BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] (371 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust. Identities = 371/371 (100%), Positives = 371/371 (100%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA Sbjct: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD Sbjct: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS Sbjct: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI Sbjct: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI Sbjct: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK Sbjct: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 Query: 361 EMVKQRILYNK 371 EMVKQRILYNK Sbjct: 361 EMVKQRILYNK 371 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 408 bits (1048), Expect = e-112, Method: Compositional matrix adjust. Identities = 203/370 (54%), Positives = 273/370 (73%), Gaps = 3/370 (0%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 LNIRNFFYN KG ++ILTAI LP+IF+V+G++IE SH FF+K LH ++D SL++ AT+I Sbjct: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 +N+ NGNN KK K IKN W FRNELR+NGF DI++I RSTSL I++ Q++ Sbjct: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 Query: 124 DYNLSAVSRYEMPFIFCTF-PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y++SA+SRY++P FCTF PW NS H + ITSSVK++S++D LDMM+VLDVS SM Sbjct: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 F + K+ +A +SI ML+ +K IPDVNNVV+SGLVTFS+KI + F L WGV H+Q Sbjct: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 Query: 243 KINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 KI L FG +T STPGL+YAYN+IFD + +H +YKK I+F+TDGEN S + Sbjct: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-E 304 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 +++SL+YCNEAK+RGAIVYAIG++ + +FL+ CASP+ FY V+N ++DAF IGK+ Sbjct: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 Query: 362 MVKQRILYNK 371 +V +RI Y+K Sbjct: 365 IVTKRIWYDK 374 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 190 bits (483), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 118/373 (31%), Positives = 205/373 (54%), Gaps = 28/373 (7%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 M + IRNFF N +G I+I +AI+ P+I I+M +V E S+ + K +L ++D +LL T Sbjct: 1 MYCIKIRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALLDTV 60 Query: 61 T--KILNQEN--GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 T K+ N E+ N G + IW + + EL + F+ D+ N+ TS+ + Sbjct: 61 TMIKLKNIEDVVKNVGP----------VNTIWTKNLKYELEHSDFSSDVQNVIDDTSMKL 110 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFP-WCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 D K +++A+S+Y+MPF C C + + + + SS+KI +D+M+VL Sbjct: 111 ESDSNFKTLSITAISQYKMPFKICNIHLLCPKNKYVTVPVLSSMKIGRNEGSDIDLMIVL 170 Query: 176 DVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 DVS SM+D+F +L VA +SIR+ML+ + +P+ NV R+G V F+ + Sbjct: 171 DVSSSMDDNFMKPEEAPCSRLEVAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPM 230 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 PL G++ I I + +T S G++YA+ +++ + + D KK +IFLT Sbjct: 231 PLKRGLKRIYNDIKKYRAFGSTNSYVGMKYAWEQLYGNPQDTK-------DRKKIVIFLT 283 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DGEN N ++++ CN+ K++ A++Y+I + + + L+ C+S Y+ +++ L Sbjct: 284 DGENMIINA-TRKTIELCNDMKKKKAVIYSIALAVDNK-EVLQGCSSSGNVYAADDAQSL 341 Query: 352 HDAFLRIGKEMVK 364 A+ IGK+++K Sbjct: 342 VQAYSLIGKDVMK 354 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 91/358 (25%), Positives = 157/358 (43%), Gaps = 20/358 (5%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 IR N G+ +++ A +PV+F+ L ++T++ +K +L +D + L TA ++ Sbjct: 10 IRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALATAARLSE 69 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +EN + Q F+ + + + DF NGF S + ++ ID Sbjct: 70 EENLTAAQAQA--FALKFVNGQVKEDFG---AFNGF---------SVTPTVNIDPVETGG 115 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLSMND 183 M P L ++ K S + M +VLD S SM+ Sbjct: 116 RTVWKVAVSMEGSQSLTPMARIMGKDKLTVSVVGKSESAGEAQGAFSMALVLDRSGSMDW 175 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 + G K+ V ++ +++ + VR G +++SK+ + L W +E Sbjct: 176 NLN-GQKKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGSTKLRWNPGKTKEF 234 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++ L T ST ++AY + +E H AK KK+I+F+TDG+N+ + D+ Sbjct: 235 VDALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKKFIVFMTDGDNNYSSADS- 293 Query: 304 ESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDR-FYSVQNSRKLHDAFLRIG 359 + C++AK G VY + A Q L CAS + F+ QNS +L +AF IG Sbjct: 294 STKHLCDDAKDDGIEVYTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKNIG 351 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 91/401 (22%), Positives = 185/401 (46%), Gaps = 62/401 (15%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 +TAI++ V F+ + I+ +H +++ ++ LD ++L I++ + +K+ Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 80 SYRIIKNIWQTDFRNELRENGF-AQDINNIERSTSLSIIIDDQHK-DYNLSAVSRYEMP- 136 S I++ + L++ + ++ +I + ++I D + Y + ++YE+P Sbjct: 61 S-----TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 Query: 137 ---FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-------- 185 F+ P ++ L T ++ SS+ ++ + + MVLDVS SM D + Sbjct: 116 ENLFLKGLIPSAL--TNLSLRSTGIIERSSE-NLAISICMVLDVSRSMEDLYLQKHNDNN 172 Query: 186 -------------------------------GPGMDKLGVATRSIREMLDII-KSIPDVN 213 P K+ V S +++ I K+I + Sbjct: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 Query: 214 NV-VRSGLVTFSSKIV--QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 N+ VR G + ++ IV Q PL+ + ++ ++N+L T + P + +AY ++++ K Sbjct: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEA 328 E H G KK++IF+TDGENS + + +L C + G +Y++ V A Sbjct: 293 ES-SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 Query: 329 ADQ-FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 Q L+ C S +F++V +SR+L ++F +I ++ +Q + Sbjct: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 16/178 (8%) Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI------FGSTTKSTPGL 259 IK I +VN+ VR G F+ +++ +WGV + I + GST + + Sbjct: 285 IKKIDNVNDTVRMGATFFNDRVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAIND-AM 343 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + AY+ I + E H K + + KKYI+ LTDGEN+ DN+E + CN+AK +G + Sbjct: 344 QTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGENTQ---DNEEGIAICNKAKSQGIRI 400 Query: 320 YAIG-----VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL-RIGKEMVKQRILYNK 371 I Q E A FL NCASP+ F+ ++ +L+ F RIG E+ ++ I K Sbjct: 401 MTIAFSVNKTQQEKARYFLSNCASPNSFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 Score = 40.4 bits (93), Expect = 0.48, Method: Compositional matrix adjust. Identities = 41/179 (22%), Positives = 86/179 (48%), Gaps = 9/179 (5%) Query: 12 NCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNN 71 +C G I+TA+L+PV+ V G++++ + + L +++ TA+ L Q Sbjct: 17 SCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAII-TASVPLIQSLEEV 75 Query: 72 GKKQKNDFSY--RIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + KN F++ + I+ +F N L++N +++ +I R T++ ++ + Y + Sbjct: 76 SSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVE--MNPRKSAYQVVL 133 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLIT----SSVKISSKSDIGLDMMMVLDVSLSMNDH 184 SRY++ + + + L+ T +V S + G+ + V+D S SM D+ Sbjct: 134 SSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDY 192 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 56/184 (30%), Positives = 91/184 (49%), Gaps = 4/184 (2%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK-INRLIF 249 K+ ++ +LD + S + VR+ + +SS++ + LAWG + I+ L Sbjct: 219 KIAALKTAVGTLLDQLDSADPKSQYVRTAAIAWSSEVDSSSALAWGTTTTRSNVISGLNA 278 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T+S+ + AY + + E AKG+ ++K I+ +TDGEN++ + D K +L C Sbjct: 279 NGGTESSAPMALAYKNVSASSEATAQAAKGNTTFQKIIVLMTDGENNATSSDTK-TLATC 337 Query: 310 NEAKRRGAIVYAIGVQAEAADQ-FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 AK G ++Y++ A Q LKNCA SP ++ Q L AF IG + KQ Sbjct: 338 KAAKDAGVLIYSVAFMAPDRGQTLLKNCASSPSNYFDAQQMSDLIAAFKTIGNQASKQIT 397 Query: 368 LYNK 371 L K Sbjct: 398 LLTK 401 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 84/361 (23%), Positives = 148/361 (40%), Gaps = 26/361 (7%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 GSI L IL+ ++ V+ + I+ S F + +L D + + T +L +E Sbjct: 80 GSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALLAEEIT---IA 136 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI------ERSTSLSIIIDDQHKDYNLS 128 Q N ++ I + L +G +I + + ++ +D +Y + Sbjct: 137 QANAYAKDAFNRI-----ASGLSASGDGSSGTSIFGTMTVKPAVQITETVDGNTTNYVVK 191 Query: 129 AVSRYEMP---FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++P F F + L S +++ L M +VLD S SM Sbjct: 192 VNGTAKIPASPLSFMFFDGETGKNTISLGFESETTAKAEAGASLSMALVLDRSGSMGWER 251 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 M +L A RS+ + L + PD + R G + L W ++ +N Sbjct: 252 PSRMSELKKAVRSLIKELQTVD--PD-DQFTRLGAYAYHWYYAGKKELTWNKNSVRSWVN 308 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L T++ P ++ A N + E HI K + +I+++TDG + PN +E Sbjct: 309 SLPASGGTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPNWAKRE- 367 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMV 363 C AK G +Y + +A A+ + LK CA+ D +Y +N+ +L+ F I +E Sbjct: 368 ---CTSAKNAGITIYTVAFKAPASGRNLLKACATSDAHYYDAKNANELNKVFKDIARETT 424 Query: 364 K 364 K Sbjct: 425 K 425 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 55/184 (29%), Positives = 92/184 (50%), Gaps = 6/184 (3%) Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 H+ ++ L +A ++ LD + P+ VR+G V+++ ++ + L WG H+ + Sbjct: 215 HYYTKIEALKLAVGTLTGELDAVD--PE-KEYVRTGAVSYNIEMQKAKALDWGTAHVTKY 271 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 +N+L T S + AYNK+ DA E H+ K KYI+F+TDG+N+ + D Sbjct: 272 VNKLTATDGTDSGEAFKTAYNKLADAAEDKAHVDKTGQVPTKYIVFMTDGDNNYTSADT- 330 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCAS-PDRFYSVQNSRKLHDAFLRIGKE 361 E+ +C++A+ VY I A A Q L CA+ P ++ + L AF IG + Sbjct: 331 ETKTWCDKARDAKMQVYTIAFMAPARGQALLSYCATAPGNYFPAGDMTALLKAFKEIGMK 390 Query: 362 MVKQ 365 Q Sbjct: 391 ASNQ 394 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 99/441 (22%), Positives = 176/441 (39%), Gaps = 90/441 (20%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F N + GS +++A+LLPVIF+V+GL+I+ + L ++ + L + + Sbjct: 8 FFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSASVQ 67 Query: 63 ILNQ-ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 +LN E+ + K + IK + + L N D I + T ++I ++ Sbjct: 68 LLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKVNIY--NR 125 Query: 122 HKDYNLSAVSRYEMPF--IFCTFPWCANSSHAPLLITSSVKISSKSDI----GLDMMMVL 175 + ++ S Y +P F N P+ +++SK + G+ + ++ Sbjct: 126 KGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVSVQWLI 185 Query: 176 DVSLSM----------------------------------NDHFGPGMDKLGVATRSIRE 201 D S SM +D F P + +L RS+ Sbjct: 186 DDSGSMGSIIDRACFGSKQLKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCDRSLYY 245 Query: 202 MLDIIKSIPD-------VNN----------VVRSGLVTF--------------------- 223 +L+ K + D ++N +VR L TF Sbjct: 246 VLNDKKILEDDDLEEKNLDNHSQYYIRKRYLVRDALATFIKRVRKIDNLKDKLRMSFMYF 305 Query: 224 SSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + +I FP+ WG++ +++++ R + T P L+ AYNK+ E EH K Sbjct: 306 NERIDHYFPMTWGIKEFKQEVSSHYKRKHENTATDIHPILQEAYNKLHSKNEDDEHKKKN 365 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----ADQFLK 334 + KK+I+ LTDG + L C+ AK G ++ I ++ A+ FL Sbjct: 366 SVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIFTISYSVDSSERKKANDFLS 425 Query: 335 NCASPDRFYSVQNSRKLHDAF 355 CASPD+F+ ++ KL+ F Sbjct: 426 RCASPDKFFEAYDADKLNMIF 446 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 104/426 (24%), Positives = 159/426 (37%), Gaps = 82/426 (19%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 SFL+ +R F + G+ I+TAIL PV+ GL I+ K +L + + Sbjct: 4 SFLHPCLRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQ---EAADSAA 60 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 NG Q F+ +N N L +NG +I+ +T++++ Sbjct: 61 LATATALGNGTIQTSQAEAFA----RNFVAGQMANYL-QNGV-----DIKNATAVNVQTS 110 Query: 120 DQHKD--YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS-------SKSDIGLD 170 + K Y ++ Y++ ++ H L TSS +S S S + Sbjct: 111 NSGKSASYQVTVTPSYDLTVNPLMQAVGFSTQH---LSTSSTTVSGPSQTPGSNSQGSVS 167 Query: 171 MMMVLDVSLSMND----------------HFGPGMDKLG-----VATRSIREMLDIIKSI 209 M + LD S SM D P ++K G T S I+++ Sbjct: 168 MFLALDKSGSMGDPTETVNKDQPTETFTYDCNPHLNKKGKWVYDTCTGSRTNYYTKIEAL 227 Query: 210 ---------------PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 PD VR+G V++ LAWG + +N L G T Sbjct: 228 KMAAGNLFGQLTSADPDAQ-YVRTGAVSYDIDQYTPSTLAWGTSGVSSYVNALQAGGGTN 286 Query: 255 STPGLEYAY------NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 S+ + AY N + E H K KKYI+F+TDG+N N D+ Y Sbjct: 287 SSGAMGTAYSSLTAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDN---NNDSSGGRSY 343 Query: 309 -------CNEAKRRGAIVYAIGVQAEAADQ-FLKNCAS-PDRFYSVQNSRKLHDAFLRIG 359 C+ AK +G +Y I A Q L+ CAS ++ + L AF IG Sbjct: 344 DTLTKATCDTAKSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLAAFKAIG 403 Query: 360 KEMVKQ 365 + Q Sbjct: 404 AKASAQ 409 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 3/147 (2%) Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 VR+G ++++S L+WG + ++ L+ T S + A+ K+ +A E EH Sbjct: 263 VRTGAISYNSAQDAASSLSWGTRGAAGYVDALVAIGGTASGNAFKTAFQKVTNAAEDSEH 322 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLK 334 AK KYI+F+TDGEN+ N D+ + +C+ AK +Y++ A + + LK Sbjct: 323 GAKNGQVPTKYIVFMTDGENNHAN-DDTVTRQWCDTAKASKVQIYSVAFMAPDRGQKLLK 381 Query: 335 NCA-SPDRFYSVQNSRKLHDAFLRIGK 360 +CA S ++ + + L AF IG+ Sbjct: 382 SCASSSSHYFEAEEASDLVAAFKAIGE 408 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 45/147 (30%), Positives = 69/147 (46%), Gaps = 3/147 (2%) Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 VR+ ++++S LAWG +N L+ T S + AY K+ A E H Sbjct: 257 VRTAAISYNSVQDTAGTLAWGTSGAAAYVNALVATGGTASAGAFKTAYQKVIAATENTAH 316 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLK 334 AK KY++F+TDGEN+ N D+ + +C+ AK +Y++ A E LK Sbjct: 317 AAKNGQVPSKYMVFMTDGENNYAN-DDTVTKQWCDTAKANKVEIYSVAFMAPERGQALLK 375 Query: 335 NCASPDR-FYSVQNSRKLHDAFLRIGK 360 CAS ++ + L AF IG+ Sbjct: 376 YCASSSSHYFEAEEVTDLVAAFKAIGE 402 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 54/167 (32%), Positives = 73/167 (43%), Gaps = 22/167 (13%) Query: 216 VRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIF 267 VR+G V++ IVQ P LAWG + +N L G T S+ + AY N Sbjct: 243 VRTGAVSY--DIVQYTPSALAWGTSGVSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAG 300 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIVY 320 + E H K KKYI+F+TDG+N N D+ Y C+ AK +G +Y Sbjct: 301 NDAEDAAHKLKTGQTPKKYIVFMTDGDN---NDDSSGGRSYDTLTKATCDTAKSKGIEIY 357 Query: 321 AIGVQA-EAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 I A E L CAS D ++ + L AF IG + Q Sbjct: 358 TIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQ 404 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 54/167 (32%), Positives = 73/167 (43%), Gaps = 22/167 (13%) Query: 216 VRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIF 267 VR+G V++ IVQ P LAWG + +N L G T S+ + AY N Sbjct: 162 VRTGAVSY--DIVQYTPSALAWGTSGVSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAG 219 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIVY 320 + E H K KKYI+F+TDG+N N D+ Y C+ AK +G +Y Sbjct: 220 NDAEDAAHKLKTGQTPKKYIVFMTDGDN---NDDSSGGRSYDTLTKATCDTAKSKGIEIY 276 Query: 321 AIGVQA-EAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 I A E L CAS D ++ + L AF IG + Q Sbjct: 277 TIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQ 323 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 87/390 (22%), Positives = 167/390 (42%), Gaps = 55/390 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N G I++A ++ V I + VI+ +H +K + LD++++ + +++ Sbjct: 16 FSKNKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIVVSDPK 75 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENG----------FAQDINNIERSTSLSIII 118 N+ Q+ I KN + +N E+ F++D+ N I + Sbjct: 76 INDLNPQEERIRDVIKKNAYVNMVQNFPAEHAAYIIENANISFSKDLTN---KYEYKITM 132 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + +H+ + + + MP + +H + T ++ S + MVLD S Sbjct: 133 EAKHQLSGKNFILGFLMPNVI---------THISSISTGIIQKPSDKK-AFSVEMVLDCS 182 Query: 179 LSMNDHFGPGMD-----------------------KLGVATRSIREML-DIIKSIPDVNN 214 SM D D L A+ ++ + +++ P ++ Sbjct: 183 GSMLDSMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQTFPQIS- 241 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKEKL 273 R GL+TF+ I+Q L+ I++ I+R+ T + + AY + + E Sbjct: 242 -ARIGLITFNHYIMQDSKLSNNFNVIKKTISRMKPKGGTDTFLPMNAAYEYLNNIPNETK 300 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAAD-- 330 H + K+YII +TDGEN+ P+ D K ++ C+ A++ G I+Y+I + E D Sbjct: 301 AHNISDNVPLKRYIILMTDGENNHPSYDLK-TINVCDNARKNGIIIYSIFLNYYEYTDGY 359 Query: 331 QFLKNCASPDR-FYSVQNSRKLHDAFLRIG 359 + + CAS ++ F+ N++ L D+F I Sbjct: 360 ELARKCASSEKHFFYANNTKALLDSFKSIA 389 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 44/184 (23%), Positives = 84/184 (45%), Gaps = 11/184 (5%) Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +DK+ ++ ++ + + N +R+G +++ + + L WG ++ + L Sbjct: 217 VDKITTLKSAVDKLFTPLAKMDPGNEYLRAGAASYNDRQDRASKLTWGTKNASAHVQGLD 276 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-------ID 301 T S+ A ++ E H+AK +KYI+F+TDGEN+S N ++ Sbjct: 277 ATGGTDSSSAFAAAVEELLLDGENEAHLAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLE 336 Query: 302 NKESL--FYCNEAKRRGAIVYAIGVQA-EAADQFLKNCA-SPDRFYSVQNSRKLHDAFLR 357 +S+ C AK G ++ + A + LK CA SPD + ++ L F + Sbjct: 337 KADSVTKAACTTAKNNGIAIFTVAFMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEK 396 Query: 358 IGKE 361 IG++ Sbjct: 397 IGQK 400 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 98/410 (23%), Positives = 167/410 (40%), Gaps = 61/410 (14%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 SFL+ +R + G+ I+TAI++PV+ GL I+ S+ K +L D + L Sbjct: 4 SFLHPCLRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAALAA 63 Query: 60 ATKILNQENGNNGKKQ--KNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSII 117 AT + + + DF + N TD + ++ G + DI+ +TS Sbjct: 64 ATALASGAASTTADAEAIAKDFVSGQMANYVDTDAISSIKA-GTSVDIDVSATATS---- 118 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS---SVKISSKSDIGLDMMMV 174 K Y ++ + Y + P+ + + L I + + +S + L M +V Sbjct: 119 -----KSYKVTVATSYGIAAT----PFMSVLGYKTLNIGASTSTSSGTSDTKTALSMELV 169 Query: 175 LDVSLSMNDH--------------FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 LD S SM + + +D L A ++ + LD PD +++VR+G Sbjct: 170 LDQSGSMGEKTTTCATYNGKNCKTYVTKIDALKKAADALFDALDTAD--PD-HSLVRTGA 226 Query: 221 VTFS---------SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK- 270 +++ ++I +AWG ++ + T +T + A I A Sbjct: 227 YSYNNGLIYNSQKTQIKSMSGMAWGTATTATYVSGITASGGTDATEPMRQATLSIAKASD 286 Query: 271 ----EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF------YCNEAKRRGAIVY 320 E H KG+ +YII +TDGE + N +S F C+ K G ++ Sbjct: 287 GSDVETQAHAVKGNTIVSRYIILMTDGEMTG-NTGVWQSSFDQNVRNQCDATKTAGIKIF 345 Query: 321 AIGVQA-EAADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + A + Q L+ CASP +Y + KL +F I KE K L Sbjct: 346 TVAFMAPDKGKQLLQYCASPGGNYYEAETMEKLVASFTSIAKEATKAVTL 395 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 50/189 (26%), Positives = 92/189 (48%), Gaps = 15/189 (7%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR---- 246 K+ ++ LD I + V V GL+ +++++ + +WG + +++ + R Sbjct: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEK-------LEHIAKGHDDYKKYIIFLTDGENSSPN 299 LI T STP ++ AY + K++ + + ++K+IIFLTDGEN++ Sbjct: 287 LIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNC-ASPDRFYSVQNSRKLHDAFLR 357 N ++ C++AK + I + A Q LK C +SP+ Y+V N+ L F Sbjct: 346 -SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 Query: 358 IGKEMVKQR 366 I + MV ++ Sbjct: 405 ISQLMVHRK 413 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 52/167 (31%), Positives = 74/167 (44%), Gaps = 22/167 (13%) Query: 216 VRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIF 267 VR+G V++ +V+ P LAWG+ + +N L G T S+ + AY N Sbjct: 102 VRTGAVSY--DLVEYTPSKLAWGITAVTSYVNALESGGGTNSSGAVNTAYTSLTAKNAAG 159 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIVY 320 + E H K KKYI+F+TDG+N N D++ Y C+ AK +G Y Sbjct: 160 NDAEDAAHKLKTGQLPKKYIVFMTDGDN---NDDSRGGRSYDTLTKATCDTAKAKGIETY 216 Query: 321 AIGVQA-EAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 I A E L CAS D ++ + L AF IG + Q Sbjct: 217 TIAFMAPEGGQALLHYCASDDAHYFQAEKMEDLLAAFKAIGAKASAQ 263 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 52/168 (30%), Positives = 75/168 (44%), Gaps = 24/168 (14%) Query: 216 VRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF------ 267 VR+G V++ I Q P LAWG + +N L T S+ + AY+ + Sbjct: 244 VRTGAVSY--DINQYTPSNLAWGTAGVTSYVNALQANGGTNSSGAMSTAYSSLTAKNAAG 301 Query: 268 -DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIV 319 DA++ + G KKYI+F+TDG+N N D+ Y C+ AK +G + Sbjct: 302 NDAEDSAHKLKTGQTP-KKYIVFMTDGDN---NDDSSGGRSYDTLTKATCDTAKSKGIEI 357 Query: 320 YAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 Y I A A Q L CAS D ++ + L AF IG + Q Sbjct: 358 YTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASAQ 405 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 52/167 (31%), Positives = 73/167 (43%), Gaps = 22/167 (13%) Query: 216 VRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIF 267 VR+G V++ IVQ P LAWG + +N L G T S+ + AY N Sbjct: 222 VRTGAVSY--DIVQYAPSSLAWGAIGVSSYVNALQAGGGTNSSGAMSTAYLSLTAKNAAG 279 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIVY 320 + E H K +KYI+F+TDG+N N D+ Y C+ AK +G +Y Sbjct: 280 NDAEDSAHKLKSGQIPQKYIVFMTDGDN---NNDSSGGRSYDTLTKATCDTAKSKGIEIY 336 Query: 321 AIGVQAEAADQ-FLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 I A Q L+ CAS ++ + L AF IG + Q Sbjct: 337 TIAFMAPPGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQ 383 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 50/165 (30%), Positives = 70/165 (42%), Gaps = 18/165 (10%) Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIFDA 269 VR+G V++ LAWG+ + +N L T S+ + AY N + Sbjct: 244 VRTGAVSYDINQYAPSSLAWGITGVSSYVNALQANGGTNSSGAMNTAYTSLTAKNAAGND 303 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIVYAI 322 E H K KKYI+F+TDG+N N D Y C++AK +G +Y I Sbjct: 304 VENSAHQQKTGQVPKKYIVFMTDGDN---NNDPSGGRSYDTATKKTCDDAKSKGIEIYTI 360 Query: 323 GVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 A A Q L CAS D ++ + L AF IG + Q Sbjct: 361 AFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFQAIGAKASAQ 405 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 44/154 (28%), Positives = 79/154 (51%), Gaps = 15/154 (9%) Query: 219 GLVTFSSKIVQTFPLAWGV----QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-- 272 GL +++++ + +WG ++I E+I+ + G T STP ++ AY + K++ Sbjct: 262 GLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLGQT-DSTPAMKKAYQILTSDKKRNF 320 Query: 273 -----LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + I ++K++IFLTDGEN+ P D K ++ C +AK+ + I + A Sbjct: 321 IRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVK-TIKICEKAKKNSIKILTISINAS 379 Query: 328 A-ADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIG 359 A + LK C S P+ +Y+V ++ L F I Sbjct: 380 ANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDIS 413 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 89/409 (21%), Positives = 164/409 (40%), Gaps = 71/409 (17%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G+ ++TAILLPV+ G +E ++ VKA L D + L AT+ + Sbjct: 11 RRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQNTADSAALAAATEARLK 70 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD--QHKD 124 E + IK I + +++ + ++ +E+++ ++I D + K Sbjct: 71 EGA---------LTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARGKT 121 Query: 125 YNLSAVSRYEMPF--IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y + Y+M + F + L T + + + M +VLD S SM+ Sbjct: 122 YTIQTTINYQMQLNPLLGFF----GAKTLDLAATGTAVSTVNKGAPISMYLVLDRSGSMS 177 Query: 183 DHFGPGMDKLGVATRSIRE-MLDIIKSIPDVNNV-------------------------- 215 D L S + +D S P++ N Sbjct: 178 FK----TDTLNTKKTSCQNYTVDNWGSYPNLKNTSPCYVNKATSLKTAVGYLVATLNKAD 233 Query: 216 -----------VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--FGS-TTKSTPGLEY 261 VR+G ++ + P+ WG + +++ I F S T + L Sbjct: 234 PTYTANGGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIPEFPSGGTDARSSLNA 293 Query: 262 AYNKIFDAK--EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-----FYCNEAKR 314 AYN + A E EH K + +++YI+ +TDGE + + S+ C+ AK+ Sbjct: 294 AYNALKKANTVEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKK 353 Query: 315 RGAIVYAIGVQA-EAADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKE 361 G ++++ A + L++CAS D +Y+ +N ++ AF I ++ Sbjct: 354 DGIKIFSVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEIARK 402 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 40/144 (27%), Positives = 64/144 (44%), Gaps = 9/144 (6%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL+ + I+ + T +T GL + +N + L A + K I+FL Sbjct: 293 MPLSTNWSALNSHIDAMASAGNTNTTIGLAWGWNMLTQGGP-LSSAAAPAANLDKVIVFL 351 Query: 291 TDGENS------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFY 343 TDG+N+ + N N + CN K G VY++ V E ++NCA+ P +Y Sbjct: 352 TDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRV-IEGNATLIRNCATEPGMYY 410 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 SV + +L F I + + RI Sbjct: 411 SVTTASELTSVFASIAQSLSNLRI 434 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 56.2 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 50/183 (27%), Positives = 83/183 (45%), Gaps = 38/183 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD----IIKSIPDVNNVVRSGLVTFS 224 +D +VLDVS SM ++ G + KL ++ LD I K + VR GLV F+ Sbjct: 601 VDFTLVLDVSGSMRENMG-SVTKLQALQSAVNNFLDEAAKINKGAQSGSEPVRVGLVKFA 659 Query: 225 SKIVQ-----TFPLAWGVQH--------------IQEKINRLIFGSTTKSTPGLEYAYNK 265 + T+ W + ++ ++N+L G T++ G ++A+ Sbjct: 660 GNATKKIGNKTYQDKWNTYNYSQIVKKLTADTDGLKNEVNKLTAGGATRADYGFQHAFTV 719 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNK---ESLFYCNEAKRRGAIVYA 321 + +A+ + KK +IF TDG+ S D K +++ Y + K GAIVY+ Sbjct: 720 MSEART----------EAKKVVIFFTDGKPTSEKTFDGKVANDAVEYAKQLKDSGAIVYS 769 Query: 322 IGV 324 IGV Sbjct: 770 IGV 772 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 26/170 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +++++DVS SM GP +D+ A RS + D + R GL+++S ++V Sbjct: 511 LAILLLIDVSSSMA---GPPLDEAREAARSFLDQCDFTTT--------RVGLISYSDQVV 559 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L V+ ++ + RL TT LE + KL + GH KY++ Sbjct: 560 LQTDLTDNVRKVEAGLARLEADGTTNLAGALELG-------RRKLATVPTGH---VKYLV 609 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 LTDG P DN +L AK G + AIG EA +L AS Sbjct: 610 VLTDGYPDDP--DN--ALLEAAHAKGSGIEIVAIGT-GEADQAYLDRIAS 654 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 40/159 (25%), Positives = 75/159 (47%), Gaps = 12/159 (7%) Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--FGST-TKSTPGLEYAYNKIFDAK- 270 +VR+G ++ + + WG + +++ I F S T + L AYN + A Sbjct: 244 LVRTGASVYTHETYVAQSIGWGTSGVTSYVDKQIPEFPSGGTDARSSLNAAYNALKKANP 303 Query: 271 -EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-----FYCNEAKRRGAIVYAIGV 324 E H KG + +++YI+ +TDGE + + S+ C AK+ G ++++ Sbjct: 304 DEARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIKIFSVAF 363 Query: 325 QA-EAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKE 361 A + L+ CA S D +Y+ +N ++ AF I ++ Sbjct: 364 MAPDKGKSLLQYCASSADNYYAPENMEQIVTAFGEIARK 402 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 49/170 (28%), Positives = 81/170 (47%), Gaps = 21/170 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 VKISS S + + LDV +++ G + +M D + PD VR Sbjct: 35 VKISSASQL-MQCSAALDVLFALDGSHSIGKGSFERSKYFAIKMCDALAIYPDR---VRV 90 Query: 219 GLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEH 275 G++ FSS FPL + + ++EKI +++F G +T++ L+Y +K F Sbjct: 91 GVLQFSSVPQLEFPLDSFFTREEVKEKIKKIVFKGGSTETGLALKYLLHKGFPGGR---- 146 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +I +TDG+ S NID N+ K RG +V+A+GV+ Sbjct: 147 ----NSSVPQLLIIVTDGK-SQGNID-----LPANQLKERGVMVFAVGVR 186 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 62/216 (28%), Positives = 87/216 (40%), Gaps = 44/216 (20%) Query: 132 RYEMPFI-FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGM 189 R +P + + C + P ++ S KI KSD G+D+M+ +D SLSM P Sbjct: 55 RPLLPILRYLALALCIVALARPRIVDVSTKI--KSDKGVDIMLTVDTSLSMLARDLEP-- 110 Query: 190 DKL----GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 D+L VA + +E R GLV++S + + PL + + +IN Sbjct: 111 DRLTALKAVAVKFSKE-----------RQADRLGLVSYSGEALTRVPLTTDREVLIREIN 159 Query: 246 RLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 L G T GL A N I D+K K K II +TDG S I+ Sbjct: 160 ALESGELEDGTAIGIGLATAINHIKDSKAK-----------SKVIILMTDGVES---INP 205 Query: 303 KESLFYCNE------AKRRGAIVYAIGVQAEAADQF 332 L Y + A RG VY IG+ F Sbjct: 206 TNDLMYISPQTAAEMATSRGIKVYTIGIGTRGLAPF 241 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 62/243 (25%), Positives = 103/243 (42%), Gaps = 39/243 (16%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMD 190 +PF+ P A + L I V S++D+ GLD++++LD+S SM + G G Sbjct: 51 RLPFV----PLSAAAVLIGLAIMQPVIPYSQADLRSRGLDIVLLLDLSSSMQEEMGSGQS 106 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRS------GLVTFSSKIVQTFPLAWGVQHIQEKI 244 T + R +D +K V VR GLV FS PL + Q++ + + Sbjct: 107 LKTGTTAAGRTRMDAVKDA--VRTFVRGRRDDRIGLVVFSDNAYVISPLTFDHQYLLDYL 164 Query: 245 ----NRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 ++ G T GL A + + A+GH + ++ TDGE++ Sbjct: 165 GFVDGEILLGEGQTAIGDGLALASAVL---ARQAGRDARGH----QVVVLFTDGESNR-- 215 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEA------ADQFLKN--CASPDRFYSVQNSRKL 351 ++ + EAK G V+ IGV +A Q L+ A+ R+++ + R L Sbjct: 216 --GRDPIEVVGEAKSAGIRVHVIGVDLDAEVKTRPGVQLLRRGVVAAGGRYFAADSERDL 273 Query: 352 HDA 354 A Sbjct: 274 LTA 276 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 61/229 (26%), Positives = 101/229 (44%), Gaps = 38/229 (16%) Query: 137 FIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 I T W A ++S P + V I ++ G D+M+ +D+S SM +D + V Sbjct: 56 LIIATIGWVALVSASARPQWLGEPVSIPAE---GRDLMIAVDLSGSMK------IDDMQV 106 Query: 195 ATRSIREMLDIIKSIPDVNNVV------RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 R + + L +IKS+ +++ + R GL+ F+ PL + + + + +N + Sbjct: 107 NGRQV-DRLQMIKSV--LHDFIQRRIGDRLGLIFFADTAYLQAPLTYDRETVSQLLNESL 163 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G + T I DA K ++ K +I LTDG+N++ NI ++ Sbjct: 164 IGLVGEQT--------AIGDAIGLAIKRFKSKEESNKVLILLTDGQNTAGNITPEQ---- 211 Query: 309 CNE-AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 NE A G +Y IGV ADQ L R V S++L + L Sbjct: 212 ANELAINNGVTLYTIGV---GADQMLVQSIFGSR--QVNPSQELDEGML 255 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 35/120 (29%), Positives = 52/120 (43%), Gaps = 27/120 (22%) Query: 274 EHIAKGHDDYKKYIIFLTDGEN-------SSPNID-------------------NKESLF 307 E+ +K +++IFLTDG+ S ID + LF Sbjct: 503 ENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRWTQTSSLTLAQTVEERFLF 562 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CNE K+ GA V+ + A D+ +K CA R++ N+ +L+DAF I K RI Sbjct: 563 ACNEVKKLGATVWVVAFGTAANDK-MKTCAGSGRYFEAANASQLNDAFSTIAKSTGDLRI 621 Score = 40.4 bits (93), Expect = 0.41, Method: Compositional matrix adjust. Identities = 48/216 (22%), Positives = 101/216 (46%), Gaps = 30/216 (13%) Query: 16 SISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQ 75 +++++ A LLP++ + G ++ S + +++L D +L A K L E Sbjct: 12 TLALIAAGLLPLLAMA-GSGVDMSRAYLAESRLQQACDSGVL-AARKALGTE-------- 61 Query: 76 KNDFSYRIIKNIWQTDFRNELRE--NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 + +I TD +E N QD N ++ + ++++++ DY++S + Sbjct: 62 -----IATLTDI-PTDAGTRGQEFFNSNFQDGNYGTQNRTFNMVLEN---DYSVSGTATV 112 Query: 134 EMPF-IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDK 191 ++P + F + + P+ + +IS SD+ D+MMVLDV+ SM + + G + K Sbjct: 113 DVPTSVMTVFGF----TKIPVKVECQARISF-SDV--DVMMVLDVTGSMKHTNSGDTLSK 165 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + ++R D ++ +R G V ++S + Sbjct: 166 IDSLKATVRNFYDQMEGAKSAGTRIRYGFVPYASNV 201 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 34/93 (36%), Positives = 46/93 (49%), Gaps = 10/93 (10%) Query: 275 HIAKGHDDYK--KYIIFLTDGENS------SPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + AK + D KY+I +TDGEN+ S + N +L C AK G VY I V Sbjct: 468 NTAKPYSDKTNYKYMIVITDGENTQNRWSTSASTINARTLLACQAAKDLGITVYTIRVM- 526 Query: 327 EAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRI 358 E LK+CAS P+ FY V S +L ++ Sbjct: 527 EGNSDMLKSCASRPEYFYDVTASSQLTSTLAKV 559 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 52/193 (26%), Positives = 78/193 (40%), Gaps = 34/193 (17%) Query: 144 WCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVAT 196 WCA + P+ V K D+M+VLD+S SM+ D G +D+L Sbjct: 61 WCALITALARPVWYGEPVTTQPKHR---DLMLVLDLSYSMSQEDMQDSSGNYIDRLTAVK 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTT 253 +++ R GLV F+ PL I E++N L+ G T Sbjct: 118 -------NVVSQFAQQRKGDRLGLVLFADHAYLQTPLTLDRNTISEQVNSLVLQLIGQKT 170 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 G+ A D+ D ++ +I L+DG N+S +D E+ N AK Sbjct: 171 AIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTSGVLDPIEA---ANIAK 216 Query: 314 RRGAIVYAIGVQA 326 + A +Y IGV A Sbjct: 217 KYNATIYTIGVGA 229 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 49.7 bits (117), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 41/162 (25%), Positives = 66/162 (40%), Gaps = 41/162 (25%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +T S Q+ LAWG Q + + + PG E Y Sbjct: 405 MTPSGNTNQSIGLAWGWQSLSTTNGPI-------AAPGKESGYV---------------- 441 Query: 281 DDYKKYIIFLTDGENS------------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 Y+ YI+ L+DG N+ P ID +++L C + K G ++ I V + Sbjct: 442 --YQDYIVLLSDGLNTQNRWYSCPPSGPCPTIDARQALL-CQKVKDSGVTIFTIQVNVGS 498 Query: 329 AD---QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 D Q L+NCAS F + ++ + DAF I ++ + R+ Sbjct: 499 KDPLSQVLQNCASDGNFQMITSATETADAFQNILTQISQLRL 540 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 59/123 (47%), Gaps = 13/123 (10%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 ++ IN L T + G++ A ++ DA ++ K+YII LTDG + P Sbjct: 953 VKNAINSLSASGGTDISSGIKKAIAEL-DAHKR--------STAKQYIIVLTDGYSQYPE 1003 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 D L ++AK +G ++ IG+ D K + P+ +Y V + +L A+ IG Sbjct: 1004 FD----LIEADKAKAKGYTIFTIGMGMADEDTLKKIASKPEYYYRVLSPEQLEAAYYDIG 1059 Query: 360 KEM 362 +E+ Sbjct: 1060 QEI 1062 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust. Identities = 39/165 (23%), Positives = 70/165 (42%), Gaps = 15/165 (9%) Query: 210 PDVNNVVRSGLVTFS---SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 P +VV S + S + + + L + ++ L G T T G+++ + Sbjct: 351 PYATDVVDSNYIAASCSTTALAKVLDLTSDFTSVNTYLSSLSPGGNTNITLGVQFGMEML 410 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGEN------SSPNIDNKESLFYCNEAKRRGAIVY 320 A+ + A G D KKY+I +TDG N +S + N + C AK +G ++ Sbjct: 411 SPAEPYTKATAFGDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQGITLF 470 Query: 321 AIGVQAEAADQFLKNCASPDRFY-----SVQNSRKLHDAFLRIGK 360 + V+ + L+ CAS +Y + ++ + D F I K Sbjct: 471 VVRVE-DGDSSLLEACASQSSYYYDLSQASDLTKTMQDIFATINK 514 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust. Identities = 34/131 (25%), Positives = 63/131 (48%), Gaps = 8/131 (6%) Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS------S 297 I L G T T G+++ + + + G +K++I +TDG+N+ S Sbjct: 281 IKTLQPGGYTNVTMGVQWGMEVLSPNQPFSDATEFGSTKARKFMIVVTDGDNTKSFTSWS 340 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFL 356 ++ +K + C AK +G VY + + + L+ CAS P+ FY + ++ +L+ A Sbjct: 341 ASVIDKRTALACENAKAKGITVYTVKI-IQGNSNMLRKCASAPEYFYDLTSANQLNAAMS 399 Query: 357 RIGKEMVKQRI 367 I K + K R+ Sbjct: 400 GIFKSINKTRL 410 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 51/183 (27%), Positives = 77/183 (42%), Gaps = 35/183 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM-NDHFGP----GMDKLGVATRSIREMLDIIKSIPDV 212 SV +K++ G+D++M +DVS SM P + K+ V D + P+ Sbjct: 83 SVSKRTKTNRGIDIVMAIDVSASMLARDLKPNRLEALKKVAV---------DFVDRRPN- 132 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDA 269 R G+V ++ + P+ ++ INRL +G T GL N++ D+ Sbjct: 133 ---DRIGIVVYAGESFTQTPITSDKTIVKRTINRLQWGQLEGGTAIGMGLGSRVNRLKDS 189 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 K K K II LTDG N++ NID + AK G VY IG+ Sbjct: 190 KAK-----------SKVIILLTDGVNNAGNIDPTTA---TELAKELGIKVYTIGIGTNGM 235 Query: 330 DQF 332 F Sbjct: 236 ADF 238 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 51/197 (25%), Positives = 82/197 (41%), Gaps = 31/197 (15%) Query: 169 LDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+++ LD S SMND F D +I + + V GL+TF Sbjct: 97 IDVVLALDTSASMNDDAFTAAQDA----------AYGLINGLSPEDKV---GLITFDKTA 143 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PLA +QE I +L T GL A + +AKG + K I Sbjct: 144 RVIEPLAQDHARVQESIQKLSRSVGTALYQGLSLAA----------QEVAKGQN--TKAI 191 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 + +TDG N+S N +E++ E GA V+ +G + Q L+ A + ++S Sbjct: 192 VLMTDGFNTSRNTTLEEAVAKAQEV---GASVFTVGFGKKVDTQGLQKIANETGGEYFSA 248 Query: 346 QNSRKLHDAFLRIGKEM 362 + +L F I +++ Sbjct: 249 PTNAQLRRVFADISQKL 265 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust. Identities = 54/238 (22%), Positives = 91/238 (38%), Gaps = 38/238 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHF--------------GPGMDKLGVATR---------- 197 ++ S+ L M +VLD S SMND G G +K T Sbjct: 190 TTASESPLSMYLVLDRSGSMNDETATTYTGTCTKTTTSGYGWNKKTTTTSYSCTKNYTKI 249 Query: 198 -----SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 ++ ++ +K + VR+G ++++ ++WG ++ +N L Sbjct: 250 ESLKLAVADLAAQLKKADPNSEYVRTGADSYNASADTAQAMSWGTANVVTYVNALSATGG 309 Query: 253 TKSTPGLEYAYNKI--FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-- 308 T + L AY+ + + E H +YI+F+TDGE + + S+ Sbjct: 310 TDARGALSAAYSALQTSNKTEITAHNVSSVSKIGRYIVFMTDGEMTGNSSSWSSSIDSAV 369 Query: 309 ---CNEAKRRGAIVYAIGVQAEA-ADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKE 361 C K G +Y + A A L CAS +Y ++ L AF IGK+ Sbjct: 370 RSQCTSIKADGIQIYTVAFMAPANGKSLLSACASDASHYYEATDAASLVAAFGEIGKK 427 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust. Identities = 54/230 (23%), Positives = 93/230 (40%), Gaps = 28/230 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 C ++ P + K+ +++ LD S SM+ G +++ ATR Sbjct: 356 CEDNGVDPRTLMGDSKVDPNDASSRHVVLALDTSGSMD---GEPLNETKTATREFAST-- 410 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 I KS DV LV++ S ++ + L G T L +Y Sbjct: 411 IFKSDADVC------LVSYDSSARNVIDSTDNEYALKAAVRDLSAGGGTNIEDALRVSY- 463 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 E+LE G D K+ I+ ++DGE ++ + + + Y NE K G +Y +G Sbjct: 464 ------ERLE--GSGSD--KRIIVLMSDGE-ANEGLVGDDLIAYANEIKDDGVTIYTLGF 512 Query: 325 QAEAADQ-----FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 +D+ ++ ASP Y V ++ +L F IG ++ R +Y Sbjct: 513 FQSVSDKAECQRVMEGIASPGCHYEVDDASQLRYFFGDIGDDINGTRFIY 562 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust. Identities = 41/171 (23%), Positives = 71/171 (41%), Gaps = 28/171 (16%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+K+ Q PL++ ++ +N + T G+ +A + + Y Sbjct: 431 STKLGQIVPLSYNWTSLKSAVNAMEPTGGTNQAIGMAWAVQSLIPNGVLGAPAEDANTTY 490 Query: 284 KKYIIFLTDGENSS---PN------------IDNKESLFYC----NEAKRRGAIVYAI-- 322 + II L+DG N+ P+ ID +++L C N +G +Y I Sbjct: 491 NRVIILLSDGLNTEDRWPDYGNGSTQASGNPIDARQALL-CSNLKNTKDSKGNAMYTIYT 549 Query: 323 -----GVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ L+NCAS PD+FY + +S ++ F IG + K R+ Sbjct: 550 IQVNTSSPADPTSTVLQNCASSPDKFYMLTSSSQIVTTFNSIGTALSKLRV 600 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust. Identities = 35/146 (23%), Positives = 69/146 (47%), Gaps = 12/146 (8%) Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L + +Q ++ +N+L T T G+++ + A L + G + +K +I LTD Sbjct: 403 LTFDLQSVETAVNKLTPSGNTNVTIGVQWGM-EALTAAAPLTGVRTGSE-VRKVMIVLTD 460 Query: 293 GENSS--------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFY 343 G N+ N + +L CN AK G +Y + + E + LK CA + D+++ Sbjct: 461 GLNTQNRWWGSRDRNKIDARTLAACNNAKAMGIELYTVRL-VEGNEDLLKTCAETEDKYH 519 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V ++ +L F + +++ R+ + Sbjct: 520 YVTSASQLKTTFADLARQVKGVRLAW 545 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust. Identities = 54/195 (27%), Positives = 81/195 (41%), Gaps = 30/195 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+M+VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 406 IDVMLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDGV------GVIAFDHNYY 459 Query: 229 QTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + V + + E I+ + G T P L A + +K K K Sbjct: 460 WAYKFGKLVRKEDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAK-----------NKL 508 Query: 287 IIFLTD--GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 ++ LTD GE S I +EAKR + IGV L A + RF Sbjct: 509 VVLLTDGMGEQSGYEIP-------ADEAKRNNIKITTIGVGKFVNASVLSWIAAYTSGRF 561 Query: 343 YSVQNSRKLHDAFLR 357 Y V N +L D FL+ Sbjct: 562 YLVSNPSELVDVFLK 576 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust. Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 16/151 (10%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFL 290 P++ ++ +I+ + +T GL + + + AK Y+ Y++ L Sbjct: 454 PVSSQSSTLKNQIDSMSPSGSTNQAIGLAWGWQTLSTTNGPFPAPAKDKAYVYQDYLVLL 513 Query: 291 TDGENS-----------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLKNC 336 +DG N+ SP +D +++L C + K G +++ + V D Q L++C Sbjct: 514 SDGLNTRNRWSGNGSDHSPEVDVRQALL-CQKVKDSGTVIFTVQVNVGNRDPLSQVLQDC 572 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 AS F + ++ + DAF I ++ + RI Sbjct: 573 ASNGNFQMITSANQTADAFQNILTQISQLRI 603 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust. Identities = 54/201 (26%), Positives = 87/201 (43%), Gaps = 30/201 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S SM+ + G K +A +S + L I+ R+ +V F Sbjct: 532 IDLVFVLDSSGSMSWNDPNGYRK--IAAKSFVDAL--IQG-------DRAAVVDFDDYGY 580 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL Q ++ I+R+ T G+ A +++ I++ DD K II Sbjct: 581 LLQPLTTDFQTVKNAIDRIDSWGGTNIAEGIRIANHQL---------ISQSSDDRIKVII 631 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQ 346 LTDGE N EAK G +Y IG+ + L+N A+ ++ V Sbjct: 632 LLTDGEGYYDN-------NLTTEAKNNGITIYTIGLGTSVDENLLRNIATQTGGMYFPVS 684 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 ++ +L F RI E+V + I Sbjct: 685 SASQLPQVFKRI-TEIVTEPI 704 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust. Identities = 45/170 (26%), Positives = 70/170 (41%), Gaps = 40/170 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV----------RSG 219 D+M+V+D+S SM I + D I + V NVV R G Sbjct: 87 DLMLVVDLSGSMQQE-------------DILQDGDYIDRLSSVKNVVTQFIEQRQGDRLG 133 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHI 276 LV F+ PL Q + ++N+ I G + T GL A D++ Sbjct: 134 LVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSEAP---- 189 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ +I L+DG N++ +D E+ N AK+ G +Y IG+ A Sbjct: 190 -------QRVVILLSDGSNTAGTLDPIEA---ANIAKKYGVKIYTIGIGA 229 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 52/194 (26%), Positives = 84/194 (43%), Gaps = 42/194 (21%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 FP C N +I +DI D+ LD S S + ++ Sbjct: 1824 FPGCTN------------EIGEINDIPTDIAFALDASASFYEE-------------GFQQ 1858 Query: 202 MLDIIKSIPD----VNNVVRSGLVTFS--SKIVQTFPLAWGVQHIQEKINRLIFGST-TK 254 D IKS+ D ++ VR G++T+S +KI F ++ + +++ I+ + + S T+ Sbjct: 1859 EKDFIKSVIDKIELSSSGVRVGVLTYSDEAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTR 1918 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-PNIDNKESLFYCNEAK 313 GLE AKE + G KK +I LTDG+ + P D K+ + Y E Sbjct: 1919 IDLGLE-------AAKELFLEKSGGRGSSKKVLILLTDGQQTYIP--DAKDPVDYAKELA 1969 Query: 314 RRGAIVYAIGVQAE 327 G ++AIG+ E Sbjct: 1970 EYGVDIFAIGIGDE 1983 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust. Identities = 45/170 (26%), Positives = 70/170 (41%), Gaps = 40/170 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV----------RSG 219 D+M+V+D+S SM I + D I + V NVV R G Sbjct: 87 DLMLVVDLSGSMQQE-------------DILQDGDYIDRLSAVKNVVTQFIEQRQGDRLG 133 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHI 276 LV F+ PL Q + ++N+ I G + T GL A D++ Sbjct: 134 LVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSEAP---- 189 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ +I L+DG N++ +D E+ N AK+ G +Y IG+ A Sbjct: 190 -------QRVVILLSDGSNTAGTLDPIEA---ANIAKKYGVKIYTIGIGA 229 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust. Identities = 42/136 (30%), Positives = 62/136 (45%), Gaps = 11/136 (8%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M+ +DVS SM P D+L A S R +D +PD NV GLV F+ Sbjct: 88 VMVAVDVSTSMLAGDVDP--DRLTAAKESARRFVD---GLPDEFNV---GLVAFAGSAAV 139 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + E I+RL+ G+T + A N A + L+ A D I+ Sbjct: 140 LVPPDTDREALDEGIDRLVEGATGVQGTAIGEAINTSLGAVKALDGEAA-KDPPPARIVL 198 Query: 290 LTDGENSSPNIDNKES 305 L+DG N+S +D E+ Sbjct: 199 LSDGANTS-GMDPMEA 213 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 32/181 (17%) Query: 170 DMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DM++ +D+S SM+ G +D+L + + ++ K R GLV F+ Sbjct: 84 DMLLAVDLSGSMSIPDMVTKNGQSIDRLTAVKHVLSDFIEKRKGD-------RLGLVLFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + + ++++++R + G +ST GL A ++K Sbjct: 137 DHAYLQTPLTFDRKTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFINSKAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ II L+DG N+S ID E+ AK G +Y +GV ADQ ++ DR Sbjct: 188 --QRVIILLSDGANTSGVIDPLEA---AKLAKESGVKIYTVGV---GADQMVQQGFFGDR 239 Query: 342 F 342 Sbjct: 240 I 240 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 46.2 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 44/173 (25%), Positives = 81/173 (46%), Gaps = 30/173 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIV 228 D+++++D S SMND +K+ A S + +D++ D++ + G+V +SS + Sbjct: 69 DVILIIDRSGSMNDE-----NKMQSAINSAKGFIDLM----DLSKH-KVGIVDYSSANNI 118 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 +FPL+ + ++ +N L T + + K+ E + D + I+ Sbjct: 119 SSFPLSTDKEAVKNYVNGLRANGGTATGDAI----------KKARELLVNHRPDAQPVIV 168 Query: 289 FLTDGENSSPNIDNKE-SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 LTDG+ + PN + +L NEAK+ G + Y I + L A+PD Sbjct: 169 LLTDGDATEPNGNAYNYALTNSNEAKQEGIVFYTIAL--------LNTNANPD 213 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust. Identities = 45/157 (28%), Positives = 72/157 (45%), Gaps = 15/157 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++ LDVS SM +++L A +I + + S R GLV F+ + Sbjct: 87 GIDIVICLDVSDSMLIEDMKPLNRLEAAKETIAKFISARTS-------DRIGLVVFAGES 139 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P Q I +++N + S+ K G + +A +L K + + Sbjct: 140 FTMVPPTLDYQMILQRVNEISSASSAKIKDGTALGV-AMANAAGRL----KDSQARSRVM 194 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 IF+TDGEN+S ID + L AK G VY+IG+ Sbjct: 195 IFMTDGENNSGTIDPETGL---EIAKGYGIKVYSIGI 228 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust. Identities = 28/66 (42%), Positives = 33/66 (50%), Gaps = 11/66 (16%) Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T LEYA +K G D K+YIIFLTDGE +S N DNK+ Y N Sbjct: 185 TNYTQSLEYALSKF-----------SGMRDSKRYIIFLTDGEPTSLNHDNKQYTLYTNGT 233 Query: 313 KRRGAI 318 R G + Sbjct: 234 ARAGNV 239 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 45.8 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 6/110 (5%) Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRGA 317 A N + E H K KKYI+F+TDG+N++ + + + C++AK +G Sbjct: 5 AKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKGI 64 Query: 318 IVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 +Y I A A Q L CAS D ++ + L AF IG + Q Sbjct: 65 EIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFEAIGAKSAAQ 114 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 78/181 (43%), Gaps = 32/181 (17%) Query: 170 DMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DM++ +D+S SM+ G +D+L + + ++ K R GLV F+ Sbjct: 84 DMLLAVDLSGSMSIPDMVTKNGQSIDRLTAVKHVLSDFIEKRKGD-------RLGLVLFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + ++++++R + G +ST GL A ++K Sbjct: 137 DHAYLQTPLTFDRNTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFINSKAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ II L+DG N+S ID E+ AK G +Y +GV ADQ ++ DR Sbjct: 188 --QRVIILLSDGANTSGVIDPLEA---AKLAKESGVKIYTVGV---GADQMVQKGFFGDR 239 Query: 342 F 342 Sbjct: 240 L 240 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust. Identities = 45/170 (26%), Positives = 70/170 (41%), Gaps = 40/170 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV----------RSG 219 D+M+V+D+S SM I + D I + V NVV R G Sbjct: 87 DLMLVVDLSGSMQQA-------------DILQDGDYIDRLSAVKNVVTQFIEQRQGDRLG 133 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHI 276 LV F+ PL Q + ++N+ I G + T GL A D++ Sbjct: 134 LVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSEAP---- 189 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ +I L+DG N++ +D E+ N AK+ G +Y IG+ A Sbjct: 190 -------QRVVILLSDGSNTAGTLDPIEA---ANIAKKYGVKIYTIGIGA 229 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 45.4 bits (106), Expect = 0.013, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 32/181 (17%) Query: 170 DMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DM++ +D+S SM+ G +D+L + + ++ K R GLV F+ Sbjct: 84 DMLLAVDLSGSMSIPDMVTKNGQSVDRLTAVKHVLSDFIEKRKGD-------RLGLVLFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + + ++++++R + G +ST GL A ++K Sbjct: 137 DHAYLQTPLTFDRKTVEKQLDRTVLGLIGQSTAIGEGLGIATKTFINSKAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ II L+DG N+S ID E+ AK G +Y +GV ADQ ++ DR Sbjct: 188 --QRVIILLSDGANTSGVIDPLEA---AKLAKESGVKIYTVGV---GADQMVQQGFFGDR 239 Query: 342 F 342 Sbjct: 240 I 240 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 39/148 (26%), Positives = 64/148 (43%), Gaps = 22/148 (14%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENS-- 296 I++KI+ L T G+ +A+ + D L AK D Y II L+DG N+ Sbjct: 309 IKDKIDALSPNGGTNQPIGMHWAWMSLQDGAP-LNTPAKDADYKYTDAIILLSDGMNTID 367 Query: 297 ---------SPNIDNKESLFYCNEAKRRGA------IVYAIGVQAEA--ADQFLKNCASP 339 S ++D ++ L C+ + A ++Y I V + + LK CA Sbjct: 368 RWYGNGSSWSKDVDARQKLL-CDNIRAASAASTTKTVIYTIQVNTDGDPESEVLKYCADS 426 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ + + AF +IG + K RI Sbjct: 427 GNFFATTTASGISTAFAQIGASLSKLRI 454 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 31/142 (21%), Positives = 68/142 (47%), Gaps = 12/142 (8%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAKEKLEHIAKGHDDYK-KYIIF 289 +H + + RL+ G T + G+ +A + I+D K + E + D+ KY++ Sbjct: 340 RHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWD-KTRPELPRRYSDETSNKYLVM 398 Query: 290 LTDGENS-SPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +DG + P +K+ C + K+ RG V + A+++ +++CAS +Y V Sbjct: 399 FSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQSCASGPEYYHVA 458 Query: 347 NSRKLHDAFLRIGKEMVKQRIL 368 + + F +I ++++ ++ Sbjct: 459 SLFSVEKVFEQIAEQVISSSLI 480 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 5/68 (7%) Query: 279 GHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 G D +K ++ +DGE+ P + +++ L C E KR+G VY + + +A +F+ C Sbjct: 370 GQADNRKIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFEGDA--RFVAQC 427 Query: 337 ASPDRFYS 344 AS DR ++ Sbjct: 428 AS-DRSHA 434 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust. Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 28/164 (17%) Query: 170 DMMMVLDVSLSMN--DHFGPG--MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+M+VLD+S SM+ D G +D+L + ++ R GLV F+ Sbjct: 86 DLMLVLDLSYSMSKEDMLDDGDYVDRLTAVKK-------VVSDFASKREGDRLGLVLFAD 138 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + I E++N+L+ G T G+ A D+ D Sbjct: 139 HAYLQTPLTLDRKTIAEQVNQLVLRLIGEKTAIGEGIGLATKTFVDS-----------DA 187 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ ++ L+DG N+S +D E+ AK+ A +Y IGV A Sbjct: 188 PQRVMVLLSDGSNTSGVLDPLEA---AKIAKKYNATIYTIGVGA 228 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust. Identities = 50/182 (27%), Positives = 85/182 (46%), Gaps = 32/182 (17%) Query: 131 SRY---EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDH 184 SRY ++PF+ F + + + ++ K+S SDI G+ + +V+DVS SM Sbjct: 7 SRYYVKDIPFMLIIFAL----AFSIIGLSRPAKVSHLSDINGEGVYISLVVDVSPSM--- 59 Query: 185 FGPGMDKLGVATR---SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 M + + TR S + M+D IK N + LV+F+ + P + ++ Sbjct: 60 ----MAEDMIPTRLEASKKTMIDFIKK----RNFDKISLVSFALRASVLSPATFDYTSLE 111 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 E+I ++ ++ GL A ++ + +D +K II LTDGEN+S ID Sbjct: 112 EEIKKIEIDEEGSTSIGLGIA--------TAVDMLRSVKEDNEKIIILLTDGENNSGEID 163 Query: 302 NK 303 K Sbjct: 164 PK 165 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust. Identities = 31/142 (21%), Positives = 68/142 (47%), Gaps = 12/142 (8%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAKEKLEHIAKGHDDYK-KYIIF 289 +H + + RL+ G T + G+ +A + I+D K + E + D+ KY++ Sbjct: 324 RHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWD-KTRPELPRRYSDETSNKYLVM 382 Query: 290 LTDGENS-SPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +DG + P +K+ C + K+ RG V + A+++ +++CAS +Y V Sbjct: 383 FSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQSCASGPEYYHVA 442 Query: 347 NSRKLHDAFLRIGKEMVKQRIL 368 + + F +I ++++ ++ Sbjct: 443 SLFSVEKVFEQIAEQVISSSLI 464 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust. Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 4/62 (6%) Query: 279 GHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 G D +K ++ +DGE+ P + +++ L C E KR+G VY + + +A +F+ C Sbjct: 370 GQADNRKILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFEGDA--RFVAQC 427 Query: 337 AS 338 AS Sbjct: 428 AS 429 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust. Identities = 31/142 (21%), Positives = 68/142 (47%), Gaps = 12/142 (8%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAKEKLEHIAKGHDDYK-KYIIF 289 +H + + RL+ G T + G+ +A + I+D K + E + D+ KY++ Sbjct: 324 RHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWD-KTRPELPRRYSDETSNKYLVM 382 Query: 290 LTDGENS-SPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +DG + P +K+ C + K+ RG V + A+++ +++CAS +Y V Sbjct: 383 FSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQSCASGPEYYHVA 442 Query: 347 NSRKLHDAFLRIGKEMVKQRIL 368 + + F +I ++++ ++ Sbjct: 443 SLFSVEKVFEQIAEQVISSSLI 464 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 45.1 bits (105), Expect = 0.017, Method: Compositional matrix adjust. Identities = 53/195 (27%), Positives = 81/195 (41%), Gaps = 30/195 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 406 IDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDGV------GVIAFDHNYY 459 Query: 229 QTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + V + + E I+ + G T P L A + +K K K Sbjct: 460 WAYKFGKLVRKEDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAK-----------NKL 508 Query: 287 IIFLTDG--ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 ++ LTDG E S I +EAKR + IGV L A + RF Sbjct: 509 VVLLTDGMGEQSGYEIP-------ADEAKRNNIKITTIGVGKFVNASVLSWIADYTSGRF 561 Query: 343 YSVQNSRKLHDAFLR 357 Y V N +L D FL+ Sbjct: 562 YLVSNPSELVDVFLK 576 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust. Identities = 30/132 (22%), Positives = 61/132 (46%), Gaps = 4/132 (3%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA--KGHDDYKKYII 288 PL + + + +NRL +T S GL + ++ + + + D+ ++ ++ Sbjct: 248 LPLTSNLNDVVDAVNRLQTIGSTASYQGLLWGLRQLTPNWQSAWRVGPNRNQDNVQRKLV 307 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG--VQAEAADQFLKNCASPDRFYSVQ 346 +TDG + + ++D + C AK G + IG VQ+ +QF + S +S Sbjct: 308 LMTDGMDDNSHLDELINAGLCTRAKDLGIELNFIGFGVQSWRLEQFTRCAGSAGAVFSAN 367 Query: 347 NSRKLHDAFLRI 358 N++ L D F ++ Sbjct: 368 NTQDLDDYFSQL 379 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust. Identities = 41/136 (30%), Positives = 62/136 (45%), Gaps = 11/136 (8%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M+ +DVS SM P D+L A + R +D +PD NV GLV F+ Sbjct: 88 VMVAVDVSTSMLAGDVEP--DRLTAAKEAARRFVD---GLPDEFNV---GLVAFAGSAAV 139 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + E I+RL+ G+T + A N A + L+ A D I+ Sbjct: 140 LVPPDTDREALDEGIDRLVEGATGVQGTAIGEAINTSLGAVKALDGEAA-KDPPPARIVL 198 Query: 290 LTDGENSSPNIDNKES 305 L+DG N+S +D E+ Sbjct: 199 LSDGANTS-GMDPMEA 213 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust. Identities = 43/175 (24%), Positives = 72/175 (41%), Gaps = 32/175 (18%) Query: 170 DMMMVLDVSLSM-----NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DMM+ +D+S SM G +D+L + ++ K R GLV F Sbjct: 84 DMMLAVDLSGSMAIKDMQTQSGQSIDRLTAIKHVLSNFIEKRKG-------DRLGLVLFG 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + ++++++R + G +ST GL A +K Sbjct: 137 DHAYLQTPLTFDRHTVEQQLDRTVLGLVGQSTAIGEGLGIATKTFIKSKAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 ++ II L+DG N++ ID E+ AK G +Y +G+ AD+ L+ Sbjct: 188 --QRVIILLSDGANTAGVIDPLEA---AKLAKESGVTIYTVGI---GADEMLQRS 234 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust. Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 29/166 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+ + LD+S SM + F P ++L A + E +D K D R L+ F Sbjct: 81 GIDIAISLDLSQSMLQEDFTP--NRLEKAKEVLSEFID--KRTDD-----RLALIVFGGD 131 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-----GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + I+E +L T +T G+ A N++ D++ K Sbjct: 132 AYTKVPLTFDHNVIKEMTGKLTVDDITSNTRTAIGMGIGVALNRLKDSEAK--------- 182 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K II LTDGEN+S + + + AK G +Y IG+ A+ Sbjct: 183 --SKVIILLTDGENNSGEMSPSAA---ADIAKELGIKIYTIGIGAK 223 >gi|238063244|ref|ZP_04607953.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885055|gb|EEP73883.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 265 Score = 44.7 bits (104), Expect = 0.025, Method: Compositional matrix adjust. Identities = 44/144 (30%), Positives = 62/144 (43%), Gaps = 11/144 (7%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M+ +DVS SM P D+L A + R +D +PD NV GLV F+ Sbjct: 88 VMVAVDVSTSMLASDVKP--DRLSAAKDAARRFVD---GLPDEFNV---GLVAFAGSAAV 139 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + + I RL GST + A N A L+ A D II Sbjct: 140 LVPPGTDREALHDGIERLAEGSTGVQGTAIGEAINTSLGAVRGLDSQAA-KDLPPARIIL 198 Query: 290 LTDGENSSPNIDNKESLFYCNEAK 313 L+DG N+S +D E+ +AK Sbjct: 199 LSDGANTS-GMDPMEAAAEAVDAK 221 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 44.7 bits (104), Expect = 0.025, Method: Compositional matrix adjust. Identities = 44/170 (25%), Positives = 75/170 (44%), Gaps = 19/170 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D+M+ +D+S SM M GV + + ++K+ R GLV F S+ Sbjct: 102 GKDLMLAVDLSGSMEKT---DMPLRGVEVDRLTAVKSVVKNFIQKRQGDRMGLVVFGSQA 158 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA-KEKLEHIAKGHDDYKKY 286 PL + + ++ +N G +T I DA L+H+ + + + K Sbjct: 159 FLQSPLTYDLNTVETLLNETEIGMAGNNT--------AIGDAIGIALKHLHQ-NSEKKAV 209 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFL 333 +I LTDG N++ + + L +A+ G +Y IG+ QA D F+ Sbjct: 210 LILLTDGSNTAGAV---QPLDAAKQAQEMGLKIYTIGIGQNQATGLDAFI 256 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 44.7 bits (104), Expect = 0.025, Method: Compositional matrix adjust. Identities = 41/153 (26%), Positives = 65/153 (42%), Gaps = 19/153 (12%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R+ +V F PL Q ++ I+R+ T G+ A +++ I Sbjct: 74 RAAVVDFDDYGYLLQPLTTDFQTVKNAIDRIDSWGGTNIAEGIRIANHQL---------I 124 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 ++ DD K II LTDGE N EAK G +Y IG+ + L+N Sbjct: 125 SQSSDDRIKVIILLTDGEGYYDN-------NLTTEAKNNGITIYTIGLGTSVDENLLRNI 177 Query: 337 ASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ ++ V ++ +L F RI E+V + I Sbjct: 178 ATQTGGMYFPVSSASQLPQVFKRI-TEIVTEPI 209 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust. Identities = 56/182 (30%), Positives = 80/182 (43%), Gaps = 29/182 (15%) Query: 169 LDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVT 222 +D+ +VLDVS SMND FG K+ ++ LD +I D NN V+ LV Sbjct: 244 IDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNNKVKVALVK 303 Query: 223 FSSKIVQTFPLAWG--------------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++++I T A G Q +QE G S GL+ A D Sbjct: 304 YANQI-GTATGADGCRISNSRQSDTGNCTQIVQELTTDA--GLLKTSVNGLQAAGATYAD 360 Query: 269 AKEKL--EHIAKGHDDYKKYIIFLTDGE-NSSPNIDN---KESLFYCNEAKRRGAIVYAI 322 A ++ + +A G KKY+IF TDGE N D+ ++ E K G VY+I Sbjct: 361 AAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDDDVANAAIKKSQELKNAGTTVYSI 420 Query: 323 GV 324 G+ Sbjct: 421 GI 422 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 70/165 (42%), Gaps = 35/165 (21%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATR--SIREMLD--IIKSIPDVNNVVRSGLVT 222 G+D+MMVLD S SM + F P TR + RE + + D R GL+ Sbjct: 50 GIDIMMVLDASTSMQAEDFQP--------TRFEAAREAAGAFVEGRVSD-----RVGLIV 96 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKG 279 F+++ PL +Q + + G+ T L A N++ D++ + Sbjct: 97 FAAEAYTQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEAE------- 149 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K I LTDG N+ ID + + A+ G VYAIGV Sbjct: 150 ----SKVAILLTDGRNNRGQIDPRTA---AEVARTMGVRVYAIGV 187 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 44.3 bits (103), Expect = 0.032, Method: Compositional matrix adjust. Identities = 42/163 (25%), Positives = 77/163 (47%), Gaps = 24/163 (14%) Query: 200 REMLDIIKSIPDVNNV----VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FG 250 ++M D +KS+ +N+ V G++ +S++ FPL + ++++++ I G Sbjct: 671 QKMKDFMKSLVQKSNIGKDQVHVGVLQYSTEQKLVFPLIQ--YYTKDQLSKAIDDMQQIG 728 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + + +K FDA+ G D K+ ++ +TDGE+ E+L Sbjct: 729 GGTHTGEAIAVV-SKYFDAQNG------GRPDLKQRLVVVTDGESQDDVKLPAEAL---- 777 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + +G IVY+IGV A Q L+ DR Y+ ++ L D Sbjct: 778 --RAKGVIVYSIGVVAANTSQLLEISGDADRMYAERDFDALKD 818 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 39/152 (25%), Positives = 68/152 (44%), Gaps = 18/152 (11%) Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKS 255 I E L + P N VR G+V ++ F L V+ +++ I + G T++ Sbjct: 482 ILEFLQTFRVGP---NHVRIGVVKYADSPTLEFDLHTYTDVKSLEKAITNIHQVGGGTET 538 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 L++ + FD +GH K+Y++ +TDG ++ D + L + + Sbjct: 539 GKALDFMRPQ-FDRAV----TTRGHK-VKEYLVVITDGNSTDKVKDPADKL------RAQ 586 Query: 316 GAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 G +VYAIGV+ + L+ P R + V N Sbjct: 587 GVVVYAIGVKDAVEKELLEISGEPQRTFYVNN 618 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 44.3 bits (103), Expect = 0.032, Method: Compositional matrix adjust. Identities = 55/206 (26%), Positives = 89/206 (43%), Gaps = 27/206 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G +++VLDVS SM +KL +A + L +++S + R G+V FSS Sbjct: 303 GAALVLVLDVSGSMAG------EKLSMA---VAGALALVESAAPED---RLGVVVFSSGH 350 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 FP +++ L+ L A+ + +L H G +K + Sbjct: 351 RVLFPPRPMTAQAKKEAESLLLSLRAGGGTVLGGAFREAV----RLLHGVPGE---RKAV 403 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 + LTDG + D KE + + A+ G V A+ + +A FLK A RFY Sbjct: 404 LVLTDGLIA----DAKEPIL--DLAQTSGVEVSALALGPDADAPFLKELARRGGGRFYQA 457 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + R+L FLR G+E+ + L + Sbjct: 458 PSPRELPRLFLREGQEVFRGEALEGR 483 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 44.3 bits (103), Expect = 0.035, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 70/165 (42%), Gaps = 35/165 (21%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATR--SIREMLD--IIKSIPDVNNVVRSGLVT 222 G+D+MMVLD S SM + F P TR + RE + + D R GL+ Sbjct: 78 GIDIMMVLDASTSMQAEDFQP--------TRFEAAREAAGAFVEGRVSD-----RVGLIV 124 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKG 279 F+++ PL +Q + + G+ T L A N++ D++ + Sbjct: 125 FAAEAYTQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEAE------- 177 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K I LTDG N+ ID + + A+ G VYAIGV Sbjct: 178 ----SKVAILLTDGRNNRGQIDPRTA---AEVAQTMGVRVYAIGV 215 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 43.9 bits (102), Expect = 0.038, Method: Compositional matrix adjust. Identities = 41/153 (26%), Positives = 62/153 (40%), Gaps = 11/153 (7%) Query: 203 LDIIKSIPDV-----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 +DI + DV N VR +T+S+ + I+E + +L Sbjct: 17 MDIYNMVEDVVKKFDNPKVRISFITYSTDGHTLMKITSDKNEIRENLAKL----QNVVPS 72 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 G + + A E++E G I+ LTDG + +E+ E++R GA Sbjct: 73 GATHMQEGLRKANEQIEQENAGEKKAPIVILALTDG--TLLPFPFEETKMEAEESRRLGA 130 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 VY IGV+ DQ L SPD + V N K Sbjct: 131 TVYCIGVKDYRKDQLLDIADSPDHMFGVDNGFK 163 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 43.9 bits (102), Expect = 0.040, Method: Compositional matrix adjust. Identities = 53/206 (25%), Positives = 88/206 (42%), Gaps = 27/206 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G +++VLDVS SM +KL +A + L +++S + R G+V FSS Sbjct: 303 GAALVLVLDVSGSMAG------EKLSMA---VAGALALVESAAPED---RLGVVVFSSGH 350 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 FP +++ L+ L A+ + + +G +K + Sbjct: 351 RVLFPPRPMTAQAKKEAESLLLSLRAGGGTVLGGAFREAV-------RLLQGVPGERKAV 403 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 + LTDG + D KE + + A+ G V A+ + +A FLK A RFY Sbjct: 404 LVLTDGLIA----DAKEPIL--DLAQTSGVEVSALALGPDADAPFLKELARRGGGRFYQA 457 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + R+L FLR G+E+ + L + Sbjct: 458 PSPRELPRLFLREGQEVFRGEALEGR 483 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 43.9 bits (102), Expect = 0.042, Method: Compositional matrix adjust. Identities = 43/167 (25%), Positives = 72/167 (43%), Gaps = 34/167 (20%) Query: 170 DMMMVLDVSLSMNDH-------FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 DMM+V+D+S SM++ F +D+L R + + ++ K R GLV Sbjct: 84 DMMLVVDLSGSMSEEDMKTDSGF---VDRLTAVKRVVSDFIEKRKGD-------RLGLVL 133 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKG 279 F PL + +QE++NR + G + T GL A ++ Sbjct: 134 FGDHAYLQTPLTFDRNTVQEQLNRTVLGLVGQRTAIGEGLGLATKTFIESNAP------- 186 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ II L+DG N++ +D E+ AK A +Y +G+ A Sbjct: 187 ----QRTIILLSDGANTAGVLDPIEA---AQLAKDNNAKIYTVGIGA 226 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 43.9 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 55/215 (25%), Positives = 98/215 (45%), Gaps = 33/215 (15%) Query: 169 LDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGL 220 +D++ V+D SLSM+ D D+ + + LD I IPD+ + ++ Sbjct: 1524 VDLVFVIDKSLSMDYDIDGDEIKWWEDETESRKDIVNDALDEI--IPDLCSQQYDIQIAG 1581 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS T L W + Q+ +N L +T+ +T + DA + L+ ++ H Sbjct: 1582 YQFSGS--STRVLDWSREE-QQVLNNLKISNTSYNTE----PSQALADALDMLKTGSQAH 1634 Query: 281 ---DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + KKY+IF+TDGE P + S + ++ GA +Y IGV ++A+ ++ Sbjct: 1635 QNQSNVKKYLIFMTDGE---PTESEELSYYAISKNPVPGASIYTIGVSSDASTDLMEGIR 1691 Query: 338 S---------PDRFYSVQNSRKLHDAFLRIGKEMV 363 S P F +++ + DAF +I E++ Sbjct: 1692 STAEGNGMTAPATFKGT-SAQLIKDAFTQIKDEII 1725 Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 53/205 (25%), Positives = 80/205 (39%), Gaps = 16/205 (7%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVV 216 SV K +M V+D S SM+ FG G D S E+ + D + + Sbjct: 1093 SVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDARREVVNSALELF--FNQLSDGDYNI 1150 Query: 217 RSGLVTFSSKIVQTFPLAWGVQH------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + G FS + WG Q ++ L T+ T G Y + A Sbjct: 1151 QFGGYKFSDSGERVNFNDWGWQDKYWETDTSNALSHLKL--TSWETDGSTYPSQTLRSAI 1208 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNID-NKESLFYCNEAKRR---GAIVYAIGVQA 326 LE++ G + K+Y+IFLTDGE + +KE C A + G YAI V Sbjct: 1209 SALENVELGENG-KRYLIFLTDGEPGQNSYSFSKEEAENCYSAIKNLDSGTTFYAIQVAN 1267 Query: 327 EAADQFLKNCASPDRFYSVQNSRKL 351 + F+++ S F ++K Sbjct: 1268 SDSHGFMESMVSNANFVDGVTAQKF 1292 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 43.9 bits (102), Expect = 0.045, Method: Compositional matrix adjust. Identities = 33/122 (27%), Positives = 52/122 (42%), Gaps = 10/122 (8%) Query: 233 LAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 A GV+ IQ L F + + L N IF K + +I +T Sbjct: 76 FASGVKLIQAPTQSLSTFNTAVNTVSPLNGGTN-IFRGLRGCYQQLKTKPMTDRVLILVT 134 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY-SVQNSRK 350 DG P + YCN K +G ++ +G+ FLKNCA+ + FY +V+++ + Sbjct: 135 DGFGGQP-------INYCNFIKSKGILLVTVGIGTSINQNFLKNCATSEEFYINVKDTGR 187 Query: 351 LH 352 H Sbjct: 188 PH 189 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 43.5 bits (101), Expect = 0.047, Method: Compositional matrix adjust. Identities = 43/158 (27%), Positives = 71/158 (44%), Gaps = 22/158 (13%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +D S SM D P ++L VA R I ++L I++ R GL+ F+ + Sbjct: 88 GIDIMIAVDTSRSMLADDVKP--NRLEVAKREIEDLLKILEG-------DRVGLIAFAGR 138 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + +N L + A K DA + +++K Sbjct: 139 AFTYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAIYKGIDA------FGENENNHKAM 192 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 II +TDGEN + L ++AK +G ++Y +GV Sbjct: 193 II-ITDGENHETD-----PLKAASKAKEKGIVIYTVGV 224 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 43.5 bits (101), Expect = 0.052, Method: Compositional matrix adjust. Identities = 44/190 (23%), Positives = 87/190 (45%), Gaps = 35/190 (18%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P+ I I + G +MM+ +D+S SM + + + R + + L ++K Sbjct: 67 AAARPMWIGEPQSIPQQ---GREMMLAVDLSRSMQ------AEDMQINNRMV-DRLSLVK 116 Query: 208 SIPDVNNVV------RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---G 258 ++ V + + R GL+ F+ PL + ++ + + + + G + T G Sbjct: 117 TV--VADFIQQRKGDRVGLIFFADNAYLQAPLTFDLKTVSGYMQQAVLGLVGEQTAIGEG 174 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + A + FDA D+ +K +I LTDG+NS+ + ++ + E +G Sbjct: 175 IGLALKR-FDAA----------DNPQKVLILLTDGQNSAGEVKPLDAAKFAQE---QGVK 220 Query: 319 VYAIGVQAEA 328 +Y IGV A+A Sbjct: 221 IYTIGVGADA 230 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 43.5 bits (101), Expect = 0.053, Method: Compositional matrix adjust. Identities = 56/229 (24%), Positives = 98/229 (42%), Gaps = 38/229 (16%) Query: 137 FIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + W A +S P + V I ++ G D+M+ +D+S SM +D + V Sbjct: 56 LLVAALAWIALVGASARPQWLGEPVSIPAQ---GRDLMIAVDLSGSMK------IDDMQV 106 Query: 195 ATRSIREMLDIIKSIPDVNNVV------RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 R + + L +IKS+ +++ + R GL+ F+ PL + + + + + + Sbjct: 107 NGRQV-DRLQMIKSV--LHDFIQRRVGDRLGLIFFADTAYLQAPLTYDRETVSQLLGESL 163 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G + T I DA + + K +I LTDG+N++ NI ++ Sbjct: 164 IGLVGEQT--------AIGDAIGLAIKRFQSKKESNKVLILLTDGQNTAGNISPQQ---- 211 Query: 309 CNE-AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 NE A G +Y IGV ADQ + R V S++L ++ L Sbjct: 212 ANELAINNGVTLYTIGV---GADQMMVQSIFGSR--QVNPSQELDESML 255 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 43.5 bits (101), Expect = 0.054, Method: Compositional matrix adjust. Identities = 47/178 (26%), Positives = 76/178 (42%), Gaps = 19/178 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+DVS SM DKL A + L I +P+ R GLVTFS++ Sbjct: 373 DILLVVDVSGSMEG------DKLEAAKAGLGTFLSRI--LPED----RVGLVTFSTESRL 420 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P A + + RL G Y+ + D KE L+ + DD + I+ Sbjct: 421 VVPPA----PLSDTRIRLDDAIAVMRAQGRTALYDALIDGKEALDSLPSTGDDRIRAIVL 476 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 L+DG ++S ++ E+ G ++ + A+A L+ A+ R VQ Sbjct: 477 LSDGLDNSSRATLEQVRLAFEES---GISIFPVAYGADADTDALQQIATFSRTILVQG 531 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 43.1 bits (100), Expect = 0.061, Method: Compositional matrix adjust. Identities = 41/142 (28%), Positives = 69/142 (48%), Gaps = 15/142 (10%) Query: 168 GLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G D+++VLD+S SM D+ +D+ V+ I +L+++K+ D R GLV F+ Sbjct: 86 GYDIILVLDISGSMLAEDY---EIDQKRVSRLDI--VLEVVKTFLDKRTNDRIGLVAFAG 140 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL + ++ KI++L G+ T L A +++ KE E G Sbjct: 141 RAYTVCPLTFDHNWLKRKIDQLQAGTIEDGTAIGDALGLALSRLEGKKESGERKKIGS-- 198 Query: 283 YKKYIIFLTDGENSSPNIDNKE 304 ++I LTDG N+ N+ E Sbjct: 199 ---FLILLTDGANNCGNLTPIE 217 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 43.1 bits (100), Expect = 0.062, Method: Compositional matrix adjust. Identities = 42/162 (25%), Positives = 72/162 (44%), Gaps = 24/162 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD--IIKSIPDVNNVVRSGLVTFSSKI 227 DMM+V+D+S SM+ D+ +++++L I K D R GL+ F+ Sbjct: 86 DMMLVVDLSYSMSQQDMKSGDQFIDRLSAVKQVLSDFIAKRQGD-----RLGLIFFADHA 140 Query: 228 VQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL Q I +++N+ + G+ T G+ A D+ D + Sbjct: 141 YLQTPLTLDRQTIAQQLNQAVLRLIGTQTAIGEGIGLATKTFIDS-----------DAPQ 189 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + +I L+DG N+S +D E+ AK+ +Y +GV A Sbjct: 190 RVMILLSDGSNTSGVLDPMEA---AKIAKKYHTTIYTVGVGA 228 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 43.1 bits (100), Expect = 0.062, Method: Compositional matrix adjust. Identities = 53/236 (22%), Positives = 97/236 (41%), Gaps = 51/236 (21%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHF--GPGMDKLGVATRSIREMLDIIK 207 PL + ++++ SK G D+M+ +D+S SM D G +D+ + + + E ++ K Sbjct: 76 PLWVGDAIELPSK---GRDLMLAVDLSGSMQIEDMVLNGKAVDRFAMVQQVMSEFIERRK 132 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYN 264 + GL+ F+ PL + + + + G K T + A Sbjct: 133 GD-------KLGLILFADHAYLQAPLTQDRRSVAQFLTEAQIGLVGKQTAIGEAIALAVK 185 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + AK+ + +I LTDG N+S +I +++ + A +RG +Y IGV Sbjct: 186 RFDKAKQS-----------NRVLILLTDGSNNSGSITPEQA---ADIAAKRGVTIYTIGV 231 Query: 325 QAEAADQ---FLKNCASPD-----------------RFYSVQNSRKLHDAFLRIGK 360 AE ++ F K +P R++ +NS +L + I K Sbjct: 232 GAEVMERRTLFGKERVNPSMDLDEAQLTLLAQKTKGRYFRARNSDELEQIYQEIDK 287 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 43.1 bits (100), Expect = 0.063, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 87/201 (43%), Gaps = 30/201 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S SM+ + G K +A +S + L I+ R+ +V F + Sbjct: 498 IDLVFVLDSSGSMSWNDPNGYRK--IAAKSFVDAL--IQG-------DRAAVVDFDNFGY 546 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL Q ++ I+R+ T G+ A ++ I++ +D K II Sbjct: 547 LLQPLTTDFQAVKNAIDRIDSWGGTNIAEGIRIANQQL---------ISRSSEDRIKVII 597 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQ 346 LTDGE N EAK G +Y IG+ + L++ A+ ++ V Sbjct: 598 LLTDGEGYYDNN-------LTTEAKNNGITIYTIGLGTSVDENLLRDIATQTGGMYFPVS 650 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 ++ +L F RI E+V + I Sbjct: 651 SASQLPQVFKRI-TEIVTEPI 670 >gi|32475534|ref|NP_868528.1| BatB [Rhodopirellula baltica SH 1] gi|32446076|emb|CAD75905.1| BatB [Rhodopirellula baltica SH 1] Length = 747 Score = 43.1 bits (100), Expect = 0.063, Method: Compositional matrix adjust. Identities = 45/162 (27%), Positives = 75/162 (46%), Gaps = 29/162 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G++ + VLDVS SM + P ++LG A + I++M+D + R GLV F+ + Sbjct: 126 GIEAVFVLDVSRSMLAEDVSP--NRLGRAKQQIKDMVDEMPGD-------RVGLVVFAGE 176 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 QT PL V+ ++ ++ + S + L A DA + K D K Sbjct: 177 TRQTLPLTRHVEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDA-----FLDKTTD--HKA 229 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKR----RGAIVYAIGV 324 ++ LTDGE+ + +EAKR +G ++ IG+ Sbjct: 230 MVILTDGEDQESD--------PVSEAKRAYEEQGIRIFTIGL 263 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 43.1 bits (100), Expect = 0.068, Method: Compositional matrix adjust. Identities = 48/178 (26%), Positives = 77/178 (43%), Gaps = 20/178 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+ +VLD S S+ D++ + S E L +K + N +R ++T+S++ Sbjct: 75 FDLYLVLDKSGSVADNW--------IHIYSFAEGL--VKKFTNPN--LRISIITYSTEAE 122 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + I + + L+ S GL + + A E++ G II Sbjct: 123 VILPLTSDSKEINKSL--LVLKSIVPQ--GLTHMQKGLRKANEQIRKSTLGGRIVNSVII 178 Query: 289 FLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 LTDG P +D E +A+R GAIVY +GV + Q + PDR + V Sbjct: 179 ALTDGLLLLKPYLDTMEEA---KKARRMGAIVYTVGVFMYSKQQLVNIAGDPDRCFGV 233 >gi|327542236|gb|EGF28725.1| BatB protein [Rhodopirellula baltica WH47] Length = 700 Score = 43.1 bits (100), Expect = 0.069, Method: Compositional matrix adjust. Identities = 45/162 (27%), Positives = 75/162 (46%), Gaps = 29/162 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G++ + VLDVS SM + P ++LG A + I++M+D + R GLV F+ + Sbjct: 79 GIEAVFVLDVSRSMLAEDVSP--NRLGRAKQQIKDMVDEMPGD-------RVGLVVFAGE 129 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 QT PL V+ ++ ++ + S + L A DA + K D K Sbjct: 130 TRQTLPLTRHVEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDA-----FLDKTTD--HKA 182 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKR----RGAIVYAIGV 324 ++ LTDGE+ + +EAKR +G ++ IG+ Sbjct: 183 MVILTDGEDQESD--------PVSEAKRAHEEQGIRIFTIGL 216 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 43.1 bits (100), Expect = 0.073, Method: Compositional matrix adjust. Identities = 48/179 (26%), Positives = 74/179 (41%), Gaps = 31/179 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 SD G+D+++ LD S SM ++++ A + IR+ + K R GLV FS Sbjct: 86 SDQGIDIIVALDTSTSMRSLDFRSLNRMEAAKKVIRDFMKERK-------YDRIGLVIFS 138 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 PL + E IN + G T T + + N++ D++ K Sbjct: 139 GLAFTQCPLTTDKDSLAEFINNINIGDTGLDGTAIGSAIMTSVNRLKDSRAK-------- 190 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------EAADQFL 333 + II +TDG N+ ID L A+ +YA+GV + E D FL Sbjct: 191 ---SRIIILVTDGNNNMGEID---PLTASKIARSYDIKIYAVGVGSLDGAIYEVDDPFL 243 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 43.1 bits (100), Expect = 0.077, Method: Compositional matrix adjust. Identities = 42/190 (22%), Positives = 78/190 (41%), Gaps = 22/190 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D+M+ +D+S SM + M G + + ++K+ R GL+ F Sbjct: 86 GRDIMLAVDLSGSMVEQ---DMAYQGRYVDRLSMVKAVLKNFIAQRQGDRLGLILFGDTA 142 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + + + G ++T I DA D + + Sbjct: 143 FLQTPLTRDLNTVSKMLEEAQIGLVGRAT--------AIGDALGLAVKRFSQKQDSNRIL 194 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + LTDGEN++ N+ +E+L A+ G VY +GV ++ ++F +S+ Sbjct: 195 VLLTDGENTAGNLAPEEALLL---AREEGIKVYTVGVGSQGGNRF--------NLFSMSG 243 Query: 348 SRKLHDAFLR 357 S L ++ L+ Sbjct: 244 SSSLDESLLQ 253 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 43.1 bits (100), Expect = 0.079, Method: Compositional matrix adjust. Identities = 51/213 (23%), Positives = 90/213 (42%), Gaps = 26/213 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + + ++LD S SM+ + G ++G ++ M+D++K NV + ++TF + V Sbjct: 15 IPIFLLLDTSSSMSTNMNGGQTRIGCLNDCVQTMIDLLKEEAKRENVSKLAVITFGAGGV 74 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE----YAYNK-IFDAKEKLEHIAKGHDDY 283 + ++ IQ + L G T LE Y NK F K Y Sbjct: 75 KLQTPLSKIESIQ--FSPLGTGGNTPLGMALELTRDYIQNKDTFPGKF-----------Y 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL--FYCNEAKRRG-AIVYA--IGVQAEAADQFLKNCAS 338 Y++ ++DGE PN D + L F N+ R ++ Y+ IG + E S Sbjct: 122 TPYVVMVSDGE---PNDDWQGPLHDFIHNKENRSSKSVRYSVFIGNEGEEPQAVHDFSGS 178 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 P++ Y + + L + F I + + R + K Sbjct: 179 PNQVYYANDVQSLINCFKAITASVTQGRKITAK 211 >gi|242034241|ref|XP_002464515.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] gi|241918369|gb|EER91513.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] Length = 704 Score = 42.7 bits (99), Expect = 0.083, Method: Compositional matrix adjust. Identities = 42/148 (28%), Positives = 69/148 (46%), Gaps = 25/148 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +LI V ++ LD++ VLDVS SM+ GP KL + R++R +++ ++ Sbjct: 218 ILIHLRVPTWVRTRAPLDLVTVLDVSRSMS---GP---KLALLKRAMRFVIE------NL 265 Query: 213 NNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 R +V FSS + FPL A+G Q Q+ ++ L+ T GL A + D Sbjct: 266 EPSDRLSVVAFSSSACRLFPLRKMTAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVED 325 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + II L+DG +S Sbjct: 326 RQAR---------NPVCSIILLSDGVDS 344 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 42.7 bits (99), Expect = 0.083, Method: Compositional matrix adjust. Identities = 54/216 (25%), Positives = 89/216 (41%), Gaps = 48/216 (22%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+++ LD+S SM F P ++L A + + E +D K I D R LV F Sbjct: 81 GIDIVVALDLSQSMLQRDFKP--NRLETAKKLLEEFID--KRIND-----RISLVVFGGD 131 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-----GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + +++ ++L T + GL + N++ D++ K Sbjct: 132 AYTKVPLTFDHNVVKDITSKLTTDDITSNNRTAIGMGLGVSLNRLKDSEAK--------- 182 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--------------- 326 K II +TDGEN+S + + AK G +Y IG+ A Sbjct: 183 --SKVIILMTDGENNSGEMS---PMGASEIAKELGIKIYTIGIGAREIQIRVPFGHTTVK 237 Query: 327 --EAADQFLKNCASP--DRFYSVQNSRKLHDAFLRI 358 E + LKN AS ++ + ++ + F RI Sbjct: 238 NTELDENLLKNIASTTGGEYFRAGSEKEFQEIFNRI 273 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 42.7 bits (99), Expect = 0.087, Method: Compositional matrix adjust. Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 32/199 (16%) Query: 168 GLDMMMVLDVSLSMN-DHFG---PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D+++ LD S SMN F P + + V + + ++K I D NV G+V F Sbjct: 89 GIDIVLSLDGSGSMNASGFSKEEPRLSRFEVVQKIASDF--VMKRIED--NV---GVVLF 141 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGH 280 P+ + + + E I L G ++T G+ + D+K K Sbjct: 142 GDFAFIATPVTYEKEIVSEMIGYLSHGMAGQNTAIGEGIAMGVRALRDSKAK-------- 193 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCA- 337 K II LTDGE++S +I KE++ + R +Y IG+ + E + LK A Sbjct: 194 ---SKVIILLTDGEHNSGSISPKEAVAMVGKEHIR---LYTIGIGQKGEFDNALLKQLAH 247 Query: 338 -SPDRFYSVQNSRKLHDAF 355 +F++ N ++L + Sbjct: 248 DGHGKFFAAANEKELQSVY 266 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 42.7 bits (99), Expect = 0.098, Method: Compositional matrix adjust. Identities = 46/173 (26%), Positives = 77/173 (44%), Gaps = 21/173 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++VLDVS SM+ G +L A + + + + +S R GLV FS K Sbjct: 102 GIDLLLVLDVSRSMHQQDFNGQSRLE-AVKGVGKQFVLSRSAD------RIGLVVFSGKG 154 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTK--STPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL ++ L G+ S+ ++ I A + + + +K Sbjct: 155 YTPCPLT---------LDHLTLGTVLDNISSEVIQEEGTAIGTAILIAVNRLRASESRQK 205 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 II LTDG+N++ +ID L A + G +Y I A+ A F+++ S Sbjct: 206 AIILLTDGQNNAGDID---PLTAAGFALQDGIKIYTIAATAQDARPFVRSAES 255 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust. Identities = 39/167 (23%), Positives = 72/167 (43%), Gaps = 34/167 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV-------NNVVRSGLVT 222 D+M+V+D+S SM+ +S ++M+D + ++ V R GL+ Sbjct: 15 DLMLVVDLSYSMSQE----------DMQSGQQMVDRLTAVKQVLSEFIAQREGDRIGLIL 64 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ PL Q + E++N+ + G+ T G+ A D+ Sbjct: 65 FADHAYLQTPLTLDRQTVTEQLNQAVLKLIGTQTAMGEGIGLATKTFIDSAAP------- 117 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ +I L+DG N++ +D E+ N AK+ +Y +GV A Sbjct: 118 ----QRVMILLSDGSNTAGVLDPLEA---ANIAKQYQTTIYTVGVGA 157 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust. Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 1/60 (1%) Query: 309 CNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 C+ AK G IVY +G +A + + LK CAS D Y + ++ DAF I + K R+ Sbjct: 497 CDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYYDADGLEISDAFTSIASSIRKLRL 556 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust. Identities = 55/207 (26%), Positives = 88/207 (42%), Gaps = 35/207 (16%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLSM--NDHFGPGMDK 191 PF++ ++ + SV ++SKS G+D++M +DVS SM ND ++ Sbjct: 56 PFLYVLRLLALSAIIIAMARPRSVDVTSKSKTTRGIDIVMAIDVSSSMLANDLKPNRLEA 115 Query: 192 L-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--- 247 L VA +++ +N+ R GLV ++ + P+ I + + + Sbjct: 116 LKKVAATFVQD---------RIND--RIGLVVYAGESYTRTPVTSDKTIILQSLKSVEFD 164 Query: 248 --IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 I T GL A N+I D+K K + II LTDG N+S ID + + Sbjct: 165 DSIIADGTGIGVGLATAINRIKDSKAK-----------SRIIILLTDGVNNSGTIDPRTA 213 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQF 332 + AK G VY IG+ F Sbjct: 214 ---ASIAKEYGIKVYTIGIGTNGKAMF 237 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 33/110 (30%), Positives = 49/110 (44%), Gaps = 9/110 (8%) Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L Y Y ++F A AK Y Y + T +N+ + C+ AK RG + Sbjct: 420 LRYVYQRLF-ADWMGNSAAKNSWYYGVYDSWGTSTKNA-------RTKAICDAAKARGIV 471 Query: 319 VYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 VY IG +A + LK+CAS D Y ++ DAF I + + R+ Sbjct: 472 VYTIGFEAPSGGVSVLKDCASSDAHYFDVQGLEISDAFASIATSIRQLRL 521 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 53/192 (27%), Positives = 81/192 (42%), Gaps = 28/192 (14%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T V +K+ G+D++M +DVS SM P D+L T + D IK P+ Sbjct: 78 TKDVSTRTKTTKGIDIVMAIDVSSSMLARDLKP--DRL---TALKKVAADFIKKRPN--- 129 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKE 271 R GLV ++++ P+ + + ++ G T GL A N++ D+K Sbjct: 130 -DRIGLVAYAAESYTKTPITSDKSIVLSSLRQITHGQLEDGTAIGMGLATAVNRLKDSKS 188 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 K K II LTDG N+S I+ K + E K + Y IG+ + Sbjct: 189 K-----------SKVIILLTDGVNNSGFIEPKTAADLAVEYKIK---TYTIGLGTN-GNA 233 Query: 332 FLKNCASPDRFY 343 +PDR Y Sbjct: 234 LTPIAFNPDRTY 245 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 29/166 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+ + LD+S SM + F P ++L A + E +D K D R L+ F Sbjct: 81 GIDIAISLDLSQSMLQEDFTP--NRLEKAKEVLDEFID--KRGND-----RLSLIVFGGD 131 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-----GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + I+E +L T +T G+ A N++ D++ K Sbjct: 132 AYTKVPLTFDHNVIKEMTRKLTVDDITSNTRTAIGMGIGVALNRLKDSEAK--------- 182 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K II LTDGEN+S + + + AK G +Y IG+ A+ Sbjct: 183 --SKVIILLTDGENNSGEMSPSAA---ADIAKELGIKIYTIGIGAK 223 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 28/164 (17%) Query: 170 DMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 DMM+V+D+S SM+ G +D+L + + + I K D R GLV F+ Sbjct: 35 DMMLVVDLSYSMSKEDMQFNGDYIDRLSAVKQVLSDF--ISKRQGD-----RLGLVLFAD 87 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + E++N+ + G+ T G+ A D+ D Sbjct: 88 HAYLQTPLTLDRHTVAEQLNQTVLRLIGTKTAIGEGIGLATKTFVDS-----------DA 136 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ +I L+DG N++ +D E+ AK+ A +Y +GV A Sbjct: 137 PQRVMILLSDGSNTAGVLDPIEA---AKIAKKYNATIYTVGVGA 177 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 46/162 (28%), Positives = 72/162 (44%), Gaps = 24/162 (14%) Query: 170 DMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKI 227 D+M+VLD+S SM+ + G D + T + + D IK D R G+V F+ Sbjct: 86 DLMLVLDLSYSMSQEDMSDGSDYVDRLTAVKKVVSDFAIKREGD-----RLGVVLFADHA 140 Query: 228 VQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL + +++N+L+ G T G+ A D+ D + Sbjct: 141 YLQTPLTLDRTTVADQVNQLVLRLIGDKTAIGEGIGLATKTFIDS-----------DAPQ 189 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + +I L+DG N+S ID E+ AK+ A +Y IGV A Sbjct: 190 RVMILLSDGSNTSGVIDPIEA---AKIAKKYDATIYTIGVGA 228 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 55/182 (30%), Positives = 79/182 (43%), Gaps = 29/182 (15%) Query: 169 LDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVT 222 +D+ +VLDVS SMND FG K+ ++ LD +I D N+ V+ LV Sbjct: 244 IDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNDKVKVALVK 303 Query: 223 FSSKIVQTFPLAWG--------------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++++I T A G Q +QE G S GL+ A D Sbjct: 304 YANQI-GTATGADGCRISNSRQSDTGNCTQIVQELTTDA--GLLKTSVNGLQAAGATYAD 360 Query: 269 AKEKL--EHIAKGHDDYKKYIIFLTDGE-NSSPNIDN---KESLFYCNEAKRRGAIVYAI 322 A ++ + +A G KKY+IF TDGE N D ++ E K G VY+I Sbjct: 361 AAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDGDVANAAIKKSQELKNAGTTVYSI 420 Query: 323 GV 324 G+ Sbjct: 421 GI 422 >gi|33600172|ref|NP_887732.1| putative hemolysin [Bordetella bronchiseptica RB50] gi|33567770|emb|CAE31684.1| putative hemolysin [Bordetella bronchiseptica RB50] Length = 3346 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 87/193 (45%), Gaps = 15/193 (7%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK---LGVATRSIREMLDIIKSIPDVNNV 215 +K + + ++ +VLD+S SMND +G G +K L A +++ +L+ ++ D Sbjct: 2717 IKQNVTAGTSYNIALVLDLSDSMNDKWGSGSNKPTRLQTAKDALKALLENQLAVHD--GE 2774 Query: 216 VRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + L+TF SS ++ +++ E ++ ++ G ST A+++ E Sbjct: 2775 INVSLITFNGSSSALKKSITGLTPENVDEMVD-ILMGLKASSTTPYGAAFDRTTQWFEGQ 2833 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQ 331 + YK FLTDGE S+ N++ NE AI V+ IG+ + + Sbjct: 2834 PTVDSEGKPYKNLTFFLTDGEPSTEWSYNRD-----NEFAELAAISDVHGIGIGSGVSTS 2888 Query: 332 FLKNCASPDRFYS 344 L + +Y+ Sbjct: 2889 TLNKYDNTGGYYT 2901 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust. Identities = 59/227 (25%), Positives = 101/227 (44%), Gaps = 42/227 (18%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR-EMLDII--- 206 +P+++ K+S + G D+++ +D S SMN D++ R R E+ II Sbjct: 39 SPIVVD---KLSPNNRHGKDIVLAIDASGSMNSSGFDFEDEVSDGKRLSRFEITKIIASE 95 Query: 207 ---KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 K I D VV G F + P+ + +++I + G T+ G A Sbjct: 96 FIQKRISDNVGVVLYGDFAFIAS-----PITY-----EKEIVTQMLGYLTQGMAGQNTAI 145 Query: 264 NKIFDAKEKLEHIAKGHDDYK------KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 E IA G +K K I+ L+DGE++S ++ KE+ AK +G Sbjct: 146 G---------EAIAMGVRSFKHSKAKTKVIVLLSDGEHNSGSVSPKEA---TELAKEQGI 193 Query: 318 IVY--AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +Y A+G + EA + L+ A S F+S ++++L + + I K Sbjct: 194 KIYTIAMGNKGEADEALLETIAKDSNGEFFSASSAKELKNIYDEIDK 240 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust. Identities = 42/185 (22%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ PL + ++++ + G D+M+ +D+S SM ++ + V R R +D +K Sbjct: 66 AAARPLWVGDAIELPNS---GRDLMLAVDISGSMR------VEDMQVGNRMARR-IDAVK 115 Query: 208 SIPDVNNVVRSG----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + RSG L+ F S+ PL++ +Q +Q + G + T Sbjct: 116 QLGSDFMSRRSGDRLGLILFGSRAYLQSPLSFDIQTVQRFLLEAQIGFAGQET----AIG 171 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + I A ++L+ + ++ LTDG++++ +D E+ N A G +Y IG Sbjct: 172 DAIGLAVKRLQE----RPASSRVLVLLTDGQDTASTVDPLEA---ANLAADLGVRIYTIG 224 Query: 324 VQAEA 328 + A++ Sbjct: 225 IGADS 229 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 49/188 (26%), Positives = 80/188 (42%), Gaps = 34/188 (18%) Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+ V+DVS SM D F P ++L VA I + + + R GL+ FS + Sbjct: 86 DIFFVIDVSRSMLADDFRP--NRLEVAKDKISDFVAL-------RPTDRIGLIMFSERAF 136 Query: 229 QTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL+ ++ I++ + + + GS T L A +A+G Sbjct: 137 TLLPLSTDLKLIKQMVGEINVGGMLGSGTNIGDALGLA-------------VARGAQSLA 183 Query: 285 --KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCASPD 340 K II LTDG + N+ + EAK++G VY IG+ + +A + KN Sbjct: 184 KNKVIILLTDGVS---NVGFLTPIQAAEEAKKQGIKVYTIGIGGRGDAKIPYGKNIFGRQ 240 Query: 341 RFYSVQNS 348 R+ ++ Sbjct: 241 RYQNIPGG 248 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust. Identities = 43/185 (23%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ PL + ++++ + G D+M+ +D+S SM ++ + V R R +D +K Sbjct: 74 AAARPLWVGEAIELPNS---GRDLMLAVDISGSMR------VEDMQVGNRMARR-IDAVK 123 Query: 208 SIPDVNNVVRSG----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + RSG L+ F S+ PL++ +Q +Q + G + T Sbjct: 124 QLGSDFMSRRSGDRLGLILFGSRAYLQSPLSFDIQTVQRFLLESQIGFAGQET----AIG 179 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + I A ++L+ + +I LTDG++++ +D E+ N A G +Y IG Sbjct: 180 DAIGLAVKRLQE----RPATSRVLILLTDGQDTASTVDPLEA---ANLAADLGVRIYTIG 232 Query: 324 VQAEA 328 + A++ Sbjct: 233 IGADS 237 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust. Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 21/147 (14%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENS-- 296 I+ KI+ L T G+ +A+ + + L AK + Y II L+DG N+ Sbjct: 309 IKNKIDALSPNGGTNQAIGMHWAWMSLRTG-DPLNTPAKDSNYKYTDAIILLSDGLNTVD 367 Query: 297 ---------SPNIDNKESLFYCNEAKRRGA-----IVYAIGVQAEAADQ--FLKNCASPD 340 SP +D ++ + C+ + ++Y I V + + LK CA Sbjct: 368 RWYGNGRDWSPQVDARQRIL-CDNIRASATNTNPVVIYTIQVNTDGDPESAVLKYCADSG 426 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ S + AF +IG + K R+ Sbjct: 427 NFFATTTSSGIGTAFAQIGSSLSKLRV 453 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust. Identities = 52/220 (23%), Positives = 92/220 (41%), Gaps = 29/220 (13%) Query: 154 LITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 ++ + +S K+DI L +++V+D S SM D G+ KL +A + + ++ V Sbjct: 389 MLPVDMDLSKKADIPSLGLVLVIDKSGSMTDG-QYGITKLEMAKEAAIRSTEALRPTDSV 447 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + ++ + Q LA IQ+ I + G T P L+ AY + +A K Sbjct: 448 GVICFDDAASWVVGMRQADDLA----EIQDSIGTIRPGGGTNMYPALDLAYKALEEADTK 503 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 L+H II LTDG++++ + D + G + ++ V +A Sbjct: 504 LKH-----------IIVLTDGQSATGDFDG-----IAHRMAEDGITLSSVAVGMDADKNL 547 Query: 333 LKNCASPDRFYSVQNSRKLH-DAFLRIGKEMVKQRILYNK 371 L R + N R + D F I K + K+ L + Sbjct: 548 LS------RLAEIGNGRYYYTDEFSNIPKILTKETYLATQ 581 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust. Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 3/62 (4%) Query: 309 CNEAKRRGAIVYAIGV---QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 C K G +V++IG Q A+Q LKNCAS + Y ++DAF I +V Sbjct: 487 CTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGININDAFSAIASNVVNL 546 Query: 366 RI 367 R+ Sbjct: 547 RL 548 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust. Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 21/147 (14%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENS-- 296 I+ KI+ L T G+ +A+ + + L AK + Y II L+DG N+ Sbjct: 309 IKNKIDALSPNGGTNQAIGMHWAWMSLRTG-DPLNTPAKDSNYKYTDAIILLSDGLNTVD 367 Query: 297 ---------SPNIDNKESLFYCNEAKRRGA-----IVYAIGVQAEAADQ--FLKNCASPD 340 SP +D ++ + C+ + ++Y I V + + LK CA Sbjct: 368 RWYGNGRDWSPQVDARQRIL-CDNIRASATNTNPVVIYTIQVNTDGDPESTVLKYCADSG 426 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ S + AF +IG + K R+ Sbjct: 427 NFFATTTSSGIGTAFAQIGSSLSKLRV 453 >gi|104782921|ref|YP_609419.1| RTX toxin [Pseudomonas entomophila L48] gi|95111908|emb|CAK16632.1| putative RTX toxin [Pseudomonas entomophila L48] Length = 2350 Score = 42.0 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 27/98 (27%), Positives = 52/98 (53%), Gaps = 5/98 (5%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S+V+ + ++ ++++V+DVS SMN G PG+ +L +A ++I +LD + D Sbjct: 1551 SAVRSITPGEVDSNILLVVDVSSSMNSGSGVPGLTRLELAKQAINTLLDKYDDMGD---- 1606 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 ++ +VTFS+ P+ + + I L G +T Sbjct: 1607 IKVQIVTFSTGATMQTPVWVSISEAKSLIAGLTAGGST 1644 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust. Identities = 40/169 (23%), Positives = 75/169 (44%), Gaps = 22/169 (13%) Query: 168 GLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G D+M+ +D+S SM + + G +D+L + + + ++ + D R GL+ F Sbjct: 86 GRDIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDFIE--QRTGD-----RLGLILF 138 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL ++ + + +N G ++T I DA DD Sbjct: 139 GDTAFLQTPLTRDLKTVTKMLNEAQIGLVGRAT--------AIGDALGLSVKRFASKDDS 190 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ LTDG+N++ N++ ++L A+ G VY IGV ++ F Sbjct: 191 NRIVVLLTDGQNTAGNLNPDDALLL---AREEGIKVYTIGVGSDNPRGF 236 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust. Identities = 62/233 (26%), Positives = 92/233 (39%), Gaps = 53/233 (22%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T V + S G+D++M +DVS SM P + AT+++ E + IK P Sbjct: 79 TVDVSTRTNSTQGIDIVMAIDVSASMLARDLQPNRLE---ATKAVGE--EFIKGRPS--- 130 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAK 270 R GLV +S + P+ + + + I S T GL + N++ D+K Sbjct: 131 -DRIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESGTAIGSGLATSVNRLKDSK 189 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------ 324 + K II LTDG N+S ID K + AK G VY IGV Sbjct: 190 AE-----------SKVIILLTDGVNNSGFIDPKVA---SELAKEFGIKVYTIGVGTNGMA 235 Query: 325 -----------------QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 Q E + LK A + +++ N+ KL D + I Sbjct: 236 LTPVGIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNNEKLEDIYDEI 288 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 46/183 (25%), Positives = 77/183 (42%), Gaps = 28/183 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFS 224 LD+++VLDVS SM D+ G K+ ++ +D I D + R LV F+ Sbjct: 123 LDIVLVLDVSGSMADNLSGGPKKIDALKTAVNGFIDATADENAKITDQSQRNRIALVKFA 182 Query: 225 SKIVQT-----FPLAWGVQHIQEKINRLIF----------GSTTKSTPGLEYAYNKIFDA 269 + + W + + ++ L + G + +YA+N+ A Sbjct: 183 GTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNR---A 239 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKR---RGAIVYAIGVQ 325 + L + + + KK +IF TDGE N D + N+AK G +Y+IGV Sbjct: 240 QAALTYQPRAN--AKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 297 Query: 326 AEA 328 + A Sbjct: 298 SGA 300 >gi|198436415|ref|XP_002121394.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 904 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 80/185 (43%), Gaps = 28/185 (15%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 N +HAP S K +D++MVLD S S+ + PG K+ ++ L Sbjct: 695 NWTHAPPCCARQCPASPK----IDIVMVLDSSSSVTE---PGWRKM---INFVKTALGFY 744 Query: 207 KSIPDVNNVV---RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYA 262 + P+ +V + + ++KI + +G + + +I RL + G T++ L YA Sbjct: 745 EMGPNSTSVSVFRYNAEIDEANKISFQYTQTYGKEQLLRRIGRLPYNGQGTRTGQALSYA 804 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + + + + D ++ LTDG++ E+L +R G + YAI Sbjct: 805 LHILTNEINR--------PDAVDVVLVLTDGKSQDAVKAPAEAL------RRNGVLTYAI 850 Query: 323 GVQAE 327 +Q E Sbjct: 851 AIQPE 855 >gi|33595651|ref|NP_883294.1| putative hemolysin [Bordetella parapertussis 12822] gi|33565730|emb|CAE36274.1| putative hemolysin [Bordetella parapertussis] Length = 2215 Score = 41.6 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 88/193 (45%), Gaps = 15/193 (7%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK---LGVATRSIREMLDIIKSIPDVNNV 215 +K + + ++ +VLD+S SMND +G G +K L A +++ +L+ ++ D Sbjct: 1622 IKQNVTAGTSYNIALVLDLSDSMNDKWGSGSNKPTRLQTAKDALKALLENQLAVHD--GE 1679 Query: 216 VRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + L+TF SS ++ +++ E ++ L+ + +TP A+++ E Sbjct: 1680 INVSLITFNGSSSALKKSITGLTPENVDEMVDILMGLKASSATP-YGAAFDRTTQWFEGQ 1738 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQ 331 + YK FLTDGE S+ N++ NE AI V+ IG+ + + Sbjct: 1739 PTVDSEGKPYKNLTFFLTDGEPSTEWSYNRD-----NEFAELAAISDVHGIGIGSGVSTS 1793 Query: 332 FLKNCASPDRFYS 344 L + +Y+ Sbjct: 1794 TLNKYDNTGGYYT 1806 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 47/178 (26%), Positives = 77/178 (43%), Gaps = 20/178 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+ +VLD S S+ D++ + S E L +K + N +R ++T+S++ Sbjct: 75 FDLYLVLDKSGSVADNW--------IHIYSFAEGL--VKKFTNPN--LRISIITYSTEAE 122 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + I + + L+ + GL + + A E++ G II Sbjct: 123 VILPLTSDSKEINKSL--LVLKNIVPQ--GLTHMQKGLRKANEQIRKSTLGGRIVNSVII 178 Query: 289 FLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 LTDG P +D E +A+R GAIVY +GV + Q + PDR + V Sbjct: 179 ALTDGLLLLKPYLDTMEEA---KKARRMGAIVYTVGVFMYSKQQLVNIAGDPDRCFGV 233 >gi|254492197|ref|ZP_05105371.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxidans DMS010] gi|224462522|gb|EEF78797.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxydans DMS010] Length = 2740 Score = 41.6 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 40/142 (28%), Positives = 65/142 (45%), Gaps = 13/142 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ ++MM+LDVS SMND GM +L V +S E+LD + D V ++TF+ Sbjct: 2077 EVNTNLMMILDVSGSMNDSANFQGMTRLQVMIKSSLELLDQYDAYGD----VMVNIITFA 2132 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + P V Q K +I G T + A N +A + G + Sbjct: 2133 TS--ASNPSGGWVTVDQAKA--IILGLTAGGNTNYDDALNDAINAFALGGKLGDGQN--- 2185 Query: 285 KYIIFLTDGENSSPNIDNKESL 306 F++DGE +S N+ N ++ Sbjct: 2186 -ISYFMSDGEPNSNNVSNSATV 2206 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 41.6 bits (96), Expect = 0.22, Method: Compositional matrix adjust. Identities = 52/177 (29%), Positives = 81/177 (45%), Gaps = 35/177 (19%) Query: 158 SVKISSKSDI--GLDMMMVLDVSLSM-NDHFGPG-MDKLG-VATRSIREMLDIIKSIPDV 212 SV +++KS G+D++M +DVS SM F P +D L VA+ I + + Sbjct: 16 SVDVTAKSRTTKGIDIVMAIDVSGSMLAKDFKPNRLDALKRVASTFIEDR---------I 66 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIF 267 N+ R GLV ++ + P+ I + + + I T GL A N+I Sbjct: 67 ND--RIGLVVYAGESYTRTPITSDKTVILQSLKTVEYDDSIIADGTGIGVGLATAINRIK 124 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 D+K K + II LTDG N++ ID + + + AK+ G VY IG+ Sbjct: 125 DSKAK-----------SRVIILLTDGVNNAGTIDPRMA---ADIAKQYGIKVYTIGI 167 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust. Identities = 48/186 (25%), Positives = 79/186 (42%), Gaps = 30/186 (16%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+VLD+S SM F P ++L A + + +D D R GL FS + Sbjct: 104 GIDIMLVLDISESMKIQDFTP--NRLEAAKQVANDFID--GRFQD-----RIGLTIFSGE 154 Query: 227 IVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + ++ +I + F S T L N++ + D Sbjct: 155 AYSLSPLTTDYKMLKNQITDIDFKMMEASGTAIGSALAVGTNRM-----------RESDS 203 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 K +I L+DG+N++ NID + S A G +Y I + E + K+ R+ Sbjct: 204 KSKVLILLSDGDNNAGNIDPETS---AKLANAYGIKIYTIAIGKEGKVPYGKDFFGRTRY 260 Query: 343 YSVQNS 348 ++NS Sbjct: 261 --IENS 264 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 41.2 bits (95), Expect = 0.25, Method: Compositional matrix adjust. Identities = 43/170 (25%), Positives = 69/170 (40%), Gaps = 40/170 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV----------RSG 219 D+M+VLD+S SM+ ++E I + V NVV R G Sbjct: 86 DLMLVLDLSYSMSQE-------------DMQEGDQYIDRLSAVKNVVSDFVKQREGDRLG 132 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 LV F+ PL + I +++N L+ G T G+ A D++ Sbjct: 133 LVLFADHAYLQTPLTLDRETISDQVNSLVLRLIGDKTAIGEGIGLATKTFVDSEAP---- 188 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ ++ L+DG N+S + E L AK+ A +Y IG+ A Sbjct: 189 -------QRVMVLLSDGSNTSGVL---EPLEAARIAKKYNATIYTIGIGA 228 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust. Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 28/180 (15%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSM---NDHFGPGMDKLGVATRSIREMLDIIKS 208 P + S K+S+ G+D+++ LD+S SM ++ F + ATR I D++K Sbjct: 74 PRFVQSETKVSAS---GVDIVVALDMSGSMLAEDEGFVLNGQQ---ATRFIIAR-DVLKK 126 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYN 264 D R GLV F ++ P + + + + RL GS T L + N Sbjct: 127 FVDKRQSDRIGLVVFGTQAYVAVPPTLDHEFLLKNLERLGIGSINGNQTAIGSALSTSMN 186 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 ++ + K K K II +TDG+N++ + L A+ G +Y IGV Sbjct: 187 RLRELKSK-----------SKIIILMTDGQNNAGKV---PPLTAAEAARALGIKIYTIGV 232 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 85/201 (42%), Gaps = 30/201 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S SM+ + G K +A +S + L I+ R+ +V F Sbjct: 498 IDLVFVLDSSGSMSWNDPNGYRK--IAAKSFVDAL--IQG-------DRAAVVDFDDFGY 546 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL Q ++ I+R+ T G+ A ++ I+ +D K II Sbjct: 547 LLQPLTTDFQAVKNAIDRIDSWGGTNIAEGIRIANQQL---------ISLSSEDRIKVII 597 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQ 346 LTDGE N EAK G +Y IG+ + L++ A+ ++ V Sbjct: 598 LLTDGEGYYDNN-------LTTEAKNNGITIYTIGLGTSVDENLLRDIATQTGGMYFPVS 650 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 ++ +L F RI E+V + I Sbjct: 651 SASQLPQVFKRI-TEIVTEPI 670 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust. Identities = 42/171 (24%), Positives = 68/171 (39%), Gaps = 36/171 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + C F P ++ + ++ G+D+M+ LD+S SM G G +L A Sbjct: 75 LVLCVFALAG-----PRMVVRQTEAEAR---GIDVMLALDISESMLQKDGSGKSRLDAAR 126 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-------LAWGVQHIQEKINRLIF 249 R+ +++ D R GLV F K P LA + HI ++ I Sbjct: 127 EVARKF--VLRRSSD-----RIGLVVFRGKGYTQCPLTIDHDVLAMLIDHISPQV---IQ 176 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 T + A N+ KG +K II +TDGEN++ ++ Sbjct: 177 DEGTAIGSAILIATNRF-----------KGSTSLQKVIILITDGENNTGDV 216 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 41.2 bits (95), Expect = 0.27, Method: Compositional matrix adjust. Identities = 48/185 (25%), Positives = 89/185 (48%), Gaps = 32/185 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 LL+ S+ +++ SD LD ++V+D S SM + +KLG+ ++ +D++ + Sbjct: 17 LLLFPSMLMAATSDAQLDAILVIDASGSMKET---DPNKLGL--EGVKLFVDMLGLTDN- 70 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGST-----TKSTPGLEYAYNKI 266 + G+VT+ S + QT+P++ Q +E I + G T T T GL+ A K+ Sbjct: 71 ----QVGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAV-KM 125 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENS-------SPNIDNKESLFYCNEAKRRGAIV 319 + + A G+ I+ TDG N+ +P +K+ ++A+ G + Sbjct: 126 LNQRN-----ASGN---SPLIVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEGYPI 177 Query: 320 YAIGV 324 Y IG+ Sbjct: 178 YTIGL 182 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 40.8 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 76/168 (45%), Gaps = 29/168 (17%) Query: 201 EMLDIIKSIPDVNNV----VRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGS 251 EM D +K + + ++ VR G+V +S KI+ F L G+ + I ++ G+ Sbjct: 867 EMKDFMKEVIKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGT 926 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT A +K+ + I D +Y+I +TDG+++ P + + L Sbjct: 927 TTGK------ALSKMVPVFQNTARI-----DVARYLIVITDGQSTDPVAEAAQGL----- 970 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRF--YSVQNSRKLHDAFLR 357 + G +YAIGV+ +A L+ AS F Y + + +H +R Sbjct: 971 -RDIGVNIYAIGVR-DANTTELEEIASKKMFFIYEFDSLKSIHQEVIR 1016 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 40.8 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 76/168 (45%), Gaps = 29/168 (17%) Query: 201 EMLDIIKSIPDVNNV----VRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGS 251 EM D +K + + ++ VR G+V +S KI+ F L G+ + I ++ G+ Sbjct: 9 EMKDFMKEVIKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGT 68 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT A +K+ + I D +Y+I +TDG+++ P + + L Sbjct: 69 TTGK------ALSKMVPVFQNTARI-----DVARYLIVITDGQSTDPVAEAAQGL----- 112 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRF--YSVQNSRKLHDAFLR 357 + G +YAIGV+ +A L+ AS F Y + + +H +R Sbjct: 113 -RDIGVNIYAIGVR-DANTTELEEIASKKMFFIYEFDSLKSIHQEVIR 158 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust. Identities = 35/146 (23%), Positives = 59/146 (40%), Gaps = 19/146 (13%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENS-- 296 ++ +IN L T G+ +A+ A L AK + Y I+ L+DG N+ Sbjct: 337 LKGRINTLDAQGGTNQGIGMFWAW-MTLQATAPLYTPAKDSEYKYTDAIVLLSDGMNTKN 395 Query: 297 ---------SPNIDNKESLFYCNEAKRRGAI----VYAIGV--QAEAADQFLKNCASPDR 341 SP +D+++ + N + + +Y I V + LK C S Sbjct: 396 RWYGNGSNWSPQVDDRQKILCDNITTKVNGVPETTIYTIQVNTSGDPESSVLKYCGSTGG 455 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+S + + AF +G + K RI Sbjct: 456 FFSTTTASGIQSAFQEVGASLTKLRI 481 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust. Identities = 44/211 (20%), Positives = 90/211 (42%), Gaps = 25/211 (11%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-- 183 ++ A +R PF + + ++ P + + + ++ G D+M+ +D+S SM + Sbjct: 47 SVEAHARRLTPFEWVIWLLLVIAAANPTWLDDPISMPNE---GRDIMLAVDLSGSMTEQD 103 Query: 184 --HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 + G +D+L + + + ++ + R GL+ F PL V+ + Sbjct: 104 MAYNGQYVDRLTMVKAVLTDFIEQRQGD-------RLGLILFGDTAFLQTPLTRDVKTVS 156 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + ++ G ++T I DA D + ++ LTDG+N++ N+ Sbjct: 157 KMLSEAQIGLVGRAT--------AIGDALGLSVKRFANKDKSNRIVVLLTDGQNTAGNLK 208 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +E+L +A G VY IGV ++ F Sbjct: 209 PEEALLLARDA---GIKVYTIGVGSDNPRGF 236 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust. Identities = 45/209 (21%), Positives = 82/209 (39%), Gaps = 26/209 (12%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS------SKSDIGLDMMMVLDVSLSM 181 SA + ++F WC LL+ ++ K + G D+++ +D+S SM Sbjct: 54 SAAHHPLLRWLFLIVIWC-------LLVLAAAKPQWLGEPQALPTSGRDLLLAVDISGSM 106 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDV---NNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + + + R + + K + D R GL+ F ++ PL + Q Sbjct: 107 QQ------EDMQINNRPATRLAAVKKVVSDFIDQRQGDRIGLILFGTQAYLQTPLTFDTQ 160 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + + G K T + A + + K II LTDGEN++ Sbjct: 161 SVNQFLQEAQLGFAGKDT-AIGDAIGLSVKRLKNQSSASSAKPSNSKVIILLTDGENTAG 219 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + E L A++ GA +Y +G+ A+ Sbjct: 220 EV---EPLQAAKLAEKIGAKIYTVGIGAD 245 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust. Identities = 51/201 (25%), Positives = 88/201 (43%), Gaps = 31/201 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRS--GLVTF 223 G D+++ +DVS SM G K S + ++ K I + N G+V F Sbjct: 80 GRDLILTIDVSGSMAQK---GFSK----EESEKSRYEVAKEIAKRFIKNRFSDNIGIVIF 132 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S PL + ++ + E + + I G+ T + IF+A + LE Sbjct: 133 GSFSFSASPLTYDLKALLEMFDLMSDVGIAGNNTAIG-------DAIFEAIKNLE----S 181 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + K II LTDG+++ KE + EAK+RG +Y +G+ + + L+ A Sbjct: 182 GEAKSKVIILLTDGKHNFGKKSPKEGVV---EAKKRGIKIYTVGIGTDYDKKLLEKMAKE 238 Query: 340 D--RFYSVQNSRKLHDAFLRI 358 + + +NS++L + F I Sbjct: 239 TNAKSFFAKNSKELEEVFKEI 259 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust. Identities = 45/173 (26%), Positives = 74/173 (42%), Gaps = 24/173 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMD-KL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+++++ LD+S SM +D KL G + + +++K + R G+V F S Sbjct: 86 GINIILALDLSKSM-----AALDFKLDGAIVNRLDAVKNVVKDFIMKRSGDRIGMVVFGS 140 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I ++RL G+ ST + + ++ D K K Sbjct: 141 EAFTQMPLTRDYDTIAFVLSRLKIGAAGPSTAIGDAMGISLKRLEDVKSK---------- 190 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +I LTDG+++S I + + A+ RG VY IGV FL N Sbjct: 191 -SNIVILLTDGKSNSGEITPGAA---ADIARERGVKVYTIGVGQRGKAPFLVN 239 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 40.8 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 28/165 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVA-TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD+ +V+D S S+ P D++ + T ++ M D I N V+ GL +FS Sbjct: 1241 LDIQIVIDTSGSLTS--APNKDQVLMNFTNNLANMYDTI-------NQVKIGLTSFSESS 1291 Query: 228 VQTFPLAWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V PL + Q +Q+ ++ + + GS T T G+E A N + D + ++ + Sbjct: 1292 VLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALNDM-DTSDAVDDV--------- 1341 Query: 286 YIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAA 329 +I +TDG S+ N +F ++AK G + A+G E A Sbjct: 1342 -MILITDGFQST----NTTLMFQMIDQAKAEGVRLIALGFFGEFA 1381 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 40.8 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 34/129 (26%), Positives = 54/129 (41%), Gaps = 13/129 (10%) Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + +N L T GLE+AY+++ K D KKY+I +DGE S N D Sbjct: 881 KSVNALFADGGTSPQKGLEHAYSEL----------QKAEDGNKKYVILFSDGEPSDSN-D 929 Query: 302 NKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 E+ + K G I +G+ E A + AS ++ + +L+ F I Sbjct: 930 KMETEASAVKLKEAGYTVITVGLGLNNETATWLGEKVASAGCAFTADTAEELNKIFQNIQ 989 Query: 360 KEMVKQRIL 368 + + R L Sbjct: 990 STITQSRSL 998 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust. Identities = 40/167 (23%), Positives = 73/167 (43%), Gaps = 34/167 (20%) Query: 170 DMMMVLDVSLSMN-------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 DMM+V+D+S SM+ D + +D+L + + +D K R GLV Sbjct: 86 DMMLVIDLSYSMSQQDMAYQDDY---IDRLTAVKHVVSDFVDRRKGD-------RVGLVY 135 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ PL + + ++ ++N+ + G+ T G+ A D+ Sbjct: 136 FADHAYLQTPLTFDRETVKTQLNQTVLKLIGTQTAIGDGIGLATKTFVDSNAP------- 188 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ +I L+DG N++ +D ++ AK+ G +Y IGV A Sbjct: 189 ----QRVMILLSDGSNNAGVLDPVQA---AEIAKKYGTTIYTIGVGA 228 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust. Identities = 37/141 (26%), Positives = 65/141 (46%), Gaps = 13/141 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL- 259 ++ +++S+ N R G++ +SS++ FPL A+ + E+ R + + GL Sbjct: 59 LVGLLRSLDVGLNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLA 118 Query: 260 -EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 +YA N F + E + + ++ +TDG P E +A+ RG Sbjct: 119 IQYAMNVAFS---EAEGARPSEERVPRVLVIVTDGR---PQDRVAE---VAAQARARGIE 169 Query: 319 VYAIGVQAEAADQFLKNCASP 339 +YA+GVQ A L+ ASP Sbjct: 170 IYAVGVQ-RADVGSLRTMASP 189 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 40.4 bits (93), Expect = 0.43, Method: Compositional matrix adjust. Identities = 46/182 (25%), Positives = 79/182 (43%), Gaps = 21/182 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 PL + ++++ SK G D+M+ +D+S SM M G A + D++ + Sbjct: 69 PLWMGDAIELPSK---GRDLMIAVDLSGSMQIE---DMVLNGQAVDRFTMIQDVVSDFIE 122 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYN-KIFDA 269 + GL+ F+ PL + + + + G K T G A K FD Sbjct: 123 RRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALGVKRFDM 182 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +K I ++ LTDG N+S +I +++ A +RG +YAIGV A+ Sbjct: 183 VDKSNRI----------LVLLTDGSNNSGSISPEQA---AAIAAKRGVKIYAIGVGADVM 229 Query: 330 DQ 331 ++ Sbjct: 230 ER 231 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust. Identities = 48/173 (27%), Positives = 81/173 (46%), Gaps = 20/173 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDH--FGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 S K+S S G+D+M+ LD+S SM + F P ++L V+ +RE + K + D Sbjct: 79 SKYKLSPDSTKGVDIMIALDISGSMVNSYDFLP-RNRLSVSKDLLREFVK--KRLYD--- 132 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R G+V F+ PL+ + ++ LI G++++ +E + DA Sbjct: 133 --RIGIVVFAGAAYLQSPLSSD----RFALDELIAGTSSED---IEEQGTAVGDALVLSS 183 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + K + K II LTDG +++ +D + + K G VY IG+ E Sbjct: 184 YRLKNSEAKSKVIILLTDGVSNTGKLDPDTAAY---TTKTMGIKVYCIGIGKE 233 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust. Identities = 37/141 (26%), Positives = 65/141 (46%), Gaps = 13/141 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL- 259 ++ +++S+ N R G++ +SS++ FPL A+ + E+ R + + GL Sbjct: 59 LVGLLRSLDVGLNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLA 118 Query: 260 -EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 +YA N F + E + + ++ +TDG P E +A+ RG Sbjct: 119 IQYAMNVAFS---EAEGARPSEERVPRVLVIVTDGR---PQDRVAE---VAAQARARGIE 169 Query: 319 VYAIGVQAEAADQFLKNCASP 339 +YA+GVQ A L+ ASP Sbjct: 170 IYAVGVQ-RADVGSLRTMASP 189 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 40.4 bits (93), Expect = 0.46, Method: Compositional matrix adjust. Identities = 40/166 (24%), Positives = 73/166 (43%), Gaps = 29/166 (17%) Query: 170 DMMMVLDVS--LSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DM++V+D+S +S+ D G +D+L + E ++ K R GLV F+ Sbjct: 84 DMLLVVDLSGSMSIEDMIIKNGESIDRLAAVKDVLAEFIEQRKG-------DRLGLVLFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + +++++ R + G +ST GL A +++ Sbjct: 137 QHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTFINSEAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ II L+DG N++ I+ E+ E+ +Y +GV AE Sbjct: 188 --QRVIILLSDGANTAGVIEPLEAAKLAAESN---VTIYTVGVGAE 228 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 40.4 bits (93), Expect = 0.47, Method: Compositional matrix adjust. Identities = 48/173 (27%), Positives = 73/173 (42%), Gaps = 27/173 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++S + G+D+M+ +DVS SM F P D++ A ++ S R Sbjct: 79 RLSRTNTEGIDIMLAIDVSGSMLARDFRP--DRITAAK-------EVAGSFIADRYGDRI 129 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEH 275 GLV F+ + PL +Q + R+ G T GL A N++ +++ K Sbjct: 130 GLVAFAGEAFTQSPLTTDQGTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESEAK--- 186 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 K II LTDG N+ I + + AK +G VY IGV E Sbjct: 187 --------SKVIILLTDGVNNRGEIAPQTA---AEIAKAQGIRVYTIGVGTEG 228 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 40.4 bits (93), Expect = 0.47, Method: Compositional matrix adjust. Identities = 54/193 (27%), Positives = 79/193 (40%), Gaps = 33/193 (17%) Query: 142 FPWCANSSHAPLLITS-----SVKISSKSDI-GLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 P+ + LLI + SV S S+ G+D+++ +D+S SM P D++ Sbjct: 55 LPFALRCAAVALLIVALARPQSVDEGSTSNTEGIDIVLAIDISTSMLAQDLQP--DRIQA 112 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + + I P R GLV F+ + PL +Q + RL G Sbjct: 113 AKQVAG---NFITDRPGD----RIGLVAFAGEAFTQSPLTTDQGTLQTLLGRLRSGVVED 165 Query: 255 STP---GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL A N++ ++ K K II LTDGEN+ I L Sbjct: 166 GTAIGNGLATAINRLRESNAK-----------SKVIILLTDGENNRGEI---APLTAAEI 211 Query: 312 AKRRGAIVYAIGV 324 A+ +G VY IGV Sbjct: 212 ARDQGIRVYTIGV 224 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 40.4 bits (93), Expect = 0.48, Method: Compositional matrix adjust. Identities = 46/167 (27%), Positives = 75/167 (44%), Gaps = 19/167 (11%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +G D+M+ +D+S SM D VA + ++ +K+I D N R GL+ FSS+ Sbjct: 76 VGKDIMISVDLSASM--------DANDVAPSRLEKIKYELKNIVDAFNSDRIGLIIFSSE 127 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKEKLEHIAKGHDDYK- 284 PL + Q +N I T PG + + A EKL A K Sbjct: 128 AFVQCPLTYD----QNALNLFIETLNTGLVPGSSTDFGSALNMAHEKLTSEAAPSSQQKS 183 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 K II ++DGE+ D + ++ N++ R ++++GV E + Sbjct: 184 KIIILISDGEDFGD--DTEGAVSKINDSGIR---LFSLGVGTEQGSK 225 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 40.4 bits (93), Expect = 0.48, Method: Compositional matrix adjust. Identities = 52/205 (25%), Positives = 82/205 (40%), Gaps = 36/205 (17%) Query: 150 HAPLLIT---------SSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 +APLL + SK G LD++ V+D S S+ M + Sbjct: 6 YAPLLFILLVSLNAELQATPAGSKCRTGPLDLVFVIDSSRSVRPFEFETMRRF------- 58 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTK 254 +++II+ + N R G++ +SS++ FPL G +E + R I T Sbjct: 59 --LVNIIRGLDIGPNATRVGVIQYSSQVQSVFPL--GAFSRREDMERAIHAIVPLAQGTM 114 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + ++YA N F E + + +TDG P E +A+ Sbjct: 115 TGLAIQYAMNVAFSVAEGAR---PSQARVPRVAVIVTDGR---PQDRVTE---VAAQARN 165 Query: 315 RGAIVYAIGVQAEAADQFLKNCASP 339 RG +YA+GVQ A L+ ASP Sbjct: 166 RGIEIYAVGVQ-RADVGSLRAMASP 189 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 40.4 bits (93), Expect = 0.49, Method: Compositional matrix adjust. Identities = 45/174 (25%), Positives = 75/174 (43%), Gaps = 22/174 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S+ M + M+DII ++ N R G++ +SS++ Sbjct: 33 LDIVFVIDSSRSVRPFEFETMRRF---------MMDIIGNLDVGPNATRVGVIQYSSQVQ 83 Query: 229 QTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + ++ IN +I T + ++YA N F +E + H + Sbjct: 84 NIFSLKTFFTRADMERAINSIIPLAQGTMTGLAIQYAMNVAFTTQEGARPL---HKRIPR 140 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I +TDG P E +A+ G +YA+G+Q A L+ ASP Sbjct: 141 IAIVVTDGR---PQDRVTE---VATQARNAGIEIYAVGIQ-RADMNSLRAMASP 187 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 40.4 bits (93), Expect = 0.49, Method: Compositional matrix adjust. Identities = 43/173 (24%), Positives = 71/173 (41%), Gaps = 28/173 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV----VRSGLVTF 223 GLD+M+V+D+S SM MD + +A R++ LD + R GLV F Sbjct: 84 GLDIMLVVDLSESMA-----AMD-MRLADRTVTR-LDAVADAAARFAANHPGDRIGLVAF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGH 280 S+ P + + + RL G+ K T GL A ++ DA Sbjct: 137 GSRAYAVMPPSADRAALTGALARLAVGAAGKRTAMGDGLGLAVKRLSDAP---------- 186 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + + DG +++ + +++ A RG VY++GV + FL Sbjct: 187 -GLSRLAVVFGDGRSNAGEVSPEDA---AKAASERGVTVYSVGVGGDEPAPFL 235 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 40.4 bits (93), Expect = 0.49, Method: Compositional matrix adjust. Identities = 43/161 (26%), Positives = 66/161 (40%), Gaps = 27/161 (16%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+M+VLD+S SM F P + VA R+ + ++ R GLV F+ + Sbjct: 87 GRDLMLVLDLSSSMLAQDFSP--SRFEVARRT---AIQFVQG----RRADRIGLVVFAGQ 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDDY 283 P + + + RL G T + A N++ K + Sbjct: 138 AFTQVPPTLDYRFLLTMLQRLQVGRLEDGTAIGTAIATAINRL-----------KNSEAR 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K II LTDG+N+ ID L A++ G +Y IG+ Sbjct: 187 SKVIILLTDGQNNRGEID---PLTAAELARQAGIRIYTIGL 224 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 40.4 bits (93), Expect = 0.50, Method: Compositional matrix adjust. Identities = 42/161 (26%), Positives = 72/161 (44%), Gaps = 20/161 (12%) Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+ + LDVS +M F P ++L VA ++I++ +D S R L+ F+ Sbjct: 81 GIDIAVALDVSGTMQSVDFEP--NRLEVARKTIQDFVDQRPS-------DRIALIAFAGT 131 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL ++E + + F S + + A + L + K K Sbjct: 132 AYTRVPLTLDHNVVRESLQDISFKSVNEEGTAIGMAIS------VGLNRLKKSTSP-SKI 184 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +I LTDG+N++ +ID + AK G +Y IGV ++ Sbjct: 185 MILLTDGDNNAGSIDPNTA---STLAKDSGIKIYTIGVGSD 222 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 40.4 bits (93), Expect = 0.51, Method: Compositional matrix adjust. Identities = 41/160 (25%), Positives = 71/160 (44%), Gaps = 21/160 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ ++D S S+ M K M+DII S+ + R G+V +SS++ Sbjct: 31 MDLVFIIDSSRSVRPFEFETMRKF---------MIDIINSLEVGLSTTRVGVVQYSSQVQ 81 Query: 229 QTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +++ IN +I T + ++YA N F +E ++K + + Sbjct: 82 TVFSLKTFSNKSDMEKAINEIIPLAQGTMTGLAIQYAMNVAFTEEEGARPLSK---NIPR 138 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I +TDG P E +A+ G +YA+GVQ Sbjct: 139 VAIIVTDGR---PQDRVTEVAV---QAREAGIEIYAVGVQ 172 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 40.0 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 57/219 (26%), Positives = 88/219 (40%), Gaps = 66/219 (30%) Query: 154 LITSSVKISSKSDIG-----LDMMMVLDVSLSMNDHFGPG-----MDKLGVATRSIREML 203 +T+ ISS SD LD++MVLD S SM+D G G +D L A + + Sbjct: 92 FLTTLSAISSTSDTTISGKPLDIVMVLDASGSMDDPMGTGDNTKRIDALKTAANT---FI 148 Query: 204 DII----KSIPDV---------------------NNVVRSGLVTFS-SKIVQTFPLAWG- 236 D I +SI D N+ R G T++ S+ ++ G Sbjct: 149 DAIAAQNQSITDASKQHRVAIVKFAGKKKTDKVGNDTYRDGRYTYNYSQTMKNLTSCKGK 208 Query: 237 -VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 +++ + + +T++ GLE A N I G D KK ++F TDG Sbjct: 209 DADSLKDTVGNINPAGSTQADYGLELAENIT---------INSGRADAKKIVVFFTDGSP 259 Query: 296 SSPN----------IDNKESLFYCNEAKRRGAIVYAIGV 324 +S + I + +SL K GA +Y IG+ Sbjct: 260 TSSSGFQASVADSAIASAKSL------KANGADIYTIGI 292 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 40.0 bits (92), Expect = 0.52, Method: Compositional matrix adjust. Identities = 16/53 (30%), Positives = 33/53 (62%) Query: 14 KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 +G+I+++TA+LLPV+ GL I+ + + +K ++ +D ++ K+ NQ Sbjct: 10 EGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQ 62 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust. Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 1/60 (1%) Query: 309 CNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 C+ AK +G IVY IG +A + L++CAS D Y + ++ DAF I + K R+ Sbjct: 469 CSAAKAQGIIVYTIGFEAPSNGVAVLQDCASSDSHYFDVDGLEIRDAFESIATSIRKLRL 528 >gi|330808169|ref|YP_004352631.1| hypothetical protein PSEBR_a1432 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327376277|gb|AEA67627.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2855 Score = 40.0 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 5/61 (8%) Query: 166 DIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +I ++++VLDVS SM D G PG+ +L +A ++I +LD + D V+ LVTFS Sbjct: 2049 EIDSNLLIVLDVSGSMADDSGVPGLSRLDLAKQAISALLDKYDDLGD----VKVQLVTFS 2104 Query: 225 S 225 S Sbjct: 2105 S 2105 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 40.0 bits (92), Expect = 0.54, Method: Compositional matrix adjust. Identities = 49/209 (23%), Positives = 91/209 (43%), Gaps = 43/209 (20%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P +F + P A ++ ++ G+D + V+D S SMN PG Sbjct: 24 LPMLFLSAPLTAGAN-------------GTAEAGVDAVFVVDTSNSMNKT-DPG------ 63 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-FPLAWGVQHIQEKINRLIFGSTT 253 ++ E++ + + + R G V ++ +IVQ P + +E++ R I Sbjct: 64 --KTAAEVMSMFIDMSEATR-TRIGFVAYNDRIVQAQSPASMAEARNREQLKRTI----- 115 Query: 254 KSTPGLEYA-YNKI-FDAKEKLEHIAKGHDDYKK-YIIFLTDG-----ENS---SPNIDN 302 GL Y+ Y+ + + E I K D +K ++I L+DG +N+ S N Sbjct: 116 ---QGLRYSGYSDLGLGLRRGAEMIEKAKDPARKPFLILLSDGGTDLRQNAGGRSVAASN 172 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 K+ ++AK +G +Y IG+ + + Q Sbjct: 173 KDVETVISKAKAQGYPIYTIGLNNDGSVQ 201 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 40.0 bits (92), Expect = 0.55, Method: Compositional matrix adjust. Identities = 52/214 (24%), Positives = 90/214 (42%), Gaps = 34/214 (15%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN- 182 + NL+ RY + + CA PL + ++++ SK G D+M+ +D+S SM Sbjct: 47 EKNLATKKRYWIVWCLLVLA-CAR----PLWVGEAIELPSK---GRDLMLSVDLSGSMQI 98 Query: 183 -DHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 D G +D+ + I + ++ K R GL+ F+ PL + Sbjct: 99 EDMVLDGKVVDRFSLIQHVISDFIERRKG-------DRIGLILFADHAYLQSPLTQDRRT 151 Query: 240 IQEKINRLIFGSTTKSTP-GLEYAYN-KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + + G K T G A K FD E+ + +I LTDG N++ Sbjct: 152 VAQYLKEAQIGLVGKQTAIGEAIALAVKRFDKVEQSNRV----------LILLTDGSNNA 201 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 I +++ A +RG +Y IGV A+ ++ Sbjct: 202 GAISPEQA---TQIAAKRGITIYTIGVGADVMER 232 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 40.0 bits (92), Expect = 0.55, Method: Compositional matrix adjust. Identities = 47/193 (24%), Positives = 75/193 (38%), Gaps = 30/193 (15%) Query: 204 DIIKSIPDVNNVVRSGLVTF---SSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKS 255 D K P ++ L T +S Q FP+ A Q I++ I+ L+ T Sbjct: 271 DTTKDAPTSDDTRFPALRTLLGTTSCPAQIFPMTSAYAATDAQKIKDVIDDLVADGGTNQ 330 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENS--------------SPNI 300 G+ +A+ + L AK + Y II L+DG N+ I Sbjct: 331 PIGMAWAWMSLQQG-NPLNTPAKDPNYKYTDAIILLSDGLNTMDRWPDYGDGQRQFDGKI 389 Query: 301 DNKESLFYCN----EAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRFYSVQNSRKLHDA 354 D ++ L N ++ + +VY I V + LK CA F++ + + A Sbjct: 390 DARQKLLCDNIKLPDSNGKRPVVYTIQVNTTGDPESTILKYCADGGNFFATTTASGIGTA 449 Query: 355 FLRIGKEMVKQRI 367 F +IG + K RI Sbjct: 450 FAQIGSSLSKLRI 462 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 40.0 bits (92), Expect = 0.57, Method: Compositional matrix adjust. Identities = 42/188 (22%), Positives = 79/188 (42%), Gaps = 28/188 (14%) Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 PG D + R ++D + D R G+ F+S PL+ ++ +E + Sbjct: 652 PGNDPTNQRVEATRNVIDELDPSAD-----RVGVYDFASSGRALHPLSDDLESAKESVVG 706 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +G T + GLE A N ++ +G DD ++ +I L+DG+NS N N E + Sbjct: 707 TAYGGTNMAA-GLEAALN---------DYATRGTDDRERIVILLSDGKNS--NTANDERM 754 Query: 307 -FYCNEAKRRGAIVYAIGVQAEAADQFLKN------CASPDRFYSVQNSRKLHDAFLRIG 359 + + ++ +G+ A D ++ + +Y + +L D F Sbjct: 755 DELADRSDDLDYTLHTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELLDLF---- 810 Query: 360 KEMVKQRI 367 +E+V + I Sbjct: 811 EEIVDEEI 818 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 40.0 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 56/216 (25%), Positives = 100/216 (46%), Gaps = 35/216 (16%) Query: 169 LDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLD--IIKSIPDVNNV---VRSGLVT 222 +D++ V+D SLSM+ D G + T S +++++ + + IPD+ + ++ Sbjct: 1475 VDLVFVIDKSLSMDYDIDGNEIKWWDDETESRKDIVNDALEEIIPDLCSQQYDIQIAGYQ 1534 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH-- 280 FS T L W + Q+ ++ L T+ ST + DA + L+ + H Sbjct: 1535 FSGS--STRVLDWSREE-QQVLSGLKIARTSSSTE----PSQALADALDMLKTGSPAHRN 1587 Query: 281 -DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR---GAIVYAIGVQAEAADQFLKNC 336 + KKY+IF+TDGE + P E Y N + GA +Y IGV ++A+ ++ Sbjct: 1588 QSNVKKYLIFMTDGEPTEP-----EDWSY-NAVRNHAVPGASIYTIGVSSDASTNLMEGI 1641 Query: 337 AS---------PDRFYSVQNSRKLHDAFLRIGKEMV 363 S P F +++ + DAF +I E++ Sbjct: 1642 RSTALSNGMYAPATFKGT-SAQLIRDAFTQIKDEII 1676 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 52/204 (25%), Positives = 86/204 (42%), Gaps = 25/204 (12%) Query: 156 TSSVKISSKSDIG-----------LDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREML 203 T ++ + KSD+G +M V+D S SM+ FG G D S E+ Sbjct: 1031 TYTLTLDVKSDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDARREVVNSALELF 1090 Query: 204 DIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLI--FGSTTKSTPGL 259 + D + ++ G FS + V W ++ + + + T++ T G Sbjct: 1091 --FNQLSDGDYNIQFGGYKFSDSGERVNFNDQGWETEYWETDTSNALSHLKLTSRETDGS 1148 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE--NSSPNIDNKESLFYCNEAKRR-- 315 Y + A LE++ G + K+Y+IFLTDGE +S + KE+ C A + Sbjct: 1149 TYPSQTLRSAISALENVELGENG-KRYLIFLTDGEPGQNSYSFSEKEAE-NCYSAIKNLD 1206 Query: 316 -GAIVYAIGVQAEAADQFLKNCAS 338 G YAI V + F+++ S Sbjct: 1207 SGTTFYAIQVANSDSHGFMESMVS 1230 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 40.0 bits (92), Expect = 0.63, Method: Compositional matrix adjust. Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 9/110 (8%) Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+Y Y IF + + DD+ Y I+ + G NS+ N + C AK +G + Sbjct: 440 LKYIYRYIF---YEWMNFYDARDDWY-YGIYSSHG-NSTKNARTRS---VCEAAKAKGIV 491 Query: 319 VYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 VY IG +A + L++CAS D Y + ++ DAF I + + R+ Sbjct: 492 VYTIGFEAPSNGVAVLRDCASSDAHYFDVDGLEIKDAFASIATSIRQLRL 541 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 40.0 bits (92), Expect = 0.65, Method: Compositional matrix adjust. Identities = 48/179 (26%), Positives = 77/179 (43%), Gaps = 26/179 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +K S+ G+D++M +D+S SM F P ++L A + D +K P+ Sbjct: 14 GIKTRDLSNKGVDIVMAIDISGSMLAMDFAPK-NRLSAAVSVAK---DFVKRRPN----D 65 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKL 273 R GLV FS + PL + + +++L +T GL A ++ ++ K Sbjct: 66 RFGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLAKAVARLKNSTAK- 124 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K II +TDG +++ ID L AK G VY IGV ++ F Sbjct: 125 ----------SKVIILITDGVSNTGEID---PLTAAGMAKELGIKVYPIGVGSKGLVPF 170 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 40.0 bits (92), Expect = 0.67, Method: Compositional matrix adjust. Identities = 51/208 (24%), Positives = 81/208 (38%), Gaps = 39/208 (18%) Query: 140 CTFP-WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 CT P W + P+ G D+M+ +D+S SM D ++ RS Sbjct: 73 CTRPQWLGDPIELPV-------------SGRDLMLAVDLSGSMR------TDDFQLSGRS 113 Query: 199 IREMLDIIKSIP----DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 + + L +K++ D R GL+ F + PL + + ++ + G Sbjct: 114 V-DRLTALKAVAGAFIDQRQGDRIGLILFGEQPYIQAPLTFDHNTVTRLLHEAVVGLAGN 172 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 T I DA + K +I LTDG ++S ++D L A + Sbjct: 173 KT--------AIGDAIGLAVKRLRKDPQAKNVLILLTDGASNSGSLD---PLKAAKLAAQ 221 Query: 315 RGAIVYAIGVQAEAADQ---FLKNCASP 339 RG VY IG+ AEA + F K +P Sbjct: 222 RGLKVYTIGIGAEAVEVGSFFFKRTVNP 249 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust. Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 3/70 (4%) Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCAS-PDRFYSVQNSRKLHDAFLR 357 ID++E+ C AK G +YAIGV + + L++CAS P+ +Y V ++ +L F Sbjct: 492 IDSREAR-ACTNAKAAGVQIYAIGVGVSSHSRGILQDCASKPEMYYDVTDAAQLASVFNT 550 Query: 358 IGKEMVKQRI 367 I + RI Sbjct: 551 IAGSIQNLRI 560 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 39.7 bits (91), Expect = 0.69, Method: Compositional matrix adjust. Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 22/169 (13%) Query: 168 GLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G D+M+ +D+S SM + + G +D+L + + + ++ + R GL+ F Sbjct: 86 GRDIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDFIEQRQGD-------RLGLILF 138 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL V+ + + ++ G ++T I DA D+ Sbjct: 139 GDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRAT--------AIGDALGLSVKRFANKDES 190 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ LTDG+N++ N++ +++L A+ G VY IGV ++ F Sbjct: 191 NRIVVLLTDGQNTAGNLNPEDALLL---AREEGIKVYTIGVGSDNPRGF 236 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 39.7 bits (91), Expect = 0.70, Method: Compositional matrix adjust. Identities = 45/174 (25%), Positives = 76/174 (43%), Gaps = 22/174 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S+ M + ++ +++S+ N R G++ +SS++ Sbjct: 35 LDLVFVIDSSRSVRPFEFETMRQF---------LVGLLRSLDVGPNATRVGVIQYSSQVQ 85 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL--EYAYNKIFDAKEKLEHIAKGHDDYKK 285 FPL A+ + E+ R + + GL +YA N F E + H + Sbjct: 86 SVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSVAEGA-RPPEAH--VPR 142 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +TDG P E +A+ RG +YA+GVQ A L+ ASP Sbjct: 143 VAVIVTDGR---PQDRVAE---VAAQARARGIEIYAVGVQ-RADVGSLRAMASP 189 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 39.7 bits (91), Expect = 0.70, Method: Compositional matrix adjust. Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 1/60 (1%) Query: 309 CNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CN AK +G +VY IG +A ++ LK+CAS D + ++ DAF I + + R+ Sbjct: 497 CNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDAFASIATSIRQLRL 556 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 39.7 bits (91), Expect = 0.73, Method: Compositional matrix adjust. Identities = 45/174 (25%), Positives = 76/174 (43%), Gaps = 22/174 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S+ M + ++ +++S+ N R G++ +SS++ Sbjct: 35 LDLVFVIDSSRSVRPFEFETMRQF---------LVGLLRSLDVGPNATRVGVIQYSSQVQ 85 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL--EYAYNKIFDAKEKLEHIAKGHDDYKK 285 FPL A+ + E+ R + + GL +YA N F E + H + Sbjct: 86 SVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSVAEGA-RPPEAH--VPR 142 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +TDG P E +A+ RG +YA+GVQ A L+ ASP Sbjct: 143 VAVIVTDGR---PQDRVAE---VAAQARARGIEIYAVGVQ-RADVGSLRAMASP 189 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 39.7 bits (91), Expect = 0.74, Method: Compositional matrix adjust. Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 4/89 (4%) Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 EH ++ H + +I +TDG++ D E + + G I+YAIG++ D+ L Sbjct: 893 EHGSRKHRGVPQVLIVITDGDSH----DAAELDEVSKKLRANGIIIYAIGIERARPDELL 948 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 S D+++ V L + RI +++ Sbjct: 949 TMAGSEDKYFYVNTFEGLKHLYPRISEKI 977 >gi|325267447|ref|ZP_08134103.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] gi|324981088|gb|EGC16744.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] Length = 238 Score = 39.7 bits (91), Expect = 0.75, Method: Compositional matrix adjust. Identities = 46/202 (22%), Positives = 88/202 (43%), Gaps = 19/202 (9%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + L ++++LDVS SM+ DK+ +++ MLD + + Sbjct: 6 KFTTPTAKPLPVVLLLDVSSSMSG------DKIDNLNKAVENMLDTFAQEEKMETEILVS 59 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++TF K+ P + Q + + L T L+ A I D KE A Sbjct: 60 VITFGGKVDLHVPF---TKASQVQWHGLQVNGDTPMGTALKMAKAMIED-KETTPSRA-- 113 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK---NC 336 Y+ I+ ++DG+ + NI + + +E + A+ + +A + LK Sbjct: 114 ---YRPTIVLVSDGQPTDGNIWKQAMADFISEGRSSKCDRMAMAIGHDADETVLKRFIEG 170 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 + D FY+ +N+ +LH+ F R+ Sbjct: 171 TAHDLFYA-ENAGQLHEFFQRV 191 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 39.7 bits (91), Expect = 0.75, Method: Compositional matrix adjust. Identities = 39/166 (23%), Positives = 71/166 (42%), Gaps = 29/166 (17%) Query: 170 DMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DM++ +D+S SM+ G +D+L + E ++ K R GLV F+ Sbjct: 84 DMLLAVDLSGSMSIEDMITQSGESIDRLAAVKDVLAEFIEQRKG-------DRLGLVLFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHD 281 PL + +++++ R + G +ST GL A +++ Sbjct: 137 QHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTFINSEAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ II L+DG N++ I+ E+ E+ +Y +GV AE Sbjct: 188 --QRVIILLSDGANTAGVIEPLEAAKLAAESN---VTIYTVGVGAE 228 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust. Identities = 40/161 (24%), Positives = 70/161 (43%), Gaps = 23/161 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ V+D S S+ + ++R L +++S+ N R G++ +SS++ Sbjct: 35 LDLVFVIDSSRSVRP----------LEFETMRRFLVGLVRSLDVGPNATRVGVIQYSSQV 84 Query: 228 VQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL--EYAYNKIFDAKEKLEHIAKGHDDYK 284 FPL A+ + E+ R + + GL +YA N F E + Sbjct: 85 QSVFPLGAFSRREDMERALRTLVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEERVP 141 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG P E +A+ RG +YA+GVQ Sbjct: 142 RVAVIVTDGR---PQDRVAEVA---AQARARGIEIYAVGVQ 176 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust. Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 1/60 (1%) Query: 309 CNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CN AK +G +VY IG +A ++ LK+CAS D + ++ DAF I + + R+ Sbjct: 499 CNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDAFASIATSIRQLRL 558 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 39.7 bits (91), Expect = 0.83, Method: Compositional matrix adjust. Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 30/177 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +S+K G+++MM +DVS SM P + + T + + + I + PD N GL Sbjct: 80 LSNKETEGINIMMAIDVSTSM---LTPDLPPSRIET-AKQVAYEFINNRPDDN----IGL 131 Query: 221 VTFSSKIVQTFPLAW---GVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 F + PL + ++ +++N + T GL A + + +K K Sbjct: 132 TVFGGEAYTQCPLTTDHSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQSKSK 191 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 K II LTDGEN++ I L AKR G +Y I V +AA Sbjct: 192 -----------SKVIILLTDGENNAGEIS---PLTAAEMAKRLGIRIYTISVGTDAA 234 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 39.7 bits (91), Expect = 0.86, Method: Compositional matrix adjust. Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 36/203 (17%) Query: 128 SAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 SAVSR I C +CA + +P+++ ++K G +++ VLDVS SM Sbjct: 56 SAVSR-----ILCIAGYCAVVAALASPVVMRQEKVYTAK---GSEILFVLDVSPSMAAKD 107 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 GM +L A +++R + +P+ GLV +S+ P + ++N Sbjct: 108 IAGMSRLEAAKQAVRVI------VPEAGGTA-FGLVALASEAALMVPPTLDREAFFARLN 160 Query: 246 RLIFGSTT-KSTPGL---EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 L G S G+ AY+ I A KK I+ +TDGEN++ ++ Sbjct: 161 SLQAGELGDGSAIGMGVSTAAYHLISSAAP------------KKSIVLITDGENNAGSVH 208 Query: 302 NKESLFYCNEAKRRGAIVYAIGV 324 + A G +Y +GV Sbjct: 209 PGTA---AQLAFENGITLYVLGV 228 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 39.3 bits (90), Expect = 0.88, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 67/165 (40%), Gaps = 27/165 (16%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM F P D++ A ++ S R GLV F+ + Sbjct: 87 GIDIMLAIDVSGSMLARDFKP--DRITAAK-------EVAGSFIADRYGDRIGLVAFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL +Q + R+ G T GL A N++ ++ K Sbjct: 138 AFTQSPLTTDQSTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESDAK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 K II LTDG N+ I + AK +G VY IGV E Sbjct: 187 SKVIILLTDGVNNQGQI---APMTAAEIAKAQGIRVYTIGVGTEG 228 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 39.3 bits (90), Expect = 0.88, Method: Compositional matrix adjust. Identities = 38/133 (28%), Positives = 64/133 (48%), Gaps = 29/133 (21%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSI------REMLDIIKSIPDVNNVVRSGLVTF 223 D+++VLDVS M D L + S+ ++ L I KS + GL+TF Sbjct: 520 DIVIVLDVSQEM-------KDSLTIVKNSLFNKLLNKDALKISKS--------QYGLITF 564 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+++ Q PL + ++ + N + +T TP + K FD+ K+ + G D Sbjct: 565 SNQVKQEIPLTDNITNLND--NYIKSLATDSLTPNIS----KTFDSITKV--LNSGRADA 616 Query: 284 KKYIIFLTDGENS 296 KK +IF++ G+ S Sbjct: 617 KKNVIFISTGQAS 629 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 39.3 bits (90), Expect = 0.88, Method: Compositional matrix adjust. Identities = 44/184 (23%), Positives = 83/184 (45%), Gaps = 25/184 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHF--GPGMDKLGVATRSIREMLDIIK 207 PL + +++ SK G D+MM +D+S SM D G +D+ + + + ++ K Sbjct: 69 PLWMGDPIELPSK---GRDLMMAVDLSGSMQIEDMVLDGKTVDRFTMIQAVVSDFIERRK 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + GL+ F+ PL + + + + G K T E I Sbjct: 126 G-------DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGE----AIA 174 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 A ++ + + D+ + ++ LTDG N+S +I +++ A +RG +Y+IGV AE Sbjct: 175 LAVKRFDRV----DESNRILVLLTDGSNNSGSISPEQA---AAIAAKRGVKIYSIGVGAE 227 Query: 328 AADQ 331 ++ Sbjct: 228 VMER 231 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 39.3 bits (90), Expect = 0.97, Method: Compositional matrix adjust. Identities = 45/182 (24%), Positives = 78/182 (42%), Gaps = 21/182 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 PL + ++++ SK G D+M+ +D+S SM M G + D++ + Sbjct: 69 PLWMGDAIELPSK---GRDLMLAVDLSGSMQIE---DMVLNGQTVDRFTMIQDVVSDFIE 122 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYN-KIFDA 269 + GL+ F+ PL + + + + G K T G A K FD Sbjct: 123 RRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLQEAQIGLVGKQTAIGEAIALGVKRFDM 182 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +K I ++ LTDG N+S +I +++ A +RG +YAIGV A+ Sbjct: 183 VDKSNRI----------LVLLTDGSNNSGSISPEQA---AAIAAKRGVKIYAIGVGADVM 229 Query: 330 DQ 331 ++ Sbjct: 230 ER 231 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 39.3 bits (90), Expect = 1.0, Method: Compositional matrix adjust. Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 2/62 (3%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + GS++ILT L+ ++F V G ++ + +L Y LD ++L A L+ Sbjct: 23 LRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLAAAD--LD 80 Query: 66 QE 67 QE Sbjct: 81 QE 82 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 52/193 (26%), Positives = 77/193 (39%), Gaps = 26/193 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 406 IDVVLVLDHSGSMADTEDAGISKLEIAKSASAKMIEHLESSDGV------GVIAFDHNYY 459 Query: 229 QTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + + V+ + E I+ + G T P L A K K Sbjct: 460 WAYEFSKLVRKKDVIESISSIEVGGGTAIIPPLSEAV-----------KTLKKSKAKSKL 508 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYS 344 I+ LTDG + NEAKR + IGV L AS RFY Sbjct: 509 IVLLTDGMGEQGGYE-----IPANEAKRNNIKITTIGVGKFVNLPVLSWIASFTSGRFYL 563 Query: 345 VQNSRKLHDAFLR 357 V N +L D FL+ Sbjct: 564 VSNPYELVDVFLK 576 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 51/193 (26%), Positives = 75/193 (38%), Gaps = 26/193 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 406 IDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMIEHLESSDGV------GVIAFDHNYY 459 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + + + E I+ + G T P L A K K Sbjct: 460 WAYKFGKISKKEDVIESISSIEVGGGTAIIPPLSEAV-----------KTLKKSKAKSKL 508 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYS 344 I+ LTDG + NEAKR + IGV L AS RFY Sbjct: 509 IVLLTDGMGEQGGYE-----IPANEAKRNNIKITTIGVGKYVNATVLSWIASFTSGRFYL 563 Query: 345 VQNSRKLHDAFLR 357 V N +L D FL+ Sbjct: 564 VSNPSELVDVFLK 576 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 9/74 (12%) Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--AIGVQAEAADQFLKNCASPDRFYS 344 II LTDGE DN+ S + E K+ GAI++ A+G A+ + L N RFY+ Sbjct: 411 IILLTDGE------DNQMSSCF-EEVKQSGAIIHTIALGPSADRELETLSNMTRGRRFYA 463 Query: 345 VQNSRKLHDAFLRI 358 ++ L DAF RI Sbjct: 464 HKDINGLIDAFSRI 477 >gi|301755498|ref|XP_002913610.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Ailuropoda melanoleuca] Length = 765 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 19/148 (12%) Query: 211 DVN-NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 D+N VR G + FSS FPL Q ++ KI R++F G T++ L+Y + Sbjct: 82 DINPERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRG 141 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F + + ++ +TDG + P + L K RG V+A+GV+ Sbjct: 142 FPGGR--------NASVPQILVVITDGRSQGPVELPAKQL------KERGVTVFAVGVRF 187 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDA 354 ++ L AS R V + ++ DA Sbjct: 188 PRWEE-LHTLASEPREQHVLMAEQVDDA 214 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 22/174 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S+ M + M+DII ++ N R G++ +SS++ Sbjct: 33 LDIVFVIDSSRSVRPFEFETMRRF---------MIDIIGNLDVGPNATRVGVIQYSSQVQ 83 Query: 229 QTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + ++ IN ++ T + ++YA N F +E H + Sbjct: 84 NIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQEGAR---PPHKKIPR 140 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I +TDG P E + A+ G +YA+G+Q A L+ ASP Sbjct: 141 IAIIVTDGR---PQDRVSEVAAH---ARNAGIEIYAVGIQ-RADMNSLRAMASP 187 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 44/159 (27%), Positives = 76/159 (47%), Gaps = 28/159 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVA-TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD+ +V+D S S+ P D++ + T ++ M D I N V+ GL +FS Sbjct: 1400 LDIQIVIDTSGSLTS--APNKDQVLMNFTNNLANMYDTI-------NQVKIGLTSFSESS 1450 Query: 228 VQTFPLAWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V PL + Q +Q+ ++ + + GS T T G+E A N + D + ++ + Sbjct: 1451 VLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALNDM-DTSDAVDDV--------- 1500 Query: 286 YIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIG 323 +I +TDG S+ N +F ++AK G + A+G Sbjct: 1501 -MILITDGFQST----NTTLMFQMIDQAKADGVRLIALG 1534 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 44/162 (27%), Positives = 74/162 (45%), Gaps = 20/162 (12%) Query: 168 GLDMMMVLDVSLSMNDH--FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+D+M+ LDVS SM+ F P +LGV+ + +R+ +D KS R GLV F+ Sbjct: 85 GVDVMIALDVSGSMSRSRDFLP-ETRLGVSKKLLRKFIDKRKSD-------RLGLVVFAG 136 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL + + E I G+ + T + I DA + + K Sbjct: 137 AAYLQAPLTGDRESLNE-----ILGTIEEET--VAEQGTAIGDAIILSTYRLRASQARSK 189 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 I+ +TDG +++ ID + + A+ G +Y++G+ E Sbjct: 190 VIVLITDGVSNTGKIDPVTA---TDLAEHIGVKIYSVGIGKE 228 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 13/143 (9%) Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKST 256 R M+DII ++ N R G++ +SS++ F L + ++ IN ++ T + Sbjct: 55 RFMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTG 114 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 ++YA N F +E H + I +TDG P E + A+ G Sbjct: 115 LAIQYAMNVAFTVQEGAR---PPHKKIPRIAIIVTDGR---PQDRVSEVAAH---ARNAG 165 Query: 317 AIVYAIGVQAEAADQFLKNCASP 339 +YA+G+Q A L+ ASP Sbjct: 166 IEIYAVGIQ-RADMNSLRAMASP 187 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 46/183 (25%), Positives = 76/183 (41%), Gaps = 37/183 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+N D + M+ + PD V+ GL+ FSS+ + Sbjct: 1025 DIVFLVDSSASINS------DDYETMKEFMESMVKQAEIGPDR---VQIGLIQFSSETKE 1075 Query: 230 TFPL---------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FPL ++ IQ+ + G K T L Y F A + G Sbjct: 1076 EFPLNRYKRKDEIQSAIRGIQQLSQGTLMGEALKYT--LPY-----FSASKG------GR 1122 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + K+Y+I +TDGE + +++ + G I+YAIGVQ Q L+ + Sbjct: 1123 VNTKQYLIVITDGEAQDAVGNPAKAI------RDHGVIIYAIGVQQANNTQLLEIAGKQE 1176 Query: 341 RFY 343 + Y Sbjct: 1177 QVY 1179 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 29/165 (17%) Query: 170 DMMMVLDVSLSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DMM+V+D+S SM + G +D+L + + + +D K R GLV F Sbjct: 84 DMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQRKGD-------RLGLVLFG 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 PL + ++E+++R + G T GL A ++ Sbjct: 137 DHAYLQTPLTFDRNTVREQLDRTVLRLVGQMTAMGEGLGLATKTFIESNAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ II L+DG N++ ++ E+ AK A +Y +G+ A Sbjct: 188 --QRTIILLSDGANTAGVLEPLEA---AQLAKDNHAKIYTVGIGA 227 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 46/181 (25%), Positives = 78/181 (43%), Gaps = 23/181 (12%) Query: 163 SKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SK G LD++ V+D S S+ M + M+DII ++ N R G++ Sbjct: 38 SKCKTGPLDIVFVIDSSRSVRPFEFETMRRF---------MIDIIHNLDIGPNATRVGVI 88 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +SS++ F L + +++ IN ++ T + ++Y N F +E + Sbjct: 89 QYSSQVQNVFSLKSFFTRAEMEKAINNIVPLAQGTMTGLAIQYVMNVAFTTQEGARPL-- 146 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 H + + +TDG P E +A+ G +YA+GVQ A L+ AS Sbjct: 147 -HKKIPRVAVIVTDGR---PQDRVTE---VSAQARAAGIEIYAVGVQ-RADMNSLRAMAS 198 Query: 339 P 339 P Sbjct: 199 P 199 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 48/193 (24%), Positives = 82/193 (42%), Gaps = 24/193 (12%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 PL + +++ SK G D+M+ +D+S SM M G T + +I + Sbjct: 71 PLWMGEPIELPSK---GRDLMLSVDLSGSMQIE---DMVIDGKVTDRFTLIQHVISQFIE 124 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYN-KIFDA 269 R GL+ F+ PL + + + +N G + T G A K FD Sbjct: 125 RRKGDRIGLILFADHAYLQSPLTQDRRTVAQYLNEAEIGLVGRQTAIGEAIALGVKRFDQ 184 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E + + +I LTDG N++ +I +++ + A +RG +Y +GV AE Sbjct: 185 VE----------NSNRVLILLTDGSNNAGSISPEQA---TDIAAKRGITIYTVGVGAEVM 231 Query: 330 DQ---FLKNCASP 339 ++ F K +P Sbjct: 232 ERRTLFGKERVNP 244 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust. Identities = 41/183 (22%), Positives = 70/183 (38%), Gaps = 40/183 (21%) Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F ++ PL I+ + L +T G+ + + + D + + K Sbjct: 359 FGCEMEPLVPLTTDFSKIRTTVKALEANGSTNMLEGVMWGWRVLSDREPFAQGAPKSDAS 418 Query: 283 YKKYIIFLTDGENSSPNIDN---------------------------------KESLFYC 309 +K +IFLTDG+NS N++N K++ C Sbjct: 419 VEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTNNALDKKTKAAC 478 Query: 310 NEAKRRGAIVYAIGVQAEAAD----QFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 AK G +Y I + E AD + L+ CA S ++ + ++L F I K +VK Sbjct: 479 ENAKEDGVTIYTI--RLEEADVGTGKMLEECATSSAHYFDAPSRQQLTPIFDAIKKGVVK 536 Query: 365 QRI 367 R+ Sbjct: 537 LRL 539 >gi|281347736|gb|EFB23320.1| hypothetical protein PANDA_001404 [Ailuropoda melanoleuca] Length = 708 Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust. Identities = 33/119 (27%), Positives = 53/119 (44%), Gaps = 18/119 (15%) Query: 211 DVN-NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 D+N VR G + FSS FPL Q ++ KI R++F G T++ L+Y + Sbjct: 40 DINPERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRG 99 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 F + + ++ +TDG + P + L K RG V+A+GV+ Sbjct: 100 FPGGR--------NASVPQILVVITDGRSQGPVELPAKQL------KERGVTVFAVGVR 144 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 38.9 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 42/168 (25%), Positives = 75/168 (44%), Gaps = 29/168 (17%) Query: 201 EMLDIIKSIPDVNNV----VRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGS 251 EM D +K + + ++ VR G+V +S KI+ F L G+ + I + G+ Sbjct: 867 EMKDFMKEVIKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGT 926 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT A +K+ + + D +Y+I +TDG+++ P + + L Sbjct: 927 TTGK------ALSKMVPVFQNTARV-----DVARYLIVITDGQSTDPVAEAAQGL----- 970 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRF--YSVQNSRKLHDAFLR 357 + G +YAIGV+ +A L+ AS F Y + + +H +R Sbjct: 971 -RDIGVNIYAIGVR-DANTTELEEIASKKMFFIYEFDSLKSIHQEVIR 1016 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 38.9 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 42/168 (25%), Positives = 75/168 (44%), Gaps = 29/168 (17%) Query: 201 EMLDIIKSIPDVNNV----VRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGS 251 EM D +K + + ++ VR G+V +S KI+ F L G+ + I + G+ Sbjct: 867 EMKDFMKEVIKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGT 926 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT A +K+ + + D +Y+I +TDG+++ P + + L Sbjct: 927 TTGK------ALSKMVPVFQNTARV-----DVARYLIVITDGQSTDPVAEAAQGL----- 970 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRF--YSVQNSRKLHDAFLR 357 + G +YAIGV+ +A L+ AS F Y + + +H +R Sbjct: 971 -RDIGVNIYAIGVR-DANTTELEEIASKKMFFIYEFDSLKSIHQEVIR 1016 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust. Identities = 22/61 (36%), Positives = 37/61 (60%), Gaps = 3/61 (4%) Query: 309 CNEAKRRGAIVYAIGVQA-EAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 CN AK G I+Y+IG + A Q +++CA SP FY V+ ++ +AF I +++ + R Sbjct: 409 CNAAKANGIIIYSIGFEINNDAAQEMEDCASSPSHFYRVEGV-QISEAFSSIAQQLKQLR 467 Query: 367 I 367 + Sbjct: 468 L 468 >gi|296111730|ref|YP_003622112.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] gi|295833262|gb|ADG41143.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] Length = 894 Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust. Identities = 38/136 (27%), Positives = 63/136 (46%), Gaps = 23/136 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 +D+++V+D+S SMN G D++G A + ++ L I + V G+V FSS Sbjct: 142 IDIVLVVDMSGSMNSSVNGGNDRVGAARQGVKNFLKTINDA-GIGKYVNVGVVGFSSPGY 200 Query: 227 IVQTFPLAWGVQ------HIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + L+ + HI +IN L+ F T + G+ + +A Sbjct: 201 ISSSGTLSENIDASDNQAHI-TRINNLLANDFKGGTFTQLGI----------RTGQSMLA 249 Query: 278 KGHDDYKKYIIFLTDG 293 +D+KK +I LTDG Sbjct: 250 GDSNDHKKMMILLTDG 265 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust. Identities = 44/179 (24%), Positives = 78/179 (43%), Gaps = 22/179 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVTFSSKIV 228 D+ V D S S+N + + G+ +++++D K+ PD +VTF+ + Sbjct: 244 DIAFVFDASSSINAN---NPNNYGLMKDFMKDIVDRFNKTGPDGTQFA---VVTFADRAT 297 Query: 229 QTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + F L ++ +K+ I G T GLE A ++F + G ++ Sbjct: 298 KQFGLKDYSSKAEIKGAIDKVTPSIIGQTAIGD-GLENARLEVFPNRN-----GGGREEV 351 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K +I LTDG+N+ ES ++ G ++ AIGV L N AS + + Sbjct: 352 QKVVILLTDGQNNGHKSPEHESSLL----RKEGVVIVAIGVGTGFLKSELINIASSEEY 406 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust. Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 37/186 (19%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIK 207 PL + V I+ + D+M+ +D S SM G +D+L V IK Sbjct: 79 PLHVGDPVSINPHAR---DLMLAVDTSQSMEIQDMRLHGEPVDRLTV-----------IK 124 Query: 208 SIPDVNNVV------RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 S+ V++ + R GL+ F ++ PL + + ++ +N +S G+ Sbjct: 125 SV--VDDFISHRKNDRIGLILFGTQAYLQTPLTFDHKTVRTLLN--------ESRIGIAG 174 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 I DA K H K +I LTDG N++ ++ ++ A R+G +Y Sbjct: 175 GQTAIGDAIGLALKRLKNHKTGSKVLILLTDGANTAGSVSPVQA---AELAARQGMKIYT 231 Query: 322 IGVQAE 327 +GV A+ Sbjct: 232 VGVGAD 237 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust. Identities = 46/202 (22%), Positives = 85/202 (42%), Gaps = 27/202 (13%) Query: 168 GLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G D+++ +D+S SM D+F +D+ + ++ + ++ + + R GL+ F+ Sbjct: 85 GYDIVLAVDLSRSMEAEDYF---VDR--KRSNRLQAVKPVLSAFINRRENDRIGLIAFAG 139 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFD-AKEKLEHIAKGHD 281 + PL + + + + RL G T L A +++ + AKE+ + Sbjct: 140 RAYTVAPLTFDHKWLARQTERLQIGLIEDGTAIGDSLAVATSRLLEGAKER------AGE 193 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +I+ LTDGEN++ +D E AK G VY I F + +R Sbjct: 194 REGAFIVLLTDGENTAGMMDPMEG---ATLAKDAGIRVYTIAAGKNGYVPFPRRNERGER 250 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + Q FLR+ E + Sbjct: 251 IGTTQE-------FLRVDTETL 265 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust. Identities = 36/129 (27%), Positives = 57/129 (44%), Gaps = 13/129 (10%) Query: 214 NVVRSGLVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL--EYAYNKIFDAK 270 N R G++ +SS++ FPL A+ + E+ R + + GL +YA N F Sbjct: 84 NATRVGVIQYSSQVQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEA 143 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + + ++ +TDG P E +A+ RG +YA+GVQ A Sbjct: 144 EGAR---PSEERVPRVLVIVTDGR---PQDRVAE---VAAQARARGIEIYAVGVQ-RADV 193 Query: 331 QFLKNCASP 339 L+ ASP Sbjct: 194 GSLRAMASP 202 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust. Identities = 37/140 (26%), Positives = 59/140 (42%), Gaps = 13/140 (9%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENSSP 298 ++ KIN+L T GL +A+ + L AK + Y II L+DG+N+ Sbjct: 339 LKGKINKLDAEGNTNQPIGLFWAWMSL-QTGVPLNTPAKDTEYKYTDAIILLSDGDNTQS 397 Query: 299 NIDNKESLF------YCNEAK---RRGAIVYAIGVQAEAADQ--FLKNCASPDRFYSVQN 347 N S C+ K ++ I V + D+ LK CAS +F+ Sbjct: 398 GNSNSVSAIDARQKKLCDNIKDPLNGTTTIFTIQVNTDGDDESAVLKYCASDGQFFQSTT 457 Query: 348 SRKLHDAFLRIGKEMVKQRI 367 + ++ AF IG + K R+ Sbjct: 458 ADQIEIAFQSIGSSLTKLRL 477 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust. Identities = 41/166 (24%), Positives = 73/166 (43%), Gaps = 14/166 (8%) Query: 189 MDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +DK G + E+ + ++ + + V+ +R + FS++ PL I + + Sbjct: 47 LDKSGSVASNWVEIYEFVEKLTERFVSPRMRLSFIVFSTQAKIILPLTGDRYEITKGLKD 106 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE--NSSPNIDNKE 304 L ++ G Y + A E++ + G II LTDG+ + P + KE Sbjct: 107 L----SSVIPAGETYMHEGFKLANEQI--VKAGGKSTASVIIALTDGKLADQIPVLTEKE 160 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + N A+ RGA VY +GV DQ + A+P+ + V+ K Sbjct: 161 A----NIARGRGARVYCVGVLDFNFDQLKRIAAAPENVFRVEGGFK 202 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 28/192 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR---SIREMLDIIKS 208 P + SS I+ + G+ + MV+D+S SM M + + TR S + M D IK Sbjct: 72 PATVDSSANINGE---GIYISMVVDISPSM-------MAEDMLPTRLEASKKTMADFIKK 121 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 N + LV F+ + P + ++++I + ++ GL A Sbjct: 122 ----RNFDKISLVAFALRASVLSPSTFDYTSLEKEIGNIKIDEEGSTSIGLGIA------ 171 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++ + DD +K II LTDGEN+S ID K A +Y IG+ A Sbjct: 172 --TAVDMLRSVKDDAEKVIILLTDGENNSGEIDPK---LASEIASNFNIKIYTIGIGDAA 226 Query: 329 ADQFLKNCASPD 340 P+ Sbjct: 227 GSHAWVTYTDPN 238 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 22/174 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S+ M + ++ +++S+ N R G++ +SS++ Sbjct: 35 LDLVFVIDSSRSVRPFEFETMRQF---------LVGLLRSLDVGPNATRVGVIQYSSQVQ 85 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL--EYAYNKIFDAKEKLEHIAKGHDDYKK 285 FPL A+ + E+ R + + GL +YA N F E + Sbjct: 86 SVFPLSAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR---PPEARVPR 142 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +TDG P E +A+ RG +YA+GVQ A L+ ASP Sbjct: 143 VAVIVTDGR---PQDRVAE---VAAQARARGIEIYAVGVQ-RADVGSLRAMASP 189 >gi|120406920|ref|YP_956749.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] gi|119959738|gb|ABM16743.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 248 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 32/133 (24%), Positives = 58/133 (43%), Gaps = 21/133 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +V DVS SM H +G +S+R+ D + + P + + V+ G++ FS Sbjct: 19 LPFWLVCDVSASMGPH-------IGTLNQSLRDFRDSLATNPVLADKVQFGVIDFSDTAT 71 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL--EHIAKGHDDYKKY 286 + PL E+ G T+ Y + F +++ +A G D ++ + Sbjct: 72 EVIPLGDFSSADLERHQLRTRGGTS---------YGQAFTTVQQIIERDLAAGADRFRYF 122 Query: 287 ---IIFLTDGENS 296 + FLTDG+ + Sbjct: 123 RPAVFFLTDGQPT 135 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 60/236 (25%), Positives = 101/236 (42%), Gaps = 38/236 (16%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-FGPG---MDKLGVATRSIREMLDII 206 +P+L +I K G D+M+V+D S SMN F PG K V + + +D Sbjct: 68 SPVLTNEYKEIKKK---GRDIMLVIDSSDSMNQWGFDPGDPNKSKFDVVKEVVGDFIDKR 124 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 K N+ R GL+ F+S PL + +++ + PG+ I Sbjct: 125 K-----ND--RIGLINFASVAFVASPLTFEKDFLRKILQ--------MQEPGIAGKRTAI 169 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 DA + +I D K I LTDG +++ I E +++ + +Y IG+ + Sbjct: 170 NDALLQTYNILSKSDAKSKIAILLTDGIDNASRISFDEIRRLISDSDIK---LYTIGIGS 226 Query: 327 --EAADQFLKNCASP--DRFYSVQNSR---KLHDAFLRI------GKEMVKQRILY 369 + +LK A RF++ + R K+++A R+ K +V+ R LY Sbjct: 227 YRDFDAPYLKALAQAGHGRFFAASDRRSLQKIYEAIDRLETSKIKSKRVVQHRYLY 282 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 5/61 (8%) Query: 166 DIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +I ++++VLD+S SM D G PG+ +L +A ++I +LD + D V+ LVTFS Sbjct: 2078 EIDSNILIVLDISGSMADASGVPGLSRLELAKQAISALLDKYDDLGD----VKVQLVTFS 2133 Query: 225 S 225 S Sbjct: 2134 S 2134 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 44/179 (24%), Positives = 78/179 (43%), Gaps = 22/179 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVTFSSKIV 228 D+ V D S S+N + + G+ +++++D K+ PD +VTF+ + Sbjct: 241 DIAFVFDASSSINAN---NPNNYGLMKDFMKDIVDRFNKTGPDGTQFA---VVTFADRAT 294 Query: 229 QTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + F L ++ +K+ I G T GLE A ++F + G ++ Sbjct: 295 KQFGLKDYSSKAEIKGAIDKVTPSIIGQTAIGD-GLENARLEVFPNRN-----GGGREEV 348 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K +I LTDG+N+ ES ++ G ++ AIGV L N AS + + Sbjct: 349 QKVVILLTDGQNNGHKSPEHESSLL----RKEGVVIVAIGVGTGFLKSELINIASSEEY 403 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 49/194 (25%), Positives = 87/194 (44%), Gaps = 29/194 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L ++++LDVS SM+ +K+ ++R+MLD + + ++TF S++ Sbjct: 19 LPVVLLLDVSGSMSG------EKIRNVNDAVRDMLDTFSDTENGETEIHVAIITFGSQVA 72 Query: 229 QTFPLAWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PLA H Q+ L G T L+ A I D K+ + A Y+ + Sbjct: 73 LHQPLASASDIHWQD----LSAGGMTPLGTALQMAKAMIED-KDVIPSRA-----YRPTV 122 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGA----IVYAIGVQAEAA--DQFLKNCASPDR 341 + ++DG PN D E + R A + AIG A+ A +F++ + +R Sbjct: 123 VLVSDG---GPN-DAWEKPLNAFISDGRSAKCDRLAMAIGADADEAVLGKFIEGTS--NR 176 Query: 342 FYSVQNSRKLHDAF 355 + +N+++L D F Sbjct: 177 LFYAENAKQLRDFF 190 >gi|311271483|ref|XP_003133150.1| PREDICTED: anthrax toxin receptor-like [Sus scrofa] Length = 728 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 32/115 (27%), Positives = 52/115 (45%), Gaps = 10/115 (8%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 G+Q+++++ FG + P Y +A E++E G + II LTDG Sbjct: 190 GIQNLKQES----FGEAFEGLPDPLYR----VEANEQIEQANSGENKVPSMIIALTDGTL 241 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 S I +E+ + A++ GA VY IGV+ +Q + S D + V K Sbjct: 242 ES--ISLQETKQQADRARKLGANVYCIGVKDYETEQLSEIADSSDHVFGVDQGFK 294 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 52/204 (25%), Positives = 85/204 (41%), Gaps = 30/204 (14%) Query: 124 DYNLSAVSRYEMPFIFCTF----PWCANSSHAPLLITSSVKISSKSDIGL---DMMMVLD 176 +Y+ E PF T PW +N+ L+ +K L +++ ++D Sbjct: 161 NYDYQTPDSTEQPFAVNTHVFSAPWNSNA----YLMEIGIKGFEPEQQELPPSNLVYLID 216 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 VS SMN DKLG+ +S++ + ++ VV +G +S +V Sbjct: 217 VSGSMNSE-----DKLGLVKKSLKLLAQESSDQDRISIVVYAG----ASGVVLEPTKGND 267 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I++ ++RL G +T G+E AY KL A D + +I TDG+ + Sbjct: 268 RMAIEQALDRLSAGGSTNGGAGIELAY--------KLAEQAFIKDGINR-VILATDGDFN 318 Query: 297 SPNIDNKESLFYCNEAKRRGAIVY 320 I N+E L E KR I + Sbjct: 319 VGTI-NREQLIDLVERKRESGISF 341 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 30/121 (24%), Positives = 46/121 (38%), Gaps = 38/121 (31%) Query: 285 KYIIFLTDGE---------------NSSPNIDN------------KESLFYCNEAKRRGA 317 K ++ +TDGE N S D +++ C K Sbjct: 524 KAVVLMTDGEFNTPYFRGVIASDAGNGSGGADTHINQPATNGSSFEQAYRLCENMKAADV 583 Query: 318 IVYAIGVQAEAA----------DQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 IVY +G AA + + CA+ PDR + +S L DAF IG+++ + R Sbjct: 584 IVYTVGFDIGAARNMTGPIDSAGELMARCATNPDRAFQASSSTDLSDAFRDIGRDITRLR 643 Query: 367 I 367 I Sbjct: 644 I 644 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 47/220 (21%), Positives = 96/220 (43%), Gaps = 24/220 (10%) Query: 139 FCTFPWCA-NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 T W + ++ A + K ++D +MM+ +D+S SM + M G Sbjct: 59 LVTLTWLSFITAMAQPMFVGEPKALQQTD--RNMMLAVDISKSMLEE---DMQYQGRLVN 113 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKST 256 ++ + ++ + R GL+ F + PL + + ++ ++ + G + K+ Sbjct: 114 RLQTVKAVVTDFVEERKGDRLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVVGLAGNKTA 173 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G + I ++L+ + + + + +I LTDG+N++ I E L A++ G Sbjct: 174 IG-----DAIGLGVKRLQDLPESN----RVLILLTDGQNTAGEI---EPLKAAELAEKAG 221 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 +YAIG+ A+ + ++ P R V SR L + L Sbjct: 222 VKIYAIGIGAD--EMVIQGFFGPRR---VNPSRDLDEDTL 256 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 9/71 (12%) Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--AIGVQAEAADQFLKNCASPDRFYS 344 II LTDGE DN+ SL + E ++ GAI++ A+G AE + L N RFY+ Sbjct: 410 IILLTDGE------DNQISLCF-EEVRQSGAIIHTIALGPSAEKELETLSNMTRGHRFYA 462 Query: 345 VQNSRKLHDAF 355 + L DAF Sbjct: 463 HNDINGLIDAF 473 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 36/129 (27%), Positives = 57/129 (44%), Gaps = 13/129 (10%) Query: 214 NVVRSGLVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGL--EYAYNKIFDAK 270 N R G++ +SS++ FPL A+ + E+ R + + GL +YA N F Sbjct: 71 NATRVGVIQYSSQVQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEA 130 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + + ++ +TDG P E +A+ RG +YA+GVQ A Sbjct: 131 EGAR---PSEERVPRVLVIVTDGR---PQDRVAE---VAAQARARGIEIYAVGVQ-RADV 180 Query: 331 QFLKNCASP 339 L+ ASP Sbjct: 181 GSLRAMASP 189 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust. Identities = 37/166 (22%), Positives = 75/166 (45%), Gaps = 26/166 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM---LDIIKSIPDVNNVVRSGLVTFS 224 G D+++ +D+S SM ++ + +A +++R + DI + R GL+ F Sbjct: 90 GRDLLLAVDISGSMR------VEDMVIANQAVRRIDAVRDIGAEFIERREGDRVGLILFG 143 Query: 225 SKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ PL++ +++ ++ GS T L A ++ D + Sbjct: 144 SRAYMQSPLSFDRDTVKQFLSEAQIGFAGSETAIGDALGLAVKRLRD-----------KE 192 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D + +I LTDG++++ ++D ++ A G VY IG+ A+ Sbjct: 193 DGDRVVILLTDGQDTASSVDPLDA---TALAANYGVKVYTIGIGAD 235 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust. Identities = 35/133 (26%), Positives = 54/133 (40%), Gaps = 20/133 (15%) Query: 204 DIIKSIPDVN---NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 D + S D N N + + FSS + G + E N + G+T S+ G + Sbjct: 86 DAVSSFADQNLFTNGGSASIAQFSSSASE-----GGTFYSLEDFNAFVDGNTKYSSGGTD 140 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 I D K + K ++I TDG++SSP + + A+ G IVY Sbjct: 141 -----IIDGIAKGRELLKASPATTSFMIVTTDGQSSSPKAE-------ADAARDEGTIVY 188 Query: 321 AIGVQAEAADQFL 333 A+GV + L Sbjct: 189 AVGVGTGPTQEIL 201 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 29/165 (17%) Query: 170 DMMMVLDVSLSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 DMM+V+D+S SM + G +D+L + + + +D K R GLV F Sbjct: 84 DMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQRKGD-------RLGLVLFG 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 PL + ++E+++R + G T GL A ++ Sbjct: 137 DHAYLQTPLTFDRNTVREQLDRTVLNLVGQRTAIGEGLGLATKTFIESNAP--------- 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ II L+DG N++ ++ E+ AK A +Y +G+ A Sbjct: 188 --QRTIILLSDGANTAGVLEPLEA---AQLAKDNHAKIYTVGIGA 227 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 47/168 (27%), Positives = 76/168 (45%), Gaps = 30/168 (17%) Query: 163 SKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +K+ G+D++M +DVS SM D M+ L R D ++ P+ R GL Sbjct: 86 TKTTKGIDIVMAIDVSGSMLAKDLKPNRMEALK------RVAADFVEERPN----DRIGL 135 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNKIFDAKEKLEHI 276 V ++S+ P+ I E I + + + T GL A N++ D+K K Sbjct: 136 VLYASEAYTKTPVTSDKPIILEAIKGIRYDTVLQDGTGIGMGLATAVNRLKDSKAK---- 191 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + II LTDG N++ I+ + + + AK+ G VY IG+ Sbjct: 192 -------SRVIILLTDGVNNAGFIEPETA---ADIAKQYGIKVYTIGL 229 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 49/194 (25%), Positives = 87/194 (44%), Gaps = 29/194 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L ++++LDVS SM+ +K+ ++R+MLD + + ++TF S++ Sbjct: 19 LPVVLLLDVSGSMSG------EKIRNVNDAVRDMLDTFSDTENGETEIHVAIITFGSQVA 72 Query: 229 QTFPLAWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PLA H Q+ L G T L+ A I D K+ + A Y+ + Sbjct: 73 LHQPLASASDIHWQD----LSAGGMTPLGTALQMAKAMIED-KDVVPSRA-----YRPTV 122 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGA----IVYAIGVQAEAA--DQFLKNCASPDR 341 + ++DG PN D E + R A + AIG A+ A +F++ + +R Sbjct: 123 VLVSDG---GPN-DAWEKPLNAFISDGRSAKCDRLAMAIGADADEAVLGKFIEGTS--NR 176 Query: 342 FYSVQNSRKLHDAF 355 + +N+++L D F Sbjct: 177 LFYAENAKQLRDFF 190 >gi|73953968|ref|XP_853856.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 924 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 2/80 (2%) Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E++E G + II LTDG + E+ +++ GA +Y+IGV D Sbjct: 472 EQIEEANSGGKKFPSMIIALTDGTLMPEPYE--ETKIEAENSRQLGATIYSIGVMDYRRD 529 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q L SPD + V N K Sbjct: 530 QLLSIADSPDHVFGVDNGFK 549 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 45/168 (26%), Positives = 69/168 (41%), Gaps = 35/168 (20%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATR-SIREMLDIIKSIPDVNNVV------RSGL 220 G+D+++ LD+S SM L ++TR +I LD K + V + R GL Sbjct: 88 GIDIVLALDLSGSMR--------ALDLSTRENIVTRLDAAKEV--VQEFIGKRPHDRIGL 137 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHI 276 V F++ PL +++ + RL G S T L + N++ D Sbjct: 138 VAFAADAFVVSPLTLNHDWLKKNVQRLELGDINLSGTAIGTALGASVNRLRD-------- 189 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 H+ + +I LTDGEN+S + L AK VY I Sbjct: 190 ---HESRSRIVILLTDGENNSGTLS---PLSAAEAAKSLNVKVYTIAT 231 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 1/67 (1%) Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 + +L C+ AK +G +V+ IG +A Q L+ CAS Y + ++ DAF I Sbjct: 515 DSRTLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSASHYYDVDGLEISDAFASIAS 574 Query: 361 EMVKQRI 367 + + R+ Sbjct: 575 AIRQLRL 581 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 42/185 (22%), Positives = 82/185 (44%), Gaps = 27/185 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ G++ G ++ +++ + D V+ G+V FS Sbjct: 624 DVKADVMFLVDSSRSI------GLENFGKMKTFMKNLVNKSQIGEDQ---VQVGIVQFSD 674 Query: 226 KIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG-HD 281 + F L W I + I+R+ T + L++ + H +KG Sbjct: 675 VNKEEFQLNRYWTQHEIFDAIDRMSNIDRETLTGSALKFVSDYF--------HPSKGARP 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE+ P D +L ++ G I+Y++GV Q ++ P+ Sbjct: 727 GVRKFLILITDGESQDPVKDPAMAL------RQDGVIIYSVGVYGANETQLVEISGKPEM 780 Query: 342 FYSVQ 346 + V+ Sbjct: 781 IFYVE 785 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 21/164 (12%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ + +D+S SMN GP R E+ ++ KS P R GL+ FSS+ Sbjct: 77 GKDIFLAVDLSQSMNATDIGPSR-----LQRIKFELKELTKSFPSD----RIGLIIFSSE 127 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKEKLEHIAKGHDDYKK 285 PL + +Q I+ G T P N A ++ ++ + + K Sbjct: 128 AFMQCPLTFDQSVLQLYID----GLNTGLVPNFGTDLNAPLRIALDRFQN-DESQEVKSK 182 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +I ++DGEN ++N S E K G V+A+G+ E+ Sbjct: 183 SVILISDGENFGDELENIGS-----ELKNLGVKVFALGIGTESG 221 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 84/201 (41%), Gaps = 45/201 (22%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + CT N ++A L T+S++ G D + VLD S SM D D G+A Sbjct: 26 VICTV----NQANAASLGTASIE-------GYDAVFVLDTSYSMRD-----TDPEGIAAE 69 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV----QHIQEKINRLIFGST 252 I +D+ + + R G V ++ +V + PL + GV IQ++I L Sbjct: 70 VISMFMDLSDA-----DRTRVGFVAYNHHVVASKPLTSIGVAAQKSQIQQEIRMLNRSGY 124 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK-YIIFLTDGE--------NSSPNIDNK 303 T GL ++ E +A G ++ ++I L+DGE + S N Sbjct: 125 TDLGLGL----------RKGSELLAAGASQGRQPFMILLSDGETDFGVSSGSRSKGDSNN 174 Query: 304 ESLFYCNEAKRRGAIVYAIGV 324 + A+ +G VY IG+ Sbjct: 175 DVSSVIKSAQTKGYPVYTIGL 195 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 28/114 (24%), Positives = 53/114 (46%), Gaps = 13/114 (11%) Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-------NSSPN-IDNKESLFYCNEAKR 314 +N ++ A K + D K +I ++DG+ N+ P+ + E++ C++ K Sbjct: 349 WNSVWPAGSKA--LPYDEPDATKVVIMMSDGQYNETRHNNAYPSSVTQAEAI--CDKMKE 404 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPDRF-YSVQNSRKLHDAFLRIGKEMVKQRI 367 + ++Y +G A L CAS F Y N ++L +A+ I + + RI Sbjct: 405 KEVVIYTVGFDAGYGQDVLNYCASNPAFAYKPTNGQELTEAYKSIARSISDLRI 458 Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust. Identities = 30/144 (20%), Positives = 67/144 (46%), Gaps = 11/144 (7%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 RNFF N G+++++ A+ + I + G I+ KA++ +D ++L + + Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAATKSMQDG 65 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE------RSTSLSIIIDD 120 ++ K+ ND+ I+ + + + + +E ++T+LS + Sbjct: 66 KDRAYSLKEANDYFKGILNQSNNSGLNCTNIDLVYIDETEELEGHVECSQNTTLSKVAGI 125 Query: 121 QHKDYNLSAVSRY-----EMPFIF 139 +H D+N+S+ + Y E+ F+F Sbjct: 126 RHLDFNVSSAATYGIGKLEIAFVF 149 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 48/196 (24%), Positives = 83/196 (42%), Gaps = 31/196 (15%) Query: 144 WC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHF--GPGMDKLGVATR 197 WC + PL + ++++ SK G D+M+ +D+S SM D G +D+ + Sbjct: 60 WCLLVLACARPLWVGEAIELPSK---GRDLMLSVDLSGSMQIEDMVIDGKVVDRFTLIQH 116 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 I + ++ K R GL+ F+ PL + + + + G K T Sbjct: 117 VISDFIERRKG-------DRIGLILFADHAYLQSPLTQDRRSVAQYLKEAQIGLVGKQTA 169 Query: 258 -GLEYAYN-KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 G A K FD E+ + +I LTDG N++ I +++ A +R Sbjct: 170 IGEAIALGVKRFDKVEQSNRV----------LILLTDGSNNAGAITPEQA---SQIAAQR 216 Query: 316 GAIVYAIGVQAEAADQ 331 G +Y IGV A+ ++ Sbjct: 217 GITIYTIGVGADVMER 232 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust. Identities = 38/165 (23%), Positives = 72/165 (43%), Gaps = 34/165 (20%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV-------NNVVRSGLVTFS 224 M+V+D+S SM+ +S ++M+D + ++ V R GL+ F+ Sbjct: 1 MLVVDLSYSMSQE----------DMQSGQQMVDRLTAVKQVLSDFIAKREGDRIGLILFA 50 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 PL + + +++N+ + G+ T G+ A D+ D Sbjct: 51 DHAYLQTPLTLDRETVTQQLNQAVLKLIGTQTAIGEGIGLATKIFIDS-----------D 99 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ II L+DG N++ +D E+ N AK+ + +Y +GV A Sbjct: 100 APQRVIILLSDGSNTAGVLDPLEA---ANIAKQYHSTIYTVGVGA 141 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust. Identities = 52/212 (24%), Positives = 88/212 (41%), Gaps = 37/212 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSK 226 L +++V+D S SM+ G +D + A R+ LD I G++ F S + Sbjct: 528 LALILVIDKSGSMSS--GDRLDLVKEAARATARTLDPSDEI---------GVIAFDNSPQ 576 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 ++ A I I RL G T + P L AY ++ +K ++H Sbjct: 577 VLVRLQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKALVKH----------- 625 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---FY 343 +I L+DGE+ I+ + ++ V ++GV A FL A R FY Sbjct: 626 VILLSDGESPENGIN-----ALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGRYFY 680 Query: 344 SVQNSRKLHDAFLRIGKEMVK----QRILYNK 371 S ++ + F R +E+ + +R LY + Sbjct: 681 S-EDGTDVPRIFSREAREVKRNALVERGLYPR 711 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 35/115 (30%), Positives = 52/115 (45%), Gaps = 30/115 (26%) Query: 282 DYKKYIIFLTDGENSS---PNIDNKESLFY----------------------CNEAKRRG 316 D K+++ +TDG SS P +K +Y C AK++ Sbjct: 376 DTMKFLVLMTDGAISSQRIPKDASKPVQYYNNGSLNTDLYSVGDAERFAAALCTAAKQKN 435 Query: 317 AIVYAIG--VQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 IV+ IG V AA Q + NCAS +RFY V N+ + DAF I + K +++ Sbjct: 436 VIVFTIGFDVNDTAAKQ-MSNCASGAERFYRV-NALDIQDAFKSIATAIQKIKLI 488 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 46/178 (25%), Positives = 74/178 (41%), Gaps = 28/178 (15%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSM----NDHFGPGMDKLGVATRSIREMLDIIK 207 P + ++ S+ G+D+++ LD+S SM G G ++L A R + E + K Sbjct: 73 PQAVARESQVQSR---GMDLVLALDLSTSMLAEEQGREGRGENRLAAAKRVLSEFIGARK 129 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R GLV F+ + PL Q +Q + RL S T + I Sbjct: 130 QD-------RIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDTNSVEDGT----ALGDAIL 178 Query: 268 DAKEKL-EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 +L + A+G + +I +TDG N++ E AK G V+AIG+ Sbjct: 179 AGVNRLRQRPAEG-----RALILITDGRNNA----GAEPQLAAQAAKALGIRVHAIGI 227 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 54/213 (25%), Positives = 85/213 (39%), Gaps = 45/213 (21%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM D P +++ VA E I S P+ N GL F+ + Sbjct: 87 GIDIMLTMDVSASMLTDDVYP--NRMAVAKEVASE---FISSRPNDN----IGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQ----------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 P+ + H + + + T GL A +++ D+K K Sbjct: 138 AFTQCPMT--LDHAALLNLLHNVRPDLVTSGLMKDGTAIGMGLANAVSRLQDSKAK---- 191 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---- 332 K +I LTDG N N+ + + AK+ G VY IG E ++ Sbjct: 192 -------SKIVILLTDGSN---NVGSISPMTAAAIAKKFGIRVYTIGFGRETGEEIGAID 241 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L+N A + FY Q+ +L + I K Sbjct: 242 YRALQNIAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust. Identities = 37/167 (22%), Positives = 72/167 (43%), Gaps = 20/167 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV---NNVVRSGLVTFS 224 G D+M+V+D+S SM++ + + RSI + + + + D R GL+ F Sbjct: 88 GRDLMLVVDISPSMDEQ------DMVLQGRSINRLQAVKRVLDDFISRRQGDRLGLILFG 141 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 ++ PL + ++ ++ + G ++T I DA + + Sbjct: 142 TEPYVQAPLTFDLETVRTLMREAGLGMAGRAT--------AIGDAVGLATKRLRNRPQDQ 193 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + ++ LTDG N++ I ++ A R +Y IG+ AE+ Q Sbjct: 194 RVVVLLTDGANTAGEITPDKATEIAAAASIR---LYTIGIGAESMVQ 237 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust. Identities = 44/172 (25%), Positives = 75/172 (43%), Gaps = 25/172 (14%) Query: 202 MLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGST 252 M DI+ K+ PD +VTF+ + + F L ++ +K++ I G T Sbjct: 279 MKDIVDRFNKTGPDGTQFA---VVTFADRATKQFGLKDYSSKADIKGAIDKVSPSIIGQT 335 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 GLE A ++F + G ++ +K +I LTDG+N+ ES Sbjct: 336 AIGD-GLENARLEVFPNRN-----GGGREEVQKVVILLTDGQNNGHKSPEHESSLL---- 385 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ G ++ AIGV L N AS + + +S D +I +++VK Sbjct: 386 RKEGVVIVAIGVGTGFLKSELINIASSEEYVFTTSS---FDKLSKIMEDVVK 434 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust. Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 16/111 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS GLD++ VLDVS SM G + KL +A ML +IK + ++ R +V Sbjct: 55 SSNDRPGLDLVAVLDVSGSM---AGEKIAKLKMA------MLFMIKKLSPID---RLSIV 102 Query: 222 TFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFD 268 TFS+ + PL ++ Q++ INRL T T GLE A + D Sbjct: 103 TFSTDSTRLCPLRQITENSQKEFENLINRLKADGWTNITAGLETALKVLND 153 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust. Identities = 45/162 (27%), Positives = 70/162 (43%), Gaps = 28/162 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+++ +DVSLSM P D+L + +I ++ R GLV +S + Sbjct: 87 GMDIVLSIDVSLSMLAKDLEP--DRLTA-------LKEIARTFIKQRTTDRIGLVEYSGE 137 Query: 227 IVQTFPLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + PL + ++E++ N + T GL A + + +K K Sbjct: 138 ALMRVPLTSDHRVVEEELMSFNPMDLEGGTNIGDGLAVAVSHLRKSKAK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-VYAIGV 324 K II +TDG N+ IDN S E R I VY IG+ Sbjct: 187 SKIIILMTDGVNT---IDNAMSPLTAAELARNNDIKVYTIGI 225 >gi|311105413|ref|YP_003978266.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] gi|310760102|gb|ADP15551.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] Length = 2061 Score = 37.7 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 39/146 (26%), Positives = 72/146 (49%), Gaps = 14/146 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--S 224 ++ +VLD+S SM+ ++G G + +L A ++++ +L+ + + + L+TF S Sbjct: 1444 NIALVLDLSGSMDYYWGSGSNQETRLETAKKALKSLLE--NQLATHDGTINVSLITFADS 1501 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI---FDAKEKLEHIAKGHD 281 S +Q ++ + +N L+ TP A+N+ FD + + A G + Sbjct: 1502 SSKLQKAISGLTPDNVDDMVNILLGLKAGGGTP-YGAAFNETKSWFDGQPTED--ANG-N 1557 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLF 307 YK FLTDGE SS N+++ F Sbjct: 1558 AYKNLTFFLTDGEPSSEYWYNRDTEF 1583 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust. Identities = 43/180 (23%), Positives = 77/180 (42%), Gaps = 29/180 (16%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 +V +K++ G+D++M +DVS SM D ++ L + +D + P+ Sbjct: 81 AVSKKTKTNSGIDIIMAIDVSASMLARDLKPNRLEALK------KVAIDFVDRRPN---- 130 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEK 272 R G+V ++ + P+ ++ I+ L +G T GL N++ ++ K Sbjct: 131 DRIGIVVYAGESFTQTPITSDKNIVKRTISELQWGQLDGGTAIGMGLGSGVNRLKESTAK 190 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K II LTDG N++ NID + + E + + VY IG+ F Sbjct: 191 -----------SKVIILLTDGVNNAGNIDPRTATELARELEIK---VYTIGIGTNGMADF 236 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust. Identities = 40/143 (27%), Positives = 59/143 (41%), Gaps = 22/143 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 SS+ +G+D+M+ LDVS SMN FG G T ++R + D R GL Sbjct: 91 SSRDTVGIDIMIALDVSDSMNTPDFGGKSRFAGARTAAMRFI--------DNRPADRIGL 142 Query: 221 VTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V FS PL + + E + F PG I A +L Sbjct: 143 VVFSGGSFTRCPLTLDHEVLGRLAETVAPGFF-----DEPGTAIG-TAILTATNRL---- 192 Query: 278 KGHDDYKKYIIFLTDGENSSPNI 300 K +K ++ +TDGEN++ + Sbjct: 193 KASSSKEKALVLITDGENNAGEV 215 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 37.7 bits (86), Expect = 2.9, Method: Compositional matrix adjust. Identities = 43/185 (23%), Positives = 80/185 (43%), Gaps = 25/185 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P+ + + V I + G +MM+ +D+S SM+ P + + GV L ++K Sbjct: 73 AAARPVYVGTPVAIPVE---GREMMLAVDLSASMSS---PDLVQSGVPA----NRLQVVK 122 Query: 208 SIPDVNNVVRSG----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + D R+G L+ FS++ PL +++ + G T ++T Sbjct: 123 RVADDFIARRTGDRIGLILFSTRAYVQAPLTLDRNVVRQLLAEASIGMTGRNT------- 175 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I DA + + +I LTDG N+S +D E+ + R ++ IG Sbjct: 176 -SIGDAIGLAVKTLRDRPAKDRVLILLTDGANTSGVLDPMEAAAIAAKENVR---IHTIG 231 Query: 324 VQAEA 328 V A++ Sbjct: 232 VGADS 236 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 37.7 bits (86), Expect = 3.0, Method: Compositional matrix adjust. Identities = 51/205 (24%), Positives = 86/205 (41%), Gaps = 41/205 (20%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P I + W S A P + V I ++ G +MM+ +D+S SM +D + Sbjct: 56 PLIISSLIWLLLISAAARPQWLGEPVSIPNE---GREMMLAVDLSGSMK------IDDMQ 106 Query: 194 VATRSIREMLDIIKSIPDVNNVV------RSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + R + L + KS+ V + + R GL+ F+ PL + + ++ Sbjct: 107 LNGRQVNR-LTMTKSV--VYDFIQRRVGDRLGLILFADTAYVQAPLTYDRDTVSTLLSEA 163 Query: 248 IFGSTTKSTP-----GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + G + T GL K FD + D+ +I LTDG+N++ NI Sbjct: 164 VIGLVGEQTAIGDAIGLAV---KRFDER----------DESNNVLILLTDGQNTAGNITP 210 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAE 327 +++ A +G VY IGV A+ Sbjct: 211 EQA---KELAINKGVKVYTIGVGAD 232 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 37.7 bits (86), Expect = 3.0, Method: Compositional matrix adjust. Identities = 38/105 (36%), Positives = 48/105 (45%), Gaps = 16/105 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS GLD++ VLDVS SM G M+K+ A ML IIK + + R +V Sbjct: 93 SSYGRPGLDLVAVLDVSRSME---GDKMEKMKTA------MLFIIKKLGPTD---RLSIV 140 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYA 262 TFS + PL QE+ LI G T T GL+ A Sbjct: 141 TFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTA 185 >gi|126272975|ref|XP_001371818.1| PREDICTED: similar to anthrax toxin receptor [Monodelphis domestica] Length = 858 Score = 37.7 bits (86), Expect = 3.1, Method: Compositional matrix adjust. Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 18/144 (12%) Query: 189 MDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +DK G E+ ++S+ + ++ ++R + FSSK L + I++ + Sbjct: 351 LDKSGSVKHHWIEIYSFVESLAEKFISPMLRMSFIVFSSKGTTIMKLTEDREAIRQGLEV 410 Query: 247 LIFGSTTKSTPGLEYAYNKIFD-AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L + PG + +K F+ A E++ H G II LTDGE KE Sbjct: 411 LRY-----EVPGGDTFMHKGFERANEQIYHENYGGLRTASVIIALTDGEL------QKEQ 459 Query: 306 LFYC----NEAKRRGAIVYAIGVQ 325 ++ N A+ GAIVY +GV+ Sbjct: 460 FYFAEKEVNRARTFGAIVYCVGVK 483 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 37.7 bits (86), Expect = 3.1, Method: Compositional matrix adjust. Identities = 48/199 (24%), Positives = 75/199 (37%), Gaps = 29/199 (14%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 CT + P L S + S G+D+M+ LD S SM +D V +S+ Sbjct: 61 CTLLLLIVVAARPQLYNVSRDVHSP---GVDIMLCLDTSGSMQ-----ALD-FKVEGKSV 111 Query: 200 REMLDIIKSIPDV---NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTT 253 + + K + D R GLV F + PL + E +NR+ + G T Sbjct: 112 TRLEAVKKVVADFIGKRETDRIGLVVFGEEAFTQSPLTIDKGLLLELVNRMKIGMAGDRT 171 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + ++ D K K K +I LTDG N++ I + + + Sbjct: 172 AIGSAIAIGGKRLKDLKSK-----------SKILILLTDGRNNAGEISPQAA---ARAVR 217 Query: 314 RRGAIVYAIGVQAEAADQF 332 G +Y IGV + F Sbjct: 218 EFGIKLYTIGVGGKGPAPF 236 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust. Identities = 36/167 (21%), Positives = 74/167 (44%), Gaps = 28/167 (16%) Query: 168 GLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G D+M+ +D+S SM + + G +D+L + + + ++ + R GL+ F Sbjct: 86 GRDIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDFIEQRQGD-------RLGLILF 138 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGH 280 PL V+ + + ++ G ++T L + + KE Sbjct: 139 GDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFASKKES-------- 190 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + ++ LTDG+N++ N++ +++L A+ G VY IGV ++ Sbjct: 191 ---NRIVVLLTDGQNTAGNLNPEDALLL---AREEGIKVYTIGVGSD 231 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust. Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 8/147 (5%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D++K P++ +R +T+S++ L I ++RL T +T + + Sbjct: 79 DVVKKFPNLK--MRVSFITYSTQGHTLMELTSDRNKIHNSLSRLKNIKPTGAT-NMHEGF 135 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 K A E++E G ++ II LT G + + +E+ +A+ GA VY +G Sbjct: 136 KK---ANEQIEQENAGGNNAASLIIALTTGPLTPKAL--QETKSEAEKAREMGAKVYCVG 190 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRK 350 V+ DQ D+ Y V N K Sbjct: 191 VKDYRKDQLDAIVGRKDQMYGVGNGFK 217 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust. Identities = 39/146 (26%), Positives = 62/146 (42%), Gaps = 17/146 (11%) Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F SK L +H+ + INRL +T T GL AY K+ K Sbjct: 128 GLIEFGSKAKIISGLTQNAKHLYKAINRLKTNGSTNMTEGLTTAYLKL-----------K 176 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 DD ++II LTDG + P + + C + G + IG +A +L++ A Sbjct: 177 NVDD-PRFIILLTDGLPNHPKNTQQIAQEICAD----GIELITIGT-GDADKTYLQSLAC 230 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 D+ + + F RI + + + Sbjct: 231 YDQNSFFAKAGTMVSTFSRIAQVLTE 256 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust. Identities = 48/198 (24%), Positives = 85/198 (42%), Gaps = 25/198 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 I CT + + P + +S +SK +D ++V+DVS SMN PG K+G Sbjct: 14 IICTMIMTSILAWQPQMANASSPSASK----VDAVLVVDVSNSMNTS-DPG--KIG--NE 64 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 +++ +D++ + D +V V K + Q ++ I+ L G+ T ++ Sbjct: 65 AMKMFIDMLSTQNDKVGIVAYTDVVQREKALLNITSEADKQELKTFIDGLNRGAYTDTSV 124 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-------SPNIDNKESLFYCN 310 G++ A + D K GH I+ L DG N + + +++ Sbjct: 125 GVKEALRILQDGK------TAGH---APMIVMLADGNNDFNKTTGRTESQSDQDMAQAVA 175 Query: 311 EAKRRGAIVYAIGVQAEA 328 EAK G +Y IG+ A+ Sbjct: 176 EAKNSGVPIYTIGLNADG 193 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust. Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 24/166 (14%) Query: 168 GLDMMMVLDVSLSM--NDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++MMVLD+S SM D G + +L V R+ + ++ + D R GL+ F Sbjct: 97 GYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVE--DRVGD-----RIGLILF 149 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI-AKGHDD 282 ++ PL + + +I+ G K+T + + A ++L+ + +KG Sbjct: 150 GTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTT----SIGDAVGLAVKRLQDVPSKG--- 202 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II LTDG N+S + L AK+ G +Y IG+ +EA Sbjct: 203 --RVIILLTDGANNSGVL---APLKAAELAKQDGIKIYTIGLGSEA 243 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 37.4 bits (85), Expect = 3.5, Method: Compositional matrix adjust. Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 9/71 (12%) Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYS 344 II LTDGE+ D S F E K+ GA+++ I + AA + L N RFY+ Sbjct: 411 IILLTDGED-----DQISSCF--EEVKQSGAVIHTIALGPSAARELETLSNMTGGYRFYA 463 Query: 345 VQNSRKLHDAF 355 ++ L DAF Sbjct: 464 NKDINGLTDAF 474 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust. Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 27/111 (24%) Query: 284 KKYIIFLTDGE------NSSPNIDNKESLFY---------------------CNEAKRRG 316 KK I+F+TDGE ++ P + E L + C+ K Sbjct: 369 KKVIVFMTDGEMNVKFGSTDPAKSSTEKLDWICDKNRTKSCNDTATNALLTTCDSIKSNN 428 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +YAI +EA Q L+ C+S ++Y ++ + D + I K ++ + Sbjct: 429 IEIYAISYSSEADVQNLQTCSSGTKYYFSASTTNIKDVYTAISKNIIGSTV 479 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust. Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 24/166 (14%) Query: 168 GLDMMMVLDVSLSM--NDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++MMVLD+S SM D G + +L V R+ + ++ + D R GL+ F Sbjct: 97 GYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVE--DRVGD-----RIGLILF 149 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI-AKGHDD 282 ++ PL + + +I+ G K+T + + A ++L+ + +KG Sbjct: 150 GTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTT----SIGDAVGLAVKRLQDVPSKG--- 202 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II LTDG N+S + L AK+ G +Y IG+ +EA Sbjct: 203 --RVIILLTDGANNSGVL---APLKAAELAKQDGIKIYTIGLGSEA 243 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust. Identities = 62/238 (26%), Positives = 98/238 (41%), Gaps = 59/238 (24%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKL-GVATRSIREMLDIIKSIPDV 212 T V +K+ G+D++M +DVS SM D ++ L VA+ + IK P+ Sbjct: 79 TVDVSTKTKTTRGIDIVMAIDVSASMLAKDLKPNRLEALKNVAS-------EFIKGRPN- 130 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWG----VQHIQE-KINRLIFGSTTKSTPGLEYAYNKIF 267 R GLV ++ + P+ ++ +QE + N +I G T GL A N++ Sbjct: 131 ---DRIGLVEYAGESYTKTPITSDKSIVLRSLQEIRYNNIIEGGTAIGM-GLATAVNRLK 186 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG---- 323 D+K K K II LTDG N+S +I+ K A G Y IG Sbjct: 187 DSKAK-----------SKVIILLTDGVNNSGSINPK---IASELAVEFGIKTYTIGLGTN 232 Query: 324 -------------------VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 V+ E + LK A + +++ N++KL + + I K Sbjct: 233 GMALSPIAIKQNGQFQYGRVKVEIDETLLKEIAQVTGGKYFRATNNKKLAEIYDEINK 290 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust. Identities = 45/166 (27%), Positives = 79/166 (47%), Gaps = 24/166 (14%) Query: 168 GLDMMMVLDVSLSM--NDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++MMVLD+S SM D G + +L V R+ + ++ + D R GL+ F Sbjct: 89 GYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVE--DRVGD-----RIGLILF 141 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI-AKGHDD 282 ++ PL + + +I+ G K+T + + A ++L+ + +KG Sbjct: 142 GTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTT----SIGDAVGLAVKRLQDVPSKG--- 194 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II LTDG N+S + ++ AK+ G +Y IG+ +EA Sbjct: 195 --RVIILLTDGANNSGVLAPLKA---AELAKQDGIKIYTIGLGSEA 235 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 35/146 (23%), Positives = 61/146 (41%), Gaps = 19/146 (13%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENS-- 296 + K+N + T GL + + + A + + +K + Y+ YI+ L+DG N+ Sbjct: 454 LNSKVNAMNPSGNTNQAIGLFWGWQTLNTANDPFKAPSKDPNWVYQDYIVILSDGLNTQN 513 Query: 297 ----------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLKNCASPDR-- 341 P ID +E C+ K ++ I V + D Q LK+CAS Sbjct: 514 RWYTCPNAGPCPTIDGREKTL-CDNIKADKITIFTIQVNINSKDPESQVLKDCASSGSGY 572 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRI 367 F + ++ AF + ++ K RI Sbjct: 573 FQLITSANDTATAFDNVLNKIAKLRI 598 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 45/169 (26%), Positives = 71/169 (42%), Gaps = 27/169 (15%) Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM F P ++L A + I + P+ N G+VTF+ + Sbjct: 87 GIDIMLAMDVSGSMQAMDFKP--NRLEAAKDV---AISFINNRPNDN----IGMVTFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + L G T GL A N++ D+K K Sbjct: 138 SFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +I LTDG N+ +I + + + A+ G VY +GV F Sbjct: 187 SRVVILLTDGSNNMGDITPRMA---ADIARTFGIRVYTVGVGTRGEAPF 232 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 37/114 (32%), Positives = 57/114 (50%), Gaps = 21/114 (18%) Query: 145 CANSSHAPLLITSSVKI--------SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 C NS+ APL S +K+ SS GLD+++VLD+S SM G ++KL A Sbjct: 31 CMNSTTAPLE-ESKLKVMLELTGGDSSNDRPGLDLVVVLDLSGSME---GEKIEKLKAA- 85 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +L +IK + ++ R +VTFS + PL ++ Q+ + LI G Sbjct: 86 -----ILFMIKKLSSID---RLSIVTFSRDARRLCPLRQITENSQKDLENLING 131 >gi|254456981|ref|ZP_05070409.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] gi|207085773|gb|EDZ63057.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] Length = 229 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 48/207 (23%), Positives = 93/207 (44%), Gaps = 26/207 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--IV 228 ++++LDVS SM G +D L +++ ML+ K + ++ ++TF S+ + Sbjct: 18 VVLLLDVSYSMQ---GENIDTLN---KAVESMLNSFKKAETMETFIKLSIITFGSENGVD 71 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL +++++ F T S A K+ A + + I KG D Y+ I+ Sbjct: 72 LHTPLT--------EVSKIDFKPLTVSGSTPMGAAFKMGKAMIEDKDIFKGRD-YRPTIV 122 Query: 289 FLTDGENSSPNIDNKESL---FYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRFYS 344 L+DGE PN D ++ L K+ + AIG + + F++ C + + Sbjct: 123 LLSDGE---PNDDWRQPLDDFVSTGRTKKCDRMALAIGAADKTVLNMFIEGCEN--SLFY 177 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILYNK 371 +++ + D F +I + ++ NK Sbjct: 178 AEDAENIIDEFKKITMSVTQRTKSVNK 204 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 45/166 (27%), Positives = 79/166 (47%), Gaps = 24/166 (14%) Query: 168 GLDMMMVLDVSLSM--NDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++MMVLD+S SM D G + +L V R+ + ++ + D R GL+ F Sbjct: 89 GYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVE--DRVGD-----RIGLILF 141 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI-AKGHDD 282 ++ PL + + +I+ G K+T + + A ++L+ + +KG Sbjct: 142 GTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTT----SIGDAVGLAVKRLQDVPSKG--- 194 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II LTDG N+S + ++ AK+ G +Y IG+ +EA Sbjct: 195 --RVIILLTDGANNSGVLAPLKA---AELAKQDGIKIYTIGLGSEA 235 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust. Identities = 45/169 (26%), Positives = 71/169 (42%), Gaps = 27/169 (15%) Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM F P ++L A + I + P+ N G+VTF+ + Sbjct: 87 GIDIMLAMDVSGSMQAMDFKP--NRLEAAKDV---AISFINNRPNDN----IGMVTFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + L G T GL A N++ D+K K Sbjct: 138 SFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +I LTDG N+ +I + + + A+ G VY +GV F Sbjct: 187 SRVVILLTDGSNNMGDITPRMA---ADIARTFGIRVYTVGVGTRGEAPF 232 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust. Identities = 51/218 (23%), Positives = 92/218 (42%), Gaps = 42/218 (19%) Query: 155 ITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRSIRE-MLDIIKSIP 210 +T+SV SSK D +D ++ ++D S S+ + ++E M+ +++ Sbjct: 983 VTASVCNSSKVDCEIDKVDLVFLMDGSTSIQQ----------TDFKKMKEFMVSVVQDFD 1032 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL--IFGSTTKSTPGLEYAYNKI 266 N VR G FS FPL G + I +I + IFG+T I Sbjct: 1033 VSNKRVRIGAAQFSDAYRPEFPLGTFIGAKEISIQIENITQIFGNT------------HI 1080 Query: 267 FDAKEKLEHI------AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A K+EH ++ + + ++ LTDG++ E+L +RRG +Y Sbjct: 1081 GAALRKVEHYFRPDMGSRINTGTPQVLLVLTDGQSQDEVAQAAEAL------RRRGIDIY 1134 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 ++G+ Q ++ + ++ +V N +L RI Sbjct: 1135 SVGIGDVDDQQLMQITGTAEKKLTVHNFDELKKVNKRI 1172 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust. Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 24/166 (14%) Query: 168 GLDMMMVLDVSLSM--NDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++MMVLD+S SM D G + +L V R+ + ++ + D R GL+ F Sbjct: 89 GYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVE--DRVGD-----RIGLILF 141 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI-AKGHDD 282 ++ PL + + +I+ G K+T + + A ++L+ + +KG Sbjct: 142 GTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTT----SIGDAVGLAVKRLQDVPSKG--- 194 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II LTDG N+S + L AK+ G +Y IG+ +EA Sbjct: 195 --RVIILLTDGANNSGVL---APLKAAELAKQDGIKIYTIGLGSEA 235 >gi|224052500|ref|XP_002194907.1| PREDICTED: similar to anthrax toxin receptor 1 [Taeniopygia guttata] Length = 537 Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust. Identities = 41/165 (24%), Positives = 72/165 (43%), Gaps = 34/165 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSSK 226 D+ VLD S S+ +H+ E+ ++S+ + ++ ++R + FSS+ Sbjct: 26 FDLYFVLDKSGSVKNHW--------------TEIYSFVESLAEKFISPMLRMSFIVFSSR 71 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKEKLEHIAKGHDDYKK 285 L + I+ ++ L + PG + ++ F A E++ H G Sbjct: 72 GTTIMKLTENREAIRRGLDTL-----KEELPGGDTFMHEGFKRANEQIYHETYGGVRTAS 126 Query: 286 YIIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQ 325 II LTDGE +++ FY N A+ GAIVY +GV+ Sbjct: 127 VIIALTDGEL-------QDAQFYYAEQEANRARSFGAIVYCVGVK 164 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 37.4 bits (85), Expect = 4.0, Method: Compositional matrix adjust. Identities = 39/154 (25%), Positives = 68/154 (44%), Gaps = 25/154 (16%) Query: 145 CANSSHAPLLIT-SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 C SS L ++ S+V + + L++ ++LD S SMN G A ++++ + Sbjct: 17 CQPSSQRQLAVSISAVGETLDRRVPLNLCLILDHSGSMN----------GRALETVKKAV 66 Query: 204 DIIKSIPDVNNVVRSGLVTFS--SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 ++ + +++ R +V F +KI+ + I+++INRL T GL Sbjct: 67 SLL--VDQLSSEDRLSIVVFDHRAKILVPNQIISDRNQIKQQINRLTADGGTAIDEGLRL 124 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 +E +AKG D LTDGEN Sbjct: 125 G----------IEELAKGKKDTISQAFLLTDGEN 148 >gi|297692554|ref|XP_002823610.1| PREDICTED: centrosomal protein of 290 kDa-like [Pongo abelii] Length = 2479 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 21/47 (44%), Positives = 30/47 (63%), Gaps = 3/47 (6%) Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + AYNKIF KE+++ + +D+ K+ I LT G P IDNK+SL Sbjct: 1830 QKAYNKIFREKEEID---QENDELKRQIKRLTSGLQGKPLIDNKQSL 1873 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 37.4 bits (85), Expect = 4.0, Method: Compositional matrix adjust. Identities = 49/161 (30%), Positives = 67/161 (41%), Gaps = 27/161 (16%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+MM LD+S +M P +L A E I S P+ N GLV F+ + Sbjct: 87 GIDIMMALDISSTMLAGDIKP--TRLEAAKSVATEF---ILSRPNDN----IGLVIFARE 137 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + ++ +N + T GL A N+I D K K Sbjct: 138 SFTQCPLTTDHAVLVNLFNGVNNGMIEDGTAIGLGLANAVNRIKDGKSK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K II LTDG N+S +I + AK G +Y IGV Sbjct: 187 SKVIILLTDGSNNSGDI---APITAAEIAKTFGIRIYTIGV 224 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 37.4 bits (85), Expect = 4.1, Method: Compositional matrix adjust. Identities = 47/195 (24%), Positives = 83/195 (42%), Gaps = 35/195 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 L M++VLD S SM + K+ +A + ++++ + G + F + Sbjct: 408 LGMVIVLDRSGSMAGY------KIQLAKEAAIRSAELLREKDTL------GFIAFDDRPW 455 Query: 227 -IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 I+ T P+ + + EKIN L G T P LE AY ++ + + +K Sbjct: 456 QIIDTEPIK-DKEKVIEKINGLTSGGGTNIFPSLELAYEQLTPLELQ-----------RK 503 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD---RF 342 +II LTDG++++ + + L E K + + + E +D L S + RF Sbjct: 504 HIILLTDGQSAT----SPDYLTTIQEGKENNITLSTVAI-GEGSDSVLLEELSDEGGGRF 558 Query: 343 YSVQNSRKLHDAFLR 357 Y V +S + R Sbjct: 559 YDVNDSSTIPSILSR 573 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 6/76 (7%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-----MDKLGVATRS-IREMLDIIKS 208 ++S+ KI+ ++ + LD+++VLDVS SM+D G G +D L A S I + Sbjct: 108 LSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDALKAAVNSFIDGSAKVNDQ 167 Query: 209 IPDVNNVVRSGLVTFS 224 DVN R +V F+ Sbjct: 168 RADVNKQNRIAVVKFA 183 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 37.4 bits (85), Expect = 4.1, Method: Compositional matrix adjust. Identities = 51/233 (21%), Positives = 96/233 (41%), Gaps = 45/233 (19%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHF--GPGMDKLGVATRSIREMLDIIK 207 P + +++ SK G D+MM +D+S SM D G +D+ + + + ++ K Sbjct: 70 PQWLGDPIELPSK---GRDLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRK 126 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R GL+ F+ PL + + + + G K T E I Sbjct: 127 GD-------RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGE----AIA 175 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 A ++ + I D+ + +I LTDG N++ NI+ +++ A R +Y +GV A+ Sbjct: 176 LAVKRFDKI----DESNRVLILLTDGSNNAGNIEPEQA---AQIAANRKVTIYTVGVGAD 228 Query: 328 AADQ---FLKNCASPD-----------------RFYSVQNSRKLHDAFLRIGK 360 ++ F + +P R++ +NS++L + I K Sbjct: 229 VMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDK 281 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 37.4 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 55/120 (45%), Gaps = 14/120 (11%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + D N R GLVT++S + L + ++ ++ L T++ G++ A + + Sbjct: 260 VSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKASGATRADLGMQTANTVLGN 319 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDN---KESLFYCNEAKRRGAIVYAIGV 324 A+ D K +IF TDG+ + N +N +++ K GA VY++G+ Sbjct: 320 ARA----------DASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGI 369 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 37.4 bits (85), Expect = 4.2, Method: Compositional matrix adjust. Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 9/71 (12%) Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYS 344 II LTDGE+ D S F E K+ GA+++ I + AA + L N RFY+ Sbjct: 83 IILLTDGED-----DQISSCF--EEVKQSGAVIHTIALGPSAARELETLSNMTGGYRFYA 135 Query: 345 VQNSRKLHDAF 355 ++ L DAF Sbjct: 136 NKDINGLTDAF 146 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 37.4 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 55/120 (45%), Gaps = 14/120 (11%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + D N R GLVT++S + L + ++ ++ L T++ G++ A + + Sbjct: 246 VSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKASGATRADLGMQTANTVLGN 305 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDN---KESLFYCNEAKRRGAIVYAIGV 324 A+ D K +IF TDG+ + N +N +++ K GA VY++G+ Sbjct: 306 ARA----------DASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGI 355 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 37.4 bits (85), Expect = 4.3, Method: Compositional matrix adjust. Identities = 20/61 (32%), Positives = 35/61 (57%), Gaps = 1/61 (1%) Query: 309 CNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 C+ A G IVYAIG +A + + +++CAS D Y R++ +AF I + + + R+ Sbjct: 551 CDVANAAGIIVYAIGFEAPDRGQRVMEHCASVDANYFDVEGREISEAFASIARSINQLRL 610 Query: 368 L 368 + Sbjct: 611 I 611 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust. Identities = 51/233 (21%), Positives = 96/233 (41%), Gaps = 45/233 (19%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHF--GPGMDKLGVATRSIREMLDIIK 207 P + +++ SK G D+MM +D+S SM D G +D+ + + + ++ K Sbjct: 70 PQWLGDPIELPSK---GRDLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRK 126 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R GL+ F+ PL + + + + G K T E I Sbjct: 127 GD-------RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGE----AIA 175 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 A ++ + I D+ + +I LTDG N++ NI+ +++ A R +Y +GV A+ Sbjct: 176 LAVKRFDKI----DESNRVLILLTDGSNNAGNIEPEQA---AQIAANRKVTIYTVGVGAD 228 Query: 328 AADQ---FLKNCASPD-----------------RFYSVQNSRKLHDAFLRIGK 360 ++ F + +P R++ +NS++L + I K Sbjct: 229 VMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDK 281 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust. Identities = 51/208 (24%), Positives = 89/208 (42%), Gaps = 25/208 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K SSK G D++ LD S SM + G + V R + ++++S G Sbjct: 73 KTSSKRK-GRDLVFALDTSGSMAES---GFNPENVQNRKFDALKELLRSFITKRYNDNVG 128 Query: 220 LVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + F + PL++ V + + + I G +T GL A L+ + Sbjct: 129 VSIFGTYAYPAIPLSYDMGSVAFLLDFFDVGIAGDSTAIGEGLAMA----------LKIL 178 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKN 335 KG +K II +TDG +S + KE++ +AK++ +Y IG+ +A D L Sbjct: 179 KKGEAK-EKVIILITDGYQNSGAVSVKEAV---QKAKKQHVKIYTIGIGDRSAFDANLLQ 234 Query: 336 CASPD---RFYSVQNSRKLHDAFLRIGK 360 + + + + +N + L D + I K Sbjct: 235 LIAKNTDAKMFEAKNVKMLQDIYKEIDK 262 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 37.0 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 55/120 (45%), Gaps = 14/120 (11%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + D N R GLVT++S + L + ++ ++ L T++ G++ A + + Sbjct: 260 VSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKASGATRADLGMQTANTVLGN 319 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDN---KESLFYCNEAKRRGAIVYAIGV 324 A+ D K +IF TDG+ + N +N +++ K GA VY++G+ Sbjct: 320 ARA----------DASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGI 369 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 37.0 bits (84), Expect = 4.6, Method: Compositional matrix adjust. Identities = 49/189 (25%), Positives = 81/189 (42%), Gaps = 39/189 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIREMLD-----------------IIKSI 209 +D +VLDVS SM+D +L +++ LD ++K Sbjct: 191 IDFTLVLDVSGSMDDPMSKTDRTRRLDALKEAVKAFLDEAANTNTEAGSELVHVGLVKFA 250 Query: 210 PDV-----NNVVRSGLVTFS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D +++ RSG T++ S+IV L + ++ K+++L T++ G Sbjct: 251 GDKTDKIGDDMYRSGGYTYNYSQIVSN--LTADMNGLKNKVSKLKAAGATRADNG----- 303 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNK---ESLFYCNEAKRRGAIV 319 F+ K+ A D KK +IF DG SS + K +++ E K GA V Sbjct: 304 ---FNRAVKVMGSASARTDAKKVVIFFADGSPTSSSGFEGKVANKAVEAAKELKDGGAAV 360 Query: 320 YAIGVQAEA 328 Y+IG+ A A Sbjct: 361 YSIGIFASA 369 >gi|125527010|gb|EAY75124.1| hypothetical protein OsI_03018 [Oryza sativa Indica Group] Length = 589 Score = 37.0 bits (84), Expect = 4.9, Method: Compositional matrix adjust. Identities = 32/107 (29%), Positives = 53/107 (49%), Gaps = 16/107 (14%) Query: 148 SSHAP----LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 +S AP LL+ SS GLD++ V+DVS SM+ G G+DK A + Sbjct: 43 ASMAPHDQELLLELRGSSSSTDRAGLDLVAVIDVSGSMD---GDGIDKAKTALQF----- 94 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +I+ + D++ + +VTFS+ + PL + Q ++ L+ G Sbjct: 95 -VIRKLSDLDRLC---IVTFSTNATRLCPLRFVTAAAQAELKALVDG 137 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 37.0 bits (84), Expect = 5.0, Method: Compositional matrix adjust. Identities = 43/174 (24%), Positives = 75/174 (43%), Gaps = 30/174 (17%) Query: 158 SVKISSKSDI--GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +V +S+K + G+D++M +D+S SM F P + + ++ S + Sbjct: 78 TVDVSNKRNTTNGIDIVMAIDLSSSMLAKDFKP---------NRMEALKEVAASFVEARQ 128 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAK 270 R G+V ++++ P+ + + IN + + T GL A N++ D+K Sbjct: 129 SDRIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYDNVLQDGTGIGMGLATAVNRLKDSK 188 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K K II +TDG N++ I E + AK G VY IG+ Sbjct: 189 AK-----------SKVIILMTDGVNNAGFI---EPVTAAEFAKEFGIKVYTIGI 228 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 37.0 bits (84), Expect = 5.1, Method: Compositional matrix adjust. Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 8/90 (8%) Query: 285 KYIIFLTDGENSSPNIDNKES------LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 KY+I LTDG N+ S C AK G V+ + V+ + L+NCAS Sbjct: 485 KYMIVLTDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVE-QGDSTTLQNCAS 543 Query: 339 PDR-FYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y++ N+ +L +I K + K R+ Sbjct: 544 QTAYYYNLSNADQLPATMSKIMKSIRKVRL 573 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 37.0 bits (84), Expect = 5.5, Method: Compositional matrix adjust. Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 4/87 (4%) Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E A G Y I+ L+DGE+ + + ESL N A + IV +G+ E +L Sbjct: 382 EFSAAGDGKYVNIIVMLSDGESDEVDAETIESL--SNLANEKSVIVLTVGLGREIDRAWL 439 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRI 358 + A + ++YS ++ L D + +I Sbjct: 440 QEVAYSTGGKYYSASDATSLDDVYKQI 466 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 37.0 bits (84), Expect = 5.5, Method: Compositional matrix adjust. Identities = 47/150 (31%), Positives = 70/150 (46%), Gaps = 25/150 (16%) Query: 160 KISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATR---SIREMLDIIKSIPDVN 213 K+ SDI G+ + +V+DVS SM M + + TR S + M+D IK Sbjct: 74 KVDHLSDINGEGIYISLVVDVSPSM-------MAEDMMPTRLEASKKTMIDFIKK----R 122 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 N + LV F+ + P + ++E+I + ++ GL A A + L Sbjct: 123 NFDKISLVAFALRASVLSPSTFDYTLLEEEIKNIKIDEEGSTSIGLGIA-----TAVDML 177 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + KG D +K II LTDGEN+S ID K Sbjct: 178 RSV-KG--DNEKIIILLTDGENNSGEIDPK 204 >gi|320101871|ref|YP_004177462.1| hypothetical protein Isop_0318 [Isosphaera pallida ATCC 43644] gi|319749153|gb|ADV60913.1| protein of unknown function DUF1009 [Isosphaera pallida ATCC 43644] Length = 328 Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust. Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 1/59 (1%) Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + + A+R+G VY +G++ EAAD+ + CA + + SV + AF R+G + V Sbjct: 50 PILFAEAARRQGLEVYCVGIRYEAADELVPLCARFE-WVSVTRMNSMIQAFQRMGIDEV 107 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust. Identities = 41/161 (25%), Positives = 67/161 (41%), Gaps = 25/161 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D+ MV+D+S SM D V + ++ + + R G++ FS+ Sbjct: 77 GKDIFMVVDLSKSM--------DAADVTPSRLEKVKFELNRFIENERANRIGIIIFSNDA 128 Query: 228 VQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + IQ L+ + T +E AYNK+ ++ + Sbjct: 129 YIHVPLTYDAAALELFIQSLQTDLLPTNGTNVCGAIEMAYNKLMNSADPTSR-------- 180 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K ++ TDGENSS + +LF N +R G VY++ V Sbjct: 181 AKMMVLFTDGENSSSCTN---ALF--NNLRRFGIGVYSVAV 216 >gi|325981245|ref|YP_004293647.1| von Willebrand factor type A [Nitrosomonas sp. AL212] gi|325530764|gb|ADZ25485.1| von Willebrand factor type A [Nitrosomonas sp. AL212] Length = 651 Score = 36.6 bits (83), Expect = 5.9, Method: Compositional matrix adjust. Identities = 40/172 (23%), Positives = 73/172 (42%), Gaps = 21/172 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + +++DVSLS + G G L + ++ + + + D + TF+S+ Sbjct: 461 LSVAILMDVSLSTDSWIG-GRRILDIEKEALITLATGLAACRDT-----FAIYTFTSR-K 513 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + G++ E N + T PG Y ++ A +H+ + + I+ Sbjct: 514 RDYVRVTGIKDFNESFNTQVLRRITALRPGY---YTRMGAALRHTQHLLSQRSERHRLIL 570 Query: 289 FLTDGENSSPN-IDNKESLFYCN-------EAKRRGAIVYAIGVQAEAADQF 332 LTDG+ PN +D E + EA+R G V+ I + +A D F Sbjct: 571 LLTDGK---PNDLDYYEGRYGVEDTRQAIIEARRAGLSVFGITIDHKAQDYF 619 >gi|312072174|ref|XP_003138945.1| hypothetical protein LOAG_03360 [Loa loa] gi|307765891|gb|EFO25125.1| hypothetical protein LOAG_03360 [Loa loa] Length = 1596 Score = 36.6 bits (83), Expect = 6.1, Method: Compositional matrix adjust. Identities = 50/217 (23%), Positives = 96/217 (44%), Gaps = 27/217 (12%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + V+I ++ D D+M +LD S ++ + K G++ I E+ D+ PD Sbjct: 895 MRVATPVRICNRVDFQADVMFILDSSDNVTSEEYSNL-KEGISML-IDEIFDL---SPD- 948 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +VR G V +S K PL + K+ L S ++ G + AKE+ Sbjct: 949 --IVRVGFVEYSDKASVPVPLGY----YDNKVQLLADISNSEQLGGTPIILRGLRAAKEQ 1002 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-VYAIGVQAEAADQ 331 + G D+ + ++ +T G N N+ F ++ + + ++ + V A Q Sbjct: 1003 FQR--HGRDNVSRILLLVTSGANRG-NV-----AFAADDLREHLNVSIFVLVVNASQGAQ 1054 Query: 332 FLKNCASPD-----RFYSVQNSRKLHDA-FLRIGKEM 362 + N + D R S+ ++ KL +A L+IG+ + Sbjct: 1055 IMLNRLTSDEYTQQRVISISSANKLQEAELLQIGQAL 1091 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 36.6 bits (83), Expect = 6.1, Method: Compositional matrix adjust. Identities = 55/234 (23%), Positives = 89/234 (38%), Gaps = 52/234 (22%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T S +K G+D++M +DVS SM + P + A + + +K P+ Sbjct: 79 THSENAQTKITDGIDIVMAIDVSSSMLSQDLKPNRFE---ALKKVASQF--VKDRPN--- 130 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKE 271 R GLV ++ + P+ I + L +G T GL A N++ ++K Sbjct: 131 -DRIGLVVYAGESYTKTPVTTDKGIILSSLAELTYGQVEDGTAIGMGLATAVNRLKESKA 189 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------- 324 K + II LTDG N++ ID L A G VY +G+ Sbjct: 190 K-----------SRVIILLTDGVNNTGVID---PLIAAELAAEYGIKVYTVGIGTNGMAL 235 Query: 325 ----------------QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 Q E + +K A + R++ N++KL + I K Sbjct: 236 SPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRATNNQKLQQIYDEINK 289 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 36.6 bits (83), Expect = 6.6, Method: Compositional matrix adjust. Identities = 40/144 (27%), Positives = 61/144 (42%), Gaps = 22/144 (15%) Query: 218 SGLVTF-SSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKL 273 +G+VTF SS +Q + + EKI + G T GL + I Sbjct: 347 TGMVTFESSATIQNYLTEITDHNAYEKILANLPQAAGGGTSICSGLRAGFQAI------- 399 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF- 332 I + I+ LTDGE D+ SL + E K+ G++++ I + AA + Sbjct: 400 --IHSNQNTSGSEIVLLTDGE------DDNISLCF-EEVKKSGSVIHTIALGPSAAKELE 450 Query: 333 -LKNCASPDRFYSVQNSRKLHDAF 355 L N RFY+ ++ L DAF Sbjct: 451 ILSNMTGGHRFYANKDINGLIDAF 474 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 36.6 bits (83), Expect = 6.7, Method: Compositional matrix adjust. Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCA--SPD 340 K ++ LTDGE++S +I K++L AK +Y IG+ + EA + LK A S Sbjct: 192 KIVVLLTDGEHNSGDISPKDALVL---AKEENIKIYTIGMGNRGEADEALLKKIADESGG 248 Query: 341 RFYSVQNSRKLHDAFLRI 358 F+ N+++L + + I Sbjct: 249 EFFYATNAKELKEIYEHI 266 >gi|169624118|ref|XP_001805465.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] gi|111056124|gb|EAT77244.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] Length = 1587 Score = 36.6 bits (83), Expect = 6.9, Method: Compositional matrix adjust. Identities = 34/147 (23%), Positives = 67/147 (45%), Gaps = 19/147 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMN-------DHFGPGMDKLGVATRSIREMLD-IIKSIPDVNN 214 ++S G+D +V +++ N P + +L V +++M D I + N Sbjct: 1158 ARSGRGMDQPLVFKLTILGNRKSSSSNVVKSPHLSRLDV----LKQMFDAFINRLLAYNF 1213 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 GLV F SK P+ V++ + K+N ++ T ++ I A+++L+ Sbjct: 1214 QTHIGLVAFGSKASVAQPITNAVENFRHKLNNMLASGDTA-------IWDSIALAQDQLQ 1266 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNID 301 A+ + + II ++DGE++ N D Sbjct: 1267 TYAEKYPTARLRIICISDGEDTKSNQD 1293 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 36.6 bits (83), Expect = 7.2, Method: Compositional matrix adjust. Identities = 55/264 (20%), Positives = 105/264 (39%), Gaps = 45/264 (17%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Q N+S+ SR + + + P + +++ S+ G D+M+ +D+S S Sbjct: 39 QTGKANISSHSRQSRKRYWLMWSLLVLAIARPQWLGDPIELPSQ---GRDLMLAVDLSGS 95 Query: 181 MN--DHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 M D G +D+ + + E ++ K R GL+ F+ PL Sbjct: 96 MQIEDMVINGKVVDRFTLIQHVVSEFIERRKGD-------RIGLILFADHAYLQAPLTQD 148 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + + G K T E I A ++ + + D+ + +I LTDG N+ Sbjct: 149 RRSVAQFLKEAQIGLVGKQTAIGE----SIALAVKRFDKM----DESNRVLILLTDGSNN 200 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---FLKNCASPD------------- 340 + NID ++ A R +Y +GV A+ ++ F + +P Sbjct: 201 AGNIDPDQA---AQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 341 ----RFYSVQNSRKLHDAFLRIGK 360 R++ +NS++L + I K Sbjct: 258 VTHGRYFRARNSQELEQIYQEIDK 281 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 36.6 bits (83), Expect = 7.3, Method: Compositional matrix adjust. Identities = 40/157 (25%), Positives = 64/157 (40%), Gaps = 12/157 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G ++ ++LD S SM + G D + A + I+K GLV F S Sbjct: 83 GHEIALILDASGSMKER---GFDPVNPAASRFDVVKSIVKDFISQRTNDNMGLVVFGSYS 139 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + +HI +I ++ G+ Y +++A + ++ K K Sbjct: 140 FIASPLTYD-KHILSRI-------VSQLEVGMAGKYTALYEALAQGVNLLKMSKAKSKVA 191 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 I LTDG S+ D + AK+ G VY IG+ Sbjct: 192 ILLTDG-YSTAGADKIPLDVVLDMAKKEGVKVYPIGI 227 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 36.6 bits (83), Expect = 7.3, Method: Compositional matrix adjust. Identities = 43/177 (24%), Positives = 70/177 (39%), Gaps = 26/177 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSM----NDHFGPGMDKLGVATRSIREMLDIIK 207 P + ++ S+ G+D+++ LD+S SM G G +L A R + + K Sbjct: 73 PQAVERESRVRSR---GMDLVLALDLSTSMLAEEQGREGRGESRLAAAKRVLAGFIGGRK 129 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R GLV F+ + PL Q +Q ++RL G+ T + I Sbjct: 130 DD-------RIGLVAFAGRPYPAAPLTSDHQWLQGVVDRLDTGAVEDGT----ALGDAIL 178 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 +L + +I +TDG N++ E AK G V+AIG+ Sbjct: 179 SGVNRLRR----RPAESRALILITDGRNNA----GAEPQLAAQAAKALGIRVHAIGI 227 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 36.2 bits (82), Expect = 7.6, Method: Compositional matrix adjust. Identities = 46/186 (24%), Positives = 76/186 (40%), Gaps = 18/186 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKI 227 LD+M+V+D S S+ G G +R + + ++P + VR GLVTF + Sbjct: 46 LDVMLVVDESGSI------GTSNYG----KVRSFISNFAGTMPLSPDDVRVGLVTFGTSA 95 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG-HDDYKKY 286 V + L+ Q G Y + + A+E L KG D+ K Sbjct: 96 VTRWDLS--DSRAQNADLLAAAAKKLPYAAGSTYTHLGLAKAEEILFSFQKGGRDNAPKM 153 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 I+ +TDG +S ++L + + RG I+ +GV ++ A D +V+ Sbjct: 154 ILVMTDGASSR----RSQTLSAAEKLRNRGVIIVVLGVGTGVNSAECRSIAGCDTSDTVE 209 Query: 347 NSRKLH 352 R L Sbjct: 210 CPRYLQ 215 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 36.2 bits (82), Expect = 7.8, Method: Composition-based stats. Identities = 48/196 (24%), Positives = 90/196 (45%), Gaps = 21/196 (10%) Query: 168 GLDMMMVLDVSLSM--NDHFGPGMDKLGVA---TRSIREMLDIIKSIPDVNNVVRSGLVT 222 G+D+++V+D S SM ND+ + + +++ ++D I +PD ++V R ++ Sbjct: 544 GVDVLLVIDKSGSMGLNDNGRTDSNYFNLMPTLKKTVPTLVDTI--LPDSDSVNRVAAIS 601 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FSS ++ W + + NR I G TK + A +A +KL+ A+ Sbjct: 602 FSSDDYTGNDISTDWVDYNGKSGFNRKIEGLGTKGGTNWQLAMR---NADKKLKPRAESQ 658 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-----FLKN 335 + KK ++FL+DGE + K S + E + G Y+ A D+ +LK+ Sbjct: 659 N--KKVVVFLSDGEPTYRY--EKRSYYPYEEYETGGGQSYSSANLTNAVDEVAGSDYLKD 714 Query: 336 CASPDRFYSVQNSRKL 351 + + Q S ++ Sbjct: 715 AEIYSVYLTSQTSTRM 730 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 36.2 bits (82), Expect = 7.8, Method: Compositional matrix adjust. Identities = 52/243 (21%), Positives = 99/243 (40%), Gaps = 47/243 (19%) Query: 144 WC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHF--GPGMDKLGVATR 197 WC + P + +++ S+ G D+MM +D+S SM D G +D+ + Sbjct: 60 WCLLVLAIARPQWLGEPIELPSQ---GRDLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQH 116 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 + + ++ K R GL+ F+ PL + + + + G K T Sbjct: 117 VVSDFIERRKGD-------RIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTA 169 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 E I A ++ + + D+ + +I LTDG N+S NI+ +++ A R Sbjct: 170 IGE----AIGLAVKRFDKM----DESNRVLILLTDGSNNSGNIEPEQA---AQIAANRKV 218 Query: 318 IVYAIGVQAEAADQ---FLKNCASPD-----------------RFYSVQNSRKLHDAFLR 357 +Y +GV A+ ++ F + +P R++ +NS++L + Sbjct: 219 TIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQE 278 Query: 358 IGK 360 I K Sbjct: 279 IDK 281 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 36.2 bits (82), Expect = 8.6, Method: Compositional matrix adjust. Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 9/86 (10%) Query: 291 TDGENSSPNIDNKESLFY----CNEAKRRGAIVYAIGVQAEA---ADQFLKNCA-SPDRF 342 T G +SP S+ Y C++AK +G I+Y +G Q D L CA SP + Sbjct: 235 TRGNMNSPANSKHNSVAYMKTMCDQAKAKGIIIYTVGFQIRRNTLPDLSLSYCATSPSHY 294 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRIL 368 Y V++S L AF I + RI+ Sbjct: 295 YFVESS-DLSAAFKAIASSIKSLRIV 319 >gi|303242740|ref|ZP_07329210.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302589715|gb|EFL59493.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 429 Score = 36.2 bits (82), Expect = 8.8, Method: Compositional matrix adjust. Identities = 37/150 (24%), Positives = 65/150 (43%), Gaps = 29/150 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ +S K I ++ ++D S SM K+ A +++ +D++ ++ Sbjct: 34 ALGLSDKQQIKEAIIFIIDTSGSM------AGGKIENAIKTLHTCIDLL------DDSCH 81 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEK 272 ++TFS FP+ +EK + L+ +G T S P L A Sbjct: 82 FCILTFSESTTTIFPMEKSSIEAKEKAHILVADIKVYGGTKLSAP-LNTA---------- 130 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + G+DDY K F+TDG NS +I+N Sbjct: 131 -SLVFAGYDDYLKIAYFVTDGNNSQSDINN 159 >gi|327284423|ref|XP_003226937.1| PREDICTED: anthrax toxin receptor 2-like [Anolis carolinensis] Length = 441 Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust. Identities = 43/165 (26%), Positives = 69/165 (41%), Gaps = 16/165 (9%) Query: 189 MDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +DK G T + E++D +K + D V+ +R + FS + L I+ + Sbjct: 48 LDKSGSVTDNWFEIVDFVKQLTDRFVSPRMRLSFIVFSMQAKVILQLTENRAQIERGLEE 107 Query: 247 LIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG--ENSSPNIDNK 303 L PG E Y + I +A ++E G II LTDG E P K Sbjct: 108 L-----RNVKPGGETYMHEGIKEANRQIE--TAGGQRTNSIIIALTDGKLEGLIPQYAEK 160 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 ++ + ++R GA VY +GV +Q S D+ + V+ Sbjct: 161 QA----DISRRLGARVYCVGVLNFNQEQLESIADSRDQVFPVKEG 201 >gi|148976671|ref|ZP_01813358.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] gi|145964022|gb|EDK29280.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] Length = 303 Score = 36.2 bits (82), Expect = 9.3, Method: Compositional matrix adjust. Identities = 68/299 (22%), Positives = 124/299 (41%), Gaps = 61/299 (20%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G + +++ I LP I +V+GL I+ + VK+KL +D + + A + N E+ Sbjct: 12 SRGLVVLMSVIALPFILLVVGLSIDAGRAYIVKSKLFAAVDAASIAAARAVANGEDAGRA 71 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQD-INNIERSTSLSIIIDDQHKDYNLSAVS 131 QK F+ I + + L + FA D NI ++SA + Sbjct: 72 AAQKY-FAANIPADFYSA--TPNLGDVNFAYDSFGNIS---------------IDISATA 113 Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++P +F P + P + S++ +D+++V+D + S+ + Sbjct: 114 --QVPTVFL--PLIGLDTFNPGVSAQSIRRP------VDLVLVIDNTTSL---------R 154 Query: 192 LGVATRSIREMLDIIKSIPDV--NNVVRSGLV--TFSSKIVQTFPLAWGVQH--IQEKIN 245 LG ++++D KS + R LV F S++ F G I+ +I+ Sbjct: 155 LGSIGDVTQDVIDRSKSFIENFHEGFDRISLVKFAFGSEVPVGFNATRGHSRSTIKSEID 214 Query: 246 RLIFGST-----TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 FGST T ++ G+ A+N++ + K I+F TDG +PN Sbjct: 215 SFNFGSTSNAQYTNASEGMYRAFNELRTVTDPAN---------LKVIVFFTDG---APN 261 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 36.2 bits (82), Expect = 9.5, Method: Compositional matrix adjust. Identities = 49/234 (20%), Positives = 99/234 (42%), Gaps = 47/234 (20%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 PL + +++ SK G D+M+ +D+S SM ++ + + +++ + + + D Sbjct: 69 PLWMGDPIELPSK---GRDLMVAVDLSGSMQ------IEDMVLDGKAVNRFIMVQSVVSD 119 Query: 212 VNNVVRS---GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYN-KI 266 + GL+ F+ PL + + + + G K T G A + K Sbjct: 120 FIERRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALSVKR 179 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 FD D+ + ++ LTDG N+S +I +++ + A +RG +Y+IGV A Sbjct: 180 FDLV----------DESNRILVLLTDGSNNSGSISPEQA---ADIAAKRGIKIYSIGVGA 226 Query: 327 EAADQ---FLKNCASPD-----------------RFYSVQNSRKLHDAFLRIGK 360 + ++ F K +P R++ +N+++L + I K Sbjct: 227 DVMERRTLFGKERVNPSMDLDEEQLTSLAQTTGGRYFRARNAQELEQIYQEIDK 280 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 36.2 bits (82), Expect = 9.5, Method: Compositional matrix adjust. Identities = 46/177 (25%), Positives = 72/177 (40%), Gaps = 31/177 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ ++D S S+ H M + M+DI+ ++ N R G+V +SS++ Sbjct: 6 VDLLFLIDSSRSVRPHEFETMRRF---------MIDILNTLDIGLNSTRVGVVQYSSQVR 56 Query: 229 QTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L V+ IQE + T + + Y N F A E Sbjct: 57 SEFSLRSHASLDSMVKAIQEMVP---LAQGTMTGLAIRYTMNVAFTAAE------GDRPK 107 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ +TDG P E EA+ RG +YA+GV A A L+ ASP Sbjct: 108 VPNVVVIVTDGR---PQDRVAEVA---AEARERGMEIYAVGV-ARADMTSLRAMASP 157 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 35.8 bits (81), Expect = 9.8, Method: Compositional matrix adjust. Identities = 48/200 (24%), Positives = 81/200 (40%), Gaps = 27/200 (13%) Query: 131 SRYEMPFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGP 187 R + F+F + A A P + T KI G D+M++LDVS SM P Sbjct: 53 GRQGLKFVFFVLAYAALVLGAANPQIGT---KIEEAKRSGSDLMILLDVSNSMLAGDLAP 109 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 ++L A R+I +++D + + R G++ F+ + P+ + +N + Sbjct: 110 --NRLENAKRAISQLIDNLHND-------RIGIIIFAGEAYVQLPITTDYSAAKLFLNNI 160 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 TT P A D K + G K +I +TDGEN +++ Sbjct: 161 ----TTDIVPTQGTAIGAAIDMGMKSFNFVNGTS---KAMILMTDGENHE-----DDAVS 208 Query: 308 YCNEAKRRGAIVYAIGVQAE 327 A + ++ IGV +E Sbjct: 209 AAKRASAKDVAIHVIGVGSE 228 Searching..................................................done Results from round 2 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 471 bits (1211), Expect = e-130, Method: Composition-based stats. Identities = 371/371 (100%), Positives = 371/371 (100%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA Sbjct: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD Sbjct: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS Sbjct: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI Sbjct: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI Sbjct: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK Sbjct: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 Query: 361 EMVKQRILYNK 371 EMVKQRILYNK Sbjct: 361 EMVKQRILYNK 371 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 393 bits (1010), Expect = e-107, Method: Composition-based stats. Identities = 204/370 (55%), Positives = 272/370 (73%), Gaps = 3/370 (0%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 LNIRNFFYN KG ++ILTAI LP+IF+V+G++IE SH FF+K LH ++D SL++ AT+I Sbjct: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 +N+ NGNN KK K IKN W FRNELR+NGF DI++I RSTSL I++ Q++ Sbjct: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 Query: 124 DYNLSAVSRYEMPFIFCTF-PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y++SA+SRY++P FCTF PW NS H + ITSSVK++S++D LDMM+VLDVS SM Sbjct: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 F + K+ +A +SI ML+ +K IPDVNNVV+SGLVTFS+KI + F L WGV H+Q Sbjct: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 Query: 243 KINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 KI L FG +T STPGL+YAYN+IFD + +H +YKK I+F+TDGEN S D Sbjct: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 ++SL+YCNEAK+RGAIVYAIG++ + +FL+ CASP+ FY V+N ++DAF IGK+ Sbjct: 306 -QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 Query: 362 MVKQRILYNK 371 +V +RI Y+K Sbjct: 365 IVTKRIWYDK 374 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 309 bits (792), Expect = 4e-82, Method: Composition-based stats. Identities = 117/369 (31%), Positives = 202/369 (54%), Gaps = 20/369 (5%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 M + IRNFF N +G I+I +AI+ P+I I+M +V E S+ + K +L ++D +LL T Sbjct: 1 MYCIKIRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALLDTV 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 T I N KN + IW + + EL + F+ D+ N+ TS+ + D Sbjct: 61 TMIKL---KNIEDVVKNV---GPVNTIWTKNLKYELEHSDFSSDVQNVIDDTSMKLESDS 114 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPW-CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 K +++A+S+Y+MPF C C + + + + SS+KI +D+M+VLDVS Sbjct: 115 NFKTLSITAISQYKMPFKICNIHLLCPKNKYVTVPVLSSMKIGRNEGSDIDLMIVLDVSS 174 Query: 180 SMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 SM+D+F +L VA +SIR+ML+ + +P+ NV R+G V F+ + PL Sbjct: 175 SMDDNFMKPEEAPCSRLEVAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPMPLKR 234 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 G++ I I + +T S G++YA+ +++ + + D KK +IFLTDGEN Sbjct: 235 GLKRIYNDIKKYRAFGSTNSYVGMKYAWEQLYGNPQDTK-------DRKKIVIFLTDGEN 287 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAF 355 N ++++ CN+ K++ A++Y+I + + + L+ C+S Y+ +++ L A+ Sbjct: 288 MIINA-TRKTIELCNDMKKKKAVIYSIALAVD-NKEVLQGCSSSGNVYAADDAQSLVQAY 345 Query: 356 LRIGKEMVK 364 IGK+++K Sbjct: 346 SLIGKDVMK 354 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 302 bits (773), Expect = 5e-80, Method: Composition-based stats. Identities = 92/371 (24%), Positives = 159/371 (42%), Gaps = 20/371 (5%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 IR N G+ +++ A +PV+F+ L ++T++ +K +L +D + L TA ++ Sbjct: 9 KIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALATAARLS 68 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 +EN + Q F+ + + + DF NGF S + ++ ID Sbjct: 69 EEENLTAAQAQA--FALKFVNGQVKEDFG---AFNGF---------SVTPTVNIDPVETG 114 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS--KSDIGLDMMMVLDVSLSMN 182 M P L ++ K S ++ M +VLD S SM+ Sbjct: 115 GRTVWKVAVSMEGSQSLTPMARIMGKDKLTVSVVGKSESAGEAQGAFSMALVLDRSGSMD 174 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 + G K+ V ++ +++ + VR G +++SK+ + L W +E Sbjct: 175 WNLN-GQKKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGSTKLRWNPGKTKE 233 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 ++ L T ST ++AY + +E H AK KK+I+F+TDG+N+ + D Sbjct: 234 FVDALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKKFIVFMTDGDNNYSSAD- 292 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGK 360 + C++AK G VY + A Q L CAS + F+ QNS +L +AF IG Sbjct: 293 SSTKHLCDDAKDDGIEVYTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKNIGY 352 Query: 361 EMVKQRILYNK 371 K + Sbjct: 353 AASKVVSRLTE 363 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 287 bits (734), Expect = 2e-75, Method: Composition-based stats. Identities = 82/365 (22%), Positives = 147/365 (40%), Gaps = 16/365 (4%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I N GSI L IL+ ++ V+ + I+ S F + +L D + + T +L Sbjct: 71 IPNAHKERDGSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALLA 130 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN-NIERSTSLSIIIDDQHKD 124 +E Q N ++ I + +G + ++ + ++ +D + Sbjct: 131 EEIT---IAQANAYAKDAFNRIASGLSASGDGSSGTSIFGTMTVKPAVQITETVDGNTTN 187 Query: 125 YNLSAVSRYEMP---FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 Y + ++P F F + L S +++ L M +VLD S SM Sbjct: 188 YVVKVNGTAKIPASPLSFMFFDGETGKNTISLGFESETTAKAEAGASLSMALVLDRSGSM 247 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 M +L A RS+ + L + + R G + L W ++ Sbjct: 248 GWERPSRMSELKKAVRSLIKELQTVD---PDDQFTRLGAYAYHWYYAGKKELTWNKNSVR 304 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 +N L T++ P ++ A N + E HI K + +I+++TDG + PN Sbjct: 305 SWVNSLPASGGTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPNWA 364 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIG 359 +E C AK G +Y + +A A+ + LK CA+ D +Y +N+ +L+ F I Sbjct: 365 KRE----CTSAKNAGITIYTVAFKAPASGRNLLKACATSDAHYYDAKNANELNKVFKDIA 420 Query: 360 KEMVK 364 +E K Sbjct: 421 RETTK 425 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 283 bits (724), Expect = 2e-74, Method: Composition-based stats. Identities = 91/424 (21%), Positives = 149/424 (35%), Gaps = 68/424 (16%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 SFL+ +R F + G+ I+TAIL PV+ GL I+ K +L + + Sbjct: 4 SFLHPCLRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQ---EAADSAA 60 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 NG Q F+ +N N L Q+ +I+ +T++++ Sbjct: 61 LATATALGNGTIQTSQAEAFA----RNFVAGQMANYL------QNGVDIKNATAVNVQTS 110 Query: 120 DQHKD--YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS----SVKISSKSDIGLDMMM 173 + K Y ++ Y++ ++ H T+ S S S + M + Sbjct: 111 NSGKSASYQVTVTPSYDLTVNPLMQAVGFSTQHLSTSSTTVSGPSQTPGSNSQGSVSMFL 170 Query: 174 VLDVSLSMND-----------------------------------HFGPGMDKLGVATRS 198 LD S SM D K+ + Sbjct: 171 ALDKSGSMGDPTETVNKDQPTETFTYDCNPHLNKKGKWVYDTCTGSRTNYYTKIEALKMA 230 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + + S VR+G V++ LAWG + +N L G T S+ Sbjct: 231 AGNLFGQLTSADPDAQYVRTGAVSYDIDQYTPSTLAWGTSGVSSYVNALQAGGGTNSSGA 290 Query: 259 LEYAY------NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFY 308 + AY N + E H K KKYI+F+TDG+N++ + + + Sbjct: 291 MGTAYSSLTAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTLTKAT 350 Query: 309 CNEAKRRGAIVYAIGVQAEAADQ-FLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 C+ AK +G +Y I A Q L+ CAS ++ + L AF IG + Q Sbjct: 351 CDTAKSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLAAFKAIGAKASAQL 410 Query: 367 ILYN 370 Sbjct: 411 TRLT 414 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 282 bits (722), Expect = 4e-74, Method: Composition-based stats. Identities = 96/458 (20%), Positives = 179/458 (39%), Gaps = 91/458 (19%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F N + GS +++A+LLPVIF+V+GL+I+ + L ++ + L + + Sbjct: 8 FFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSASVQ 67 Query: 63 ILNQ-ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 +LN E+ + K + IK + + L N D I + T ++I ++ Sbjct: 68 LLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKVNIY--NR 125 Query: 122 HKDYNLSAVSRYEMPFIF--CTFPWCANSSHAPLLITSSVKISSKSDI----GLDMMMVL 175 + ++ S Y +P F N P+ +++SK + G+ + ++ Sbjct: 126 KGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVSVQWLI 185 Query: 176 DVSLSMN----------------------------------DHFGPGMDKLGVATRSIRE 201 D S SM D F P + +L RS+ Sbjct: 186 DDSGSMGSIIDRACFGSKQLKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCDRSLYY 245 Query: 202 MLDI--------------------------------------IKSIPDVNNVVRSGLVTF 223 +L+ ++ I ++ + +R + F Sbjct: 246 VLNDKKILEDDDLEEKNLDNHSQYYIRKRYLVRDALATFIKRVRKIDNLKDKLRMSFMYF 305 Query: 224 SSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + +I FP+ WG++ +++++ R + T P L+ AYNK+ E EH K Sbjct: 306 NERIDHYFPMTWGIKEFKQEVSSHYKRKHENTATDIHPILQEAYNKLHSKNEDDEHKKKN 365 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-----AADQFLK 334 + KK+I+ LTDG + L C+ AK G ++ I + A+ FL Sbjct: 366 SVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIFTISYSVDSSERKKANDFLS 425 Query: 335 NCASPDRFYSVQNSRKLHDAFL-RIGKEMVKQRILYNK 371 CASPD+F+ ++ KL+ F IG + ++ + + Sbjct: 426 RCASPDKFFEAYDADKLNMIFKEHIGDAIFERLVKIRR 463 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 274 bits (701), Expect = 1e-71, Method: Composition-based stats. Identities = 86/420 (20%), Positives = 144/420 (34%), Gaps = 65/420 (15%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 SFL+ +R F + G+ I+TAIL PV+ G+ I+ K +L + + Sbjct: 4 SFLHPCLRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQ---EAADSAA 60 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI--I 117 NG Q F+ +N N L Q +I+ +T +++ Sbjct: 61 LATATALANGTIQTSQAEAFA----RNFVAGQMANYL------QSGVDIKSATGVTVQTN 110 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ Y++ + H +++ S++ + M + LD Sbjct: 111 TSGNSTSYQVTVSPSYDLTVNPLMQAVGFTTQHLS-TSGTTIGGHSQTQGSISMYLALDK 169 Query: 178 SLSMND-----------------------------------HFGPGMDKLGVATRSIREM 202 S SM + K+ + + Sbjct: 170 SGSMGEDTATVNEEDPTESYTYDCNGHYNKKGKWIYDTCTGSRANYYTKIEALKMAAGNL 229 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + S VR+G V++ LAWG + +N L G T S+ + A Sbjct: 230 FGQLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSGVSTYVNALQAGGGTNSSGAMSTA 289 Query: 263 Y------NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEA 312 Y N + E H K KKYI+F+TDG+N+ + + + C+ A Sbjct: 290 YSSLTAKNAAGNDAEDAAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTA 349 Query: 313 KRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 K +G +Y I A Q L CAS D ++ + L AF IG + Q Sbjct: 350 KSKGIEIYTIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQLTRLT 409 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 269 bits (688), Expect = 5e-70, Method: Composition-based stats. Identities = 83/410 (20%), Positives = 156/410 (38%), Gaps = 56/410 (13%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 SFL+ +R + G+ I+TAI+LPV+F G+ I+ K +L + + Sbjct: 5 SFLHPCLRRMLGDRGGNFGIMTAIVLPVLFGAAGMAIQVGDLLLSKQQLQ---EAADSAA 61 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI--I 117 NG Q F+ + ++ + +I+ +T + + Sbjct: 62 LATATALANGTIQTSQAEAFARDFVAGQMANYLQSGI----------DIKSTTGVDVRTT 111 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 + Y ++ Y + + + +++ +S++ + M +VLD Sbjct: 112 TSGKSTSYQVTVSPDYNIAVNPLMQTIGFTTQNIS-TSSTTTSGNSQTQGSVSMFLVLDR 170 Query: 178 SLSMNDHFGP-----------------------------------GMDKLGVATRSIREM 202 S SM + K+ ++ + Sbjct: 171 SGSMGEDTATVNASDPTEEYNYDCSEKDRYGNVTKKKTCTDTRPHYYTKIEALKLAVGTL 230 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + ++ VR+G V+++ ++ + L WG H+ + +N+L T S + A Sbjct: 231 TGELDAVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHVTKYVNKLTATDGTDSGEAFKTA 290 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 YNK+ DA E H+ K KYI+F+TDG+N+ + D E+ +C++A+ VY I Sbjct: 291 YNKLADAAEDKAHVDKTGQVPTKYIVFMTDGDNNYTSADT-ETKTWCDKARDAKMQVYTI 349 Query: 323 GVQAEAADQ-FLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 A A Q L CA+ P ++ + L AF IG + Q Sbjct: 350 AFMAPARGQALLSYCATAPGNYFPAGDMTALLKAFKEIGMKASNQVTRLT 399 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 268 bits (685), Expect = 9e-70, Method: Composition-based stats. Identities = 84/415 (20%), Positives = 143/415 (34%), Gaps = 64/415 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + G+ I+TAIL PV+ G+ I+ K +L + + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQ---EAADSAALATATA 66 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD- 124 NG + F+ +N N L Q +I+ +TS+++ K Sbjct: 67 LANGTIQTTEAEAFA----RNFVAGQMANYL------QSGTDIKSTTSVNVQTTTSGKST 116 Query: 125 -YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 Y ++ Y + + H +++ S++ + M + LD S SM + Sbjct: 117 SYQVTVSPAYVLTVNPLMQAVGFTTQHLS-TSGTTIGGHSQTQGSISMFLALDKSGSMGE 175 Query: 184 ------------------------------------HFGPGMDKLGVATRSIREMLDIIK 207 K+ + + + Sbjct: 176 DTATVNEESPTESYTYDCNLHYNTKNNKWVYDKCTGSRTNYYTKIEALKMAAGNLFSQLN 235 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY---- 263 S VR+G V++ LAWG+ + +N L T S+ + AY Sbjct: 236 SADPNAQYVRTGAVSYDINQYAPSSLAWGITGVSSYVNALQANGGTNSSGAMNTAYTSLT 295 Query: 264 --NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRGA 317 N + E H K KKYI+F+TDG+N++ + + C++AK +G Sbjct: 296 AKNAAGNDVENSAHQQKTGQVPKKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKSKGI 355 Query: 318 IVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +Y I A A Q L CAS D ++ + L AF IG + Q Sbjct: 356 EIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFQAIGAKASAQLTRLT 410 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 267 bits (682), Expect = 2e-69, Method: Composition-based stats. Identities = 81/416 (19%), Positives = 140/416 (33%), Gaps = 64/416 (15%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 +R F + G+ I+TAI+ PV+ V G+ I+ K +L + + Sbjct: 9 RLRRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQ---EAADSAALATAT 65 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK- 123 NG Q F+ + ++ + + + TS+++ K Sbjct: 66 ALANGTIQTSQAEAFAQNFVAGQMANYVQSGV----------DFKSGTSVNVQTSTSGKS 115 Query: 124 -DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y ++ Y++ + H ++V S++ + M + LD S SM Sbjct: 116 TSYQVTVSPSYDLTVNPLMQAVGFKTQHLS-TSGTTVGGHSQTQGSISMFLALDKSGSMG 174 Query: 183 D------------------------------------HFGPGMDKLGVATRSIREMLDII 206 + K+ + + + Sbjct: 175 EATATVNADDPTESYTYDCNLHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFGQL 234 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY--- 263 S VR+G V++ LAWG + +N L T S+ + AY Sbjct: 235 NSADPNAEYVRTGAVSYDINQYTPSNLAWGTAGVTSYVNALQANGGTNSSGAMSTAYSSL 294 Query: 264 ---NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRG 316 N + E H K KKYI+F+TDG+N+ + + + C+ AK +G Sbjct: 295 TAKNAAGNDAEDSAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKG 354 Query: 317 AIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +Y I A A Q L CAS D ++ + L AF IG + Q Sbjct: 355 IEIYTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASAQMTRLT 410 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 259 bits (662), Expect = 4e-67, Method: Composition-based stats. Identities = 83/397 (20%), Positives = 163/397 (41%), Gaps = 49/397 (12%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N G I++A ++ V I + VI+ +H +K + LD++++ + +++ Sbjct: 16 FSKNKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIVVSDPK 75 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELR-------ENGFAQDINNIERSTSLSIIIDDQ 121 N+ Q+ I KN + +N EN ++ I ++ + Sbjct: 76 INDLNPQEERIRDVIKKNAYVNMVQNFPAEHAAYIIENANISFSKDLTNKYEYKITMEAK 135 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 H+ + + + MP + +H + T ++ S + MVLD S SM Sbjct: 136 HQLSGKNFILGFLMPNVI---------THISSISTGIIQKPSDKK-AFSVEMVLDCSGSM 185 Query: 182 NDHFGPGMD-----------------------KLGVATRSIREML-DIIKSIPDVNNVVR 217 D D L A+ ++ + +++ P ++ R Sbjct: 186 LDSMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQTFPQIS--AR 243 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKEKLEHI 276 GL+TF+ I+Q L+ I++ I+R+ T + + AY + + E H Sbjct: 244 IGLITFNHYIMQDSKLSNNFNVIKKTISRMKPKGGTDTFLPMNAAYEYLNNIPNETKAHN 303 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFL 333 + K+YII +TDGEN+ P+ D K ++ C+ A++ G I+Y+I + + Sbjct: 304 ISDNVPLKRYIILMTDGENNHPSYDLK-TINVCDNARKNGIIIYSIFLNYYEYTDGYELA 362 Query: 334 KNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 + CAS + F+ N++ L D+F I + + + Sbjct: 363 RKCASSEKHFFYANNTKALLDSFKSIAHAIQDKAVRI 399 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 253 bits (647), Expect = 3e-65, Method: Composition-based stats. Identities = 80/416 (19%), Positives = 163/416 (39%), Gaps = 53/416 (12%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 L+ + G+ ++TA++ P++ V G+ ++ ++ K +L D + L A+ + Sbjct: 9 LSFMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASAL 68 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENG------FAQDINNIERSTS---- 113 ++ + + D + + +K ++L + G + ++ +T Sbjct: 69 VSDARPDIE--EAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWED 126 Query: 114 -----LSIIIDDQH---KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS 165 + I K + ++ +++ + F T S T+ SK+ Sbjct: 127 VNATEIDITATPNGAKGKSFQVTVANKHLLQFNAMTRLLGPESIEIETRSTAESATESKN 186 Query: 166 DIGLDMMMVLDVSLSMNDHFGP----------------------------GMDKLGVATR 197 L M +VLD S SM + K+ Sbjct: 187 --ALSMYLVLDRSGSMAWKTNTINTGKAKCPNYTEANWSKYPDLKATGPCYVTKIDALKT 244 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 ++ ++L + + + VR+G ++++S L+WG + ++ L+ T S Sbjct: 245 AVGDLLAQLVTADPESAYVRTGAISYNSAQDAASSLSWGTRGAAGYVDALVAIGGTASGN 304 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + A+ K+ +A E EH AK KYI+F+TDGEN+ N D+ + +C+ AK Sbjct: 305 AFKTAFQKVTNAAEDSEHGAKNGQVPTKYIVFMTDGENNHAN-DDTVTRQWCDTAKASKV 363 Query: 318 IVYAIGVQAE-AADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 +Y++ A + LK+CA S ++ + + L AF IG+ K Sbjct: 364 QIYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGERAAASVSRLTK 419 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 249 bits (635), Expect = 5e-64, Method: Composition-based stats. Identities = 91/403 (22%), Positives = 186/403 (46%), Gaps = 62/403 (15%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 +TAI++ V F+ + I+ +H +++ ++ LD ++L I++ + +K+ Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 80 SYRIIKNIWQTDFRNELRENGF-AQDINNIERSTSLSIIIDDQHK-DYNLSAVSRYEMP- 136 S I++ + L++ + ++ +I + ++I D + Y + ++YE+P Sbjct: 61 S-----TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 Query: 137 ---FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-------- 185 F+ P + ++ L T ++ SS+ ++ + + MVLDVS SM D + Sbjct: 116 ENLFLKGLIP--SALTNLSLRSTGIIERSSE-NLAISICMVLDVSRSMEDLYLQKHNDNN 172 Query: 186 -------------------------------GPGMDKLGVATRSIREMLDII-KSIPDVN 213 P K+ V S +++ I K+I + Sbjct: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 Query: 214 NV-VRSGLVTFSSKIV--QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 N+ VR G + ++ IV Q PL+ + ++ ++N+L T + P + +AY ++++ K Sbjct: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEA 328 E H G KK++IF+TDGENS + + +L C + G +Y++ V A Sbjct: 293 E-SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 Query: 329 ADQ-FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 Q L+ C S +F++V +SR+L ++F +I ++ +Q + Sbjct: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 248 bits (632), Expect = 1e-63, Method: Composition-based stats. Identities = 80/404 (19%), Positives = 158/404 (39%), Gaps = 59/404 (14%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G+ ++TAILLPV+ G +E ++ VKA L D + L AT+ + Sbjct: 11 RRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQNTADSAALAAATEARLK 70 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII--DDQHKD 124 E + IK I + +++ + ++ +E+++ ++I D + K Sbjct: 71 E---------GALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARGKT 121 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 Y + Y+M + L T + + + M +VLD S SM+ Sbjct: 122 YTIQTTINYQMQLNPLL--GFFGAKTLDLAATGTAVSTVNKGAPISMYLVLDRSGSMSFK 179 Query: 185 FGP----------------------------GMDKLGVATRSIREMLDIIKSIPD----- 211 ++K ++ ++ + Sbjct: 180 TDTLNTKKTSCQNYTVDNWGSYPNLKNTSPCYVNKATSLKTAVGYLVATLNKADPTYTAN 239 Query: 212 -VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIF 267 + +VR+G ++ + P+ WG + +++ I T + L AYN + Sbjct: 240 GGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIPEFPSGGTDARSSLNAAYNALK 299 Query: 268 DAK--EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-----FYCNEAKRRGAIVY 320 A E EH K + +++YI+ +TDGE + + S+ C+ AK+ G ++ Sbjct: 300 KANTVEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKKDGIKIF 359 Query: 321 AIGVQAE-AADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEM 362 ++ A L++CAS D +Y+ +N ++ AF I ++ Sbjct: 360 SVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEIARKA 403 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 242 bits (617), Expect = 7e-62, Method: Composition-based stats. Identities = 91/408 (22%), Positives = 162/408 (39%), Gaps = 53/408 (12%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 SFL+ +R + G+ I+TAI++PV+ GL I+ S+ K +L D + L Sbjct: 4 SFLHPCLRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAALAA 63 Query: 60 ATKILNQENGNNGKKQ--KNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSII 117 AT + + + DF + N TD + ++ G + DI+ +TS Sbjct: 64 ATALASGAASTTADAEAIAKDFVSGQMANYVDTDAISSIKA-GTSVDIDVSATATS---- 118 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS---SVKISSKSDIGLDMMMV 174 K Y ++ + Y + P+ + + L I + + +S + L M +V Sbjct: 119 -----KSYKVTVATSYGI----AATPFMSVLGYKTLNIGASTSTSSGTSDTKTALSMELV 169 Query: 175 LDVSLSMND-----------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 LD S SM + + + K+ ++ + D + + +++VR+G ++ Sbjct: 170 LDQSGSMGEKTTTCATYNGKNCKTYVTKIDALKKAADALFDALDTADPDHSLVRTGAYSY 229 Query: 224 S---------SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---- 270 + ++I +AWG ++ + T +T + A I A Sbjct: 230 NNGLIYNSQKTQIKSMSGMAWGTATTATYVSGITASGGTDATEPMRQATLSIAKASDGSD 289 Query: 271 -EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES-----LFYCNEAKRRGAIVYAIGV 324 E H KG+ +YII +TDGE + + S C+ K G ++ + Sbjct: 290 VETQAHAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCDATKTAGIKIFTVAF 349 Query: 325 QAE-AADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 A Q L+ CASP +Y + KL +F I KE K L Sbjct: 350 MAPDKGKQLLQYCASPGGNYYEAETMEKLVASFTSIAKEATKAVTLLT 397 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 72/399 (18%), Positives = 126/399 (31%), Gaps = 60/399 (15%) Query: 21 TAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFS 80 TAIL PV+ G+ + K +L + + NG + ++ Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQ---EAADSAALATATALANGKIQTSEAEAYA 57 Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 +N N L+ + + S+ Y ++ Y++ Sbjct: 58 ----RNFVAGQMANYLQSGVDIKSATGV----SVQTNTSGNSTSYQVTVSPSYDLTVNPL 109 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLSMND--------------- 183 + H T+ S++ + M + LD S SM + Sbjct: 110 MQAVGFTTQHLSTSGTTIGGGHSQTQGQGSISMYLALDKSGSMGEDTATVNEEDPTESYT 169 Query: 184 --------------------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ + + + VR+G V++ Sbjct: 170 YPCNPHYNRKGKEVWDTCTGSRANYYTKIEALKMAAGNLFAQLSGADPNAQYVRTGAVSY 229 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIFDAKEKLEHIA 277 LAWG + +N L G T S+ + AY N + E H Sbjct: 230 DIVQYAPSSLAWGAIGVSSYVNALQAGGGTNSSGAMSTAYLSLTAKNAAGNDAEDSAHKL 289 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRGAIVYAIGVQAEAADQ-F 332 K +KYI+F+TDG+N++ + + + C+ AK +G +Y I A Q Sbjct: 290 KSGQIPQKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQAL 349 Query: 333 LKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 L+ CAS ++ + L AF IG + Q Sbjct: 350 LQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQVTRLT 388 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 78/412 (18%), Positives = 156/412 (37%), Gaps = 59/412 (14%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G+ ++TAILLPV+ V G +E ++ VKA + D + L AT+ + Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQNTADSAALAAATEARLR 70 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ--HKD 124 E S IK I + ++ +N A++ +E+++ + + K Sbjct: 71 EGK---------LSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKNSPTRVTTTENARGKT 121 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-- 182 Y + ++++ + L +T + K + + M + LD S SM+ Sbjct: 122 YAVETTIKHQIQLNPML--GFIGAKTLDLSVTGTAKSTINKGAPISMYLALDRSGSMSFK 179 Query: 183 ------------------DHFGP--------GMDKLGVATRSIREMLDIIKSIPD----- 211 P ++K ++ ++ + Sbjct: 180 TDTVDTTKTSCQNYTSDNWSKYPNLAKTSPCYVNKAASLKTAVGFLVATLNKADPTYTVN 239 Query: 212 -VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIF 267 + +VR+G ++ + + WG + +++ I T + L AYN + Sbjct: 240 GGSELVRTGASVYTHETYVAQSIGWGTSGVTSYVDKQIPEFPSGGTDARSSLNAAYNALK 299 Query: 268 DAKEKLE--HIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-----FYCNEAKRRGAIVY 320 A H KG + +++YI+ +TDGE + + S+ C AK+ G ++ Sbjct: 300 KANPDEARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIKIF 359 Query: 321 AIGVQAE-AADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 ++ A L+ CAS D +Y+ +N ++ AF I ++ Sbjct: 360 SVAFMAPDKGKSLLQYCASSADNYYAPENMEQIVTAFGEIARKAAGSIATLT 411 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 239 bits (609), Expect = 6e-61, Method: Composition-based stats. Identities = 75/410 (18%), Positives = 151/410 (36%), Gaps = 49/410 (11%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ-- 66 + G+ ++TA++ P++ V G+ ++ ++ K +L D + L A+ +++ Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNE----LRENGFAQDINNIERSTSLSIIIDDQH 122 + K F + D E + G + ++ + I+ Sbjct: 63 PDIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAAAGSTAPSWDDVNTSEVVIVETPNG 122 Query: 123 ---KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 K + +S +++ + F T L S+ +++S + M +VLD S Sbjct: 123 TKGKSFQVSVANKHLLQFNAMTR--LLGKESIELETRSTADSATESKNAISMYLVLDRSG 180 Query: 180 SMNDHFGP---------------------------GMDKLGVATRSIREMLDIIKSIPDV 212 SM +DK+ ++ ++ + + Sbjct: 181 SMAWKTDTVDTSRPRCINWTASNWGESNVRATSPCYVDKITTLKSAVDKLFTPLAKMDPG 240 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 N +R+G +++ + + L WG ++ + L T S+ A ++ E Sbjct: 241 NEYLRAGAASYNDRQDRASKLTWGTKNASAHVQGLDATGGTDSSSAFAAAVEELLLDGEN 300 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPN---------IDNKESLFYCNEAKRRGAIVYAIG 323 H+AK +KYI+F+TDGEN+S N + + C AK G ++ + Sbjct: 301 EAHLAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAKNNGIAIFTVA 360 Query: 324 VQAE-AADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 A LK CA SPD + ++ L F +IG++ K Sbjct: 361 FMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEKIGQKAAAMIARLTK 410 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 223 bits (567), Expect = 5e-56, Method: Composition-based stats. Identities = 78/403 (19%), Positives = 149/403 (36%), Gaps = 43/403 (10%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE- 67 + G+ ++TA+ P++ G+ I+ ++ K +L D + L A+ +++ E Sbjct: 14 MLKDRGGNFGMMTAVAAPLLLAAGGVSIDMANMLMTKNQLQDATDAAALAAASALVSDEQ 73 Query: 68 -NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN-----NIERSTSLSIIIDDQ 121 + K+ F D + E + + + + ++I Sbjct: 74 PDIAAAKEIARKFLKTQAGGTTTPDAPADSGEGASSGAASSTPDWDDVNTLEVNITETPN 133 Query: 122 H---KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 K + ++ +++ F T + L +S+ + +++S L M +VLD S Sbjct: 134 GTKGKIFQVTVINKRVTEFNAMTR--LLGTDSIELEASSTAESATESKNALSMYLVLDRS 191 Query: 179 LSMNDHFGP----------------------------GMDKLGVATRSIREMLDIIKSIP 210 SM + K+ ++ ++L + Sbjct: 192 GSMAWKTNTINAAKKSCPNYTESNWSRYPNLWASSPCYVTKIDALKTAVTDLLAQLLVAD 251 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 VR+ ++++S LAWG +N L+ T S + AY K+ A Sbjct: 252 PDQIYVRTAAISYNSVQDTAGTLAWGTSGAAAYVNALVATGGTASAGAFKTAYQKVIAAT 311 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AA 329 E H AK KY++F+TDGEN+ N D+ + +C+ AK +Y++ A Sbjct: 312 ENTAHAAKNGQVPSKYMVFMTDGENNYAN-DDTVTKQWCDTAKANKVEIYSVAFMAPERG 370 Query: 330 DQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 LK CA S ++ + L AF IG+ K Sbjct: 371 QALLKYCASSSSHYFEAEEVTDLVAAFKAIGERAAAVVSRLTK 413 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 216 bits (550), Expect = 4e-54, Method: Composition-based stats. Identities = 65/335 (19%), Positives = 111/335 (33%), Gaps = 56/335 (16%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSI--IIDDQHKDYNLSAVSRYEMPFIFCTF 142 +N N L Q +I+ +T +++ Y ++ Y++ Sbjct: 1 RNFVAGQMANYL------QSGVDIKSATGVTVQTNTSGNSTSYQVTVSPSYDLTVNPLMQ 54 Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND------------------- 183 + H +++ S++ + M + LD S SM + Sbjct: 55 AVGFTTQHLS-TSGTTIGGHSQTQGSISMYLALDKSGSMGEDTATVNEEDPTESYTYDCN 113 Query: 184 ----------------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 K+ + + + S VR+G V++ Sbjct: 114 GHYNKKGKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQ 173 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY------NKIFDAKEKLEHIAKGHD 281 LAWG + +N L G T S+ + AY N + E H K Sbjct: 174 YTPSALAWGTSGVSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQ 233 Query: 282 DYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNC 336 KKYI+F+TDG+N+ + + + C+ AK +G +Y I A Q L C Sbjct: 234 TPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALLHYC 293 Query: 337 ASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 AS D ++ + L AF IG + Q Sbjct: 294 ASDDSHYFQAEKMEDLLAAFKAIGAKASSQLTRLT 328 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 200 bits (508), Expect = 3e-49, Method: Composition-based stats. Identities = 91/405 (22%), Positives = 165/405 (40%), Gaps = 47/405 (11%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F ++ ++ G+ ILTAI +PV+ G+ ++ ++ ++L D + L TAT Sbjct: 8 FRSLIQLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQQATDAAALATATA 67 Query: 63 ILNQ-ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 + N +N ++ F + N +G + ++ T+ ++ Sbjct: 68 LANGNATTSNAQQLATQFVTGQMSNYL----------SGDTNTADALKAGTTANVTSATN 117 Query: 122 ---HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDV 177 Y ++ + Y+M + + H TS+ ++ + L M + LD Sbjct: 118 SSGGTSYTVAVNASYDMSVNGMSQLLGIKTMHVSAASTSTSGSAAAAKQAALSMEIALDK 177 Query: 178 SLSM----------------------------NDHFGPGMDKLGVATRSIREMLDIIKSI 209 S SM + K+ ++ +LD + S Sbjct: 178 SGSMLLNTDVIDTSQKSCTQYYTEGNYLYQYPKAKSPCYIKKIAALKTAVGTLLDQLDSA 237 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK-INRLIFGSTTKSTPGLEYAYNKIFD 268 + VR+ + +SS++ + LAWG + I+ L T+S+ + AY + Sbjct: 238 DPKSQYVRTAAIAWSSEVDSSSALAWGTTTTRSNVISGLNANGGTESSAPMALAYKNVSA 297 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + E AKG+ ++K I+ +TDGEN++ + D K +L C AK G ++Y++ A Sbjct: 298 SSEATAQAAKGNTTFQKIIVLMTDGENNATSSDTK-TLATCKAAKDAGVLIYSVAFMAPD 356 Query: 329 ADQ-FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 Q LKNCA SP ++ Q L AF IG + KQ L K Sbjct: 357 RGQTLLKNCASSPSNYFDAQQMSDLIAAFKTIGNQASKQITLLTK 401 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 200 bits (507), Expect = 4e-49, Method: Composition-based stats. Identities = 79/407 (19%), Positives = 165/407 (40%), Gaps = 46/407 (11%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 + + K + SI+ A+ + +++G +I + K + + ++L A+K+ Sbjct: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 Query: 64 LN---------QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL 114 ++ + N+ K+ D + R IKN + + + +I NI S+ + Sbjct: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNIVNSSRI 127 Query: 115 SIIIDDQHKDYNLSAVSRYEMPF---------IFCTFPWCANSSHAPLLITSSVKISSKS 165 S+ ++ + + Y M + I + ++I Sbjct: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 Query: 166 DIGLDMMMVLDVSLSMNDHFGP--------------GMDKLGVATRSIREMLDIIKSIPD 211 + +V+D+S SM+ K+ ++ LD I + Sbjct: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFD 268 V V GL+ +++++ + +WG + +++ + R + T STP ++ AY + Sbjct: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 Query: 269 AKEK-------LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 K++ + + ++K+IIFLTDGEN++ N ++ C++AK + Sbjct: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVT 366 Query: 322 IGVQAEAADQ-FLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 I + A Q LK C +SP+ Y+V N+ L F I + MV ++ Sbjct: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 197 bits (501), Expect = 2e-48, Method: Composition-based stats. Identities = 72/400 (18%), Positives = 139/400 (34%), Gaps = 43/400 (10%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + G+ ++TA+LLPV V GL ++ + ++ L +D + L A+ + N Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + F + N D + + + ++T ++ Y++ Sbjct: 101 SEADAIALAKSFLSSQLANTMARDENTSSVDQITQAEPDISVKTTQVN----SSSTSYDV 156 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 Y + + + + ++ S+ L M +VLD S SMND Sbjct: 157 ELTGSYTITMNPLSRVLGWETVTLK-AYGKAQAATTASESPLSMYLVLDRSGSMNDETAT 215 Query: 188 GM-----------------------------DKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ ++ ++ +K + VR+ Sbjct: 216 TYTGTCTKTTTSGYGWNKKTTTTSYSCTKNYTKIESLKLAVADLAAQLKKADPNSEYVRT 275 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI--FDAKEKLEHI 276 G ++++ ++WG ++ +N L T + L AY+ + + E H Sbjct: 276 GADSYNASADTAQAMSWGTANVVTYVNALSATGGTDARGALSAAYSALQTSNKTEITAHN 335 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKES-----LFYCNEAKRRGAIVYAIGVQAEAADQ 331 +YI+F+TDGE + + S C K G +Y + A A + Sbjct: 336 VSSVSKIGRYIVFMTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAFMAPANGK 395 Query: 332 -FLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 L CAS +Y ++ L AF IGK+ Sbjct: 396 SLLSACASDASHYYEATDAASLVAAFGEIGKKATSTSTRL 435 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 60/265 (22%), Positives = 91/265 (34%), Gaps = 48/265 (18%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMND----------------HFGPGM-------- 189 +V S+S + M + LD S SM D P + Sbjct: 4 TSGRTVSGHSQSQGSISMFLALDKSGSMGDPTATVNADDPTEPFTYDCNPHLNKKGTKII 63 Query: 190 ------------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 K+ + + + S VR+G V++ LAWG+ Sbjct: 64 YDTCTGSRAHYYTKIEALKIAAGNLFSQLNSADPNAEYVRTGAVSYDLVEYTPSKLAWGI 123 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAY------NKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + +N L G T S+ + AY N + E H K KKYI+F+T Sbjct: 124 TAVTSYVNALESGGGTNSSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLPKKYIVFMT 183 Query: 292 DGENSSPNIDNKE----SLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSV 345 DG+N+ + + + C+ AK +G Y I A Q L CAS D ++ Sbjct: 184 DGDNNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHYCASDDAHYFQA 243 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYN 370 + L AF IG + Q Sbjct: 244 EKMEDLLAAFKAIGAKASAQVTRLT 268 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 190 bits (483), Expect = 2e-46, Method: Composition-based stats. Identities = 90/452 (19%), Positives = 168/452 (37%), Gaps = 92/452 (20%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 + +C G I+TA+L+PV+ V G++++ + + L +++ + ++ Sbjct: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 Query: 67 --------------ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN----- 107 + +F + KN + R+ +R+ + Sbjct: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 Query: 108 --------------------------IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT 141 I+ + HK++ +S ++ + F Sbjct: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSI--QWVIDFSRSM 189 Query: 142 FPWCANSSHAPL-----LITSSVKISSKSDIGLDM-------MMV--------------L 175 + +S PL +VK S + + + MV L Sbjct: 190 LDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPL 249 Query: 176 DVSLSMNDHFGPG-----MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 D SLS + K + ++ ++ IK I +VN+ VR G F+ +++ Sbjct: 250 DPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISD 309 Query: 231 FPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +WGV + I + +T ++ AY+ I + E H K + + KK Sbjct: 310 PSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKK 369 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-----FLKNCASPD 340 YI+ LTDGEN+ DN+E + CN+AK +G + I Q FL NCASP+ Sbjct: 370 YIVLLTDGENTQ---DNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPN 426 Query: 341 RFYSVQNSRKLHDAFLR-IGKEMVKQRILYNK 371 F+ ++ +L+ F IG E+ ++ I K Sbjct: 427 SFFEANSTHELNKIFRDRIGNEIFERVIRITK 458 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 176 bits (447), Expect = 3e-42, Method: Composition-based stats. Identities = 79/413 (19%), Positives = 166/413 (40%), Gaps = 55/413 (13%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 N + + K + SIL +++L I + +G++I + K + ++L A+KI Sbjct: 9 FNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSAILSGASKI 68 Query: 64 LNQEN--GNNGKKQKNDFSYRIIKNIWQTDFRNEL---RENGFAQDINNIERSTSLSIII 118 +++ + G+N + + ++ + L + N I +++ +SI Sbjct: 69 ISRISYFGDNMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNIISQNSKVSITR 128 Query: 119 DD----------------QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT---SSV 159 + + Y++S + Y+ + F N +I+ + V Sbjct: 129 EPHPNVFHEFNNQSILQNKKTFYHISVETFYD--YHIKFFDNLLNKKINSKIISFVPALV 186 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGP---------------GMDKLGVATRSIREMLD 204 KI + + +V+D+S SM+ K+ +++ LD Sbjct: 187 KIDTGEHPFFFVQLVVDLSASMSCLMNSDPEHATEFSVCGKSKKNSKMDALKKAVLLFLD 246 Query: 205 IIKSIPD-VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLE 260 + + GL +++++ + +WG +++ I + T STP ++ Sbjct: 247 SVDRGSKTQKDTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLGQTDSTPAMK 306 Query: 261 YAYNKIFDAKEKL-------EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 AY + K++ + I ++K++IFLTDGEN+ P D K ++ C +AK Sbjct: 307 KAYQILTSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVK-TIKICEKAK 365 Query: 314 RRGAIVYAIGVQAEA-ADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + I + A A + LK C S P+ +Y+V ++ L F I + Sbjct: 366 KNSIKILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDISTLITH 418 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 142 bits (358), Expect = 7e-32, Method: Composition-based stats. Identities = 60/251 (23%), Positives = 106/251 (42%), Gaps = 33/251 (13%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 P A + L I V S++D+ GLD++++LD+S SM + G G T Sbjct: 54 FVPLSAAAVLIGLAIMQPVIPYSQADLRSRGLDIVLLLDLSSSMQEEMGSGQSLKTGTTA 113 Query: 198 SIREMLDIIK----SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 + R +D +K + R GLV FS PL + Q++ + + ++ Sbjct: 114 AGRTRMDAVKDAVRTFVRGRRDDRIGLVVFSDNAYVISPLTFDHQYLLDYLGFVDGEILL 173 Query: 250 G-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T GL A + + A+GH ++ TDGE++ ++ + Sbjct: 174 GEGQTAIGDGLALASAVL---ARQAGRDARGHQ----VVVLFTDGESNRG----RDPIEV 222 Query: 309 CNEAKRRGAIVYAIGVQAEA------ADQFLKN--CASPDRFYSVQNSRKLHDAFLRIGK 360 EAK G V+ IGV +A Q L+ A+ R+++ + R L A I Sbjct: 223 VGEAKSAGIRVHVIGVDLDAEVKTRPGVQLLRRGVVAAGGRYFAADSERDLLTASRTI-- 280 Query: 361 EMVKQRILYNK 371 + +++ +L ++ Sbjct: 281 DAMEKGVLVSR 291 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats. Identities = 51/251 (20%), Positives = 88/251 (35%), Gaps = 52/251 (20%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMD 190 + + P+ V K D+M+VLD+S SM+ D G +D Sbjct: 55 TLSILLWCALITALARPVWYGEPVTTQPKHR---DLMLVLDLSYSMSQEDMQDSSGNYID 111 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF- 249 +L +++ R GLV F+ PL I E++N L+ Sbjct: 112 RLTAVK-------NVVSQFAQQRKGDRLGLVLFADHAYLQTPLTLDRNTISEQVNSLVLQ 164 Query: 250 --GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G T G+ A D+ D ++ +I L+DG N+S +D + Sbjct: 165 LIGQKTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTSGVLDP---IE 210 Query: 308 YCNEAKRRGAIVYAIGVQAE-------------------AADQFLKNC-ASPDRFYSVQN 347 N AK+ A +Y IGV A + + +++ +N Sbjct: 211 AANIAKKYNATIYTIGVGAGEMMVKDFFMTRKVNTAQDLDEKTLMSIAKITGGQYFRARN 270 Query: 348 SRKLHDAFLRI 358 +++L + I Sbjct: 271 AQELATIYDTI 281 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 63/264 (23%), Positives = 105/264 (39%), Gaps = 32/264 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSD 166 +S +L + Q + + I T W A ++S P + V I ++ Sbjct: 28 VQSAALRVPHLPQGIQNVSKPPTSRKAFLIIATIGWVALVSASARPQWLGEPVSIPAE-- 85 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS----IPDVNNVVRSGLVT 222 G D+M+ +D+S SM +D + V R + + L +IKS R GL+ Sbjct: 86 -GRDLMIAVDLSGSM------KIDDMQVNGRQV-DRLQMIKSVLHDFIQRRIGDRLGLIF 137 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+ PL + + + + +N + G + T I DA K ++ Sbjct: 138 FADTAYLQAPLTYDRETVSQLLNESLIGLVGEQT--------AIGDAIGLAIKRFKSKEE 189 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 K +I LTDG+N++ NI +++ A G +Y IGV A+ Q L R Sbjct: 190 SNKVLILLTDGQNTAGNITPEQANEL---AINNGVTLYTIGVGAD---QMLVQSIFGSR- 242 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 V S++L + L E R Sbjct: 243 -QVNPSQELDEGMLTTLAESTGGR 265 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 50/256 (19%), Positives = 95/256 (37%), Gaps = 51/256 (19%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GP 187 R + + + + P+ V I + DMM+V+D+S SM++ Sbjct: 49 RVNIGLMSIAWVLLVGALARPVWYGDPVDIQPEHR---DMMLVVDLSGSMSEEDMKTDSG 105 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +D+L R + + ++ R GLV F PL + +QE++NR Sbjct: 106 FVDRLTAVKRVVSDFIE-------KRKGDRLGLVLFGDHAYLQTPLTFDRNTVQEQLNRT 158 Query: 248 IF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + G T GL A ++ ++ II L+DG N++ +D Sbjct: 159 VLGLVGQRTAIGEGLGLATKTFIESN-----------APQRTIILLSDGANTAGVLDP-- 205 Query: 305 SLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYS 344 + AK A +Y +G+ A + + L A + +++ Sbjct: 206 -IEAAQLAKDNNAKIYTVGIGAGEMQVRGFFGNQTVNTARDLDEDTLTKIATMTGGQYFR 264 Query: 345 VQNSRKLHDAFLRIGK 360 +N+ +L + + I K Sbjct: 265 ARNADELAEIYQTIDK 280 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 126 bits (315), Expect = 7e-27, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 96/252 (38%), Gaps = 53/252 (21%) Query: 138 IFCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDK 191 + WC +S P+ ++ + D+M+V+D+S SM G +D+ Sbjct: 56 LITLLIWCSLIVASARPVWFGEPIEHFPEYR---DLMLVVDLSGSMQQEDILQDGDYIDR 112 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + ++ R GLV F+ PL Q + ++N+ I Sbjct: 113 LSAVKNVVTQFIEQ-------RQGDRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGL 165 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T GL A D+ + ++ +I L+DG N++ +D + Sbjct: 166 IGQKTAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLDP---IEA 211 Query: 309 CNEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNS 348 N AK+ G +Y IG+ A+ ++ L A + +++ +++ Sbjct: 212 ANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDA 271 Query: 349 RKLHDAFLRIGK 360 ++L + I + Sbjct: 272 QELQAIYQAINQ 283 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 94/241 (39%), Gaps = 52/241 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREM 202 ++ P+ + ++I + DM++ +D+S SM+ G +D+L + + Sbjct: 65 AAARPVWYGNPIEIKPEHR---DMLLAVDLSGSMSIPDMVTKNGQSIDRLTAVKHVLSDF 121 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGL 259 ++ R GLV F+ PL + ++++++R + G +T GL Sbjct: 122 IE-------KRKGDRLGLVLFADHAYLQTPLTFDRNTVEQQLDRTVLGLIGQSTAIGEGL 174 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A ++K ++ II L+DG N+S ID L AK G + Sbjct: 175 GIATKTFINSK-----------APQRVIILLSDGANTSGVIDP---LEAAKLAKESGVKI 220 Query: 320 YAIGVQAEA-------ADQF-----------LKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 Y +GV A+ D+ L A + ++ +N ++L + I Sbjct: 221 YTVGVGADQMVQKGFFGDRLVNPSQDLDEKTLTEIAKMTGGEYFRARNPQQLEKIYDIIN 280 Query: 360 K 360 K Sbjct: 281 K 281 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 73/429 (17%), Positives = 146/429 (34%), Gaps = 93/429 (21%) Query: 3 FLNIRNFF----YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY 58 F R + G++ ++ A+ + VIF +G I+ S + + +L D ++L Sbjct: 11 FAGFRRRLGESCRDQSGNVIMIFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSAVLR 70 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 + + + + G F+ G ++R + +II Sbjct: 71 AMA-LKSATDESRGVAADKAFAENF-------------GHPGVYDLNGALKREVNENIIS 116 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + Y + A F P+ + S K S ++ VLD + Sbjct: 117 ----QTYTVHATVS-------SYFGAFFGKDSYPVTVVSQAKTSLDV---FEIAFVLDTT 162 Query: 179 LSMND-----HFGPGMD-KLGVATRSIREMLDIIKSIPDVNNVVRS-----------GLV 221 SM + + +D + ++ + + ++ N VR GL Sbjct: 163 GSMAEANKMPNLKSSVDSAMAGLLQNGKNLSGSKIAVVPFNTQVRLSDATVTTMSSQGLS 222 Query: 222 TFSSKIVQT------------------------------------FPLAWGVQHIQEKIN 245 + V L+ + + I Sbjct: 223 SGWGNCVHDRDLATSHDVSASAAQKGKAQTLYPLETCDEASLKPVQGLSDNISSARNFIK 282 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS------SPN 299 L G T T G+++ + + + G +K++I +TDG+N+ S + Sbjct: 283 TLQPGGYTNVTMGVQWGMEVLSPNQPFSDATEFGSTKARKFMIVVTDGDNTKSFTSWSAS 342 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRI 358 + +K + C AK +G VY + + + L+ CAS P+ FY + ++ +L+ A I Sbjct: 343 VIDKRTALACENAKAKGITVYTVKI-IQGNSNMLRKCASAPEYFYDLTSANQLNAAMSGI 401 Query: 359 GKEMVKQRI 367 K + K R+ Sbjct: 402 FKSINKTRL 410 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 87/241 (36%), Gaps = 52/241 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-----NDHFGPGMDKLGVATRSIREM 202 + P+ ++I DMM+ +D+S SM G +D+L + Sbjct: 65 ALARPVWYGDPIEIKPDHR---DMMLAVDLSGSMAIKDMQTQSGQSIDRLTAIKHVLSNF 121 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGL 259 ++ R GLV F PL + ++++++R + G +T GL Sbjct: 122 IE-------KRKGDRLGLVLFGDHAYLQTPLTFDRHTVEQQLDRTVLGLVGQSTAIGEGL 174 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A +K ++ II L+DG N++ ID L AK G + Sbjct: 175 GIATKTFIKSK-----------APQRVIILLSDGANTAGVIDP---LEAAKLAKESGVTI 220 Query: 320 YAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 Y +G+ A+ + L A + +++ +N ++L + I Sbjct: 221 YTVGIGADEMLQRSIFGVQKVNPSQDLDEKTLTKIAQMTGGKYFRARNPQELDKIYQIIN 280 Query: 360 K 360 + Sbjct: 281 Q 281 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 48/241 (19%), Positives = 94/241 (39%), Gaps = 52/241 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREM 202 ++ P+ + ++I + DM++ +D+S SM+ G +D+L + + Sbjct: 65 AAARPVWYGNPIEIKPEHR---DMLLAVDLSGSMSIPDMVTKNGQSIDRLTAVKHVLSDF 121 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGL 259 ++ R GLV F+ PL + + ++++++R + G +T GL Sbjct: 122 IE-------KRKGDRLGLVLFADHAYLQTPLTFDRKTVEQQLDRTVLGLIGQSTAIGEGL 174 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A ++K ++ II L+DG N+S ID L AK G + Sbjct: 175 GIATKTFINSK-----------APQRVIILLSDGANTSGVIDP---LEAAKLAKESGVKI 220 Query: 320 YAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 Y +GV A+ + L A + ++ +N ++L + I Sbjct: 221 YTVGVGADQMVQQGFFGDRIVNPSQDLDEKTLTEIAKMTGGEYFRARNPQQLEKIYDIIN 280 Query: 360 K 360 K Sbjct: 281 K 281 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 96/252 (38%), Gaps = 53/252 (21%) Query: 138 IFCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDK 191 + WC +S P+ ++ + D+M+V+D+S SM G +D+ Sbjct: 56 LVTLLIWCSLIVASARPVWFGEPIEHFPEYR---DLMLVVDLSGSMQQEDILQDGDYIDR 112 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + ++ R GLV F+ PL Q + ++N+ I Sbjct: 113 LSSVKNVVTQFIEQ-------RQGDRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGL 165 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T GL A D+ + ++ +I L+DG N++ +D + Sbjct: 166 IGQKTAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLDP---IEA 211 Query: 309 CNEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNS 348 N AK+ G +Y IG+ A+ ++ L A + +++ +++ Sbjct: 212 ANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDA 271 Query: 349 RKLHDAFLRIGK 360 ++L + I + Sbjct: 272 QELQTIYQAINQ 283 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 96/252 (38%), Gaps = 53/252 (21%) Query: 138 IFCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDK 191 + WC +S P+ ++ + D+M+V+D+S SM G +D+ Sbjct: 56 LVTLLIWCSLIVASARPVWFGEPIEHFPEYR---DLMLVVDLSGSMQQADILQDGDYIDR 112 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + ++ R GLV F+ PL Q + ++N+ I Sbjct: 113 LSAVKNVVTQFIEQ-------RQGDRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGL 165 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T GL A D+ + ++ +I L+DG N++ +D + Sbjct: 166 IGQKTAIGDGLALATKTFVDS-----------EAPQRVVILLSDGSNTAGTLDP---IEA 211 Query: 309 CNEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNS 348 N AK+ G +Y IG+ A+ ++ L A + +++ +++ Sbjct: 212 ANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKVATMTGGQYFRARDA 271 Query: 349 RKLHDAFLRIGK 360 ++L + I + Sbjct: 272 QELQTIYQAINQ 283 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats. Identities = 45/250 (18%), Positives = 92/250 (36%), Gaps = 51/250 (20%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 + ++ P+ ++I K DMM+V+D+S SM+ +D+ Sbjct: 55 TIAIMAWVSLVVAAARPIWYGEPIEIQPKHR---DMMLVIDLSYSMSQQDMAYQDDYIDR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-- 249 L + + +D R GLV F+ PL + + ++ ++N+ + Sbjct: 112 LTAVKHVVSDFVD-------RRKGDRVGLVYFADHAYLQTPLTFDRETVKTQLNQTVLKL 164 Query: 250 -GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G+ T G+ A D+ ++ +I L+DG N++ +D ++ Sbjct: 165 IGTQTAIGDGIGLATKTFVDSN-----------APQRVMILLSDGSNNAGVLDPVQA--- 210 Query: 309 CNEAKRRGAIVYAIGVQAE-------------------AADQFLKNC-ASPDRFYSVQNS 348 AK+ G +Y IGV A +K + +++ +N+ Sbjct: 211 AEIAKKYGTTIYTIGVGAGEMQVKDFFMTRTVNTAEDLDEKTLIKIANITGGQYFRARNA 270 Query: 349 RKLHDAFLRI 358 +L + I Sbjct: 271 DELATIYDTI 280 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats. Identities = 48/241 (19%), Positives = 95/241 (39%), Gaps = 52/241 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREM 202 ++ P+ + ++I + DM++ +D+S SM+ G +D+L + + Sbjct: 65 AAARPVWYGNPIEIKPEHR---DMLLAVDLSGSMSIPDMVTKNGQSVDRLTAVKHVLSDF 121 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGL 259 ++ R GLV F+ PL + + ++++++R + G +T GL Sbjct: 122 IE-------KRKGDRLGLVLFADHAYLQTPLTFDRKTVEKQLDRTVLGLIGQSTAIGEGL 174 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A ++K ++ II L+DG N+S ID L AK G + Sbjct: 175 GIATKTFINSK-----------APQRVIILLSDGANTSGVIDP---LEAAKLAKESGVKI 220 Query: 320 YAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 Y +GV A+ + L + A + ++ +N ++L + I Sbjct: 221 YTVGVGADQMVQQGFFGDRIVNPSQDLDEKTLTDIAKMTGGEYFRARNPQQLEKIYDIIN 280 Query: 360 K 360 K Sbjct: 281 K 281 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 122 bits (306), Expect = 9e-26, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 101/269 (37%), Gaps = 54/269 (20%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 D++H + V ++ + + P+ V+ K DMM+V+D+S Sbjct: 72 DNKHSNKPKQVVQKF---LSIGVWALLVVACARPVWYGDPVEFQPKYR---DMMLVVDLS 125 Query: 179 LSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 SM + G +D+L R + + ++ R G+V F PL Sbjct: 126 GSMQKEDMNDNGEYIDRLTTVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLT 178 Query: 235 WGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + ++IN+ + G T G+ D+ D ++ +I L+ Sbjct: 179 ADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLS 227 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFL 333 DG N++ +D L AK+ A +Y +GV A + +Q L Sbjct: 228 DGSNTAGVLDP---LEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTL 284 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + +++ +++++L + I + Sbjct: 285 TKIAEMTGGKYFRARDAKELETIYDTINQ 313 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 122 bits (306), Expect = 9e-26, Method: Composition-based stats. Identities = 55/264 (20%), Positives = 100/264 (37%), Gaps = 53/264 (20%) Query: 137 FIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + W A +S P + V I ++ G D+M+ +D+S SM +D + V Sbjct: 56 LLVAALAWIALVGASARPQWLGEPVSIPAQ---GRDLMIAVDLSGSM------KIDDMQV 106 Query: 195 ATRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 R + + L +IKS R GL+ F+ PL + + + + + + G Sbjct: 107 NGRQV-DRLQMIKSVLHDFIQRRVGDRLGLIFFADTAYLQAPLTYDRETVSQLLGESLIG 165 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 + T I DA + + K +I LTDG+N++ NI +++ Sbjct: 166 LVGEQT--------AIGDAIGLAIKRFQSKKESNKVLILLTDGQNTAGNISPQQANEL-- 215 Query: 311 EAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRK 350 A G +Y IGV A+ + L A + R++ +++ Sbjct: 216 -AINNGVTLYTIGVGADQMMVQSIFGSRQVNPSQELDESMLTQLAESTGGRYFRARDAES 274 Query: 351 LHDAFLR------IGKEMVKQRIL 368 L + + I +E + R L Sbjct: 275 LKAIYDKLDELEPIARESRQMRPL 298 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 60/260 (23%), Positives = 94/260 (36%), Gaps = 55/260 (21%) Query: 132 RYEMPFI-FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGM 189 R +P + + C + P ++ S KI S G+D+M+ +D SLSM P Sbjct: 55 RPLLPILRYLALALCIVALARPRIVDVSTKIKSDK--GVDIMLTVDTSLSMLARDLEP-- 110 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 D+L + R GLV++S + + PL + + +IN L Sbjct: 111 DRLTALKAVAVK-------FSKERQADRLGLVSYSGEALTRVPLTTDREVLIREINALES 163 Query: 250 GS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDN--K 303 G T GL A N I D+K K K II +TDG E+ +P D Sbjct: 164 GELEDGTAIGIGLATAINHIKDSKAK-----------SKVIILMTDGVESINPTNDLMYI 212 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--S 338 A RG VY IG+ ++ L+N A + Sbjct: 213 SPQTAAEMATSRGIKVYTIGIGTRGLAPFPTAYDMYGNYIFDMMPVDIDEKLLQNIADLT 272 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 ++ +++ L + I Sbjct: 273 GGLYFRATDNQSLQKIYQEI 292 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 89/236 (37%), Gaps = 52/236 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRSIREMLDI 205 P+ V I + DMM+V+D+S SM + G +D+L + + + +D Sbjct: 68 RPVWYGEPVDIQPEHR---DMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQ 124 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYA 262 R GLV F PL + ++E+++R + G T GL A Sbjct: 125 -------RKGDRLGLVLFGDHAYLQTPLTFDRNTVREQLDRTVLNLVGQRTAIGEGLGLA 177 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ ++ II L+DG N++ ++ L AK A +Y + Sbjct: 178 TKTFIESN-----------APQRTIILLSDGANTAGVLEP---LEAAQLAKDNHAKIYTV 223 Query: 323 GVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 G+ A + + L A + +++ +N+ +L + + I Sbjct: 224 GIGAGEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 53/265 (20%), Positives = 98/265 (36%), Gaps = 53/265 (20%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + I W A + P+ V K D+M+VLD+S S Sbjct: 40 PSHGQGNAPKNGLAKIIAAIVWVALLAAMARPVWYGEPVTTQPKHR---DLMLVLDLSYS 96 Query: 181 MND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 M+ G +D+L + ++ R GLV F+ PL Sbjct: 97 MSKEDMLDDGDYVDRLTAVKK-------VVSDFASKREGDRLGLVLFADHAYLQTPLTLD 149 Query: 237 VQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I E++N+L+ G T G+ A D+ D ++ ++ L+DG Sbjct: 150 RKTIAEQVNQLVLRLIGEKTAIGEGIGLATKTFVDS-----------DAPQRVMVLLSDG 198 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKN 335 N+S +D L AK+ A +Y IGV A + ++ L + Sbjct: 199 SNTSGVLDP---LEAAKIAKKYNATIYTIGVGAGEMVVKEFFMTRKVNTAQDLDERTLMD 255 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRI 358 A + +++ +++++L + I Sbjct: 256 IAQVTGGQYFRARDAKELATIYDTI 280 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 6/116 (5%) Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRG 316 A N + E H K KKYI+F+TDG+N++ + + + C++AK +G Sbjct: 4 TAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKG 63 Query: 317 AIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +Y I A A Q L CAS D ++ + L AF IG + Q Sbjct: 64 IEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFEAIGAKSAAQVTRLT 119 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 48/251 (19%), Positives = 93/251 (37%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKL 192 + + P+ V+ K DMM+V+D+S SM + G +D+L Sbjct: 68 LSIGVWALLVVACARPVWYGDPVEFQPKYR---DMMLVVDLSGSMQKEDMNDNGEYIDRL 124 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 R + + ++ R G+V F PL + + ++IN+ + Sbjct: 125 TAVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLV 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T G+ D+ D ++ +I L+DG N++ +D L Sbjct: 178 GQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAA 223 Query: 310 NEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSR 349 AK+ A +Y +GV A E +Q L A + +++ ++++ Sbjct: 224 EIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASELDEQTLTKIAEMTGGKYFRARDAK 283 Query: 350 KLHDAFLRIGK 360 +L + I + Sbjct: 284 ELETIYDTINQ 294 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 98/262 (37%), Gaps = 53/262 (20%) Query: 128 SAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH- 184 + +P W + P+ V+ K D+M+V+D+S SM+ Sbjct: 42 TKTPSSRLPKALSVIIWLLLVTAMARPVWYGEPVEFQPKHR---DLMLVVDLSGSMSQED 98 Query: 185 ---FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G +D+L + + ++ R GLV F+ PL + Sbjct: 99 MQFNGEYIDRLSAVKHVLSDFIE-------RRKGDRVGLVLFADHAYLQTPLTLDRDTLS 151 Query: 242 EKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +++N+ + G+ T G+ A D+ D ++ +I L+DG N++ Sbjct: 152 QQLNQAVLKLIGTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAG 200 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--S 338 +D L + AK+ A +Y +GV A + ++ L A + Sbjct: 201 VLDP---LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRT 257 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 +++ ++S++L + I + Sbjct: 258 GGQYFRARDSKELATIYDTINQ 279 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 119 bits (299), Expect = 5e-25, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 101/269 (37%), Gaps = 54/269 (20%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 D++H + V ++ + + P+ V+ K DMM+V+D+S Sbjct: 53 DNKHSNKPKQVVQKF---LSIGVWSLLVVACARPVWYGDPVEFQPKYR---DMMLVVDLS 106 Query: 179 LSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 SM + G +D+L R + + ++ R G+V F PL Sbjct: 107 GSMQKEDMNDNGEYIDRLTAVKRVLSDFVE-------KRQGDRLGVVLFGDHAYLQTPLT 159 Query: 235 WGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + ++IN+ + G T G+ D+ D ++ +I L+ Sbjct: 160 ADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLS 208 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFL 333 DG N++ +D L AK+ A +Y +GV A + +Q L Sbjct: 209 DGSNTAGVLDP---LEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTL 265 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + +++ +++++L + I + Sbjct: 266 TKIAEMTGGKYFRARDAKELETIYDTINQ 294 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 119 bits (298), Expect = 7e-25, Method: Composition-based stats. Identities = 50/279 (17%), Positives = 98/279 (35%), Gaps = 52/279 (18%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 ++ ST + + Q + ++ + ++ P+ V K Sbjct: 27 SVTASTPIRLPYLPQSNSGKAPKNNVAKL-LATLFWLSLLAAAARPVWYGEPVTTQPKHR 85 Query: 167 IGLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D+M+V+D+S SM+ +D+L + ++ R GLV Sbjct: 86 ---DLMLVVDLSYSMSQKDMLSQDDYIDRLTAVKK-------VVSDFAQQREGDRLGLVL 135 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ PL I +++ L+ G T G+ A D+ Sbjct: 136 FADHAYLQTPLTLDRNTIAKQVESLVLRLIGDKTAIGEGIGLATKTFIDS---------- 185 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------- 326 D ++ +I L+DG N+S +D + AK+ A +Y IGV A Sbjct: 186 -DAPQRVMILLSDGSNTSGVLDP---IEAAKIAKKYNATIYTIGVGAGEMMVKEFFMTRK 241 Query: 327 -----EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + + L+ A + +++ +N+ +L + I Sbjct: 242 VNTAKDLDENTLQEIADLTGGQYFRARNADELATIYDTI 280 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 90/242 (37%), Gaps = 52/242 (21%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIRE 201 + P+ +++ DM++V+D+S SM+ G +D+L Sbjct: 64 GALARPVWYGDPIEVQPDHR---DMLLVVDLSGSMSIEDMIIKNGESIDRLAAVK----- 115 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPG 258 D++ + R GLV F+ PL + +++++ R + G +T G Sbjct: 116 --DVLAEFIEQRKGDRLGLVLFAQHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEG 173 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L A ++ + ++ II L+DG N++ I+ L A Sbjct: 174 LGIATKTFINS-----------EAPQRVIILLSDGANTAGVIEP---LEAAKLAAESNVT 219 Query: 319 VYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +Y +GV AE ++ L A + +++ +N ++L + I Sbjct: 220 IYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRARNPQELEHIYQLI 279 Query: 359 GK 360 + Sbjct: 280 DQ 281 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 48/278 (17%), Positives = 100/278 (35%), Gaps = 52/278 (18%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 ++++ + + + + + + P+ V K Sbjct: 28 LKQAAPVRLPYLPESGTAVAPTNTLARVLVAITWIAL-IAALARPVWYGDPVTTQPKHR- 85 Query: 168 GLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D+M+VLD+S SM+ + G +D+L +++ R GLV F Sbjct: 86 --DLMLVLDLSYSMSQEDMQEGDQYIDRLSAVK-------NVVSDFVKQREGDRLGLVLF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + PL + I +++N L+ G T G+ A D+ Sbjct: 137 ADHAYLQTPLTLDRETISDQVNSLVLRLIGDKTAIGEGIGLATKTFVDS----------- 185 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-------------- 326 + ++ ++ L+DG N+S ++ L AK+ A +Y IG+ A Sbjct: 186 EAPQRVMVLLSDGSNTSGVLEP---LEAARIAKKYNATIYTIGIGAGEMMVKEFFMTRKV 242 Query: 327 ----EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + ++ LK A + +++ +N+ +L + I Sbjct: 243 NTAKDLDEKTLKQIADLTGGQYFRARNADELATIYDTI 280 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 98/262 (37%), Gaps = 53/262 (20%) Query: 128 SAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH- 184 + +P W + P+ V+ K D+M+V+D+S SM+ Sbjct: 53 TKTPSSRLPKALSVIIWLLLVTAMARPVWYGEPVEFQPKHR---DLMLVVDLSYSMSQED 109 Query: 185 ---FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G +D+L + + ++ R GLV F+ PL + Sbjct: 110 MQFNGEYIDRLSAVKHVLSDFIE-------RRKGDRVGLVLFADHAYLQTPLTLDRDTLS 162 Query: 242 EKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +++N+ + G+ T G+ A D+ D ++ +I L+DG N++ Sbjct: 163 QQLNQAVLKLIGTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAG 211 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--S 338 +D L + AK+ A +Y +GV A + ++ L A + Sbjct: 212 VLDP---LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRT 268 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 +++ ++S++L + I + Sbjct: 269 GGQYFRARDSKELATIYDTINQ 290 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 98/262 (37%), Gaps = 53/262 (20%) Query: 128 SAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH- 184 + +P W + P+ V+ K D+M+V+D+S SM+ Sbjct: 70 TKTPSSRLPKALSVIIWLLLVTAMARPVWYGEPVEFQPKHR---DLMLVVDLSYSMSQED 126 Query: 185 ---FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G +D+L + + ++ R GLV F+ PL + Sbjct: 127 MQFNGEYIDRLSAVKHVLSDFIE-------RRKGDRVGLVLFADHAYLQTPLTLDRDTLS 179 Query: 242 EKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +++N+ + G+ T G+ A D+ D ++ +I L+DG N++ Sbjct: 180 QQLNQAVLKLIGTQTAIGDGIGLATKTFVDS-----------DAPQRVMILLSDGSNTAG 228 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--S 338 +D L + AK+ A +Y +GV A + ++ L A + Sbjct: 229 VLDP---LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRT 285 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 +++ ++S++L + I + Sbjct: 286 GGQYFRARDSKELATIYDTINQ 307 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 89/236 (37%), Gaps = 52/236 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRSIREMLDI 205 P+ V I + DMM+V+D+S SM + G +D+L + + + +D Sbjct: 68 RPVWYGDPVDIQPEHR---DMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQ 124 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYA 262 R GLV F PL + ++E+++R + G T GL A Sbjct: 125 -------RKGDRLGLVLFGDHAYLQTPLTFDRNTVREQLDRTVLRLVGQMTAMGEGLGLA 177 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ ++ II L+DG N++ ++ L AK A +Y + Sbjct: 178 TKTFIESN-----------APQRTIILLSDGANTAGVLEP---LEAAQLAKDNHAKIYTV 223 Query: 323 GVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 G+ A + + L A + +++ +N+ +L + + I Sbjct: 224 GIGAGEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 48/262 (18%), Positives = 99/262 (37%), Gaps = 53/262 (20%) Query: 128 SAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH- 184 + +P W +S P+ V+ K D+M+V+D+S SM+ Sbjct: 53 TKTPSTRLPKALSVIIWLLLVTASARPVWYGEPVEFQPKHR---DLMLVVDLSYSMSQED 109 Query: 185 ---FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G +D+L + + ++ R GLV F+ PL + Sbjct: 110 MQFNGEYIDRLSAVKHVLSDFIE-------RRKGDRVGLVLFADHAYLQTPLTLDRDTLS 162 Query: 242 EKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +++N+ + G+ T G+ A D+ D ++ ++ L+DG N++ Sbjct: 163 QQLNQAVLRLIGNQTAIGDGIGLATKTFVDS-----------DAPQRVMVLLSDGSNTAG 211 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--S 338 +D L + AK+ A +Y +GV A + ++ L A + Sbjct: 212 VLDP---LEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRT 268 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 +++ ++S++L + I + Sbjct: 269 GGQYFRARDSKELATIYDTINQ 290 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 89/242 (36%), Gaps = 52/242 (21%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIRE 201 + P+ +++ DM++ +D+S SM+ G +D+L Sbjct: 64 GALARPVWYGDPIEVQPDHR---DMLLAVDLSGSMSIEDMITQSGESIDRLAAVK----- 115 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPG 258 D++ + R GLV F+ PL + +++++ R + G +T G Sbjct: 116 --DVLAEFIEQRKGDRLGLVLFAQHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEG 173 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L A ++ + ++ II L+DG N++ I+ L A Sbjct: 174 LGIATKTFINS-----------EAPQRVIILLSDGANTAGVIEP---LEAAKLAAESNVT 219 Query: 319 VYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +Y +GV AE ++ L A + +++ +N ++L + I Sbjct: 220 IYTVGVGAEEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRARNPQELEHIYQLI 279 Query: 359 GK 360 + Sbjct: 280 DQ 281 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 62/381 (16%), Positives = 122/381 (32%), Gaps = 45/381 (11%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 + G+ ++ ILLPV+ V I + + +L D + + + Sbjct: 29 LRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVTGDK 88 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + + F + N E + D + +L A Sbjct: 89 AEAIEAAERLLEA--NPYLDRTLSIGDADIIFGKS-NRTEENRRYEFTPDKKVNSVSLRA 145 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG--- 186 ++P +F T +K + + + LD+ +VLD S SM Sbjct: 146 FGADDVPMLFPTMGVPIEFR--------PIKQAVATQVELDIAIVLDRSGSMAFSHDEVA 197 Query: 187 --------PGMDKLGVATRSIREMLDII-------KSIPDVNNVVRSGLVTFSSKIVQTF 231 P K+G A LD + + D ++ R L T+S K Sbjct: 198 KNGSPSSAPPGWKMGHAVPENARWLDTVAAVNGFLDIMEDSSHDERVSLSTYSDKSKADV 257 Query: 232 PLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L I+ +N G T G+ + D K + +I Sbjct: 258 KLTGDYTEIRAAMNAHSTKFKGGATNIGSGILEGGATLGDKKLARSWA-------SRVLI 310 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQ 346 ++DG +++ E + + +++ + EA Q ++ A + + + Sbjct: 311 VMSDGIHNTG----IEPIPAAQQVANEKIMIFTVTFSDEANVQEMEKVAVSGGGQHFHAK 366 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 +S++L +AF +I K + Sbjct: 367 DSQQLTEAFRKIAKSLPTLIT 387 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 9/151 (5%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + PL+ + I+ + T +T GL + +N + A + Sbjct: 286 NCNLQTIMPLSTNWSALNSHIDAMASAGNTNTTIGLAWGWNMLTQGGPLSS-AAAPAANL 344 Query: 284 KKYIIFLTDGEN------SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I+FLTDG+N ++ N N + CN K G VY++ V E ++NCA Sbjct: 345 DKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRV-IEGNATLIRNCA 403 Query: 338 S-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + P +YSV + +L F I + + RI Sbjct: 404 TEPGMYYSVTTASELTSVFASIAQSLSNLRI 434 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 75/220 (34%), Gaps = 36/220 (16%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + + +G+ + + AI + + G ++ S + V+++L LD S L Sbjct: 11 LWQDRRGNFAAIFAIAIIPVVAAAGATVDISRAYIVESRLKAALDASALAVGGAT----G 66 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + Q + ++ F + + ++ + +LS Sbjct: 67 MTTSQMQA-------------------MAQSFFNANYPASKLGVPGTLSVSQSGNVVSLS 107 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 ++ + T + + TS V K L++ +VLD + SM Sbjct: 108 VHAQ-----LPTTLMGVVGINTLNVSATSQVTRMGKK---LEVALVLDNTGSMASG---- 155 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++ V + + ++ + + V+ +V F+ + Sbjct: 156 -GRMTVLKTAAKNLITTVSAAATNPGDVKVAIVPFNVDVN 194 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 61/381 (16%), Positives = 120/381 (31%), Gaps = 45/381 (11%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 + G+ ++ ILLPV+ V I + + +L D + + + Sbjct: 29 LRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVTGDK 88 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + + F + N E + D + L A Sbjct: 89 AEAIEAAERLLEA--NPYLDRTLSIGDADIIFGKS-NRTEENRRYEFTPDKKVNSVGLRA 145 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG--- 186 ++P +F T +K + + + LD+ +VLD S SM Sbjct: 146 FGADDVPMLFPTMGVPIEFR--------PIKQAVATQVELDIAIVLDRSGSMAFSHDEVA 197 Query: 187 --------PGMDKLGVATRSIREMLDII-------KSIPDVNNVVRSGLVTFSSKIVQTF 231 P K+G A LD + + D ++ R L T+S K Sbjct: 198 KNGSPSSAPPGWKMGHAVPKNARWLDTVAAVNGFLDIMEDSSHDERVSLSTYSDKSKADV 257 Query: 232 PLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L I+ +N G T G+ + D + +I Sbjct: 258 KLTGDYTEIRAAMNAHSTNFKGGATNIGSGILEGGATLGDKNLARSWA-------SRVLI 310 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQ 346 ++DG +++ E + + +++ + EA Q ++ A + + + Sbjct: 311 VMSDGIHNTG----IEPIPAAQQVANEKIMIFTVTFSNEANVQEMEKVAVSGGGQHFHAK 366 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 +S++L +AF +I K + Sbjct: 367 DSQQLAEAFRKIAKSLPTLIT 387 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 48/251 (19%), Positives = 95/251 (37%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKL 192 + + ++ P+ V S K DMM+V+D+S SM+ +D+L Sbjct: 56 LVIALWIGLVTAAARPVWYGDPVTTSPKHR---DMMLVVDLSYSMSQQDMKSGDQFIDRL 112 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 + + + + R GL+ F+ PL Q I +++N+ + Sbjct: 113 SAVKQVLSDFI-------AKRQGDRLGLIFFADHAYLQTPLTLDRQTIAQQLNQAVLRLI 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G+ T G+ A D+ D ++ +I L+DG N+S +D E+ Sbjct: 166 GTQTAIGEGIGLATKTFIDS-----------DAPQRVMILLSDGSNTSGVLDPMEA---A 211 Query: 310 NEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSR 349 AK+ +Y +GV A + ++ L+ A + +++ +N + Sbjct: 212 KIAKKYHTTIYTVGVGAGEMMVKEFFMTRKINTAEDLDEKTLQAIADETGGQYFRARNQQ 271 Query: 350 KLHDAFLRIGK 360 L + I + Sbjct: 272 DLQHIYDTINQ 282 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 86/238 (36%), Gaps = 51/238 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREML 203 + P+ V K D+M+VLD+S SM+ +D+L + Sbjct: 67 AMARPVWYGEPVTTQPKHR---DLMLVLDLSYSMSQEDMSDGSDYVDRLTAVKK------ 117 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLE 260 ++ R G+V F+ PL + +++N+L+ G T G+ Sbjct: 118 -VVSDFAIKREGDRLGVVLFADHAYLQTPLTLDRTTVADQVNQLVLRLIGDKTAIGEGIG 176 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A D+ D ++ +I L+DG N+S ID + AK+ A +Y Sbjct: 177 LATKTFIDS-----------DAPQRVMILLSDGSNTSGVIDP---IEAAKIAKKYDATIY 222 Query: 321 AIGVQAE-------------------AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 IGV A ++ + +++ +++++L + I Sbjct: 223 TIGVGAGEMMVKEFFMTRKVNTAQDLDEKALMQIAQITGGQYFRARDAKELATIYDTI 280 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 115 bits (288), Expect = 9e-24, Method: Composition-based stats. Identities = 47/253 (18%), Positives = 99/253 (39%), Gaps = 47/253 (18%) Query: 132 RYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHF--- 185 + ++P + + W ++ P + V I ++ G +MM+ +D+S SM D Sbjct: 52 QTKIPLLVSSLIWLLLVTAAARPQWLGEPVSIPNE---GREMMLAVDLSGSMKIDDMQLN 108 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G +++L + + + + R GL+ F+ PL + + ++ Sbjct: 109 GRQVNRLTMTKSVVYDFIQ-------RRVGDRIGLILFADTAYVQAPLTYDRDTVSTLLS 161 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + G + T I DA ++ +I LTDG+N++ NI +++ Sbjct: 162 EAVIGLVGEQT--------AIGDAIGLAVKRFDEREESNNVLILLTDGQNTAGNITPEQA 213 Query: 306 LFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCASP--DRFYSV 345 A +G VY IGV A+ + L N A+ +++ Sbjct: 214 KEL---AISKGVKVYTIGVGADKMLIQSFFGSRQINPSQELDEGMLTNIATSTGGQYFRA 270 Query: 346 QNSRKLHDAFLRI 358 +N+++L + ++ Sbjct: 271 RNAQELQAIYQQL 283 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 100/278 (35%), Gaps = 52/278 (18%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 + + + +++ + S + + + P+ V+ K Sbjct: 42 QQAEIKLAYLPTNENSSKS-KQALQKALSIAIWGLLVVACARPVWFGDPVEFQPKYR--- 97 Query: 170 DMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+M+V+D+S SM + G +D+L + + + + R G+V F Sbjct: 98 DLMLVVDLSGSMQQEDMELNGEYIDRLTAVKKVLSDFV-------AKRKGDRLGVVLFGD 150 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + + ++IN+ + G T G+ D+ D Sbjct: 151 HAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDS-----------DA 199 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ----------------- 325 ++ +I L+DG N++ ++ L AK+ A +Y +GV Sbjct: 200 PQRVMILLSDGSNTAGVLEP---LEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNT 256 Query: 326 -AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A+ +Q L A + +++ +++ +L + I + Sbjct: 257 AADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQ 294 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 56/264 (21%), Positives = 99/264 (37%), Gaps = 48/264 (18%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-- 184 +SA + +P + + P + ++ + G ++MMVLD+S SM Sbjct: 61 ISAKTLLLIPVLIWV--LLVIALSGPRWVGEPKPVARE---GYNIMMVLDLSGSMEITDM 115 Query: 185 --FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L V R+ + ++ R GL+ F ++ PL + + Sbjct: 116 LLHGRPVSRLLVVKRAAEQFVED-------RVGDRIGLILFGTRAYLQTPLTYDRHSVLM 168 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 +I+ +T GL I DA + + II LTDG N+S + Sbjct: 169 RID--------DATAGLAGKTTSIGDAVGLAVKRLQDVPSKGRVIILLTDGANNSGVLAP 220 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRF 342 L AK+ G +Y IG+ +EA ++ L+ A + R+ Sbjct: 221 ---LKAAELAKQDGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRY 277 Query: 343 YSVQNSRKLHDAFLRIGK-EMVKQ 365 + + L + I + E VKQ Sbjct: 278 FRATDPESLQSIYQTINQLETVKQ 301 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 56/264 (21%), Positives = 99/264 (37%), Gaps = 48/264 (18%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-- 184 +SA + +P + + P + ++ + G ++MMVLD+S SM Sbjct: 53 ISAKTLLLIPVLIWV--LLVIALSGPRWVGEPKPVARE---GYNIMMVLDLSGSMEITDM 107 Query: 185 --FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L V R+ + ++ R GL+ F ++ PL + + Sbjct: 108 LLHGRPVSRLLVVKRAAEQFVED-------RVGDRIGLILFGTRAYLQTPLTYDRHSVLM 160 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 +I+ +T GL I DA + + II LTDG N+S + Sbjct: 161 RID--------DATAGLAGKTTSIGDAVGLAVKRLQDVPSKGRVIILLTDGANNSGVLAP 212 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRF 342 L AK+ G +Y IG+ +EA ++ L+ A + R+ Sbjct: 213 ---LKAAELAKQDGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRY 269 Query: 343 YSVQNSRKLHDAFLRIGK-EMVKQ 365 + + L + I + E VKQ Sbjct: 270 FRATDPESLQSIYQTINQLETVKQ 293 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 94/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQTVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYN 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 47/198 (23%), Positives = 80/198 (40%), Gaps = 29/198 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S SM+ + G K+ A + ++ R+ +V F Sbjct: 532 IDLVFVLDSSGSMSWNDPNGYRKI-AAKSFVDALI----------QGDRAAVVDFDDYGY 580 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL Q ++ I+R+ T G+ A +++ + DD K II Sbjct: 581 LLQPLTTDFQTVKNAIDRIDSWGGTNIAEGIRIANHQLIS---------QSSDDRIKVII 631 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQ 346 LTDGE N EAK G +Y IG+ + L+N A + ++ V Sbjct: 632 LLTDGEGYYDNNLTT-------EAKNNGITIYTIGLGTSVDENLLRNIATQTGGMYFPVS 684 Query: 347 NSRKLHDAFLRIGKEMVK 364 ++ +L F RI + + + Sbjct: 685 SASQLPQVFKRITEIVTE 702 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 45/255 (17%), Positives = 98/255 (38%), Gaps = 53/255 (20%) Query: 133 YEMPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGP- 187 ++ I F W + P+ + S +I+ G D+ + LD+S SM +D + Sbjct: 58 FKTSNILLMFAWISLVVAIARPIWVGSPTQITP---SGRDLFVALDLSGSMQISDMYYQS 114 Query: 188 -GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +++L ++ + + ++ R G++ F +K PL++ + +++ I Sbjct: 115 RPVNRLVISKHVLSDFIE-------KRKGDRIGVIVFGTKAYLQAPLSFDTKTVRQLIQE 167 Query: 247 LI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 G T + ++ + KK +I +TDG N++ + Sbjct: 168 TQIGFAGEKTAIGDAIGLGIKQLSELPSD-----------KKVLILMTDGANTAGRVSP- 215 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCAS--PDRFY 343 L N A +G ++ IG+ A+ + L+N AS ++Y Sbjct: 216 --LQAANFAAEQGVTIHTIGIGADEMEVQGFFGPQTVNPSEDLDEALLENVASLTGGKYY 273 Query: 344 SVQNSRKLHDAFLRI 358 +++ L + + I Sbjct: 274 RAKSTSDLEEIYGDI 288 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 53/199 (26%), Positives = 84/199 (42%), Gaps = 30/199 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L +++++DVS SM GP +D+ A RS + D + R GL+++S ++ Sbjct: 510 PLAILLLIDVSSSMA---GPPLDEAREAARSFLDQCDFTTT--------RVGLISYSDQV 558 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V L V+ ++ + RL TT LE + KL + GH KY+ Sbjct: 559 VLQTDLTDNVRKVEAGLARLEADGTTNLAGALELG-------RRKLATVPTGHV---KYL 608 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSV 345 + LTDG D +L AK G + AIG EA +L AS ++ Sbjct: 609 VVLTDGYPD----DPDNALLEAAHAKGSGIEIVAIGTG-EADQAYLDRIASTQAGSIFA- 662 Query: 346 QNSRKLHDAFLRIGKEMVK 364 +L AF I + + + Sbjct: 663 -RKGELVRAFGHIARVIAE 680 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 56/264 (21%), Positives = 99/264 (37%), Gaps = 48/264 (18%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-- 184 +SA + +P + + P + ++ + G ++MMVLD+S SM Sbjct: 61 ISAKTLLLIPVLVWV--LLVIALSGPRWVGEPKPVARE---GYNIMMVLDLSGSMEITDM 115 Query: 185 --FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L V R+ + ++ R GL+ F ++ PL + + Sbjct: 116 LLHGRPVSRLLVVKRAAEQFVED-------RVGDRIGLILFGTRAYLQTPLTYDRHSVLM 168 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 +I+ +T GL I DA + + II LTDG N+S + Sbjct: 169 RID--------DATAGLAGKTTSIGDAVGLAVKRLQDVPSKGRVIILLTDGANNSGVLAP 220 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRF 342 L AK+ G +Y IG+ +EA ++ L+ A + R+ Sbjct: 221 ---LKAAELAKQDGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRY 277 Query: 343 YSVQNSRKLHDAFLRIGK-EMVKQ 365 + + L + I + E VKQ Sbjct: 278 FRATDPESLQSIYQTINQLETVKQ 301 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 56/264 (21%), Positives = 99/264 (37%), Gaps = 48/264 (18%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-- 184 +SA + +P + + P + ++ + G ++MMVLD+S SM Sbjct: 53 ISAKTLLLIPVLVWV--LLVIALSGPRWVGEPKPVARE---GYNIMMVLDLSGSMEITDM 107 Query: 185 --FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L V R+ + ++ R GL+ F ++ PL + + Sbjct: 108 LLHGRPVSRLLVVKRAAEQFVED-------RVGDRIGLILFGTRAYLQTPLTYDRHSVLM 160 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 +I+ +T GL I DA + + II LTDG N+S + Sbjct: 161 RID--------DATAGLAGKTTSIGDAVGLAVKRLQDVPSKGRVIILLTDGANNSGVLAP 212 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRF 342 L AK+ G +Y IG+ +EA ++ L+ A + R+ Sbjct: 213 ---LKAAELAKQDGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRY 269 Query: 343 YSVQNSRKLHDAFLRIGK-EMVKQ 365 + + L + I + E VKQ Sbjct: 270 FRATDPESLQSIYQTINQLETVKQ 293 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 45/275 (16%), Positives = 100/275 (36%), Gaps = 46/275 (16%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 + + + +K + + + + P+ V+ K Sbjct: 42 QQAEIKLAYLPNNK-HASKPKQIVQKALSTGVWMLLVIACARPVWYGDPVEFQPKYR--- 97 Query: 170 DMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 DMM+++D+S SM + G +D+L R + + ++ R G+V F Sbjct: 98 DMMLLVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFVE-------KRQGDRLGVVLFGD 150 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL + + ++IN+ + G + T I D D ++ Sbjct: 151 HAYLQTPLTADRRTVMQQINQAVIGLVGERT--------AIGDGIGLGTKTFVDSDAPQR 202 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------------E 327 +I L+DG N++ ++ E+ A++ A +Y +GV A + Sbjct: 203 VMILLSDGSNTAGVLEPLEATEI---AQKYNATIYTVGVGAGEMMVKDFFMTRKVNTASD 259 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +Q L A + +++ +++++L + I + Sbjct: 260 LDEQTLTKIAEMTGGKYFRARDAKELEAIYDTINQ 294 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 93/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLITAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDSN-----------APQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 94/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 221 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 277 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 278 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 330 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L N AK+ Sbjct: 331 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 376 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 377 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 436 Query: 357 RIGK 360 I + Sbjct: 437 TINQ 440 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 48/249 (19%), Positives = 97/249 (38%), Gaps = 47/249 (18%) Query: 136 PFIFCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHF---GPGM 189 P I + W +++ P + V I ++ G +MM+ +D+S SM D G + Sbjct: 56 PLIISSLIWLLLISAAARPQWLGEPVSIPNE---GREMMLAVDLSGSMKIDDMQLNGRQV 112 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 ++L + + + + R GL+ F+ PL + + ++ + Sbjct: 113 NRLTMTKSVVYDFIQ-------RRVGDRLGLILFADTAYVQAPLTYDRDTVSTLLSEAVI 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G + T I DA D+ +I LTDG+N++ NI +++ Sbjct: 166 GLVGEQT--------AIGDAIGLAVKRFDERDESNNVLILLTDGQNTAGNITPEQAKEL- 216 Query: 310 NEAKRRGAIVYAIGVQAEA------------------ADQFLKNCASP--DRFYSVQNSR 349 A +G VY IGV A+ + L + A+ +++ +N++ Sbjct: 217 --AINKGVKVYTIGVGADKMLIQSFFGSREINPSQELDEGMLTDIATSTGGQYFRARNAQ 274 Query: 350 KLHDAFLRI 358 +L + ++ Sbjct: 275 ELEAIYQQL 283 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 93/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDSN-----------APQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKLNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 89/251 (35%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKL 192 + + P+ V+ K D+M+V+D+S SM G +D+L Sbjct: 68 LAVGVWTLLVVACARPVWFGEPVEFQPKYR---DLMLVVDLSGSMQKEDMNLDGEYIDRL 124 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 + + + + R G+V F PL Q + ++I + + Sbjct: 125 SAVKKVLSDFV-------AKRKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLV 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T G+ D+ D ++ +I L+DG N++ +D + Sbjct: 178 GQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP---IEAA 223 Query: 310 NEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNSR 349 AK+ A +Y +GV A+ +Q L A + +++ +++ Sbjct: 224 EIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAE 283 Query: 350 KLHDAFLRIGK 360 +L + I K Sbjct: 284 QLEKIYDTINK 294 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 93/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDSN-----------APQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 89/251 (35%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKL 192 + + P+ V+ K D+M+V+D+S SM G +D+L Sbjct: 62 LAVGVWTLLVVACARPVWFGEPVEFQPKYR---DLMLVVDLSGSMQKEDMNLDGEYIDRL 118 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 + + + + R G+V F PL Q + ++I + + Sbjct: 119 SAVKKVLSDFV-------AKRKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLV 171 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T G+ D+ D ++ +I L+DG N++ +D + Sbjct: 172 GQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP---IEAA 217 Query: 310 NEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNSR 349 AK+ A +Y +GV A+ +Q L A + +++ +++ Sbjct: 218 EIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAE 277 Query: 350 KLHDAFLRIGK 360 +L + I K Sbjct: 278 QLEKIYDTINK 288 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 92/237 (38%), Gaps = 51/237 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDII 206 P+ V S+ D+M+V+D+S SM+ + G +D+L + + E + Sbjct: 69 RPVWYGEPVSTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSEFIT-- 123 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAY 263 R GL+ F+ PL Q + ++N+ + G+ T G+ A Sbjct: 124 -----KREGDRVGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLAT 178 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 D+ D ++ +I L+DG N++ +D L N AK+ +Y +G Sbjct: 179 KTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYQTTIYTVG 224 Query: 324 VQA------------------EAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 V A + ++ L+ AS +++ +N + L + I + Sbjct: 225 VGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 281 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 89/251 (35%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKL 192 + + P+ V+ K D+M+V+D+S SM G +D+L Sbjct: 68 LSVAIWTLLVVACARPVWFGEPVEFQPKYR---DLMLVVDLSGSMQKEDMNLDGEYIDRL 124 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 + + + + R G+V F PL Q + ++I + + Sbjct: 125 SAVKKVLSDFV-------AKRKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLV 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T G+ D+ D ++ +I L+DG N++ +D + Sbjct: 178 GQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP---IEAA 223 Query: 310 NEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNSR 349 AK+ A +Y +GV A+ +Q L A + +++ +++ Sbjct: 224 EIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAE 283 Query: 350 KLHDAFLRIGK 360 +L + I K Sbjct: 284 QLEKIYNTINK 294 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 94/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVASQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 94/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L + AK+ Sbjct: 172 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAADIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 103/300 (34%), Gaps = 59/300 (19%) Query: 83 IIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF 142 K Q L G ++ ++ +K Y L V Sbjct: 20 RKKQQQQAKLGGHLYLPGSDVEVQSLPNKVE-----TSSNKGYWLVWVLL---------- 64 Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIRE 201 + PL + +++ SK G D+M+ +D+S SM + + Sbjct: 65 ---VLAVARPLWMGEPIELPSK---GRDLMLSVDLSGSMQIEDMV-----IDGKVTDRFT 113 Query: 202 MLD-IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ +I + R GL+ F+ PL + + + +N G + T Sbjct: 114 LIQHVISQFIERRKGDRIGLILFADHAYLQSPLTQDRRTVAQYLNEAEIGLVGRQT---- 169 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 I +A ++ + +I LTDG N++ +I +++ + A +RG +Y Sbjct: 170 ----AIGEAIALGVKRFDQVENSNRVLILLTDGSNNAGSISPEQAT---DIAAKRGITIY 222 Query: 321 AIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +GV AE + LK A + ++ +N+ +L + I K Sbjct: 223 TVGVGAEVMERRTLFGKERVNPSMDLDETQLKQIAEKTGGSYFRARNTEELERIYQEIDK 282 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 94/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 94/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 44/251 (17%), Positives = 92/251 (36%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKL 192 + + + P+ V+ K D+M+V+D+S SM + G +D+L Sbjct: 68 LSIAIWGFLVVACARPVWFGDPVEFQPKYR---DLMLVVDLSGSMQQEDMELNGEYIDRL 124 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 + + + + R G+V F PL + + ++IN+ + Sbjct: 125 TAVKQVLSDFV-------AKRKGDRLGVVLFGDHAYLQTPLTADRKSVMQQINQTVIGLV 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T G+ D+ D ++ +I L+DG N++ ++ L Sbjct: 178 GQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEP---LEAA 223 Query: 310 NEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNSR 349 AK+ A +Y +GV A+ +Q L A + +++ +++ Sbjct: 224 EIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRARDTE 283 Query: 350 KLHDAFLRIGK 360 +L + I + Sbjct: 284 ELEKIYDTINQ 294 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 56/264 (21%), Positives = 99/264 (37%), Gaps = 48/264 (18%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-- 184 +SA + +P + + P + ++ + G ++MMVLD+S SM Sbjct: 53 ISAKTLLLIPVLVWV--LLVIALSGPRWVGEPKPVARE---GYNIMMVLDLSGSMEITDM 107 Query: 185 --FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L V R+ + ++ R GL+ F ++ PL + + Sbjct: 108 LLHGRPVSRLLVVKRAAEQFVED-------RVGDRIGLILFGTRAYLQTPLTYDRHSVLM 160 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 +I+ +T GL I DA + + II LTDG N+S + Sbjct: 161 RID--------DATAGLAGKTTSIGDAVGLAVKRLQDVPSKGRVIILLTDGANNSGVLAP 212 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRF 342 L AK+ G +Y IG+ +EA ++ L+ A + R+ Sbjct: 213 ---LKAAELAKQDGIKIYTIGLGSEADPRALTGDFFAPTLSAELDEKTLEEMAKMTGGRY 269 Query: 343 YSVQNSRKLHDAFLRIGK-EMVKQ 365 + + L + I + E VKQ Sbjct: 270 FRATDPESLQSIYQTINQLETVKQ 293 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 113 bits (282), Expect = 5e-23, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 92/237 (38%), Gaps = 51/237 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDII 206 P+ V S+ D+M+V+D+S SM+ + G +D+L + + E + Sbjct: 69 RPVWYGEPVSTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSEFIT-- 123 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAY 263 R GL+ F+ PL Q + ++N+ + G+ T G+ A Sbjct: 124 -----KREGDRVGLILFADHAYLQTPLTMDRQTVISQLNQAVLKLIGTQTAIGEGIGLAT 178 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 D+ D ++ +I L+DG N++ +D L N AK+ +Y +G Sbjct: 179 KTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYQTTIYTVG 224 Query: 324 VQA------------------EAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 V A + ++ L+ AS +++ +N + L + I + Sbjct: 225 VGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 281 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 113 bits (282), Expect = 5e-23, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 92/244 (37%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSMSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDSN-----------APQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 113 bits (282), Expect = 5e-23, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 94/251 (37%), Gaps = 47/251 (18%) Query: 134 EMPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGP 187 ++P + + W ++ P + + I S+ G +MM+ +D+S SM + G Sbjct: 54 KIPVVIASLIWLCVVIAAARPQWLGEPISIPSE---GREMMLAVDLSGSMKIDDMELNGR 110 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +++L + + + + R GL+ F+ PL + + ++ Sbjct: 111 QVNRLTMTKSVLYDFIQ-------RRVGDRLGLILFADTAYLQAPLTYDRDTVSTLLSES 163 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + G + T I DA ++ +I LTDG+N++ I +++ Sbjct: 164 VIGLVGEQT--------AIGDAIGLAVKRFDEKEESNNVLILLTDGQNTAGFITPEQAKE 215 Query: 308 YCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCASP--DRFYSVQN 347 A + VY IGV A+ + L + AS +++ ++ Sbjct: 216 L---AVNKKVKVYTIGVGADKMLIQSFFGSRQVNPSQELDEDMLSDLASSTGGQYFRARD 272 Query: 348 SRKLHDAFLRI 358 +L + ++ Sbjct: 273 VNELEAIYAKL 283 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 112 bits (281), Expect = 6e-23, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 95/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ D ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L++ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTSQDLDEKTLQSIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 112 bits (281), Expect = 6e-23, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 92/237 (38%), Gaps = 51/237 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDII 206 P+ V S+ D+M+V+D+S SM+ + G +D+L + + E + Sbjct: 59 RPVWYGEPVSTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSEFIT-- 113 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAY 263 R GL+ F+ PL Q + ++N+ + G+ T G+ A Sbjct: 114 -----KREGDRMGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLAT 168 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 D+ D ++ +I L+DG N++ +D L N AK+ +Y +G Sbjct: 169 KTFIDS-----------DAPQRVMILLSDGSNTAGVLDP---LEAANIAKQYQTTIYTVG 214 Query: 324 VQA------------------EAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 V A + ++ L+ AS +++ +N + L + I + Sbjct: 215 VGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 271 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 112 bits (281), Expect = 7e-23, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 91/238 (38%), Gaps = 37/238 (15%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGM 189 ++ + + A + P L+ + + + G D+MM +D+S SM + G Sbjct: 59 KLATLTLVWILLATALARPALVGPEMPLPVE---GRDIMMAIDLSGSMEERDFAVGGRPA 115 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 +L + + + + + R GLV FS + PL + + +++ +++ Sbjct: 116 TRLSIVKETADDFI-------SRRDGDRLGLVLFSDRAYLQAPLTFDREAVRKLLDQAQV 168 Query: 250 ---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G T + + ++ D E + ++ LTDG N+ + ++ Sbjct: 169 GLTGQKTAIGDAIAVSVKRLKDRPEDG-----------RVLVLLTDGANNEGVMSPDKA- 216 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF----LKNC--ASPDRFYSVQNSRKLHDAFLRI 358 + A + G +Y IGV + + L+ A+ ++ + + L + I Sbjct: 217 --ADLAAKLGIRIYTIGVGSARSRDLDERTLRQIADATGGAYFRATDVQGLAQIYRAI 272 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 112 bits (280), Expect = 8e-23, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 93/244 (38%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLITAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ ++ +I L+DG N++ +D L N AK+ Sbjct: 172 EGIGLATKTFIDSN-----------APQRVMILLSDGSNTAGVLDP---LEAANIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTLGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 112 bits (280), Expect = 8e-23, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 90/240 (37%), Gaps = 40/240 (16%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 +S P + V + + G D+++ +D+S SM DK ++ ++ Sbjct: 69 LLVAASARPQWVGEPVTLPA---TGRDLLLAVDISGSMKTPDMVVQDKQIARILVVKYVV 125 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + + R GL+ F S+ PL + + + ++ G + T Sbjct: 126 NE---FIERRESDRLGLILFGSQAYLQAPLTFDRKTVSTLLDEAQLGFAGEQT------- 175 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I DA + ++ +I LTDG N++ + +++ + AK+ G +Y +G Sbjct: 176 -AIGDAVGLAIKRLRERPASQRVLILLTDGANTAGEVAPRQA---ADLAKQAGIKIYTVG 231 Query: 324 VQAEAADQ---------------------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 V A+ +Q L+ A + ++ +N ++L + + K Sbjct: 232 VGADQMEQRMGLFGGFSRTVNPSSDLDEDTLRYMAETTGGLYFRARNPQELQAIYEELDK 291 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 112 bits (280), Expect = 8e-23, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 91/240 (37%), Gaps = 51/240 (21%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIRE 201 + P+ V K DMM+V+D+S SM+ G +D+L + + + Sbjct: 14 ITAVARPVWYGDPVTTQPKHR---DMMLVVDLSYSMSKEDMQFNGDYIDRLSAVKQVLSD 70 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPG 258 + R GLV F+ PL + E++N+ + G+ T G Sbjct: 71 FI-------SKRQGDRLGLVLFADHAYLQTPLTLDRHTVAEQLNQTVLRLIGTKTAIGEG 123 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + A D+ D ++ +I L+DG N++ +D + AK+ A Sbjct: 124 IGLATKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP---IEAAKIAKKYNAT 169 Query: 319 VYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +Y +GV A + ++ L A + +++ ++S++L + I Sbjct: 170 IYTVGVGAGEMMVKEFFMTRKVNTAQDLDEKSLMEIAKLTGGQYFRARDSKELATIYDTI 229 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 44/251 (17%), Positives = 91/251 (36%), Gaps = 51/251 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKL 192 + + P+ V+ K D+M+V+D+S SM + G +D+L Sbjct: 68 LSMAIWVLLIIACARPVWFGDPVEFQPKYR---DLMLVVDLSGSMQQEDMELNGEYIDRL 124 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--- 249 + + + + R G+V F PL + + ++IN+ + Sbjct: 125 TAVKKVLSDFV-------AKRKGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLV 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T G+ D+ D ++ +I L+DG N++ ++ L Sbjct: 178 GQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLEP---LEAA 223 Query: 310 NEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRFYSVQNSR 349 AK+ A +Y +GV A+ +Q L A + +++ +++ Sbjct: 224 EIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEMTGGQYFRARDTD 283 Query: 350 KLHDAFLRIGK 360 +L + I + Sbjct: 284 QLEKIYDTINQ 294 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 96/257 (37%), Gaps = 52/257 (20%) Query: 132 RYEMPFIFCTFPWCANSSH-APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----G 186 ++ M + T C ++ P+ + S DMM+V+D+S SMN Sbjct: 50 QWLMKGLVLTLWICLVAAAARPVWYGEPIT---HSQPHRDMMLVVDLSYSMNQKDMQIGD 106 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +D+L + + + + R GL+ F+ PL + +++N+ Sbjct: 107 DYIDRLTAVKQVLSDFI-------AQRQGDRLGLIFFADHAYLQTPLTLDRTTVAQQLNQ 159 Query: 247 LIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + G+ T G+ A ++ D ++ +I L+DG N++ +D Sbjct: 160 AVLRLIGTQTAIGDGIGLATKTFIES-----------DAPQRVMILLSDGSNNAGVLDP- 207 Query: 304 ESLFYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFY 343 + + AK+ +Y +GV A + ++ L+ A + +++ Sbjct: 208 --IEAAHIAKQYHTTIYTVGVGAGEMMVRDFFMTRRINTAEDLDEETLQKIADLTGGQYF 265 Query: 344 SVQNSRKLHDAFLRIGK 360 ++ L + I K Sbjct: 266 RARDKHDLQTIYQTIDK 282 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 51/264 (19%), Positives = 97/264 (36%), Gaps = 52/264 (19%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + S SR P + P+ I I + G +MM+ +D+S SM Sbjct: 44 TGSASKKSRAIWPLTLMWLCLVVAA-ARPMWIGEPQSIPQQ---GREMMLAVDLSRSMQA 99 Query: 184 H----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 +D+L + + + + R GL+ F+ PL + ++ Sbjct: 100 EDMQINNRMVDRLSLVKTVVADFIQQ-------RKGDRVGLIFFADNAYLQAPLTFDLKT 152 Query: 240 IQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + + G T G+ A + A D+ +K +I LTDG+NS Sbjct: 153 VSGYMQQAVLGLVGEQTAIGEGIGLALKRFDAA-----------DNPQKVLILLTDGQNS 201 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-------ADQ-----------FLKNCA- 337 + + + L A+ +G +Y IGV A+A +Q LK A Sbjct: 202 AGEV---KPLDAAKFAQEQGVKIYTIGVGADAYYKRTLFGNQKVDPSRDLDEVTLKTIAA 258 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGK 360 + +++ +++ L + + K Sbjct: 259 QTGGQYFRARDASSLAAIYAELDK 282 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 54/252 (21%), Positives = 85/252 (33%), Gaps = 44/252 (17%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P I + + KI + G+D+++ LDVS SM +++L Sbjct: 55 LPVILKSLALVFAIVALARPQEMNTKIRKNVE-GIDIVICLDVSDSMLIEDMKPLNRLEA 113 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + I R GLV F+ + P Q I +++N + S+ K Sbjct: 114 AK-------ETIAKFISARTSDRIGLVVFAGESFTMVPPTLDYQMILQRVNEISSASSAK 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 G + A K + +IF+TDGEN+S ID + L AK Sbjct: 167 IKDG-----TALGVAMANAAGRLKDSQARSRVMIFMTDGENNSGTIDPETGLEI---AKG 218 Query: 315 RGAIVYAIGVQAEA--------------------------ADQFLKNCASP--DRFYSVQ 346 G VY+IG+ + + L AS ++Y Sbjct: 219 YGIKVYSIGIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNEDLLGRMASDTGGKYYRAT 278 Query: 347 NSRKLHDAFLRI 358 L F I Sbjct: 279 TEGALQKVFSDI 290 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 92/247 (37%), Gaps = 51/247 (20%) Query: 142 FPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 WC + P + +++ S+ G D+MM +D+S SM ++ + V +++ Sbjct: 58 LMWCLLVLAIARPQWLGEPIELPSQ---GRDLMMAVDLSGSMQ------IEDMVVNGKTV 108 Query: 200 REML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 ++ + R GL+ F+ PL + + + + G T Sbjct: 109 DRFTLIQHVVSDFIERRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT 168 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + E + +I LTDG N++ NI+ +++ A Sbjct: 169 AIGEAIALAVKRFDKMDESN-----------RVLILLTDGSNNAGNIEPEQA---AQIAA 214 Query: 314 RRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHD 353 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 215 NRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQ 274 Query: 354 AFLRIGK 360 + I K Sbjct: 275 IYQEIDK 281 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 92/247 (37%), Gaps = 51/247 (20%) Query: 142 FPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 WC + P + +++ S+ G D+MM +D+S SM ++ + V +++ Sbjct: 58 LMWCLLVLAIARPQWLGEPIELPSQ---GRDLMMAVDLSGSMQ------IEDMVVNGKTV 108 Query: 200 REML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 ++ + R GL+ F+ PL + + + + G T Sbjct: 109 DRFTLIQHVVSDFIERRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT 168 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + E + +I LTDG N++ NI+ +++ A Sbjct: 169 AIGEAIALAVKRFDKMDESN-----------RVLILLTDGSNNAGNIEPEQA---AQIAA 214 Query: 314 RRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHD 353 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 215 NRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQ 274 Query: 354 AFLRIGK 360 + I K Sbjct: 275 IYQEIDK 281 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 47/247 (19%), Positives = 92/247 (37%), Gaps = 51/247 (20%) Query: 142 FPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 WC + P + +++ S+ G D+MM +D+S SM ++ + V +++ Sbjct: 58 LMWCLLVLAIARPQWLGEPIELPSQ---GRDLMMAVDLSGSMQ------IEDMVVNGKTV 108 Query: 200 REML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 ++ + R GL+ F+ PL + + + + G T Sbjct: 109 DRFTLIQHVVSDFIERRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT 168 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + E + +I LTDG N+S NI+ +++ A Sbjct: 169 AIGEAIGLAVKRFDKMDESN-----------RVLILLTDGSNNSGNIEPEQA---AQIAA 214 Query: 314 RRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHD 353 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 215 NRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQ 274 Query: 354 AFLRIGK 360 + I K Sbjct: 275 IYQEIDK 281 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 46/196 (23%), Positives = 78/196 (39%), Gaps = 29/196 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ VLD S SM+ + G K+ A + ++ R+ +V F Sbjct: 39 LVFVLDSSGSMSWNDPNGYRKI-AAKSFVDALI----------QGDRAAVVDFDDYGYLL 87 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL Q ++ I+R+ T G+ A +++ + DD K II L Sbjct: 88 QPLTTDFQTVKNAIDRIDSWGGTNIAEGIRIANHQLIS---------QSSDDRIKVIILL 138 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNS 348 TDGE N EAK G +Y IG+ + L+N A + ++ V ++ Sbjct: 139 TDGEGYYDNNLTT-------EAKNNGITIYTIGLGTSVDENLLRNIATQTGGMYFPVSSA 191 Query: 349 RKLHDAFLRIGKEMVK 364 +L F RI + + + Sbjct: 192 SQLPQVFKRITEIVTE 207 >gi|229526203|ref|ZP_04415607.1| protein BatA [Vibrio cholerae bv. albensis VL426] gi|229336361|gb|EEO01379.1| protein BatA [Vibrio cholerae bv. albensis VL426] Length = 318 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 44/244 (18%), Positives = 92/244 (37%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMQVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 E + R GL+ F+ PL Q + ++N+ + G+ T Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ A D+ ++ +I L+DG N++ +D L + AK+ Sbjct: 172 EGIGLATKTFIDSN-----------APQRVMILLSDGSNTAGVLDP---LEAADIAKQYH 217 Query: 317 AIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y +GV A + ++ L+ A + ++ +N + L + + Sbjct: 218 TTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYD 277 Query: 357 RIGK 360 I + Sbjct: 278 TINQ 281 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 92/247 (37%), Gaps = 51/247 (20%) Query: 142 FPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 WC + P + +++ S+ G D+MM +D+S SM ++ + V +++ Sbjct: 58 LMWCLLVLAIARPQWLGEPIELPSQ---GRDLMMAVDLSGSMQ------IEDMVVNGKTV 108 Query: 200 REML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 ++ + R GL+ F+ PL + + + + G T Sbjct: 109 DRFTLIQHVVSDFIERRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT 168 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + E + +I LTDG N++ NI+ +++ A Sbjct: 169 AIGEAIGLAVKRFDKMDESN-----------RVLILLTDGSNNAGNIEPEQA---AQIAA 214 Query: 314 RRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHD 353 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 215 NRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQ 274 Query: 354 AFLRIGK 360 + I K Sbjct: 275 IYKEIDK 281 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 52/275 (18%), Positives = 105/275 (38%), Gaps = 46/275 (16%) Query: 111 STSLSIIIDDQH--KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 T L I +H ++ A +R PF + + ++ P + + + ++ G Sbjct: 30 HTRLRIPSFAKHNLTSQSVEAHARRLTPFEWVIWLLLVIAAANPTWLDDPISMPNE---G 86 Query: 169 LDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+M+ +D+S SM + G +D+L + + + ++ R GL+ F Sbjct: 87 RDIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLTDFIEQ-------RQGDRLGLILFG 139 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL V+ + + ++ G ++T I DA D Sbjct: 140 DTAFLQTPLTRDVKTVSKMLSEAQIGLVGRAT--------AIGDALGLSVKRFANKDKSN 191 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------- 328 + ++ LTDG+N++ N+ +E+L A+ G VY IGV ++ Sbjct: 192 RIVVLLTDGQNTAGNLKPEEALLL---ARDAGIKVYTIGVGSDNPRGFSLFNMGGMSGDT 248 Query: 329 -ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LK A + ++ ++ L + + K Sbjct: 249 IDEGLLKRIAEQTGGLYFRAKDVAGLQQIYAELDK 283 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 51/349 (14%), Positives = 125/349 (35%), Gaps = 29/349 (8%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 G++ +L AI++ + IV+ I+ + + +L D + AT + + + N Sbjct: 33 RHGAMLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLA 92 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 ++ + + N + + F + + + + + + + Sbjct: 93 IQRGQQIAQANLVNGQPLLLADGDFQ--FGRSDRQVNGKYAFNAGEAPFNG---VRVNGQ 147 Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + P + + ++ + + D+ +V+D S SM G + L Sbjct: 148 RTAGSLSGPVPLFFGNVTGTSIFEPEAFATA-TYVERDITLVVDRSGSMA---GSRFNDL 203 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 A R ++L V+ + GL +++ + + L + ++RL G Sbjct: 204 QAAIRIFTDLLATT----PVDE--QIGLASYNDRASEDVQLTENFAEVNNAMDRLRTGGF 257 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + G++ A +++ + + ++ +I +TDG ++ E + Sbjct: 258 TSISRGMQ--------AGQEIALRGRPPEFVERTMIVMTDGRHNRG----PEPRVVATDL 305 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 G ++ I A A +++ A R + N +L D + I Sbjct: 306 AADGVTIHTITFGAGADFGRMQDVARIGGGRHFHATNGDQLRDIYREIA 354 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 52/250 (20%), Positives = 96/250 (38%), Gaps = 33/250 (13%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 +D +K + A + + F+ P P+ +I+ + + +D++ VLD Sbjct: 450 GVDPVNK--VVWAKTNHFTTFVLFYIPTWKAIWEVPIN-KGEREINQQINY-IDLVFVLD 505 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 S SM+ + G K+ A + ++ R+ +V F PL Sbjct: 506 SSGSMSWNDPNGYRKI-AAKSFVDALI----------QGDRAAVVDFDDFGYLLQPLTTD 554 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 Q ++ I+R+ T G+ A ++ +D K II LTDGE Sbjct: 555 FQAVKNAIDRIDSWGGTNIAEGIRIANQQLIS---------LSSEDRIKVIILLTDGEGY 605 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 N EAK G +Y IG+ + L++ A + ++ V ++ +L Sbjct: 606 YDNNLTT-------EAKNNGITIYTIGLGTSVDENLLRDIATQTGGMYFPVSSASQLPQV 658 Query: 355 FLRIGKEMVK 364 F RI + + + Sbjct: 659 FKRITEIVTE 668 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 45/245 (18%), Positives = 90/245 (36%), Gaps = 47/245 (19%) Query: 138 IFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 W S P+ + I ++ G D+M+ +D+S SM G +D+ Sbjct: 67 TLLFIAWSTLIISVCRPVYYGEPISIPNE---GRDLMVAVDLSGSMQTQDMVVNGNEVDR 123 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + + + R GL+ F+ PL + +++ ++ + G Sbjct: 124 LVMVKTVLGDFIQ-------RRVGDRIGLILFADTAYLQAPLTFDRTTVEQLLSETVIGL 176 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 ST I DA + K ++ LTDG+N++ NI ++L Sbjct: 177 VGDST--------AIGDAIGLAAKRFSDKPNVNKVLVLLTDGQNTAGNITPDQALSL--- 225 Query: 312 AKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKL 351 A + +Y IGV A+A + L A + +++ +++++L Sbjct: 226 AVDQNIKIYPIGVGADAMTVNSLFGQRQVNPSADLDEGLLTRLAKDTGGQYFRARDTQEL 285 Query: 352 HDAFL 356 + Sbjct: 286 EQIYR 290 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 87/246 (35%), Gaps = 47/246 (19%) Query: 136 PFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGM 189 P + W + P + ++ ++ G D+M+ +D+S SM + + Sbjct: 57 PLLLAAIAWLFLVIACTRPQWLGEPIE---QAVSGRDLMLAVDLSGSMEEQDFVINKRSV 113 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 D+L A + ++ R GL+ F ++ PL + + + +N + Sbjct: 114 DRLTAAKMVAADFIN-------RRVGDRVGLILFGTQAYLQTPLTFDRKTVMTLLNEAVI 166 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T I DA K + ++ +TDG N++ + L Sbjct: 167 GLAGDNT--------AIGDAIGLAVKRLKSEQVNSRVLVLMTDGANTAGEVSP---LKAA 215 Query: 310 NEAKRRGAIVYAIGVQAE-------------------AADQFLKNC-ASPDRFYSVQNSR 349 A +Y IG+ A+ +K ++ ++Y +N+ Sbjct: 216 ELAAANHLKIYTIGIGADEMIVRSFFGNRKINPSVDLDEKTLIKIAESTGGQYYRARNTD 275 Query: 350 KLHDAF 355 +L++ + Sbjct: 276 ELNNIY 281 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 39/251 (15%), Positives = 98/251 (39%), Gaps = 45/251 (17%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGM 189 +P + + + P + + I ++ G ++M+ +D+S SM G + Sbjct: 54 LLPLMSLCWIMLVAAVAKPQWLGEPLAIRAE---GREIMLAVDLSGSMEIADMQLEGRSV 110 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 ++L + + + ++ R GL+ F+ P+ + +++ +N + Sbjct: 111 NRLTMVKHVLSDFIE-------RREGDRLGLILFADTAYLQTPMTYDRNTVKQMLNESVL 163 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G + T I DA + + + ++ LTDG+N++ N+ +++L Sbjct: 164 GLVGERT--------AIGDAIALSVKRFRDDEKSNRVLVLLTDGQNTAGNLPPEQALEL- 214 Query: 310 NEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSR 349 A+ +Y I V AE +++ A + +++ +++ Sbjct: 215 --AQAYDVTIYPIAVGAEEVVVDSFFGQRRVNPSRDLDVPLMQSIAKQTGGKYFRARSTN 272 Query: 350 KLHDAFLRIGK 360 +L + + R+ K Sbjct: 273 ELEEIYQRLDK 283 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 52/250 (20%), Positives = 98/250 (39%), Gaps = 33/250 (13%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 +D +K + A + + F+ P P+ +I+ + + +D++ VLD Sbjct: 450 GVDPVNK--VVWAKTNHFTTFVLFYIPTWKAIWEVPIN-KGEREINQQVNY-IDLVFVLD 505 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 S SM+ + G K+ A + ++ R+ +V F + PL Sbjct: 506 SSGSMSWNDPNGYRKI-AAKSFVDALI----------QGDRAAVVDFDNFGYLLQPLTTD 554 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 Q ++ I+R+ T G+ A ++ + +D K II LTDGE Sbjct: 555 FQAVKNAIDRIDSWGGTNIAEGIRIANQQLIS---------RSSEDRIKVIILLTDGEGY 605 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 N EAK G +Y IG+ + L++ A + ++ V ++ +L Sbjct: 606 YDNNLTT-------EAKNNGITIYTIGLGTSVDENLLRDIATQTGGMYFPVSSASQLPQV 658 Query: 355 FLRIGKEMVK 364 F RI + + + Sbjct: 659 FKRITEIVTE 668 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 46/257 (17%), Positives = 92/257 (35%), Gaps = 49/257 (19%) Query: 134 EMPFIFCTF-PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPG 188 ++P ++ P+ ++ + G D+M+ +D+S SM G Sbjct: 70 KIPLTGIFLWSLVVLAAMRPVWFLNTTPFQA---SGKDLMLAVDLSGSMEKTDMPLRGVE 126 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +D+L ++ + R GLV F S+ PL + + ++ +N Sbjct: 127 VDRLTAVKSVVKNFIQ-------KRQGDRMGLVVFGSQAFLQSPLTYDLNTVETLLNETE 179 Query: 249 ---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 G+ T + A + EK +I LTDG N++ + + Sbjct: 180 IGMAGNNTAIGDAIGIALKHLHQNSEKKA-----------VLILLTDGSNTAGAV---QP 225 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQF---------------LKNCA--SPDRFYSVQNS 348 L +A+ G +Y IG+ A L+ A + RF+ +++ Sbjct: 226 LDAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNMDTTTLQKIAELTQGRFFMAKDT 285 Query: 349 RKLHDAFLRIGKEMVKQ 365 +L++ + I + Q Sbjct: 286 NQLNEIYQLIDQLEASQ 302 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 111 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 45/263 (17%), Positives = 100/263 (38%), Gaps = 44/263 (16%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 ++ +R P + + ++ P + + + ++ G D+M+ +D+S S Sbjct: 42 NLTSQSIEPHARRLNPLEWIIWLLLVTAAANPTWLDEPISLPNE---GRDIMLAVDLSGS 98 Query: 181 MNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 M + G +D+L + + + ++ R GL+ F PL Sbjct: 99 MTEQDMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGDTAFLQTPLTRD 151 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 V+ + + ++ G ++T I DA D+ + ++ LTDG+N+ Sbjct: 152 VKTVSKMLSEAQIGLVGRAT--------AIGDALGLSVKRFANKDESNRIVVLLTDGQNT 203 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------------ADQFLKNCA-- 337 + N++ +++L A+ G VY IGV ++ + LK A Sbjct: 204 AGNLNPEDALLL---AREEGIKVYTIGVGSDNPRGFSLFNVGGSSGSNLDESLLKKIAEQ 260 Query: 338 SPDRFYSVQNSRKLHDAFLRIGK 360 + ++ ++ L + + K Sbjct: 261 TGGLYFRAKDVAGLQQIYAELDK 283 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 111 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 47/248 (18%), Positives = 91/248 (36%), Gaps = 53/248 (21%) Query: 142 FPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVA 195 WC + P + +++ S+ G D+MM +D+S SM G +D+ + Sbjct: 58 LMWCLLVLAIARPQWLGEPIELPSQ---GRDLMMAVDLSGSMQIEDMVINGKTVDRFTLI 114 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---ST 252 + + ++ R GL+ F+ PL + + + + G Sbjct: 115 QHVVSDFIE-------RRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQ 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + A + E + +I LTDG N++ NI+ +++ A Sbjct: 168 TAIGEAIGLAVKRFDKMDESN-----------RVLILLTDGSNNAGNIEPEQA---AQIA 213 Query: 313 KRRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLH 352 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 214 ANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELD 273 Query: 353 DAFLRIGK 360 + I K Sbjct: 274 QIYQEIDK 281 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 110 bits (275), Expect = 3e-22, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 94/267 (35%), Gaps = 53/267 (19%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + ++ WC + P + +++ S+ G D+M+ +D+S S Sbjct: 39 QTSKATLATQTRQTRKRYWLMWCLLVLAIARPQWLGDPIELPSQ---GRDLMLAVDLSGS 95 Query: 181 MN-DHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 M + G +D+ + + E ++ R GL+ F+ PL Sbjct: 96 MQIEDMVIDGKVVDRFTLIQHVVSEFIE-------RRKGDRIGLILFADHAYLQAPLTQD 148 Query: 237 VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + + G T + A + E + ++ LTDG Sbjct: 149 RRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN-----------RVLVLLTDG 197 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-------------------AADQFLK 334 N++ NI+ +++ A R +Y +GV A+ +Q + Sbjct: 198 SNNAGNIEPQQA---AQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLQQ 254 Query: 335 NC-ASPDRFYSVQNSRKLHDAFLRIGK 360 A+ R++ +NS +L + I K Sbjct: 255 IADATHGRYFRARNSEELEQIYQEIDK 281 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 95/259 (36%), Gaps = 45/259 (17%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 +++ S M + A++ P + + + ++ G D+M+ +D+S SM Sbjct: 47 GSHSVKQRSSILMWLFWLCLCLGASN---PRWLGEPISLPNE---GRDIMLAVDLSGSMV 100 Query: 183 DH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + G +D+L + ++ + R GL+ F PL + Sbjct: 101 EQDMAYQGRYVDRLSMVKAVLKNFI-------AQRQGDRLGLILFGDTAFLQTPLTRDLN 153 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + + G ++T I DA D + ++ LTDGEN++ Sbjct: 154 TVSKMLEEAQIGLVGRAT--------AIGDALGLAVKRFSQKQDSNRILVLLTDGENTAG 205 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---------------FLKNCA--SPDR 341 N+ +E+L A+ G VY +GV ++ ++ L+ A + Sbjct: 206 NLAPEEALLL---AREEGIKVYTVGVGSQGGNRFNLFSMSGSSSLDESLLQKIATETGGL 262 Query: 342 FYSVQNSRKLHDAFLRIGK 360 ++ + L + + K Sbjct: 263 YFRATDVASLQQIYQELDK 281 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 48/242 (19%), Positives = 92/242 (38%), Gaps = 53/242 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREML 203 ++ PL + V I+ + D+M+ +D S SM G +D+L V + + + Sbjct: 75 AAAQPLHVGDPVSINPHAR---DLMLAVDTSQSMEIQDMRLHGEPVDRLTVIKSVVDDFI 131 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLE 260 R GL+ F ++ PL + + ++ +N I G T + Sbjct: 132 -------SHRKNDRIGLILFGTQAYLQTPLTFDHKTVRTLLNESRIGIAGGQTAIGDAIG 184 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A ++ K H K +I LTDG N++ ++ ++ A R+G +Y Sbjct: 185 LALKRL-----------KNHKTGSKVLILLTDGANTAGSVSPVQA---AELAARQGMKIY 230 Query: 321 AIGVQAE----------------AADQF----LKNCAS--PDRFYSVQNSRKLHDAFLRI 358 +GV A+ + +K AS +++ +N+ +L + I Sbjct: 231 TVGVGADEMRIPGVLGFGSQIVNPSADLDEVTMKKIASLTGAQYFRARNTDELRRIYQHI 290 Query: 359 GK 360 K Sbjct: 291 DK 292 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 56/251 (22%), Positives = 96/251 (38%), Gaps = 35/251 (13%) Query: 131 SRYEMPF-IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 RY +P +F + P +I + S+ G+D+M+ +DVS+SM++ Sbjct: 77 GRYLLPLCMFLGTACLLIALARPQIIRELREEQSE---GIDIMLAMDVSVSMSESDILP- 132 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 +L A R + + R GLV F+ + PL + + +N L Sbjct: 133 TRLAAARRVAQAFV-------RGRRNDRIGLVIFAGEAFSLCPLTTDYNLLNQYLNDLND 185 Query: 250 G----STTKSTPGLEYAYNKIFD-----AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 G S T L N++ D + + + K II L+DG+N++ N+ Sbjct: 186 GMIRTSGTAIGDALARCINRMRDRPAASSDTTQAKTEQWKSERSKVIILLSDGDNTAGNL 245 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---------FLKNCAS--PDRFYSVQNSR 349 D + + AK +Y I V A LK A+ F+ +S Sbjct: 246 DP---ITAASLAKAFNIKIYTIAVGQPVASASEASTVDEGILKKIATIGKGSFFRAVDSG 302 Query: 350 KLHDAFLRIGK 360 +L F +I + Sbjct: 303 RLKTVFAQISQ 313 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 58/336 (17%), Positives = 116/336 (34%), Gaps = 38/336 (11%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENG------- 100 L D L+ + + N E N ++ + + ++ +E + Sbjct: 255 LFSAEDAKLIDDSVQAFNDEIQNLVEECQQEDMNIYFVSVQDEFAGHEAYTSDEYINRVK 314 Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 +++ + + K A M C ++ P + K Sbjct: 315 INAQPQDLKPEGPSAYSVHPNDKGAAAYARCVQRM---IDDIQECEDNGVDPRTLMGDSK 371 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + +++ LD S SM+ G +++ ATR + KS DV L Sbjct: 372 VDPNDASSRHVVLALDTSGSMD---GEPLNETKTATREFASTI--FKSDADVC------L 420 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V++ S ++ + L G T L +Y ++ + Sbjct: 421 VSYDSSARNVIDSTDNEYALKAAVRDLSAGGGTNIEDALRVSYERLEGSGSD-------- 472 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----QFLKN 335 K+ I+ ++DGE + + + + Y NE K G +Y +G +D + ++ Sbjct: 473 ---KRIIVLMSDGEANEG-LVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEG 528 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 ASP Y V ++ +L F IG ++ R +Y + Sbjct: 529 IASPGCHYEVDDASQLRYFFGDIGDDINGTRFIYVR 564 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 62/319 (19%), Positives = 118/319 (36%), Gaps = 43/319 (13%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 N KG++ I+ A+LL V+ L ++ + +++L +D + A I N G Sbjct: 6 KNQKGAVLIIFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISNPYLGE 65 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 +G + + E G+ ++ ERS + + D +D+ + Sbjct: 66 DGHLR------------LAEEVARENFSAGYLMTPDSGERSATFTAYAD---EDHRIRVE 110 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 P + + ++ + +++M+VLD S SM+ G M+ Sbjct: 111 GTVSSPGNL--------AGLFGVDWVATSAMGVAKKNEVEIMLVLDRSGSMD---GTPMN 159 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG-VQHIQEKINRLIF 249 L A RS + + + GLV+F++ + PL V + KIN + Sbjct: 160 DLKKAARSFVSFFEETQDQD------KMGLVSFATSVKVDVPLGNNYVSSMTSKINAMDA 213 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-------SPNIDN 302 T + L A N + G+ ++++IF +DG + DN Sbjct: 214 VGATNAEDSLSQAGNPAKGGLTDQSGVP-GNKRVQQFVIFFSDGNPTAFRGKFKYNGTDN 272 Query: 303 KESLFYCNEAKRRGAIVYA 321 +++ C G VY Sbjct: 273 IDAV-VCGTGNDCG-TVYT 289 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 24/58 (41%) Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + E K +Y IG+ D + + P + S +L F +I K++ Sbjct: 384 AVEHAQELKDNNVKIYTIGLGNIDRDFLSQIASGPSFEFYAPTSGELQAIFNKIAKDI 441 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 101/262 (38%), Gaps = 46/262 (17%) Query: 128 SAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH- 184 SA + ++F WC ++ P + + + G D+++ +D+S SM Sbjct: 54 SAAHHPLLRWLFLIVIWCLLVLAAAKPQWLGEPQALPT---SGRDLLLAVDISGSMQQED 110 Query: 185 ---FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 +L + + + +D R GL+ F ++ PL + Q + Sbjct: 111 MQINNRPATRLAAVKKVVSDFIDQ-------RQGDRIGLILFGTQAYLQTPLTFDTQSVN 163 Query: 242 EKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + G T + + ++ + + AK + K II LTDGEN++ Sbjct: 164 QFLQEAQLGFAGKDTAIGDAIGLSVKRLKN--QSSASSAKPSN--SKVIILLTDGENTAG 219 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--S 338 ++ L A++ GA +Y +G+ A+ ++ L A + Sbjct: 220 EVEP---LQAAKLAEKIGAKIYTVGIGADEMIVRGFFGNRRVNPSASLDEETLTAIANTT 276 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 ++ +N+++L++ + + K Sbjct: 277 GGLYFRARNTQELNNIYSELDK 298 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 58/278 (20%), Positives = 103/278 (37%), Gaps = 49/278 (17%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMP--FIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 + ++L K Y R+++ + + + P L K S S Sbjct: 24 KKSALKFSSVKLLKSYGYKNKIRHKIGRYLVVLSLILMVIAMARPQL---PEKNSPISKQ 80 Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+ + LDVS +M F P ++L VA ++I++ +D R L+ F+ Sbjct: 81 GIDIAVALDVSGTMQSVDFEP--NRLEVARKTIQDFVDQ-------RPSDRIALIAFAGT 131 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL ++E + + F S + + A + + K K Sbjct: 132 AYTRVPLTLDHNVVRESLQDISFKSVNEEGTAIGMAISVGLN-------RLKKSTSPSKI 184 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ 328 +I LTDG+N++ +ID + AK G +Y IGV ++ Sbjct: 185 MILLTDGDNNAGSIDPNTASTL---AKDSGIKIYTIGVGSDKTIIPGTNEFGQTVYQEYE 241 Query: 329 ----ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LK A + ++Y ++S L F I K Sbjct: 242 SGLLNEDLLKKIAETTNGQYYRAKDSNALSQVFANINK 279 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 75/207 (36%), Gaps = 27/207 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 I G D+++ LD S SM + K V + Sbjct: 67 SSIKLDDRKGRDLVLALDASGSMEESLYDEKSKFEVVKSMAQNFF-------HKRFDDNI 119 Query: 219 GLVTFSSKIVQTFPLAWGVQH---IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 G+V F S PL + + + + I G+ T GL + Sbjct: 120 GIVIFGSFAYIAAPLTYDTKALDFLINYLEPSIAGNNTAIGEGLWQGIKALQAD------ 173 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +K +I +TDG ++S +I ++++ +AK+ G +Y IG+ +A L+ Sbjct: 174 -----TAKQKVLILITDGHHNSGSISPRQAV---EKAKKLGIKIYTIGLG-DADKHLLEQ 224 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGK 360 A S +F+ ++ L F + K Sbjct: 225 IAKESGGKFFYAKSEEDLQSIFSELNK 251 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 109 bits (273), Expect = 5e-22, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 91/247 (36%), Gaps = 49/247 (19%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + + P + +++ SK G D+MM +D+S SM ++ + V +++ Sbjct: 58 LMWSLLVLAIARPQWLGDPIELPSK---GRDLMMAVDLSGSMQ------IEDMVVNGKTV 108 Query: 200 REML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 ++ + R GL+ F+ PL + + + + G T Sbjct: 109 DRFTLIQHVVSDFIERRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT 168 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + E + +I LTDG N++ NI+ +++ A Sbjct: 169 AIGEAIALAVKRFDKIDESN-----------RVLILLTDGSNNAGNIEPEQA---AQIAA 214 Query: 314 RRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHD 353 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 215 NRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQ 274 Query: 354 AFLRIGK 360 + I K Sbjct: 275 IYQEIDK 281 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 91/247 (36%), Gaps = 49/247 (19%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + + P + +++ SK G D+MM +D+S SM ++ + V +++ Sbjct: 58 LMWSLLVLAIARPQWLGDPIELPSK---GRDLMMAVDLSGSMQ------IEDMVVNGKTV 108 Query: 200 REML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 ++ + R GL+ F+ PL + + + + G T Sbjct: 109 DRFTLIQHVVSDFIERRKGDRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT 168 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + E + +I LTDG N++ NI+ +++ A Sbjct: 169 AIGEAIALAVKRFDKIDESN-----------RVLILLTDGSNNAGNIEPEQA---AQIAA 214 Query: 314 RRGAIVYAIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHD 353 R +Y +GV A+ + LK+ A + R++ +NS++L Sbjct: 215 NRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQ 274 Query: 354 AFLRIGK 360 + I K Sbjct: 275 IYQEIDK 281 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 49/264 (18%), Positives = 94/264 (35%), Gaps = 43/264 (16%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 Q N+S+ SR + + + P + +++ S+ G D+M+ +D+S Sbjct: 38 SQTGKANISSHSRQSRKRYWLMWSLLVLAIARPQWLGDPIELPSQ---GRDLMLAVDLSG 94 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 SM K+ + ++ + R GL+ F+ PL + Sbjct: 95 SMQIEDMVINGKV---VDRFTLIQHVVSEFIERRKGDRIGLILFADHAYLQAPLTQDRRS 151 Query: 240 IQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + G T + A + E + +I LTDG N+ Sbjct: 152 VAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN-----------RVLILLTDGSNN 200 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--------------AADQF----LKNCA- 337 + NID ++ A R +Y +GV A+ + LK+ A Sbjct: 201 AGNIDPDQA---AQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGK 360 + R++ +NS++L + I K Sbjct: 258 VTHGRYFRARNSQELEQIYQEIDK 281 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 109 bits (272), Expect = 6e-22, Method: Composition-based stats. Identities = 61/393 (15%), Positives = 130/393 (33%), Gaps = 51/393 (12%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++N G++ +L I+LPV+ I+ VI ++ V A + D ++ + Sbjct: 29 LKNQPKQRSGAVIVLLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQ 88 Query: 66 QENGNNGKKQKNDFSYRI-IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 + N D + R + ++L ++ + S D+ Sbjct: 89 TGDKNAALAAARDAAERNPVAGKVVPINMSDLEFGISLRESLDEGYSFQPLSDDDEFGNA 148 Query: 125 YNLSAVSRYEMP---FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 L+ +S P F N P + S + + +D+ +V+D S SM Sbjct: 149 VRLTTLSLSNSPQPVFSPLFPTMGTNLEIRPQRVAVSTQST------MDVALVIDRSGSM 202 Query: 182 -------------------NDHFG---PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +G P + S+ + P + Sbjct: 203 AYANDEAPDPYVNPAAAPPGWTYGDPVPPNSRWLDLVASVNAFNGFLADSPQYEKLC--- 259 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L T+S + L I +++ + + G T GLE+ + DA + Sbjct: 260 LATYSDNASRDCDLTHTYAEISNQLDAISYQFNGGGTSVGYGLEHGLAVLTDATHARKFA 319 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + ++ +TDG +++ + + G ++ I +A + N Sbjct: 320 V-------RVMVLMTDGHHNTGKSPESMTY----HLQNHGVTLFTITFSDDADQSRMSNL 368 Query: 337 AS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ + ++ +L +AF +I K++ Sbjct: 369 ANACGGENFHATDASQLQNAFQKIAKKLPSLMT 401 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 109 bits (271), Expect = 9e-22, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 95/270 (35%), Gaps = 48/270 (17%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 +S ++ + RY + + + P + +++ S+ G D+M+ Sbjct: 37 ISQTDKANITSHSRQSRKRY-----WLMWSLLVLAIARPQWLGDPIELPSQ---GRDLML 88 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +D+S SM K+ + ++ + R GL+ F+ PL Sbjct: 89 AVDLSGSMQIEDMVINGKV---VDRFTLIQHVVSEFIERRKGDRIGLILFADHAYLQAPL 145 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + + + G T + A + E + +I L Sbjct: 146 TQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN-----------RVLILL 194 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--------------AADQF---- 332 TDG N++ NI+ +++ A R +Y +GV A+ + Sbjct: 195 TDGSNNAGNIEPEQA---AQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQ 251 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK+ A + R++ +NS++L + I K Sbjct: 252 LKHIADVTHGRYFRARNSQELEQIYQEIDK 281 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 47/264 (17%), Positives = 95/264 (35%), Gaps = 43/264 (16%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 Q N+++ SR + + + P + +++ S+ G D+M+ +D+S Sbjct: 38 SQTGKTNITSRSRQSRKRYWLMWSLLVLAIARPQWLGDPIELPSQ---GRDLMLAVDLSG 94 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 SM K+ + ++ + R GL+ F+ PL + Sbjct: 95 SMQIEDMVINGKV---VDRFTLIQHVVSEFIERRKGDRIGLILFADHAYLQAPLTQDRRS 151 Query: 240 IQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + G T + A + E + +I LTDG N+ Sbjct: 152 VAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN-----------RVLILLTDGSNN 200 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--------------AADQF----LKNCA- 337 + NI+ +++ A R +Y +GV A+ + LK+ A Sbjct: 201 AGNIEPEQA---AQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAD 257 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGK 360 + R++ +NS++L + I K Sbjct: 258 VTHGRYFRARNSQELEQIYQEIDK 281 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 48/258 (18%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 MP W + P + ++ ++ MM+ +D+S SMN+ Sbjct: 57 QAKTTPSLRMPRWLAWLGWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMNE 113 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 114 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 166 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 167 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 218 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ-------------AEAADQF----LKNCA--SPD 340 +D L AK G +Y I A D L+ A + Sbjct: 219 LDP---LKAAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGG 275 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 276 RFFRARDTEELAGIYAEL 293 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 88/235 (37%), Gaps = 41/235 (17%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSI 199 ++ P + V I+ + G D+M+ +D+S SM+ +D+L + Sbjct: 69 LLVGAAMRPAYVGKPVPIAVE---GRDLMLAVDLSGSMSRQDLSYDNIPVDRLTIIKGVA 125 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 + + R GL+ FS++ PL + +++ + S+ G+ Sbjct: 126 DDFI-------AKRKGDRIGLILFSTRAYVQAPLTFDRNVVRDLLRT--------SSIGM 170 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 I DA + ++ ++ LTDG N+S + + AK G + Sbjct: 171 TGQETAIGDAIALAVKTLRTRPQEQRVLVLLTDGANNSGMLSP---IPAAEIAKANGVKI 227 Query: 320 YAIGVQAEA---ADQF-----------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 Y IGV A+A + L+ A + R++ +++ L + I Sbjct: 228 YTIGVGADAFAVGQRMVNPSFDLDEGALEQIAQMTGGRYFRARDAAGLAAIYNDI 282 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 54/262 (20%), Positives = 98/262 (37%), Gaps = 44/262 (16%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGP 187 R + FI+ F + + P K+ + + G D+++VLD+S SM + + Sbjct: 50 ITERVSLFFIYIAFLFFVIALARPQ--EEKGKVPLRKE-GYDIILVLDISGSMLAEDYEI 106 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 ++ +L+++K+ D R GLV F+ + PL + ++ KI++L Sbjct: 107 DQKRVSRLD----IVLEVVKTFLDKRTNDRIGLVAFAGRAYTVCPLTFDHNWLKRKIDQL 162 Query: 248 IFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 G+ T L A +++ KE E G ++I LTDG N+ N+ E Sbjct: 163 QAGTIEDGTAIGDALGLALSRLEGKKESGERKKIGS-----FLILLTDGANNCGNLTPIE 217 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQ-----------------------FLKNCA--SP 339 + A V+ IG L+N A + Sbjct: 218 AARL---AAHAAVPVFTIGAGINGEVTMPVMDEERRKIGSQTVVSEVDEGLLRNIAQLTG 274 Query: 340 DRFYSVQNSRKLHDAFLRIGKE 361 ++ +S + AF I + Sbjct: 275 GEYFRATDSNAIVSAFQAIDAQ 296 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 87/233 (37%), Gaps = 30/233 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKIS---SKSDIGLDMMMVLDVSLSMNDHFG-PGMDKL 192 F+ P + + + K + G+D+M+ LDVS SM+ +L Sbjct: 53 FLPLLRPIAISL--VVVALAGPGKKTTFLPNEKKGVDVMIALDVSGSMSRSRDFLPETRL 110 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 GV+ + +R+ +D R GLV F+ PL + + E + + + Sbjct: 111 GVSKKLLRKFID-------KRKSDRLGLVVFAGAAYLQAPLTGDRESLNEILGTIEEETV 163 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + I DA + + K I+ +TDG +++ ID + Sbjct: 164 AEQGTA-------IGDAIILSTYRLRASQARSKVIVLITDGVSNTGKIDPVTATDLAEHI 216 Query: 313 KRRGAIVYAIGVQAEAAD-----QFLK--NCASPDRFYSVQNSRKLHDAFLRI 358 G +Y++G+ E + L+ + ++ +F+ ++ ++ I Sbjct: 217 ---GVKIYSVGIGKEDGSYEINFEILRELSASTGGKFFRAEDPEEMKAVLTSI 266 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 84/238 (35%), Gaps = 51/238 (21%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDI 205 P+ I V + + G D+++ +D+S SM G +++L + + Sbjct: 74 ARPVWIGEPVSLPT---SGRDILLAVDISGSMEREDMQLSGQTVNRLMAVKAVVGNFVTE 130 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYA 262 R GL+ F K PL + + +Q + G+ T + + Sbjct: 131 -------REGDRLGLILFGEKAYLQTPLTFDRKTMQTLLYEAQLGFAGNGTAIGDAIGLS 183 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ E + +I LTDG N++ +D ++ + AK +Y I Sbjct: 184 VKRLQQRPENH-----------RVVILLTDGANNAGELDPLKAAELASSAK---VKIYTI 229 Query: 323 GVQAEA------------------ADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGK 360 GV AE +Q L A+ +++ +N +L + + + Sbjct: 230 GVGAETQEAWGLFGKRVTNPSADLDEQTLTAIAEATGGQYFRARNPEELMAIYQELNR 287 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 94/240 (39%), Gaps = 43/240 (17%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + PL + +++ SK G D+M+ +D+S SM ++ + + +++ + Sbjct: 61 LLVIAVARPLWMGDPIELPSK---GRDLMVAVDLSGSMQ------IEDMVLDGKAVNRFI 111 Query: 204 ---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ + + GL+ F+ PL + + + + G K T Sbjct: 112 MVQSVVSDFIERRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT---- 167 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 I +A D+ + ++ LTDG N+S +I +++ + A +RG +Y Sbjct: 168 ----AIGEAIALSVKRFDLVDESNRILVLLTDGSNNSGSISPEQA---ADIAAKRGIKIY 220 Query: 321 AIGVQAE-------------------AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 +IGV A+ +Q + R++ +N+++L + I K Sbjct: 221 SIGVGADVMERRTLFGKERVNPSMDLDEEQLTSLAQTTGGRYFRARNAQELEQIYQEIDK 280 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 48/248 (19%), Positives = 88/248 (35%), Gaps = 45/248 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKL 192 + S P + +++ G D+M+ +D+S SM G +D+L Sbjct: 61 LTILGWVLLVLSCTRPQWLGDPIELPV---SGRDLMLAVDLSGSMRTDDFQLSGRSVDRL 117 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 +D R GL+ F + PL + + ++ + G Sbjct: 118 TALKAVAGAFIDQ-------RQGDRIGLILFGEQPYIQAPLTFDHNTVTRLLHEAVVGLA 170 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T I DA + K +I LTDG ++S ++D L A Sbjct: 171 GNKT--------AIGDAIGLAVKRLRKDPQAKNVLILLTDGASNSGSLDP---LKAAKLA 219 Query: 313 KRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLH 352 +RG VY IG+ AEA ++ LK A + R++ +++ +L Sbjct: 220 AQRGLKVYTIGIGAEAVEVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRYFRARDTEELA 279 Query: 353 DAFLRIGK 360 + ++ + Sbjct: 280 QIYQQLDQ 287 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 70/182 (38%), Gaps = 18/182 (9%) Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQ 241 P + T + + + + G+V+F + L ++ Sbjct: 895 PPLRLGSKLTTDSVAKTSAVSFVESRGDGDQVGVVSFYTSASLNSALKQMNSGTNKTTVK 954 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 IN L T + G++ A ++ K K+YII LTDG + P D Sbjct: 955 NAINSLSASGGTDISSGIKKAIAELDAHKRSTA---------KQYIIVLTDGYSQYPEFD 1005 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 L ++AK +G ++ IG+ D K + P+ +Y V + +L A+ IG+E Sbjct: 1006 ----LIEADKAKAKGYTIFTIGMGMADEDTLKKIASKPEYYYRVLSPEQLEAAYYDIGQE 1061 Query: 362 MV 363 + Sbjct: 1062 IG 1063 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 54/269 (20%), Positives = 96/269 (35%), Gaps = 48/269 (17%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 I I DQ K ++S +P + A + P + + +S + G ++MM L Sbjct: 43 IDIADQEKS-SISVQHSLLIPALVWLLLVFALAG--PRWVGAPKPVSRE---GYNIMMAL 96 Query: 176 DVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 D+S SM G +L + + + + + + GL+ F ++ Sbjct: 97 DLSGSMEIPDMILHGRPTSRLNIVKSAAEQFV-------RERSGDKIGLILFGTRAYLQT 149 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 PL + I ++ +T GL I DA + II LT Sbjct: 150 PLTYDRHSILLRL--------EDATAGLAGKTTSIGDAVGLAVKRLDSAPKKGRVIILLT 201 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------ADQFLKNCA-------- 337 DG N+S + L AK G +Y IG+ +E D +++ A Sbjct: 202 DGANNSGVLAP---LKAAELAKEEGIKIYTIGLGSEGDSRALVGDFLMQSPAADLDEETL 258 Query: 338 ------SPDRFYSVQNSRKLHDAFLRIGK 360 + R++ ++ LH + I + Sbjct: 259 KKMSDMTGGRYFRATDTESLHLIYKTINQ 287 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 47/267 (17%), Positives = 89/267 (33%), Gaps = 46/267 (17%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMV 174 D H V MP W + P + ++ ++ MM+ Sbjct: 48 YADQLHAVAQAQRVPALRMPRWLAWLGWFLLCAALARPQQLGVVIQPPREAR---QMMLA 104 Query: 175 LDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +D+S SM++ G +D+L A + + LD R GL+ F + Sbjct: 105 VDLSGSMSEPDMVLGGSVVDRLTAAKAVLSDFLD-------RREGDRVGLLVFGQRAYAL 157 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL + +++++ S GL I DA + ++ ++ L Sbjct: 158 TPLTADLTSVRDQL--------ADSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLL 209 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------L 333 TDG N++ ++ L AK G V+ I L Sbjct: 210 TDGVNTAGVLNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGL 266 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRI 358 + A + RF+ +++ +L + + Sbjct: 267 RKIAEQTGGRFFRARDTEELAGIYAEL 293 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 93/259 (35%), Gaps = 43/259 (16%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + LS + ++ + + P + +++ S G D+M+ +D+S S Sbjct: 46 KRGTGRLSNRPSAALWILWAIWLLLVLALSRPQWVGDPIELPS---SGRDLMLAIDLSGS 102 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPD----VNNVVRSGLVTFSSKIVQTFPLAWG 236 M ++ + V R + ++ +K+I R GL+ F ++ PL + Sbjct: 103 MQ------IEDMQVGAR-LVSRIEAVKAIASDFTSQRVGDRVGLILFGTRAYVQAPLTFD 155 Query: 237 VQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I G T L A ++ + + + +I LTDG Sbjct: 156 TATVTRFIREAQLGFAGEDTAIGDALGLAIKRLRERPAE-----------SRVLILLTDG 204 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAAD------QFLKNC--ASPDR 341 ++++ +D E+ A G VY IG + A A L A+ Sbjct: 205 QDTASTVDPMEATAL---AAESGIKVYTIGISRRIGARAGGSGEVDEALLNAIAEATGGE 261 Query: 342 FYSVQNSRKLHDAFLRIGK 360 ++ +N +L + + + Sbjct: 262 YFRARNPAELQSIYGVVDQ 280 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 53/258 (20%), Positives = 100/258 (38%), Gaps = 62/258 (24%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KSIP 210 ++ + + +D +VLDVS SM ++ G + KL ++ LD K Sbjct: 587 TGAASSSTITTTQPVDFTLVLDVSGSMRENMGS-VTKLQALQSAVNNFLDEAAKINKGAQ 645 Query: 211 DVNNVVRSGLVTFSSKIVQTF-------------------PLAWGVQHIQEKINRLIFGS 251 + VR GLV F+ + L ++ ++N+L G Sbjct: 646 SGSEPVRVGLVKFAGNATKKIGNKTYQDKWNTYNYSQIVKKLTADTDGLKNEVNKLTAGG 705 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN----KESLF 307 T++ G ++A+ + +A+ + KK +IF TDG+ +S + +++ Sbjct: 706 ATRADYGFQHAFTVMSEAR----------TEAKKVVIFFTDGKPTSEKTFDGKVANDAVE 755 Query: 308 YCNEAKRRGAIVYAIGV--------QAEAADQFLKNCAS----------------PDRFY 343 Y + K GAIVY+IGV A + ++F+ +S + Sbjct: 756 YAKQLKDSGAIVYSIGVFDGANPASTATSENKFMHAVSSNYPNAANYEDLSEGSNAGYYK 815 Query: 344 SVQNSRKLHDAFLRIGKE 361 + ++ L+ F I K Sbjct: 816 TATDASGLNSIFEEIRKS 833 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 89/234 (38%), Gaps = 45/234 (19%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREM 202 + P + + + ++ G ++M+ +D+S SM G +D+L + + + Sbjct: 67 IAVARPQWLGEPLPVRNE---GREIMLAVDLSGSMEIADMTLDGRNVDRLEMVKAVLGDF 123 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ R GL+ F+ P+ + +Q+ ++ + G + T Sbjct: 124 IE-------RRKGDRLGLILFADTAFLQTPITYDRNTVQQMLDESVLGLVGERT------ 170 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA KG + ++ LTDG+N++ N+ +++L AK +Y I Sbjct: 171 --AIGDAIALAVKRFKGKQQTNRVLVLLTDGQNTAGNLSPEQALEL---AKAYDVRIYPI 225 Query: 323 GVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 V AE ++N A + ++ +++ +L + Sbjct: 226 AVGAEEVVVDSVFGRRKVNPSRDLDVPLMQNLADETGGEYFRARSTEELERIYQ 279 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 46/239 (19%), Positives = 92/239 (38%), Gaps = 49/239 (20%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML---D 204 + PL + ++++ SK G D+M+ +D+S SM ++ + + +++ Sbjct: 72 AVARPLWVGDAIELPSK---GRDLMLAVDLSGSMQ------IEDMVLNGKAVDRFAMVQQ 122 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEY 261 ++ + + GL+ F+ PL + + + + G T + Sbjct: 123 VMSEFIERRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLTEAQIGLVGKQTAIGEAIAL 182 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A + AK+ + +I LTDG N+S +I +++ + A +RG +Y Sbjct: 183 AVKRFDKAKQSN-----------RVLILLTDGSNNSGSITPEQA---ADIAAKRGVTIYT 228 Query: 322 IGVQAE-------------------AADQF-LKNCASPDRFYSVQNSRKLHDAFLRIGK 360 IGV AE Q L + R++ +NS +L + I K Sbjct: 229 IGVGAEVMERRTLFGKERVNPSMDLDEAQLTLLAQKTKGRYFRARNSDELEQIYQEIDK 287 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 53/256 (20%), Positives = 89/256 (34%), Gaps = 56/256 (21%) Query: 135 MPFIF--CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDK 191 +PF + P + +++ G+D+++ +D+S SM P D+ Sbjct: 55 LPFALRCAAVALLIVALARPQSVDEGSTSNTE---GIDIVLAIDISTSMLAQDLQP--DR 109 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 + A + + R GLV F+ + PL +Q + RL G Sbjct: 110 IQAAKQVAGNFITD-------RPGDRIGLVAFAGEAFTQSPLTTDQGTLQTLLGRLRSGV 162 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T GL A N++ ++ K K II LTDGEN+ I L Sbjct: 163 VEDGTAIGNGLATAINRLRESNAK-----------SKVIILLTDGENNRGEIAP---LTA 208 Query: 309 CNEAKRRGAIVYAIGVQAEA----------------------ADQFLKNCA--SPDRFYS 344 A+ +G VY IGV ++ L A + R++ Sbjct: 209 AEIARDQGIRVYTIGVGTRGTAPYPTVDFFGNPTVVQAKVQIDEKILGEIADLTGGRYFR 268 Query: 345 VQNSRKLHDAFLRIGK 360 ++ KL + I + Sbjct: 269 ATDNAKLQSIYDEINQ 284 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 50/345 (14%), Positives = 123/345 (35%), Gaps = 29/345 (8%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 + +L AI++ + IV+ I+ + + +L D + AT + + + N ++ Sbjct: 1 MLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLAIQRG 60 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 + + N + + F + + + + + + + Sbjct: 61 QQIAQANLVNGQPLLLADGDFQ--FGRSDRQVNGKYAFNAGEAPFNG---VRVNGQRTTG 115 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P + + ++ + + D+ +V+D S SM G + L A Sbjct: 116 SLSGPVPLFFGNVTGTSIFEPEAFATA-TYVERDITLVVDRSGSMA---GSRFNDLQAAI 171 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R ++L V+ + GL +++ + + L + ++RL G T + Sbjct: 172 RIFTDLLATT----PVDE--QIGLASYNDRASEDVQLTENFAEVNNAMDRLRTGGFTSIS 225 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G++ A +++ + + ++ +I +TDG ++ E + G Sbjct: 226 RGMQ--------AGQEIALRGRPPEFVERTMIVMTDGRHNRG----PEPRVVATDLAADG 273 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 ++ I A A +++ A R + N +L D + I Sbjct: 274 VTIHTITFGAGADFGRMQDVARIGGGRHFHATNGDQLRDIYREIA 318 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 88/243 (36%), Gaps = 49/243 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + P + +++ +K G D+M+ +D+S SM ++ + + +++ Sbjct: 61 LLLLAIARPQWLGDPIELPAK---GRDLMIAVDLSGSMQ------IEDMVINGQTVNRFT 111 Query: 204 ---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTP 257 ++ + R GL+ F+ PL + + ++ G T Sbjct: 112 LIQHVLSDFIERRKGDRLGLILFADHAYLQAPLTLDRRSVATFLDDAQIGLVGKQTAIGE 171 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + A + E + +I LTDG N++ NI+ + + A +R Sbjct: 172 AIALAVKRFDKVDESN-----------RVLILLTDGSNNAGNIEPEVA---AQIAAKRNI 217 Query: 318 IVYAIGVQAE-------------------AADQFLKNCA-SPDRFYSVQNSRKLHDAFLR 357 +Y IGV AE DQ K A + R++ +NS +L + Sbjct: 218 TIYTIGVGAEILERRTIFGKERINPSMDLDEDQLKKLAAMTKGRYFRARNSEELASIYQE 277 Query: 358 IGK 360 I K Sbjct: 278 IDK 280 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 107 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 85/240 (35%), Gaps = 51/240 (21%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREML 203 + P + +++ SK G D+M+ +D+S SM G +D+ + + E + Sbjct: 66 AIARPQWLGEPIELPSK---GRDLMLAVDLSGSMQIEDMVINGKTVDRFSLIQNVLGEFI 122 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLE 260 + N R GL+ F+ PL + I + G T + Sbjct: 123 E-------RRNGDRLGLILFADHAYLQAPLTQDRRSIATFLADAQIGLVGKQTAIGEAIA 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + E + ++ LTDG N++ NI+ + A +R +Y Sbjct: 176 LAVKRFDQVSESN-----------RVLVLLTDGSNNAGNIEPDVA---AEIAAKRNVTIY 221 Query: 321 AIGVQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +GV AE + L+ A + ++ +NS L + +I + Sbjct: 222 TVGVGAELMERRTIFGKERVNPSMDLDEAQLQRLATMTNGYYFRAKNSEDLAQIYQKIDQ 281 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 107 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 47/267 (17%), Positives = 90/267 (33%), Gaps = 46/267 (17%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMV 174 D H V MP W + P + ++ ++ MM+ Sbjct: 48 YADQLHAVAQARRVPALRMPRWLAWLGWFLLCAALARPQQLGEVIQPPREAR---QMMLA 104 Query: 175 LDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +D+S SM++ G +D+L A + + LD + R GL+ F + Sbjct: 105 VDLSGSMSEPDMVLGGNVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYAL 157 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL + +++++ S GL I DA + ++ ++ L Sbjct: 158 TPLTADLTSVRDQL--------ADSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLL 209 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------L 333 TDG N++ ++ L AK G V+ I L Sbjct: 210 TDGVNTAGVLNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGL 266 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRI 358 + A + RF+ +++ +L + + Sbjct: 267 RKIAEQTGGRFFRARDTEELAGIYAEL 293 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 89/234 (38%), Gaps = 43/234 (18%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + PL + ++++ SK G D+M+ +D+S SM +D V S+ + + I Sbjct: 66 ACARPLWVGEAIELPSK---GRDLMLSVDLSGSMQIE-DMVLDGKVVDRFSLIQHV--IS 119 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYN 264 + R GL+ F+ PL + + + + G T + A Sbjct: 120 DFIERRKGDRIGLILFADHAYLQSPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAVK 179 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + ++ + +I LTDG N++ I +++ A +RG +Y IGV Sbjct: 180 RFDKVEQSN-----------RVLILLTDGSNNAGAISPEQATQI---AAKRGITIYTIGV 225 Query: 325 QAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 A+ + L+ A + +++ +N+ +L + I Sbjct: 226 GADVMERRTLFGKERVNPSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQVI 279 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 106 bits (265), Expect = 4e-21, Method: Composition-based stats. Identities = 54/238 (22%), Positives = 87/238 (36%), Gaps = 54/238 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSI 209 P + +V+ +++ G+D+M+ +DVS SM F P D++ A + Sbjct: 73 RPQDVEQNVRTNTE---GIDIMLAIDVSGSMLARDFKP--DRITAAKEVAGSFIAD---- 123 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKI 266 R GLV F+ + PL +Q + R+ G T GL A N++ Sbjct: 124 ---RYGDRIGLVAFAGEAFTQSPLTTDQSTLQTLLARIRSGLIEDGTAIGNGLATAINRL 180 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + D K II LTDG N+ I + AK +G VY IGV Sbjct: 181 -----------RESDAKSKVIILLTDGVNNQGQIAP---MTAAEIAKAQGIRVYTIGVGT 226 Query: 327 EA----------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 E ++ LK + + R++ + KL + I + Sbjct: 227 EGMAPYPAIDMFGNLTFVNQKVEIDEKVLKAISDMTGGRYFRATDKEKLKAVYDEINQ 284 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 106 bits (265), Expect = 4e-21, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + MP W + P + ++ ++ MM+ +D+S SM++ Sbjct: 170 QAKSAPVLRMPRWLAWLAWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSE 226 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 227 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 279 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 ++++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 280 VRDQLS--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 331 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------LKNCA--SPD 340 ++ L AK G V+ I L+ A + Sbjct: 332 LNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGG 388 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 389 RFFRARDTEELAGIYAEL 406 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 MP W + P + ++ ++ MM+ +D+S SMN+ Sbjct: 57 QAKTTPSLRMPRWLAWLGWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMNE 113 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 114 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 166 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 167 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 218 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ-------------AEAADQF----LKNCA--SPD 340 +D L AK G ++ I A D L+ A + Sbjct: 219 LDP---LKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGG 275 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 276 RFFRARDTEELAGIYAEL 293 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 MP W + P + ++ ++ MM+ +D+S SMN+ Sbjct: 57 QAKTTPSLRMPRWLAWLGWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMNE 113 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 114 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 166 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 167 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 218 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ-------------AEAADQF----LKNCA--SPD 340 +D L AK G ++ I A D L+ A + Sbjct: 219 LDP---LKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGG 275 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 276 RFFRARDTEELAGIYAEL 293 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 48/218 (22%), Positives = 78/218 (35%), Gaps = 50/218 (22%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+M+VLD+S SM F P + VA R+ + + R GLV F+ + Sbjct: 87 GRDLMLVLDLSSSMLAQDFSP--SRFEVARRTAIQFVQ-------GRRADRIGLVVFAGQ 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P + + + RL G T + A N++ K + Sbjct: 138 AFTQVPPTLDYRFLLTMLQRLQVGRLEDGTAIGTAIATAINRL-----------KNSEAR 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------------ 331 K II LTDG+N+ ID L A++ G +Y IG+ Sbjct: 187 SKVIILLTDGQNNRGEIDP---LTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQ 243 Query: 332 ---------FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 ++ A + R++ ++R L + I Sbjct: 244 PVPVEIDEAMMREVAEKTGGRYFRATDARTLEAIYAEI 281 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + MP W + P + ++ ++ MM+ +D+S SM++ Sbjct: 57 QAKSAPVLRMPRWLAWLAWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSE 113 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 114 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 166 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 ++++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 167 VRDQLS--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGA 218 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------LKNCA--SPD 340 ++ L AK G V+ I L+ A + Sbjct: 219 LNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGG 275 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 276 RFFRARDTEELAGIYAEL 293 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 106 bits (264), Expect = 5e-21, Method: Composition-based stats. Identities = 55/378 (14%), Positives = 123/378 (32%), Gaps = 46/378 (12%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G++ IL A+LLPVI + ++ ++ + +L D + A + +++ + Sbjct: 18 RRGAVLILIAVLLPVILWMAAFCVDVAYMQLTRTELRIATDSAARAGARTLSLEQDASL- 76 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 K+ Y N+ L ++ ++ +++ R Sbjct: 77 -AHKSAIEYAAKNNVAGNTL--TLADSDVQIGLSVRTDDVGRFTFSSGGKLLNSVNVTGR 133 Query: 133 YEMPFIFCTFPWCANSSHAPLLIT----SSVKISSKSDIGLDMMMVLDVSLSMNDHFG-- 186 P A + + V ++ S I D+ +V+D S SM Sbjct: 134 RTQQA-----PDGAVRLYLTPIFGHEFFQPVADATASQIDRDIALVVDRSGSMTFRINRN 188 Query: 187 ------------PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 P + S+ L + S P + L T++S L Sbjct: 189 SYESGWRNNDPVPSRARWWALVDSVDGFLTELGSTPQLEL---VSLSTYNSSAKIDEQLT 245 Query: 235 WGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 I++ ++ R +T T G++ + + + K + +K ++ +T Sbjct: 246 DKYSRIEDALDDYSRRYPDGSTNITAGMDRGISTLQNKKYARPYASKT-------MVVMT 298 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSR 349 DG ++ + +A +V+ I A ++ A + + + Sbjct: 299 DGNHNYGS----SPTNAAYDAASDDIVVHTITYSDGANQSLMREVARIGGGQHWHAPDGD 354 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L + F I + Sbjct: 355 ELEEIFREIARNAPTLLT 372 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 ++ MP W + P + ++ ++ MM+ +D+S SM++ Sbjct: 170 QATSAPVLRMPRWLAWLAWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSE 226 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 227 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 279 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 280 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 331 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------LKNCA--SPD 340 ++ L AK G V+ I L+ A + Sbjct: 332 LNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGG 388 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 389 RFFRARDTEELAGIYAEL 406 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 ++ MP W + P + ++ ++ MM+ +D+S SM++ Sbjct: 170 QATSAPVLRMPRWLAWLAWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSE 226 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 227 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 279 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 280 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 331 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------LKNCA--SPD 340 ++ L AK G V+ I L+ A + Sbjct: 332 LNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGG 388 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 389 RFFRARDTEELAGIYAEL 406 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 89/235 (37%), Gaps = 45/235 (19%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHF---GPGMDKLGVATRSIREML 203 + PL + ++++ SK G D+M+ +D+S SM + G +D+ + I + + Sbjct: 66 ACARPLWVGEAIELPSK---GRDLMLSVDLSGSMQIEDMVIDGKVVDRFTLIQHVISDFI 122 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + R GL+ F+ PL + + + + G K T Sbjct: 123 E-------RRKGDRIGLILFADHAYLQSPLTQDRRSVAQYLKEAQIGLVGKQT------- 168 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I +A + + +I LTDG N++ I +++ + RG +Y IG Sbjct: 169 -AIGEAIALGVKRFDKVEQSNRVLILLTDGSNNAGAITPEQASQIAAQ---RGITIYTIG 224 Query: 324 VQAE--------------AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 V A+ + L+ A + +++ +N+ +L + I Sbjct: 225 VGADVMERRTLFGKERVNPSMDLDESQLQEIAKVTGGQYFRARNTEELEQIYQVI 279 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 43/257 (16%), Positives = 95/257 (36%), Gaps = 49/257 (19%) Query: 129 AVSRYEMPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 ++P I W ++ PL + ++++ + G D+M+ +D+S SM Sbjct: 53 VRVSAKIPAIALWTIWLCMLLAAARPLWVGEAIELP---NSGRDLMLAVDISGSMRVEDM 109 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +++ +++++ S + R GL+ F S+ PL++ +Q +Q + Sbjct: 110 QVGNRMARRIDAVKQLGSDFMS---RRSGDRLGLILFGSRAYLQSPLSFDIQTVQRFLLE 166 Query: 247 LI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 G T + A ++ + + +I LTDG++++ +D Sbjct: 167 SQIGFAGQETAIGDAIGLAVKRLQERPAT-----------SRVLILLTDGQDTASTVDP- 214 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEA-------ADQF---------------LKNCASP-- 339 L N A G +Y IG+ A++ L AS Sbjct: 215 --LEAANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDENSLIAIASSTG 272 Query: 340 DRFYSVQNSRKLHDAFL 356 +++ ++ +L + Sbjct: 273 GQYFRARDPEELATVYR 289 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 86/242 (35%), Gaps = 51/242 (21%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSI 199 + P+ + K++ G D+++ LD+S SM G ++L A + Sbjct: 66 LLIFAMTQPVWLGEPTKVTP---SGRDLLIALDLSGSMQVTDMALNGQPANRLEAAKSVL 122 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKST 256 + + R G++ F SK PL++ + I + + G T Sbjct: 123 SDFIQE-------RRGDRIGIIVFGSKAYLQAPLSFDTKTINQLVQEAQIGFAGEQTAIG 175 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + ++ D KK +I +TDG N++ + +++ A + Sbjct: 176 DAIGLGIKRLEDKPSD-----------KKVLILMTDGANTAGRVQPQQA---ATFAASQN 221 Query: 317 AIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 ++ IG+ A++ + LKN A + ++ +++ L + Sbjct: 222 VKIHTIGIGADSMIVQSFFGPKAINPSSDLDETLLKNIAAQTGGEYFRAKSTEDLQAIYQ 281 Query: 357 RI 358 + Sbjct: 282 TL 283 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 106 bits (263), Expect = 8e-21, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 92/252 (36%), Gaps = 49/252 (19%) Query: 134 EMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 +P + W ++ P+ + +++ + G D+M+ +D+S SM D+ Sbjct: 57 RLPVLALWSIWLLLLMAAARPVWVGEPIELP---NSGRDLMLAVDISGSMKIEDMEVSDE 113 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L R+++ + + R GL+ F S PL++ ++ + Sbjct: 114 LVSRIRAVK---QVGSRFIEQREGDRLGLILFGSNAYVQSPLSFDTATVKRFLLEAQIGF 170 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + A ++ K + +I L+DG++++ ++ + L Sbjct: 171 AGQDTAIGDAIGLAVKRL-----------KERPAENRVLILLSDGKDTASSV---QPLNA 216 Query: 309 CNEAKRRGAIVYAIGVQAE------------AADQF----------LKNCA--SPDRFYS 344 A G +Y IG+ A+ A Q L+ A + +++ Sbjct: 217 AKLAADLGIRIYTIGIGADSLTMPGLFGSSFGARQVNPSAELDEAGLQQIAKITDGKYFR 276 Query: 345 VQNSRKLHDAFL 356 +N +L + + Sbjct: 277 ARNPEELANIYQ 288 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 106 bits (263), Expect = 8e-21, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 95/261 (36%), Gaps = 37/261 (14%) Query: 116 IIIDDQHKDYNLSAVSRYEMP--FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 I + Q +SR ++ + ++ + P+ + ++ + ++M+ Sbjct: 42 IAVSSQEPSETAVKMSRRKVQWLLVIISYLALIVAIAKPMWVGEPIE---QKKSAREIMV 98 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+S SM+ D G D+L +A +++ R GL+ F+ Sbjct: 99 ALDLSGSMSEEDFADKKGNKHDRLTIAK-------QVLREFAAQREHDRLGLILFADSAY 151 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P + Q + + G G + A+ DA + + ++ +I Sbjct: 152 VQAPFTEDINVWQSLLEDVELG-----YAGFKTAF---GDAIGLSIAVFEQEQSRQRVMI 203 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK---------NCASP 339 LTDG+++S + + A + G +Y I + + K + A+ Sbjct: 204 LLTDGDDTSSKMPP---VKAAEIAAKYGVKIYTIAIGDPSTKGRYKMDLPTLEKVSAATG 260 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 + + + ++L A+ I + Sbjct: 261 GQMFHAMDRKQLDQAYATIDQ 281 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 90/238 (37%), Gaps = 45/238 (18%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREM 202 + P + +++ G D+M+V+D+S SM++ G +++L R + + Sbjct: 70 VALARPQHVGEEIQMPV---TGRDLMLVVDISPSMDEQDMVLQGRSINRLQAVKRVLDDF 126 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 +D R GL+ F ++ PL + + ++ + ++ G+ Sbjct: 127 IDQ-------REGDRLGLILFGTEPYVQAPLTFDRETVRTLL--------FEAGLGMAGR 171 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + ++ +I LTDG N++ + ++ A G +Y I Sbjct: 172 ATAIGDAIGLSVKRLRERPQEQRVVILLTDGANTAGQVSPDKATEIAQAA---GVRLYTI 228 Query: 323 GVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G+ A+ ++ L A + R++ ++ +L + I + Sbjct: 229 GIGADTMIQRGLLGSRRVNPSRDLDEELLTRMAEQTGGRYFRARSLPELEMIYDSINQ 286 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 89/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 ++ MP W + P + ++ ++ MM+ +D+S SM++ Sbjct: 45 QATSAPVLRMPRWLAWLAWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSE 101 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 102 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 154 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 155 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 206 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------LKNCA--SPD 340 ++ L AK G V+ I L+ A + Sbjct: 207 LNP---LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGG 263 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 264 RFFRARDTDELAGIYAEL 281 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 95/237 (40%), Gaps = 44/237 (18%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREM 202 +++ P + + + ++ G D+M+ +D+S SM + G +D+L + + + Sbjct: 68 SATANPTWLDEPILLPNE---GRDIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDF 124 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ R GL+ F PL ++ + + +N G ++T Sbjct: 125 IEQ-------RTGDRLGLILFGDTAFLQTPLTRDLKTVTKMLNEAQIGLVGRAT------ 171 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA DD + ++ LTDG+N++ N++ ++L A+ G VY I Sbjct: 172 --AIGDALGLSVKRFASKDDSNRIVVLLTDGQNTAGNLNPDDALLL---AREEGIKVYTI 226 Query: 323 GVQAEA-----------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 GV ++ ++ LKN A + ++ ++ L + + K Sbjct: 227 GVGSDNPRGFSLFNMGGSGGSNLDERLLKNIADDTGGLYFRAKDVAGLKQIYAELDK 283 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 42/247 (17%), Positives = 90/247 (36%), Gaps = 48/247 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKL 192 C + ++ P+ I V + + G D+++ +D+S SM+ + ++ Sbjct: 62 LAVCVWLLLVTAAAKPVWIGEEVHLPT---TGRDLLVAVDISGSMDTKDMVVQNQQIPRI 118 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 V + + ++ R GLV F + PL + +++ + Sbjct: 119 AVVKHIVGDFIE-------RRVGDRLGLVLFGTSAYLQSPLTFDRTTVKQLL-------- 163 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +S G I DA + + +I LTDG+N++ + +++ + A Sbjct: 164 VESQIGFAGPNTAIGDAIGLSIKRLRDRPAENRVVILLTDGQNTAGEVSPRQA---ADLA 220 Query: 313 KRRGAIVYAIGVQA---------------------EAADQFLKNCA--SPDRFYSVQNSR 349 K+ G VY IGV A + + L A + R++ + + Sbjct: 221 KQSGVKVYTIGVGANEMIVSDGFFGNFQRKINPSRDLDEDTLTYIAETTGGRYFRAHSPQ 280 Query: 350 KLHDAFL 356 +L+ + Sbjct: 281 ELNQIYQ 287 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 90/244 (36%), Gaps = 51/244 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHF---GPGMDKLGVATRSI 199 + PL + +++ SK G D+MM +D+S SM + G +D+ + + Sbjct: 61 LLIIAVARPLWMGDPIELPSK---GRDLMMAVDLSGSMQIEDMVLDGKTVDRFTMIQAVV 117 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKST 256 + ++ + GL+ F+ PL + + + + G T Sbjct: 118 SDFIE-------RRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIG 170 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A + E + ++ LTDG N+S +I +++ A +RG Sbjct: 171 EAIALAVKRFDRVDESN-----------RILVLLTDGSNNSGSISPEQAAAI---AAKRG 216 Query: 317 AIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 +Y+IGV AE + L A + ++ +N+++L + Sbjct: 217 VKIYSIGVGAEVMERRTLFGKERVNPSMDLDETQLTALAQTTGGLYFRARNAQELESIYQ 276 Query: 357 RIGK 360 I K Sbjct: 277 EIDK 280 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 96/259 (37%), Gaps = 55/259 (21%) Query: 128 SAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DH 184 S + P I W ++ P + ++ ++ G D+M+ +D+S SM+ + Sbjct: 50 STMPGSRAPLILAVVGWLLLVAAAARPQWLGEPIE---QTVSGRDLMLAVDLSGSMDIED 106 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIP----DVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 F V + L+ +K + + + R GL+ F + PL + + + Sbjct: 107 F--------VVDGEVSNRLEAVKRVASAFIERRSGDRIGLILFGEQAYLQVPLTFDRKTV 158 Query: 241 QEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ ++ + G T + A ++ D ++ +I L+DG N++ Sbjct: 159 EKLLDEAAIGLAGDKTAIGDAIGLAIKRLRDNPAD-----------QRVLILLSDGANTA 207 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAE-------------AADQFLKNCA------- 337 + + L A R G +Y IGV A+ + L A Sbjct: 208 GQV---QPLQAAELAAREGLKIYTIGVGADEMIVRDFFGTRRVNPSEDLDEAAMTAIAEK 264 Query: 338 SPDRFYSVQNSRKLHDAFL 356 + R++ +N+ +L + Sbjct: 265 TGGRYFRARNTEELDRIYA 283 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 89/247 (36%), Gaps = 45/247 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKL 192 F + + P + ++ + + G D+M+ +D+S SM +D+L Sbjct: 57 LTFLIWALMITALARPQWVGDTMDLPA---TGRDLMISIDISGSMEMPDMVIEDKEVDRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + + + R G++ F + PL + ++ +Q ++ G Sbjct: 114 VAVKALLTDFI-------ARRKGDRVGMILFGEQAYLQTPLTFDLKTVQTMLDETTIGLA 166 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 S I D + D + +I LTDG+N++ ++ ++ A Sbjct: 167 GSSRTA-------IGDGIGLAVKRLRERDANNRVLILLTDGQNNTGALNPLQAAELAEHA 219 Query: 313 KRRGAIVYAIGVQAEA-------------------ADQFLKNCA--SPDRFYSVQNSRKL 351 G +Y IGV A+ ++ L A + R++ ++++++ Sbjct: 220 ---GITIYTIGVGADEMIVKNRFFGNRRINPSLELDEESLIAVAEKTGGRYFRARDTKEM 276 Query: 352 HDAFLRI 358 + + I Sbjct: 277 EEIYQII 283 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 90/258 (34%), Gaps = 46/258 (17%) Query: 126 NLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 MP W + P + ++ ++ MM+ +D+S SMN+ Sbjct: 57 QAKITPSLRMPRWLAWLGWFLLCAALARPQQLGEVIQPPREAR---QMMLAVDLSGSMNE 113 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 G +D+L A + + LD + R GL+ F + PL + Sbjct: 114 PDMVLGGKVVDRLTAAKAVLSDFLD-------RRDGDRVGLLVFGQRAYALTPLTADLTS 166 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +++++ S GL I DA + ++ ++ LTDG N++ Sbjct: 167 VRDQLR--------DSVVGLAGRETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGV 218 Query: 300 IDNKESLFYCNEAKRRGAIVYAIG-------------VQAEAADQF----LKNCA--SPD 340 +D L AK G ++ I + A D L+ A + Sbjct: 219 LDP---LKAAELAKAEGVRIHTIAFGGGGGSSLFGVPIPAGGNDDIDEDGLRKIAQQTGG 275 Query: 341 RFYSVQNSRKLHDAFLRI 358 RF+ +++ +L + + Sbjct: 276 RFFRARDTEELAGIYAEL 293 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 43/245 (17%), Positives = 90/245 (36%), Gaps = 45/245 (18%) Query: 138 IFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDK 191 + WCA + P + ++ + G MM+ +DVS SM + G +D+ Sbjct: 69 LLLWLGWCALCVALARPQQLGEAITPPQE---GRQMMLAMDVSGSMGEGDMVLGGQAVDR 125 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L A + + LD R GL+ F + PL + +++++ Sbjct: 126 LTAAKAVLADFLD-------RRAGDRIGLLVFGDRAYTLTPLTADLASVRDQLR------ 172 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 S GL I DA + + ++ +I LTDG +++ ++ L Sbjct: 173 --DSVVGLAGRETAIGDAIGLAVKRLRSQPEGQRVLILLTDGVSNAGVLEP---LRAAEV 227 Query: 312 AKRRGAIVYAIGVQAEAADQF----------------LKNCA--SPDRFYSVQNSRKLHD 353 A+ G ++ + + + + LK A + +F+ +++ +L Sbjct: 228 ARAEGVRIHTVAFGGDGSMRLFGIPISADQDPVDEATLKKIATMTGGQFFRARDTAQLAG 287 Query: 354 AFLRI 358 + + Sbjct: 288 IYAEL 292 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 94/268 (35%), Gaps = 45/268 (16%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 +S R M + ++ P +V + G D+++ +D+S Sbjct: 61 PHASTSTVSRQERIRMLLAALVWVLLLFAAARPAWYDDAVALPV---SGRDLLLAVDISG 117 Query: 180 SMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 SM + G + +L R+ + R GL+ F S PL + Sbjct: 118 SMQIKDFEMNGQQVSRLTATKAVARQFI-------SRRVGDRVGLILFGSNAYVQTPLTF 170 Query: 236 GVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + ++ + G T + A ++ + +K ++ +I LTD Sbjct: 171 DRKTVITLLDEAAVGLAGKATAIGDAIGLAVKRLEQSNRDKRIASK-----EQVLILLTD 225 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLK 334 G N++ + ++ A G +Y IG+ A+A ++ L Sbjct: 226 GVNTAGQLSAPQA---AELAAEHGLTIYTIGIGADAMTVQSFFGTQRVNPSADLDEKMLT 282 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + A + R++ ++++L + I K Sbjct: 283 DIATKTGGRYFRAHDTQELQKIYAMIDK 310 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 44/259 (16%), Positives = 83/259 (32%), Gaps = 37/259 (14%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMM 173 + + + S R + + WC + P+ + +KI D+M+ Sbjct: 42 VDLSGETPRSGASVRRRLVIQAVASLIGWCLLVLALARPVWVGEPIKI---EKTARDLML 98 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+S SM D G D+L A ++K R GL+ F S Sbjct: 99 AVDISGSMEATDFVDATGKQTDRLSAAK-------QVLKQFVAGREGDRLGLIVFGSAAY 151 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P + ++ I + + DA + + +I Sbjct: 152 LQAPFTDDRETWLALLDESIVN--------MAGPSTALGDAIGLSIAHFRESKTKNRVLI 203 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SP 339 LTDG ++ + L AK G +Y + V + L + A + Sbjct: 204 VLTDGNDTGSKVPP---LDAAQVAKAEGVTIYTVAVGDPETVGEEALDLEVLDSIAQTTG 260 Query: 340 DRFYSVQNSRKLHDAFLRI 358 ++ + + L + + RI Sbjct: 261 GVSFNAADLKALQETYQRI 279 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 89/238 (37%), Gaps = 35/238 (14%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 ++ + + ++ P + +++ ++ G D+++ +D+S SM + L Sbjct: 58 KLVVLVLIWLSLVTAATRPQWVGEPIEL---ANSGRDLLLAIDLSGSMQIEDMQIGNSLV 114 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FG 250 +++ I R GL+ F ++ PL + V+ +++ I G Sbjct: 115 SRITAVKA---IAADFASRRTGDRVGLILFGTRAYVQAPLTFDVKTVKQFIEEAQLGFAG 171 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L A ++ + + +I LTDG++++ +D E+ + Sbjct: 172 EDTAIGDALGLAVKRLRERPAD-----------SRVLILLTDGQDTASTVDPMEAAALAS 220 Query: 311 EAKRRGAIVYAIGV----------QAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 E +Y IG+ E + L A+ R++ + ++L D + Sbjct: 221 EM---NVKIYTIGISRRLGTSSNSSGEVDEALLTAIAQATGGRYFRARTPKELQDIYQ 275 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 87/243 (35%), Gaps = 45/243 (18%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKLGV 194 F CA + P + ++ ++ MM+ +D+S SM++ G +D+L Sbjct: 73 LGWFLLCA-ALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSEPDMVLGGNVVDRLTA 128 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + + LD R GL+ F + PL + +++++ Sbjct: 129 AKAVLSDFLD-------RREGDRVGLLVFGQRAYALTPLTADLTSVRDQL--------AD 173 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 S GL I DA + ++ ++ LTDG N++ ++ L AK Sbjct: 174 SVVGLAGRETAIGDAIALSVKRLREQRQGQRVVVLLTDGVNTAGVLNP---LKAAELAKA 230 Query: 315 RGAIVYAIGVQAEAADQF-----------------LKNCA--SPDRFYSVQNSRKLHDAF 355 G V+ I + L+ A + RF+ +++ +L + Sbjct: 231 EGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEELAGIY 290 Query: 356 LRI 358 + Sbjct: 291 AEL 293 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 88/248 (35%), Gaps = 45/248 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHF---GPGMDKL 192 + + + P + +++ SK G D+M+ +D+S SM + +D+ Sbjct: 54 LKWLMWTALVLAVARPQWLGDPIELPSK---GRDLMVAVDLSGSMQIEDMVLDNKTVDRF 110 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + + + ++ R GL+ F P+ + + + + G Sbjct: 111 TLVQHVVSDFIE-------RRVGDRIGLILFGDHAYLQSPMTQDRRSVAQYLREAQIGLV 163 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 K T I ++ + ++ + ++ LTDG N++ +I ++ E Sbjct: 164 GKQT--------AIGESIALAVKRFENLEESNRVLVLLTDGTNNAGSISPDKAAAIAAER 215 Query: 313 KRRGAIVYAIGVQAE-------------------AADQFLKNC-ASPDRFYSVQNSRKLH 352 K +Y IGV AE +Q + A+ +++ ++S L Sbjct: 216 K---VTIYTIGVGAEMMERRSFFGRDRVNPSMDLDEEQLQRIANATQGKYFRARSSEDLA 272 Query: 353 DAFLRIGK 360 + I K Sbjct: 273 AIYQEIDK 280 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 91/240 (37%), Gaps = 53/240 (22%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDI 205 PL I V + + G D+M+ +D+S SM + G +D+ + + + ++ Sbjct: 68 SRPLWIGEPVAMKRE---GRDLMLAVDLSGSMQIEDMELGNRVVDRFTMVRHVLSDFIE- 123 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYA 262 + R GL+ F+ + PL + + ++ + G T + Sbjct: 124 ------RRDGDRLGLILFADQAYLQAPLTFDRFAVARFLDEAVLGLVGQQTAIGDAIALG 177 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + D ++ + ++ LTDGEN++ +++ A++ G +Y I Sbjct: 178 VKRFNDLEQS-----------SRVLVLLTDGENNAGRFTPAQAVSL---ARQSGVKLYTI 223 Query: 323 GVQA---------------------EAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 G+ + +A F++ ++ R++ +++ +L + + + Sbjct: 224 GIGSAEIRRRGLLGTRTVNPSSDLDQAEKSFIQLSESTGGRYFRARSTEELESIYQELDQ 283 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 60/278 (21%), Positives = 101/278 (36%), Gaps = 58/278 (20%) Query: 126 NLSAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-ND 183 + A R +P F + P L+ KI + G+D+M+ +D+S SM Sbjct: 46 TIKAKLRRWLPIFRALGFAALVIALARPQLVLKEEKIKAN---GIDIMLSMDLSSSMLAQ 102 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 F P ++L V+ + ++ +K P R GLV F+ + PL + ++ Sbjct: 103 DFEP--NRLEVSKKMA---IEFVKGRPH----DRIGLVVFAGEAFTQCPLTTDHKILETF 153 Query: 244 INRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + +L G T GL A N++ K K II LTDG N+ Sbjct: 154 LEQLECGNLEDGTAIGMGLAGAVNRL-----------KKSPAKSKVIILLTDGVNNVGYF 202 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----------------------QFLKNCA 337 + L AK G VY+IGV + L+ A Sbjct: 203 ---KPLTAGELAKELGIKVYSIGVGTIGEALTPVSRLSDGSFFLDYAQVEIDEELLREIA 259 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGK--EMVKQRILYNK 371 + +++ +N++ L + I + + Q K Sbjct: 260 RMTGGQYFRAKNNQDLRQIYNTIDRLEKTEIQVTRIKK 297 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 92/236 (38%), Gaps = 43/236 (18%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREM 202 +S P + + + ++ G D+M+ +D+S SM + G +D+L + + + Sbjct: 68 TASANPTWLDEPISLPNE---GRDIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDF 124 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ R GL+ F PL V+ + + ++ G ++T Sbjct: 125 IEQ-------RQGDRLGLILFGDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRAT------ 171 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + + ++ LTDG+N++ N++ +++L A+ G VY I Sbjct: 172 --AIGDALGLSVKRFASKKESNRIVVLLTDGQNTAGNLNPEDALLL---AREEGIKVYTI 226 Query: 323 GVQAEA----------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 GV ++ + LK A + ++ ++ L + + K Sbjct: 227 GVGSDNPRGFSLFNMGSGGSNLDEGLLKKIAEQTGGLYFRAKDVAGLQQIYAELDK 282 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 50/223 (22%), Positives = 81/223 (36%), Gaps = 53/223 (23%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM F P D++ A + R GLV F+ + Sbjct: 87 GIDIMLAIDVSGSMLARDFRP--DRITAAKEVAGSFIAD-------RYGDRIGLVAFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL +Q + R+ G T GL A N++ + + Sbjct: 138 AFTQSPLTTDQGTLQTLLARIRSGLIEDGTAIGNGLATAINRL-----------RESEAK 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----------- 332 K II LTDG N+ I + + AK +G VY IGV E + Sbjct: 187 SKVIILLTDGVNNRGEIAPQTA---AEIAKAQGIRVYTIGVGTEGMAPYPAVDIYGTPTG 243 Query: 333 -------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L++ A + +++ + KL + +I + Sbjct: 244 GTVMAKVEIDEKTLRSIAEQTGGQYFRATDKAKLKAIYDQINQ 286 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 84/235 (35%), Gaps = 44/235 (18%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREM 202 + P + ++ ++ MM+ +D+S SM++ G +D+L A + + Sbjct: 55 AALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDF 111 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 LD + R GL+ F + PL + +++++ S GL Sbjct: 112 LD-------RRDGDRVGLLVFGQRAYALTPLTADLTSVRDQLR--------DSVVGLAGR 156 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + ++ ++ LTDG N++ ++ L AK G V+ I Sbjct: 157 ETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGVLNP---LKAAELAKAEGVRVHTI 213 Query: 323 GVQAEAADQF-----------------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + RF+ +++ +L + + Sbjct: 214 AFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDTEELAGIYAEL 268 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 84/235 (35%), Gaps = 44/235 (18%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREM 202 + P + ++ ++ MM+ +D+S SM++ G +D+L A + + Sbjct: 80 AALARPQQLGEVIQPPREAR---QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDF 136 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 LD + R GL+ F + PL + +++++ S GL Sbjct: 137 LD-------RRDGDRVGLLVFGQRAYALTPLTADLTSVRDQLR--------DSVVGLAGR 181 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + ++ ++ LTDG N++ ++ L AK G V+ I Sbjct: 182 ETAIGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGVLNP---LKAAELAKAEGVRVHTI 238 Query: 323 GVQAEAADQF-----------------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + RF+ +++ +L + + Sbjct: 239 AFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDTEELAGIYAEL 293 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 48/251 (19%), Positives = 96/251 (38%), Gaps = 44/251 (17%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFG 186 SR + + + ++ P+ + + V I + G +MM+ +D+S SM+ G Sbjct: 56 SRLRLATLAFIWTLLVIAAARPVYVGTPVAIPVE---GREMMLAVDLSASMSSPDLVQSG 112 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 ++L V R + + R GL+ FS++ PL +++ + Sbjct: 113 VPANRLQVVKRVADDFI-------ARRTGDRIGLILFSTRAYVQAPLTLDRNVVRQLL-- 163 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +++ G+ I DA + + +I LTDG N+S +D E+ Sbjct: 164 ------AEASIGMTGRNTSIGDAIGLAVKTLRDRPAKDRVLILLTDGANTSGVLDPMEAA 217 Query: 307 FYCNEAKRRGAIVYAIGVQAEAA-----------------DQFLKNCA--SPDRFYSVQN 347 AK ++ IGV A++ ++ LK A + +++ +N Sbjct: 218 AIA--AKEN-VRIHTIGVGADSNFTDIQPGMLMNPSGDLDEEALKKIAGLTGGQYFRARN 274 Query: 348 SRKLHDAFLRI 358 + L + I Sbjct: 275 DKGLAAIYADI 285 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 47/250 (18%), Positives = 83/250 (33%), Gaps = 52/250 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + S P L K+ S+ G+D+++ +D+S SM +++ A Sbjct: 69 LRMLALAFLILSFARPRLENQREKVFSE---GIDIVLAIDLSGSMLAEDFEPKNRIEAAK 125 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----T 252 + + R GLV FS K PL + + I+ L G+ Sbjct: 126 SVATDFI-------HQRLSDRIGLVVFSGKSFTQCPLTLDYRLLTNFISELKAGTIEEDG 178 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + A N++ + K II LTDG+N++ I+ + A Sbjct: 179 TAIGTAIATATNRL-----------RESTAKSKVIILLTDGQNNAGEIEP---VTAAELA 224 Query: 313 KRRGAIVYAIG----------------------VQAEAADQFLKNCA--SPDRFYSVQNS 348 G +Y +G ++ + D L A S R++ + Sbjct: 225 AALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISGGRYFRATDL 284 Query: 349 RKLHDAFLRI 358 L + I Sbjct: 285 ESLKKTYHEI 294 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 45/245 (18%), Positives = 91/245 (37%), Gaps = 45/245 (18%) Query: 138 IFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDK 191 + WCA + P + ++ + G MM+ +DVS SM + G +D+ Sbjct: 69 LLLWLGWCALCVALARPQQLGEAITPPQQ---GRQMMLAMDVSGSMGEGDMVLGGQAVDR 125 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L A + + LD R GL+ F + PL + +++++ Sbjct: 126 LTAAKAVLADFLD-------RRAGDRIGLLIFGDRAYTLTPLTADLASVRDQLR------ 172 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 S GL I DA + + ++ +I LTDG +++ ++ L Sbjct: 173 --DSVVGLAGRETAIGDAIGLAVKRLRSQPEGQRVLILLTDGVSNAGVLEP---LRAAEV 227 Query: 312 AKRRGAIVYAIGVQAEAADQF----------------LKNCAS--PDRFYSVQNSRKLHD 353 A+ G ++ + + + +F LK AS +F+ +++ +L Sbjct: 228 AQAEGVRIHTVAFGGDGSMRFLGIPISADQDPVDEATLKKIASLTGGQFFRARDTAQLAG 287 Query: 354 AFLRI 358 + + Sbjct: 288 IYAEL 292 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 103 bits (256), Expect = 5e-20, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 86/253 (33%), Gaps = 38/253 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGP 187 + Y + + S + V + G D+MM +D+S SM Sbjct: 55 TKANYLKYILSTIWILLIISGSGIQWLGKPVSLP---QSGRDLMMAIDLSGSMAIQDMQK 111 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 K+ ++ + D R GL+ F + PL + + +++ ++ Sbjct: 112 SNGKME---SRFDLVMRVANEFLDTRQGDRVGLILFGTWAYLQTPLTFDIPTVKKMLD-- 166 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 ++ L I DA K + K ++ LTDGEN+S + + L Sbjct: 167 ------DASIALPGPQTAIGDAIGLAVKKLKRYPGDSKALVLLTDGENNSGAL---QPLQ 217 Query: 308 YCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQN 347 AK+ +Y IG+ + + L+ A + +F+ QN Sbjct: 218 AAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQN 277 Query: 348 SRKLHDAFLRIGK 360 S L + I + Sbjct: 278 STDLKQVYESIDQ 290 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 103 bits (256), Expect = 5e-20, Method: Composition-based stats. Identities = 54/232 (23%), Positives = 90/232 (38%), Gaps = 53/232 (22%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 SV +K++ G+D++M +DVS SM P ++L + + +D Sbjct: 82 VSVSKRTKTNRGIDIVMAIDVSASMLARDLKP--NRLEALKKVAVDFVD-------RRPN 132 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEK 272 R G+V ++ + P+ ++ INRL +G T GL N++ D+K K Sbjct: 133 DRIGIVVYAGESFTQTPITSDKTIVKRTINRLQWGQLEGGTAIGMGLGSRVNRLKDSKAK 192 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---- 328 K II LTDG N++ NID + AK G VY IG+ Sbjct: 193 -----------SKVIILLTDGVNNAGNIDPTTATEL---AKELGIKVYTIGIGTNGMADF 238 Query: 329 --------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + LKN A + +++ ++ L + + I Sbjct: 239 PWSKDPRTGMLNFRKQQVQIDEDLLKNIAEETQGKYFRATDNTSLKEIYDEI 290 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 103 bits (256), Expect = 5e-20, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 86/253 (33%), Gaps = 38/253 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGP 187 + Y + + S + V + G D+MM +D+S SM Sbjct: 55 TKANYLKYILSAIWILLIISGSGIQWLGKPVSLP---QSGRDLMMAIDLSGSMAIQDMQK 111 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 K+ ++ + D R GL+ F + PL + + +++ ++ Sbjct: 112 SNGKME---SRFDLVMRVANEFLDTRQGDRVGLILFGTWAYLQTPLTFDIPTVKKMLD-- 166 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 ++ L I DA K + K ++ LTDGEN+S + + L Sbjct: 167 ------DASIALPGPQTAIGDAIGLAVKKLKRYPGDSKALVLLTDGENNSGAL---QPLQ 217 Query: 308 YCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQN 347 AK+ +Y IG+ + + L+ A + +F+ QN Sbjct: 218 AAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQN 277 Query: 348 SRKLHDAFLRIGK 360 S L + I + Sbjct: 278 SADLKQVYESIDQ 290 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 102 bits (255), Expect = 6e-20, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 86/253 (33%), Gaps = 38/253 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGP 187 + Y + + S + V + G D+MM +D+S SM Sbjct: 61 TKANYLKYILSTIWILLIISGSGIQWLGKPVSLP---QSGRDLMMAIDLSGSMAIQDMQK 117 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 K+ ++ + D R GL+ F + PL + + +++ ++ Sbjct: 118 SNGKME---SRFDLVMRVANEFLDTRQGDRVGLILFGTWAYLQTPLTFDIPTVKKMLD-- 172 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 ++ L I DA K + K ++ LTDGEN+S + + L Sbjct: 173 ------DASIALPGPQTAIGDAIGLAVKKLKRYPGDSKALVLLTDGENNSGAL---QPLQ 223 Query: 308 YCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQN 347 AK+ +Y IG+ + + L+ A + +F+ QN Sbjct: 224 AAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQN 283 Query: 348 SRKLHDAFLRIGK 360 S L + I + Sbjct: 284 STDLKQVYESIDQ 296 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 102 bits (255), Expect = 6e-20, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 90/243 (37%), Gaps = 45/243 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVA 195 F + + P I + + G D+++ +D+S SM + A Sbjct: 62 LQFLIWSALVLALARPHWIGEPILLP---QSGRDLLLAVDISGSMRVEDMVIA----NQA 114 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGST 252 R I + DI + R GL+ F S+ PL++ +++ ++ GS Sbjct: 115 VRRIDAVRDIGAEFIERREGDRVGLILFGSRAYMQSPLSFDRDTVKQFLSEAQIGFAGSE 174 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L A ++ D +D + +I LTDG++++ ++D ++ A Sbjct: 175 TAIGDALGLAVKRLRDK-----------EDGDRVVILLTDGQDTASSVDPLDATAL---A 220 Query: 313 KRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLH 352 G VY IG+ A+ ++ L A + R++ ++ +L Sbjct: 221 ANYGVKVYTIGIGADEMLVPSLFGNRRVNPSAELDEETLSAMAESTGGRYFRARSPDELA 280 Query: 353 DAF 355 + Sbjct: 281 KIY 283 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 102 bits (255), Expect = 7e-20, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 79/229 (34%), Gaps = 49/229 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM + +L + R + + ++ Sbjct: 75 RPQWVGEPLPLPA---SGRDLLLAVDVSGSMAYEDMHWDEQPISRLELVKRLLGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ I G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRTWLDEAMIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ ID + A G +YAIG Sbjct: 185 KRLRQRPAQ-----------SRVLVLITDGANNGGEIDP---MVAAQLAAEEGVRIYAIG 230 Query: 324 VQAEAAD----------------QFLKNC--ASPDRFYSVQNSRKLHDA 354 + A+ L+ A+ ++ +N +L Sbjct: 231 IGADPRQSGVLGAFGFSALDLDETSLRAIAEATGGEYFRARNQAELTQI 279 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 102 bits (255), Expect = 7e-20, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 97/264 (36%), Gaps = 28/264 (10%) Query: 112 TSLSIIIDDQHKDYNLSAVSR--YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 TS++ + + ++ A + Y + + + P + + K++ Sbjct: 39 TSITTAAGAEIRSGSVIASASWWYRL-VVIAVWLLLLVGLAKPQWVGEPIT---KTETAR 94 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+M+ +D+S SM+ PG D V+ + + + R GL+ F +K Sbjct: 95 DVMLAIDLSASMDYRDFPGPDGKPVSRFDAVQRV--VDQFVANREGDRVGLIVFGAKAYL 152 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + ++ + G + + D+ + + + +I Sbjct: 153 QLPFTRDLNTARALVDLMQVG--------MAGPQTALGDSIGLAIRAFESSEVDDRVLIL 204 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAAD------QFLKNCA--SPD 340 LTDG +++ + + A+ G +Y IG+ AEA + L + A S Sbjct: 205 LTDGNDTASKMTP---INAAEIAQLNGIEIYTIGIGDAEATGEDRIDFETLASIAERSGG 261 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 +F+ Q+ L + RI V Sbjct: 262 QFFDAQDETALRQVYDRIDALAVA 285 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 102 bits (255), Expect = 7e-20, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 95/243 (39%), Gaps = 31/243 (12%) Query: 129 AVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DH 184 +SR ++ I W + P + + + I + D++M +D+S SM+ D Sbjct: 55 VLSRPKLSGISAVLCWGLLVLALARPERVGAPITI---ENAARDVVMAIDISGSMDARDF 111 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 P +++ + + +++++ R L+ F + PL ++ I + Sbjct: 112 ATPEGERIQRLS----GVREVVRAFVSGREGDRMALIVFGTSAYLQAPLTDDLETIIALL 167 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 +R + G+ + + DA + + ++ +I L+DG +++ + Sbjct: 168 DR--------TEVGMAGPHTALGDAIGLSIRTFETSEIDQRLLILLSDGSDTASRMSPVN 219 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAF 355 + A RG +Y IGV A LK A + +++ +++ L + Sbjct: 220 A---AEIAADRGVEIYTIGVGDPDATGENRVDLTTLKEVAQRTGGQYFFAEDAASLEAVY 276 Query: 356 LRI 358 RI Sbjct: 277 DRI 279 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 102 bits (255), Expect = 7e-20, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 91/235 (38%), Gaps = 51/235 (21%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDIIKS 208 + + ++ D+M+V+D+S SM+ + G +D+L + + E + Sbjct: 1 MWYGEPISTTTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSEFI----- 52 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNK 265 R GL+ F+ PL Q + E++N+ + G+ T G+ A Sbjct: 53 --AQREGDRIGLILFADHAYLQTPLTLDRQTVTEQLNQAVLKLIGTQTAMGEGIGLATKT 110 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ ++ +I L+DG N++ +D L N AK+ +Y +GV Sbjct: 111 FIDSA-----------APQRVMILLSDGSNTAGVLDP---LEAANIAKQYQTTIYTVGVG 156 Query: 326 A------------------EAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 A + ++ L+ AS +++ +N + L + I + Sbjct: 157 AGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 211 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 102 bits (254), Expect = 8e-20, Method: Composition-based stats. Identities = 45/261 (17%), Positives = 89/261 (34%), Gaps = 37/261 (14%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMM 173 +++ + S + R M I WC ++ P + + I D+M+ Sbjct: 42 VLLSGETPRSGASVLRRRRMQGIVSILGWCLLVLAAARPEWVGDPINI---EKSARDLML 98 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+S SM+ D G ++L A D+++ R GL+ F + Sbjct: 99 ALDLSGSMDARDFRDAEGHEQNRLTAAK-------DVLEGFAAQREGDRLGLIVFGNAAY 151 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P + Q + +S + + DA I + D + +I Sbjct: 152 LQAPFTDDRETWQTLL--------EESEVAMAGQSTALGDAIGLAISIFQASDTTNRVLI 203 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SP 339 LTDG ++ + +++ A +Y + V A + L A + Sbjct: 204 VLTDGNDTGSRVPPRDA---ATIAAANDVTIYTVAVGDPATIGEEALDLETLNAVAETTG 260 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 + +++ L A+ I + Sbjct: 261 GASFQALDTQALEKAYDEINR 281 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 102 bits (254), Expect = 8e-20, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 87/243 (35%), Gaps = 45/243 (18%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKLGV 194 F CA + P + ++ ++ MM+ +D+S SM++ G +D+L Sbjct: 73 LGWFLLCA-ALARPQQLGDVIQPPREAR---QMMLAVDLSGSMSEPDMVLGGNVVDRLTA 128 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + + LD R GL+ F + PL + +++++ Sbjct: 129 AKAVLSDFLD-------RREGDRVGLLVFGQRAYALTPLTADLTSVRDQL--------AD 173 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 S GL I DA + ++ ++ LTDG N++ ++ L AK Sbjct: 174 SVVGLAGRETAIGDAIALSVKRLREQRHGQRVVVLLTDGVNTAGVLNP---LKAAELAKA 230 Query: 315 RGAIVYAIGVQAEAADQF-----------------LKNCA--SPDRFYSVQNSRKLHDAF 355 G V+ I + L+ A + RF+ +++ +L + Sbjct: 231 EGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEELAGIY 290 Query: 356 LRI 358 + Sbjct: 291 AEL 293 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 102 bits (254), Expect = 8e-20, Method: Composition-based stats. Identities = 63/432 (14%), Positives = 132/432 (30%), Gaps = 127/432 (29%) Query: 22 AILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSY 81 A+LLPV+ V G+ ++ + + +L +D L + ++ + + Sbjct: 2 ALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLELPYDPDLS----------- 50 Query: 82 RIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT 141 K I + + N + I T + ++ ++ E+ + Sbjct: 51 ---KGIVTQAVNDMVETNMEEAVVTEISAGTEIR----------SVKVTAQAEVRMLLME 97 Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 A+ + + K L+++ V+D S SM G +D + A+ + + Sbjct: 98 VLGMADKTVEASAMAGFNK--------LEVVFVIDNSGSMK---GTPIDLVKQASEELTD 146 Query: 202 ML-------DIIKSIPDVNNVVRSGLVT-------------------------FS----- 224 +L D + +R G ++ Sbjct: 147 LLIPDGTTPDTKVGLVPFRGKIRLGEAVDGYAEGCVNADGSLNTGINEEFMDEYNALPYY 206 Query: 225 ----------SKIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKE 271 S I PL+ I I T + G+++ N + Sbjct: 207 YKRYITLDTCSDIPTVLPLSKNKSTIIAAIGSQTATGAASGTVISEGIKWGRNILTPDAP 266 Query: 272 KLEHIAKGHDDYKKYIIFLTDGEN---------------------------------SSP 298 + +K +D++K +I LTDG+ + Sbjct: 267 FTQAGSK--EDFRKIMIVLTDGDTEDGECGGTYRATYRPNNYWTNAYYGMGVDTAHCNDG 324 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCASP-----DRFYSVQNSRKL 351 + N + L AK G +++I + +K AS D ++ + + Sbjct: 325 GVLNADMLSEAQLAKDAGIEIFSIRFGSSDTTDINLMKEIASSKAGTDDHYFDAPSVYDI 384 Query: 352 HDAFLRIGKEMV 363 D F +IGK++ Sbjct: 385 PDIFKQIGKQLG 396 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 102 bits (254), Expect = 9e-20, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 79/229 (34%), Gaps = 49/229 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + R + + ++ Sbjct: 75 RPQWVGEPLPLPA---SGRDLLLAVDVSGSMDYADMQWDDEPISRLELVKRLLGDFIE-- 129 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ I G T + A Sbjct: 130 -----GRRGDRVGLILFGSQAYLQAPLTFDRHTVRTWLDEALIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ ID + A G +Y IG Sbjct: 185 KRLRQRPAQ-----------SRVLVLITDGANNGGEIDP---MVAAQLAADEGVRIYTIG 230 Query: 324 VQAEAAD----------------QFLKNCA--SPDRFYSVQNSRKLHDA 354 + A+ L+ + + ++ +N +L Sbjct: 231 IGADPQQSGAFGSFGFSALDLDETSLRAISDTTGGEYFRARNQAELEQI 279 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 102 bits (254), Expect = 9e-20, Method: Composition-based stats. Identities = 43/245 (17%), Positives = 90/245 (36%), Gaps = 45/245 (18%) Query: 138 IFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDK 191 + WCA + P + ++ + G MM+ +DVS SM + G +D+ Sbjct: 69 LLLWLGWCALCVALARPQQLGEAITPPQQ---GRQMMLAMDVSGSMGEGDMVLGGQAVDR 125 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L A + + LD R GL+ F + PL + +++++ Sbjct: 126 LTAAKAVLADFLD-------RRAGDRIGLLIFGDRAYTLTPLTADLASVRDQLR------ 172 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 S GL I DA + + ++ +I LTDG +++ ++ L Sbjct: 173 --DSVVGLAGRETAIGDAIGLAVKRLRSQPEGQRVLILLTDGVSNAGVLEP---LRAAEV 227 Query: 312 AKRRGAIVYAIGVQAEAADQF----------------LKNCA--SPDRFYSVQNSRKLHD 353 A+ G ++ + + + + LK A + +F+ +++ +L Sbjct: 228 ARAEGVRIHTVAFGGDGSMRVFGISISADQDPVDEATLKKIAGMTGGQFFRARDTAQLAG 287 Query: 354 AFLRI 358 + + Sbjct: 288 IYAEL 292 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 95/249 (38%), Gaps = 51/249 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F++ ++ + P+ + ++ G D+++ LD+S SM + + Sbjct: 62 FLWISWLLLVVAIARPVWLGEPKSVTP---SGRDLLIALDLSGSMQ------TADMKINQ 112 Query: 197 RSIREMLDIIKSIPDV----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---F 249 ++ LD K + + R G++ F SK PL++ + I + +N Sbjct: 113 QAA-NRLDAAKQVLNRFITERQGDRIGIIVFGSKAYLQAPLSYDLDTIAQLVNETQIGFA 171 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T + ++ + K+ +I +TDG N++ + ++ Sbjct: 172 GENTAIGDAIGLGIKRLANIDAD-----------KRVMILMTDGANTAGRVKPDQA---A 217 Query: 310 NEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSR 349 A ++G ++ IG+ AE ++ L+ A + +++ ++++ Sbjct: 218 QFAAKQGVKIHTIGIGAEQMVSQGFFGPRVINPSTDLDEELLQKVADLTQGQYFRAKSTQ 277 Query: 350 KLHDAFLRI 358 +L + + Sbjct: 278 ELASIYATL 286 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 69/464 (14%), Positives = 140/464 (30%), Gaps = 112/464 (24%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++ F + K SI + +++ +I ++ G+ I+ S + KL + +D + L A + Sbjct: 13 LKGFTGDRKASILPVFGLMVVLIVVIAGITIDVSRTVNAREKLSFAIDAAALSVAADLST 72 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNE-LRENGFAQDINNIERSTSLSIIIDD---- 120 + K + N+ +F +E ++ F D N S +D+ Sbjct: 73 SVMSDEQI--KAALADSFKANLADVEFLDEAIKNLSFVVDAENGTIKVSSFATLDNYFID 130 Query: 121 --------------QHKDYNLSAVSRYEMPFIFC----------TFPWCANSSHAPLLIT 156 + SR+++ S ++ Sbjct: 131 MGGYGMQALGPETFNFGTSSQVTYSRFDVELALVVDVTGSMRNDMDTLRDASKGLVNILI 190 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG----------VATRSIREMLD-- 204 + S + + + V S + G K+ V ++ D Sbjct: 191 PETTEEADSKVRISL-----VPYSQGVNLGTYAAKVKGGVYGYADSSVCVTERQDYDDGE 245 Query: 205 --------------IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +K+ P +V G S + PL + + I L Sbjct: 246 DIYKVRYTDMPYNYYVKTDPPPKDVFYGGGSNRCSGTSKMIPLTADRDTLLDAIADLDDN 305 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK-----KYIIFLTDGENS--------- 296 T G+ + +N I + +A + Y K+ I +TDG+N+ Sbjct: 306 GGTAGQTGVVWGWNSISPNYSDVWPLASKPEPYDNDDVLKFAIIMTDGDNNRFYEFVKER 365 Query: 297 ---------------------------------SPNIDNKESLFYCNEAKRRGAIVYAIG 323 N +K C K G ++ + Sbjct: 366 EECDWVYSRRYGWQWTCEMVSVNQWQERSESESYNNNSSKAQRALCQAMKDEGISIFGVY 425 Query: 324 VQA---EAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A + +++CAS +Y +S +L +AF I K++ + Sbjct: 426 FGTNDSSAGSKNMQSCASTGNYYKATSSDELINAFANIAKKIQQ 469 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 46/236 (19%), Positives = 81/236 (34%), Gaps = 39/236 (16%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREM 202 ++ PL + V + + G +++ LD+S SM D G ++ V + Sbjct: 67 VAAMRPLWVGEPVAMPRE---GRALVVALDISGSMEEQDMDDNGQRRSRIAVTKDVAMDF 123 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + R LV F + PL + + + I G+ Sbjct: 124 VKQ-------REGDRIALVLFGTHPYLQTPLTFDHPTVMQHIYEAQLTMADDLQRGIHA- 175 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + D K +I LTDG +++ + L A R G +Y I Sbjct: 176 -TAIGDAIGLAVKRLRDIDAPDKTLILLTDGSDNASQVAP---LKAAQIAAREGLKIYTI 231 Query: 323 GVQAEAAD------------------QFLKNC--ASPDRFYSVQNSRKLHDAFLRI 358 G+ AE + LK+ A+ R++ +N +L + + I Sbjct: 232 GLGAEQRQASLLGFDFGFGKNREIDEKTLKDIAKATDGRYFRARNPEELREIYQHI 287 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 65/153 (42%), Gaps = 17/153 (11%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI-FDAKEKLEHIAKGHDDYKKYIIF 289 ++ ++ +I+ + T + GL + + + + Y+ YI+ Sbjct: 389 TAMSSQWSTLKNQIDAMTPSGNTNQSIGLAWGWQSLSTTNGPIAAPGKESGYVYQDYIVL 448 Query: 290 LTDGENSS------------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLK 334 L+DG N+ P ID +++L C + K G ++ I V + D Q L+ Sbjct: 449 LSDGLNTQNRWYSCPPSGPCPTIDARQALL-CQKVKDSGVTIFTIQVNVGSKDPLSQVLQ 507 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 NCAS F + ++ + DAF I ++ + R+ Sbjct: 508 NCASDGNFQMITSATETADAFQNILTQISQLRL 540 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 62/204 (30%), Gaps = 27/204 (13%) Query: 29 FIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIW 88 +G ++ + ++ + LD ++L + + I N Sbjct: 1 MGFVGAAVDYTRANAARSSMQAALDSAVLMVSKDAAANPTMTSQ----------QITNAV 50 Query: 89 QTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS 148 Q D + + S + + A + + F Sbjct: 51 QR------YFTSLYNDKSAFGVTVSATYTPSSSSAAAKILASGQGAIQTDFMKIAG---- 100 Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 P L + S+ + + + +VLD + SM+ + K+ R+ ++M+D + + Sbjct: 101 --FPQLSFGTSSTSTWGNSRMRVALVLDNTGSMSSN-----GKMSALQRAAKDMIDSLSA 153 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFP 232 V ++ FS + Sbjct: 154 FAKKTGDVYISIIPFSKDVNVDTS 177 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 53/269 (19%), Positives = 91/269 (33%), Gaps = 48/269 (17%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + I +Q K + A S +P + A + P + I + G ++MM L Sbjct: 43 VGIANQEKQSFV-AQSSLLIPALVWLLLVFAMAG--PRWLGEPKPIERE---GYNIMMAL 96 Query: 176 DVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 D+S SM +L V + + + + GL+ F S+ Sbjct: 97 DLSGSMEIPDMILHDRPASRLTVVKNAAEQFVRD-------RLGDKIGLILFGSRAYLQT 149 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 PL + Q + +I +T GL I DA + II LT Sbjct: 150 PLTYDRQTVLLRI--------EDATVGLAGKTTSIGDAVGLAVKRLDAVPQKGRVIILLT 201 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE------AADQFLKNCA-------- 337 DG N+S ++ L AK G +Y IG+ A ++ A Sbjct: 202 DGANNSGILEP---LKAAELAKDEGIKIYTIGLGAATDPRALTNGFLMQAAAADLDEETL 258 Query: 338 ------SPDRFYSVQNSRKLHDAFLRIGK 360 + R++ ++ L+ + I + Sbjct: 259 KEMSAMTGGRYFRATDTATLNSIYKTINQ 287 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 91/259 (35%), Gaps = 40/259 (15%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + V ++ + + SKS G D+++ +D+S SM Sbjct: 46 NIVGGTKLATTLAFLSAVFIIIALARPQAVTTERHSKSR-GYDIVLAVDLSRSMEAEDY- 103 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +D+ + + + + + R GL+ F+ + PL + + + + RL Sbjct: 104 FVDRKRSNRLQAVKPV--LSAFINRRENDRIGLIAFAGRAYTVAPLTFDHKWLARQTERL 161 Query: 248 IFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 G T L A +++ + ++ +G +I+ LTDGEN++ +D E Sbjct: 162 QIGLIEDGTAIGDSLAVATSRLLEGAKERAGEREGA-----FIVLLTDGENTAGMMDPME 216 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEA----------------ADQFLK-------NCA--SP 339 AK G VY I +FL+ A + Sbjct: 217 G---ATLAKDAGIRVYTIAAGKNGYVPFPRRNERGERIGTTQEFLRVDTETLMKIANETN 273 Query: 340 DRFYSVQNSRKLHDAFLRI 358 F+ +NS + AF +I Sbjct: 274 GEFFRAENSDTIDQAFEKI 292 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 88/242 (36%), Gaps = 47/242 (19%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKLGVATR 197 F CA + PL + + ++ +M+ +D+S SM+D G +D+L A Sbjct: 82 FCLCAAA-ARPLQLGEPIAPPQQAR---QLMLAVDLSGSMSDPDMRLGGRVVDRLTAAKA 137 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 + + LD + R GL+ F + PL + +++++ S Sbjct: 138 VLADFLD-------RRDGDRIGLLVFGQQAYALTPLTADLATVRDQLR--------DSVV 182 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GL + DA + ++ +I LTDG N++ +D L AK Sbjct: 183 GLAGRETALGDAIALAVKRLREQPQGERVLILLTDGVNTAGVLDP---LKAAELAKAEHV 239 Query: 318 IVYAIGVQAEAAD-------------------QFLKNCA--SPDRFYSVQNSRKLHDAFL 356 VY I + + L+ A + RF+ +++ +L + Sbjct: 240 RVYTIALGGDGGGMSLFGMPIPGSGGDDEVDEDTLRKIAQDTGGRFFRARDTAQLASIYA 299 Query: 357 RI 358 + Sbjct: 300 EL 301 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 47/254 (18%), Positives = 88/254 (34%), Gaps = 44/254 (17%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + Y + + S + V + G D++M +D+S SM + Sbjct: 57 ANYLKYLLGVIWILLIISGSGIQWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQ 107 Query: 191 KLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + A + D++ D R GL+ F ++ PL + + +++ ++ Sbjct: 108 DMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD- 166 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 ++ L I DA K + K +I LTDGEN+S + + L Sbjct: 167 -------DASIALPGPQTAIGDAIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPL 216 Query: 307 FYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQ 346 AK+ +Y IG+ + L+ A + +++ Q Sbjct: 217 QAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQ 276 Query: 347 NSRKLHDAFLRIGK 360 NS L + I K Sbjct: 277 NSSDLKKVYESIDK 290 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 101 bits (250), Expect = 2e-19, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 77/227 (33%), Gaps = 51/227 (22%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + G D+M+ +D S SM G + +L V + + +D R Sbjct: 89 TENRTAGYDLMLAVDTSRSMTAEDFTVHGREVSRLSVLKGIMGKFVD-------GRVGDR 141 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLE 274 GL+ F PL + I + ++ ++ G T G+ K+ + E Sbjct: 142 IGLIIFGDTSYVLSPLTFDRNAIHQLLDGIVPTLAGGGTAIGDGIGLGIKKLRERPEG-- 199 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---- 330 + +I +TDG+N + I L AK+ G +Y IGV + Sbjct: 200 ---------SRVLILVTDGKNETGTIPP---LKAAQLAKQEGIRIYTIGVGSTKNRVRLL 247 Query: 331 -----------------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L+ A + ++ ++ L + RI Sbjct: 248 SPDLRTYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQRI 294 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 101 bits (250), Expect = 2e-19, Method: Composition-based stats. Identities = 49/259 (18%), Positives = 99/259 (38%), Gaps = 44/259 (16%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGP 187 +VSR+ + + P + + S+ G D+M+ +D+S SM ++ + Sbjct: 74 SVSRWPIGLAVLGVILIIAALARPQKVEDKRDVHSQ---GYDLMLCIDLSGSMLSEDYER 130 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 G D++ + +I++ + R G+V FS + PL + + + ++ R+ Sbjct: 131 GGDRINRLQA----IKPVIQAFIERRPSDRIGIVLFSGRAYTMAPLTFDHRWLGSQLERI 186 Query: 248 IFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 G T GL ++ A+ + +G +++ LTDG N+ ++ ++ Sbjct: 187 KVGLIEDGTAIGDGLGVGLTRLEQAQRESGGKRQGA-----FVVLLTDGANNRGSLTPQQ 241 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQF-----------------------LKNCA--SP 339 + AK RG VY IG + F L++ A + Sbjct: 242 A---AELAKARGIPVYTIGAGQDGIVPFPVFDDKGRKLGYRRIMSDLDEGALRDIAEMTG 298 Query: 340 DRFYSVQNSRKLHDAFLRI 358 F+ + + AF I Sbjct: 299 GHFFRAADVGTVESAFRAI 317 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 69/473 (14%), Positives = 129/473 (27%), Gaps = 132/473 (27%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I F N GSI + A ++ V+ ++ G I+ S + KL Y +D + L AT + Sbjct: 6 ISKFNRNQDGSILPIFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVATDLST 65 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 IK + FR L + F + ++ D Sbjct: 66 -----------TVLRDNQIKTRIENSFRANLSDAEFLDQAID-------NLDFDVDSNAG 107 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSL----- 179 ++ S + F P P + S +++ +V+DV+ Sbjct: 108 TVTVSSSAGLNNYFLNIPGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTGSMAGD 167 Query: 180 ----------------------------------SMNDHFGPGMDKLGVATRSIREMLDI 205 S + G + + S R ++ Sbjct: 168 MGALRDAAEEVVDILIEDDASNSASKVRISLVPYSQGVNLGSYASTVTNGSTSWRNCVNE 227 Query: 206 IKSIPDVNNVVRS----------GLVTFSSKIVQTF----------------PLAWGVQH 239 + + V + GL ++ + PL Sbjct: 228 REGQQKYTDAVYNYDGTNSEYFHGLQSYFIWDYGSSENWSSARDDCPSSSLQPLTSDKNT 287 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKI---------FDAKEKLEHIAKGHDDYKKYIIFL 290 + I L G T G+ + + + D+ + DD KK+ + + Sbjct: 288 LISDIRNLSSGGGTGGQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNGTPDDDVKKFALIM 347 Query: 291 TDGE-----------------------------------NSSPNIDNKESLFYCNEAKRR 315 TDG+ + + + C+ K Sbjct: 348 TDGDFNAQYGKEERTTCTGRGRNRVCTTNEYWVERYHRYSDYNDPPATRARTLCDAMKAE 407 Query: 316 GAIVYAIGVQAEA---ADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ + D + CAS D +Y N +L AF I K + + Sbjct: 408 NIEIFTVFFDTGGSAFGDDLMSYCASGSDYYYEADNKDELITAFSNIAKRIQQ 460 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 87/248 (35%), Gaps = 45/248 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKL 192 + + P + + ++ D+++ +D+S SM + G +L Sbjct: 65 LQVLAWTLLVVAVCRPQWQGEPIPMDYEAR---DLLLAVDISPSMQETDLQLKGNQATRL 121 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 V + + + V R GL+ F ++ PL + + + E +N G Sbjct: 122 DVVKSVVTDFIQ-------VRQGDRLGLILFGAQPYIQAPLTYDLVTVGELLNEATLGIA 174 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +T I DA + + ++ LTDG N+ + +++ A Sbjct: 175 GNAT--------AIGDAIGLGIKRLRERPADSRVLVLLTDGANTGGEVSPEQA---AKLA 223 Query: 313 KRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQNSRKLH 352 G +Y +GV A+ + L++ A + +++ +N+ +L Sbjct: 224 ADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIADETDGQYFRARNTGELE 283 Query: 353 DAFLRIGK 360 + I + Sbjct: 284 LIYESINQ 291 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 84/241 (34%), Gaps = 44/241 (18%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 S + V + G D++M +D+S SM + + A + Sbjct: 76 LLIISGSGIQWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQDMKKANGQMESRF 126 Query: 204 DII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 D++ D R GL+ F ++ PL + + +++ ++ ++ L Sbjct: 127 DLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIAL 178 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 I DA K + K +I LTDGEN+S + + L AK+ + Sbjct: 179 PGPQTAIGDAIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKI 235 Query: 320 YAIGVQA------------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 Y IG+ + L+ A + +++ QNS L + I Sbjct: 236 YTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESID 295 Query: 360 K 360 K Sbjct: 296 K 296 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 53/235 (22%), Positives = 90/235 (38%), Gaps = 51/235 (21%) Query: 165 SDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D+ + LD+S SM + F P ++L A + E +D R L+ F Sbjct: 78 KKDGIDIAISLDLSQSMLQEDFTP--NRLEKAKEVLSEFID-------KRTDDRLALIVF 128 Query: 224 SSKIVQTFPLAWGVQHIQE-----KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 PL + I+E ++ + + T G+ A N++ K Sbjct: 129 GGDAYTKVPLTFDHNVIKEMTGKLTVDDITSNTRTAIGMGIGVALNRL-----------K 177 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------ 326 + K II LTDGEN+S + + + AK G +Y IG+ A Sbjct: 178 DSEAKSKVIILLTDGENNSGEMSPSAA---ADIAKELGIKIYTIGIGAKEIKVPSFFGYT 234 Query: 327 -----EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ---RILYNK 371 E + LK+ A + ++ +S++ + F +I Q R Y+K Sbjct: 235 TVKNTELDENMLKSIAETTGGEYFRASDSKEFKEIFNKIDALEKTQIDGRSFYDK 289 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 89/238 (37%), Gaps = 51/238 (21%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDI 205 P + V++ G D+++ +D+S SM++ G +++L R + + + Sbjct: 73 ARPQHVGEQVQLPV---SGRDLLLAVDISPSMDEQDMVIQGRSINRLQAVKRVLDDFIQ- 128 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYA 262 R GL+ F ++ PL + + ++ ++ + G T L A Sbjct: 129 ------RREGDRLGLLLFGTEPYIQAPLTFDLATVRTLLHEAGIGMAGRATAIGDALGLA 182 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ D ++ ++ +I LTDG N++ I ++ AK G +Y I Sbjct: 183 VKRLRDRPQE-----------QRVVILLTDGANTAGEIAPDKA---AEIAKAAGVRIYTI 228 Query: 323 GVQAEAADQ------------------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G+ AE Q L A + ++ ++ +L + I + Sbjct: 229 GIGAETMVQRGLLGSRRVNPSRDLDEGLLTRIAQQTGGEYFRARSLPELELIYESINQ 286 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 85/258 (32%), Gaps = 46/258 (17%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKL 192 F + P ++ G+ + MV+D S SM + G +D+L Sbjct: 45 LTLIAFALLVVALARPREGREQTVSQTE---GIAIEMVIDRSGSMQAMDFNIDGEPVDRL 101 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS- 251 + + + + + GL+TF++ P + ++N+ S Sbjct: 102 TAVKNVASKFITGGEDL-EGRFSDLVGLITFAAYADAETPPTLDHSFVVSRLNQTEIVSR 160 Query: 252 ----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T + + K+ + E + K +I LTDGEN++ +D + Sbjct: 161 RDEDGTAIGDAIALSVEKLNALDARQERKVQ-----SKILILLTDGENTAGELDP---IQ 212 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF-----------------------LKNCA--SPDRF 342 A+ G +YAIGV + L+ A + ++ Sbjct: 213 AAELAETLGIKIYAIGVGTKGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKY 272 Query: 343 YSVQNSRKLHDAFLRIGK 360 + ++ L + I + Sbjct: 273 FRATDTDSLDAIYREIDQ 290 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 93/273 (34%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSA--VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 + + Y++ R+ + + C A S G+D+++ Sbjct: 48 NSEKEQLAYSVKWVRYIRWGITSFYAFSMACLVFVFAEPYNNSIDPPKIDYKNGIDIILS 107 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +D S SM F P ++L VA R ++ +D R GLV + + P Sbjct: 108 IDASGSMLAQDFDP--NRLEVAKRVAKKFVDS-------RKGDRVGLVVYEGEAYTACPA 158 Query: 234 AWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++E+I+ + G T GL A ++ K II L Sbjct: 159 TLDYKLLKEQISAIEPGHLEPGTAIGSGLGVAVTRLRSD-----------SLISKVIILL 207 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------------------- 331 TDG +++ E L AK + VY IGV A+ Sbjct: 208 TDGSSNTG----PEPLEVAELAKAKKCRVYTIGVGADGMAPTPVNTPFGVVYQNLPVEID 263 Query: 332 --FLKNCASP--DRFYSVQNSRKLHDAFLRIGK 360 LK AS +++ Q+ + L + I K Sbjct: 264 EGVLKEIASATNGKYFRAQDEKSLEKIYAEIDK 296 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 38/257 (14%), Positives = 97/257 (37%), Gaps = 49/257 (19%) Query: 129 AVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + +P + W + ++ PL + ++++ + G D+M+ +D+S SM Sbjct: 45 VSTARRVPAVALWVIWLSLLLAAARPLWVGDAIELP---NSGRDLMLAVDISGSMRVEDM 101 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +++ +++++ S + R GL+ F S+ PL++ +Q +Q + Sbjct: 102 QVGNRMARRIDAVKQLGSDFMS---RRSGDRLGLILFGSRAYLQSPLSFDIQTVQRFLLE 158 Query: 247 LI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 G T + A ++ + + ++ LTDG++++ +D Sbjct: 159 AQIGFAGQETAIGDAIGLAVKRLQERPAS-----------SRVLVLLTDGQDTASTVDP- 206 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEA-----------------------ADQFLKNC-ASP 339 L N A G +Y IG+ A++ ++ ++ Sbjct: 207 --LEAANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDESTLIEIARSTG 264 Query: 340 DRFYSVQNSRKLHDAFL 356 +++ ++ +L + Sbjct: 265 GQYFRARDPEELATVYR 281 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 88/254 (34%), Gaps = 44/254 (17%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + Y + + S + + + G D++M +D+S SM + Sbjct: 57 ANYLKYLLGVIWILLIISGSGIQWLGKPISLP---QSGRDLIMAIDLSGSMA------IQ 107 Query: 191 KLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + + + D++ D R GL+ F ++ PL + + +++ ++ Sbjct: 108 DMKKSNGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD- 166 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 ++ L I DA K + K +I LTDGEN+S + + L Sbjct: 167 -------DASIALPGPQTAIGDAIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPL 216 Query: 307 FYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQ 346 AK+ +Y IG+ + L+ A + +++ Q Sbjct: 217 QAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLINTSEDLDTTVLEKIAEMTGGKYFRAQ 276 Query: 347 NSRKLHDAFLRIGK 360 NS L + I K Sbjct: 277 NSSDLKKVYESIDK 290 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 83/232 (35%), Gaps = 44/232 (18%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KS 208 + V + G D++M +D+S SM + + A + D++ Sbjct: 85 QWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQ 135 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 D R GL+ F ++ PL + + +++ ++ ++ L I D Sbjct: 136 FIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGD 187 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 A K + K +I LTDGEN+S + + L AK+ +Y IG+ + Sbjct: 188 AIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGDQ 244 Query: 329 ------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L+ A + +++ QNS L + I K Sbjct: 245 MIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 89/218 (40%), Gaps = 32/218 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K++SK D+++ LD+S SM+ G +D+ ++ + +D I +N G Sbjct: 219 KVTSKKR---DIVLTLDISASMD---GIPLDE---TKKAAAKFVDSI-----LNKNSNIG 264 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 LV++S + + ++ I L T GL AY+ + + Sbjct: 265 LVSYSDEATSLSGICSNDVFLKNTITSLSSAENTNIEDGLSRAYSML-----------QL 313 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------EAADQFL 333 KK I+ ++DG + D +E + Y + K +G ++Y +G + Sbjct: 314 GQSKKKLIVLMSDGLPTLGK-DGEELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQYLM 372 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + AS Y V +S L F + ++ Q+ +Y K Sbjct: 373 EKIASEGCHYEVSSSEDLVFFFEDVAGQIGGQKYIYVK 410 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 75/215 (34%), Gaps = 42/215 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+ + +D+S SM +++ VA D +K R GLV FS + Sbjct: 85 GIDIALTVDISSSMLAEDFQPQNRMEVAK-------DKVKRFVMGRKSDRVGLVAFSGEA 137 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL + I+ L G T I A + + + + Sbjct: 138 LTQVPLTTDYPVVLAAIDNLQVGQLEDGT--------AIGTAIATAANRLRNSPGRSRVM 189 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------- 328 + LTDGEN+ ID + + A G +Y IGV + Sbjct: 190 VLLTDGENNRGAIDPRTA---AQAAGTFGIRIYTIGVGTDGMAAVPVGRGLFGLRYENRP 246 Query: 329 ---ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L A + R++ +++ L + +I Sbjct: 247 VKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQI 281 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 83/225 (36%), Gaps = 41/225 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNV 215 K S + G D++M +D+S SM + + A + D++ D Sbjct: 83 KPVSLAQSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQFLDTRKG 136 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 R GL+ F ++ PL + + +++ ++ ++ L I DA Sbjct: 137 DRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGDAIGLAVK 188 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--------- 326 K + K +I LTDGEN+S + + L AK+ +Y IG+ Sbjct: 189 KLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 245 Query: 327 ---------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ QNS L + I K Sbjct: 246 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 52/244 (21%), Positives = 85/244 (34%), Gaps = 53/244 (21%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + P + S ++ S G+D++M +DVS SM D+L + Sbjct: 71 IVALARPRFVDESTRVKSTK--GIDIVMAIDVSASML-ARDLKPDRLQALK-------QV 120 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEY 261 + R GLV ++ + PL + +N + + T GL Sbjct: 121 AARFINGRPNDRIGLVEYAGESYTKTPLTSDKTVVLSSLNSIEYNSIIEGGTAIGMGLAT 180 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A N++ K K II LTDGEN+S ID K + A G VY Sbjct: 181 AVNRL-----------KESTAKSKVIILLTDGENNSGFIDPKIASEL---AVEFGIKVYT 226 Query: 322 IGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 IG+ + LK A + +++ ++ KL++ + Sbjct: 227 IGLGTNGMASSPIGILPNGRFQYGNQPVKIDETLLKEIAKTTGGQYFRATSNTKLNEIYE 286 Query: 357 RIGK 360 I K Sbjct: 287 EINK 290 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 47/254 (18%), Positives = 87/254 (34%), Gaps = 44/254 (17%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + Y + + S + V + G D++M +D+S SM + Sbjct: 63 ANYLKYLLGVIWILLIISGSGIQWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQ 113 Query: 191 KLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + A + D++ D R GL+ F ++ PL + + +++ ++ Sbjct: 114 DMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD- 172 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 ++ L I DA K K +I LTDGEN+S + + L Sbjct: 173 -------DASIALPGPQTAIGDAIGLAVKKLKKFPGDSKALILLTDGENNSGTL---QPL 222 Query: 307 FYCNEAKRRGAIVYAIGVQA------------------EAADQFLKNCA--SPDRFYSVQ 346 AK+ +Y IG+ + L+ A + +++ Q Sbjct: 223 QAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQ 282 Query: 347 NSRKLHDAFLRIGK 360 NS L + I K Sbjct: 283 NSSDLKKVYESIDK 296 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 83/256 (32%), Gaps = 46/256 (17%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKL 192 + P +S+ G+ + MV+D S SM G +D+L Sbjct: 56 LTLGAILFMILGLARPREGREQQVTTSE---GIAIEMVVDRSGSMQAMDFKIDGEHVDRL 112 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---- 248 + ++ + + N + GL+TF+ P ++ ++N + Sbjct: 113 TAIKNVAGKFVEGKEELEGRFNDL-VGLMTFAGYADGITPPTLDHPYLVSQLNNIQIVTN 171 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T + A K+ + + K K II LTDGEN++ ++ + Sbjct: 172 RSEDGTAIGDAISLAVEKLNALDARRDEKVK-----SKVIILLTDGENNAGEVEP---IQ 223 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF-----------------------LKNCA--SPDRF 342 A+ G VY IGV + L+ A + ++ Sbjct: 224 AAELAETLGIKVYTIGVGTKGEAPVPVTDPFSGKQVVQWMPVNIDEATLQKVADLTHGKY 283 Query: 343 YSVQNSRKLHDAFLRI 358 + ++ L + I Sbjct: 284 FRATDTDSLEKIYHEI 299 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 99.9 bits (247), Expect = 5e-19, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 83/232 (35%), Gaps = 44/232 (18%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KS 208 + V + G D++M +D+S SM + + A + D++ Sbjct: 79 QWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQ 129 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 D R GL+ F ++ PL + + +++ ++ ++ L I D Sbjct: 130 FIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGD 181 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-- 326 A K + K +I LTDGEN+S + + L AK+ +Y IG+ Sbjct: 182 AIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGGQ 238 Query: 327 ----------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ QNS L + I K Sbjct: 239 MIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 99/285 (34%), Gaps = 54/285 (18%) Query: 105 INNIERSTSLSIIIDDQHKDYN--LSAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKI 161 + + S +++I + L R+ +P + + + P S K Sbjct: 24 LQARKTSATMTISSLKPFEGGRRGLRVYLRHSLPILRALSVGFLIIALARPQNTNSWQKD 83 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + G+D+M+ +DVS SM ++L A ++ G+V Sbjct: 84 SIE---GIDIMLAMDVSGSMQ-AMDFKPNRLEAAKDVAISFINN-------RPNDNIGMV 132 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 TF+ + PL + + L G T GL A N++ D+K K Sbjct: 133 TFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAK------ 186 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------ 332 + +I LTDG N+ +I + + A+ G VY +GV F Sbjct: 187 -----SRVVILLTDGSNNMGDITPR---MAADIARTFGIRVYTVGVGTRGEAPFPIQTEF 238 Query: 333 ---------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A S +++ ++ L++ + I K Sbjct: 239 GVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDK 283 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 90/265 (33%), Gaps = 58/265 (21%) Query: 126 NLSAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN- 182 + + ++ WC+ ++ P I + + S G D+++ +D+S SM Sbjct: 48 KKNVIGLAKLQTAVVITIWCSLVLAAAGPTWIGDPINLPS---SGRDLLLAVDLSGSMKI 104 Query: 183 ---DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 + G + ++ + E + R GL+ F S+ PL + Sbjct: 105 EDMEVNGDRVPRIVAVKTVLNEFIQ-------RRKGDRLGLILFGSQAYVQAPLTFDQTT 157 Query: 240 IQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 +Q + G T + + ++ D +I LTDG+N Sbjct: 158 VQRFMREAQIGFAGEENTAIGDAIGLSVKRLRDRPGDRH-----------VMILLTDGQN 206 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA----------------------ADQFL 333 + I+ + A G I+Y IGV A+ ++ L Sbjct: 207 NGGKINPIPASKI---AANNGIIIYTIGVGADEMVMPGVLGSSFGSRRVNPSADLDEKTL 263 Query: 334 KNC--ASPDRFYSVQNSRKLHDAFL 356 + A+ +++ +N ++L + Sbjct: 264 QQVATATGGQYFRARNPQELEKIYR 288 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 83/225 (36%), Gaps = 41/225 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNV 215 K S + G D++M +D+S SM + + A + D++ D Sbjct: 89 KPVSLAQSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQFLDTRKG 142 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 R GL+ F ++ PL + + +++ ++ ++ L I DA Sbjct: 143 DRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGDAIGLAVK 194 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--------- 326 K + K +I LTDGEN+S + + L AK+ +Y IG+ Sbjct: 195 KLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGGQMIVETTF 251 Query: 327 ---------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ QNS L + I K Sbjct: 252 GQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 92/283 (32%), Gaps = 55/283 (19%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 ++ ++ Q + R + FI +SS +K+ Sbjct: 30 KKEIPPVLLSSSQALTSIHTWKIRLRPILFILRLLALSCLIIALARPQSSSEITKTKTTE 89 Query: 168 GLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+D+++ +D+S SM D ++ L E R G+V +S Sbjct: 90 GIDIILAIDMSSSMLAKDLKPNRIEALKRVASQFIEE----------RKSDRIGIVVYSG 139 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + + + G T GL A N++ D+K Sbjct: 140 ESYTKVPATTDKSIVLQSLKDIKQGEIEDGTAIGMGLGTAINRLKDSK-----------T 188 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------------ 330 K II +TDG N++ ID L AK G VY IG+ Sbjct: 189 KSKVIILMTDGVNNTGVIDP---LSAAELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSL 245 Query: 331 -----------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L A + +++ +++KL + I K Sbjct: 246 QYDMVPVEIDEKLLGEIAQSTGGKYFRATDNKKLAQIYTEIDK 288 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 83/232 (35%), Gaps = 44/232 (18%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KS 208 + V + G D++M +D+S SM + + A + D++ Sbjct: 79 QWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQ 129 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 D R GL+ F ++ PL + + +++ ++ ++ L I D Sbjct: 130 FIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGD 181 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-- 326 A K + K +I LTDGEN+S + + L AK+ +Y IG+ Sbjct: 182 AIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGGQ 238 Query: 327 ----------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ QNS L + I K Sbjct: 239 MIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 89/241 (36%), Gaps = 45/241 (18%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSI 199 + PL + ++++ SK G D+M+ +D+S SM G +D+ + Sbjct: 61 LLVIAVARPLWMGDAIELPSK---GRDLMLAVDLSGSMQIEDMVLNGQTVDRFTMIQ--- 114 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 D++ + + GL+ F+ PL + + + + G K T Sbjct: 115 ----DVVSDFIERRKGDKLGLILFADHAYLQAPLTQDRRSVAQFLQEAQIGLVGKQT--- 167 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 I +A D + ++ LTDG N+S +I +++ A +RG + Sbjct: 168 -----AIGEAIALGVKRFDMVDKSNRILVLLTDGSNNSGSISPEQAAAI---AAKRGVKI 219 Query: 320 YAIGVQAE-------------------AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIG 359 YAIGV A+ Q + + ++ ++S+ L + I Sbjct: 220 YAIGVGADVMERRSIFGTERVNPSMDLDEAQLISLAKTTGGLYFRARSSQDLQLIYQEID 279 Query: 360 K 360 K Sbjct: 280 K 280 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 83/232 (35%), Gaps = 44/232 (18%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KS 208 + V + G D++M +D+S SM + + A + D++ Sbjct: 85 QWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQ 135 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 D R GL+ F ++ PL + + +++ ++ ++ L I D Sbjct: 136 FIDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGD 187 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-- 326 A K + K +I LTDGEN+S + + L AK+ +Y IG+ Sbjct: 188 AIGLAVKKLKKYPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGGQ 244 Query: 327 ----------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ QNS L + I K Sbjct: 245 MIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 82/232 (35%), Gaps = 44/232 (18%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KS 208 + V + G D++M +D+S SM + + A + D++ Sbjct: 79 QWLGKPVSLP---QSGRDLIMAIDLSGSMA------IQDMKKANGQMESRFDLVMRVANQ 129 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 D R GL+ F ++ PL + + +++ ++ ++ L I D Sbjct: 130 FLDTRKGDRVGLILFGTRAYLQTPLTFDIATVKKMLD--------DASIALPGPQTAIGD 181 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-- 326 A K K +I LTDGEN+S + + L AK+ +Y IG+ Sbjct: 182 AIGLAVKKLKKFPGDSKALILLTDGENNSGTL---QPLQAAEIAKQYHIKIYTIGLGGGQ 238 Query: 327 ----------------EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ QNS L + I K Sbjct: 239 MIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 86/241 (35%), Gaps = 31/241 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + T+ + P ++ + +G D+M+V+D+S SM + G + +L Sbjct: 73 LIITWVLVVCALAKPTILGEP---QVREQLGRDVMVVVDLSGSMAEQDFTSKQGKKISRL 129 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 +++ R GL+ F P + + + + Sbjct: 130 QATK-------EVLADFAKTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMA 182 Query: 250 GSTTKSTPGLEYAYNKI-FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G +T + A K+ A+ + +K +I LTDG ++ ++ ++ Sbjct: 183 GQSTHLGDAIGLAIKVFEQSGKQMSAEQAQNDIEREKVVIVLTDGNDTGSFVEPIDA--- 239 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 AK +G ++ I + + +K A S + N +L A+ +IG Sbjct: 240 AKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRIASESGGEAFEALNRDELSTAYEQIG 299 Query: 360 K 360 + Sbjct: 300 Q 300 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 51/283 (18%), Positives = 93/283 (32%), Gaps = 55/283 (19%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +++ ++ I K + FI V +K+ G Sbjct: 32 KQTPAVKISSIKGFKTSTSILPKLRPLLFILRLAALSLIIVALARPRNVEVSTKTKTTKG 91 Query: 169 LDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+++ +DVS SM D ++ L S R GLV ++ + Sbjct: 92 IDIVIAIDVSASMLAKDLRPNRLEALKKVASSFIN----------GRPNDRIGLVEYAGE 141 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P+ + + + + T GL N+I D+K Sbjct: 142 SFTKTPITSDKSIVLSALKGIQYNSIIEGGTAIGMGLATGVNRIKDSK-----------A 190 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-------------- 328 K II +TDGEN++ ID + A+ G VY IG+ Sbjct: 191 LSKVIILMTDGENNAGQIDPR---IAAELAQEFGIKVYTIGMGTNGTALSPYARNPNGTF 247 Query: 329 ---------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ L+ A + +++ N++KL + + I K Sbjct: 248 VYENIQVTIDEELLEEIAETTGGQYFRATNNKKLQEIYDEIDK 290 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 99/285 (34%), Gaps = 54/285 (18%) Query: 105 INNIERSTSLSIIIDDQHKDYN--LSAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKI 161 + + S +++I + L R+ +P + + + P S K Sbjct: 24 LQARKTSATMTISSLKPFEGSRRGLRVYLRHSLPILRALSVGFLIIALARPQNTNSWQKD 83 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + G+D+M+ +DVS SM ++L A ++ G+V Sbjct: 84 SIE---GIDIMLAMDVSGSMQ-AMDFKPNRLEAAKDVAISFINN-------RPNDNIGMV 132 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 TF+ + PL + + L G T GL A N++ D+K K Sbjct: 133 TFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAK------ 186 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------ 332 + +I LTDG N+ +I + + A+ G VY +GV F Sbjct: 187 -----SRVVILLTDGSNNMGDITPR---MAADIARTFGIRVYTVGVGTRGEAPFPIQTEF 238 Query: 333 ---------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A S +++ ++ L++ + I K Sbjct: 239 GVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDK 283 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 52/235 (22%), Positives = 91/235 (38%), Gaps = 51/235 (21%) Query: 165 SDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D+ + LD+S SM + F P ++L A +++ D R L+ F Sbjct: 78 KKDGIDIAISLDLSQSMLQEDFTP--NRLEKAK-------EVLDEFIDKRGNDRLSLIVF 128 Query: 224 SSKIVQTFPLAWGVQHIQE-----KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 PL + I+E ++ + + T G+ A N++ K Sbjct: 129 GGDAYTKVPLTFDHNVIKEMTRKLTVDDITSNTRTAIGMGIGVALNRL-----------K 177 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------ 326 + K II LTDGEN+S + + + AK G +Y IG+ A Sbjct: 178 DSEAKSKVIILLTDGENNSGEMSPSAA---ADIAKELGIKIYTIGIGAKEIKVPSFFGYK 234 Query: 327 -----EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG---KEMVKQRILYNK 371 E + LK+ A + ++ +S++ + F +I K + R Y+K Sbjct: 235 TVKNTELDENMLKSIAETTGGEYFRASDSKEFKEIFNKIDALEKTKIDGRTFYDK 289 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 90/261 (34%), Gaps = 37/261 (14%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMM 173 +++ + S + R M + WC ++ P + V + D+M+ Sbjct: 42 VLLSGETPRSGASVLRRRRMQSVVSVIGWCLLVTAAARPEWVGEPVSV---EKSARDLML 98 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+S SM+ D G ++L A ++K + R GL+ F + Sbjct: 99 ALDLSGSMDARDFRDQNGQQQNRLSAAK-------QVLKDFAAQRDGDRLGLIVFGNAAY 151 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P I + L +S + + DA + + + + ++ Sbjct: 152 LQAPFTDD---IATWLTLLE-----ESEVAMAGPSTALGDAIGLAISLFQASETRNRVLV 203 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-------ADQFLKNCASP-- 339 LTDG ++ + ++ + A +Y + V + + L AS Sbjct: 204 VLTDGNDTGSRVPPVDA---ASIAAANDVTIYTVAVGDPSTIGEEALDLETLDAIASSTR 260 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 + ++ L DA+ +I + Sbjct: 261 GASFLALDTLALKDAYEQINR 281 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 99.1 bits (245), Expect = 8e-19, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 83/228 (36%), Gaps = 50/228 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + I++ G D+++ +DVS SM+ + +L + + + L+ Sbjct: 75 RPQWLGEPLPIAA---SGRDLLVAVDVSGSMDFPDMQWKDEEVSRLALVQHMLGDFLE-- 129 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 R GL+ F S+ PL + + ++ ++ I G T + A Sbjct: 130 -----GREGDRVGLILFGSQAYLQAPLTFDRRTVRHWLDEARIGIAGKNTAIGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ +++ H + +I +TDG N+ ID L A G +Y IG Sbjct: 185 KRL---RQRPAH--------SRVLILVTDGANNGGEIDP---LTAARLAADEGVKIYPIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKLH 352 + A LK A + +++ Q+ +L Sbjct: 231 IGAAPEQSGTTGSLGVNPSLDLDEPTLKEIAEVTGGQYFRAQDGEQLL 278 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 99.1 bits (245), Expect = 9e-19, Method: Composition-based stats. Identities = 47/255 (18%), Positives = 85/255 (33%), Gaps = 57/255 (22%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGV 194 + P + S K K+ G+D+++ +DVS SM D ++ L Sbjct: 62 LRLAALSLIIVALARPRNVEVSTKT--KTTKGIDIVIAIDVSASMLAKDLRPNRLEALKK 119 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---- 250 S R GL+ ++ + P+ + + + + Sbjct: 120 VAASFIN----------GRPNDRIGLIEYAGESFTKTPITSDKSIVLSALKSIQYNNIIE 169 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T GL N++ D+K K II +TDGEN++ ID + Sbjct: 170 GGTAIGMGLATGVNRLKDSK-----------ALSKVIILMTDGENNAGQIDPR---IAAE 215 Query: 311 EAKRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSV 345 A+ G VY IG+ ++ L+ A + +++ Sbjct: 216 LAQEFGIKVYTIGMGTNGMALSPYARNANGTFVYENIQVTIDEELLEEIAATTGGQYFRA 275 Query: 346 QNSRKLHDAFLRIGK 360 N+ KL + + I K Sbjct: 276 TNNEKLQEIYDEIDK 290 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 99.1 bits (245), Expect = 9e-19, Method: Composition-based stats. Identities = 49/218 (22%), Positives = 84/218 (38%), Gaps = 47/218 (21%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+M+VLD+S SM ++L A + + +D R GL FS + Sbjct: 104 GIDIMLVLDISESMKIQDFTP-NRLEAAKQVANDFID-------GRFQDRIGLTIFSGEA 155 Query: 228 VQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + ++ +I +++ S T L N++ + D Sbjct: 156 YSLSPLTTDYKMLKNQITDIDFKMMEASGTAIGSALAVGTNRM-----------RESDSK 204 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----------- 332 K +I L+DG+N++ NID + S N G +Y I + E + Sbjct: 205 SKVLILLSDGDNNAGNIDPETSAKLAN---AYGIKIYTIAIGKEGKVPYGKDFFGRTRYI 261 Query: 333 --------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LKN A +FY +++ L + F I + Sbjct: 262 ENSMDVTGLKNIAKIGEGQFYRATDNQALEEVFSIIDQ 299 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 99.1 bits (245), Expect = 9e-19, Method: Composition-based stats. Identities = 53/260 (20%), Positives = 102/260 (39%), Gaps = 29/260 (11%) Query: 113 SLSIIIDDQHKDYNLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSV---KISSKSDIG 168 + I D+ + N + ++ + + F F + + K+S S G Sbjct: 31 PIFFIKSDRFQKLNSTLGIQFRRILYSFTEFLVYLSMVFLVVAAAGPGSKYKLSPDSTKG 90 Query: 169 LDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+M+ LD+S SM + + ++L V+ D+++ R G+V F+ Sbjct: 91 VDIMIALDISGSMVNSYDFLPRNRLSVSK-------DLLREFVKKRLYDRIGIVVFAGAA 143 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL+ + + LI G+ S+ +E + DA + K + K I Sbjct: 144 YLQSPLSSDRFAL----DELIAGT---SSEDIEEQGTAVGDALVLSSYRLKNSEAKSKVI 196 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----QFLKNCAS--PD 340 I LTDG +++ +D + + K G VY IG+ E + L+ +S Sbjct: 197 ILLTDGVSNTGKLDPDTAAY---TTKTMGIKVYCIGIGKEEGQYEINYESLQKISSNTNG 253 Query: 341 RFYSVQNSRKLHDAFLRIGK 360 +F+ ++ L I + Sbjct: 254 KFFRAESPEVLESVLNEIDQ 273 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 55/245 (22%), Positives = 93/245 (37%), Gaps = 42/245 (17%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKL 192 ++ +F C + P +I+ + I G+D+++ LD+S SM F P ++L Sbjct: 51 KILILFSLILMCI-ALARPQIISENKII---KKEGIDIVVALDLSQSMLQRDFKP--NRL 104 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 A + + E +D R LV F PL + +++ ++L T Sbjct: 105 ETAKKLLEEFID-------KRINDRISLVVFGGDAYTKVPLTFDHNVVKDITSKLT---T 154 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T A I + K + K II +TDGEN+S + + A Sbjct: 155 DDITSNNRTA---IGMGLGVSLNRLKDSEAKSKVIILMTDGENNSGEMSPMGASEI---A 208 Query: 313 KRRGAIVYAIGVQA-----------------EAADQFLKNCAS--PDRFYSVQNSRKLHD 353 K G +Y IG+ A E + LKN AS ++ + ++ + Sbjct: 209 KELGIKIYTIGIGAREIQIRVPFGHTTVKNTELDENLLKNIASTTGGEYFRAGSEKEFQE 268 Query: 354 AFLRI 358 F RI Sbjct: 269 IFNRI 273 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 87/236 (36%), Gaps = 45/236 (19%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREM 202 + P + V++ G D+M+V+D+S SM++ G +++L R + + Sbjct: 70 VALARPQHVGEQVQMPV---SGRDLMLVVDISPSMDEQDMVLQGRSINRLQAVKRVLDDF 126 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + R GL+ F ++ PL + ++ ++ + ++ G+ Sbjct: 127 I-------SRRQGDRLGLILFGTEPYVQAPLTFDLETVRTLMR--------EAGLGMAGR 171 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + ++ ++ LTDG N++ I ++ A +Y I Sbjct: 172 ATAIGDAVGLATKRLRNRPQDQRVVVLLTDGANTAGEITPDKATEIAAAA---SIRLYTI 228 Query: 323 GVQAEAADQ------------------FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 G+ AE+ Q L A + ++ ++ +L + I Sbjct: 229 GIGAESMVQRGLLGSRRVNPSRDLDENLLTRMAQQTGGEYFRARSLPELELIYESI 284 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 43/245 (17%), Positives = 90/245 (36%), Gaps = 51/245 (20%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 PF+ + ++ P + + I++ G D+++ +DVS SM+ + + Sbjct: 61 PFMLLWL-FLLIAAARPQWLGEPLPIAA---SGRDLLVAVDVSGSMDFPDMQWNDEDVSR 116 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---I 248 L + + + L+ + R GL+ F S+ PL + + ++ ++ I Sbjct: 117 LSLVQHLLGDFLES-------RDGDRVGLILFGSQAYLQAPLTFDRRTVRVWLDEARIGI 169 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + A ++ + + +I +TDG N+ ID L Sbjct: 170 AGKNTAIGDAIGLALKRL---------RMRPAQ--SRVLILVTDGANNGGEIDP---LTA 215 Query: 309 CNEAKRRGAIVYAIGVQAEAA-----------------DQFLKNCA--SPDRFYSVQNSR 349 A G +Y IG+ A+ + LK A + R++ ++ + Sbjct: 216 AKLAASEGVKIYPIGIGADPEESGATALLGGNPTLDLDEPALKAIAEVTGGRYFRARDGK 275 Query: 350 KLHDA 354 +L Sbjct: 276 QLQAI 280 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 53/255 (20%), Positives = 85/255 (33%), Gaps = 50/255 (19%) Query: 133 YEMP--FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FG 186 +P CT + P L S + S G+D+M+ LD S SM G Sbjct: 52 PRIPKILRACTLLLLIVVAARPQLYNVSRDVHS---PGVDIMLCLDTSGSMQALDFKVEG 108 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + +L + + + + R GLV F + PL + E +NR Sbjct: 109 KSVTRLEAVKKVVADFI-------GKRETDRIGLVVFGEEAFTQSPLTIDKGLLLELVNR 161 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + G T I A K K +I LTDG N++ I + + Sbjct: 162 MKIGMAGDRT--------AIGSAIAIGGKRLKDLKSKSKILILLTDGRNNAGEISPQAA- 212 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF---------------------LKNCA--SPDRFY 343 + G +Y IGV + F L+N A +++ Sbjct: 213 --ARAVREFGIKLYTIGVGGKGPAPFRMKTLFGTRLVPQHVDLDEVTLRNVAKTGGGKYF 270 Query: 344 SVQNSRKLHDAFLRI 358 NS++L + + I Sbjct: 271 RAANSQELQEIYDII 285 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 51/198 (25%), Positives = 81/198 (40%), Gaps = 26/198 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +D+M+VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 401 EKEKNIDVMLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDGV------GVIAF 454 Query: 224 SSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + V+ + E I+ + G T P L A + +K K Sbjct: 455 DHNYYWAYKFGKLVRKEDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKN-------- 506 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K ++ LTDG + +EAKR + IGV L A + Sbjct: 507 ---KLVVLLTDGMGEQSGYE-----IPADEAKRNNIKITTIGVGKFVNASVLSWIAAYTS 558 Query: 340 DRFYSVQNSRKLHDAFLR 357 RFY V N +L D FL+ Sbjct: 559 GRFYLVSNPSELVDVFLK 576 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 79/259 (30%), Gaps = 43/259 (16%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMD 190 +P + ++D G+ + MV+D S SM +D Sbjct: 52 LPGALTLTAMVLLIIALARPREGREQAIVEND-GIAIEMVVDRSGSMQAMDFQLGDEHVD 110 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-- 248 +L + + + ++ D GL+TF+ P + ++N Sbjct: 111 RLTAIKKVAGDFVTGGDNL-DGRLSDLVGLITFAGYADGVTPPTLDHAFLVSQLNHSQIV 169 Query: 249 ---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T + A K+ + K II LTDGEN++ +++ ++ Sbjct: 170 TNRSEDGTAIGDAISLAVEKL-----NALDARRKEKIQSKIIILLTDGENNAGDLEPIQA 224 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQF----------------------LKNCAS--PDR 341 G VY IGV + L+ AS + Sbjct: 225 AELAQTM---GIKVYTIGVGTKGRAPMPVTDMFGRQSMQWMSVNIDEETLQKVASITGGK 281 Query: 342 FYSVQNSRKLHDAFLRIGK 360 ++ ++ L + I + Sbjct: 282 YFRATDTDSLAKIYGEIDQ 300 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 83/230 (36%), Gaps = 50/230 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + I++ G D+++ +DVS SM+ G + +L + + + L+ Sbjct: 75 RPEWLGEPLPIAA---SGRDLLVAVDVSGSMDFPDMNWQGEDVSRLSLVKHLLGDFLE-- 129 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 R GL+ F S+ PL + + ++ ++ I G T + A Sbjct: 130 -----GREGDRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + +I +TDG N++ ID L A G +Y IG Sbjct: 185 KRLRQRPAQ-----------SRVLILVTDGANNAGQIDP---LTAARLAAEEGVKIYPIG 230 Query: 324 VQAEAADQ-----------------FLKNC--ASPDRFYSVQNSRKLHDA 354 + A+ LK A+ +++ ++ +L Sbjct: 231 IGADPEQTGSLGILGVNPSLDLDEPALKAIAEATGGQYFRARDGEELQAI 280 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 98.3 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 45/230 (19%), Positives = 86/230 (37%), Gaps = 53/230 (23%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V +K++ G+D++M +DVS SM P ++L + + +D R Sbjct: 82 VSKKTKTNSGIDIIMAIDVSASMLARDLKP--NRLEALKKVAIDFVD-------RRPNDR 132 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLE 274 G+V ++ + P+ ++ I+ L +G T GL N++ Sbjct: 133 IGIVVYAGESFTQTPITSDKNIVKRTISELQWGQLDGGTAIGMGLGSGVNRL-------- 184 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------ 328 K K II LTDG N++ NID + + A+ VY IG+ Sbjct: 185 ---KESTAKSKVIILLTDGVNNAGNIDPRTATEL---ARELEIKVYTIGIGTNGMADFPW 238 Query: 329 ------------------ADQFLKNC--ASPDRFYSVQNSRKLHDAFLRI 358 ++ L+ A+ +++ +++ L + + I Sbjct: 239 SKDPRTGKLNFRKQQVEIDEKLLQEIATATDGKYFRATDNQSLKEIYDEI 288 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 91/248 (36%), Gaps = 43/248 (17%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S+ + + T+ + P+ + + +MM+ +D+S SM + Sbjct: 53 SKLLISLVTLTWLSFITAMAQPMFVGEP---KALQQTDRNMMLAVDISKSMLEEDMQYQG 109 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-- 248 +L ++ + ++ + R GL+ F + PL + + ++ ++ + Sbjct: 110 RL---VNRLQTVKAVVTDFVEERKGDRLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVVG 166 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G+ T + ++ D E + +I LTDG+N++ I+ L Sbjct: 167 LAGNKTAIGDAIGLGVKRLQDLPESN-----------RVLILLTDGQNTAGEIEP---LK 212 Query: 308 YCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDRFYSVQN 347 A++ G +YAIG+ A+ + L A + ++Y +N Sbjct: 213 AAELAEKAGVKIYAIGIGADEMVIQGFFGPRRVNPSRDLDEDTLTAIAENTGGQYYRARN 272 Query: 348 SRKLHDAF 355 +L + Sbjct: 273 VNELEQIY 280 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 50/283 (17%), Positives = 104/283 (36%), Gaps = 44/283 (15%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 F + IE S L + ++ + S + + + + PL + +++ Sbjct: 19 FRKQQQKIEISGHLHLPGIGENGSEQVQQSSHSRKAY-WLVWVLLVIAVARPLWMGDAIE 77 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML---DIIKSIPDVNNVVR 217 + SK G D+M+ +D+S SM ++ + + +++ D++ + + Sbjct: 78 LPSK---GRDLMIAVDLSGSMQ------IEDMVLNGQAVDRFTMIQDVVSDFIERRKGDK 128 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 GL+ F+ PL + + + + G K T I +A Sbjct: 129 LGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQT--------AIGEAIALGVKRF 180 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE---------- 327 D + ++ LTDG N+S +I +++ A +RG +YAIGV A+ Sbjct: 181 DMVDKSNRILVLLTDGSNNSGSISPEQAAAI---AAKRGVKIYAIGVGADVMERRSIFGT 237 Query: 328 ---------AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 Q + ++ ++S+ L + I K Sbjct: 238 ERVNPSMDLDEAQLSSLAKITGGLYFRARSSQDLQQIYQEIDK 280 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 47/252 (18%), Positives = 95/252 (37%), Gaps = 28/252 (11%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS---SKSDIGLDMMMV 174 ++ + S S++E F S L ++ K G+D+++ Sbjct: 32 MEIRFPGRRESTFSKFENLGKFLPILRPIAISLMILSLSGPGKKITFLPDEKEGVDILIA 91 Query: 175 LDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 LDVS SM+ +LGV+ + ++ ++ N R GLV F+ PL Sbjct: 92 LDVSGSMSRSRDFLPETRLGVSKKLLKRFIE-------KRNNDRLGLVVFAGAAYLQAPL 144 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + E + + + T+ I DA + + K I+ +TDG Sbjct: 145 TGDRESLSEILETIEEETVTEQGTA-------IGDAIILSTYRLRNSKARSKVIVLITDG 197 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----QFLKNCA--SPDRFYSVQ 346 +++ ID + + GA +Y++G+ E + L+ + + RF+ + Sbjct: 198 VSNTGKIDPVTATDLAEQI---GAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAE 254 Query: 347 NSRKLHDAFLRI 358 + ++ I Sbjct: 255 DPEEMKAVLSSI 266 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 47/252 (18%), Positives = 95/252 (37%), Gaps = 28/252 (11%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS---SKSDIGLDMMMV 174 ++ + S S++E F S L ++ K G+D+++ Sbjct: 40 MEIRFPGRRESTFSKFENLGKFLPILRPIAISLMILSLSGPGKKITFLPDEKEGVDILIA 99 Query: 175 LDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 LDVS SM+ +LGV+ + ++ ++ N R GLV F+ PL Sbjct: 100 LDVSGSMSRSRDFLPETRLGVSKKLLKRFIE-------KRNNDRLGLVVFAGAAYLQAPL 152 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + E + + + T+ I DA + + K I+ +TDG Sbjct: 153 TGDRESLSEILETIEEETVTEQGTA-------IGDAIILSTYRLRNSKARSKVIVLITDG 205 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----QFLKNCA--SPDRFYSVQ 346 +++ ID + + GA +Y++G+ E + L+ + + RF+ + Sbjct: 206 VSNTGKIDPVTATDLAEQI---GAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAE 262 Query: 347 NSRKLHDAFLRI 358 + ++ I Sbjct: 263 DPEEMKAVLSSI 274 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 59/439 (13%), Positives = 125/439 (28%), Gaps = 79/439 (17%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F G+++I+ AI L + +G I+ S + + LD + L + + + Sbjct: 16 RFVKTDGGNVAIIFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKDLASGV 75 Query: 68 NGNNG-KKQKNDFSYRIIKNIWQTDF---RNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + + + N + ++ ++ + +S + Sbjct: 76 ITAGQVSAKAQSYFASLYNNTEAPNITVTATYTAKDSTGSSTVLLKGTGDISTEFMNMFG 135 Query: 124 -------DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS------SKSDIGLD 170 + + + +S + S + + + D Sbjct: 136 FPTLGIGSAATATWGGTRLRVAIALDVTGSMASAGKMPAMQSAAKTLVDNLRANAQTADD 195 Query: 171 MMMVL-------------------------DVSLSMNDHFGPGMDKLGVATRS------- 198 + + + D + S N + A R+ Sbjct: 196 LYISIIPFAQMVNVGKSNKNASWIKWDYWEDTTGSCNWWWLTTKSSCESAGRTWSSTNQS 255 Query: 199 -----IREMLDIIKSIPDVNNV--VRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINR 246 + + + D R +S+ Q P+ I++KI+ Sbjct: 256 QWGGCVTDRDQPADTTKDAPTTAATRFPAANYSACPEQILPMTSAYSSSNATTIKDKIDA 315 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS--------- 297 L T G+ +A+ + D Y II L+DG N+ Sbjct: 316 LSPNGGTNQPIGMHWAWMSLQDGAPLNTPAKDADYKYTDAIILLSDGMNTIDRWYGNGSS 375 Query: 298 -PNIDNKESLFYCNEAKRRGA------IVYAIGVQAEA--ADQFLKNCASPDRFYSVQNS 348 + C+ + A ++Y I V + + LK CA F++ + Sbjct: 376 WSKDVDARQKLLCDNIRAASAASTTKTVIYTIQVNTDGDPESEVLKYCADSGNFFATTTA 435 Query: 349 RKLHDAFLRIGKEMVKQRI 367 + AF +IG + K RI Sbjct: 436 SGISTAFAQIGASLSKLRI 454 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 81/230 (35%), Gaps = 50/230 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + I++ G D+++ +DVS SM+ + +L + + + L Sbjct: 75 RPEWLGEPLPIAA---SGRDLLVAVDVSGSMDFPDMHWRDEDVSRLSLVKHLLGDFLQQ- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 R GL+ F S+ PL + + ++ ++ I G T + A Sbjct: 131 ------REGDRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + +I +TDG N++ ID L A G +Y IG Sbjct: 185 KRL---------RLRPAQ--SRVLILITDGANNAGQIDP---LTAARLAAEEGVKIYPIG 230 Query: 324 VQAEAADQ-----------------FLKNC--ASPDRFYSVQNSRKLHDA 354 + A+ LK A+ +++ ++ +L Sbjct: 231 IGADPEQTGSLGILGVNPSLDLDEPALKAIAAATGGQYFRARDGEELQAI 280 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 65/405 (16%), Positives = 140/405 (34%), Gaps = 89/405 (21%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN-GNN 71 +G++ +L A LL V+ I++ + ++ V+ +LH D + + E+ G Sbjct: 16 RRGAMLVLIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRGQA 75 Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN---------NIERSTSLSIIIDDQH 122 K+ FS +I ++ + + ++ +S+ +DD Sbjct: 76 RVVAKDIFSKNLIGGRELKLHNKDIEFGRTDANPDGTWEFLPNERPFQAIRISVNLDDNR 135 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM- 181 + +P +F ++ S +++ ++++ LD S SM Sbjct: 136 QKGR-----NGSVPLLFG--------KVLGQSSFATNHSSVAANLVHEIVLCLDRSHSMC 182 Query: 182 ------NDHFGPGM------------------DKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + PG KL A + + LD ++ +PDV Sbjct: 183 FDETGVDYAYPPGTPSYPAGYITPPNPVGSRWAKLQGAIQVFVDTLDDLQIVPDV----- 237 Query: 218 SGLVTFSSKIVQT-----------------FPLAWGVQHIQEKINRL---IFGSTTKSTP 257 G+VT+ S I + PL + + I I T + Sbjct: 238 -GVVTWGSDITLSWSWYPFQGRSFPAVMVDVPLGQNLNLVSPAIAAKLGDIMMGGTNMSS 296 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 G++ + + + H +K II ++DG+ ++ + L N+A + Sbjct: 297 GIDRSVSLLTANG--------THSLAQKTIILMSDGQWNAG----RNPLDAANDAADKNI 344 Query: 318 IVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ I ++ A + +F++ + L D F + K Sbjct: 345 TIHTIAF-LNGDQSVMRQIAERTGGKFFNAPDGESLEDTFKELAK 388 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 48/247 (19%), Positives = 83/247 (33%), Gaps = 51/247 (20%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSI 199 A T S+ G+D+++ +DVSLSM P D+L Sbjct: 60 YIILSALILAIARPRTFSISEDRDETKGMDIVLSIDVSLSMLAKDLEP--DRLTALKEIA 117 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI---NRLIFGSTTKST 256 R + R GLV +S + + PL + ++E++ N + T Sbjct: 118 RTFIKQ-------RTTDRIGLVEYSGEALMRVPLTSDHRVVEEELMSFNPMDLEGGTNIG 170 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL A + + +K K K II +TDG N+ N L A+ Sbjct: 171 DGLAVAVSHLRKSKAK-----------SKIIILMTDGVNTIDNA--MSPLTAAELARNND 217 Query: 317 AIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQNSRKL 351 VY IG+ + + L++ A + +++ ++ L Sbjct: 218 IKVYTIGIGSNGLALMPTQQDIFGNLVFTEEQVKIDEYLLRDVAQITGGKYFRATSNESL 277 Query: 352 HDAFLRI 358 + I Sbjct: 278 KQIYEEI 284 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 84/234 (35%), Gaps = 31/234 (13%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRS 198 + P + + G D+M+VLD+S SM + G + +L A Sbjct: 76 LTVVALAKPTIYGPP---QVRERFGRDVMIVLDLSGSMAETDFSPDPGKSLSRLDAAK-- 130 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKS 255 +++K R GL+ F P Q + ++ + + G +T Sbjct: 131 -----EVLKQFAATREGDRLGLILFGDAAFLQAPFTADLETWQTLLQETDVAMAGQSTHL 185 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A K+F+ ++ + +K I LTDG ++ + +++ A Sbjct: 186 GDAIGLAI-KVFNNSDRHGQQDQNSAKREKVAIILTDGNDTGSFVSPRDA---ARVAAVN 241 Query: 316 GAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G ++ I + A L+ A + + + + +L A+ IG+ Sbjct: 242 GVRLHTIAMGDPATVGEQALDLDTLQQLATLTGGQLFQALDEAQLTRAYQVIGE 295 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 86/247 (34%), Gaps = 48/247 (19%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ACARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKNDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYN 264 + R GL+ F ++ PL + + ++ ++ I G T + A Sbjct: 126 DFLEGRKGDRVGLILFGTQAFLQAPLTYDRRTVRVWLDEAKIGIAGKNTAIGDAIGLALK 185 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 ++ + ++ +TDG N+ ID + A G +Y +G+ Sbjct: 186 RLRMRPAN-----------SRVLVLVTDGANNGGQIDP---ITAARLAADEGVKIYTVGI 231 Query: 325 QAEAADQFLK-------------------NCASPDRFYSVQNSRKL------HDAFLRIG 359 ++ L+ S +++ ++ +L DA + Sbjct: 232 GSDPDKNALQGVLGLNPSLDLDEPTLKDIASLSGGQYFRARDGAELDKIRTALDALEPVA 291 Query: 360 KEMVKQR 366 ++ + R Sbjct: 292 QQPTQAR 298 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 54/279 (19%), Positives = 94/279 (33%), Gaps = 55/279 (19%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++ S+ + K L+ + + P + + K+ K G+ Sbjct: 35 QNASIKMSSTQGFKMSTLAKLRPLLFILKMLALVLLTIAMARPRTVDVTTKV--KKTEGI 92 Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++M +D+S SM P ++L + ++ R GLV ++ + Sbjct: 93 DIIMAVDISASMLARDLEP--NRLEALKKVAINFIE-------GRPNDRIGLVIYAGESY 143 Query: 229 QTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL I IN L + T GL + NK+ D+K + Sbjct: 144 TKTPLTTDKSIIFNAINDLEYSQNIEGGTAIGMGLATSVNKLKDSKAE-----------S 192 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------- 328 K II LTDGEN++ ID K + A Y IGV + Sbjct: 193 KVIILLTDGENNAGFIDPKTATQL---ATEYDIKTYTIGVGSNGMALSPVGIKANGQFEY 249 Query: 329 -------ADQFLKNCASP--DRFYSVQNSRKLHDAFLRI 358 + LK A +++ +++K + I Sbjct: 250 RNIEVKIDEALLKTIAESNGGKYFRATDNQKFEAIYEEI 288 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 55/200 (27%), Positives = 91/200 (45%), Gaps = 22/200 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 VKISS S + + LDV +++ G + +M D + PD VR Sbjct: 34 IVKISSASQL-MQCSAALDVLFALDGSHSIGKGSFERSKYFAIKMCDALAIYPDR---VR 89 Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 G++ FSS FPL + + ++EKI +++F G +T++ L+Y +K F Sbjct: 90 VGVLQFSSVPQLEFPLDSFFTREEVKEKIKKIVFKGGSTETGLALKYLLHKGFPGGR--- 146 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + +I +TDG+ S NID N+ K RG +V+A+GV+ + L Sbjct: 147 -----NSSVPQLLIIVTDGK-SQGNID-----LPANQLKERGVMVFAVGVRFPRWTE-LH 194 Query: 335 NCASPDRFYSVQNSRKLHDA 354 AS + V + + DA Sbjct: 195 TLASEPKDQYVLFAEHVDDA 214 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 64/184 (34%), Gaps = 18/184 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S ++ KL R +I +V + GLV + S++ Sbjct: 532 SLDLVFLLDASATVGQE---NFTKLQSFVRGSSLQFNIN------RDVTQIGLVVYGSRV 582 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 TF L + + A +++ ++ A+ K + Sbjct: 583 QTTFALDT-HPTSSSLLQAISQAPYMDGAGSTGSALLHVYEEVMTVQKGARPG--VSKAV 639 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + +T+G ++++ + + G V I V + L+ SPD V + Sbjct: 640 VVITEG------TGTEDAVVPAQKLRNNGISVLVIAVGPVLKETLLRLAGSPDFLIHVAS 693 Query: 348 SRKL 351 L Sbjct: 694 YEDL 697 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 88/248 (35%), Gaps = 48/248 (19%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 71 TATARPQWLGEPLPVAA---SGRDLLVAIDVSGSMD---YPDMQWKSDEVSRLVLVQQLL 124 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 + R GL+ F ++ PL + + ++ ++ I G T + A Sbjct: 125 GDFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + ++ +TDG N++ ID + A G +Y IG Sbjct: 185 KRLRLRPAN-----------SRVLVLVTDGANNAGQIDP---ITAARLAADEGVKIYPIG 230 Query: 324 VQAEAADQFLK-------------------NCASPDRFYSVQNSRKL------HDAFLRI 358 + ++ L+ S +++ ++ +L DA + Sbjct: 231 IGSDPDKDALQSALGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRVTLDALEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQPTQAR 298 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 50/220 (22%), Positives = 82/220 (37%), Gaps = 50/220 (22%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D++M +DVS SM P D+L A + + R GLV FS + Sbjct: 91 GIDIVMAMDVSGSMLARDLKP--DRLTAAKNVASDFV-------KGRPGDRMGLVIFSGE 141 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + + G T GL A +++ K + Sbjct: 142 TFTQVPLTTDHGVMLNMLAEMKNGLIDDGTAIGDGLATAISRL-----------KDSEAI 190 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------ 325 K +I LTDG N++ ++D + AK G VY IGV Sbjct: 191 SKVVILLTDGMNNAGSVDPYTA---AEIAKLYGIRVYTIGVGSYGTAPYPVQTPFGTQIQ 247 Query: 326 ---AEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 E ++ L + AS +++ +++KL + + I K Sbjct: 248 QMKVEIDEKLLASVASMTGGKYFRATSNQKLDEIYEEIDK 287 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 83/242 (34%), Gaps = 26/242 (10%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +A R +P + T + T K+ G+D+++ LD S SM Sbjct: 44 GKNAEIRARIPLLVRTLALVLLVAAIARPQTVDASREIKT-PGVDIILCLDASESMAQPD 102 Query: 186 ----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G +++L + + + + + R GLV F PL Sbjct: 103 FAIDGQRVNRLTAVKKVVHDFV-------KRRDTDRIGLVVFGDYAFTQAPLTLD----- 150 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 L+ G+ I DA K K +I L+DGEN++ ++ Sbjct: 151 ---KGLLLNLIENLRIGMAGRKTAIGDALGVAGKRIKDIPAMSKVVILLSDGENTAGDMT 207 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNCA-SPDRFYSVQNSRKLHDAFLRI 358 + + G +Y IG+ E + + + A ++Y N+ +L + I Sbjct: 208 PQGAAEA---LAALGIKIYTIGMGTEQAGSKELAQIAAIGQGKYYHASNTEQLDSIYKEI 264 Query: 359 GK 360 K Sbjct: 265 DK 266 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 97.2 bits (240), Expect = 3e-18, Method: Composition-based stats. Identities = 33/146 (22%), Positives = 66/146 (45%), Gaps = 12/146 (8%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L + +Q ++ +N+L T T G+++ + A + + +K +I L Sbjct: 401 TGLTFDLQSVETAVNKLTPSGNTNVTIGVQWGMEALTAAAPLTG--VRTGSEVRKVMIVL 458 Query: 291 TDGENSSP--------NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDR 341 TDG N+ N + +L CN AK G +Y + + E + LK CA + D+ Sbjct: 459 TDGLNTQNRWWGSRDRNKIDARTLAACNNAKAMGIELYTVRL-VEGNEDLLKTCAETEDK 517 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ V ++ +L F + +++ R+ Sbjct: 518 YHYVTSASQLKTTFADLARQVKGVRL 543 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 89/254 (35%), Gaps = 44/254 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++ + + +G +++T + L + G ++ + V+ + +D S L + + Sbjct: 28 LKQYRDDRRGQFAVITCLALVPLIAAAGGAVDLWNARRVQNAVQNAVDTSALAAVS--YS 85 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E +K+ + + + +++ I + Sbjct: 86 GEEQTEREKRADTL-------FLNNTAGIAIEDTDLSEEDGAWVYKAEYKI----KTNFL 134 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + + +EM S ++ ++ +D+++VLD S SM Sbjct: 135 RVVGIDEFEM---------------------ESQGAAALANSPMDVVLVLDSSGSMAQD- 172 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 +++ S++ L+ KS N++ + LV F +++ T L +G N Sbjct: 173 ----NRMVELKASVKLFLEEFKS----NDLTQVALVPFDTQVKATSSL-FGAAGNVSVAN 223 Query: 246 RLIFGSTTKSTPGL 259 L GS + L Sbjct: 224 PLATGSCATISDPL 237 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 51/254 (20%), Positives = 90/254 (35%), Gaps = 52/254 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ A + T V K++ G+D++M +DVS SM ++L Sbjct: 60 FVLRMLAIMALITALARPQTKEVSTRIKTNKGIDIVMAIDVSASMLSK-DLRPNRLTALK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IFGS 251 + E ++ R GLV ++ + P+ IQ + + Sbjct: 119 KVAAEFIE-------GRPSDRIGLVVYAGESFTKTPITTDKSIIQNALKDIKYKHGELIG 171 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL A N++ D+K K K II LTDG N++ I+ + + Sbjct: 172 GTAIGMGLATAVNRLKDSKAK-----------SKVIILLTDGVNNAGFIEPQIASEL--- 217 Query: 312 AKRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQ 346 A G Y IG+ ++ L+ A + +++ Sbjct: 218 AVEYGIKTYTIGIGTNGMASTPVALNPDGTILFRNMQVEIDEKLLQQIAKTTGGKYFRAT 277 Query: 347 NSRKLHDAFLRIGK 360 N++KL + + I K Sbjct: 278 NTKKLAEIYDEINK 291 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 90/272 (33%), Gaps = 57/272 (20%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + P SS ++ + G+D+ + LD+SLSM D++ Sbjct: 55 VLISLRAAAVGLLVVVLTRPQYAQSSERVVRE---GIDIQLALDISLSMKAGDFDPKDRI 111 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 VA ++I R GLV FS PL +Q++ ++ + Sbjct: 112 TVAK-------EVIAEFVKGRKDDRIGLVVFSGHAFTQVPLTLDYDFLQNLLGQVQTVRR 164 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T L ++ N + + K +I LTDG N+ +I+ ++ Sbjct: 165 PDGTAIGLALAHSVNGL-----------RNSTTKSKVVILLTDGSNNRGDIEPAQA---A 210 Query: 310 NEAKRRGAIVYAIGVQAEAADQF-----------------------LKNCA--SPDRFYS 344 A+ VY I V ++ L++ A + F+ Sbjct: 211 EIARALDVRVYTILVGKPGNGEYPVHDPWRDETYLIPAPTAEDEVALRDIAEQTGGIFFR 270 Query: 345 VQNSRKLHDAFLRI-----GKEMVKQRILYNK 371 + + L D + I + ++ + Y + Sbjct: 271 AGDEQGLRDVYDTIDKMERSQVASEKLVRYTE 302 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 88/216 (40%), Gaps = 48/216 (22%) Query: 172 MMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 M+V+D+S SM+ + G +D+L + + + + R GL+ F+ Sbjct: 1 MLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSDFI-------AKREGDRIGLILFADHA 53 Query: 228 VQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL + + +++N+ + G+ T G+ A D+ D + Sbjct: 54 YLQTPLTLDRETVTQQLNQAVLKLIGTQTAIGEGIGLATKIFIDS-----------DAPQ 102 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------------ 326 + II L+DG N++ +D L N AK+ + +Y +GV A Sbjct: 103 RVIILLSDGSNTAGVLDP---LEAANIAKQYHSTIYTVGVGAGEMVVKDFLFSRKVNTAQ 159 Query: 327 EAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 + ++ L+ AS +++ +N + L + + I + Sbjct: 160 DLDEKTLQTIASTTGGQYFRARNQQDLQNIYDTINQ 195 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 52/314 (16%), Positives = 100/314 (31%), Gaps = 85/314 (27%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 ++ + K + Y++ + + V + LD+++ Sbjct: 81 PTLSAPAREKTVTANGDGTYKVALN--------VTGAKSAGTGAIVT-----NQPLDIVL 127 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSS---- 225 VLDVS SM D+ G K+ ++ +D I D + R LV F+ Sbjct: 128 VLDVSGSMADNLSGGPKKIDALKTAVNGFIDATADENAKITDQSQRNRIALVKFAGTEKT 187 Query: 226 ---------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 L + V + +N L T + A + Sbjct: 188 SVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT--- 244 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQA 326 + + KK +IF TDGE + + D + N+A K G +Y+IGV + Sbjct: 245 ------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVVS 298 Query: 327 EAA--------DQFLKNCAS----------------------------PDRFYSVQNSRK 350 A ++++ +S + + ++ + Sbjct: 299 GANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAGQ 358 Query: 351 LHDAFLRIGKEMVK 364 L++ F I +E+ K Sbjct: 359 LNNIFESIYQEITK 372 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 83/233 (35%), Gaps = 31/233 (13%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 ++ P ++ + S +G D+M+V+D+S SM + A+ + L Sbjct: 84 LVVSALAKPTILGAPQIRES---LGRDVMVVVDLSGSMAEQ------DFTSASGANISRL 134 Query: 204 DIIK----SIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKST 256 D K R GL+ F P + + + + G +T Sbjct: 135 DATKEVLAEFAKTRQGDRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLG 194 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A ++ +K +K I LTDG ++ ++ ++ AK +G Sbjct: 195 DAIGLAIKVFEQSEPSQAASSKP---RQKVAIVLTDGNDTGSFVEPIDA---AKVAKAKG 248 Query: 317 AIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ I + + Q ++ A S + + N +L A+ IGK Sbjct: 249 VRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGK 301 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 96.8 bits (239), Expect = 4e-18, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 89/248 (35%), Gaps = 48/248 (19%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 71 TATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLL 124 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 + R GL+ F ++ PL + + ++ ++ I G T + A Sbjct: 125 GDFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + ++ +TDG N++ ID + A G +Y IG Sbjct: 185 KRLRLRPAN-----------SRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIG 230 Query: 324 VQAEAADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRI 358 + ++ L++ S +++ ++ +L DA + Sbjct: 231 IGSDPDKDALQSVLGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRATLDALEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQPTQAR 298 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 50/198 (25%), Positives = 81/198 (40%), Gaps = 26/198 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +D+++VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 401 EKEKNIDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDGV------GVIAF 454 Query: 224 SSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + V+ + E I+ + G T P L A + +K K Sbjct: 455 DHNYYWAYKFGKLVRKEDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKN-------- 506 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K ++ LTDG + +EAKR + IGV L A + Sbjct: 507 ---KLVVLLTDGMGEQSGYE-----IPADEAKRNNIKITTIGVGKFVNASVLSWIADYTS 558 Query: 340 DRFYSVQNSRKLHDAFLR 357 RFY V N +L D FL+ Sbjct: 559 GRFYLVSNPSELVDVFLK 576 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 79/251 (31%), Gaps = 48/251 (19%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-------ND 183 S + + W T ++ G D++ LD+S SM + Sbjct: 58 SGSRLGVLLAWIAWILLVVALAGPRT-IAASPAQPASGRDIVFALDLSGSMAAEDFVLDG 116 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 H +D L ++ + R GLV F+ + PL++ V + Sbjct: 117 HAASRIDALKRVGAALIKR----------RTGDRIGLVIFAERAYAAAPLSFDVDAVSRT 166 Query: 244 INRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + + G +T GL A ++ ++K + I+ L+DG N + Sbjct: 167 LAEIPLGLVGHSTAIGEGLGLALKRLTESK-----------APSRVIVLLSDGANDAGTT 215 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----------LKNCA--SPDRFYSVQN 347 D N G +Y IG+ F L+ A + V+ Sbjct: 216 DPTGVAELAN---NLGVKIYTIGLGVVDTQTFNGLGDPVDFLALQRLAEIGGGEAFRVRT 272 Query: 348 SRKLHDAFLRI 358 + L A I Sbjct: 273 TEDLAYASAAI 283 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 78/227 (34%), Gaps = 43/227 (18%) Query: 168 GLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+ + MV+D S SM + G +D+L + + + + + GL+TF Sbjct: 85 GIAIEMVIDRSGSMQALDFNIDGEPVDRLTAVKNVASKFITGGEDL-EGRFSDLVGLITF 143 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++ P + ++N+ S T + + K+ + E + Sbjct: 144 AAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQERKVQ 203 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------ 332 K +I LTDGEN++ +D ++ G +YAIGV Sbjct: 204 -----SKILILLTDGENTAGELDPVQAAELAETL---GIKIYAIGVGTTGKAPVPVRDPF 255 Query: 333 -----------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L+ A + +++ ++ L + I + Sbjct: 256 TGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYREIDQ 302 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 87/243 (35%), Gaps = 33/243 (13%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + T+ + P ++ + +G D+M+V+D+S SM + G + +L Sbjct: 73 LIITWALVVCALAKPTILGEP---QVREQLGRDVMVVVDLSGSMAEQDFTSKRGDKISRL 129 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 +++ R GL+ F P + + + + Sbjct: 130 DATK-------EVLADFATTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMA 182 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY---KKYIIFLTDGENSSPNIDNKESL 306 G +T + A ++++ + D +K +I LTDG ++ ++ ++ Sbjct: 183 GQSTHLGDAIGLAIKVFEQSEKQSAAVQDSSVDANEKEKVVIVLTDGNDTGSFVEPIDA- 241 Query: 307 FYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 AK +G ++ I + + +K A S + N +L A+ + Sbjct: 242 --AKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQ 299 Query: 358 IGK 360 IG+ Sbjct: 300 IGE 302 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 54/254 (21%), Positives = 93/254 (36%), Gaps = 59/254 (23%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLG-VATR 197 + + V S++ G+D++M +DVS SM D +D L VA+ Sbjct: 1 LLALSSIIIALARPRSVDVTAKSRTTKGIDIVMAIDVSGSMLAKDFKPNRLDALKRVAST 60 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NRLIFGST 252 I + ++ R GLV ++ + P+ I + + + I Sbjct: 61 FIEDRIND-----------RIGLVVYAGESYTRTPITSDKTVILQSLKTVEYDDSIIADG 109 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A N+I D+K K + II LTDG N++ ID + + A Sbjct: 110 TGIGVGLATAINRIKDSKAK-----------SRVIILLTDGVNNAGTIDPR---MAADIA 155 Query: 313 KRRGAIVYAIGVQAEA------------------------ADQFLKNCA--SPDRFYSVQ 346 K+ G VY IG+ ++ +K A + +++ Sbjct: 156 KQYGIKVYTIGIGTNGMALFPYAKDQETGKFLFRNMQVEIDEKLMKEIAEMTDGKYFRAT 215 Query: 347 NSRKLHDAFLRIGK 360 + +KL + I K Sbjct: 216 DDKKLKAIYAEINK 229 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 88/247 (35%), Gaps = 48/247 (19%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 48 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 101 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYN 264 + R GL+ F ++ PL + + ++ ++ I G T + A Sbjct: 102 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALK 161 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 ++ + ++ +TDG N++ ID + A G +Y IG+ Sbjct: 162 RLRLRPAN-----------SRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGI 207 Query: 325 QAEAADQFLK-------------------NCASPDRFYSVQNSRKL------HDAFLRIG 359 ++ L+ S +++ ++ +L DA + Sbjct: 208 GSDPDKDALQSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRATLDALEPVA 267 Query: 360 KEMVKQR 366 ++ + R Sbjct: 268 QQPTQAR 274 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 88/247 (35%), Gaps = 48/247 (19%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYN 264 + R GL+ F ++ PL + + ++ ++ I G T + A Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALK 185 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 ++ + ++ +TDG N++ ID + A G +Y IG+ Sbjct: 186 RLRLRPAN-----------SRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGI 231 Query: 325 QAEAADQFLK-------------------NCASPDRFYSVQNSRKL------HDAFLRIG 359 ++ L+ S +++ ++ +L DA + Sbjct: 232 GSDPDKDALQSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKIRATLDALEPVA 291 Query: 360 KEMVKQR 366 ++ + R Sbjct: 292 QQPTQAR 298 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 105/275 (38%), Gaps = 37/275 (13%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 N I + ++ I++ LSA + +P + + + Sbjct: 34 NTVRVSIREVSTTSQPKIVM-------TLSANNSKGLPVTDLSADDFIVKENGKEQSDIA 86 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V ++ +D+++ LD S SMND D A + +++ + + Sbjct: 87 VYPFYQNPDPIDVVLALDTSASMND------DAFTAAQDAAYGLINGLSPED------KV 134 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+TF PLA +QE I +L T GL A ++ + Sbjct: 135 GLITFDKTARVIEPLAQDHARVQESIQKLSRSVGTALYQGLSLAAQEVAKGQNTKA---- 190 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 I+ +TDG N+S N +E++ +A+ GA V+ +G + Q L+ A Sbjct: 191 --------IVLMTDGFNTSRNTTLEEAVA---KAQEVGASVFTVGFGKKVDTQGLQKIAN 239 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEM-VKQRILYN 370 + ++S + +L F I +++ + R+ Y Sbjct: 240 ETGGEYFSAPTNAQLRRVFADISQKLHQEYRLSYT 274 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 51/263 (19%), Positives = 93/263 (35%), Gaps = 15/263 (5%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + P+ ++ + S+A T + +S G + + + Sbjct: 311 STTTWTRTNNASNSTPWPSASYYGTPSYSYAQYNGTITATPTSAGGYGSGSTTTIKDNST 370 Query: 181 M---NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS--GLVTFSSKIVQTFPLAW 235 + +D G G D ++ D+ P +N ++ ++ L Sbjct: 371 ITANSDLLGVGTDSWNGCVIDRKQPYDVSGQSPIASNTDTLYPAAKCATNNLLPVMGLTT 430 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + ++ +L T T G+++ + KY+I +TDGEN Sbjct: 431 DIAAVRAHAQKLTPAGNTNITIGVQWGMELLSPELPFNTAKPYSDKTNYKYMIVITDGEN 490 Query: 296 S------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNS 348 + S + N +L C AK G VY I V E LK+CAS P+ FY V S Sbjct: 491 TQNRWSTSASTINARTLLACQAAKDLGITVYTIRVM-EGNSDMLKSCASRPEYFYDVTAS 549 Query: 349 RKLHDAFLRIGKEMVKQRILYNK 371 +L ++ + Q K Sbjct: 550 SQLTSTLAKVFYSI--QSTRLTK 570 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 72/219 (32%), Gaps = 34/219 (15%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F+ + G+ I ++ + G +E S +L D + L A ++ ++ Sbjct: 21 FWRDVSGNTMIAFGLIAATLVAAAGAGVEFSQAQEQTNRLQDAADAAALRGA--LMAKDE 78 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 F+ + D S L I HK + Sbjct: 79 TAAKAAADTVFTLNL-------------------TDSGIAPTSKGLKFEISGSHKTAVYT 119 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 A ++ + F+ + L + ++ ++ ++ +VLD + SM+ Sbjct: 120 ATAQIKTTFLKL-------AGIETLTVGATSTAEAEMRKS-EIALVLDSTGSMSRD---- 167 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++ ++ +L + + R G+V F +++ Sbjct: 168 -SRMTNLKAAVDSVLASLLVSGENVWDARVGIVPFDTQV 205 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 83/233 (35%), Gaps = 31/233 (13%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 ++ P ++ + S +G D+M+V+D+S SM + A+ + L Sbjct: 84 LVVSALAKPTILGAPQIRES---LGRDVMVVVDLSGSMAEQ------DFTSASGANISRL 134 Query: 204 DIIKS----IPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKST 256 D K R GL+ F P + + + + G +T Sbjct: 135 DATKEVLAEFAKTRQGDRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLG 194 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A ++ +K +K I LTDG ++ ++ ++ AK +G Sbjct: 195 DAIGLAIKVFEQSESNQAASSKP---RQKVAIVLTDGNDTGSFVEPIDA---AKVAKAKG 248 Query: 317 AIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ I + + Q ++ A S + + N +L A+ IGK Sbjct: 249 VRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELARAYDDIGK 301 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 53/282 (18%), Positives = 97/282 (34%), Gaps = 53/282 (18%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + L + K ++ + F+F T + +K++ G Sbjct: 32 KAQAELKMSSLSSFKQHSSFWSLLRPILFVFRLVALAMIILAIARPQTVDISTRTKTNKG 91 Query: 169 LDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D++M +DVS SM P D+L R +D R GLV ++ + Sbjct: 92 IDIVMAIDVSSSMLAQDLKP--DRLSALKRVASAFVDD-------RLSDRIGLVVYAGES 142 Query: 228 VQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P+ ++ + + + T GL + N++ D++ K Sbjct: 143 YTLTPITSDKGIVKGSLREISYQGLIEDGTAIGMGLATSVNRLKDSRAK----------- 191 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------------ 331 K II LTDG N+S ID K + A G Y IG+ + + Sbjct: 192 SKVIILLTDGVNNSGFIDPKIATEL---AVEFGIKTYTIGLGSNGTARAPVGILPNGSFQ 248 Query: 332 -----------FLKNC--ASPDRFYSVQNSRKLHDAFLRIGK 360 L+ A+ ++ +++KL + + I K Sbjct: 249 YAMTKVEIDEALLQEIATATGGIYFRATDNKKLEEIYEEINK 290 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 76/214 (35%), Gaps = 32/214 (14%) Query: 167 IGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G ++M+ +D+S SM G +++L R + + R GLV Sbjct: 101 SGREIMIAMDLSGSMERRDFALDGETVNRLTAVKRVGTDFI-------RRRAGDRIGLVI 153 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKI--FDAKEKLEHIA 277 F+ + ++ + ++ I G +T GL A ++ DA + + Sbjct: 154 FADQAYVAAAPSFDTAAVARALDEATIGISGRSTGIGDGLGLALRRLDPRDAGGEAASGS 213 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------AEAADQ 331 K + K +I L+DG N++ K+ A+ G VY I + A+ Sbjct: 214 KPGEKPAKAVILLSDGANNAGQTAPKD---VAELARELGIKVYTIALGPRDMADADGEQD 270 Query: 332 FLKN-------CASPDRFYSVQNSRKLHDAFLRI 358 + AS + V+ + L I Sbjct: 271 VVDTETLRDMARASGGEAFRVRTTEDLVRVADAI 304 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 84/234 (35%), Gaps = 33/234 (14%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDKLGVATRS 198 ++ P ++ + S +G D+M+V+D+S SM + G + +L Sbjct: 84 LVVSALAKPTILGAPQIRES---LGRDVMVVVDLSGSMAEQDFTSASGAKISRLDATK-- 138 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKS 255 +++ R GL+ F P + + + + G +T Sbjct: 139 -----EVLADFAKTRQGDRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHL 193 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A ++ +K +K I LTDG ++ ++ ++ AK + Sbjct: 194 GDAIGLAIKVFEQSEPSQAAFSKP---RQKVAIVLTDGNDTGSFVEPIDA---AKVAKAK 247 Query: 316 GAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G ++ I + + Q ++ A S + + N +L A+ IGK Sbjct: 248 GVRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGK 301 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 52/252 (20%), Positives = 88/252 (34%), Gaps = 53/252 (21%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVAT 196 + + P T + ++ G+D+MM +DVSLSM P D+L Sbjct: 63 KYILLSALIIAMARPRTFT--ISENNDDTKGIDIMMSVDVSLSMLARDLEP--DRLTALK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK---INRLIFGSTT 253 +I K D R GLVT+S + P+ + E+ +N L T Sbjct: 119 -------NIAKKFVDKRPGDRIGLVTYSGEAFTKVPVTSDHAVLLEELENLNPLELQPGT 171 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GL A + + +K K K II +TDG N+ N + AK Sbjct: 172 AIGEGLSVAVSHLRHSKAK-----------SKIIILMTDGVNTIENAMPAQV--GAQLAK 218 Query: 314 RRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQNS 348 VY+IG+ + L+ A + +++ ++ Sbjct: 219 SNDIRVYSIGIGTNGYALMPTQTDIFGDLVFTEVEVKIDEPVLREIAQTTGGKYFRATSN 278 Query: 349 RKLHDAFLRIGK 360 + L + + I + Sbjct: 279 QSLEEVYEEINQ 290 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 96.0 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 87/204 (42%), Gaps = 30/204 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIV 228 D+++++D S SMND +K+ A S + +D++ G+V +SS + Sbjct: 69 DVILIIDRSGSMNDE-----NKMQSAINSAKGFIDLMDLSKHK-----VGIVDYSSANNI 118 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 +FPL+ + ++ +N L T + ++ A + + + + I+ Sbjct: 119 SSFPLSTDKEAVKNYVNGLRANGGTATGDAIKKARELLVNHRPDA----------QPVIV 168 Query: 289 FLTDGENSSPNIDNKE-SLFYCNEAKRRGAIVYAIGV-------QAEAADQFLKNCASPD 340 LTDG+ + PN + +L NEAK+ G + Y I + + LK A+ Sbjct: 169 LLTDGDATEPNGNAYNYALTNSNEAKQEGIVFYTIALLNTNANPDTSGPNLLLKQMATTS 228 Query: 341 RFYS-VQNSRKLHDAFLRIGKEMV 363 + V S L D + I +E+ Sbjct: 229 HHHHFVLGSVGLGDIYAAIVQEIG 252 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 96.0 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 56/238 (23%), Positives = 85/238 (35%), Gaps = 53/238 (22%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 T V + S G+D++M +DVS SM P ++L E + Sbjct: 76 RPRTVDVSTRTNSTQGIDIVMAIDVSASMLARDLQP--NRLEATKAVGEEFI-------K 126 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIF 267 R GLV +S + P+ + + + F S T GL + N++ Sbjct: 127 GRPSDRIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESGTAIGSGLATSVNRLK 186 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D+K + K II LTDG N+S ID K + AK G VY IGV Sbjct: 187 DSKAE-----------SKVIILLTDGVNNSGFIDPKVASEL---AKEFGIKVYTIGVGTN 232 Query: 328 A-----------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LK A + +++ N+ KL D + I + Sbjct: 233 GMALTPVGIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNNEKLEDIYDEIDQ 290 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 96.0 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 82/243 (33%), Gaps = 52/243 (21%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + P + K K+ G+D+++ +D+S SM +++ R + + Sbjct: 70 IIALARPRSSSEITKT--KTTEGIDIILSIDMSSSMLAK-DLKPNRIEALKRVAAQFIQQ 126 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYA 262 R G+V +S + P + + + + G T GL A Sbjct: 127 -------RASDRIGIVVYSGESYTKVPATTDKSIVLQALKEIRQGEIEDGTAIGMGLGTA 179 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 N++ D+K K II +TDG N++ ID L AK G VY I Sbjct: 180 INRLKDSK-----------TKSKVIILMTDGVNNTGVIDP---LSAAELAKEYGIRVYTI 225 Query: 323 GVQAEAAD-----------------------QFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 G+ + L + + +++ ++ KL + Sbjct: 226 GIGTNGKALSPVAYNPDGSFQYDMVPVEIDEKLLAEISKITGGKYFRATDNNKLAQIYTE 285 Query: 358 IGK 360 I K Sbjct: 286 IDK 288 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 96.0 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 84/234 (35%), Gaps = 34/234 (14%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRS 198 + P+++ S +G D+M+V+D+S SM + G + +L A Sbjct: 84 LVVTAMAKPMVLGEPQLRES---LGRDVMVVVDLSGSMAEQDFTSKAGENISRLNAAKEV 140 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKS 255 + + + R GL+ F P + + + + G +T Sbjct: 141 LSDFVKT-------RKGDRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHL 193 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A E+ + ++ +K I LTDG ++ ++ + AK + Sbjct: 194 GDAMGLAIKVF----EQSKSRIGVEENKEKVAIVLTDGNDTGSFVEP---IEAAKVAKAK 246 Query: 316 GAIVYAIGVQ-------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G ++ I + A + ++ A S + + N +L A+ IG+ Sbjct: 247 GVRIHVIAMGDPQTLGEAALDMKTIRRIAKESGGKAFEAMNRDELAKAYDDIGR 300 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 96.0 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 42/187 (22%), Positives = 67/187 (35%), Gaps = 29/187 (15%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 +++G FS L + ++ L T G+ + + Sbjct: 436 YDTKKEFIQTGNWWFSGCPAPAQKLKAMTSGELDSYLDSLTPHGATYHDGGMIWGGRLLS 495 Query: 268 DAKE-KLEHIAKGHDDYKKYIIFLTDGEN-------SSPNIDN----------------- 302 E+ +K +++IFLTDG+ S ID Sbjct: 496 QYGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRWTQTSSLTLAQT 555 Query: 303 --KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 + LF CNE K+ GA V+ + A D+ K CA R++ N+ +L+DAF I K Sbjct: 556 VEERFLFACNEVKKLGATVWVVAFGTAANDKM-KTCAGSGRYFEAANASQLNDAFSTIAK 614 Query: 361 EMVKQRI 367 RI Sbjct: 615 STGDLRI 621 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 44/272 (16%), Positives = 100/272 (36%), Gaps = 27/272 (9%) Query: 6 IRNFFYNCKGS-ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI- 63 +R + G+ ++++ A LLP + + G ++ S + +++L D +L + Sbjct: 1 MRRLASDRSGNTLALIAAGLLP-LLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALG 59 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + + + QD N ++ + ++++++ Sbjct: 60 TEIATLTDIPTDAGTRGQEFFNSNF--------------QDGNYGTQNRTFNMVLEN--- 102 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN- 182 DY++S + ++P T + P+ + +IS +D+MMVLDV+ SM Sbjct: 103 DYSVSGTATVDVPTSVMT---VFGFTKIPVKVECQARISF---SDVDVMMVLDVTGSMKH 156 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 + G + K+ ++R D ++ +R G V ++S + L Sbjct: 157 TNSGDTLSKIDSLKATVRNFYDQMEGAKSAGTRIRYGFVPYASNVNVGHLLKDEWVVNSW 216 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 TT G + N + + + Sbjct: 217 AYQSRAISGTTTVEAGTKTRENWAYKSGSRSA 248 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 51/255 (20%), Positives = 87/255 (34%), Gaps = 55/255 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGV 194 F A T V + G+D++M +D+S SM D M+ L Sbjct: 59 FAMRLLALSALIVAMARPRTVDVSNKRNTTNGIDIVMAIDLSSSMLAKDFKPNRMEALKE 118 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFG 250 S + R G+V ++++ P+ + + IN + + Sbjct: 119 VAASFV----------EARQSDRIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYDNVLQ 168 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T GL A N++ D+K K K II +TDG N++ I+ + Sbjct: 169 DGTGIGMGLATAVNRLKDSKAK-----------SKVIILMTDGVNNAGFIEP---VTAAE 214 Query: 311 EAKRRGAIVYAIGVQAEAAD-----------------------QFLKNCA--SPDRFYSV 345 AK G VY IG+ Q +K+ A + +++ Sbjct: 215 FAKEFGIKVYTIGIGTNGNAPFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGKYFRA 274 Query: 346 QNSRKLHDAFLRIGK 360 Q++ L + I K Sbjct: 275 QSNSSLESIYSEINK 289 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 84/248 (33%), Gaps = 56/248 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGEPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQDDEISRLELVKKLFGDFIE-- 129 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 130 -----GRRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + + E + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQISPQTAARLAAEER---VKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL------HDAFLRI 358 + A+ L+ A + ++ ++S +L D + Sbjct: 231 IGADPQQGGVIGLFGLNPGLDLDEPVLRGIAETTGGEYFRARSSAELESISATLDRLEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQTTRAR 298 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 92/240 (38%), Gaps = 26/240 (10%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDH 184 + +Y + + + L + S +K + G D+++ +DVS SM Sbjct: 37 EWLPKYSIWWDNSILWIVTIYTLLVLALASPFTYEAKELSTKKGRDLILTIDVSGSMAQK 96 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 G K +I K G+V F S PL + ++ + E Sbjct: 97 ---GFSKEESEKSRYEVAKEIAKRFIKNRFSDNIGIVIFGSFSFSASPLTYDLKALLEMF 153 Query: 245 NRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + + I G+ T + A + + + K II LTDG+++ Sbjct: 154 DLMSDVGIAGNNTAIGDAIFEAIKNL-----------ESGEAKSKVIILLTDGKHNFGKK 202 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 KE + EAK+RG +Y +G+ + + L+ A + + + +NS++L + F I Sbjct: 203 SPKEGVV---EAKKRGIKIYTVGIGTDYDKKLLEKMAKETNAKSFFAKNSKELEEVFKEI 259 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 47/195 (24%), Positives = 78/195 (40%), Gaps = 22/195 (11%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + P + S K+S+ G+D+++ LD+S SM + ATR I Sbjct: 64 LALFIFALAQPRFVQSETKVSA---SGVDIVVALDMSGSMLAEDEGFVLNGQQATRFIIA 120 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTP 257 D++K D R GLV F ++ P + + + + RL GS T Sbjct: 121 R-DVLKKFVDKRQSDRIGLVVFGTQAYVAVPPTLDHEFLLKNLERLGIGSINGNQTAIGS 179 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 L + N++ + K K K II +TDG+N++ + L A+ G Sbjct: 180 ALSTSMNRLRELKSK-----------SKIIILMTDGQNNAGKVPP---LTAAEAARALGI 225 Query: 318 IVYAIGVQAEAADQF 332 +Y IGV + + Sbjct: 226 KIYTIGVGTKGVARM 240 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 62/347 (17%), Positives = 130/347 (37%), Gaps = 35/347 (10%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 +I ++ L+++ T L+ N +N D R+I ++F Sbjct: 475 LIALDAWGHSLQNINVTLNNTAPSLGTLSLDGSNESNLINFTTDQYGRVITEFTSSNFVG 534 Query: 95 ELRENGFA---QDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF---CTFPWCANS 148 G + D +IE I + Y +++ + + P ++ Sbjct: 535 NCTIIGLSDAINDSLSIEIRNQPFISASIDAEPYVVTSGDIVNITTVITVEGELPVSRSA 594 Query: 149 SHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + L++ S + + LD+++VLD S SM L A + + ++ + Sbjct: 595 ATSMLILDRSGSMDPDYYAGTALDIVLVLDRSGSMKFLGNAPEQPLTDAKSAAKIFMENL 654 Query: 207 KSIPDVNNVVRSGLVTFSSK-IVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEY 261 S +V G+V+FSS V P++ + + I+ ++ T + Sbjct: 655 LSNTEV------GVVSFSSTSTVDRQPVSLNISGNKDLLHNAIDSMVADGGTAIGDAMAD 708 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A N + + + KK +I LTDG ++ + +++ + A +Y+ Sbjct: 709 ANNLLINGRPDA----------KKIMIVLTDGVATAGS--DRDGSDAISTANLNNIRIYS 756 Query: 322 IGVQAEA--ADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVK 364 IG+ + + LK AS +Y+ + +L + I KE+ Sbjct: 757 IGLGSSEYIDEPMLKRIASETGGSYYNAPSGSELQTVYNTISKEISD 803 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 87/240 (36%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + ++ + P ++ S +G D+M+V+D+S SM + G + +L Sbjct: 78 LILSWLLVVCAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEPDFTSRTGEKISRL 134 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + E + R GLV F P + + + + Sbjct: 135 DAAKEVLSEFVQS-------RKGDRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMA 187 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ +K + + +K I LTDG ++ ++ ++ Sbjct: 188 GQSTHLGDAIGLAI-KVFEQSDKSSGALEQDQNREKVAIVLTDGNDTGSFVEPIDA---A 243 Query: 310 NEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G V+ I + + + A S + N +L A+ IGK Sbjct: 244 KVAKAKGVRVHVIAMGDPETIGETALDMETIHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 78/204 (38%), Gaps = 26/204 (12%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 + + + ++ + + + N V+ + S+K+ Q PL++ ++ +N + Sbjct: 397 NDANAVSPASSDVATLFPANQHMENNVQYCSSSASTKLGQIVPLSYNWTSLKSAVNAMEP 456 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS------------- 296 T G+ +A + + Y + II L+DG N+ Sbjct: 457 TGGTNQAIGMAWAVQSLIPNGVLGAPAEDANTTYNRVIILLSDGLNTEDRWPDYGNGSTQ 516 Query: 297 -SPNIDNKESLFYCNEAKRRG-------AIVYAIGVQ----AEAADQFLKNCA-SPDRFY 343 S N + C+ K +Y I V A+ L+NCA SPD+FY Sbjct: 517 ASGNPIDARQALLCSNLKNTKDSKGNAMYTIYTIQVNTSSPADPTSTVLQNCASSPDKFY 576 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 + +S ++ F IG + K R+ Sbjct: 577 MLTSSSQIVTTFNSIGTALSKLRV 600 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 71/224 (31%), Gaps = 29/224 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G+I+ L AI L I +G I+ S ++ + LD + L + + +Q Sbjct: 26 RRFSGDISGNIATLFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTALMLSRDL-SQ 84 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + D ++ + S + + Sbjct: 85 GTITAADVAAKA--------------STYFKALYTSTDAQSVAVTASYTASTSSSASNIQ 130 Query: 127 LSAVSRYEMPF--IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 L+A + F + N+ D+ + + + LD + SM Sbjct: 131 LNASGQIVTQFMKLVGFPTMTFNTKATTTW----------GDVKMRVALALDNTGSMA-- 178 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + M L A ++D + ++ V L+ F+ + Sbjct: 179 YSGKMTALQNAVAGSGGLIDQLSALAKSPGDVYISLIPFAKVVN 222 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 102/280 (36%), Gaps = 51/280 (18%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 +S +L + K+ ++F + T + +K+ G+ Sbjct: 33 QSATLKMSSTAGFKNSQSLLTKLKPCLYVFRIIALSSLIIALARPRTVDISNQTKTTKGI 92 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++M +DVS SM +++ R + ++ R GLV ++S+ Sbjct: 93 DIVMAIDVSGSMLAK-DLKPNRMEALKRVAADFVEE-------RPNDRIGLVLYASEAYT 144 Query: 230 TFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ I E I + + T GL A N++ D+K K + Sbjct: 145 KTPVTSDKPIILEAIKGIRYDTVLQDGTGIGMGLATAVNRLKDSKAK-----------SR 193 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA----------------- 328 II LTDG N++ I+ + + + AK+ G VY IG+ Sbjct: 194 VIILLTDGVNNAGFIEPETA---ADIAKQYGIKVYTIGLGTNGMAESPYAYAPNGGFLFK 250 Query: 329 ------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ +K+ A + ++ ++ KL + + I K Sbjct: 251 MQKVEIDERLMKSIAKKTDGTYFRATSNDKLAEIYNSINK 290 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 95/255 (37%), Gaps = 55/255 (21%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGP 187 R ++P++ + P + + + + G D+++ +DVS SM+ G Sbjct: 57 RQQLPYLTIWLLLLFAA-ARPQWLGEPLPLPT---SGRDLLLAVDVSGSMDYPDMQWQGE 112 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + +L + + + ++ + R GL+ F SK PL + + ++ ++ Sbjct: 113 ELTRLELVKVLLGDFIEQ-------RHGDRVGLILFGSKAYLQSPLTFDRRTVRVWLDEA 165 Query: 248 ---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 I GS T + A ++ +E+ + + ++ +TDG N+ I+ Sbjct: 166 SVGIAGSNTAIGDAIGLALKRL---RERPAN--------SRVLVLVTDGANNGGEIEP-- 212 Query: 305 SLFYCNEAKRRGAIVYAIGVQA-----------------EAADQFLKNCA--SPDRFYSV 345 L A ++ IG+ A + + L+ A + ++ Sbjct: 213 -LLAATLAAEENVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRA 271 Query: 346 QNSRKLHDAFLRIGK 360 +S +L IG+ Sbjct: 272 ASSEQL----QAIGE 282 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRIWLDEARIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ ++ Sbjct: 178 DAIGLALKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGSD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L DA + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 86/240 (35%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + ++ + P ++ S +G D+M+V+D+S SM + G + +L Sbjct: 78 LILSWLLVVCAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEPDFTSRTGEKISRL 134 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + E + R GLV F P + + + + Sbjct: 135 DAAKEVLTEFVQS-------RKGDRLGLVLFGDAAFVQTPFTVDQKVWLELLNQTDVAMA 187 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ +K + + +K I LTDG ++ ++ ++ Sbjct: 188 GQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREKVAIVLTDGNDTGSFVEPIDA---A 243 Query: 310 NEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G V+ I + + A S + N +L A+ IGK Sbjct: 244 KVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 78/206 (37%), Gaps = 26/206 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++ G+++ V+D S SM + T + L Sbjct: 17 IEDEKDKYDGINIAFVIDSSGSM--FYNDPNGLRREVTHKFIDRLTDNDMA--------- 65 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++ F K + + + ++++ T + AY+ + + Sbjct: 66 AVIGFDYKATVLEQFTSNKEKLHDAVDKIRSDGGTNIGRAVSIAYDLFNNLDNNRK---- 121 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 + Y K++I LTDG+ + + ++ AK+ G +Y IG+ +++ LK+ A Sbjct: 122 --EKYPKFLILLTDGD---GDYSEEYTIL----AKKAGIKIYTIGLGNGVSEKLLKDIAK 172 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ +++ KL+ F +I + Sbjct: 173 GTDGEYFHAKDASKLNKIFEKIADKT 198 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 82/259 (31%), Gaps = 37/259 (14%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMM 173 + + + S R + + WC + P + V++ D+M+ Sbjct: 42 VALSGETPRTGASVRQRLTVQALGSLLGWCLLVGALARPEWVGEPVQL---EKTARDLML 98 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+S SM+ D G +D+L A + E + R GL+ F + Sbjct: 99 AVDLSGSMDAADFVDASGEQIDRLSAAKGVLNEFV-------AGREGDRLGLIVFGNAAY 151 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P + ++ I + + D+ + + +I Sbjct: 152 LQAPFTDDHETWLALLDESIVN--------MAGPSTALGDSIGLAIAHFRQSKTENRVLI 203 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SP 339 LTDG ++ + L AK G +Y + V + L+ A + Sbjct: 204 VLTDGNDTGSRVPP---LDAAEVAKVEGVTIYTVAVGDPTTVGEEALDMETLETVARLTG 260 Query: 340 DRFYSVQNSRKLHDAFLRI 358 + + L + + RI Sbjct: 261 GDSFVASDLVALRETYQRI 279 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 75/487 (15%), Positives = 153/487 (31%), Gaps = 163/487 (33%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + GS++ + +L ++ I GL I+ + + K+ LD ++L A Sbjct: 28 RRFAGDESGSMTYMAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAAA------ 81 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID-DQHKDY 125 D NEL G +D + I +D D+ +Y Sbjct: 82 ------------------------DLDNELEAQGVVEDYMDKMALADALISVDVDEGLNY 117 Query: 126 NLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 Y+ MP F N L +++ +++ +VLD+S SM+D+ Sbjct: 118 RTVVAEGYKTMPSNFMQILGVDNLQAYGLA------EATERINKVEVSLVLDISGSMDDN 171 Query: 185 ---------FGPGMDKL------GVATRSIREMLDIIKSIPDV----------------- 212 G +D L + + S+ + + + P++ Sbjct: 172 DKLANMQDAAGTFIDTLLAEGNEDLVSISLVPYSEQVNAGPEILSYLSANWKHGYSHCIE 231 Query: 213 --NNVVRSGLVTFSSKIVQTFPLAWG---------------------------VQHIQEK 243 N+V S + FS Q W ++ + Sbjct: 232 MPNSVFGSAALDFSRTYEQMQHYQWNYDGYNNTLSDTVCPRYGYERIQAWSHDASALKAQ 291 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY----------------I 287 +N+L + T G+++ + + + + + Sbjct: 292 VNQLQPRAGTSIFMGMKWGTALLDPSTRPIASGMIARGSVDQVFEGRPVAYDDTDVLKTV 351 Query: 288 IFLTDGENSS-------------------------------------------PNIDNKE 304 + +TDG++ N D + Sbjct: 352 VLMTDGQHDRSYRIQDWAYNSESEYAHWNRYNLWYYLSRYVSSYERSSFYYQKYNADLGD 411 Query: 305 SLF--YCNEAKRRGAIVYAIGVQA-EAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGK 360 +L C AK +G I++++G + + +++CA SP F+ V+ ++ +AF I Sbjct: 412 ALLGSICAAAKAQGIIIWSVGFEVGDHGADVMESCASSPAHFFRVEGV-EITEAFSTIAH 470 Query: 361 EMVKQRI 367 + + R+ Sbjct: 471 TLNQLRL 477 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ A+ Sbjct: 178 DAIGLGLKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGAD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L D+ + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ A+ Sbjct: 178 DAIGLGLKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGAD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L D+ + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 50/248 (20%), Positives = 89/248 (35%), Gaps = 52/248 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVA 195 F F + P V+ ++ G+D+M+V+D+S SM+ F P ++L A Sbjct: 78 FFFLALIMVIIALARPQKSNERVEQFTE---GIDIMLVMDISESMDLQDFKP--NRLEAA 132 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----S 251 + + ++ R G+V F+ + PL + + + I + F Sbjct: 133 KATAIDFIN-------GRFGDRIGMVVFAGEAYSLAPLTNDYKLLTDLIQDISFNMMEAK 185 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + A N++ K + K +I L+DGE+++ N+D LF Sbjct: 186 GTAIGSAIASATNRM-----------KESESASKVLILLSDGESNAGNVDP---LFAAQL 231 Query: 312 AKRRGAIVYAIGV----QAEAADQF---------------LKNCASPDR--FYSVQNSRK 350 A +Y I V F L+ A F+ + Sbjct: 232 ASALDIKIYTIAVGKDGMVPYGTDFFGRPQMVESYLDETNLREIAKIGNGEFFRASDGGT 291 Query: 351 LHDAFLRI 358 L++ F RI Sbjct: 292 LNNIFDRI 299 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ A+ Sbjct: 178 DAIGLGLKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGAD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L D+ + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 92/240 (38%), Gaps = 38/240 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKI------SSKSDIGLDMMMVLDVSLSMN----DHF 185 P + W + L++ + ++ G+D+++ LD S SMN Sbjct: 51 PGKWRNLEWLFKALAVTLMVGALATPVVVDYSDPRNRNGIDIVLSLDGSGSMNASGFSKE 110 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 P + + V + + + +K I D G+V F P+ + + + E I Sbjct: 111 EPRLSRFEVVQKIASDFV--MKRIEDN-----VGVVLFGDFAFIATPVTYEKEIVSEMIG 163 Query: 246 RL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 L + G T G+ + D+K K K II LTDGE++S +I Sbjct: 164 YLSHGMAGQNTAIGEGIAMGVRALRDSKAK-----------SKVIILLTDGEHNSGSISP 212 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCASPDR--FYSVQNSRKLHDAFLRI 358 KE++ + +Y IG+ + + LK A F++ N ++L + I Sbjct: 213 KEAVAMVG---KEHIRLYTIGIGQKGEFDNALLKQLAHDGHGKFFAAANEKELQSVYDEI 269 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 86/240 (35%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + ++ + P ++ S +G D+M+V+D+S SM + G + +L Sbjct: 78 LILSWLLVVCAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEPDFTSRTGEKISRL 134 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + E + R GLV F P + + + + Sbjct: 135 DAAKEVLTEFVQS-------RKGDRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMA 187 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ +K + + +K I LTDG ++ ++ ++ Sbjct: 188 GQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREKVAIVLTDGNDTGSFVEPIDA---A 243 Query: 310 NEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G V+ I + + A S + N +L A+ IGK Sbjct: 244 KVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 68/182 (37%), Gaps = 36/182 (19%) Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F ++ PL I+ + L +T G+ + + + D + + K Sbjct: 358 GFGCEMEPLVPLTTDFSKIRTTVKALEANGSTNMLEGVMWGWRVLSDREPFAQGAPKSDA 417 Query: 282 DYKKYIIFLTDGENSSPNIDN---------------------------------KESLFY 308 +K +IFLTDG+NS N++N K++ Sbjct: 418 SVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTNNALDKKTKAA 477 Query: 309 CNEAKRRGAIVYAIGVQAE--AADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 C AK G +Y I ++ + L+ CA S ++ + ++L F I K +VK Sbjct: 478 CENAKEDGVTIYTIRLEEADVGTGKMLEECATSSAHYFDAPSRQQLTPIFDAIKKGVVKL 537 Query: 366 RI 367 R+ Sbjct: 538 RL 539 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 78/224 (34%), Gaps = 41/224 (18%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + +F + G+ +++ IL + + GL ++ K+KL +D + L A Sbjct: 3 HTSSFLKDTSGNFALVFGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAALLIARA-- 60 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 G+ + Q + K T++ + + + + Sbjct: 61 ----GDMSETQAMKLA----KTTITTNYGINVAKVAVSMVDGD----------------- 95 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + + + +F F N+ +++ + ++ +VLD + SM Sbjct: 96 --ATVKASMDQALVFGGFMGRKNA-----AVSAEATATYAY-TKYEIALVLDTTGSMLG- 146 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 KL ++ ++D ++++ ++ +V ++ + Sbjct: 147 -----GKLTSLQNAVIGLVDGMEALGLNKEQLKFAVVPYAGFVN 185 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 43/229 (18%), Positives = 89/229 (38%), Gaps = 32/229 (13%) Query: 137 FIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMD 190 + W A+ ++ P + + + + G D+++ +D+S SM + + + Sbjct: 58 ILALWIIWLASVLAAANPQWVGEATSMP---NSGRDLLLAVDISGSMREPDMVYNNRRIT 114 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +L + + + + R GLV F ++ PL + V+ +QE + G Sbjct: 115 RLMAVKKVVGDFV-------ARRQSDRLGLVLFGTQAFLQAPLTFDVKTVQEMLIEAESG 167 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 ++T I DA + + K+ II LTDGEN++ + + + Sbjct: 168 YAGEAT--------AIGDAIALSIKRLREQPNAKRVIILLTDGENTAGELGIATAT---D 216 Query: 311 EAKRRGAIVYAIGVQA---EAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 A + +Y I E ++ A + F+ +N+R L + Sbjct: 217 LAVKANTKIYTIAFSPYDREVDSHSMQQIAEQTGGEFFRARNTRDLEEI 265 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 87/240 (36%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + ++ + + P ++ S +G D+M+V+D+S SM + G + +L Sbjct: 78 LILSWLFVVCAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEPDFTSRTGEKISRL 134 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + E + R GLV F P + + + + Sbjct: 135 DAAKEVLTEFVQS-------RKGDRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMA 187 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ +K + + +K I LTDG ++ ++ ++ Sbjct: 188 GQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREKVAIVLTDGNDTGSFVEPIDA---A 243 Query: 310 NEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G V+ I + + A S + N +L A+ IGK Sbjct: 244 KVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 36/248 (14%), Positives = 83/248 (33%), Gaps = 56/248 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGEPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + A + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQIAPQ---IAAQLAAEQQVKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL------HDAFLRI 358 V A+ L+ A + ++ ++S +L D + Sbjct: 231 VGADPQQGGVPGLFGFNPGLDLDEPTLRGIAEITGGEYFRARSSAELESISATLDRLEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQTTRAR 298 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 51/251 (20%), Positives = 82/251 (32%), Gaps = 53/251 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 + P S +S+ G+D+ + LDVS SM F P D+L A Sbjct: 62 LELLAIILLVTALARPQSSNSWQTYTSE---GIDIALALDVSTSMLARDFTP--DRLEAA 116 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---ST 252 + + + GLV F+ + PL + + + G Sbjct: 117 KEVATKFI-------LERPQDKIGLVVFAGESFTQCPLTTDQAVLVNLLREVKSGMIQDG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A N++ D+ K K +I LTDG N+ I + A Sbjct: 170 TAIGLGLANAVNRLKDSPGK-----------SKVVILLTDGINNQGAIAP---VTAAELA 215 Query: 313 KRRGAIVYAIGVQAEAADQF---------------------LKNCA--SPDRFYSVQNSR 349 K G VY IGV + L+ A + +++ ++ Sbjct: 216 KAFGIRVYTIGVGTYGEAPYPVPTPFGVQLQNMPVEIDEGVLQQIANVTGGKYFRATDND 275 Query: 350 KLHDAFLRIGK 360 KL + I + Sbjct: 276 KLQQIYSEIDQ 286 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 77/217 (35%), Gaps = 49/217 (22%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+MMVLD S SM + F P + A + ++ R GL+ F+++ Sbjct: 78 GIDIMMVLDASTSMQAEDFQP--TRFEAAREAAGAFVE-------GRVSDRVGLIVFAAE 128 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL +Q + + G T L A N++ K + Sbjct: 129 AYTQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRL-----------KDSEAE 177 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------ 325 K I LTDG N+ ID + + G VYAIGV Sbjct: 178 SKVAILLTDGRNNRGQIDPRTAAEVAQTM---GVRVYAIGVGSSEDRDTWEEPLPQGQRD 234 Query: 326 --AEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRI 358 A + L++ ++ +++S N L + I Sbjct: 235 ESAGVDAEMLRSVSTSTGGQYFSATNRDALERIYAEI 271 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 86/240 (35%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + ++ + P ++ S +G D+M+V+D+S SM + G + +L Sbjct: 78 LILSWLLVVCAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEPDFTSRTGEKISRL 134 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + E + R GLV F P + + + + Sbjct: 135 DAAKEVLTEFVQS-------RKGDRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMA 187 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ +K + + +K I LTDG ++ ++ ++ Sbjct: 188 GQSTHLGDAIGLAI-KVFEQSDKSRGALEQDQNREKVAIVLTDGNDTGSFVEPIDA---A 243 Query: 310 NEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G V+ I + + A S + N +L A+ IGK Sbjct: 244 KVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 87/243 (35%), Gaps = 33/243 (13%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + T+ + P ++ + +G D+M+V+D+S SM + G + +L Sbjct: 73 LIITWVLVICALAKPTILGEP---QVREQLGRDVMVVVDLSGSMAEQDFTSKQGDKISRL 129 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 +++ R GL+ F P + + + + Sbjct: 130 DATK-------EVLADFAKTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMA 182 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY---KKYIIFLTDGENSSPNIDNKESL 306 G +T + A ++++ + D +K +I LTDG ++ ++ ++ Sbjct: 183 GQSTHLGDAIGLATKVFEQSEKQSAAVQDSSIDANVKEKVVIVLTDGNDTGSFVEPIDA- 241 Query: 307 FYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 AK +G ++ I + + +K A S + N +L A+ + Sbjct: 242 --AKVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQ 299 Query: 358 IGK 360 IG+ Sbjct: 300 IGE 302 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRFWLDEAKIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ A+ Sbjct: 178 DAIGLGLKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGAD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L D+ + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 58/280 (20%), Positives = 102/280 (36%), Gaps = 51/280 (18%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 E + +L I D ++ + + + F+ A + T V +K+ G Sbjct: 32 EETAALKISSTQGFTDTSILSKLKPIL-FVLRLLAMAAIITALARPQTKDVSTRTKTTKG 90 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++M +DVS SM D+L + D IK P+ R GLV ++++ Sbjct: 91 IDIVMAIDVSSSML-ARDLKPDRLTALKKVAA---DFIKKRPN----DRIGLVAYAAESY 142 Query: 229 QTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ + + ++ G T GL A N++ D+K K K Sbjct: 143 TKTPITSDKSIVLSSLRQITHGQLEDGTAIGMGLATAVNRLKDSKSK-----------SK 191 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------------- 330 II LTDG N+S I+ K + + A Y IG+ Sbjct: 192 VIILLTDGVNNSGFIEPKTA---ADLAVEYKIKTYTIGLGTNGNALTPIAFNPDRTYRYG 248 Query: 331 --------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ ++ KL + I K Sbjct: 249 MRQVEIDEKLLEEIATVTGGQYFRATDNEKLSAIYNEINK 288 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 51/245 (20%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 PF+ ++ P + V +++ G D+++ +DVS SM+ + + Sbjct: 61 PFVVIWL-LLLCAAARPQWLGEPVPVAA---SGRDLLVAVDVSGSMDFPDMQWKNEDISR 116 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---I 248 L + + + L R GL+ F S+ PL + + ++ ++ I Sbjct: 117 LDLVKALMGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAKIGI 169 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + A ++ + + ++ +TDG N+ I L Sbjct: 170 AGKNTAIGDAIGLAVKRLRQRPAQ-----------SRVLVLITDGANNGGQIHP---LTA 215 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-----------------QFLKNCA--SPDRFYSVQNSR 349 A + G +Y IG+ A LK A + ++ + Sbjct: 216 ARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHDGA 275 Query: 350 KLHDA 354 +L Sbjct: 276 ELDAI 280 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 85/252 (33%), Gaps = 47/252 (18%) Query: 133 YEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP--- 187 Y + W ++ P + + V + G D+M+ +DVS SM Sbjct: 56 YRWRVLIGLLAWGLLVLAAARPQWVGAPVPLPL---AGRDLMLAIDVSGSMAQEDYELDG 112 Query: 188 -GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + +L V ++ R GL+ F ++ PL + + + Sbjct: 113 RPVSRLAVVRTVASAFVE-------RRAGDRLGLILFGTRAYLQTPLTFDGATVAAMLR- 164 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 S GL I DA + + ++ +I LTDG+N++ +D E+ Sbjct: 165 -------DSVVGLAGRETAIGDAIGLAVKRLREQPEGQRVLILLTDGDNTAGALDPLEAA 217 Query: 307 FYCNEAKRRGAIVYAIGVQAEA-------ADQFLKNCA-------------SPDRFYSVQ 346 A + G VY IG+ + L+ + + R ++ Sbjct: 218 EL---AAQAGVRVYTIGIGGGELGVRSLFGMRLLRQASDFDPATLERIAEITGGRAFTAD 274 Query: 347 NSRKLHDAFLRI 358 + ++L + + Sbjct: 275 SRQQLEAVYDEL 286 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 75/230 (32%), Gaps = 35/230 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATR 197 ++ P + ++I+ D+M+ +D+S SM G +D+L Sbjct: 81 GLLVVAAARPEWVGEPIEIN---RSARDLMVAVDLSGSMEAQDFTTEQGEKIDRLTAVK- 136 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 ++ + R GL+ F S P + ++ Sbjct: 137 ------QVLTEFSQRRDGDRLGLIVFGSAAYLQAPFTADKDTWLTLLQ--------ETEI 182 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + A I DA + D + +I LTDG ++ + ++ N R Sbjct: 183 AMAGASTSIGDAIGLSISTFEHSDTDNRVLIVLTDGNDTGSRVPPVDAARVAN---ARDV 239 Query: 318 IVYAIGVQAE---AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +Y I + D LK + + ++ + + L A+L I Sbjct: 240 KIYTIAIGDPETIGEDAMDVDTLKQVSDITGGAYFEALDRQALERAYLDI 289 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 91/244 (37%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +YAIG+ ++ Sbjct: 178 DAIGLALKRLRMRPATSRALVLVTDGANNAGQIDP---VTAARLAAEEGVKIYAIGIGSD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L DA + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 91/255 (35%), Gaps = 55/255 (21%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGP 187 R ++P++ + P + + + + G D+++ +DVS SM+ G Sbjct: 57 RQQLPYLTVWLLLLFAA-ARPQWLGEPLPLPT---SGRDLLLAVDVSGSMDYPDMQWQGE 112 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + +L + + + ++ + R GL+ F SK PL + + ++ ++ Sbjct: 113 ELTRLELVKVLLGDFIEQ-------RHGDRVGLILFGSKAYLQAPLTFDRRTVRVWLDEA 165 Query: 248 ---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 I GS T + A ++ + + ++ +TDG N+ +D Sbjct: 166 RVGIAGSNTAIGDAIGLAVKRLRERPTN-----------SRVLVLITDGANNGGELDP-- 212 Query: 305 SLFYCNEAKRRGAIVYAIGVQA-----------------EAADQFLKNCA--SPDRFYSV 345 L A ++ IG+ A + + L+ A + ++ Sbjct: 213 -LLAATLAAEESVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRA 271 Query: 346 QNSRKLHDAFLRIGK 360 +S +L IG Sbjct: 272 ASSAEL----KAIGA 282 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 94.5 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 78/230 (33%), Gaps = 50/230 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + V +S+ G D+++ +DVS SM+ G + +L + + + L Sbjct: 75 RPQWLGDPVPVSA---SGRDLLVAVDVSGSMDFPDMQWQGDEISRLDLVKALMGDFLQD- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + + ++ ++ G T + A Sbjct: 131 ------RQGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + +I +TDG N+ I L A + G +Y IG Sbjct: 185 KRLRQRPAQ-----------SRVLILITDGANNGGQIHP---LTAARLAAQEGVRIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKLHDA 354 + A LK A + ++ + +L+ Sbjct: 231 IGANPEASGTPGLLGLNPSLDLDEASLKEIAGITHGTYFRAHDGAELNAI 280 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 94.5 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 87/254 (34%), Gaps = 34/254 (13%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A R + + + + P L+ + + + ++++ LD+S SM Sbjct: 58 SIAARGRRRLWLVGTLWAALVVALAGPRLVLPATALPA---SAREIVLALDLSGSMERKD 114 Query: 186 ----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G + +L R E + R GLV F+ + + + Sbjct: 115 FSLDGETVSRLAAVKRVGAEFI-------RRRAGDRIGLVEFADQAYVAAAPTFDTATVA 167 Query: 242 EKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + G +T GL A ++ A+ A K ++ L+DG N++ Sbjct: 168 RTLEEATIGLVGRSTGIGDGLGLALKRLAPAQVAAADGAGPPPSRDKVVVLLSDGANNAG 227 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGV----QAEA--------ADQFLKNCA--SPDRFYS 344 K+ AK G VY I + A+ + L+ A S R + Sbjct: 228 QTAPKD---VAALAKDLGVRVYTIALGPIDMADNPNNEQDVVDVETLRAMAETSGGRAFR 284 Query: 345 VQNSRKLHDAFLRI 358 V+ + L + I Sbjct: 285 VKTTDDLENVANAI 298 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 94.5 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 86/240 (35%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + ++ + P ++ S +G D+M+V+D+S SM + G + +L Sbjct: 78 LILSWLLVVCAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEPDFTSRTGEKISRL 134 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + E + R GLV F P + + + + Sbjct: 135 DAAKEVLSEFVQS-------RKGDRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMA 187 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ ++ + + +K I LTDG ++ ++ ++ Sbjct: 188 GQSTHLGDAIGLAI-KVFEQSDQSRGALEQDQNREKVAIVLTDGNDTGSFVEPIDA---A 243 Query: 310 NEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G V+ I + + A S + N +L A+ IGK Sbjct: 244 KVAKAKGVRVHVIAMGDPETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 94.5 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 82/237 (34%), Gaps = 44/237 (18%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREM 202 ++ P + + + G D+++ +DVS SM G + +L + + + Sbjct: 71 GAAARPEWVGEPRPLPA---SGRDLLLAVDVSGSMEYADMHWQGESIGRLELVKHLLGQF 127 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ R GL+ F S+ PL + + ++ + ++ G+ Sbjct: 128 IED-------RRGDRVGLILFGSQAYLQAPLTFDRRTVRTWL--------EEAAIGIAGK 172 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I DA + + +I +TDG N++ I + A G ++ I Sbjct: 173 DTAIGDAIGLGLKRLRQRPAQSRVLILVTDGANTAGEIAPSVA---ARLAAAEGVRIHTI 229 Query: 323 GVQAEAADQ-----------------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G+ A+ L+ A + ++ ++S +L + + Sbjct: 230 GIGADPRQDGPPGLLGLTPGLDLDEPTLRAIAEETGGSYFRARSSEELRAIEETLAR 286 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 94.5 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 53/286 (18%), Positives = 98/286 (34%), Gaps = 55/286 (19%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 + E++ SL I ++ + + F+F + T + +K Sbjct: 28 LKRREQTASLKISSLKGFSKSSILPKIKPLL-FVFRILALASIIVAMARPQTEDISTRTK 86 Query: 165 SDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + G+D++M +DVS SM D + L D I+ P+ R GLV Sbjct: 87 TTKGIDIVMAIDVSSSMLARDLKPNRLSALKEVAA------DFIRQRPN----DRIGLVA 136 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ + P+ + + + +G T GL + N++ K Sbjct: 137 YAGEAFTKTPITSDKSIVLNSLREITYGQLNDGTAIGMGLATSVNRL-----------KE 185 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------- 330 K II LTDG N+S I+ + + + A G Y IG+ Sbjct: 186 SKAISKIIILLTDGVNNSGFIEPQTA---ADLAVEYGIKSYTIGLGTNGNALSPIAYNAD 242 Query: 331 --------------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + +++ ++ KL + I K Sbjct: 243 GSYRYGMRQVEIDEKLLEGIAETTGGKYFRATDNEKLEAIYDEINK 288 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 52/281 (18%), Positives = 94/281 (33%), Gaps = 51/281 (18%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +++ L I K + + F + T V +K+ G Sbjct: 33 KQTAELKISSIQGFKVTSSIWSKLRHLLFALRLIALGLLITALVRPRTVDVSTKTKTTRG 92 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++M +DVS SM ++L + + ++ R GLV ++ + Sbjct: 93 IDIVMSIDVSASMLAKDLLP-NRLEALKKVAADFIE-------GRPNDRIGLVEYAGEAY 144 Query: 229 QTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+ + + + + T GL + N++ D++ K Sbjct: 145 TKTPITSDKSIVLRSMRDIKYNTIIEGGTAIGMGLATSVNRLKDSRAK-----------S 193 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------- 328 K II LTDG N+ ID K + A G VY IG+ Sbjct: 194 KVIILLTDGVNNGGFIDPKIASEL---AVEYGIKVYTIGLGTNGTALSPVRINPNGSFQY 250 Query: 329 -------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LK A + +++ N++KL + I K Sbjct: 251 GRQKVEIDEDLLKEIADVTGGKYFRATNNKKLAQIYDEINK 291 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 81/222 (36%), Gaps = 27/222 (12%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREM 202 + P + ++ K++ D+M+ LD+S SM+ F L +++ + Sbjct: 72 LVLTALARPQFVEPPIE---KTEPQRDLMLALDLSQSMDTRDFSDPQGNLQARVDAVKTV 128 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + D R GLV F P ++ + T + PG+ Sbjct: 129 ---VADFVDRRPYDRLGLVAFGDAPYPLVPFTMDHATVRSML--------TGALPGMAGP 177 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + DA + + K ++ LTDG +++ + ++ A + ++ + Sbjct: 178 KTALGDALGLSIKLFQQSQAPDKVLVVLTDGNDTASKMPPDKA---AEIASQNHIRIHTV 234 Query: 323 GVQAEAAD-------QFLKNC--ASPDRFYSVQNSRKLHDAF 355 G+ A + L+ A+ R++ Q+ + L + + Sbjct: 235 GIGNPDAQGEEKLDTETLQKIATATGGRYFFGQDQQALAEIY 276 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 49/217 (22%), Positives = 79/217 (36%), Gaps = 49/217 (22%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+MMVLD S SM + F P + A + ++ R GL+ F+++ Sbjct: 50 GIDIMMVLDASTSMQAEDFQP--TRFEAAREAAGAFVE-------GRVSDRVGLIVFAAE 100 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL +Q + + G T L A N++ K + Sbjct: 101 AYTQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRL-----------KDSEAE 149 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------ 325 K I LTDG N+ ID + + A+ G VYAIGV Sbjct: 150 SKVAILLTDGRNNRGQIDPRTA---AEVARTMGVRVYAIGVGSSEDRDTWEEPLPQGQRD 206 Query: 326 --AEAADQFLK--NCASPDRFYSVQNSRKLHDAFLRI 358 A + L+ + ++ +++S N L + I Sbjct: 207 ESAGVDAEMLRSVSVSTGGQYFSATNRDALERIYAEI 243 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 43/241 (17%), Positives = 83/241 (34%), Gaps = 39/241 (16%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 C + P ++ + ++ G+D+M+ LD+S SM G G +L A R+ + Sbjct: 77 LCVFALAGPRMVVRQTEAEAR---GIDVMLALDISESMLQKDGSGKSRLDAAREVARKFV 133 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + R GLV F K PL + I+ + + A Sbjct: 134 -------LRRSSDRIGLVVFRGKGYTQCPLTIDHDVLAMLIDHISPQVIQDEGTAIGSAI 186 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + KG +K II +TDGEN++ ++ + A + G +Y + Sbjct: 187 LIATN-------RFKGSTSLQKVIILITDGENNTGDVGPATA---ATLAAQNGIRIYVVN 236 Query: 324 VQAEAADQF-----------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ L+ A + ++ ++ L + IG+ Sbjct: 237 AGFKSGGSAGNLSAESSAHAAMDEASLRGIARTTGGGYFRAEDPSVLDNTIKTIGRLETA 296 Query: 365 Q 365 + Sbjct: 297 R 297 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 61/450 (13%), Positives = 138/450 (30%), Gaps = 106/450 (23%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I F + +G ++IL ++ V+F ++GL ++ +++ D ++L A + Sbjct: 29 IEKFSRDTRGDVAILFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAAVLAGARALQT 88 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + R+ ++ + +N L + + + +T++ + Sbjct: 89 NGGDQAA-------ALRVAQSYYAQATKNRLSLSNDTINFAIADNATAMVTTGNAVITTP 141 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS----- 180 + +P + + + + + +++ L++ M+LD++ S Sbjct: 142 FMGLAGTGSLPILRKDGSDYSKA---------VLAVGGNAELNLEIAMMLDITGSMRGQK 192 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----------------------- 217 + D D L + + + +I VR Sbjct: 193 LTDMKAAASDLLNIVVWTDQSKFTSKVAIVPFAYDVRLPAAAFKKATGTTSTNYPCVVER 252 Query: 218 SGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEKINR 246 +G ++ + PL + K+N Sbjct: 253 TGTEKYTDAAPATGKYVMVHNTSSTKKNKTTYSPTCDVASSAEVLPLTSDKSTLLAKVNG 312 Query: 247 LIFGSTTKSTPGLEYAYNKIFDA------KEKLEHIAKGHDDYKKYIIFLTDGE-NSSPN 299 L +T G +A+ + A D+ +K + +TDGE N+ Sbjct: 313 LSTAGSTAGHIGTAWAWYMLAPNWSSLWTSASSTPAAYNADNLRKIAVLMTDGEYNTQYT 372 Query: 300 ID---------------------NKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNC 336 + + +++ C K +G VY +G Q + L C Sbjct: 373 TNGVPDDSSSLTRCPNAANGVCSSAQAVSQCTAMKAKGIEVYTVGFQLDNQTAIDTLSQC 432 Query: 337 AS-PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A+ FY+ L AF I ++ Sbjct: 433 ATDSSHFYNSTTGDALKAAFRDIALKISTL 462 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 72/203 (35%), Gaps = 32/203 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 PF+ ++ P + V +++ G D+++ +DVS SM+ + + Sbjct: 14 PFVVIWL-LLLCAAARPQWLGDPVPVAA---SGRDLLVAVDVSGSMDFPDMQWKNEDISR 69 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + + L R GL+ F S+ PL + + ++ ++ Sbjct: 70 LDLVKALLGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGI 122 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + A ++ + + ++ +TDG N+ I L Sbjct: 123 AGKNTAIGDAIGLAVKRLRQRPAQ-----------SRVLVLITDGANNGGQIHP---LTA 168 Query: 309 CNEAKRRGAIVYAIGVQAEAADQ 331 A + G +Y IG+ A Sbjct: 169 ARLAAQEGVRIYTIGIGANPEAS 191 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 61/152 (40%), Gaps = 12/152 (7%) Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF---DAKEKLEHIAKGHDDYK 284 PL + + + I+ + T G + + + ++ A +D+ Sbjct: 307 NTPIPLTYNRNKLHDFIDDMTPRRNTAGHIGQAWGWYLVSPEWNSVWPAGSKALPYDEPD 366 Query: 285 --KYIIFLTDGE------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 K +I ++DG+ N++ ++ C++ K + ++Y +G A L C Sbjct: 367 ATKVVIMMSDGQYNETRHNNAYPSSVTQAEAICDKMKEKEVVIYTVGFDAGYGQDVLNYC 426 Query: 337 AS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 AS P Y N ++L +A+ I + + RI Sbjct: 427 ASNPAFAYKPTNGQELTEAYKSIARSISDLRI 458 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 44/222 (19%), Positives = 83/222 (37%), Gaps = 27/222 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 RNFF N G+++++ A+ + I + G I+ KA++ +D ++L + + Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAATKSMQDG 65 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 ++ K+ ND+ K I + L N D+ I+ + L ++ Sbjct: 66 KDRAYSLKEANDY----FKGILNQSNNSGL--NCTNIDLVYIDETEELEGHVECSQNT-- 117 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 T A H ++S+ L++ V DVS SM + Sbjct: 118 --------------TLSKVAGIRHLDFNVSSAATYGIGK---LEIAFVFDVSGSMAND-- 158 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 M L VA R L ++ VR +V++ + + Sbjct: 159 NRMGNLKVAAREAVNTLLPVEGYAGDPEDVRLAMVSYDTMVN 200 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 81/222 (36%), Gaps = 33/222 (14%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDII 206 ++ P+ + ++I +G +MM+ +D+S SM F ++ ++ +L Sbjct: 75 AAVRPVWLGEPIQI---EQVGREMMIAVDLSGSMEARDFVDPQGEILRRVDGVKALLQSF 131 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 + R GL+ F P Q + + + ++ + G+ T + A Sbjct: 132 LL---KRDSDRIGLIAFGENAYLQAPFTQDKQILSQLLQQMDVRMAGAGTAIGDAIGVAV 188 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N + + K ++ LTDG ++S ++ Y E +G ++Y I Sbjct: 189 NHFEQS-----------EVENKVLLLLTDGNDTSSEFPPLDAAHYAGE---QGVVIYPIA 234 Query: 324 VQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFL 356 + L+ A + R + + + L + + Sbjct: 235 IGDPKNVGEDSLDIATLERIADLTQGRVFEADDGQSLIEVYK 276 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 42/172 (24%), Positives = 68/172 (39%), Gaps = 28/172 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G+D+M+ LD S SM +L A R I +++ +I+S R+GLV F+ Sbjct: 85 EQKGVDIMICLDCSRSMLAQ-DIKPTRLERAKREIIDLMGMIQS-------DRAGLVAFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + PL +N L T +E A N E Sbjct: 137 GRAILQCPLTLDHSAFNLFLNALEPDYLPVGGTDLGGAIETALNGFEKEVES-------- 188 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +K II +TDGEN++ +S+ +A +G ++ IGV + Sbjct: 189 ---EKAIILITDGENTTG-----DSIEMAKKAADQGVKIFCIGVGSPEGAPV 232 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 61/427 (14%), Positives = 128/427 (29%), Gaps = 127/427 (29%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 ++ V GL ++ + + +L +D L + ++ + + K Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGSLELPYDPDLS--------------KG 46 Query: 87 IWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA 146 I Q + + N + ++ T + ++ ++ ++ + F A Sbjct: 47 IVQQAVSDMIHTNMPDAVVESVSPGTEVR----------SVVVTAKAKVNLLVMGFLNLA 96 Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + L+++ V+D S SM G ++ + A+ + ++L Sbjct: 97 -----DQWVEAGAAAGFNK---LEIVFVIDNSGSMK---GTPINLVKEASIGLTDLLIPD 145 Query: 207 KSIPDV-------NNVVRSG--------------------------------LVTFSSKI 227 PD VR G + ++I Sbjct: 146 GQQPDTKVGLVAFRGKVRLGGDVDGLEAGCRNADGSVNTGIHEDFMSMYWALSSYYRNQI 205 Query: 228 VQTF--------PLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHI 276 PL+ I E IN T + G+++A + + + Sbjct: 206 DLDTCSSIPESRPLSQDKGDIVEGINSQTALGSASGTVISEGIKWARHMLTPEAPYTQAG 265 Query: 277 AKGHDDYKKYIIFLTDGEN---------------------------------SSPNIDNK 303 K D++K +I LTDG+ + N+ Sbjct: 266 DK--KDFRKIMIVLTDGDTEDGECGGSYRASFRPNNYWTNAYYGMGVDTAHCQDGGVLNQ 323 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCASP-----DRFYSVQNSRKLHDAFL 356 + L AK G ++AI +K AS D ++ + + D F Sbjct: 324 DMLAEAQLAKDEGIEIFAIRFGVSDNTDISLMKQIASSKAGTNDHYFDAPSVYDIPDVFK 383 Query: 357 RIGKEMV 363 +IGK++ Sbjct: 384 KIGKQLG 390 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 85/218 (38%), Gaps = 27/218 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G D+++ +D S SM + D +R + +K+ G+V F Sbjct: 79 KKKGYDIVLAIDASGSMQEKGFDPTDPQKTKFDVVRSL---VKAFISKRRNDNIGVVIFG 135 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S PL + + +++ ++ L G T I DA + + K Sbjct: 136 SFAYIASPLTFNKEAVKKILDYLDIGVAGSKT--------AIDDALIESVRLLKESQAKS 187 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCASPDR- 341 K +I LTDG +++ ++ AK+ G +Y IG+ + + FL+ A Sbjct: 188 KIVILLTDGIDTASKTPPDVAVKM---AKKYGVKIYTIGIGDKRGIDEAFLRWLAQQGHG 244 Query: 342 -FYSVQNSRKLHDAFLRI---------GKEMVKQRILY 369 ++ +++ L + I GKE+VK+ LY Sbjct: 245 YYFYAKDASMLRKIYDEINRLEPSEIRGKEIVKKDELY 282 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 64/453 (14%), Positives = 148/453 (32%), Gaps = 91/453 (20%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA-TKI 63 + F G ++IL ++ ++ V G+ ++ +++L + D ++L A Sbjct: 27 RLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAADLDQ 86 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTD---FRNELRENGFAQDINNIERSTSLSIIIDD 120 + + N+ D FR + D I R ++ + Sbjct: 87 MRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKDMDTQFIGRFGFPTLEVPA 146 Query: 121 QHKDYNLSAVSRYEMPFIFC------------------TFPWCANSSHAPLLITSSVKIS 162 + A + + L+ S V S Sbjct: 147 HSQAEERVAKVEISLVLDISGSMATNNRLGEVQDAADIFLDTVLKDENEDLISVSLVPYS 206 Query: 163 SKSDIGLDMMMVLDVSLSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + G +M ++V+ + D + + + + + + D N R Sbjct: 207 EQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQWNYDGRNNYR 266 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAK 270 V + P + + ++ +I+ L+ + T G+++A + I ++ Sbjct: 267 DDTVCPRYDYERITPFSQNKRTLKNQIDDLVPRAGTSIFLGMKWAAAMLDPAFRDINNSL 326 Query: 271 EKLEHIAK---------GHDDYKKYIIFLTDG--------ENSSPNIDNK---------- 303 H+ + + K +I +TDG N+ N D++ Sbjct: 327 VNAGHVDREFYNRPASYTDSETLKTVILMTDGANDNSFRISNTYYNEDSEYVHWNRYNLW 386 Query: 304 -------------------------ESLF--YCNEAKRRGAIVYAIGVQAEAAD-QFLKN 335 +L C+ AK + ++++IG + + D +++ Sbjct: 387 WYLRREVNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKRIVIWSIGFEVDDEDVPAMQD 446 Query: 336 CA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CA SP F+ V+ +L +AF I +++ + R+ Sbjct: 447 CASSPSHFFRVEGV-ELSEAFRAIARQINQLRL 478 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 91/244 (37%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 TTARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +YAIG+ ++ Sbjct: 178 DAIGLALKRLRMRPATSRALVLVTDGANNAGQIDP---VTAARLAAEEGVKIYAIGIGSD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L DA + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ ++ Sbjct: 178 DAIGLALKRLRMRPATSRALVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGSD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L DA + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDALEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 53/257 (20%), Positives = 92/257 (35%), Gaps = 54/257 (21%) Query: 136 PFIFCT--FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 PF++ A + V SK+ G+D++M +DVS SM + ++L Sbjct: 56 PFLYVLRLLALSAIIIAMARPRSVDVTSKSKTTRGIDIVMAIDVSSSMLAN-DLKPNRLE 114 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-- 251 + + R GLV ++ + P+ I + + + F Sbjct: 115 ALKKVAATFVQD-------RINDRIGLVVYAGESYTRTPVTSDKTIILQSLKSVEFDDSI 167 Query: 252 ---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T GL A N+I D+K K + II LTDG N+S ID + + Sbjct: 168 IADGTGIGVGLATAINRIKDSKAK-----------SRIIILLTDGVNNSGTIDPRTA--- 213 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-----------------------QFLKNCA--SPDRFY 343 + AK G VY IG+ + ++ A + +++ Sbjct: 214 ASIAKEYGIKVYTIGIGTNGKAMFPVAKDANGKLVFKMMPVEIDEKLMQEIAKNTDAKYF 273 Query: 344 SVQNSRKLHDAFLRIGK 360 +++KL + I K Sbjct: 274 RATSNKKLQAIYDEINK 290 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 83/248 (33%), Gaps = 56/248 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGDPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQEDEISRLELIKKLFGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + A + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQIAPQ---IAAQLAAEQQVKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL------HDAFLRI 358 + A+ L+ A + ++ ++S +L D + Sbjct: 231 IGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESISATLDRLEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQTTRAR 298 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 89/240 (37%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 + T+ + P ++ + +G D+M+V+D+S SM + G + +L Sbjct: 73 LIITWVLVICALAKPTILGEP---QVREQLGRDVMVVVDLSGSMAEQDFTSKQGDKISRL 129 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 +++ R GL+ F P + + + + Sbjct: 130 DATK-------EVLADFAKTRKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMA 182 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A K+F+ ++ + + +K +I LTDG ++ ++ ++ Sbjct: 183 GQSTHLGDAIGLAI-KVFEQSAAVQDSSVDANVKEKVVIVLTDGNDTGSFVEPIDA---A 238 Query: 310 NEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 AK +G ++ I + + +K A S + N +L A+ +IG+ Sbjct: 239 KVAKAKGVRIHVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGE 298 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 90/244 (36%), Gaps = 42/244 (17%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ ++ Sbjct: 178 DAIGLGLKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGSD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L D+ + ++ Sbjct: 235 PDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 83/248 (33%), Gaps = 56/248 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGDPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + A + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQIAPQ---IAAQLAAEQQVKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL------HDAFLRI 358 + A+ L+ A + ++ ++S +L D + Sbjct: 231 IGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESISATLDRLEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQTTRAR 298 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 83/248 (33%), Gaps = 56/248 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGDPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + A + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQIAPQ---IAAQLAAEQQVKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL------HDAFLRI 358 + A+ L+ A + ++ ++S +L D + Sbjct: 231 IGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESISATLDRLEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQTTRAR 298 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 86/254 (33%), Gaps = 34/254 (13%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A R + I + + P L+ + + + ++++ LD+S SM Sbjct: 58 SIAARGRRRLWLIGTLWAALVVALAGPRLVLPATALPA---SAREIVLALDLSGSMERKD 114 Query: 186 ----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G + +L R E + R GLV F+ + + + Sbjct: 115 FSLDGETVSRLAAVKRVGAEFI-------RRRAGDRIGLVEFADQAYVAAAPTFDTATVA 167 Query: 242 EKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + G +T GL A ++ A+ K ++ L+DG N++ Sbjct: 168 RTLEEATIGLVGRSTGIGDGLGLALKRLAPAQVAAADGEGPPPARDKVVVLLSDGANNAG 227 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGV----QAEA--------ADQFLKNCA--SPDRFYS 344 K+ AK G VY I + A+ + L+ A S R + Sbjct: 228 QTAPKD---VAALAKDLGVRVYTIALGPIDMADNPNNEQDVVDVETLRAMAETSGGRAFR 284 Query: 345 VQNSRKLHDAFLRI 358 V+ + L + I Sbjct: 285 VKTTDDLENVANAI 298 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 81/245 (33%), Gaps = 28/245 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 A+ + + + +M+V+DVS SM +L A + + Sbjct: 69 AILLVASLVLLTVAMAGPTRDVRVPRNRAVVMLVIDVSQSMRATDVSP-SRLAAAQEASK 127 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + D + + GL+ ++ + + I++L T + G+ Sbjct: 128 QFADELTP------GINLGLIAYAGTATVLVSPTTNREATKTAIDKLQLADRTATGEGIF 181 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAI 318 A I I G + I+ +DG+ + P+ + K + AK +G Sbjct: 182 TALQAIATVG---AVIGGGDEPPPARIVLFSDGKETVPSNPDNPKGAFTAARTAKDQGVP 238 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 + I DQ LK A S ++ + +L + + + +++ Sbjct: 239 ISTISFGTPYGYVEINEQRQPVPVDDQMLKKIADLSEGEAFTASSLEQLREVYANLQQQI 298 Query: 363 VKQRI 367 + I Sbjct: 299 GYETI 303 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 87/244 (35%), Gaps = 48/244 (19%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 75 RPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKNDEVSRLVLVQQLLGDFL 128 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + Q ++ ++ I G T + A ++ Sbjct: 129 ESRKGDRVGLILFGTQAFLQAPLTYDRQTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLR 188 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + ++ +TDG N++ ID L A G +Y IG+ ++ Sbjct: 189 MRPAN-----------SRVLVLVTDGANNAGQIDP---LTAARLAADEGVKIYTIGIGSD 234 Query: 328 AADQFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEM 362 L++ S +++ ++ +L D+ + ++ Sbjct: 235 PEKNALQSALGLSASLDLDEPTLKEIARLSGGQYFRTRDGDQLEKIRVTLDSLEPVAQQP 294 Query: 363 VKQR 366 + R Sbjct: 295 TQAR 298 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 33/230 (14%), Positives = 77/230 (33%), Gaps = 50/230 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGDPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + A + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQIAPQ---IAAQLAAEQQVKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKLHDA 354 + A+ L+ A + ++ ++S +L Sbjct: 231 IGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 52/238 (21%), Positives = 88/238 (36%), Gaps = 54/238 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSI 209 P +S I S+ G+D+++ +D+S SM P D+L A R +D Sbjct: 81 RPQTSNTSESIDSE---GIDIVLAMDISGSMLAQDLQP--DRLEAAKRVAMNFVDS---- 131 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKI 266 R GLV FS + P+ ++ +I ++ G T GL + ++ Sbjct: 132 ---RISDRIGLVIFSGESFTQCPITTDHGVLKNQIAQVKSGMLQDGTAIGMGLATSVERL 188 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +K K K II LTDG N++ ID +L AK VY IGV Sbjct: 189 RTSKAK-----------SKVIILLTDGVNNTGLIDPLTALEI---AKAFKIRVYTIGVGT 234 Query: 327 EAAD----------------------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +K + + +++ ++++L + + I K Sbjct: 235 IGKAPFPMTMPDGSIQMQMQDVQLDEPLMKKISVETGGKYFRATSNKELENIYGEIDK 292 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 86/254 (33%), Gaps = 34/254 (13%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A R + + + + P L+ + + + ++++ LD+S SM Sbjct: 58 SIAARGRRRLWLVGTLWAALVVALAGPRLVLPATALPA---SAREIVLALDLSGSMERKD 114 Query: 186 ----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G + +L R E + R GLV F+ + + + Sbjct: 115 FSLDGETVSRLAAVKRVGAEFI-------RRRAGDRIGLVEFADQAYVAAAPTFDTATVA 167 Query: 242 EKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + G +T GL A ++ A+ K ++ L+DG N++ Sbjct: 168 RTLEEATIGLVGRSTGIGDGLGLALKRLAPAQVAAADGEGPPPARDKVVVLLSDGANNAG 227 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGV----QAEA--------ADQFLKNCA--SPDRFYS 344 K+ AK G VY I + A+ + L+ A S R + Sbjct: 228 QTAPKD---VAALAKDLGVRVYTIALGPIDMADNPNNEQDVVDVETLRAMAETSGGRAFR 284 Query: 345 VQNSRKLHDAFLRI 358 V+ + L + I Sbjct: 285 VKTTDDLENVANAI 298 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 47/255 (18%), Positives = 91/255 (35%), Gaps = 31/255 (12%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIF-CTFPWCANSSHAPLLITSSVKISSKSDIGLDM 171 +L + + + LS R+ +P F C + P L S + G+D+ Sbjct: 48 TLILTLPQKSTSKGLSVKFRFLVPATFMCGVACLVLALARPQLSGESTDSLT---AGVDI 104 Query: 172 MMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +D+S SM + P ++L A ++ R LV F+ + Sbjct: 105 AIAIDISDSMLAEDLKP--NRLEAAKGMALRFIE-------GRTTDRIALVAFAGETATL 155 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL ++E + + S L A + + + + I + Sbjct: 156 SPLTTDYTALKEYLASINTNIIRTSGTALGMALSSCVNKLRDVAGK-------SRIAIII 208 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-----FLKNCASP--DRFY 343 +DG+N++ I + ++ AK G VY I + +++ L+ A RF+ Sbjct: 209 SDGDNTAGTIPPETAVEL---AKSFGVRVYTIAIGKPGSEEGVDEKTLRMLAGGPNGRFF 265 Query: 344 SVQNSRKLHDAFLRI 358 ++ L F I Sbjct: 266 KAADNSSLSKIFDEI 280 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 93.3 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 88/250 (35%), Gaps = 49/250 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHF---GPGMDKL 192 C + P K+ ++ G+D+++ +D+S SM + F G ++L Sbjct: 57 LRSLAIALCIIALARPQQGNEQTKVKTE---GIDIVLAVDISGSMLAEDFEMDGKRQNRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 V + +++ ++ + P GLV FS+ PL + + + + G Sbjct: 114 YVVKQVVKDFINKRSTDP-------IGLVVFSANAYTQCPLTLDYGILLQFLEKTEIGLL 166 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T I A + K I+ LTDG N+S ID L A Sbjct: 167 EDGT--------AIGSAIASSVDRLRNTKAQSKVIVLLTDGRNNSGQIDP---LTAAELA 215 Query: 313 KRRGAIVYAIG----------------------VQAEAADQFLKNCA--SPDRFYSVQNS 348 + +Y IG V+ + D+ L A + R+Y ++ Sbjct: 216 QAFNIKIYTIGAGSKGLVPYPARDLFGNRVMRQVKIDIDDESLAEIANITGGRYYRATDT 275 Query: 349 RKLHDAFLRI 358 L + + +I Sbjct: 276 GSLKEIYQQI 285 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 89/253 (35%), Gaps = 51/253 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F + T V +K+ G+D++M +DVS SM ++L Sbjct: 60 FALRLIALALIITAIARPRTVDVSTKTKTTRGIDIVMAIDVSASMLAK-DLKPNRLEALK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----ST 252 E IK P+ R GLV ++ + P+ + + + + Sbjct: 119 NVASEF---IKGRPN----DRIGLVEYAGESYTKTPITSDKSIVLRSLQEIRYNNIIEGG 171 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A N++ D+K K K II LTDG N+S +I+ K + A Sbjct: 172 TAIGMGLATAVNRLKDSKAK-----------SKVIILLTDGVNNSGSINPKIASEL---A 217 Query: 313 KRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQN 347 G Y IG+ + LK A + +++ N Sbjct: 218 VEFGIKTYTIGLGTNGMALSPIAIKQNGQFQYGRVKVEIDETLLKEIAQVTGGKYFRATN 277 Query: 348 SRKLHDAFLRIGK 360 ++KL + + I K Sbjct: 278 NKKLAEIYDEINK 290 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 90/244 (36%), Gaps = 27/244 (11%) Query: 126 NLSAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 +SR ++ I + W + P + + V I + D+++ +D+S SM+ Sbjct: 52 GAVVLSRPKVSVIAASLCWVLVVLALARPEQLGAPVTIET---AARDVVLAIDISGSMDS 108 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 D G + + + D+++ + + R L+ F S PL + I Sbjct: 109 RDFTAPD--GTRKQRLAGVRDVVRGFVEGRDGDRMALIVFGSAAYLQAPLTADLDAILAL 166 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++R + G+ + + D+ + + ++ +I L+DG +++ +D Sbjct: 167 LDR--------TQVGMAGPHTALGDSIGLAIRTFESSEIDQRLLILLSDGSDTASRMDPV 218 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDA 354 + A R ++ +GV A L+ A + ++ + L Sbjct: 219 NA---AEIAAGRDVEIFTVGVGDPDATGENRVDLDTLRAIADRTGGAYFFAADEAALTAV 275 Query: 355 FLRI 358 + I Sbjct: 276 YESI 279 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 54/254 (21%), Positives = 90/254 (35%), Gaps = 53/254 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 FI + T+ V +K G+D+++ +DVS SM + P ++L Sbjct: 63 FILRLIVLALIITALARPQTTDVTTKTKKTEGIDIVLAVDVSASMLAEDLKP--NRLEAT 120 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---- 251 + D IK P+ R G+V ++ + P+ IN + F Sbjct: 121 KKVAA---DFIKGRPN----DRIGVVVYAGESYTKTPITTDEMISLRAINEIAFDGVLEN 173 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL A N++ K + K II +TDG N+S ID K + Sbjct: 174 GTAIGMGLATAVNRL-----------KDSEALSKVIILMTDGVNNSGFIDPKIASEL--- 219 Query: 312 AKRRGAIVYAIGVQAEAAD-----------------------QFLKNCA--SPDRFYSVQ 346 A VY IG+ + +K A + +++ Sbjct: 220 ALEYDIKVYTIGIGTNGNAPSPVAQIGRNKFRMAMMPVEIDEELMKQIAVDTGGKYFRAT 279 Query: 347 NSRKLHDAFLRIGK 360 N++KL + + I K Sbjct: 280 NNKKLEEIYGEIDK 293 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 83/248 (33%), Gaps = 56/248 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + + + G D+++ +DVS SM+ + +L + + + ++ Sbjct: 75 RPQWVGDPLPLPA---SGRDLLLAVDVSGSMDYRDMRWQDYEISRLELIKKLFGDFIED- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + ++ ++ G T + A Sbjct: 131 ------RRGDRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I + A + +Y IG Sbjct: 185 KRLRQRPAE-----------SRVLVLITDGANTGGQIAPQ---IAAQLAAEQQVKIYTIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL------HDAFLRI 358 + A+ L+ A + ++ ++S +L D + Sbjct: 231 IGADPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESISATLDRLEPV 290 Query: 359 GKEMVKQR 366 ++ + R Sbjct: 291 AQQTTRAR 298 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 93.0 bits (229), Expect = 6e-17, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 76/233 (32%), Gaps = 55/233 (23%) Query: 164 KSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + G+D+++ LD+S SM + +L A ++++ R G Sbjct: 84 QEAEGIDIVLALDLSGSMRALDLSTRENIVTRLDAAK-------EVVQEFIGKRPHDRIG 136 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 LV F++ PL +++ + RL G S + A + + Sbjct: 137 LVAFAADAFVVSPLTLNHDWLKKNVQRLELGDINLSGTAIGTAL-------GASVNRLRD 189 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------- 332 H+ + +I LTDGEN+S + L AK VY I + + Sbjct: 190 HESRSRIVILLTDGENNSGTLSP---LSAAEAAKSLNVKVYTIATGRKGRVEVAEMSRDG 246 Query: 333 -------------------------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + +F+ S L + + I Sbjct: 247 RVIRDRNGNPLYRGRSELSDYDESELREIAKLTGGQFFKASESGDLENIYDEI 299 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 83/247 (33%), Gaps = 46/247 (18%) Query: 137 FIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMD 190 + W ++ P + V + M++ +D+S SM++ +D Sbjct: 65 LMVMALAWVLLCTAAARPQRLGEPVAP---AQQARQMLLAVDLSGSMSEVDMMLGMQPVD 121 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +L A I + L + R GL+ F + PL + ++E++ + G Sbjct: 122 RLTAAKAVIADFLQ-------RRSGDRVGLLVFGQRAYMLTPLTLDLSAVREQLRDTVAG 174 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 + T + DA + + ++ ++ LTDG N++ + ++ Sbjct: 175 LAGRET--------ALGDAIGLAVKRLRTQPEGQRVLVLLTDGVNTTGVLQPLKAAELAA 226 Query: 311 EAKRRGAIVYAIGVQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKL 351 + VY I + L+ A + RF+ ++ +L Sbjct: 227 AEQ---VRVYTIAFGGDGGGFSLFGVQVPVQGDEVDEATLRKVAEITGGRFFRAHDANQL 283 Query: 352 HDAFLRI 358 + + Sbjct: 284 AGIYAEL 290 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 46/246 (18%), Positives = 78/246 (31%), Gaps = 43/246 (17%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSI 199 + P I + + G + + +D+S SM + + + I Sbjct: 62 LLIAALIRPQWIGDPLNL---DQRGRSLYLAVDLSESMLEQDMIWNQRPVSRYEAMQAVI 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 E ++ GLV F S PL + IQ + PG+ Sbjct: 119 SEFVED-------RRGDFIGLVVFGSFADVQAPLTPDLNAIQSLL--------ADLRPGM 163 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + I D + + ++ L+DGEN+S I E+ A V Sbjct: 164 ADSRTAIGDGLALAVRQLRESTTEDRVVVLLSDGENNSGEIRPDEATAV---AAAENIRV 220 Query: 320 YAIGVQAEAADQFLK----NCAS-------------PDRFYSVQNSRKLHDAFLRIGK-E 361 Y IG + D L+ +S R+Y +S +L + F I + E Sbjct: 221 YTIGFGSAGRDSLLQSFGLRSSSLDEQTLREIAEQTQGRYYRATSSAELAEVFRDIERLE 280 Query: 362 MVKQRI 367 Q+ Sbjct: 281 PSDQKT 286 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 87/274 (31%), Gaps = 50/274 (18%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 S R+ + P ++ + ++ S G+D+++ +DVS SM Sbjct: 45 GARKSRAGRWLGTLRILALGALVVALAQPQIVHGTSEVDS---SGIDIVLAVDVSGSMEA 101 Query: 184 H----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 +++ V + + + R GLV F+ + PL Sbjct: 102 LDFTINNEPANRVDVVKKVVFRFI-------GERPDDRIGLVAFAGRPYMVSPLTLDHDW 154 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + ++ + G T I A + + K +I LTDG N++ Sbjct: 155 LGRRLQTIHPGMVEDGT--------AIGSAIGSSINRLRDQKAKSKVVILLTDGMNNAGK 206 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----------------------LKNCA 337 I + A+ G +Y IG + L+ A Sbjct: 207 ILP---VTAAEAAETLGIKIYTIGAGSRGEVPVPITDKFGNQKIVRAKVDIDEATLEKVA 263 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGK-EMVKQRIL 368 + ++Y ++ L + I K E K++I Sbjct: 264 QMTGAKYYRATDTDSLKKIYSEINKLETTKRKIR 297 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 82/245 (33%), Gaps = 51/245 (20%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 PF+ ++ P + V I++ G D+++ +DVS SM+ + + Sbjct: 61 PFVVIWL-LLLCAAARPQWLGEPVPIAA---SGRDLLVAVDVSGSMDFPDMQWQNEDISR 116 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + + L R GL+ F S+ PL + + ++ + Sbjct: 117 LDLVKALMGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIGI 169 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + A ++ +E+ + ++ +TDG N+ I L Sbjct: 170 AGKNTAIGDAIGLAVKRL---RERPAQ--------SRVLVLITDGANNGGQIHP---LTA 215 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-----------------QFLKNCA--SPDRFYSVQNSR 349 A + G +Y IG+ A LK A + ++ + Sbjct: 216 ARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHDGA 275 Query: 350 KLHDA 354 +L Sbjct: 276 ELDAI 280 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 92.6 bits (228), Expect = 8e-17, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 80/230 (34%), Gaps = 50/230 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + + +++ G D+++ +DVS SM+ + +L + + + L+ Sbjct: 75 RPQWLGDPLPVAA---SGRDLLVAVDVSGSMDFPDMQWQDEDVSRLNLVKHLLGDFLE-- 129 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAY 263 R GL+ F SK PL + ++ ++ I G T + A Sbjct: 130 -----HREGDRVGLILFGSKAYLQAPLTFDRHTVRVWLDEAKIGIAGKNTAIGDAIGLAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + +I +TDG N+ I + A G +Y IG Sbjct: 185 KRLRQRPAQ-----------SRVLILVTDGANNGGEIAP---ITAARLAAEEGVKIYPIG 230 Query: 324 VQAEAAD-----------------QFLKNCA--SPDRFYSVQNSRKLHDA 354 + A+ LK A + R++ ++ ++L Sbjct: 231 IGADPEQSATLGVLGINPSLDLDEPALKELAQVTGGRYFRARDGQELQAI 280 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 92.6 bits (228), Expect = 8e-17, Method: Composition-based stats. Identities = 45/269 (16%), Positives = 97/269 (36%), Gaps = 42/269 (15%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDI 167 + + + SR+ + W S P + + V +S Sbjct: 29 QHIIIDAPTLPYFASNITTGKSRHLLTRTLLVLSWLLIVLSLTRPQWLETPVV---QSFP 85 Query: 168 GLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D+++ +D+S SM G +D+L + ++ + R G++ F Sbjct: 86 SRDLLLAVDISQSMQIKDMTINGEAVDRLSMVKSYLQSFIKQ-------RQGDRIGIILF 138 Query: 224 SSKIVQTFPLAWGVQH---IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + P Q + +++N + G T + A +K H K Sbjct: 139 ADHAYLMVPFTQDWQAAGLLLDEVNIGLAGKFTAIGEAITLAV-------KKTLHEPKPI 191 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------Q 331 + K +I L+DG++S I ++ AK G +Y IG+ +++ D Sbjct: 192 QN--KTLILLSDGKDSINTIQPTDA---AALAKASGLKIYTIGIGSDSTDAEAESDLDET 246 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + +++ ++ + L + + +I Sbjct: 247 TLEEIANMTGGQYFRARSEQDLSEIYQQI 275 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 92.6 bits (228), Expect = 8e-17, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 82/245 (33%), Gaps = 51/245 (20%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDK 191 PF+ ++ P + V I++ G D+++ +DVS SM+ + + Sbjct: 61 PFVVIWL-LLLCAAARPQWLGEPVPIAA---SGRDLLVAVDVSGSMDFPDMQWKNEDISR 116 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + + L R GL+ F S+ PL + + ++ + Sbjct: 117 LDLVKALMGDFLQD-------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIGI 169 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + A ++ +E+ + ++ +TDG N+ I L Sbjct: 170 AGKNTAIGDAIGLAVKRL---RERPAQ--------SRVLVLITDGANNGGQIHP---LTA 215 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-----------------QFLKNCA--SPDRFYSVQNSR 349 A + G +Y IG+ A LK A + ++ + Sbjct: 216 ARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHDGA 275 Query: 350 KLHDA 354 +L Sbjct: 276 ELDAI 280 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 92.6 bits (228), Expect = 8e-17, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 83/226 (36%), Gaps = 29/226 (12%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIRE 201 + P + + ++ S D+++ +D+S SM+ D + L Sbjct: 72 LLVIAISQPERLGAPIET---SKSARDLILAIDISGSMDTRDFTDASNENLQRLA----G 124 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + D++++ + R L+ F SK PL I E +++ + G+ Sbjct: 125 VRDVVRAFVEGREGDRMALIVFGSKAYLQSPLTEDTGTIVELLDQ--------TEVGMAG 176 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + I DA + + ++ +I L+DG +++ + L A+ G ++ Sbjct: 177 PHTAIGDAIGLSIRTFEASEIEQRLLILLSDGADTASRMSP---LNAAEIARGAGVEIFT 233 Query: 322 IGVQAEAA-------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 I V L++ A + ++ + L + + RI Sbjct: 234 IAVGDPDGTGENRVDVAALQDIANRTSGSYFFAADQAALDEIYARI 279 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 92.6 bits (228), Expect = 8e-17, Method: Composition-based stats. Identities = 51/247 (20%), Positives = 86/247 (34%), Gaps = 53/247 (21%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 + + P + S K K+ G+D++M +DVS SM ++L Sbjct: 68 AFLITAMARPRTVDVSTKT--KTTKGIDIVMAIDVSASML-ARDLRPNRLEALK------ 118 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPG 258 D+ R G+V ++ + P+ + +N + F T G Sbjct: 119 -DVASEFIQGRPNDRVGIVLYAGESYTKTPITSDKSIVLGALNDVKFSEVLENGTAIGMG 177 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L + N++ D+K K II LTDG N+S ID K + A G Sbjct: 178 LATSVNRLKDSK-----------ALSKVIILLTDGVNNSGTIDPKLASEL---AVEYGIK 223 Query: 319 VYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQNSRKLHD 353 Y IG+ + + LK A + +++ N++KL Sbjct: 224 TYTIGIGSNGMALSPIGIKSNGQFQYGNQKVEIDEDLLKQIATVTGGQYFRATNNQKLEA 283 Query: 354 AFLRIGK 360 + I K Sbjct: 284 IYEEINK 290 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 92.6 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 85/221 (38%), Gaps = 25/221 (11%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + + + D+++++D S SM + G DK+ A + + +D++ Sbjct: 54 VTLNVTGIPPANVVVPNDVVLIIDKSGSMAPSYNNGEDKMLNAKEAAKGFVDLMDLTKH- 112 Query: 213 NNVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +V F SS ++ P + I+ + +T + ++ A + + + Sbjct: 113 ----RVAIVDFSSSNMIGNLPFTTNPTEAKNYIDTINANGSTATGDAIDSAIALLANHRP 168 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE-SLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + I+ +TDG+ + P+ D + AK G I Y I + D Sbjct: 169 ----------EAQPVIVIMTDGDATQPSTDPYGYAKQKALLAKDNGIIFYTIALLKSTDD 218 Query: 331 Q-------FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMV 363 LK A + D + V S L + I KE+ Sbjct: 219 PVTSGPNILLKEMATTSDHHHFVLGSTGLSQIYAAIVKEIG 259 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 92.6 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 73/217 (33%), Gaps = 38/217 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +V DVS SM +++ A R+ R +D + V G+++F++ Sbjct: 87 SIALVTDVSGSMLATDVQP-NRMIAAKRAARRFVDEVP------RTVNLGVISFNNTATV 139 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + I+RL T + + A + + + I+ Sbjct: 140 LQSPTRNRSDVLTAIDRLAVSGGTATGEAIATATEMLRNQPGE------NGRRPPSAIVL 193 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------------------- 330 ++DG + + ++ + EA+R +Y + + Sbjct: 194 ISDG----TSTNGRDPIEAAAEARRLRIPIYTVAFGTDQGTITVPGRDGVERTERVPPDP 249 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L A + ++ ++ +L F R+G ++ + Sbjct: 250 TALAQIAEMTGGETFTADSADRLDTVFERLGSQLGTR 286 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 92.6 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 82/237 (34%), Gaps = 40/237 (16%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 C + P+ ++S G+D+M+ LD+S SM G +L A Sbjct: 71 CGVVLLVIALARPIFSE---RLSPGEAKGIDIMLALDISKSMLQEDFDGKSRLDAAKTVA 127 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKS 255 + ++ R GLV F K PL + + + S T + Sbjct: 128 LQFIEN-------RRRDRIGLVLFKGKSFTQCPLTLDHDVLSMLVRAASVDAVPESGTAT 180 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A N++ + + ++ +I LTDGE+++ +D + A Sbjct: 181 GSAILIAVNRL-----------RASESPERVLILLTDGEHNAGEVDP---VTAAGIAAGE 226 Query: 316 GAIVY--AIGVQAEAADQFLKNCA----------SPDRFYSVQNSRKLHDAFLRIGK 360 G +Y + V + + + A + R + ++ L+ F I + Sbjct: 227 GVRIYMATVSVPGSRSGEDMLASARDLSGEVSRITGGRSFRANDANSLNRTFSEIDQ 283 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 68/152 (44%), Gaps = 16/152 (10%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIF 289 P++ ++ +I+ + +T GL + + + AK Y+ Y++ Sbjct: 453 TPVSSQSSTLKNQIDSMSPSGSTNQAIGLAWGWQTLSTTNGPFPAPAKDKAYVYQDYLVL 512 Query: 290 LTDGENS-----------SPNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAADQFLKN 335 L+DG N+ SP +D +++L C + K G +++ + V + Q L++ Sbjct: 513 LSDGLNTRNRWSGNGSDHSPEVDVRQALL-CQKVKDSGTVIFTVQVNVGNRDPLSQVLQD 571 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CAS F + ++ + DAF I ++ + RI Sbjct: 572 CASNGNFQMITSANQTADAFQNILTQISQLRI 603 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 77/227 (33%), Gaps = 27/227 (11%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I F + + +I+ + AI L + +G ++ + ++ + +D ++L + Sbjct: 14 ISRFDRDIRANIAPIFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVL------MV 67 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + ++ + N D + S S + Sbjct: 68 SRDAAANPAMTSQQITDAVQRYF----------NSLYNDKSAFNVSVSAAYTPSTSSAAA 117 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + A + + F P L + S+ + + + +VLD + SM D+ Sbjct: 118 KILASGQGAIETDFMKIAG------FPQLSFGTSSTSTWGNSRMRVALVLDNTGSMRDN- 170 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 K+ R+ ++M+D + + + V ++ F+ + Sbjct: 171 ----GKMAALQRAAKDMIDSLSAFAKTADDVYISIIPFAKDVNVDKS 213 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 81/201 (40%), Gaps = 27/201 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+++ +D S SM PG + A + + + R G+V++++ Sbjct: 62 SPVDVVLSIDSSGSMTTS-DPGDLRKSAAKEFVTGLDLSMD---------RVGVVSWNTS 111 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + ++PL + I+ I+ T GL+ A + + + K Sbjct: 112 AI-SWPLTNNTKDIESAIDSTGADGNTCLDTGLKSAIDLLSECSG------------SKV 158 Query: 287 IIFLTDGENSSPNIDNKESL--FYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 I+ LTDG ++ + +EA+ +G +V+ IG+ +A + L A + F Sbjct: 159 IVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILVFTIGLGPDADARNLTEIAHSTGGEF 218 Query: 343 YSVQNSRKLHDAFLRIGKEMV 363 YS ++ L + RI + Sbjct: 219 YSAPDANALAGIYKRIRSSIT 239 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 59/143 (41%), Gaps = 16/143 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + D++ V+D S SM D G+ + ++ EML++ + P++ N R +V+ Sbjct: 393 PEKSCSADILFVIDKSGSMRDLDKSGLKNYEIMQAALYEMLNMASNTPELRNA-RIAIVS 451 Query: 223 FS-------SKIVQTFP--LAWGVQHIQEKINRLIFGS--TTKST---PGLEYAYNKIFD 268 + +I P LA G I+ I + + T T GL+ A + Sbjct: 452 WDDLDGTDEDRITTLDPQWLAVGDPRIKATIQQYNEETCKETDLTFYETGLQKAMQIMHS 511 Query: 269 AKEKLEHIAKGHDDYKKYIIFLT 291 + D +++IIF+T Sbjct: 512 RILSQANDPL-SCDTRRFIIFIT 533 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 83/244 (34%), Gaps = 54/244 (22%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLD 204 + T + +K+ G+D++M +DVS SM D + L + Sbjct: 70 VITAMARPQTQDISTRTKTTKGIDIVMAIDVSSSMLARDLRPNRLSALKEVAA------E 123 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEY 261 I+ P+ R GLV ++ + P+ + + + +G T GL Sbjct: 124 FIRKRPN----DRIGLVAYAGESYTKTPITSDKSIVLGALREITYGQLNDGTAIGMGLAT 179 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + N++ K K II LTDG N++ I+ + + + A G Y Sbjct: 180 SVNRL-----------KESTAISKVIILLTDGVNNAGFIEPQTA---ADLALEYGIKTYT 225 Query: 322 IGVQAEAAD-----------------------QFLKNC--ASPDRFYSVQNSRKLHDAFL 356 IG+ + L + A+ ++ ++ KL + Sbjct: 226 IGLGTNGNALSPIGYNPDGSFRYGMRQVEIDEELLTDIATATGGEYFRATDNEKLEAIYE 285 Query: 357 RIGK 360 I K Sbjct: 286 EINK 289 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 72/449 (16%), Positives = 154/449 (34%), Gaps = 120/449 (26%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R+F + +G+++++ A+LL + + +G ++ S F + A++ +D L A+ + Sbjct: 24 LRHFTQDVRGNVAMMFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAAAS-LSQ 82 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E+ + I + L E+ + N++ S+ L+I + D Sbjct: 83 GED---------------PETIVRNYITAALSEHNGVLERLNVQVSSDLAINSREVTADA 127 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +S + + + A + + V +++ L++ +VLD+S SM+ Sbjct: 128 VISVPT-----LMLGIIGYDALTLNR-------VSEANERVRNLEISLVLDISGSMS--- 172 Query: 186 GPGMDKLGVATRSIREML----------------------------DIIKSIPDVNNVVR 217 G + L A ++ D++ P+ + + Sbjct: 173 GSKITALRDAAEEFVGVMMDPDLEGLTSLSVIPYNGGVRLPQTVTNDLVPGTPNDSGCLE 232 Query: 218 SGL---VTFSSKIVQTFPLAW------GVQH--------------------IQEKINRLI 248 G+ VT L W G + + I L Sbjct: 233 LGVSDPVTMDLAANGYDWLDWQDRDQRGWRSSAFCPEENEATVFLEQTPSVLVNLIRDLD 292 Query: 249 FGSTTKSTPGLEYAYNKIFDA-----KEKLEHIAKGHDDYK--KYIIFLTDG-------- 293 G T + + A +DD K ++ +TDG Sbjct: 293 AGGNTGLDVATAWGARALDPAWRGRLGGDFASRPAAYDDPSTMKVLVVMTDGAATAQIRR 352 Query: 294 -ENSSPNIDNKESLFYCNEAKRR-----------GAIVYAIGVQAEA--ADQFLKNCAS- 338 +N + + E ++ ++A+ G +Y I Q +++CAS Sbjct: 353 AQNWYGDWYSYE-IYSASQARDNMADACDAAEAEGVHIYTIAFQVSGSTNRNLMRDCASR 411 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 P+ +Y+V+N + AF I ++ R+ Sbjct: 412 PENYYAVENL-DISAAFNSIAADLNNLRL 439 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 62/453 (13%), Positives = 145/453 (32%), Gaps = 91/453 (20%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA-TKI 63 + F G ++IL ++ ++ V G+ ++ +++L + D ++L A Sbjct: 27 RLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAADLDQ 86 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTD---FRNELRENGFAQDINNIERSTSLSIIIDD 120 + + N+ D FR + D I R ++ + Sbjct: 87 MRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKNMDTQFIGRFGFPTLEVPA 146 Query: 121 QHKDYNLSAVSRYEMPFIFC------------------TFPWCANSSHAPLLITSSVKIS 162 + A + + L+ S V S Sbjct: 147 HSQAEERVAKVEISLVLDISGSMATNNRLGEVQNAADIFLDTVLKDENQDLISVSLVPYS 206 Query: 163 SKSDIGLDMMMVLDVSLSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + G +M ++V+ + D + + + + + + D N R Sbjct: 207 EQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQWNYDGRNNYR 266 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V + P + + ++ +I+ L+ + T G+++A + A + + Sbjct: 267 DDTVCPRYDYERITPFSQNKRTLKNQIDDLVPRAGTSIFLGMKWAAAMLDPAFRDINNSL 326 Query: 278 KG----------------HDDYKKYIIFLTDGEN--------SSPNIDNK---------- 303 + K +I +TDG N + + D++ Sbjct: 327 VNAGYVDREFYNRPASYTDSETLKTVILMTDGANDNSYRIRSNYYDSDSEYVHWNKYNLW 386 Query: 304 -------------------------ESLF--YCNEAKRRGAIVYAIGVQAEAAD-QFLKN 335 +L C+ AK + ++++IG + + D +++ Sbjct: 387 WYLRREVDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIVIWSIGFEVDDEDVPAMQD 446 Query: 336 CA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CA SP F+ V+ +L +AF I +++ + R+ Sbjct: 447 CASSPSHFFRVEGV-ELSEAFRAIARQINQLRL 478 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 84/224 (37%), Gaps = 25/224 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + P + + ++ S D+++ +D+S SM+ D + + + Sbjct: 72 LLVIAISQPERLGAPIET---SKSARDLILAIDISGSMDTR--DFTDASNENVQRLAGVR 126 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D++++ + R L+ F SK PL I E +++ + G+ + Sbjct: 127 DVVRAFVEGREGDRMALIVFGSKAYLQSPLTEDTGTIVELLDQ--------TEVGMAGPH 178 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I DA + + ++ +I L+DG +++ + L A+ G ++ I Sbjct: 179 TAIGDAIGLSIRTFEASEIEQRLLILLSDGADTASRMSP---LNAAEIARGAGVEIFTIA 235 Query: 324 VQAEAA-------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 V A L++ A + ++ + L + + RI Sbjct: 236 VGDPDATGENRVDVAALQDIANRTSGSYFFAADQAALDEIYARI 279 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 82/255 (32%), Gaps = 57/255 (22%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGV 194 + P + S + + G+D+++ +DVS SM D ++ L Sbjct: 62 LRLLALACLITALARPRNVDVSTRT--NTTRGIDIVIAIDVSASMLARDLKPNRLEALKE 119 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---- 250 R GLV ++ + P+ + +N + + Sbjct: 120 VASQFIA----------DRPSDRIGLVEYAGESYTRTPITSDKSIVLSSLNDIQYNSIIE 169 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T GL + N++ D++ K K II +TDG N++ I+ + Sbjct: 170 GGTAIGMGLATSVNRLKDSRAK-----------SKVIILMTDGVNNAGFIEPSTASEL-- 216 Query: 311 EAKRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSV 345 A+ G VY IG+ + L+ A + ++ Sbjct: 217 -AQEFGIKVYTIGLGTNGTALSPVALRPDGSFQYGSIPVEIDEALLQEIADKTGGLYFRA 275 Query: 346 QNSRKLHDAFLRIGK 360 ++ L + + I K Sbjct: 276 TDNESLEEIYAEINK 290 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 84/213 (39%), Gaps = 33/213 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S + D+++++D S SM +++ A + + +D++ + G Sbjct: 60 TPPSNVIMPNDVILIIDKSGSMQTD-----NRINAAKNAAKGFIDLMDMTKH-----QVG 109 Query: 220 LVTFSSKIVQTF-PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +V +SS + PL ++ I+ ++ T++ ++ A + + Sbjct: 110 IVGYSSVAETSSLPLTTDTAAAKQFIDPIVASGGTETGYAIDQAITLLSSHRP------- 162 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------DQ 331 + + I+ +TDGE +S ++ +L AK G + Y I + ++ Sbjct: 163 ---EAQPVIVIMTDGEANS----SQAALERAQAAKDAGIVFYTIALLGPNDNPDTSAPNE 215 Query: 332 FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMV 363 LK A+ + + V S L + + I E+ Sbjct: 216 LLKQMATTNSHHHFVLGSTGLAEIYAAIVAEIG 248 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 93/230 (40%), Gaps = 37/230 (16%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+ +++ +S DM++ LD+S SM + P + + + D++K+ Sbjct: 71 KPVWFGDPIRLQQQSR---DMIISLDLSGSMQEVDMPLNGQ---TVDRLTLLKDLLKTFI 124 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R GL+ F+ PL + ++ IQ+ ++ +S GL I ++ Sbjct: 125 KQRQGDRLGLILFADHAYLQTPLTFDLKTIQQMVD--------ESEIGLAGTRTAIGESI 176 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-- 328 + + ++ +I ++DG N+S +I+ + +A + +Y IG+ AE Sbjct: 177 AMAIKRFVENKNEQRVLILVSDGANNSGSIEP---IQAAKQAAKNNITIYTIGMGAEQMI 233 Query: 329 -----ADQFLKNCA-------------SPDRFYSVQNSRKLHDAFLRIGK 360 +Q + A + +++ +N +L + + + K Sbjct: 234 KRGLFGNQRINPSADLDEKTLTEIANLTGGKYFRARNQTELQNIYQTLNK 283 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 61/448 (13%), Positives = 147/448 (32%), Gaps = 107/448 (23%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + +GS++I+ A+ V+ ++G+ I+ + FV ++++Y D + L + + + Sbjct: 23 ADRRGSVAIMVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLAVSRENFQVSTND 82 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 K + G ++ +TS + + ++ Sbjct: 83 QLKALAQSYFDANFP-------------PGTMGATTSLSVATS---GTPPTVQGFTVTVT 126 Query: 131 SRYEM---PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS----LSMND 183 + + P + S + + + G+++++VLD S S D Sbjct: 127 ATLPLVFAPLVETLGGPTIGSVGISKASGAVFTTQTSNQGGMELVIVLDNSASMKGSQED 186 Query: 184 HFGPGMDKLGVA-----------------TRSIREMLDIIKSIPDVNNVVRSGL------ 220 G L + + ++ + +K+ D+ V G+ Sbjct: 187 LRGGVKALLDMLYGNADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMANCPMA 246 Query: 221 -----------------------------VTF--SSKIVQTFPLAWGVQHIQEKINRLIF 249 + + +S + + L+ + I + Sbjct: 247 TVNGKLNGSRLSNAPPKTFKFDSTTDGVEIQYCGASTLGTSSALSPNRGDADKAIKSYVA 306 Query: 250 GSTTKSTPGLEYAYNKIFDA-KEKLEHIAKGHDDY---------KKYIIFLTDGEN---- 295 G T GL + + + + + + KK ++ +TDG N Sbjct: 307 GGDTLIGEGLVWGWRMLTPSWRGLWNTKDQPGASLPLDYDLPYMKKVLVLMTDGVNHIAG 366 Query: 296 ---SSPNIDNKESL-----------FYCNEAKRR-GAIVYAIGVQAEAADQFLKNCAS-P 339 ++ D +++ CN AK+ ++Y I ++ +Q + +CAS P Sbjct: 367 RNYTAYYSDPYQTVADASKADADLMTICNAAKKDHNVVLYTITYGSDTDEQQMSDCASDP 426 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + Y + L AF ++G ++ ++ Sbjct: 427 SKHYHAALPQDLAKAFTQVGTDLTTMKL 454 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 76/205 (37%), Gaps = 18/205 (8%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +PF + + +++ G+D+++VLDVS SM+ G +L Sbjct: 69 RLPFFLRAAVLLLSVFAMAQPRIVQRQTVAETR-GIDLLLVLDVSRSMHQQDFNGQSRLE 127 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 K + R GLV FS K PL + ++ + Sbjct: 128 AVKGVG-------KQFVLSRSADRIGLVVFSGKGYTPCPLTLDHLTLGTVLDNISSEVIQ 180 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + + A + + + +K II LTDG+N++ +ID L A Sbjct: 181 EEGTAIGTAILI-------AVNRLRASESRQKAIILLTDGQNNAGDIDP---LTAAGFAL 230 Query: 314 RRGAIVYAIGVQAEAADQFLKNCAS 338 + G +Y I A+ A F+++ S Sbjct: 231 QDGIKIYTIAATAQDARPFVRSAES 255 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 86/253 (33%), Gaps = 39/253 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFG 186 F+ F + P T+ K+ +M+V+DVSLSM D Sbjct: 55 WYRHVPAAFLMVAFMLLTVALAGP---TAEQKVPRNRAT---VMLVIDVSLSMKATDVQP 108 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 ++ VA +S E L + GL++F+ + + I+ Sbjct: 109 TRLEAAQVAAKSFAEGLTP---------GINLGLISFAGSATVLVAPTTDRSAVSQGIDG 159 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L +T + + A + I + + ++ +TDG+ + +++ Sbjct: 160 LKLAQSTATGDAIVAALSAIDSFGKVVGGADGP---PPARVVLMTDGKETVG---TRKAT 213 Query: 307 FYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRK 350 +AK G + I E D+ +K A S F+ ++ + Sbjct: 214 DAAGDAKEAGIPISTISFGTERGSVDINGKAQEVPVDDESMKEIAKISGGEFFKAASAEE 273 Query: 351 LHDAFLRIGKEMV 363 L + +G+++ Sbjct: 274 LRRVYDTLGEQIG 286 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 59/443 (13%), Positives = 137/443 (30%), Gaps = 85/443 (19%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 +RNF N +GSI+++ + L +F++MG ++ + V+++L+++ D + L Sbjct: 18 RLRNFRANERGSIAMIFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAAVKAAA 77 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST--SLSIIIDDQH 122 +E+ N+ S + + + + + + R +I + Sbjct: 78 QKESDCVANPAGNNVSNFQGCGQ-KDIIKAGVAAGVQYMNGDPLMRGADRKPTIELSSSE 136 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 ++ + ++P P+ + I+ + + L+ ++LD S+SM Sbjct: 137 GSWSATVNYSADIPTNIARLM---GVQTIPVNGKVTSNIALGTHMYLNFHLLLDRSMSMG 193 Query: 183 DHFGP------------------------------------GMDKLGVATRSIREMLDII 206 +D L AT ++ ++ Sbjct: 194 IGATSDDISRLQALTGCAFACHSEGYEAQYYDQPKAQGIRFRIDDLRDATGALVAQAKMV 253 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 S + ++ G+ F+ + + + ++ + L + T A + Sbjct: 254 ASA-NAREHIQMGVYAFNHHVSPLVEMTSDLTNVANAVKNLDLPTHDDGTQA-ADAVTWL 311 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE---------------------- 304 K K + + +TDG + Sbjct: 312 VANKIKGNGTGLTSAAPLEIVFLVTDGVEDGIYTGWNKMVGPTGLPLPWWPSWMTKAPTS 371 Query: 305 --SLFYCNEAKRRGAI---VYAIGVQAEAADQF--------------LKNCASPDRFYSV 345 + C+ K +GAI VY V Q+ L+ CAS F++ Sbjct: 372 AFPVTACDALKSKGAIVAVVYTTYVPFPGTVQYDRLIGPFAPNISPNLQGCASQGYFFTA 431 Query: 346 QNSRKLHDAFLRIGKEMVKQRIL 368 + + +++ L Sbjct: 432 SEPGDITRGMQSLFNRALQELAL 454 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 91.8 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 70/213 (32%), Gaps = 31/213 (14%) Query: 167 IGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++++ LD+S SM G + +L R E + R GLV Sbjct: 96 SAREIVLALDLSGSMERKDFSLDGETVSRLAAVKRVGAEFI-------RRRAGDRIGLVE 148 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ + + + + G +T GL A ++ A+ Sbjct: 149 FADQAYVAAAPTFDTAAVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQLADAEGGGP 208 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEA------- 328 K ++ L+DG N++ K+ AK G VY I + A+ Sbjct: 209 PPSRDKVVVLLSDGANNAGQTAPKD---VAALAKDLGVRVYTIALGPIDMADNPNNEQDV 265 Query: 329 -ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L+ A S R + V+ + L I Sbjct: 266 VDVETLRAMAETSGGRAFRVKTTDDLESVAAAI 298 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 52/198 (26%), Positives = 82/198 (41%), Gaps = 26/198 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +D+++VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 401 EKEKNIDVVLVLDHSGSMADTEDAGISKLEIAKSASAKMIEHLESSDGV------GVIAF 454 Query: 224 SSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + V+ + E I+ + G T P L A + +K K + Sbjct: 455 DHNYYWAYEFSKLVRKKDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKSKL------ 508 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--P 339 I+ LTDG + NEAKR + IGV L AS Sbjct: 509 -----IVLLTDGMGEQGGYE-----IPANEAKRNNIKITTIGVGKFVNLPVLSWIASFTS 558 Query: 340 DRFYSVQNSRKLHDAFLR 357 RFY V N +L D FL+ Sbjct: 559 GRFYLVSNPYELVDVFLK 576 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 84/216 (38%), Gaps = 37/216 (17%) Query: 168 GLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D+++ LD+S SM++ G G+D+L VA +I R LV F Sbjct: 120 GIDIVIALDLSDSMSNPMDGRRGLGLDRLTVAK-------QVIDEFIRRRPHDRIALVGF 172 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + PL ++ I ++ G T GL + N++ K Sbjct: 173 GAHASTIAPLTLDHAVLRNLIVQVRLGVVDGQETAIGAGLGVSLNRL-----------KE 221 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QF 332 K I+ LTDG +++ +D A RG ++Y + + + D Q Sbjct: 222 SQAATKIIVLLTDGVHNADGMDPD---TVAQTAAERGVVIYTVLMGQQTGDRSSVDAGQL 278 Query: 333 LKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + A+ Y ++++ L +F + ++ K I Sbjct: 279 ERLAGATDGYAYLAEDTQTLETSFQDLLDKLEKSSI 314 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 81/211 (38%), Gaps = 28/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+DVS SMN ++ A + ++ D+ + GL++F+ Sbjct: 90 VILVMDVSRSMNATDVAP-SRIRAAQSAAKKF------ADDLTEGINLGLISFAGTPSTL 142 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ +++L+ T + G+ A ++I + + +I+ L Sbjct: 143 VSPTPDHTATKKAVDKLVLADKTATGEGIFAALDQI---RTLNAVLGGPEAAPPAHIVLL 199 Query: 291 TDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P+ D + + +AK G V I D LK Sbjct: 200 SDGKQTVPDEPTDPRGAFTAARKAKEEGIPVSTISFGTAYGTVELDGDRVPVPVDDPSLK 259 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A S F++ + +L++ + ++ E+ Sbjct: 260 QIANLSGGNFFTASSLDELNEVYEKLQSEIG 290 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 45/214 (21%), Positives = 78/214 (36%), Gaps = 43/214 (20%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+M++LD+S SM + ++L A R R+ + R GL+ F+ + Sbjct: 102 GIDIMLLLDISDSMIEKDLSP-NRLEAAKRMARQFI-------KGRLQDRIGLIVFAGEA 153 Query: 228 VQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V PL + + ++ LI T L A N++ D + Sbjct: 154 VSLCPLTTDYELLYGFLDEVTPSLIPTPGTAIGSALAVAVNRMRDTAGE----------- 202 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---- 339 K I ++DG+N+S N+ S N G VY I V + AS Sbjct: 203 SKVAILISDGDNTSGNLGPTTSAQLAN---AFGVKVYTISVGKPKSASKADTTASAGALM 259 Query: 340 -------------DRFYSVQNSRKLHDAFLRIGK 360 +++ ++ L F +I + Sbjct: 260 DEGELQNIAGIGNGKYFRATDNTALESVFKQIDQ 293 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 55/254 (21%), Positives = 84/254 (33%), Gaps = 54/254 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGV 194 F F F T S +K G+D++M +DVS SM D + L Sbjct: 60 FAFRLFAVTLIIIALARPQTHSENAQTKITDGIDIVMAIDVSSSMLSQDLKPNRFEALKK 119 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--- 251 +K P+ R GLV ++ + P+ I + L +G Sbjct: 120 VASQF------VKDRPN----DRIGLVVYAGESYTKTPVTTDKGIILSSLAELTYGQVED 169 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL A N++ K + II LTDG N++ ID L Sbjct: 170 GTAIGMGLATAVNRL-----------KESKAKSRVIILLTDGVNNTGVIDP---LIAAEL 215 Query: 312 AKRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQ 346 A G VY +G+ + +K A + R++ Sbjct: 216 AAEYGIKVYTVGIGTNGMALSPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRAT 275 Query: 347 NSRKLHDAFLRIGK 360 N++KL + I K Sbjct: 276 NNQKLQQIYDEINK 289 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 51/198 (25%), Positives = 80/198 (40%), Gaps = 26/198 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +D+++VLD S SM D G+ KL +A + +M++ ++S V G++ F Sbjct: 401 EKEKNIDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMIEHLESSDGV------GVIAF 454 Query: 224 SSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + + E I+ + G T P L A + +K K + Sbjct: 455 DHNYYWAYKFGKISKKEDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKSKL------ 508 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--P 339 I+ LTDG + NEAKR + IGV L AS Sbjct: 509 -----IVLLTDGMGEQGGYE-----IPANEAKRNNIKITTIGVGKYVNATVLSWIASFTS 558 Query: 340 DRFYSVQNSRKLHDAFLR 357 RFY V N +L D FL+ Sbjct: 559 GRFYLVSNPSELVDVFLK 576 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 45/231 (19%), Positives = 85/231 (36%), Gaps = 26/231 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F + + +P+L+ ++ + G D+++ +D S SMN + + Sbjct: 53 LKIAIFVLLSIALSSPILVD---RVDPLNRNGKDIVLAIDASGSMNSTGFDFEGEAALPQ 109 Query: 197 RSIREMLDII--KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGS 251 + R + I G+V + P+ + I +N+ + G Sbjct: 110 KLSRFEIAKIVASEFIQKRLSDNVGIVLYGDFAFIASPITYEKNIIIEMLSYLNQGMAGQ 169 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + K K ++ LTDGE++S +I K++L Sbjct: 170 NTAIGEAIA-----------MSLRAFKHSKAKSKIVVLLTDGEHNSGDISPKDALVL--- 215 Query: 312 AKRRGAIVYAIGVQ--AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 AK +Y IG+ EA + LK A S F+ N+++L + + I Sbjct: 216 AKEENIKIYTIGMGNRGEADEALLKKIADESGGEFFYATNAKELKEIYEHI 266 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 113/375 (30%), Gaps = 89/375 (23%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + A +L G + +D + ++ + + + N + Sbjct: 24 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPD-DPT 82 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + + K + Y + + V + LD+++VL Sbjct: 83 LSAPAREKTVTANEDGTYTVALN--------VTGAKSAGTGEIVT-----NQPLDIVLVL 129 Query: 176 DVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSS--- 225 DVS SM + G K+ ++ + ++ I D + R LV F+ Sbjct: 130 DVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGTEK 189 Query: 226 ----------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 L + V + +N L T + A + Sbjct: 190 TSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT-- 247 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQ 325 + + KK +IF TDGE + + D + N+A K G +Y+IGV Sbjct: 248 -------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 300 Query: 326 AEAA--------DQFLKNCAS----------------------------PDRFYSVQNSR 349 + A ++++ +S + + ++ Sbjct: 301 SGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAG 360 Query: 350 KLHDAFLRIGKEMVK 364 +L++ F I +E+ K Sbjct: 361 QLNNIFESIYQEITK 375 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 77/215 (35%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++L A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVAP-NRLTAAQEAAKQFADQLTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I++L T + G+ A I I G + ++ + Sbjct: 152 VSPTTNRESTKTAIDKLQLADRTATGEGIFTALQAIATVG---AVIGGGDEPPPARVVLM 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P+ + K + AK +G + + D+ LK Sbjct: 209 SDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S ++ + +L F + +++ + I Sbjct: 269 KIADLSGGEAFTASSLEQLKQVFTNLQEQIGYETI 303 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 113/375 (30%), Gaps = 89/375 (23%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + A +L G + +D + ++ + + + N + Sbjct: 24 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPD-DPT 82 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + + K + Y + + V + LD+++VL Sbjct: 83 LSAPAREKTVTANEDGTYTVALN--------VTGAKSAGTGEIVT-----NQPLDIVLVL 129 Query: 176 DVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSS--- 225 DVS SM + G K+ ++ + ++ I D + R LV F+ Sbjct: 130 DVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGTEK 189 Query: 226 ----------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 L + V + +N L T + A + Sbjct: 190 TSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT-- 247 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQ 325 + + KK +IF TDGE + + D + N+A K G +Y+IGV Sbjct: 248 -------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 300 Query: 326 AEAA--------DQFLKNCAS----------------------------PDRFYSVQNSR 349 + A ++++ +S + + ++ Sbjct: 301 SGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAG 360 Query: 350 KLHDAFLRIGKEMVK 364 +L++ F I +E+ K Sbjct: 361 QLNNIFESIYQEITK 375 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 43/230 (18%), Positives = 85/230 (36%), Gaps = 35/230 (15%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDI 205 P+ + ++ + D+M+ +D+S SM D G +D+L ++ Sbjct: 77 RPVWVEPPLE---QQRPVRDLMLAIDLSQSMQTQDFNDANGQRIDRLSAVK-------EV 126 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 ++ R GL+ F S PL + + S G+ Sbjct: 127 VQGFIQRRKDDRLGLIVFGSGAFAQAPLTLDHASLSLLL--------EDSGIGMAGPNTA 178 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I DA + + + +K +I LTDG ++S I + + A RG +++ IG+ Sbjct: 179 IGDAIGLALKLLEQAHEPEKVLILLTDGNDTSSAITPQHAAA---MAAARGVVIHTIGIG 235 Query: 326 ---AEAADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 AE + L+ A + R++ ++ L + + + Q Sbjct: 236 DPSAEGEAKVDLSALEQIARTTGGRYFRAEDRSALDQVYATLDRLTPHQV 285 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 81/242 (33%), Gaps = 28/242 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + + + S +M+V+DVS SM ++L A + ++ Sbjct: 72 MILSLVSFTIAMAGPTHDVRISRNRAVVMLVIDVSQSMRATDVAP-NRLTAAQEAAKQFA 130 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D + + GL+ ++ + + I++L T + G+ A Sbjct: 131 DQLTP------GINLGLIAYAGTATVLVSPTTNRESTKTAIDKLQLADRTATGEGIFTAL 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYA 321 I I G + ++ ++DG+ + P+ + K + AK +G + Sbjct: 185 QAIATVG---AVIGGGDEPPPARVVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPIST 241 Query: 322 IGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + D+ LK A S ++ + +L F + +++ + Sbjct: 242 VSFGTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASSLEQLKQVFTNLQEQIGYE 301 Query: 366 RI 367 I Sbjct: 302 TI 303 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 46/207 (22%), Positives = 84/207 (40%), Gaps = 17/207 (8%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII--KSIPDVNNVVR 217 K+S + G D+++ +D S SMN D++ R R + I Sbjct: 45 KLSPNNRHGKDIVLAIDASGSMNSSGFDFEDEVSDGKRLSRFEITKIIASEFIQKRISDN 104 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G+V + P+ + + + + + L T G+ I +A Sbjct: 105 VGVVLYGDFAFIASPITYEKEIVTQMLGYL--------TQGMAGQNTAIGEAIAMGVRSF 156 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKN 335 K K I+ L+DGE++S ++ KE+ AK +G +Y I + EA + L+ Sbjct: 157 KHSKAKTKVIVLLSDGEHNSGSVSPKEATEL---AKEQGIKIYTIAMGNKGEADEALLET 213 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGK 360 A S F+S ++++L + + I K Sbjct: 214 IAKDSNGEFFSASSAKELKNIYDEIDK 240 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 60/155 (38%), Gaps = 19/155 (12%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIF 289 P++ + K+N + T GL + + + A + + Y+ YI+ Sbjct: 445 TPMSNQWATLNSKVNAMNPSGNTNQAIGLFWGWQTLNTANDPFKAPSKDPNWVYQDYIVI 504 Query: 290 LTDGENSS------------PNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAADQFLK 334 L+DG N+ P ID +E C+ K ++ I V + Q LK Sbjct: 505 LSDGLNTQNRWYTCPNAGPCPTIDGRE-KTLCDNIKADKITIFTIQVNINSKDPESQVLK 563 Query: 335 NCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +CAS F + ++ AF + ++ K RI Sbjct: 564 DCASSGSGYFQLITSANDTATAFDNVLNKIAKLRI 598 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 34/265 (12%), Positives = 89/265 (33%), Gaps = 38/265 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + +G+++I+ ++ + ++G ++ + + + LD + L +++++ Sbjct: 16 FRKDARGNVAIIFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAAL-----MISKDA 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + + + + +++ T ++ + + Sbjct: 71 ATMSDSEITTRARQYVNSLYTN---------------TETPIQTFSAVYTPNNGSGATIL 115 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + MP F N S P+ S+ K S + + +VLD + SM+ + G Sbjct: 116 LNAGGNMPTYFMKI-VGTNFSTLPINTASTTKWGS---SRMRVALVLDNTGSMDQN-GKM 170 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQE 242 A + ++ + + V +V F+ + L W Sbjct: 171 TALKKAAANATTGLIKKLSAFNTNEGDVYISVVPFAKDVNVGTSNVGASWLNWSEWEAAP 230 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIF 267 +I T ++ ++ YN I Sbjct: 231 RI-------LTDNSYPIKVKYNNIT 248 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 91.0 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 80/196 (40%), Gaps = 25/196 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + D S SM+ + G KL +I +D + ++ L++F + Sbjct: 242 PLAAAICYDCSGSMSGNDPKGYRKL-----AIDNFIDSMTLTD------KTALISFEDEA 290 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + + ++ +N FG T +E A ++ Y ++I Sbjct: 291 KLVSEFSDNKEELKGLVNPY-FGGGTNVRASVEMAIEQL----------NTVQHWYTRHI 339 Query: 288 IFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 I L+DG+ N + N+ N +A ++ IG+ + A +Q LK+CA + ++++ Sbjct: 340 ILLSDGDVNININLANNTVDDLIKKAVDNNIKIHTIGLGSGADNQKLKDCAEYTGGQYFT 399 Query: 345 VQNSRKLHDAFLRIGK 360 + + KL + + K Sbjct: 400 AETAEKLDAIYKDLSK 415 >gi|149634622|ref|XP_001513644.1| PREDICTED: similar to AMACO [Ornithorhynchus anatinus] Length = 801 Score = 91.0 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 50/222 (22%), Positives = 88/222 (39%), Gaps = 24/222 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F C HA + +KIS+ S + + +D+ ++ + G A Sbjct: 29 FSQAFLSHCIQELHASK--ETIMKISAASQL-MQCSAAVDILFLLDGSYSIGKGSFETAK 85 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTT 253 + ++ D + PD VR G V FS FPL Q ++KI +++F G +T Sbjct: 86 YFVVKLCDALDINPDR---VRVGAVQFSVASWLEFPLDSCLTRQEAKDKIKKIVFRGGST 142 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ L+Y K F + + +I +TDG++ + K Sbjct: 143 ETGLALKYILWKGFPGGR--------NASVPQILILITDGKSQGNVTVP------AQQLK 188 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAF 355 RG V+A+GV+ ++ L AS V + + DA+ Sbjct: 189 DRGITVFAVGVRFPRWEE-LHLLASEPNEQHVLFAEDVDDAY 229 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ ++D S ++ RS DI +V + GLV F +I Sbjct: 544 SLDLVFLVDASAAVGRENFTHTRNF---VRSSSLRFDIN------RDVTQIGLVVFGRQI 594 Query: 228 VQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + E ++++ F S A ++D ++ A+ K Sbjct: 595 RTVFALDTHPTGSGVLEAVSQMPFVGGVGSAG---TALLHVYDEVMTVQKGARPG--VSK 649 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ +TDG ++++ + + G V+ I V + L+ SP Sbjct: 650 AVVLITDG------TGIEDAVVPAQKLRSNGVSVFVIRVGPFQKEALLRIAGSPSYLVQA 703 Query: 346 QNSRKL 351 + + L Sbjct: 704 SSYKDL 709 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 91.0 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 89/241 (36%), Gaps = 42/241 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P + + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 75 RPQWLGAPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLGDFL 128 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL+ F ++ PL + + ++ ++ G K+T + DA Sbjct: 129 EGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNT--------ALGDAI 180 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ +TDG N++ ID + A G +Y IG+ ++ Sbjct: 181 GLGLKRLRLRPATSRVLVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGSDPDK 237 Query: 331 QFLKNC-------------------ASPDRFYSVQNSRKL------HDAFLRIGKEMVKQ 365 L++ S +++ ++ +L D+ + ++ + Sbjct: 238 DALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQPTQA 297 Query: 366 R 366 R Sbjct: 298 R 298 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 91.0 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 77/215 (35%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++L A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVAP-NRLVAAQEAAKQFADQLTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I++L T + G+ A + I G + I+ + Sbjct: 152 VSPTTNREATKAAIDKLQLADRTATGEGIFTALQAVATVG---AVIGGGDEPPPARIVLM 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P+ + K + AK +G + + D+ LK Sbjct: 209 SDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S ++ + +L F + +++ + I Sbjct: 269 KIADLSGGDAFTASSLEQLKQVFTNLQEQIGYETI 303 >gi|331697176|ref|YP_004333415.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326951865|gb|AEA25562.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 327 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 88/268 (32%), Gaps = 35/268 (13%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + + L P + P A +L+ + + + + + +++V+DV Sbjct: 35 KFTNLELLDTVAPRRPGWYRHVPAAAMILALAVLVVAIAGPQADAKVPRNRATVVLVIDV 94 Query: 178 SLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 SLSM D + A +S + L + GLV+F+ Sbjct: 95 SLSMQATDVAPTRLAAAQAAAKSFADQLTP---------GINLGLVSFAGTAAVLVSPTT 145 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 +++ ++ L +T + + A I + + I+ ++DG+ Sbjct: 146 DRTAVKQAVDGLKLSESTATGEAIFAALQSIDSFSRTVAASGTEGPPPAR-IVLMSDGKQ 204 Query: 296 S----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA 337 + D + S +A V I + D ++ A Sbjct: 205 TVPGPDGENDPRGSFTAAKQAAAEKIPVSTISFGTDYGTIDIEGGRTRVAVDDASMQQIA 264 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMV 363 S +F++ + +L + +G+++ Sbjct: 265 SLSGGQFFTAASESQLRQVYSELGEQIG 292 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 113/375 (30%), Gaps = 89/375 (23%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + A +L G + +D + ++ + + + N + Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPD-DPT 60 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + + K + Y + + V + LD+++VL Sbjct: 61 LSAPAREKTVTANEDGTYTVALN--------VTGAKSAGTGEIVT-----NQPLDIVLVL 107 Query: 176 DVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSS--- 225 DVS SM + G K+ ++ + ++ I D + R LV F+ Sbjct: 108 DVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGTEK 167 Query: 226 ----------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 L + V + +N L T + A + Sbjct: 168 TSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT-- 225 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQ 325 + + KK +IF TDGE + + D + N+A K G +Y+IGV Sbjct: 226 -------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 278 Query: 326 AEAA--------DQFLKNCAS----------------------------PDRFYSVQNSR 349 + A ++++ +S + + ++ Sbjct: 279 SGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAG 338 Query: 350 KLHDAFLRIGKEMVK 364 +L++ F I +E+ K Sbjct: 339 QLNNIFESIYQEITK 353 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 113/375 (30%), Gaps = 89/375 (23%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + A +L G + +D + ++ + + + N + Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPD-DPT 60 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + + K + Y + + V + LD+++VL Sbjct: 61 LSAPAREKTVTANEDGTYTVALN--------VTGAKSAGTGEIVT-----NQPLDIVLVL 107 Query: 176 DVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSS--- 225 DVS SM + G K+ ++ + ++ I D + R LV F+ Sbjct: 108 DVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGTEK 167 Query: 226 ----------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 L + V + +N L T + A + Sbjct: 168 TSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT-- 225 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQ 325 + + KK +IF TDGE + + D + N+A K G +Y+IGV Sbjct: 226 -------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 278 Query: 326 AEAA--------DQFLKNCAS----------------------------PDRFYSVQNSR 349 + A ++++ +S + + ++ Sbjct: 279 SGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAG 338 Query: 350 KLHDAFLRIGKEMVK 364 +L++ F I +E+ K Sbjct: 339 QLNNIFESIYQEITK 353 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 36/243 (14%), Positives = 87/243 (35%), Gaps = 36/243 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDK 191 + + + P + + + D+M+++D+S SM D G +D+ Sbjct: 65 LVPICWLLVVIAIAQPQWLGEPIT---QVQSARDLMLLVDLSQSMEAKDFQDQQGNKIDR 121 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + ++ R GL+ F +K P ++ + ++ Sbjct: 122 LEAVKLVVDDFIE-------RREGDRIGLILFGTKAYLQVPFTQDLETARFLLD------ 168 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 ++ G+ A + DA + + +I LTDG ++ + ++ Sbjct: 169 --EAQIGMAGAQTMLGDAIGLAIQTFEDSKTENRVLILLTDGNDTGSQVPPDQA---AKF 223 Query: 312 AKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A +R ++Y I + ++ L+ A + +F+ + + L + I ++ Sbjct: 224 AAQRNIVIYTIAIGNPETEGTEKIDEETLQLIADQTGGQFFRGSDRQGLIQIY-DILDQL 282 Query: 363 VKQ 365 Q Sbjct: 283 EPQ 285 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 66/401 (16%), Positives = 130/401 (32%), Gaps = 84/401 (20%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G++ +L A LL V+ + I+ S+ V+++L + T L + +G+ Sbjct: 16 RRGAMLVLIAFLLVVVVCMAAFAIDVSYMQLVRSELRAAT-DAAAKAGTLALAKTDGDAA 74 Query: 73 KKQKNDFSYRIIKNIWQTDF----------RNELRENGFAQDINNIERSTSLSIIIDDQH 122 + + R+ + NG N TS+ I+ Sbjct: 75 SARTAAIQAAARNKVAGRALVLTTDQVQVGRSAAQANGTWSFTANQTPYTSVKILSSMSD 134 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM- 181 S P TF + ++ S + ++ +V+D S SM Sbjct: 135 STAAGSV------PLFLGTFM--------GRGSFQPAQSATASQMEQEICLVIDRSHSMC 180 Query: 182 ------NDHFGPG---------------MDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + PG + + S+ +D I + NN R L Sbjct: 181 FNMSGVEWSYPPGTKTTPHTICYPPHATLSRWAALQSSVNLFMDTI---LETNNTPRVAL 237 Query: 221 VTFSSKIVQTF-----------------PLAWGVQHIQEKINR---LIFGSTTKSTPGLE 260 +T+ S I L+ ++ KI + T + G+ Sbjct: 238 ITWGSTIGTNTAEYSYTKKTEVAVANELGLSTDYAAVKSKIAARTTKVMLGGTNMSAGI- 296 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 DA L + KK +I +TDG+ + ++ + +A G ++ Sbjct: 297 -------DAGRTLLNGNTVRALAKKTMILMTDGQWNQG----RDPIDAAEDAADEGIQIH 345 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 I + +A ++ A + ++Y N +L +AF + Sbjct: 346 TITFLSGSAQNTMRQVAEITGGKYYVSSNQAELEEAFRDLA 386 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 113/375 (30%), Gaps = 89/375 (23%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + A +L G + +D + ++ + + + N + Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPD-DPT 60 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + + K + Y + + V + LD+++VL Sbjct: 61 LSAPAREKTVTANEDGTYTVALN--------VTGAKSAGTGEIVT-----NQPLDIVLVL 107 Query: 176 DVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSS--- 225 DVS SM + G K+ ++ + ++ I D + R LV F+ Sbjct: 108 DVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGTEK 167 Query: 226 ----------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 L + V + +N L T + A + Sbjct: 168 TSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT-- 225 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQ 325 + + KK +IF TDGE + + D + N+A K G +Y+IGV Sbjct: 226 -------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 278 Query: 326 AEAA--------DQFLKNCAS----------------------------PDRFYSVQNSR 349 + A ++++ +S + + ++ Sbjct: 279 SGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAG 338 Query: 350 KLHDAFLRIGKEMVK 364 +L++ F I +E+ K Sbjct: 339 QLNNIFESIYQEITK 353 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 76/215 (35%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +L A + ++ D + + GL+ ++ Sbjct: 91 VMLVIDVSQSMRATDVAP-SRLVAAQEAAKQFADQLTP------GINLGLIAYAGTATVL 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L T + G+ A I I G + I+ + Sbjct: 144 VQPTTNREATKNGLDKLQLADRTATGEGIFTALQAIATVG---AVIGGGDEPPPARIVLM 200 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P+ + K + AK +G + + D+ L+ Sbjct: 201 SDGKETVPSNPDNPKGAFTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLE 260 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S ++ + +L F + +++ + I Sbjct: 261 KIAQLSGGDAFTASSLEQLKAVFTSLQQQIGYETI 295 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 54/235 (22%), Positives = 86/235 (36%), Gaps = 36/235 (15%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGL------DMMMVLDVSLSM-NDHFGPGMDKLGVAT 196 W + L+ S + S D+ V+DVS SM D F P ++L VA Sbjct: 54 WLVGAVGWLLIAYSLTQPRSPQGFAKNKIEVNDIFFVIDVSRSMLADDFRP--NRLEVAK 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----T 252 D I + R GL+ FS + PL+ ++ I++ + + G Sbjct: 112 -------DKISDFVALRPTDRIGLIMFSERAFTLLPLSTDLKLIKQMVGEINVGGMLGSG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L A + +AK K II LTDG ++ + + EA Sbjct: 165 TNIGDALGLAV------ARGAQSLAKN-----KVIILLTDGVSNVGFLTP---IQAAEEA 210 Query: 313 KRRGAIVYAIGVQ--AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K++G VY IG+ +A + KN R+ ++ I + Q Sbjct: 211 KKQGIKVYTIGIGGRGDAKIPYGKNIFGRQRYQNIPGGSIDFKTLKEIADKTNGQ 265 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 81/231 (35%), Gaps = 31/231 (13%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 TF ++ P+ + + I G +MM+ +D+S SM + V Sbjct: 2 LTFLGLVLAATKPVWVGEPIAI---EKSGREMMVAVDLSGSMEAKDFVDQQGINVRRIDG 58 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKST 256 ++L ++S R GL+ F P+ + E+++ + G+ T Sbjct: 59 VKLL--LESFLQQRTGDRVGLIAFGDDAYLQAPVTEDFDTLSLLLEQMDVRMAGAGTALG 116 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N EH + K ++ LTDG++++ ++ + E RG Sbjct: 117 DAIGVAVNHF-------EHSESNN----KVLLLLTDGKDTTSQFPPVDAAHFAGE---RG 162 Query: 317 AIVYAIGVQAEAAD-------QFLKNCAS--PDRFYSVQNSRKLHDAFLRI 358 +Y I + + L AS + + + L + + Sbjct: 163 VTIYPIAIGDATNVGEEAIDLEMLARIASYTGGQVFEALDGDALAAVYQTL 213 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 48/212 (22%), Positives = 84/212 (39%), Gaps = 26/212 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +++V+D S SM + ++KL +A + +M+D ++S V G++ F Sbjct: 401 EKERNVAVVLVIDHSGSMGESNLGNINKLEIAKSAAAKMIDHLESSDSV------GVIAF 454 Query: 224 SSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + E I+ + G T P L A N + +K K Sbjct: 455 DHNFYWASKFGKLKSKNEVIENISGIQIGGGTAIIPPLTEAVNTLRKSKAK--------- 505 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K I+ LTD + + + AKR + IGV + L A + Sbjct: 506 --DKVIVLLTD-----GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTS 558 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 RFY V+++ L D FL+ K + + I K Sbjct: 559 GRFYYVKDASNLIDVFLKEAKIIKGKYIKEKK 590 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 83/231 (35%), Gaps = 27/231 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + + P + + ++ G D+++ +D S SM G D+ Sbjct: 51 LKMLIYTFLVVALAKPFVYDA---AGNQHKKGRDLVLAIDASGSMA---QSGFDEKDRFK 104 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTT 253 LD+ G+V F + PL + ++ ++ + I G +T Sbjct: 105 TKYETTLDLSADFIKHRFDDNMGVVIFGTFAYTASPLTYDLEAMESMLKMTTVGIAGEST 164 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 L A + + K II LTDG +++ K ++ AK Sbjct: 165 AIGDALMQAMRTLS-----------YGEAQSKAIILLTDGYHNAGRSSPKAAVAK---AK 210 Query: 314 RRGAIVYAIGVQ--AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +G +Y IGV ++ L A S + Y+ ++ +L + + I K Sbjct: 211 EKGIKIYTIGVGKSSDYDAALLDTIAKESGGKSYAAASAAQLKEVYKEIDK 261 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 113/375 (30%), Gaps = 89/375 (23%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + A +L G + +D + ++ + + + N + Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPD-DPT 60 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + + K + Y + + V + LD+++VL Sbjct: 61 LSAPAREKTVTANEDGTYTVALN--------VTGAKSAGTGEIVT-----NQPLDIVLVL 107 Query: 176 DVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSS--- 225 DVS SM + G K+ ++ + ++ I D + R LV F+ Sbjct: 108 DVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNRIALVKFAGTEK 167 Query: 226 ----------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 L + V + +N L T + A + Sbjct: 168 TSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALT-- 225 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEA---KRRGAIVYAIGVQ 325 + + KK +IF TDGE + + D + N+A K G +Y+IGV Sbjct: 226 -------YQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGVV 278 Query: 326 AEAA--------DQFLKNCAS----------------------------PDRFYSVQNSR 349 + A ++++ +S + + ++ Sbjct: 279 SGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKAATDAG 338 Query: 350 KLHDAFLRIGKEMVK 364 +L++ F I +E+ K Sbjct: 339 QLNNIFESIYQEITK 353 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 90/261 (34%), Gaps = 35/261 (13%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 + + + + S + + T+ ++ P I ++ + G D+M+ Sbjct: 41 VDLSEQKPEQGARILRRSTLQKILVNLTWLLLILAAAKPQWIGPPIE---QQKSGRDLMI 97 Query: 174 VLDVSLSMNDH-----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+S SM G +D+L ++K + R GL+ F + Sbjct: 98 AVDLSGSMEARDFTLPSGVTVDRLDAVK-------QVLKELAANRESDRLGLIVFGAAAY 150 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P Q Q+ ++ ++ G+ DA + D + +I Sbjct: 151 LQTPFTDDHQVWQQLLD--------ETEIGMAGPSTVFGDAIGLSIKLFSDSDSDNRVLI 202 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLK-------NCASP 339 LTDG ++ + ++ A G +Y I + A + L + + Sbjct: 203 MLTDGNDTGSTVPPVDA---AKVAAANGVRIYTIAIGDPATVGEDALDMDTITRVSKIAD 259 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 R + + ++ A++ IG+ Sbjct: 260 GRTFRALDQDEMRQAYITIGE 280 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 69/474 (14%), Positives = 134/474 (28%), Gaps = 119/474 (25%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 S+ + F + +GS+ L A + ++ +V G ++ + K+ +D + L A Sbjct: 8 SYFH--KFGSDERGSLLPLVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAALTVAK 65 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF-AQDINNIERSTSLSIIIDD 120 ++ + I+ + FR L G +Q I+N++ D Sbjct: 66 QLST-----------TVLTENQIRTGLKNAFRANLNAAGINSQGIDNLDFKVDPGEGTLD 114 Query: 121 QHKDYNLSA-------VSRYEMPFIFCTFPWCA---NSSHAPLLITSSVKISSK--SDIG 168 ++ + ++ + + L +T S++ + Sbjct: 115 VWSSVDIQTNFIKLGGIGPEKLEVGAASQVNYSRFDVELALVLDVTGSMRPDMNALKEAS 174 Query: 169 LDMMMVL--DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFS 224 ++ +L D S S + + N V RSG F+ Sbjct: 175 KSIVNILLPDDSNSRESKVRISLVPYSQGVNLGSYATRVTNGGSTWRNCVNERSGPQKFT 234 Query: 225 SKIVQTF-------------------------------------PLAWGVQHIQEKINRL 247 PL + I+ L Sbjct: 235 DAPYNYAGSRSDFFHGKPKQFVWDYGWTEQWQTRPEACPKTAVEPLTADRTKLLRAISGL 294 Query: 248 IFGSTTKSTPGLEYAYNKIF---------DAKEKLEHIAKGHDDYKKYIIFLTDGE-NSS 297 G T G+ + + + D+ DD KK+ + +TDG+ N++ Sbjct: 295 KDGGGTGGQTGIAWGWYTLSPKWKNLWPRDSAPATYGTGSHTDDTKKFALIMTDGDFNAA 354 Query: 298 PNID------------------------------------NKESLFYCNEAKRRGAIVYA 321 D + + C+E K + ++ Sbjct: 355 YGWDCGCRKIRDKPLYCRKKSNKKSWIERYFSPSKISHAPAQRAKKLCDEMKSKNIEIFT 414 Query: 322 IGVQAEA---ADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + D + CAS +Y NS +L AF I E+ Q I K Sbjct: 415 VYFDTGGATFGDDLMSYCASGSRNYYRADNSNELIQAFSNIANEI--QSIYIAK 466 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 87/258 (33%), Gaps = 54/258 (20%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMD 190 Y + F+ T S +K G+D++M +DVS SM D + Sbjct: 56 YHLLFVLRMIAIALIVVALARPQTHSENAKTKITDGIDIVMAIDVSASMLSQDLKPNRFE 115 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 L +K P+ R GLV ++ + P+ I ++ + +G Sbjct: 116 ALKKVASQF------VKDRPN----DRIGLVIYAGESYTKTPVTTDKLIILNALSEITYG 165 Query: 251 S---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T GL A N++ K + II LTDG N++ ID + + Sbjct: 166 QIEDGTAIGMGLATAVNRL-----------KESKAKSRVIILLTDGVNNTGFIDPQTA-- 212 Query: 308 YCNEAKRRGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRF 342 A G VY +G+ + +K A + R+ Sbjct: 213 -AELAAEYGIKVYTVGIGTNGMALSPYALNADGSIIYRMQQVDIDEPLMKKIAQVTKGRY 271 Query: 343 YSVQNSRKLHDAFLRIGK 360 + N++KL + I + Sbjct: 272 FRATNNQKLQQIYDEINQ 289 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 90.3 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 61/158 (38%), Gaps = 20/158 (12%) Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA-----KEKLEHIAKGHDDYK 284 PL +++ +N L+ T G+ + + + E E + Sbjct: 353 PVPLTEDKTVLKDHVNALVAEGGTAGHLGIAWGWYLVSPEWAAIWPEASEPLPYRQPQTS 412 Query: 285 KYIIFLTDGE-----NSSPNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQ------ 331 K +I +TDG+ ++ ++S+ C+ K +Y +G Q ++ Q Sbjct: 413 KAVILMTDGDFNIEHPTASRDSFRQSMDLCDGMKASSRRIQIYTVGFQVPSSVQRTGDGR 472 Query: 332 -FLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ CA SP +S + +L + + I + + R+ Sbjct: 473 TILEYCATSPSHAFSADSGEELIEVYRSIARSISDLRL 510 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 75/223 (33%), Gaps = 50/223 (22%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA--TKILNQENGN 70 G+++++TA ++P I + G+ I+ + K+K+ LD ++L A + N Sbjct: 18 EGGNVAMITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSAVLAGALGRQAGNTAAET 77 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE---RSTSLSI--IIDDQHKDY 125 Q + + G D + + T+L I + + Y Sbjct: 78 TLDVQTYALAL--------------FTDQGGGLDCDPVAVTFDETNLDILGTVRCRQPTY 123 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 S + E+ F + LD+ + DVS SM Sbjct: 124 LSSLIGHDELEFNVASTSTYGVGK-------------------LDVAFIFDVSGSM---- 160 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNN---VVRSGLVTFSS 225 ++L + +D + +PD VR + +++ Sbjct: 161 -NSYNRLAQLKTAAVAAVDEL--LPDSRERDGTVRLAIASYNH 200 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 90/258 (34%), Gaps = 60/258 (23%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLD--IIKSIPD 211 + + + +D +VLDVS SM+D +L +++ LD + Sbjct: 178 GAVNSTTVTTTQPIDFTLVLDVSGSMDDPMSKTDRTRRLDALKEAVKAFLDEAANTNTEA 237 Query: 212 VNNVVRSGLVTFSS-------------------KIVQTFPLAWGVQHIQEKINRLIFGST 252 + +V GLV F+ L + ++ K+++L Sbjct: 238 GSELVHVGLVKFAGDKTDKIGDDMYRSGGYTYNYSQIVSNLTADMNGLKNKVSKLKAAGA 297 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN----IDNKESLFY 308 T++ G A + A + D KK +IF DG +S + +++ Sbjct: 298 TRADNGFNRAVKVMGSASAR--------TDAKKVVIFFADGSPTSSSGFEGKVANKAVEA 349 Query: 309 CNEAKRRGAIVYAIGVQAEAA--------DQFLKNCAS-----------------PDRFY 343 E K GA VY+IG+ A A +QF+ +S + Sbjct: 350 AKELKDGGAAVYSIGIFASANPSSLSSNENQFMHAVSSNFPKATKYNQLGEGNIEAGYYK 409 Query: 344 SVQNSRKLHDAFLRIGKE 361 S N+ +L+ F I K Sbjct: 410 SATNASELNTIFDEIEKS 427 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 89.9 bits (221), Expect = 5e-16, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 83/217 (38%), Gaps = 36/217 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + G+ ++ +DVS SM +D +D A RS E + V+ Sbjct: 80 PTPDEQAGV--VLAIDVSGSMMADDLKPSRLDAAKAAARSFVERMPA---------GVKV 128 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GLV+F++ V L Q + E+I+ L + T GL + + Sbjct: 129 GLVSFAAGAVLESGLTADHQGVIERIDLLERRANTAIGEGLLESLKAFPTGANHQVAVPA 188 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------- 328 +I L+DG N EAKRRG VY IGV ++ Sbjct: 189 T-------VILLSDGRNRIGIA----PQEAAQEAKRRGVRVYTIGVGSDDPNASVDWAGF 237 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + L+ A + R+++ ++ +L + + +G ++ Sbjct: 238 DEAELRGIAEVTGGRYFAADSADRLQEIYRELGSQIG 274 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 89.9 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 79/216 (36%), Gaps = 40/216 (18%) Query: 167 IGLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G D+M+V+D+S SM++ G +++L + + E +D R GL+ Sbjct: 86 TGRDLMLVVDISPSMDEPDMVRQGRRINRLQAVKQVLAEFIDQ-------RQGDRLGLIL 138 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F S+ PL + + + ++ G+ I DA + Sbjct: 139 FGSQAYVQAPLTFDRTTVNILLQ--------EAGLGMAGNATAIGDAVGLAVKRLRERPL 190 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------------ 330 ++ I LTDG N++ I ++ A+ +Y IG+ A A Sbjct: 191 EQRVAIVLTDGANTAGEITPDKASEL---AQASAVRLYTIGIGAGADSAITGLLQRNPSR 247 Query: 331 ----QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + +++ +N +L + I + Sbjct: 248 DLDEALLTRMAQQTGGQYFRARNLAELGGIYTSINQ 283 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 89.9 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 34/143 (23%), Positives = 61/143 (42%), Gaps = 8/143 (5%) Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L + ++ L G T T G+++ + A+ + A G D KKY+I +TD Sbjct: 377 LTSDFTSVNTYLSSLSPGGNTNITLGVQFGMEMLSPAEPYTKATAFGDTDVKKYMIIVTD 436 Query: 293 GEN------SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSV 345 G N +S + N + C AK +G ++ + V+ + L+ CAS +Y + Sbjct: 437 GANTQNRWSTSNSAINARTALACTAAKAQGITLFVVRVE-DGDSSLLEACASQSSYYYDL 495 Query: 346 QNSRKLHDAFLRIGKEMVKQRIL 368 + L I + K R++ Sbjct: 496 SQASDLTKTMQDIFATINKLRLI 518 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 75/231 (32%), Gaps = 33/231 (14%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F + +G+ I+ + ++ G ++ F ++L D ++L KI Sbjct: 8 LTKFIGDRQGNTLIIFGLCAVILVGAAGGAVDMMRYFDTSSRLQDATDAAVLKATQKIEV 67 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E ++ + + + + ++ + + Y Sbjct: 68 SEAAAKTAAAMA----------FEMNLSDHPE-------LQTASHTFAIETSDNAKVVHY 110 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 R F + + SS + S +++ VLD + SM + Sbjct: 111 TSEITQRP-------YFLQLLGLGEQTIRVASSAQSESD---PFELLFVLDTTGSMASN- 159 Query: 186 GPGMDKLGVATR-SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 +K+ + +I + D N V+ G+V F++++ ++ Sbjct: 160 ----NKMTYLKTSVSSVLSSLISTYGDGNEDVKVGVVAFNTQVRLPASTSY 206 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 89.9 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 86/261 (32%), Gaps = 56/261 (21%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIRE 201 + ++ P + I + G+ +MMV+D S SM G +L V R Sbjct: 71 SVAAAGPRVAVGENTIRRE---GIAIMMVVDTSGSMRALDLADGGLDQTRLEVVKDVFRA 127 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKST 256 + + +N GLV+F+ PL + ++ L T Sbjct: 128 FVAGEDGLDGRSNDT-IGLVSFAGFADTRCPLTLNHGSLLTILDDLEIVRERAEDGTAIG 186 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL A ++ + + + II LTDG N++ + L A R G Sbjct: 187 DGLGLAVERL-----------RESEASSRVIILLTDGVNNAG---IETPLEAAELASRLG 232 Query: 317 AIVYAIGVQAEAADQF-----------------------LKNCA--SPDRFYSVQNSRKL 351 VY IG + L+ A + R++ + L Sbjct: 233 IKVYTIGAGTDGVAPVRVTNPLTGAEELRPMPVEIDEATLEAIAEHTGGRYFRATDGDGL 292 Query: 352 HDAFLRIGK----EMVKQRIL 368 + +I + E+ ++R+ Sbjct: 293 RQVYEQIDRLERTEISERRLR 313 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 89.9 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 87/249 (34%), Gaps = 29/249 (11%) Query: 134 EMPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMD 190 + + W + P + ++ + D+++ LD+S SM+ F Sbjct: 60 LVQLLLGPIAWGLVVVALARPQYLEPPLQRT---QPVRDLLLALDLSQSMDTRDFKTPSG 116 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 L ++R ++ R GL+ F P ++E + ++ Sbjct: 117 VLEPRVDAVR---QVVADFVARRTGDRIGLIVFGDAPYPLAPFTLDHALVRELLADMV-- 171 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 PG+ A + DA + +K +I LTDG +++ + ++ + Sbjct: 172 ------PGMAGASTSLGDAIGLGIKMFDQSHAQEKVMILLTDGNDTASRMPPAQA---AD 222 Query: 311 EAKRRGAIVYAIGVQAEAAD-------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKE 361 AK RG +V+ +G+ A LK+ AS R++ + L + + + Sbjct: 223 IAKTRGVVVHTVGIGDPATTGEQKVDLDALKHIASTTGGRYFFGADQTSLASIYATLDRV 282 Query: 362 MVKQRILYN 370 + Sbjct: 283 TPHREKTLT 291 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 89.9 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 84/229 (36%), Gaps = 49/229 (21%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++ LD+S SM +++ VA + E + R GLV F+ Sbjct: 91 GIDIVVALDLSTSMEAGDFRPQNRMHVAKEVLSEFIAN-------RVNDRIGLVVFAGAA 143 Query: 228 VQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL ++E + +L + T L + N++ D+ + Sbjct: 144 YTQAPLTLDYGVLKEVVKQLRTRVLEDGTAIGDALATSLNRLRDS-----------EAKS 192 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------- 325 + ++ +TDG+N+S I +S N A+ +Y I V Sbjct: 193 RVVVLITDGDNNSGKISPMDS---ANMAQALKVPIYTILVGKGGKVPFPQGTDLFGNTVW 249 Query: 326 ----AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + +++ A + +Y + +L + ++ + + +++ Sbjct: 250 RDTEIPINPELMQDIADRTGGEYYRATDPEQLREGLQKVLDSLERSKLM 298 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 82/226 (36%), Gaps = 37/226 (16%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSI 209 P + +S+ G +++ +DVS SM+ ++L V + + + Sbjct: 73 RPQWLGDPEAVST---TGRRLLLAVDVSGSMSTQDMAGNANRLQVVQKVAGDFI------ 123 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKI 266 + + GL+ F ++ PL + + + ++ + G+ T + A ++ Sbjct: 124 -RRRHGDQVGLILFGTQPYLQAPLTTDLNTVGQFLDEAMIGVAGTQTAIGDAIGLAIKRL 182 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 DA + +I LTDG N + + E+ A G +Y IGV + Sbjct: 183 RDATNVSGRKGET------VLILLTDGSNDAGAMPPDEAAKM---AAAAGLRIYTIGVGS 233 Query: 327 EAADQF------------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + D F LK A + ++ + L + RI Sbjct: 234 DQTDPFGMGGANDLDEDTLKLIAKTTGGEYFRATDVENLQQVYTRI 279 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 89.5 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 59/448 (13%), Positives = 135/448 (30%), Gaps = 138/448 (30%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 +++L A+LLPV+ + GL I++ + +L +D + L + ++ + G Sbjct: 1 MAMLMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGSLELPYDPQLDKG---- 56 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 + ++ ++ + ++ +++ + + Sbjct: 57 --LVKGAVNQYMAANY------------------PAAVLKGVTPGTEERSVTVKAEATVD 96 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + ++ ++ L+++ V+D + SM G + + A Sbjct: 97 TIFM--------GALGIGSSTVRAQATAGYNNLEVVFVIDNTGSMK---GTAIQQANAAA 145 Query: 197 RSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFP----------------------- 232 + E++ +PD V+ GLV F K+ Sbjct: 146 TQLAELI-----MPDGMETSVKVGLVPFRGKVHIPAGVDGLADGCRNADGTLAPSWILEE 200 Query: 233 -----------------------------LAWGVQHIQEKINRLIFGS---TTKSTPGLE 260 L I I + T + G++ Sbjct: 201 YKQTKYRYPTGSSLNVPKGTCDSIPRVQALTSNRTTIVSAIAKQDALGDASGTVISEGIK 260 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY------------ 308 + + + + + D +K +I LTDG+ +L Y Sbjct: 261 WGRHVLTPEAPFTQGS--SNKDMRKVMIVLTDGDTEDGKCGGNYALNYTPNAYWTNAYYG 318 Query: 309 ---------------------CNEAKRRGAIVYAIGVQAEAADQ--FLKNCASP-----D 340 AK +G ++AI + +K AS D Sbjct: 319 MFDMNTHCENGGKLNAAMLSEAQIAKDKGIEIFAIRYGDSDSTDISLMKAIASSKAGTDD 378 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 +Y+ ++ L + F +IG+++ + + Sbjct: 379 HYYNAPSAYDLEEIFKKIGRQLGWRLLR 406 >gi|119720657|ref|YP_921152.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] gi|119525777|gb|ABL79149.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] Length = 327 Score = 89.5 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 82/201 (40%), Gaps = 24/201 (11%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+DVS SM D G+ K+ VA R+ +++ + V GL+ FS +IV + Sbjct: 103 VLVVDVSGSMEDSIPGGV-KIEVARRAATLLVERMPG------GVDVGLLAFSDRIVLSL 155 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 P + + + I L G T T L+ A + + K ++F++ Sbjct: 156 PPTGDRRRVLDAIESLKPGGGTMYTYPLQAALSWLKPYKLFNAST---------LVVFVS 206 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFLKNCA--SPDRFYSVQN 347 DG D +E + G VY IG + ++ LK A + Y+ + Sbjct: 207 DGLP----ADAATYRTLLSEFRSLGIPVYTVYIGPGGDEGERELKLIAGSTGGEEYTAGS 262 Query: 348 SRKLHDAFLRIGKEMVKQRIL 368 + +L AF + ++ + Sbjct: 263 AEELLKAFKTLAEKASSILVR 283 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 89.1 bits (219), Expect = 9e-16, Method: Composition-based stats. Identities = 52/257 (20%), Positives = 91/257 (35%), Gaps = 48/257 (18%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFG 186 +R + + ++ P L S I S G+D+M+ LD S SM + G Sbjct: 51 ARVPIILRALCLIFLVIAAARPQLYNISRDIRS---SGVDIMLCLDTSGSMQALDFELDG 107 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + +L V + + + + R GLV F + PL + +++ Sbjct: 108 KPVTRLTVVKKVVADFIKE-------RETDRIGLVVFGQEAFTQSPLTMDKGLLLSLVDK 160 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + G ST I +A K K +I LTDG +++ +I +E+ Sbjct: 161 MEIGMAGDST--------AIGNAIAVAGKRLKDLKAKSKIMIILTDGRSNTGDITPEEAA 212 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF---------------------LKNCAS--PDRFY 343 A G +Y IGV F LK A+ +++ Sbjct: 213 GA---AAALGIKIYTIGVGGTGPAPFKVNTFFGPRIVNQSVDLDEKTLKEIAAIGKGKYF 269 Query: 344 SVQNSRKLHDAFLRIGK 360 +S++L + + I K Sbjct: 270 RATDSKELANIYEIINK 286 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 89.1 bits (219), Expect = 9e-16, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 95/267 (35%), Gaps = 54/267 (20%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMD 190 Y + + P S +++ G+D+++ D+S SM + P + Sbjct: 58 HYGIALRILALMALIIALARPQSAFSWQNSTTE---GIDIVIATDISGSMLAEDLKP--N 112 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +L +D IK P+ R GLV FS + PL + + + G Sbjct: 113 RLEAGKNIA---IDFIKGRPE----DRIGLVIFSGESFTQCPLTIDHDVLINLFSDISNG 165 Query: 251 ---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T GL A N++ K + K +I LTDG N++ +I L Sbjct: 166 MVEDGTAIGMGLATAVNRL-----------KDSEAKSKVVILLTDGSNTTGSIPP---LT 211 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF---------------------LKNCA--SPDRFYS 344 AK+ VY IGV + + L A + +++ Sbjct: 212 AAEIAKQMKVRVYTIGVGTKGYAPYPVKTPFGTQYQQVPVTIDEGVLSKIAGITGGKYFR 271 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILYNK 371 N+ KL + + +I ++ + +I + Sbjct: 272 ATNNEKLKEIYQQI-DKLERAKIAVTQ 297 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 89.1 bits (219), Expect = 9e-16, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 75/188 (39%), Gaps = 17/188 (9%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F ++ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 DA + + ++ +TDG N++ ID + A G +Y IG+ ++ Sbjct: 178 DAIGLALKRLRMRPATSRALVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGSD 234 Query: 328 AADQFLKN 335 L++ Sbjct: 235 PDKDALQS 242 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 89.1 bits (219), Expect = 9e-16, Method: Composition-based stats. Identities = 52/250 (20%), Positives = 89/250 (35%), Gaps = 49/250 (19%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +PF + SS +S G+D+++ +D+S SM ++ Sbjct: 54 LPFALELLALASMILALARPQDSS-HWEERSIQGIDLVLAMDLSGSMQ-ALDLKPNRFEA 111 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--- 251 A EM I + P+ N GLV F+ + PL I + ++ G Sbjct: 112 ARDVASEM---IAARPNDN----IGLVVFAGESFTLCPLTVDHDVILQMLDATEIGQLED 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL A N + +G D+ K II LTDG N++ +I Sbjct: 165 GTAIGLGLATAINTL-----------RGSDNKSKVIILLTDGSNNAGDITPS---MAAEL 210 Query: 312 AKRRGAIVYAIGVQAEAADQF---------------------LKNCA--SPDRFYSVQNS 348 A++ G +Y + +F L++ A + ++Y + Sbjct: 211 AQQYGIRIYTVAAGTNGVAKFPVQTASGIEYVEADVQIDEGTLRHIAQQTGGKYYRATDE 270 Query: 349 RKLHDAFLRI 358 KLH+ + I Sbjct: 271 TKLHEIYKEI 280 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 80/234 (34%), Gaps = 33/234 (14%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRS 198 + P ++ S +G D+M+V+D+S SM + G + +L A Sbjct: 84 LVVTAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEQDFTSKTGEKISRLDAAK-- 138 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKS 255 +++ R GL+ F P + + + + G +T Sbjct: 139 -----EVLSDFAKTRKGDRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHL 193 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A +++ + + +K I LTDG ++ ++ ++ AK + Sbjct: 194 GDAIGLAIKVFEQSEKSRTDV---EESKEKVAIVLTDGNDTGSFVEPIDA---AKVAKAK 247 Query: 316 GAIVYAIGVQAE---AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ I + +K A S + N +L A+ IGK Sbjct: 248 DVRIHVIAMGDPQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEIGK 301 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 82/225 (36%), Gaps = 39/225 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+++ +DVS SM+ P M + + ++ + R GL+ F ++ Sbjct: 1 SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLGDFLEGRKGDRVGLILFGTQ 57 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + + ++ ++ G K+T + DA + + Sbjct: 58 AFVQAPLTYDRRTVRVWLDEAKIGIAGKNT--------ALGDAIGLGLKRLRLRPATSRV 109 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---------- 336 ++ +TDG N++ ID + A G +Y IG+ A+ L++ Sbjct: 110 LVLVTDGANNAGQIDP---ITAARLAAEEGVKIYPIGIGADPDKDALQSVLGLNPSLDLD 166 Query: 337 ---------ASPDRFYSVQNSRKL------HDAFLRIGKEMVKQR 366 S +++ ++ +L D+ + ++ + R Sbjct: 167 EPTLKEIASLSGGQYFRARDGDQLEKIRATLDSLEPVAQQPTQAR 211 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 45/266 (16%), Positives = 90/266 (33%), Gaps = 46/266 (17%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCA-NSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 ++Q + + R+ I C + P + +S+ G+D+++ LD Sbjct: 63 SEEQMQSSRVKW-LRFVPSLILLQAMICIMVALARPQRTNEEIIQTSE---GIDILLTLD 118 Query: 177 VSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 +S SM + F P ++L A + + R GLV FS + PL Sbjct: 119 ISESMLIEDFTP--NRLEAAKLVAKNFV-------HGRKYDRIGLVIFSGEAYSVSPLTT 169 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + ++ I + ++ + A + K +I ++DG+N Sbjct: 170 DYKLLKRYIEDIREDMIQENGTAIGSAL-------GMGTIRMQESASRSKVVILISDGDN 222 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------------------DQFLKNC 336 ++ N+D + A +Y I V + L+ Sbjct: 223 TAGNLDPITASRL---ATAHNIKIYTILVGRSGKVPYGRDMFGQPQYVNNTVDESVLREI 279 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGK 360 A +FY +++ L + F I + Sbjct: 280 AKIGEGKFYRASDNQALKNVFAEINR 305 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 84/226 (37%), Gaps = 52/226 (23%) Query: 167 IGLDMMMVLDVSLSM-NDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G+D+M+ LDVS SM + F G ++ V + ++ ++ R G++ Sbjct: 92 SGIDIMLALDVSGSMIAEDFTIGGERASRVDVVKQVTQKFIEA-------RPNDRIGMIA 144 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+++ PL + + ++R+ G T + ++ + K+ Sbjct: 145 FAARPYLVSPLTLDHGWLIQNLDRVKLGLVEDGTAIGSAIASCTTRLIERKDSK------ 198 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG---------------- 323 + ++ LTDG+N++ + L A G VY IG Sbjct: 199 ----SRIVVLLTDGDNNAGKVSP---LTAAEAASALGVKVYTIGAGTKGFAPMPVGRDVF 251 Query: 324 -------VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 V+ + + LK A + +FY +++ L + I + Sbjct: 252 GRKVYQNVKVDVDEDTLKKIADMTKAKFYRATDTKSLTQIYEEIDQ 297 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 47/250 (18%), Positives = 83/250 (33%), Gaps = 51/250 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 P + SS S+ G+D++M LD+S +M + +L A Sbjct: 59 LRVIAIALVIIVLARPQSVNSSDISKSE---GIDIIMALDISGTMMANDFSP-TRLEAAK 114 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STT 253 + E ++ R GLV F+ + PL + + ++ + FG T Sbjct: 115 KVASEFIND-------RQSDRIGLVIFAGESFTQCPLTTDHRVLLNLLSEVKFGMIEDGT 167 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GL + N++ K + +I LTDG N++ I L A Sbjct: 168 AIGLGLANSVNRL-----------KDSQSKSRVVILLTDGSNNAGQIAP---LTAAELAA 213 Query: 314 RRGAIVYAIGVQAEA---------------------ADQFLKNCAS--PDRFYSVQNSRK 350 G VY IG+ + ++ L AS ++ ++ Sbjct: 214 SYGIRVYTIGIGSRGTSVARVMTPYGMQSMNVSGDFDERTLTEIASKTGGSYFRATDNTS 273 Query: 351 LHDAFLRIGK 360 L + I + Sbjct: 274 LSGIYDEIDQ 283 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 48/228 (21%), Positives = 83/228 (36%), Gaps = 20/228 (8%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + W L + S VK K G ++ ++LD S SM + G D + Sbjct: 51 LLFFLKWLTIIMMI-LALMSPVKDEPYELKPKHGHEIALILDASGSMKER---GFDPVNP 106 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + I+K GLV F S PL + + +++L G Sbjct: 107 AASRFDVVKSIVKDFISQRTNDNMGLVVFGSYSFIASPLTYDKHILSRIVSQLEVG---- 162 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + Y +++A + ++ K K I LTDG +++ D + AK+ Sbjct: 163 ----MAGKYTALYEALAQGVNLLKMSKAKSKVAILLTDGYSTAG-ADKIPLDVVLDMAKK 217 Query: 315 RGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 G VY IG+ L A + + N+ +L + + +I Sbjct: 218 EGVKVYPIGIGGPDEYNRAVLLKIAKETGGVAFGASNASQLKEVYKKI 265 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 75/216 (34%), Gaps = 43/216 (19%) Query: 168 GLDMMMVLDVSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D+M+ D+S SM +++L A I + +D R G+V F Sbjct: 89 GIDIMIAFDLSYSMETPDMVLNRMPINRLVAAKHVITQFVDS-------RPDDRIGIVGF 141 Query: 224 SSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + K PL + I R+I T + A ++ D KE Sbjct: 142 AGKTKSFCPLTLDHALVNSIIRDFHPRMIQADGTAIGSAIAAAATRLDDRKETK------ 195 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---------- 329 K II +TDG ++S I L A + G +Y I V E Sbjct: 196 ----SKIIILVTDGASNSGQISP---LVAAENAAKLGIKIYTIAVGTEEGTLANGMVVQS 248 Query: 330 ---DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + L+ A + + N + AF IGK Sbjct: 249 EFDEPTLRKIAQLTGGEHFRATNMASFNKAFTSIGK 284 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 48/212 (22%), Positives = 83/212 (39%), Gaps = 26/212 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +++V+D S SM ++KL +A + +M+D ++S V G++ F Sbjct: 401 EKERNVAVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDHLESSDSV------GVIAF 454 Query: 224 SSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + E I+ + G T P L A N + +K K Sbjct: 455 DHNFYWASKFGKLKSKNEVIENISTIQVGGGTAIIPPLTEAVNLLKKSKAK--------- 505 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K I+ LTD + + + AKR + IGV + L A + Sbjct: 506 --DKVIVLLTD-----GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTS 558 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 RFY V+++ L D FL+ K + + I K Sbjct: 559 GRFYYVKDASNLIDVFLKEAKIIKGKYIKEKK 590 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 71/207 (34%), Gaps = 32/207 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L MM+V+D S SM K+ +A + + ++++ G++ F Sbjct: 459 EKPSLAMMLVIDKSGSMGGQ------KIELAKDAAQAAVELLGPKDA------IGVIAFD 506 Query: 225 SKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L I + I+ + T P + AY + A KL+H Sbjct: 507 GDSYTVSELRSTSDRGAISDAISTIEASGGTNMYPAMADAYEALLGATAKLKH------- 559 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 +I +TDG +S + + + + + +++ L+ A Sbjct: 560 ----VILMTDGVSSPGDFQG-----VAGDMSASRITLSTVALGQGSSEDLLEELAQIGGG 610 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 R+Y + + + F + E K I Sbjct: 611 RYYFCDDPQSVPQVFAKETVEASKSAI 637 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 80/229 (34%), Gaps = 49/229 (21%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++ LD+S SM ++L VA + E + R GLV F+ Sbjct: 57 GIDIVVALDLSTSMEAGDFRPQNRLHVAKEVLAEFISN-------RVNDRIGLVVFAGAA 109 Query: 228 VQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL ++E + ++ + T L + N++ D+ + Sbjct: 110 YTQAPLTLDYGVVREVLKQIRTRVLEDGTAIGDALATSLNRLRDS-----------EAKS 158 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------- 325 + ++ +TDG+N++ I ++ K +Y I V Sbjct: 159 RVVVLITDGDNNAGKISPLDAASMAESLK---IPIYTILVGKGGKVPFPQGQDLFGNTVW 215 Query: 326 ----AEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + L++ AS +Y + L ++ + + +++ Sbjct: 216 RDTEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLERSKLM 264 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 80/229 (34%), Gaps = 49/229 (21%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++ LD+S SM ++L VA + E + R GLV F+ Sbjct: 39 GIDIVVALDLSTSMEAGDFRPQNRLHVAKEVLAEFISN-------RVNDRIGLVVFAGAA 91 Query: 228 VQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL ++E + ++ + T L + N++ D+ + Sbjct: 92 YTQAPLTLDYGVVREVLKQIRTRVLEDGTAIGDALATSLNRLRDS-----------EAKS 140 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------- 325 + ++ +TDG+N++ I ++ K +Y I V Sbjct: 141 RVVVLITDGDNNAGKISPLDAASMAESLK---IPIYTILVGKGGKVPFPQGQDLFGNTVW 197 Query: 326 ----AEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + L++ AS +Y + L ++ + + +++ Sbjct: 198 RDTEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLERSKLM 246 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 91/241 (37%), Gaps = 28/241 (11%) Query: 128 SAVSRYEMPFIFC-TFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 S V+R P + C T WC + P + ++ K + D+++ LD+S SM+ Sbjct: 53 SVVTRRTWPQLVCETLAWCLVVLALARPQFVEPPIE---KVEPQRDILLALDLSQSMDAR 109 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 PG D +A + + ++ R GLV F P + ++ I Sbjct: 110 DFPGADGKPLAR--VEAVRQVVADFVGKRPGDRIGLVAFGDAPYPLAPFTMDHELVRTMI 167 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + PG+ + DA + +K +I LTDG +++ + Sbjct: 168 --------ADTVPGMAGPRTSLGDALGLAIKMFGKTTAPEKVLIVLTDGNDTASRMPP-- 217 Query: 305 SLFYCNEAKRRGAIVYAIGVQ---AEAADQF----LKNCASP--DRFYSVQNSRKLHDAF 355 L AK +G I + +G+ A D+ L+ A+ R++ + +L + Sbjct: 218 -LKAAEIAKSKGVIFHTVGIGDPAATGEDKLDTATLQKIAASTGGRYFFGGDQSQLAAIY 276 Query: 356 L 356 Sbjct: 277 E 277 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 52/226 (23%), Positives = 95/226 (42%), Gaps = 25/226 (11%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF------GPGMDKLGVATRSIREML 203 LL SV+ + G+ ++VLD S SM D P ++ A R+IRE L Sbjct: 70 SWRLLPVRSVRRGVNREEGISFLLVLDASGSMWDALDGTPTEDPDRMRITHAKRAIREFL 129 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 P ++ R GL F+ P+ + EK++ + P E AY Sbjct: 130 ------PLLSGRDRVGLAVFNRTYRVIQPIVGDPSLVLEKLDAIE-------RPSREQAY 176 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN--SSPNIDNKESLFYCNEAKRRGAIVYA 321 +++ + E+ G + ++ ++ L+DGEN P+ + A R G Y Sbjct: 177 TELYRSMEEALTDF-GEEGRRRVLVVLSDGENFPVDPSESPSTPGTAIDLAHRYGITCYV 235 Query: 322 IGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 I E D+ + + AS R + +N+ +L + I ++++++ Sbjct: 236 IHFGTEK-DRLIGDLASETGGRVFDARNALELASVYTAIQEQVLQE 280 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 63/409 (15%), Positives = 136/409 (33%), Gaps = 70/409 (17%) Query: 22 AILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSY 81 A+LLPVI +MGL +++ + ++L +D + L + ++ Sbjct: 2 AVLLPVILGIMGLGLDSGMLYLSHSRLQAAVDAAALAGSLQLPYD------PAMDKGLVR 55 Query: 82 RIIKNIWQTDFRNELRENGF-AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + +F + ++ + ++ + ++ S+ R + + Sbjct: 56 AAVDEYMHANFPQAVVQSVLPGAEERSVTVNAEATVGTIFMGALGIGSSTVRAQASAGYN 115 Query: 141 TFPWCANSSHAPLLITSSV-KISSKSDIGLDMMMVLDVSLSMNDHFGP---------GMD 190 ++ + S + + ++ + +D++M ++ S+ P +D Sbjct: 116 NLEVVFVIDNSGSMKGSPINETNAAATRLVDLIMPEGMATSVKIGLVPFRGKVRIPADVD 175 Query: 191 KLGVATRSIREMLDIIKSIPDVNNV-------VRSGLVTFS-SKIVQTFPLAWGVQHIQE 242 L R+ L+ + + R + +S S I T L I + Sbjct: 176 GLPSGCRNADGSLNEDGLLDEYKKPEYRYPYNDRLRVTPYSCSSIPLTQGLTADRATITQ 235 Query: 243 KINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I R T + GL++A + + E D +K II LTDG+ N Sbjct: 236 AIGRQDARGDSSGTVISEGLKWARHVLTPEAPFTEGS--SAKDMRKVIILLTDGDTEDGN 293 Query: 300 IDNKESLFY-------------------CNE--------------AKRRGAIVYAIGVQA 326 S++Y C + AK G ++AI + Sbjct: 294 CGGNYSVYYRPNNYWTNAYYGMMDMDSHCEDGGVLNNAMLSEAALAKDAGIEIFAIRYGS 353 Query: 327 EAADQ--FLKNCASP-----DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A ++ AS D ++ + + D F IG+++ + + Sbjct: 354 SDAVDRNLMRAVASSKEGTDDHYFDAPSPYDIDDVFKLIGRQLGWRLLR 402 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 81/243 (33%), Gaps = 31/243 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P ++ LL T+ +S I L+ +M+V+DVS SM + P ++L A + Sbjct: 71 VPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMASNDVPP-NRLAAAKEA 129 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ D + + GLV F++ P ++ I+ L T + G Sbjct: 130 GKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRGAVKSGIDSLQPAPKTATGEG 183 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRG 316 + A I + G I+ +DG + P + + AK +G Sbjct: 184 IFTALQAIATVGSVMGG---GEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKGQG 240 Query: 317 AIVYAIGVQAEAA---------------DQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 + I K C + + + L + + + + Sbjct: 241 VQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGEAFHADSLDSLKNVYTTLQR 300 Query: 361 EMV 363 ++ Sbjct: 301 QIG 303 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 63/163 (38%), Gaps = 26/163 (15%) Query: 232 PLAWGVQHIQEKINRLIFGST-----TKSTPGLEYAYNKIFDAKEKLEHI-AKGHDDYKK 285 L+ ++ I+ L +G T GL + N + E + +K Sbjct: 298 SLSTREADVKTSISNLSYGGGGYRPSTFIPAGLIWGLNVLSPPAPFEEQAYDPNNKLPRK 357 Query: 286 YIIFLTDGENS-----------------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++ +TDG N+ +++++ CN KR G ++ +G + Sbjct: 358 ALVLMTDGANTMVFNSSDGRHRNARSGTEVAQSDRDTISICNNIKRSGIEIFTVGFMVNS 417 Query: 329 AD--QFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + LK CA+ ++ + +LH AF RI + + R++ Sbjct: 418 SSALDLLKECATDGEHYFDATSPEELHSAFGRIADGLTQIRLI 460 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 73/209 (34%), Gaps = 36/209 (17%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 + +LP++ V G ++ + + L +D + L+ A ++ + D Sbjct: 1 MALAILPMLLAVGG-TVDVGRQSSLATDLQEAIDIAALHIAKA------PSDAIPGEEDV 53 Query: 80 SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF 139 I NI D R L++ + ++ + I PF Sbjct: 54 LQLIKSNITTKDSRIALKKLDVTE--KDVSLHATAEIT------------------PFFL 93 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + L + K + ++ +++ +VLD + SM++ G +L + Sbjct: 94 GL------AGIKNLTAQRATKTAREARGEIEVALVLDTTWSMSEKDSSGKSRLDSLKGAA 147 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++D I + R +V ++ + Sbjct: 148 AKLVDTIFTEDGK---TRVAVVPYADYVN 173 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 66/188 (35%), Gaps = 31/188 (16%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDII 206 P + V +++ G D+++ +DVS SM+ + +L + + + L Sbjct: 75 RPQWLGDPVPVAA---SGRDLLVAVDVSGSMDFPDMQWKDEEVSRLDLVKALLGDFLQD- 130 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAY 263 R GL+ F S+ PL + + ++ ++ G T + A Sbjct: 131 ------REGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAV 184 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + ++ +TDG N+ I L A + +Y IG Sbjct: 185 KRLRQRPAQ-----------SRVLVLITDGANNGGRIHP---LTAARLAAQEDVRIYTIG 230 Query: 324 VQAEAADQ 331 + A Sbjct: 231 IGANPEAS 238 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 80/234 (34%), Gaps = 33/234 (14%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRS 198 + P ++ S +G D+M+V+D+S SM + G + +L A Sbjct: 84 LVVTAMAKPTVLGEPQVRES---LGRDVMVVVDLSGSMAEQDFTSKNGEKISRLDAAK-- 138 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKS 255 +++ R GL+ F P + + + + G +T Sbjct: 139 -----EVLSDFAKTRKGDRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHL 193 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A +++ + + +K I LTDG ++ ++ ++ AK + Sbjct: 194 GDAIGLAIKVFEQSEKSRTDV---EESKEKVAIVLTDGNDTGSFVEPIDA---AKVAKAK 247 Query: 316 GAIVYAIGVQAE---AADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ I + +K A S + N +L A+ IGK Sbjct: 248 DVRIHVIAMGDPQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEIGK 301 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 75/216 (34%), Gaps = 38/216 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M+V D S SM D++ A R+ LD + P +R G+ TFS Sbjct: 87 SVMLVTDHSRSMLAEDVEP-DRITAAKRAASRFLDQL---PPG---IRVGVTTFSDVPDG 139 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T + I+ I I T + L+ A + + E + + + ++ Sbjct: 140 TQTPTYDHDLIRRTIEAQIADGGTATGDALQVALDTL-------ERLEQNGERTPAAMVL 192 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------AEAADQ 331 L+DG ++ ++ + A +Y + + + Sbjct: 193 LSDGATTTG----RDPVMVARAAGEARIPIYTVALGTRDATVPNPGPTGPPLLPVAPDPE 248 Query: 332 FLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L+ AS R + Q+ ++L + +G + + Sbjct: 249 TLQAIADASGGRAFQAQDDQELSSIYETLGSRLGTR 284 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 51/254 (20%), Positives = 93/254 (36%), Gaps = 52/254 (20%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKL 192 ++PF+ F + S + G+ + +V+DVS SM + P +L Sbjct: 13 DIPFMLIIFALAFSIIGLSRPAKVSHLSDINGE-GVYISLVVDVSPSMMAEDMIP--TRL 69 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 + + M+D IK N + LV+F+ + P + ++E+I ++ Sbjct: 70 EASKK---TMIDFIKK----RNFDKISLVSFALRASVLSPATFDYTSLEEEIKKIEIDEE 122 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+ A + + KE E K II LTDGEN+S ID K + Sbjct: 123 GSTSIGLGIATAVDMLRSVKEDNE----------KIIILLTDGENNSGEIDPKLASEI-- 170 Query: 311 EAKRRGAIVYAIGVQAEAAD-------------------------QFLKNCA-SPDRFYS 344 A +Y IG+ + A + ++++ Sbjct: 171 -ASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAATTGGKYFN 229 Query: 345 VQNSRKLHDAFLRI 358 +N+ L + + I Sbjct: 230 AKNASALDNVYNTI 243 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 88/237 (37%), Gaps = 25/237 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + ++ + P L+ ++ G D+M+V+D+S SM + +D + ++ Sbjct: 72 LLSLSWIMILVALTKPTLLGPP---QTREQFGRDVMVVVDLSGSMAEKDFTSIDGIKISR 128 Query: 197 -RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGST 252 +++++L+ R GL+ F P + + +N+ + G + Sbjct: 129 LDAVKKVLNDFAK---TRKGDRLGLILFGDAAFVQTPFTADHEVWLDLLNQTRVEMAGKS 185 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + I +E KK I LTDG ++ + ++ A Sbjct: 186 THLGDAIGL---TIKRFEENDNSQPLSTTSRKKVAIILTDGNDTDSYVPPMDA---AKVA 239 Query: 313 KRRGAIVYAIGVQAEA--ADQFLK-------NCASPDRFYSVQNSRKLHDAFLRIGK 360 K +G ++ I + +Q L AS + + N +L +A+ I K Sbjct: 240 KVKGIRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQALNQDELINAYAEISK 296 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 52/242 (21%), Positives = 85/242 (35%), Gaps = 56/242 (23%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDII 206 + P + S + SS G+D++M +DVS SM P ++L E + Sbjct: 73 AMARPRSVDVSTQTSSTQ--GIDIVMAIDVSASMLARDLQP--NRLDATKNVAEEFIQD- 127 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYA 262 R GLV ++ + P+ + + + + + + T GL A Sbjct: 128 ------RPGDRIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENGTAIGSGLATA 181 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 N+I K D K II LTDG N++ ID + A G VY I Sbjct: 182 VNRI-----------KDSDAESKVIILLTDGVNNAGFIDPSTASEL---AVEFGIKVYTI 227 Query: 323 GVQAEA------------------------ADQFLK--NCASPDRFYSVQNSRKLHDAFL 356 GV + + LK A+ +++ N+ KL + + Sbjct: 228 GVGSNGMALSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGGKYFRATNNEKLEEIYA 287 Query: 357 RI 358 I Sbjct: 288 EI 289 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 28/223 (12%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + P L+ VK + G+D+++ LD+S SM G +L RE + Sbjct: 77 LGVAAMTRPQLV---VKQTVAHSRGIDVILALDISESMQLKDAGGRSRLDAVKSVAREFV 133 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 ++ R G+V F K PL + I+ + + A Sbjct: 134 T-------RHSNDRIGVVVFKGKGYTLSPLTLDHRVTGMLIDNVSPDVIRDEGTAVGTAV 186 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + +K II +DG +++ ID + + A +G +Y G Sbjct: 187 LI-------AVNRLRASQSDQKVIILFSDGVSNAGEIDP---VTAASFAAAQGIRIYTAG 236 Query: 324 VQAEAADQF------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + ++ L+ A + R++ S L +AF I Sbjct: 237 AGSASSASSALDEGELRRVALTAGGRYFRAGTSASLAEAFESI 279 >gi|149188658|ref|ZP_01866950.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] gi|148837568|gb|EDL54513.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] Length = 346 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 80/238 (33%), Gaps = 34/238 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 WC T + ++ +G D+M+ +D+S SM++ D Sbjct: 69 ILIVSWCLLVIAMTKP-TVLGEPQTREKLGRDVMVAVDLSGSMSEMDFSSSD------GQ 121 Query: 199 IREMLDIIKSIPDV----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGS 251 LD +KS+ R GL+ F P +++ G Sbjct: 122 AVSRLDAVKSVLHEFVATREGDRLGLILFGDAAYLQTPFTADHDVWLALLDQTEVAMAGQ 181 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T + A ++ D +K ++ LTDG ++ ++ K++ Sbjct: 182 STHLGDAIGLAIKVFEQSE--------SSKDKEKVVVVLTDGNDTGSFVEPKDAAIV--- 230 Query: 312 AKRRGAIVYAIGVQAEA--ADQFLKNCA-------SPDRFYSVQNSRKLHDAFLRIGK 360 A +G ++ I + A +Q L S + + + L A+ IG+ Sbjct: 231 AAAKGVRIHVIAMGDPATIGEQALDMATIDNIASQSGGQAFQALDQEALQQAYRTIGE 288 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 50/246 (20%), Positives = 89/246 (36%), Gaps = 36/246 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 P T+ V++ + +M+ +DVS SM P D+L A Sbjct: 60 LFLAMLALLVVGFARP---TAEVRVPRERAT---VMVAVDVSTSMLAGDVEP--DRLTAA 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 + R +D +PD N GLV F+ P + + E I+RL+ G+T Sbjct: 112 KEAARRFVD---GLPDEFN---VGLVAFAGSAAVLVPPDTDREALDEGIDRLVEGATGVQ 165 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A N A + L+ A + I+ L+DG N+S + + +A Sbjct: 166 GTAIGEAINTSLGAVKALDGEAAKDPPPAR-IVLLSDGANTSG----MDPMEAATDAVAM 220 Query: 316 GAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 V+ I + Q L A + +F+ ++++L + IG Sbjct: 221 DVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDEVARETGGQFHEADSAKELRAVYDDIG 280 Query: 360 KEMVKQ 365 + + Sbjct: 281 SSVGYR 286 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 41/228 (17%), Positives = 85/228 (37%), Gaps = 37/228 (16%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD--KLGVATRSIREMLDIIKSIPDVNNV 215 VK+ + +++ +DVSLSM +D +L A +S + ++ + S +V Sbjct: 78 EVKVPRERAT---IVVAIDVSLSM---MATDVDPNRLEAAKKSAKNFVNQLPSKFNV--- 128 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV F+ P +Q I+ L +T + G+ + + EH Sbjct: 129 ---ALVNFAGTASIIVPPTTDRATVQRSIDGLELAESTATGEGIFTSLQALTQVPPDPEH 185 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA------ 329 +D I+ L+DG+ + + AK + +Y I ++ Sbjct: 186 ---PNDPAPARIVLLSDGKRTVGRTAQEG----AQAAKEKNTPIYTITFGTDSGFIEMDG 238 Query: 330 --------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L++ A + Y+ +++ +L D + IG + ++ Sbjct: 239 IRQRVPPDRAELRSVAEITGGEAYTAESAGELEDVYKDIGSSVGYDKV 286 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 32/161 (19%), Positives = 59/161 (36%), Gaps = 25/161 (15%) Query: 232 PLAWGVQHIQEKINRLIFGST-----TKSTPGLEYAYNKIFDAKEKLEHIA--KGHDDYK 284 PL+ + I L+ T G+ + N + E + + + Sbjct: 324 PLSNDATVVTNTIKGLVVNIGGYKPETYIPGGMIWGVNALTPPAPFTEGKPYDANNKEPR 383 Query: 285 KYIIFLTDGENSSPNIDN---------------KESLFYCNEAKRRGAIVYAIGVQAEAA 329 K I+ +TDG N+ + + + C+ AK + +Y IG + Sbjct: 384 KTIVLMTDGANTLYANTSGGIAVANATQVAVTYSDQIRVCDYAKSKKIEIYTIGFDVTDS 443 Query: 330 DQF--LKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 LK CA+ ++ ++S L AF IG ++ K R+ Sbjct: 444 KALSTLKACATDAQHYFDAKSSADLIKAFETIGGKLSKVRL 484 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 89/227 (39%), Gaps = 33/227 (14%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 F+N F + G+ +++ + + I +G I+ + K LD ++L A Sbjct: 10 YFVNA--FLRSRGGNTTMIFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAA- 66 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 I+N N + KK D +EN A D++ + D + Sbjct: 67 -IVNNSNEGDLKKLAADV----------------FKENLGAADLDAKVTAFKY----DAK 105 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + +A Y+ P I F + + P +TS +D L++ +VLD + SM Sbjct: 106 ARTVKATAQGSYD-PVIMQLFGF----KNLPYAVTSDAIK--AADGTLEVALVLDNTWSM 158 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + G K+ + + + ++ I + + + V+ +V ++ + Sbjct: 159 SATVN-GTPKIDILKTAAQGLVSTILTKDN-KDYVKIAVVPYADYVN 203 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 42/257 (16%), Positives = 87/257 (33%), Gaps = 41/257 (15%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + A SR ++ + P + + I+S D+++ +D+S Sbjct: 48 SPASDGALILARSRLQLVAASLVWVLLITGLARPERLGEPITITS---AARDLVLAVDIS 104 Query: 179 LSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 SM+D G + +L D++ + R L+ F +K P Sbjct: 105 GSMDDRDMTAPDGTRLQRLQAVK-------DVVGAFVAEREGDRISLIVFGAKPFIQAPF 157 Query: 234 AWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + E +N++ G T + A D+ + ++ +I L Sbjct: 158 TEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFEDS-----------EIEERLLILL 206 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCAS--PDR 341 +DG +++ + + A + G +Y IGV L++ A+ Sbjct: 207 SDGADTASTMTP---INAAQIAAQEGITIYTIGVGNPDGSGEERLDPATLEDIATRGGGA 263 Query: 342 FYSVQNSRKLHDAFLRI 358 FY + L + + I Sbjct: 264 FYFADDVEGLSEIYAEI 280 >gi|116251678|ref|YP_767516.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256326|emb|CAK07407.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 329 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 91/241 (37%), Gaps = 28/241 (11%) Query: 128 SAVSRYEMPFIFCT-FPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 S ++R P I C WC + P + ++ K + D+++ LD+S SM+ Sbjct: 53 SVIARRTWPQILCEGAAWCLVVVALARPQFVEPPIE---KVEPQRDILLALDLSQSMDTK 109 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 PG D +A + ++ R GLV F P ++ I Sbjct: 110 DFPGADGKPLARVDAVK--QVVADFVGRRPGDRIGLVAFGDAPYPLAPFTMDHALVETMI 167 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + PG+ + DA + + +K +I LTDG +++ + Sbjct: 168 --------ADAVPGMAGPRTSLGDALGLAVKMFEKTTVPEKVLIVLTDGNDTASRMPP-- 217 Query: 305 SLFYCNEAKRRGAIVYAIGVQ---AEAADQF----LKNCA--SPDRFYSVQNSRKLHDAF 355 L AK +G +V+A+G+ A D+ L+ A + R++ + +L + Sbjct: 218 -LKAAEIAKSKGVVVHAVGIGDPLATGEDKLDTATLQKIAEKTGGRYFFGGDQAQLASIY 276 Query: 356 L 356 Sbjct: 277 Q 277 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 82/251 (32%), Gaps = 51/251 (20%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + P T + G+D+M+ +DVSLSM D++ Sbjct: 59 KYIILSALIIAMARPRTFT--ISQDRDDTKGVDIMLSIDVSLSMLAKDLNP-DRITALKD 115 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---TTK 254 + + R G+V ++++ P+ Q + ++I L T Sbjct: 116 IAVKFVQ-------KRPNDRIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNSAGLEPGTA 168 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 GL A N + +K K K +I +TDG ++ N + AK Sbjct: 169 IGEGLSVAVNHLVKSKAK-----------SKVVILMTDGVSNIQNAIPPQV--AAELAKN 215 Query: 315 RGAIVYAIGVQAEA-----------------------ADQFLKNCA--SPDRFYSVQNSR 349 VYAIG+ + L+ A + +++ ++ Sbjct: 216 NNIKVYAIGIGTNGYALMPTSQDIFGDLVFTETEVTIDENTLREIAQTTGGKYFRATSNS 275 Query: 350 KLHDAFLRIGK 360 L + + I + Sbjct: 276 SLEEVYDEINQ 286 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 81/217 (37%), Gaps = 48/217 (22%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++ +D+S SM ++ A EM I + P+ N GLV F+ + Sbjct: 86 GIDLVLAMDLSGSMQ-ALDLKPNRFEAARDVASEM---IAARPNDN----IGLVVFAGES 137 Query: 228 VQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL IQ+ + G T GL A N + +G D+ Sbjct: 138 FTLCPLTVDHNVIQQMLETTEIGQLEDGTAIGLGLATAINTL-----------RGSDNKS 186 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------ 332 K II LTDG N++ +I A++ G +Y + +F Sbjct: 187 KVIILLTDGSNNAGDITPS---MAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGTEYVE 243 Query: 333 ---------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L++ A + ++Y + KLH+ + I Sbjct: 244 ADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI 280 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 81/210 (38%), Gaps = 26/210 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +S S + +D++ +D S SM + G+ K A +S + +D + G+ Sbjct: 73 STSTSAVPMDVVFAIDSSGSMQSNDPSGLRK--TAAKSFVDKMDSSRDTA--------GV 122 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V++ I + PL ++ I+ + +T GLE A + + + Sbjct: 123 VSWDDSIDFSLPLTNDFPLVKTNIDSVDSSGSTNLNVGLEEAIDILDANPRTENSVE--- 179 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NCASP 339 IIFLTDG+ + + EA +G ++Y+IG+ + Sbjct: 180 -----VIIFLTDGQGTYLH-------STAQEAADKGYVIYSIGLGGVNPTPLQDMATTTG 227 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 +YS ++ L F I E+ I Y Sbjct: 228 GAYYSSPDATSLQAIFDDIFSEVTTSTIPY 257 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 76/215 (35%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVAP-NRMAAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + G+ A I + K I+ Sbjct: 152 VSPTTNREATKNALDKLQFADRTATGEGIFTALQAIATVGAVIGGGDKP---PPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ LK Sbjct: 209 SDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ + ++L + + +++ + I Sbjct: 269 KVAQLSGGNAYNAASLQELKSVYATLQQQIGYETI 303 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 51/207 (24%), Positives = 79/207 (38%), Gaps = 25/207 (12%) Query: 164 KSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 K G+D+M+ +DVS SM D P +++ VA E I S P+ N GL Sbjct: 83 KETEGIDIMLTMDVSASMLTDDVYP--NRMAVAKEVASEF---ISSRPNDN----IGLTI 133 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+ + P+ + ++ + T GL I + Sbjct: 134 FAGEAFTQCPMTLDHAALLNLLHNVRPDLVT---SGLMKDGTAIGMGLANAVSRLQDSKA 190 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LKN 335 K +I LTDG N+ +I + AK+ G VY IG E ++ L+N Sbjct: 191 KSKIVILLTDGSNNVGSISP---MTAAAIAKKFGIRVYTIGFGRETGEEIGAIDYRALQN 247 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + FY Q+ +L + I K Sbjct: 248 IAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 83/213 (38%), Gaps = 32/213 (15%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 ++ S + + + +++DVS SM+ L A ++ +E + Sbjct: 74 WLSLPPSPESVTLVHQSVFLLIDVSYSMDGS------ALAEAKQAAQEFVRKSDLAHTA- 126 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 GL+ F SK L +H+ + INRL +T T GL AY K+ + Sbjct: 127 ----IGLIEFGSKAKIISGLTQNAKHLYKAINRLKTNGSTNMTEGLTTAYLKLKN----- 177 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 D ++II LTDG + P + + C G + IG +A +L Sbjct: 178 -------VDDPRFIILLTDGLPNHPKNTQQIAQEIC----ADGIELITIGTG-DADKTYL 225 Query: 334 K--NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + C + F++ + + F RI + + + Sbjct: 226 QSLACYDQNSFFA--KAGTMVSTFSRIAQVLTE 256 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 78/245 (31%), Gaps = 28/245 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 A+ + + +M+V+DVS SM +++ A + + Sbjct: 69 AILLVASLVLLTIAMAGPTNDVRIPRNRAVVMLVIDVSQSMRATDVQP-NRMAAAQEAAK 127 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + D + + GL+ ++ + +++L F T + G+ Sbjct: 128 QFADELTP------GINLGLIAYAGTATVLVSPTTNRDSTKRALDKLQFADRTATGEGIF 181 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAI 318 A I I G I+ +DG+ + P + K + AK +G Sbjct: 182 TALQAIATVG---AVIGGGDAPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVP 238 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 + I D+ LK A S Y+ ++L + + +++ Sbjct: 239 ISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAATLQELKSVYATLQQQI 298 Query: 363 VKQRI 367 + I Sbjct: 299 GYETI 303 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 53/237 (22%), Positives = 91/237 (38%), Gaps = 30/237 (12%) Query: 134 EMPFI-FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPG 188 +P I +C + P T + + ++ G+++++ LD+S SM G Sbjct: 54 LLPLIKYCALVLLIIALARPQWGTRKMNVKTE---GINIILALDLSKSMAALDFKLDGAI 110 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +++L +++K + R G+V F S+ PL I ++RL Sbjct: 111 VNRLDAVK-------NVVKDFIMKRSGDRIGMVVFGSEAFTQMPLTRDYDTIAFVLSRLK 163 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G+ ST I DA + +I LTDG+++S I + Sbjct: 164 IGAAGPST--------AIGDAMGISLKRLEDVKSKSNIVILLTDGKSNSGEITPGAA--- 212 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKN-CASPDRF-YSVQNSRKLHDAFLRIGKEMV 363 + A+ RG VY IGV FL N R+ Y + + H+A I + Sbjct: 213 ADIARERGVKVYTIGVGQRGKAPFLVNDPLFGQRYVYQMVDMD--HEALKEIADKTG 267 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 87.2 bits (214), Expect = 3e-15, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 81/217 (37%), Gaps = 48/217 (22%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+++ +D+S SM ++ A EM I + P+ N GLV F+ + Sbjct: 86 GIDLVLAMDLSGSMQ-ALDLKPNRFEAARDVASEM---IAARPNDN----IGLVVFAGES 137 Query: 228 VQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL IQ+ + G T GL A N + +G D+ Sbjct: 138 FTLCPLTVDHNVIQQMLETTEIGQLEDGTAIGLGLATAINTL-----------RGSDNKS 186 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------ 332 K II LTDG N++ +I A++ G +Y + +F Sbjct: 187 KVIILLTDGSNNAGDITPS---MAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGTEYVE 243 Query: 333 ---------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L++ A + ++Y + KLH+ + I Sbjct: 244 ADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI 280 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 87.2 bits (214), Expect = 3e-15, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 84/259 (32%), Gaps = 31/259 (11%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGP 187 + P F P + L + + I + +M+V+DVS SM Sbjct: 56 VAPQRPSRFRHIPAMLLALSLVLFTVAMAGPTHDVRIPRNRAVVMLVIDVSQSMRATDVE 115 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +++ A + ++ D + + GL+ ++ + + +++L Sbjct: 116 P-NRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVLVSPTTNREATKAALDKL 168 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKES 305 F T + + A I I G I+ +DG+ + P + K + Sbjct: 169 QFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGA 225 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSR 349 AK +G + I D+ +K A S Y+ Sbjct: 226 YTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLA 285 Query: 350 KLHDAFLRIGKEMVKQRIL 368 +L+ + + +++ + I Sbjct: 286 ELNSVYASLQQQIGYETIR 304 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 76/215 (35%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVAP-NRMAAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + G+ A I + K I+ Sbjct: 152 VSPTTNREATKNALDKLQFADRTATGEGIFTALQAIATVGAVIGGGDKP---PPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ LK Sbjct: 209 SDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ + ++L + + +++ + I Sbjct: 269 KVAQLSGGNAYNAASLQELKSVYATLQQQIGYETI 303 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 47/177 (26%), Positives = 73/177 (41%), Gaps = 24/177 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +K S+ G+D++M +D+S SM ++L A + D +K P+ R Sbjct: 15 IKTRDLSNKGVDIVMAIDISGSMLAMDFAPKNRLSAAVSVAK---DFVKRRPN----DRF 67 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEH 275 GLV FS + PL + + +++L S T GL A ++ Sbjct: 68 GLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLAKAVARL--------- 118 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K K II +TDG +++ ID L AK G VY IGV ++ F Sbjct: 119 --KNSTAKSKVIILITDGVSNTGEIDP---LTAAGMAKELGIKVYPIGVGSKGLVPF 170 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 86/232 (37%), Gaps = 27/232 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVA 195 + + P + + ++ K+ D+++ LD+S SM+ F L Sbjct: 65 LAPLAWALLVTALARPQFLEAPIE---KTQPVRDLLLALDLSQSMDTRDFRDPSGALIPR 121 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 +++R +++ S R GL+ F P +Q I L+ Sbjct: 122 VQAVR---EVVSSFVARRPGDRIGLIVFGDAPYPLAPFTLDHALVQTMIRDLL------- 171 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 PG+ + DA + +K +I LTDG +++ + + + + AK+R Sbjct: 172 -PGMAGPSTALGDAVGLGIKMFDQSPAPEKVLIVLTDGNDTASKMPPERA---ADIAKQR 227 Query: 316 GAIVYAIGVQ---AEAADQ----FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 V+ IG+ AE + L+ A + R++ + L + + Sbjct: 228 HVTVHTIGIGDPSAEGEQRVDLGVLQRMAAQTGGRYFFGADQNSLESIYATL 279 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 74/199 (37%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M++++K N VR G + ++ Sbjct: 804 LDVVFVIDSSGSIDHDEYNIMKDF---------MINLVKKADVGKNHVRFGALKYADDPE 854 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I N G T + L ++ + +A+ H + Sbjct: 855 VLFYLDNLDTKWEVISVLQNDQPLGGNTYTAEALGFSDHMFTEARGSRLHKGVP-----Q 909 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 910 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFV 965 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 966 ETFGGLKGIFSDVSASVCN 984 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 73/216 (33%), Gaps = 18/216 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ N Sbjct: 974 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKEFLASVIQDFDI---SN 1030 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 N VR G FS FPL G + I +I N T L + Sbjct: 1031 NRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQIFGYTHIGAALRQVGHYFRPDM 1090 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 H + ++ LTDG++ E E + +G +Y++G+ Sbjct: 1091 GSRIHAGTP-----QVLLVLTDGQSQD------EVAQAAEELRHKGIDIYSVGIGDVDDQ 1139 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 Q ++ + ++ +V N +L RI + + R Sbjct: 1140 QLVQITGTANKKLTVHNFDELKKVKKRIVRNICSPR 1175 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 79/202 (39%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S G++ ++ ++ + D V+ G+V FS Sbjct: 613 KEMKADIMFLVDSSGS------IGLENFIKMKTFMKNLVSKSQIRADR---VQIGVVQFS 663 Query: 225 SKIVQTFPLAWGV--QHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + I + I+R+ G TT L + AK + Sbjct: 664 DVNKEEFQLNRYTSQEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAKGARPN------ 717 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE D +L + G I+Y++GV Q + P+ Sbjct: 718 -VRKFLILITDGEAQDIVKDPAVAL------REEGIIIYSVGVFGSNVTQLEEISGRPEM 770 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 771 VFYVEN----FDILKHIEDDLV 788 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 84/225 (37%), Gaps = 22/225 (9%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + + + + D++ +LD+S + + + + + + I Sbjct: 202 AAQYKDRAVNDILVEVCQGPSVADVVFLLDMSTNSSWEDFDYLKEF---------LEESI 252 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYN 264 ++ + +R GLV +S++ L+ GV + + I L + T A Sbjct: 253 SALDIKEHCMRVGLVAYSNETKVISTLSRGVNKSEVLQDIQSLAPQAGKAYTGA---ALR 309 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KI +H ++ + + + +T SP+ DN +R+G IV+ IGV Sbjct: 310 KIRKEVFSAQHGSRKNQGVPQIAVLVT----HSPSQDN--VTKAAVNLRRQGVIVFTIGV 363 Query: 325 QAEAADQFLKNCASPDRFY--SVQNSRKLHDAFLRIGKEMVKQRI 367 + + Q K + P Y +++ L K++ Q Sbjct: 364 EGASDTQLEKIASHPAEQYVSQLRSFSDLAAHNQTFLKKLRNQIT 408 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 70/197 (35%), Gaps = 26/197 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S + E+ + P VR G V ++ + Sbjct: 432 DIYLLIDGSGS------TQATDFQEMKTFLSEVAGMFNIAPQK---VRVGAVQYADRWDL 482 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ +++ I + G + L + + AK++ H Sbjct: 483 EFEISKYTNKHDVRKAIENIRQMGGNRNTGAALNFTLGLLQRAKQQRGGRVPCH------ 536 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 ++ LT+G + N VYAIGV+ EA L+ A + R Y V Sbjct: 537 LVVLTNGASRDSVSGP------ANRLSEELIHVYAIGVR-EANQTQLREIAGEEKRVYYV 589 Query: 346 QNSRKLHDAFLRIGKEM 362 + L D ++ +E+ Sbjct: 590 HDFDALKDIRNQVVQEI 606 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 74/199 (37%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M++++K N VR G + ++ Sbjct: 804 LDVVFVIDSSGSIDHDEYNIMKDF---------MINLVKKADVGKNHVRFGALKYADDPE 854 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I N G T + L ++ + +A+ H + Sbjct: 855 VLFYLDNLDTKWEVISVLQNDQPLGGNTYTAEALGFSDHMFTEARGSRLHKGVP-----Q 909 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 910 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFV 965 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 966 ETFGGLKGIFSDVSASVCN 984 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 73/216 (33%), Gaps = 18/216 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ N Sbjct: 974 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKEFLASVIQDFDI---SN 1030 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 N VR G FS FPL G + I +I N T L + Sbjct: 1031 NRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQIFGYTHIGAALRQVGHYFRPDM 1090 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 H + ++ LTDG++ E E + +G +Y++G+ Sbjct: 1091 GSRIHAGTP-----QVLLVLTDGQSQD------EVAQAAEELRHKGIDIYSVGIGDVDDQ 1139 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 Q ++ + ++ +V N +L RI + + R Sbjct: 1140 QLVQITGTANKKLTVHNFDELKKVKKRIVRNICSPR 1175 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 79/202 (39%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S G++ ++ ++ + D V+ G+V FS Sbjct: 613 KEMKADIMFLVDSSGS------IGLENFIKMKTFMKNLVSKSQIRADR---VQIGVVQFS 663 Query: 225 SKIVQTFPLAWGV--QHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + I + I+R+ G TT L + AK + Sbjct: 664 DVNKEEFQLNRYTSQEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAKGARPN------ 717 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE D +L + G I+Y++GV Q + P+ Sbjct: 718 -VRKFLILITDGEAQDIVKDPAVAL------REEGIIIYSVGVFGSNVTQLEEISGRPEM 770 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 771 VFYVEN----FDILKHIEDDLV 788 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 84/225 (37%), Gaps = 22/225 (9%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + + + + D++ +LD+S + + + + + + I Sbjct: 202 AAQYKDRAVNDILVEVCQGPSVADVVFLLDMSTNSSWEDFDYLKEF---------LEESI 252 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYN 264 ++ + +R GLV +S++ L+ GV + + I L + T A Sbjct: 253 SALDIKEHCMRVGLVAYSNETKVISTLSRGVNKSEVLQDIQSLAPQAGKAYTGA---ALR 309 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KI +H ++ + + + +T SP+ DN +R+G IV+ IGV Sbjct: 310 KIRKEVFSAQHGSRKNQGVPQIAVLVT----HSPSQDN--VTKAAVNLRRQGVIVFTIGV 363 Query: 325 QAEAADQFLKNCASPDRFY--SVQNSRKLHDAFLRIGKEMVKQRI 367 + + Q K + P Y +++ L K++ Q Sbjct: 364 EGASDTQLEKIASHPAEQYVSQLRSFSDLAAHNQTFLKKLRNQIT 408 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 70/197 (35%), Gaps = 26/197 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S + E+ + P VR G V ++ + Sbjct: 432 DIYLLIDGSGS------TQATDFQEMKTFLSEVAGMFNIAPQK---VRVGAVQYADRWDL 482 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ +++ I + G + L + + AK++ H Sbjct: 483 EFEISKYTNKHDVRKAIENIRQMGGNRNTGAALNFTLGLLQRAKQQRGGRVPCH------ 536 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 ++ LT+G + N VYAIGV+ EA L+ A + R Y V Sbjct: 537 LVVLTNGASRDSVSGP------ANRLSEELIHVYAIGVR-EANQTQLREIAGEEKRVYYV 589 Query: 346 QNSRKLHDAFLRIGKEM 362 + L D ++ +E+ Sbjct: 590 HDFDALKDIRNQVVQEI 606 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 55/280 (19%), Positives = 92/280 (32%), Gaps = 55/280 (19%) Query: 110 RSTSLSIIIDDQHKDYNLSAVS-RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 SL + + + Y P P S +++ G Sbjct: 29 NDASLQVSSTKAFEGMAKTKKIWLYYFPSTLIVVALLVIVLARPQTSDSYSNSTTE---G 85 Query: 169 LDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++++ +D+S SM P ++L A E + +N GLV F+ + Sbjct: 86 INIVIAMDISGSMLARDLSP--NRLEAAKDVGIEFI----LSRPNDNF---GLVVFAGES 136 Query: 228 VQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+ + ++ FG T GL A N+I DA+ K Sbjct: 137 FTQCPITSNHASLVNLFKQVDFGIIQDGTAIGLGLATAINRIKDAEGK-----------S 185 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------ 332 K II LTDG N++ +I + A+ G VY IGV + + Sbjct: 186 KVIILLTDGTNNTGDIAP---ISAAQIAQSYGIRVYTIGVGTQGIAEVPMLDQFGNIHYT 242 Query: 333 ----------LKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 L+ AS +++ N L + I K Sbjct: 243 EAEVVIDETTLQQIASTTGGKYFRATNVSSLKQIYSEIDK 282 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 86.8 bits (213), Expect = 4e-15, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 74/220 (33%), Gaps = 54/220 (24%) Query: 168 GLDMMMVLDVSLSM---NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G+D+M+ +D S SM + + + VA + I + GLV F+ Sbjct: 85 GIDIMLAIDASGSMMAMDLQPNRFVAAVEVAQKFIGNRPNDN-----------IGLVMFA 133 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + PL + +++ + G T G+ A N++ K Sbjct: 134 GESFTQCPLTTDHATLLNRLSEVEIGYLEDGTAIGLGIATACNRL-----------KESH 182 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--------- 332 K I+ LTDG N++ +I + A+ G +Y + V + Sbjct: 183 AKSKIIVLLTDGTNNAGSIAPS---MAASLAESLGIRIYTVAVGTRGEAPYPHATAFGTV 239 Query: 333 ------------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 LK A + ++ ++ L+ + I Sbjct: 240 IDNVKVEIDEASLKEIAQTTGGSYFRATDNESLNQIYDEI 279 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 49/288 (17%), Positives = 89/288 (30%), Gaps = 58/288 (20%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 + D + T + K Y + + + A +S V+ Sbjct: 27 YKIDASLQVSDTRVYEHAPKSFKVYLIHLPFILRIAALALLIIVLARPQSTDNWKSSEVE 86 Query: 161 ISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 G+D+M+ +DVS SM + P ++L A + ++ G Sbjct: 87 -------GIDIMLAIDVSGSMLAEDLQP--NRLEAAKDVAAKFIND-------RPNDNIG 130 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHI 276 + F+ + PL + + G T G+ A ++ Sbjct: 131 ITLFAGESFTQCPLTIDHTALLNLFGNIQTGVIEDGTAIGMGVSNAVARL---------- 180 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---- 332 K K II LTDG N++ +I L AK G VY +G+ + Sbjct: 181 -KDSQAKSKVIILLTDGSNNAGDISP---LTSAEIAKTYGIRVYTVGIGTRGTAPYPIQT 236 Query: 333 ------------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK+ A + ++ ++ L D + I + Sbjct: 237 MTGAIQRIQVEVDIDEPTLKDIARTTGGVYFRATDNTSLQDIYQEIDQ 284 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 93/265 (35%), Gaps = 33/265 (12%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + + L P P + LL+ + ++ + + +M+ +DV Sbjct: 35 RFTNLRLLDRVAPRRPAWRRHVPAGLFLAMLALLVVGFARPEAEVRVPRERATVMVAVDV 94 Query: 178 SLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 S SM P D+L A + R +D +PD N GLV F+ P + Sbjct: 95 STSMLAGDVDP--DRLTAAKEAGRRFVD---GLPDEFN---VGLVAFAGSAAVLVPPSTD 146 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + I RL G T + A + A + L+ A + II L+DG N+ Sbjct: 147 REALHDGIGRLAEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPPPARIII-LSDGANT 205 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPD 340 S + + ++A V+ I Q L+ A + Sbjct: 206 SG----MDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGG 261 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQ 365 F+ + +L D + IG + + Sbjct: 262 GFHEASTTAELKDVYEDIGTSVGYR 286 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 93/265 (35%), Gaps = 33/265 (12%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + + L P P + LL+ + ++ + + +M+ +DV Sbjct: 35 RFTNLRLLDRVAPRRPAWRRHVPAGLFLAMLALLVVGFARPEAEVRVPRERATVMVAVDV 94 Query: 178 SLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 S SM P D+L A + R +D +PD N GLV F+ P + Sbjct: 95 STSMLAGDVDP--DRLTAAKEAGRRFVD---GLPDEFN---VGLVAFAGSAAVLVPPSTD 146 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + I RL G T + A + A + L+ A + II L+DG N+ Sbjct: 147 REALHDGIGRLAEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPPPARIII-LSDGANT 205 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPD 340 S + + ++A V+ I Q L+ A + Sbjct: 206 SG----MDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAVAEQTGG 261 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQ 365 F+ + +L D + IG + + Sbjct: 262 GFHEASTTAELKDVYEDIGTSVGYR 286 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 82/252 (32%), Gaps = 37/252 (14%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + + +++V+DVSLSM P +L VA ++ +E + Sbjct: 64 MLVGLVFLTIAAAGPTSVQKVPRNRATVVLVMDVSLSMEATDVPP-SRLEVAQQAGKEFV 122 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D + + G VTF+ + ++ I+ + T + G+ A Sbjct: 123 DGLT------QGINLGFVTFAGTASVMQSPTTNREAVKAAIDNIKLAERTATGEGILTAL 176 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN----IDNKESLFYCNEAKRRGAIV 319 I L I+ ++DG+ + P+ + + + AK +G V Sbjct: 177 QSIETLATVLGGAETP---PPARIVLMSDGKQTVPDDKDVDNPRHAFTAARLAKSKGIPV 233 Query: 320 YAIGVQAEAA---------------------DQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 I E ++ L+ A S FY+ + +L + Sbjct: 234 STISFGTEWGSVEIPDQDGQGGSQRVKVPVDNESLREIAKLSGGEFYTASSLEELTAVYD 293 Query: 357 RIGKEMVKQRIL 368 + +++ + Sbjct: 294 TLEEQIGYETTR 305 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 77/240 (32%), Gaps = 24/240 (10%) Query: 139 FCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F WC + P ++ S +G D+M+V+D+S SM + T Sbjct: 69 FLVVSWCLLIFALTKPTILGEPQTRES---LGRDVMVVVDLSGSMAEQDFVSKQSSDSGT 125 Query: 197 RSIREMLDIIK----SIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 L+ K R GL+ F P + + + + Sbjct: 126 VKKISRLEATKEVLADFVKTRKGDRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVAMA 185 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A + E + +K I LTDG ++ ++ ++ Sbjct: 186 GQSTHLGDAIGLAIKVFEQSSEDKASAEENAKPREKVAIVLTDGNDTGSYVEPIDAAKVA 245 Query: 310 NEAKRRGAIVYAIGVQAEA--ADQFLK-----NCA--SPDRFYSVQNSRKLHDAFLRIGK 360 R ++ I + +Q L A S + + N +L A+ IG+ Sbjct: 246 AAKGVR---IHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEIGE 302 >gi|2811055|sp|O07395|Y335_MYCAV RecName: Full=UPF0353 protein MAV335 gi|2183263|gb|AAC46199.1| MAV335 [Mycobacterium avium] Length = 335 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 78/215 (36%), Gaps = 27/215 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVAP-NRMAAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + G+ A A + + G I+ Sbjct: 152 VSPTTNREATKNALDKLQFADRTATGEGIFTALQV--QAIATVGAVIAGDKPPPARIVLF 209 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ LK Sbjct: 210 SDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLK 269 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ ++ ++L + + +++ + I Sbjct: 270 KVAQLSGGNAYNARSLQELKSVYATLQQQIGYETI 304 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 92/244 (37%), Gaps = 27/244 (11%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + ++ + ++ + + P + ++ K + D+M+ LD+S SM Sbjct: 50 TEGAIIAHRTWPQLILESLAWSLLVLALARPQFVEPPIE---KVEPQRDLMLGLDLSQSM 106 Query: 182 N-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + F L ++R++ + R GL+ F P + + Sbjct: 107 DTKDFRAPDGNLEARVDAVRKV---VGDFVARRPGDRIGLIAFGDAPYPLAPFTMDHELV 163 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 +E I+ + PG+ + DA + + +K +I LTDG +++ + Sbjct: 164 REIISGTL--------PGIAGPRTSLGDAVGLAIKMFEKTTVPEKVLIVLTDGNDTASKM 215 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQF----LKNCA--SPDRFYSVQNSRKL 351 L AKR G +V+ +G+ A D+ L+ A + R++ + +L Sbjct: 216 PP---LKAAEIAKRNGVVVHTVGIGDPQATGEDRLDATALEKIAETTGGRYFFGGDQAQL 272 Query: 352 HDAF 355 A+ Sbjct: 273 AAAY 276 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 83/230 (36%), Gaps = 54/230 (23%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 SD G+D+++ LD S SM ++++ A + IR+ + R GLV FS Sbjct: 86 SDQGIDIIVALDTSTSMRSLDFRSLNRMEAAKKVIRDFMKE-------RKYDRIGLVIFS 138 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 PL + E IN + G T + + N++ D++ K Sbjct: 139 GLAFTQCPLTTDKDSLAEFINNINIGDTGLDGTAIGSAIMTSVNRLKDSRAK-------- 190 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------- 330 + II +TDG N+ ID + A+ +YA+GV + Sbjct: 191 ---SRIIILVTDGNNNMGEIDPLTASKI---ARSYDIKIYAVGVGSLDGAIYEVDDPFLG 244 Query: 331 -------------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 LK A + ++ Q+ +F I K++ K Sbjct: 245 KREIKYRKDAINESVLKEVAYNTSGGYFRAQD----VKSFENIMKQIDKL 290 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 50/226 (22%), Positives = 93/226 (41%), Gaps = 25/226 (11%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF------GPGMDKLGVATRSIREML 203 LL S++ + G+ ++VLD S SM D P ++ A R+IRE L Sbjct: 70 SWRLLPVRSLRRGVNREEGISFLLVLDASGSMWDALDGTPTEDPDRMRITHAKRAIREFL 129 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 P ++ R GL F+ P+ + EK++ + P E AY Sbjct: 130 ------PLLSERDRVGLAVFNRTYRMIQPIVDDPALVLEKLDAIE-------RPSREQAY 176 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN--SSPNIDNKESLFYCNEAKRRGAIVYA 321 +++ + E+ + ++ ++ L+DGEN P + A R G Y Sbjct: 177 TELYRSMEEALTSF-EEEGRRRVLVVLSDGENFPVDPEKSPATPGTAVDLAHRYGITCYV 235 Query: 322 IGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 I E D+ + + AS R + +N+ +L + I ++++++ Sbjct: 236 IHFGTEK-DRLIGDLASETGGRVFDARNALELASVYTAIQEQVLQE 280 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 58/277 (20%), Positives = 94/277 (33%), Gaps = 60/277 (21%) Query: 117 IIDDQHKDYNLSAVSRYEMPFI--FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 D + + +PF+ TF + P + +S+K G+++MM Sbjct: 37 TTDPYRQLPRTPRTALIHLPFLLRILTFTCIIFALARPQTHNA---LSNKETEGINIMMA 93 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +DVS SM P ++ A + E ++ GL F + PL Sbjct: 94 IDVSTSMLTPDLPP-SRIETAKQVAYEFINN-------RPDDNIGLTVFGGEAYTQCPLT 145 Query: 235 WGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + ++ + T GL A + + +K K K Sbjct: 146 TDHSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQSKSK-----------SKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------------- 330 II LTDGEN++ I L AKR G +Y I V +AA Sbjct: 195 IILLTDGENNAGEISP---LTAAEMAKRLGIRIYTISVGTDAAVNQTVATLPNGETYEAA 251 Query: 331 -------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L+ A + +FY ++ KL D + I Sbjct: 252 IKQNTDPKTLEAIANSTGGKFYQARSKAKLRDIYQNI 288 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 88/238 (36%), Gaps = 35/238 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDK 191 ++ ++ + P LI + K+ D+++ +D+S SM + G + + Sbjct: 64 VVYLSWFLLVVALARPQLIEEPLT---KTIASRDLLLAVDLSGSMETKDFKNKSGENVTR 120 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + E L R GLV F S P ++ QE ++ Sbjct: 121 LDSVKEVLSEFL-------AEREGDRVGLVFFGSAAFIQMPFTEDLEICQELMD------ 167 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 ++ + + DA I + K +I LTDG ++ + +++ Sbjct: 168 --EAQVRMAGPQTMLGDAIGLSISIFDQSELEDKVLILLTDGNDTGSLVAPEKA---AQI 222 Query: 312 AKRRGAIVYAIGVQAEAAD-------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 A+ +G +++ + V AA L++ +S ++Y N +L + I K Sbjct: 223 ARDKGIVIHTVAVGDPAAAGEQALDEATLRSISSLTKGKYYWAGNREELAGIYDEIDK 280 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 73/194 (37%), Gaps = 31/194 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L M++VLD S SM + K+ +A + I+S + G + F + Sbjct: 407 SLGMVIVLDRSGSMAGY------KIQLAKEA------AIRSAELLREKDTLGFIAFDDRP 454 Query: 228 VQ--TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 Q + + EKIN L G T P LE AY ++ + +K Sbjct: 455 WQIIDTEPIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQLT-----------PLELQRK 503 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 +II LTDG+ + + + L E K + + + + L+ + RFY Sbjct: 504 HIILLTDGQ----SATSPDYLTTIQEGKENNITLSTVAIGEGSDSVLLEELSDEGGGRFY 559 Query: 344 SVQNSRKLHDAFLR 357 V +S + R Sbjct: 560 DVNDSSTIPSILSR 573 >gi|73990557|ref|XP_853279.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1634 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 51/332 (15%), Positives = 104/332 (31%), Gaps = 27/332 (8%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYR-IIKNIWQTDFRNELRE 98 +++ + ++ +D T + + R ++ + Sbjct: 675 NRYMSQNEISNAIDRMAHIGETTLTGSALTFVSQYFSPAKGARPNVRRFLI--LITDGEA 732 Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLS---AVSRYEMPFIFCTFPWCANSSHAPLLI 155 +D R + I R EM F F + + Sbjct: 733 QDIVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQHIEDDLVFG 792 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + K LD++ V+D S S++ M M+D++K N Sbjct: 793 ICSPREECKRIEVLDVVFVIDSSGSIDHDEYNIMKDF---------MVDLVKKADVGKNQ 843 Query: 216 VRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 VR G + ++ F L+ + I G T + L ++ + +A+ Sbjct: 844 VRFGALKYADDPEVLFYLDDLSTKWEVISVLQKDQPMGGNTYTAEALGFSDHMFTEARGS 903 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H + +I +TDGE + D + + +G +V A+G+ + Sbjct: 904 RLHKGVP-----QVLIVITDGE----SHDADKLNDTAKALRDKGILVLAVGIAGANPVEL 954 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L S D+++ V+ L F + + Sbjct: 955 LAMAGSSDKYFFVETFGGLKGIFSDVSASVCN 986 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 71/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 976 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFKKMKEFLASVVQDFDVSVNR- 1034 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 VR G FS FPL G + I +I N T L + Sbjct: 1035 --VRIGAAQFSHTYRPEFPLGTFIGKKEISFQIENIQQIFGYTHIGAALREVGDYFRPDM 1092 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG++ E E + +G +Y++G+ Sbjct: 1093 GSRINAGTP-----QVLLVLTDGQSQD------EVAQAAEELRHKGIDIYSVGIGDVDDQ 1141 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + D+ +V N +L RI + + Sbjct: 1142 QLIQITGTADKKLTVHNFDELTKVKKRIVRNI 1173 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 53/314 (16%), Positives = 93/314 (29%), Gaps = 37/314 (11%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 N N G Q T LR G IE ++ + Sbjct: 319 SARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKIAS 378 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DM 171 H + + F IT +V + S+ L D+ Sbjct: 379 HPAEQHVSKLK---TFSELAAHNQTFLKKLRNQITLTVSVFSERTETLKSGCVDTEEADI 435 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++D S S + E++ + P VR G V ++ F Sbjct: 436 YLLIDGSGS------TQATDFHEMKTFLSEVVGMFNIAPQK---VRVGAVQYADSWDLEF 486 Query: 232 PLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + I + G T + L + + AK++ + H ++ Sbjct: 487 EINKYTNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKAKKQRGNRVPCH------LV 540 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 LT+G + L N + VYAIGV+ Q + R Y V + Sbjct: 541 VLTNG------MSKDSILEPANRLREELIRVYAIGVKEANQTQLREIAGEDKRVYYVHDF 594 Query: 349 RKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 595 DALKDIRNQVVQEI 608 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 78/226 (34%), Gaps = 22/226 (9%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + + + + D++ +LDVS + + + + Sbjct: 204 ATQYREVATDDILVEVCQGPSVADLVFLLDVS------VNGSQENFDYLKEFLE---ESV 254 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYN 264 ++ N +R GLVT+S++ L+ GV + + I L + T A Sbjct: 255 SALDIKENCMRVGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSPRAGKAYTGA---AIR 311 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KI + ++ + + + +T P+ DN +R G ++ +G+ Sbjct: 312 KIRKEVFSARNGSRKNQGVPQIAVLVT----HRPSEDN--VTKAAVNLRREGVTIFTMGI 365 Query: 325 QAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + + Q K + P ++ +L K++ Q L Sbjct: 366 EGASDSQLEKIASHPAEQHVSKLKTFSELAAHNQTFLKKLRNQITL 411 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 75/215 (34%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVQP-NRMAAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + +++L F T + G+ A I + K I+ Sbjct: 152 VSPTTNRDSTKAALDKLQFADRTATGEGIFTALQAIATVGAVIGGGDKP---PPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ LK Sbjct: 209 SDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ + ++L + + +++ + I Sbjct: 269 KVAQLSGGNAYNAASLQELKAVYATLQQQIGYETI 303 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 57/260 (21%), Positives = 96/260 (36%), Gaps = 70/260 (26%) Query: 168 GLDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLD----IIKSIPDVNNVVRSGLV 221 +D+ +VLDVS SMND FG K+ ++ LD +I D NN V+ LV Sbjct: 243 PIDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNNKVKVALV 302 Query: 222 TFSSKIVQTF-----------------------PLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++++I L ++ +N L T + Sbjct: 303 KYANQIGTATGADGCRISNSRQSDTGNCTQIVQELTTDAGLLKTSVNGLQAAGATYADAA 362 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKR 314 +E A + + KKY+IF TDGE + + + + ++ E K Sbjct: 363 MEVAQQALAGGRAGA----------KKYVIFFTDGEPNHWSGFDDDVANAAIKKSQELKN 412 Query: 315 RGAIVYAIGV-----------QAEAADQFLKNC----------------ASPDRFYSVQN 347 G VY+IG+ A A++F+ AS D +YS + Sbjct: 413 AGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYRSLGDRASGDYYYSASS 472 Query: 348 SRKLHDAFLRIGKEMVKQRI 367 + +L F I K + ++ + Sbjct: 473 ATQLAQIFNDIQKTITEKHV 492 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 80/240 (33%), Gaps = 24/240 (10%) Query: 139 FCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGV 194 F WC + P ++ S +G D+M+V+D+S SM D LG Sbjct: 69 FLVVSWCLLIFALTKPTILGEPQTRES---LGRDVMVVVDLSGSMAEQDFVSKQSSDLGA 125 Query: 195 ATRSIREML--DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 + R +++ R GL+ F P + + + + Sbjct: 126 VKKISRLEATKEVLADFVKTRKGDRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVAMA 185 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + A + E + +K I LTDG ++ ++ + Sbjct: 186 GQSTHLGDAIGLAIKVFEQSSEDKASAEENAKPREKVAIVLTDGNDTGSYVEP---IDAA 242 Query: 310 NEAKRRGAIVYAIGVQAEA--ADQFLK-----NCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + ++ I + +Q L A S + + N +L A+ IG+ Sbjct: 243 KVAAAKDVRIHMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEIGE 302 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 53/273 (19%), Positives = 91/273 (33%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF T+ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFALRIIALVLIILVLARPQTTDSWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 MDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ +I L AK G +Y IGV + Sbjct: 194 TDGVNNKGDISP---LTAAEIAKSFGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 51/246 (20%), Positives = 87/246 (35%), Gaps = 36/246 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 P T+ V++ + +M+ +DVS SM P D+L A Sbjct: 60 LFLAMLALLVVGFARP---TAEVRVPRERAT---VMVAVDVSTSMLAGDVDP--DRLTAA 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 S R +D +PD N GLV F+ P + + E I+RL+ G+T Sbjct: 112 KESARRFVD---GLPDEFN---VGLVAFAGSAAVLVPPDTDREALDEGIDRLVEGATGVQ 165 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A N A + L+ A + I+ L+DG N+S + + +A Sbjct: 166 GTAIGEAINTSLGAVKALDGEAAKDPPPAR-IVLLSDGANTSG----MDPMEAAADAVEM 220 Query: 316 GAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 V+ I + Q L A + +F+ + +L + IG Sbjct: 221 EVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDAVARETGGQFHEAVSVEELRAVYDDIG 280 Query: 360 KEMVKQ 365 + + Sbjct: 281 SSVGYR 286 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 65/292 (22%), Positives = 102/292 (34%), Gaps = 55/292 (18%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYE-MPFI--FCTFPWCANSSHAPLLITSSVKISSKS 165 + SL I K + S + +PF+ + P S +++ Sbjct: 28 KSDASLQISAHRNLKQFPQSKKIKLRHIPFVLRILVIACLIIALARPQASNSWRTQNTE- 86 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+D+MM LD+S +M +L A E I S P+ N GLV F+ Sbjct: 87 --GIDIMMALDISSTMLAG-DIKPTRLEAAKSVATEF---ILSRPNDN----IGLVIFAR 136 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL + N + G T GL A N+I K Sbjct: 137 ESFTQCPLTTDHAVLVNLFNGVNNGMIEDGTAIGLGLANAVNRI-----------KDGKS 185 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-------------- 328 K II LTDG N+S +I + AK G +Y IGV Sbjct: 186 KSKVIILLTDGSNNSGDIAP---ITAAEIAKTFGIRIYTIGVGTHGVINIPVSTPMGIQY 242 Query: 329 -------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + L+N A + +++ ++ KL + + I ++ K RI + Sbjct: 243 QRVQSEFDAKSLENIANLTGGKYFGATDNSKLRNIYQEI-DKLEKTRIKIQQ 293 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 78/239 (32%), Gaps = 38/239 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKL 192 I T+ + P+ + + G D+M+V+D+S SM+ D+ G D+L Sbjct: 66 IIATWLMLVIAIAKPMWLDTPKTYQL---SGRDLMLVVDLSGSMSERDFLDNSGKEQDRL 122 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---F 249 ++++ R GL+ F P + +++ Sbjct: 123 TAVKS-------VLETFAAKREGDRLGLILFGDSAYLQSPFTADHEAWLALLDQAQVGMA 175 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + D E + +K I LTDG ++ + ++ Sbjct: 176 GESTHLGDAVGLTIKTYIDNPE--------NQTVEKVAIILTDGNDTDSLVPPIDA---A 224 Query: 310 NEAKRRGAIVYAIGVQAEA--ADQFLKNCA-------SPDRFYSVQNSRKLHDAFLRIG 359 A+ G +Y + + + DQ + + + + + L + I Sbjct: 225 KVAQAYGIRLYIVAMGSPNTTGDQAIDFSTIETMATVTGGQAFLAMSQEDLDAVYQTIS 283 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 53/281 (18%), Positives = 99/281 (35%), Gaps = 49/281 (17%) Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL--LITSSVKISS 163 +++ SL + K +N S +R + P +F ++ L SS + Sbjct: 25 QLLKQKKSLVPLKMPSTKGFNNSIYNRLK-PMLFILRLLSISALIIGLARPQNSSENTKT 83 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ G+D++M +DVS SM ++L IK P+ R GLV + Sbjct: 84 ETTKGIDIVMAIDVSSSMLAKDLSP-NRLVALKEVASNF---IKDRPN----DRIGLVVY 135 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDD 282 + + P+ + + E + + +G+ T T G+ A + K Sbjct: 136 AGESYTKTPITSDKRLVLEALKEIKYGALTDGTAIGMGLA---------TSVNRLKDSKA 186 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------------ 330 K II LTDG N++ I+ + A Y IG+ Sbjct: 187 LSKVIILLTDGVNNAGFIEPATA---AELAVEYDIKTYTIGLGTNGNALSPIAYNPDGSF 243 Query: 331 -----------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + +++ +++KL + I Sbjct: 244 RYGMAEVQIDEALLEQIATLTGGKYFRATDNKKLEAIYNEI 284 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 47/231 (20%), Positives = 79/231 (34%), Gaps = 32/231 (13%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDV 212 + V + G+ + +++DVS SM+ + + ++ VA + + + Sbjct: 73 MTEVTKKRQDRQGIAIQVLVDVSSSMDINMKYGEERLTRMDVAKIVVEKFIGGDGDELVG 132 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 GL+TF+ PL+ + + + + + AY Sbjct: 133 RPDDLIGLITFARYADTIAPLSLAHEALISIVQDVTINTRPNEDG---TAYGDATALAAA 189 Query: 273 LEHIAKGHDDYK-KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------ 325 + +G D K K II LTDGEN+ N L + AK G +Y I +Q Sbjct: 190 QLDLLQGDQDIKSKIIILLTDGENNCGNHLP---LQAASLAKEWGIKIYTISIQNKPTPE 246 Query: 326 --------------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A DQ LK A + F + L + I K Sbjct: 247 RKKTDQGTFFVPPTPSAGDQVLKKMAESTGGVFRLAHDYDSLKSVYKEINK 297 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 85/259 (32%), Gaps = 31/259 (11%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGP 187 + P + P + L + + I + +M+V+DVS SM Sbjct: 56 VAPQRPSRYRHIPAMLLALSLVLFTVAMAGPTHDVRIPRNRAVVMLVIDVSQSMRATDVE 115 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +++ A + ++ D + + GL+ ++ + + +++L Sbjct: 116 P-NRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVLVSPTTNREATKAALDKL 168 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKES 305 F T + + A I I G I+ +DG+ + P + K + Sbjct: 169 QFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGA 225 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSR 349 AK +G + I D+ +K A S Y+ Sbjct: 226 YTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLA 285 Query: 350 KLHDAFLRIGKEMVKQRIL 368 +L+ ++ + +++ + I Sbjct: 286 ELNSVYVSLQQQIGYETIR 304 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 84/208 (40%), Gaps = 31/208 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +D++ LD S SM + G+ K A +S + L+ + ++G+V++ Sbjct: 67 TSVPIDVVFALDSSGSMGWNDPSGLRK--TAAKSFVDKLN--------STTDQAGVVSWD 116 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + I T L ++ KI+ + T GL A + + K+ Sbjct: 117 NNIDFTQTLTNNFSLVKSKIDAVDSSGGTDLNVGLNAAISLLDTGKQANSSW-------- 168 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCA--SP 339 IIFL++G+ + S A +G VY IG+ LK+ A + Sbjct: 169 -VIIFLSNGQGTY-------SHSTAVVAANKGYTVYTIGLAISPGSTAESNLKDIANTTG 220 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++YS N+ L F I KE+V I Sbjct: 221 GKYYSSPNATNLDAVFNDIYKEVVTSTI 248 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 79/242 (32%), Gaps = 55/242 (22%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREML 203 + P L ++ +I S G+D+++ +DVS SM G ++L V Sbjct: 69 ALARPQLGNTTREIKS---SGIDILLAVDVSGSMQAMDFTLNGKRTNRLEVVK------- 118 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLE 260 D++ GLV F+ + P + +++ L G T + Sbjct: 119 DVMAKFISQRPNDSIGLVAFAGRPYVVCPPTLDHNWLTLRLHSLSIGMIEDGTAIGSAIG 178 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 N++ + K + II LTDG N++ + L AK VY Sbjct: 179 TGVNRLREKK-----------SPSQIIILLTDGINNAGKVPP---LIAAEAAKSFKVKVY 224 Query: 321 AIGVQAEAA----------------------DQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 IG D+ L A + R++ ++ L + Sbjct: 225 TIGAGTRGEAPIPITDAFGRRQLVRARVDIDDKTLSKVAQITGARYFRATDTESLEKVYA 284 Query: 357 RI 358 I Sbjct: 285 EI 286 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 86/217 (39%), Gaps = 35/217 (16%) Query: 157 SSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++I K + L +++VLD S SM+ KL +A + +++++ Sbjct: 354 VEMEIKGKEQLPSLGLVIVLDRSGSMSGS------KLELAKEAAARSVEMLRDEDT---- 403 Query: 216 VRSGLVTFSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 G + F + I++T PL + + I + G T+ L AY + D K + Sbjct: 404 --LGFIAFDDRPWEIIETGPL-NNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQ 460 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +H II LTDG++ N D+ + K G + + + +A Sbjct: 461 RKH-----------IILLTDGQSQPGNYDD-----LIEQGKDNGITLSTVAIGQDADANL 504 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ + RFY+V + + + R + + I Sbjct: 505 LEALSEMGSGRFYNVIDEQTIPSILSRETAMISRTYI 541 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 60/185 (32%), Gaps = 34/185 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ ++D S SMN + + S+ + +GL +FSS + Sbjct: 27 QIVYLVDRSASMNGTEDEMVQFIQ-------------DSLQSKKDEQLAGLYSFSSTLQT 73 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + ++ + + + T L+ A I K ++ Sbjct: 74 EAIMTKTLKEVPKF-TEIKATDQTNIEQSLQLATGIIDPKKATR-------------LVL 119 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYSVQNS 348 LTDG + + +L + + K V + + LK+ SP Y V Sbjct: 120 LTDGNETKGS-----ALDFATKFKGSNISVDVVPFSQPVVNDVSLKSFVSPQVAY-VGEQ 173 Query: 349 RKLHD 353 ++L Sbjct: 174 QQLVT 178 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 86.0 bits (211), Expect = 7e-15, Method: Composition-based stats. Identities = 53/251 (21%), Positives = 84/251 (33%), Gaps = 53/251 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 P S S++ G+D+M+ +D+S SM P ++L A Sbjct: 60 LRMVAVALLIVILARPQSTNSWSNSSTE---GIDIMLAMDISGSMLAQDLKP--NRLEAA 114 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---ST 252 D+ S + GLV FS++ PL + + G Sbjct: 115 K-------DVAASFINGRPNDNIGLVVFSAESFTQCPLTTDHTVLLNLFKDIQSGMIQDG 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A ++I D+ K K II LTDG N++ I + A Sbjct: 168 TAIGLGLANAVSRIKDSHAK-----------SKVIILLTDGSNNAGEIAP---VTAAEIA 213 Query: 313 KRRGAIVYAIGVQAEAADQF---------------------LKNCAS--PDRFYSVQNSR 349 K G VY IGV + + LK AS +++ ++ Sbjct: 214 KTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFRATDNA 273 Query: 350 KLHDAFLRIGK 360 L + + I + Sbjct: 274 SLKEIYSEIDQ 284 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 50/231 (21%), Positives = 87/231 (37%), Gaps = 32/231 (13%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 F W ++ L + S + + L +++V+D S SM G +D + A Sbjct: 496 ENSFGVGGWGGSTIEQVLPVRFSGERQRE-QPTLALILVIDKSGSM--SSGDRLDLVKEA 552 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 R+ LD I G++ F S +++ A I I RL G T Sbjct: 553 ARATARTLDPSDEI---------GVIAFDNSPQVLVRLQPAANRLRISSSIRRLSAGGGT 603 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + P L AY ++ G K++I L+DGE+ I+ + + Sbjct: 604 NAMPALREAYLQLA-----------GSKALVKHVILLSDGESPENGINALLG-----DMR 647 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 + V ++GV A FL A R++ ++ + F R +E+ Sbjct: 648 QSDITVSSVGVGDGAGKDFLIRVAERGRGRYFYSEDGTDVPRIFSREAREV 698 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 58/273 (21%), Positives = 97/273 (35%), Gaps = 70/273 (25%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLD----IIKS 208 I S+ +D+ +VLDVS SMND FG K+ ++ LD + Sbjct: 230 IGKDASTSTTDTTPIDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDT 289 Query: 209 IPDVNNVVRSGLVTFSSKIVQTF-----------------------PLAWGVQHIQEKIN 245 I D N+ V+ LV ++++I L ++ +N Sbjct: 290 IEDDNDKVKVALVKYANQIGTATGADGCRISNSRQSDTGNCTQIVQELTTDAGLLKTSVN 349 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN----ID 301 L T + +E A + + KKY+IF TDGE + + Sbjct: 350 GLQAAGATYADAAMEVAQQALAGGRAGA----------KKYVIFFTDGEPNHWSGFDGDV 399 Query: 302 NKESLFYCNEAKRRGAIVYAIGV-----------QAEAADQFLKNC-------------- 336 ++ E K G VY+IG+ A A++F+ Sbjct: 400 ANAAIKKSQELKNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYWNLG 459 Query: 337 --ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 AS D +YS ++ +L F I K + ++ + Sbjct: 460 DRASGDYYYSASSATQLAQIFNDIQKTITEKHV 492 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 49/270 (18%), Positives = 90/270 (33%), Gaps = 47/270 (17%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF+ T++ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFLLRCIALVLVILVLARPQTTNKWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGETFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + I+ + G T T + K K II LTDG Sbjct: 145 TVDHAVLLDMIHNIKCGLITDGT--------AVGMGIANAVTRLKDSKAKSKVIILLTDG 196 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LKNCA--------- 337 N+ +I + AK G VY IGV + ++ + Sbjct: 197 TNNKGDISP---MTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKT 253 Query: 338 -------SPDRFYSVQNSRKLHDAFLRIGK 360 + ++ ++ KL + + I K Sbjct: 254 LTEIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 51/218 (23%), Positives = 87/218 (39%), Gaps = 40/218 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+M+ +D+S SM +++ VA D I P+ N GL F+ + Sbjct: 87 GIDIMLTMDISASMLTEDVFP-NRIEVAKEVAS---DFISGRPNDN----IGLTIFAGEA 138 Query: 228 VQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 P+ + + + + + T GL + +++ D+K K Sbjct: 139 FTQCPMTVDHAALLNLLHNVRTDLVVKGLIQDGTAIGMGLANSVSRLKDSKAK------- 191 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------- 332 K II LTDG N+ +I + + AK+ G +Y IG+ E+ Sbjct: 192 ----SKVIILLTDGSNNVGSISP---MTAASIAKKYGIRIYTIGLGKESEGDLGAIDYKT 244 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGK-EMVKQRI 367 L+N A + FY Q+ +L + I K E K R+ Sbjct: 245 LQNIAVSTNGEFYRAQSQAELSKIYQDIDKLEKTKLRV 282 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 95/273 (34%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF+ T++ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFLLRVIALVLVILVLARPQTTNKWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGETFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I+ + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ +I L AK G VY IGV + Sbjct: 194 TDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVEP-SRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + + A I I G I+ Sbjct: 152 VSPTTNREATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ +K Sbjct: 209 SDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEIDDQRQPVPVDDETMK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 269 KVAQLSGGNSYNAATLAELRAVYSSLQQQIGYETI 303 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 96/269 (35%), Gaps = 57/269 (21%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 SR+ +P F T S D +D++ LD+S SM+ + P + Sbjct: 50 SRFLIPIFFEFLALSLMIFALARPRTGEENSYSYKD-SVDIVFSLDISGSMSSYDQP--E 106 Query: 191 KLGVATRSIREMLDIIKSIP-------------DVNNVVRSGLVTFSSKIVQTFPLAWGV 237 L V R I E ++ + P D R GLV F ++ P Sbjct: 107 DLAVNRRVIAEAINNKELHPRLHYAKKSIADFIDKRKSDRLGLVVFGAEAYSVCPPTNDH 166 Query: 238 QHIQEKINRLIF------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 +++Q ++ + T T + ++ +K KK II +T Sbjct: 167 EYLQNRLKEISTEYLGDYNRQTNITAAISGGLARLRKSK-----------APKKIIILVT 215 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--------------------QAEAADQ 331 DG +++ + N A + A++Y IGV ++ ++ Sbjct: 216 DGSHTANS--NLTPRMAAKAAAKSDAVIYTIGVGNEVAWNVENFFGSSRLNASNSDFDEE 273 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 LK A + ++SV+ + ++ D +I Sbjct: 274 LLKEIAEKTGGLYFSVREAEQMKDVLKKI 302 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 53/251 (21%), Positives = 84/251 (33%), Gaps = 53/251 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 P S S++ G+D+M+ +D+S SM P ++L A Sbjct: 60 LRMVAVALLIVILARPQSTNSWSNSSTE---GIDIMLAMDISGSMLAQDLKP--NRLEAA 114 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---ST 252 D+ S + GLV FS++ PL + + G Sbjct: 115 K-------DVAASFINGRPNDNIGLVVFSAESFTQCPLTTDHTVLLNLFKDIQSGMIQDG 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A ++I D+ K K II LTDG N++ I + A Sbjct: 168 TAIGLGLANAVSRIKDSHAK-----------SKVIILLTDGSNNAGEIAP---VTAAEIA 213 Query: 313 KRRGAIVYAIGVQAEAADQF---------------------LKNCAS--PDRFYSVQNSR 349 K G VY IGV + + LK AS +++ ++ Sbjct: 214 KTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFRATDNA 273 Query: 350 KLHDAFLRIGK 360 L + + I + Sbjct: 274 SLKEIYSEIDQ 284 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVEP-SRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + + A I I G I+ Sbjct: 152 VSPTTNREATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ +K Sbjct: 209 SDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 269 KVAQLSGGNSYNAATLAELRAVYSSLQQQIGYETI 303 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 83/234 (35%), Gaps = 35/234 (14%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIRE 201 + P+L+ ++ D+M+ +D+S SM D G D+L +R+ Sbjct: 71 AACARPVLVEKPIQR---EQPIRDLMLAIDISQSMEATDYTDANGAKSDRLSAVKSVVRD 127 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + R GL+ F + PL + ++ + G+ Sbjct: 128 FI-------ARRKDDRIGLIVFGTGAYPQAPLTLDHASLLLLLDEV--------GIGMAG 172 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + DA + + +K +I LTDG ++S I + A+ G +V+ Sbjct: 173 PNTALGDAIGLTIKALEKTPEQEKVLILLTDGNDTSSAITPDHAAHL---AQANGIVVHT 229 Query: 322 IGVQ-----AEAADQF--LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 IG+ +A L+ A + +F+ + + L + + + + Sbjct: 230 IGIGDPQATGDAKVDLTTLQAIARTTGGQFFRADDRQALQQVYATLDRLTPHKV 283 >gi|113867618|ref|YP_726107.1| von Willebrand factor type A domain-containing protein [Ralstonia eutropha H16] gi|113526394|emb|CAJ92739.1| von Willebrand factor (vWF) type A domain [Ralstonia eutropha H16] Length = 345 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 84/232 (36%), Gaps = 27/232 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVA 195 + + P + + ++ K D+++ LD+S SM+ FG L Sbjct: 65 LAPLAWALVVTALARPQFLEAPIQ---KVQPARDLLIALDLSQSMDTRDFGDPSGALIPR 121 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 +++R ++ R GL+ F P Q +Q I L+ Sbjct: 122 VQAVR---QVVSGFVARRPGDRIGLIVFGDAPYPLAPFTLDHQLVQTLITGLL------- 171 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 PG+ + DA + + + +K +I LTDG +++ + + + AK R Sbjct: 172 -PGMAGPSTALGDAIGLGIKMFEHSEAPEKVLIVLTDGNDTASRMPPERAGGI---AKER 227 Query: 316 GAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +V+ IG+ A L+ A + R++ + L + + Sbjct: 228 KVVVHTIGIGDPNASGEEKVDLGVLQRLAAQTGGRYFFGADQAGLETIYATL 279 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 60/281 (21%), Positives = 97/281 (34%), Gaps = 40/281 (14%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP-----FIFCTFPWCANSSHAPLLI 155 F D + S +D L + E+ F F N P Sbjct: 179 FNDDEALEHQDRSAESGLDKPGVTGTLEVKTYPEISEVVRSVSFKDFAVLINLKA-PTSS 237 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 SS SS S +D++ VLDVS SM KL + R++ ++ + Sbjct: 238 KSSSNPSSSSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQNLGPFD----- 286 Query: 216 VRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +++FSS + FPL G Q + +N L+ T GL+ + D Sbjct: 287 -RLSVISFSSTARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLID--- 342 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL------FYCNEAKRRGAIVYAIGVQ 325 K I+ L+DG+++ S E V+A G Sbjct: 343 ---RRFKNPVSS---IVLLSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFG 396 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 A+ + + A S F +++ + DAF + IG + Sbjct: 397 ADHDASLMHSIAENSGGTFSFIESETVIQDAFAQCIGGLLS 437 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 60/281 (21%), Positives = 97/281 (34%), Gaps = 40/281 (14%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP-----FIFCTFPWCANSSHAPLLI 155 F D + S +D L + E+ F F N P Sbjct: 179 FNDDEALEHQDRSAESGLDKPGVTGTLEVKTYPEISEVVRSVSFKDFAVLINLKA-PTSS 237 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 SS SS S +D++ VLDVS SM KL + R++ ++ + Sbjct: 238 KSSSNPSSSSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQNLGPFD----- 286 Query: 216 VRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +++FSS + FPL G Q + +N L+ T GL+ + D Sbjct: 287 -RLSVISFSSTARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLID--- 342 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL------FYCNEAKRRGAIVYAIGVQ 325 K I+ L+DG+++ S E V+A G Sbjct: 343 ---RRFKNPVSS---IVLLSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFG 396 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 A+ + + A S F +++ + DAF + IG + Sbjct: 397 ADHDASLMHSIAENSGGTFSFIESETVIQDAFAQCIGGLLS 437 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 46/226 (20%), Positives = 79/226 (34%), Gaps = 32/226 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +M+ +DVSLSM + ++L A +S + ++ + PD N Sbjct: 75 PAMPVQQPRERATIMVAVDVSLSMAANDIDP-NRLEAAKKSAQGFVETL---PDRFN--- 127 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 GLV FSS Q + I L G T G+ + I E + Sbjct: 128 VGLVAFSSTATVVSSPTHDHQAVIGSIENLQLGPGTAIGEGVFASLESISSFDEDADVDP 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------ 325 I+ L+DGEN+S ++ A + V I Sbjct: 188 PPSA-----IVLLSDGENTSG----RDISQAVAMAAEQEVPVSTIAFGTGAAMIEIDGYQ 238 Query: 326 --AEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ + L+ AS FY ++ +L + + IG + + + Sbjct: 239 VPADIDKEALRGLASDTGGHFYEAESETELDEVYEDIGSSLGTELV 284 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 52/431 (12%), Positives = 131/431 (30%), Gaps = 104/431 (24%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL--------YTA 60 F + G+ I+TA+L+ +F G+ ++ +H ++ +L+ D + + A Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + NG +NI+ + EL ++ +++ Sbjct: 68 AAMTMSGNGTI------SLGKDDARNIFMSQMSGEL---------TDVHIDLGINVTKTA 112 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + +S + F + I+ + ++ +D ++LD + S Sbjct: 113 NKLNSQVSFSATVPTTF-------MRILGRDSITISGAATAEYQTAAFMDFYILLDNTPS 165 Query: 181 MN----------------------------DHFGPGMD-----KLGVATRSIREMLDIIK 207 M +++ ++ V ++ + + D K Sbjct: 166 MGVGATANDVSKLQAKTGCAFACHQMDQSTNNYTIAKGLGVAMRIDVVRQATQALTDTAK 225 Query: 208 SIPDVNNVVRSGLVTFSSKIV-----QTFPLAWGVQHIQEKINRLIF------GSTTKST 256 + ++ R G+ TF +K + ++ + + Sbjct: 226 TERVSSDQFRMGVYTFGTKAEDAKLTTISSPTSDLTKVKNYTDTVDLMTIPYQNYNQDQL 285 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG------------ENSSPNIDNKE 304 + A ++ + + +K + F++DG + + Sbjct: 286 TSFDSALTQM-NTIIDPAGDGTSNISPEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPI 344 Query: 305 SLFYCNEAKRRGAIV---YAIGVQAEAADQF--------------LKNCASPDRFYSVQN 347 +C K RG + Y + + D + ++ CASP ++ V Sbjct: 345 DTSFCKPLKDRGVKIAVLYTTYLPLPSNDWYNKWISPFQSEIPTKMQACASPGFYFEVTP 404 Query: 348 SRKLHDAFLRI 358 + + DA + Sbjct: 405 TEGITDAMKAL 415 >gi|319956579|ref|YP_004167842.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418983|gb|ADV46093.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 560 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 77/208 (37%), Gaps = 30/208 (14%) Query: 158 SVKISSKS---DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 V S++ G + ++LD S SM + + +A + + + +D Sbjct: 66 PVTRKSETLHQARGHAVALLLDASYSMREG-----GRFDIARKVLLDFID-------RRP 113 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEK 272 R L F+ P+++ + ++ + L G + Y +F Sbjct: 114 KDRIALEVFADYAYLAAPMSYEKKGLKTILAALEPGVVGGRDTAL-----YEALFLGARL 168 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K + +I LTDG ++ NI + ++ E KR VY +GV + Sbjct: 169 ---FKKEEGRSNRVMILLTDGIDTVGNIPLEAAIR---ELKRAHIRVYTVGVGDDFRRGV 222 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + RFY + L + + RI Sbjct: 223 LEKIARSTGGRFYDARYPEALANIYRRI 250 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 57/160 (35%), Gaps = 23/160 (14%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIF 289 + + ++ L T GL + + + K Y+ I+ Sbjct: 419 TEMNNNWATMNTTVDGLFPVGGTNQPIGLVWGWQSLVGGGPFPTPPVKDEQYTYQDIIVL 478 Query: 290 LTDGEN-----------SSPNIDNK----ESLFYCNEAKRRGAIVYAIGVQAEAA--DQF 332 ++DG N ++ ++DN+ + C K G VY + V + Sbjct: 479 MSDGLNTVDRWYGNGWDTNTSVDNRMYASATTGTCVNVKAAGIKVYTVHVNTNGSPESTL 538 Query: 333 LKNCASPD-----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 LKNCASP F V ++ L+ AF I ++ R+ Sbjct: 539 LKNCASPADDGGKEFQMVTSASGLNAAFNSIATKLTDLRV 578 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 75/221 (33%), Gaps = 29/221 (13%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + G+I++L I + +G ++ S + + LD + L A Sbjct: 15 RFIADRSGNIAVLFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTALMVAKDYSLN- 73 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + + I ++ N + ++++ + K + Sbjct: 74 --KISASEIDGKAKSIFSALYTNKSANSVE---------------VVAVLTPNTGKGSTI 116 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 ++P F N S + +S+ S L + +VLD + SMND+ Sbjct: 117 KVDGTGKVP---TDFMKLVNISQIDIGASSTTTWGS---TRLRVALVLDTTGSMNDN--- 167 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 K+G + + +L +K V ++ FS + Sbjct: 168 --GKIGALKTATQNLLTQLKDAAGKPEDVYVSIIPFSKDVN 206 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 78/223 (34%), Gaps = 38/223 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 V +++ +D S SM + P +L A + RE I+++P V Sbjct: 82 PVLPFPAYRSVATVVLAIDTSRSMRAEDLEP--SRLEAAKAAAREF---IRAMPPG---V 133 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 GLV FSS P + +++ ++ L T GL A + Sbjct: 134 EVGLVAFSSYATLLQPPTTDRERLEQAVDLLDLAHRTAIGDGLVAALRVLPLEDSDAPGG 193 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA------- 329 ++ L+DG N+ + L +A+ +G VY +GV Sbjct: 194 MS--------VVLLSDGRNNYG----IDPLEAARQAEAQGVRVYTVGVGLSENTYVFANG 241 Query: 330 --------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ L+ A + +Y ++ +L + + + + Sbjct: 242 YYIRAGLDEETLQEIAALTGGAYYRASSADELRAVYQTLARAV 284 >gi|291008772|ref|ZP_06566745.1| hypothetical protein SeryN2_29978 [Saccharopolyspora erythraea NRRL 2338] Length = 324 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 78/231 (33%), Gaps = 28/231 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + + +M+ +DVSLSM ++L A + +E D + Sbjct: 69 LTVALAGPTAEQRIPRNRATVMLTVDVSLSMKATDVEP-NRLEAAKVAAKEFADQLTP-- 125 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + GLV+F+ +++ I+ L T + G+ A + I Sbjct: 126 ----GINLGLVSFAGTATVLVMPTTDRASVKQAIDNLKLSEATATGDGINAAMSAIDSFG 181 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ--- 325 + + I+ + DG + P + + EAK+ + I Sbjct: 182 KMVGG---PSGAPPARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIPISTISFGTKH 238 Query: 326 -----------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 E D+ ++ A S F+ ++ +L + + +G+++ Sbjct: 239 GSIEIEGEQEFVEVDDEAMQEIARLSGGEFHKAASAEQLREVYATLGEQIG 289 >gi|134100328|ref|YP_001105989.1| hypothetical protein SACE_3793 [Saccharopolyspora erythraea NRRL 2338] gi|133912951|emb|CAM03064.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 327 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 78/231 (33%), Gaps = 28/231 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + + +M+ +DVSLSM ++L A + +E D + Sbjct: 72 LTVALAGPTAEQRIPRNRATVMLTVDVSLSMKATDVEP-NRLEAAKVAAKEFADQLTP-- 128 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + GLV+F+ +++ I+ L T + G+ A + I Sbjct: 129 ----GINLGLVSFAGTATVLVMPTTDRASVKQAIDNLKLSEATATGDGINAAMSAIDSFG 184 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ--- 325 + + I+ + DG + P + + EAK+ + I Sbjct: 185 KMVGG---PSGAPPARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIPISTISFGTKH 241 Query: 326 -----------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 E D+ ++ A S F+ ++ +L + + +G+++ Sbjct: 242 GSIEIEGEQEFVEVDDEAMQEIARLSGGEFHKAASAEQLREVYATLGEQIG 292 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 30/166 (18%) Query: 165 SDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D+M+ +D S SM D P ++L VA R I ++L I+ R GL+ F Sbjct: 85 EKKGIDIMIAVDTSRSMLADDVKP--NRLEVAKREIEDLLKIL-------EGDRVGLIAF 135 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + PL + +N L T + + + + Sbjct: 136 AGRAFTYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAIYKGIDAFGENENNH------ 189 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K +I +TDGEN + L ++AK +G ++Y +GV Sbjct: 190 -----KAMIIITDGENHE-----TDPLKAASKAKEKGIVIYTVGVG 225 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 52/423 (12%), Positives = 136/423 (32%), Gaps = 88/423 (20%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N G+ I+TA+L+ + G+ ++ +H ++ +L+ D + + + + Sbjct: 8 FISNRSGNFGIMTALLMVPLMGAAGMAVDVAHALSLRTQLYAAADAAAVGSIAE------ 61 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + + ++ +TD RN + + + ++ +I + + + +S Sbjct: 62 KSGAVAAAMTMNGNGTVSLGKTDARN-IFMSQTSGELTDIHIDLGIDVTKTANKLNSQVS 120 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 + F+ ++I+ + ++ +D ++LD + SM Sbjct: 121 FTATVPTTFMRIF-------GRDSIIISGTATAEYQTAAFMDFYILLDNTPSMGVGATAS 173 Query: 183 ----------------------DHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNNV 215 +++ ++ V ++ + + D K+ ++ Sbjct: 174 DVSKLQAKTGCAFACHQMDQSTNNYTIAKSLGVTMRIDVVRQATQALTDTAKAERVSSDQ 233 Query: 216 VRSGLVTFSSKIV-----QTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYN 264 R G+ TF +K L + ++ N + + + A Sbjct: 234 FRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTNAVDLMTIPYQNYNSDQLTSFDSAMT 293 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDG------------ENSSPNIDNKESLFYCNEA 312 +I + + +K + F+ DG + + +C Sbjct: 294 QI-NTIIDPAGDGTSNISPEKILFFVADGVGDSYKPSTCTKKTTGGRCQEPIDTTFCKPL 352 Query: 313 KRRGAIV---YAIGVQAEAADQF--------------LKNCASPDRFYSVQNSRKLHDAF 355 K RG + Y + + + ++ CASP ++ V + + DA Sbjct: 353 KDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQNEIPTKMQACASPGLYFEVTPTDGIADAM 412 Query: 356 LRI 358 + Sbjct: 413 KAL 415 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 75/239 (31%), Gaps = 41/239 (17%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDK 191 T+ ++ P+ + +G +MM+ +D+S SM D G + Sbjct: 64 LATLTYIALVIAAMRPIWLDEPFTR---DKVGREMMVAVDLSGSMEARDFVDLKGDKTRR 120 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---I 248 + + + L R GL+ F P + + + + Sbjct: 121 IDGVKSLLLDFL-------AQRASDRVGLIAFGDAAYLQAPFTEDKGALSLLLKEMDVRM 173 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G+ T + A N + K ++ LTDG ++S L Sbjct: 174 AGAGTALGDAIGVAVNHFSHSDTDN-----------KVLLLLTDGNDTSSEFPP---LEA 219 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 A ++G ++Y I + A + L+ A + + + Q+ + I Sbjct: 220 ARYAAQQGIVIYPIAIGDPANVGEDSLDIEMLQQIADITYGQVFEAQDGEAFTQVYSII 278 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 50/248 (20%), Positives = 86/248 (34%), Gaps = 48/248 (19%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLG 193 M + P T + G+++M+ +D+S SM +D P +++ Sbjct: 57 MLLRCLVYALVVIVLARPQTTTPIDNGQVE---GINIMLAVDISASMLSDDVDP--NRIE 111 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG--- 250 VA D+ K GL F+ + P+ + + + Sbjct: 112 VAK-------DVAKEFIASRPNDNIGLTIFAGEAFTQCPMTTDHASLLNLLAGIRADLSV 164 Query: 251 -----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T GL A ++ D KE K +I LTDG N+ +I Sbjct: 165 NHLIQDGTAIGMGLANAVGRLKDVKEG-----------SKVVILLTDGSNNVGDISP--- 210 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAAD-----------QFLKNCA--SPDRFYSVQNSRKLH 352 L + A++ G VY IG+ + D + L++ A + FY Q+ +L Sbjct: 211 LTAASIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYKTLQDIAMQTDGEFYRAQSRAELS 270 Query: 353 DAFLRIGK 360 + I K Sbjct: 271 QIYKDIDK 278 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 85/263 (32%), Gaps = 40/263 (15%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSV---KISSKSDIGLDMMMVLDVSLSMNDH 184 ++ +Y P + +T V + ++ IG D+M+V+D+S SM+ Sbjct: 50 NSALQYLKPSRWQQCALFIVWLLLVTAMTQPVILGEPQTRLQIGRDLMVVVDLSGSMDTK 109 Query: 185 -----------FGPGMDKLGVATRSIREMLDIIK----SIPDVNNVVRSGLVTFSSKIVQ 229 G + LD +K + R GL+ F Sbjct: 110 DFTLHVKQQTADGIANSSGTEISDEYISRLDAVKRVLHEFAEQRQGDRLGLILFGDAAYL 169 Query: 230 TFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P + ++ + G +T L A + K Sbjct: 170 QAPFTADLASWLRLLDESRVAMAGQSTHVGDALGLAIKVMSSD-------EIKSSQKNKV 222 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA-- 337 ++ LTDG ++ ++ L A ++G V+ I + + ++ A Sbjct: 223 VLLLTDGNDTDSSVPP---LEAAKIAAKKGIRVHVIAIGDPQTVGEQAMDMEVIEGVAAL 279 Query: 338 SPDRFYSVQNSRKLHDAFLRIGK 360 + + + ++++L+ + I K Sbjct: 280 TGGKAFKAISTQELNKVYQTISK 302 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 80/221 (36%), Gaps = 50/221 (22%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++M +DVS SM ++ + +K P+ R GLV ++ + Sbjct: 91 GIDIVMAIDVSASMLSK-DLKPNRFEALKKVAS---QFVKDRPN----DRIGLVIYAGES 142 Query: 228 VQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+ I ++ L +G T GL A N++ K Sbjct: 143 YTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNRL-----------KESKAKS 191 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------- 328 + II LTDG N++ ID + + A G VY IG+ + Sbjct: 192 RVIILLTDGVNNTGFIDPQTA---AELAAEYGIRVYTIGIGSNGTALSPYALNPDGSIMY 248 Query: 329 -------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + +K A + R++ +++KL + I K Sbjct: 249 RMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYDEINK 289 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++ A + ++ D + + GL+ ++ Sbjct: 91 VMLVIDVSQSMRATDVEP-SRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVL 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + + A I I G I+ Sbjct: 144 VSPTTNREATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLF 200 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ +K Sbjct: 201 SDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMK 260 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 261 KVAQLSGGNSYNAATLAELRAVYSSLQQQIGYETI 295 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 85/208 (40%), Gaps = 24/208 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S G D++ LD S SM + + +++E+L +S G Sbjct: 72 QKTSSKRKGRDLVFALDTSGSMAESGFNPENVQNRKFDALKELL---RSFITKRYNDNVG 128 Query: 220 LVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + F + PL++ + + + + I G +T GL A + Sbjct: 129 VSIFGTYAYPAIPLSYDMGSVAFLLDFFDVGIAGDSTAIGEGLAMALKIL---------- 178 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLK 334 K + +K II +TDG +S + KE++ +AK++ +Y IG+ +A L+ Sbjct: 179 -KKGEAKEKVIILITDGYQNSGAVSVKEAV---QKAKKQHVKIYTIGIGDRSAFDANLLQ 234 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + + + +N + L D + I K Sbjct: 235 LIAKNTDAKMFEAKNVKMLQDIYKEIDK 262 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 56/191 (29%), Gaps = 54/191 (28%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L Q K+ + T G+ + + + + E +D KK +I + Sbjct: 500 SDLTDNKNTTQAKLTSMQASGATNVQMGVAWGWRTLSPGEPFTEGRPYDAEDNKKIMIIM 559 Query: 291 TDGENSSPNID----------------------------------------------NKE 304 TDG N+ + ++ Sbjct: 560 TDGNNTYYPTNIYGNQYAQDNKSFYGGHGHSVKGRIFDGYDGEANPGHNSQTFTKAMDEH 619 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASPD-----RFYSVQNSRKLHDAFL 356 C AK G +Y+I L++CAS D ++ N+ L D F Sbjct: 620 LTETCTNAKNAGITIYSIAFDVPNGSSVKATLEDCASSDVGGGKLYFDANNNAALIDTFE 679 Query: 357 RIGKEMVKQRI 367 +I + + RI Sbjct: 680 KIAERLADLRI 690 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 76/200 (38%), Gaps = 12/200 (6%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++ F + +G++ + TA+ LPV+ + +G + + + + +D A Sbjct: 23 VKRFRDDERGNVFVFTALSLPVMLMAIGAGADYAELYRARVNFQSAVDA----GAIAAAK 78 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS--LSIIIDDQHK 123 + D + ++ +RE D+ + + + ++ + Sbjct: 79 NLAATGQVQTSKDIGEEVFRSNLSHLGEKAVREGQINFDMGDGDCAVQGVITTATLPHDR 138 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 ++LS V + + ++++S + +D +++ +VLD S SM Sbjct: 139 FFSLSFVDQSQ---QKGFGANKIVKGQEEFILSASSTVECGNDT-IEIALVLDNSGSMRW 194 Query: 184 HFGPGMDKLGVATRSIREML 203 + + L A+ S+ E L Sbjct: 195 N--GKIGTLRQASNSLVETL 212 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 94/247 (38%), Gaps = 35/247 (14%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 A++ + P+ + T KIS + + ++VLD S SM Sbjct: 65 AIASIVLIMFSLALPYTIIPRYVKTTQTLEAKISLQRKPPV--VIVLDTSGSMKG----- 117 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 DK+ A ++++ +D + V GL+TF+ + P + + +K+ + Sbjct: 118 -DKIITAINAVKKFID------QTIDYVLIGLITFNDHVRIAIPPTSDQELLYKKLGEIK 170 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + LE AY+ + E IIF+TDG + D Sbjct: 171 AFGGTIYSKPLEIAYDWLVPFAEFNLSPT---------IIFVTDGLP--YSQDAPLYREV 219 Query: 309 CNEAKRRGAIVYAIGVQAEA--------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + R +Y I ++ A Q L+ A + +FY+V+ + L + F ++ Sbjct: 220 VYKCARYNITIYPIFIETPGMSIYETMMAQQRLREIANITKGQFYNVKQTNSLINLFEKL 279 Query: 359 GKEMVKQ 365 ++ V + Sbjct: 280 AEKTVSK 286 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 92/273 (33%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF T++ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFALRIIALALVILILARPQTTNSWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ +I L AK G VY IGV + Sbjct: 194 TDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 76/219 (34%), Gaps = 33/219 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +L A + + D+ + + GL++F+ Sbjct: 90 VMLVVDVSNSMKSTDVAP-SRLKAAQAAGKRF------ADDLTDGINLGLISFAGTASTL 142 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++RL T + G+ A +I + I+ L Sbjct: 143 VSPTPDHSATKNALDRLKLADKTATGEGIFAALQQIDT---LNAVLGGPSGAPPARIVLL 199 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQ-------------------AEAA 329 +DG+ + P D + +AK +G V I Sbjct: 200 SDGKQTVPESPDDPRGGFTAARKAKEKGVPVSTISFGTLTGTVDLETPGGGVERVPVPVD 259 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 D+ L+ A S F++ + +L+ + + K++ +R Sbjct: 260 DESLRKIANLSGGDFFTASSLDELNKVYSTLQKQIGYER 298 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 81/242 (33%), Gaps = 37/242 (15%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREM 202 + P+ + K +++ IG D+M+V+D+S SM + +G +L Sbjct: 76 AMTQPMWLG---KSQTRTLIGRDVMVVVDLSGSMAEKDFLTPYGEKHTRLDAVK------ 126 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGL 259 ++ + R GL+ F P + ++ G +T + Sbjct: 127 -QVLAQFSQHRDGDRLGLILFGDAAYLQAPFTADHETWLALLDETQVGMAGQSTHLGDAI 185 Query: 260 EYAYNKIF-------DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 A A + +K +I LTDG ++ + ++ A Sbjct: 186 GLAIKVFNDQADAAKQAATQQGSAVVTRPVKEKVVIVLTDGNDTDSLVPPIDA---AKVA 242 Query: 313 KRRGAIVYAIGVQAEAAD-------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMV 363 R ++ I + + ++ AS + + + +L + +IG+ Sbjct: 243 ASRDIRIHMIAMGDPRTVGEQALDMEVIEQVASLTGGQSFQALSPAELTRVYKKIGELEP 302 Query: 364 KQ 365 +Q Sbjct: 303 QQ 304 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 56/252 (22%), Positives = 84/252 (33%), Gaps = 42/252 (16%) Query: 134 EMPFIF--CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 +P + TF P TS + G+D+M+ +DVS SM ++ Sbjct: 54 NLPVLLRCATFALAVIILARPQTHTSWGNKQVE---GIDIMLAMDVSTSMLAEDLTP-NR 109 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + A E I P+ N GL F+ + P+ + +N L Sbjct: 110 MEAAKDVAAEF---IADRPNDN----IGLTIFAGEAFTQCPMTTDHTSL---LNMLQTVR 159 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + GL I K K +I LTDG N+ ++ S N Sbjct: 160 TDIAAKGLIQDGTAIGMGLANAVSRLKDSKAKSKVVILLTDGSNNMGDLSPMTS---ANI 216 Query: 312 AKRRGAIVYAIGVQAEA---------------------ADQFLKNCA--SPDRFYSVQNS 348 AK G VY IGV + LK+ A + FY N+ Sbjct: 217 AKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAATTDGNFYRATNN 276 Query: 349 RKLHDAFLRIGK 360 ++L + I K Sbjct: 277 QELKQIYKDIDK 288 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVEP-NRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + +++L F T + + A I I G I+ Sbjct: 152 VSPTTNRDATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ LK Sbjct: 209 SDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINGQRQPVPVDDETLK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 269 KVAQLSGGNAYNAATLAELKSVYASLQQQIGYETI 303 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 96/273 (35%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF+ F T++ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHAPFLLRLFALTLVILVLARPQTTNKWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 VDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I+ + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ +I + AK G VY IGV + Sbjct: 194 TDGTNNKGDISP---MTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 84/231 (36%), Gaps = 25/231 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + P + + ++ D+++ LD+S SM+ V+ Sbjct: 63 LNIVVWLLLVVTLARPQWVEPPLT---HTEPVRDILLALDISQSMDSQDFRDAQDRQVSR 119 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 ++ + + + D R GL+ F + PL + ++ ++ + Sbjct: 120 WTVVKAV--VADFIDKRTDDRLGLIVFGTGAFPQAPLTRDHKSLRLLLD--------HTA 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ I DA + + K +I LTDG ++ + L N A + Sbjct: 170 VGMAGPNTAIGDAIGMGIRMLDSAQERDKVLILLTDGNDTGSAVPP---LRAANLAAQHH 226 Query: 317 AIVYAIGVQ---AEAADQF----LK--NCASPDRFYSVQNSRKLHDAFLRI 358 V+ IG+ A DQ L+ + AS +F+ Q+ LHD + + Sbjct: 227 VTVHTIGIGSPTASGDDQVDFDTLRGISSASGGQFFQAQDGAALHDVYATL 277 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 71/205 (34%), Gaps = 27/205 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++M+ LD+S SM +D VA + + + R GLV F+ + Sbjct: 93 SGREIMLALDLSGSMER-VDFSLDGRNVARLAAVKRVGA--DFIRRRAGDRIGLVIFADQ 149 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L++ + ++ G +ST I D +K Sbjct: 150 ADVAASLSFDTASVAHALDEAQIGLVGRSTG--------IGDGLGLALKRLDATPAREKV 201 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------AEAADQF-----LKN 335 ++ L+DG N++ + A+ G V+ I + AE L++ Sbjct: 202 VVLLSDGANNAGQTTPHD---VAALARELGIRVHTIALGPRDLSDAEGDPDVVDTEALRD 258 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRI 358 A S RF+ V+ + L I Sbjct: 259 VATTSGGRFFRVRTTDDLAAVADSI 283 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 74/234 (31%), Gaps = 35/234 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDK 191 I + ++ P + ++ + G D+M+ +D+S SM G D+ Sbjct: 52 LIALMWLSLVIAAAKPQWLGEPIE---QQKAGRDLMIAVDLSGSMETEDFSQADGKPADR 108 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L +++ + + R GL+ F S P + +N Sbjct: 109 LTAVKT-------VLRQLANERAGDRLGLIVFGSSAYLQSPFTEDHRTWLLLLN------ 155 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 ++ + + DA + K + + ++ LTDG ++ + ++ Sbjct: 156 --ETRIRMAGPSTALGDAVGLAIKLFKDAETEHRVLLLLTDGNDTGSLVPPVDAARVAAT 213 Query: 312 AKRRGAIVYAIGVQAEAA--------DQFLKNC-ASPDRFYSVQNSRKLHDAFL 356 +Y I V A D + + + + +S L F Sbjct: 214 ---EDIRIYPIAVGDPTAVGEEAIDLDTLARMAEVTGGQAFEALSSEDLIAVFK 264 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 46/224 (20%), Positives = 79/224 (35%), Gaps = 43/224 (19%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++ SS+ +G+D+M+ LDVS SMN G + A + +D Sbjct: 85 SAPFPPSSRDTVGIDIMIALDVSDSMNTPDFGGKSRFAGARTAAMRFIDN-------RPA 137 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAK 270 R GLV FS PL + + ++ + T + A N++ Sbjct: 138 DRIGLVVFSGGSFTRCPLTLDHE-VLGRLAETVAPGFFDEPGTAIGTAILTATNRL---- 192 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-- 328 K +K ++ +TDGEN++ + + + A G +Y + EA Sbjct: 193 -------KASSSKEKALVLITDGENNAGEVTPETA---ARLAANYGIRIYTVFAGKEARA 242 Query: 329 ------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A S R +S + L +F I Sbjct: 243 FENTSNTALNRKGRSELETVARISGGRMFSAGDVFGLMKSFRDI 286 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 82/208 (39%), Gaps = 34/208 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L +++V+D S SM D G+ KL +A + I+S + G++ F Sbjct: 404 SLGLVLVIDKSGSMTDGQY-GITKLEMAKEA------AIRSTEALRPTDSVGVICFDDAA 456 Query: 228 VQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + A + IQ+ I + G T P L+ AY + +A KL+H Sbjct: 457 SWVVGMRQADDLAEIQDSIGTIRPGGGTNMYPALDLAYKALEEADTKLKH---------- 506 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 II LTDG++++ + D + G + ++ V +A L A R+Y Sbjct: 507 -IIVLTDGQSATGDFDG-----IAHRMAEDGITLSSVAVGMDADKNLLSRLAEIGNGRYY 560 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILYNK 371 D F I K + K+ L + Sbjct: 561 Y-------TDEFSNIPKILTKETYLATQ 581 >gi|172087820|ref|YP_001816750.1| hypothetical protein VF_A1192 [Vibrio fischeri ES114] gi|171902402|gb|ACB55718.1| conserved hypothetical protein containing von Willebrand factor type A domain [Vibrio fischeri ES114] Length = 350 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 83/241 (34%), Gaps = 33/241 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDK 191 ++ ++ + P L+ + G D+M+V+D+S SM + G + + Sbjct: 72 LLYLSWGMILIALTKPTLLGPP---QIREQFGRDVMVVVDLSGSMAEKDFTSTNGLKISR 128 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 L + + + + R GL+ F P + + +N+ Sbjct: 129 LDAVKKVLNDFVKT-------RKGDRLGLILFGDAAFVQTPFTADHKVWLDLLNQTHVEM 181 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G +T + + D+ +K I LTDG ++ + ++ Sbjct: 182 AGKSTHLGDAIGLTIKRFEDSNNSQ---PLSTTSREKVAIILTDGNDTDSYVPPMDA--- 235 Query: 309 CNEAKRRGAIVYAIGVQAEA--ADQFLK-------NCASPDRFYSVQNSRKLHDAFLRIG 359 AK +G ++ I + +Q L AS + + N +L +A+ I Sbjct: 236 AKVAKVKGVRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQALNQDELINAYAEIS 295 Query: 360 K 360 K Sbjct: 296 K 296 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 71/211 (33%), Gaps = 29/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM ++L A +D + P N GLV+FS Sbjct: 89 IILAIDVSNSMAATDVAP-NRLAAAKDGADAFIDQL---PPRIN---LGLVSFSGSAALL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P Q ++ I+ L G +T G+ I A E+L A G I+ L Sbjct: 142 VPPTTDRQSVRSGIHGLQLGPSTAIGEGIFAGLQAITTAGEQLA--ADGGTPPPAAIVLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------------QFLKNC 336 +DGE + + + + G V I L+ Sbjct: 200 SDGETQRGRPNAQAAQAA----RDAGVPVDTIAYGTADGSLDVGGQEIPVPVNEDALREI 255 Query: 337 --ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A+ ++ + +L + +G + + Sbjct: 256 ARATDGSYHRAASGDELRSVYENLGSSIGYR 286 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 85/258 (32%), Gaps = 58/258 (22%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD--------KLGVATRSIREMLDII 206 + + S ++ +++++VLD S SMN +L + +D Sbjct: 272 VGKDTRESHETTEKIEVVLVLDTSGSMNYCMDGSQRGCNKSNPKRLTALKEAATSFIDAT 331 Query: 207 ----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 +I D N+ VR + F L ++ ++RL T + G+ A Sbjct: 332 ETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSANGATPADKGMAAA 391 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG----ENSSPNIDNKESLFYCNEAKRRGAI 318 + A+ KK +IF DG +N+ +++ K G + Sbjct: 392 QTALLRARPGA----------KKVVIFFADGVPTTQNTFSTRVANDAVTTALAMKSAGTL 441 Query: 319 VYAIGVQAEAA--------------DQFLKNCASP------------------DRFYSVQ 346 +Y+IG+ A +QF+ +S + + Sbjct: 442 IYSIGIFEGANPEQQSFGNRENDQANQFMHAVSSNYPNATAYNKTNWGTGSNLGYYKATN 501 Query: 347 NSRKLHDAFLRIGKEMVK 364 ++ L F I KE+ Sbjct: 502 SADDLTKIFDDIQKEITT 519 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 61/176 (34%), Gaps = 41/176 (23%) Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L ++ +N +I T G + ++ + + G + KK II +TD Sbjct: 383 LTDNYTALRTAVNNMIASGNTNVPLGTMWGWHTLSPNAPFGDGRPYGTERLKKIIIIMTD 442 Query: 293 GEN-----SSPNIDNKESLFY----------------------------------CNEAK 313 G N +SPN L Y C K Sbjct: 443 GANVMSDTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRRTRMDNRFDHATAATEDMCGNMK 502 Query: 314 RRGAIVYAIGVQAEAADQ-FLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + VY + VQ ++ Q L+ CA+ D ++ V ++ + AF RI + RI Sbjct: 503 DKDIEVYTVAVQVDSTAQTLLRRCATDTDHYFPVDSAAGIGAAFDRIAGAIENLRI 558 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 76/258 (29%), Gaps = 23/258 (8%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + +G+++++ A+ LP + ++ ++ + V+ + + Sbjct: 17 RFSDDRRGNVAMMFALALPPMMLMTLGGVDIARVSTVRMNVQDA--------------LD 62 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + I + + L G D+ T+ + + + Sbjct: 63 AATLAAARSQYTDNPRINAVGLAALQANLAPYG---DVTLDTTQTNFRLNTATGAVEADA 119 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 R + IF P S V S+ +++ +V+D + SM+ G Sbjct: 120 KVNVRALVANIFLPPYGQFFDDQLPANAHSEVLRSNN---RIEVALVIDNTGSMD---GA 173 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + A + L+ + ++ LV FS + + Sbjct: 174 KLTNTKTAAIDLINRLEAADGRSIEQDAIKISLVPFSMTVRVAQGGTNTPPSFMSNADTH 233 Query: 248 IFGSTTKSTPGLEYAYNK 265 G S A++ Sbjct: 234 TGGGAWNSGSNPYSAFDT 251 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 56/252 (22%), Positives = 84/252 (33%), Gaps = 42/252 (16%) Query: 134 EMPFIF--CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 +P + TF P TS + G+D+M+ +DVS SM ++ Sbjct: 54 NLPVLLRCVTFVLAVIILARPQTYTSWGNKQVE---GIDIMLAMDVSTSMLAEDLTP-NR 109 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + A E I P+ N GL F+ + P+ + +N L Sbjct: 110 MEAAKDVAAEF---IADRPNDN----IGLTIFAGEAFTQCPMTTDHTSL---LNMLQTVR 159 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + GL I K K +I LTDG N+ ++ S N Sbjct: 160 TDIAAKGLIQDGTAIGMGLANAVSRLKDSKAKSKVVILLTDGSNNMGDLSPMTS---ANI 216 Query: 312 AKRRGAIVYAIGVQAEA---------------------ADQFLKNCA--SPDRFYSVQNS 348 AK G VY IGV + LK+ A + FY N+ Sbjct: 217 AKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAASTDGNFYRATNN 276 Query: 349 RKLHDAFLRIGK 360 ++L + I K Sbjct: 277 QELKQIYKDIDK 288 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 52/332 (15%), Positives = 104/332 (31%), Gaps = 27/332 (8%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK-NDFSYRIIKNIWQTDFRNELRE 98 + F + + +D T + + + ++ + Sbjct: 676 NTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGARPNVRKFLI--LITDGEA 733 Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLS---AVSRYEMPFIFCTFPWCANSSHAPLLI 155 +D R + I + EM F F + +L Sbjct: 734 QDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQHIEDDLVLG 793 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + K LD++ V+D S S++ M M+ ++K N Sbjct: 794 ICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQ 844 Query: 216 VRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 VR G + ++ F L ++ I N G T + L ++ + +A+ Sbjct: 845 VRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGS 904 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H + +I +TDGE + D ++ + +G +V A+G+ + Sbjct: 905 RLHKGVP-----QVLIVITDGE----SHDAEKLNATAKALRDKGILVLAVGIAGANTWEL 955 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L S D++Y V+ L F + + Sbjct: 956 LAMAGSSDKYYFVETFGGLKGIFSDVSASVCN 987 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 70/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKEFLASVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L + Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGDALRKVKHYFRPDT 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + ++ +V N +L RI + + Sbjct: 1143 QLIQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 26/197 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S + E++D+ P VR G V ++ Sbjct: 435 DIYLLIDGSG------NTQPTDFHEMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDL 485 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ + + I+ + G T + L + + AK++ + H Sbjct: 486 EFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQRGNKVPCH------ 539 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 ++ LT+G + + ++ + V+AIGV+ EA L+ A + R Y V Sbjct: 540 LVVLTNGMSQDSVLGP------AHKLREENIRVHAIGVK-EANQTQLREIAGDEKRVYYV 592 Query: 346 QNSRKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 593 HEFDALRDIRNQVVQEI 609 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 67/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + + ++ N +R GLV +S++ L Sbjct: 226 VADVVFLLDMAINGSQENLDHLKAFLG---ESVSALDIKENCMRVGLVAYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + GV + ++I L T A K + ++ + + + +T Sbjct: 283 SMGVNKTEVLQRIQDLSPHVGQAYTGA---ALRKTRKEVFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 + +R G V+ +GV+ +Q K + P ++ + N Sbjct: 340 ------HRASDDNVTKAAVNLRREGVTVFTMGVEGANPEQLEKIASYPAEQFTSKLSNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 77/199 (38%), Gaps = 23/199 (11%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 DV ++ GM + I+++ + S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGMKSFPLVKTFIQKL---VSSLPVEANKYRVALAQYSDALHNEFHLGA 82 Query: 235 -WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + ++A + G + ++GVQ + + LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEASKALREDGVKIISVGVQKASEED-LKAMATSQFHFNLRTARD 190 Query: 351 LHD---AFLRIGKEMVKQR 366 L +I K++ + R Sbjct: 191 LSMFAPNMTQIIKDVTQYR 209 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 52/332 (15%), Positives = 104/332 (31%), Gaps = 27/332 (8%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK-NDFSYRIIKNIWQTDFRNELRE 98 + F + + +D T + + + ++ + Sbjct: 676 NTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGARPNVRKFLI--LITDGEA 733 Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLS---AVSRYEMPFIFCTFPWCANSSHAPLLI 155 +D R + I + EM F F + +L Sbjct: 734 QDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQHIEDDLVLG 793 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + K LD++ V+D S S++ M M+ ++K N Sbjct: 794 ICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQ 844 Query: 216 VRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 VR G + ++ F L ++ I N G T + L ++ + +A+ Sbjct: 845 VRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGS 904 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H + +I +TDGE + D ++ + +G +V A+G+ + Sbjct: 905 RLHKGVP-----QVLIVITDGE----SHDAEKLNATAKALRDKGILVLAVGIAGANTWEL 955 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L S D++Y V+ L F + + Sbjct: 956 LAMAGSSDKYYFVETFGGLKGIFSDVSASVCN 987 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 70/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKEFLASVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L + Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGDALRKVKHYFRPDT 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + ++ +V N +L RI + + Sbjct: 1143 QLIQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 26/197 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S + E++D+ P VR G V ++ Sbjct: 435 DIYLLIDGSG------NTQPTDFHEMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDL 485 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ + + I+ + G T + L + + AK++ + H Sbjct: 486 EFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQRGNKVPCH------ 539 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 ++ LT+G + + ++ + V+AIGV+ EA L+ A + R Y V Sbjct: 540 LVVLTNGMSQDSVLGP------AHKLREENIRVHAIGVK-EANQTQLREIAGDEKRVYYV 592 Query: 346 QNSRKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 593 HEFDALRDIRNQVVQEI 609 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 67/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + + ++ N +R GLV +S++ L Sbjct: 226 VADVVFLLDMAINGSQENLDHLKAFLG---ESVSALDIKENCMRVGLVAYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + GV + ++I L T A K + ++ + + + +T Sbjct: 283 SMGVNKTEVLQRIQDLSPHVGQAYTGA---ALRKTRKEVFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 + +R G V+ +GV+ +Q K + P ++ + N Sbjct: 340 ------HRASDDNVTKAAVNLRREGVTVFTMGVEGANPEQLEKIASYPAEQFTSKLSNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 77/199 (38%), Gaps = 23/199 (11%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 DV ++ GM + I+++ + S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGMKSFPLVKTFIQKL---VSSLPVEANKYRVALAQYSDALHNEFHLGA 82 Query: 235 -WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + ++A + G + ++GVQ + + LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEASKALREDGVKIISVGVQKASEED-LKAMATSQFHFNLRTARD 190 Query: 351 LHD---AFLRIGKEMVKQR 366 L +I K++ + R Sbjct: 191 LSMFAPNMTQIIKDVTQYR 209 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 87/246 (35%), Gaps = 41/246 (16%) Query: 134 EMP--FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 EMP + F C + P L+ + S+ G+D+++ LD+S SM G + Sbjct: 65 EMPHWLRWSAFVLCVLALTGPHLLFRQSEAESR---GIDVILALDISESMLQKDVGGTSR 121 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L A R + + R GLV F K PL + + ++RL G Sbjct: 122 LDAAREVSRNFV-------LRRSNDRIGLVVFRGKGYTQCPLTLDHEVLAMLLDRLSPGV 174 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 + A + K + K +I +TDGEN++ + + + Sbjct: 175 IQDDGTAIGTAILI-------AVNRLKASESLHKVLILVTDGENNAGEVGPGTA---ASI 224 Query: 312 AKRRGAIVYAI--GVQA---------------EAADQFLKNCA--SPDRFYSVQNSRKLH 352 A R G +Y I G + + ++ L+ A + ++ V++ Sbjct: 225 AARSGVRIYVINAGFKVVEDRIDPPEESGRYIQKDEESLQGIARTTGGGYFRVEDPAAFD 284 Query: 353 DAFLRI 358 I Sbjct: 285 QTIRSI 290 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/289 (13%), Positives = 93/289 (32%), Gaps = 54/289 (18%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 ++ SL + + + + + P V I S+ G Sbjct: 27 KKRPSLVVSTVRPFGAVVAKHRPTFRLVAMMLGLAVLIVALARPRYGDEKVLIRSQ---G 83 Query: 169 LDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIRE---------MLDIIKSIPDVNNVVRS 218 +D+++ LD+S SM + P ++ +++ I+ + R Sbjct: 84 IDIVLALDMSGSMEAYDVPRNINDARTLIAAVKNKEVENRIEVAKKEIRRFIEQRPNDRI 143 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEH 275 GL+ F+ + P + + +L G T L N++ Sbjct: 144 GLIGFADQAYSFAPPTLDHAWLLAHLEQLEPGMIGQQTGIAAPLASGVNRL--------- 194 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAA----- 329 K D ++ ++ TDG N N+DN+ + K +++ +G+ + A Sbjct: 195 --KKSDAPRRVLVLFTDGRN---NVDNRLTPEQAAALGKEFDVVIHTVGIGSRNAFVLVT 249 Query: 330 ----------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 ++ L++ A + ++ ++ + I + Sbjct: 250 DPFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGMKQVMDEINQ 298 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 72/199 (36%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M D++K N VR G + ++ Sbjct: 842 LDVVFVIDSSGSIDYDEYNIMKDF---------MTDLVKKADVGKNQVRFGALKYADDPE 892 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I N G T + L ++ + +A+ + Sbjct: 893 VLFYLDTLGTKWEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLQRGVP-----Q 947 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G ++ A+G+ + L S D+++ V Sbjct: 948 VLIVITDGE----SHDADKLNATAKALRDKGILILAVGIAGANPVELLAMAGSSDKYFFV 1003 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 1004 ETFGGLKGIFSDVSASVCN 1022 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 72/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 1012 IFSDVSASVCNSSKVDCEIGQIDLVFLMDGSNSIHPDDFKKMKAFLASVVQDFDVTVNR- 1070 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G FS FPL G + I +I ++ T L Sbjct: 1071 --VRIGAAQFSHNYQPEFPLGTFTGEEEISLQIEKIQQIFGYTHIGAALRRVGRYFRPDM 1128 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG++ E + +R+G +Y++G+ Sbjct: 1129 GSRINAGTP-----QVLLVLTDGQSQD------EVAQAAEDLRRKGINIYSVGIGDVDDQ 1177 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + D+ +V + +L RI + + Sbjct: 1178 QLVQITGTADKKLTVHDFDELRKVKKRIVRHI 1209 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 651 KEMKADIMFLVDSSGSIGPENFSKMKIF---------MKNLVSKSQIGADRVQIGVVQFS 701 Query: 225 SKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I + I+R+ G TT + L + AK + Sbjct: 702 HVNKEEFQLNRYMSQSEISDAIDRMAHIGETTLTGHALTFVSQYFSPAKGARPN------ 755 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 756 -VRKFLILITDGEAQDIVKDPAVAL------RQEGIIIYSVGVFGSNVTQLEEISGRPEM 808 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 809 VFYVEN----FDILQHIEDDLV 826 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 61/353 (17%), Positives = 106/353 (30%), Gaps = 41/353 (11%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + I S L ++ A + N N G Q Sbjct: 317 LSRGINKSEVLQYIQNLSPQAGNAYTGAAIRKIRKEVFGAQNGSRKNQGVPQIAVLVTHR 376 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK--DYNLSAVSRYEMPF-IFC 140 T LR G IE ++ + H Y S ++ Sbjct: 377 PSEDNVTKAAVNLRRQGVTIFTMGIEGASDTQLEKIASHPAEQYVSKLKSFSDLAAHNQT 436 Query: 141 TFPWCANSSHAPLLITSSVKISSKSD----IGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 N L + S + KS D+ +++D S S Sbjct: 437 FLKKLRNQITHTLSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 490 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTT 253 + E++ + P VR G V ++ F + + + I + G T Sbjct: 491 TFLSELVGMFNIAPQK---VRFGAVQYADSWDLEFEINKYSNKHDLGKAIENIRQMGGNT 547 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + L + + AK++ + H ++ LT+G + + N + Sbjct: 548 NTGAALNFTLGLLQKAKKERGNKVPCH------LVVLTNG------VSKDSIVEPANRLR 595 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 VYAIGV+ EA L+ A + R Y V + L D ++ +E+ Q Sbjct: 596 EELIHVYAIGVR-EANQTQLREIAGEEKRMYYVHDFDALKDIRNQVVQEICAQ 647 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 74/202 (36%), Gaps = 22/202 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LDVS + + + + ++ N +R GLV FS++ Sbjct: 263 DVVFLLDVS------LNGSQENFDYLKEFLE---ESVSALDIKENCMRVGLVAFSNETKV 313 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A KI ++ ++ + + Sbjct: 314 INSLSRGINKSEVLQYIQNLSPQAGNAYTGA---AIRKIRKEVFGAQNGSRKNQGVPQIA 370 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY--SV 345 + +T P+ DN +R+G ++ +G++ + Q K + P Y + Sbjct: 371 VLVT----HRPSEDN--VTKAAVNLRRQGVTIFTMGIEGASDTQLEKIASHPAEQYVSKL 424 Query: 346 QNSRKLHDAFLRIGKEMVKQRI 367 ++ L K++ Q Sbjct: 425 KSFSDLAAHNQTFLKKLRNQIT 446 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 54/273 (19%), Positives = 91/273 (33%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF+ T++ +S+ + G+D+MM Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFMLRIIALALIIVVLARPQTTNSWQNSEIE-GIDIMMA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------------- 328 TDG N+ +I L AK G VY IGV Sbjct: 194 TDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEID 250 Query: 329 ADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGK 360 + A+ ++ ++ KL + + I K Sbjct: 251 EKTLTQIAATTEGNYFRATSNSKLKEVYEEIDK 283 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 72/211 (34%), Gaps = 28/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVSLSM + +L A + R + + V GL++F+ Sbjct: 91 VMLVIDVSLSM-EATDVKPTRLRAAQDAARSFAEGLTP------GVNLGLISFAGTATVL 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I L +T + G+ A I I I+ + Sbjct: 144 AAPTTEREGVVHAIENLKLAQSTATGEGIFAALQAIES---FSAVIGGAEGPPPARIVLM 200 Query: 291 TDGENSSP--NIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 TDG+ + P + + AK++G + I D ++ Sbjct: 201 TDGKQTVPQDEYAPRGAFTAAGVAKQKGIPITTISFGTSYGSVEIDGTRVPVEVDDASMR 260 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A S FY + +L + +G+++ Sbjct: 261 EIARLSGGDFYKAATAEELKQVYDSLGEQIG 291 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++ A + ++ D + + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVEP-SRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +++L F T + + A I I G I+ Sbjct: 152 VSPTTNREATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTXPPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + K + AK +G + I D+ +K Sbjct: 209 SDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 269 KVAQLSGGNSYNAATLAELRAVYSSLQQQIGYETI 303 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 43/279 (15%), Positives = 87/279 (31%), Gaps = 50/279 (17%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWC--ANSSHAPLLITSSVKISSKSDIGLDMMM 173 + + + ++R + + F W + P +I + + ++ D+M+ Sbjct: 42 VDVTGEKPQSGAVLLNRNNLQRLIVAFSWLCIVTAIAKPEMIGAPI---NQEKSARDLMI 98 Query: 174 VLDVSLSMN---------------------DHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +D+S SM D L +K + + Sbjct: 99 AVDLSGSMAVEDFTLPIATNELTNRAKNDTDSSATKSSTNDTGKGEKVNRLVAVKHVLNA 158 Query: 213 ----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 R GL+ F P + Q +N +S G+ D Sbjct: 159 FVKSREHDRLGLILFGDAPYLQAPFTDDIATWQALLN--------ESDIGMAGQSTAFGD 210 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 A + + D + +I LTDG +++ + E+ A R +Y I + + Sbjct: 211 AIGLAISVFQQSDTQNRVLIVLTDGNDTASKVPPVEA---AKVAAARDIKIYTIAIGDPS 267 Query: 329 AD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 A + L+ A + + + NS +L + I Sbjct: 268 AVGEEKVDLEVLQAMAEITQGKSFQALNSEELLKVYAEI 306 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 83/238 (34%), Gaps = 28/238 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + ++ + +++ +DVSLSM D+L A + ++ + Sbjct: 65 LAIGLVLLMVALSGPQAMRKVPRNRATVVLAIDVSLSMEARDVEP-DRLTAAKEAAKKFV 123 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 ++ N V G+V+F+ +++L T + G+ + Sbjct: 124 T------ELPNGVNLGIVSFAGTASLLVSPTPDRTLALNAVDKLELAQRTATGEGIYTSI 177 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYA 321 I + ++ L + + II +DG+ + P D + +AK G + Sbjct: 178 QSIKNIRDVLGGE--DNAPPAR-IILESDGKQTVPTDLDDPRGGFTAARKAKEEGIPIST 234 Query: 322 IGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 I + D LK A S +F++ + L++A+ + E+ Sbjct: 235 ISFGTTSGSVNIGGQNIPVPVDDASLKRIAELSGGQFFAASSLNDLNEAYGSLRDEIG 292 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 76/204 (37%), Gaps = 27/204 (13%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P +++ VA E + GL F+ + Sbjct: 87 GIDIMLTMDVSASMLTEDVYP--NRMVVAKEVASEFI-------SGRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ + L+ G T T GL I K K Sbjct: 138 AFTQCPMTLDHAALLN----LLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLKDSKAKSK 193 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LKNCA- 337 +I LTDG N++ +I + A++ G +Y IG E ++ L++ A Sbjct: 194 IVILLTDGSNNAGSISP---MTAAAIARKFGIRIYTIGFGKETGEEIGAIDYKTLQDIAV 250 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGK 360 + FY Q+ +L + I K Sbjct: 251 STNGEFYRAQSQAELSRIYQDIDK 274 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 90/273 (32%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF T+ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHAPFALRIIALALIIIVLSRPQTTDSWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNVGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ I L AK G VY IGV + Sbjct: 194 TDGTNNRGEISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 57/281 (20%), Positives = 96/281 (34%), Gaps = 54/281 (19%) Query: 110 RSTSLSIII-DDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 + T ++ I D + + + Y + PFI S+ K + Sbjct: 27 KKTEATLQISDARVYAHTPKSYKNYLLHAPFILRVIALALIIVVLARP-QSTNKWQNSEI 85 Query: 167 IGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+D+M+ +DVS SM + P ++L A D+ + G+ F+ Sbjct: 86 EGIDIMLAIDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAG 136 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL + + I+ + G T G+ A ++ D+K K Sbjct: 137 ETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAK---------- 186 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---------- 332 K II LTDG N+ +I L AK G VY IGV + Sbjct: 187 -SKVIILLTDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQY 242 Query: 333 -----------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 243 INMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 84.5 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 67/179 (37%), Gaps = 28/179 (15%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K+ + G++MM+ +DVS SM + F P ++L +I ++ + ++ R Sbjct: 80 SKLREEKAQGIEMMLTVDVSNSMLAEDFEP--NRLERTKYAIGKLFEGLQQ-------DR 130 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKL 273 GLV F+ + P+ + + R+ T LE A E+ Sbjct: 131 VGLVVFAGEPKVQLPITSDYRMARAFARRIDPSLVSVQGTAIGKALEQALLAFSGDTEQS 190 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + II +TDGEN +++ A + G ++ IG+ Sbjct: 191 HG---------RVIILITDGENHD-----DDAIAVAERAAQMGVKIFTIGIGTPEGAPI 235 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 57/281 (20%), Positives = 96/281 (34%), Gaps = 54/281 (19%) Query: 110 RSTSLSIII-DDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 + T ++ I D + + + Y + PFI S+ K + Sbjct: 27 KKTEATLQISDARVYAHTPKSYKNYLLHAPFILRVIALALIIVVLARP-QSTNKWQNSEI 85 Query: 167 IGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+D+M+ +DVS SM + P ++L A D+ + G+ F+ Sbjct: 86 EGIDIMLAIDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAG 136 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL + + I+ + G T G+ A ++ D+K K Sbjct: 137 ETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAK---------- 186 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---------- 332 K II LTDG N+ +I L AK G VY IGV + Sbjct: 187 -SKVIILLTDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQY 242 Query: 333 -----------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 243 INMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|194289639|ref|YP_002005546.1| hypothetical protein RALTA_A1531 [Cupriavidus taiwanensis LMG 19424] gi|193223474|emb|CAQ69479.1| conserved hypothetical protein, vWA domain (Von Willebrand factor, type A); putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 359 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 84/234 (35%), Gaps = 27/234 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVA 195 + + P + + ++ K D+++ LD+S SM+ F L Sbjct: 65 LAPLAWALVVTALARPQFLEAPIE---KMQPARDLLIALDLSQSMDTRDFRDPAGALIPR 121 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 +++R ++ R GL+ F P + +Q I L+ Sbjct: 122 VQAVR---QVVSGFVARRPGDRIGLIVFGDAPYPLAPFTLDHRLVQTLIADLL------- 171 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 PG+ + DA + + + +K +I LTDG +++ + + + AK R Sbjct: 172 -PGMAGPSTALGDAIGLGIKMFEHSEAPEKVLIVLTDGNDTASRMPPERAGGI---AKER 227 Query: 316 GAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +V+ IG+ A L+ A + R++ + L + + + Sbjct: 228 KVVVHTIGIGDPNASGEEKVDLDVLQKLAAQTGGRYFFGADQAGLETIYATLDQ 281 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 54/273 (19%), Positives = 91/273 (33%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF+ T++ +S+ + G+D+MM Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFMLRIIALALIIVVLARPQTTNSWQNSEIE-GIDIMMA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------------- 328 TDG N+ +I L AK G VY IGV Sbjct: 194 TDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEID 250 Query: 329 ADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGK 360 + A+ ++ ++ KL + + I K Sbjct: 251 EKTLTQIAATTEGNYFRATSNSKLKEVYEEIDK 283 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 80/221 (36%), Gaps = 50/221 (22%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++M +DVS SM ++ + +K P+ R GLV ++ + Sbjct: 98 GIDIVMAIDVSASMLSK-DLKPNRFEALKKVAS---QFVKDRPN----DRIGLVIYAGES 149 Query: 228 VQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+ I ++ L +G T GL A N++ K Sbjct: 150 YTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNRL-----------KESKAKS 198 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------- 328 + II LTDG N++ ID + + A G VY IG+ + Sbjct: 199 RVIILLTDGVNNTGFIDPQTA---AELAAEYGIRVYTIGIGSNGTALSPYALNPDGSIMY 255 Query: 329 -------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + +K A + R++ +++KL + I K Sbjct: 256 RMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYNEINK 296 >gi|330829742|ref|YP_004392694.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804878|gb|AEB50077.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 330 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 78/252 (30%), Gaps = 50/252 (19%) Query: 131 SRYEMPFIFCTFPWCANSSH--APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + +P WCA P V + + D+++ +D+S SM Sbjct: 40 PQAGLPLWRAMLLWCALILALCRPQWWGEPVI---QYEGSRDLLLAVDLSDSMRTPDMLD 96 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 + ++R IK++ R G++ F+ PL + + + L Sbjct: 97 NGEQQARLTAVR---QQIKALIAKRAGDRVGIIVFADHAYLLSPLTQEIPALLTLSDELD 153 Query: 249 F---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 F G TT + A + ++ +TDG N++ N D Sbjct: 154 FDLVGRTTALGEAILLARQHGDPGRPTA-------------LLLVTDGRNTAGNADP--- 197 Query: 306 LFYCNEAKRRGAIVYAIGVQAEA---------------------ADQFLKNCA--SPDRF 342 L A +G +Y +GV A+ + LK A R+ Sbjct: 198 LQEAKLAAAQGIRIYTLGVGADPDTFIQPYDEAGSGQADPSSELDEPLLKELAQTGQGRY 257 Query: 343 YSVQNSRKLHDA 354 + + L Sbjct: 258 FRARTQSDLDTI 269 >gi|312139646|ref|YP_004006982.1| integral membrane protein [Rhodococcus equi 103S] gi|311888985|emb|CBH48298.1| putative integral membrane protein [Rhodococcus equi 103S] Length = 326 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 82/274 (29%), Gaps = 28/274 (10%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 I+R ++ + A +R + T + + Sbjct: 28 IQRRRQKHVLRFANMELLEKVAPNRPNITRHLPTALLLVGLVFLTVALAGPTADKRVPRN 87 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++V+DVSLSM +L A + + D + + GLV F+ Sbjct: 88 RATVVLVIDVSLSMKATDVEP-SRLAAAQEAAKSFADGLTP------GINLGLVAFAGTA 140 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + I+ L T + + + + + I Sbjct: 141 SVLVSPTPNRDETKAAIDNLTLSERTATGEAI---FTSLQSIDTLAAVLGGSEQAPPARI 197 Query: 288 IFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQ 331 + L+DG+ + P D + +AK +G + I D Sbjct: 198 VLLSDGKQTVPESPDDPRGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDP 257 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 L+ A S F++ + +L D + + +++ Sbjct: 258 SLREIANLSGGNFFTASSLEELRDVYDTLEEQIG 291 >gi|325676908|ref|ZP_08156581.1| von Willebrand factor [Rhodococcus equi ATCC 33707] gi|325552456|gb|EGD22145.1| von Willebrand factor [Rhodococcus equi ATCC 33707] Length = 326 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 82/274 (29%), Gaps = 28/274 (10%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 I+R ++ + A +R + T + + Sbjct: 28 IQRRRQKHVLRFANMELLEKVAPNRPNITRHLPTALLLVGLVFLTVALAGPTADKRVPRN 87 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++V+DVSLSM +L A + + D + + GLV F+ Sbjct: 88 RATVVLVIDVSLSMKATDVEP-SRLAAAQEAAKSFADGLTP------GINLGLVAFAGTA 140 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + I+ L T + + + + + I Sbjct: 141 SVLVSPTPNRDETKAAIDNLTLSERTATGEAI---FTSLQSIDTLAAVLGGSEQAPPARI 197 Query: 288 IFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQ 331 + L+DG+ + P D + +AK +G + I D Sbjct: 198 VLLSDGKQTVPESPDDPRGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDP 257 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 L+ A S F++ + +L D + + +++ Sbjct: 258 SLREIANLSGGNFFTASSLEELRDVYDTLEEQIG 291 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 77/218 (35%), Gaps = 53/218 (24%) Query: 190 DKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 ++ +++ +D + D N R GLVT++S + L + ++ ++ Sbjct: 237 TRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVD 296 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE----NSSPNID 301 L T++ G++ A + +A+ K +IF TDG+ N N Sbjct: 297 DLKASGATRADLGMQTANTVLGNARADA----------SKIVIFFTDGQPTKSNGFENDV 346 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAA------------------DQFLKNCAS----- 338 +++ K GA VY++G+ A + F++ +S Sbjct: 347 ANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNA 406 Query: 339 ------------PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + + ++ L+ F I E+ Sbjct: 407 TTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEVSS 444 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 70/205 (34%), Gaps = 35/205 (17%) Query: 167 IGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G D+++ LD+S SM G + +L + + R GLV Sbjct: 85 SGRDIVLALDLSGSMEREDFSLNGQTVSRLAAVQGVAADFV-------RGRTGDRVGLVV 137 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P V + I+ L G + K+T I D + D Sbjct: 138 FGDRAYVAAPQTHDVASVARLIDGLQIGVSGKAT--------AIADGLGLAIRRLRERDA 189 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------AEAAD 330 + I+ L+DG++++ +D + A+ G VY I + A AD Sbjct: 190 KSRVILLLSDGQDTTGMVDP---VAAAQTARDLGMRVYTIALGPADLSDDPGARDAVDAD 246 Query: 331 QFLKNC-ASPDRFYSVQNSRKLHDA 354 + A+ + V+ + L Sbjct: 247 TLRRIAQAAGGETFRVRTTDDLQAV 271 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 77/218 (35%), Gaps = 53/218 (24%) Query: 190 DKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 ++ +++ +D + D N R GLVT++S + L + ++ ++ Sbjct: 237 TRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVD 296 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE----NSSPNID 301 L T++ G++ A + +A+ K +IF TDG+ N N Sbjct: 297 DLKASGATRADLGMQTANTVLGNARADA----------SKIVIFFTDGQPTKSNGFENDV 346 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAA------------------DQFLKNCAS----- 338 +++ K GA VY++G+ A + F++ +S Sbjct: 347 ANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNA 406 Query: 339 ------------PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + + ++ L+ F I E+ Sbjct: 407 TTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEVSS 444 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 81/237 (34%), Gaps = 52/237 (21%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 T + +K+ G+D++M +DVS SM P ++L + D IK P+ Sbjct: 75 RPQTEDISTKTKTTKGIDIVMAIDVSSSMLARDLKP--NRLASLKKVAA---DFIKKRPN 129 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFD 268 R GLV ++ + P+ + + + +GS T GL + N++ Sbjct: 130 ----DRIGLVVYAGESYTKTPITSDKGIVLNALKEITYGSLEDGTAIGMGLATSVNRL-- 183 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 K K II LTDG N+S I+ + + A Y IG+ Sbjct: 184 ---------KESKALSKVIILLTDGINNSGFIEPQTA---AELAVEYDIKTYTIGLGTNG 231 Query: 329 AD-----------------------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L+ A + ++ N+ L + I K Sbjct: 232 NALSPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAYFRATNNESLASIYDEINK 288 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 77/218 (35%), Gaps = 53/218 (24%) Query: 190 DKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 ++ +++ +D + D N R GLVT++S + L + ++ ++ Sbjct: 223 TRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVD 282 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE----NSSPNID 301 L T++ G++ A + +A+ K +IF TDG+ N N Sbjct: 283 DLKASGATRADLGMQTANTVLGNARADA----------SKIVIFFTDGQPTKSNGFENDV 332 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAA------------------DQFLKNCAS----- 338 +++ K GA VY++G+ A + F++ +S Sbjct: 333 ANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSVTGKSDIELKSNAFMQGVSSNYPNA 392 Query: 339 ------------PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + + ++ L+ F I E+ Sbjct: 393 TTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEVSS 430 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 61/432 (14%), Positives = 128/432 (29%), Gaps = 75/432 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA--TKI 63 + + +G+++I+ A+ V+ +G ++ S V ++L LD L A T+ Sbjct: 8 FSRWSDDRRGNVAIIMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSGALAAASLTQD 67 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE-------------- 109 N E+ + + + Q D ++ N + Sbjct: 68 RNPEDVVRAYVEAALADHPQLLASLQLDVVADISLNSRVVNATASVAMPTTMLGLVGINT 127 Query: 110 -------------RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 R +S+++D + + F A+++ + Sbjct: 128 LTLEHASEAIEQVRDVEISLVLDVSGSMGGSKINALQDAAIEFVEIVLAADAAERTSISV 187 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG----VATRSIREMLDIIKSIPDV 212 ++ ++ +V S + N G +G V ++ + Sbjct: 188 IPYNGGVRTPREVNQDIV---SGNNNHRRQSGCVDMGTDYPVEMTLPYREMEFTEYYGSE 244 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 S S + F L+ ++ IN L T + + A Sbjct: 245 QTGNSSSAFCPRSNMESEF-LSQNEGRMRGLINSLRAEGNTGLDVATMWGARALDPAWRG 303 Query: 273 LEHIAKGHDDYK-------KYIIFLTDGENS---------------------------SP 298 + K ++ +TDGE + S Sbjct: 304 NLGGSFSDRPASYDDRDTIKILVVMTDGEATAQIRSEEYTYYDWWGRERTGTRSYELYSA 363 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA-SPDRFYSVQNSRKLHDAF 355 + C+ A+ G +Y I Q ++NCA P +Y V+N + +AF Sbjct: 364 RQARENMAEACDIAEGNGVQIYTIAFQLSGQTNRDLMRNCANKPQNYYQVENL-DIAEAF 422 Query: 356 LRIGKEMVKQRI 367 I ++ + R+ Sbjct: 423 SSIAADINRLRL 434 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 51/250 (20%), Positives = 85/250 (34%), Gaps = 50/250 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM----NDHFGPGMDKL 192 + P + ++ S+ G+D+++ LD+S SM G G +L Sbjct: 58 LRLAVLALGIAALARPQAVERESRVRSR---GMDLVLALDLSTSMLAEEQGREGRGESRL 114 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 A R + + R GLV F+ + PL Q +Q ++RL G+ Sbjct: 115 AAAKRVLAGFI-------GGRKDDRIGLVAFAGRPYPAAPLTSDHQWLQGVVDRLDTGAV 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + DA + + + +I +TDG N++ E A Sbjct: 168 EDGT--------ALGDAILSGVNRLRRRPAESRALILITDGRNNAG----AEPQLAAQAA 215 Query: 313 KRRGAIVYAIGVQ----------------------AEAADQFLKNCA--SPDRFYSVQNS 348 K G V+AIG+ AE LK A + R++ ++ Sbjct: 216 KALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDAELDAATLKGVAELTGGRYFEAGDA 275 Query: 349 RKLHDAFLRI 358 L F I Sbjct: 276 TVLSRVFAEI 285 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 49/202 (24%), Positives = 86/202 (42%), Gaps = 26/202 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + D+++V+DVS SM DKL A + L I +P+ R GLVT Sbjct: 366 SLNRKRADILLVVDVSGSMEG------DKLEAAKAGLGTFLSRI--LPE----DRVGLVT 413 Query: 223 FSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS++ +V PL+ + + I + T Y+ + D KE L+ + Sbjct: 414 FSTESRLVVPPAPLSDTRIRLDDAIAVMRAQGRTAL-------YDALIDGKEALDSLPST 466 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 DD + I+ L+DG ++S ++ E G ++ + A+A L+ A+ Sbjct: 467 GDDRIRAIVLLSDGLDNSSRATLEQVRLAFEE---SGISIFPVAYGADADTDALQQIATF 523 Query: 340 DRFYSVQ-NSRKLHDAFLRIGK 360 R VQ ++ + F + + Sbjct: 524 SRTILVQGDAGDIGQIFENLSR 545 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 42/172 (24%), Positives = 63/172 (36%), Gaps = 28/172 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + G+D+++ LDVS SM D+L A R I + L ++K R GLV FS Sbjct: 85 TQKGVDIVIALDVSPSMLVE-DIKPDRLERAKREITDFLKVVKG-------DRVGLVAFS 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 PL + +N L + T + A E Sbjct: 137 GAAYTQCPLTLDYGALMMFLNILHPNNIPHPGTDLGAAVLGAIKAFDPKSET-------- 188 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K I+ +TDGE++ K L EA R+G ++ G+ A Sbjct: 189 ---DKVILLITDGEDNE-----KRGLDAAREAVRKGIKIFVFGMGDPAGGPI 232 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 57/458 (12%), Positives = 142/458 (31%), Gaps = 97/458 (21%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA----- 60 ++ F + G+ I + + ++ + G+ ++ +A+L LD ++L A Sbjct: 24 VKEFARDEDGAFIIFSLFMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAADLDQT 83 Query: 61 ---TKILNQ-----------ENGNNGKKQKNDFSYRIIKNIWQTDF-----RNELRENGF 101 ++ + + T F + EL G Sbjct: 84 LSPAAVVTDYFAKAGLSSFLTSTTVDQGLNYRIISAQGNMTMPTTFMRLSGQTELAIRGD 143 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 A + + +S+++D + +S L+ + + Sbjct: 144 ATAEERV-SNVEISLVVDISGSMGRNNKLSTLR-TASHTFIDTVIRPETEDLISLNIIPY 201 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++ + G D+ L V N + T ++ ++ + + Sbjct: 202 TAQVNAGPDIFDQLTVDQKHNFSHCIDFEPADFNTAALDVPPVSTRTYKQMQHFQYGWSS 261 Query: 222 TFSSKI-------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ + + P + ++ + L + T G+++ + + + Sbjct: 262 SYVNNPGCPMQSYERIVPFSQDATSLKSTVTSLRARANTAIHLGMKWGVSMLDPTFRPIV 321 Query: 275 HIAKGHDDYK----------------KYIIFLTDGEN----------------------- 295 ++ K I+ +TDG+N Sbjct: 322 TAMIANNKVDPEFAGRPVAYNDPETLKTIVLMTDGQNVDTYRISDEFYSTPSQIAHWDRY 381 Query: 296 -------SSPNIDNKESLFY---------------CNEAKRRGAIVYAIGVQAE--AADQ 331 + + D ++ +Y C+ AK G +V+ IG + AA + Sbjct: 382 QLFFFTNNYIDRDIDQNYYYKKFTATQADTMLQSICDAAKAEGILVWTIGFEVSNHAAGE 441 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 L +SP F+ V+ +L +AF I +++ + R++ Sbjct: 442 MLDCASSPSHFFRVEGV-ELSEAFASIARQINQLRLVL 478 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 70/217 (32%), Gaps = 31/217 (14%) Query: 150 HAPLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 T V+ + + +D+ +V+D S SM+ R + +LD Sbjct: 480 ATTYNATEPVRQTDDDGLRPVDVTLVMDTSGSMSSSVKLRN---TAGQRFVAGLLD---- 532 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 V R+ +V F S L ++ L G T GL A ++ Sbjct: 533 ------VDRAAVVDFDSSAYVAQDLTSDFGAANSTLDNLGSGGGTDIGSGLSTANSQFAS 586 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 +D + +I LTDG + + A + VY +G Sbjct: 587 NS---------NDSRAQVMILLTDGRGNGG-------ISEAQTAANQNTTVYTVGFDNAN 630 Query: 329 ADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 D+ + F V + +L + F RI + + Sbjct: 631 RDKLRDIANITDGEFNYVTDRSELPNVFSRIAENTTE 667 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 47/204 (23%), Positives = 76/204 (37%), Gaps = 27/204 (13%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P +++ VA E + GL F+ + Sbjct: 87 GIDIMLTMDVSASMLTEDVYP--NRMVVAKEVASEFI-------SGRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ + L+ G T T GL I K K Sbjct: 138 AFTQCPMTLDHAALLN----LLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLKDSKAKSK 193 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LKNCA- 337 +I LTDG N++ +I + A++ G VY IG E ++ L++ A Sbjct: 194 IVILLTDGSNNAGSISP---MTAAAIARKFGIRVYTIGFGKETGEEIGAIDYKTLQDIAV 250 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGK 360 + FY Q+ +L + I K Sbjct: 251 STNGEFYRAQSQAELSRIYQDIDK 274 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 72/214 (33%), Gaps = 31/214 (14%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 L T S + +++ ++LD S SM + + ++D + Sbjct: 25 LAISLWATGE---DSDRTLPINLGLILDRSGSMRAQ------AMETVKEAANYLVDGLG- 74 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 PD R ++TF+ P +Q ++ KINRL T G++ + Sbjct: 75 -PD----DRLSVITFNHHAEVILPNQSVEDLQGVKNKINRLTASGGTCIDEGMKLGIKEA 129 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 KE I LTDGEN DN+ L A + +G + Sbjct: 130 ALGKENRVSQ----------IFLLTDGENEHG--DNERCLKLAKVAAEYNITLNTLGFGS 177 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + +++ + F R+ Sbjct: 178 NWNQDILEQIADSAGGMLCYIEHPEQALTEFSRL 211 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 46/235 (19%), Positives = 77/235 (32%), Gaps = 26/235 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M + P +K G+D+M+ +D+S SM +++ V Sbjct: 57 MALRCIVYALVVIVLARPQTYN---TWDNKDTEGIDIMLTMDISASMLTEDVFP-NRMEV 112 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A E + GL F+ + P+ + ++ + T Sbjct: 113 AKEVASEFISS-------RPSDNIGLTIFAGEAFTQCPMTLDHAALLNLLHNVRTDLVTN 165 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 GL I K K +I LTDG N+ +I + AK+ Sbjct: 166 ---GLMQDGTAIGLGLANSVSRLKDSKAKSKVVILLTDGSNNVGSISP---MTAATIAKK 219 Query: 315 RGAIVYAIGVQAEAADQF-------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 G VY IG+ E + L++ A + FY Q+ +L + I K Sbjct: 220 FGIRVYTIGLGRETGEDIGAIDYKTLQDIAVLTNGEFYRAQSQAELSKIYQDIDK 274 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 72/214 (33%), Gaps = 31/214 (14%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 L T S + +++ ++LD S SM + + ++D + Sbjct: 25 LAISLWATGE---DSDRTLPINLGLILDRSGSMRAQ------AMETVKEAANYLVDGLG- 74 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 PD R ++TF+ P +Q ++ KINRL T G++ + Sbjct: 75 -PD----DRLSVITFNHHAEVILPNQSVEDLQGVKNKINRLTASGGTCIDEGMKLGIKEA 129 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 KE I LTDGEN DN+ L A + +G + Sbjct: 130 ALGKENRVSQ----------IFLLTDGENEHG--DNERCLKLAKVAAEYNITLNTLGFGS 177 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + +++ + F R+ Sbjct: 178 NWNQDILEQIADSAGGMLCYIEHPEQALTEFSRL 211 >gi|153842534|ref|ZP_01993517.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] gi|149745366|gb|EDM56617.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] Length = 223 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 71/198 (35%), Gaps = 44/198 (22%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G +D+L + + + + R G+V F PL Q + ++I Sbjct: 10 GEYIDRLSAVKKVLSDFV-------AKRKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIK 62 Query: 246 RLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + + G T G+ D+ D ++ +I L+DG N++ +D Sbjct: 63 QTVIGLVGQRTAIGDGIGLGTKTFVDS-----------DAPQRVMILLSDGSNTAGVLDP 111 Query: 303 KESLFYCNEAKRRGAIVYAIGVQ------------------AEAADQFLKNCA--SPDRF 342 + AK+ A +Y +GV A+ +Q L A + ++ Sbjct: 112 ---IEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQY 168 Query: 343 YSVQNSRKLHDAFLRIGK 360 + +++ +L + I K Sbjct: 169 FRARDAEQLEKIYDTINK 186 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 76/233 (32%), Gaps = 44/233 (18%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMM-------------VLDVSLSMNDHFGPG 188 + + K + + + M + V+D S SM+ Sbjct: 367 IGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSGSMDG----- 421 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINR 246 K+ +A S +++++S V G+V F + P + + I Sbjct: 422 -TKIELAKESAMRTVELLRSKDTV------GVVAFDDQPWWVVPPQKLGNKEEVLSSIQS 474 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + T P + A ++ K + H II +TDG+ + N Sbjct: 475 IPSAGGTNIYPAVSSALEEMLKIKSQRRH-----------IILMTDGQ----SAMNSGYQ 519 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 + + ++ V +A L++ A+ R+Y V++ L F R Sbjct: 520 DLTDTMVENKITMSSVAVGTDADTHLLQSLAEAAKGRYYFVEDETTLPAVFSR 572 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 85/217 (39%), Gaps = 35/217 (16%) Query: 157 SSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++I K + L + +VLD S SM+ KL +A + +++++ Sbjct: 394 VEMEIKGKEQLPSLGLAIVLDRSGSMSGS------KLELAKEAAARSVEMLRDEDT---- 443 Query: 216 VRSGLVTFSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 G + F + I++T PL + + I + G T+ L AY + D K + Sbjct: 444 --LGFIAFDDRPWEIIETGPL-NNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADIKLQ 500 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +H II LTDG++ N ++ + K G + + + +A Sbjct: 501 RKH-----------IILLTDGQSQPGNYED-----LIEQGKDNGITLSTVAIGQDADANL 544 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ + RFY+V + + + R + + I Sbjct: 545 LEALSEMGSGRFYNVIDEQTIPSILSRETAMISRTYI 581 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 59/185 (31%), Gaps = 34/185 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ ++D S SMN + + S+ + +GL +FSS + Sbjct: 67 QIVYLIDRSASMNGTEDEMVQFIQ-------------DSLQSKKDEQLAGLYSFSSTLQT 113 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + ++ + + + T L+ A I K + Sbjct: 114 EAIMTKTLKEVPKF-TEIKATDQTNIEQSLQLATGIIDPKKATR-------------FVL 159 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYSVQNS 348 LTDG + N +L + + K V + + LK+ SP Y V Sbjct: 160 LTDGNETKGN-----ALDFATKFKGSNISVDVVPFSQPVVNDVSLKSFVSPQVAY-VGEQ 213 Query: 349 RKLHD 353 ++L Sbjct: 214 QQLVT 218 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 35/141 (24%), Positives = 56/141 (39%), Gaps = 11/141 (7%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ KIN+L T GL +A+ + Y II L+DG+N+ Sbjct: 337 STLKGKINKLDAEGNTNQPIGLFWAWMSLQTGVPLNTPAKDTEYKYTDAIILLSDGDNTQ 396 Query: 298 PNIDNKES------LFYCNEAKR--RG-AIVYAIGVQAEAADQ--FLKNCASPDRFYSVQ 346 N S C+ K G ++ I V + D+ LK CAS +F+ Sbjct: 397 SGNSNSVSAIDARQKKLCDNIKDPLNGTTTIFTIQVNTDGDDESAVLKYCASDGQFFQST 456 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 + ++ AF IG + K R+ Sbjct: 457 TADQIEIAFQSIGSSLTKLRL 477 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 40/288 (13%), Positives = 88/288 (30%), Gaps = 46/288 (15%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL------YTA 60 + F G+I++L I + + +G+ ++ S ++ + D + L Sbjct: 15 KAFHAADDGNIAVLFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATLMVSKDYAAG 74 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + K K ++ I N+ + + + Sbjct: 75 VIRASDIQATAEKYFKALYTSPGINNV-----------------------TVTATYTARS 111 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + MP F A + P +S+ + L + M LDV+ S Sbjct: 112 ANGSSTVVMNTSGSMPTSFLK---VAGFTALPFTASSTSTWGA---TRLRVAMALDVTGS 165 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------LA 234 M+ DKL + ++++ +K+ + V ++ F+ + L Sbjct: 166 MDWD-----DKLTAMKTAAIKLVNTLKATASTDADVYISIIPFNVMVNVGTANKDAEWLD 220 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 W + K NR S + + N + G + Sbjct: 221 WDTDYGSCKSNRTTQNSCQAAGETWSWWANSCTSRYTRKSTCVAGGET 268 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 60/433 (13%), Positives = 130/433 (30%), Gaps = 108/433 (24%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL------YTATK 62 F + G+ I+TA+L+ + G+ ++ +H ++ +L+ D + + A Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 NGN + +NI+ + EL E + + + Sbjct: 68 AAMAMNGNGTISLGKTDA----RNIFMSQVSGELAE---------VHVDLGIDVTKTANK 114 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 + +S + F+ + I+ + ++ +D ++LD + SM Sbjct: 115 LNSQVSFTATVPTTFMQIF-------GRDSITISGTATAEYQTAAFMDFYILLDNTPSMG 167 Query: 183 DHFGPG-----------------MDK----------------LGVATRSIREMLDIIKSI 209 P MDK + V ++ + + D K+ Sbjct: 168 VGATPSDVSKLEAKVGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQATQALTDTAKTE 227 Query: 210 PDVNNVVRSGLVTFSSKIV-----QTFPLAWGVQHIQEKINR----------LIFGSTTK 254 ++ R G+ TF +K L + ++ + T Sbjct: 228 RVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKSYTDAVDLMTIPYQNYNSDQITN 287 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG------------ENSSPNIDN 302 + N I D + +K + F++DG + + Sbjct: 288 FDSAM-TQMNTIIDPAGD----GTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQE 342 Query: 303 KESLFYCNEAKRRGAIV---YAIGVQAEAADQF--------------LKNCASPDRFYSV 345 +C K RG + Y + + + ++ CASP ++ V Sbjct: 343 PIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEV 402 Query: 346 QNSRKLHDAFLRI 358 + + DA + Sbjct: 403 SPTDGITDAMKAL 415 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 43/214 (20%), Positives = 76/214 (35%), Gaps = 33/214 (15%) Query: 168 GLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D+M+ +D+S SM D G MD+L R + + + R GLV Sbjct: 93 ARDLMLAIDLSGSMATRDFVDPAGERMDRLSAVKRVVADFV-------AKRKGDRIGLVV 145 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F PL ++ ++++ G + I DA + Sbjct: 146 FGDAAYPQAPLTLDHDSVRILLDQMQIG--------MAGPRTAIGDAIGLTVKLMADSHA 197 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKN 335 +K +I LTDG ++S I + + AK+ +V+ IG+ + L Sbjct: 198 QEKVLILLTDGNDTSSAIPPERA---AEIAKQHKLVVHTIGIGDPGTTGEDRVDLEALAR 254 Query: 336 CAS--PDRFYSV-QNSRKLHDAFLRIGKEMVKQR 366 AS R + R L + + + K ++ Sbjct: 255 IASITGGRAFRALGQQRDLAEVYATLDKLTPEKI 288 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 68/205 (33%), Gaps = 27/205 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++M+ LD+S SM +D V+ + + + R GLV F+ + Sbjct: 98 SGREIMLALDLSGSMER-VDFSIDGRNVSRLAAVKQVGA--DFIRRRAGDRIGLVIFADQ 154 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L++ + + G +ST I D +K Sbjct: 155 ADVAASLSFDTAAVVHALEEAQIGLVGRSTG--------IGDGLGLALKRLDAATAREKV 206 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------AEAADQFLKNCA--- 337 +I L+DG N++ + A+ G V+ I + A+ + A Sbjct: 207 VILLSDGANNAGQTTPHD---VAGLARDLGIRVHTIALGPRDLSDADGDPDVVDTEALRD 263 Query: 338 ----SPDRFYSVQNSRKLHDAFLRI 358 S RF+ V+ + L I Sbjct: 264 VSATSGGRFFRVRTTDDLAAVADAI 288 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L ++ + N G T + L ++ + +A+ H + Sbjct: 858 VLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D ++ + +G +V A+G+ + + L S D++Y V Sbjct: 913 VLIVITDGE----SHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDKYYFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVSASVCN 987 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 77/201 (38%), Gaps = 29/201 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 617 DMKADIMFLVDSSGSIGPENFSKMKMF---------MKNLVSKSQIGADRVQIGVVQFSH 667 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + F L I I+R+ G TT + L + K + Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN------- 720 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 721 VRKFLILITDGEAQDIVRDPAIAL------RKEGVIIYSVGVFGSNVTQLEEISGKPEMV 774 Query: 343 YSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 775 FYVEN----FDILQHIEDDLV 791 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDM 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + ++ +V N +L RI + + Sbjct: 1143 ELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 96/312 (30%), Gaps = 43/312 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N G Q T LR G IE + + H + Sbjct: 327 KNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFT 386 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVS 178 + F IT +V + S+ L D+ +++D S Sbjct: 387 SKLG---NFSELATHNQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGS 443 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S + E++ + P VR G V ++ F ++ Sbjct: 444 GS------TQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEISKYSN 494 Query: 237 VQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + I + G T + L + + AK++ +++ LT+G Sbjct: 495 KPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKER------GSKVPCHLVVLTNG-- 546 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDA 354 + L ++ + V+AIGV+ EA L+ A + R Y V DA Sbjct: 547 ----MSRDSVLGPAHKLREENIRVHAIGVK-EANQTQLREIAGEEKRVYYVHE----FDA 597 Query: 355 FLRIGKEMVKQR 366 I ++V++ Sbjct: 598 LRNIRNQVVQEI 609 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 66/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + I ++ N +R GLVT+S++ L Sbjct: 226 VADVVFLLDMAINGSQEDLDHLKAFLG---ESISALDIKENCMRVGLVTYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + ++I L T A K + ++ + + + +T Sbjct: 283 STGNNKTEVLQRIQDLSPQVGQAYTGA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 +R G ++ +G++ D+ K + P ++ + N Sbjct: 340 ------HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 82 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 190 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 191 L-SVFAPNMTEIIKDVTQY 208 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 75/214 (35%), Gaps = 33/214 (15%) Query: 168 GLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D+M+ +D+S SM D G MD+L R + + R GLV Sbjct: 85 ARDLMLAIDLSGSMATRDFVDPAGERMDRLSAVKRVVANFV-------AKRKGDRIGLVV 137 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F PL ++ ++++ G + I DA + Sbjct: 138 FGDAAYPQAPLTLDHDSVRILLDQMQIG--------MAGPRTAIGDAIGLTVKLMADSHA 189 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKN 335 +K +I LTDG ++S I + + AK+ +V+ +G+ + L Sbjct: 190 QEKVLILLTDGNDTSSAIPPERA---AEIAKQHKLVVHTVGIGDPGTTGEDRVDLEALAR 246 Query: 336 CAS--PDRFYSV-QNSRKLHDAFLRIGKEMVKQR 366 AS R + + L + + + K ++ Sbjct: 247 IASITGGRAFRALGQEKDLTEVYATLDKLTPEKI 280 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 20/65 (30%), Positives = 27/65 (41%), Gaps = 2/65 (3%) Query: 308 YCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQ 365 C+ AK +G +Y I A A Q L CAS D ++ + L AF IG + Q Sbjct: 1 TCDTAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQ 60 Query: 366 RILYN 370 Sbjct: 61 LTRLT 65 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 78/212 (36%), Gaps = 41/212 (19%) Query: 167 IGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G D+++ +D+S SM D G + +L +E + R GL+ Sbjct: 86 TGRDLILAIDLSGSMEREDFDLDGRTVTRLDAVKAVAKEFVTS-------RAGDRVGLIL 138 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ PL + V + ++ I G +T GL A ++ K Sbjct: 139 FAEFAYTAAPLTFDVAAVSRIVDEATIGISGRSTAIAGGLGLALKRL-----------KR 187 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA----------- 328 D + +I L+DG ++S N+ ++S + G V+ I + E Sbjct: 188 SDAQSRVVILLSDGSDTSGNVLPRDSARLAEQL---GVTVHTIALGPEDMETAPQTRDAV 244 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L++ A S R + V+N+ L I Sbjct: 245 DTATLRDIAELSGGRTFRVRNTDDLRAVTAEI 276 >gi|21228580|ref|NP_634502.1| putative chloride channel [Methanosarcina mazei Go1] gi|20907073|gb|AAM32174.1| putative chloride channel [Methanosarcina mazei Go1] Length = 1004 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 45/293 (15%), Positives = 106/293 (36%), Gaps = 43/293 (14%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 K+ + + ++ + D+ + S + D + +Y+L+ +Y+ Sbjct: 241 KDNFTIEIGSKKVNDVTVSDVGEGKYKLSFNPPKQDSNGNYDLNVYVKYK---------- 290 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + + ++ ++M+V+D S SM+ G + + + ++ Sbjct: 291 --KVTLSDSELNAVRYGEDNANANANVMLVIDRSGSMS---GSPISSAKNSANLFIDYME 345 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV----QHIQEKINRLIFGSTTKSTPGLE 260 G+V+FSS + LA I++KIN + T G+ Sbjct: 346 AEDMA---------GVVSFSSSARYDYHLATLTPEVKNSIKQKINSIYASGVTAIGSGMR 396 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 Y N + + + A I+ L+DG +S + K VY Sbjct: 397 YGLNDLLNYGDPNNPWA---------IVLLSDGYQNSG----ENPNNVIPSIKASNIQVY 443 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 +G+ + L N A + ++Y +L + + I +++ + ++ + Sbjct: 444 TVGLGPAVDQKLLGNIADQTGGKYYYSPTDSQLQEIYNDIVGKIIGWKTVFKR 496 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 53/159 (33%), Gaps = 28/159 (17%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK--EKLEHIAKGHDDYKKYIIFLTDG 293 + L T G+ + + A ++I+F+TDG Sbjct: 507 DRATFNTYVQSLQPLGGTYHDAGMVWGARLLSPTGLFADENATAPNDRPISRHIVFMTDG 566 Query: 294 EN--SSPNID-----------------------NKESLFYCNEAKRRGAIVYAIGVQAEA 328 + N+ N C A++RG ++ + + Sbjct: 567 AMAPNMGNLTFQGYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVGS 626 Query: 329 ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 D L NCAS + + N+ +L++ F I +++ K R+ Sbjct: 627 NDS-LNNCASSGQAFEADNAAELNEQFQAIARQISKLRL 664 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 83/244 (34%), Gaps = 29/244 (11%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 ++ + +G+ +LTA + + +G ++ + + +L D +L Sbjct: 10 CAGTKSLISDQRGNAFMLTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLAGRRA 69 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 + + Q + + A + +++E S + + Sbjct: 70 M--GGASYDEAAQAEANKMFNFNFPEAKYGATGILFSSRALNASDVEGQASAVLPTE--- 124 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 F + T+ ++IS +D+M+VLDV+ SM Sbjct: 125 -----------------LMFMFGKEEFRLSADCTAKLEIS-----NVDVMLVLDVTGSMA 162 Query: 183 D-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHI 240 + G ++++ + + D + + + +R G+V +SS V LA + Sbjct: 163 QTNAGDSVNRITALKDATMDFFDTLTNADVGDGRLRFGVVPYSSTANVGQILLAKNPAWL 222 Query: 241 QEKI 244 + + Sbjct: 223 ADTV 226 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 70/205 (34%), Gaps = 35/205 (17%) Query: 167 IGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G D+++ LD+S SM G + +L D+ R GLV Sbjct: 85 SGRDIVLALDLSGSMEREDFVLDGRTVSRLAAVQ-------DVAAQFVRGRTGDRVGLVV 137 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + V + + I L G + K+T I D +G + Sbjct: 138 FGDRAYVAAAPTHDVGAVAQVIGTLQIGVSGKAT--------AIADGLGLAIRRLRGREA 189 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------ADQ 331 + II L+DG++++ +D + A+ G VY I + + Sbjct: 190 ESRVIILLSDGQDTTGAVDP---VAAAQAAQELGMRVYTIALGPADLASSPDARDAVDSE 246 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDA 354 L+ A + + V+++ L Sbjct: 247 TLRRIAEVAGGETFRVRSTEDLEAV 271 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 51/254 (20%), Positives = 100/254 (39%), Gaps = 25/254 (9%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 S+ + + ++ + + EM F F + + L S + LD++ Sbjct: 744 YSVGVFNANRTQLVEISGKPEMVFYVEDFDILKHLENEILFGICSPYDECRRIERLDIVF 803 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D S S++ M + M+ ++K ++ V+ G V +S++ F L Sbjct: 804 VIDGSGSIDPKEYDIMKEF---------MISLVKKSDVSHDRVQFGAVKYSAEPETFFYL 854 Query: 234 AWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + I N G TT + L + + EH ++ H + +I Sbjct: 855 --NRYTTKSAIIRAIQNDKSIGETTYTAKALRH-----SEGLFSEEHGSRKHRGVPQVLI 907 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +TDG+ + D E + + G I+YAIG++ D+ L S D+++ V Sbjct: 908 VITDGD----SHDAAELDEVSKKLRANGIIIYAIGIERARPDELLTMAGSEDKYFYVNTF 963 Query: 349 RKLHDAFLRIGKEM 362 L + RI +++ Sbjct: 964 EGLKHLYPRISEKI 977 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 81/195 (41%), Gaps = 25/195 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + I D++ ++D S S++D ++ D+++ +N V+ G+ Sbjct: 981 SKPECGIPADLVFLIDGSNSISDS------DFTKMKNFLQ---DVVRPFDTGHN-VQVGI 1030 Query: 221 VTFSSKIVQTFPLA-WGVQ-HIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +S + + F L + + ++ +I R+ + G T L+ N Sbjct: 1031 AQYSDRYRKEFSLNMFSHKLELETQIGRIRQMEGLQTYIGAALDRVRNFFTPEGGSRV-- 1088 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +++ ++ ++ +TDG + + + + +++G +YAIGV Q + Sbjct: 1089 ---NENIQQILLVITDGRSHD------KVVKAAEDLRKKGVDIYAIGVGRIDHLQLSQIA 1139 Query: 337 ASPDRFYSVQNSRKL 351 S DR Y+V N +L Sbjct: 1140 GSSDRKYTVDNFSEL 1154 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 77/191 (40%), Gaps = 27/191 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + D++ ++D S S++ D + E+++ D + G Sbjct: 604 VEACKEKKSDILFLVDSSRSISP------DNFLKMKDFMSELVNKSDIGLDR---MHVGA 654 Query: 221 VTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FSS+ + F L+ I I R+ G +T + L++ + Sbjct: 655 IQFSSRNKEEFRLSQYATKSDIIRAIGRMSLMGQSTLTGGALQFVLDYF--------RPI 706 Query: 278 KGHDDY-KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 KG Y KK +I +TDGE E+L ++ G IVY++GV Q ++ Sbjct: 707 KGSRPYVKKILILITDGEAQDDVKTPAEAL------RQEGIIVYSVGVFNANRTQLVEIS 760 Query: 337 ASPDRFYSVQN 347 P+ + V++ Sbjct: 761 GKPEMVFYVED 771 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 64/179 (35%), Gaps = 24/179 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D D+ ++DV S ++K+ + + LD+ +R GLVT+ Sbjct: 216 DADAVADVAFIVDVGSS-----QTNIEKIQIFLEKLVSSLDVKDKC------MRIGLVTY 264 Query: 224 SSKIVQTFPL--AWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S+K T L A H+ + I + G + + K+F ++ Sbjct: 265 SNKPQATSLLRMATDKTHVLQSIQSISPKGGKANLGSVIHFTKEKVFSKSA----GSRKS 320 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ I +T + +R G ++AIG+ A Q + A P Sbjct: 321 QGVEQIAIVIT------HRSSEDDVSGAATALRRAGVTIFAIGIDAANTTQLAQIVAYP 373 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 69/196 (35%), Gaps = 24/196 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++ S SMN ++E++ P+ VR G+V ++ Sbjct: 427 DIYFLIGGSSSMNYF------DFADLKLFLKEVVKFFMVGPNK---VRFGVVQYAEINEL 477 Query: 230 TF-PLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F P +G ++ N + L++ + + ++ + H Sbjct: 478 EFGPEEYGKTSDILKAIENIRQLRGNPHTGKALKFIHPLLRKSQGQHSRNVPCH------ 531 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 ++ LTD + P + + K +YAIG++ Q + S DR Y V Sbjct: 532 LVVLTDQISEDPVKEP------AKKLKNEMVSIYAIGIRHANESQIYEIAESKDRAYFVN 585 Query: 347 NSRKLHDAFLRIGKEM 362 + L + +++ Sbjct: 586 DFASLKHIRNEVVRDI 601 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 56/188 (29%), Gaps = 49/188 (26%) Query: 233 LAWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L + + I+ + T + GL +A+ + K + A KK I+ + Sbjct: 465 LTNNLSTVTAAIDSMNYWLNGGTVISEGLMWAWRTLSPQKPYADGAAYTDKKTKKVIVLM 524 Query: 291 TDGENSSPNIDNKES-------------------------------------LFYCNEAK 313 TDG N + N S C+ AK Sbjct: 525 TDGVNGLADNGNAASANISDYSAYGYMGASRLSVADGVTTYAGLQTFLDDRLKKACDNAK 584 Query: 314 RRGAIVYAIGVQ---------AEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMV 363 +G +Y + + L CAS P+ + +S L+ AF +I Sbjct: 585 AKGISIYTVMFNHNGFLSATEQARSATLLSYCASKPEYAFLATDSAALNSAFGQIASSAA 644 Query: 364 KQRILYNK 371 + + Sbjct: 645 ASPLRLTR 652 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 57/178 (32%), Gaps = 30/178 (16%) Query: 23 ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYR 82 ++PV F+ G I+ +++ L K Sbjct: 7 AMIPVTFMA-GAAIDYGRATLLRSSLQ-----------------------KAADAGALAA 42 Query: 83 IIKNIWQTDFRNELRENGFAQDINNIERSTSLSII-IDDQHKDYNLSAVSRYEMPFIFCT 141 + R ++ +N ++ R+ SL+I + + + + I + Sbjct: 43 GARTSLTQLAREQIAKNAVLANLGAKARNLSLTITETEPSAGVFQVQIQAS-----IATS 97 Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 A P+++TS ++ S +++ + LD + SM D +++ Sbjct: 98 IMKVARFDTIPVVVTSEARVVGGSTNPIEIALALDNTGSMRDDMPALKQAAKTLAQNV 155 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 46/252 (18%), Positives = 92/252 (36%), Gaps = 34/252 (13%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 ++E+ +L ++ + K + F + + PLL+T Sbjct: 339 SLEQMVALREVVRSEGKGLTV---IGGNQSFTLGGYAETPLADALPLLMTPP---PRPQR 392 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++ ++D S SM+ FG + K +A + L ++ R G++ F ++ Sbjct: 393 APVSILFIIDRSASMSATFG--ISKFDMAKEAAILSLTTLQPGD------RVGVLAFDTE 444 Query: 227 IVQTFPLAW---GVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + T P GV +Q++I + G T L + + H Sbjct: 445 TIWTVPFRTVGEGVSLVELQDQIATMSLGGGTNIERALSVGLPALANEPYSTRHA----- 499 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--P 339 + LTDG + S N + L A+ + I + +++ + L AS Sbjct: 500 ------VLLTDGRSYSNNYPRYQQL--VETARAAQITLSTIAIGSDSDTELLNQLASWGN 551 Query: 340 DRFYSVQNSRKL 351 R+Y V ++ L Sbjct: 552 GRYYFVADATDL 563 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 81/249 (32%), Gaps = 29/249 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMD 190 +P S + + G+ + M++DVS SM+ G Sbjct: 49 IPVALRRLCLLLVIVAIARPQAG-TSYSLEVNEGIAIQMLVDVSSSMDMSVKNFDGKSTT 107 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 ++ VA + + GL+TF+ PL +G + + + L Sbjct: 108 RMEVAKEMVERFIAGDGEDLQGRPHDLIGLITFARYADTRSPLTFGHDALLQIVRHLTIQ 167 Query: 251 S-----TTKSTPGLEYAYNKIFDAKEKLEHI---AKGHDDYKKYIIFLTDGENSSPNIDN 302 T L A ++ + +E A+ K II LTDGEN+S + Sbjct: 168 ERPNEDGTAYGDALALAAARLKNPQELRHGKRPDAQAEAIESKVIILLTDGENNSGSHLP 227 Query: 303 KESLFYCNEAKRRGAIVYAIGVQ-----------AEAADQFLKNCA--SPDRFYSVQNSR 349 + AK +YAI + A++ L++ + + F + Sbjct: 228 ---IEAAGLAKAWDCKIYAISLGESLDAENPLDALSPAERVLEHISIETGGVFRQAHDFE 284 Query: 350 KLHDAFLRI 358 L + I Sbjct: 285 SLLSVYEEI 293 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 57/243 (23%), Positives = 89/243 (36%), Gaps = 27/243 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M TF P ++ S+ G+D+M+ +D+S SM + ++L V Sbjct: 57 MFLRCITFSLVVIVLARPQSYSAWDNKDSE---GIDIMLAMDISASMLTNDVIP-NRLEV 112 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A D I P+ N GL F+ + PL + +N + K Sbjct: 113 AKEVAS---DFISGRPNDN----IGLTIFAGEAFTQCPLTTDHASLINLLNSVRTDLVVK 165 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 GL I K K +I LTDG N+ +I + AK+ Sbjct: 166 ---GLIQDGTAIGMGLINAVGRLKSSKAKSKVVILLTDGSNNVGSISP---MTAAEIAKK 219 Query: 315 RGAIVYAIGVQAEAADQF-------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK-EMVK 364 VY IG+ E + + L+ A + +FYS Q+ +L + I K E K Sbjct: 220 FNIRVYTIGLGTEQNNGYSDIDYTTLRQIANVTNGKFYSAQSQTELSQIYKDINKLEKTK 279 Query: 365 QRI 367 +I Sbjct: 280 LKI 282 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 82/237 (34%), Gaps = 40/237 (16%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGV 194 I + + P +V ++ G+D+++ +D S SM F P +++ Sbjct: 117 TLITLGLIFAVLALAKPRDAQKTVLPPTE---GVDIILAIDTSGSMAAQDFDP--NRITA 171 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---- 250 A + + R G+V F+S + PL + + + + + G Sbjct: 172 AKVAAANFIAN-------RLSDRIGIVVFASDAMLQSPLTLDYESLLDFLADVRIGMVRT 224 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + + + + + K II LTDGE++S I L Sbjct: 225 DGTAIGDAIAVSSVHLERSPAR-----------SKVIILLTDGESNSGVISP---LDAAK 270 Query: 311 EAKRRGAIVYAIG-VQAEAADQF------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 A G VY I + + D L+ A + ++Y N +L + I Sbjct: 271 TAALYGIKVYTIATISKNSRDSLDFKPDDLEQIAKLTGGKYYRAYNEAELTKIYAEI 327 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 72/202 (35%), Gaps = 23/202 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D+M+ +D+S SM +++ VA E + GL F+ + Sbjct: 87 GIDIMLTMDISASMLTEDVFP-NRMEVAKEVASEFISS-------RPSDNIGLTIFAGEA 138 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P+ + ++ + T GL I K K + Sbjct: 139 FTQCPMTLDHAALLNLLHNVRTDLVTN---GLMQDGTAIGLGLANSVSRLKDSKAKSKVV 195 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LKNCA--S 338 I LTDG N+ +I + AK+ G VY IG+ E + L++ A + Sbjct: 196 ILLTDGSNNVGSISP---MTAATIAKKFGIRVYTIGLGRETGEDIGAIDYKTLQDIAVLT 252 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 FY Q+ +L + I K Sbjct: 253 NGEFYRAQSQAELSKIYQDIDK 274 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 52/332 (15%), Positives = 104/332 (31%), Gaps = 27/332 (8%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK-NDFSYRIIKNIWQTDFRNELRE 98 + F + + +D T + + + ++ + Sbjct: 676 NTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGARPNVRKFLI--LITDGEA 733 Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLS---AVSRYEMPFIFCTFPWCANSSHAPLLI 155 +D R + I + EM F F + +L Sbjct: 734 QDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQHIEDDLVLG 793 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + K LD++ V+D S S++ M M+ ++K N Sbjct: 794 ICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQ 844 Query: 216 VRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 VR G + ++ F L ++ I N G T + L ++ + +A+ Sbjct: 845 VRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGS 904 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H + +I +TDGE + D ++ + +G +V A+G+ + Sbjct: 905 RLHKGVP-----QVLIVITDGE----SHDAEKLNATAKALRDKGILVLAVGIAGANTWEL 955 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L S D++Y V+ L F + + Sbjct: 956 LAMAGSSDKYYFVETFGGLKGIFSDVSASVCN 987 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 70/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKEFLASVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L + Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGDALRKVKHYFRPDT 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + ++ +V N +L RI + + Sbjct: 1143 QLIQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 26/197 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S + E++D+ P VR G V ++ Sbjct: 435 DIYLLIDGSG------NTQPTDFHEMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDL 485 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ + + I+ + G T + L + + AK++ + H Sbjct: 486 EFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQRGNKVPCH------ 539 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 ++ LT+G + + ++ + V+AIGV+ EA L+ A + R Y V Sbjct: 540 LVVLTNGMSQDSVLGP------AHKLREENIRVHAIGVK-EANQTQLREIAGDEKRVYYV 592 Query: 346 QNSRKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 593 HEFDALRDIRNQVVQEI 609 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 67/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + + ++ N +R GLV +S++ L Sbjct: 226 VADVVFLLDMAINGSQENLDHLKAFLG---ESVSALDIKENCMRVGLVAYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + GV + ++I L T A K + ++ + + + +T Sbjct: 283 SMGVNKTEVLQRIQDLSPHVGQAYTGA---ALRKTRKEVFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 + +R G V+ +GV+ +Q K + P ++ + N Sbjct: 340 ------HRASDDNVTKAAVNLRREGVTVFTMGVEGANPEQLEKIASYPAEQFTSKLSNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 77/199 (38%), Gaps = 23/199 (11%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 DV ++ GM + I+++ + S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGMKSFPLVKTFIQKL---VSSLPVEANKYRVALAQYSDALHNEFHLGA 82 Query: 235 -WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + ++A + G + ++GVQ + + LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEASKALREDGVKIISVGVQKASEED-LKAMATSQFHFNLRTARD 190 Query: 351 LHD---AFLRIGKEMVKQR 366 L +I K++ + R Sbjct: 191 LSMFAPNMTQIIKDVTQYR 209 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 50/225 (22%), Positives = 80/225 (35%), Gaps = 55/225 (24%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A + E ++ GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRLEAAKQVAAEFIN-------GRPNDNIGLTVFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 PL + N + + T GL A +++ D+K K Sbjct: 138 AFTQCPLTVDHGVLLNLFNSIKGDIAQRGMIEDGTAIGMGLANAISRLKDSKAK------ 191 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------- 328 K II LTDG N+ +I L AK+ G VY IGV Sbjct: 192 -----SKVIILLTDGSNNRGDISP---LTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYA 243 Query: 329 -----------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +Q L A + ++ ++ KL + + I K Sbjct: 244 GVQYVNVPVEIDEQTLTQIAGTTNGNYFRATSNSKLEEVYREIDK 288 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 47/260 (18%), Positives = 84/260 (32%), Gaps = 46/260 (17%) Query: 138 IFCTFPWCANSSHAPLLI-TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I C +CA + + K+ + +++ VLDVS SM GM +L A Sbjct: 61 ILCIAGYCAVVAALASPVVMRQEKVYTAKGS--EILFVLDVSPSMAAKDIAGMSRLEAAK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 ++ I GLV +S+ P + ++N L G + Sbjct: 119 -------QAVRVIVPEAGGTAFGLVALASEAALMVPPTLDREAFFARLNSLQAGELGDGS 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 I + KK I+ +TDGEN++ ++ + A G Sbjct: 172 --------AIGMGVSTAAYHLISSAAPKKSIVLITDGENNAGSVHPGTA---AQLAFENG 220 Query: 317 AIVYAIGVQAEAADQF-----------------------LKNCA--SPDRFYSVQNSRKL 351 +Y +GV + L+ A + R++ V++ +L Sbjct: 221 ITLYVLGVGTRGSVPLEYVDPATGKTYSGYLDSRFDESPLQEIALTAGGRYFGVESMGEL 280 Query: 352 HDAFLRIGKEMVKQRILYNK 371 A + + Y K Sbjct: 281 TAAVSAVTGREQTVQSFYLK 300 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 68/212 (32%), Gaps = 29/212 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM +L A + + +D + P N GLV+FS Sbjct: 89 IVLAIDVSNSMAATDISP-SRLAAAKQGAQAFVDQL---PPKIN---LGLVSFSGTAAVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P ++ IN L G T G+ + I + + G I+ L Sbjct: 142 VPPTTDRDAVKAGINGLQLGPATAIGEGIYAGLSAINTVS--SQFVNSGQAVPPAAIVLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----ADQF---------LKNC 336 +DGE + +N AK V I Q L Sbjct: 200 SDGETTRGRPNN----QAAQAAKDAHIPVSTIAYGTPNGTLDVGGQLIPVPVNEPALSQI 255 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A + + + +L + +G + ++ Sbjct: 256 AEQTGGSHHRATSGDELTSIYKGLGSSIGYRK 287 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 91/273 (33%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PF T+ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHAPKSYKNYLLHAPFALRIIALALIIIVLARPQTTDSWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 IDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNVGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I + G T G+ A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ +I L AK G VY IGV + Sbjct: 194 TDGTNNRGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 88/242 (36%), Gaps = 52/242 (21%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREM 202 + P +T ++S+ G D+M+ +D S SM G ++++ V + Sbjct: 80 VALMRPQWLTPYTEVST---PGYDLMIAVDASHSMEALDFTVEGRQVNRMAVVKGVMGRF 136 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGL 259 +D R GL+ F S+ PL ++ ++ ++ G T + Sbjct: 137 IDA-------RQGDRVGLILFGSQAFILSPLTLDRHAARQLLDGVVPSIAGPATALGDAI 189 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 +K+ + E + +I + DG+N++ + KE+ A+ G + Sbjct: 190 ALGVSKLRERPEG-----------SRVMIVIADGDNNAGSFAPKEA---ARLARATGTRI 235 Query: 320 YAIGVQ-------------AEAADQF------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 Y IGV D L+ A + ++ ++R L + RI Sbjct: 236 YVIGVGSKQPSIPILEEGSVRYRDDLTMDEGTLQEIADLTGGGYFRATDTRALEEISSRI 295 Query: 359 GK 360 G+ Sbjct: 296 GQ 297 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 63/464 (13%), Positives = 138/464 (29%), Gaps = 105/464 (22%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL----------- 57 F G+++I I++ ++ +V G+ ++ ++ ++ LD ++L Sbjct: 3 FKREEDGAMTIFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAASDLDQPLPA 62 Query: 58 ----------YTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF------ 101 T+ LN G + + G Sbjct: 63 DEVVDDYFAKAGMTEFLNDVRITPGSDLPTTNFRIVQAEARTRTPSIYMAMTGVRTLPVY 122 Query: 102 -----AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL--- 153 + I IE S L I ++ + + + A ++ + Sbjct: 123 VSGTAEETIEKIEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAAKTTSLNIVPY 182 Query: 154 -LITSSVKISSKSDIGLDMMMVLDVSL--------SMNDHFGPGMDKLGVATRSIREMLD 204 T+ +I + GL ++ S ++ D G +D S ++ + Sbjct: 183 AGQTNPGRIVFERAGGLPFATFIEDSNGDEILYGQTIVDDEGNSIDVPYNTMSSCLDLTN 242 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV---------------QHIQEKINRLIF 249 D+ + F + + + WG +Q+ I+ + Sbjct: 243 SDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQNNSSIRYAQNDAGRLQDFIDDMRL 302 Query: 250 GSTTKSTPGLEYAYNKIFDAKEK----------------LEHIAKGHDDYKKYIIFLTDG 293 T + G++Y + + G D +K+I+ +TDG Sbjct: 303 HDGTGTQYGMKYGVALLNPSSRNTFLALNAAGLVPDGFKNRPADFGTTDTRKFIVLMTDG 362 Query: 294 ENSSP-----------------------------NIDNKESLFYCNEAKRRGAIVYAIGV 324 + + + + CN+AK G VY I Sbjct: 363 QITDQFRPEDKNDPKNDEIALNQRTGDRDTYSTQSTNVTNFYSVCNKAKAEGITVYTIAF 422 Query: 325 QAEAAD-QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +A A ++ CA+ F+ ++ AF I +++ + R+ Sbjct: 423 EAPADAVTQMRTCATSPAFFYKVEGVQIKTAFKSIARQINELRL 466 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 49/263 (18%), Positives = 94/263 (35%), Gaps = 25/263 (9%) Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 Y I+ W TD + E ++ ++ +L + D + A E + Sbjct: 294 YAILNAPWVTDEQREAANIFLRYLLDRPQQELALRYGYRPSNTDVAVGAPITPE-NGVDP 352 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 P P ++++ I ++ +D+M VLDVS SM D +L A ++R Sbjct: 353 QQPQTLLEVPRPDVLSAIRSIWEQNKKRVDVMAVLDVSGSMEDE-----GRLEQAKAALR 407 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTP 257 ++ ++ GL FS + P++ + +I L T+ Sbjct: 408 IFVEQLQDDDGF------GLTIFSDQATVLTPISPIGSRRTEVLNRIAGLTPRGGTRLLD 461 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RG 316 + AY ++ A + ++ LTDG ++ ++ L + + Sbjct: 462 TVVEAYQELT---------ATPPGQRIRAVVVLTDGLDNRSQRSAEDVLDLLRQDREGYS 512 Query: 317 AIVYAIGVQAEAADQFLKNCASP 339 V+ I +A LK AS Sbjct: 513 IKVFTIAFGGDADVHLLKEIASA 535 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 52/194 (26%), Gaps = 59/194 (30%) Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-------EKLEHIAKGHDDYKK 285 L ++ IN + +T G + + + + + K Sbjct: 373 LTNNTNALKTLINNMAPSGSTNIHEGFMWGWRTLSPNSVFADGQPYASSANSSNATNINK 432 Query: 286 YIIFLTDGENSSPNIDNKES---------------------------------------- 305 II +TDG NS + + Sbjct: 433 IIILMTDGTNSWGTNSSAPTGSLYFAAGYFRNANGTTPNPRLTTAYQNTNIADGNTARKA 492 Query: 306 -----LFYCNEAKRRGAIVYAIGVQAE------AADQFLKNCAS-PDRFYSVQNSRKLHD 353 C K +Y IG A L+NCAS PD+FY +S L Sbjct: 493 LDALTAEACANTKAVNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQFYLANSSDDLIK 552 Query: 354 AFLRIGKEMVKQRI 367 AF I + R+ Sbjct: 553 AFKSIQASIGALRL 566 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 66/210 (31%), Gaps = 33/210 (15%) Query: 16 SISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA-TKILNQENGNNGKK 74 ++++L A L + ++ G I+ ++ KL D + L T + E N Sbjct: 26 NVALLFAFLSVPMVMIGGAAIDYGFATRLETKLQTATDATALLLCQTPLTTSEAELNTLA 85 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 Q N+ S+ I + + S Sbjct: 86 QTTMTGAMGAANLVVDRLA---------------ITSSPRKITL----TAHKQSTTFFGG 126 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + P + PL T ++ +VLD + SM G G KL Sbjct: 127 LTGTQRINPGAVSQCATPLPKT------------FEIALVLDNTGSMAASSG-GQSKLRA 173 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + ++ + + P ++ + +V F+ Sbjct: 174 VQTAATDFVNYVYTSPAFSSATKVAIVPFA 203 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 72/200 (36%), Gaps = 23/200 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + F+ C + P + I+ G+D+M+++DVS SM ++L Sbjct: 36 VKTAFLICALGIMLFALAGPRWGSHYQDIT---QKGVDIMVLVDVSPSMMVE-DIKPNRL 91 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 A R + + L+++ R GL+ FS PL IQ ++ L Sbjct: 92 ERARREVLDFLNVV-------QGDRIGLIAFSGVAFVQCPLTLDYGAIQMFLDELKP--- 141 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + A + A E K I+ +TDGE++ L +A Sbjct: 142 ----ELIPVAGTDLGAAIEAGISSFDFKSVTDKVIMLITDGEDNEGK-----GLIAAQKA 192 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K +G ++ G+ + Sbjct: 193 KEKGVKIFVFGMGDPSGGPI 212 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 73/217 (33%), Gaps = 44/217 (20%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A D+ + G+ F+ + Sbjct: 87 GIDIMLAIDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + + I+ + G T T + K K Sbjct: 138 TFTQCPLTVDHAVLLDMIHNIKCGLITDGT--------AVGMGIANAVTRLKDSKAKSKV 189 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LKNCA-- 337 II LTDG N+ +I + AK G VY IGV + ++ + Sbjct: 190 IILLTDGTNNKGDISP---MTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMP 246 Query: 338 --------------SPDRFYSVQNSRKLHDAFLRIGK 360 + ++ ++ KL + + I K Sbjct: 247 VEIDEKTLTEIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 70/204 (34%), Gaps = 35/204 (17%) Query: 167 IGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G D+++ +D+S SM G + +L R+ + R GLV Sbjct: 79 SGRDIVLAIDLSGSMQKEDFQLDGQPISRLDAVKRTASRFVAA-------RRGDRIGLVI 131 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + PL + V + ++ G + ++T I D + Sbjct: 132 FGDRAYFAQPLTFDVDAVARAVDEAQIGISGRAT--------AISDGLGLAMKRLAASEA 183 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------ADQ 331 + ++ ++DG ++S N+ +++ A G ++ I + E + Sbjct: 184 PTRVVVLMSDGVDTSGNV---QAVDAARLAAGHGIRIHTIALGPEDLENQPRSRDAVDTK 240 Query: 332 FLKNCA--SPDRFYSVQNSRKLHD 353 L+ A S + V+ L Sbjct: 241 TLREVAELSGGTAFRVRGMADLEA 264 >gi|194679013|ref|XP_608702.4| PREDICTED: matrilin 4-like [Bos taurus] gi|297490994|ref|XP_002698558.1| PREDICTED: matrilin 4-like [Bos taurus] gi|296472628|gb|DAA14743.1| matrilin 4-like [Bos taurus] Length = 788 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 52/232 (22%), Positives = 87/232 (37%), Gaps = 30/232 (12%) Query: 135 MPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLD----MMM---VLDVSLSMNDHF 185 MP C + P L V +S + + MM +DV M+ Sbjct: 1 MPSFLLLEAICIVLFAGVPPSLPLQEVHVSKEIIGKISVASKMMWCSAAVDVLFLMDGSH 60 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEK 243 G + + D + P VR G V FSS FPL A+ Q ++ K Sbjct: 61 SVGKGSFERSKHFAITVCDALDINPRK---VRVGAVQFSSVPRLEFPLDAFSTQQGVKGK 117 Query: 244 INRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 I R++F T+++ L+Y K F + + +I +TDG + Sbjct: 118 IKRMVFKGGHTETSLALKYLLRKGFPGGR--------NASVPQVLIIITDGRSQGHVA-- 167 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + K+RG V+++G+ ++ L + AS R V + ++ DA Sbjct: 168 ----LPAKQLKQRGITVFSVGIHFPRWEE-LHSLASEPREQHVLMAEEVDDA 214 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 82.6 bits (202), Expect = 8e-14, Method: Composition-based stats. Identities = 45/268 (16%), Positives = 87/268 (32%), Gaps = 40/268 (14%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS--HAPLLITSSVKISSKSDIGLDMMM 173 + Q R + + W + P+LI + G DMM+ Sbjct: 41 VAATGQAPTSGSQIKQRKRIQALALLLSWIVITLCIARPVLIGEPIVTQ---KAGRDMMI 97 Query: 174 VLDVSLSMND------HFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D+S SM P D K + + ++ S + R GL+ F Sbjct: 98 AVDLSQSMEQKDYLLPSEEPSQDGQLKAASNVSRLVALKSLLSSFSQQRDGDRLGLIVFG 157 Query: 225 SKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S P Q + E+++ + G T + + + Sbjct: 158 SGAYLQVPFTEDVRLWQTLLEQMDTQMAGPATAIGDAVGLSIRAFERSN----------- 206 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG---VQAEAADQF----LK 334 ++ ++ +TDG ++S +D ++ A G ++ +G V DQ L Sbjct: 207 TSQRILLLVTDGSDTSSRLDPVDA---ARVAAAEGIEIFTLGMGSVDTVGDDQVDFNTLN 263 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + R + +S + + +I K Sbjct: 264 KIAKITNGRAFEGNSSTAIAEILAQIDK 291 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 50/263 (19%), Positives = 86/263 (32%), Gaps = 41/263 (15%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 T P A S + + + + S D M L S + + A Sbjct: 259 TAPSQATPSTLFVPFFAPDESDNDSRAVNDYMADLPSGGSAGGASNRQLQGM-TAKYDKN 317 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + D N L ++ +I L + I + T G Sbjct: 318 AFKVSTTARQDGTNY----LFGPNAGCEIQPLTRLTTSQTQLTNAIAAMTVIGDTNIPIG 373 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY---------- 308 L + ++ + + +A G KK+I+ +TDG+N S + +Y Sbjct: 374 LAWGWHLLSPNGPFKDGVAYGEIKTKKFIVLMTDGQNQSAVSSSDNRSYYSGLGFIWQNR 433 Query: 309 ----------------------CNEAKRRGAIVYAIGVQAEAAD-QFLKNCA-SPDRFYS 344 C+ ++ V+A+ V+ D LK CA SP+ F+ Sbjct: 434 IGTTSNDNAVRTKAIDTRLTLLCDNIRKARIQVFAVRVEVNDGDSAVLKACATSPNMFFD 493 Query: 345 VQNSRKLHDAFLRIGKEMVKQRI 367 V+NS L F I ++ + RI Sbjct: 494 VKNSSGLPAVFRAIADQISELRI 516 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 77/232 (33%), Gaps = 39/232 (16%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + F + GS+ ++ A+ L + + G+ I+ + ++ + +D + L K+ Sbjct: 11 RLARFRHTESGSVLVIFALALVPMAFLAGMTIDYAQNTNLRQQAQVAVDATAL-ALAKLP 69 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 + + +K + +R NG + + Sbjct: 70 LDTTDKDLAAKAEAQVLTALKGLPIDALTVTMRHNGDL-----------IEVAAKGATPT 118 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 MP S IS++S L++ +VLD + SM Sbjct: 119 SLTRLAGFMSMPL-------------------SVSAISNRSMTNLEIALVLDNTGSMKG- 158 Query: 185 FGPGMDKLGVATRSIREMLDII--KSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 KL + R+++ + ++ P N ++ G+V FS + A Sbjct: 159 -----TKLTNLKAAARDLVTSLFQQADPAKPNALKIGVVPFSMTVNVGSGFA 205 >gi|291228410|ref|XP_002734180.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 945 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 76/212 (35%), Gaps = 25/212 (11%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 CAN H + +++V+D S SM + G + +A + Sbjct: 160 SLIDDCANYDHRFRPWYVEAATPEPKN----VVIVIDTSGSMA-NLHSGKSLINIAIDAA 214 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--------------AWGVQHIQEKIN 245 +LD + N + G++ FS ++ + +Q++++ + Sbjct: 215 ITVLDTM------NPNDKVGVIAFSDELKLPPKIGDASCYANELALATTINIQNLKQFVL 268 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L+ T + A+N + ++ +G + + IIFLTDGE Sbjct: 269 SLVARGGTHYGKAFDAAFNLLKESYTLDADNERGKIERDQVIIFLTDGEPLDDKTSIMRK 328 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + NE + G+ ++ FL++ A Sbjct: 329 IRSNNEEMENKVTILTFGLGLDSGINFLEDIA 360 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 52/341 (15%), Positives = 107/341 (31%), Gaps = 51/341 (14%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILN------------QENGNNGKKQKNDFSYRII 84 +T + + +A L ++ + + + N G+ + I Sbjct: 264 DTRNIYKYRAVLEGFAGDAIPANNEALTLVDVRGRLRLLYVEGDMNEGQYLVQAMAKEGI 323 Query: 85 KNIWQ--TDFRNELRE-NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM---PFI 138 + + N +E +GF I + + + +DY + M P Sbjct: 324 ELELRAPNSIPNTPQELSGFDGVILSDVPAHQVGETAMVAIRDYVDKLGGGFIMLGGPNS 383 Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 F + L + + + +V+D S SM+ KL +A + Sbjct: 384 FGVGGYYRTPIEEVLPVRLKAPDEEEKQSS-ALALVIDRSGSMSGE------KLEMAKSA 436 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKST 256 +++ G+ F S+ P+ + +I L G T Sbjct: 437 AIATAEVLTRNDS------IGVYAFDSEAHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLH 490 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 P A N + K K++H +I LTDG+ S + ++ + G Sbjct: 491 PAFTEARNALQRTKAKIKH-----------MIILTDGQTSGQGYE-----ALASQCRAEG 534 Query: 317 AIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAF 355 + + + A L+ AS + Y+ ++ + F Sbjct: 535 VTISTVAIGDGAHVGLLQAIASLGGGKSYTTLDAANIVRIF 575 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 88/249 (35%), Gaps = 41/249 (16%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 A + + ++ G+++++V+D S SM ++L Sbjct: 57 KILVILILAAVGFIFIGLADPHIPLEQTKEGVNVVLVIDDSGSMQATDYSP-NRLEATKS 115 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKST 256 + E+++ + G+V F S L+ + E + N + T Sbjct: 116 AAEELINDLDPKD------YVGIVVFESGASTASYLSPDKDSVIENLENIMEKDGATAIG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL N + KK +I L+DG N++ I E++ + AK Sbjct: 170 DGLSLGINMADSIP-----------NRKKVVILLSDGVNNAGVISPDEAIQF---AKDSD 215 Query: 317 AIVYAIGVQ-----------------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 V+ IG+ AE + LK A + +++ + + L++ + Sbjct: 216 IQVFTIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIADETGGKYFKSVDDQTLNEIYSN 275 Query: 358 IGKEMVKQR 366 I E+ +++ Sbjct: 276 INSEIKREK 284 >gi|324997883|ref|ZP_08118995.1| hypothetical protein PseP1_03919 [Pseudonocardia sp. P1] Length = 329 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 78/244 (31%), Gaps = 28/244 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 A + + + ++ +M+V+DVSLSM TR Sbjct: 58 AIAMIAALAVLTVALAGPQAMAKVPRNRATVMLVIDVSLSMKATDVAP-------TRLAA 110 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + + V GLV+F+ ++ ++ L +T + + Sbjct: 111 AQAAAKQFADQLTPGVNLGLVSFAGTAAVLVSPTTDRNAVKNGVDNLQLAESTATGEAIF 170 Query: 261 YAYNKI-FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID----NKESLFYCNEAKRR 315 A I ++ + ++ L+DG + P D + S ++A+RR Sbjct: 171 TAMQSIDTFSRSLQGGPDQQGTPPPARVVLLSDGTQTVPGPDGENEPRGSFTAASDAQRR 230 Query: 316 GAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 G V I D ++ A S RF++ +L + + Sbjct: 231 GIPVSTISFGTSYGSIELDGGRTPVAVDDASMERIASLSGGRFFTAATESELRAVYSDLS 290 Query: 360 KEMV 363 +E+ Sbjct: 291 EELG 294 >gi|319949307|ref|ZP_08023384.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] gi|319437027|gb|EFV92070.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] Length = 326 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 69/211 (32%), Gaps = 28/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVSLSM +L A ++ + ++ V GLV+++ Sbjct: 91 VMLVVDVSLSMESTDVSP-SRLEAAQQAATTFAN------NLTPGVNLGLVSYAGTASML 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++RL T + + A I E L I+ L Sbjct: 144 VAPTTDRGPVVRAVDRLSLDERTATGEAIYTATQAITTFTESLGG---PDQAPPARIVLL 200 Query: 291 TDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ--------------AEAADQFLK 334 +DG+ + P + + + A G V I D L+ Sbjct: 201 SDGKETVPADPTEERGAFTAAERAAEAGIPVSTISFGTLYGTVDIQGRPQPVPVDDASLR 260 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A S F++ +L + + +++ Sbjct: 261 TIAELSGGDFFTASTLEELDSVYRTLEEQIG 291 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 65/422 (15%), Positives = 140/422 (33%), Gaps = 66/422 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILD----------HS 55 ++NF+ G+ ILTA+L+ + G+ ++ + VKA L D + Sbjct: 1 MKNFWQEKSGNFGILTALLMVPLCGAAGVALDITRGMSVKADLQQAADSAALAAVADMSA 60 Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKN---IWQTDFRNELRENGFAQDINNIERST 112 + A K+ G ++ F + T + ++G + + +++ Sbjct: 61 SVQAAKKMSGDGVIPVGNEEARAFFDGNQRGDADYTITSVDVSVIKHGNVVESSVSFKAS 120 Query: 113 SLSIIIDDQHKDYNL---SAVSRYEMPFIFCTFPWCANSSHAPLLIT----SSVKISSKS 165 + + KD+ +A ++YE + N+ + T +++ ++ Sbjct: 121 VSTTLSGLLGKDFVSVAGTATAKYETETFSDFYLLLDNTPSMGVGATPTDVATLVANTGD 180 Query: 166 DIGLDMMMVLD-----VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +V D S ++ V ++ ++D KS +N R + Sbjct: 181 KCAFACHIVKDGVADPNSYYFKAKKLGVTTRIDVVAKATASLMDTAKSTRKSSNQYRMAV 240 Query: 221 VTFSSKIV-----QTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIFDA 269 TF + + L + ++K + G + A +I D Sbjct: 241 YTFGERAEDTKLLEVVSLTSDLDAAKKKAGEINLMSIPYQGYNNDQQTDFDRALIQIGDK 300 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE------------SLFYCNEAKRRGA 317 A + K I F++DG S + + C + K +G Sbjct: 301 VGSSGTGA-SSANPDKVIFFVSDGVGDSYKPSSCTKKLTGGRCQEPIDIKDCTKLKEKGF 359 Query: 318 IV---YAIGVQAEAADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRIGK 360 + Y + D + +++CASP ++ V S+ + DA + K Sbjct: 360 RIAVLYTTYLPLPTNDWYNSWIKPFQAEIGSRMQSCASPGLYFEVSPSQGISDAMTVLFK 419 Query: 361 EM 362 + Sbjct: 420 KA 421 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 87/213 (40%), Gaps = 20/213 (9%) Query: 153 LLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + S VKI + G+++++ LD S SM + D + + ++++ Sbjct: 65 VALASPVKILNNQLLKKDGINIILDLDTSGSMRERGFNPND---LQQNRWNVVNNVVQDF 121 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + GLV F + ++ PL++ +E I + G + T +FD+ Sbjct: 122 IEKRVNDNIGLVVFGTSVLTASPLSFDKNSQKEIIKYIDIGIVGEQT--------AMFDS 173 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +I K II LTDGE+++ I + L AK+ +Y IG+ E+ Sbjct: 174 LATSINILKNSKAKSNIIILLTDGEDNASKIPPQIILKL---AKKYKIKIYTIGIG-ESN 229 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 Q L + + + + + L + + I K Sbjct: 230 RQMLSTISQETGAKSFLANSKDDLVEVYNTINK 262 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 79/227 (34%), Gaps = 28/227 (12%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 C P + + + + + S + + L++ ++LD S SM L R+ Sbjct: 13 CLSPTRVSQRQLEISVAAIAQASGERNAPLNLGLILDHSGSMAGQ------PLETVKRAA 66 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTP 257 ++++D + +P R ++ F P I+ +I+ L T Sbjct: 67 QKLVD--RLLPS----DRLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLAAMGGTAIDE 120 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GL+ ++ AK I LTDGEN N N L EA + Sbjct: 121 GLQLGLTELIAAKAGAISQ----------IFLLTDGENEHGN--NSRCLQLAEEAAKENI 168 Query: 318 IVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEM 362 + +G L+ A+ ++ + + F R+ ++ Sbjct: 169 TLNTLGFGYHWNQDVLEQIADAAGGSLMFIEYPQDVLIGFERLFNQI 215 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 77/217 (35%), Gaps = 39/217 (17%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +DVS SM+ P M + + ++ + R GL+ F ++ PL Sbjct: 2 VDVSGSMD---YPDMQWKSDEVSRLVLVQQLLGDFLEGRKGDRVGLILFGTQAFVQAPLT 58 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + + ++ ++ G K+T + DA + + ++ +TDG Sbjct: 59 YDRRTVRVWLDEARIGIAGKNT--------ALGDAIGLALKRLRMRPATSRALVLVTDGA 110 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC------------------ 336 N++ ID + A G +Y IG+ ++ L++ Sbjct: 111 NNAGQIDP---ITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDEPTLKEIA 167 Query: 337 -ASPDRFYSVQNSRKL------HDAFLRIGKEMVKQR 366 S +++ ++ +L DA + ++ + R Sbjct: 168 SLSGGQYFRARDGDQLEKIRATLDALEPVAQQPTQAR 204 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 52/217 (23%), Positives = 75/217 (34%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A E I P+ N GL F+ + Sbjct: 87 GIDIMLAMDVSTSMLAEDLKP--NRLEAAKNVASEF---ISDRPNDN----IGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + IN L T + GL I K K Sbjct: 138 AFTQCPMTTDHASL---INMLRSVRTDIAARGLISDGTAIGMGLANAVSRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ 328 +I LTDG N+ +I L AK G VY IGV Sbjct: 195 VILLTDGSNNMGDISP---LTSAQIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIP 251 Query: 329 ---ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LKN A + +Y ++ +L + I K Sbjct: 252 VEIDSKTLKNIAETTDGNYYRATSNNQLKQIYKDIDK 288 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 67/200 (33%), Gaps = 32/200 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + P + ++S +G D+M +LD S SM +L + Sbjct: 66 LLVVAVAGIGIAMARPQW-GETTEVS--KALGEDVMFLLDCSKSMLAADVQP-SRLSRSK 121 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGST 252 +I + + + R GLV F+ + PL + ++ + + I Sbjct: 122 YAILDFVQQ-------HGRGRVGLVAFAGQAFLQCPLTFDYDAFRDALLAIDEQTIPVGG 174 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L+ AY + K ++ +TDGE+ K + Sbjct: 175 TDIGRALDEAYRAMEKNDRH------------KILVLITDGEDLE-----KAGIKTAQAL 217 Query: 313 KRRGAIVYAIGVQAEAADQF 332 +G +VY IGV A Sbjct: 218 AEKGIVVYTIGVGTAAGSPI 237 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 84/201 (41%), Gaps = 17/201 (8%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + +D+++ LD S SM+ + G ++ + + +++++ + R G Sbjct: 75 KIKAVKSNAVDIVLALDTSDSMSTY---GFNEKKYKQSRLNVVKEVVQNFINSRVKDRIG 131 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 LV F + PL++ + + + + G KST + D + K Sbjct: 132 LVVFGTTAGIASPLSFDKEAQKNIVGNINVGVLGKST--------ALIDGIVSSIQLLKN 183 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 K II L+DGE+S+ I + +L AK+ +Y I + +D +K A Sbjct: 184 SKSKSKIIILLSDGEDSASKIPLEFALKL---AKKYNIKIYTITIDKSYSD-MMKVIANK 239 Query: 338 SPDRFYSVQNSRKLHDAFLRI 358 + + + VQN L + I Sbjct: 240 NGAKNFEVQNKEDLVKVYKSI 260 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L ++ + N G T + L ++ + +A+ H + Sbjct: 858 VLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE++ D ++ + +G +V A+G+ + + L S D++Y V Sbjct: 913 VLIVITDGESN----DAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDKYYFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVSASVCN 987 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 25/185 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 617 DMKADIMFLVDSSGSIGPENFSKMKMF---------MKNLVSKSQIGADRVQIGVVQFSH 667 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + F L I I+R+ G TT + L + K + Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN------- 720 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 721 VRKFLILITDGEAQDIVRDPAIAL------RKEGVIIYSVGVFGSNVTQLEEISGKPEMV 774 Query: 343 YSVQN 347 + V+N Sbjct: 775 FYVEN 779 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDM 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + ++ +V N +L RI + + Sbjct: 1143 ELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 96/312 (30%), Gaps = 43/312 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N G Q T LR G IE + + H + Sbjct: 327 KNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFT 386 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVS 178 + F IT +V + S+ L D+ +++D S Sbjct: 387 SKLG---NFSELATHNQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGS 443 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S + E++ + P VR G V ++ F ++ Sbjct: 444 GS------TQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEISKYSN 494 Query: 237 VQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + I + G T + L + + AK++ +++ LT+G Sbjct: 495 KPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKER------GSKVPCHLVVLTNG-- 546 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDA 354 + L ++ + V+AIGV+ EA L+ A + R Y V DA Sbjct: 547 ----MSRDSVLGPAHKLREENIRVHAIGVK-EANQTQLREIAGEEKRVYYVHE----FDA 597 Query: 355 FLRIGKEMVKQR 366 I ++V++ Sbjct: 598 LRNIRNQVVQEI 609 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 66/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + I ++ N +R GLVT+S++ L Sbjct: 226 VADVVFLLDMAINGSQEDLDHLKAFLG---ESISALDIKENCMRVGLVTYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + ++I L T A K + ++ + + + +T Sbjct: 283 STGNNKTEVLQRIQDLSPQVGQAYTGA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 +R G ++ +G++ D+ K + P ++ + N Sbjct: 340 ------HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 82 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 190 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 191 L-SVFAPNMTEIIKDVTQY 208 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 89/272 (32%), Gaps = 51/272 (18%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI-GLDMMMVL 175 I D + + + Y + F L + S+I G+D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFVLRIIALILIILVLARPQTTDSWQNSEIEGIDIMLAM 94 Query: 176 DVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 95 DVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPLT 145 Query: 235 WGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G T G+ A ++ D+K K K II LT Sbjct: 146 VDHAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAK-----------SKVIILLT 194 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------- 332 DG N+ +I L AK G VY IGV + Sbjct: 195 DGVNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDE 251 Query: 333 --LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 252 KTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 48/239 (20%), Positives = 89/239 (37%), Gaps = 38/239 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + ++ P + + + S+ G+D++ LD+S SM + G +L A Sbjct: 67 LMRSAIVLAVVATGRPQITRAVTE---ASEKGIDIVFALDISESMLEEDFEG-SRLDAAK 122 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + L I+ P R GLV F K PL + + + ++ + + Sbjct: 123 KIA---LRFIRERPQ----DRFGLVLFRGKSFTLCPLTLDHRLLGMLVRQVSVDAISDKG 175 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A + ++K ++ ++ LTDGE++S + + A+ G Sbjct: 176 TAIGSAI--LVGTNRLRASVSK-----ERVLLLLTDGEHNSGEVGPVTASEI---AQSEG 225 Query: 317 AIVYAIGVQ------------AEAAD---QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +Y IGV+ AE Q L A + R++ + L DAF I Sbjct: 226 IRIYVIGVRNEEEAGSPESMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLKDAFGEI 284 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 54/414 (13%), Positives = 127/414 (30%), Gaps = 70/414 (16%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHY----------ILDHSLLY 58 F + G+ I+TA+L+ + G+ ++ +H ++ +L+ + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQT---DFRNELRENGFAQDINNIERSTSLS 115 A + + GK D + D ++ + + +T + Sbjct: 68 AAMAMNGNGTISLGKTDARDIFMSQVSGELAEVHVDLGIDVTKTANKLNSQVSFTATVPT 127 Query: 116 IIIDDQHKDYNL---SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM- 171 + +D +A + Y+ + N+ + T S ++ +G Sbjct: 128 TFMRIFGRDSITISGTATAEYQTAAFMDFYILLDNTPSMGVGATPSDVSKLEAKVGCAFA 187 Query: 172 MMVLDVS---LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D S ++ G M ++ V ++ + + D K+ ++ R G+ TF +K Sbjct: 188 CHQMDKSTNNYTIAKSLGVAM-RIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAE 246 Query: 229 -----QTFPLAWGVQHIQEKINR----------LIFGSTTKSTPGLEYAYNKIFDAKEKL 273 L + ++ + T + I L Sbjct: 247 DAKLTTISGLTSDLTKVKNYTDAVDLMTIPYQNYNSDQITNFDSAMTQMNTII-----DL 301 Query: 274 EHIAKGHDDYKKYIIFLTDG------------ENSSPNIDNKESLFYCNEAKRRGAIV-- 319 + +K + F++DG + + +C K RG + Sbjct: 302 AGDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAV 361 Query: 320 -YAIGVQAEAADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRI 358 Y + + + ++ CASP ++ V + + DA + Sbjct: 362 LYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDGITDAMKAL 415 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 76/233 (32%), Gaps = 44/233 (18%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMM-------------VLDVSLSMNDHFGPG 188 + + K + + + M + V+D S SM+ + Sbjct: 367 IGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSGSMDGN---- 422 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINR 246 K+ +A S ++++++ V G+V F + P + + I Sbjct: 423 --KIELAKESAMRTVELMRAKDTV------GVVAFDDQPWWVVPPQKLGDKEEVLSSIQS 474 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + T P + A ++ + H II +TDG+ + N Sbjct: 475 IPSAGGTNIYPAVSSALEEMLKIDAQRRH-----------IILMTDGQ----SAMNSGYQ 519 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 + + ++ V +A L++ A+ R+Y V++ L F R Sbjct: 520 DLTDTMVENKITMSSVAVGMDADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 96/268 (35%), Gaps = 35/268 (13%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 I + ++ + + +S ++ E +F + ++ P LI V Sbjct: 34 RISFKEDIDAAMESRQEVSGISRATKGEKFLLFILYVLVLSALAKPNLIGEPVTKDVSQR 93 Query: 167 IGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++++ +D+S SM + G +D+L +R+ L + GL+ Sbjct: 94 ---ELLISVDLSGSMMTKDFVNKEGKAIDRLEAVKMVLRDFLKE-------RKGEKIGLI 143 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P + ++ ++ L G + + D+ + + + Sbjct: 144 LFGNAAFVQAPFTQDLDALEHLLDSLRVG--------MAGPQTAMGDSIGLAVKMFRESN 195 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLK 334 + +I ++DG+++ + K S A + G V+ IG+ LK Sbjct: 196 VTDRMLIVMSDGDDTGSKVPPKTS---AELAAKNGVNVFTIGIGDPKNAGEHPIDTDTLK 252 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A + +FY N L D + +I K Sbjct: 253 EIAAITGGKFYYAWNLDDLQDIYKQIDK 280 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 56/273 (20%), Positives = 93/273 (34%), Gaps = 53/273 (19%) Query: 117 IIDDQHKDYNLSAVSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I D + + + Y + PFI T++ +S+ + G+D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHAPFILRIVALILMIIVLARPQTTNSWQNSEIE-GIDIMLA 93 Query: 175 LDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 94 MDVSTSMLAEDLKP--NRLEAAK-------DVATEFINGRPNDNIGITLFAGESFTQCPL 144 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + + G T GL A ++ D+K K K II L Sbjct: 145 TVDHAVLLNLLKDMKCGFIEDGTAIGMGLANAVTRLKDSKAK-----------SKVIILL 193 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------ 332 TDG N+ +I L AK G VY IGV + Sbjct: 194 TDGVNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEID 250 Query: 333 ---LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 251 EKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|83814223|ref|YP_446667.1| putative batB protein [Salinibacter ruber DSM 13855] gi|83755617|gb|ABC43730.1| putative batB protein [Salinibacter ruber DSM 13855] Length = 350 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 69/200 (34%), Gaps = 20/200 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A + P T + + GLD+++ LDVS SM P +L A Sbjct: 52 LVVGALLLGAVALMGPRWGTEVRTVERR---GLDLVVALDVSASMRAQDVPP-SRLRRAK 107 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----T 252 IR ++D + R GLV F+ PL + ++ Sbjct: 108 NEIRTLVDDLSG-------DRVGLVLFAGSGFVQSPLTTDYGAFRLFLDAAAPDQISTPG 160 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + ++ + + A + + ++ ++DGEN + ++D A Sbjct: 161 TDVSAAVDAGLQAFGAPRPTDDTTAAPEEPRPRALLIVSDGENHAGDLDAAR-----QRA 215 Query: 313 KRRGAIVYAIGVQAEAADQF 332 + G + GV E + Sbjct: 216 EEAGVTLLTAGVGTEDGARI 235 >gi|301784735|ref|XP_002927783.1| PREDICTED: collagen alpha-6(VI) chain-like [Ailuropoda melanoleuca] Length = 2267 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 72/199 (36%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+D++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDHDEYSIMKDF---------MVDLVKKADVGKNQVRFGALKYADDPE 857 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I G T + L ++ + +A+ + Sbjct: 858 VLFYLGDLGSKWEVISVLQKDQPMGGNTYTAEALGFSDHMFTEARGSRLQKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 913 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVSASVCN 987 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 70/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKEFLASVVQDFDVSVNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 VR G FS FPL G + I +I N T L + Sbjct: 1036 --VRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGAALRQVGHYFRPDM 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG++ E + + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGQSQD------EVARAAEDLRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + + +V N +L RI + + Sbjct: 1143 QLIQITGTAGKKLTVHNFDELTKVKKRIVRNI 1174 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 79/202 (39%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S G++ ++ ++ + D V+ G+V FS Sbjct: 616 KEMKADIMFLVDSSGS------IGLENFIKMKTFMKNLVSKSQIGADR---VQIGVVQFS 666 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I+R+ G TT + L + AK + Sbjct: 667 DVNKEEFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAKGARPN------ 720 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 721 -VRRFLILITDGEAQDIVKDPAVAL------RQEGIIIYSVGVFGSNVTQLEEISGRPEM 773 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 774 VFYVEN----FDILQHIEDDLV 791 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 96/315 (30%), Gaps = 39/315 (12%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 N N G Q T LR G IE ++ + Sbjct: 320 SARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKIAS 379 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DM 171 H + + F I +V + S+ L D+ Sbjct: 380 HPAEQHVSKLK---TFSDLAAHNQTFLKKLRNQIMHTVSVFSERTETLKSGCVDTEEADI 436 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++D S S + E++ + P VR G V ++ F Sbjct: 437 YLLIDGSGS------TQATDFHEMKTFLSEVVGMFNIAPQK---VRVGAVQYADSWDLEF 487 Query: 232 PLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + I + G T + L + + AK++ + H ++ Sbjct: 488 EINKYTNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKAKKQRGNRVPCH------LV 541 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQN 347 LT+G + L N + VYAIGV+ EA L+ A + R Y V + Sbjct: 542 VLTNG------MSKDSILEPANRLREELIRVYAIGVK-EANQTQLREIAGEEKRVYYVHD 594 Query: 348 SRKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 595 FDALKDIRNQVVQEI 609 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 25/193 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LDVS + + + + ++ + +R GLVT+S++ Sbjct: 228 DVVFLLDVS------VNGSQENFEYLKEFLE---ESVSALDIKEHCMRVGLVTYSNETKV 278 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ GV + + I L S T A KI + ++ + + Sbjct: 279 INSLSRGVNKSEVLQNIQNLSPWSGKAYTGA---AIRKIRKEVFSARNGSRKNQGVPQIA 335 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + +T P+ DN +R G ++ +G++ + Q K + P + Sbjct: 336 VLVT----HRPSEDN--VTKAAVNLRREGVTIFTMGIEGASDSQLEKIASHP-----AEQ 384 Query: 348 SRKLHDAFLRIGK 360 F + Sbjct: 385 HVSKLKTFSDLAA 397 >gi|281345782|gb|EFB21366.1| hypothetical protein PANDA_017603 [Ailuropoda melanoleuca] Length = 2245 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 72/199 (36%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+D++K N VR G + ++ Sbjct: 787 LDVVFVIDSSGSIDHDEYSIMKDF---------MVDLVKKADVGKNQVRFGALKYADDPE 837 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I G T + L ++ + +A+ + Sbjct: 838 VLFYLGDLGSKWEVISVLQKDQPMGGNTYTAEALGFSDHMFTEARGSRLQKGVP-----Q 892 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 893 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFV 948 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 949 ETFGGLKGIFSDVSASVCN 967 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 70/212 (33%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 957 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKEFLASVVQDFDVSVNR- 1015 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 VR G FS FPL G + I +I N T L + Sbjct: 1016 --VRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGAALRQVGHYFRPDM 1073 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG++ E + + +G +Y++G+ Sbjct: 1074 GSRINAGTP-----QVLLVLTDGQSQD------EVARAAEDLRHKGVDIYSVGIGDVDDQ 1122 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + + +V N +L RI + + Sbjct: 1123 QLIQITGTAGKKLTVHNFDELTKVKKRIVRNI 1154 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 79/202 (39%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S G++ ++ ++ + D V+ G+V FS Sbjct: 596 KEMKADIMFLVDSSGS------IGLENFIKMKTFMKNLVSKSQIGADR---VQIGVVQFS 646 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I+R+ G TT + L + AK + Sbjct: 647 DVNKEEFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAKGARPN------ 700 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 701 -VRRFLILITDGEAQDIVKDPAVAL------RQEGIIIYSVGVFGSNVTQLEEISGRPEM 753 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 754 VFYVEN----FDILQHIEDDLV 771 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 96/315 (30%), Gaps = 39/315 (12%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 N N G Q T LR G IE ++ + Sbjct: 300 SARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKIAS 359 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DM 171 H + + F I +V + S+ L D+ Sbjct: 360 HPAEQHVSKLK---TFSDLAAHNQTFLKKLRNQIMHTVSVFSERTETLKSGCVDTEEADI 416 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++D S S + E++ + P VR G V ++ F Sbjct: 417 YLLIDGSGS------TQATDFHEMKTFLSEVVGMFNIAPQK---VRVGAVQYADSWDLEF 467 Query: 232 PLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + I + G T + L + + AK++ + H ++ Sbjct: 468 EINKYTNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKAKKQRGNRVPCH------LV 521 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQN 347 LT+G + L N + VYAIGV+ EA L+ A + R Y V + Sbjct: 522 VLTNG------MSKDSILEPANRLREELIRVYAIGVK-EANQTQLREIAGEEKRVYYVHD 574 Query: 348 SRKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 575 FDALKDIRNQVVQEI 589 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 25/193 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LDVS + + + + ++ + +R GLVT+S++ Sbjct: 208 DVVFLLDVS------VNGSQENFEYLKEFLE---ESVSALDIKEHCMRVGLVTYSNETKV 258 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ GV + + I L S T A KI + ++ + + Sbjct: 259 INSLSRGVNKSEVLQNIQNLSPWSGKAYTGA---AIRKIRKEVFSARNGSRKNQGVPQIA 315 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + +T P+ DN +R G ++ +G++ + Q K + P + Sbjct: 316 VLVT----HRPSEDN--VTKAAVNLRREGVTIFTMGIEGASDSQLEKIASHP-----AEQ 364 Query: 348 SRKLHDAFLRIGK 360 F + Sbjct: 365 HVSKLKTFSDLAA 377 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 89/272 (32%), Gaps = 51/272 (18%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI-GLDMMMVL 175 I D + + + Y + F L + S+I G+D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFVLRIIALILIILVLARPQTTDSWQNSEIEGIDIMLAM 94 Query: 176 DVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 DVS SM + P ++L A D+ + G+ F+ + PL Sbjct: 95 DVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGESFTQCPLT 145 Query: 235 WGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G T G+ A ++ D+K K K II LT Sbjct: 146 VDHAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAK-----------SKVIILLT 194 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------------------- 332 DG N+ +I L AK G VY IGV + Sbjct: 195 DGVNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDE 251 Query: 333 --LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 252 KTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|256821839|ref|YP_003145802.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795378|gb|ACV26034.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 958 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 89/235 (37%), Gaps = 20/235 (8%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 C +C + S + T+ V + + D+M+V+D S SM+ G G K+ A S Sbjct: 350 CNLHYCPSPSVLNDIATTPVPVPDQ-----DIMLVIDRSGSMSGDAGTGQSKIDEAKDSA 404 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 + ++++ R GLV+FS+ + + ++ S + Sbjct: 405 SLFVQLVEASAGH----RMGLVSFSTSASIDEGIGNLNPGKKNQLIGPAPYSGGAVGGLI 460 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + I D +K + G + K I+ LTDG ++P + + + + Sbjct: 461 PDGWTSIGDGIDKAQSELTGGANP-KTILLLTDGLQNTPPMIETATNDIGDT------RI 513 Query: 320 YAIGVQAEA--ADQFLK--NCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +AIG+ EA L ++ + + +L F ++ + L + Sbjct: 514 HAIGLGTEANLNGGLLSDLTQSTGGAYTRAGDGLELKKFFALAFGDIFEDGTLID 568 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 76/233 (32%), Gaps = 44/233 (18%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMM-------------VLDVSLSMNDHFGPG 188 + + K + + + M + V+D S SM+ + Sbjct: 367 IGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSGSMDGN---- 422 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINR 246 K+ +A S ++++++ V G+V F + P + + I Sbjct: 423 --KIELAKESAMRTVELMRAKDTV------GVVAFDDQPWWVVPPQKLGDKEEVLSSIQS 474 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + T P + A ++ + H II +TDG+ + N Sbjct: 475 IPSAGGTNIYPAVSSALEEMLKIDAQRRH-----------IILMTDGQ----SAMNSGYQ 519 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 + + ++ V +A L++ A+ R+Y V++ L F R Sbjct: 520 DLTDTMVENKITMSSVAVGMDADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 54/280 (19%), Positives = 91/280 (32%), Gaps = 52/280 (18%) Query: 110 RSTSLSIII-DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI- 167 + T ++ I D + + + Y + F L + S+I Sbjct: 27 KKTEPTLQISDARVYAHAPKSYKNYLLHVPFGLRIITLILIILVLARPQTTNSWQNSEIE 86 Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A D+ + G+ F+ + Sbjct: 87 GIDIMLAIDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + T G+ A ++ D+K K Sbjct: 138 SFTQCPLTVDHAVLLNLFQGIQCDIIEDGTAVGMGIANAVTRLKDSKAK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----------- 332 K II LTDG N+ +I L AK G VY IGV + Sbjct: 187 SKVIILLTDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPVRVGGTTQYI 243 Query: 333 ----------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 244 NTPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 49/272 (18%), Positives = 89/272 (32%), Gaps = 56/272 (20%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG------P 187 ++P + ++ + S D G+D+++ LD+S SM P Sbjct: 89 DLPGVLRAVALALLVLAMGRPVSVLREQRSD-DKGIDIVVALDLSGSMRAILDARASDLP 147 Query: 188 GMDKLGVATRSIREMLDIIKSIPD----VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 G KL R LD K + R G+V F P + + Sbjct: 148 GQPKLPRGKR--LTRLDTAKLVLQDFISRRRTDRLGVVVFGKAAYVLSPPTLDYHLLTQM 205 Query: 244 INRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 ++++ GS T L A ++ + D K +I LTDG++++ Sbjct: 206 VSQMTLNVIDGSATAIGDALGTAVARL-----------RRSDAQSKVVILLTDGDSNAGA 254 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-----------------------FLKNC 336 I + + A GA VY I + + + L+ Sbjct: 255 ISPEYATHL---ATSLGAKVYTIQIGTDDEVEVEDGIDLFGQPRYVRHRFPVNPALLQEI 311 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A + Y +++ L D+ + + K R Sbjct: 312 AQKTGGASYVATDAKALADSMHDVLDRLEKTR 343 >gi|116751034|ref|YP_847721.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116700098|gb|ABK19286.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 479 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 77/202 (38%), Gaps = 26/202 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + ++ LDM++V+D S SM D KL A +++ +L ++ R LV Sbjct: 85 NVEARRELDMVVVMDRSGSMADA-----GKLTHARQAVLNLL------SRLSETDRFALV 133 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++S + + L ++ + + G T GL+ +++ + ++ Sbjct: 134 SYSDHVQRHGGLLPITPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSR 193 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +I ++DG + D + A RG V +GV + + + + A Sbjct: 194 ---------LILISDGLANRGVTDPSALGTMASVAAERGYAVSTVGVGLDFNEHLMTSIA 244 Query: 338 --SPDRFYSVQNSRKLHDAFLR 357 + ++++ F + Sbjct: 245 DKGAGNYTFMESASAFAQVFDK 266 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 84/232 (36%), Gaps = 27/232 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS---KSDIGLDMMMVLDVSLSMN----DHFGPGMD 190 F L + S VK + G ++ ++LD S SM D P + Sbjct: 50 KLLFFLKWLGIIMLILALMSPVKDEPYTLEPKDGYEIALILDASQSMKAQGFDVTNPQLT 109 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + V DI+ + GLV F + PL + + + +++L Sbjct: 110 RFDVVK-------DIVSNFIKERQNDNIGLVVFGAYSFIASPLTYDENILNKIVSQL--- 159 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 G+ Y +F + + ++ K + K I LTDG ++ P +D + Sbjct: 160 -----YIGMAGKYTALFTSLAQGVNLLKMSESKSKVGILLTDGFST-PEVDKIPFDVALD 213 Query: 311 EAKRRGAIVYAIGVQAE--AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 A + +Y IG+ + L+ A + + + ++ +L + + I Sbjct: 214 MAIKEKIKIYPIGIGMPHEYNIEVLRKIAEKTGGKAFGAASATELKEVYKEI 265 >gi|294508602|ref|YP_003572661.1| aerotolerance-related membrane protein [Salinibacter ruber M8] gi|294344931|emb|CBH25709.1| aerotolerance-related membrane protein [Salinibacter ruber M8] Length = 358 Score = 81.8 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 69/200 (34%), Gaps = 20/200 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A + P T + + GLD+++ LDVS SM P +L A Sbjct: 60 LVVGALLLGAVALIGPRWGTEVRTVERR---GLDLVVALDVSASMRAQDVPP-SRLRRAK 115 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----T 252 IR ++D + R GLV F+ PL + ++ Sbjct: 116 NEIRTLVDDLSG-------DRVGLVLFAGSGFVQSPLTTDYGAFRLFLDAAAPDQISTPG 168 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + ++ + + A + + ++ ++DGEN + ++D A Sbjct: 169 TDVSAAVDAGLQAFGAPRPTDDTTAAPEEPRPRALLIVSDGENHAGDLDAAR-----QRA 223 Query: 313 KRRGAIVYAIGVQAEAADQF 332 + G + GV E + Sbjct: 224 EEAGVTLLTAGVGTEDGARI 243 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 81.8 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 84/256 (32%), Gaps = 47/256 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ + P + S K + G++ M+ LD+S SM ++L A Sbjct: 61 FLLLAIVFLIGMLARPQI--SIRVDVPKKEKGIEAMICLDISNSMLCEDVKP-NRLSFAK 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + ++ D + + GLV F+ P+ + ++ + + T Sbjct: 118 QVLGKLFDGL-------QNDKVGLVVFAGNAYTQIPITTDLSAAKQFLADISPNMVTAQG 170 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A E + + K II LTDGEN N ++ +A G Sbjct: 171 TAIGAAI-------ELASKSFSDNKEIGKTIIVLTDGENHEGN-----AIEAAQQAHEAG 218 Query: 317 AIVYAIGVQAEAAD-----------------------QFLKNCASPDR--FYSVQNSRKL 351 V IG+ + ++ AS F+S Q++ L Sbjct: 219 IRVNVIGLGTALGAPIPIEEGYLKDETGNPVVTKFDEKMCRDIASAGEGTFFSGQSASAL 278 Query: 352 HDAFLRIGKEMVKQRI 367 A ++ K + Sbjct: 279 VRAIESQLDKLPKAVL 294 >gi|320333536|ref|YP_004170247.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319754825|gb|ADV66582.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 509 Score = 81.8 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 94/261 (36%), Gaps = 29/261 (11%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMP-FIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 + I+ D + N + + P + P+ ++S ++++ ++ + Sbjct: 263 APAVQARIMRDTLRRPVNTTVPLTRDFPDALLLELPFPRSASTLDAVVSTYLQ---DTRQ 319 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSS- 225 + + VLDVS SM G +D L A R + + N R L+ FSS Sbjct: 320 PANTIFVLDVSGSMR---GARIDALKTALRGLSGADTTLTGRYATFANRERVTLIPFSSA 376 Query: 226 ----KIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P G ++ ++ +++ L T L+ AY + A Sbjct: 377 PGAPRTTELTPATRGAALKQLRAQVDALTPDGGTNIYGALQAAYEQARAAPAGRYTS--- 433 Query: 280 HDDYKKYIIFLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 I+ +TDGE + P+ D + + + R + + A + + Sbjct: 434 -------IVLMTDGERTEGPSADQFRATYAALPERARQVKTFTVLFGDSDATEMNRIATL 486 Query: 338 SPDRFYSVQNSRKLHDAFLRI 358 + R + QN L AF I Sbjct: 487 TGGRTFDGQN--DLRAAFKDI 505 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 81.8 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 83/206 (40%), Gaps = 22/206 (10%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S+ L++M+V+D S SM D G + LD I +P N GL+ Sbjct: 353 SRVQDPLNIMLVIDTSGSMGPSKEGLTD--GGLDAAKIAALDFIDHLPSNAN---VGLIH 407 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + + L + +++ I+ L T L +Y ++ AK + Sbjct: 408 FGTLVTVDHSLTNDIGAVRQSISELKPEGQTAIYDALAISYTQLRRAKGQT--------- 458 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SP 339 +I+ ++DG +++ DN +S+ +A + Y IG+ + Q L++ + Sbjct: 459 ---FIVLISDGADTASKGDNYDSIVA--KATKANIPTYIIGLTSPEFDGQLLEDLQRDTK 513 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 Y + +L + + +E+ Q Sbjct: 514 AMIYQTPSKEQLGGFYTEVAQEVSGQ 539 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 81.8 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 79/216 (36%), Gaps = 39/216 (18%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++K G+D+M+ +D+S SM + ++L VA + ++ GL Sbjct: 81 NNKETEGIDIMLAMDISASMLTNDVTP-NRLIVAKNVASDFIN-------GRPNDNIGLT 132 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFG--------STTKSTPGLEYAYNKIFDAKEKL 273 F+ + PL + +N + T GL A ++ + K Sbjct: 133 IFAGEAFTQCPLTIDHATLINLLNNVRADLVVKGLIQDGTAIGMGLANAVGRLKASNAK- 191 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF- 332 K +I LTDG N+ +I + AK+ VY IG+ E + + Sbjct: 192 ----------SKIVILLTDGSNNVGSISP---MTAATIAKKFNIRVYTIGLGTEQSGNYN 238 Query: 333 ------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK A + FY Q+ +L + I K Sbjct: 239 DIDYTTLKQIALTTNGEFYRAQSQTELLQIYNDINK 274 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 81.8 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 55/280 (19%), Positives = 92/280 (32%), Gaps = 52/280 (18%) Query: 110 RSTSLSIII-DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI- 167 + T ++ I D + + + Y + F L + S+I Sbjct: 27 KKTEPTLQISDARVYAHAPKSYRNYLLHVPFGLRIIALILIILVLARPQTTNSWQNSEIE 86 Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A D+ + G+ F+ + Sbjct: 87 GIDIMLAIDVSTSMLAEDLKP--NRLEAAK-------DVAAEFINGRPNDNIGITLFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + I T G+ A ++ D+K K Sbjct: 138 SFTQCPLTVDHAVLLNLFQGIKCDIIEDGTAVGMGIANAVTRLKDSKAK----------- 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----------- 332 K II LTDG N+ +I L AK G VY IGV + Sbjct: 187 SKVIILLTDGTNNKGDISP---LTAAEIAKSFGIRVYTIGVGTNGMAPYPVPVGGTVQYI 243 Query: 333 ----------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 244 NTPVEIDEKTLTQIAGITDGNYFRATSNSKLKEVYEEIDK 283 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 81/236 (34%), Gaps = 34/236 (14%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREML 203 + P + + + D+++++D+S SM + F K +++E+L Sbjct: 94 VLTAVARPQWLEPPITKEIPTR---DLLLLVDLSGSMAQEDFKNDAGKKVSRLDAVKEVL 150 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLE 260 D R GLV F P +Q QE + G T + Sbjct: 151 D---GFLAKRKGDRVGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIG 207 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 N + E+ K II LTDG ++ + E+ A +R +Y Sbjct: 208 LGVNLFDEDTERA-----------KTIIALTDGNDTKSKVPPVEA---ARVATQRDIKIY 253 Query: 321 AIGVQAEA-------ADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + +Q LK+ AS +++ + L + + ++ Q I Sbjct: 254 TVAIGDPTTVGEDKLDEQSLKDVASETGGKYFFAADREHLAGIYDEL-DKIETQTI 308 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/205 (21%), Positives = 73/205 (35%), Gaps = 32/205 (15%) Query: 165 SDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 G D+M+V+D S SMN D P K V + + +D R GL Sbjct: 79 KKKGRDIMLVIDSSDSMNQWGFDPGDPNKSKFDVVKEVVGDFID-------KRKNDRIGL 131 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F+S PL + +++ + G T L YN + + Sbjct: 132 INFASVAFVASPLTFEKDFLRKILQMQEPGIAGKRTAINDALLQTYNILSKS-------- 183 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKN 335 D K I LTDG +++ I E + +Y IG+ + + +LK Sbjct: 184 ---DAKSKIAILLTDGIDNASRISFDEIRRLIS---DSDIKLYTIGIGSYRDFDAPYLKA 237 Query: 336 CASPDR--FYSVQNSRKLHDAFLRI 358 A F++ + R L + I Sbjct: 238 LAQAGHGRFFAASDRRSLQKIYEAI 262 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 77/229 (33%), Gaps = 29/229 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + + + I SS S + +++++D S SM D+ L Sbjct: 71 LLAVALAGISIVEYVRIPIESSSISSLSFNARTPVVIIVDTSGSMADNMDSVKYALRTMV 130 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 +D GLV FS I P +I I+R+ G T + Sbjct: 131 SLFNNTID-------------IGLVEFSHSIKSAIPPTPNRSYIDMVIDRMEAGGGTMYS 177 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L A + + +E + +F+TDG D ++ +E + G Sbjct: 178 FALSTALSWLRPYRELNVSA---------FTVFITDGLPG----DPQDYRPLLDEYNKLG 224 Query: 317 AIVYAIGVQAEAAD---QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 +Y + + + L + ++V++ +L D I ++ Sbjct: 225 IPIYTVFIGEDPRGIDETKLIASKTGGEQFTVESIDRLSDTLNTIASKI 273 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 46/230 (20%), Positives = 76/230 (33%), Gaps = 41/230 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P +++ A D+ GL F+ + Sbjct: 87 GIDIMLAMDVSASMLAEDLKP--NRIEAAK-------DVAAEFISGRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + + + T+ GL I K + Sbjct: 138 AFTQCPMTTDHASLLTLLQDVRTDMATR---GLINDGTAIGMGLANAVSRLKDSKTKSRV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--------------------- 325 +I LTDG N++ +I L AK G VY I V Sbjct: 195 VILLTDGANNAGDISP---LTAAQMAKSLGIRVYTIAVGTSKVAPYPIEVGGRVQYISRP 251 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK--EMVKQRILYNK 371 A+ + L+ A + FYS N+ +L + I + + Y K Sbjct: 252 ADIDTKTLREIAAVTEGNFYSANNTAQLKQIYHDIDQLEKTKMSVTRYAK 301 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 48/254 (18%), Positives = 92/254 (36%), Gaps = 52/254 (20%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKL 192 ++PFI + T + + G+ + MV+D+S SM + P +L Sbjct: 52 DIPFILLMLGLTFSIIGLARPATVDSSANINGE-GIYISMVVDISPSMMAEDMLP--TRL 108 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN--RLIFG 250 + +++ + + N + LV F+ + P + ++++I ++ Sbjct: 109 EASKKTMADFI-------KKRNFDKISLVAFALRASVLSPSTFDYTSLEKEIGNIKIDEE 161 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+ A + + K+ E K II LTDGEN+S ID K + Sbjct: 162 GSTSIGLGIATAVDMLRSVKDDAE----------KVIILLTDGENNSGEIDPKLASEI-- 209 Query: 311 EAKRRGAIVYAIGVQAEAAD------------------------QFLKNCAS--PDRFYS 344 A +Y IG+ A + L AS ++++ Sbjct: 210 -ASNFNIKIYTIGIGDAAGSHAWVTYTDPNYGKRRIRADFTLNEKALIEIASITGGKYFN 268 Query: 345 VQNSRKLHDAFLRI 358 + S L + + I Sbjct: 269 AKTSSALDNVYNTI 282 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 88/260 (33%), Gaps = 52/260 (20%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + +++ + P + ++ S G+D++ LDVS SM D Sbjct: 54 TGFKLFLSILALSCIIIALARPQWDYENKELQS---SGMDIIFALDVSKSM-DATDMMPS 109 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 +L A I L+ +K+ R G++ F+ PL + ++ +N L Sbjct: 110 RLLRAILQIGSFLEQVKT-------DRIGIIAFAGTATLQCPLTDDYEAVRIVLNGLNSN 162 Query: 251 S----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + T L A N + + K ++ ++DGE+ + +L Sbjct: 163 TVEIPGTDIGSALRLAENAFPEGSK------------SKTLVLISDGEDLQHS-----AL 205 Query: 307 FYCNEAKRRGAIVYAIGVQAEAA-----------------DQFLKNCA--SPDRFYSVQN 347 K +G VY +GV + + L+ A + +Y V Sbjct: 206 REARILKTKGIRVYTMGVGSPEGTIIRHPETGEEVKSKLDEATLQEIARITEGEYYRVTP 265 Query: 348 -SRKLHDAFLRIGKEMVKQR 366 ++ RI + +R Sbjct: 266 GGEEIQLILKRIYESESTRR 285 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 51/250 (20%), Positives = 85/250 (34%), Gaps = 50/250 (20%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM----NDHFGPGMDKL 192 + P + ++ S+ G+D+++ LD+S SM G G ++L Sbjct: 58 LRLAVLALGIVALARPQAVARESQVQSR---GMDLVLALDLSTSMLAEEQGREGRGENRL 114 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 A R + E + R GLV F+ + PL Q +Q + RL S Sbjct: 115 AAAKRVLSEFIGA-------RKQDRIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDTNSV 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + DA + + + +I +TDG N++ E A Sbjct: 168 EDGT--------ALGDAILAGVNRLRQRPAEGRALILITDGRNNAG----AEPQLAAQAA 215 Query: 313 KRRGAIVYAIGVQ----------------------AEAADQFLKNCA--SPDRFYSVQNS 348 K G V+AIG+ A+ LK A + R++ ++ Sbjct: 216 KALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDADLDAATLKGVAEITGGRYFEAGDA 275 Query: 349 RKLHDAFLRI 358 L F I Sbjct: 276 TVLSRVFAEI 285 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 73/204 (35%), Gaps = 35/204 (17%) Query: 165 SDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S G ++++ LD+S SM D G +L R R ++ R GL Sbjct: 84 SASGREIVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGL 136 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V F+++ PL + + + I G T +ST I D Sbjct: 137 VLFANRAYVAAPLTFDLAAVGRAIEEASIGITGRST--------AIADGLGLALKSVTES 188 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------A 329 + I+ L+DG++++ ID ++ A R G ++ I + + Sbjct: 189 SAASRVIVLLSDGQDNAHQIDARQ---VAGLAARHGVRIHTIALGPDDLETRPAARDAVD 245 Query: 330 DQFLKNC--ASPDRFYSVQNSRKL 351 L+ AS R Y V+ L Sbjct: 246 TATLRAIAEASGGRSYRVRGMEDL 269 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 105/315 (33%), Gaps = 34/315 (10%) Query: 47 KLHYILDHSLLYTA---TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 ++ +++ +L N + + + + + G ++ Sbjct: 280 RVQNNEAAAIIQAYGPRQILLVAANSADAQPLASALEAANFQTAIRVPAEMPNDLIGLSE 339 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS- 162 I +T + + +S + + + +++ + Sbjct: 340 YAAVIVVNTPARALPAGAMEALEISVRDLGRGLLMIGGEQSFGAGGYRDTPVEAALPVYM 399 Query: 163 ----SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + L ++ V+D S SM + G + KL +A ++ +++I + R Sbjct: 400 DVRDREQRPDLALVFVIDRSGSMAEPAGN-VQKLDIAKEAL------VQAIRMLYGEDRV 452 Query: 219 GLVTFSSKIVQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 G+VTF S+ T P+ GV + + + I + T GL + Sbjct: 453 GIVTFDSQAYTTMPITQGVGEEEVLQAIASVTADGGTNIGAGLSAGQRMLT--------- 503 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-N 335 G + K++I LTDG + + L + +G + + ++ A++ Sbjct: 504 --GVEAKIKHMILLTDGW-----GEGNDQLAVVEAMRAQGITLSVVAAGSDTAEELKTLA 556 Query: 336 CASPDRFYSVQNSRK 350 A R+Y+ + Sbjct: 557 TAGGGRYYAAAIMQA 571 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L ++ + N G T + L ++ + +A+ H + Sbjct: 858 VLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D ++ + +G +V A+G+ + + L S D++Y V Sbjct: 913 VLIVITDGE----SHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDKYYFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVSASVCN 987 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 25/185 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 617 DMKADIMFLVDSSGSIGPENFSKMKMF---------MKNLVSKSQIGADRVQIGVVQFSH 667 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + F L I I+R+ G TT + L + K + Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN------- 720 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 721 VRKFLILITDGEAQDIVRDPAIAL------RKEGVIIYSVGVFGSNVTQLEEISGKPEMV 774 Query: 343 YSVQN 347 + V+N Sbjct: 775 FYVEN 779 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDM 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + ++ +V N +L RI + + Sbjct: 1143 ELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 96/312 (30%), Gaps = 43/312 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N G Q T LR G IE + + H + Sbjct: 327 KNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFT 386 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVS 178 + F IT +V + S+ L D+ +++D S Sbjct: 387 SKLG---NFSELATHNQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGS 443 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S + E++ + P VR G V ++ F ++ Sbjct: 444 GS------TQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEISKYSN 494 Query: 237 VQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + I + G T + L + + AK++ +++ LT+G Sbjct: 495 KPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKER------GSKVPCHLVVLTNG-- 546 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDA 354 + L ++ + V+AIGV+ EA L+ A + R Y V DA Sbjct: 547 ----MSRDSVLGPAHKLREENIRVHAIGVK-EANQTQLREIAGEEKRVYYVHE----FDA 597 Query: 355 FLRIGKEMVKQR 366 I ++V++ Sbjct: 598 LRNIRNQVVQEI 609 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 66/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + I ++ N +R GLVT+S++ L Sbjct: 226 VADVVFLLDMAINGSQEDLDHLKAFLG---ESISALDIKENCMRVGLVTYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + ++I L T A K + ++ + + + +T Sbjct: 283 STGNNKTEVLQRIQDLSPQVGQAYTGA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 +R G ++ +G++ D+ K + P ++ + N Sbjct: 340 ------HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 82 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 190 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 191 L-SVFAPNMTEIIKDVTQY 208 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 88/227 (38%), Gaps = 46/227 (20%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S + +++ G++M+ +DVS SM + A + ++++ + R Sbjct: 101 SNETNTQYTEGINMIFAIDVSESMKIT-DIHPSRFDAAKQICTDIIN-------KRSNDR 152 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKL 273 G+V FS + V PL ++ ++N L S T L A N++ Sbjct: 153 IGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSGTAIGTALGTAINRL------- 205 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF- 332 K + ++ I+ ++DGEN+S +D + C E +Y IG+ + QF Sbjct: 206 ----KNAETKERIIVLISDGENTSGLMDPITAADLCLE---YNIKIYCIGLGKDGTHQFK 258 Query: 333 -----------------LKNCASP--DRFYSVQNSRKLHDAFLRIGK 360 LKN ++ +FY + + L D I + Sbjct: 259 DDNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSLDDVIANIDQ 305 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L ++ I N G T + L ++ + +A+ H + Sbjct: 858 VLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEARGSRLHKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D ++ + +G +V A+G+ + + L S D++Y V Sbjct: 913 VLIVITDGE----SHDAEKLNATAKALRDKGILVLAVGIAGANSWELLAMAGSGDKYYFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVSASVCN 987 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 77/201 (38%), Gaps = 29/201 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 617 DMKADIMFLVDSSGSIGPENFSKMKMF---------MKNLVSKSQIGADRVQIGVVQFSH 667 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + F L I I+R+ G TT + L + K + Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN------- 720 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 721 VRKFLILITDGEAQDIVRDPALAL------RKEGVIIYSVGVFGSNVTQLEEISGKPEMV 774 Query: 343 YSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 775 FYVEN----FDILQHIEDDLV 791 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDT 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + ++ +V N +L RI + + Sbjct: 1143 ELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 97/312 (31%), Gaps = 43/312 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N G Q T LR G IE + + H + Sbjct: 327 KNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFT 386 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVS 178 + F IT +V + S+ L D+ +++D S Sbjct: 387 SKLG---NFSELATHNQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGS 443 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S + E++ + P VR G V ++ F ++ Sbjct: 444 GS------TQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEISKYSN 494 Query: 237 VQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + I + G T + L + + AK++ +++ LT+G Sbjct: 495 KPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKER------GSKVPCHLVVLTNG-- 546 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDA 354 + L ++ + V+AIGV+ EA L+ A + R Y V + DA Sbjct: 547 ----MSRDSVLGPAHKLREENIRVHAIGVK-EANQMQLREIAGEEKRVYYVHD----FDA 597 Query: 355 FLRIGKEMVKQR 366 I ++V++ Sbjct: 598 LRNIRNQVVQEI 609 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 66/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + I ++ N +R GLVT+S++ L Sbjct: 226 VADVVFLLDMAINGSQEDLDHLKAFLG---ESISALDIKENCMRVGLVTYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + ++I L T A K + ++ + + + +T Sbjct: 283 STGNNKTEVLQRIQDLSPQVGQAYTGA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 +R G ++ +G++ D+ K + P ++ + N Sbjct: 340 ------HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 82 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 190 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 191 L-SVFAPNMTEIIKDVTQY 208 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 50/283 (17%), Positives = 84/283 (29%), Gaps = 74/283 (26%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 V + + + C + P S+ G+D+M+ LDVS SM Sbjct: 60 IVPAFRLAALVCFV----VALARPQNAMEVEYTSTD---GVDIMLALDVSGSMGTLDMLT 112 Query: 183 DHFGPGMDKLGVAT----------RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 + + + D+I + R GL F ++ P Sbjct: 113 RTEQAKLGVMNAEKILKRGEYWKYSRLGYAQDVIAEFIGKRHSDRIGLSAFGARSFTQCP 172 Query: 233 LAWGVQHIQEKINR-------LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + E + + + T GL A ++ K D + Sbjct: 173 LTMDYGSLLEILKASDDLARDTLVNNRTAIGDGLMNALARL-----------KMSDAKSR 221 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------------- 330 +I LTDG +++ + + AK G VY +GV ++ Sbjct: 222 VVILLTDGRDNASVVPPVRA---AEVAKSLGVKVYTVGVGKKSGKILAFQQNPWTGEISW 278 Query: 331 -------------QFLKNCAS--PDRFYSVQNSRKLHDAFLRI 358 LK AS RFY +N +L + I Sbjct: 279 GERDITPEEGIDEDVLKAIASKTGGRFYRAENKAELEKIYSEI 321 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 42/252 (16%), Positives = 93/252 (36%), Gaps = 21/252 (8%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 S+ + ++ + + EM F TF + + + + K LD++ Sbjct: 759 YSVGVYGANETQLVEISGKPEMIFYVETFDILKHIEDDLVFGICNPREECKQIEVLDIVF 818 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D S S++ + M + M+ ++K + VR G + +S F L Sbjct: 819 VIDSSGSIDYNEYNIMKEF---------MIKLVKKADVAKDRVRFGALKYSYDPTILFYL 869 Query: 234 A--WGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + N G T + L ++ + +A+ + + +I + Sbjct: 870 DEFDTRSKVISLLQNDSPKGGDTYTAKALAFSEHMFTEARGSRI-----NQKVPQVLIVI 924 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDGE + D + + +G ++ A+G+ ++ L S D+++ V+ Sbjct: 925 TDGE----SHDANQLNATAKALRDKGILILAVGIAGANTEELLAMAGSTDKYFFVETFGG 980 Query: 351 LHDAFLRIGKEM 362 L F + + Sbjct: 981 LKGIFQNVSDSI 992 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 72/198 (36%), Gaps = 28/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S + E++++ P VR G V +S Sbjct: 442 DIYLLIDGSGS------IYPTDFQEMKAFLSEVIEMFTIAPYK---VRVGAVQYSHIQEL 492 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + I+ + G T + L++ + AK + + H Sbjct: 493 EFEINKYSNKNDLGKAIDNIWQLGGNTNTGAALDFTLGLLQRAKTQRGNKVPCH------ 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG + + L + K VYAIGV+ Q L+ + R Y V Sbjct: 547 LIVLTDG------MSDDNVLEPAKKLKDENINVYAIGVKEANRTQLLEIAGTEKRVYYVY 600 Query: 347 NSRKLHDAFLRIGKEMVK 364 N L I ++V+ Sbjct: 601 NFDSL----KDIKNQVVQ 614 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 86/212 (40%), Gaps = 29/212 (13%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + D+ D+M ++D S S G++ G ++ +++ + D Sbjct: 613 VQGICSKEACKDVKADVMFLVDSSRS------IGLENFGKMKTFMKNLVNKSQIGEDQ-- 664 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKE 271 V+ G+V FS + F L W I + I+R+ T + L++ + +K Sbjct: 665 -VQVGIVQFSDVNKEEFQLNRYWTQHEIFDAIDRMSNIDRETLTGSALKFVSDYFHPSKG 723 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +K++I +TDGE+ P D +L ++ G I+Y++GV Q Sbjct: 724 ARPG-------VRKFLILITDGESQDPVKDPAMAL------RQDGVIIYSVGVYGANETQ 770 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 ++ P+ + V+ D I ++V Sbjct: 771 LVEISGKPEMIFYVET----FDILKHIEDDLV 798 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 74/229 (32%), Gaps = 24/229 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ + I K+ K + D++ +LD S S + Sbjct: 974 FVETFGGLKGIFQNVSDSICGPSKVECKMEKA-DLVFLLDGSNS------IYPENFKKMK 1026 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKIN-RLIFGSTT 253 + ++D P V GL FS F L + +I T Sbjct: 1027 DFLVSVVDDFDIGPSR---VHIGLAQFSHVYRAEFFLGSFTSEGEVSTQIEMTQQVFGNT 1083 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 L+ ++ ++ ++ LTDG++ E + + Sbjct: 1084 HIGAALKQVEQYFRPEMGSRINVGI-----QQVLLVLTDGQSQD------EVAKAAEDLR 1132 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 R+G +Y++G+ Q ++ + D+ ++ N +L RI + + Sbjct: 1133 RKGIDIYSLGIGDVDEQQLIQISGTSDKKLTIDNFDELKKIKKRIVRNI 1181 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++++ + L + + + S N +R GLVT++ + L Sbjct: 233 VADVVFLLDEYVNGTQENLEHLKGFLE---ESVSSFDVKENCMRIGLVTYTDETKVIHSL 289 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + ++I +L + T A K+ ++ ++ + I + Sbjct: 290 STGTNKSEVLQEIQKLSPKAGRAYTGA---AMTKVRKEVFSVQKGSRRMQGVPQIAILVA 346 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 P+ DN + +R G V+ +G++ Q + + P Y Sbjct: 347 ----HRPSEDNVS--EAALDLRREGVTVFTVGLEGSDDTQLGQISSHPPEKY 392 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 73/204 (35%), Gaps = 35/204 (17%) Query: 165 SDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S G ++++ LD+S SM D G +L R R ++ R GL Sbjct: 84 SASGREIVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGL 136 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V F+++ PL + + + I G T +ST I D Sbjct: 137 VLFANRAYVAAPLTFDLAAVGRAIEEASIGITGRST--------AIADGLGLALKRVTES 188 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------A 329 + I+ L+DG++++ ID ++ A R G ++ I + + Sbjct: 189 AAASRVIVLLSDGQDNAHQIDARQ---VAGLAARHGVRIHTIALGPDDLETRPAARDAVD 245 Query: 330 DQFLKNC--ASPDRFYSVQNSRKL 351 L+ AS R Y V+ L Sbjct: 246 TATLRAIAEASGGRSYRVRGMEDL 269 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 60/446 (13%), Positives = 133/446 (29%), Gaps = 102/446 (22%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++ FF + +G++ ++ + LPV+F+ +G ++ S +K +L D + + + Sbjct: 5 LKRFFRDTRGNVIMIIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASVGSVAVNSY 64 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 N I+ ++ + D+NNI+ + + + Sbjct: 65 AYKANTKGHSSFKTGENQALAIFNSNVKKH-------NDLNNIKVKAKIKKQSTNLVSEI 117 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--- 182 ++A R + + + IT SS +D ++LD S SM Sbjct: 118 GVTADYRPYLLGLMGM---------NTMPITIKSTSSSTFPPYIDFYLLLDNSPSMGVGA 168 Query: 183 ---------------------------DHFGPGMDKLGVATR------SIREMLDIIKSI 209 + + KL V TR + + ++ K+ Sbjct: 169 TTKDIDTMVANTSDKCAFACHQMDKAGNDYYALAKKLKVTTRIDVVRQATQNLMTTAKNT 228 Query: 210 PDVNNVVRSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + + R + F + +I P + V + ++ S + Y Sbjct: 229 QTLTDQYRMAIYHFGMAADQIDSKNPAPYEVSALTTNLS--TSASNAAKIDLMTIPYQNY 286 Query: 267 FDAKEKL--------------EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF----- 307 ++ ++ + F++DG N + Sbjct: 287 NSDRQTNFPSYLLGMNKVIPSSGDGSSSSKPQQVLFFVSDGANDGYDCAYSNGASCRRIS 346 Query: 308 -----YCNEAKRRGAIV---YA--------------IGVQAEAADQF---LKNCASPDRF 342 C K RG + Y + Q ++ CA+ + Sbjct: 347 PLDTPQCKAMKARGVKIAVLYTTYLPLPTNAFYNSHLAKYVSPTSQLAAKMQECATEGLY 406 Query: 343 YSVQNSRKLHDAFLRI-GKEMVKQRI 367 + V + + +A + K + RI Sbjct: 407 FEVGPNEGISEAMNALFAKVISTVRI 432 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 84/256 (32%), Gaps = 47/256 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ + P + S K + G++ M+ LD+S SM ++L A Sbjct: 61 FLLLAIVFLIGMLARPQI--SIRVDVPKEEKGIEAMICLDISNSMLCEDVKP-NRLSFAK 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + ++ D + + GLV F+ P+ + ++ + + T Sbjct: 118 QVLGKLFDGL-------QNDKVGLVVFAGNAYTQIPITTDLSAAKQFLADISPNMVTAQG 170 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A E + + K II LTDGEN N ++ +A G Sbjct: 171 TAIGAAI-------ELASKSFSDNKEIGKTIIVLTDGENHEGN-----AIEAAQQAHEAG 218 Query: 317 AIVYAIGVQAEAAD-----------------------QFLKNCASPDR--FYSVQNSRKL 351 V IG+ + ++ AS F+S Q++ L Sbjct: 219 IRVNVIGLGTALGAPIPIEEGYLKDETGNPVVTKFDEKMCRDIASAGEGTFFSGQSASAL 278 Query: 352 HDAFLRIGKEMVKQRI 367 A ++ K + Sbjct: 279 VRAIESQLDKLPKAVL 294 >gi|170739508|ref|YP_001768163.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193782|gb|ACA15729.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 342 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 35/259 (13%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 +++ + H ++ S + + + + P I + + D+++ Sbjct: 42 VALTGRNPHAGGLVAPRSLPRLATLVACWLLALAALMRPQWIEAPLHR---DQPTRDLLL 98 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++D+S SM+ D G +D+L + + L R+G+V F Sbjct: 99 LVDLSGSMDTKDFTDASGRTVDRLTAVKAVLDDFL-------SRRKGDRAGVVVFGDAPF 151 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P + +E + + G+ + DA + K II Sbjct: 152 ALVPFTTDLDLCREMLR--------DTVVGMAGPRTALGDAIGLGIALFDRSTVKAKTII 203 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE---AADQF----LKNCAS--P 339 LTDG +++ + E+ AK +G +++ + + D+ LK+ AS Sbjct: 204 ALTDGNDTASQVPPTEA---AGVAKDKGIVIHTVAIGDPSTVGEDKLDETALKDVASATG 260 Query: 340 DRFYSVQNSRKLHDAFLRI 358 F+ + +L + R+ Sbjct: 261 GGFFRALDRDELARIYGRL 279 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 88/245 (35%), Gaps = 37/245 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + ++ ++ P ++ ++ ++ D G D++M++D+S SM++ A Sbjct: 67 MLILSWMLIVSALAKPSILG---EVQTREDFGRDVLMLVDLSGSMDEA------DFTTAD 117 Query: 197 RSIREMLDI----IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 S L+ +K+ + R GL+ F P + E+ + Sbjct: 118 GSTLTRLNAAKNVLKTFIAKRSGDRFGLILFGDAAFIQTPFTADQQVWLSLLEEAQTGMA 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + + E ++ +I LTDG ++ ++ ++ Sbjct: 178 GQSTHLGDAIGLGIKVFEQNPQPSE---------QQVMIVLTDGNDTGSFVEPVDA---A 225 Query: 310 NEAKRRGAIVYAIGVQ-------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A RG +Y I + + ++ + + R + + +L A+ I K Sbjct: 226 KIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAIDQAELDKAYQLIDK 285 Query: 361 EMVKQ 365 +Q Sbjct: 286 LEPQQ 290 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 85/276 (30%), Gaps = 53/276 (19%) Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM 171 S + D + P S S++ G+D+ Sbjct: 35 VSSTEGFDAPGASSWKVWLRHVPFILRMAAVAVLIVILARPQSTNSWQNSSTE---GIDI 91 Query: 172 MMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +D+S SM P ++L + D+ + + GLV F+++ Sbjct: 92 VLAMDISTSMMAQDLKP--NRLEASK-------DVASAFINGRPNDNIGLVVFAAESFTQ 142 Query: 231 FPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + G T GL A ++I K K I Sbjct: 143 CPLTTDHTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRI-----------KDSQAKSKVI 191 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--------------- 332 I LTDG N+ I + AK G VY IGV + + Sbjct: 192 ILLTDGVNNQGEIAP---VTAAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDV 248 Query: 333 ------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK A + +++ ++ L + + I K Sbjct: 249 EIDEPTLKQIAATTGGQYFRATDNASLKEIYSEIDK 284 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 75/208 (36%), Gaps = 32/208 (15%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNV----VRSGLVTFSSKIVQTFPLA-----WGVQHI 240 K + D K P ++ +R+ L+ +S Q FP+ Q I Sbjct: 257 SKWKGCVTDRDQPADTTKDAPTSDDTRFPALRT-LLGTTSCPAQIFPMTSAYAATDAQKI 315 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS---- 296 ++ I+ L+ T G+ +A+ + + Y II L+DG N+ Sbjct: 316 KDVIDDLVADGGTNQPIGMAWAWMSLQQGNPLNTPAKDPNYKYTDAIILLSDGLNTMDRW 375 Query: 297 ----------SPNIDNKESLFYCNEAK---RRGAI--VYAIGVQA--EAADQFLKNCASP 339 ID ++ L C+ K G VY I V + LK CA Sbjct: 376 PDYGDGQRQFDGKIDARQKLL-CDNIKLPDSNGKRPVVYTIQVNTTGDPESTILKYCADG 434 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ + + AF +IG + K RI Sbjct: 435 GNFFATTTASGIGTAFAQIGSSLSKLRI 462 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 83/255 (32%), Gaps = 35/255 (13%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + G+I+++ A+ L I +G I+ S + + LD + L + + + Sbjct: 16 RFVGDDGGNIAVIFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTALMVSKDLGADK 75 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + +K + + + + +T+ + D + Sbjct: 76 IKTSEVSEKAQTYFNSLYT---------------GTEARGVTLTTNYTAKDDSGSSTVVV 120 Query: 128 SAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + F+ FP A S A + L + M LDV+ SM Sbjct: 121 NGDGAVSTHFMKMFGFPSLAIGSAATATWGGT---------RLRVAMALDVTGSM---VL 168 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------LAWG-VQH 239 G KL ++ ++D +++ + + +V F+ + + W Sbjct: 169 NGSTKLAEMKKAASALVDTLRASAQSKDDLYISVVPFAQMVNVGSSNIDASWIKWDVWDE 228 Query: 240 IQEKINRLIFGSTTK 254 + ++ F + T Sbjct: 229 TEGSCSKSKFKTKTD 243 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 72/222 (32%), Gaps = 41/222 (18%) Query: 164 KSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S G+D+M+ +DVS SM + P +++ A D+ GL Sbjct: 83 NSVEGIDIMLAMDVSTSMLAEDLKP--NRMEAAK-------DVAAEFISGRPNDNIGLTI 133 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHD 281 F+ + P+ Q + L+ T S GL + K Sbjct: 134 FAGEAFTQCPMTTDHQSLLN----LLQNVRTDLSARGLIEDGTAVGMGLANAVSRLKDSK 189 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--------- 332 K +I LTDG N+ ++ S N AK G VY IGV + Sbjct: 190 AKSKVVILLTDGSNNRGDLSPMTS---ANIAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQ 246 Query: 333 ------------LKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 L AS FY N+ +L + I K Sbjct: 247 YVNIPVEIDTKTLSGIASVTHGNFYRATNNNELKQIYKDIDK 288 >gi|297565996|ref|YP_003684968.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296850445|gb|ADH63460.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 717 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 90/240 (37%), Gaps = 36/240 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P W +S + + + D G+ +++VLDVS SM + DKLG+A Sbjct: 277 PKGLFFGGWERSSLADAIPLE-----PLQEDGGVGLVLVLDVSGSMLEA-----DKLGLA 326 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---IVQTFPLA-WGVQHIQEKINRLIFGS 251 ++ I + G+V FSS + + P+ G + + + + G Sbjct: 327 VAG------SLELIRSARDQDYIGVVAFSSSARWVFRPRPMTPQGRREAESLLLSVRAGG 380 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T+ A + +G K ++ LTDG P + +L ++ Sbjct: 381 GTEIGEAYAEALQAL-----------RGLKTEDKQVLVLTDGLVQDPTL---PTLQAAHQ 426 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 A+ A+ + ++A FL+ A FY V + + L FL + ++ L Sbjct: 427 AQANKIRTNAVALGSDADRAFLRELAKQGGGTFYDVPSPKDLPRFFLEEAQRAFQREALV 486 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 87/217 (40%), Gaps = 35/217 (16%) Query: 157 SSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++I K + L +++VLD S SM KL +A + +++++ Sbjct: 394 VEMEIKGKEQLPSLGLVIVLDRSGSMQGS------KLELAKEAAARSVEMLRDEDT---- 443 Query: 216 VRSGLVTFSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 G + F + I++T PL+ + + I + G T+ L AY + D K + Sbjct: 444 --LGFIAFDDRPWEIIETGPLS-SKEEAVDTILSVTPGGGTEIYSSLAKAYENLADLKLQ 500 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +H II LTDG++ + N ++ + E K G + + + +A Sbjct: 501 RKH-----------IILLTDGQSQAGNYEDLIT-----EGKEDGITLSTVAIGQDADANL 544 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ + RFY V + + + R + + I Sbjct: 545 LEALSDMGSGRFYDVIDEQTIPSILSRETAMISRTYI 581 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 87/234 (37%), Gaps = 25/234 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + P + + I D+++ +D+S SM+ D G Sbjct: 65 LAILIWVLLVLAMARPERLGDPIFI---EKSARDVVLAVDISGSMDQRDFKAAD--GTPK 119 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + + D++++ + R L+ F ++ P +Q + + + + Sbjct: 120 QRLEAVKDVLRAFIAARDGDRMALIIFGTRAFVQAPFTEDLQSLNGFLEQ--------TA 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+ + DA + + ++ +I L+DG ++S + + + A +G Sbjct: 172 VGMAGPNTALGDAIGLGIRTFESSEVDQRMMIVLSDGADTSSRMTP---VIAASIAADKG 228 Query: 317 AIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKE 361 ++Y IGV A LK+ A + +++ + L + + +I + Sbjct: 229 VVIYTIGVGDPDATGEDRVDLDALKDIANKTQGQYFFADDEAALTEVYRQIDAQ 282 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 77/195 (39%), Gaps = 28/195 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-- 225 L + +V+D S SM+ G DK+ +A + I++ +N G++ F Sbjct: 405 SLGLQLVIDKSGSMSSD-ARGADKMALAREA------AIRATTMMNAQDYIGVIAFDDTP 457 Query: 226 -KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +V + + IQ++I+R+ T P L+ Y ++ + +H Sbjct: 458 WDVVAPQSVT-KLDEIQQQISRIQADGGTDIFPALQLGYERVKAMNTQRKH--------- 507 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 +I LTDG+ S D+ E L + V + + ++ L+ A R+ Sbjct: 508 --VILLTDGQ--SALDDDYEGLL--QQMTAENITVSTVALGDDSDRGLLEMIAELGKGRY 561 Query: 343 YSVQNSRKLHDAFLR 357 Y ++ + F + Sbjct: 562 YFANDAESIPKIFSK 576 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 63/196 (32%), Gaps = 41/196 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM D + L A K D V+ G + Sbjct: 68 IVFVVDRSASMKDD-PRVLSFLREAVGQ--------KQAADKYAVIAIGA-----EAAVD 113 Query: 231 FPLAWGVQHIQEKINRLIFGST---TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P+ I++++ L T G+ A I + Sbjct: 114 QPMT-----IRQEVQPLGVDVNRNATNLAEGIRLASAMIPTNARGK-------------V 155 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPDRFYSVQ 346 + LTDG +S + + L A+ RG V A+ +Q D+ L + P R Y+ + Sbjct: 156 VLLTDGLETSGDAARQTRL-----ARERGIAVEAVSLQQPNGDEVVLTSVQVPQRLYAGE 210 Query: 347 NSRKLHDAFLRIGKEM 362 D I E Sbjct: 211 EYGITVDVESTITTEA 226 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 81/213 (38%), Gaps = 28/213 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + S + S S G+D++ V+DVS SM K+ + ++ M++ + Sbjct: 72 TINLESPAQTSEASRSGVDIVCVIDVSGSMQGE------KIQLVQTTLNFMVERLSPAD- 124 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R L++FS+ + L G + ++ I RL+ T GLEY + Sbjct: 125 -----RICLISFSNDATKISRLVQMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQALR 179 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQA 326 + + + II L+DG++++ + + + R V+ G Sbjct: 180 QRRTINQLSS---------IILLSDGQDNNGTTVLQRAKATMDSIVIRDDYSVHTFGYGH 230 Query: 327 EAADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 L A P FY V++ + AF Sbjct: 231 GHDSTLLNALAEPKNGAFYYVKDEETIATAFAN 263 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 787 LDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 837 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L ++ + N G T + L ++ + +A+ H + Sbjct: 838 VLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP-----Q 892 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D ++ + +G +V A+G+ + + L S D++Y V Sbjct: 893 VLIVITDGE----SHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDKYYFV 948 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 949 ETFGGLKGIFSDVSASVCN 967 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 25/185 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 597 DMKADIMFLVDSSGSIGPENFSKMKMF---------MKNLVSKSQIGADRVQIGVVQFSH 647 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + F L I I+R+ G TT + L + K + Sbjct: 648 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN------- 700 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 701 VRKFLILITDGEAQDIVRDPAIAL------RKEGVIIYSVGVFGSNVTQLEEISGKPEMV 754 Query: 343 YSVQN 347 + V+N Sbjct: 755 FYVEN 759 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 957 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR- 1015 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L Sbjct: 1016 --VRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDM 1073 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1074 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1122 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + ++ +V N +L RI + + Sbjct: 1123 ELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1154 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 96/312 (30%), Gaps = 43/312 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N G Q T LR G IE + + H + Sbjct: 307 KNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFT 366 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVS 178 + F IT +V + S+ L D+ +++D S Sbjct: 367 SKLG---NFSELATHNQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGS 423 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S + E++ + P VR G V ++ F ++ Sbjct: 424 GS------TQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEISKYSN 474 Query: 237 VQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + I + G T + L + + AK++ +++ LT+G Sbjct: 475 KPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKER------GSKVPCHLVVLTNG-- 526 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDA 354 + L ++ + V+AIGV+ EA L+ A + R Y V DA Sbjct: 527 ----MSRDSVLGPAHKLREENIRVHAIGVK-EANQTQLREIAGEEKRVYYVHE----FDA 577 Query: 355 FLRIGKEMVKQR 366 I ++V++ Sbjct: 578 LRNIRNQVVQEI 589 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 66/198 (33%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + I ++ N +R GLVT+S++ L Sbjct: 206 VADVVFLLDMAINGSQEDLDHLKAFLG---ESISALDIKENCMRVGLVTYSNETRVISSL 262 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + ++I L T A K + ++ + + + +T Sbjct: 263 STGNNKTEVLQRIQDLSPQVGQAYTGA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVT 319 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 +R G ++ +G++ D+ K + P ++ + N Sbjct: 320 ------HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 373 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 374 ELATHNQTFLKKLRNQIT 391 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 6 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 62 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 63 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 117 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 118 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 170 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 171 L-SVFAPNMTEIIKDVTQY 188 >gi|126465452|ref|YP_001040561.1| von Willebrand factor, type A [Staphylothermus marinus F1] gi|126014275|gb|ABN69653.1| von Willebrand factor, type A [Staphylothermus marinus F1] Length = 416 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 85/222 (38%), Gaps = 26/222 (11%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + SV+ + + ++V+D S SM+ K+ A ++ +LDI+ Sbjct: 23 IPFVLSVEGVYSAHPPIAFLIVIDTSYSMDGE------KIFRAKQAALRLLDIL------ 70 Query: 213 NNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + G+ F+ K + A +++ I L GS T Y+ + Sbjct: 71 RDKDYVGVYGFAGKFYKVLEPVPATNRNEVEKAIIGLKLGSGTNI-------YDTLKKLV 123 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E+ + + + IIF+TDGE ++ ++ L + + GA IGV E + Sbjct: 124 EETKKVLESGAISLVRIIFITDGEPTTGQKKPEKILEMAKKLREAGASALIIGVGTEYNE 183 Query: 331 QFLKNCASP-----DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + L A + + KL + + +E+ + + Sbjct: 184 KLLSRMAMVLNGEFEHVSDPASLEKLISEYAKSTQEISAKNV 225 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 57/150 (38%), Gaps = 19/150 (12%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 I+ KI+ L T G+ +A+ + + Y II L+DG N Sbjct: 305 NATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGDPLNTPAKDSNYKYTDAIILLSDGLN 364 Query: 296 S-----------SPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEA--ADQFLKNCA 337 + SP +D ++ + C+ + ++Y I V + LK CA Sbjct: 365 TVDRWYGNGRDWSPQVDARQRIL-CDNIRASATNTNPVVIYTIQVNTDGDPESAVLKYCA 423 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ S + AF +IG + K R+ Sbjct: 424 DSGNFFATTTSSGIGTAFAQIGSSLSKLRV 453 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 74/226 (32%), Gaps = 28/226 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F G+I+++ A+ L + +G+ ++ S + L LD + L + + Sbjct: 15 RRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDL--- 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 G Q + + +++ +E G + + + D Sbjct: 72 GVGTITPDQVSSKAQTYFNSLYTN------KETG--------AVTVTATYTAKDGSGSST 117 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 ++ + + F S ++ L + M LDV+ SM Sbjct: 118 IAMSGQGAVQTQFMKILGFQTM------AIGSSTTTTWGGTRLRVAMALDVTGSMA---- 167 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 K+ + + ++D +++ + V +V F+ + Sbjct: 168 -SAGKMSAMKTAAKNLVDSLRASAQTVDDVYISVVPFAQMVNVGSS 212 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 71/211 (33%), Gaps = 28/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVSLSM +L A + ++ + GL++F+ Sbjct: 91 VMLVIDVSLSMEATDVLP-TRLQAAQEAATSF------ARNMTPGINLGLISFAGTATVL 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I L +T + G+ A + + I+ + Sbjct: 144 VNPTTDRNGVIKAIENLKLAQSTATGEGIFAALQSVESFSSLVGGADGP---PPARIVLM 200 Query: 291 TDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 +DG+ + P + AK+ G + +I D+ L+ Sbjct: 201 SDGKQTVPEDLYAARGGYTAAQAAKQAGVPISSISFGTTHGSVTIDDKPQPVSVDDESLR 260 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A S FY ++ +L + +G+++ Sbjct: 261 EIARLSGGDFYKAASAEELKKVYADLGEQIG 291 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 57/150 (38%), Gaps = 19/150 (12%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 I+ KI+ L T G+ +A+ + + Y II L+DG N Sbjct: 305 NATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTGDPLNTPAKDSNYKYTDAIILLSDGLN 364 Query: 296 S-----------SPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEA--ADQFLKNCA 337 + SP +D ++ + C+ + ++Y I V + LK CA Sbjct: 365 TVDRWYGNGRDWSPQVDARQRIL-CDNIRASATNTNPVVIYTIQVNTDGDPESTVLKYCA 423 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ S + AF +IG + K R+ Sbjct: 424 DSGNFFATTTSSGIGTAFAQIGSSLSKLRV 453 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 74/226 (32%), Gaps = 28/226 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F G+I+++ A+ L + +G+ ++ S + L LD + L + + Sbjct: 15 RRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDL--- 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 G Q + + +++ +E G + + + D Sbjct: 72 GVGTITPDQVSSKAQTYFNSLYTN------KETG--------AVTVTATYTAKDGSGSST 117 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 ++ + + F S ++ L + M LDV+ SM Sbjct: 118 IAMSGQGAVQTQFMKILGFQTM------AIGSSTTTTWGGTRLRVAMALDVTGSMA---- 167 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 K+ + + ++D +++ + V +V F+ + Sbjct: 168 -SAGKMSAMKTAAKNLVDSLRASAQTADDVYISVVPFAQMVNVGSS 212 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 81.0 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 72/210 (34%), Gaps = 29/210 (13%) Query: 132 RYEMPFIFC------TFPWCANSSHAPLLITSS---VKISSKSDIGLDMMMVLDVSLSMN 182 MP + + + K+ G+++M+ LDVS SM Sbjct: 44 GPLMPLLSFKRGTWKFVMLMLALLFVIVGVAGPQFGSKLQQVKKKGVELMIALDVSNSMM 103 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 +L A +I M++ + + + GL+ F+ P+ + Sbjct: 104 AQ-DIKPSRLEKAKMAISRMVEKLSN-------DKIGLIVFAGDAYVQLPITTDYSSAKL 155 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 ++ + +T P A I A + + K II +TDGEN Sbjct: 156 FLSNI----STDIVPVQGTA---IGSAIDLAARSFTPETETSKAIIVITDGENHQ----- 203 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +++ +A +G +++ IG+ E Sbjct: 204 DDAVAAAKQAHEKGIVIHTIGMGLEQGAPI 233 >gi|87308177|ref|ZP_01090319.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] gi|87289259|gb|EAQ81151.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] Length = 1032 Score = 81.0 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 78/199 (39%), Gaps = 32/199 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M+VLD S SM K+ + + + + + G++ F S+ + Sbjct: 458 ALMLVLDKSGSMQGE------KMQMTQGAALAAIRAMGAADFA------GVIGFDSQAQR 505 Query: 230 TFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P+ ++ +L T TPG+ + + + D K++ Sbjct: 506 IVPIRKVDNPGMFVAQVRKLSASGGTNMTPGVALGFRDL-----------QNVDAGVKHM 554 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 I L+DG+ N+ ++ K+ G V A+ V ++A + + A +FY+V Sbjct: 555 IVLSDGQTEPGNVA-----QIASDMKKMGMTVSAVAVGSDADQKLMATVARNGGGKFYAV 609 Query: 346 QNSRKLHDAFLRIGKEMVK 364 N + + F+R + + + Sbjct: 610 NNPKAIPRIFMREARRVAQ 628 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 81.0 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 73/416 (17%), Positives = 148/416 (35%), Gaps = 85/416 (20%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 I+ N KGSI+I A+L +I + + I+ S + + + + D L+ T Sbjct: 8 RIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVT--- 64 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 K + S + +TD R L + D + S ++I +D + K Sbjct: 65 -----TFRKYVADGMSKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINI-VDREAKV 118 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-- 182 + + + S I S + + ++DVS SM Sbjct: 119 VKANVNISGKHE---SYMTHAMGFDNIDYTADSESTI-SFGQGKYEFIFLVDVSPSMGIG 174 Query: 183 --------------------DHFGPGMD---------KLGVATRSIREMLDIIKSIPDVN 213 + + + ++ V +++ ++ ++ +V+ Sbjct: 175 ASNRDRQIMQRAIGCQFACHEPWYSSVSRAKSAGARLRIDVVKDALKSLVTQLEEATEVD 234 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIF 267 +R+GL +FS+ + L G+ + + N++ G T ++ +F Sbjct: 235 --LRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTN--------FHGVF 284 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGE--------------NSSPNIDNKESLF---YCN 310 + K D K++II ++DG N +PN F +C+ Sbjct: 285 SDFNGVLRSLKPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCD 344 Query: 311 EAKRRGA-IVYAIGVQAEAADQF------LKNCA-SPDRFYSVQNSRKLHDAFLRI 358 E K+ V+ + V+ + A ++ CA S D FYS ++ ++ AF + Sbjct: 345 EFKKGEVRTVHTMLVEPDRAHYVRASTSSMRACATSADFFYSANSAAEIDKAFKDL 400 >gi|260437096|ref|ZP_05790912.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] gi|292810406|gb|EFF69611.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] Length = 623 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 80/211 (37%), Gaps = 27/211 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM-----NDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +K + ++I D ++++D S SM + + ++ I + S V Sbjct: 86 EIKYPNSNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRKLASKNYV 145 Query: 213 N---NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 N R+G+V F+S+ L + I+++ T ++ + + + Sbjct: 146 KAQGNDDRTGIVLFTSEANTVCELTNSEYVLMNAIDKIYSNGGTNFNNAIKESIRILTNT 205 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + E K I+ ++DGE+ + S + A + + + + Sbjct: 206 RNDSE----------KRILLVSDGES-------ELSSSVIDLAIENNIKINTVYIGGQNN 248 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 ++ LKN A + +++ + +L + + I Sbjct: 249 NELLKNVAERTGGKYFKAVTADELINIYSEI 279 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 44/259 (16%), Positives = 89/259 (34%), Gaps = 34/259 (13%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 + + + E+ T+L ++ + K + F + + PLL+ Sbjct: 332 DVSANQLSFEQMTALREVVRSEGKGLTV---IGGNQSFTLGGYARTPLAEALPLLMEPP- 387 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +++++D S SM+ FG + K +A + L +++ R G Sbjct: 388 --PRPQRAPISLLLIIDRSASMSASFG--VSKFDLAKEAAILALTALQAGD------RIG 437 Query: 220 LVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ F + + P V +Q +I + G T L + + Sbjct: 438 VLAFDTDTIWVIPFQAVGEGAAVAELQTRIATMAIGGGTNIERALAVGLPALAAEPHSVR 497 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + LTDG + S N + L A+ + I + +A L+ Sbjct: 498 HA-----------VLLTDGRSYSNNYPRYQQL--VETARAAQITLSTIAIGTDADTDLLE 544 Query: 335 NCA--SPDRFYSVQNSRKL 351 A R+Y V ++ L Sbjct: 545 QLARWGNGRYYFVPDAADL 563 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 53/262 (20%), Positives = 84/262 (32%), Gaps = 45/262 (17%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLG 193 M TF + P S + + G+D+M+ +DVS SM + P ++L Sbjct: 57 MLLRCLTFVLIVCALARPQTHNSWDNRTVE---GIDIMLAMDVSTSMLAEDLKP--NRLE 111 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 A D+ GL F+ + PL + +N + T Sbjct: 112 AAK-------DVASEFVSGRPSDNIGLTIFAGESFTQCPLTIDHGSLLNLLNNVR----T 160 Query: 254 KSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 GL + KG K +I LTDG N+ +I L A Sbjct: 161 DIAARGLIQDGTAVGMGLANAVSRLKGSKAKSKVVILLTDGSNNMGDISP---LTAAQIA 217 Query: 313 KRRGAIVYAIGVQ----------AEAADQFLK-------------NCASPDRFYSVQNSR 349 + G VY IGV Q++ + FY N+R Sbjct: 218 RSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNMPVEIDTKTLSDIAAITEGNFYRATNNR 277 Query: 350 KLHDAFLRIGKEMVKQRILYNK 371 +L + I ++ K ++ K Sbjct: 278 ELKQIYRDI-DKLEKTKMNVTK 298 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 52/251 (20%), Positives = 91/251 (36%), Gaps = 54/251 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 I + P + I+ + G+ + +V+DVS SM + P +L + Sbjct: 57 LIILALLFSIIGLARPAKVDHLSDINGE---GIYISLVVDVSPSMMAEDMMP--TRLEAS 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN--RLIFGSTT 253 + M+D IK N + LV F+ + P + ++E+I ++ +T Sbjct: 112 KK---TMIDFIKK----RNFDKISLVAFALRASVLSPSTFDYTLLEEEIKNIKIDEEGST 164 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 G+ A + + K E K II LTDGEN+S ID K + A Sbjct: 165 SIGLGIATAVDMLRSVKGDNE----------KIIILLTDGENNSGEIDPKLASEI---AS 211 Query: 314 RRGAIVYAIGVQAEAAD------------------------QFLKNCAS--PDRFYSVQN 347 +Y IG+ + L + AS ++++ QN Sbjct: 212 NFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFSLNEEALIDIASTTGGKYFNAQN 271 Query: 348 SRKLHDAFLRI 358 + L + + I Sbjct: 272 ASALDNVYNTI 282 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 72/217 (33%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A + E ++ GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRLEAAKQVASEFIN-------GRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + N + + GL I K K Sbjct: 138 SFTQCPLTVDHGVLLNLFNSIK---GDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------------- 332 II LTDG N+ +I L AK+ G +Y IGV + Sbjct: 195 IILLTDGSNNRGDISP---LTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVP 251 Query: 333 -------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 252 VEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 73/204 (35%), Gaps = 35/204 (17%) Query: 165 SDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S G ++++ LD+S SM D G +L R R ++ R GL Sbjct: 97 SASGREIVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGL 149 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V F+++ PL + + + I G T +ST I D Sbjct: 150 VLFANRAYVAAPLTFDLAAVGRAIEEASIGITGRST--------AIADGLGLALKRVTES 201 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------A 329 + I+ L+DG++++ ID ++ A R G ++ I + + Sbjct: 202 GAASRVIVLLSDGQDNAHQIDARQ---VAGLAARHGVRIHTIALGPDDLETRPAARDAVD 258 Query: 330 DQFLKNC--ASPDRFYSVQNSRKL 351 L+ AS R Y V+ L Sbjct: 259 TATLRAIAEASGGRSYRVRGMEDL 282 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 229 QTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L ++ I N G T + L ++ + +A+ H + Sbjct: 858 VLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEARGSRLHKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D ++ + +G +V A+G+ + + L S D++Y V Sbjct: 913 VLIVITDGE----SHDAEKLNATAKALRDKGILVLAVGIAGANSWELLAMAGSGDKYYFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVSASVCN 987 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 25/185 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+M ++D S S+ M M +++ + V+ G+V FS Sbjct: 617 DMKADIMFLVDSSGSIGPENFSKMKMF---------MKNLVSKSQIGADRVQIGVVQFSH 667 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + F L I I+R+ G TT + L + K + Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGARPN------- 720 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I +TDGE D +L ++ G I+Y++GV Q + P+ Sbjct: 721 VRKFLILITDGEAQDIVRDPALAL------RKEGVIIYSVGVFGSNVTQLEEISGKPEMV 774 Query: 343 YSVQN 347 + V+N Sbjct: 775 FYVEN 779 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 155 ITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I S V S + +D + +D+ M+ D + ++ + Sbjct: 977 IFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR- 1035 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 VR G+ FS F L G + I +I + T L Sbjct: 1036 --VRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDT 1093 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG + E E + +G +Y++G+ Sbjct: 1094 GSRINAGTP-----QVLLVLTDGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQ 1142 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + ++ +V N +L RI + + Sbjct: 1143 ELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 97/312 (31%), Gaps = 43/312 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N G Q T LR G IE + + H + Sbjct: 327 KNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFT 386 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVS 178 + F IT +V + S+ L D+ +++D S Sbjct: 387 SKLG---NFSELATHNQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGS 443 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S + E++ + P VR G V ++ F ++ Sbjct: 444 GS------TQPTDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADTWDLEFEISKYSN 494 Query: 237 VQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + I + G T + L + + AK++ +++ LT+G Sbjct: 495 KPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKER------GSKVPCHLVVLTNG-- 546 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDA 354 + L ++ + V+AIGV+ EA L+ A + R Y V + DA Sbjct: 547 ----MSRDSVLGPAHKLREENIRVHAIGVK-EANQMQLREIAGEEKRVYYVHD----FDA 597 Query: 355 FLRIGKEMVKQR 366 I ++V++ Sbjct: 598 LRNIRNQVVQEI 609 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 65/198 (32%), Gaps = 16/198 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + L + + I ++ N +R GLVT+S++ L Sbjct: 226 VADVVFLLDMAINGSQEDLDHLKAFLG---ESISALDIKENCMRVGLVTYSNETRVISSL 282 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G + + I L T A K + ++ + + + +T Sbjct: 283 STGNNKTEVLQLIQDLSPQVGQAYTGA---ALRKTRKEIFSAQRGSRKNQGVPQIAVLVT 339 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ--NSR 349 +R G ++ +G++ D+ K + P ++ + N Sbjct: 340 ------HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 393 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L K++ Q Sbjct: 394 ELATHNQTFLKKLRNQIT 411 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 82 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 190 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 191 L-SVFAPNMTEIIKDVTQY 208 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 72/217 (33%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A + E ++ GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRLEAAKQVASEFIN-------GRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + N + + GL I K K Sbjct: 138 SFTQCPLTVDHGVLLNLFNSIK---GDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------------- 332 II LTDG N+ +I L AK+ G +Y IGV + Sbjct: 195 IILLTDGSNNRGDISP---LTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVP 251 Query: 333 -------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 252 VEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 85/276 (30%), Gaps = 53/276 (19%) Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM 171 S + D + P S S++ G+D+ Sbjct: 35 VSSTEGFDAPGASSWKVWLRHIPFILRMAAVAVLIVILARPQSTNSWQNSSTE---GIDI 91 Query: 172 MMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +D+S SM P ++L + D+ + + GLV F+++ Sbjct: 92 VLAMDISTSMMAQDLKP--NRLEASK-------DVASAFINGRPNDNIGLVVFAAESFTQ 142 Query: 231 FPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + G T GL A ++I K K I Sbjct: 143 CPLTTDHTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRI-----------KDSQAKSKVI 191 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--------------- 332 I LTDG N+ I + AK G VY IGV + + Sbjct: 192 ILLTDGVNNQGEIAP---VTAAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDV 248 Query: 333 ------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK A + +++ ++ L + + I K Sbjct: 249 EIDEPTLKQIAATTGGQYFRATDNASLKEIYSEIDK 284 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 75/199 (37%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 809 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 859 Query: 229 QTFPL-AWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G T + L ++ + +A+ + + Sbjct: 860 VLFYLEDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 914 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 915 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPAELLAMAGSSDKYFFV 970 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 971 ETFGGLKGIFSDVTASVCN 989 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + + ++ + Sbjct: 983 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNR-- 1037 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 1038 -VRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALRKVEHYFRPDMG 1096 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 1097 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 1145 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1146 LIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1176 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 61/353 (17%), Positives = 106/353 (30%), Gaps = 47/353 (13%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L + A K N N G Q Sbjct: 284 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 343 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G IE ++ + H + + F Sbjct: 344 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLK---TFADLAAH 400 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 401 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 454 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFG 250 + ++ + P VR G V ++ F + Q + + I + G Sbjct: 455 EMKTFLSAVVGMFDIAPHK---VRVGAVQYADSWDLEFEINKYSNRQDLGKAIENIRQMG 511 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + L + + + AK++ + H ++ LT+G + L N Sbjct: 512 GNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG------MSKDSILEPAN 559 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 + VYAIGV+ EA L+ A + R Y V + L D ++ +E+ Sbjct: 560 RLREEHIRVYAIGVK-EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEI 611 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 79/202 (39%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ M ++ ++ + PD V+ G+V FS Sbjct: 618 KEMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGPDQ---VQIGVVQFS 668 Query: 225 SKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I+++ G TT + L + K +I Sbjct: 669 DINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARPNI----- 723 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE + L ++ G I+Y++GV Q + P+ Sbjct: 724 --RKFLILITDGEAQDIVKEPAVVL------RQEGVIIYSVGVFGSNVTQLEEISGRPEM 775 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D RI ++V Sbjct: 776 VFYVEN----FDILQRIEDDLV 793 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 64/202 (31%), Gaps = 26/202 (12%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV ++ + + + + ++ N +R GLV +S++ L Sbjct: 228 VADVVFLLDMSINGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKVINSL 284 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + G+ + + I L + T A K+ + ++ + + + +T Sbjct: 285 SMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIAVLVT 341 Query: 292 --DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 D E++ +R G ++ +G++ + Q K + P + Sbjct: 342 HRDSEDNVTKAAV--------NLRREGVTIFTLGIEGASDTQLEKIASHP--------AE 385 Query: 350 KLHDAFLRIGKEMVKQRILYNK 371 + + K Sbjct: 386 QYVSKLKTFADLAAHNQTFLKK 407 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 83/203 (40%), Gaps = 29/203 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S + P + I +M++ S+P + R L +S K+ Sbjct: 28 DVVFLVDSSDRLGSRSFPFV------KMFITKMIN---SLPIEADKYRVALAQYSDKLHS 78 Query: 230 TFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L+ G + + + + G + + L+ A+ F A K Sbjct: 79 EFHLSTFKGRSPMLNHLRKNVGFIGGSLQIGKALQEAHRTYFSAPAD--GRDKKQFPPIL 136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ + ++++ + ++A ++ G + +IGVQ +A+++ LK A+ ++ Sbjct: 137 VVV---------ASSESEDDVVEASKALQKDGVKIISIGVQ-KASEENLKAMATSQFHFN 186 Query: 345 VQNSRKL---HDAFLRIGKEMVK 364 ++ R L I K++ K Sbjct: 187 LRTVRDLSMFSQNMTHIIKDVTK 209 >gi|226366409|ref|YP_002784192.1| hypothetical protein ROP_70000 [Rhodococcus opacus B4] gi|226244899|dbj|BAH55247.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 328 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 75/247 (30%), Gaps = 32/247 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + + + + +++V+DVSLSM +L Sbjct: 61 IPALLMVLALVF----FSVALAGPTEDKRVPRNRATVILVIDVSLSMKATDVEP-TRLAA 115 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + + D + + GLV F+ + + I+ L T Sbjct: 116 AQDAAKSFADGLTP------GINLGLVAFAGTASVLVSPTTNREATKVAIDNLQLSERTA 169 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEA 312 + + + + + I+ L+DG+ + P D + +A Sbjct: 170 TGEAI---FTSLQSIDTLAAVLGGSDQAPPARIVLLSDGKQTVPENPDDPRGGFTAARQA 226 Query: 313 KRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 K + + I D L+ A S F++ + +L D + Sbjct: 227 KDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTASSLEELRDVYD 286 Query: 357 RIGKEMV 363 + +++ Sbjct: 287 TLEEQIG 293 >gi|291399639|ref|XP_002716220.1| PREDICTED: collagen, type VI, alpha 6 [Oryctolagus cuniculus] Length = 2273 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 49/330 (14%), Positives = 104/330 (31%), Gaps = 23/330 (6%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 F ++ + +D T + + R + + ++ Sbjct: 676 DTFMSQSDISNAIDRMAHIGETTLTGGALTFVSQYFSPAKGARPNVRKFLILITDGEAQD 735 Query: 100 GFAQDINNIERS--TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 + + + S+ + + R EM F F + Sbjct: 736 VVKEPAVALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQRIEDDLVFGIC 795 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S + K LD++ V+D S S++ M M+ ++K VR Sbjct: 796 SPREECKRIEVLDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKAQVR 846 Query: 218 SGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G + ++ F L ++ I N G T + L ++ + +A+ Sbjct: 847 FGALKYADDPEVLFYLGDLDTKMEVISMLQNDQPMGGNTYTAEALAFSDHMFTEARGSRL 906 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + +I +TDGE + D ++ + +G +V A+G+ + L Sbjct: 907 HKGVP-----QVLIVITDGE----SHDAEKLNGTTKALRDKGILVLAVGIAGANPVELLA 957 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 S D+++ V+ L F + + Sbjct: 958 MAGSSDKYFFVETFGGLQGIFSDVSASVCN 987 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 66/214 (30%), Gaps = 18/214 (8%) Query: 153 LLITSSVKISSKSDIGLDMMM-VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I S V S + +D + +D+ M+ + ++ + Sbjct: 975 QGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIDPSDFKKMKEFVASVVQDFDVSLN 1034 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFD 268 V+ G FS FPL + + I N T L Sbjct: 1035 R---VQIGAAQFSHTYQPEFPLGTFTDEKEVSFHIENIQQIFGYTHIGAALHQVGRYFQP 1091 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + + ++ LTDG++ E E + G +Y++G+ Sbjct: 1092 DMGSRINTGTP-----QVLLVLTDGQSQD------EVAQAAEELRHNGVDIYSVGIGNVD 1140 Query: 329 ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ + D+ +V N +L RI + + Sbjct: 1141 HQQLIQITGTADKKLTVDNFDELKKIKKRIVRNI 1174 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 83/203 (40%), Gaps = 31/203 (15%) Query: 165 SDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D+ D+M ++D S S+ ++FG K+ +S+ + + V+ G+V F Sbjct: 616 KDLKADIMFLVDSSGSIGLENFG----KMKTFMKSL------VSKSQIGAHRVQIGVVQF 665 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S + F L I I+R+ G TT + L + AK + Sbjct: 666 SHINKEEFQLDTFMSQSDISNAIDRMAHIGETTLTGGALTFVSQYFSPAKGARPN----- 720 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +K++I +TDGE + +L ++ G I+Y++GV Q + P+ Sbjct: 721 --VRKFLILITDGEAQDVVKEPAVAL------RQEGVIIYSVGVFGSNVTQLEEISGRPE 772 Query: 341 RFYSVQNSRKLHDAFLRIGKEMV 363 + V+N D RI ++V Sbjct: 773 MVFYVEN----FDILQRIEDDLV 791 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 70/197 (35%), Gaps = 26/197 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S + E+ + P VR G V ++S Sbjct: 435 DIYLLIDGSGS------TQATDFQEMKTFLSEVASMFHIGPHK---VRVGAVQYASSWDL 485 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + I + G T + L + + AK++ + H Sbjct: 486 EFEIGKYSNKHDLGRAIENIRQLGGDTNTGAALNFTLRLLQKAKQQRGNKVPCH------ 539 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 ++ LT+G + L + VYAIGV+ EA L+ A + R Y V Sbjct: 540 LVVLTNG------MSKDSILEPAKRLREENIRVYAIGVK-EANQTQLREIAGDEKRVYYV 592 Query: 346 QNSRKLHDAFLRIGKEM 362 + L D ++ +E+ Sbjct: 593 HDFDALKDIRNQVVQEI 609 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 24/178 (13%) Query: 170 DMMMVLDVS--LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LDV+ S+ D ++ + + ++ + +R GLV +S++ Sbjct: 228 DVVFLLDVAVNGSLED--------FDHLKGFLQ---ESVSALDIKESCMRVGLVAYSNET 276 Query: 228 VQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L+ G + +I L G+ T A KI ++ + + Sbjct: 277 EVIGSLSEGVNRSAVLHRIQSLSPGAGEAYTGA---ALRKIRKEVFGARGGSRKNQGVPQ 333 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ +T +R+G V+ +GV+ +A Q K + P Y Sbjct: 334 IVVLVT------HRASEDNVTRAAVNLRRQGVTVFTLGVEGASATQLEKIASHPAEQY 385 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 80/199 (40%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ + M M+D++K N V+ G + +S Sbjct: 806 LDIVFVIDSSGSIDSNEYNIMKAF---------MIDLVKKADVGKNQVQFGALKYSDFPE 856 Query: 229 QTFPLA--WGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I I N G +T + L ++ + ++ H + Sbjct: 857 VLFNLNEFSSKSEIISFIQNDHPRGGSTYTAKALAHSAHLFSESLGSRMHRGVP-----Q 911 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE+ ++ N + + +G +V A+G++ ++ L S DR++ V Sbjct: 912 VLIVITDGESHDAHLLNATAR----ALRDKGILVLAVGIEGANHEELLSMAGSTDRYFFV 967 Query: 346 QNSRKLHDAFLRIGKEMVK 364 +N L F + + Sbjct: 968 ENFEGLKGIFENVSASVCN 986 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 74/186 (39%), Gaps = 25/186 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ D ++ +++ K N V+ GLV FS Sbjct: 615 KEMSADIMFLVDSSGSIGG------DNFEKMKTFMKNVVNRTKI---GANQVQVGLVQFS 665 Query: 225 SKIVQTFPLA-WGVQH-IQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + + I + I+ L T L + + +K + Sbjct: 666 DINKEGFQLNQYDTKTKISDAIDGLSLIGRGTLIGGALTFVSDYFSVSKGARPN------ 719 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK+++ LTDG++ + +L ++ G I+Y++GV Q + D Sbjct: 720 -VKKFLVLLTDGKSQDAVKEAAVAL------RQDGVIIYSVGVFGSEYSQLEEISGRSDM 772 Query: 342 FYSVQN 347 + V+N Sbjct: 773 VFYVEN 778 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 72/210 (34%), Gaps = 24/210 (11%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T +K D+ +++D S S+ + R + E++ + P Sbjct: 420 TERLKSGCADTEAADIYLLIDGSGSIQ------VADFQEMKRFLAEVIGMFNIGPHK--- 470 Query: 216 VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEK 272 VR G V +S F + + + + + G T + L+ A+++ Sbjct: 471 VRFGAVQYSHLWEWEFEMDRYSNKNDLVKAVENIRQLGGNTDTGAALDKMLPLFQRARQQ 530 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++++ LTDG + + + VYAIGV+ Q Sbjct: 531 RARK------VPQHLVVLTDG------LSHDSVREPAGRLRGDNINVYAIGVKEANHTQL 578 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + S R Y V N L D R+ + + Sbjct: 579 EEIAGSDSRVYYVHNFDSLKDIKNRVVRSI 608 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 80/215 (37%), Gaps = 26/215 (12%) Query: 154 LITSSVKISSKSDIGL---DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +++SV +SK D L D++ ++D S S+ + + +++ P Sbjct: 979 NVSASVCNTSKVDCELGMADLVFLIDGSTSILEE------DFKKMKDFLVTIVNDFDIRP 1032 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIF 267 V GL FS + F L ++ +I R+ T L + + Sbjct: 1033 GK---VHVGLAQFSHEYRPEFSLIPFRDKIEVKNQIGRIQQIFGNTLIGAALRNVGSYFW 1089 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + ++ ++ LTDG++ E + + +G +Y++GV Sbjct: 1090 PDFGSRINAG-----VQQVLLVLTDGQSQD------EVAQAAEDLRNKGIDIYSLGVGQV 1138 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q ++ S + +V N +L R+ +++ Sbjct: 1139 NDQQLIQISGSAKKKLTVDNFSELDKIKKRVVRDV 1173 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 67/191 (35%), Gaps = 15/191 (7%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D+ +++ + + ++ I S N +R GLV +S++ L Sbjct: 225 VVDIVFLVDESVNGTDENFEHLKGFL---VETIDSFDVKENCMRIGLVMYSNETKLVSRL 281 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 G I ++I+ L + T A N ++ + + +T Sbjct: 282 GTGTNKSDILQQIDGLSPKAGRALTGA---AINVTRKEIFSRGAGSRKSQGVLQITVLIT 338 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSVQNSRK 350 +R G V+A+G++ Q + + P +++ S+ S Sbjct: 339 ------HRSSEDNVSEAALSLRREGVTVFAVGIEGANETQLDQIASYPREQYVSMVKSYS 392 Query: 351 LHDAFLRIGKE 361 A+ RI ++ Sbjct: 393 DMGAYYRIFQK 403 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 68/217 (31%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P +++ A D+ GL F+ + Sbjct: 87 GIDIMLAMDVSTSMLAEDLRP--NRMEAAK-------DVATEFISGRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + L T + GL + K K Sbjct: 138 AFTQCPMTTDHASLLRL---LQATRTDIAARGLIDDGTAVGMGLANAVSRLKDSKSKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------------- 332 +I LTDG N+ I + AK G VY IGV + Sbjct: 195 VILLTDGSNNMGEISP---MTAAEIAKSYGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIP 251 Query: 333 -------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L + A + FY N+ +L + I K Sbjct: 252 VEIDTKTLSDIAQTTDGNFYRATNNNELKKIYRDIDK 288 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 70/193 (36%), Gaps = 29/193 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D + V D S SM+ +L R + + D R+G V ++S Sbjct: 535 DFVFVNDESGSMSGS-PTHYAELAG-KRFVGALTDSE----------RAGRVGYASGANL 582 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL + + RL T + GL N + + G ++ +I Sbjct: 583 DQPLTTDHDAVNSSLERLSASGGTNTRAGLRVGLNHLEEE---------GWENRSAVMIL 633 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQN 347 L+DG++ S + L +A G + +G+ + L+ A + FY V+ Sbjct: 634 LSDGKSGS------DPLPVAEDAAEAGVEISTVGLGNNINENELREIAAITGGDFYHVER 687 Query: 348 SRKLHDAFLRIGK 360 L D F R+ + Sbjct: 688 EEDLPDTFERVAE 700 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 79/228 (34%), Gaps = 29/228 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 C + L +T++ I S D+++VLD S SM G + + + ++ +++D Sbjct: 32 CDGTLRVTLALTAAPDIVSN---PTDIVLVLDRSGSMT---GTPLADMKLGAKTFIDLID 85 Query: 205 IIKSIPDVNNV---VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + R G+V+FS+ V L V ++ ++ L G +T Sbjct: 86 EATDSSQDGQIGSGSRMGVVSFSNTAVADTQLITSVDALKAAVDNLSAGGSTNHADAFAK 145 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A A K ++ TDG + + + G I+Y Sbjct: 146 AIQLFDPASANA-----------KVMVMFTDGNTTIGAPPAPVAAAARAQ----GIIIYC 190 Query: 322 IGVQAEAADQF--LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 IG+ L + A+ N+ L + F + + K Sbjct: 191 IGLIGSDGLDITALNDWATDPDASHVAVTPNAADLEELFAELAANISK 238 >gi|126341668|ref|XP_001379927.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2568 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 82/199 (41%), Gaps = 22/199 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+++H + +++ + V+ G + +S Sbjct: 753 LDIVFVLDDSSSISEHQYESLINFTK---------HLVEKADVGRDRVQFGALKYSDTPE 803 Query: 229 QTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L Q+ + I++L F G T + L ++ +H ++ K+ Sbjct: 804 ILFYL-NDYQNKKSIIDKLKFQRKGGNTYTAKALNRSHELFT-----EQHGSRIKRGVKQ 857 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG+ + D E + + I+Y +GV + + L S D ++ V Sbjct: 858 MLIVITDGK----SHDYLELETVGKALRAKNIIIYGVGVAEASDKELLDITGSKDNYFMV 913 Query: 346 QNSRKLHDAFLRIGKEMVK 364 N KL + +L I K+ + Sbjct: 914 DNFEKLKEIYLPIEKKACE 932 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 82/205 (40%), Gaps = 25/205 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +F + E++ + P+ VR G V +SS Q Sbjct: 383 DIYFLIDGSGSVRPYFHEI-------KNFMIEVIQMFNVAPEK---VRFGAVQYSSTHQQ 432 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ + + +N G TT + L + +I +K+ K HD + Sbjct: 433 EFGISTYSNVVDLSKAILNIRQLGHTTNTGAALRFMLEEIKSSKK------KQHDSVPCH 486 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 ++ +TDG ++ + + L + I++AIGV+ + S DR V Sbjct: 487 LLVVTDGMSNDEVSEPAKRL------REEDVIIHAIGVKGANIIELKDIAGSEDRVKFVY 540 Query: 347 NSRKLHDAFLRIGKEMVKQRILYNK 371 N L I + + ++R +K Sbjct: 541 NFESLKGIKNEIVQTICRERDCKDK 565 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 64/201 (31%), Gaps = 33/201 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T S D+ +D+ +LD S +M + I +LD PD Sbjct: 2208 STLESGSPSLRDVFMDVAFLLDSSWNMGSS------EFQEIKSFIASVLDYFYITPDPLT 2261 Query: 215 ---VVRSGLVTFSSKIVQTFP-----------LAWGVQH-IQEKI--NRLIFGSTTKSTP 257 R L+++S + + H ++ I + + + Sbjct: 2262 SPVGDRVALLSYSPPGYMPNGEECPVLLEFDLMTYNNAHQMKSYIWESLQQLNGNSFTGH 2321 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 L++ + IF + + I ++ GE + +D K A+ G Sbjct: 2322 ALKWTIDNIFSG--------TPNLRKNRVIFVISAGETNY--LDRKTLKEESQRARCLGY 2371 Query: 318 IVYAIGVQAEAADQFLKNCAS 338 ++ + D+ L+ AS Sbjct: 2372 AIFVLSFGPRHNDKELEELAS 2392 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 70/198 (35%), Gaps = 30/198 (15%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 I + D D++ +LD SL + + +++ L+ I + D+ Sbjct: 167 VGEFSILCERDSLADIVFLLDESLKVRRNLEN-----------LQKFLENITASMDIKAG 215 Query: 216 -VRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKE 271 R G+++FS + L I + I L ++ A +E Sbjct: 216 CTRIGILSFSDETEVVASLERDMTHDEIIQAIRSLSLRDGKAYIGAAMKKA-------RE 268 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 H K ++ + +T + + E L +G ++ IG++ Q Sbjct: 269 LFSHSRK-AQGVRQIAVLVTHRSSEDDMHEAAEDLLL------QGVSIFGIGIKGYNITQ 321 Query: 332 FLKNCA-SPDRFYSVQNS 348 F + + PD + S QNS Sbjct: 322 FKQMVSYPPDHYISEQNS 339 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 60/181 (33%), Gaps = 45/181 (24%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL+ ++ +++R T G+ + + +E + D +K +I L Sbjct: 368 TPLSNDYAALKREVSRFTADGNTNIMEGVAWGMRVL-SPREPFTEGKEPASDVEKIMIVL 426 Query: 291 TDGENSSPNIDNKE----------------------------------SLFYCNEAK--- 313 TDG N+ +N+ +L C AK Sbjct: 427 TDGANNMGLSNNRNHALGSSYSSFGYLVEDRLTRERSQRRVTEEMNRRTLAACENAKREY 486 Query: 314 ----RRGAIVYAIGVQAE--AADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +Y I ++ A L+ CA+ P ++ + +L+ F I + K R Sbjct: 487 TPSKEDDVTIYTIRLEEPDVATGTLLQECATGPGYYFDSPSRTQLNAIFKEIRDGITKLR 546 Query: 367 I 367 + Sbjct: 547 L 547 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/260 (13%), Positives = 87/260 (33%), Gaps = 55/260 (21%) Query: 4 LNIRNFF-------YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSL 56 L+IR + +G+ +++ A+ +F GL ++ ++ +++L LD ++ Sbjct: 2 LSIRRLRSACSALCRDRRGNFAVILALSALPVFGAAGLAVDYTNMSRTRSELQNALDAAV 61 Query: 57 LYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 L A + G K + + I + + + + ++ +++ S + Sbjct: 62 LAVAQR---------GDKISDAEARSIAASFLTGNLSSAYKNMAVERNGTSVKLSAEAT- 111 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 MP F + + + ++ +VLD Sbjct: 112 ------------------MPLSFG------GLIGRKEATVGASSTADMAFAYYEIALVLD 147 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ--TFPLA 234 + SM KL ++ ++D + S ++ LV F+S + F Sbjct: 148 TTGSMRG------GKLQAMKEAVNGLIDDLSSRVTDKERLKFALVPFASFVNVGPQFGPE 201 Query: 235 WGVQHIQEKINRLIFGSTTK 254 + R++ G+ Sbjct: 202 FDRNG------RIVPGTGAD 215 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 43/258 (16%), Positives = 88/258 (34%), Gaps = 47/258 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--------------- 181 F N P + S + + +D+ +V+D S SM Sbjct: 26 FSPLFPTMGTNLEIRPQRVAVSTQST------MDVALVIDRSGSMAYASDETPDPYVNPA 79 Query: 182 ----NDHFG---PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +G P + S+ + P + L T+SS + L Sbjct: 80 SAPPGWTYGDPVPPNSRWLDLVASVNAFNGFLVDSPQYEKLC---LATYSSTASRDCDLT 136 Query: 235 WGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 I +++ + + G T GLE+ + DA + + ++ +T Sbjct: 137 HTYAEISNELDAISYQFDGGGTSVGYGLEHGLAVLTDATHARKFAV-------RVMVLMT 189 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSR 349 DG +++ + ES+ Y + G ++ I +A + N A+ + ++ Sbjct: 190 DGHHNTGK--SPESMMY--HLQNHGVTLFTITFSDDADQSRMSNLANACGGENFHATDAS 245 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L +AF +I K++ Sbjct: 246 QLQNAFQKIAKKLPSLMT 263 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 67/459 (14%), Positives = 148/459 (32%), Gaps = 104/459 (22%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA---- 60 ++ F GS++I+ L ++ I G+ I+ + + K+ LD ++L A Sbjct: 25 QLQGFRRAESGSMTIMAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAAADLDN 84 Query: 61 ---------------------TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 + E N + + + E + Sbjct: 85 ELDAQGVVEDYMSKMSLADALVSVNVDEGLNYRTVTADGYRTMPSN--FMQLIGIENMQA 142 Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G +S+++D + ++ + + L+ S V Sbjct: 143 GGHSQAMERINKVEVSMVLDISGSMDDGDKMAELQ-TAASDFVDTLLDDGSEDLVSISLV 201 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S + G +++ L+V+ +D + A + D ++ Sbjct: 202 PYSEHVNAGPEILSYLNVNYMHDDSYCLEMPNSAFNSAALDLSLTYDQMQHF----QWNY 257 Query: 218 SGLVTFSSKI------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 SG + + + Q P + ++ +I++L + T G+++A + + Sbjct: 258 SGSNSLTDTVCPRYAYEQIRPWSQDAGALKTQISQLQPRAGTSIFMGMKWASALLDPSTR 317 Query: 272 KLEH----------------IAKGHDDYKKYIIFLTDGE--------------------- 294 + +A D K I+ +TDG+ Sbjct: 318 PIASGMIADGTVDAVFEGRPVAYSDTDVLKTIVLMTDGQHDRSFRIQNWAYNDENEVEHW 377 Query: 295 ---------NSSPNIDNKESLFY---------------CNEAKRRGAIVYAIGVQA-EAA 329 N N N+ S +Y C AKR+G ++++IG + + Sbjct: 378 SQYNLWHYLNYYVNSWNRSSFYYQKYDAATGDTLLSSVCTAAKRQGILIWSIGFEVSDHG 437 Query: 330 DQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +++CA SP F+ V+ ++ +AF I + + + R+ Sbjct: 438 ANVMESCASSPAHFFRVEGV-EISEAFSTIAQTLNQLRL 475 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 89/241 (36%), Gaps = 38/241 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKS------------DIGLDMMMVLDVSLSMND 183 P + T P P + + + + S D+++V+D S SM Sbjct: 325 PNVPLTDPISPAFGADPQGVQTVLAVPSAEVIVAVKDSWKLNRKRADILLVVDTSGSMEG 384 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---IVQTFPLAWGVQHI 240 DK+ + I L ++ +P+ R GL+TF S+ +V PL+ + Sbjct: 385 ------DKMTMVKAGIETFL--MRILPE----DRLGLITFDSQARLVVPMAPLSENRIDL 432 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 Q + + T L+ A + E + DD + I+ L+DG +++ + Sbjct: 433 QIAVQEMRASGRTALFDALDLARQTL-------EALPPAEDDRIRAIVLLSDGADNASRL 485 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIG 359 +E +E G ++ + ++A Q L A V +S + F + Sbjct: 486 TLEEVRRQFDE---SGITIFPVAYGSDADRQVLDAIAEFSRTIVVVGDSGDIAQIFENLS 542 Query: 360 K 360 + Sbjct: 543 R 543 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 79/218 (36%), Gaps = 35/218 (16%) Query: 151 APLLITSSVK---ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + S VK I++ G ++++ LD S SM + +K+ A + Sbjct: 55 LSVALASPVKSKIITNTHKKGYNIVIDLDTSGSMAEF-----NKIDAAKAVSLD------ 103 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYN 264 GLV F + PL + + ++ + R+ I G T L + N Sbjct: 104 -FAKKRKNDALGLVVFGNIAYIASPLTFDKKTFEDILKRIYVSIAGGKTAIYDALFLSSN 162 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 +A +K II LTDG ++ + + K+ VY+I + Sbjct: 163 LFKNANG------------EKIIILLTDGMDNMSITPLDVVIK---KLKKEHIKVYSIAI 207 Query: 325 QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +A LK + + +FY + L + I K Sbjct: 208 GGDADLSVLKKISKETNGKFYIASSLEDLKKIYSDINK 245 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 70/217 (32%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A + + GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRLEAAK-------QVAAQFINGRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + + + GL I K K Sbjct: 138 AFTQCPLTIDHGVLLNLFGSIK---GDIAQRGLIEDGTAIGMGLANSISRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ 328 II LTDG N+ +I L AK+ G VY IGV Sbjct: 195 IILLTDGSNNRGDISP---LTAAEIAKQFGIRVYTIGVGTNGTAPYPMPTYAGVQYVNVP 251 Query: 329 ---ADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGK 360 +Q L AS ++ ++ KL + + I K Sbjct: 252 VEIDEQTLIQIASTTNGNYFRATSNSKLKEVYEEIDK 288 >gi|301755498|ref|XP_002913610.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Ailuropoda melanoleuca] Length = 765 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 47/221 (21%), Positives = 84/221 (38%), Gaps = 22/221 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+F + + KIS+ S + + +DV ++ G + Sbjct: 13 FLFSGVHLSHPLQEVHVSRETVGKISAASKM-IQCFAAVDVLFLIDGSHSVGKGSFERSK 71 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST-T 253 + D + P+ VR G + FSS FPL Q ++ KI R++F T Sbjct: 72 HFAIMVCDALDINPER---VRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRT 128 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ L+Y + F + + ++ +TDG + P + K Sbjct: 129 ETGLALKYLLRRGFPGGR--------NASVPQILVVITDGRSQGPVE------LPAKQLK 174 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 RG V+A+GV+ ++ L AS R V + ++ DA Sbjct: 175 ERGVTVFAVGVRFPRWEE-LHTLASEPREQHVLMAEQVDDA 214 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 46/128 (35%), Gaps = 16/128 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ +R + PDV GLV + S++ Sbjct: 529 SLDLLFMLDASASVGSE------NFAQMQSFVRSLTLQFDVNPDVTQ---VGLVVYGSRV 579 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L G + +++ + S A ++D ++ A+ K Sbjct: 580 QTAFGLDTHLGRAAVLRAMSQAPYLGGVGSAG---TALLHVYDKVMTVQRGARPGVP--K 634 Query: 286 YIIFLTDG 293 ++ LT G Sbjct: 635 AVVLLTGG 642 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 808 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 858 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G +T + L ++ + +A+ + + Sbjct: 859 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 913 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 914 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFV 969 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 970 ETFGGLKGIFSDVTASVCN 988 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + + ++ + Sbjct: 982 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNR-- 1036 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 1037 -VRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMG 1095 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 1096 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 1144 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1145 LIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1175 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 59/355 (16%), Positives = 106/355 (29%), Gaps = 51/355 (14%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L + A K N N G Q Sbjct: 283 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 342 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G IE ++ + H + + F Sbjct: 343 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLK---TFADLAAH 399 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 400 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 453 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----I 248 + E++ + P VR G V ++ F + ++ + + Sbjct: 454 EMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYSNKQDLGKAIENIRQ 508 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + L + + + AK++ + H ++ LT+G + L Sbjct: 509 MGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG------MSKDSILEP 556 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 N + VYAIG++ EA L+ A + R Y V + L D ++ +E+ Sbjct: 557 ANRLREEHIRVYAIGIK-EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEI 610 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 82/216 (37%), Gaps = 29/216 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ + ++ D+M ++D S S+ M ++ ++ + P Sbjct: 603 RNQVVQEICTEEACKEMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGP 656 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIF 267 D V+ G+V FS + F L I I+++ G TT + L + Sbjct: 657 DR---VQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFS 713 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K +I +K++I +TDGE + L ++ G I+Y++GV Sbjct: 714 PTKGARPNI-------RKFLILITDGEAQDIVKEPAVVL------RQEGVIIYSVGVFGS 760 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 Q + P+ + V+N D RI ++V Sbjct: 761 NVTQLEEISGRPEMVFYVEN----FDILQRIEDDLV 792 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S + + + + ++ N +R GLV +S++ Sbjct: 229 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 279 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 280 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 336 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 337 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIEGASDTQLEKIASHP------ 382 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 383 --AEQYVSKLKTFADLAAHNQTFLKK 406 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 62/155 (40%), Gaps = 18/155 (11%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK-GHDDYKKYIIF 289 P++ ++ +IN + T GL + + + + + AK + YK YI+ Sbjct: 445 TPMSNQWSTLKSQINAMTPSGNTNQAVGLFWGWQTLNTTNDPFKAPAKDPNWVYKDYIVL 504 Query: 290 LTDGENSSP------NIDNKESLFYCNEAKR------RGAIVYAIGVQAEAAD---QFLK 334 L+DG N+ + + C K V++I V + D + L+ Sbjct: 505 LSDGLNTQNRWTQTVSDIDARQELLCKNIKDPAQNGGNQITVFSIQVNISSKDPTSKVLQ 564 Query: 335 NCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +CA+P F + S + DAF + + K RI Sbjct: 565 DCATPGAGYFQMITQSSQTADAFNNVLATIAKLRI 599 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 84/226 (37%), Gaps = 29/226 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + +G+++I+ AI+ + ++G ++ + + L LD + L ++++ Sbjct: 14 RRFQTDARGNVAIIFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAAL-----MISK 68 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + Q + + + +++ + +N A N S+ + + Y Sbjct: 69 DAATMSASQITTRARQYVDSLYTAT--DAPIQNFTATYTPNSGSGASILLSANGTMPTYF 126 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + +N + P+ +S+ K S + + +VLD + SM + Sbjct: 127 MRV--------------LGSNFNTLPVATSSTTKWGS---TRMRVALVLDNTGSMAQN-- 167 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 K+ + +M+ + + V +V F+ + + Sbjct: 168 ---GKMAALQSAATDMITKLSAFNTTTGDVYISIVPFAKDVNVSTS 210 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 72/217 (33%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A + E ++ GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRLEAAKQVASEFIN-------GRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + N + + GL I K K Sbjct: 138 SFTQCPLTVDHGVLLNLFNSIK---GDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------------- 332 II LTDG N+ +I L AK+ G +Y IGV + Sbjct: 195 IILLTDGSNNRGDISP---LTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVP 251 Query: 333 -------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + ++ ++ KL + + I K Sbjct: 252 VEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 75/199 (37%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 836 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 886 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G T + L ++ + +A+ + + Sbjct: 887 VLFYLDDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 941 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 942 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFV 997 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 998 ETFGGLKGIFSDVTASVCN 1016 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + + ++ + Sbjct: 1010 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQQNDFKKMKEFLASVVQDFDVSLNR-- 1064 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 1065 -VRIGAAQFSDTYHPEFPLGAFIGEKEISFQIENIKQIFGNTHIGAALRKVEHYFRPDMG 1123 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 1124 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 1172 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1173 LIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1203 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 75/200 (37%), Gaps = 25/200 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ + ++ D+M ++D S S+ M ++ ++ + P Sbjct: 631 RNQVVQEICTEEACKEMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGP 684 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIF 267 D V+ G V FS + F L I I+++ G TT + L + Sbjct: 685 DR---VQIGAVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFS 741 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K + +K++I +TDGE + +L ++ G I+Y++GV Sbjct: 742 PTKGARPN-------VRKFLILITDGEAQDIVKEPAIAL------RQEGVIIYSVGVFGS 788 Query: 328 AADQFLKNCASPDRFYSVQN 347 Q + P+ + V+N Sbjct: 789 NVTQLEEISGRPEMVFYVEN 808 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 60/355 (16%), Positives = 106/355 (29%), Gaps = 51/355 (14%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L + A K N N G Q Sbjct: 311 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 370 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G IE ++ + H + + F Sbjct: 371 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLK---TFADLAAH 427 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 428 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 481 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----I 248 + E++ + P VR G V ++ F + ++ + + Sbjct: 482 EMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYSNKQDLGKAIENIRQ 536 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + L + + + AK++ + H ++ LT+G + L Sbjct: 537 MGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG------MSKDSILEP 584 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 N + VYAIGV+ EA L+ A + R Y V + L D ++ +E+ Sbjct: 585 ANRLREEHIRVYAIGVK-EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEI 638 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S + + + + ++ N +R GLV +S++ Sbjct: 257 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 307 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 308 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 364 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 365 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIEGASDTQLEKIASHP------ 410 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 411 --AEQYVSKLKTFADLAAHNQTFLKK 434 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 83/203 (40%), Gaps = 29/203 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S + P + I +M I S+P + R L +S K+ Sbjct: 55 DVVFLVDSSDRLGSKSFPFV------KMFITKM---ISSLPIEADKYRVALAQYSDKLHS 105 Query: 230 TFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L+ G + + + G + + L+ A+ F A + + Sbjct: 106 EFHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSAPAN----GRDKKQFPP 161 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ L + ++++ + ++A ++ G + ++GVQ +A+++ LK A+ ++ Sbjct: 162 ILVVL-------ASSESEDDVEEASKALQKDGVKIISVGVQ-KASEENLKAMATSQFHFN 213 Query: 345 VQNSRKL---HDAFLRIGKEMVK 364 ++ R L I K++ K Sbjct: 214 LRTVRDLSMFSQNMTHIIKDVTK 236 >gi|15827966|ref|NP_302229.1| hypothetical protein ML1808 [Mycobacterium leprae TN] gi|221230443|ref|YP_002503859.1| hypothetical protein MLBr_01808 [Mycobacterium leprae Br4923] gi|81536900|sp|Q9CBL9|Y1808_MYCLE RecName: Full=UPF0353 protein ML1808 gi|254800638|sp|B8ZS82|Y1808_MYCLB RecName: Full=UPF0353 protein MLBr01808 gi|13093519|emb|CAC30761.1| possible membrane protein [Mycobacterium leprae] gi|219933550|emb|CAR71903.1| possible membrane protein [Mycobacterium leprae Br4923] Length = 335 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM +++ A + ++ ++ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRATDVEP-NRMAAAQEAAKQF------AGELTPGINLGLIAYAGTATVL 151 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + +++L F T + + A I I G I+ Sbjct: 152 VSPTTNRYATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGEMPPPARIVLF 208 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQ--------------AEAADQFLK 334 +DG+ + P + K + AK +G + I D+ +K Sbjct: 209 SDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTVYGFVEINGQRQPVPVDDETMK 268 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 269 KVAQLSGGNSYNAATLAELKAVYASLQQQIGYETI 303 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 70/196 (35%), Gaps = 21/196 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P ++S K G+D+M+ +D+S SM ++L A Sbjct: 60 MFISAVALMVFALAGPQWGEHYQEVSRK---GVDIMVCVDISNSMMVEDAQP-NRLERAK 115 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++ V R GLV FS PL Q IQ +++L T Sbjct: 116 REVADLI-------RVATGDRLGLVAFSGVAFTQCPLTLDYQAIQMFLDQL-----TVDL 163 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + + A E K I+ +TDGE++ + L +A G Sbjct: 164 LPLRFQGTDLGAAIEMGMTAFDPKSSTDKVILLITDGEDNE-----EAGLKAAEKASDEG 218 Query: 317 AIVYAIGVQAEAADQF 332 ++ +G+ A Sbjct: 219 IRIFVLGIGDPAGGPV 234 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 65/200 (32%), Gaps = 31/200 (15%) Query: 141 TFPWCANSSHAPLLITSS---VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVAT 196 +C + L K+ + G++MM+ +DVS SM + F P ++L Sbjct: 59 FILFCTAVTLLILAAARPQFGSKLREEKTQGVEMMLAVDVSNSMLAEDFEP--NRLERTK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----ST 252 +I ++ D + R GL+ F+ + P+ + + R+ Sbjct: 117 YAINKLFDGLHQ-------DRVGLIVFAGEPKVQLPITSDYRMAKAFAKRIDPSLVPVQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L A E+ + +I +TDGEN + Sbjct: 170 TAIGKALSQALMSFSGETEENH---------SRVVILITDGENHEDDALAAARHAA---- 216 Query: 313 KRRGAIVYAIGVQAEAADQF 332 G +Y IG+ Sbjct: 217 -EMGIRIYTIGIGTPEGAPI 235 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 68/205 (33%), Gaps = 28/205 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L++ ++LD S SM + + A S+ + L R ++ F Sbjct: 34 QQRAPLNLCLILDHSGSMAWQ---PLAMVKQAAASLVDRLLPSD---------RLSVIAF 81 Query: 224 SSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K P W + I+ +I L G T G++ +I K+ Sbjct: 82 DHKAKVLVPNQTVWDKEAIKAQIATLEPGGGTAIDEGMKLGLKEIAAGKQGTISQ----- 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASP 339 I LTDGEN DN+ L A + A+G L+ A+ Sbjct: 137 -----IFLLTDGENEHG--DNQRCLELAKLAAEYNITLNALGFGVHWNQDVLEQIADAAG 189 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 R ++ + + F + + Sbjct: 190 GRLVFIEYAEQAIACFQSLFSHISS 214 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 79.9 bits (195), Expect = 5e-13, Method: Composition-based stats. Identities = 59/250 (23%), Positives = 90/250 (36%), Gaps = 42/250 (16%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 PFI F + TS+ +++ + G+D+M+ +DVS SM + P ++L Sbjct: 56 PFILRLFTFVMVIIILARPQTSNSWKNTQVE-GIDIMLAVDVSTSMLAEDLKP--NRLEA 112 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + E I P+ N GL F+++ P+ + L G T Sbjct: 113 AKQVASEF---IAGRPNDN----IGLTIFAAEAFTQCPMTTDHAVLLN----LFHGIKTD 161 Query: 255 STP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 G+ + K K II LTDG N++ +I L AK Sbjct: 162 MAQRGMIQDGTAVGMGIANAVSRLKDSKAKSKVIILLTDGTNNAGDISP---LTAAEIAK 218 Query: 314 RRGAIVYAIGVQ----------AEAADQFL--------KNCA-----SPDRFYSVQNSRK 350 G VY IGV Q+L K A + FY +++K Sbjct: 219 SFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDNKK 278 Query: 351 LHDAFLRIGK 360 L D + I K Sbjct: 279 LEDVYKDIDK 288 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 79.9 bits (195), Expect = 5e-13, Method: Composition-based stats. Identities = 59/250 (23%), Positives = 90/250 (36%), Gaps = 42/250 (16%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 PFI F + TS+ +++ + G+D+M+ +DVS SM + P ++L Sbjct: 56 PFILRLFTFVMVVIILARPQTSNSWKNTQVE-GIDIMLAVDVSTSMLAEDLKP--NRLEA 112 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + E I P+ N GL F+++ P+ + L G T Sbjct: 113 AKQVASEF---IAGRPNDN----IGLTIFAAEAFTQCPMTTDHAVLLN----LFHGIKTD 161 Query: 255 STP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 G+ + K K II LTDG N++ +I L AK Sbjct: 162 MAQRGMIQDGTAVGMGIANAVSRLKDSKAKSKVIILLTDGTNNAGDISP---LTAAEIAK 218 Query: 314 RRGAIVYAIGVQ----------AEAADQFL--------KNCA-----SPDRFYSVQNSRK 350 G VY IGV Q+L K A + FY +++K Sbjct: 219 SFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATDNKK 278 Query: 351 LHDAFLRIGK 360 L D + I K Sbjct: 279 LEDVYKDIDK 288 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 52/285 (18%), Positives = 97/285 (34%), Gaps = 81/285 (28%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIREMLDII--- 206 L SS ++ S LD++MVLD S SM+D G G ++ + +D I Sbjct: 95 TLSAISSTSDTTISGKPLDIVMVLDASGSMDDPMGTGDNTKRIDALKTAANTFIDAIAAQ 154 Query: 207 -KSIPDVNNVVRSGLVTFSSK---------IVQTFPLAWGVQ---------------HIQ 241 +SI D + R +V F+ K + + ++ Sbjct: 155 NQSITDASKQHRVAIVKFAGKKKTDKVGNDTYRDGRYTYNYSQTMKNLTSCKGKDADSLK 214 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-- 299 + + + +T++ GLE A N ++ KK ++F TDG +S + Sbjct: 215 DTVGNINPAGSTQADYGLELAENITINSGRADA---------KKIVVFFTDGSPTSSSGF 265 Query: 300 --IDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------------DQFLKNCAS------ 338 ++ K GA +Y IG+ + A ++F+ +S Sbjct: 266 QASVADSAIASAKSLKANGADIYTIGIFSGANPSADPTAEGTSKVNKFMHAVSSNYPGAT 325 Query: 339 -------------------PDRFYSVQNSRKLHDAFLRIGKEMVK 364 D + S ++ +L F I +++ Sbjct: 326 SSISFWGEWVIDYGTRAENSDYYKSATSASELEKIFEEISGSIIQ 370 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 48/411 (11%), Positives = 120/411 (29%), Gaps = 62/411 (15%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKL----------HYILDHSLLY 58 F + G+ I+TA+L+ + G+ ++ +H ++ +L + Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAADAAAVGSIAEKSGAVA 67 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNI---WQTDFRNELRENGFAQDINNIERSTSLS 115 A + + GK + D ++ + + + + Sbjct: 68 AAMTMTGNGTISLGKTDARSIFLSQVSGELADVNVDLGIDVTKTANKLNSQVSFTAVVPT 127 Query: 116 IIIDDQHKDYNL---SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + KD +A + Y + N+ + T+ + + + Sbjct: 128 TFMRVLGKDSITISGTATAEYLTASFMDFYILLDNTPSMGVGATAKDVATMEKNTSDSCA 187 Query: 173 MVLDVSLSMNDHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + + N+++ ++ V ++ +E+ KS N R G+ TF +K Sbjct: 188 FACHETENKNNYYNLAKTLGVSMRIDVVRQATKELTLTAKSTRVSTNQFRMGVYTFGTKA 247 Query: 228 V-----QTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + ++ + + G + A ++ D Sbjct: 248 EDANLTTISDPTDDLDKVRTYTDAVDLMTIPKQGYNNDQQTSFDNALTQMKDII-TTPGD 306 Query: 277 AKGHDDYKKYIIFLTDG------------ENSSPNIDNKESLFYCNEAKRRGAIV---YA 321 +K + F++DG + + +C K +G + Y Sbjct: 307 GSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCKPLKDKGIRIAVLYT 366 Query: 322 IGVQAEAADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRI 358 + + ++ CASP ++ V + + DA + Sbjct: 367 TYLPLPKNSWYNTWISPFQSQIPTKMQECASPGLYFEVTPTEGIADAMKAL 417 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 62/367 (16%), Positives = 130/367 (35%), Gaps = 86/367 (23%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 R+ + G+++I+ A+ L + + +G + + V+ ++ LD +L+ +I N Sbjct: 29 FRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 88 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E+ + K++ D+ + ++N + + IE T ++ Sbjct: 89 SEDTDALKQKVYDWFHAQVENSYA---------------LGEIEIDT----------TNH 123 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM---- 181 N++A + +P F AN P+ + S+VK + S L++ +V+D S SM Sbjct: 124 NITATASGTVPTTFMKI---ANIDTVPVSVGSAVKGPATS--YLNVYIVIDRSPSMLLAA 178 Query: 182 ----NDHFGPGMD--------------------------------KLGVATRSIREMLDI 205 G+ + VA ++RE+LD+ Sbjct: 179 TTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDM 238 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 I + ++ GL + + + +++++ T +T + Y Y Sbjct: 239 IDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARKRLSD-DSYGLTSATS-MNYTYFD 296 Query: 266 IFDAKEKL----EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF----------YCNE 311 + A + + K ++ LTDG S K S +C Sbjct: 297 VALAALQKIVGTGGDGTSSANPLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNPDWCGY 356 Query: 312 AKRRGAI 318 K + A Sbjct: 357 VKNKSAT 363 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 808 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 858 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G +T + L ++ + +A+ + + Sbjct: 859 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 913 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 914 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIDGANPMELLAMAGSSDKYFFV 969 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 970 ETFGGLKGIFSDVTASVCN 988 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + + ++ + Sbjct: 982 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNR-- 1036 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 1037 -VRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMG 1095 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 1096 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 1144 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1145 LIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1175 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 60/355 (16%), Positives = 106/355 (29%), Gaps = 51/355 (14%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L + A K N N G Q Sbjct: 283 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 342 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G IE ++ + H + + F Sbjct: 343 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLK---TFADLAAH 399 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 400 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 453 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----I 248 + E++ + P VR G V ++ F + ++ + + Sbjct: 454 EMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYSNKQDLGKAIENIRQ 508 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + L + + + AK++ + H ++ LT+G + L Sbjct: 509 MGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG------MSKDSILEP 556 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 N + VYAIG++ EA L+ A + R Y V N L D ++ +E+ Sbjct: 557 ANRLREEHIRVYAIGIK-EANQTQLREIAGEEKRVYYVHNFDALKDIRNQVVQEI 610 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 82/216 (37%), Gaps = 29/216 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ + D+ D+M ++D S S+ M ++ ++ + P Sbjct: 603 RNQVVQEICTEEACKDMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGP 656 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIF 267 D V+ G+V FS + F L I I+++ G TT + L + Sbjct: 657 DR---VQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFS 713 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K +I +K++I +TDGE + L ++ G I+Y++GV Sbjct: 714 PTKGARPNI-------RKFLILITDGEAQDIVKEPAVVL------RQEGVIIYSVGVFGS 760 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 Q + P+ + V+N D RI ++V Sbjct: 761 NVTQLEEISGRPEMVFYVEN----FDILQRIEDDLV 792 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S + + + + ++ N +R GLV +S++ Sbjct: 229 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 279 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 280 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 336 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 337 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIEGASDTQLEKIASHP------ 382 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 383 --AEQYVSKLKTFADLAAHNQTFLKK 406 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 83/203 (40%), Gaps = 29/203 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S + P + I +M I S+P + R L +S K+ Sbjct: 27 DVVFLVDSSDRLGSKSFPFV------KMFITKM---ISSLPIEADKYRVALAQYSDKLHS 77 Query: 230 TFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L+ G + + + G + + L+ A+ F A + + Sbjct: 78 EFHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSAPTN----GRDKKQFPP 133 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ L + ++++ + ++A ++ G + ++GVQ +A+++ LK A+ ++ Sbjct: 134 ILVVL-------ASSESEDDVEEASKALRKDGVKIISVGVQ-KASEENLKAMATSQFHFN 185 Query: 345 VQNSRKL---HDAFLRIGKEMVK 364 ++ R L I K++ K Sbjct: 186 LRTVRDLSMFSQNMTHIIKDVTK 208 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 84/223 (37%), Gaps = 26/223 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + S+K + + ++V+D S SM+ K+ A ++ +LDI+ Sbjct: 22 TIPFVLSIKGVYSAHPPIAFLIVIDTSYSMDGE------KIFRAKQAALGLLDIL----- 70 Query: 212 VNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + G+ F+ K + A ++ I L GS T Y+ + Sbjct: 71 -RDKDYVGVYGFAGKFYKVLEPVPATKRGEVERAIISLKLGSGTNI-------YDTLKKL 122 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E+ + + + IIF+TDGE + + K+ L + + GA IGV E Sbjct: 123 VEETKKVLQNGALSLVRIIFITDGEPTVGKKNPKKILEMAKKLREAGASALIIGVGTEYN 182 Query: 330 DQFLKNCASP-----DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ L A + + KL + + +E+ + + Sbjct: 183 EKLLSRMAMALNGEFEHISDPASLEKLISEYAKSTQEVSAKNV 225 >gi|114589213|ref|XP_516745.2| PREDICTED: hypothetical protein [Pan troglodytes] Length = 1859 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 1040 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 1090 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G +T + L ++ + +A+ + + Sbjct: 1091 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 1145 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 1146 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIDGANPMELLAMAGSSDKYFFV 1201 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 1202 ETFGGLKGIFSDVTASVCN 1220 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + + ++ + Sbjct: 1214 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNR-- 1268 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 1269 -VRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMG 1327 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 1328 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 1376 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1377 LIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1407 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 62/364 (17%), Positives = 109/364 (29%), Gaps = 51/364 (14%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L + A K N N G Q Sbjct: 503 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 562 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G IE ++ + H + + F Sbjct: 563 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLK---TFADLAAH 619 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 620 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 673 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----I 248 + E++ + P VR G V ++ F + ++ + + Sbjct: 674 EMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYSNKQDLGKAIENIRQ 728 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + L + + + AK++ + H ++ LT+G + L Sbjct: 729 MGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG------MSKDSILEP 776 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 N + VYAIG++ EA L+ A + R Y V N L D ++ +E+ + Sbjct: 777 ANRLREEHIRVYAIGIK-EANQTQLREIAGEEKRVYYVHNFDALKDIRNQVVQEICTEEA 835 Query: 368 LYNK 371 NK Sbjct: 836 DLNK 839 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 79/202 (39%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ M ++ ++ + PD V+ G+V FS Sbjct: 849 KDMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGPDR---VQIGVVQFS 899 Query: 225 SKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I+++ G TT + L + K +I Sbjct: 900 DINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARPNI----- 954 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE + L ++ G I+Y++GV Q + P+ Sbjct: 955 --RKFLILITDGEAQDIVKEPAVVL------RQEGVIIYSVGVFGSNVTQLEEISGRPEM 1006 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D RI ++V Sbjct: 1007 VFYVEN----FDILQRIEDDLV 1024 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S + + + + ++ N +R GLV +S++ Sbjct: 449 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 499 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 500 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 556 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 557 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIEGASDTQLEKIASHP------ 602 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 603 --AEQYVSKLKTFADLAAHNQTFLKK 626 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 95/243 (39%), Gaps = 36/243 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGL-------DMMMVLDVSLSMNDHFGPGM 189 + P +S H + +V S K G+ D++ ++D S + P + Sbjct: 207 LLRGASPNTVSSQHRAWRVACTVDHSRKQMNGVNQGPEYADVVFLVDSSDRLGSKSFPFV 266 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL 247 I +M I S+P + R L +S K+ F L+ G + + + Sbjct: 267 ------KMFITKM---ISSLPIEADKYRVALAQYSDKLHSEFHLSTFKGRSPMLNHLRKN 317 Query: 248 I--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 G + + L+ A+ F A + + ++ L + ++++ Sbjct: 318 FGFIGGSLQIGKALQEAHRTYFSAPTN----GRDKKQFPPILVVL-------ASSESEDD 366 Query: 306 LFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL---HDAFLRIGKE 361 + ++A ++ G + ++GVQ +A+++ LK A+ ++++ R L I K+ Sbjct: 367 VEEASKALRKDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKD 425 Query: 362 MVK 364 + K Sbjct: 426 VTK 428 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 71/223 (31%), Gaps = 30/223 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + I + + + L++ +VLD S SM L + I Sbjct: 21 SQRQLSFSIGAIASDFQDATLPLNLCLVLDHSGSMAGQ------PLRTVKEA------AI 68 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + + + R ++ F K P HI+ +I+RL T G++ Sbjct: 69 QLVDQLREGDRLSVIAFDHKAKVIVPNQDVTDKAHIKAQIDRLEAAGGTCIDDGIKLGLQ 128 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFL-TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + K +F+ TDGEN DN L A G + ++G Sbjct: 129 ELASSPGKRAAQ-----------VFMLTDGENEHG--DNGRCLEIAAVAAEHGVTLNSLG 175 Query: 324 VQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L+ A+ ++ + F R+ + Sbjct: 176 FGENWNQDVLEKIADAANGSLAYIETPNQALTEFERLLQRAQS 218 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 46/249 (18%), Positives = 93/249 (37%), Gaps = 41/249 (16%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + S + + ++ G+++++V+DVS SM +L A Sbjct: 58 VHLFYLSLVAISLMIIGFANPHIPLEQTKEGVNVVLVMDVSGSMQAQDYTP-SRLEAAKS 116 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKST 256 S +++ +KS +G+VTF S L+ + + EK+ N +T Sbjct: 117 SAEILINSLKSKD------YAGIVTFESGATTAAYLSPYKEKVIEKLRNVAPKEGSTAIG 170 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL + + KK II L+DG N++ I E++ Y AK Sbjct: 171 DGLSLGIDMASSIP-----------NKKKVIILLSDGVNNAGYISPDEAIQY---AKANN 216 Query: 317 AIVYAIGVQAEAADQF-----------------LKNCA--SPDRFYSVQNSRKLHDAFLR 357 VY IG+ + L+ A + +++ + + L + + Sbjct: 217 IQVYTIGMGSNGNVLLGYDWFGNPQYAELDEATLQAIANDTGGKYFKSIDDKTLDEIYKN 276 Query: 358 IGKEMVKQR 366 I + + +++ Sbjct: 277 ISENIKREK 285 >gi|169629808|ref|YP_001703457.1| hypothetical protein MAB_2724c [Mycobacterium abscessus ATCC 19977] gi|169241775|emb|CAM62803.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 336 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 72/216 (33%), Gaps = 28/216 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+DVS SM ++LG A + +E ++ + GL+ ++ Sbjct: 100 VMLVIDVSRSMESTDVAP-NRLGAAKEAGKEF------ARNLTPGINLGLIAYAGTATVL 152 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ L T + G+ A I + K I+ + Sbjct: 153 VSPTTNRDATVNALDNLQLADRTATGEGIFTALQAIATVGAVIGGGDKP---PPARIVLM 209 Query: 291 TDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGV--------------QAEAADQFLK 334 +DG+ + P+ + K + AK + + I A ++ Sbjct: 210 SDGKETVPSNPDNPKGAYTAARTAKDQQVPISTIAFGTKDGYVEINGQRQNVPYAPDMME 269 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A S Y+ +L + + + +++ + I Sbjct: 270 KVAKLSGGETYTASTLGQLKEVYANLQQQIGYETIR 305 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 80/243 (32%), Gaps = 31/243 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P ++ LL T+ +S I L+ +M+V+DVS SM P ++L A + Sbjct: 67 VPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMASTDVPP-NRLAAAKEA 125 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ D + + GLV F++ P ++ I+ L T + G Sbjct: 126 GKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRSAVKAGIDSLQPAPKTATGEG 179 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRG 316 + A I + G I+ +DG + P + + AK G Sbjct: 180 IFTALQAIATVGSVMGG---GEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEG 236 Query: 317 AIVYAIGVQAEAA---------------DQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 + I K C + + + + L + + + + Sbjct: 237 VQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQR 296 Query: 361 EMV 363 ++ Sbjct: 297 QIG 299 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 71/209 (33%), Gaps = 29/209 (13%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGP 187 + R + FI + + S +S G++MM+V+DVS SM + F P Sbjct: 51 STGRRRLKFILYLTAFALLTLALARPQLGSKLREVESR-GIEMMLVVDVSNSMLAEDFQP 109 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 ++L +I ++ D +K R GLV F+ V P+ + + R+ Sbjct: 110 --NRLERTKYAIDKLFDGLKQ-------DRVGLVVFAGDAVVQLPITSDYRMAKAFARRI 160 Query: 248 IFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 T L A + KG + + I+ +TDGE Sbjct: 161 SPSMVSVQGTDIGQALSLATMSFSE---------KGDNPAGRVIVLITDGEGHDSGAIEA 211 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +G ++ IG+ Sbjct: 212 AERAA-----EQGIRIFTIGIGTPEGAPI 235 >gi|226306560|ref|YP_002766520.1| hypothetical protein RER_30730 [Rhodococcus erythropolis PR4] gi|226185677|dbj|BAH33781.1| conserved hypothetical membrane protein [Rhodococcus erythropolis PR4] Length = 326 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 81/264 (30%), Gaps = 31/264 (11%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + ++ L P + P + ++ + + +++V+DV Sbjct: 38 RFTNFALLEKVAPSRPGRWRHIPAILMVVALVFFTVALAGPTADKKVPRNRATVILVIDV 97 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SLSM +L A + + D + + GLV F+ Sbjct: 98 SLSMQATDVEP-TRLAAAQEAAKSFADGLTP------GINLGLVAFAGTASVLVSPTTNR 150 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + I+ L T + + + + + I+ L+DG+ + Sbjct: 151 DATKVAIDNLKLSERTATGEAI---FTSLQSIDTLSAVLGGSDQAPPARIVLLSDGKQTV 207 Query: 298 P--NIDNKESLFYCNEAKRRGAIVYAIGVQ--------------AEAADQFLKNCA--SP 339 P + D + +AK +G + I D LK A S Sbjct: 208 PENSDDPRGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSG 267 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMV 363 F++ + +L + + +++ Sbjct: 268 GSFFTASSLEELRQVYDTLEEQIG 291 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 67/205 (32%), Gaps = 28/205 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L++ +VLD S SM+ L + ++D ++ R ++ F Sbjct: 37 DRRLPLNLCLVLDHSGSMDGQ------PLETVKSAALGLIDRLE------EDDRLSVIAF 84 Query: 224 SSK--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + IV I + I RL T GL+ + KE Sbjct: 85 DHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQEAAKGKEDRVS------ 138 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +I LTDGEN DN L A V+ +G L+ A+ Sbjct: 139 ----HIFLLTDGENEHG--DNDRCLKLGTVASDYKLTVHTLGFGDHWNQDVLEAIAASAQ 192 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 ++N + F ++ + M Sbjct: 193 GSLSYIENPSEALHTFRQLFQRMSN 217 >gi|194205647|ref|XP_001498059.2| PREDICTED: similar to von Willebrand factor A domain-containing protein 2 precursor (A domain-containing protein similar to matrilin and collagen) (AMACO) (Colon cancer secreted protein 2) (CCSP-2) [Equus caballus] Length = 784 Score = 79.5 bits (194), Expect = 7e-13, Method: Composition-based stats. Identities = 48/221 (21%), Positives = 83/221 (37%), Gaps = 22/221 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+F P + + KIS+ S + + +D+ M+ G + Sbjct: 13 FLFAGVPPSLPLQEVHVSRDTIGKISAASKM-MWCSAAVDILFLMDGSHSVGKGSFERSK 71 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGST-T 253 + D + VR G FSS FPL ++ Q ++ KI R++F T Sbjct: 72 HFAITVCDALDINLKR---VRVGAFQFSSAPHLEFPLDSFSTQQEVKAKIKRMVFKGGRT 128 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ L+Y K F + + +I +TDG + + K Sbjct: 129 ETGLALKYLLRKGFPGGR--------NASVPQILIIITDGRSQGHVA------LPAKQLK 174 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 RG V+A+GV+ ++ L AS R V + ++ DA Sbjct: 175 ERGITVFAVGVRFPRWEE-LHTLASEPREQHVLMAEQVEDA 214 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 70/213 (32%), Gaps = 28/213 (13%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +V+DVS SM ++ A + ++ D + + GL+ ++ Sbjct: 12 VVIDVSQSMRATDVEP-SRMVAAQEAAKQFADELTP------GINLGLIAYAGTATVLVS 64 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + +++L F T + + A I I G I+ +D Sbjct: 65 PTTNREATKNALDKLQFADRTATGEAIFTALQAIATVG---AVIGGGDTPPPARIVLFSD 121 Query: 293 GENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 G+ + P + K + AK +G + I D+ +K Sbjct: 122 GKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKV 181 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S Y+ +L + + +++ + I Sbjct: 182 AQLSGGNSYNAATLAELRAVYSSLQQQIGYETI 214 >gi|224054051|ref|XP_002190865.1| PREDICTED: collagen, type VI, alpha 1 [Taeniopygia guttata] Length = 1023 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 84/218 (38%), Gaps = 19/218 (8%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F W + + P +IT ++SS D +D+ VLD S S+ P D + + Sbjct: 16 FLWGDSCAQRPEIIT---QVSSAEDCPVDLFFVLDTSESVALRVKPFGDLVTQVKDFTNQ 72 Query: 202 MLDIIKS--IPDVNNVV-RSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLI-FGSTTK 254 +D + N+V +G + +S ++V L G ++ +++ + G T Sbjct: 73 FIDKLTQRYYRCDRNLVWNAGALHYSDEVVLIKSLTPMPSGQSELKNRVSAINYIGKGTY 132 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAK 313 + ++ ++ H KY+I +TDG + L NEAK Sbjct: 133 TDCAIKRGIEELLIGG--------SHHKENKYLIVVTDGHPLEGYKEPCGGLDDAANEAK 184 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 G V+++ + DQ L A+ + + L Sbjct: 185 LLGIKVFSVAISPNHLDQRLNIIATDHAYRRNFTATSL 222 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 61/168 (36%), Gaps = 19/168 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S+ K + L+ K + VR +V +S + Q Sbjct: 828 DITLLVDSSTSVGSKNFETTKKF--VKQLSGRFLEASK---PTDESVRISVVQYSGRNQQ 882 Query: 230 TFP--LAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + I + I+ + F T L++ + K+ Sbjct: 883 KVEAQFQYNYTVIAKAIDNMEFMNDATDVNSALQFITELYRRSARAAAK--------KRV 934 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++F +DG + I + +A++ G +Y + V ++A + ++ Sbjct: 935 LVF-SDGHSQG--ITARAIERAVQDAQKAGIEIYVLAVGSQANEPNIR 979 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 61/462 (13%), Positives = 141/462 (30%), Gaps = 109/462 (23%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 ++ +F G + L +L + V G+ ++ + ++ ++D S L A Sbjct: 21 HVVDFARAEDGVMLALVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDASTLAAA---- 76 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 +++N + K+ D+ + + + + + +++ + Sbjct: 77 HKDNALDPKQVVLDYFD-------KAALASYISADDILVGGGETSTAVEVNLTAQVKTPF 129 Query: 125 YNLSAVSRYEMPFI-FCTFPWCANSSHAPLLITSSV-------KISSKSDIGLDMMMVLD 176 + +P + + L I+ S+ ++ ++ +D ++ D Sbjct: 130 IRHLGNESFNVPARGRAEQAYGNSEVSLVLDISGSMDDNRRMSRLHRAANEFVDTVLTPD 189 Query: 177 ----VSLSM---------NDHFGPGMDKLG----------VATRSIREMLDIIKSIPDVN 213 VS+S+ M+ +D + Sbjct: 190 SVDRVSVSLIPYTGDVNVGWDIFSRMNVRQLHDYSYCVQFTPDDFSTTAIDPEDAYIQGQ 249 Query: 214 NV----VRSGLVT-FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + R ++ + P + ++ +INRL T G+++ + + Sbjct: 250 HFSHVDARFNYISCPTQSYETVTPFSQNNAALEAQINRLTGRERTSIHIGIKWGAAMLDE 309 Query: 269 AKEKLEHIAKGHDDYK---------------KYIIFLTDGENSSPN-------------- 299 A L + + K I+ +TDG N+ Sbjct: 310 AFRPLVNDLVDNSIVDEAFRDRPAPFTSNTLKVIVVMTDGMNTETKRIKEFAYDTPDMRA 369 Query: 300 -------IDNKESL-----------------------FYCNEAKRRGAIVYAIGV--QAE 327 D + CN AK G I+Y+IG + Sbjct: 370 HWARHAMDDWDNDVDGSVEDHLFDTYYDTAIGNALLQNICNAAKANGIIIYSIGFEINND 429 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 AA + +SP FY V+ ++ +AF I +++ + R+ Sbjct: 430 AAQEMEDCASSPSHFYRVEGV-QISEAFSSIAQQLKQLRLTL 470 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 76/219 (34%), Gaps = 34/219 (15%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + +S + +D+ +V+D S SM + D + + +E +D + SI Sbjct: 625 WDKDFAGTSVDNSGKTVAMDIALVIDSSGSMTWN-----DPKNLRKDAAKEFVDKLSSID 679 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + ++ F S L + I+ + T T G+ + + Sbjct: 680 EA------AIIDFDSSSKINRNLTSNRTLLYSAIDDIDSSGGTSLTAGVSKGLEALSKSN 733 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EA 328 + KK +I LTDG+ +++ G +Y IG+ + Sbjct: 734 D------------KKIMILLTDGKGPYDKSLTTQAI-------NAGVTIYTIGLGTNNDI 774 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L + A + ++Y + + +F + ++ + Sbjct: 775 DQPLLNSIATETGGKYYHAKKDIDIQGSFDNVSGDLGNK 813 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 72/204 (35%), Gaps = 35/204 (17%) Query: 165 SDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S G ++++ LD+S SM D G +L R R ++ R GL Sbjct: 84 SASGREIVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGL 136 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F+++ PL + + + I G T +ST I D Sbjct: 137 ALFANRAYVAAPLTFDLAAVGRAIEEASIGITGRST--------AIADGLGLALKRVTES 188 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------A 329 + I+ L+DG++++ ID ++ A R G ++ I + + Sbjct: 189 GAASRVIVLLSDGQDNAHQIDARQ---VAGLAARHGVRIHTIALGPDDLETRPAARDAVD 245 Query: 330 DQFLKNC--ASPDRFYSVQNSRKL 351 L+ AS R Y V+ L Sbjct: 246 TATLRAIAEASGGRSYRVRGMEDL 269 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 48/225 (21%), Positives = 75/225 (33%), Gaps = 40/225 (17%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + S D L + +V D S SM +S + + K + Sbjct: 84 QEISQFSSEDAPLSVGVVFDCSGSMG----------QKLDKSRQAVSQFFKLANPEDEFF 133 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 LV F+ P ++ IQ + T + DA H Sbjct: 134 ---LVQFNDSASLIQPFTRNLEEIQNHLAFTQSKGRT-----------ALLDAVYLALHE 179 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ----- 331 K + +K ++ ++DG ++S E N K +YAIG+ AA + Sbjct: 180 MKKAKNPRKALLLISDGGDNSSRYTEPEIK---NLVKEADVQIYAIGIYESAAGRGRTPE 236 Query: 332 ------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L A + R Y V N +L D +IG E+ Q IL Sbjct: 237 ESSGPALLTEIAEQTGGRQYQVDNLNELPDVAAKIGVELRNQYIL 281 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 48/251 (19%), Positives = 84/251 (33%), Gaps = 53/251 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 P + SS +S+ G+D++M LD+S +M F P +L A Sbjct: 62 LRVIAIALVIIVLARPQSVNSSDVSNSE---GIDIVMALDISGTMMAQDFSP--TRLEAA 116 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---ST 252 + E ++ R GLV F + PL + + + + FG Sbjct: 117 KKVAAEFIND-------RPNDRIGLVIFGGESFTQCPLTTDHKVLLNLLTEVKFGMIEDG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL + N++ D+K K + +I LTDG N++ I L A Sbjct: 170 TAIGLGLANSVNRLKDSKSK-----------SRVVILLTDGSNNAGQIAP---LTAAELA 215 Query: 313 KRRGAIVYAIG----------VQAEAADQFLK-------------NCASPDRFYSVQNSR 349 VY IG + Q ++ + +++ ++ Sbjct: 216 ASYDIRVYTIGIGSRGTSTARIMTPYGLQTMQVSGDFDERTLTEIAAITKGQYFRATDNT 275 Query: 350 KLHDAFLRIGK 360 L + I + Sbjct: 276 SLSAIYDEIDQ 286 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 65/336 (19%), Positives = 117/336 (34%), Gaps = 47/336 (13%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSY-RIIKNIWQTDFRNELRENGFAQDINNIERST 112 SL T+ + +Q + + + N + R E D ++R Sbjct: 134 RSLSSTSPHGRIGGDHTRSPQQDPHLALHQQVSNRRREVRRLRTSEPADYNDDEPLQRME 193 Query: 113 SLSIIIDDQHKDYNLSAVSRYE-MPFIFCT------FPWCANSSHAPLLITSSVKISS-- 163 + + K +S+ ++ +P C A +S + S V SS Sbjct: 194 AFDDLNFGSSKTAEISSYPEFQAVPQSTCLDGFDILIHVKAPTSSSDDATGSLVNGSSLR 253 Query: 164 -KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +D++ VLDVS SM K+ + +++ ++ + R ++ Sbjct: 254 LSRRVPIDIVTVLDVSGSMAG------TKMALLKQAMGFVIQ------HLRPSDRLSVIA 301 Query: 223 FSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + FPL G Q + IN L G T L+ A I D K + Sbjct: 302 FSSTARRLFPLQRMSHHGRQQALQAINSLGAGGGTNIADALKKAVKVIADRSYKNSVCS- 360 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFY--------CNEAKRRGAIVYAIGVQAEAAD 330 II L+DG+++ N + N + ++ G A+ Sbjct: 361 --------IILLSDGQDTYNISSNFQGTSAGRRSLVPSANPNELHMVPLHTFGFGADHDS 412 Query: 331 QFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + AS F +++ + DAF + IG + Sbjct: 413 DTLHSISEASGGTFSFIEDEGVMQDAFAQCIGGLLS 448 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 49/225 (21%), Positives = 79/225 (35%), Gaps = 55/225 (24%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A + E ++ GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRLEAAKQVAAEFIN-------GRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 PL + + + T GL A +++ D+K K Sbjct: 138 AFTQCPLTVDHGVLLNLFQSIKCDIAQKGLIMDGTALGMGLANAVSRLKDSKAK------ 191 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------- 328 K II LTDG N+ +I L AK+ G VY IGV Sbjct: 192 -----SKVIILLTDGVNNRGDISP---LTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYA 243 Query: 329 -----------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +Q L A + ++ ++ KL + + I K Sbjct: 244 GVQYVQVPVEIDEQTLTQIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|327277464|ref|XP_003223484.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like, partial [Anolis carolinensis] Length = 750 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 75/207 (36%), Gaps = 24/207 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ S + ++ +DV ++ + G + ++ D + P+ V Sbjct: 32 ETIAKISAAGQLMNCSAPVDVMFLLDGSYSIGKGSFERSKHLAIKLCDALDISPEK---V 88 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKL 273 + G + FS+ F L + I++K+ R++F T++ L+Y K F Sbjct: 89 KVGAIEFSNTAYLEFSLDAYFTKHQIKDKLKRIVFKGGRTETGLALKYILRKGFHGSR-- 146 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + K +I LTDG++ + K G V+ +GV ++ L Sbjct: 147 ------NSTVPKILIILTDGKSQGNIAAP------AKQLKEMGITVFVVGVSFPRWEE-L 193 Query: 334 KNCASPD---RFYSVQNSRKLHDAFLR 357 AS +++ + F Sbjct: 194 HILASDPTEWHLLFAEDTDDAVNGFYT 220 Score = 42.5 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 70/197 (35%), Gaps = 27/197 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D++ ++D S S ++ ++ L + S N G+ +S + Sbjct: 341 SVDLLFLVDSSSS------TTLEGFLRYKAFLKRFLQAVWSSETSGN---VGVAQYSDDV 391 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 T + V + + I+ + F T + L Y F + + DD Sbjct: 392 EMTVQVGDYKDVLSLVKVIDSMQFNGGSTLTGKALRYVTYNGFQSAPIFADVP---DDLP 448 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 + +I LTDG AK RG V+ IGV +E L+ + Sbjct: 449 RVVILLTDGNAQD---------SVVEAAKYSRGQDVFLIGVGSEFLRAELEEITGSSKRT 499 Query: 344 SVQNSRKLHDAFLRIGK 360 + ++ + D F +I + Sbjct: 500 VIYSTPQ--DLFNKITE 514 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 63/234 (26%), Gaps = 52/234 (22%) Query: 24 LLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRI 83 + PV+ G+ + K +L D NG + ++ Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAADS---AALATATALANGKIQTSEAEAYA--- 54 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK--DYNLSAVSRYEMPFIFCT 141 +N N L Q +I+ TS+++ K Y ++ Y++ Sbjct: 55 -RNFVAGQMANYL------QSGVDIKGGTSVNVQTSTSGKSTSYQVTVSPSYDLSVNPLM 107 Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-------------- 187 + H ++V S++ + M + LD S SM + Sbjct: 108 QAVGFKTQHLS-TSGTTVGGHSQTQGSISMFLALDKSGSMGESTATVNEDDPTETFTYDC 166 Query: 188 ----------------------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+ + + + S VR+G Sbjct: 167 NLHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFSQLNSADPNAQYVRTG 220 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 46/322 (14%), Positives = 97/322 (30%), Gaps = 37/322 (11%) Query: 4 LNIRNFFYNCKGS--ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 + ++N +G I + T ++ V+ ++GL I+ + V+ KL +D L A Sbjct: 1 MRVKNTSKRRQGGQAIVMFTLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAAR 60 Query: 62 KILNQENGNNGKKQKNDFSYRIIK-NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + + N + + + N F L + S + Sbjct: 61 LLGSGSNAAQQLSMAKATAAQFVNANFPAKFFGASLSGAANVCVDPGTDSSDPCGVGNGS 120 Query: 121 QHKDY---NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ + MP +F + + + S + +++V+D Sbjct: 121 GISTYKVRTVAVKATATMPTLFMRI--------IGMPTVTVSGSGTASRRDVRVILVMDR 172 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS------------ 225 S SM ++ + ++ + GLV + Sbjct: 173 SSSMGTYYSGINQTPPSINDMALKFVNSFSGAGEFGGRDEVGLVVYGGSGIVAYPPRDIT 232 Query: 226 ----KIVQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + P + +I + I + GS T + L AY + +A Sbjct: 233 KDYTDYTKFTPPDNNFKASGNIPKYIADITSGSNTGTAEALYLAYMTLRADAATNPDLAT 292 Query: 279 GHDDYKKYIIFLTDGENSSPNI 300 + I+ TDG + Sbjct: 293 KLN----VIVLFTDGIPNGVTA 310 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 78/252 (30%), Gaps = 38/252 (15%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + C+ + P+ VK+ + +++VLD+S SM Sbjct: 60 QWRRHVAVAMALCSLAAITGAWAVPV---GDVKVPRERAT---IVLVLDISQSMMATDVS 113 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +L + + + + + +V+ +VT S P + + I L Sbjct: 114 P-SRLAAEKDAATKFVAALPAQYNVS------VVTLSGHPNTLVPPTTDRAPVNQGIKTL 166 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T ++ + A + ++ L+DG + + + Sbjct: 167 ELADGTAIASSIDVGLEALKQAPAGDDGKQAPGL-----MVLLSDGSETGGG----DPVA 217 Query: 308 YCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC--ASPDRFYSVQNSRKL 351 ++AK++ +Y I + LK AS + ++ +L Sbjct: 218 SADKAKQQNVPIYTIAFGTQNGYVDLDGQRFNVAPDTDMLKRIADASSGKALDAASASQL 277 Query: 352 HDAFLRIGKEMV 363 D + + ++ Sbjct: 278 DDVYKTLTSDVG 289 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 75/199 (37%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 807 LDVVFVIDSSGSIDHDEYNIMKDF---------MIGLVKKADVGKNRVRFGALKYADDPE 857 Query: 229 QTFPL-AWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + + I N G T ++ L ++ + +A+ + + Sbjct: 858 VLFYLGDFDTKLEVISVLQNDQPMGGNTYTSEALGFSDHMFTEAQGSRLNKGVP-----Q 912 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 913 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFV 968 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 969 ETFGGLKGIFSDVTASVCN 987 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 74/211 (35%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + ++ + Sbjct: 981 VTASVCNSSKVDCEID---KVDLVFLMDGSNSIHSSDFTKMKEFLVSVVQDFDVSLNR-- 1035 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS Q FPL G + I +I N G T L + Sbjct: 1036 -VRIGAAQFSHNYRQEFPLGTFIGEKEISFQIENIQQLGGNTHIGDALRQVGHYFRPDMG 1094 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 1095 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 1143 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1144 LIQITGAAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 61/355 (17%), Positives = 107/355 (30%), Gaps = 51/355 (14%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L ++ A K N N G Q Sbjct: 282 LSMGINKSEILRNIQNLSPQTGNAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 341 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G A IE + + H + + F Sbjct: 342 ASEDNVTKAAVNLRREGVAIFTLGIEGARDSQLEKIASHPAEQYVSKLK---TFADLAAH 398 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 399 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 452 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----I 248 + E++ + P VR G V ++ F + ++ + + Sbjct: 453 EMKSFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYSNKQDLGKAIENIRQ 507 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + L + + + AK++ + H ++ LT+G + L Sbjct: 508 MGGNTNTGAALNFTLSLLQKAKQQRGNKVPCH------LVVLTNG------MSKDSILEP 555 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 N + VYAIGV+ EA L+ A + R Y V + L D ++ +E+ Sbjct: 556 ANRLREEHIRVYAIGVK-EANKTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEI 609 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 77/202 (38%), Gaps = 29/202 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ M M +++ N V+ G+V FS Sbjct: 616 KEMKADIMFLVDSSGSIGPENFSKMKTF---------MKNLVSKSQIGANRVQIGVVQFS 666 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I++++ G TT + L + K + Sbjct: 667 GVNKEEFQLNRFMSQSDISNAIDQMVHIGETTLTGSALSFVSQYFSPTKGARPN------ 720 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE + +L ++ G I+Y++GV Q + P+ Sbjct: 721 -VRKFLILITDGEAQDVVKEPAVAL------RQEGIIIYSVGVFGSNVTQLEEISGRPEM 773 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 + V+N D I ++V Sbjct: 774 VFYVEN----FDILQHIEDDLV 791 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 61/204 (29%), Gaps = 28/204 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LDVS + + + + ++ N +R GLV +S++ Sbjct: 228 DVVFLLDVS------VNGSEENFDFLKEFLE---ESVSALDIKENCMRVGLVAYSNETKV 278 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ I I L + T A K+ + ++ + + Sbjct: 279 INSLSMGINKSEILRNIQNLSPQTGNAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 335 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + +T +R G ++ +G++ Q K + P Sbjct: 336 VLVT------HRASEDNVTKAAVNLRREGVAIFTLGIEGARDSQLEKIASHP-------- 381 Query: 348 SRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 382 AEQYVSKLKTFADLAAHNQTFLKK 405 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 82/214 (38%), Gaps = 26/214 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ +++ +++D S SM+ + KL A ++ R ++ ++K + R Sbjct: 86 EVPGARRSPVNLALIIDRSGSMSGY------KLEQAKQAARHLVTLLK------DDDRLA 133 Query: 220 LVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V + S + L + + + I + T + GL ++ A+ Sbjct: 134 IVHYGSDVKSLPGLQATPANRERMIQYIEGIWDEGGTNISAGLLAGQAQVETARSDYRVN 193 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +I ++DG+ + + D + + RG V +IGV + + ++ Sbjct: 194 R---------LILISDGQPTEGSTDEGSLKQVVKDIRTRGITVSSIGVGTDFNEDLMQAF 244 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A + ++++ KL F + ++ Q Sbjct: 245 AEYGAGSYGFLEDAGKLATLFQKDLQQASTQVAR 278 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 75/194 (38%), Gaps = 24/194 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + ++ LD++ V+D S SM+ +D A ++ V R G+V Sbjct: 630 REEEVCLDIVYVIDTSGSMSGD---RIDAAKRAAIALAHF--------SVKAGDRVGIVG 678 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F++K + V+ I K+ L G T + E + D Sbjct: 679 FNTKAEIVVDITSDVEEIITKVMSLKPGGATDIGDAIRVGT-------ELFRRCGRPDRD 731 Query: 283 YKKYIIFLTDGENSSPNIDNK-ESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCA--S 338 + ++I LTDG + D + ++L A R G + IG++ + +++ A S Sbjct: 732 W--HMILLTDGVPTKGEPDPETKALSEATAASRMGVTISTIGIKLPEEGIRLIEHIAGIS 789 Query: 339 PDRFYSVQNSRKLH 352 R + + + +L Sbjct: 790 GGRSHHITDPEELT 803 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 86/245 (35%), Gaps = 37/245 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + ++ + P ++ ++ ++ G D++M++D+S SM++ A Sbjct: 67 MLILSWLLIVTALAKPSILG---EVQTREAFGRDVLMLVDLSGSMDEA------DFTTAD 117 Query: 197 RSIREMLDI----IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 S L+ +K+ + R GL+ F P + E+ + Sbjct: 118 GSTLTRLNAAKNVLKTFIAKRSGDRFGLILFGDAAFIQTPFTADQQVWLSLLEEAQTGMA 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + + E ++ +I LTDG ++ ++ ++ Sbjct: 178 GQSTHLGDAIGLGIKVFEQNPQPSE---------QQVMIVLTDGNDTGSFVEPVDA---A 225 Query: 310 NEAKRRGAIVYAIGVQ-------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A RG +Y I + + ++ + + R + + +L A+ I K Sbjct: 226 KIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAIDQAELDKAYQLIDK 285 Query: 361 EMVKQ 365 +Q Sbjct: 286 LEPQQ 290 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 78/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS LD++ V+D S S+ H I +L + PDV R GL+ Sbjct: 3 SSCDSKRLDLIFVIDSSRSVRPH------DFEKVKEFIVTILQFLDVAPDV---TRVGLI 53 Query: 222 TFSSKIVQTFPL-AWGVQ-HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L +G + ++ + + G+ T + ++YA N F E + Sbjct: 54 QYGSTVKNEFSLKTYGRKSEVERAVKVMKRLGTGTMTGLAIQYAVNIAFSESEGARPLR- 112 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG P +A+ G +++AIGV + + Sbjct: 113 --ENVPRIIMIVTDGRPQDPVA------EVAAKARNSGILIFAIGVGQVDYNTLKSIGSK 164 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 165 PHQDHVFLVANFSQIESLTSVFQN 188 >gi|229493542|ref|ZP_04387327.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229319503|gb|EEN85339.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 326 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 74/247 (29%), Gaps = 32/247 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P I + + +++V+DVSLSM +L Sbjct: 59 IPAILMVIALVF----FTVALAGPTADKKVPRNRATVILVIDVSLSMQATDVEP-TRLAA 113 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + + D + + GLV F+ + I+ L T Sbjct: 114 AQEAAKSFADGLTP------GINLGLVAFAGTASVLVSPTTNRDATKVAIDNLKLSERTA 167 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEA 312 + + + + + I+ L+DG+ + P + D + +A Sbjct: 168 TGEAI---FTSLQSIDTLSAVLGGSDQAPPARIVLLSDGKQTVPENSDDPRGGFTAARQA 224 Query: 313 KRRGAIVYAIGVQ--------------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 K +G + I D LK A S F++ + +L + Sbjct: 225 KDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTASSLEELRQVYD 284 Query: 357 RIGKEMV 363 + +++ Sbjct: 285 TLEEQIG 291 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 68/204 (33%), Gaps = 37/204 (18%) Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + E L + + R G S PL + + I + TT G Sbjct: 317 VSERLQALDKYDNGR-PNREGPNR-SCTTTPVTPLTSTERTVLNAIGDMGASGTTNIPNG 374 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN--SSPNID--------------- 301 + + I E A D+Y K ++ LTDG+N N D Sbjct: 375 VGWGIRLISPGAPFTEGSAWDDDEYIKAMVILTDGDNVMRGRNTDQMSDYEAYGFVADGR 434 Query: 302 ---------------NKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCAS-PDRFY 343 + + C A+ G VY I Q ++ ++NCAS P ++ Sbjct: 435 LGRRSSSSNVLSNELDDRTEAACAYARSLGIRVYTITFQVNSSSTRSLMQNCASNPSLYF 494 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 +S L DAF I ++ R+ Sbjct: 495 DSPSSEALEDAFEMIAGDLTNLRL 518 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 86/237 (36%), Gaps = 43/237 (18%) Query: 8 NFFYNCKGSISILTAI-LLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 +G+++ + A+ L+PV + G ++ S +++L LD + L Sbjct: 12 RLCRETRGNVATIFALTLVPVALLSGG-AVDLSQSMNARSRLAQALDAAALAVGV----- 65 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 N N + + I N +++ ++ +++ IDD+ Sbjct: 66 -NTNLSSSEATGIANDFI------------AANYPGRELGVVQN---VNVYIDDETDTVT 109 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 +S +R + + + S V+ + + L+++MVLD + SM Sbjct: 110 VSGEARVRTTML-----GMIGLDYITVHWESEVQRARQ---RLELVMVLDNTGSMGGS-- 159 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 K+ S + I+ D + V+ GLV F++ + G H + Sbjct: 160 ----KIRNLRESAELLTGILFDAADDPSDVKIGLVPFAATVNV------GTNHARAW 206 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 91/246 (36%), Gaps = 38/246 (15%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIRE 201 + + ++ ++ +++V+D+SLSM +D +D A Sbjct: 82 FLVGLLLLTVALSGPTTLAKVPKNRATVVLVIDISLSMMCDDVRPTRVDAARQA------ 135 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTP 257 IK + ++ ++ GLVTF+ + + ++ ++ I + T + Sbjct: 136 ---AIKFVDEMEPTLQLGLVTFAGTAQTLIAPSSDHEVVKRALDDAIRPDKLAARTATGE 192 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRR 315 G+ A +I K L +K I+ +DG+ + P+ + + EAK + Sbjct: 193 GIYTALQQIETLKGILGGASK---APPARIVLESDGKETVPDDLNAPRGAFTAAKEAKAK 249 Query: 316 GAIVYAIGVQ----------------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 +Y+I A D L+ A S +F++ + +L D + Sbjct: 250 EVPIYSISFGTASPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFTASSLDQLTDVYSS 309 Query: 358 IGKEMV 363 + E+ Sbjct: 310 LNAEIG 315 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 75/231 (32%), Gaps = 33/231 (14%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + A S L S++K+S G D+ +VLD+S SM D+ Sbjct: 65 LIWALVLLAISGPRDLAPVSALKVS-----GRDLAIVLDLSGSMVRDDFNLDDRAVTRLE 119 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 +++ + R LV F S+ P + + + +I G + ++T Sbjct: 120 AVKAV---GADFARRRAGDRLALVVFGSEAYFASPFTFDTESVARRIEEATIGISGRATS 176 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 I D + +I L+DG N++ + + A R G Sbjct: 177 --------ISDGLGLALKRLSTSTATSRVVILLSDGINNAGATNPRG---VAELAARYGV 225 Query: 318 IVYAIGVQAEA------------ADQFLKNCA--SPDRFYSVQNSRKLHDA 354 V+ I + + L+ + S + V+ + L Sbjct: 226 RVHTIALGPKDLTTAEVGERGVVDAATLRAISQISGGESFRVRTTEDLVAV 276 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 49/202 (24%), Positives = 77/202 (38%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS GLD++ VLDVS SM DK+ + ML +IK + ++ R +V Sbjct: 55 SSNDRPGLDLVAVLDVSGSMAG------DKIEKVKTA---MLFVIKKLSPID---RLSVV 102 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TFS+ + PL + +++ IN L T T GL+ + D + Sbjct: 103 TFSADANRLCPLRQITENSQKDLEKLINGLNADGATNITAGLQTGLKVLSDRSLSGGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ ++DGE ++ + + VY G + LK A Sbjct: 163 G--------IMLMSDGEQNAGGDAAQVPV--------GNVPVYTFGFGINHEPRVLKAIA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F VQ++ L AF Sbjct: 207 HNSIGGTFSDVQDTNNLSKAFS 228 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIIVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIIVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|254820232|ref|ZP_05225233.1| hypothetical protein MintA_09906 [Mycobacterium intracellulare ATCC 13950] Length = 327 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 79/216 (36%), Gaps = 31/216 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+D+S SM ++L A ++ + + + GLV F+ Sbjct: 92 IMLVIDMSQSMRATDVEP-NRLKAAEQAASQF------ASQLTPGINLGLVGFAGTPYLL 144 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P Q + + +L F +T + + A + I + G + I+ L Sbjct: 145 VPPTPQHQATIDALKKLDFADSTATGQAIFTALHAIGATA-----VTGGDNPPPARIVLL 199 Query: 291 TDGENSSPN--IDNKESLFYCNE-AKRRGAIVYAIGVQAEAAD--------------QFL 333 +DG + P+ D + ++ AK G + I + + + Sbjct: 200 SDGRENKPSNPSDPHDGVYTAARLAKDEGVPISTISFGTKGGEIEMDGQRVAVPVSTDQM 259 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 K A S + Y+ N +L+ ++ I E+ + + Sbjct: 260 KTIARLSGGQPYTATNIGELNKSYNAIENEIGYRTV 295 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 79.1 bits (193), Expect = 9e-13, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 57 LAMGLSVLSMAIIVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 115 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 116 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 169 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 170 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 222 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 223 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 282 Query: 363 VKQRI 367 +++ Sbjct: 283 GYEKV 287 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 56/325 (17%), Positives = 108/325 (33%), Gaps = 44/325 (13%) Query: 43 FVKAKLHYILDHSLLYTA---TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 K L +++ K L+ N Q S + N + + Sbjct: 630 LAKNPLTSAEKYAVSEDGKVFVKALSDANILIAPLQVKR-SDNVFINSLKGIVGKAIEIT 688 Query: 100 GFAQDINNIE-----RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN-----SS 149 DI E L+ + ++ Y ++ + P N + Sbjct: 689 AGGFDIKRAEIVVNYDEAELNGVDENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKTE 748 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 H + + +D++ VLD S SM+ + P ++ + I+ I Sbjct: 749 HFSTFLLGDKNMPVDL-SKVDIVFVLDNSGSMSSN-DPNYYRIEATKKFIQN-------I 799 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFD 268 ++NN R GLV F S + L + + +N + G +T GL+ A Sbjct: 800 DELNN--RVGLVDFDSSVYVRSNLTSDKNKLLQALNAMRWTGGSTNIGGGLKAALELF-- 855 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK I+ L+DG +++ N E ++ +V I + + Sbjct: 856 ----------DQEQSKKIIVLLSDGYHNTGIHPND----VLPELIKQEIVVNTIALGKDC 901 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKL 351 + L + A + ++ V N+ L Sbjct: 902 DRELLHDIADKTKGDYFYVDNTGGL 926 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 79/196 (40%), Gaps = 24/196 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ P + + I E L + P N VR G+V ++ Sbjct: 459 DIFFLIDQSGSI---HPPDFYDM---KKFILEFLQTFRVGP---NHVRIGVVKYADSPTL 509 Query: 230 TFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ +++ I + G T++ L++ + A H K Y Sbjct: 510 EFDLHTYTDVKSLEKAITNIHQVGGGTETGKALDFMRPQFDRAVTTRGHKVKE------Y 563 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 ++ +TDG ++ D ++ + +G +VYAIGV+ + L+ P R + V Sbjct: 564 LVVITDGNSTDKVKDP------ADKLRAQGVVVYAIGVKDAVEKELLEISGEPQRTFYVN 617 Query: 347 NSRKLHDAFLRIGKEM 362 N L I ++ Sbjct: 618 NFDALKPIKDDIITDI 633 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 82/233 (35%), Gaps = 24/233 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ + ++ + K + D++ +LD S S+ M K Sbjct: 1014 FVDTFDALETLYKNISKVLCNHTKPVCEKQKA-DLVFLLDQSGSIQSDDYTTMKKFT--- 1069 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTT 253 +D+I ++V GL FSS F L + Q I I + T Sbjct: 1070 ------IDLINKFQISRDLVHVGLAQFSSTFKDEFYLNKFFDEQAISAHIKDMQQEEGGT 1123 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 L + H ++ + + ++ +TDG++ + L + Sbjct: 1124 LIGLALNSIRKYF-----EASHGSRKAEGISQNLVLITDGDSQDDVEEAARLL------R 1172 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 G V+AIG+ + L+ +P+ ++V+N KL ++ + + + Sbjct: 1173 GLGVEVFAIGIGNVHDLELLQIAGTPENVFTVKNFDKLEGIHQKVVDTICQSK 1225 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 80/201 (39%), Gaps = 25/201 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D++ ++D S S+ + K+ +S+ ++ + V G++ +S Sbjct: 649 KDVPGDLIFLIDSSGSI---YPEDYQKMKDFMKSL------VQKSNIGKDQVHVGVLQYS 699 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ FPL + + + I+ + G T + + G Sbjct: 700 TEQKLVFPLIQYYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFDAQNG-------GRP 752 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 D K+ ++ +TDGE+ + + +G IVY+IGV A Q L+ DR Sbjct: 753 DLKQRLVVVTDGESQD------DVKLPAEALRAKGVIVYSIGVVAANTSQLLEISGDADR 806 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 Y+ ++ L D ++ E+ Sbjct: 807 MYAERDFDALKDLEKQMALEI 827 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 79/202 (39%), Gaps = 23/202 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT 222 K D++ ++DVS S+ A S+ ++ + + V + R G++T Sbjct: 847 KKTAQADIIFLVDVSTSI---------LKEKAFPSVTVFMESVVNQSSVGPELTRFGVIT 897 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS+ + F L + + + + + G T + L+Y+ EH + Sbjct: 898 FSTGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYFG-----KEHGGRA 952 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCAS 338 + ++ +TDG P+ S +++G V++IGV+ + +Q + Sbjct: 953 ALKVPQILMVITDGAAQEPSKLPGPS----EALRKQGVSVFSIGVKNASREQLDIMAGND 1008 Query: 339 PDRFYSVQNSRKLHDAFLRIGK 360 P R + V L + I K Sbjct: 1009 PSRVFFVDTFDALETLYKNISK 1030 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 49/138 (35%), Gaps = 16/138 (11%) Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNK 265 + R G V+ S + Q F L + ++ + + T++ A N Sbjct: 41 QPRQHPDRLGSVS-SDEPHQEFLLKYHIEKM-NLLAAFESFPYRNGGTETG----KAINF 94 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + ++ + + +TDG+++ + + E ++ G IV+AIGV Sbjct: 95 LRKQYFTKKAGSRADQRVPQIAVVITDGDSTD------DVVVPARELRKHGVIVFAIGVG 148 Query: 326 AEAADQFLKNCASPDRFY 343 + P + Sbjct: 149 NANQGELKSIANRPSERF 166 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 51/425 (12%), Positives = 128/425 (30%), Gaps = 90/425 (21%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+ I+TA+L+ +F G+ ++ +H ++ +L+ D + + + + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGSIAE------ 61 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 +G D + + + ++ +++ + + + +S Sbjct: 62 -KSGAVAAAMTMSGNGTISLGKDDARSIFMSQISGELTDVQVDLGIDVTKTANKLNSQVS 120 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 + F + I+ + ++ +D ++LD + SM Sbjct: 121 FSATVPTTF-------MRVLGRDSITISGTATAEYQTASFMDFYILLDNTPSMGVGATAT 173 Query: 183 -----------------------DHFGPGMDKLGVATR------SIREMLDIIKSIPDVN 213 +++ KLGV+ R + +E+ KS + Sbjct: 174 DVSTMEKNTSDTCAFACHETQNNNNYYNLAKKLGVSMRIDVVRQATKELTVTAKSTRVSS 233 Query: 214 NVVRSGLVTFSSKIV-----QTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYA 262 N R G+ TF +K + ++ + + G + A Sbjct: 234 NQFRMGVYTFGTKAEDAKLTTISDPTDDLDKVRSYTDAVDLMTIPFQGYNNDQQTSFDSA 293 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG------------ENSSPNIDNKESLFYCN 310 ++ +K + F++DG + + +C Sbjct: 294 LTQMKTII-TTPGDGSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCQ 352 Query: 311 EAKRRGAIV---YAIGVQAEAADQF--------------LKNCASPDRFYSVQNSRKLHD 353 K + + Y + + ++ CASP ++ V + + D Sbjct: 353 PLKDKSIRIAVLYTTYLPLPKNSWYNTWIKPFQGEIPTKMQACASPGLYFEVTPTEGIAD 412 Query: 354 AFLRI 358 A + Sbjct: 413 AMKAL 417 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 80/243 (32%), Gaps = 31/243 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P ++ LL T+ +S I L+ +M+V+DVS SM P ++L A + Sbjct: 67 VPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMASTDVPP-NRLAAAKEA 125 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ D + + GLV F++ P ++ I+ L T + G Sbjct: 126 GKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRAAVKAGIDSLQPAPKTATGEG 179 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRG 316 + A I + G I+ +DG + P + + AK G Sbjct: 180 IFTALQAIATVGSVMGG---GEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEG 236 Query: 317 AIVYAIGVQAEAA---------------DQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 + I K C + + + + L + + + + Sbjct: 237 VQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQR 296 Query: 361 EMV 363 ++ Sbjct: 297 QIG 299 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 80/243 (32%), Gaps = 31/243 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P ++ LL T+ +S I L+ +M+V+DVS SM P ++L A + Sbjct: 67 VPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMASTDVPP-NRLAAAKEA 125 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ D + + GLV F++ P ++ I+ L T + G Sbjct: 126 GKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRAAVKAGIDSLQPAPKTATGEG 179 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRG 316 + A I + G I+ +DG + P + + AK G Sbjct: 180 IFTALQAIATVGSVMGG---GEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEG 236 Query: 317 AIVYAIGVQAEAA---------------DQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 + I K C + + + + L + + + + Sbjct: 237 VQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQR 296 Query: 361 EMV 363 ++ Sbjct: 297 QIG 299 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 80/217 (36%), Gaps = 27/217 (12%) Query: 153 LLITSSVKISS---KSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDI 205 + + S +K + G ++ ++LD S SM D + + V +I Sbjct: 65 VALMSPIKDEPYELEPKDGYEIALILDASESMKAQGFDVQNQHLSRFDVVK-------EI 117 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + GLV F + PL + V + + +++L G + Y Sbjct: 118 VSDFISQRKNDNMGLVVFGAYSFIASPLTYDVNILNKILSQLQIG--------MAGKYTA 169 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + ++ K K I LTDG ++ P +D + K+ G VY IG+ Sbjct: 170 LNTSLAQGANLLKQSKSKTKIAILLTDGYST-PQVDTITLDIALDMIKKEGIKVYPIGIG 228 Query: 326 AE---AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 + LK S + ++ +L + + +I Sbjct: 229 MPHEYNTEALLKIANESGGVAFGASSAAELQEVYKKI 265 >gi|111024162|ref|YP_707134.1| hypothetical protein RHA1_ro07212 [Rhodococcus jostii RHA1] gi|110823692|gb|ABG98976.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 75/247 (30%), Gaps = 32/247 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + + + + +++V+DVSLSM +L Sbjct: 59 IPALLMVLALVF----FSVALAGPTEDKRVPRNRATVILVIDVSLSMKATDVEP-TRLAA 113 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + + D + + GLV F+ + + I+ L T Sbjct: 114 AQDAAKSFADGLTP------GINLGLVAFAGTASVLVSPTTNREASKVAIDNLQLSERTA 167 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEA 312 + + + + + I+ L+DG+ + P D + +A Sbjct: 168 TGEAI---FTSLQSIDTLAAVLGGSDQAPPARIVLLSDGKQTVPENPDDPRGGFTAARQA 224 Query: 313 KRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 K + + I D L+ A S F++ + +L D + Sbjct: 225 KDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTASSLEELRDVYD 284 Query: 357 RIGKEMV 363 + +++ Sbjct: 285 TLEEQIG 291 >gi|256426121|ref|YP_003126774.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256041029|gb|ACU64573.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 462 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 75/197 (38%), Gaps = 22/197 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 SK + L++ +VLD S SM+ + A + + + L+ + VN R + + Sbjct: 75 SKPRVPLNISLVLDRSGSMSGD---KIKYARQAAKFLIDQLNSTDHLSIVNYDDRVEVTS 131 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S + + ++ I+++ +T + G+ Y ++ +++ Sbjct: 132 PSQSVK-------NKEALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNR----- 179 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 ++ LTDG + D E N+ K G + GV A+ + L A Sbjct: 180 ----VLLLTDGLANQGITDPLELKRLAENKYKEDGIALSTFGVGADYNEDLLTMLAENGR 235 Query: 341 -RFYSVQNSRKLHDAFL 356 +Y + + K+ F Sbjct: 236 ANYYFIDSPDKIPQIFA 252 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 49/202 (24%), Positives = 74/202 (36%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS GLD++ VLDVS SM G + K+ A + + L I R +V Sbjct: 55 SSNDRPGLDLVAVLDVSGSME---GEKISKVKTAMLFMIKKLSSID---------RLSIV 102 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TFS + PL + ++ IN L T T GL+ N + D + + Sbjct: 103 TFSGDARRLCPLRQISETSQRELENLINGLKAEGATNITAGLKTGLNVLNDRRLSGGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ ++DGE ++ + + V+ G + LK A Sbjct: 163 G--------IMLMSDGEQNAGGDAAQVPV--------GNVPVHTFGFGINHEPRVLKAIA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F VQN+ L AF Sbjct: 207 QNSVGGTFSDVQNTDNLSKAFS 228 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIIVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-TRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 81/226 (35%), Gaps = 35/226 (15%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREM 202 A + L S++K++ G D+ +VLD+S SM D F +D V R Sbjct: 71 LLAAAGPRDLAPVSALKVT-----GRDLAIVLDLSGSMVRDDFD--LDGRQVTRRDAVAT 123 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + R LV F S+ P ++ V+ I +I G + ++T Sbjct: 124 VGA--DFARRRGGDRVALVVFGSEAYFAAPFSFDVEAIARQIEGAQIGISGRATS----- 176 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I D + + + +I L+DG N++ + + + G V+ I Sbjct: 177 ---ISDGLGLALKRMENSEAASRVVILLSDGVNNAGATNPRGVAELAAQM---GVRVHTI 230 Query: 323 GVQ------AEAADQFLKNCA--------SPDRFYSVQNSRKLHDA 354 + A+ ++ + + A S + V+ + L Sbjct: 231 ALGPKDLSTADPGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 68/195 (34%), Gaps = 28/195 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+ V+D S SM D A R + + + R+ LV+F+ Sbjct: 512 PIDLAFVIDESGSMGGAR--IQDAKASAKRFVGGLYED----------DRAALVSFAGGA 559 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L + I++L G T + GL+ A +++ E I Sbjct: 560 TLGQSLTTDHGAVNASIDQLNAGGGTNTGAGLQKAVDELTSNGEGDTQE----------I 609 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSV 345 I L DG + + A + IG+ Q L + A+ FY V Sbjct: 610 ILLADG----GTGLGPDPVTIAQTADEHRITINTIGMGTGIDAQELTSIADATGGEFYQV 665 Query: 346 QNSRKLHDAFLRIGK 360 +S +L + F R+ + Sbjct: 666 SDSSELPEVFDRVEQ 680 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 74/214 (34%), Gaps = 42/214 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++ +D S SM ++L A ++ + + + R GL+ F+ Sbjct: 94 GIDILFAIDTSRSML-SQDLKPNRLERARYAVMDFVATLGG-------DRVGLIPFAGSS 145 Query: 228 VQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL Q + + L I T + A + D+ Sbjct: 146 YLMCPLTLDYQAFTDSLKALDTKIIPRRGTNIAKVIALAEKTVADSSNH----------- 194 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--------ADQFLKN 335 K +I LTDGEN + L + AK+ G +Y IGV A F+++ Sbjct: 195 -KILIILTDGENLQG-----DVLKAADLAKKNGLTIYTIGVGTAAGELIPGGPGGAFIRD 248 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 S ++ + + I ++ +L Sbjct: 249 --SSGKYVKSKLDEETL---QEIAEKTGGISVLL 277 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 104/290 (35%), Gaps = 41/290 (14%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS--------- 149 N F ++I N I + DY + E +FC A S Sbjct: 872 NNFRENIGNDYLPLPTDITYEGLFYDYYFDTGEKAECQNLFCPSYSYALSKDPVSEVLGY 931 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS- 208 + + + S + S L++ +VLD+S SM F ++ + D KS Sbjct: 932 YLSVGLNSGIIESDFQRKKLNLALVLDISGSMGSSFDEYYYDRFGNHVAVNDTEDAEKSK 991 Query: 209 -----------IPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTT 253 + + + R GLV F++ P++ +Q ++ + + T Sbjct: 992 IEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQKLKGDVLEISATGGT 1051 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE--SLFYCNE 311 + + G++ A ++D E + +Y+ IIFLTD +S + + N Sbjct: 1052 RLSAGMQMATE-LYD-----EFLEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANA 1105 Query: 312 AKRRGAIVYA--IGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLR 357 K VY IG+ + + ++ +YSV ++ + + Sbjct: 1106 NKN----VYTTFIGIGVDFNTELVEYITKIRGANYYSVHSATQFKERMDD 1151 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 85/224 (37%), Gaps = 38/224 (16%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +++ ++ + G+DM++ +D+S SM + +A ++ R +++ + R Sbjct: 86 TLRSTTAAARGIDMVLAIDISESMMQSQTDTQSRFEIARQAARNVVEQ-------RSNDR 138 Query: 218 SGLVTFSSKIVQTFPLAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 GLV F + PL + +R+I T L A N++ ++ +L Sbjct: 139 IGLVVFRGEAYTLSPLTRDHTVLSLLLDNLSSRIIQDDGTAIGSALLVALNRLQASESEL 198 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGV------Q 325 + +I LTDGEN++ + L A RRG Y + Sbjct: 199 -----------QMVILLTDGENNAGEVSP---LTAAALAARRGVRFYVLNVAFESVKDEN 244 Query: 326 AEAADQF---LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A + + L+ A + +++V N +L I Sbjct: 245 APRSALYAAELQEVARRTGGSYFTVNNKTELETTIASIAARAKN 288 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 62/367 (16%), Positives = 131/367 (35%), Gaps = 86/367 (23%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 R+ + G+++I+ A+ L + + +G + + V+ ++ LD +L+ +I N Sbjct: 16 FRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 75 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E+ + K++ +D+ + ++N + + IE T ++ Sbjct: 76 SEDTDALKQKVSDWFHAQVENSYA---------------LGEIEIDT----------TNH 110 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM---- 181 N++A + +P F AN P+ + S+VK + S L++ +V+D S SM Sbjct: 111 NITATASGTVPTTFMKI---ANIDTVPVSVGSAVKGPATS--YLNVYIVIDRSPSMLLAA 165 Query: 182 ----NDHFGPGMD--------------------------------KLGVATRSIREMLDI 205 G+ + VA ++RE+LD+ Sbjct: 166 TTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDM 225 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 I + ++ GL + + + +++++ T +T + Y Y Sbjct: 226 IDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARKRLSD-DSYGLTSATS-MNYTYFD 283 Query: 266 IFDAKEKL----EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF----------YCNE 311 + A + + K ++ LTDG S K S +C Sbjct: 284 VALAALQKIVGTGGDGTSSANPLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNPDWCGY 343 Query: 312 AKRRGAI 318 K + A Sbjct: 344 VKNKSAT 350 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 34/144 (23%), Positives = 56/144 (38%), Gaps = 8/144 (5%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L + + R+ T T G+++ + E A KY+I L Sbjct: 431 MALTQDIAAARTYAARMAPAGNTNVTIGVQWGMEVLSPTAPFSEGGAFTDKAVLKYMIVL 490 Query: 291 TDGENSSPNIDNKES------LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFY 343 TDG N+ S C AK G V+ + V+ + L+NCAS +Y Sbjct: 491 TDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVE-QGDSTTLQNCASQTAYYY 549 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 ++ N+ +L +I K + K R+ Sbjct: 550 NLSNADQLPATMSKIMKSIRKVRL 573 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 32/269 (11%), Positives = 86/269 (31%), Gaps = 23/269 (8%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + +G+++++ A + I +G ++ ++ +AKL +D + Sbjct: 8 LRKFSADIRGNVTMIVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIAATIDPTA 67 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + IK L+ + + + S ++ + Sbjct: 68 TPTQTTREAVAKKAFCGNIKQ------SGGLQNSFCNTTTLDTLGTASATLSTATSNNIM 121 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++ + +P + + +++ + VLD + SM+ + Sbjct: 122 TVTYSATAHVPTYLLGLVGIDTVDIDAVAKSGVSTSTAE------VAFVLDNTGSMSSN- 174 Query: 186 GPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 +K+ S+ +L ++ S + LV F +++ + V + + Sbjct: 175 ----NKMTYLKSSLDAVLASMLDSTGKNYAKTKVALVPFDTQVSLS-----NVAGMVDYA 225 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + T S G A ++ Sbjct: 226 GDFSTVTPTYSCSGYSSAQCQVISENASA 254 >gi|242074986|ref|XP_002447429.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] gi|241938612|gb|EES11757.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] Length = 519 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 71/202 (35%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ GLD++ VLDVS SM G +DK+ A + + + L I R +V Sbjct: 55 STSDRSGLDLVAVLDVSGSMQ---GEKIDKMKTAMKFVVKKLSSID---------RLSIV 102 Query: 222 TFSSKIVQTFPLAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TF + PL + I+ L G T T GL+ + D + Sbjct: 103 TFMDTATRICPLRQVTDASQPELLGLIDALNPGGNTNITDGLQTGLKVLADRNLSSGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ ++DG+ + + K A VY G A+ L A Sbjct: 163 G--------VMLMSDGQQNRGGN--------AADVKIGNAPVYTFGFGADYDPTVLNAVA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F V + KL AF Sbjct: 207 RNSMGGTFSVVNDVDKLSMAFS 228 >gi|171912901|ref|ZP_02928371.1| hypothetical protein VspiD_17015 [Verrucomicrobium spinosum DSM 4136] Length = 339 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 78/236 (33%), Gaps = 40/236 (16%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREML 203 + P + S ++ S+ G+ +++ DVSLSM F G ++ T + R ++ Sbjct: 73 AIIALARPQKVISYDELKSE---GIGIVVAFDVSLSMRIRDFYIGNRQVDRMTAAKRVLV 129 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D IK P+ R G+V F P + ++R+ G T Sbjct: 130 DFIKGRPN----DRIGIVAFGGAPYNPCPPTLDHDWLLNNMDRIQTGIMEDGT------- 178 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I + K I+ +TDG N+S + +++ G ++AI Sbjct: 179 -AIGSGIAAAARRLDQLEVKSKVILLMTDGANNSGKLSPQDAARLAATL---GIRIHAIS 234 Query: 324 VQAEAADQF-------------------LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L+ A F+ ++ L F + Sbjct: 235 IGTPGMHPIYMPNGPPINSGRQEFDPETLQEVANIGSGSFFRAEDLSTLERIFKTV 290 >gi|73998866|ref|XP_535021.2| PREDICTED: similar to A-domain containing protein similar to matrilin and collagen [Canis familiaris] Length = 788 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 77/201 (38%), Gaps = 21/201 (10%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ S + + +D+ ++ G + + D + P+ V Sbjct: 50 ETIGKISAASKMMQCSAAVDILFLIDGSHSVGKGSFERSKHFAIMVCDALDINPER---V 106 Query: 217 RSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKL 273 R G + FSS FPL Q ++ KI R++F T++ L+Y + F Sbjct: 107 RVGAIQFSSAPHLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRGFPGGR-- 164 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + + +I +TDG + + L K RG ++A+GV+ ++ L Sbjct: 165 ------NASVPQILIVITDGRSQGHVAVPTKQL------KERGVTIFAVGVRFPRWEE-L 211 Query: 334 KNCASPDRFYSVQNSRKLHDA 354 AS R V + ++ DA Sbjct: 212 HRLASEPREQHVLMAEQVEDA 232 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 63/362 (17%), Positives = 129/362 (35%), Gaps = 48/362 (13%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLY----TATKILNQENGNNGKKQKNDFSYRIIKNI 87 + + + + + + +LLY + I ++E + +S KNI Sbjct: 28 ITICADYQNTVTELNETNNCRSKTLLYPPPTSGAGITHEEVYCPSPSYFSGYSTSFSKNI 87 Query: 88 -WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA 146 + T ++ N F ++I N I + DY + E +FC A Sbjct: 88 GFSTGGAKDV--NNFRENIGNDYLPLPTDITYEGLFYDYYFDTGEKAECQNLFCPSYSYA 145 Query: 147 NSS---------HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 S + + + + + S L++ +VLD+S SM F Sbjct: 146 LSKDPVSEVLGYYLSVGLNTGIIESDFQRKKLNLALVLDISGSMGSSFDEYYYDRFGNHV 205 Query: 198 SIREMLDIIKS------------IPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQ 241 ++ + D KS + + + R GLV F++ P++ +Q ++ Sbjct: 206 AVNDTEDAEKSKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQKLK 265 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T+ + G++ A ++D E + +Y+ IIFLTD +S Sbjct: 266 GDVLEISATGGTRLSAGMQMATE-LYD-----EFLEVNQSEYENRIIFLTDAMPNSGQTS 319 Query: 302 NKE--SLFYCNEAKRRGAIVYA--IGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAF 355 + + N K VY IG+ + + ++ +YSV ++ + + Sbjct: 320 EESLLGMIEANANKN----VYTTFIGIGVDFNTELVEYITKIRGANYYSVHSATQFKERM 375 Query: 356 LR 357 Sbjct: 376 DD 377 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 85/219 (38%), Gaps = 29/219 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGP--GMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + + D+++++D S SM +GP G DK+ A + + +D++ Sbjct: 60 TGTPPANVVMPNDVVLIIDKSGSMAPTYGPNNGEDKMTNAKEAAKGFVDLMDMTKH---- 115 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 R +V S +FP + IN + G T + ++ A + D + Sbjct: 116 -RVAVVD-FSSSASSFPFTVDKDAAKSYINTINSGGGTATGNAIDAAVALLADHR----- 168 Query: 276 IAKGHDDYKKYIIFLTDGENSSP--NIDNKE-SLFYCNEAKRRGAIVYAIGVQAEAADQ- 331 + + I+ +TDG + N D + +L AK G I Y I + D Sbjct: 169 -----TEAQPVIVLMTDGAATESPKNTDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPI 223 Query: 332 ------FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEMV 363 +KN A+ + V S+ L+ + I KE+ Sbjct: 224 TSAPNVLMKNMATTATHHHFVLGSKGLNQIYAAIVKEIG 262 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 80/217 (36%), Gaps = 33/217 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ L M+M +D S S+ F D A R +++ML + L+ Sbjct: 70 ERQATTPLSMVMAIDTSESVITQFQTERD---AAKRFVKQMLREQDEMD---------LI 117 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +FS + + P + I L G T + A ++ +AK Sbjct: 118 SFSDTVDEIVPFTNDAGRMNAGIGNLHKGDATSLYDAIYLASQRLTEAKRDATRR----- 172 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI---------GVQAEAADQF 332 K ++ +TDG N++ + ++++ A GA +Y I G Sbjct: 173 ---KILVIVTDGGNTTKGMRYQQAVEAAERA---GAAIYPIIMVPIEADAGRNTGGEHAL 226 Query: 333 LKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++ + +++ V + L AF + ++ Q +L Sbjct: 227 IQMAQDTGGKYFYVLDKHDLDKAFAHLSDDLRTQYLL 263 >gi|307591433|ref|YP_003900232.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986287|gb|ADN18166.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 491 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 18/207 (8%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++++++D S SM+ G + ++ A S + D+I + R + Sbjct: 44 PPTLEKTPQEIVLLIDCSGSMD---GNKLSEVKTAATSFVQRQDLITN--------RIAV 92 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + F S + PL V +Q I L G T L A +++ +A LE Sbjct: 93 MGFGSGVQLGTPLTSDVNVLQTAIANLYDGGGTMMDQALTAATDQLHNASASLESAIPSG 152 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 ++ ++I+ TDG D +L A+ + A+ + + P+ Sbjct: 153 EN--QHILLFTDGV----AADPYNTLVAGQTAQNAQINIVAVATGDADTNFLSQLTGDPN 206 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + N+ AF K + +++ Sbjct: 207 LVFYA-NTGNFDAAFQAAEKAIYSKQL 232 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 79/221 (35%), Gaps = 26/221 (11%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P H + + S+ + + + L + +VLDVS SM G M + A R++ + Sbjct: 186 VPPGGGPVHVRVALRSTAQAPA-ARPHLSVHLVLDVSGSMQ---GEPMARARDAARALVD 241 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPG 258 LD LVTFSS P+ I+ I+ + G T G Sbjct: 242 RLDANDDFS---------LVTFSSGADVRIDDGPVGPRRAAIKATIDGIREGGGTNIGQG 292 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L Y + + K ++ L+DG ++ ++ +A + G Sbjct: 293 LALGYAQ--------AARPGIPEGAVKVVLLLSDGRANAGITSSERLSRLALDAFQGGVQ 344 Query: 319 VYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 G+ A+ + AS +Y +++ ++ A Sbjct: 345 TSTFGLGADYDGALMSAIASDGAGGYYYLRDPDQIAPALAT 385 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 77/238 (32%), Gaps = 46/238 (19%) Query: 141 TFPWCANSSHAPLLITSSVK---ISSKSDIGLDMMMVLDVSLSM--NDHF--GPGMDKLG 193 PW + L ++ + + G D+ + LD+S SM +D + G + +L Sbjct: 61 LLPWATWALFV-LALSGPSTLEPVPALRVSGRDLAIALDLSGSMVRDDFYLDGQPITRLE 119 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFG 250 T R L+ F S+ P + + I +I I G Sbjct: 120 AVTTVG-------AEFARRRAGDRVALIVFGSEAYYAAPFTFDTEAIARRIEEATIGISG 172 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + L A ++ + + +I L+DG N++ + + Sbjct: 173 RATNISDALGLALKRMAGSDADT-----------RVVILLSDGANNAGATNPRGVAQLAA 221 Query: 311 EAKRRGAIVYAIGVQAEAAD------------QFLKNCA--SPDRFYSVQNSRKLHDA 354 + G V+ I + ++ D + L + S + V+ + L Sbjct: 222 QM---GVRVHTIAMGPKSVDEAEEGERGVVDAETLDAISKVSGGETFRVRTTEDLIAV 276 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 71/197 (36%), Gaps = 24/197 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 + + + P L K + G++ M+ LD+S SM + P ++L A Sbjct: 60 LMLLSIACLVAALARPQL--QERKENPGEAKGIEAMIALDISNSMLAEDLSP--NRLQFA 115 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 +I +LD + + G+V F+ P+ + ++ ++ + Sbjct: 116 KLTIHRLLDYLAES-------KVGVVVFAGNAYMQLPITTDLAMAKKMVDDANPDMLSNQ 168 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + A + + D K II TDGEN ++L +AK + Sbjct: 169 GTAIASAIDLSLGSFSDRH-------DVGKAIILFTDGENHEG-----DALEAAKKAKSQ 216 Query: 316 GAIVYAIGVQAEAADQF 332 G VY I V +E Sbjct: 217 GVKVYTIAVGSEEGAPI 233 >gi|154089854|emb|CAO81739.1| collagen type VI alpha 6 [Homo sapiens] Length = 631 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 51 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 101 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G +T + L ++ + +A+ + + Sbjct: 102 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 156 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 157 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFV 212 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 213 ETFGGLKGIFSDVTASVCN 231 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ + + ++ + Sbjct: 225 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNR-- 279 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 280 -VRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMG 338 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E + RG +Y++G+ Q Sbjct: 339 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQ 387 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 388 LIQITGTAEKKLTVHNFDELKKVNKRIVRNI 418 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 64/176 (36%), Gaps = 44/176 (25%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---KGHDDYKKYIIFLTD 292 ++ K++++I T GL + ++ + + + K I+ LTD Sbjct: 385 DRDTVKGKLDQMIASGNTNVAMGLIWGWHTLSKNAPFADGVDPATTVGKRTTKVIVLLTD 444 Query: 293 GENSSPNIDNKESLFY---------------------------------------CNEAK 313 G+N++ +N + Y C AK Sbjct: 445 GDNTNDTYNNPNASIYTGYGYITQGRLLNASNSPLGATSTATNRRDAIDSREARACTNAK 504 Query: 314 RRGAIVYAIGVQAEAADQ-FLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 G +YAIGV + + L++CAS P+ +Y V ++ +L F I + RI Sbjct: 505 AAGVQIYAIGVGVSSHSRGILQDCASKPEMYYDVTDAAQLASVFNTIAGSIQNLRI 560 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 72/201 (35%), Gaps = 30/201 (14%) Query: 8 NFFYNCKGSISILTAIL-LPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 + +G+I++ A+L +P+ + GL I+ S K +L LD + L A Sbjct: 16 RLGADERGAIAVQFALLLIPIAVLTFGL-IDISRASVQKRQLQDALDAATLMAAR----- 69 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + I E+ G N + + ++ Sbjct: 70 ---------STATTNADLDTIGDAALATEMAGLGVTFGPGNSSFVLGDNNTVVGTIQNVV 120 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + + +S++ P+ T++V S L++ +VLD + SM G Sbjct: 121 IKPIIS-----------NLWSSTNTPVSATATVMRSINH---LEVALVLDNTGSMASSLG 166 Query: 187 PGMDKLGVATRSIREMLDIIK 207 G K+ + + ++D++ Sbjct: 167 SGGSKITALITASKSLVDVLS 187 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 62/171 (36%), Gaps = 33/171 (19%) Query: 231 FPLAWGVQHIQEKINRLIFGST-----TKSTPGLEYAYNKIFDAKEKLEHIA--KGHDDY 283 PL+ + I L+ T GL + N + E +A + + Sbjct: 318 QPLSNDKTVVTNSIKGLVNSIGSYKPDTFIPGGLHWGVNTLSPPAPFKEGMAYDSKNKEP 377 Query: 284 KKYIIFLTDGENSSPNIDNKESLFY-----------------------CNEAKRRGAIVY 320 KK I+ +TDG N+ + + + C AK + V+ Sbjct: 378 KKVIVLMTDGANTLYTNSSGQIVSAATGSPPTISSSLVAPTYTAQDNACKYAKGKNIEVF 437 Query: 321 AIGVQAEAADQF--LKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 IG+ LK+CA+ ++ QN+ L +AF IG ++ R++ Sbjct: 438 VIGLGVTDPTALSALKSCATDAQHYFDAQNANDLIEAFEIIGGKLSVVRLM 488 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 79/220 (35%), Gaps = 30/220 (13%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 FF + +G+ +++ + ++ + M ++ S+ ++K LD + L A Sbjct: 6 FFRDRRGNTAVMFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVAV----LRP 61 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + Q + EL +N I D + + Y ++ Sbjct: 62 ATVEQAQA----------AVKLRLDKELGDNPDKVVIG--------QFNYDTKTRTYYVT 103 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 A Y+ PF+ + + + ++ L++ +VLD + SM Sbjct: 104 AKGTYK-PFLLG------VVNIKEIPYEVISETIQAANGTLELALVLDNTDSMGQILNGS 156 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +L V + +++ + + + + V+ +V ++ + Sbjct: 157 STRLDVLKTAATNLVNTVMTSAN-KDYVKVAVVPYADYVN 195 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 73/209 (34%), Gaps = 35/209 (16%) Query: 167 IGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G D+++ +D+S SM G + +L R+ + R GLV Sbjct: 88 SGRDLVLAIDLSGSMEQEDFVLDGRTVSRLDAVKAVARDFVRA-------RAGDRVGLVI 140 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+ PL + V+ + I++ G + +ST I D D Sbjct: 141 FAETAYFAAPLTFDVEAVGRLIDQATIGISGRST--------AISDGLGLALKRLARSDA 192 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------ADQ 331 + ++ L+DG N++ + +++ A+R G V+ I + Sbjct: 193 RSRVVVLLSDGVNNAGAVQPRDAGSL---AERLGIRVHTIALGPADLETDPKSRDAVDTA 249 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A S + V+ + L I Sbjct: 250 TLRAIAETSGGETFRVRTTDDLRQVARAI 278 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 90/267 (33%), Gaps = 41/267 (15%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 +L+I + +Y + F + + + P L+ + IG D+M Sbjct: 51 ALNISTTQGASQLVATRWQKYLLCFTWIMILF---ALTKPTLLGPP---QVREQIGRDIM 104 Query: 173 MVLDVSLSMNDH-----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +V+D+S SM G + +L + + + R GL+ F Sbjct: 105 VVVDLSGSMQKKDFVNQQGHKISRLDAVKEVLTDFIKT-------RQGDRLGLILFGDAA 157 Query: 228 VQTFPLAWGVQH-----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P +Q ++N + G +T + + +A + Sbjct: 158 FVQTPFTADHDVWLDLLMQTRVN--MAGKSTHLGDAIGLTIKRFNEATKNQ----TSEKT 211 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLK------ 334 +K I L+DG ++ + ++ AK +Y I + +Q L Sbjct: 212 REKVAIILSDGNDTGSYVPPIDAAMV---AKVNAVRIYMIAIGDPKSVGEQSLDMQTINK 268 Query: 335 -NCASPDRFYSVQNSRKLHDAFLRIGK 360 S + Y N ++L +A+ +I K Sbjct: 269 IASVSGGQAYQALNQQELLNAYAKIDK 295 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIMVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIMVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|301604858|ref|XP_002932077.1| PREDICTED: collagen alpha-1(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 1025 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 81/222 (36%), Gaps = 17/222 (7%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + P +++ + D +D+ VLD S S+ P + Sbjct: 37 GFPAVGPTITVRPGPGPEPERVTFQ-DCPVDVFFVLDTSESVALRVKPFKTLVTQVKDFT 95 Query: 200 REMLDIIKS--IPDVNNVV-RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKS 255 ++ +D + S N+V +G + +S +++ L ++ +++ + + G T + Sbjct: 96 KKFIDKLTSRYYRCDRNLVWNAGALHYSDEVIMISSLTRDMKTLRDDVETVEYIGKGTHT 155 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKR 314 ++ ++ H KY+I +TDG + L NEAK Sbjct: 156 DCAIKRGIEEVLIGG--------SHQKENKYLIVVTDGHPLEGYKEPCGGLEDAANEAKH 207 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPDRF---YSVQNSRKLHD 353 G V+++ + + L AS ++ ++ L D Sbjct: 208 LGIKVFSVAISPNHLEPRLSVIASDASHRRNFTATSAAGLTD 249 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 70/215 (32%), Gaps = 36/215 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFS 224 + D+ +++D S + + + ++ + +K+ + R +V +S Sbjct: 825 EGPADITLLVDSSTRVGNQH------FQTSKSFVKLLAQRFLKAKAPPSGSARVSVVQYS 878 Query: 225 SKIVQTFPLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q + + ++ N T L A +++ Sbjct: 879 GLNQQKVEAQFVSNYTVLEVPVDNMQFINGATNVVSALR-AVTELYREDSLAG------- 930 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEAADQFLKNCA 337 KK ++F +DG N+ D K+ L +A+ G +Y + V L A Sbjct: 931 VSKKLLVF-SDG-NTQ---DEKDMLKAVQDARAAGIEIYVLAVGSRLNYPNLQVMLTGSA 985 Query: 338 SP-------DRFYSVQNSRKLHDA--FLRIGKEMV 363 + +R + V + L + I + + Sbjct: 986 ADITAPFPEERLFRVPDYPSLLQGVRYQSISRRIS 1020 >gi|296221263|ref|XP_002756657.1| PREDICTED: von Willebrand factor A domain-containing protein 2 [Callithrix jacchus] Length = 725 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 87/221 (39%), Gaps = 22/221 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+F P + + + KIS+ S + + +D+ ++ G + Sbjct: 13 FLFFRVPPSLSLQEVHVSRETIGKISAASKM-MWCSAAVDILFLLDGSHSIGKGSFERSK 71 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGST-T 253 + D + P+ VR G FSS FPL ++ Q ++ KI R+IF T Sbjct: 72 HFAITVCDALDISPER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGRT 128 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ L+ ++ F + + +I +TDG++ + K Sbjct: 129 ETGLALKNLLHRGFPGGR--------NASVPQILIIITDGKSQGHVA------LPAKQLK 174 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 +G V+A+GV+ ++ L AS R V + ++ DA Sbjct: 175 EKGVTVFAVGVRFPRWEE-LHMLASEPREQHVLLAEQVEDA 214 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 57 LAMGLSVLSMAIMVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 115 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 116 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEG 169 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 170 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 222 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 223 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 282 Query: 363 VKQRI 367 +++ Sbjct: 283 GYEKV 287 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 74/203 (36%), Gaps = 29/203 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + + K + +VLD S SM+ ++ + S +L + ++ Sbjct: 106 TAEYNVKEAPPFHICLVLDRSTSMHGA------RMDMVKSSALNLLKQFRKQDLIS---- 155 Query: 218 SGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +V FS + P + +I+ L G T+ GL+ ++ + Sbjct: 156 --VVAFSDRAEVVIPPTRVPDLAKDDHRISMLQVGGGTEIYQGLQLGIEQLRSIDPR--- 210 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + + +I LTDG D++ + EA + G + +G+ E D+ L Sbjct: 211 -------FMRQLILLTDG---HTYGDDEACIELAEEAAQDGIQINTMGIGHEWNDELLDK 260 Query: 336 CA--SPDRFYSVQNSRKLHDAFL 356 A S V + + L+ F Sbjct: 261 IATISGANSIFVTSPKDLNKFFE 283 >gi|90409149|ref|ZP_01217268.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] gi|90309757|gb|EAS37923.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] Length = 226 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 76/194 (39%), Gaps = 15/194 (7%) Query: 110 RSTSLSIII-DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + T + I + + + S++ + + + P+ ++ +S Sbjct: 29 QHTQVWIPLATGSKQSFLQQTQSKFGAFILLLAWTLVIVALAKPIYYGEPIRAQQQSR-- 86 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +M++ LD+S SMN+ +D V S+ + L +K R GL+ F+ Sbjct: 87 -NMILSLDLSGSMNE-VDMRLDGQSVTRLSLVKSL--LKKFVATRQGDRLGLILFADHAY 142 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + ++ I ++I G G+ A I ++ + + ++ +I Sbjct: 143 LQTPLTFDLKTIAQRIEETQIGLV-----GIRTA---IGESIAIAIKRFVKNKNKQRILI 194 Query: 289 FLTDGENSSPNIDN 302 LTDG N++ I Sbjct: 195 LLTDGSNTAGRIKP 208 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 93/252 (36%), Gaps = 61/252 (24%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K SS + D+++V+D S SM + + G K+ V + ++ + + + + Sbjct: 61 AKASSSAVGAADVVLVIDRSGSMGERYDGSRQTKMEVLKDTAKDFITQLSAQSPASQ--- 117 Query: 218 SGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V+++S L +Q + I++L T+S GLE AY+ + A Sbjct: 118 VSVVSYASDSKTNIGLTSLDTQENIQSLNRAIDKLWASGATRSDLGLEDAYSVLGAA--- 174 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC----------NEAKRRG------ 316 K+++IFLTDGE +S + ++E + KR G Sbjct: 175 -------DSGNKQFVIFLTDGEPNSYSGFDREIAARAESTASIIKGEDLIKRDGRIFGDY 227 Query: 317 ----------------------AIVYAIGV----QAEAADQFLKNCASPDRFYSVQNSRK 350 A +++IG+ ++ +L S ++ Sbjct: 228 DGSLDDGSSHNPDWEFEGDPLSAEIFSIGILKSWSSQRVHDYLNYIDSQHSAALADTAQA 287 Query: 351 LHDAFLRIGKEM 362 L D F I ++ Sbjct: 288 LQDIFEAITHQI 299 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 58/203 (28%), Gaps = 66/203 (32%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL I IN + T G+ + + + + + A D K ++ + Sbjct: 389 TPLTDNQATINAAINAMDADGETNIPEGIAWGWRLLSAREPFTQGRANDAKDNLKVLVLM 448 Query: 291 TDGENSSPNID------------------------------------------------N 302 TDG+N+ + + N Sbjct: 449 TDGDNNYGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFDDTSTTSKRANRSSFVSAMN 508 Query: 303 KESLFYCNEAKR--------RGAIVYAIGVQAEAA---DQFLKNCASPD-------RFYS 344 ++ C K G +++ I + ++ CAS +Y Sbjct: 509 EKVAAICQNIKDDGRKATGEDGIVIFTIAFDLNDGSSVKKLMEQCASYGITDPTKKLYYD 568 Query: 345 VQNSRKLHDAFLRIGKEMVKQRI 367 ++S L AF I +++ RI Sbjct: 569 AKSSSDLMAAFDSITEQVSSLRI 591 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 87/228 (38%), Gaps = 24/228 (10%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 + F+ G+++++ + LP++ MG ++ S + K L + +D + L + N Sbjct: 13 KRFWTAKSGNVAVVFGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAAGREYSNN 72 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQD-INNIERSTSLSIIIDDQHKDY 125 + ++ K + ++ + F+ D I N + ST L + +H Sbjct: 73 QQDSHLSKVAEGY-------FFENAGADARANTDFSYDGIFNEDGSTVLQVSAARRHPTI 125 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-- 183 + + PL S + + ++S ++++MVLD S SM Sbjct: 126 FGD--------LLSFVTAGELDWRAFPLAARSQIVVQNQS---IELVMVLDNSGSMTGRP 174 Query: 184 HFGPGMDKLGVATRSIREMLDII---KSIPDVNNVVRSGLVTFSSKIV 228 G G K+ + + + + V+ G+V F++ + Sbjct: 175 KSGGGKRKIDTIKEAAIGLTGQFLKGAASSTLKLPVQFGVVPFAAAVN 222 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 73/235 (31%), Gaps = 50/235 (21%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM ++L A + R ++ + VR G+V+F+ Sbjct: 88 IILAMDVSGSMRAEDVKP-NRLVAAQEAARAFVESLP------REVRVGVVSFAGTAAVV 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY---- 286 + I R T G+ + IF G + Sbjct: 141 QAPTTSRDDVFAAIERFQLQRGTAIGSGIVLSLATIFPDAGIDIQQITGQRTMPRMLGDP 200 Query: 287 ------------------IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 +I LTDG+ ++ + + A RG VY +GV Sbjct: 201 EKKAEFTPVPPGSYASAAMILLTDGQRTTG----PDPIDAAKMAADRGIRVYTVGVGTTQ 256 Query: 329 A---------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + L+ A + ++ + L + R+G MV +R Sbjct: 257 GEIIGFEGWSMRVRLDEDTLRQIAQMTTGEYFYAGTAEDLKKVYQRLGSRMVVER 311 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 52/252 (20%), Positives = 79/252 (31%), Gaps = 46/252 (18%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKL 192 M TF P + K S + G+D+M+ +DVS SM D ++ Sbjct: 57 MLLRCITFTLIIIVLARPQTHNAWDKKSVE---GIDIMLAMDVSTSMLAEDLHPNRIEAA 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + I P+ N GL F+ + P+ + L+ Sbjct: 114 KAVAA------EFIAGRPNDN----IGLTIFAGEAFTQCPMTTDHASLLN----LLQNVR 159 Query: 253 TKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL + K K +I LTDG N+ ++ S Sbjct: 160 TDIAARGLIQDGTAVGMGLANAVSRLKDSKAKSKVVILLTDGSNNMGDLSPMTS---AQI 216 Query: 312 AKRRGAIVYAIGVQAEAADQF---------------------LKNCA--SPDRFYSVQNS 348 AK G VY IGV ++ L + A + FY N+ Sbjct: 217 AKSLGIRVYTIGVGTNKVARYPMPVTGGIQYVNIPVEIDTKTLSDIAATTDGNFYRATNN 276 Query: 349 RKLHDAFLRIGK 360 R+L + I K Sbjct: 277 RELKQIYNDIDK 288 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 56/232 (24%), Positives = 89/232 (38%), Gaps = 31/232 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDH---FGPGMDKLGVATRSIREM-LDIIKSIPDV 212 V I S+ G+ + +VLD S SM++ F + K+ + + L + Sbjct: 90 QQVSIPSE---GIAIYLVLDQSGSMSEEVKVFRKTITKMDLLKEVTKGFVLGNKQEGLTG 146 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIF 267 GLVTF+ PL Q I +++++L + + T + N I Sbjct: 147 RPQDMMGLVTFARGAQVLAPLTLDHQAIIDQLSKLQYTTDLEQDGTAIGYAIYKTANLIA 206 Query: 268 DAK---EKLEHIAKGHDDYKK-YIIFLTDGENSSPNID------NKESLFYCNEAKRRGA 317 + E+LE K K +I +TDG + +D N E L AK+ G Sbjct: 207 ATRHYAEELEGAGKPAYTIKNSIMILVTDGLQAPNPLDQGKEFRNVELLDAAVYAKKLGV 266 Query: 318 IVYAIGVQAEAADQ-------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 VY I V+ A + +K + RFY V NS L + I + Sbjct: 267 KVYIINVEPRIASEEFSAHRLLMKKITELTGGRFYMVDNSLNLSSIYSEIDQ 318 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 57/299 (19%), Positives = 110/299 (36%), Gaps = 50/299 (16%) Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 N + + + N ST + ++ ++ + E+P I A+ S Sbjct: 130 NGMGQAHDEPLVVNSAESTDPTSLVSLSRPQL-VTVKALPELPAI------SASESFRTF 182 Query: 154 LITSSVKISSKSD-------IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + +K + D +D++ VLDVS SM KL + R++ ++ + Sbjct: 183 AVLVGIKAPALLDDAHLLDRAPIDLVAVLDVSGSMAGS------KLSLLKRAVCFLIQNL 236 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 R +V+FSS + FPL G + IN L T GL+ Sbjct: 237 GPSD------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKG 290 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRG---- 316 + + E + II L+DG+++ N++ +++ + R+G Sbjct: 291 VRVLEERSE---------QNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQGRQAI 341 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGK--EMVKQRILYN 370 V+ G ++ + + S F +++ + DAF IG +V Q + Sbjct: 342 IPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVATVQDAFAMCIGGLLSVVAQELRLT 400 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 51/220 (23%), Positives = 76/220 (34%), Gaps = 41/220 (18%) Query: 164 KSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 KS G+D+M+ +DVS SM + P +++ A + E + GL Sbjct: 74 KSVEGIDIMLAMDVSTSMLAEDLKP--NRMEAAKKVAAEFISD-------RANDNIGLTI 124 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHD 281 F+ + P+ + L+ G T GL + K Sbjct: 125 FAGEAFTQCPMTTDHASLLN----LLQGVRTDIASRGLIADGTAVGMGLANAVSRLKESK 180 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---------------- 325 K II LTDG N+ +I L AK G VY IGV Sbjct: 181 AKSKVIILLTDGSNNMGDISP---LTAAQIAKSLGIRVYTIGVGTNTVAPYPVTVGGTTQ 237 Query: 326 -----AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 AE + LK+ A + FY N+ +L + + I Sbjct: 238 YVNVPAEIDTKTLKDIAQSTDGGFYRATNNAELKEIYNDI 277 >gi|222616426|gb|EEE52558.1| hypothetical protein OsJ_34813 [Oryza sativa Japonica Group] Length = 517 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 87/227 (38%), Gaps = 37/227 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM-------NDHFGPGMDKLGVATRSIREMLDI 205 + + + ++ + +D++ VLDVS SM H +D L +A + I ++ Sbjct: 48 VTVEAPKVVAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSRLDLLKIAMKYIIKL--- 104 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEY 261 V + R +V+F+ +V + L + ++ +++L T P L+ Sbjct: 105 ------VRDADRLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKK 158 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN----SSPNIDNKESLFYCNE----AK 313 A + ++ + G +I+ L+DG + S N + + A Sbjct: 159 AVEILDGRGKEEKKKRVG------FILLLSDGVDQFQYSRINWEKVAKSTDVDHSEVGAM 212 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA--SPDRF-YSVQNSRKLHDAFLR 357 R V+ G A L+ + S + + +N + +AF R Sbjct: 213 LRKYAVHTFGFSASHDPVPLRQISALSYGLYSFVCKNLDNITEAFAR 259 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 50/253 (19%), Positives = 76/253 (30%), Gaps = 42/253 (16%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKI-SSKSDIGLDMMMVLDVSLSM-NDHFGPGMDK 191 MP L + SK+ G+D+M+ +DVS SM + P ++ Sbjct: 52 LMPLSMLLRLLVFVMIVMVLARPQTRNSWDSKTVEGIDIMLAMDVSTSMLAEDLRP--NR 109 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + A + E + GL F+ + P+ + L+ Sbjct: 110 IEAAKQVASEFI-------IGRPNDNIGLAIFAGESFTQCPMTTDHASLLN----LLQNV 158 Query: 252 TTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T GL I K K +I LTDG N+ +I + Sbjct: 159 RTDIAARGLIEDGTAIGMGLANAVSRLKDSKAKSKVVILLTDGSNNRGDISPSTA---AE 215 Query: 311 EAKRRGAIVYAIGVQAEAADQF---------------------LKNCAS--PDRFYSVQN 347 AK G VY IGV + L AS FY N Sbjct: 216 IAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNVPVEIDTKTLSEIASITEGDFYRATN 275 Query: 348 SRKLHDAFLRIGK 360 + +L + I + Sbjct: 276 TNELRKIYKEIDQ 288 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 45/189 (23%), Positives = 69/189 (36%), Gaps = 22/189 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++V I+S S +M+ LDVS SM P +++ A ++ IK+ P + Sbjct: 77 ATVPITSNSTT---IMLALDVSGSMCSTDVPP-NRITAAEKAATAF---IKAQPAGS--- 126 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE-- 274 R GLVTFS P Q + + + L T G+ + + I DA + Sbjct: 127 RIGLVTFSGIAGLLVPPTTDSQKLLDALQNLTTSRGTAIGQGILTSIDAIADADPSVAPT 186 Query: 275 ----HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 I+ LTDG N+ + +A R VY IG Sbjct: 187 GSAVSGNGTGPYAADVIVVLTDGANTQGV----DPQTAAKQAAARRLRVYTIGFGTTTPA 242 Query: 331 QFLKNCASP 339 + C S Sbjct: 243 PMV--CGSS 249 >gi|109090611|ref|XP_001091779.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 3 [Macaca mulatta] Length = 781 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 47/223 (21%), Positives = 85/223 (38%), Gaps = 34/223 (15%) Query: 150 HAPLLITSSVKISSKS---DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 H + ++SK +D++ +LD S S+ G + + D + Sbjct: 28 HVSRETIGKISVASKMMWCSAAVDILFLLDGSNSV------GKGSFERSKHFAITVCDAL 81 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGST-TKSTPGLEYAY 263 + VR G FSS FPL ++ Q ++ KI R+IF T + L+Y Sbjct: 82 DISTER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLL 138 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N+ F + + ++ +TDG++ + K +G V+A+G Sbjct: 139 NRGFPGGR--------NASVPQILVIVTDGKSQGHVA------LPAKQLKEKGVTVFAVG 184 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDA----FLRIGKEM 362 V+ ++ L AS R V + ++ DA F + Sbjct: 185 VRFPRWEE-LHALASEPREQHVLLAEQVEDATNGLFSTLSNSA 226 >gi|109090613|ref|XP_001091550.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 1 [Macaca mulatta] Length = 725 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 47/223 (21%), Positives = 85/223 (38%), Gaps = 34/223 (15%) Query: 150 HAPLLITSSVKISSKS---DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 H + ++SK +D++ +LD S S+ G + + D + Sbjct: 28 HVSRETIGKISVASKMMWCSAAVDILFLLDGSNSV------GKGSFERSKHFAITVCDAL 81 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGST-TKSTPGLEYAY 263 + VR G FSS FPL ++ Q ++ KI R+IF T + L+Y Sbjct: 82 DISTER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLL 138 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N+ F + + ++ +TDG++ + K +G V+A+G Sbjct: 139 NRGFPGGR--------NASVPQILVIVTDGKSQGHVA------LPAKQLKEKGVTVFAVG 184 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDA----FLRIGKEM 362 V+ ++ L AS R V + ++ DA F + Sbjct: 185 VRFPRWEE-LHALASEPREQHVLLAEQVEDATNGLFSTLSNSA 226 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 88/259 (33%), Gaps = 42/259 (16%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY--EMPFIFCTFPWCANSSH 150 ++ F D I +++ + S Y E P +S Sbjct: 36 KSSAPNKMFNDDEEPIAPASNAGKQVRGFSDVGKASVKPYYPKEAPL---------GAST 86 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +L+ S S+ LD+++VLDVS SM D +DKL A R I + L + Sbjct: 87 VRVLLDVSSSSSTAGRAALDLVVVLDVSGSMRDF--GRLDKLKSAMRFIIKKLAPMD--- 141 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +VTF+ + PL V + + ++ L+ T GL+ + Sbjct: 142 ------RLSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVL 195 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + A +I ++DGE +S D + VY + + Sbjct: 196 DGRRYTGARTAG--------VILMSDGEQNSG--DATRVRNP------QNYPVYTLSFGS 239 Query: 327 EAADQFLKNCASPDRFYSV 345 A L+ A Y+ Sbjct: 240 NADMNLLQKLAGGGGTYNP 258 >gi|256787646|ref|ZP_05526077.1| lipoprotein [Streptomyces lividans TK24] gi|289771539|ref|ZP_06530917.1| lipoprotein [Streptomyces lividans TK24] gi|289701738|gb|EFD69167.1| lipoprotein [Streptomyces lividans TK24] Length = 532 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 73/216 (33%), Gaps = 29/216 (13%) Query: 151 APLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 L+ ++ + V+D+S SM + +L +A ++ M D + Sbjct: 163 WSLVRVGLATRPAERQSERPPAALTFVIDISGSMGEP-----GRLDLAQEALGTMTDRL- 216 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + LVTFS + P+ + + I+ L +T G+E Y Sbjct: 217 -----RDDDSVALVTFSDEAETVLPMTRLGDHRGRVHDAIDGLEPTDSTNLGAGMETGYE 271 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIG 323 + + + ++ ++D ++ + D L E + G ++ +G Sbjct: 272 TAVEGRREGATNR---------VVLVSDALANTGDTDADTILERIATERREHGITLFGVG 322 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 V ++ D ++ A V + + F Sbjct: 323 VGSDYGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|21221175|ref|NP_626954.1| lipoprotein [Streptomyces coelicolor A3(2)] gi|6969217|emb|CAB75310.1| putative lipoprotein [Streptomyces coelicolor A3(2)] Length = 532 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 73/216 (33%), Gaps = 29/216 (13%) Query: 151 APLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 L+ ++ + V+D+S SM + +L +A ++ M D + Sbjct: 163 WSLVRVGLATRPAERQSERPPAALTFVIDISGSMGEP-----GRLDLAQEALGTMTDRL- 216 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + LVTFS + P+ + + I+ L +T G+E Y Sbjct: 217 -----RDDDSVALVTFSDEAETVLPMTRLGDHRGRVHDAIDGLEPTDSTNLGAGMETGYE 271 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIG 323 + + + ++ ++D ++ + D L E + G ++ +G Sbjct: 272 TAVEGRREGATNR---------VVLVSDALANTGDTDADTILERIATERREHGITLFGVG 322 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 V ++ D ++ A V + + F Sbjct: 323 VGSDYGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 81/226 (35%), Gaps = 35/226 (15%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREM 202 A S L S++K++ G D+ +VLD+S SM D F +D V R Sbjct: 71 LLAASGPRDLAPVSALKVT-----GRDLAIVLDLSGSMVRDDFD--LDGRQVTRREAVAT 123 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + R LV F S+ P ++ V+ I +I G + ++T Sbjct: 124 VGA--DFARRRGGDRVALVVFGSEAYFAAPFSFDVEAIARQIESAQIGVSGRATS----- 176 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I D + + + +I L+DG N++ + + + G V+ I Sbjct: 177 ---ISDGLGIALKRMENSEAASRVVILLSDGVNNAGATNPRGVAELAAQM---GVRVHTI 230 Query: 323 GVQ------AEAADQFLKNCA--------SPDRFYSVQNSRKLHDA 354 + A+ ++ + + A S + V+ + L Sbjct: 231 ALGPKDLSSADPGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 64/417 (15%), Positives = 136/417 (32%), Gaps = 61/417 (14%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + G+I+++ AI L I +G ++ ++ V+AKL D ++L + + Sbjct: 16 RFASDRSGNIAVIFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVLLAVSNSAINK 75 Query: 68 NGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH-KDY 125 + + F + + EN + + S+ + +D Sbjct: 76 TVADAQADAVQFFNATLDGYGLSATIDLSVSENDGKRSAVSSFSSSVKTHFLDMIGYPTL 135 Query: 126 NL--SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL-DVSLS-- 180 + + S +P + NS + T+S + ++ D+S S Sbjct: 136 AIGNRSTSTVSLPVYVDFYLLLDNSPSMGVAATTSDIATMVANTSDQCAFACHDLSTSNN 195 Query: 181 ---MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 + G M ++ V ++++ + ++ V N R G+ TF S L Sbjct: 196 YYNLAKKLGVTM-RIDVVRQAVQRLTTTATAMSAVTNQFRMGVYTFGSSC-TAIGLTTVA 253 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYN----KIFDAKEKLEHIAKG------HDDYKKYI 287 + T YN FD + A +K++ Sbjct: 254 NLSSSMSSVQTSVGTIDLMTIPYQGYNNDQCTDFDGSLTAINSAIPSPGSGISTQPQKWL 313 Query: 288 IFLTDG--ENSSPNIDNKESLF-----------YCNEAKRRGAIV---YAIGVQAEAADQ 331 F++DG + + P+ K ++ C K RG + Y + Sbjct: 314 FFVSDGVADANYPSTCTKPTVSGGRCQEPLTVAQCTAIKSRGIQIAVLYTTYLALPTNSW 373 Query: 332 F-----------------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +++CASP ++ V ++ + +A + K+ V + Sbjct: 374 YNTYIAPFNPGPYGPSTNSQIAANMQSCASPGFYFEVSPTQGIAEAMDALFKKAVAK 430 >gi|311106403|ref|YP_003979256.1| von Willebrand factor type A domain-containing protein 2 [Achromobacter xylosoxidans A8] gi|310761092|gb|ADP16541.1| von Willebrand factor type A domain protein 2 [Achromobacter xylosoxidans A8] Length = 340 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 79/218 (36%), Gaps = 39/218 (17%) Query: 159 VKISSKSDIGL-DMMMVLDVSLSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDV 212 V+ + + D+++ +D+S SM G D+L + + +D Sbjct: 81 VEPPLEHRQPVRDLLLAIDISQSMETEDFVAPDGRREDRLSGVKAVVADFID-------R 133 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R GL+ F + PL ++ +++ + G T + A + A Sbjct: 134 RQDDRLGLIVFGTAAYPQAPLTQDHATLKLLLGQVSTRMAGPNTAIGDAIGVAIKQFEHA 193 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---A 326 E + +I LTDG ++ + + + A R +V+ +G+ A Sbjct: 194 GEH-----------DQVLILLTDGNDTGSAVPPDRA---ASMAAARHIVVHTVGIGDPQA 239 Query: 327 EAADQF----LKNCA--SPDRFYSVQNSRKLHDAFLRI 358 E ++ L+ A + RF+ Q+ L + + Sbjct: 240 EGEEKVDFDALRAIAAKTGGRFFPAQDQASLRQVYAEL 277 >gi|281347736|gb|EFB23320.1| hypothetical protein PANDA_001404 [Ailuropoda melanoleuca] Length = 708 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 73/189 (38%), Gaps = 21/189 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + +DV ++ G + + D + P+ VR G + FSS Sbjct: 2 IQCFAAVDVLFLIDGSHSVGKGSFERSKHFAIMVCDALDINPER---VRVGALQFSSAPR 58 Query: 229 QTFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 FPL Q ++ KI R++F T++ L+Y + F + + Sbjct: 59 LEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRGFPGGR--------NASVPQ 110 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ +TDG + P + K RG V+A+GV+ ++ L AS R V Sbjct: 111 ILVVITDGRSQGPVE------LPAKQLKERGVTVFAVGVRFPRWEE-LHTLASEPREQHV 163 Query: 346 QNSRKLHDA 354 + ++ DA Sbjct: 164 LMAEQVDDA 172 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 46/128 (35%), Gaps = 16/128 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ +R + PDV GLV + S++ Sbjct: 487 SLDLLFMLDASASVGSE------NFAQMQSFVRSLTLQFDVNPDVTQ---VGLVVYGSRV 537 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L G + +++ + S A ++D ++ A+ K Sbjct: 538 QTAFGLDTHLGRAAVLRAMSQAPYLGGVGSAG---TALLHVYDKVMTVQRGARPGVP--K 592 Query: 286 YIIFLTDG 293 ++ LT G Sbjct: 593 AVVLLTGG 600 >gi|313126713|ref|YP_004036983.1| mg-chelatase subunit chld [Halogeometricum borinquense DSM 11551] gi|312293078|gb|ADQ67538.1| Mg-chelatase subunit ChlD [Halogeometricum borinquense DSM 11551] Length = 785 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 45/261 (17%), Positives = 89/261 (34%), Gaps = 43/261 (16%) Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS------SHAPLLITSSV 159 ++ ++ + + N+ A+ Y + NS + L V Sbjct: 304 EDLSPYYAVVVQDMPASRIGNVDALQEYVIDGNGLLVVGGPNSFENGNYESSSLASMLPV 363 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++++ +DVS S ++ A L +K + D N G Sbjct: 364 TTGDGRGQSTNIVLSIDVSGSSKGGM-----RVQKAVS-----LSALKQLGDENE---VG 410 Query: 220 LVTFSSKIVQ---TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V F+ + PL + + ++I RL G T GL A + D Sbjct: 411 IVGFNHRTYSVAERQPLGPNREALADRIRRLQAGGATDIAGGLRGAGKMLGDDPGT---- 466 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +I ++DG + +ES+ Y + + G + AIG ++ L+ Sbjct: 467 ----------VILISDGHDR-----VEESISYAKQLRSEGKRIIAIGAGKNPNEKNLRTI 511 Query: 337 --ASPDRFYSVQNSRKLHDAF 355 AS ++ + +L+ F Sbjct: 512 ARASGGSYFRATETNRLNILF 532 >gi|86361153|ref|YP_473040.1| hypothetical protein RHE_PF00423 [Rhizobium etli CFN 42] gi|86285255|gb|ABC94313.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 545 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 50/328 (15%), Positives = 116/328 (35%), Gaps = 26/328 (7%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R+ + G++ I+ A+ L + + +G + + V+ ++ LD +L+ +I N Sbjct: 107 LRSLERDRGGNVGIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 166 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQ-TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 E+ + K++ +D+ + + N + D + + N +T + I + Sbjct: 167 TEDTDALKEKVSDWFHAQVDNSYTLGDIDIDTVNHNITATANGTVPTTFMKI---ANIET 223 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 +S S + P + + +L+ ++ S G+ D Sbjct: 224 VPVSVASAVKGPATSYLNVYVVIDTSPSMLLAATTSGQSTMYSGIGCQFACHT----GDA 279 Query: 185 FGPGMDK----------------LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 G K VA ++RE+LD+I + + ++ GL + + Sbjct: 280 HTVGKTKYANNYAYSTAKKIKLRADVAGDAVREVLDMIDESDENHERIKVGLYSLGDTLS 339 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKY 286 + + ++ +G T+ ++ Y + K+K K Sbjct: 340 EVLAPTLSTDTARTRLADASYGLTSATSKAATYFDVSLATLKQKVGAGGDGTSSGSPLKL 399 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKR 314 ++ LTDG S ++ +A Sbjct: 400 VLLLTDGVQSKREWVTDGVVWSSGKAIS 427 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 73/216 (33%), Gaps = 28/216 (12%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 NSS L + S + + L++ +VLD S SM L ++ I Sbjct: 19 NSSQRQLSLAISAQGEPGRTLPLNLCLVLDHSGSMGG------RPLETVKKA------AI 66 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + + +N R ++ F + P + + E+I L T GL+ Sbjct: 67 ELVKQLNPEDRVSVIAFDHRAKVIVPNQGIEDLNTVIEQIKALRAAGGTAIDEGLKLGIK 126 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + A G + I LTDGEN DN+ L A + +G Sbjct: 127 E----------SALGKQERVSQIFLLTDGENEHG--DNERCLKLAQVASDYNITLNTLGF 174 Query: 325 QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A + ++N K + F R+ Sbjct: 175 GNHWNQDVLEKIADSAGGSLSYIENPEKALEEFSRL 210 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 66/487 (13%), Positives = 141/487 (28%), Gaps = 162/487 (33%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + +F + GS+ I ++ +I + G+ ++ + L + LD ++L A + Sbjct: 29 LASFKDDESGSLVIFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAA----D 84 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + +D+ G ++N+ ++ Sbjct: 85 LDQTQTPQAVVDDY----------------FETAGLESFLSNVTVDQGINYKTVGAEAQS 128 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + F A + + +++ MVLD+S SM Sbjct: 129 ITTT-----------AFMKMAGVDTLNATAAGVAE---ERIANVEISMVLDISGSMG--I 172 Query: 186 GPGMDKLGVATRSIREML------DIIK-SIPDVNNVVRSGLVTFS-------------- 224 G M +L A S + D++ S+ + V +G ++ Sbjct: 173 GSKMTQLRSAATSFVNTVLSPENEDLVSVSLVPYSQHVNAGPKIYNELNTNHRHNYSHCV 232 Query: 225 ---SKIVQTFPLA------------WGVQH--------------------------IQEK 243 L W + + Sbjct: 233 EMADSAYSETELDLSVTYDQMQHFQWNYSGANQLTDTICPRYSYERITAFSQDASALNAQ 292 Query: 244 INRLIFGSTTKSTPGLEYAYNKI----------------FDAKEKLEHIAKGHDDYKKYI 287 I +L + T+ G+++A + D+ A + K + Sbjct: 293 IAQLQPRAGTQIFMGMKWAAAMLDPAFNPVVNALVTSNDIDSVFDNRPAAFDDTETLKTV 352 Query: 288 IFLTDGENSSP-----------------------------------------NIDNKESL 306 + +TDG+NSS D + Sbjct: 353 VLMTDGKNSSSMRIKSWAYDSSSDYYHWSRYNLWYYLRRNVNRHYHSRYYWFTHDAAQGD 412 Query: 307 F----YCNEAKRRGAIVYAIGVQAEA-ADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGK 360 CN +K G ++++IG + + + NCAS P F+ V+ ++ +AF I + Sbjct: 413 ALLDDICNASKDAGIVIWSIGFEVDDHGADVMANCASSPSHFFRVEGI-EISEAFDAIAR 471 Query: 361 EMVKQRI 367 ++ + R+ Sbjct: 472 QINQLRL 478 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 73/211 (34%), Gaps = 29/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM +L A + + +D + P N GLV+F+ Sbjct: 89 IILAIDVSNSMAATDITP-TRLEAAKQGAQAFVDQL---PPRIN---LGLVSFAGSAAVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ I L G T G+ + I A ++L +G I+ L Sbjct: 142 VPASTDRESVRSGIRGLQLGPATAVGEGIYASLQAIATAGQRLS--DEGQPPPPAAIVLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DGE + + A+ V I ++ L+ Sbjct: 200 SDGETTRGRPNT----QAATAARDAEIPVDTIAYGTSDGTLDVGGQQIPVPVNEEALREL 255 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + ++ + +L + +G + + Sbjct: 256 ADQTGGSYHRATSGDELQSVYRGLGSSIGYR 286 >gi|291295702|ref|YP_003507100.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470661|gb|ADD28080.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 318 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 79/216 (36%), Gaps = 33/216 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++D +++ +D+SLSM + A ++R I+ +P+ +R LVTF Sbjct: 80 QADPKAAVVLAVDISLSMQATDVQP-SRFEAARAALRTF---IRELPEG---LRLALVTF 132 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + PL + E ++ L T + + + E+ E Sbjct: 133 ARDAHLVVPLTTDRGRLLEAVDFLQLNLGTAIGDAILESIQALPPLSERAEDPDPRRLAT 192 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----------- 332 II LTDG + + + EA R+ V+ IG+ + Sbjct: 193 ---IILLTDGRSLGGV----DPVVAAQEAARQQIRVHTIGIGRTTSGPVPGLPEVYAQAA 245 Query: 333 ------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK A +++ V ++ KL +A+ + + Sbjct: 246 LFDEETLKEVARVGDGQYFYVDSAEKLKEAYRDLTR 281 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 60/281 (21%), Positives = 90/281 (32%), Gaps = 46/281 (16%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC----------TF 142 R L G A+ + + + D + SR P + Sbjct: 15 RANLLAGGEAEQKLPLVPLLPPPLKMSSNDDDEKIVTRSRPTPPIVPARVKLRSINNDMA 74 Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P + L +T SS GLD++ VLDVS SM M+K+ A I + Sbjct: 75 PLEESKLKVMLELTG-GDSSSYGRPGLDLVAVLDVSRSMEGD---KMEKMKTAMLFIIKK 130 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV----QHIQEKINRLIFGSTTKSTPG 258 L R +VTFS + PL + + IN L T T G Sbjct: 131 LGPTD---------RLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAG 181 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+ A + E + I+ ++DGE ++ + S+ Sbjct: 182 LQTALKVLKGRSFNGERVVG--------IMLMSDGEQNAGSDATGVSV--------GNVP 225 Query: 319 VYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFL 356 ++ G + LK A F VQN L AF Sbjct: 226 IHTFGFGINHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFA 266 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 64/318 (20%), Positives = 120/318 (37%), Gaps = 43/318 (13%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 F L ++ + K+L E + + D S + +I L +GF Sbjct: 276 FIENNSLQSAVN---IKGTPKVLIVEQEKSQLENILDGSGLLADSIVPEKLPTSL--SGF 330 Query: 102 AQDINNIERSTSLSIIIDDQH---KDYNLSAVSRYEMPFIFCTFPWCANSSH-APLLITS 157 + I + +++ ++Q + S + M +F L+ Sbjct: 331 LPYQSIIFNNIPATVVSENQMMLIEKAVKEFGSGFIMAGGENSFGLGGYFKTPIEKLLPV 390 Query: 158 SVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ I K ++ L +M+V+D S SM KL +A + +S+ + Sbjct: 391 NMDIKGKKEMPSLGLMIVMDRSGSMAGS------KLELAKEA------AARSVELLREKD 438 Query: 217 RSGLVTFSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 G + F + IV+T PL + +KI + G T+ LE AY ++ + K + Sbjct: 439 TLGFIAFDDRPWVIVETGPLE-DKKDAVDKIGSVTPGGGTEIFTSLEKAYEELENLKLQR 497 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +H II LTDG+ S+ + D + + K + + + ++A L Sbjct: 498 KH-----------IILLTDGQ-SARSTDYESMI---ETGKENNITLSTVALGSDADRNLL 542 Query: 334 KNCA--SPDRFYSVQNSR 349 + A RFY V +S Sbjct: 543 EELAGLGAGRFYDVTDSS 560 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 74/204 (36%), Gaps = 30/204 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++ L++ +VLD S SM G ++ + A + E L R +V F Sbjct: 39 NVPLNLCLVLDHSGSM---HGQPLETVKQAAVGLIERLQP---------DDRLSIVAFDH 86 Query: 226 KI---VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + V+ P+ + I+ KINRL T GL+ ++ AK+ Sbjct: 87 RAKVLVRNQPMG-NLDQIKRKINRLGADGGTAIDEGLKLGVKELIKAKQDTVSQ------ 139 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 + LTDGEN N N+ + A + ++G A L+ A + Sbjct: 140 ----VFLLTDGENEHGN--NESCIKLAELAAENNLTINSLGFGANWNQDILEKIADIATG 193 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 ++ + F R+ M Sbjct: 194 SLSYIEEPEQALSEFARLFNRMQS 217 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 77/203 (37%), Gaps = 33/203 (16%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIREMLDIIKSIPD 211 ++S+ KI+ ++ + LD+++VLDVS SM+D G G ++ ++ +D + D Sbjct: 107 ALSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDALKAAVNSFIDGSAKVND 166 Query: 212 VNNVV----RSGLVTFSSKIVQTFP-----------------------LAWGVQHIQEKI 244 V R +V F+ + + + Sbjct: 167 QRADVNKQNRIAVVKFAGNKTDKIGNDQYSQNRYWYNYTQVVSGYKAYTSGNKSEWETTV 226 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNK 303 N L T + ++ + D + + + K+ +IF TDGE + + D+ Sbjct: 227 NALKPAGCTAADYAMDL-TKTLVDQSKTDANNNADRKNVKRVVIFFTDGEPNHQSGFDDD 285 Query: 304 ESLFYCNEAK--RRGAIVYAIGV 324 + AK + A +Y IG+ Sbjct: 286 VANDAITSAKTIKTDADIYTIGI 308 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 58/262 (22%), Positives = 93/262 (35%), Gaps = 41/262 (15%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++ + S +E+ F + + ++ ++ D + +VLDVS Sbjct: 259 PNMQNKMCNYKSTWEIIMESDDFRNSSVVNSLVPPFETTFELLQTQDRAVS--LVLDVSG 316 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGV 237 SM + + L A + I S R G+VTF S + PL V Sbjct: 317 SM--NTNNRITNLRTAAEVFLIQIIEIGS--------RVGIVTFESSAYEKSPLLQITSV 366 Query: 238 QHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q + L G TK G+E I +A I+ LTDGE+ Sbjct: 367 ATRQRLVQNLPTTAGGGTKICAGIEKGLEIITNAIGTTYGSE---------IVLLTDGED 417 Query: 296 SSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY--SVQNSRK 350 S ++ C + K GAI++ I + AA + N + Y V K Sbjct: 418 S--------TMSLCREKVKESGAIIHTIALGPSAAKELEEFSNITGGLQLYAVDVDVPSK 469 Query: 351 LHDAFLRIGK---EMVKQRILY 369 L +AF I ++ +Q I Sbjct: 470 LVEAFSEITTGSGDISEQSIQL 491 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 79/243 (32%), Gaps = 31/243 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P ++ LL T+ +S I L+ +M+V+DVS SM P D+L A + Sbjct: 67 VPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMAATDVPP-DRLTAAKEA 125 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ D + + GLV F++ ++ I+ L T + G Sbjct: 126 GKQFADELTPAIN------LGLVEFAANASLLVSPTTNRAAVKAAIDSLKPAPKTATGEG 179 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRG 316 L A I + G I+ +DG + P + + AK G Sbjct: 180 LFTALQAIATVGSVMGG---GDGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEG 236 Query: 317 AIVYAIGVQAEAA---------------DQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGK 360 + I K C + + + + L + + + + Sbjct: 237 VQISTISFGTPYGTVEYEGATIPVPVDDQTLQKICEITDGQAFHADSLESLKNVYSTLQR 296 Query: 361 EMV 363 ++ Sbjct: 297 QIG 299 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 64/183 (34%), Gaps = 42/183 (22%) Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA----KEKLEHIAKGHDD 282 Q PL + + + ++ + +T GL + + + A Sbjct: 462 ASQIVPLTNVKKTLTDAVDGMTAVGSTAGHIGLAWGWYLVSPNFGLWSGLGAPAAYDSSK 521 Query: 283 YKKYIIFLTDGE---------------NSSPNIDN------------KESLFYCNEAKRR 315 K ++ +TDGE N S D +++ C K Sbjct: 522 TLKAVVLMTDGEFNTPYFRGVIASDAGNGSGGADTHINQPATNGSSFEQAYRLCENMKAA 581 Query: 316 GAIVYAIGVQA----------EAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 IVY +G ++A + + CA+ PDR + +S L DAF IG+++ + Sbjct: 582 DVIVYTVGFDIGAARNMTGPIDSAGELMARCATNPDRAFQASSSTDLSDAFRDIGRDITR 641 Query: 365 QRI 367 RI Sbjct: 642 LRI 644 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 89/290 (30%), Gaps = 39/290 (13%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + + +G+++++ + LPVI ++ ++ +++ LD + L A Sbjct: 17 VSRLRDDRRGNVAMIFGLSLPVIVMLALGGVDLHRITTARSQFQDALDAATLAAARSSET 76 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 G +E + + K Sbjct: 77 TPAGLKSVALATLHGNIQ---------------------GTEVEPINDADVEVAMNDKSV 115 Query: 126 NL-SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND- 183 + +A R + P L I++ +++ S +++ +VLD++ SMND Sbjct: 116 VIATAQGRVKTLVANIVLPPYGQLLDDTLPISARSEVNRSSR-DVEVALVLDITGSMNDC 174 Query: 184 -HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV--------QTFPLA 234 K+ + +E++DI+ + L +S + L Sbjct: 175 ADSCSSGRKIDNLKSAAKELIDIVVQTNQSPFYSKVALAPYSMGVNVGDTYASRARGSLD 234 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q I + K+ + AY + A + H K D Sbjct: 235 SNTQSITA---ATWLTGSVKTITSISRAYTAVVTASK---HGFKTGDIVT 278 >gi|229596191|ref|XP_001012539.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565545|gb|EAR92294.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 703 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 52/295 (17%), Positives = 104/295 (35%), Gaps = 32/295 (10%) Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA-VS 131 +Q S+ + N + + E I + S + + + K L Sbjct: 110 PQQLYQPSFYQLPNFYDNLQAQKYAEMQIEDSIQKVVNSKNHKKQMQNLEKGLVLDVETL 169 Query: 132 RYEMPFIF---CTFPWCANSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGP 187 + F T P + VK + + LD++ V+D S SMND Sbjct: 170 QKHFQFNKNQDQTIPVMVSVKTLDQTNDMEVKSNPLEGRPNLDLICVIDNSGSMNDF--- 226 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEK 243 K+ +I ++L+++ N R L+TF++K Q L + +Q Sbjct: 227 --SKIENVKNTILQLLEML------NENDRLSLITFNTKAKQLCGLKNVNNQNKKSLQTI 278 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + T G+E A+ + K+K + I L+DG+++ + K Sbjct: 279 TKSIKADGGTDIIRGIEIAFQILQSRKQKNSVSS---------IFLLSDGQDNLADAGIK 329 Query: 304 ESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 L + + +++ G + ++ A FY V+ + ++ + F Sbjct: 330 NLLKTTYKQLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFF 384 >gi|224048603|ref|XP_002193071.1| PREDICTED: collagen, type XXI, alpha 1 [Taeniopygia guttata] Length = 945 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 90/242 (37%), Gaps = 30/242 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 + H + +S D++ +LD S S+ + Sbjct: 3 QIVIFFQMLLVLLLHDYISAEDGETRASCRTAPADLVFILDGSYSVGPENFEIIKSW--- 59 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGST 252 +++I ++ ++ G+V +S V PL +++ ++ + G Sbjct: 60 ------LVNITRNFDIGPKFIQVGVVQYSDYPVLEIPLGTHESTENLIREMESIHYLGGN 113 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T++ +++A++ +F AK K + LTDG++ D EA Sbjct: 114 TRTGRAIQFAFDHLF---------AKSSRFLTKIAVVLTDGKSQDEVKD------VAAEA 158 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 ++ ++AIGV +E + LK A S + V++ + I +++ ++ + Sbjct: 159 RKNKITLFAIGVGSEIEEDELKAIANKPSSTYVFYVEDYIAISRIKEVIKQKLCEESVCP 218 Query: 370 NK 371 + Sbjct: 219 TR 220 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 67/192 (34%), Gaps = 28/192 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + M +V+D S SM AT + D+I+ I + R Sbjct: 473 SVAMALVMDRSGSMTGLPLEMAKAAAKATAGVLSSDDLIEVIAFDSAPTR---------- 522 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 A I +I R+ G T+ L+ AY + KK++ Sbjct: 523 YVKMQPARNRSRIAGEIARIQPGGGTEIFSALDAAYQDMT-----------VTQARKKHV 571 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 I LTDG+ S+ I + S V +G+ + +Q LK A RF++V Sbjct: 572 ILLTDGKASTGGIRDLVSAMIAES-----ITVTTVGLGNDLDEQLLKMIADVGGGRFHAV 626 Query: 346 QNSRKLHDAFLR 357 + L F + Sbjct: 627 PDPNNLPRIFTK 638 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 76/217 (35%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A E I P+ N GL F+ + Sbjct: 87 GIDIMLAMDVSTSMLAEDLKP--NRLEAAKNVASEF---IADRPNDN----IGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + IN L T + GL + K K Sbjct: 138 AFTQCPMTTDHVSL---INLLQSVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ 328 +I LTDG N+ +I S A+ G VY IG+ Sbjct: 195 VILLTDGSNNMGDISPMTS---AQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIP 251 Query: 329 ---ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LK+ A + FY N+R+L + I + Sbjct: 252 VEIDSKTLKDIAATTEGNFYRATNNRQLKQIYKDIDQ 288 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 63/419 (15%), Positives = 130/419 (31%), Gaps = 63/419 (15%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G+I+++ AI L I +G I+ + ++ KL D ++L + Sbjct: 15 RRFGRDRSGNIAVIFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAAVLLAVSNSEIN 74 Query: 67 ENGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERST-SLSIIIDDQHKD 124 K F I + E+ EN + ST + + + + Sbjct: 75 RTTAQAKADAEQFFNATIGAYGLTATIKIEVTENDGKRSATADFTSTVTTNFLNLIGYPT 134 Query: 125 YNL--SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL-DVSLS- 180 + + S P + NS + T++ + + D+S S Sbjct: 135 LAIGNRSTSTVSRPIYQDFYLLLDNSPSMGVAATTADIATMVGNTSDKCAFACHDLSDSN 194 Query: 181 ----MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-----SSKIVQTF 231 + G M ++ V ++++++ + VNN R + T S + Sbjct: 195 NYYNLAKKLGVKM-RIDVVRQAVQQLTSTATLMTAVNNQFRMAVYTLGGSCASLGLTTIA 253 Query: 232 PLAWGVQHIQEKINRLIFGS------TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L+ + +Q + S A + +K Sbjct: 254 SLSSAMSSVQTAAGAIDLMSIPKQNYNNDQCTDFNSALAAMNTTIPSSGTG--TAAQPQK 311 Query: 286 YIIFLTDGENSSPNIDNKE-------------SLFYCNEAKRRGAIV---YAIGVQAEAA 329 ++ F++DG N ++ C K RG + Y + Sbjct: 312 WLFFVSDGVADFNNPSGCTQPTVSGGRCQEPLTVTQCKAMKDRGIQIAVLYTTYLALPTN 371 Query: 330 DQF-----------------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+CASPD ++ V ++ + +A + K+ V + Sbjct: 372 QWYNDHIAPFNAGPYGPSVNSQIAAKMKSCASPDFYFEVSPTQGISEAMDALFKKAVAK 430 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 65/153 (42%), Gaps = 10/153 (6%) Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAKEKLEHIAK 278 + P +H + + RL+ G T + G+ +A + I+D Sbjct: 312 HVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRY 371 Query: 279 GHDDYKKYIIFLTDGENS-SPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKN 335 + KY++ +DG + P +K+ C + K+ RG V + A+++ +++ Sbjct: 372 SDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQS 431 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 CAS +Y V + + F +I ++++ ++ Sbjct: 432 CASGPEYYHVASLFSVEKVFEQIAEQVISSSLI 464 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 60/172 (34%), Gaps = 35/172 (20%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL I + I +I TT G+ + + E ++ + K ++ L Sbjct: 327 TPLTNQRNVIDDAIEDMIASGTTNIPIGISWGVRVLSPGMPFTEGVSYDEEGTIKAMVVL 386 Query: 291 TDGEN-----SSPNID---------------------------NKESLFYCNEAKRRGAI 318 TDGEN ++PN N + C AK G Sbjct: 387 TDGENYLDGRNNPNYSHYSGYGYMRDGRLGIQTSSDSTIRNALNDRTEAACEYAKSLGIR 446 Query: 319 VYAIGVQAEAA--DQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 VY I Q ++ +++CA+ P ++ + L AF I ++ R+ Sbjct: 447 VYTITFQVNSSSTRDMMRDCATHPTLYFDSPSDDALRSAFEMIAGDLTNLRL 498 >gi|118394228|ref|XP_001029494.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89283721|gb|EAR81831.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 406 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 52/282 (18%), Positives = 101/282 (35%), Gaps = 32/282 (11%) Query: 86 NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE-MPFIF---CT 141 N + + E I + S + I + K L + ++ F T Sbjct: 73 NFYDNLQPQKYAEMQIEDSIQEVVNSKNHKKQIYNLEKGLVLDVKTLHKHFQFNKNQDQT 132 Query: 142 FPWCANSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 P + V+ + + LD++ V+D S SM G K+ +I Sbjct: 133 IPVMVSVKTLDQTNDMEVESNPLEGRPNLDLICVIDNSGSM-----SGCSKIENVKNTIL 187 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKST 256 ++L+++ N R L+TF++K Q L + +Q + T T Sbjct: 188 QLLEML------NENDRLSLITFNTKAKQLCGLKKVNNQNKESLQTITKSIKADGGTDIT 241 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRR 315 GLE A+ + K+K + I L+DG++ +I K L + + Sbjct: 242 SGLEIAFQILQSRKQKNSVSS---------IFLLSDGQDDGADIKIKNLLKTTYQQLQEE 292 Query: 316 GAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 +++ G + ++ A FY V+ + ++ + F Sbjct: 293 SFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFF 334 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 76/217 (35%), Gaps = 39/217 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A E I P+ N GL F+ + Sbjct: 87 GIDIMLAMDVSTSMLAEDLKP--NRLEAAKNVASEF---IADRPNDN----IGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + IN L T + GL + K K Sbjct: 138 AFTQCPMTTDHVSL---INLLQSVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAKSKV 194 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------------------ 328 +I LTDG N+ +I S A+ G VY IG+ Sbjct: 195 VILLTDGSNNMGDISPMTS---AQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIP 251 Query: 329 ---ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 + LK+ A + FY N+R+L + I + Sbjct: 252 VEIDSKTLKDIAATTEGNFYRATNNRQLKQIYKDIDQ 288 >gi|41409532|ref|NP_962368.1| hypothetical protein MAP3434 [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570937|sp|Q73UD4|Y3434_MYCPA RecName: Full=UPF0353 protein MAP_3434 gi|41398363|gb|AAS05984.1| hypothetical protein MAP_3434 [Mycobacterium avium subsp. paratuberculosis K-10] Length = 330 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 78/216 (36%), Gaps = 31/216 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+D+S SM ++L A ++ + + + GLV F+ Sbjct: 95 IMLVIDMSQSMRATDVEP-NRLKAAEQAASQF------ASQLTPGINLGLVGFAGTPYLL 147 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P Q + + +L F +T + + A + I +A G I+ L Sbjct: 148 VPPTPQHQATIDALKKLDFADSTATGEAIFTALHAISATA-----VAGGDTPPPARIVLL 202 Query: 291 TDGENSSPN--IDNKESLFYCNE-AKRRGAIVYAIGVQAEAAD--------------QFL 333 +DG + P+ D + ++ AK G + I + + + Sbjct: 203 SDGGENKPSNPSDPHDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVSTDQM 262 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 K A S + Y+ N +L ++ I E+ + + Sbjct: 263 KMVAKLSGGQSYTATNLGELQKSYNAIENEIGYRTV 298 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 80/225 (35%), Gaps = 55/225 (24%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P +++ A + E ++ GL F+ + Sbjct: 87 GIDIMLAVDVSTSMLAEDLKP--NRIEAAKQVAAEFIN-------GRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEK--------INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 PL + + R + T GL A +++ D+K K Sbjct: 138 AFTQCPLTVDHGVLLNLFQSVSCDMVQRGMIEDGTALGMGLANAVSRLKDSKAK------ 191 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------- 328 K +I LTDG N+ +I L AK+ G VY IGV Sbjct: 192 -----SKVVILLTDGVNNRGDISP---LTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYA 243 Query: 329 -----------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 +Q + A + ++ ++ KL + + I K Sbjct: 244 GVQYVQMPVEIDEQTMSQIAGTTNGNYFRATSNTKLKEVYREIDK 288 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 65/153 (42%), Gaps = 10/153 (6%) Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAKEKLEHIAK 278 + P +H + + RL+ G T + G+ +A + I+D Sbjct: 312 HVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRY 371 Query: 279 GHDDYKKYIIFLTDGENS-SPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKN 335 + KY++ +DG + P +K+ C + K+ RG V + A+++ +++ Sbjct: 372 SDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQS 431 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 CAS +Y V + + F +I ++++ ++ Sbjct: 432 CASGPEYYHVASLFSVEKVFEQIAEQVISSSLI 464 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 52/292 (17%), Positives = 102/292 (34%), Gaps = 50/292 (17%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYN-LSAVSRYEMPFIFCTFPWCANSSHA 151 R+ + I IE T + + +N + + + P N ++ Sbjct: 207 RSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQTN- 265 Query: 152 PLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ +S+S +D++ VLDVS SM KL + R++ ++ + Sbjct: 266 -------MQPTSQSCRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQSLGPCD 312 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +++FSS + FPL G Q + +N L+ T GL + Sbjct: 313 ------RLSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNGGTNIAEGLRKGAKVM 366 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA--------KRRG-- 316 D + + II L+DG+++ + + + + G Sbjct: 367 LD---------RKWKNPVSSIILLSDGQDTYTVCSPGGAHSRTDYSLLLPFSIHRNGGTG 417 Query: 317 --AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+A G + + + S F ++ + DAF + IG + Sbjct: 418 FQIPVHAFGFGTDHDATSMHAISETSGGTFSFIEAEGVIQDAFAQCIGGLLS 469 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 53/292 (18%), Positives = 102/292 (34%), Gaps = 50/292 (17%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYN-LSAVSRYEMPFIFCTFPWCANSSHA 151 R+ + I IE T + + +N + + + P N ++ Sbjct: 207 RSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQTN- 265 Query: 152 PLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ +S+S +D++ VLDVS SM KL + R++ ++ + Sbjct: 266 -------MQPTSQSCRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQSLGPCD 312 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +++FSS + FPL G Q + +N LI T GL + Sbjct: 313 ------RLSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLISNGGTNIAEGLRKGAKVM 366 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA--------KRRG-- 316 D + + II L+DG+++ + + + + G Sbjct: 367 LD---------RKWKNPVSSIILLSDGQDTYTVCSPGGAHSRTDYSLLLPFSIHRNGGTG 417 Query: 317 --AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+A G + + + S F ++ + DAF + IG + Sbjct: 418 FQIPVHAFGFGTDHDATSMHAISETSGGTFSFIEAEGVIQDAFAQCIGGLLS 469 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 77.2 bits (188), Expect = 3e-12, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 66/208 (31%), Gaps = 64/208 (30%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L I+ KI++L+ T G + + I + Sbjct: 379 DHTTQRLLQLTTSQTTIKNKIDQLVANGNTNLQEGFMWGWRTISPNGPFAAGRPYATSNN 438 Query: 284 KKYIIFLT-----------------------------------DGE-------------- 294 +K ++F+T DG Sbjct: 439 RKVMVFMTDGFNHWGAYPNTVVGSDYEALGYYTYNGEKNLRLPDGSRGDRVDYQNALKAA 498 Query: 295 ---NSSPNIDNKE-----SLFYCNEAKRRGAIVYAIGVQA------EAADQFLKNCASP- 339 NSS ++ +L C AK G V+ IG + LK+CA+ Sbjct: 499 RNSNSSYLATARDAQDELTLQACTNAKNAGVEVFTIGFSTSTDPIDAQGLELLKSCATNV 558 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 D +++V+N+ +L+ AF IG + K R+ Sbjct: 559 DHYFAVENANQLNAAFSSIGIGLGKLRL 586 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 72/211 (34%), Gaps = 29/211 (13%) Query: 7 RNFFYNCKGSISILTAI-LLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++F + G++ ++ + L+PV+F++ + + ++ L D ++L A+K+ Sbjct: 14 KSFGADESGNVGMVFGLGLVPVMFMLGATA-DYTRYATTRSALRQATDVAVLTVASKLTA 72 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 Q + I+ + + T+ SI Q Sbjct: 73 ----TTTDAQAKAQAQVILNAQPRMSTASI----------------TTASIATTKQ---- 108 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 A S + F + + + + + + + ++ +V+D S SM Sbjct: 109 TFCATSEVTIQNSFMQMARVTSLTP-SVTSCADLAWGANPNATYEVALVVDNSGSMLSSD 167 Query: 186 GPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 G + K+ + + +D PD Sbjct: 168 GS-VTKISALKTAAKSFVDTMFAKAPDRVQF 197 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 77.2 bits (188), Expect = 3e-12, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 78/217 (35%), Gaps = 31/217 (14%) Query: 158 SVKISSKSD---IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 SV +++K L++ VLD S SM G + + A I + L +N Sbjct: 26 SVSVAAKGGGVSAPLNVCFVLDRSGSM---MGTPLQTVKQAASRIVDRL---------SN 73 Query: 215 VVRSGLVTFSSKIVQ--TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R ++ F K + LA Q I+ +IN L G T GL+ ++ KE Sbjct: 74 RDRISIIAFDHKAEVLISNELASDPQAIKRRINSLRAGGGTCIDDGLKAGIEQLASGKEG 133 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ LTDGEN DN ++ + A V +G Sbjct: 134 YISQ----------LLLLTDGENEHG--DNSRAIKLADVAIGYNLTVNTLGFGDHWNQDV 181 Query: 333 LKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ A ++++ + F R+ M + Sbjct: 182 LEQIADAGGGSLSYIEHAEEAIATFGRLFTRMQSVSL 218 >gi|34534804|dbj|BAC87116.1| unnamed protein product [Homo sapiens] Length = 725 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 90/226 (39%), Gaps = 32/226 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSD-----IGLDMMMVLDVSLSMNDHFGPGMDK 191 F+F P + + KIS+ S +D+M +LD S S+ G Sbjct: 13 FLFSRVPPSLPLQEVHVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSV------GKGS 66 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIF 249 + + D + P+ VR G FSS FPL ++ Q ++ +I R++F Sbjct: 67 FERSKHFAIAVCDGLDISPER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVF 123 Query: 250 GST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ L+Y ++ + + +I +TDG++ + L Sbjct: 124 KGGRTETGLALKYLLHRGLPGGR--------NASVPQILIIVTDGKSQGDVALPSKQL-- 173 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 K RG V+A+GV+ ++ L AS R V + ++ DA Sbjct: 174 ----KERGVTVFAVGVRFPRWEE-LHALASEPRGQHVLLAEQVEDA 214 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 63/448 (14%), Positives = 129/448 (28%), Gaps = 87/448 (19%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 M ++ + + G+ +L +++ + V GL ++ S+ VK I+D ++L Sbjct: 1 MQMHSVSQYLRSRDGNFGLLAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRAT 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 I+ +E + + + Y + + + E+ Q ++ I + + + Sbjct: 61 RDII-EEGKTLAEAELSARKYFDAELAFSSGVGLEVSTFTLTQGVDGI-----VKLGVSG 114 Query: 121 QHKDYNLSAVSRYEMPFIF-CTFPWCANSSHAPLLIT-------SSVKISSKSDIGLDMM 172 + L AV R E+P S + + + S I + Sbjct: 115 KTSTSLLKAVGREEIPVSVDAAAHVGGGSVEIAIAFDVTNSMGFGTTWGEATSVIASALN 174 Query: 173 MVLDVSLSMNDHFGPGMDKLGV-------------------------------------- 194 + S SM F P D++ V Sbjct: 175 ALKANSGSMALTFIPFTDRVNVGMGRANLLNPGDQTAVKKGGWGGCVDVRATKKKNKGET 234 Query: 195 ---ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 S E D + + V + ++ +L G Sbjct: 235 EYFMPDSAPEKGDRFTKFDNGTPAAHKSGYKLACNPQSIIGPTSNVSDVTSQLGKLTKGG 294 Query: 252 TTKSTPGLEYAYNKIF-------------DAKEKLEHIAKGHDDYKKYIIFLTDGE---- 294 T + G + + + D L + +K + TDG Sbjct: 295 TGRFDLGFAWLWYALSPNWKGFWSGGAPADNGVNLADYPTASTNTRKIAVLATDGLTNAY 354 Query: 295 ------------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCAS--P 339 N+ + + C + V+ + V + A+ + + CAS Sbjct: 355 VYEYGKTNLAGWNTGSKDHFENVVAICKSMAAQKIEVHVMHVNGNDKAEPYFRECASATG 414 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y V + + L DA I R+ Sbjct: 415 GGYYKVASKQTLVDALTGITNGGGNLRL 442 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 82/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K N V+ G + +S + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRNRVQFGALKYSDQPN 863 Query: 229 QTFPLAW--GVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F K H ++ + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALFTEK----HGSRIKQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG+ + D+ + +E + +G ++A+GV + + + V Sbjct: 919 VLIVITDGK----SHDHDQLNDTASELRDKGITIFAVGVGKANQKELEGMAGNKNNTIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 975 DNFDKLKDVFTLVQERM 991 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 68/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQVGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L+ + Q I + I+R+ T + L + H Sbjct: 674 DKTKEEFQLSRYFTQQEISDAIDRMSLINEGTLTGKALNFVSQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDGSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 88/266 (33%), Gaps = 33/266 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T L I+ + + S ++ F + + T + + + Sbjct: 376 ANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTGC 435 Query: 169 LD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D + ++D S S+ + + R + E+ ++ PD VR G+V + Sbjct: 436 VDTKEADIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQY 486 Query: 224 SSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F + I + N T + L+Y I + + Sbjct: 487 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGTKDRMSK---- 542 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 Y+I LTDG ++ + + V+A+G+ A + + + Sbjct: 543 --VPCYLIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEE 594 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQR 366 R QN DA I E+V++ Sbjct: 595 RVSFGQN----FDALKSIKNEVVREI 616 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 64/179 (35%), Gaps = 23/179 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NV 215 IS + D D++ ++D SL + R ++ L+ I S DV N Sbjct: 223 VPFPISCQKDSLADLVFLVDESLGTRGNL-----------RHLQTFLENITSSTDVKENC 271 Query: 216 VRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +R GL+++S+ L + Q++I L +T A +++ Sbjct: 272 MRLGLMSYSNSAKTISFLKSSTTQSEFQQQIKNLSIQVGKSNTGA---AIDQMRRDGFSE 328 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ + + +T + D +L G V+A+ +Q Q Sbjct: 329 SYGSRRAQGVPQIAVLVTHRPSDDEVHDAALNLRL------EGVTVFALSIQGANNTQL 381 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 51/213 (23%), Positives = 80/213 (37%), Gaps = 32/213 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM D+L A S + ++ S+P N GLV FSS Sbjct: 88 ILVAIDVSPSMAATDVAP-DRLTSAKESAQSFIE---SLPPRFN---VGLVAFSSVATVV 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q + + I L S T G+ + I EK D I+ L Sbjct: 141 ASPTQDHQAVADSIANLTISSGTAIGEGVFASLQAIRSFDEKA-----TDDPPPAAIVLL 195 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--------------AEAADQFLKNC 336 +DGEN+S + +EA+ G V I A + LK Sbjct: 196 SDGENTSG----RPVAAAADEARAAGVPVSTIAFGTGVSIIEIEGHYVPANIDKETLKEL 251 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A + RFY +++ +L D + IG + + + Sbjct: 252 AMTTGGRFYEAESTGELKDVYADIGSSLGTETV 284 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 32/239 (13%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 F + A + I V +M+ +D+S SM D+ VA + Sbjct: 58 AFAFLAAIVVLTIAIARPVADVRVPRERATVMVAMDISNSMAATDVSP-DRFTVAKEAAT 116 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 E ++++P+ N GLV+F+ P + Q + I +L +T + Sbjct: 117 EF---VRNLPEQFN---VGLVSFARTATVVAPPSTNHQAAVDAIEQLTLTDSTAIGEAVL 170 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 + + + I+ L+DG N+S ++ + A G V Sbjct: 171 TSLQAVRSLDAQAAEDPPPAR-----IVLLSDGGNTSGRPIDEGAR----AATEAGVPVS 221 Query: 321 AIGVQ--------------AEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 I A + L+ A+ FY+ ++ +L D + + + Sbjct: 222 TIAYGTPEGTIDLEGRSIPVPADTESLRGLADATSGSFYAAESDEELRDVYSDLQSSIG 280 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 53/391 (13%), Positives = 122/391 (31%), Gaps = 59/391 (15%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G+I+IL A ++ + + G + S+ +A+L D + +++ ++ +G Sbjct: 18 RRGAIAILAAFVMVALLALAGFFLSLSYVELTRAELRAATDAAARSAVIRLVETQSTTSG 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + D + R + + G + +N S +++ + + + + Sbjct: 78 RAAARDIASRFEVGGKALSLNDNDIQFGRSTRQSNGSYSFAINGTPTNAARVFGRKTKTS 137 Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF------- 185 P + ++ L ++ + D+++VLD S SM Sbjct: 138 AAGPVELPFGGFVGAPEYSTELNAVAM------RLDYDIVIVLDRSGSMGWDLSGVEFEY 191 Query: 186 -----------------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI- 227 P + + + S+ + L I+ R GLVT++ Sbjct: 192 PEAVRQRPLVENYFSPPDPTGSRWAILSASVNDFLTILN---QRQVAARVGLVTYAGDYT 248 Query: 228 ---------VQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L I K+ + T G+ A + + + Sbjct: 249 FGKYSSVKLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITAAQTMLTTSSQARLK 308 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + II +DG N + + + I++++ A A + N Sbjct: 309 TGQP------IIIVFSDG---MFNQGTEPVSLAASAYSQSSTIIHSVTFGATAQGRATMN 359 Query: 336 C----ASPDRFYSVQNSRKLHDAFLRIGKEM 362 A + +L ++F I + Sbjct: 360 SVTATAGKGLSLHANTAAELAESFRSIANAI 390 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 65/153 (42%), Gaps = 10/153 (6%) Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA-------YNKIFDAKEKLEHIAK 278 + P +H + + RL+ G T + G+ +A + I+D Sbjct: 328 HVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRY 387 Query: 279 GHDDYKKYIIFLTDGENS-SPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKN 335 + KY++ +DG + P +K+ C + K+ RG V + A+++ +++ Sbjct: 388 SDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQS 447 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 CAS +Y V + + F +I ++++ ++ Sbjct: 448 CASGPEYYHVASLFSVEKVFEQIAEQVISSSLI 480 >gi|326328639|ref|ZP_08194979.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325953600|gb|EGD45600.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 871 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 76/215 (35%), Gaps = 29/215 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SMND PG K+ + I + + + TF S + + Sbjct: 650 QVILLIDNSGSMNDEVAPGAAKIDRVQSAANA---AIGLLAPKDE---LAVWTFGSSVHK 703 Query: 230 T--FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 T P+ + ++ +I + G TT P + A + L + K + Sbjct: 704 TALAPMGNRISQVRAEIGAIEAGGTTTQLPA------AVQAAHDALAQTNDPDNPKTKAV 757 Query: 288 IFLTDGE-NSSPNIDNKESLFYCNEA-------KRRGAIVYAIGVQAEAADQFLKNC--A 337 + LTDG N +P+ ++E N+A +Y I A L+ A Sbjct: 758 VLLTDGATNLTPDGADEEENKAANDALVADIRGSESHVRIYTIPYGNSADKCLLEKVAAA 817 Query: 338 SPDRFYSVQNSRKL-----HDAFLRIGKEMVKQRI 367 S R+Y L F G + + Sbjct: 818 SGARYYGAGARESLINDVMLAVFGNFGTQAAAANL 852 >gi|258652510|ref|YP_003201666.1| hypothetical protein Namu_2300 [Nakamurella multipartita DSM 44233] gi|258555735|gb|ACV78677.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 320 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 74/229 (32%), Gaps = 27/229 (11%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + + + + +M+ +D SLSM ++L A + + +D Sbjct: 68 LTVALAGPTAQAKEPRNRAVVMLAVDTSLSMEATDVAP-NRLDAAKEAAQSFVD------ 120 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 D+ V G+V+F+ ++ I+ L T + + + I Sbjct: 121 DLTPGVNLGIVSFAGIATVLVSPTTDRTVAKQAIDGLTLDERTATGEAIISSLQTIELFS 180 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + L I+ +TDG+ + + + G V I + Sbjct: 181 KTLPPDGTDTGPPPARIVLMTDGKRTVGRTEQDAAQRA----ADAGVPVSVIAFGTDNGS 236 Query: 331 --------------QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + ++ A S F+ ++ +L + ++G+++ Sbjct: 237 ITVNDEVIPVPLDTEAMQQIAQISGGDFHQAASAEELKSIYAQLGEQIG 285 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 46/232 (19%), Positives = 76/232 (32%), Gaps = 48/232 (20%) Query: 169 LDMMMVLDVSLSMNDHF-------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +D+++++D S SMN MD L + D I + + +V Sbjct: 179 IDVLLIVDKSGSMNWKMDTDKVGKPSRMDVLKQVVTGTGGLTDSI--FGNTQIDAQMAVV 236 Query: 222 TFS----------SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 T+S + +N + T GL + ++E Sbjct: 237 TYSGSNDFLDQRYDDAEIIQEWTKQKDTVNNAVNNIQAKGGTNCEAGLRTGATALEGSRE 296 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS-------------SPNIDNKESLFYCNEAKRRGAI 318 KK++IFL+DG+ + S + K G Sbjct: 297 NA----------KKFVIFLSDGDATFYYGDDGYTKGPGSGSSPTAREKAIAQVQKITGLE 346 Query: 319 -VYAIGVQAEAADQFLKNCA-----SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 Y IG+ + ++ +FL N A S RFY N+ L AF I E + Sbjct: 347 GFYTIGMTSSSSSEFLTNLANNSKASEKRFYPANNTEALEKAFQEIVGETTE 398 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 80/209 (38%), Gaps = 30/209 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + GLD+M VLD S ++N + V+ + ++ R G+++F Sbjct: 56 QPGAGLDLMFVLDNSGTVNLDDTDSIRSSTVSDYA-----------ENMLPGDRGGIISF 104 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +++ ++ + + ++ L T + G+ A + K Sbjct: 105 NTEADMLQEMSDNRYDLLDALSALPDPSGGTDLSQGMRAANEQFVQTKGAN--------- 155 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLKNCA-- 337 K+ ++ +TDG ++ I+ E EA+ G ++ +G+ + A L++ A Sbjct: 156 -KQIMVLITDGADT---INLAEVYNQVREARMNGITIFTLGLGSLATGLDEALLQDIADQ 211 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ V N+ + I + R Sbjct: 212 TRGQYRQVPNATVIESVLQDIRSSLEGMR 240 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 50/158 (31%), Gaps = 29/158 (18%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE---HIAKGHDDYKKYIIFLTDGE 294 + L +T G+ + + + ++IIF+TDG+ Sbjct: 473 SAFYAYADALSANGSTYHDLGMLWGLRLSSPDGPWQAMVNETPENGGEVSRHIIFMTDGQ 532 Query: 295 NS--------------------SPNID-----NKESLFYCNEAKRRGAIVYAIGVQAEAA 329 D C+ AK +G V+ I ++ Sbjct: 533 MDTNYKVMSTYGIEWHDRRITDDGVTDQDARHTLRFRALCDAAKAKGFRVWVIAFASDLN 592 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 D L CAS + N+ +L+ AF I K + + R+ Sbjct: 593 DD-LSYCASASSTFPATNATELNTAFQEIAKNVAELRV 629 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 89/305 (29%), Gaps = 51/305 (16%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 + L A +PV+ +++G ++ + + +L D L + + Sbjct: 1 MLPLAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAG--------- 51 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 + + + A + +T + + ++ ++ Sbjct: 52 -----------YDDAAKAQAAAFFNANFNEDDLGATETNFATSSADGGSLVEGIATTDVE 100 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVA 195 + + I + D+ MVLD + SM+ G ++ Sbjct: 101 MVLMNL-----FGVISVPINVECSATMDIGNT-DVTMVLDTTGSMSQTLSGTTTKRIDAL 154 Query: 196 TRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 +++ D + + +N VR V +SS + ++ + I L Sbjct: 155 RTAMKNFYDTVSAATTGSNARVRYSFVPYSSSV-----------NVGQLIYDLDPDYLVD 203 Query: 255 S------TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + TP +I + + T G + S N SL Sbjct: 204 TWAIQSRTPVFNTVTEQILTGYDTPVTTTASSYSNE------TTGNDQSYNSTRYNSLSA 257 Query: 309 CNEAK 313 CN AK Sbjct: 258 CNTAK 262 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIIVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ P + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVPPTPDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|119599629|gb|EAW79223.1| hCG1743181 [Homo sapiens] Length = 1211 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+ ++K N VR G + ++ Sbjct: 786 LDVVFVIDSSGSIDYDEYNIMKDF---------MIGLVKKADVGKNQVRFGALKYADDPE 836 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I N G +T + L ++ + +A+ + + Sbjct: 837 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 891 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + + +G +V A+G+ + L S D+++ V Sbjct: 892 VLIVITDGE----SHDADKLNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFV 947 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L F + + Sbjct: 948 ETFGGLKGIFSDVTASVCN 966 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 60/187 (32%), Gaps = 17/187 (9%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +D+ M+ + + ++ + VR G FS FPL Sbjct: 994 VDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNR---VRIGAAQFSDTYHPEFPLG 1050 Query: 235 W--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 G + I +I N T L + + + ++ LT Sbjct: 1051 TFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMGSRINTGTP-----QVLLVLT 1105 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG++ E + RG +Y++G+ Q ++ + ++ +V N +L Sbjct: 1106 DGQSQD------EVAQAAEALRHRGIDIYSVGIGDVDDQQLIQITGTAEKKLTVHNFDEL 1159 Query: 352 HDAFLRI 358 RI Sbjct: 1160 KKVNKRI 1166 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 98/309 (31%), Gaps = 35/309 (11%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 N N G Q T LR G IE ++ + Sbjct: 274 SARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFTLGIEGTSDTQLEKIAS 333 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL--DMMMVLDVSL 179 H + + F IT +V + S+ L D+ +++D S Sbjct: 334 HPAEQYVSKLK---TFADLAAHNQTFLKKLRNQITHTVSVFSERTETLKSDIYLLIDGSG 390 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 S + E++ + P VR G V ++ F + Sbjct: 391 S------TQATDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYS 439 Query: 240 IQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 ++ + + G T + L + + + AK++ + H ++ LT+G Sbjct: 440 NKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG- 492 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHD 353 + L N + VYAIG++ EA L+ A + R Y V + L D Sbjct: 493 -----MSKDSILEPANRLREEHIRVYAIGIK-EANQTQLREIAGEEKRVYYVHDFDALKD 546 Query: 354 AFLRIGKEM 362 ++ +E+ Sbjct: 547 IRNQVVQEI 555 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 73/186 (39%), Gaps = 25/186 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ M ++ ++ + PD V+ G+V FS Sbjct: 595 KEMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGPDR---VQIGVVQFS 645 Query: 225 SKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I+++ G TT + L + K +I Sbjct: 646 DINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARPNI----- 700 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE + L ++ G I+Y++GV Q + P+ Sbjct: 701 --RKFLILITDGEAQDIVKEPAVVL------RQEGVIIYSVGVFGSNVTQLEEISGRPEM 752 Query: 342 FYSVQN 347 + V+N Sbjct: 753 VFYVEN 758 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S + + + + ++ N +R GLV +S++ Sbjct: 182 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 232 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 233 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 289 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 290 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIEGTSDTQLEKIASHP------ 335 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 336 --AEQYVSKLKTFADLAAHNQTFLKK 359 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 52/230 (22%), Positives = 86/230 (37%), Gaps = 33/230 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATR 197 A I + I+SK + G D+++V+D S SM D P +K V Sbjct: 57 VGIVAAVVALASPILTKNYINSKKE-GRDIVLVIDSSDSMRQMGFDPKDPYKNKFDVVK- 114 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE--KINRL-IFGSTTK 254 +++ R G+VTF+ PL + + ++ +L + G T Sbjct: 115 ------EVVADFIKKRKNDRIGMVTFADVAFIASPLTFEKDFLTNITEMQKLGMAGKRTA 168 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 L AYN + +K K K II LTDG ++ I + + Sbjct: 169 INDALVQAYNLMSKSKAK-----------SKIIILLTDGRDNMSKIPLSDVKHMIE---K 214 Query: 315 RGAIVYAIGVQAEA--ADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGK 360 R +Y IG+ Q+LK A + Y+ +++ L + I K Sbjct: 215 RDVKLYTIGIGGPRDYDAQYLKTLAKAGKGQAYAARSAAMLSKIYDEINK 264 >gi|294055720|ref|YP_003549378.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293615053|gb|ADE55208.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 326 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 87/255 (34%), Gaps = 37/255 (14%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 + ++ + +F W + P + + + + + D+++++D+ Sbjct: 46 PKQGGKPINLPGNHRATRLFAWMSWILLVLALARPQYLEAPLTRTVPTR---DLLLIVDL 102 Query: 178 SLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 S SM+ + G +D+L + E L R GL+ F S P Sbjct: 103 SGSMDARDFTNPEGERIDRLSAVKGVLDEFLT-------RREGDRVGLIVFGSAAFVQVP 155 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + + ++T + DA + + + ++ II LTD Sbjct: 156 FTQDLNACRILL--------EETTVRMAGPRTVFGDALGLGITLFERSEVEERVIIALTD 207 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-------ADQFLKNCAS--PDRFY 343 G ++ + E+ N V+ +GV ++ L AS R++ Sbjct: 208 GNDTGSRVPPAEAAKIAN---DNTVKVHVVGVGDPTTTGEDVLDEEALNAVASTTGGRYF 264 Query: 344 SVQNSRKLHDAFLRI 358 N +L D + + Sbjct: 265 HANNREELEDIYTEL 279 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 70/200 (35%), Gaps = 28/200 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V +++++D S SM G+ K +A + + +++ R Sbjct: 382 VPPPRPERSDTTLLLIIDQSASMGPE--TGLSKFTMAKEAAIMATESLRAED------RI 433 Query: 219 GLVTFSSK---IVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 G++ F +V P+ G+ IQ +I+ L G T L+ ++ ++ Sbjct: 434 GVLAFDVSTRWVVDFQPVGTGLSLADIQRRISTLPLGGGTDIYNALQTGLPELARQPGRV 493 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 H + LTDG S D + EA+ R + I + +A L Sbjct: 494 RHA-----------VLLTDGR--SFTDDRQAYQALIEEARSRNITLSTIAIGTDADIDLL 540 Query: 334 KNCA--SPDRFYSVQNSRKL 351 + A R+Y + Sbjct: 541 QTLARWGAGRYYFAAEPGDI 560 >gi|117618496|ref|YP_856674.1| von Willebrand factor type A domain-containing protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559903|gb|ABK36851.1| von Willebrand factor type A domain protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 337 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 81/241 (33%), Gaps = 28/241 (11%) Query: 131 SRYEMPFIFCTFPWCANSSH--APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 P WCA P S D+++ +D+S SM Sbjct: 45 PHAGQPLWRMALCWCALVLALCRPQWQESPQVSY---QSSRDLILAVDLSDSMRTQDMLD 101 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ---HIQEKIN 245 + ++R+ +D + + + R L+ F+ PL + +++ Sbjct: 102 EGEQKDRLSAVRQQIDRLIAARPGD---RIALIVFADHAYLLSPLTQETNALLGLTRELD 158 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + G TT + A +H A ++ +TDG N++ + D Sbjct: 159 FELVGRTTALGEAILLA----------RQHKAPERSTA---LLLVTDGRNTAGSADP--- 202 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L +A G +Y +GV A+ D F + S +L +A L+ E+ Sbjct: 203 LREAKQAAAAGIRLYTLGVGADP-DTFAEAMTPAQTPAQSDPSAELDEALLQQLAEVGHG 261 Query: 366 R 366 R Sbjct: 262 R 262 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 76/213 (35%), Gaps = 38/213 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + +++VLD S SMN +++L VA ++ + + N G+V+++S Sbjct: 327 VRVVLVLDESGSMNAETPKRIERLKVAAKNFVSLAE---------NGTELGIVSYASDAA 377 Query: 229 QTF--------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 PL I+ L + T GL+ A + I A G Sbjct: 378 VASGRTEVAIAPLGANRAAWNNAIDGLGPSTRTNIGAGLQKARDLITAA---------GG 428 Query: 281 DDYKKYIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 YI+ ++DG N+ P N D + G VY ++ C Sbjct: 429 VTANTYIVLMSDGLNNEPAPQANADADLNGKIA-MLLADGIPVYVTCTGSDLG--LASQC 485 Query: 337 A-----SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + + +S +L +AF + +V Sbjct: 486 SEIGTGTGGHYVDSADSARLPEAFADFHERIVA 518 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 77/228 (33%), Gaps = 33/228 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVA 195 + + P + + + D+++++D+S SM + F K Sbjct: 63 LAVAIWACVLTAVARPQWLEPPITKEIPTR---DLLLLVDLSGSMAQEDFQNDAGKKVSR 119 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGST 252 +++E+LD R GLV F P +Q QE + G Sbjct: 120 LDAVKEVLD---GFLAKRKGDRVGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPR 176 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + N + E+ K II LTDG ++ + E+ A Sbjct: 177 TAFGDAIGLGVNLFDEDTERA-----------KTIIALTDGNDTKSKVPPVEA---ARVA 222 Query: 313 KRRGAIVYAIGVQAEA-------ADQFLKNCAS--PDRFYSVQNSRKL 351 +R +Y + + +Q LK+ AS +++ + R L Sbjct: 223 TQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFFRRGPRAL 270 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 76/191 (39%), Gaps = 19/191 (9%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G I +L A+L+ V+ M L ++ + F ++++L +D + L A ++ E+ G Sbjct: 19 RRGVIVVLAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEGEDVATG 78 Query: 73 KKQKNDFSYRIIKNIWQT--------DFRNELRENGFAQDINNIE-RSTSLSIIIDDQHK 123 ++ + W+ + L + G + E TS ++ +++ Sbjct: 79 TVH-EYLTHNPVGLQWKEFTEGNTADNVDKFLTKYGDGLQLTIGEWNDTSGQVVAAEKNP 137 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 ++ MPF F + S S D+M+VLD+S SMND Sbjct: 138 TTVSVRMTYENMPF---FFGHLLGRDSFDITAESIATYQS-----RDIMLVLDLSGSMND 189 Query: 184 HFG-PGMDKLG 193 + KLG Sbjct: 190 DSEFNSIGKLG 200 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 61/203 (30%), Gaps = 33/203 (16%) Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS---SKIVQTFPLA 234 S S D + + S+ + + + + R GL ++ + L Sbjct: 379 SNSTPDLWKASAQPITAVKNSVDLFIHF---MQEGDGRDRIGLAVYNAPNGDGLLESTLT 435 Query: 235 WGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + I + + G + T G+ ++ K K ++ LT Sbjct: 436 ENLPFIMTQSRQRQAGHYHNYTNIGGGMTVGREELQTRGRKGA---------VKMMVLLT 486 Query: 292 DGE------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 DG+ + N L +G + I + A A + A + + Sbjct: 487 DGQANWVNGGVNNNAAKNYVLNEAYLCADQGFTIITISLGAGADKALMDQVAEITGGVHF 546 Query: 344 SV-------QNSRKLHDAFLRIG 359 +V + S L + F ++ Sbjct: 547 NVPGGQTVDEYSEDLTEIFRQVA 569 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 83/197 (42%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 229 QTFPLAW--GVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALF----TEEHGSRINQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG+ + D+ + +E + +G ++A+GV + + + V Sbjct: 919 MLIVITDGK----SHDHDQLNDTASELRDKGITIFAVGVGKANQKELEGMAGNKNNAIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 975 DNFDKLKDVFTLVQESM 991 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 70/200 (35%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ + + R + E+ ++ PD VR G+V +S Sbjct: 442 DIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQYSDDTEV 492 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + I + +N T + L+Y I + + Y Sbjct: 493 EFYITDYSNDIDLRKAILNIKQITGGTYTGRALDYILQIIKNGMKDRMSK------VPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG + + + ++A+G+ A + + +R Q Sbjct: 547 LIVLTDG------MSADRVVEPAKRLRAEQITIHAVGIGAANKIELQEIAGKEERVSFGQ 600 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N DA I E+V++ Sbjct: 601 N----FDALKSIKNEVVREI 616 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 674 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVARDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDGSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 85/205 (41%), Gaps = 27/205 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S DH GP I +M++ S+P N R L +S + Sbjct: 30 DVVFLVDSS----DHLGPK--SFPFVKTFINKMIN---SLPIEANKYRVALAQYSDEFHS 80 Query: 230 TFPLAW--GVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L+ G + + + G + + L A+ F A + + Sbjct: 81 EFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALREAHRTYFSAHTN----GRDKKQFPP 136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ L + ++++ + ++A K+ G + ++GVQ EA+++ LK A+ ++ Sbjct: 137 ILVVL-------ASAESEDEVEEASKALKKDGVKIISVGVQ-EASEENLKAMATSHFHFN 188 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILY 369 ++ R L F + +++K Y Sbjct: 189 LRTIRDL-STFSQNMTQIIKDVTKY 212 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 71/211 (33%), Gaps = 29/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM +L A + + +D + P N GLV+F+ Sbjct: 89 IILAIDVSNSMAAT-DIQPTRLEAAKQGAQAFVDQL---PPRIN---LGLVSFAGSAAVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ I L G T G+ + I A E++ +G I+ L Sbjct: 142 VPASTDRESVRSGIRGLQLGPATAVGEGIFASLQAITTAGERMS--DEGQPPPPAAIVLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DGE + + A+ V I + L+ Sbjct: 200 SDGETTRGRPNT----QAATAARDAEVPVDTIAYGTSDGTLDVGGQQIPVPVNEDALREL 255 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + ++ +L + +G + + Sbjct: 256 AEQTGGSYHRATTGDELQSVYRGLGSSIGYR 286 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 74/205 (36%), Gaps = 29/205 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 + + FI +C K+ + G+D++ LDVS SM ++ Sbjct: 54 KANLKFIVLALAFCCFVMALVNPKLG-TKMETIKREGVDIVFALDVSKSMLAEDIAP-NR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---- 247 L + R I E+++ + + R GLV ++ P+ + + + Sbjct: 112 LEKSKRIITEIVNKLTA-------DRVGLVGYAGSAFPQVPITTDYASTKTFLQSMNTDM 164 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + T + ++ A + D + K +I L++GE+ N+++ Sbjct: 165 VSSQGTAISQAIDLAKSYYNDDDQTN-----------KVLIILSEGEDHDSNVES----- 208 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 A G +Y IGV E D Sbjct: 209 MAETAAAEGIKIYTIGVGTERGDPI 233 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 72/416 (17%), Positives = 147/416 (35%), Gaps = 85/416 (20%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 I+ N KGSI+I A+L +I + + I+ S + + + + D L+ T Sbjct: 8 RIQALKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVT--- 64 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 K + S + +TD R L + D + S ++I +D + K Sbjct: 65 -----TFRKYVADGMSKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINI-VDREAKV 118 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-- 182 + + + S I S + + ++DVS SM Sbjct: 119 VKANVNISGKHE---SYMTHAMGFDNIDYTADSESTI-SFGQGKYEFIFLVDVSPSMGIG 174 Query: 183 --------------------DHFGPGMD---------KLGVATRSIREMLDIIKSIPDVN 213 + + + ++ V +++ ++ ++ +V+ Sbjct: 175 ASNRDRQIMQRAIGCQFACHEPWYSSVSRAKSAGARLRIDVVKDALKSLVTQLEEATEVD 234 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIF 267 +R+GL +FS+ + L G+ + + N++ G T ++ +F Sbjct: 235 --LRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTN--------FHGVF 284 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGE--------------NSSPNIDNKESLF---YCN 310 + K D K++II ++DG N +PN F +C+ Sbjct: 285 SDFNGVLRSLKPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCD 344 Query: 311 EAKRRGA-IVYAIGVQAEAADQF------LKNCA-SPDRFYSVQNSRKLHDAFLRI 358 E K+ V+ + V+ + A ++ CA S D FYS ++ ++ A + Sbjct: 345 EFKKGEVRTVHTMLVEPDRAHYVRASTSSMRACATSADFFYSANSAAEIDKASKTV 400 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 87/267 (32%), Gaps = 27/267 (10%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++ F ++ + A+ L VI ++G I+ K L D ++L A L Sbjct: 8 LKQFLNATNAGVAPMFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLAAARAYLT 67 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 K++ D S +I + + + N F + + I+ + K Sbjct: 68 SSVQVKETKRQED-SQKIASDYLTANLLSS--SNNFENNQIQLVFKEDGEIVGNASTK-- 122 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 I F S L ++ + L++++VLD S SM Sbjct: 123 ------------IKLIFGGLFGKSDVVLPALAAATVGDSRK--LEIVLVLDTSGSM---- 164 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEK 243 +++ + ++ + V+ G+V +++ I P W Sbjct: 165 -SSQNRMKQLRTASINFVNSVFDNAVYERTVQVGVVPWNATVNINMDRPGTWDASP-GPA 222 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAK 270 I+ +G+ T + ++ Sbjct: 223 IHNSNYGNGTNQVTSFQDFTENLYPPG 249 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 57/191 (29%), Gaps = 57/191 (29%) Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY---------NKIFDAKEKLEHIAKGHDDY 283 ++ I +K+N+L T + GL + Y N F + + Sbjct: 323 MSQSRPQIIKKLNQLNPSGNTHADIGLMWGYRMFSQQANWNNFFGYNSDTKPDSFHSTKS 382 Query: 284 KKYIIFLTDGENSSPNID------------------------------------------ 301 +K +I LTDGEN++ N + Sbjct: 383 RKIMIMLTDGENTATNSEGYSYYGWCTYTNHYNKWGRYTGSTKDCEVPKGINKDEISNND 442 Query: 302 -NKESLFYCNEAKRRGAIVYAIGVQAEA-----ADQFLKNCASPDRFYSVQNSRKLHDAF 355 N L C + + ++ I + + A L+ CA D +L + F Sbjct: 443 LNSLMLDACEVIRSKDVELFTIALDLHSYYDSTAIALLRECAGSDSHAYNIKGNELDETF 502 Query: 356 LRIGKEMVKQR 366 + + ++ Sbjct: 503 QELASKALRLS 513 >gi|116626306|ref|YP_828462.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116229468|gb|ABJ88177.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 310 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 84/215 (39%), Gaps = 27/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S ++ +++SD+ L + +++D S S+ D F D + IK + N Sbjct: 75 SIMEFNAESDLPLRLGILIDTSNSIRDRFKFEQDAAS----------EFIKGVVHANQ-D 123 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ LV+F +K L + + I L G T Y+ IF A Sbjct: 124 KAMLVSFDTKAELVSDLIGDTEKLDHAIRSLRPGGGTAL-------YDAIFFACRDKLSQ 176 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA----ADQF 332 + +++ I+ ++DG+++ ++L +A ++Y+I D+ Sbjct: 177 DQPKHKFRRAIVIVSDGDDNQSQYTRDQALEMAQKAD---VVLYSISTNISKIESDGDKV 233 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 LK A + + + L +F I E+ Q Sbjct: 234 LKYYAAETGGKAFFPFKVEDLEQSFENIANELRHQ 268 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 67/204 (32%), Gaps = 32/204 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L +++VLD S SM G D A E+L + G + F ++ Sbjct: 405 SLGLIIVLDRSGSM---MGEKFDLAKEAAARSVELLKEEDTF---------GFIAFDTEA 452 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V + E I G T P L AY ++ + D +K Sbjct: 453 WTVVETEPIKNKDEVIETIRSTALGGGTDIFPALNQAYQQLNEM-----------DLKRK 501 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 +II LTDG+++ + E + + + +A L+ A RFY Sbjct: 502 HIILLTDGQSNDGPYE-----EIIEEGLTNNVTLSTVAIGGDADTSLLEELAEIGTGRFY 556 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 V + + R K I Sbjct: 557 EVYEASAVPSILSRETALTTKTYI 580 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 72/215 (33%), Gaps = 38/215 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ DVS SM +L A + R + + VR G+V F + + T Sbjct: 89 VVLAFDVSGSMAATDIAP-TRLEAAKAAARGFVQRQPAA------VRIGIVAFGATGLVT 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI--------FDAKEKLEHIAKGHDD 282 + I+RL T GL+ A I + Sbjct: 142 QQPTSDRASVVAAIDRLSPQGGTALGGGLQTALGAIVGKPVVVPGSDPGGGPEPSGPDLG 201 Query: 283 Y--KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----------- 329 Y ++ LTDGEN++ + L + A G VY IG+ + A Sbjct: 202 YHGSAAVVLLTDGENTA----QPDPLQVADIASTAGVKVYPIGLGSPAGTVLQIDGFQIA 257 Query: 330 ----DQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L+ A + R+++ + L + + Sbjct: 258 TRLDEPLLQQIADRTDGRYFAASDPAALAAVYDAV 292 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 77/231 (33%), Gaps = 46/231 (19%) Query: 167 IGLDMMMVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVV-R-- 217 G+ +++ LDVS SM P + +L A R+++ L + PD R Sbjct: 87 KGIALVVALDVSGSMGAEDVVWTPGAPSVSRLEAARRALKLFL-AGGAAPDGTAFDPRPG 145 Query: 218 --SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 GLV F++ P + + + L + I D+ + Sbjct: 146 DAVGLVAFAAVPETVCPATLNHSVLFKVADALQPKGGADAG-------TNIGDSLAEAVI 198 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKES-----------LFYCNEAKRRGAIVYAIGV 324 D + +I L+DGE++ D +++ A G VY I Sbjct: 199 RLDAADQKSRVLILLSDGEHNILKEDVRDAQRPGIDRTLKPREAAQLAANLGVRVYTIDA 258 Query: 325 QAE--------------AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 + A + LK+ A + + + + +L A+ I Sbjct: 259 GGDPPLGAPPDAVAQRFAGRKALKDVAEMTGGKSFQATSGAELLSAYREIS 309 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 59/445 (13%), Positives = 128/445 (28%), Gaps = 98/445 (22%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 L + F N G++++ AI+ + +G ++ S ++AKL +D + + ++ Sbjct: 6 LLLSRFRRNAGGNVAVTFAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASVGAVSR- 64 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + + N+ F +++ T S+ + + Sbjct: 65 ----------TSPAFIAAGAMTTDGVIAAGNDDARKIFNGNMSGTTGYTLDSLTPEVKKT 114 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 L+A + + + + + S + +D ++LD S SM Sbjct: 115 GSVLTATVSFSATVPTLF---MSIVGYKTMSLQGSSTAKASMPKYIDFYLLLDNSPSMGV 171 Query: 184 HFGPG-----------------------------------MDKLGVATRSIREMLDIIKS 208 P ++ V + ++++D + Sbjct: 172 AATPADVTKMVSATSDKCAFACHDYNDANNYYNLAKTLGVTTRIDVLRSATQQLMDTAQQ 231 Query: 209 IPDVNNVVRS-----GLVTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPG-L 259 +N R G + + + F L+ + + I+ + S Sbjct: 232 TQTYSNQFRMAIYDFGASSKTIGLRALFALSSSLTSAKSAAGNIDLMGVYGNNDSFTADK 291 Query: 260 EYAYNKIFDA--KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF---------- 307 + Y A E KY+ F++DG N + Sbjct: 292 DTPYTTALPAINNEIATPGDGTSGSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPIN 351 Query: 308 --YCNEAKRRGAIV---YAIGVQAEAA-----------------------DQFLKNCASP 339 C K RG + Y +Q Q ++ CAS Sbjct: 352 PALCTALKNRGIKIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACASD 411 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ V ++ + DA + K+ V Sbjct: 412 GFYFEVSPTQGIADAMNALFKKAVA 436 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 85/245 (34%), Gaps = 41/245 (16%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHF 185 R ++ + A + P + + + D+++V+D+S SM+ D Sbjct: 57 GRAQLWLNVAVWLLLALALARPQWVEPPLT---HVEPMRDILLVVDISQSMDSEDFRDAQ 113 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G + + + +D R GL+ F + PL ++ + Sbjct: 114 GRPASRWQAVQAVVGDFID-------KRPDDRLGLIVFGAGAYPQAPLTRDHAALRLLLQ 166 Query: 246 RL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 R + G T + + A E+ K +I LTDG +++ + Sbjct: 167 RTAVGMAGPNTALGDAIGLGIRMLDHAGER-----------DKILILLTDGNDTASAVPP 215 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHD 353 + + + +V+ IG+ AA L++ A + RF+ ++ L + Sbjct: 216 ARAAELAAQHR---VVVHTIGIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQE 272 Query: 354 AFLRI 358 + + Sbjct: 273 VYATL 277 >gi|109071570|ref|XP_001110086.1| PREDICTED: collagen alpha-1(XXI) chain-like isoform 2 [Macaca mulatta] Length = 959 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 86/216 (39%), Gaps = 30/216 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K +++I K+ ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPENFEIVKKW---------LVNITKNFDIGPKFIQVGVV 79 Query: 222 TFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S V PL ++ +H+ + L G T++ +++A + +F AK Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 K + LTDG++ D A+ ++AIGV +E D L+ A Sbjct: 131 SSRFLTKIAVVLTDGKSQDDVKD------AAEAARDSKITLFAIGVGSETEDAELRAIAN 184 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 S + V++ + I +++ ++ + + Sbjct: 185 KPSSTYVFYVEDYIAISKIREVIKQKLCEESVCPTR 220 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 81/245 (33%), Gaps = 33/245 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + +++ + + + ++ D +++ +DVS SM +L A + Sbjct: 59 LAMGLSVLSMAIIVLAFAQPKAYHEVPRDRATVVVAIDVSRSMVATDVEP-SRLSAAKTA 117 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ L +P N LV F++ + I L +T G Sbjct: 118 AKDFL---GDLPPRFN---VSLVKFAASAQVVVAPTTDRAAVSTAITNLQVLPSTAIGEG 171 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + N + + +H I+ L+DG + + SL EA R+ Sbjct: 172 IYSSLNALKLVPDDPKH---PGQKPPAAIVLLSDGATNVG----RPSLEAAKEAGRQHVP 224 Query: 319 VYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 VY I L A S +S ++ +L D + I + + Sbjct: 225 VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSV 284 Query: 363 VKQRI 367 +++ Sbjct: 285 GYEKV 289 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 72/209 (34%), Gaps = 29/209 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM ++L A ++ +D + S +V LVTF+ Sbjct: 91 IVVTIDVSRSMEATDVTP-NRLDAAKSGAKDFVDSLPSAFNV------ALVTFAGTANVK 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P ++ I+ + +T G+ + + + + I+ L Sbjct: 144 MPPTTDRTQLKAAIDAIRLAPSTAIGEGIYTSLDVLEKLAPQDPDHPDDPAPGA--IVLL 201 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG + ++S EAK++ +Y I L Sbjct: 202 SDGATNMG----RDSADAATEAKKKNVPIYTIAYGTSTGYVVENGQRQTVAVNHAELSQV 257 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A S + YS + + L + I +++ Sbjct: 258 AKLSGGKKYSADSMKNLQAVYQTISRQIG 286 >gi|119493582|ref|ZP_01624246.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452572|gb|EAW33755.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 414 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 46/223 (20%), Positives = 80/223 (35%), Gaps = 29/223 (13%) Query: 147 NSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 SS L ++ S +S + L++ ++LD S SM+ L ++ E++D Sbjct: 19 GSSQRQLSMSVSAIPNSVDRHVPLNLCLILDHSGSMSGS------PLETVKKAAGELIDR 72 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + N R +V F + P + I+++INRL T GL+ Sbjct: 73 L------NPGDRISVVVFDHRAKVLIPNQDIDDPESIKKQINRLRTSGGTSIDEGLKLGI 126 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ K + A LTDGEN DN L A + ++G Sbjct: 127 EELGKGKVERISQA----------FLLTDGENEHG--DNNRCLKLAKLATDYNLTLNSLG 174 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + L+ A ++ ++ D F R+ M Sbjct: 175 FGNDWNQDILEKIADEGGGTLAYIEYPEQVIDEFSRLFNRMQS 217 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 61/456 (13%), Positives = 145/456 (31%), Gaps = 94/456 (20%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT--KI 63 +R F G +++ +++ ++ ++ G+ ++ +A++ + D ++L A + Sbjct: 19 LRRFAREEDGLVTLFAILMILLMILLGGVGVDLMRHERERARVQAVADRAVLAAADLDQT 78 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN--IERSTSLSIIIDDQ 121 L+ E K+ + I + ++D+ I++ + + + Sbjct: 79 LSPEAVARDYFDKSGLADYISSVTVEEGLNYRRVTVDASRDLKTMFIDKFGQEKLHVPAK 138 Query: 122 HKDYNLSAVSRYEMPFIFC------------------TFPWCANSSHAPLLITSSVKISS 163 A M + L+ S V ++ Sbjct: 139 ATAEEKVAKVEISMVLDISGSMRENDKMNNLHDASNVFIDTVIQTDTEDLISISVVPYTA 198 Query: 164 KSDIGLDMMMVLDV----SLSMNDHFGPGMDKLGV-ATRSIREMLDIIKS--IPDVNNVV 216 + ++G D+M L+V S S F L + E + ++ + N+ Sbjct: 199 QVNVGKDIMDELNVTQLHSYSHCVDFEDSDFNLTTISQTRSYEHMQHFEAGYYWNGNDRD 258 Query: 217 RSGLVT-------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R+G + + ++ +I + T GL++ + + Sbjct: 259 RTGHYDNISNPGCPKQSYEEIETFSQNAAALKSRIANFQPRANTAIHLGLKWGVALLDPS 318 Query: 270 KEKLEHIAKGHDDYK------------KYIIFLTDGEN---------------------- 295 + G ++ K +I +TDG N Sbjct: 319 FRAINEAIGGDAVFRGRPAEYNDIDTLKTVILMTDGVNVTTRRIAPEAYSNRDHYRHWSD 378 Query: 296 ---------------------SSPNIDNKESLF--YCNEAKRRGAIVYAIGVQA-EAADQ 331 + ++L C+ AK +G ++++IG + + Sbjct: 379 YPFYWWLGRNVRSSEHYRWYRTKYTAGQADNLLDNICDAAKAKGIVIWSIGFEVTDHGAA 438 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +KNCAS D + ++ DAF I +++ + R+ Sbjct: 439 VMKNCASSDSHFFRVEGVEIVDAFEAIARQINQLRL 474 >gi|296232325|ref|XP_002761549.1| PREDICTED: hypothetical protein LOC100408376 [Callithrix jacchus] Length = 912 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 81/217 (37%), Gaps = 19/217 (8%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNN 214 S K + D +D+ VLD S S+ P + + +D ++ N Sbjct: 24 ESPKAVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRN 83 Query: 215 VV-RSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDA 269 +V +G + +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 84 LVWNAGALHYSDEVEIIQGLTRMPGGRDSLKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG 143 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEA 328 H KY+I +TDG + L NEAK G V+++ + + Sbjct: 144 G--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDH 195 Query: 329 ADQFLKNCASPDRF---YSVQNSRKLHDAFLRIGKEM 362 + L A+ + ++ + + DA I + + Sbjct: 196 LEPRLSIIATDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|255598079|ref|XP_002536925.1| conserved hypothetical protein [Ricinus communis] gi|223518102|gb|EEF25458.1| conserved hypothetical protein [Ricinus communis] Length = 451 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 54/322 (16%), Positives = 105/322 (32%), Gaps = 51/322 (15%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R+F + +G IS++ + L + V+GL + V++KL+ D + L A I N Sbjct: 3 RSFPHAQRGGISLMVIVSLTTLLAVVGLAFSAGLSYLVRSKLNAATDAAGLAAARAISNG 62 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + R N L N DI+ + ++I + Sbjct: 63 TTQADQIANAKAAGQRFFH---ANFPSNYLMSNATLNDISVTFSGSEVTIGVS------- 112 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + +P + ++ ++ LDM++V+D S S++ Sbjct: 113 ----ASASLPAA-LFGGFGTSALA-------PAVVTETKRKDLDMIVVMDTSGSLSPSAA 160 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-----SKIVQTFPLAWGVQHIQ 241 S L+ + D R GLV F+ ++ + + Sbjct: 161 N-------VRSSAITFLNQFNATRD-----RVGLVHFAFGAIVDDAIRQTARGFDRASMT 208 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI- 300 I F +T S G+ A +I + + + I+F +DG +S Sbjct: 209 NHIKAYAFSGSTASAEGMYTARQQINSVPTANLNRSNM-----RVIVFFSDGAPNSFGAY 263 Query: 301 -DNKESLFYCNEAKRRGAIVYA 321 + K + + +Y Sbjct: 264 LNWKPGVACADPG-----TIYT 280 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 76/205 (37%), Gaps = 27/205 (13%) Query: 152 PLLITSSVKIS-SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 PL S + G+ M+ VLD S SM+ G + +L VA +++ +++ Sbjct: 731 PLETLSPLSARLPHEGPGIAMVFVLDRSGSMSQTVGD-VTRLDVAKQAVSAAANLLDP-- 787 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 G+V F S+ PL I + L G T PGL+ A+ + Sbjct: 788 ---QTGSLGVVMFGSEAEVALPLGPLPDAAGIAAALGHLQPGGGTNIYPGLQLAFQALRA 844 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + H I+ +TDG + D + + G V ++ + + + Sbjct: 845 SDADARH-----------IVVMTDGMS-----DEADFPGLLAAIRAEGITVSSVAIGSTS 888 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKL 351 ++ A RF++ ++ L Sbjct: 889 ETSIAEDIALLGGGRFHNTRDFGAL 913 >gi|38348304|ref|NP_940898.1| von Willebrand factor A domain-containing protein 2 [Homo sapiens] gi|34527908|dbj|BAC85505.1| unnamed protein product [Homo sapiens] Length = 725 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 90/226 (39%), Gaps = 32/226 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSD-----IGLDMMMVLDVSLSMNDHFGPGMDK 191 F+F P + + KIS+ S +D+M +LD S S+ G Sbjct: 13 FLFSRVPPSLPLQEVHVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSV------GKGS 66 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIF 249 + + D + P+ VR G FSS FPL ++ Q ++ +I R++F Sbjct: 67 FERSKHFAITVCDGLDISPER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVF 123 Query: 250 GST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ L+Y ++ + + +I +TDG++ + L Sbjct: 124 KGGRTETELALKYLLHRGLPGGR--------NASVPQILIIVTDGKSQGDVALPSKQL-- 173 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 K RG V+A+GV+ ++ L AS R V + ++ DA Sbjct: 174 ----KERGVTVFAVGVRFPRWEE-LHALASEPRGQHVLLAEQVEDA 214 >gi|74722595|sp|Q5GFL6|VWA2_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; AltName: Full=Colon cancer secreted protein 2; Short=CCSP-2; Flags: Precursor gi|50429312|gb|AAT77225.1| colon cancer secreted protein-2 [Homo sapiens] gi|50429314|gb|AAT77226.1| colon cancer secreted protein-2 [Homo sapiens] Length = 755 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 90/226 (39%), Gaps = 32/226 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSD-----IGLDMMMVLDVSLSMNDHFGPGMDK 191 F+F P + + KIS+ S +D+M +LD S S+ G Sbjct: 13 FLFSRVPPSLPLQEVHVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSV------GKGS 66 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIF 249 + + D + P+ VR G FSS FPL ++ Q ++ +I R++F Sbjct: 67 FERSKHFAITVCDGLDISPER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVF 123 Query: 250 GST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ L+Y ++ + + +I +TDG++ + L Sbjct: 124 KGGRTETELALKYLLHRGLPGGR--------NASVPQILIIVTDGKSQGDVALPSKQL-- 173 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 K RG V+A+GV+ ++ L AS R V + ++ DA Sbjct: 174 ----KERGVTVFAVGVRFPRWEE-LHALASEPRGQHVLLAEQVEDA 214 >gi|51893456|ref|YP_076147.1| hypothetical protein STH2318 [Symbiobacterium thermophilum IAM 14863] gi|51857145|dbj|BAD41303.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 414 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 76/217 (35%), Gaps = 29/217 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + + + + + L++ V+D S SM L +++R ++D Sbjct: 21 GEVYLLVTVKAPRMPAPEGRPPLNLAAVVDRSGSMAGA------ALYFTKQALRFLVD-- 72 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + R +VT+ ++ FP ++ ++ + G TT + GL Sbjct: 73 ----QMAEEDRLAIVTYDDQVHVPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQ 128 Query: 265 KIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 +I + H + ++ +TDG + D + + +G V + Sbjct: 129 QI-----------RPHAGPGRVSRVLLMTDGLANVGVTDPDVLAGWARAWREKGLAVSTM 177 Query: 323 GVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 GV ++ L A F+ + N ++ F Sbjct: 178 GVGPHFSEDLLVALAEAGGGNFHYIANPDQIPRIFQE 214 >gi|27657363|emb|CAD60276.1| AMACO [Homo sapiens] Length = 755 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 90/226 (39%), Gaps = 32/226 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSD-----IGLDMMMVLDVSLSMNDHFGPGMDK 191 F+F P + + KIS+ S +D+M +LD S S+ G Sbjct: 13 FLFSRVPPSLPLQEVHVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSV------GKGS 66 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIF 249 + + D + P+ VR G FSS FPL ++ Q ++ +I R++F Sbjct: 67 FERSKHFAITVCDGLDISPER---VRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVF 123 Query: 250 GST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ L+Y ++ + + +I +TDG++ + L Sbjct: 124 KGGRTETELALKYLLHRGLPGGR--------NASVPQILIIVTDGKSQGDVALPSKQL-- 173 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 K RG V+A+GV+ ++ L AS R V + ++ DA Sbjct: 174 ----KERGVTVFAVGVRFPRWEE-LHALASEPRGQHVLLAEQVEDA 214 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 53/277 (19%), Positives = 97/277 (35%), Gaps = 40/277 (14%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 K+I D + NGF +I + +I Y A+ Y+ + P Sbjct: 337 KSIVLNDVHRDDLSNGFMDNIEAYVKDYGGGLITFGGEDSY---ALGGYKDTSLEKVLPV 393 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + K + + +++D S SM+ G G+ KL +A + + L+ Sbjct: 394 YMD------------KRGKNEVPAISINLIIDKSGSMSAE-GGGVSKLTLAKEAAMKALE 440 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ + +++ ++ F + PL + I+E I+ + T P LE Sbjct: 441 NLREVDEIS------VIAFDDTYDEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQG 494 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 YN + K++H I LTDG++ +DN +L + + Sbjct: 495 YNMQMQSSAKIKHT-----------ILLTDGQDGYG-LDNYATLL--QNFIDNNITLSTV 540 Query: 323 GVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLR 357 V A L AS R Y + F + Sbjct: 541 AVGEGANAGLLNQLASIGKGRSYYTDIYTDIPRIFAK 577 Score = 36.7 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 55/175 (31%), Gaps = 40/175 (22%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +LDVS S +D G D + A I+S+P N G+V F Sbjct: 61 KGRNISTVFLLDVSESASDFEESGKDFISTA----------IESMPRGNKA---GVVLFG 107 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + I+ + T +E A Sbjct: 108 DNSKIDKVL--NKKKEYKSIDEKPVVTATNIQEAVESALGLFERGG-------------S 152 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAK----------RRGAIVYAIGVQAEAA 329 K I+ +TDGE + D +S NE K +G +Y V+ Sbjct: 153 KRIVLITDGEENQG--DILKSTPLINEQKIDFKVYKITGEKGEEIYVDNVKVPDN 205 >gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 787 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 103/266 (38%), Gaps = 33/266 (12%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIF-CTFPWCANSSHAPLLITS 157 +I + +S + D K +L + ++ T P + Sbjct: 170 EDNIQEVVKSKQNNKNSYDLEKGLSLDVKTFQKHFQFNNSQDQTIPIMVSVKTLEQTSDM 229 Query: 158 SVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ + + LD++ V+D S SM+ K+ +I +++D++ N+ Sbjct: 230 EIQSNLLEGRPNLDLICVIDNSGSMDGE------KIENVKNTILQLIDML------NDHD 277 Query: 217 RSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R ++TF+S Q L +++Q+ + T T GL+ A++ + + K++ Sbjct: 278 RLSIITFNSYAKQLCGLRKVNKDNKENLQKITKSIQADGGTNITSGLQTAFSILQNRKQR 337 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQ 331 + + L+DG++++ + + L + + +++ G + Sbjct: 338 NSVSS---------VFLLSDGQDNNSDSRIRNLLQTTYQQLQEECFTIHSFGFGNDHDGP 388 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAF 355 ++ A FY V+ + ++ + F Sbjct: 389 LMQRIAQIKDGSFYYVERNDQVDEFF 414 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 T + P + + G+D+M+V D+S SM M Sbjct: 61 LRIMTVALLVVALAGPRFAHDRTETQA---SGVDIMLVFDLSWSM---MVLDMGGHDETG 114 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----ST 252 +++ + R GL+ FS PL + E ++RL G Sbjct: 115 TRFGIASAVLEDFVNKRPNDRIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGIIRELG 174 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T A ++ +K+ + II LTDG+N+ I+ + A Sbjct: 175 TAIGDATAAATKRLQMSKDSK----------SRIIILLTDGDNNQGEIEPVPAAQL---A 221 Query: 313 KRRGAIVYAIGVQAE 327 GA +Y IG+ E Sbjct: 222 AAIGAKIYTIGLGIE 236 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 49/202 (24%), Positives = 76/202 (37%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ GLD++ VLDVS SM DK+ + ML +IK + ++ R +V Sbjct: 55 STNDRPGLDLVAVLDVSGSMAG------DKIAKVKTA---MLFVIKKLSPID---RLSVV 102 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FS+ + PL + ++ IN L T T GL+ + D + Sbjct: 103 KFSADASRLCPLRQITEDSQKDLENLINGLNADGATNITAGLQTGLKVLNDRSLSSGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II ++DGE ++ + + VY G + LK A Sbjct: 163 G--------IILMSDGEQNAGGDAAQVPI--------GNVPVYTFGFGINHEPRVLKAIA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 + F VQN+ L AF Sbjct: 207 NNSMGGTFSDVQNTDNLSLAFS 228 >gi|147898761|ref|NP_001080437.1| collagen alpha-1(VI) chain precursor [Xenopus laevis] gi|82210072|sp|Q801S8|CO6A1_XENLA RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|28703819|gb|AAH47255.1| Col6a1 protein [Xenopus laevis] Length = 1045 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 75/196 (38%), Gaps = 16/196 (8%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + ++ +D + S N+V +G + Sbjct: 61 DCPVDIFFVLDTSESVALRVKPFKTLVTQVKEFTKKFIDKLTSRYYRCDRNLVWNAGALH 120 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +S +++ L ++ +++ + + G T + ++ ++ H Sbjct: 121 YSDEVILINSLTRDMKTLRDNVETVEYIGKGTHTDCAIKRGIEEVLIGG--------SHQ 172 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 KY+I +TDG + L NEAK G V+++ + + L AS Sbjct: 173 KENKYLIVVTDGHPLEGYKEPCGGLEDAANEAKHLGIKVFSVAISPNHLEPRLSVIASDA 232 Query: 341 RF---YSVQNSRKLHD 353 ++ ++ L D Sbjct: 233 SHRRNFTATSAVGLTD 248 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 72/214 (33%), Gaps = 36/214 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFS 224 + D+ M++D S + + + ++ + + +K+ P + R +V +S Sbjct: 845 EGPADITMLVDSSTRVGNQH------FQTSKSFVKLLAERFLKAKPPPSGSARVSVVQYS 898 Query: 225 SKIVQTFPLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q + + ++ N T L A +++ Sbjct: 899 GQNQQIVEAQFLTNYTVLEVPVDNMQFINGATNVVSALR-AVTELYREDSLAG------- 950 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEAADQFLKNCA 337 KK ++F +DG N+ + K L +A+ G +Y + V L A Sbjct: 951 VNKKLLVF-SDG-NTQ---EEKGLLKVVQDAQSAGIEIYVLAVGSRLNYPNLQVMLTGSA 1005 Query: 338 SP-------DRFYSVQNSRKLHDA--FLRIGKEM 362 + +R + V + L + I + + Sbjct: 1006 ADIAGPFPEERLFRVPDYTSLLQGVRYQSISRRI 1039 >gi|118088945|ref|XP_419902.2| PREDICTED: similar to collagen XXI [Gallus gallus] Length = 964 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 83/212 (39%), Gaps = 32/212 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + +++I ++ ++ G+V +S Sbjct: 41 APADLVFILDGSYSVGPENFEIIKSW---------LVNITRNFDIGPKFIQVGVVQYSDY 91 Query: 227 IVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V PL + + I + G TK+ +++AY+ +F AK Sbjct: 92 PVLEIPLGT-HESTENLIKEMESIHYLGGNTKTGRAIQFAYDHLF---------AKSSRF 141 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SP 339 K + LTDG++ D EA++ ++AIGV +E + LK A S Sbjct: 142 LTKIAVVLTDGKSQDEVKD------VAAEARKNKITLFAIGVGSEIEEDELKAIANKPSS 195 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + V++ + I +++ ++ + + Sbjct: 196 TYVFYVEDYIAISRIKEVIKQKLCEESVCPTR 227 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 53/296 (17%), Positives = 99/296 (33%), Gaps = 49/296 (16%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDY-----------NLSAVSRYEMPFIFC 140 E F D I +++ S D + Y +S R P + Sbjct: 271 VAIYPAEGTFWHDNPFIIMASANSDERDAAERFYEFLLSEESQRAAMSFGFRPANPNVPL 330 Query: 141 TFPWCANSSHAPLLITSSVKISSKS------------DIGLDMMMVLDVSLSMNDHFGPG 188 T P P + + + + + D+++V+D S SM Sbjct: 331 TDPISPAFGVDPQGVQTVLAVPTAEVIVAIKNSWSLNRKRADIVLVVDTSGSMEG----- 385 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---IVQTFPLAWGVQHIQEKIN 245 DKL + I L ++ +P+ R GL+TF+S +V PL+ +Q+ + Sbjct: 386 -DKLTMVKAGIETFL--MRILPE----DRLGLITFASAARLVVPMAPLSDNRIALQDAVQ 438 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + T L + E + DD + I+ L+DG ++S + Sbjct: 439 AMRASGRTALFDALVLGKQVL-------EQLPPADDDRIRAIVLLSDGADNSSQASLDQI 491 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGK 360 +E G ++ + +A Q L A V ++ + F + + Sbjct: 492 RTLFDE---SGISIFPVAYGNDADRQVLDAIAEFSRTIVVVGDTGDIAQIFENLSR 544 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 229 QTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D+ + E + +G ++A+GV + + + V Sbjct: 919 MLIVITDGE----SHDHDQLNDTALELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 975 DNFDKLKDVFTLVQERM 991 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 674 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDSSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ + + R + E+ ++ PD VR G+V +S Sbjct: 442 DIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQYSDDTEV 492 Query: 230 TF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + N T + L+Y I + + Y Sbjct: 493 EFYVTDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG ++ + + V+A+G+ A A L+ A + Sbjct: 547 LIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGA-ANKTELQEIAGKE---ERV 596 Query: 347 NSRKLHDAFLRIGKEM 362 + + DA I E+ Sbjct: 597 SFGQNFDALKSIKNEV 612 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 229 QTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D+ + E + +G ++A+GV + + + V Sbjct: 919 MLIVITDGE----SHDHDQLNDTALELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 975 DNFDKLKDVFTLVQERM 991 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 674 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDSSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 88/266 (33%), Gaps = 33/266 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T L I+ + + S ++ F + + T + + + Sbjct: 376 ANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTGC 435 Query: 169 LD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D + ++D S S+ + + R + E+ ++ PD VR G+V + Sbjct: 436 VDTKEADIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQY 486 Query: 224 SSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F + I + N T + L+Y I + + Sbjct: 487 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK---- 542 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 Y+I LTDG ++ + + V+A+G+ A + + + Sbjct: 543 --VPCYLIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEE 594 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQR 366 R QN DA I E+V++ Sbjct: 595 RVSFGQN----FDALKSIKNEVVREI 616 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 229 QTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D+ + E + +G ++A+GV + + + V Sbjct: 919 MLIVITDGE----SHDHDQLNDTALELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 975 DNFDKLKDVFTLVQERM 991 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 674 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDSSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 88/266 (33%), Gaps = 33/266 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T L I+ + + S ++ F + + T + + + Sbjct: 376 ANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTGC 435 Query: 169 LD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D + ++D S S+ + + R + E+ ++ PD VR G+V + Sbjct: 436 VDTKEADIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQY 486 Query: 224 SSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F + I + N T + L+Y I + + Sbjct: 487 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK---- 542 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 Y+I LTDG ++ + + V+A+G+ A + + + Sbjct: 543 --VPCYLIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEE 594 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQR 366 R QN DA I E+V++ Sbjct: 595 RVSFGQN----FDALKSIKNEVVREI 616 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 229 QTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D+ + E + +G ++A+GV + + + V Sbjct: 919 MLIVITDGE----SHDHDQLNDTALELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 975 DNFDKLKDVFTLVQERM 991 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 674 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDSSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ + + R + E+ ++ PD VR G+V +S Sbjct: 442 DIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQYSDDTEV 492 Query: 230 TF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + N T + L+Y I + + Y Sbjct: 493 EFYVTDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG ++ + + V+A+G+ A A L+ A + Sbjct: 547 LIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGA-ANKTELQEIAGKE---ERV 596 Query: 347 NSRKLHDAFLRIGKEM 362 + + DA I E+ Sbjct: 597 SFGQNFDALKSIKNEV 612 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 82/211 (38%), Gaps = 32/211 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLDVS SM KL + R++ ++ + S R Sbjct: 233 QISQYRRAPVDLVTVLDVSGSMGG------TKLALLKRAMGFVIQNLGSSD------RLS 280 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N L+ T GL + D +E+ Sbjct: 281 VIAFSSTARRLFPLTRMSDAGRQQALQAVNSLVANGGTNIFDGLRKGAKVMEDRRERNSV 340 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + II L+DG ++ S +A V++ G ++ + + Sbjct: 341 AS---------IILLSDGRDTYTTNHPDPSY----KAMLPQIPVHSFGFGSDHDASVMHS 387 Query: 336 CA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + S F +++ + DA + IG + Sbjct: 388 VSEFSGGTFSFIESESVIQDALAQCIGGLLS 418 >gi|282863310|ref|ZP_06272369.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561645|gb|EFB67188.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 624 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 79/227 (34%), Gaps = 46/227 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 ++ P+ +V D G ++MVLD S SM + G G ++ A R++ ++ Sbjct: 6 LVLSAGALPVAAVPAVT----DDAGGSLVMVLDSSGSMGEDDGTGSTRMESARRAVGAVV 61 Query: 204 DII-KSIPDVNNVVRSGLVTF-SSKIVQTF---------PLAWGVQHIQEKINRLIFGST 252 D + P +GL + + + PL ++ + + Sbjct: 62 DALPDGYP-------TGLRVYGADRPQGCADTRLVRPVRPL--DRAAVKSAVAGVRPTGD 112 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L A + ++ + I+ ++DGE++ + C A Sbjct: 113 TPIGLSLRKAAEDLPAPRDGAAR--------TRTIVLVSDGEDTCG------TPPPCEVA 158 Query: 313 -----KRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKL 351 + G + +G Q + A + C A R+Y ++ L Sbjct: 159 ARLAGQGAGLRIDTVGFQVKGAAREQLECVAEAGNGRYYDAPDADAL 205 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 60/175 (34%), Gaps = 29/175 (16%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + LD S S+ + P + + V D K D V F+S + Sbjct: 32 ICFSLDQSGSIVSPGLYPNIREFTV---------DAAKEFDDRTKDSYFSAVGFASGVKL 82 Query: 230 TFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + +N + T GL Y ++ K + +I Sbjct: 83 IQAPTQSLSTFNTAVNTVSPLNGGTNIFRGLRGCYQQL-----------KTKPMTDRVLI 131 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 +TDG P + YCN K +G ++ +G+ FLKNCA+ + FY Sbjct: 132 LVTDGFGGQP-------INYCNFIKSKGILLVTVGIGTSINQNFLKNCATSEEFY 179 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 68/181 (37%), Gaps = 29/181 (16%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++ + + GLD++ VLD S SM +L A +I ++++ ++S R Sbjct: 80 IEWTERKTRGLDIVFVLDSSKSMLAS-DLRPTRLERAKLAILDLVEQLES-------DRI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 GLV F+ P +E ++ ++ + L A Sbjct: 132 GLVAFAGSAFLQTPPTLDYGAFRESLDATAPDMMSRGGSDLGVALREATKAF-------- 183 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ K ++ LTDGE+ ++ +A + G V+AIG+ D + Sbjct: 184 ----PVENNYKAVVLLTDGEDLGG-----HAIDEAKKASKEGVKVFAIGLGTPEGDYLRQ 234 Query: 335 N 335 Sbjct: 235 T 235 >gi|224072363|ref|XP_002303700.1| predicted protein [Populus trichocarpa] gi|222841132|gb|EEE78679.1| predicted protein [Populus trichocarpa] Length = 587 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 97/276 (35%), Gaps = 31/276 (11%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 ++ F+ D + S + + + + Y +F Sbjct: 85 QIEAEHFSDD--EVLSDVSPDQSLSSRPHAITVKTFTEYPAVSASESFSNFGVLVRILAP 142 Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++ +D++ VLDVS SM KL + R++ ++ + Sbjct: 143 PLDNTLPHHRARAPIDVVTVLDVSGSMA-------SKLILLKRAVNFIIQNLGPSD---- 191 Query: 215 VVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R +VTFSS + PL G + ++ + T GL+ + + Sbjct: 192 --RLSIVTFSSSARRMLPLRRMSGSGREDATSVVDSISAIGGTNIVAGLKKGVQVL---E 246 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E+ +H + II L+DG ++ + + E+K+ VY G ++ Sbjct: 247 ERRQHNSVAT------IILLSDGCDTQSHNAQNRLDYLKEESKQPTFPVYTFGFGSDHDS 300 Query: 331 QFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + AS F +++ L DAF R IG + Sbjct: 301 AAMHAISDASRGTFSFIESINILQDAFARCIGGLIS 336 >gi|108760959|ref|YP_630756.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464839|gb|ABF90024.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 476 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 83/217 (38%), Gaps = 26/217 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S ++ +++ +V+D S SM+ + KL A ++ R ++ ++ N+ Sbjct: 83 SGAQVPGAQRSPVNLALVIDRSGSMSGY------KLAQAKQAARHLIGLL------NDQD 130 Query: 217 RSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R ++ + S + L A + + + ++ + T GL ++ A+ Sbjct: 131 RLAIIHYGSDVKSLPSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTY 190 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +I ++DG+ + ++E E + G + AIGV + + + Sbjct: 191 GVNR---------LILMSDGQPTEGLTADEELTRMARELRATGLTLSAIGVGTDFNEDLM 241 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + A + ++++ +L F + ++ Sbjct: 242 QAFAEYGAGAYGFLEDAAQLSTLFQKDLQQAGTTVAR 278 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 71/202 (35%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ GLD++ VLDVS SM G ++K+ A + + + L I R +V Sbjct: 55 STSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFVVKKLSSID---------RLSIV 102 Query: 222 TFSSKIVQTFPLAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TF + PL + + + I+ L G T + GL+ + D K + Sbjct: 103 TFLDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ ++DG+ + K VY G A+ L A Sbjct: 163 G--------VMLMSDGQQNRG--------EPAANVKIGNVPVYTFGFGADYDPTVLNAVA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F V + L AF Sbjct: 207 RNSMGGTFSVVNDVNLLSMAFS 228 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 758 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 808 Query: 229 QTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + K+ Sbjct: 809 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 863 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D+ + E + +G ++A+GV + + + V Sbjct: 864 MLIVITDGE----SHDHDQLNDTALELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYV 919 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D F + + M Sbjct: 920 DNFDKLKDVFTLVQERM 936 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 568 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 618 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 619 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 671 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 672 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDSSL 725 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 726 VFHVENFDHL 735 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 88/266 (33%), Gaps = 33/266 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T L I+ + + S ++ F + + T + + + Sbjct: 321 ANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTGC 380 Query: 169 LD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D + ++D S S+ + + R + E+ ++ PD VR G+V + Sbjct: 381 VDTKEADIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQY 431 Query: 224 SSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F + I + N T + L+Y I + + Sbjct: 432 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK---- 487 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 Y+I LTDG ++ + + V+A+G+ A + + + Sbjct: 488 --VPCYLIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEE 539 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQR 366 R QN DA I E+V++ Sbjct: 540 RVSFGQN----FDALKSIKNEVVREI 561 >gi|284990593|ref|YP_003409147.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284063838|gb|ADB74776.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 318 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 72/211 (34%), Gaps = 33/211 (15%) Query: 171 MMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++M +DVSLSM D + + VA + + + +PD N GLV+F+ Sbjct: 88 VVMAVDVSLSMQATDIEPSRFEAMQVAAK------EFVDVLPDRIN---LGLVSFAGTAT 138 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + I+ L +T + + I + + L A G + I+ Sbjct: 139 TVVTPTTDRGQVSTAIDNLELAESTAIGEAVFTSLTAIENFQSSL--DADGEEVPPARIV 196 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLK 334 L+DG N+ D + A G V I + L+ Sbjct: 197 LLSDGYNTVGRPDT----QAVSAALDAGIPVSTIAFGTDYGTLDLDGERVPVPVDRATLE 252 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A + + ++ +L + +G ++ Sbjct: 253 EIADQTGGSYSEAASAAELEQVYQDLGSQIG 283 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 71/202 (35%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ GLD++ VLDVS SM G ++K+ A + + + L I R +V Sbjct: 55 STSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFVVKKLSSID---------RLSIV 102 Query: 222 TFSSKIVQTFPLAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TF + PL + + + I+ L G T + GL+ + D K + Sbjct: 103 TFLDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ ++DG+ + K VY G A+ L A Sbjct: 163 G--------VMLMSDGQQNRG--------EPAANVKIGNVPVYTFGFGADYDPTVLNAVA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F V + L AF Sbjct: 207 RNSMGGTFSVVNDVNLLSMAFS 228 >gi|115450663|ref|NP_001048932.1| Os03g0142500 [Oryza sativa Japonica Group] gi|108706121|gb|ABF93916.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] gi|113547403|dbj|BAF10846.1| Os03g0142500 [Oryza sativa Japonica Group] gi|125584872|gb|EAZ25536.1| hypothetical protein OsJ_09360 [Oryza sativa Japonica Group] gi|215712380|dbj|BAG94507.1| unnamed protein product [Oryza sativa Japonica Group] Length = 694 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 72/208 (34%), Gaps = 34/208 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S S LD++ VLDVS SM+ G + L A + + L R +V Sbjct: 239 SVSSRAPLDLVTVLDVSGSMS---GIKLSLLKRAMSFVIQTLGPND---------RLSVV 286 Query: 222 TFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FSS + FPL G Q + I+ L+ T L+ + D + K Sbjct: 287 AFSSTAQRLFPLRRMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRK----- 341 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------CNEAKRRGAIVYAIGVQAEAAD 330 + II L+DG+++ + + + Y ++ G + Sbjct: 342 ----NPVSSIILLSDGQDTHSFLSGEADINYSILVPPSILPGTSHHVQIHTFGFGTDHDS 397 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFL 356 + A S F + + DAF Sbjct: 398 AAMHAIAETSNGTFSFIDAEGSIQDAFA 425 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 106/297 (35%), Gaps = 35/297 (11%) Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL-SAVSRY 133 QK+D S + I + D + L+E D+ + + ++ + + + + Sbjct: 106 QKSDISIEKQEKIQKLDTKTMLQEQEIKPDLQQMVKDAKKPSYDLEKGLTFEIKTLNKHF 165 Query: 134 EMPFIF-CTFPWCANSSHAPLLIT------SSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + C P + VK +S +D++ V+D S SM Sbjct: 166 QFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQVKQVEQSRPSIDLVCVIDNSGSMQGE-- 223 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQE 242 K+ ++ ++LD++ N+ R L+ F+S L +IQ Sbjct: 224 ----KIQNVKTTLLQLLDML------NSNDRLSLILFNSYPTLLCNLRKVDDENTPNIQS 273 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 IN + T G+ A+N + + + I L+DG+++ + Sbjct: 274 IINSITADGGTDINSGMLMAFNIL---------QKRQFFNPVSSIFLLSDGQDNGADEKI 324 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNCA-SPDRFYSVQNSRKLHDAFLR 357 K+ + K +++ G ++ + + C FY V+ ++ + F+ Sbjct: 325 KKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGNFYYVEKINQVDEFFVD 381 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 44/198 (22%), Positives = 77/198 (38%), Gaps = 29/198 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D++ +LD S S+ G + + + +++ + PD R G+V +S Sbjct: 33 KNVHYDLVFILDASSSV------GKEDFEKVRQWVSNLVETFEIGPDK---TRVGVVRYS 83 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + F L G +E+I +G T + L Y N +KE + Sbjct: 84 DRPTTEFDL--GKYKTREEIKEAARKIRYYGGNTNTGDALRY-INTYSFSKEAGGRL--S 138 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 KK I LTDG + +D N A++ G ++A+GV EA + L AS Sbjct: 139 DRTVKKVAILLTDGRSQDYVLDP------ANAARQAGIRIFAVGVG-EALKEELDEIASE 191 Query: 339 --PDRFYSVQNSRKLHDA 354 + V + + Sbjct: 192 PKSAHVFHVSDYNAIDKI 209 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 33/230 (14%), Positives = 72/230 (31%), Gaps = 28/230 (12%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + + + +++ +DVSLSM P +L A + + Sbjct: 72 TVAMAGPLAQTRVPRNRATVILAIDVSLSMRATDVPP-SRLAAAQAGAKTF------ADN 124 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + + GL F+ ++ L T + + A I Sbjct: 125 LTPGINLGLEAFAGTASMLVSPITDHTATDNALDHLQLAERTATGEAIFTALQAIDTLAG 184 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ---- 325 L G I+ +DG+ + P D + + AK +G + I Sbjct: 185 VLGG---GSTPPPARIVLESDGKQTVPADLNDPRGAFTAARLAKEQGVPISTISFGTTHG 241 Query: 326 ----------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 D+ L+ A S F++ ++ +L ++ + +++ Sbjct: 242 AIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQASYQNLQQQIG 291 >gi|330466229|ref|YP_004403972.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328809200|gb|AEB43372.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 319 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 84/246 (34%), Gaps = 34/246 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 P T+ V++ + +M+ +DVS SM D+L A Sbjct: 60 LFLAMLALLVVGFARP---TAEVRVPRERAT---VMVAVDVSTSMLATDVDP-DRLSAAK 112 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + R D + +P N GLV F+ P + + + + I RL G T Sbjct: 113 SAAR---DFVDGLPREFN---VGLVAFAGSAAVLVPPSTDREALHDGIRRLAEGITGVQG 166 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A + A + L+ A + II L+DG N+S + + EA Sbjct: 167 TAIGEAISTSLGAVKALDEQATTQPPPAR-IIVLSDGANTSG----MDPMEAAAEAVAFE 221 Query: 317 AIVYAIGVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 V+ I + L+ A + F+ S +LH + IG Sbjct: 222 VPVHTISFGTPGGFVDRGGRPIQVPVDGETLQAVAEQTGGAFHQADTSDELHAVYDDIGS 281 Query: 361 EMVKQR 366 + ++ Sbjct: 282 SVGWRK 287 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 41/228 (17%), Positives = 86/228 (37%), Gaps = 21/228 (9%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + W + S + + S VK + G ++ ++LD S SM G D+ Sbjct: 51 LLLFLKWLSISMLI-IALMSPVKDEPYEIEPKKGYEIALILDASESMKAK---GFDEKNR 106 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 + +I+ + G+V F + PL + ++ ++ Sbjct: 107 DLTRFDVVKEIVSNFISSRKNDNMGIVVFGAYSFIASPLTYDSNILKGVVS--------N 158 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 G+ + +F++ + ++ K K I LTDG N+ + ++ + A + Sbjct: 159 LYIGMAGKFTALFESLAQGVNLLKTSKSKTKIAILLTDGYNTPDSEFPFDA--AIDFANK 216 Query: 315 RGAIVYAIGVQA--EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +G VY IG+ E + L+ A + + N+ +L + +I Sbjct: 217 QGVKVYPIGIGKSDEYNQKMLEKIAEQTGGVAFGASNASELAIVYAKI 264 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 33/241 (13%), Positives = 78/241 (32%), Gaps = 37/241 (15%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + + + + DI M ++LD S SM G +L A + + Sbjct: 46 AIALLLSLAGIVVPFGQPQAAAEAKDIPPTM-LILDASGSMMARDAGGQTRLDAAKEASK 104 Query: 201 EMLDIIKSIPDVNNVVRSGLV--TFSSKIVQTFPLAW-------GVQHIQEKINRLIFGS 251 + ++ +V V + + + I +++++ Sbjct: 105 NFSRSVSEESELGFMVYGTKVGNSPEEREAGCKDVTTLLPVGKGNAGKISGEVDKVNASG 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T P L+ A ++ + E+ I+ ++DGE++ C+ Sbjct: 165 HTPMGPALKQAAKELPNEGERS-------------IVLVSDGEDTCA------PPPVCDV 205 Query: 312 AK---RRGA--IVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 AK ++G + +G + A + C A + Q++ L ++ + Sbjct: 206 AKDLHKQGIDLTINTVGFLVDPAARKELQCIAEAGGGEYLDAQDAESLAESMKVLATRTA 265 Query: 364 K 364 + Sbjct: 266 Q 266 >gi|145299122|ref|YP_001141963.1| von Willebrand factor type A domain-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851894|gb|ABO90215.1| von Willebrand factor type A domain protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 331 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 73/253 (28%), Gaps = 55/253 (21%) Query: 131 SRYEMPFIFCTFPWCANSSH--APLLITSSVKISSKSDIGLDMMMVLDVSLSMND----H 184 P T WCA P + D+++ +D+S SM Sbjct: 45 PHAGQPLWRMTLCWCALVLALCRPQWQEPPLVTY---QSSRDLILAVDLSDSMRTQDMLD 101 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 G D+L + I +++ R LV F+ PL + + Sbjct: 102 EGEQRDRLSAVRQQISRLIET-------RPGDRIALVVFADHAYLLSPLTQETKALLTLT 154 Query: 245 NRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 L F G TT + A + ++ +TDG N++ + D Sbjct: 155 RELDFDLVGRTTALGEAILLARQHADPKRPTA-------------LLLVTDGRNTAGSAD 201 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEA------------------ADQFLKNCA--SPDR 341 L A G +Y +GV A+ + L+ A R Sbjct: 202 P---LSEARRAAASGMTLYTLGVGADPDTFVEALQPAQSDPSAELDEALLQQLAKVGQGR 258 Query: 342 FYSVQNSRKLHDA 354 ++ + L Sbjct: 259 YFRARTQGDLDAI 271 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 72/213 (33%), Gaps = 30/213 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM +L A + ++ L +P N LV F++ Sbjct: 89 VVVAIDVSRSMVATDVDP-SRLSAAKTAAKDFL---GDLPPRFN---VSLVKFAASSQVV 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + I L +T G+ + N + + +H I+ L Sbjct: 142 VPPTTDRAVVSTAIANLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLL 198 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG + + SL EA R+ VY I L Sbjct: 199 SDGATNVG----RPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAV 254 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S +S ++ +L D + I + + +++ Sbjct: 255 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 287 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 72/213 (33%), Gaps = 30/213 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM +L A + ++ L +P N LV F++ Sbjct: 91 VVVAIDVSRSMVATDVDP-SRLSAAKTAAKDFL---GDLPPRFN---VSLVKFAASSQVV 143 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + I L +T G+ + N + + +H I+ L Sbjct: 144 VPPTTDRAVVSTAIANLQVLPSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVLL 200 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG + + SL EA R+ VY I L Sbjct: 201 SDGATNVG----RPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAV 256 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S +S ++ +L D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 73/198 (36%), Gaps = 28/198 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + +++++D S SM FG + K +A + + + ++ R GL Sbjct: 389 PPRPQRSDVALLLIMDRSASMLASFG--VSKFDMAKEAAQLATESLQPED------RIGL 440 Query: 221 VTFSSKI--VQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + F ++ V F L G V IQE+I L G T+ LE + + K+ H Sbjct: 441 LAFDTETLWVVPFQLISGGLSVAQIQEQIASLPSGGGTRIERALEVGLPALAEQPTKVRH 500 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + LTDG S DN A+ + + I + ++ LK Sbjct: 501 A-----------VLLTDGR--SFMNDNALYQRLVETARSQQITLSTIAIGLDSDTALLKQ 547 Query: 336 CAS--PDRFYSVQNSRKL 351 A+ R+Y + Sbjct: 548 LAAWGGGRYYYADQPADI 565 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 61/171 (35%), Gaps = 31/171 (18%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY----KKYI 287 PL +Q I + +T+ G+ + + DY KK I Sbjct: 313 PLTSSFSTLQSAIADMTSEGSTRLDAGMLAGWYTLSPKWRSAWGGGTAPADYSEKVKKVI 372 Query: 288 IFLTDGENS--SPNIDNKES-------------------------LFYCNEAKRRGAIVY 320 +F+TDGE + + D +S L C+ K +Y Sbjct: 373 VFMTDGEMNVKFGSTDPAKSSTEKLDWICDKNRTKSCNDTATNALLTTCDSIKSNNIEIY 432 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 AI +EA Q L+ C+S ++Y ++ + D + I K ++ + + Sbjct: 433 AISYSSEADVQNLQTCSSGTKYYFSASTTNIKDVYTAISKNIIGSTVRLTQ 483 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 77/221 (34%), Gaps = 24/221 (10%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F + G+ +I++AILL + + +G ++ S + + D ++L A+ + Sbjct: 7 LSKFIADDNGNFAIMSAILLMPLLLAVGAAVDYSSARDHRNDIQVTADSAILAAASSYSS 66 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 ++ + + + + D +R + ++ D + Sbjct: 67 SSGVDSLAAGIDSYLDSKLTDQGSNDV----------DTAAVPKRLSGPTLSADGKE--- 113 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + V +P F A + S + D+ + +VLDVS SM + Sbjct: 114 -ICIVVGEGVPTSFMQL---AGVKTVDVSAKSCAALPGNIDLEVS--LVLDVSSSMIEE- 166 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + +++ L + V + + FSS+ Sbjct: 167 ----GRFVPMQTAVKSFLTSFANDATVAKRSKIAIAPFSSR 203 >gi|254784286|ref|YP_003071714.1| matrixin family protein [Teredinibacter turnerae T7901] gi|237683907|gb|ACR11171.1| matrixin family protein [Teredinibacter turnerae T7901] Length = 877 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 79/215 (36%), Gaps = 30/215 (13%) Query: 170 DMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D+++V+D S SMN P + K+ + +D + D++ R+GLV F ++ Sbjct: 421 DVVLVMDRSGSMNLSSAPDPSVSKMDALKYAANVFMDFL----DLDAGHRAGLVQF-HEV 475 Query: 228 VQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V F A+ +Q + Q IN + G T G+ ++ A + + Sbjct: 476 VVPFSPAFNLQPVNAASLSAAQTAINSMTAGGMTNIIDGVNEGIAQLTTAVDPSDR---- 531 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + ++ LTDG ++ P + + +Y++G + L A Sbjct: 532 -----QIMLLLTDGLHNRPVGTSVTDITA--PLLASEVTLYSVGFGTSTNEAELTPLALS 584 Query: 338 SPDRFYSVQNSRKLH--DAFLRIGKEMVKQRILYN 370 + ++ L FL I L + Sbjct: 585 TGGVHLENKDVSDLQLRKHFLSIAASAADSTTLID 619 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 80/197 (40%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S Sbjct: 813 LDVVFVLDHSGSIAEQSQDHM---------INLTMHLVKKADVGRDRVQFGALKYSDNPE 863 Query: 229 QTFPLAW--GVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + + K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALF----TEEHGSRINQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D E + + +G ++A+GV + + + V Sbjct: 919 MLIVITDGE----SDDRVELNDTAAKLRDKGITIFAVGVGKADQKELEGMAGNKNNTIYV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D + + + M Sbjct: 975 DNFDKLKDIYAPVQESM 991 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 67/192 (34%), Gaps = 25/192 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + G ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSGSIGNE------NFGKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + Q I + I+R+ T L + +K Sbjct: 674 DDPEEEFQLNTYFTQQEISDAIDRMSLIDKGTLMGKALNFVDQYFTHSKGARFGA----- 728 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 729 --KKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYGADRSQLEEISGDGSL 780 Query: 342 FYSVQNSRKLHD 353 + V+N L Sbjct: 781 VFYVENFDHLQA 792 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 75/198 (37%), Gaps = 28/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ + + K ML++ + + VR G+V FS K+ Sbjct: 442 DIHFLIDGSTSIREEQFEQIKKF---------MLEVTEMFSIGPDKVRVGVVQFSDKMRV 492 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + I + +N T + L++ +I + ++ Y Sbjct: 493 EFSITDYSNDIDLRKAILNIQQLTGDTHTGEALDFILPRIKNGIKERM------SQVPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG++ ++ + V+AIG+ EA + L+ A + Sbjct: 547 LIVLTDGKSQYSVVEP------AKRVRAEQITVHAIGIG-EANKKELQEIAGKE---ERV 596 Query: 347 NSRKLHDAFLRIGKEMVK 364 + + DA I E+V Sbjct: 597 SFGQNFDALKSIKNEVVH 614 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 71/202 (35%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ GLD++ VLDVS SM G ++K+ A + + + L I R +V Sbjct: 55 STSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFVVKKLSSID---------RLSIV 102 Query: 222 TFSSKIVQTFPLAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TF + PL + + + I+ L G T + GL+ + D K + Sbjct: 103 TFLDTANRICPLQQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ ++DG+ + K VY G A+ L A Sbjct: 163 G--------VMLMSDGQQNRG--------EPAANVKIGNVPVYTFGFGADYDPTVLNAVA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F V + L AF Sbjct: 207 RNSMGGTFSVVNDVNLLSMAFS 228 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 56/382 (14%), Positives = 130/382 (34%), Gaps = 30/382 (7%) Query: 4 LNIRNFFYNCKGSISILT-AILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 ++ +GS++I A+L+P+I ++ + +D + L A + Sbjct: 1 MHSSRLLSKQRGSVAISYLAMLIPMIIAAASTIVIGYQVLLSNRAMQ-AVDTASL--ACE 57 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN----ELRENGFAQDINNIERSTSLSIII 118 + + + + + + +I K + EL + + + S + Sbjct: 58 FRGEYDRSIAQGYLDYYKPKIDKVTATLGASSGCKVELGYSYSSIFTSLTFSDASYVAGV 117 Query: 119 DDQHKDYNLSAVSRYEMPFIF---------CTFPWCANSSHAPLLITSSVKISSKSDIGL 169 K Y + + + + L S + + + Sbjct: 118 TASQKVYVTEVTDSDPIELVLVLDISGSMMGALDELKSILNRGLTTLRSQQANVAGQDHI 177 Query: 170 DMMMVLDVSL-SMND-----HFGPGMDKLGVATRSIREMLDIIKSIP---DVNNVVRSGL 220 + +V + S+ D G V + + + ++ D V S Sbjct: 178 KVSIVPFSNGVSVTDAPWLKSGGTLCVDATVNSGGSFSPANTVANLDVTHDQAPVTTSSS 237 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI--AK 278 + S PL + + + +NRL +T S GL + ++ + + + Sbjct: 238 SSDCSLTSVILPLTSNLNDVVDAVNRLQTIGSTASYQGLLWGLRQLTPNWQSAWRVGPNR 297 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNC 336 D+ ++ ++ +TDG + + ++D + C AK G + IG + +QF + Sbjct: 298 NQDNVQRKLVLMTDGMDDNSHLDELINAGLCTRAKDLGIELNFIGFGVQSWRLEQFTRCA 357 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 S +S N++ L D F ++ Sbjct: 358 GSAGAVFSANNTQDLDDYFSQL 379 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 80/212 (37%), Gaps = 33/212 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V + + +++VLD S SMN + ++ + + + Sbjct: 422 PVTFDRDDEPTVALVIVLDRSWSMNG------TAMELSKSAAEGAANALAPSQM------ 469 Query: 218 SGLVTFSSKIVQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G++TF+ PL + + + I R+ T P L AY+ + + + + + Sbjct: 470 LGVLTFNDASNWDIPLGRVRESRPELHDAIGRIKASGPTAIFPALRNAYDALANVRVRAK 529 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H +I L+DG++ D ++ + V + + +A L+ Sbjct: 530 H-----------VILLSDGQS-----DPEDFEGLVRKMSAAHITVSTVALGPDADAALLR 573 Query: 335 NCAS--PDRFYSVQNSRKLHDAFLRIGKEMVK 364 N AS R Y VQ+++++ + F+ + Sbjct: 574 NLASWGGGRSYVVQDAQQIPEIFVTEARNAAT 605 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 75/208 (36%), Gaps = 21/208 (10%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ + +V+DVS SM++ +KL + + L + GL++ Sbjct: 347 TEKPQAISTNLVIDVSDSMSED-----NKLTKVKDAATQFLSHASFASN----DVVGLMS 397 Query: 223 FSSKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS + ++ I+ I + T L A + A Sbjct: 398 FSDASNIRQSDFTTEIESIKSSIAGMQTSGCTALYEALNQAV----------SNTAYNSV 447 Query: 282 DYKKYIIFLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + KY++ TDG+N+ + N A + G +YAIGV+ +A Q + + Sbjct: 448 EGSKYVVVFTDGKNTICDGTNWVSPSTVINNALQWGVPIYAIGVEEDADLQQIAEQTNGQ 507 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + L+ + I KQ ++ Sbjct: 508 YHVLGNDFTDLNAIYSDIYTNKKKQYVI 535 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 52/323 (16%), Positives = 118/323 (36%), Gaps = 35/323 (10%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI 108 + + + AT++ + + QK+D S + + I + D + LRE D+ + Sbjct: 81 QSLAQNPIKAKATQLKSAVSKQLTSFQKSDISIQKEEKIQKLDTKTMLREQEIKPDLQQM 140 Query: 109 ERSTSLSIIIDDQHKDYNL-SAVSRYEMPFIF-CTFPWCANSSHAPLLIT------SSVK 160 + ++ + + + ++ C P + VK Sbjct: 141 IKDAKKPSYDLEKGLTFEIKTLNKHFQFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQVK 200 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + +S +D++ V+D S SM K+ ++ ++LD++ N+ R L Sbjct: 201 QAEQSRPSIDLVCVIDNSGSMQGE------KIQNVKTTLLQLLDML------NSNDRLSL 248 Query: 221 VTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + F+S L +IQ+ IN + T G+ A+N + Sbjct: 249 ILFNSYPTLLCNLRKVDDKNTPNIQKIINSITAEEYTDINSGMLMAFNIL---------Q 299 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KN 335 + + I L+DG+++ + K+ + K +++ G ++ + + Sbjct: 300 KRQFFNPVSSIFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRI 359 Query: 336 CA-SPDRFYSVQNSRKLHDAFLR 357 C FY V+ ++ + F+ Sbjct: 360 CQLKDGNFYYVEKINQVDEFFVD 382 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 74/210 (35%), Gaps = 51/210 (24%) Query: 179 LSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SM + F P ++L +D IK+ PD R GLV FS + PL Sbjct: 104 GSMLAEDFKP--NRLEAGKNIA---IDFIKNRPD----DRIGLVIFSGESFTQCPLTIDH 154 Query: 238 QHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + T GL A N++ +G + K +I LTDG Sbjct: 155 DVLINLYHDIKNGMIEDGTAIGMGLATAVNRL-----------RGSEAKSKVVILLTDGV 203 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---------------------- 332 N++ +I + AK+ G VY +G+ + + Sbjct: 204 NNAGSIPP---ITAAEIAKQFGIRVYTVGIGTQGYAPYPVPSPYGGVVYQRMEVQIDEPT 260 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 L A + +++ N+ L + +I + Sbjct: 261 LTKIAAITGGKYFRATNNDALTRIYKQIDQ 290 >gi|254776723|ref|ZP_05218239.1| hypothetical protein MaviaA2_18931 [Mycobacterium avium subsp. avium ATCC 25291] Length = 309 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 78/216 (36%), Gaps = 31/216 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+D+S SM ++L A ++ + + + GLV F+ Sbjct: 74 IMLVIDMSQSMRATDVEP-NRLKAAEQAASQF------ASQLTPGINLGLVGFAGTPYLL 126 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P Q + + +L F +T + + A + I +A G I+ L Sbjct: 127 VPPTPQHQATIDALKKLDFADSTATGEAIFTALHAISATA-----VAGGDTPPPARIVLL 181 Query: 291 TDGENSSPN--IDNKESLFYCNE-AKRRGAIVYAIGVQAEAAD--------------QFL 333 +DG + P+ D + ++ AK G + I + + + Sbjct: 182 SDGGENKPSNPSDPHDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVSTDQM 241 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 K A S + Y+ N +L ++ I E+ + + Sbjct: 242 KMVAKLSGGQSYTATNLGELQKSYNAIENEIGYRTV 277 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 73/211 (34%), Gaps = 29/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM ++L A + +D + P N GLV+F+ Sbjct: 89 IILAIDVSNSMAATDIAP-NRLEAAKQGAEAFVDQL---PPRIN---LGLVSFAGSATVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ I L G T G+ + I A ++ G I+ L Sbjct: 142 VPASTDRESVRAGIRGLQLGPATAIGEGIFASLQAINTAGKRFS--DAGQSPPPAAIVLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DGE + + + + + A++ V I +Q L Sbjct: 200 SDGETTRGRPNTQAT----DAARQAHVPVDTIAYGTSDGTLDVGGQEVPVPVNEQALNEI 255 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + ++ +L + +G + + Sbjct: 256 ADQTEGSYHRAATGDELRSVYKGLGSSIGYR 286 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 78/221 (35%), Gaps = 25/221 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 A + + + + + +++ +VLD S SM+ + K+ A + Sbjct: 38 LLAGPAQKTVIKIALDAPRAPRTAQRPPVNLALVLDRSGSMSGN------KIAKAREAA- 90 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ ++ + D + LV + + P + I+ +I R+ G +T Sbjct: 91 --IEAVRRLSDGDLF---SLVVYDDSVETLVPAQPVSDIGDIEARIRRIRPGGSTALFGA 145 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + A E +H + + ++ L+DG + + + G Sbjct: 146 VSQG------AAEVRKHSDAPYVNR---VVLLSDGLANVGPSRPADLARLGAALLKEGIS 196 Query: 319 VYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 V +GV + + + A S Y V++SR L F Sbjct: 197 VTTVGVGTDFNEDLMTQLAERSDGNHYFVESSRDLPRIFAA 237 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 53/234 (22%), Positives = 83/234 (35%), Gaps = 35/234 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F P + + + G +++VLDVS SM KL +A Sbjct: 272 LLFTATPKGLFFGGWDRALPEELPLKPLGREGAALVLVLDVSGSMAGE------KLSMAV 325 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 +++ R G+V FSS FP A + + + L G Sbjct: 326 AGALALVESAAPED------RLGVVVFSSGHRVLFPPRPMTAQAKKEAESLLLSLRAGGG 379 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T A + + K ++ LTDG D KE + + A Sbjct: 380 TVLGGAFREAVRLLHGVPGER-----------KAVLVLTDGL----IADAKEPIL--DLA 422 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + G V A+ + +A FLK A RFY + R+L FLR G+E+ + Sbjct: 423 QTSGVEVSALALGPDADAPFLKELARRGGGRFYQAPSPRELPRLFLREGQEVFR 476 >gi|219850571|ref|YP_002465004.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544830|gb|ACL26568.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 90/221 (40%), Gaps = 36/221 (16%) Query: 148 SSHAPLLITSSVKISSKSDI------GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + + P L+ + +++S++S L++ +V+D S SM +L ++ + Sbjct: 17 ALNEPQLLYALIELSAQSGATKMPRLPLNLCLVIDRSSSMRGE------RLQQVKQAAMQ 70 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSK--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +LD++ LVTF+ + +V + LA I+ +I+ + T+ GL Sbjct: 71 ILDLLGDNESF------ALVTFNDRAEVVVSSQLARARAEIKRQISAIEAAGGTEMATGL 124 Query: 260 EYAYNKIFDA-KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ A + H ++ LTDG D + A+ RG Sbjct: 125 ALGVQELQRAMMPRAIHR----------LLLLTDGRTYG---DESRCVEIARRAQARGIG 171 Query: 319 VYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 + A+G+ +E + L+ A+ + + + ++ + F Sbjct: 172 ITALGIGSEWNEDLLETIAARENSRTHYITSAADITKIFTA 212 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 68/210 (32%), Gaps = 28/210 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + S + L++ ++LD S SM G ++ + A + E L R Sbjct: 32 ITQSQDQSLPLNLCLILDHSGSM---HGRPLETVKKAAMQLIERL---------KEGDRI 79 Query: 219 GLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ F + P + I+ +I +L T GL+ ++ K Sbjct: 80 CVIAFDHRAKVLVPNQAIDNLNTIKSQIRQLSADGGTAIDEGLKLGIEEVAKGKADAVSQ 139 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + LTDGEN DN+ L + A + +G A L+ Sbjct: 140 ----------VFLLTDGENEHG--DNERCLKLAHFAVEHKLTINTLGFGASWNQDVLEKI 187 Query: 337 ASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 A ++ + F R+ + Sbjct: 188 ADSGSGTLCYIEQPEQAVQEFGRLFNRIQA 217 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 72/230 (31%), Gaps = 28/230 (12%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + + + +++ +DVSLSM P +L A + + Sbjct: 72 TVAMAGPLAQTRVPRNRATVILAIDVSLSMRATDVPP-SRLAAAQAGAKTF------ADN 124 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + + GL F+ ++ L T + + A I Sbjct: 125 LTPGINLGLEAFAGTASMLVSPITDHTATDNALDHLQLAERTATGEAIFTALQAIDT--- 181 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ---- 325 + G I+ +DG+ + P D + + AK +G + I Sbjct: 182 LAGVVGGGGTPPPARIVLESDGKQTVPTDLNDPRGAFTAARLAKEQGVPISTISFGTTHG 241 Query: 326 ----------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 D+ L+ A S F++ ++ +L ++ + +++ Sbjct: 242 AIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQASYQNLQQQIG 291 >gi|257387423|ref|YP_003177196.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257169730|gb|ACV47489.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 788 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 76/209 (36%), Gaps = 37/209 (17%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ V++ + + ++++++DVS S + A LD++ + D Sbjct: 358 PIGSMLPVQVGNATGGESNIVVLVDVSSSAESGLS-----IQKAVA-----LDVLDQLGD 407 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 N R G+V F+ + L EKI +L G T GL+ A + D Sbjct: 408 EN---RVGVVAFNHNAYRVSELRTLGQNRAETAEKIRQLESGGATDIAVGLQGADELLGD 464 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II L+DG++ N+ R G V ++GV Sbjct: 465 REGT--------------IILLSDGQDRLG-----PPAAVANQLGREGTRVVSVGVGKRV 505 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 ++ A S +++ + +L F Sbjct: 506 GVPTMRQIAGESGGSYFAADETERLRLLF 534 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 86/215 (40%), Gaps = 30/215 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K D++ +D S SM + P ++ A + I +++ R + F Sbjct: 109 KFKKADDIVFAIDTSGSMK-NTDPNNERFSAA----------LNLIDNMDKNNRFSMYKF 157 Query: 224 SSKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + P++ + +E+++ T LE AY +I ++ K ++ Sbjct: 158 DDTAEKIIPMSQVTKQSREEVSGKLKDMQNPKGNTNMRDALEKAYEEIKSSETKDKNAM- 216 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 +I L+DG ++ +K+ K + +Y IG+ LK A Sbjct: 217 --------VIMLSDGGDTY--DLSKKFDETLKPFKEKNISIYTIGMSNGNNFSMLKEIAK 266 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 S +Y+V+ + L + F +I ++ +QR+L +K Sbjct: 267 ESGGNYYNVKEIKDLKNVFNKIYRD-RQQRLLVDK 300 >gi|113971716|ref|YP_735509.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113886400|gb|ABI40452.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 335 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 86/245 (35%), Gaps = 37/245 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + ++ + P ++ ++ ++ G D++M++D+S SM++ A Sbjct: 67 MLILSWLLIVTALAKPSILG---EVQTREAFGRDVLMLVDLSGSMDEA------DFTTAD 117 Query: 197 RSIREMLDI----IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 S L+ +K+ + R GL+ F P + E+ + Sbjct: 118 GSTLTRLNAAKNVLKTFIAKRSGDRFGLILFGDAAFIQTPFTADQQVWLSLLEEAQTGMA 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + + E ++ +I LTDG ++ ++ ++ Sbjct: 178 GQSTHLGDAIGLGIKVFEQNPQPSE---------QQVMIVLTDGNDTGSFVEPVDA---A 225 Query: 310 NEAKRRGAIVYAIGVQ-------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 A RG +Y I + + ++ + + R + + +L A+ I K Sbjct: 226 KIAAARGIKIYIIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAIDQAELDKAYQLIDK 285 Query: 361 EMVKQ 365 +Q Sbjct: 286 LEPQQ 290 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 80/197 (40%), Gaps = 26/197 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ M + M+DII ++ N R G++ +SS+ Sbjct: 31 GPLDIVFVIDSSRSVRPFEFETMRRF---------MIDIIGNLDVGPNATRVGVIQYSSQ 81 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + F L + ++ IN ++ T + ++YA N F +E H Sbjct: 82 VQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQE---GARPPHKKI 138 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + I +TDG A+ G +YA+G+Q + L+ ASP + Sbjct: 139 PRIAIIVTDGRPQD------RVSEVAAHARNAGIEIYAVGIQRADMNS-LRAMASPPLEE 191 Query: 341 RFYSVQNSRKLHDAFLR 357 + V++ +L F + Sbjct: 192 HVFLVESF-ELIQQFGK 207 Score = 43.3 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 30/134 (22%), Positives = 56/134 (41%), Gaps = 19/134 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++MV+D S S+ + + + ++D+++ PD R GLV +SS++ Sbjct: 344 VDLVMVIDGSKSVRPQ------NFELVKQFVNRIVDLLEVSPDG---TRVGLVQYSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL H E+I + T + L++ F E H+ Sbjct: 395 TEFPL--NKYHSAEEIKEAVMKMEYMEKGTMTGLALKHMVEHSFS--ELEGARPLSHNVP 450 Query: 284 KKYIIFLTDGENSS 297 + ++F TDG + Sbjct: 451 RIGLVF-TDGRSQD 463 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 80/197 (40%), Gaps = 26/197 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ M + M+DII ++ N R G++ +SS+ Sbjct: 31 GPLDIVFVIDSSRSVRPFEFETMRRF---------MIDIIGNLDVGPNATRVGVIQYSSQ 81 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + F L + ++ IN ++ T + ++YA N F +E H Sbjct: 82 VQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQE---GARPPHKKI 138 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + I +TDG A+ G +YA+G+Q + L+ ASP + Sbjct: 139 PRIAIIVTDGRPQD------RVSEVAAHARNAGIEIYAVGIQRADMNS-LRAMASPPLEE 191 Query: 341 RFYSVQNSRKLHDAFLR 357 + V++ +L F + Sbjct: 192 HVFLVESF-ELIQQFGK 207 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 47/209 (22%), Positives = 84/209 (40%), Gaps = 33/209 (15%) Query: 160 KISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K SK G +D++MV+D S S+ + + + ++D+++ PD + Sbjct: 334 KTCSKCGAGHVDLVMVIDGSKSVRPQ------NFELVKQFVNRIVDLLEVSPDGTH---V 384 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKL 273 GLV +SS++ FPL H E+I + T + L++ F E Sbjct: 385 GLVQYSSRVRTEFPL--NKYHSAEEIKAAVMKMEYMEKGTMTGLALKHMVEHSFS--ELE 440 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 H+ + ++F TDG + + + AK G +++A+GV ++ L Sbjct: 441 GARPLSHNVPRIGLVF-TDGRSQD------DISEWAQRAKESGIVMFAVGVGKAVEEE-L 492 Query: 334 KNCAS---PDRF-YSVQ--NSRKLHDAFL 356 + AS F YS L + F Sbjct: 493 RAIASEPVEQHFSYSADFTTMTHLVENFK 521 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 82/209 (39%), Gaps = 30/209 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S SM + P ++ S+ ++D ++N R + F + Sbjct: 115 DIVFVIDTSGSMA-NTDPQNERF----SSVLNLMD------NMNTQNRVSIYKFDDTSKR 163 Query: 230 TFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+ + +++ I T ++ AYN+I K A Sbjct: 164 IIPMTEVSESLKKNAEEELKQYEIPAGNTNMGEAIDSAYNEINSTKRPGRKAA------- 216 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--F 342 +I L+DGE++ NK+ K +Y IG+ E LK A + Sbjct: 217 --VILLSDGEDNFGL--NKKFDETLKPFKDSNISIYTIGMSNENNFTTLKKIAKDTHGEY 272 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 Y+V+N+ L F +I +QR+L ++ Sbjct: 273 YNVKNASDLKGTFSKI-YYATQQRLLVDR 300 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 77/210 (36%), Gaps = 28/210 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S + +++ +D S SM P +++ A R + + + + N Sbjct: 126 SAVRPAPGGRPTQVVIDIDGSSSMRRS-DPKRERVRAAKRFV----ETLSRVDKRNQF-- 178 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G++ F++ + + P+ G++ + I + +T Y + + + LE Sbjct: 179 -GVIEFNTTVEERAPMGSGMKATSDAIQAVDAVGSTAL-------YTSLIRSIDALEGSG 230 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K Y++ I+ LTDG++++ + + N AK +Y + + + L Sbjct: 231 KTG--YRRAILVLTDGKDTASSHGVATVI---NRAKAAKVRIYVVSLGGAGDQKGLGYVG 285 Query: 338 --------SPDRFYSVQNSRKLHDAFLRIG 359 + F V + L F I Sbjct: 286 PMQRLTTETGGVFTHVDRADDLVARFDAIA 315 >gi|12052774|emb|CAB66559.1| hypothetical protein [Homo sapiens] Length = 957 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +FD K K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLFD---------KSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|156396520|ref|XP_001637441.1| predicted protein [Nematostella vectensis] gi|156224553|gb|EDO45378.1| predicted protein [Nematostella vectensis] Length = 177 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 72/192 (37%), Gaps = 26/192 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ VLD S S+ + + + I ++++ P R G+V +S++ Sbjct: 1 DIGFVLDASGSVRAN------RFKMCLNFINKLVNSFHIGPHN---TRIGIVRYSTRPSG 51 Query: 230 TFPLA--WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + ++NR+ + T++ + YA ++ + +K Sbjct: 52 IFRFTSYRNKHSTKHRVNRIRYTGGWTRTGAAINYARRYLYQHNRRRG--------VRKV 103 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I +TDG++ + S+ KR G V+AIG+ L A+ Sbjct: 104 LIVMTDGKSQDSVVGASRSV------KRMGIEVFAIGIGRGYRRSELNQMATDRNHVLTA 157 Query: 347 NSRKLHDAFLRI 358 R LH +I Sbjct: 158 RFRDLHKIIGKI 169 >gi|327260888|ref|XP_003215265.1| PREDICTED: collagen alpha-1(VI) chain-like [Anolis carolinensis] Length = 1026 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 77/200 (38%), Gaps = 16/200 (8%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSM---NDHFGPGMDKLGVATRSIREMLDIIK 207 S + + D +D+ VLD S S+ FG +D + T + L+ Sbjct: 31 LAQGSDVSSNVVTFQDCPVDLFFVLDTSESVALREKPFGALVDNIKQFTTQFIDKLNERY 90 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIF-GSTTKSTPGLEYAY 263 D N + +G + +S ++ L G +++++++++ G T + ++ Sbjct: 91 YRCDRNLMWNAGALHYSDEVQLISGLTSMRTGRSGLKDQVSKVVSIGKGTYTDCAIKRGI 150 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAI 322 ++ H KY+I +TDG + L NEAK +G V+++ Sbjct: 151 EELLIGG--------SHHKENKYMIVVTDGHPLEGYKEPCGGLEDAANEAKHQGIKVFSV 202 Query: 323 GVQAEAADQFLKNCASPDRF 342 + + L A+ + Sbjct: 203 AISPNHLESRLSVIATDQAY 222 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 75/211 (35%), Gaps = 34/211 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIV 228 D+ +V+D S S+ + ++ + + + + + VR +V +S + Sbjct: 831 DITLVVDSSTSVGS------RNFNTTKKFVKRLAERFLSAAKPTEDAVRVSVVQYSGRTQ 884 Query: 229 Q--TFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 Q P I + ++++ F T L Y + + KK Sbjct: 885 QKLEVPFEQNYTVIADSVDKMQFINDATDVNAALNYVTSLFRRSSRSGA---------KK 935 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK----------N 335 ++ +DG +S I EA++ G +Y + V +A + ++ + Sbjct: 936 RMLIFSDG--NSQGITQSAIERAVQEARQAGIEIYVLVVGTQANEPNVRVLVTGKTAEYD 993 Query: 336 CASPDRF-YSVQNSRKLHDA--FLRIGKEMV 363 A +R + V + L + + +++ Sbjct: 994 VAFGERHLFRVPDYESLLRGVFYQTVSRKIS 1024 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 62/181 (34%), Gaps = 36/181 (19%) Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK--EKLEHIA 277 L +++++ + I+ LI T G+ + + + A Sbjct: 499 LTSYTNRTSTPTGQS---SSFNSYIDNLIAVGGTYHDIGMLWGARFLSPKGIFASDNNSA 555 Query: 278 KGHDDYKKYIIFLTDGENSSPN---------------------------IDNKESLFYCN 310 + ++I+F+TDG+ S+ I N CN Sbjct: 556 PNGFNISRHIVFMTDGDMSAYQQVYGAYGYQQLDARVAPGNTSDTDLTAIHNTRLQMLCN 615 Query: 311 EAKRRGAIVYAIGVQAEAADQF---LKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 K +G ++ IG + ++ L+NCA S + + ++ L F I K + R Sbjct: 616 AIKAKGITIWVIGFRNQSEGNIQTPLQNCATSSNHWTMAYDATSLSQKFKDIAKNIGGLR 675 Query: 367 I 367 + Sbjct: 676 V 676 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 88/293 (30%), Gaps = 46/293 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 N KG++ + A + + ++G ++ + +A++ D + L + Sbjct: 14 RLARNQKGNVMAMVAAAIIPLAALIGGGLDMGRAYMARARMQQACDAAALAGRRAMTTSS 73 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 K + F + +T +I + + Sbjct: 74 MTQANKDEAKKFFDFNFPQ-------------------GTFQAATFTPVIRSKPGETTTV 114 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM----ND 183 + MP + L ++ + + D+M+VLD + SM +D Sbjct: 115 QVTASTTMPTTVMKI-----FRYETLPLSVTCEARFDIGNT-DVMLVLDTTGSMAYAISD 168 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---------- 233 G +L +++++ D + + + +R G + +SS + + L Sbjct: 169 GKGGSTTRLAALKQAVKDFYDTLGAGSNATGRIRYGFMPYSSTVNVGYQLPTNYLVGGIS 228 Query: 234 --AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 W Q RL+ T + + Y + G + Sbjct: 229 GETWDYQT-----RRLLTTYGTATNETGSWIYTSGSVSAPTTYSSTSGGSASQ 276 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 48/265 (18%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 TS+++ Y ++ Y + + + P A P+ +T+ + Sbjct: 340 THTSINVRDLSPETVYEIAL---YALKGLTPSEPIMATERTQPVKVTTECSLGVDVQA-- 394 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S G+ + +++ P V+ LV +S Sbjct: 395 DVVLLVDGSYS------IGLQNFAKVRAFLEVLVNSFDIGPSK---VQISLVQYSRDPHT 445 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + + G +T + ++Y +KIF A + + Sbjct: 446 EFALNTHHDINAVVRAVRTFPYRGGSTNTGKAMKYVKDKIFVASRGAR------QNVPRV 499 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 ++ +TDG++S D + ++A+GV+ +A L+ A+P + + Sbjct: 500 MVLITDGKSSDSFKD------AATNLRNIDVEIFAVGVK-DAVRSELEAIANPPADNHVF 552 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 V++ DAF RI KE+ + L Sbjct: 553 EVED----FDAFQRISKELTQSICL 573 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 89/234 (38%), Gaps = 28/234 (11%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +P NSS T ++K S + D++ ++D S S+ + Sbjct: 61 SIPIFGQLTIQSGNSSDRVRRPTDTIKCSVSAIT--DLVFLVDGSWSVGRENFKHIRSF- 117 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIF-G 250 S+ DI + R +V +S+ FPL + + + IN L + G Sbjct: 118 --IASLAGAFDI------GEDKTRVAVVQYSTDTRTEFPLTRYTRRGDLLQAINSLPYKG 169 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + ++Y IF + K + +TDG++ P ++ L Sbjct: 170 GNTMTGDAIDYLLQNIFTEAGGSRK------SFPKVAMIITDGKSQDPVEEHARRL---- 219 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 + G ++ +G++ D+ + ++P Y+V N K+ + +I +E+ Sbjct: 220 --RNIGVEIFVLGIKGADEDELREIASTPHSKHMYNVPNFDKIQEVQKKIIREV 271 >gi|86740090|ref|YP_480490.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86566952|gb|ABD10761.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 319 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 73/211 (34%), Gaps = 29/211 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM +L A + +D + P N GLV+F+ Sbjct: 89 IILAIDVSNSMAATDIAP-TRLAAAKQGASAFVDQL---PPRIN---LGLVSFAGSATVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ I L G T G+ + I A ++ G I+ L Sbjct: 142 VPASADRESVRAGIRGLQLGPATAVGEGIFASLQAITTAGKRFS--DTGQSAPPAAIVLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DGE + +N A++ V I +Q L++ Sbjct: 200 SDGETTRGRPNN----QAIEAARQARIPVDTIAYGTADGTLDVGGQEVPVPVNEQALRDI 255 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + ++ + +L + +G + + Sbjct: 256 AEQTGGSYHRATSGDELRSVYRGLGSSIGYR 286 >gi|171914502|ref|ZP_02929972.1| von Willebrand factor type A domain protein [Verrucomicrobium spinosum DSM 4136] Length = 424 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 81/215 (37%), Gaps = 27/215 (12%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 +++ + +T +S +++ +V+D S SM DK+ A + ++ LD + Sbjct: 19 TTYLKVGLTGQELEASAKRAPVNVTIVIDKSGSMGG------DKMVHAREAAKQALDRLG 72 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + V+ +V + + P ++ I+R+ G +T G Sbjct: 73 AGDMVS------VVAYDDAVSLISPATDLTDRDRVKAAIDRIQAGGSTALFSG------- 119 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 I E+L + + + ++ L+DG N P+ + AK G V +G+ Sbjct: 120 ISKGAEELRRNKRPNQVNR--VVLLSDGMANVGPSSPQDLGRLGASLAKE-GITVTTLGL 176 Query: 325 QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + A S ++NS+ L F Sbjct: 177 GLGYNEDLMTELALRSDGNHAFIENSQNLAGIFQT 211 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 71/202 (35%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ GLD++ VLDVS SM G ++K+ A + + + L I R +V Sbjct: 20 STSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFVVKKLSSID---------RLSIV 67 Query: 222 TFSSKIVQTFPLAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TF + PL + + + I+ L G T + GL+ + D K + Sbjct: 68 TFLDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVV 127 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ ++DG+ + K VY G A+ L A Sbjct: 128 G--------VMLMSDGQQNRG--------EPAANVKIGNVPVYTFGFGADYDPTVLNAVA 171 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F V + L AF Sbjct: 172 RNSMGGTFSVVNDVNLLSMAFS 193 >gi|327261941|ref|XP_003215785.1| PREDICTED: hypothetical protein LOC100567114 [Anolis carolinensis] Length = 1225 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 72/202 (35%), Gaps = 26/202 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D+S SM K+ +++ +L D++ +VTFS + Sbjct: 277 DVVFVIDISGSM------YGTKMKQTKKAMHVIL------SDLHQDDFFNIVTFSDTVNV 324 Query: 230 TFP------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P ++ ++ ++++ T L A + + I + Sbjct: 325 WKPSQSIQATPQNIKKAKDYVSKMEADGWTDINAALLAAASVFNHSSPMAGKIMRDQRIP 384 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-- 341 IIFLTDGE +S L +A + ++ + +A L+ + +R Sbjct: 385 --LIIFLTDGEPTSGVTTGSRILSNAQQALKGTISLFGLAFGDDADYGLLRRLSLENRGV 442 Query: 342 ----FYSVQNSRKLHDAFLRIG 359 + + +L + I Sbjct: 443 ARRIYEDADATLQLKGFYDEIA 464 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 58/329 (17%), Positives = 111/329 (33%), Gaps = 49/329 (14%) Query: 55 SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL 114 + L T I + + +D + I + + ++ + + I+ T Sbjct: 173 ARLDTNRHISSLFHAQEPPIFDDDEALDQQHEIAHRNLSTK-NDSSDSHSLGTIDVKTYP 231 Query: 115 SIIIDDQHKDY-NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 + + + N + + P +N P ++S S +D++ Sbjct: 232 EVSAVSRSASHDNFCVLIHLKAPVTSIRHNSSSNHMELP-------QMSQNSRAPVDLVT 284 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 VLDVS SM KL + R++ ++ + R ++ FSS + FPL Sbjct: 285 VLDVSGSMAG------TKLALLKRAMGFVIQNLGPSD------RLSVIAFSSTARRLFPL 332 Query: 234 ----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 G Q +N L+ T GL I D + + II Sbjct: 333 RCMTEAGRQEALLSVNSLVSNGGTNIAEGLRKGAKVIVD---------RKWKNPVASIIL 383 Query: 290 LTDGENSSPNIDNKE--------SLFYCNEAKRRG----AIVYAIGVQAEAADQFLKNCA 337 L+DG+++ SL + + G V++ G A+ + + + Sbjct: 384 LSDGQDTYTVTSPSGMNPRADYKSLLPISIHRNGGTGLKIPVHSFGFGADHDAASMHSIS 443 Query: 338 --SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 S F ++ + DAF + IG + Sbjct: 444 EISGGTFSFIEAEGVIQDAFAQCIGGLLS 472 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 81/214 (37%), Gaps = 38/214 (17%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN----------DHFGPGMDKLGVATRSIRE 201 L + S ++ + + + ++M +D S SM + + G+ K+ +A S Sbjct: 389 TLPVDSQIR-NPDEEPQVAVVMAIDKSGSMAACHCEGSKLLEQYPGGIPKVDIAKESAIL 447 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 + + G+V F + + V I EK+ + T GL Sbjct: 448 SSETLGPNDIF------GVVAFDTAPRWVVRPEPVTDKSSIAEKVAGIQGSGGTNIYGGL 501 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + K K +H +I LTDG ++ N D ++A+R G + Sbjct: 502 AEAIDSLIKVKAKNKH-----------VILLTDGWSNVGNYD-----ELISKARRHGITI 545 Query: 320 YAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 + A + Q L++ A FY+ ++S + Sbjct: 546 STVS-AAGGSAQLLRSIAEKGGGTFYNTRDSADI 578 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 78/225 (34%), Gaps = 29/225 (12%) Query: 145 CANSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 C SS L ++ S + + L++ ++LD S SMN L +++ ++ Sbjct: 17 CQPSSQRQLAVSISAVGETLDRRVPLNLCLILDHSGSMNG------RALETVKKAVSLLV 70 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 D + S R +V F + P + I+++INRL T GL Sbjct: 71 DQLSSED------RLSIVVFDHRAKILVPNQIISDRNQIKQQINRLTADGGTAIDEGLRL 124 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 ++ K+ A LTDGEN DN L + A + Sbjct: 125 GIEELAKGKKDTISQA----------FLLTDGENEHG--DNNRCLKFAQLAASYNLTLNT 172 Query: 322 IGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 +G L+ A +++ + D F R+ M Sbjct: 173 LGFGDNWNQDILEKIADAGLGNLSHIEHPNQAVDKFSRLFSRMQT 217 >gi|332210152|ref|XP_003254169.1| PREDICTED: collagen alpha-1(XXI) chain [Nomascus leucogenys] Length = 957 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 RTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|297678422|ref|XP_002817079.1| PREDICTED: collagen alpha-1(XXI) chain-like [Pongo abelii] Length = 612 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 RTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 56/313 (17%), Positives = 102/313 (32%), Gaps = 66/313 (21%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY----EMPFIFCTFPWCA 146 D L +G D S + +++ D + + Y Y + +F F Sbjct: 133 DIVFVLDTSGSMSDPMEYIYSPTYNVVTDGRVE-YYAEVEGNYVKIDRITGLFGFF---- 187 Query: 147 NSSHAPLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 H + ++SD G+ K+G ++ + Sbjct: 188 --KHWEVAGKEVTPKKNESDTNGIQFYT--------RREKPNSQSKMGALKIAVNQFAQE 237 Query: 206 I----KSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTP 257 SI D R +VTFSS+ L + V + IN L T + Sbjct: 238 TAKRNDSITDAAKQHRMSIVTFSSESYIRQSLKAYNSNTVSEFERTINGLNANGATYANL 297 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKR 314 G+E A + + +EK +K +IF TDG D+ ++ K Sbjct: 298 GMEKAKESLKNVREKA----------QKVVIFFTDGTPGRSGFDDDTANNTIQAAKSLKD 347 Query: 315 RGAIVYAIGVQAEAA--------DQFLKNCAS-----------------PDRFYSVQNSR 349 +Y+IGV +A + ++ +S + + + Q++ Sbjct: 348 DLTKIYSIGVFDQANPDNTSSSFNAYMHGVSSNYPNATKWTELGERAENSNYYKAAQDAD 407 Query: 350 KLHDAFLRIGKEM 362 +L+ F I +EM Sbjct: 408 ELNKIFEEIFEEM 420 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 64/185 (34%), Gaps = 40/185 (21%) Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA---KEKLEHIAKG 279 F K L Q + + + + T G+ + + + K Sbjct: 366 FLCKTQPITDLTNDKQALLDAVAAMRADGYTNIHQGVVWGWRVLTPQEPFSRGRSPDQKR 425 Query: 280 HDDYKKYIIFLTDGENSS----------------------------------PNIDNKES 305 D+++ +I +TDG N+ +D + + Sbjct: 426 EKDHRRIMIVMTDGANTYQDKSSSHNRTEYNAYGYGTEQRLGSGIDTAGEIAAKMDERTA 485 Query: 306 LFYCNEAKRRGAIVYAIGVQAE--AADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEM 362 L N A VY I Q A + L++CAS P+ + ++ +L AF RIGKE+ Sbjct: 486 LACRNAATYEATQVYTIAFQVGDYATRKLLRDCASSPEMAFDAGSNSELVTAFERIGKEI 545 Query: 363 VKQRI 367 + R+ Sbjct: 546 SRLRL 550 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 45/258 (17%), Positives = 83/258 (32%), Gaps = 39/258 (15%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 + I F + +G+I+I ++ G+ I+ S K++L Sbjct: 1 MKIFRFLSDRRGNIAIAFGSFAFLLTAGSGVGIDMSRVVTEKSRLQSAA-----DATALA 55 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 N ++G +Q + ++ R + N D Sbjct: 56 ANYKSGTYTAEQIRQHAEAYFDGLYTAPERGSVSRNVTVGDGT----------------- 98 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 +S + MP F + + K+ + S D+++VLD S SM Sbjct: 99 ---ISVEAGVTMP---TFFAPLLGVEEISFAVMAESKVGTAS---FDVVLVLDNSGSMA- 148 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----SSKIVQTFPLAWGVQH 239 G M L A + L I I ++ V GLV F + + W + Sbjct: 149 --GSRMTTLKQAASDLIRTLMSINEISTEDDRVMVGLVPFTAFVNIGADKATQP-WMDRE 205 Query: 240 IQEKINRLIFGSTTKSTP 257 + ++ F + + TP Sbjct: 206 GRSPVHWTNFQTGSDGTP 223 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 82/197 (41%), Gaps = 26/197 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ M + M+DII ++ N R G++ +SS+ Sbjct: 31 GPLDIVFVIDSSRSVRPFEFETMRRF---------MMDIIGNLDVGPNATRVGVIQYSSQ 81 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + F L + ++ IN +I T + ++YA N F +E + K Sbjct: 82 VQNIFSLKTFFTRADMERAINSIIPLAQGTMTGLAIQYAMNVAFTTQEGARPLHK---RI 138 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + I +TDG +A+ G +YA+G+Q + L+ ASP + Sbjct: 139 PRIAIVVTDGRPQD------RVTEVATQARNAGIEIYAVGIQRADMNS-LRAMASPPLEE 191 Query: 341 RFYSVQNSRKLHDAFLR 357 + V++ +L F + Sbjct: 192 HVFLVESF-ELIQQFAK 207 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 82/209 (39%), Gaps = 33/209 (15%) Query: 160 KISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K SK G +D++MV+D S S+ + + + ++D+++ P R Sbjct: 334 KTCSKCGAGHVDLVMVIDGSKSVRPQ------NFELVKQFVNRIVDLLEVSPHG---TRV 384 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKL 273 GLV +SS++ FPL H ++I + + T + L++ F E Sbjct: 385 GLVQYSSRVRTEFPL--NKYHSADEIKKAVMDVEYMEKGTMTGLALKHMVEHSFSELEGA 442 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ + + TDG + + + AK G +++A+GV ++ L Sbjct: 443 RPLSYN---IPRIGLVFTDGRSQD------DISEWARRAKESGIVMFAVGVGKAVEEE-L 492 Query: 334 KNCAS---PDRF-YSVQ--NSRKLHDAFL 356 + AS F YS L + F Sbjct: 493 RAIASEPVEQHFSYSADFTTMTHLVENFS 521 >gi|226358120|ref|YP_002787859.1| hypothetical protein Deide_2p00900 [Deinococcus deserti VCD115] gi|226319763|gb|ACO47757.1| Conserved hypothetical protein [Deinococcus deserti VCD115] Length = 418 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 77/227 (33%), Gaps = 26/227 (11%) Query: 136 PFIFCTFPWCANSSHAPLLI-TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 P + + + + V L++ V+D S SM+ L + Sbjct: 10 PLRAGLTAGQTTTLTLLIRVHPAPVTTQVSQRPPLNLAFVIDRSGSMSGL------PLQM 63 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGST 252 A ++ I ++ R +V F ++ P LA + + + I + + Sbjct: 64 AKQA------AIAAVRQARPDDRVSVVAFDDRVDVIVPSQLATSREAVIQAIGTIDDRGS 117 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G + A + +H+ G + +I L+DG+ + D +E Sbjct: 118 TNLHGGW------LEGATQVAQHLTPGALNR---VILLSDGQANVGVTDRREIARQVRGL 168 Query: 313 KRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 RG IG+ + ++ L A F V++ +L F Sbjct: 169 TERGISTTTIGLGSHYDEELLLAIANAGDGNFEHVEDPSRLPTFFEE 215 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 70/219 (31%), Gaps = 49/219 (22%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++ VLD S SM G + A +IR ++ GLV F + Sbjct: 90 GVDIVFVLDQSPSMIVRDFGGDTRFDAAKHAIRTFVE-------GREHDPLGLVIFGDEA 142 Query: 228 VQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P +++ + G + G+ A + + + Sbjct: 143 ALVTPPTLDYTSFLSRMDAVRVMKLGRGSALGLGMAVATVHLEKSSAER----------- 191 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------------- 330 K ++ ++DGEN++ I + + A G +YA+GV E + Sbjct: 192 KVMVIVSDGENNAGEITPESA---ARVAASLGIRIYAVGVGGEGSVATEFTDPETGKSYR 248 Query: 331 ---------QFLKNCASP--DRFYSVQNSRKLHDAFLRI 358 + LK + + + L F I Sbjct: 249 GTYEGKIDMELLKAVTESTRGQAFLAGSPGALSQVFREI 287 >gi|148656915|ref|YP_001277120.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569025|gb|ABQ91170.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 561 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 74/218 (33%), Gaps = 28/218 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P ++ + I ++ +D+M VLDVS SM D +L A ++R ++ ++ Sbjct: 362 RPDVLAAIRSIWVENKKRVDVMAVLDVSGSMADE-----ARLEQAKTALRIFIEQLQDDD 416 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 GL FS P++ I +I L T+ AY ++ Sbjct: 417 GF------GLTIFSDSATVLTPVSPIGPKRAEILNRIAGLTPRGGTRLLDTTVEAYQEMS 470 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQA 326 A + ++ LTDG ++ + ++ L + + V+ + Sbjct: 471 ---------ATPPGQRIRAVVVLTDGLDNKSQRNAQDVLNLLRQDREGYSIKVFTVAFGG 521 Query: 327 EAADQFLKNCA----SPDRFYSVQNSRKLHDAFLRIGK 360 +A LK A + + + I Sbjct: 522 DADVNLLKEIAEATGAKSYVGKPGERGSIERVYQDIAT 559 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 66/176 (37%), Gaps = 24/176 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K+ + G+D++ +DVS SM+ D +DK I L Sbjct: 81 TKLETVKREGVDVVFAVDVSKSMDAEDIAPSRIDKAKQLVTQIINNLGS----------D 130 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G++ ++ P+ + +N + T A I DA E + Sbjct: 131 RVGIIAYAGSAYPQLPITTDYSSAKMFLNAM----NTDMLSSQGTA---IRDAIELAKTY 183 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++ ++DGE+ + + + A +G ++ IGV +EA D+ Sbjct: 184 YNDEEQTNRVLVIISDGEDHAGEVAS-----IAESATEQGIRIFTIGVGSEAGDRI 234 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 68/218 (31%), Gaps = 41/218 (18%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D+M+ +DVS SM + P ++L A E + GL F+ + Sbjct: 87 GIDIMLAMDVSTSMLAEDLKP--NRLEAAKNVAAEFI-------SGRPNDNIGLTIFAGE 137 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ + L+ T GL + K K Sbjct: 138 AFTQCPMTTDHTSLLN----LLRNVRTDIAARGLISDGTAVGMGLANAVSRLKDSKTKSK 193 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ----------AEAADQFLK- 334 +I LTDG N+ +I S AK VY IGV Q++ Sbjct: 194 VVILLTDGSNNMGDISPMTS---AQIAKSLDIRVYTIGVGTNKVAPYPMSVGGGTQYINI 250 Query: 335 ------------NCASPDRFYSVQNSRKLHDAFLRIGK 360 + FY N+++L + I K Sbjct: 251 PVEIDSKTLSDIAAVTEGNFYRATNNQQLKQIYKDIDK 288 >gi|297286916|ref|XP_002808380.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-5(VI) chain-like [Macaca mulatta] Length = 2604 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 46/199 (23%), Positives = 84/199 (42%), Gaps = 21/199 (10%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 GLD++ VLD S S+ + M I + ++K N V+ G + +S + Sbjct: 811 TGLDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRNGVQFGALKYSDQ 861 Query: 227 IVQTFPLAW--GVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L I E + R G +T + L++A N +F EH ++ ++ Sbjct: 862 PNILFYLNTYSNRSAIIENLRMRRDTGGSTYTAKALKHA-NALF----TEEHGSRIKENV 916 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 K+ +I +TDG+ + D+ + +E + +G + A+GV + + + Sbjct: 917 KQMLIVITDGK----SHDHDQLNDTASELRDKGITILAVGVGKANQKELEGMAGNKNNTI 972 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V N KL D F + + M Sbjct: 973 YVDNFDKLKDVFTLVQESM 991 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 623 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + Q I + I+R+ T + L + H Sbjct: 674 DTTKEEFQLNRYFTRQEISDAIDRMSLINKGTLTGKALNFVGQYFT-------HSKGARL 726 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 727 GAKKFLILITDGVARDYVRDPARIL------RGKNVTIFSVGVYNANRSQLEEISGDGSL 780 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 781 VFHVENFDHL 790 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 72/200 (36%), Gaps = 24/200 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ + + R + E+ ++ PD VR G+V +S Sbjct: 442 DIHFLIDGSSSIQEK------EFEQIKRFMLEVTEMFSIGPDK---VRVGVVQYSDNAEV 492 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ I + N T + L+Y + + Y Sbjct: 493 EFYISDYSNDIGLRKAIFNIKQLTGRTYTGKALDYILQIXKNGSKDRMSK------VPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG + ++ + L + ++A+G+ A + + +R + Q Sbjct: 547 LIVLTDGMSMDRVVEPGKRL------RAEQITIHAVGIGAANKIELQEIAGKEERVHFGQ 600 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N L + +E+ ++ Sbjct: 601 NFDALKSIKNEVVREICTEK 620 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 52/325 (16%), Positives = 108/325 (33%), Gaps = 49/325 (15%) Query: 80 SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF 139 S +K I + NG + + S + + + ++ + Sbjct: 522 SKYKVKEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVGEVPRTTVTNTVVTAPRYRK 581 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + L +T + SS++ D+++V D S SM++ G +L VA + Sbjct: 582 YIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMG-HNSRLEVAKTA 640 Query: 199 IREMLDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + M + + + ++ +R LV FS+ I +N L T Sbjct: 641 VNSMAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAVNGLGADGGTN-- 698 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-------- 308 + A + KKYI+F++DG+ + + + Sbjct: 699 ------WEAALKAANAKLTSGRKG--VKKYIVFMSDGDPTYRTSSVRTGTDWLGRPIYDA 750 Query: 309 --------------------------CNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDR 341 EA RRG A ++++GV ++ + Sbjct: 751 DDGWGLPAGVHGSGLSDRYGANLSSAVAEANRRGDATLFSVGVSSDPTKMRGFADQTKGS 810 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 +YS ++ +L+ AF I ++ ++ Sbjct: 811 YYSATSTDELNKAFADIIGQINRKS 835 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 64/430 (14%), Positives = 132/430 (30%), Gaps = 80/430 (18%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + KG +IL A+++P +F + L + + KA++ + + L + N Sbjct: 2 KLRRHQKGHAAILFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAH--NDP 59 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 N + G + + I + + +++ D N + + + K Sbjct: 60 NQDYGGGGSPSSANQQIVTDYINAYISDV-------DSINEIKVYKRNCEEIPECKAGLA 112 Query: 128 SAVSRY---EMPFIFCTFPWCANSSHA-----PLLITSSVKISSKSDIGLDMMMVLDVSL 179 RY E+ W + + +D+M D S Sbjct: 113 VGEPRYFEHEVGVTTSQKSWFPGNDAIVGMGDSFSTSGHSLARKYQSEAVDVMFAADFSG 172 Query: 180 SMNDHFGPGMDKLGVA---TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-- 234 SM D + G K SI + L + +N G+ ++ + + Sbjct: 173 SMGDRWTGGNKKYEDLIDIIDSISKELQKFNDLEHNDNDNTMGITAYNEYTYSQYSGSSG 232 Query: 235 -----------------WGVQHIQEKINRL-----------IFGSTTKSTPGLEYAYNKI 266 WG I + I+ L + S + L ++ + Sbjct: 233 GWWGDDCYLSQAESDGFWGGVSISKTIDGLWNEKSKDHCNNSYNSGRFNDIPLTSNFDVV 292 Query: 267 FDAKE---------------KLEHIAKGHDDYKKYIIFLTDGENSSPNI----------- 300 + + + ++ +I L+DG ++ N+ Sbjct: 293 NQDVSRFWPEGGTSSYQALIRGAQLLTYGTNSRRLLIVLSDGMDTDNNLTSSLVNAGMCR 352 Query: 301 DNKESLFYCNEAKRRGAI--VYAIGVQAEAA-DQFLKNCASPDRFYSVQNSRK-LHDAFL 356 D ++ L R + IG E + +Q LK+C + Y +NS L+ Sbjct: 353 DIQQGLESDKTLDNRPIRAQMAVIGFDYEPSENQALKDCVGAENVYKAENSDDILNTILE 412 Query: 357 RIGKEMVKQR 366 I +E+ + Sbjct: 413 LISEEIGHLK 422 >gi|310694574|gb|ADP05359.1| collagen type VI alpha 1 protein [Bubalus bubalis] Length = 1027 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D +K N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLKDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIRGLTRMPSGRDELKSSVDAVKYFGKGTYTDCAIKKGLEELLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY++ +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLVVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEVISQTI 232 >gi|219520386|gb|AAI43866.1| COL21A1 protein [Homo sapiens] Length = 957 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 50/237 (21%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D LK A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELKAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 69/172 (40%), Gaps = 23/172 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +G D+M+ +D+S SM+ + +L ++ ++D N R GL+ FS Sbjct: 74 KAVGKDIMISVDLSASMDANDVAP-SRLEKIKYELKNIVDAF-------NSDRIGLIIFS 125 Query: 225 SKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S+ PL + + I L+ GS+T L A+ K+ A Sbjct: 126 SEAFVQCPLTYDQNALNLFIETLNTGLVPGSSTDFGSALNMAHEKLTSEA------APSS 179 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K II ++DGE D + ++ N G ++++GV E + Sbjct: 180 QQKSKIIILISDGE--DFGDDTEGAVSKIN---DSGIRLFSLGVGTEQGSKI 226 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 37/174 (21%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK------ 284 PL Q + + I T +++ Y + + A+ Sbjct: 302 IPLTADKQKLLDTIADFKAAGVTAGGIAVQWGYYMLSPSWRSTIVNARLGSGPANFDNRK 361 Query: 285 --KYIIFLTDGE--------------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 K I +TDG+ ++ + + C+ KR G ++ IG + Sbjct: 362 VGKVAILMTDGQFNTAFAAGRGAPRSQNAGQMSRSNAESICDNMKRDGIEIFTIGFDLDD 421 Query: 329 ----------ADQFLKNCASPD-----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A L++C++ D +Y +L +AF I + + + I Sbjct: 422 PSMTSTERDQAKSVLQDCSTADTSTLKHYYEAATGPELDEAFNAIVQNIERLTI 475 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 84/225 (37%), Gaps = 30/225 (13%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+ +IL V+ + G ++ S + K+ L ++D ++ TA + Sbjct: 14 FARHSGGNFAILFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTG-- 71 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 K + + ++N + + + + T+ ++ D Sbjct: 72 -----VIKEADASKAVQNFLVANSMAGILQPDQIVLDRLVVDRTANTVQAD--------- 117 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + ++ F F + + S+ + S I + MM LDV+ SM ++ Sbjct: 118 --AHVDVALFFPVF----GMGNTQRVTASTTSLYSDKTIEVAMM--LDVTGSMAANWWAK 169 Query: 189 MDKL----GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DK+ A+ ++ +LD +I N VR +V ++ + Sbjct: 170 TDKIGDLQAAASTAVENLLDN--NIDPNNPRVRVAIVPYAEAVNT 212 >gi|317505805|ref|ZP_07963650.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316255887|gb|EFV15112.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 343 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 39/269 (14%), Positives = 93/269 (34%), Gaps = 40/269 (14%) Query: 130 VSRYEMPFIFCTFP---WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 P + P A + + +S + ++++LD+S SM Sbjct: 55 KIAPRRPKLVRYIPSAVLVAGLLFLTVALAGPTAVSQVAKNQATVILLLDISESMRATDV 114 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 ++ A + + +D + V+ G+VTF+ + + ++ I++ Sbjct: 115 KP-SRVEAARAAAIKFVDGMAP------TVQLGVVTFAGNAQPLVRPSTDHETAKKVIDQ 167 Query: 247 LI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--I 300 +I T + G+ A +I L K H + I+ ++DG+ + P+ Sbjct: 168 MIRPDKLEKQTATGEGIYTALQQIETIAGALGG--KNHAPPAR-IVLVSDGKETVPDDLN 224 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------------QFLKNCA----SPDR- 341 + + AK + V + +A + LK + SP Sbjct: 225 APRGAYAAARTAKEKHVPVCTVAFGTKAGKITLDNQVDEVPVDLESLKKISDLSNSPGNS 284 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRIL 368 F+ ++ +L + + +++ + + Sbjct: 285 CRFFPAESQGELAQIYQSLNEDIGYENVR 313 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 63/460 (13%), Positives = 143/460 (31%), Gaps = 130/460 (28%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + + SI + +L+ ++ ++ G ++ ++A + + LD ++L A ++ Sbjct: 16 FGRDARASILPMVGVLVALMVVIGGAGLDYGRAIMLRASISHALDAAVLAVARQLSVS-- 73 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + + IK+ + + + D+ + +D + + Sbjct: 74 -----IMTDSELDKAIKDAFAANMASAGLSGATLGDLTYV---------LDPDAGTISAT 119 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 A + + F + I +S + S +++ MV+DV+ SM + Sbjct: 120 ATA-----LVPTYFIHVGGLGPENVAIAASADATY-SRFDVELAMVVDVTGSMRNSMAS- 172 Query: 189 MDKLGVATRSIREML-----------------------DIIKSIPDVNNVV--------- 216 L A +S+ ++L ++ + P V+N Sbjct: 173 ---LRTAAQSVVDILIPDGTKKSASKVRIALVPYSQGVNLGEYAPKVSNGDAGTQNCVTE 229 Query: 217 RSGLVTFSSKIV---------------------QTFPLAWGVQHIQEKINRLIFGSTTKS 255 R G ++ Q PL + I++L T Sbjct: 230 RMGNEKYTDATYNYNGTSSEFFGGGSNSCASTPQMEPLTSKRNTLTSAISKLKDNGRTAG 289 Query: 256 TPGLEYAYNKIFDA-----KEKLEHIAKGHDDYKKYIIFLTDGENS-------------- 296 G+ + + + + D K+ + +TDG+ + Sbjct: 290 QTGIAWGWYALSPKWSNLWPNDSVPGSYTDSDILKFALIMTDGDFNEYYDKATAQSNCKW 349 Query: 297 ---------------------------SPNIDNKESLFYCNEAKRRGAIVYAIGVQA--- 326 N+ + + C K+ G VY+I + Sbjct: 350 QFNWSTFKWEQVCDSSYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQVYSIYFGSNAN 409 Query: 327 EAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 A + +K+CAS + F+ + +L AF +I ++ Sbjct: 410 SAGAKVMKDCASSTKETFFMATSDSELIAAFAKIANKIQN 449 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 87/234 (37%), Gaps = 38/234 (16%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + ++ +++V+D+SLSM +D +D A IK + ++ Sbjct: 87 SGPTTLARVPKNRATVVLVVDISLSMVCDDVRPTRVDAARQA---------AIKFVDEME 137 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDA 269 ++ GLVTF+ + + ++ ++ I + T + G+ A +I Sbjct: 138 PTLQLGLVTFAGTAQTLIAPSSDHEIVKHALDEAIRPDKLAARTATGEGIYTALQQIETL 197 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ-- 325 L +K I+ +DG+ + P+ + + EAK + +Y+I Sbjct: 198 SSILGGKSKAPSAR---IVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSISFGTT 254 Query: 326 --------------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A D L+ A S +F++ + +L D + + E+ Sbjct: 255 RPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFTAGSLDQLSDVYSSLNAEIG 308 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 53/234 (22%), Positives = 83/234 (35%), Gaps = 35/234 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F P + + + G +++VLDVS SM KL +A Sbjct: 272 LLFTATPKGLFFGGWDRALPEELPLKPLGREGAALVLVLDVSGSMAGE------KLSMAV 325 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 +++ R G+V FSS FP A + + + L G Sbjct: 326 AGALALVESAAPED------RLGVVVFSSGHRVLFPPRPMTAQAKKEAESLLLSLRAGGG 379 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T A + + K ++ LTDG D KE + + A Sbjct: 380 TVLGGAFREAVRLLQGVPGER-----------KAVLVLTDGL----IADAKEPIL--DLA 422 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + G V A+ + +A FLK A RFY + R+L FLR G+E+ + Sbjct: 423 QTSGVEVSALALGPDADAPFLKELARRGGGRFYQAPSPRELPRLFLREGQEVFR 476 >gi|302527162|ref|ZP_07279504.1| von Willebrand factor [Streptomyces sp. AA4] gi|302436057|gb|EFL07873.1| von Willebrand factor [Streptomyces sp. AA4] Length = 326 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 78/237 (32%), Gaps = 28/237 (11%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + +M+V+DVSLSM +L A + ++ Sbjct: 65 VLSLLFLTVSLAGPTAEQKVPRNRATVMLVIDVSLSMEATDVAP-TRLKAAQDAAKQF-- 121 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 ++ V GL++F+ + + I+ L +T + G+ A Sbjct: 122 ----AQNMTPGVNLGLISFAGTATVLVNPTTDRAGVTKAIDNLKLAQSTATGEGIYAAMQ 177 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAI 322 I + I+ ++DG+ + P + + AK+ + +I Sbjct: 178 SIQSFSAVVGGADGP---PPARIVLMSDGKQTVPEDLYAPRGAYTAAQAAKQAQMPISSI 234 Query: 323 GVQAEAA--------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 E D+ L+ A S FY ++ +L + +G+++ Sbjct: 235 SFGTEHGSVDIEGKQQDVRVDDESLREIARLSGGEFYKAASADELKRVYADLGEQIG 291 >gi|163849338|ref|YP_001637382.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527332|ref|YP_002571803.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670627|gb|ABY36993.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451211|gb|ACM55477.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 418 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 91/221 (41%), Gaps = 36/221 (16%) Query: 148 SSHAPLLITSSVKISSKSDI------GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 +S P L+ + V++S++S L++ +V+D S SM +L ++ + Sbjct: 17 ASQEPQLLYALVELSAQSGATKMPRLPLNLCLVIDRSSSMRGE------RLQQVKQAAMQ 70 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSK--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +LD++ LVTF+ + +V + LA I+ +I+ + T+ GL Sbjct: 71 ILDLLGDHESF------ALVTFNDRAEVVVSAQLARARAEIKRQISAIEAAGGTEMATGL 124 Query: 260 EYAYNKIFDA-KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ A + H ++ LTDG D + A+ RG Sbjct: 125 ALGVQELQRAMMPRAVHR----------LLLLTDGRTYG---DEGRCVEIARRAQSRGIG 171 Query: 319 VYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 + A+G+ +E + L+ A+ + + + ++ ++ F Sbjct: 172 ITALGIGSEWNEDLLETIAARENSRTHYITSAAEITKIFTA 212 >gi|320353059|ref|YP_004194398.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121561|gb|ADW17107.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 336 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 76/212 (35%), Gaps = 29/212 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + GLD+M+V+DVS SM+ KL + +++ + N Sbjct: 54 ITGQAGASGMTGGTGLDLMLVIDVSGSMSGS------KLTAVKAAAVALVNSL-----PN 102 Query: 214 NVVRSGLVTFSS---KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 N + G+V +SS + L + IN L +T + ++ A ++ ++ Sbjct: 103 NTTQVGIVKYSSSANMVEMLQDLTSNKSDLIATINGLSASGSTATGTAIQVATAELLSSR 162 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 H K + L+DGE + + +A +G V+ +GVQ Sbjct: 163 AIAGHA--------KMEVVLSDGEYNVG----IDPKIAAAQAHAQGITVHTVGVQLYGTG 210 Query: 331 QF---LKNCASPDRFYSVQNSRKLHDAFLRIG 359 A F +V N L F G Sbjct: 211 YTSMQQTAVAGGGIFTNVNNLNDLVALFSGTG 242 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 48/202 (23%), Positives = 72/202 (35%), Gaps = 36/202 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SMN D+L + + ++ II+ V G+VTF S Sbjct: 310 VCLVLDKSGSMNRE-----DRLFRMNQAAELYLIQIIEKGSLV------GMVTFDSSAEI 358 Query: 230 TFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L T GL+ + I + + Sbjct: 359 QNNLTKIIDENTYQKITANLPQKPSGGTSICGGLKAGFQAISQSNQSTSGSE-------- 410 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE++ + C E K+ GAI++ I + A + L N RF Sbjct: 411 -IILLTDGEDNQMSS--------CFEEVKQSGAIIHTIALGPSADRELETLSNMTRGRRF 461 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 Y+ ++ L DAF RI Sbjct: 462 YAHKDINGLIDAFSRISSRSGN 483 >gi|297567412|ref|YP_003686384.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851861|gb|ADH64876.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 319 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 83/246 (33%), Gaps = 37/246 (15%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + I + +D +++ LDVS SM + Sbjct: 56 LPAVLFLLALILGIVALARPIIPIL----HADPRTTIVLALDVSRSMRATDVLP-SRFEA 110 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A +++ I+ +P R GLVTFS + Q + + + + T Sbjct: 111 AREALKVF---IRELPQGA---RIGLVTFSRAATEVVAPTTNRQRLLDSVELIGLEFGTA 164 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 G+ + + E+ + + II LTDG +I + L A Sbjct: 165 IGEGILTSLQAL-PPLEQRKDAKDPSELAT--IILLTDGR----SISGIDPLEAARIAAE 217 Query: 315 RGAIVYAIGVQ-----------------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 + ++ IGV A + LK A + +++ V ++ KL + + Sbjct: 218 QKVRIHTIGVGRVTEGPVPGLESVYQWAAYFDEDVLKQIAAITGGKYFFVNSAGKLRETY 277 Query: 356 LRIGKE 361 ++ + Sbjct: 278 QQLSQS 283 >gi|296394903|ref|YP_003659787.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296182050|gb|ADG98956.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 95/278 (34%), Gaps = 41/278 (14%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 + + A +R + A + + +S + +++VLD+S Sbjct: 48 ANTELLDSIAPTRPGLTRHIPAAVLLAGLVFLTVALAGPTAVSQVAKNQATVILVLDISK 107 Query: 180 SMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SM D +D A IK + + V+ G+VTF+ + Sbjct: 108 SMAATDVKPSRVDAARAA---------AIKFVDGMAPTVQLGVVTFAGSAQPLVRPSTDH 158 Query: 238 QHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + ++ I+++I T + G+ A +I L K H + I+ ++DG Sbjct: 159 ETAKKVIDQMIRADKLEKQTATGEGIYTALQQIETIAGALGG--KNHTPPAR-IVLVSDG 215 Query: 294 ENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------------QFLKNCA 337 + + P+ + + AK + V + ++ LK + Sbjct: 216 KETVPDDLNAPRGAYAAARTAKEKHIPVCTVAFGTKSGKITIDNQVDEVPVDLDSLKKIS 275 Query: 338 ----SPDR---FYSVQNSRKLHDAFLRIGKEMVKQRIL 368 SP F+ ++ +L + + +++ + + Sbjct: 276 DLSNSPGNSCRFFPAESQAELAQIYQSLNEDIGYENVR 313 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 71/203 (34%), Gaps = 28/203 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L++ ++LD S SM L ++ ++++ +K R +V F Sbjct: 37 ERTVPLNLCLILDHSGSMEG------RPLETVKQAAVQLVEKLK------EGDRLSVVAF 84 Query: 224 SSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + P + I+ KIN+L T GL+ ++ +++ A Sbjct: 85 DHQAQVIVPNQMINDSASIKGKINKLRASGGTAIDKGLKLGIEELNKGRKESISQA---- 140 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASP 339 LTDGEN DN L A + ++G + L+ A Sbjct: 141 ------FILTDGENEHG--DNDLCLKLAKLATDYNITLNSLGFGDDWNQDVLEKIADAGG 192 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 +Q + + F ++ + Sbjct: 193 GNLSYIQQPEQAIEEFSKLFNRI 215 >gi|290960274|ref|YP_003491456.1| lipoprotein [Streptomyces scabiei 87.22] gi|260649800|emb|CBG72916.1| putative lipoprotein [Streptomyces scabiei 87.22] Length = 537 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 74/218 (33%), Gaps = 28/218 (12%) Query: 146 ANSSHAPLLITSSVKISS--KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + L+ ++ + + V+DVS SM + +L +A ++R M Sbjct: 161 TGPGNWSLVRVGLATRAAGDRQRPPAALTFVIDVSGSMAEP-----GRLDLAQDALRTMT 215 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLE 260 + + + +VTFS + P+ I+E + L +T G+E Sbjct: 216 NRL------RDDDSVAVVTFSDEAETVLPMTRLDGNRGEIREAVAGLEPTDSTNLAAGVE 269 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIV 319 Y + K ++ L+D ++ + D L E + G + Sbjct: 270 TGYETAVEGLRKGATNR---------VVLLSDALANTGSTDADTILERIAGERREHGITL 320 Query: 320 YAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 + +GV ++ D ++ A V + F Sbjct: 321 FGVGVGSDYGDALMEQLADKGDGHTTYVSTEEEAEKVF 358 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 58/450 (12%), Positives = 127/450 (28%), Gaps = 105/450 (23%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F + +G++++ AI+ + +G ++ S ++AKL +D + + ++ Sbjct: 8 LARFRSDIQGNVAVTFAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAASVGAVSR--- 64 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + I D RN N + S++ + Sbjct: 65 ---TSPAFIAAGAMTADGIITAGNDDARNIFNGN---MNGTTGYTLNSVTPEVKKTGSVL 118 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + +P +F + + K ++ +D ++LD S SM Sbjct: 119 TATVSFSASVPMMFMNI-----VGIKTMTLQGMSKATASMPKYIDFYLLLDNSPSMGVAA 173 Query: 186 GPG------------------------------------------MDKLGVATRSIREML 203 P ++ V + ++++ Sbjct: 174 TPDDVTKMVNATSDAKYGSNRYCAFACHDYNDSNNFYNLAKSIGVTTRIDVLRSATQQLM 233 Query: 204 DIIKSIPDVNNVVRS-----GLVTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKS 255 D N R G + + + F L+ + + I+ + + Sbjct: 234 DTATQTQTYPNQFRMAIYDFGAASKTIGLRALFALSANLSSAKSAAGNIDLMGVYGNNDA 293 Query: 256 TPG-LEYAYNKIFDAKEKLEHIAKGHDD--YKKYIIFLTDGENSSPNIDNKESLF----- 307 + + +F A KY+ F++DG N + Sbjct: 294 YTADKDTPFTAVFPAVNNEISTPGDGTTGSPLKYLFFVSDGVADESNAACLKPKASGNRC 353 Query: 308 -------YCNEAKRRGAIV---YAIGVQAEAA-----------------------DQFLK 334 C K RG + Y +Q Q ++ Sbjct: 354 QSPINPALCTTLKNRGIKIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQ 413 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 CASP ++ V ++ + DA + K+ V Sbjct: 414 ACASPGFYFEVSPTQGIADAMNALFKKAVA 443 >gi|189066649|dbj|BAG36196.1| unnamed protein product [Homo sapiens] Length = 957 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|119624855|gb|EAX04450.1| collagen, type XXI, alpha 1, isoform CRA_c [Homo sapiens] Length = 552 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|119624854|gb|EAX04449.1| collagen, type XXI, alpha 1, isoform CRA_b [Homo sapiens] Length = 567 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|114607957|ref|XP_001157649.1| PREDICTED: collagen alpha-1(XXI) chain isoform 3 [Pan troglodytes] Length = 957 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|114607959|ref|XP_001157475.1| PREDICTED: collagen, type XXI, alpha 1 isoform 1 [Pan troglodytes] gi|114607961|ref|XP_518554.2| PREDICTED: collagen, type XXI, alpha 1 isoform 4 [Pan troglodytes] gi|114607963|ref|XP_001157591.1| PREDICTED: collagen, type XXI, alpha 1 isoform 2 [Pan troglodytes] Length = 954 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|18780273|ref|NP_110447.2| collagen alpha-1(XXI) chain precursor [Homo sapiens] gi|74752071|sp|Q96P44|COLA1_HUMAN RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|15593270|gb|AAL02227.1|AF414088_1 collagen XXI [Homo sapiens] gi|19310967|gb|AAL86699.1|AF438327_1 alpha 1 type XXI collagen precursor [Homo sapiens] gi|55665071|emb|CAH73913.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202573|emb|CAI22496.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202937|emb|CAI22395.1| collagen type XXI alpha 1 [Homo sapiens] gi|116496597|gb|AAI26109.1| Collagen, type XXI, alpha 1 [Homo sapiens] gi|215434893|gb|ACJ66843.1| alpha 1 type XXI collagen precursor [Homo sapiens] Length = 957 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|17974510|gb|AAL50033.1|AF330693_1 alpha 1 chain-like collagen COLA1L precursor [Homo sapiens] gi|55665070|emb|CAH73912.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202574|emb|CAI22497.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202938|emb|CAI22396.1| collagen type XXI alpha 1 [Homo sapiens] Length = 954 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 71/206 (34%), Gaps = 29/206 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S+ + + + K+ + G+D++ +DVS SM+ + Sbjct: 53 SKPVIKLVLILLALASLVIALVNPKMG-TKMETVKREGVDIVFAIDVSKSMDAEDIAP-N 110 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--- 247 +L + + + ++L + S R G++ ++ P+ + + L Sbjct: 111 RLEKSKQLVSQILSSLGS-------DRVGIIAYAGGAYPQLPITTDFSAAKMFLQALNTD 163 Query: 248 -IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 I T + +E A D + + ++DGE+ N+++ Sbjct: 164 MISSQGTAISDAIELATTYYDDD-----------QQTNRVLFIISDGEDHEGNVED---- 208 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 +A +G ++ IGV E Sbjct: 209 -IAEQAAEKGIRIFTIGVGTEKGGPI 233 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 59/179 (32%), Gaps = 46/179 (25%) Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL----------------EHIAK 278 +Q+ I+ + T + G++Y + + Sbjct: 308 NNAGQLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRDTFVALNAAGLVPDGFKDRPADF 367 Query: 279 GHDDYKKYIIFLTDGENSSP-----------------------------NIDNKESLFYC 309 G D +K+I+ +TDG+ + + + C Sbjct: 368 GTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRIGDRDTYATQSTNVANFYSIC 427 Query: 310 NEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 N+AK G VY I +A A ++ CA+ F+ ++ AF I +++ + R+ Sbjct: 428 NKAKAAGITVYTIAFEAPANAITQMRTCATSPAFFYKVEGVEIKTAFKSIARQINELRL 486 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 72/204 (35%), Gaps = 32/204 (15%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 + F G+++I +++ ++ +V G+ ++ ++ ++ LD ++L + Sbjct: 16 FCRRLMGFRREEDGAMTIFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAASD 75 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 + ++ T+F ++ Sbjct: 76 L--------DQPLPADEVVDDYFAKAGMTEF----------------LDDVQITPGAHLP 111 Query: 122 HKDYNL-SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 ++ + A +R P I+ P+ + + + + ++ ++ +VLD+S S Sbjct: 112 TTNFRVVQAEARTRTPSIYMA---MTGVRSLPVYVAGTAEETIENT---EISLVLDISGS 165 Query: 181 MNDHFGPGMDKLGVATRSIREMLD 204 M ++ G + A I +L+ Sbjct: 166 MRNNGKIGNLR-TAAKDFIGAVLE 188 >gi|219804724|ref|NP_001137337.1| collagen alpha-1(VI) chain [Bos taurus] gi|296490826|gb|DAA32939.1| collagen, type VI, alpha 1 [Bos taurus] Length = 1027 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 81/208 (38%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPG---MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D +D+ VLD S S+ P +DK+ T+ + L+ D N V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLNDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIRGLTRMPSGRDELKSSVDAVKYFGKGTYTDCAIKKGLEELLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY++ +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLVVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEVISQTI 232 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P VR +V +S Q Sbjct: 828 DITILLDGSASVGSHNFDITKRF--AKRLAERFLTASRTDPG--QDVRVAVVQYSGTGQQ 883 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 884 RPERAALQFLQNYTVLANTVDSMDFFNDATDVMDALGYVTRFYREASSNAA--------- 934 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++A+ V Sbjct: 935 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGVEIFAVVVG 974 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 64/197 (32%), Gaps = 30/197 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + L++ VLD S SM + ++ + + V R ++ Sbjct: 306 SNNVRPLELAFVLDSSGSMRWSDPDNIR-----------IVGSKHLVDRLKEVDRGAVID 354 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F S L I+ ++ + T G+ A + +A+ + Sbjct: 355 FDSTAQLLQSLTDNKAVIKSALDLIDASGGTDIGDGVSKALEEFANARSASDWA------ 408 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 ++ LTDG S + E + + V I + + A ++ + Sbjct: 409 ----VVLLTDGSGSYNHALTTELVQ-------KNIRVLGITMGSGANQSLIRGISDSTYG 457 Query: 342 -FYSVQNSRKLHDAFLR 357 + V + +L + F R Sbjct: 458 IYQHVNTADELIEVFER 474 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 92/254 (36%), Gaps = 49/254 (19%) Query: 151 APLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 L +T + SS++ D+++V D S SM++ G +L VA ++ M + + Sbjct: 590 LSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMG-HNSRLEVAKTAVNSMAQHLLTS 648 Query: 210 PD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + ++ +R LV FS+ + I +N L T + Sbjct: 649 ENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAVNGLRADGGTN--------WEAAL 700 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY------------------- 308 A + KKYI+F++DG+ + + + Sbjct: 701 KAANAKLTSGRKG--VKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVH 758 Query: 309 ---------------CNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 EA RRG A ++++GV ++ + +YS ++ +L+ Sbjct: 759 GSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPTKMRGFADQTKGSYYSATSTDELN 818 Query: 353 DAFLRIGKEMVKQR 366 AF I ++ ++ Sbjct: 819 KAFADIIGQINRKS 832 >gi|33601708|ref|NP_889268.1| hypothetical protein BB2732 [Bordetella bronchiseptica RB50] gi|33576145|emb|CAE33224.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 336 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 85/245 (34%), Gaps = 41/245 (16%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHF 185 R ++ + A + P + + + D+++ +D+S SM+ D Sbjct: 57 GRAQLWVNVAVWLLLALALARPQWVEPPLT---HVEPMRDILLAVDISQSMDSEDFRDAQ 113 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G + + + +D R GL+ F + PL ++ + Sbjct: 114 GRPASRWQAVQAVVGDFID-------KRPDDRLGLIVFGAGAYPQAPLTRDHAALRLLLQ 166 Query: 246 RL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 R + G T + + A+E+ K +I LTDG +++ + Sbjct: 167 RTAVGMAGPNTALGDAIGLGIRMLDHARER-----------DKILILLTDGNDTASAVPP 215 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYSVQNSRKLHD 353 + + + +V+ IG+ AA L++ A + RF+ ++ L + Sbjct: 216 ARAAELAAQHR---VVVHTIGIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQE 272 Query: 354 AFLRI 358 + + Sbjct: 273 VYATL 277 >gi|194387934|dbj|BAG61380.1| unnamed protein product [Homo sapiens] Length = 589 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|194386506|dbj|BAG61063.1| unnamed protein product [Homo sapiens] Length = 404 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|119629725|gb|EAX09320.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] gi|119629726|gb|EAX09321.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] Length = 726 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|114684807|ref|XP_001158390.1| PREDICTED: collagen, type VI, alpha 1 isoform 1 [Pan troglodytes] gi|114684809|ref|XP_001158445.1| PREDICTED: collagen alpha-1(VI) chain isoform 2 [Pan troglodytes] Length = 1028 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEAISQTI 232 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P + VR +V +S Q Sbjct: 829 DITILLDGSASVGSHNFDTTKRF--AKRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 884 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 885 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 935 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 936 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 975 >gi|87196339|ref|NP_001839.2| collagen alpha-1(VI) chain precursor [Homo sapiens] gi|125987811|sp|P12109|CO6A1_HUMAN RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|30851190|gb|AAH52575.1| Collagen, type VI, alpha 1 [Homo sapiens] gi|119629727|gb|EAX09322.1| collagen, type VI, alpha 1, isoform CRA_b [Homo sapiens] Length = 1028 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEAISQTI 232 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P + VR +V +S Q Sbjct: 829 DITILLDGSASVGSHNFDTTKRF--AKRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 884 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 885 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 935 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 936 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 975 >gi|30032|emb|CAA33888.1| precursor polypeptide (AA -19 to 237) [Homo sapiens] Length = 256 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 56/292 (19%), Positives = 104/292 (35%), Gaps = 49/292 (16%) Query: 93 RNELRENGFAQD-INNIERSTSLSIIIDDQHKDY-NLSAVSRYEMPFIFCTFPWCANSSH 150 RN N + + +E T + + K Y N + + + N S+ Sbjct: 195 RNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNMSN 254 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +PL + +D++ VLD+S SM KL + R++ ++ + S Sbjct: 255 SPLNSHNP-------RAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSSD 301 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R ++ FSS + FPL G Q + +N L+ T GL + Sbjct: 302 ------RLSVIAFSSTARRLFPLRRMTDAGRQQALQAVNSLVANGGTNIAEGLRKGAKVM 355 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----------AKRRG 316 D KE+ + II L+DG+++ + + N ++ G Sbjct: 356 EDRKER---------NPVSSIILLSDGQDTYTVNGSSGNXPQPNYQLLLPLSMHGSQNTG 406 Query: 317 --AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V++ G + + + S F ++ + DAF + IG + Sbjct: 407 FQIPVHSFGFGTDHDASSMHTISEISGGTFSFIETESVIQDAFAQCIGGLLS 458 >gi|162452621|ref|YP_001614988.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] gi|161163203|emb|CAN94508.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 73/206 (35%), Gaps = 21/206 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++S ++ +V+D S SM G + A + L+ + V R+ + Sbjct: 98 SPARSLAQANLSLVIDRSGSMK---GTRLTNAVQAATTAVSRLNDGDVVSVVTFDTRTSV 154 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V + + P G I + + G T + G+E + + + Sbjct: 155 VVPPTTV---GPETRGR--ILASVRGISLGGDTCISCGIEEGLSLLGQTSAGVSR----- 204 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--S 338 ++ L+DG+ + D A+ RG + IGV + ++ L A S Sbjct: 205 ------MLVLSDGDANHGVRDVPGFRAMAQRARDRGVAITTIGVDVDYNEKILSAIALDS 258 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 R Y V+N L F +++ Sbjct: 259 NGRHYFVENDAALARIFEAEAEQLTT 284 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 65/171 (38%), Gaps = 44/171 (25%) Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDA-----KEKLEHIAKGHDDYKKYIIFLTDGEN 295 + KI+++ G +T G+ +A+ + + + A D K I +TDGE Sbjct: 478 KSKIDQMSAGGSTAGQIGIAWAWYALSPDFASLFSGEGQPGAYAPSDTLKVAILMTDGEF 537 Query: 296 SSPNIDN---------------------------KESLFYCNEAKRRGAIVYAIGVQ--- 325 ++P D +S+ C + +G +VY +G Sbjct: 538 NTPFRDGVIALDAGTGSGGLDSHIDLNSSNGDPFAQSVALCQAMQAKGVVVYTVGFDLGS 597 Query: 326 -------AEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + A ++ CA+ + F+ + L +AF IG+++ + RI Sbjct: 598 ATGREGVVDTALDVMRECATNEQTHFFQADDGTDLKEAFRAIGRDITRLRI 648 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 79/243 (32%), Gaps = 32/243 (13%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F +G+++++ A+ LPV+ ++ I+ VKA + Sbjct: 19 RFLRQTQGNVAMMFAMALPVLLMITLGAIDIHQASKVKA--------------QLQDALD 64 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + I + + + + + S ++ + Sbjct: 65 AAALAAARSTFTDDVNINKVGLAALKANMPSYFGEASGD------TASFVLLNNRVTGEA 118 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG- 186 + + + + P+ S V +S++ +++ M LD++ SM++ Sbjct: 119 TVNVKVLVANVVLPPYGKLLDDFLPVSSRSEVLRASRN---VEVAMALDITGSMDNCSRN 175 Query: 187 -PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-------SSKIVQTFPLAWGVQ 238 P KL + +E++DI+ + L+ + SS I PL Sbjct: 176 CPPTSKLEDLQAAAKELVDIVVQDQQTPFYSKVALIPYAAGVNVGSSAISARGPLDTTTH 235 Query: 239 HIQ 241 +I Sbjct: 236 NIS 238 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 95/233 (40%), Gaps = 33/233 (14%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + +G+ +++TA+ + + + L I+ + K + LD + TA ++ Sbjct: 6 RQFRRDRRGNYALMTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLTEG 65 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS-TSLSIIIDDQHKDY 125 + + DF F ++N++ + +L++ + Sbjct: 66 ATDDQLRAYALDF---------------------FNANLNDLNPANATLNLTLPSNTAGG 104 Query: 126 NL---SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLS 180 L +A Y+ P+ + F S + S ++S+ + L++ +VLD S S Sbjct: 105 GLLKMTARLNYK-PYFYPAFAQLVGKSATDANQSISFDVTSQVRLKNTLEVALVLDNSGS 163 Query: 181 MNDH-FGPGMDKLGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIV 228 M G G ++ + + ++++D + I V+ V+ GLV F++ + Sbjct: 164 MTTLGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDRPVQFGLVPFAASVN 216 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 62/224 (27%), Gaps = 79/224 (35%) Query: 223 FSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S PL G I+ I+ + T G+ + + + + + Sbjct: 436 YSCTTNAITPLTDVSVTDGATAIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRL 495 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFY----------------------- 308 + K +I LTDG N+ D +S Sbjct: 496 ETEKGNDKVVIVLTDGANTYYTPSSLSYSDPADSKSTYASYGYLNPGYNGTSVGRMFMGT 555 Query: 309 -------------------------CNEAKRRGAIVYAIGVQ---AEAADQF----LKNC 336 CN AK +V + + +A+D+ LK+C Sbjct: 556 STAIGQFDYSNGNYTNALNEQMATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSC 615 Query: 337 ASPDRF-------------YSVQNSRKLHDAFLRIGKEMVKQRI 367 +S RF + L + F IG E+ R+ Sbjct: 616 SSESRFRKDPTDPSKPAKLFWNATGASLSNDFKEIGNELSNLRV 659 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 91/232 (39%), Gaps = 31/232 (13%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + +G+ +++T + + + + + ++ + K + LD + TA ++ Sbjct: 23 RQFRRDRRGNYALMTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLTEG 82 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + K DF F ++N+I+ +++ + + Sbjct: 83 ATDDQLKAYALDF---------------------FNANLNDIDPASATLNVTLPSNTSGG 121 Query: 127 LSAVSRYEMPFIFCTFPWCA-----NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 ++ + +P A +++ A I SV + L++ +VLD S SM Sbjct: 122 GLLTMTAQLAYKPYFYPAFAQLVGKSATDANQKINFSVTSQVRLKNTLEVALVLDNSGSM 181 Query: 182 NDH-FGPGMDKLGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIV 228 G G ++ + + ++++D + I V+ V+ GLV F++ + Sbjct: 182 TTLGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDKPVQFGLVPFAASVN 233 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 61/224 (27%), Gaps = 79/224 (35%) Query: 223 FSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S PL G I+ I+ + T G+ + + + + + Sbjct: 453 YSCTTNPITPLTDVSVADGATSIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRR 512 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFY----------------------- 308 + K +I LTDG N+ D S Sbjct: 513 ETEKGNDKVVIVLTDGANTYYTPSSLGYSDPANSKSTYASYGYLNPGYNGTSVGRMFMGT 572 Query: 309 -------------------------CNEAKRRGAIVYAIGVQ---AEAADQF----LKNC 336 CN AK +V + + +A+D+ LK+C Sbjct: 573 SSAIGQLDYSNGNYTNALNEQMATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSC 632 Query: 337 ASPDRF-------------YSVQNSRKLHDAFLRIGKEMVKQRI 367 +S RF + L + F IG E+ R+ Sbjct: 633 SSDSRFRKDPTDPSKPAKLFWNATGASLSNDFKEIGNELSNLRV 676 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 61/183 (33%), Gaps = 22/183 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ LDVS SM ++L A ++R ++ + ++ R GLV FS Sbjct: 87 AVIVALDVSGSMCSTDVVP-NRLAAAQEAVRRFVE------EQDSGTRIGLVVFSGFAEL 139 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA-----------K 278 I ++ L G T + + + I + + Sbjct: 140 AVAPTTDRDAITRALDGLTTGRGTTVGSAILKSVDAISEIDPDVAPSDPAPGTVPPPPRA 199 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + ++ LTDG N++ A RG VY IG + + + A Sbjct: 200 PGTYAPEIVVLLTDGANTTGVTPED----AAKTAAERGVRVYPIGFGTDEPSSMVCSAAQ 255 Query: 339 PDR 341 R Sbjct: 256 LGR 258 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 84/212 (39%), Gaps = 32/212 (15%) Query: 151 APLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 L+ +++ K ++ L +M+V+D S SM K+ +A + +++++S Sbjct: 387 IERLLPVEMEVKGKHELPSLGLMIVMDRSGSMMGL------KMELAKEAAARSVELLRSD 440 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 G++ F + + P + +KI + G T+ LE AY ++ Sbjct: 441 DT------LGVIAFDDQPWEILPTGKVDDPKKAADKILSITPGGGTEIYRSLEQAYTELE 494 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D + + +H II LTDG++S+ N + K + + + + Sbjct: 495 DLELQRKH-----------IILLTDGQSSTSNDYDA----LIENGKDHNITLSTVSIGQD 539 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 A L+ A RFY V ++ + R Sbjct: 540 ADRNLLEQLAGTGSGRFYDVTDATTIPAILSR 571 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 92/254 (36%), Gaps = 49/254 (19%) Query: 151 APLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 L +T + SS++ D+++V D S SM++ G +L VA ++ M + + Sbjct: 272 LSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMG-HNSRLEVAKTAVNSMAQHLLTS 330 Query: 210 PD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + ++ +R LV FS+ + I +N L T + Sbjct: 331 ENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAVNGLRADGGTN--------WEAAL 382 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY------------------- 308 A + KKYI+F++DG+ + + + Sbjct: 383 KAANAKLTSGRKG--VKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVH 440 Query: 309 ---------------CNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 EA RRG A ++++GV ++ + +YS ++ +L+ Sbjct: 441 GSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPTKMRGFADQTKGSYYSATSTDELN 500 Query: 353 DAFLRIGKEMVKQR 366 AF I ++ ++ Sbjct: 501 KAFADIIGQINRKS 514 >gi|148656823|ref|YP_001277028.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568933|gb|ABQ91078.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 851 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 70/213 (32%), Gaps = 31/213 (14%) Query: 149 SHAPLLITSSVK---ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + PL T V+ +++++D S SM G+ K +A + Sbjct: 369 KNTPLEETLPVEMTPPPRPERSDTTLLLIIDQSASMGPE--TGISKFTMAKEA------A 420 Query: 206 IKSIPDVNNVVRSGLVTFSSKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 I + + R G++ F Q + + +Q +I+ L G T L+ Sbjct: 421 IMATESLRQEDRIGVLAFDVSTRWVVDFQPVGVGLSLADVQRRISTLPLGGGTDIYNALQ 480 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 + ++ H + LTDG S D + EA+ + + Sbjct: 481 EGLPALAQQPGRVRHA-----------VLLTDGR--SFTDDRQAYRMLLEEARSQNITLS 527 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 I + +A L+ A R++ + Sbjct: 528 TIAIGTDADINLLQELARWGAGRYHYAAEPNDI 560 >gi|296168869|ref|ZP_06850541.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896486|gb|EFG76136.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 327 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 79/216 (36%), Gaps = 31/216 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D+S SM P ++L A + + + + GLV F+ Sbjct: 92 VVLVIDMSQSMRATDVPP-NRLKAAEEAASQF------AAQLTPGINLGLVGFAGTPYLL 144 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P Q + + +L F +T + + A + + I G I+ L Sbjct: 145 VPPTPQHQATIDALKKLDFADSTATGEAIFTALHAVSATA-----ITGGDTPPPARIVLL 199 Query: 291 TDGENSSPN--IDNKESLFYCNE-AKRRGAIVYAIGVQAEAAD--------------QFL 333 +DG + P+ D + ++ A+ G + I + + + Sbjct: 200 SDGRENKPSNPSDPHDGVYTAARLARDEGVPISTISFGTKTGEIEMDGQRVAVPVSTDQM 259 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 K A S + Y+ N +L+ ++ I K++ + + Sbjct: 260 KTIAKLSGGQSYTAGNLAELNKSYNAIEKDIGYRTV 295 >gi|296198464|ref|XP_002746714.1| PREDICTED: collagen alpha-1(XXI) chain isoform 2 [Callithrix jacchus] Length = 954 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 89/212 (41%), Gaps = 35/212 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K +++I K+ ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPENFEIVKKW---------LINITKNFDIGPKFIQVGVV 79 Query: 222 TFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S V PL ++ +H+ + L G T++ +++A + +F AK Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 K + LTDG++ D A+ ++AIGV +E D L+ A Sbjct: 131 SSRFLTKIAVVLTDGKSQDEVKD------AAEAARDSKITLFAIGVGSETEDAELRAIAN 184 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 S + V++ + A +I +E++KQ++ Sbjct: 185 KPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|296198462|ref|XP_002746713.1| PREDICTED: collagen alpha-1(XXI) chain isoform 1 [Callithrix jacchus] Length = 957 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 89/212 (41%), Gaps = 35/212 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K +++I K+ ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPENFEIVKKW---------LINITKNFDIGPKFIQVGVV 79 Query: 222 TFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S V PL ++ +H+ + L G T++ +++A + +F AK Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 K + LTDG++ D A+ ++AIGV +E D L+ A Sbjct: 131 SSRFLTKIAVVLTDGKSQDEVKD------AAEAARDSKITLFAIGVGSETEDAELRAIAN 184 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 S + V++ + A +I +E++KQ++ Sbjct: 185 KPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 44/198 (22%), Positives = 76/198 (38%), Gaps = 29/198 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D++ +LD S S+ G + + + +++ + PD R G+V +S Sbjct: 33 KNVHYDLVFILDASSSV------GKEDFEKVRQWVSNLVETFEIGPDK---TRVGVVRYS 83 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + F L G E+I +G T + L Y N +KE + Sbjct: 84 DRPTTEFDL--GKYKTCEEIKEAARKIRYYGGNTNTGDALRY-INTYSFSKEAGGRL--S 138 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 KK I LTDG + +D N A++ G ++A+GV EA + L AS Sbjct: 139 DRTVKKVAILLTDGRSQDYVLDP------ANAARQAGIRIFAVGVG-EALKEELDEIASE 191 Query: 339 --PDRFYSVQNSRKLHDA 354 + V + + Sbjct: 192 PKSAHVFHVSDYNAIDKI 209 >gi|119624853|gb|EAX04448.1| collagen, type XXI, alpha 1, isoform CRA_a [Homo sapiens] Length = 429 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 49/237 (20%), Positives = 96/237 (40%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 K+ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 KTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDDVKD------AAQAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 65/201 (32%), Gaps = 60/201 (29%) Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG--- 279 F+S++ + ++ ++RL T GL + + + G Sbjct: 122 FASRMYLN---RNNLGGLKAAVDRLTLSDGTGMDIGLLWEAKALSPKLRTAAALDGGLLP 178 Query: 280 -------HDDYKKYIIFLTDG--------------------------------------E 294 +K I+ +TDG Sbjct: 179 GHPTDWSDKQTQKVIVLMTDGGITAQYRPKDPWKGLNPKDMRRGIVNARRNVQYVTTRGN 238 Query: 295 NSSPNIDNKESLFY----CNEAKRRGAIVYAIGVQAEAA---DQFLKNCA-SPDRFYSVQ 346 +SP S+ Y C++AK +G I+Y +G Q D L CA SP +Y V+ Sbjct: 239 MNSPANSKHNSVAYMKTMCDQAKAKGIIIYTVGFQIRRNTLPDLSLSYCATSPSHYYFVE 298 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 +S L AF I + RI Sbjct: 299 SS-DLSAAFKAIASSIKSLRI 318 >gi|209527269|ref|ZP_03275780.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492336|gb|EDZ92680.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 414 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 72/201 (35%), Gaps = 28/201 (13%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P ++ + S+V + + +++ ++LD S SMN G ++ + A + + + Sbjct: 16 PNQESNQRQLSISVSAVPDPFEGQVPMNLCLILDHSGSMN---GQPLETVKQAAKELIDR 72 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFP---LAWGVQHIQEKINRLIFGSTTKSTPGL 259 L++ R +V F + P L I++KI+ L T G+ Sbjct: 73 LNV---------GDRISVVAFDHRAKVLVPNQDLT-DPDGIKKKIDGLRCSGGTAIDEGI 122 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + ++ K+ LTDGEN DNK L A + Sbjct: 123 KLGIEELGKGKQDRISQG----------FLLTDGENEHG--DNKRCLKLAKLATEYKLTI 170 Query: 320 YAIGVQAEAADQFLKNCASPD 340 ++G + L+ A Sbjct: 171 NSLGFGDDWNQDILEKIADAG 191 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 60/175 (34%), Gaps = 19/175 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LDVS SM ++L A ++ D I++ P + R GLVTF+ Sbjct: 88 ILLALDVSGSMCSTDVSP-NRLTAAEKAAT---DFIRAQPGGS---RIGLVTFAGSAGLL 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD------AKEKLEHIAKGHDDYK 284 P + + L T G+ + + I + G + Sbjct: 141 VPPTDDTDKLLAALKSLTTSRGTAIGQGILTSIDAIAEVDPSVPPTGADVPGGTGGEYAA 200 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I+ LTDG N+ + EA R VY IG + C S Sbjct: 201 DVIVVLTDGANTVGV----DPRTAAGEAATRRLRVYTIGFGTTTPAPMV--CDSS 249 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 75/220 (34%), Gaps = 29/220 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +L ++ +++ +++ +VLD S SM L ++ +++ + Sbjct: 26 MLSIAATSEQINTNLPINLCLVLDHSGSMQGK------PLETVKKAALSLIESLGVND-- 77 Query: 213 NNVVRSGLVTFSSKIVQTFP---LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R ++ F + P + I+ KI +L G T G++ + Sbjct: 78 ----RLSVIAFDHRAKVILPSQSREDDLTLIRSKIQQLQAGGGTAIDEGIKLGIQE---- 129 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + G Y +I LTDGEN N N+ L A G + G Sbjct: 130 ------SSTGSKGYVSHIFLLTDGENEHGN--NQRCLKLAEVAAEYGITLNTFGFGDHWN 181 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ A + ++ + F R+ + R+ Sbjct: 182 QDILEKIADIAGGSLSYIERPEQALIEFTRLFNRLQSVRL 221 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 56/292 (19%), Positives = 104/292 (35%), Gaps = 49/292 (16%) Query: 93 RNELRENGFAQD-INNIERSTSLSIIIDDQHKDY-NLSAVSRYEMPFIFCTFPWCANSSH 150 RN N + + +E T + + K Y N + + + N S+ Sbjct: 190 RNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNMSN 249 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +PL + +D++ VLD+S SM KL + R++ ++ + S Sbjct: 250 SPLNSHNP-------RAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSSD 296 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R ++ FSS + FPL G Q + +N L+ T GL + Sbjct: 297 ------RLSVIAFSSTARRLFPLRRMTDAGRQQALQAVNSLVANGGTNIAEGLRKGAKVM 350 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----------AKRRG 316 D KE+ + II L+DG+++ + + N ++ G Sbjct: 351 EDRKER---------NPVSSIILLSDGQDTYTVNGSSGNQPQPNYQLLLPLSMHGSQNTG 401 Query: 317 --AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V++ G + + + S F ++ + DAF + IG + Sbjct: 402 FQIPVHSFGFGTDHDASSMHTISEISGGTFSFIETESVIQDAFAQCIGGLLS 453 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 48/233 (20%), Positives = 89/233 (38%), Gaps = 36/233 (15%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 ++S+ + + + S + +D++ VLDVS SM KL + +++ ++ Sbjct: 241 SSSNDVTGSLVNESSMRSSRRVPIDLVTVLDVSGSMAG------TKLALLKQAMGFVIQ- 293 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEY 261 + R ++ FSS + FPL G Q + I+ L G T L+ Sbjct: 294 -----HLRPSDRLSVIAFSSTARRLFPLQRMSHHGRQQALQAISSLGAGGGTNIADALKK 348 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-------SPNIDNKESLFYCNEAKR 314 A I D K + II L+DG+++ + SL + Sbjct: 349 AVKVIEDRNYKNSVCS---------IILLSDGQDTFNISSNFQGTSAGRRSLVPPSILNE 399 Query: 315 RGAI-VYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + ++ G A+ L + AS F +++ + DAF + IG + Sbjct: 400 LHMVPLHTFGFGADHDSDTLHSISEASGGTFSFIEDEGVMQDAFAQCIGGLLS 452 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 55/325 (16%), Positives = 108/325 (33%), Gaps = 44/325 (13%) Query: 43 FVKAKLHYILDHSLLYTA---TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 K L +++ + L+ N Q S + N + + Sbjct: 630 LTKNPLTSAEKYAVSEDGKVFVRALSDANILIAPLQVKR-SDNVFINSLKGIVGKAIEIT 688 Query: 100 GFAQDINNIE-----RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN-----SS 149 DI E L+ + ++ Y ++ + P N + Sbjct: 689 AGGFDIKKAEIVVNYDEAELNGVEENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKTE 748 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 H + + +D++ VLD S SM+ + P ++ + I+ I Sbjct: 749 HFSTFLLGDKNMPVDL-SKVDIVFVLDNSGSMSSN-DPNYYRIEATKKFIQN-------I 799 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFD 268 ++NN R GLV F S + L + + +N + G +T GL+ A Sbjct: 800 DELNN--RVGLVDFDSSVSVRSNLTSDKSKLLQALNAMRWTGGSTNIGGGLKAALGLF-- 855 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK I+ L+DG +++ N E ++ +V I + + Sbjct: 856 ----------DQEQSKKIIVLLSDGYHNTGIHPND----VLPELIKQEIVVNTIALGKDC 901 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKL 351 + L + A + ++ V N+ L Sbjct: 902 DRELLHDIADKTKGGYFYVDNTGGL 926 >gi|73973418|ref|XP_532180.2| PREDICTED: similar to alpha 1 type XXI collagen precursor [Canis familiaris] Length = 961 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 92/237 (38%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L ++ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLWIVLVLLLQNSVLAEDGEIRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTT 253 +++I K+ ++ G+V +S V PL G + + G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGKNLVAAMESIHYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 RTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDEVKD------AAEAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 61/205 (29%), Gaps = 64/205 (31%) Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + L +Q KIN+L+ T G + + + A +K Sbjct: 341 TQRILQLTQKKSDVQNKINQLVANGATNLHEGFMWGWRTLSPNAPFSGGRAYQAPKNRKI 400 Query: 287 IIFLTD--------------------GENSSPNIDNKE---------------------- 304 ++F+TD G S +N+ Sbjct: 401 MVFMTDGFNSWNSRVNTATGSTYDTLGYYSYNGAENERFPDGSQGNGVNYRSLLAAAANN 460 Query: 305 ---------------SLFYCNEAKRRGAIVYAIGVQAEA------ADQFLKNCASP-DRF 342 + C AK G V+ IG +K CA+ D + Sbjct: 461 SSSYQTISRAMQDELTRQACTNAKTAGIEVFTIGFSVSGDPIDAQGLALMKECATNEDHY 520 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRI 367 + +++ +L+ AF +IG + K R+ Sbjct: 521 FKAEDASQLNAAFSQIGIGLGKLRL 545 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 51/294 (17%), Positives = 106/294 (36%), Gaps = 42/294 (14%) Query: 24 LLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRI 83 L+PV+ ++ + + +A L D ++L A+K+ + Q D + + Sbjct: 7 LMPVMLMLGATA-DYTRFTTTRAALQQAADSAVLTVASKMTE----STTNAQAKDQAQVV 61 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 + + ++D + + ++I S + ++ + T Sbjct: 62 LNAQPRMTTAIVTGAT-VSEDKRTVCATAKVTIQN---------SFMQMAQLATLTPTVK 111 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 CAN + + ++ +VLD S SMN G K+ + + + Sbjct: 112 SCANLA-----------GGADPGTTYEIALVLDNSGSMNSSSD-GQSKISILKSAANSFV 159 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFP-------LAWGVQHIQEKINRLIFGSTTKST 256 D + +N V+ +V FSS + P AW ++ + + FG T + Sbjct: 160 DT---MFSKSNNVKFSVVPFSSGVAAVDPSEPSSRNAAWVDKNGANSQHWIAFGGKTAAN 216 Query: 257 PGLEYAYNKIFD--AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + IFD +G + + Y + + D ++PN + E+LF Sbjct: 217 AAGFTSRFDIFDKLKARNSALDWRGCFEPQVYPLNVND---TTPNPSDAETLFV 267 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 73/204 (35%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L + ++ L I + GLVTF S + Sbjct: 309 VCLVLDKSGSM--SSGDPITRLTLMNQAAELYLIQILEKESL-----VGLVTFDSIAIVQ 361 Query: 231 FPLAWGVQ-----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + I K+ T GL + I + + Sbjct: 362 NNLIRMINDSSYLEISAKL-PQEAAGGTSICNGLRKGFETITSSDQSTCGSE-------- 412 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE+ + C E K GA+++ I + +AA + L + RF Sbjct: 413 -IVLLTDGEDDQ--------ISSCFEEVKHSGAVIHTIALGPDAARELETLSDMTGGRRF 463 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 Y+ + L DAF I + Sbjct: 464 YASEGINGLIDAFSGISSKSGSLS 487 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 73/204 (35%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L + ++ L I + GLVTF S + Sbjct: 309 VCLVLDKSGSM--SSGDPITRLTLMNQAAELYLIQILEKESL-----VGLVTFDSIAIVQ 361 Query: 231 FPLAWGVQ-----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + I K+ T GL + I + + Sbjct: 362 NNLIRMINDSSYLEISAKL-PQEAAGGTSICNGLRKGFETITSSDQSTCGSE-------- 412 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE+ + C E K GA+++ I + +AA + L + RF Sbjct: 413 -IVLLTDGEDDQ--------ISSCFEEVKHSGAVIHTIALGPDAARELETLSDMTGGRRF 463 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 Y+ + L DAF I + Sbjct: 464 YASEGINGLIDAFSGISSKSGSLS 487 >gi|297287373|ref|XP_001118050.2| PREDICTED: hypothetical protein LOC721855 [Macaca mulatta] Length = 2077 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 77/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 1100 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 1159 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S + L ++ +++ + FG T + ++ ++ Sbjct: 1160 YSDDVEIIQGLTRMPGDRDTLKSRVDAIKYFGKGTYTDCAIKKGLEQLLVGG-------- 1211 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 1212 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 1271 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 1272 TDHTYRRNFTAADWGQSRDAEEVISQTI 1299 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P + VR +V +S Q Sbjct: 1878 DITILLDSSASVGSHNFDTTKRF--AKRLAERFLMAGRTDPAHD--VRVAVVQYSGTGQQ 1933 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 1934 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 1984 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 1985 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 2024 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 79/214 (36%), Gaps = 32/214 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +K L ++MV+D S SM++ G ++L +A ++ + + I V GLV Sbjct: 403 TKQQPDLALVMVIDRSGSMSELVGGSRRNRLDLAKEAVYQASLGLTPIDQV------GLV 456 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F PL V I+ + G T PG+ E+ Sbjct: 457 VFDDAANWVLPLQRLPSVVEIERALGSFGIGGGTNIRPGI-----------EQAAQALAS 505 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-AS 338 D K++I LTDG S D + + G + + + +A + A Sbjct: 506 ADAKVKHVILLTDGIAESNYSD------LIAQMRAAGVTISTVAIGEDANPNLVDVANAG 559 Query: 339 PDRFYSVQNSRKLHDAFL-----RIGKEMVKQRI 367 R Y V + FL G+++V++RI Sbjct: 560 GGRSYRVTRIEDVPRIFLQETIIAAGRDIVEERI 593 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 62/453 (13%), Positives = 135/453 (29%), Gaps = 114/453 (25%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN--Q 66 F ++ +G+I+IL A L + + +G I+ + +++K+ D + + + +K Sbjct: 12 FHHDRRGNIAILFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAASVGSVSKASPAFL 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 G+ NI+ + + ++G+ + L + Sbjct: 72 AAGSMTTDGPIAVGSTDATNIFNGNMAS---QSGYTL--------SKLDAAVTKSGATLT 120 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN---- 182 + + F T L I + +S + +D ++LD S SM Sbjct: 121 STVTFSASVATTFLTI-----IGKTALAIGGTSVSTSSMPVYIDFYLLLDNSPSMGVGAT 175 Query: 183 --------DHFGPGM-----------------------DKLGVATRSIREMLDIIKSIPD 211 D+ ++ V + ++++D + Sbjct: 176 PTDVATMVDNTSDKCAFACHDVNDEHNYYELAKTLGVKTRIDVLRDATQQLMDTAAATAT 235 Query: 212 VNNVVRSGLVTFSSKIVQTF--------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 N R + F + + I+ + + + +Y Sbjct: 236 YPNQFRMAIYDFGASAQSAALRRLFALSSSLSSAKTAAGAIDLMTVKGQNDNDD-RDTSY 294 Query: 264 NKIFDA--KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-------------- 307 +K+ A K+ A D +KY++F++DG N +++ Sbjct: 295 SKLLPAIDKQITAAGAGTSDAPQKYLLFVSDGVADETNAGCAKTMKNAFWGNKSPRCQSP 354 Query: 308 ----YCNEAKRRGAIV----------------------------YAIG-VQAEAADQF-- 332 C RG V + +G + Sbjct: 355 IDPALCKAMTDRGVKVAVLYTTYLALPLKQANGDPSWYASWIAPFNVGPYGPSPNSEIAN 414 Query: 333 -LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +K CASP ++ V + + DA I ++ V Sbjct: 415 NMKACASPGFYFEVSPTDGIADAMNAIFRKAVA 447 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 50/371 (13%), Positives = 114/371 (30%), Gaps = 65/371 (17%) Query: 34 LVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR 93 ++ ++ V+ +L D S + ++ + + Sbjct: 33 FTVDVAYMQLVRTELRAATDASAKAGMEALRRTQD--TEAAIDAAIATAAANKVGGRSLT 90 Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPF-IFCTFPWCANSSHAP 152 + F N++ S S + SA++ + F + Sbjct: 91 LTADQIEFGLAFRNVDNSVSFNAGQLPYTAVRVNSAMTESSAAGAVPLFFGSIFGTGQF- 149 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM-------NDHFGPG---------------MD 190 + + + +++ +D S SM + + PG + Sbjct: 150 ----EPTRSAVSASTEVEICFAIDRSHSMCFDLTGVDWSYPPGTPRNPDPVAFPPHPTLS 205 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-----------------FPL 233 + +R+++ + I S R +VT++SKI Q+ PL Sbjct: 206 RWASLSRAMQTFVSITASQEPKP---RVAMVTWASKITQSNYEGKLTKTNSPEVFVDVPL 262 Query: 234 AWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + + I + + T G++ A + K + + II + Sbjct: 263 TTNLADLNQAIKGRSEKVMLGATNMAAGIDEARKILNATKSTRPYAH-------RIIILM 315 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YSVQNSR 349 TDG + + L +A G +++++ + + D + ++ Y NS Sbjct: 316 TDGLWNQG----RNPLLAAQDAANEGIVIHSVSLLPRSGDITPQVSSTTGGVNYPATNSA 371 Query: 350 KLHDAFLRIGK 360 L AF I + Sbjct: 372 ALEAAFADIAR 382 >gi|296269770|ref|YP_003652402.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092557|gb|ADG88509.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 315 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 86/262 (32%), Gaps = 35/262 (13%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + + L ++ E P P LLI + + + + + D +++ +DV Sbjct: 35 RFTNLALLSLVAPERPEWRRHVPAALFLVMMSLLIVGAARPAGEVRVPRDRATIIIAVDV 94 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SLSM + ++L A + + ++ +P+ N G+V F+ Sbjct: 95 SLSM-EARDVAPNRLIAAKEAAQ---QFVRDLPERFN---VGVVAFARTAAVVISPTTDH 147 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + I L T + A + I + I+ L+DG+N+S Sbjct: 148 AAVTNAIAGLTTRPGTSIGEAVFNALDSIRSFDREAATDPPPAA-----IVLLSDGDNTS 202 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDR 341 + A V I L+ + + R Sbjct: 203 G----RPVSEAIEAAANAKVPVSTIAYGTPDGYVMIDNRPVQVPVNKAALQELSEGTGGR 258 Query: 342 FYSVQNSRKLHDAFLRIGKEMV 363 Y+ +++ +L + + +IG + Sbjct: 259 AYTAESASELREVYQQIGTSLG 280 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 59/313 (18%), Positives = 108/313 (34%), Gaps = 43/313 (13%) Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELR-----ENGFAQDINNIERSTSLSIIIDDQHKD 124 N + + + R + + + + + L E G D +I+ + ++ + Sbjct: 139 NWPQDEGHMAVVRRLSHTYSGNLQEHLPFFRTLEAGIFNDDEHIDLQSDMNDEHNAITGS 198 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + A S + P I + + L S +S S LD++ VLDVS SM Sbjct: 199 VKIKAYSEF--PAIEQSVTKEIFAILIHLRAPKSSHSAS-SRAPLDLVTVLDVSGSMA-- 253 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHI 240 G + L A + + L R ++ FSS + FPL G Q Sbjct: 254 -GTKIALLKNAMSFVIQTLGPND---------RLSVIAFSSTARRLFPLRRMTLAGRQQA 303 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + ++ L+ T GL+ I D + K + II L+DG+++ Sbjct: 304 LQAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPVCS---------IILLSDGQDTYTLP 354 Query: 301 DNKESLFYCN-------EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 ++ L Y ++ G ++ + A S F + + Sbjct: 355 SDRNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMHAIAEISSGTFSFIDAEGSI 414 Query: 352 HDAFLR-IGKEMV 363 D F + IG + Sbjct: 415 QDGFAQCIGGLLS 427 >gi|115379114|ref|ZP_01466238.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|310823566|ref|YP_003955924.1| Batb protein [Stigmatella aurantiaca DW4/3-1] gi|115363897|gb|EAU63008.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|309396638|gb|ADO74097.1| BatB protein [Stigmatella aurantiaca DW4/3-1] Length = 352 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 66/179 (36%), Gaps = 28/179 (15%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K G+D+++VLD S SM P +L A + +LD +K R Sbjct: 94 SKSEMTKRKGIDVVVVLDASKSMLARDVQP--SRLERAKLELNTLLDELKG-------DR 144 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKL 273 GLV F+ PL ++ + + + + L+ A + +A Sbjct: 145 VGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEQMPQGGSNIGAALKLANQVLSNADRGA 204 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ ++ L+DGE+ + K G V A+GV +E+ + Sbjct: 205 K---------ERAVVLLSDGEDLFGEVGEAT-----EALKDGGVQVLAVGVGSESGEPI 249 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 56/157 (35%), Gaps = 22/157 (14%) Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ + + L I+ L T G++ A ++ + A G+ Sbjct: 534 GYTPRYDVSQTLTNDTLSANNSIDDLWAYGGTPMGGGIKVARQELV------ANTAPGNI 587 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG------------AIVYAIGVQAEAA 329 +I L+DG + + EA ++Y IG +A Sbjct: 588 P---VMIVLSDGNPTLTSDGTASETLAIQEAIEEAETTKQTTIGGEQILIYTIGFGNDAN 644 Query: 330 DQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + LK A SPD +Y S +L + +I KE+ ++ Sbjct: 645 ETLLKQIATSPDYYYFAATSEELSSIYRQIAKELKEK 681 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 67/201 (33%), Gaps = 31/201 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D+ + +DVS SM D+L A ++ +D + + +V+FS Sbjct: 580 PYDIGLCVDVSGSM------YGDRLEKAKTALNTFIDAMLPQDNAC------MVSFSDNA 627 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + ++ + N+L T + GL + + D K I Sbjct: 628 YLVAGYGASKEVMRSRTNQLRDLYGTNTDVGLSKTISILADQGRSDA---------SKMI 678 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 I + DG+ + + AK G VY I V + L+ A + +Y Sbjct: 679 IMICDGDVNYIQG-------TVDAAKAAGIAVYTINV-VSGDNDLLQKIADETGGEYYYA 730 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + ++ I E V Sbjct: 731 ATTEEVVSQVEAIRGETVSAV 751 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 51/221 (23%), Positives = 84/221 (38%), Gaps = 29/221 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + + G+ +++VLDVS SM D KL +A +++ Sbjct: 319 PLADEIPVRPAGRSGVALVLVLDVSGSMADGNPS---KLALAVAGALSLVET------AR 369 Query: 214 NVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R G+VTFSS FP A G + ++RL G +T+ + AY + +A Sbjct: 370 PEDRLGIVTFSSGPRWLFPPRPMTARGKLEAKTLLDRLRPGGSTR----MLEAYRQAIEA 425 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E LE K I+ LTDG+ D + A+ +G ++ + +A Sbjct: 426 LEALELETKQ-------ILVLTDGQ---VEEDPAALVALAEAARAQGIRTNSVALGGDAD 475 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L + RF+ V L FL + + L Sbjct: 476 RALLARMSRVGEGRFWDVPTPEDLPRLFLEEAERTFGREAL 516 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 71/194 (36%), Gaps = 26/194 (13%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S + L + +V+D S SM++ +L +A + ++ ++ + + Sbjct: 404 PSREERFDLALTLVIDRSGSMSELVDGLRTQLDLAREA------AFQASLGLSRQDQLSI 457 Query: 221 VTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F S PL + I++ ++RL G T G+ A I A ++ H Sbjct: 458 IAFDSVADVILPLQPLPDLATIEDALSRLSAGGGTNIRSGMALAAETIATADARIRH--- 514 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-A 337 +I LTDG + + D + +G V + + + + Sbjct: 515 --------VILLTDGVSETEYAD------LVANLRAQGVTVSTVAIGLNTDPELERVAQI 560 Query: 338 SPDRFYSVQNSRKL 351 ++Y V+ + L Sbjct: 561 GGGKYYVVRQAEAL 574 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 81/208 (38%), Gaps = 26/208 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +S ++ LD++ ++D S S++ + I +L + PD + Sbjct: 3 EAIENSCNNKHLDLVFIIDSSRSVSHY------DFEKVKEFILTILQFLDISPDATH--- 53 Query: 218 SGLVTFSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 GL+ + S + Q F L + I+ + R+ G+ T + L+YA N F E Sbjct: 54 VGLIQYGSTVKQEFSLKTFRRKQDIERAVKRMMHLGTGTMTGLALQYAVNIAFSETEGAR 113 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + + I+ +TDG P +A+ G +++AIGV + Sbjct: 114 PLR---QNVPRIIMIVTDGRPQDPVA------EIAAKARNSGILIFAIGVGRVDMNTLKS 164 Query: 335 NCASP--DRFYSVQNSRK---LHDAFLR 357 + P + + V N + L AF Sbjct: 165 IGSEPHEEHVFLVANFSQIETLTSAFQT 192 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 84/210 (40%), Gaps = 33/210 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + D + + + +LD ++ P R GL+ +S Sbjct: 608 TEGPIDLVFVIDGSKSLGE------DNFEIVKQFVSGILDTLEISP---KAARVGLLQYS 658 Query: 225 SKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L A ++ ++ + GS T A ++ + A+ Sbjct: 659 TEVRTEFTLRQFSSAKDMKKAVSQMKYMGRGSMT------GLALRQMSERSFTETEGARP 712 Query: 280 -HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + I TDG E + AK+RG I+YAIG+ ++ L+ + Sbjct: 713 FSANVPRISIVFTDGRAQD------EVSEWATRAKQRGIIIYAIGIGKAIEEELLEIASE 766 Query: 339 PD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 P + ++ A I +E+ Q Sbjct: 767 PSYKHLFYAED----FTALEDISEELRAQI 792 >gi|296128023|ref|YP_003635273.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] gi|296019838|gb|ADG73074.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] Length = 500 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 72/206 (34%), Gaps = 25/206 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 SS S+ S D+++V+D S SM++ K+ ++R + + + Sbjct: 147 SSAPASAVSRADADLVLVVDCSGSMDEA-----GKMETTKYALRTL------VSSLRRTD 195 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R +V +S++ + + I+RL +T + GL Y L Sbjct: 196 RVAMVCYSTEADVYLEPTPVAEREGVLAAIDRLAPRDSTNAAAGLALGY--------DLA 247 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFL 333 + + ++ ++DG + D + L + + G + ++GV D L Sbjct: 248 MSMRTEGRLTR-VVLVSDGVANVGETDPEGILARISSQAKAGISLISVGVGITTYNDHLL 306 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLR 357 + A V + F Sbjct: 307 EQLADQGDGWHVYVDGEAEAERVFAT 332 >gi|297199802|ref|ZP_06917199.1| lipoprotein [Streptomyces sviceus ATCC 29083] gi|197710264|gb|EDY54298.1| lipoprotein [Streptomyces sviceus ATCC 29083] Length = 506 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 71/188 (37%), Gaps = 24/188 (12%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 L T S ++ + V+D+S SM++ +L +A RS+ M + + Sbjct: 146 LATRSAGENADERPPAALTFVIDISGSMSEP-----GRLDLAQRSLDTMTERL------R 194 Query: 214 NVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + LVTFS + + P+ I E I+ L +T G+E Y + Sbjct: 195 DDDSVALVTFSDRARKVLPMTRLGGHRDRIHEAIDGLEPTYSTNLGAGVETGYKTAVEGL 254 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAA 329 K ++ ++D + D L + A+ G ++ +GV ++ Sbjct: 255 RKGATNR---------VVLISDALANDGETDPDAILERIDTARREHGITLFGVGVGSDYG 305 Query: 330 DQFLKNCA 337 D ++ A Sbjct: 306 DALMERLA 313 >gi|301133566|gb|ADK63405.1| C3HC4 type zinc finger protein [Brassica rapa] Length = 677 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 53/276 (19%), Positives = 95/276 (34%), Gaps = 34/276 (12%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 F D + + + L + E+ + + + S L Sbjct: 177 FNDDEALEPQHHNPAESTKPGGVSGKLEVKTYPEISEVVRSVSFKDFSVLINLKAPPVSS 236 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 SS S +D++ VLDVS SM KL + R++ ++ + R + Sbjct: 237 SSSSSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQNLGPFD------RLSV 284 Query: 221 VTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++FSS ++FPL G Q + +N + T GL + D Sbjct: 285 ISFSSTSRRSFPLRLMTETGKQEALQAVNSFVSNGGTNIAEGLTKGAKVLID------RR 338 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR------RGAIVYAIGVQAEAAD 330 K I+ L+DG+++ S + K V+A G A+ Sbjct: 339 FKNSVSS---IVLLSDGQDTYTMTSPTGSNTKGADYKTLLPKEVNRIPVHAFGFGADHDA 395 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + A S F +++ + DAF + IG + Sbjct: 396 SLMHSIAENSGGTFSFIESETVIQDAFAQCIGGLLS 431 >gi|225435355|ref|XP_002285271.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 670 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 35/223 (15%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYN-LSAVSRYEMPFIFCTFPWCANSSHA 151 R+ + I IE T + + +N + + + P N ++ Sbjct: 207 RSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQTN- 265 Query: 152 PLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ +S+S +D++ VLDVS SM KL + R++ ++ + Sbjct: 266 -------MQPTSQSCRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQSLGPCD 312 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +++FSS + FPL G Q + +N L+ T GL + Sbjct: 313 ------RLSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNGGTNIAEGLRKGAKVM 366 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 D + + II L+DG+++ + + Sbjct: 367 LD---------RKWKNPVSSIILLSDGQDTYTVCKAEGVIQDA 400 >gi|74203017|dbj|BAE26210.1| unnamed protein product [Mus musculus] Length = 791 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGM 189 Y + F + H + ++SK +D++ +LD S S G Sbjct: 11 YMLLFFRASPTISLQEVHVNRETMGKIAVASKLMWCSAAVDILFLLDGSHS------IGK 64 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL 247 + R D + P VR G + F S FPL Q ++E I + Sbjct: 65 GSFERSKRFAIAACDALDISPGR---VRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGI 121 Query: 248 IFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +F T++ L+ + + + +I +TDG++ P Sbjct: 122 VFKGGRTETGLALKRLSRGFPGGR---------NGSVPQILIIITDGKSQGPVA------ 166 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + RG +V+A+GV+ D+ L + P Sbjct: 167 LPAKQLRERGIVVFAVGVRFPRWDELLTLASEP 199 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ IR+ PDV GLV + S++ Sbjct: 528 SLDLVFLLDASASVGRE------NFAQMQSFIRKCTLRFDVNPDVTQ---VGLVVYGSRV 578 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I D ++ A+ K Sbjct: 579 QTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAG---TALLHIEDKVMTVQRGARPGVP--K 633 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ LT G +++ + + G V + V A Sbjct: 634 AVVMLTGG------SGAEDAAVPAQKLRGNGISVLVMSVGA 668 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 91/254 (35%), Gaps = 49/254 (19%) Query: 151 APLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 L +T + SS++ D+++V D S SM++ G +L VA ++ M + + Sbjct: 613 LSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMG-HNSRLEVAKTAVNSMAQHLLTS 671 Query: 210 PD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + ++ +R LV FS+ I +N L T + Sbjct: 672 ENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAVNGLGADGGTN--------WEAAL 723 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY------------------- 308 A + KKYI+F++DG+ + + + Sbjct: 724 KAANAKLTSGRKG--VKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVH 781 Query: 309 ---------------CNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 EA RRG A ++++GV ++ + +YS ++ +L+ Sbjct: 782 GSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPTKMRGFADQTKGSYYSATSTDELN 841 Query: 353 DAFLRIGKEMVKQR 366 AF I ++ ++ Sbjct: 842 KAFADIIGQINRKS 855 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 32/204 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSS---VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 P + T WC + + + + K+ + G+D++ +DVS SM +L Sbjct: 53 PILKLTI-WCLAIASLVVALVNPKIGTKLETVKREGVDIVFAVDVSKSMLAEDISP-SRL 110 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---- 248 + R + E+++ + S R G++ ++ + V P+ + +N L Sbjct: 111 DKSKRIVSEIINNLGS-------DRIGIIAYAGRAVPQLPITTDFSAAKMFLNNLNTNML 163 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + A +D E+ + ++ ++DGE+ + Sbjct: 164 SSQGTAIDDAIRLA-KTYYDDVEQT----------NRVLVIISDGEDHTGGAG-----QL 207 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 EA + G Y IGV Sbjct: 208 AEEATKEGIKTYTIGVGTTKGGPI 231 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 85/224 (37%), Gaps = 39/224 (17%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S SS++ +D++ VLDVS SM KL + R++ ++ + Sbjct: 262 ESSAGSSRNRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVVQHLGPSD------ 309 Query: 217 RSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R ++ FSS + F L G Q + IN L T L+ A I D K Sbjct: 310 RLSVIAFSSSARRLFHLQRISHHGRQQALQAINSLGASGGTNIADALKKAMKVIEDRSYK 369 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES----------LFYCNEAKRRGAIVYAI 322 + II L+DG+++ + + N+A+ ++A Sbjct: 370 NSVCS---------IILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDARHT-VPLHAF 419 Query: 323 GVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G A+ L + AS F +++ + DAF + IG + Sbjct: 420 GFGADHDSDSLHSIAQASGGTFSFIEDEGVMQDAFAQCIGGLLS 463 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 85/224 (37%), Gaps = 39/224 (17%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S SS++ +D++ VLDVS SM KL + R++ ++ + Sbjct: 262 ESSAGSSRNRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVVQHLGPSD------ 309 Query: 217 RSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R ++ FSS + F L G Q + IN L T L+ A I D K Sbjct: 310 RLSVIAFSSSARRLFHLQRISHHGRQQALQAINSLGASGGTNIADALKKAMKVIEDRSYK 369 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES----------LFYCNEAKRRGAIVYAI 322 + II L+DG+++ + + N+A+ ++A Sbjct: 370 NSVCS---------IILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDARHT-VPLHAF 419 Query: 323 GVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G A+ L + AS F +++ + DAF + IG + Sbjct: 420 GFGADHDSDSLHSIAQASGGTFSFIEDEGVMQDAFAQCIGGLLS 463 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 53/279 (18%), Positives = 98/279 (35%), Gaps = 42/279 (15%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS 162 + NI S ++ I + S+ + + I P+ + ++ Sbjct: 192 MEPANIGSSRTVEIKTYSEFSAIQQSSQDDFAV-LIHLKAPYANPEQVIGRPVNATSVGY 250 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +D++ VLDVS SM KL + R++ ++ + R ++ Sbjct: 251 PTARAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RLSVIA 298 Query: 223 FSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + F L G Q + +N L+ T L+ A I D + Sbjct: 299 FSSTARRLFHLQRMSHSGRQQALQTVNSLVASGGTNIADALKKAAKVIED---------R 349 Query: 279 GHDDYKKYIIFLTDGENSS----------PNIDNKESLFYCNEAKRRG-AIVYAIGVQAE 327 H + II L+DG+++ P + N + G V+ G + Sbjct: 350 SHQNPVCSIILLSDGQDTYNIPSNIRGARPEYSSLVPSSILN--RTFGLVPVHGFGFGVD 407 Query: 328 AADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + AS F +++ + DAF + IG + Sbjct: 408 HDSDALHSIAEASGGTFSFIEDEGVIQDAFAQCIGGLLS 446 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 70/193 (36%), Gaps = 33/193 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L + ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMR--LGSPITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQ 361 Query: 231 FPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + T GL+ + I + + Sbjct: 362 TNLIRIIND-SSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE-------- 412 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C E K GAI++ I + AA + L + RF Sbjct: 413 -IVLLTDGEDNR--------ISSCFQEVKHSGAIIHTIALGPSAARELETLSDMTGGLRF 463 Query: 343 YSVQNSRKLHDAF 355 Y+ ++ L DAF Sbjct: 464 YAKEDVNGLIDAF 476 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 48/203 (23%), Positives = 80/203 (39%), Gaps = 28/203 (13%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +M F + P + TS +I + D+ + +D+S SMN G +L Sbjct: 46 KMVLRTTYFVLFLVAFAGPSIGTSVKEIKEEGK---DIFLAVDLSQSMNAT-DIGPSRLQ 101 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-- 251 E+ ++ KS P R GL+ FSS+ PL + +Q I+ L G Sbjct: 102 RIK---FELKELTKSFPS----DRIGLIIFSSEAFMQCPLTFDQSVLQLYIDGLNTGLVP 154 Query: 252 --TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T L A ++ + + + K +I ++DGEN ++N S Sbjct: 155 NFGTDLNAPLRIALDRFQNDE--------SQEVKSKSVILISDGENFGDELENIGS---- 202 Query: 310 NEAKRRGAIVYAIGVQAEAADQF 332 E K G V+A+G+ E+ Sbjct: 203 -ELKNLGVKVFALGIGTESGSTI 224 >gi|77552603|gb|ABA95400.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 574 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 90/238 (37%), Gaps = 42/238 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM-------NDHFGPGMDKLGVATRSIREMLDI 205 + + + ++ + +D++ VLDVS SM H +D L +A + I ++ Sbjct: 88 VTVEAPKVVAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSRLDLLKIAMKYIIKL--- 144 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEY 261 V + R +V+F+ +V + L + ++ +++L T P L+ Sbjct: 145 ------VRDADRLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKK 198 Query: 262 AYNKIFDAKEKLEHI--------AKGHDDYKK---YIIFLTDGEN----SSPNIDNKESL 306 A + K +G ++ KK +I+ L+DG + S N + Sbjct: 199 AVEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRINWEKVAKS 258 Query: 307 FYCNE----AKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF-YSVQNSRKLHDAFLR 357 + A R V+ G A L+ + S + + +N + +AF R Sbjct: 259 TDVDHSEVGAMLRKYAVHTFGFSASHDPVPLRQISALSYGLYSFVCKNLDNITEAFAR 316 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 82/206 (39%), Gaps = 12/206 (5%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKI 227 + + ++LD S SM+ + G ++G ++ M+D++K NV + ++TF + + Sbjct: 15 IPIFLLLDTSSSMSTNMNGGQTRIGCLNDCVQTMIDLLKEEAKRENVSKLAVITFGAGGV 74 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL+ + + + L G T LE + I + Y Y+ Sbjct: 75 KLQTPLS---KIESIQFSPLGTGGNTPLGMALELTRDYIQN------KDTFPGKFYTPYV 125 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCASPDRFYSV 345 + ++DGE + + E + ++ Y++ + E SP++ Y Sbjct: 126 VMVSDGEPNDDWQGPLHDFIHNKENRSSKSVRYSVFIGNEGEEPQAVHDFSGSPNQVYYA 185 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + L + F I + + R + K Sbjct: 186 NDVQSLINCFKAITASVTQGRKITAK 211 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 43/200 (21%), Positives = 71/200 (35%), Gaps = 33/200 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + + +V+D S SM+ K+ A S R +++ + +V F ++ Sbjct: 458 VAIALVVDRSGSMSGL------KIEAAKESARATAEVLSPSDLIT------VVAFDNQPT 505 Query: 229 QTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L A I I RL G T P L AY + A K++H Sbjct: 506 TIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKVKH----------- 554 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 +I L+DG+ I + C E + V A+G+ +A L R Y Sbjct: 555 VIVLSDGQAPYDGIAD-----LCQEMRSARITVSAVGIG-DADRNLLNLITDNGDGRLYM 608 Query: 345 VQNSRKLHDAFLRIGKEMVK 364 + L F++ E + Sbjct: 609 TDDLAALPRIFMKETTEAQR 628 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 46/280 (16%), Positives = 104/280 (37%), Gaps = 38/280 (13%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT----FPWCANSSHAPLLITSSV 159 + + + ++ + DY A + +PF T PW + I Sbjct: 141 NDGRLPPADAVRAEAFINYFDYGYPAPATPAVPFSLTTEIAPAPWNPQRQLLLVGIQG-Y 199 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +++ + +++++++D S SM D KL + ++R+++ +++ R Sbjct: 200 RVAPQDIPAVNLVLLIDTSGSMADRA-----KLPLLKSALRQLVTQMRAQD------RVA 248 Query: 220 LVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V ++ P G I I+ L +T GLE AY E + + Sbjct: 249 IVAYAGSAGLVLPSTPGDRHAQILAAIDGLQASGSTNGGAGLELAY------AEAAKGLV 302 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK--- 334 K + I+ +DG+ + D + Y ++RG + +G+ + + + Sbjct: 303 KDGVNR---IVLASDGDFNVGRTDLAQLKDYVGSQRKRGIALTTLGLGSGNYNDAMAMQL 359 Query: 335 NCASPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 A ++ + + + +L L I K++ Q Sbjct: 360 ANAGDGSYHYIDSLLQARKVFASELSATLLTIAKDVKVQV 399 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 105/280 (37%), Gaps = 38/280 (13%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT----FPWCANSSHAPLLITSSV 159 + + + ++ + DY A + +PF T PW A + I Sbjct: 141 NDGRLPPADAVRAEAFINYFDYGYPAPATPAVPFSLTTEIAPAPWNAQRQLLLVGIQG-Y 199 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +++ + +++++++D S SM D KL + ++R+++ +++ R Sbjct: 200 RVAPQDIPAVNLVLLIDTSGSMADRA-----KLPLLKSALRQLVTQMRAQD------RVA 248 Query: 220 LVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V ++ P G I I+ L +T GLE AY E + + Sbjct: 249 IVAYAGSAGLVLPSTPGDRHAQILAAIDGLQASGSTNGGAGLELAY------AEAAKGLV 302 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK--- 334 K + I+ +DG+ + D + Y ++RG + +G+ + + + Sbjct: 303 KDGVNR---IVLASDGDFNVGRTDLAQLKDYVGSQRKRGIALTTLGLGSGNYNDAMAMQL 359 Query: 335 NCASPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 A ++ + + + +L L I K++ Q Sbjct: 360 ANAGDGSYHYIDSLLQARKVFASELSATLLTIAKDVKVQV 399 >gi|194226345|ref|XP_001488401.2| PREDICTED: similar to Collagen alpha-1(VI) chain [Equus caballus] Length = 1027 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 84/218 (38%), Gaps = 19/218 (8%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVN 213 T++V+ + D +D+ VLD S S+ P + + +D ++ Sbjct: 23 TAAVRTVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKRFIDNLRDRYYRCDR 82 Query: 214 NVV-RSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFD 268 N+V +G + +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 83 NLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLV 142 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAE 327 H KY+I +TDG + L NEAK G V+++ + + Sbjct: 143 GG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPD 194 Query: 328 AADQFLKNCASPDRF---YSVQNSRKLHDAFLRIGKEM 362 + L A+ + ++ + + DA I + + Sbjct: 195 HLEPRLSIIATDHTYRRNFTAADWGQSRDAEEIISQTI 232 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 56/433 (12%), Positives = 127/433 (29%), Gaps = 95/433 (21%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F ++ +G+I++L AI + +G I+ S ++ KL +D ++L Sbjct: 8 LRKFGHDQRGNIAVLFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLAGVAAGKA 67 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + + + I+ + +++ + Sbjct: 68 ALDSGATQAAAIAMAQAASSSYFTGNTAK-------------IDATPTINFTTMGRTLSG 114 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM---- 181 SA S + + + +S S+ L++ +++D+S SM Sbjct: 115 TGSATS-------VMNTSFMRLVGFPTMTLNASSASSATMQPYLNVYLLVDISSSMLLPA 167 Query: 182 ------------------------NDHFGPGMD-----KLGVATRSIREMLDIIKSIPDV 212 D + + + V + ++ +L + S Sbjct: 168 TQAGITQMRNGTGCALACHETTNGTDSYSYALKNNVLLRYQVVNQGVQNLLTYLNSSAVY 227 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD---A 269 N V+ GL +F +++ Q L + + ++ + + Sbjct: 228 KNYVKVGLWSFDNQLTQLSSLTSSFSSVAAN---FPAPGLAYNDAAAATPFDSLIGSFVS 284 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF----------YCNEAKRRGAIV 319 +K +I TDG N ++ +CN K G V Sbjct: 285 SVGTAGDGSTSATPQKLVIIATDGVNDPTRAWTSQTSLRSQVRVFNTAFCNTFKSNGVTV 344 Query: 320 YAI-------------------------GVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 I + + LK+CA F + + +A Sbjct: 345 AIINTPYYPMTWDWGYNATLGQPGSLGGATRVDDIPIALKSCAGS-NFIIASDVATIQNA 403 Query: 355 FLRIGKEMVKQRI 367 F + + R+ Sbjct: 404 FTTLFNKASPVRL 416 >gi|261876471|dbj|BAI47561.1| collagen type VI alpha 1 subunit [Mesocricetus auratus] Length = 1026 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 80/214 (37%), Gaps = 19/214 (8%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV- 216 K + D +D+ VLD S S+ P + + +D ++ N+V Sbjct: 26 KPIAYQDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVW 85 Query: 217 RSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEK 272 +G + +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 86 NAGALHYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG-- 143 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQ 331 H KY+I +TDG + L NEAK G V+++ + + + Sbjct: 144 ------SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEP 197 Query: 332 FLKNCASPDRF---YSVQNSRKLHDAFLRIGKEM 362 L A+ + ++ + + DA I + + Sbjct: 198 RLSIIATDHTYRRNFTAADWGQSRDAEETISQTI 231 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 77/225 (34%), Gaps = 34/225 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DV 212 T + SS D+ ++LD S S+ H + + + S D Sbjct: 816 YTCPITFSS----PTDITILLDGSASVGSH------NFETTKVFAKRLAERFLSADRTDP 865 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY--NKIFDAK 270 + VR +V +S Q G +Q N + S+ S + A N Sbjct: 866 SQDVRVAVVQYSGLGQQQP----GRTALQFLQNYTVLASSVDSMDFINDATDVNDALSYV 921 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + K+ ++F +DG + + E EA+R G ++ + V + + Sbjct: 922 TRFYRENSLGATKKRVLLF-SDGNSQGATAEAIE--KAVQEAQRAGIEIFVVVVGPQVNE 978 Query: 331 QFLK----------NCASPDRF-YSVQNSRKLHDA--FLRIGKEM 362 ++ + A +R + V N + L + + +++ Sbjct: 979 PHIRVLVTGKTAEYDVAFGERHLFRVPNYQALLRGVFYQTVSRKV 1023 >gi|301767086|ref|XP_002918971.1| PREDICTED: collagen alpha-1(XXI) chain-like [Ailuropoda melanoleuca] Length = 957 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 92/237 (38%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L ++ S ++ D++ +LD S S+ + K Sbjct: 5 ISFLWIVLVLLLQNSVLAEDGEIRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTT 253 +++I K+ ++ G+V +S V PL G + + G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGENLVAAMESIHYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 RTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDEVKD------AAEAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 DSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|311260227|ref|XP_003128387.1| PREDICTED: collagen alpha-1(XXI) chain-like [Sus scrofa] Length = 957 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 93/238 (39%), Gaps = 36/238 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 F ++ L ++ S ++ D++ +LD S S+ + K Sbjct: 4 QITFLRMVLVLLLQNSVLAEDGEIRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW--- 59 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLI-FGST 252 +++I K+ ++ G+V +S V PL + + + + G Sbjct: 60 ------LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGN 113 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T++ +++A + +F AK K + LTDG++ D A Sbjct: 114 TRTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDEVKD------AAEAA 158 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + ++AIGV +E D L+ A S + V++ + A +I +E++KQ++ Sbjct: 159 RDSKITLFAIGVGSETEDAELRAIANKPSSTYVFHVED----YIAISKI-REVMKQKL 211 >gi|293339632|gb|ADE44108.1| collagen type XXI alpha 1 [Sus scrofa] Length = 895 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 86/212 (40%), Gaps = 35/212 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K +++I K+ ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPENFEIVKKW---------LVNITKNFDIGPKFIQVGVV 79 Query: 222 TFSSKIVQTFPLAWGV--QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S V PL + + + + G T++ +++A + +F AK Sbjct: 80 QYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 K + LTDG++ D A+ ++AIGV +E D L+ A Sbjct: 131 SSRFLTKIAVVLTDGKSQDEVKD------AAEAARDSKITLFAIGVGSETEDAELRAIAN 184 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 S + V++ + A +I +E++KQ++ Sbjct: 185 KPSSTYVFHVED----YIAISKI-REVMKQKL 211 >gi|126310411|ref|XP_001373784.1| PREDICTED: similar to collagen type XII alpha 1 [Monodelphis domestica] Length = 3116 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 103/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++TSL++ +Y +S Y M + + P P+ + Sbjct: 377 QTTSLNVRDLSADTEYQISV---YSMKGLTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 431 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 432 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 482 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF A + + K Sbjct: 483 EFTLKKFTRVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVASKGSR------SNVPKV 536 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 537 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPPETHVF 589 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 590 TVED----FDAFQRISFELTQSICL 610 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 75/199 (37%), Gaps = 28/199 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 132 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDVGEGKTRVGVVQYSSDTRT 182 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I ++ + G T + ++Y F A + K Sbjct: 183 EFNLNQYYQRKDLLAAIKKIPYKGGNTMTGDAIDYLIKNTFTES------AGARVGFPKV 236 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 I +TDG++ E + G V+++G++A A + L+ ASP + Sbjct: 237 AIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKE-LRQIASPPSLKHVF 289 Query: 344 SVQNSRKLHDAFLRIGKEM 362 +V N + D I ++ Sbjct: 290 NVANFDAIVDIQNEIISQV 308 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 68/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ + P V+ L +S Sbjct: 1186 TRAEADIVLLVDGSWSIGRA------NFKTIRSFISRIVEVFEIGPKR---VQIALAQYS 1236 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 1237 GDPRTEWQLNAHKDRRSLLDSVANLPYKGG-NTLTGM--ALNFIRQNNFRPQAGMRP--R 1291 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 1292 ARKIGVLITDGKSQDDVEAPSK------KLKDDGVELFAIGIKNADESELKMIATDPDDT 1345 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 1346 HAYNVADFDSL 1356 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADVVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHEDHVFLVANFSQMESLTSVFQN 234 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLAISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVRTEFTLRNFGSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 756 SARVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASE 808 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 809 PIDKHLFYAED----FSTMGEINEKLKK 832 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADVVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHEDHVFLVANFSQMESLTSVFQN 234 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 609 TEGPIDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLAISP---KAARVGLLQYS 659 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 660 TQVRTEFTLRNFGSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 714 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 715 SARVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASE 767 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 768 PIDKHLFYAED----FSTMGEINEKLKK 791 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 76/197 (38%), Gaps = 32/197 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +S G +++++D S SM K+ +A + +++ ++ I V G+ Sbjct: 385 APPRSPEGTAVVLIIDKSSSMEG------RKIELARLAAIGVVENLRPIDSV------GV 432 Query: 221 VTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F + P+ A I++ I+ + T+ P L AY +I Sbjct: 433 LIFDNSFQWAVPIRKAEDRATIKKLISGITPDGGTQIAPALTEAYQRIL----------- 481 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K+I+ LTDG + +S+ EA+ + +G+ + FL+ AS Sbjct: 482 PQTAMYKHIVLLTDGISEEG-----DSMTLTKEAQANHVTISTVGLGQDVNRAFLEKVAS 536 Query: 339 --PDRFYSVQNSRKLHD 353 + Y + + L Sbjct: 537 NADGKAYFLNDPSGLEQ 553 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 100/266 (37%), Gaps = 33/266 (12%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIF-CTFPWCANSSHAPLLITS 157 I + +S D K +L + ++ T P + Sbjct: 1532 EDSIQEVVKSKQNKKNSYDLEKGLSLDVKTLQKHFQFSNSKDQTIPIMISVKTLEQTSDM 1591 Query: 158 SVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ + + LD++ V+D S SM+ K+ +I +++D++ N+ Sbjct: 1592 EIESNLLEGRPNLDLICVIDNSGSMSGQ------KIENVKNTILQLIDML------NDND 1639 Query: 217 RSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R ++TF+S Q L +++Q+ + T T GL+ A++ + K++ Sbjct: 1640 RLSIITFNSHAQQLCGLRKVNKDNKENLQKITKSIYANGGTNITSGLQTAFSILQSRKQR 1699 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQ 331 + I L+DG++++ + + L + + +++ G + Sbjct: 1700 NSVSS---------IFLLSDGQDNNSDSRIRNLLQTTYQQLQEECFTIHSFGFGNDHDGP 1750 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAF 355 ++ A FY V+ + ++ + F Sbjct: 1751 LMQRIAQIKDGSFYYVERNDQVDEFF 1776 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 72/210 (34%), Gaps = 33/210 (15%) Query: 171 MMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 M+++LDVS SM + G+ +L RS+R + R GLV F+S Sbjct: 98 MVLILDVSGSMEKNDAQDGITRLQAVQRSVRAFV-------AQRKTDRIGLVIFASSAWP 150 Query: 230 TFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P++ Q + +IN+L G T L A + + ++ + Sbjct: 151 FAPISEDKQALLARINQLAPGMIGQQTAIGDALGVAVKLLDSSLDRDA---------SRL 201 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA-- 337 I LTDG +++ + A V+ I + LK A Sbjct: 202 AILLTDGNDTASQLSPA---LAAQLAASHHVQVHTIAFGDINSSGEDKVDTALLKQIAQL 258 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + + + L + +I Q Sbjct: 259 TGGEALQASTSGKALDSVWQQIDAMTPSQV 288 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 67/210 (31%), Gaps = 28/210 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + L++ +VLD S SM L ++ I+ + +N R Sbjct: 31 ATGDQDKTLPLNLCLVLDHSGSMAGK------PLETVKQA------AIELVKQLNVEDRL 78 Query: 219 GLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ F + P + I E+IN L T GL+ + + K+ Sbjct: 79 SIIAFDHRAKVLVPNQGIDNLNTIIEQINSLKPAGGTAIDEGLKLGIQESANGKKDRVSQ 138 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 I LTDGEN DN+ L + A + +G L+ Sbjct: 139 ----------IFLLTDGENEHG--DNERCLKLAHVASDYNITLNTLGFGNHWNQDVLEKI 186 Query: 337 --ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ ++ K + F R+ Sbjct: 187 SDSAGGTLCYIETPDKAIEEFSRLFNRAQS 216 >gi|241767791|ref|ZP_04765389.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241361168|gb|EER57806.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 277 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 77/258 (29%), Gaps = 60/258 (23%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ PL + + + S+ +M+ +DVS SM D++ A + + Sbjct: 1 AAARPLAV---ITLPSEQQT---IMLAMDVSGSMRATDVQP-DRITAAQNAAKAF----- 48 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 I ++ VR G+V F+ + + + I+ T + G+ A +F Sbjct: 49 -IAELPRHVRVGIVAFAGSAQLAQLPTQSHEDLAKAIDSFQLQRGTATGNGIMLALATLF 107 Query: 268 DAKE-----------------KLEHIAKGHDDYKKY---------IIFLTDGENSSPNID 301 + II LTDG+ ++ Sbjct: 108 PDAGIDIAALGGRQAMHPRPLDEVTRQDPAKPFTPVAPGSYTSAAIIMLTDGQRTTGV-- 165 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQF---------------LKNCASPDR--FYS 344 + L A RG VY +GV + LK A ++ Sbjct: 166 --DPLEAAQWAADRGVRVYTVGVGTVQGETIGFEGWSMRVRLDEETLKAVAGRTHAEYFH 223 Query: 345 VQNSRKLHDAFLRIGKEM 362 + L + + + Sbjct: 224 AATAADLKKVYETLSSRL 241 >gi|332256727|ref|XP_003277467.1| PREDICTED: collagen alpha-1(VI) chain, partial [Nomascus leucogenys] Length = 1104 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 92 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 93 YSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG-------- 144 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 145 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 204 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 205 TDHTYRRNFTAADWGQSRDAEEVISQTI 232 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 54/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + R L ++ P + VR +V +S Q Sbjct: 922 DITILLDGSASVGSHNFDTTRRFT--KRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 977 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + I+ + F T L Y DA Sbjct: 978 RPERASLQFLQNYTALASAIDAMDFINDATDVNDALGYVTRFYRDASSGAA--------- 1028 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 1029 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 1068 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 62/178 (34%), Gaps = 43/178 (24%) Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L +++ ++ L+ +T GL + ++ + + + + +KK ++ +TD Sbjct: 342 LTTDFDGLRDAVDDLVADGSTNIPMGLVWGWHTLAPMAPFPDGVPYLTEKHKKIVVLMTD 401 Query: 293 GENSSPNIDNKES---------------------------------------LFYCNEAK 313 GEN+ D L C K Sbjct: 402 GENTILYKDTPNGSDYSGVGHARQGRVLDPAGRPITESSSQRERTAALDDRLLKLCANMK 461 Query: 314 R--RGAIVYAIGVQAEAADQF-LKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + +YAI V+ + L+ CAS D +Y VQN+ + AF I ++ + Sbjct: 462 APAKDIEIYAIRVEVSSGSSSVLQTCASSADHYYDVQNAADMTMAFQSIAGQIAALHL 519 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 84/253 (33%), Gaps = 39/253 (15%) Query: 6 IRNFFYNCKGSISILTA-ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 +R + G+++I A ++P+ +G I+ + K +L LD + L A Sbjct: 13 LRRL-ADDGGNVAITVAFAMVPLAIGTLG-AIDLARGASAKVQLQDALDAAALGAARSSA 70 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 N + Q DF F D + R+ + ++ Sbjct: 71 NSPD----TLQAAGERLLRQNLALGGDFELVSSSFTFGPDNKVLARA---QVRVEP---- 119 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 A ++ + + V + + L++ +VLD + SM + Sbjct: 120 ----------------YVAGLAGVNNMDIAAATEVVR---AGMQLEIALVLDNTGSM--N 158 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP----LAWGVQHI 240 + L A ++ ++ V N ++ LV FS + AW Q+ Sbjct: 159 QNNKLYHLKTAAKAFVTAMETAAEGNTVPNSIKISLVPFSHTVRVDSDAYRNAAWIDQNG 218 Query: 241 QEKINRLIFGSTT 253 IN IF + T Sbjct: 219 SSPINNEIFPTAT 231 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 78/195 (40%), Gaps = 19/195 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M L + ++K ++ VR G + +S Sbjct: 802 LDIVFVLDHSGSIGTQEQESMMNLT---------IHLVKKADVDSDRVRVGALKYSDYPE 852 Query: 229 QTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L+ + E + R + S T + LE+A E+ ++ + K+ + Sbjct: 853 VLFYLSGNKSAVIEHLRRRRYTSGHTYTARALEHANIMFT-----EEYGSRIQQNVKQML 907 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I +TDG + D ++ + +G +YA+GV + + + V N Sbjct: 908 IIITDGV----SHDRDNLSDTASKLRNKGINIYAVGVGQANQLELETMAGNKSNTFHVDN 963 Query: 348 SRKLHDAFLRIGKEM 362 L D +L + ++M Sbjct: 964 FSNLKDIYLPLQEKM 978 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ D G ++ +L I+ PD G+V FS Sbjct: 612 EDMKADIMFLVDSSGSIGH------DNFGKMKTFMKNLLAKIQIGPDSTQ---IGVVQFS 662 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + + I+R+ T + L + K Sbjct: 663 DINQEEFQLNKYFTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTKGAR-------T 715 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDGE P D ++L + +G +++++GV Q + Sbjct: 716 KVKKFLILITDGEAQDPVRDPAKAL------RDKGVVIFSVGVYGANRTQLEEISGDSSL 769 Query: 342 FYSVQNSRKLHDA 354 + V+N L Sbjct: 770 VFQVENFDDLKTV 782 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 74/200 (37%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+N + + ML + ++ V++G V +S KI Sbjct: 431 DIYFLIDGSTSINTEGFEQIKQF---------MLAVTGMFSIGSDKVQAGAVQYSDKIRV 481 Query: 230 TFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + + +N T + L++ + I ++ + + Sbjct: 482 EFYINASSNDMDLRKAILNIEQLQGNTHTGKALDFMLSIIKKDRKHRI------SEIPCH 535 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG++ E L + ++A+G+ Q L+ A + Sbjct: 536 LIVLTDGKSQD------EVLKPAERLRDEQITIHAVGIGEADKIQ-LQQIAGEE---ERV 585 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N + D+ I E+V + Sbjct: 586 NFGQNFDSLRNIKNEVVHRI 605 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 89/225 (39%), Gaps = 20/225 (8%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM-DKLGVATRSIREMLDIIKSIPDVNN 214 SV K +M V+D S SM+ FG G D S E+ + D + Sbjct: 1091 VGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDARREVVNSALELF--FNQLSDGDY 1148 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTK-STPGLEYAYNKIFDAK 270 ++ G FS + WG Q + N L T T G Y + A Sbjct: 1149 NIQFGGYKFSDSGERVNFNDWGWQDKYWETDTSNALSHLKLTSWETDGSTYPSQTLRSAI 1208 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNID-NKESLFYCNEAKRR---GAIVYAIGVQA 326 LE++ G ++ K+Y+IFLTDGE + +KE C A + G YAI V Sbjct: 1209 SALENVELG-ENGKRYLIFLTDGEPGQNSYSFSKEEAENCYSAIKNLDSGTTFYAIQVAN 1267 Query: 327 EAADQFLKNCASPDRF--------YSVQNSRKLHDAFLRIGKEMV 363 + F+++ S F + ++ +L+ AF ++ E+ Sbjct: 1268 SDSHGFMESMVSNANFVDGVTAQKFVGNSADELNAAFSQMAAEIS 1312 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 111/330 (33%), Gaps = 35/330 (10%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 N+E G + K +TD + ++ + + ++ Sbjct: 1409 RNEEGGYILADGEEPSEENQYKGDEKTDDPSVPEDSQTSSGKPGFPANKKATLQYTYDGG 1468 Query: 124 DYNLSA------VSRYEMP--FIFCTFPWCANSSHAPLLITSSVKISSK--SDIGLDMMM 173 + +P + P + L +T + + +D++ Sbjct: 1469 TGRFEYPHPVLQIPEPVLPDEYNKRIEPNDDGTYSLTLDVTGIEGNPATVTTKYPVDLVF 1528 Query: 174 VLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVTFSSKI 227 V+D SLSM+ D D+ + + LD I + ++ FS Sbjct: 1529 VIDKSLSMDYDIDGDEIKWWEDETESRKDIVNDALDEIIPDLCSQQYDIQIAGYQFSGSS 1588 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + + Q + + T+ + L A + + + + + KKY+ Sbjct: 1589 TRVLDWSREEQQVLNNLKISNTSYNTEPSQALADALDMLKTGSQAHQ----NQSNVKKYL 1644 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDR---- 341 IF+TDGE + E L Y +K GA +Y IGV ++A+ ++ S Sbjct: 1645 IFMTDGEPTE-----SEELSYYAISKNPVPGASIYTIGVSSDASTDLMEGIRSTAEGNGM 1699 Query: 342 ----FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + +++ + DAF +I E++ Sbjct: 1700 TAPATFKGTSAQLIKDAFTQIKDEIISTST 1729 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/170 (21%), Positives = 63/170 (37%), Gaps = 37/170 (21%) Query: 161 ISSKSDIGLDMMMVLDVSLSMND--HFGPGMDKLGVATRSIREMLDI------------- 205 I+ GLD+++V+D+S SM+D +L V ++ + Sbjct: 633 ITGGDQQGLDIVLVIDLSNSMDDGISEDSSDSRLKVLKDTLGYYKESYNSRPGKPTIDEK 692 Query: 206 ---IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTK 254 I + + + R +VT+S+ + L W G Q I+E I L T Sbjct: 693 SGFIDDLFEQSPNSRFSIVTYST--YASTELDWTEYGMNGSGQQTIKEAIGELQANGGTN 750 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 GL A + +G+ +IFL+DG+ + D E Sbjct: 751 YEAGLYQAMEVL---------KERGNSSNIPVVIFLSDGKPTYYYSDVDE 791 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 52/279 (18%), Positives = 98/279 (35%), Gaps = 39/279 (13%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP-LLITSSVK 160 + D + S ++ I + + S + + CAN ++ Sbjct: 198 SGDSRDARCSRAVEIKTYPEF-SAIPQSSSEDDFAVLIHLKAPCANPEQVTGRPFNATSI 256 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S +D++ VLDVS SM KL + R++ ++ + R + Sbjct: 257 GYPTSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RLSV 304 Query: 221 VTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + FSS + F L G Q + +N L G T L+ A I D Sbjct: 305 IAFSSTARRLFHLRRMSHSGRQQALQAVNLLGAGGGTNIADALKKAAKVIED-------- 356 Query: 277 AKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAI--------GVQAE 327 + + + II L+DG+++ + + + + + + I G A+ Sbjct: 357 -RNYKNPVCSIILLSDGQDTYNISSNVRGTRPDYRSLVPSSILNHTICTVPVHGFGFGAD 415 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + A S F +++ + DAF + IG + Sbjct: 416 HDSDALHSIAESSGGTFSFIEDESVIQDAFAQCIGGLLS 454 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 72/194 (37%), Gaps = 25/194 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + +VLD S SM G +D A +++ + L L TFSS Sbjct: 502 LSVHLVLDTSGSMA---GAPIDSARRAAQALVDRLAPADDFS---------LTTFSSDAE 549 Query: 229 ---QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + P+ I+ I L G T GL Y + +D + Sbjct: 550 VVIEDGPVGPRRAAIRRAIEGLREGGGTNIGAGLSLGYAQ--------ASRPGIPEDAVR 601 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FY 343 ++ ++DG +S ++ + +A +RG A+G+ + Q + AS +Y Sbjct: 602 VVLLVSDGRATSGLTHSERLAWLALDAFQRGIQTSALGLGDDFDGQLMSAIASDGAGGYY 661 Query: 344 SVQNSRKLHDAFLR 357 +++ ++ A Sbjct: 662 YLRHPEQIAPALST 675 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 83/229 (36%), Gaps = 29/229 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIRE 201 P S AP ++ ++ + ++MV+DVS SM D G G KL +A ++ Sbjct: 391 PAKTLSVPAPRVLDRILRSWDELRKPAHVLMVIDVSGSMGADVPGTGQTKLELAKQAAIN 450 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIV------QTFPL-AWGVQHIQEKINRLIFGSTTK 254 L P + + GL FS+ + P+ ++++I LI G T Sbjct: 451 AL------PQLGPNDQVGLWMFSTNQDGGKDYRELVPMGRNNRDLLKKRIQGLIPGGGTG 504 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 AY + E+ + ++ LTDG+N N + E L Sbjct: 505 LYDTTRAAYRTVL---ERHSNDVINA------VVVLTDGKNEDDNSISLEDLL-AELRTE 554 Query: 315 RG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 G V+ I +A + L+ + + Y + + F + Sbjct: 555 TGQETVRVFTIAYGNDADLEVLRQISQVTDAAAYDSREPGSIDQVFTAV 603 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 38/250 (15%), Positives = 75/250 (30%), Gaps = 56/250 (22%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + + S +M+ +DVS SM +L A + + L ++ Sbjct: 76 MAVITLPSNQQT---IMLAIDVSGSMRATDVQP-SRLVAAQTAAKAFLT------ELPRT 125 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 V+ GLV F+ + + I+R T G+ + +F Sbjct: 126 VKVGLVAFAGSAQVAQIPTVNREDLVSAIDRFQLQRGTAIGNGIVMSLATLFPDAGIDLQ 185 Query: 276 IAKGHDDYKK-------------------------YIIFLTDGENSSPNIDNKESLFYCN 310 + + ++ II LTDG+ ++ +SL Sbjct: 186 SMQSGRERQRGFAIDQEKKEAKEFTPVAPGSYNSAAIILLTDGQRTTGV----DSLDAAK 241 Query: 311 EAKRRGAIVYAIGVQAEAADQF---------------LKNC--ASPDRFYSVQNSRKLHD 353 A RG VY +G+ + LK A+ ++ + L Sbjct: 242 LAADRGVRVYTVGIGTVDGETIGFEGWSMRVRLDEETLKGIARATQAEYFYAGTATDLKK 301 Query: 354 AFLRIGKEMV 363 + + + Sbjct: 302 VYETLSSRLT 311 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 72/209 (34%), Gaps = 29/209 (13%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + + + I + + K + G+D++ +DVS SM+ Sbjct: 50 KSNFKPMLKLILVCLALACLVVALVNPKMGTTMETVKRE-GVDIVFAIDVSKSMDAEDIA 108 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +L + + +R++L + S R G++ ++ P+ + + L Sbjct: 109 P-SRLEKSKQIVRQILGGLGS-------DRVGIIAYAGSAFPQLPITTDYAAAKMFLQAL 160 Query: 248 ----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 I T ++ A D + + + ++DGE+ N+++ Sbjct: 161 NTDMISSQGTAIGDAIDLATTYYDDDNQTN-----------RVLFIISDGEDHEGNVES- 208 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +EA +G +Y IGV E Sbjct: 209 ----IADEAAEKGIRIYTIGVGTEKGGPI 233 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 45/228 (19%), Positives = 83/228 (36%), Gaps = 37/228 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 ++ S +D++ VLDVS SM KL + R++ ++ + Sbjct: 248 GRPFNATSIGYPTSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD- 300 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R ++ FSS + F L G Q + +N L G T L+ A I Sbjct: 301 -----RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGAGGGTNIADALKKAAKVIE 355 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAI---- 322 D + + + II L+DG+++ + + + + + + I Sbjct: 356 D---------RNYKNPVCSIILLSDGQDTYNISSNVRGTRPDYRSLVPSSILNHTICTVP 406 Query: 323 ----GVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G A+ L + A S F +++ + DAF + IG + Sbjct: 407 VHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQCIGGLLS 454 >gi|73974062|ref|XP_548552.2| PREDICTED: similar to matrilin 2 isoform a precursor [Canis familiaris] Length = 978 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 83/226 (36%), Gaps = 28/226 (12%) Query: 142 FPWCANSSHAPLLITSSVKI--SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + S + SS + D++ ++D S S+N H + I Sbjct: 63 TPWRSISRGRHAWTHPQTALLESSCENKQADLVFIIDSSRSVNTHDYAKV------KEFI 116 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKST 256 ++L + PD + R GL+ + S I F L ++ + R+ + T + Sbjct: 117 LDILQFLDISPD---LTRVGLLQYGSTIKNEFSLKTFKKKSEVERAVKRMRHLSTGTMTG 173 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 ++YA N F E + ++ + I+ +TDG +A+ G Sbjct: 174 LAIQYALNIAFSEAEGARPLR---ENVLRVIMIVTDGRPQDSVA------EVAAKARDTG 224 Query: 317 AIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 +++AIGV + + P D + V N L F + Sbjct: 225 ILIFAIGVGQVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQK 270 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 81/208 (38%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ LD++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 686 TEGPLDLVFVIDGSKSLGEE------NFEIVKQFVAGIIDSLAVSP---KAARVGLLQYS 736 Query: 225 SKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L A ++ + + G + + L++ + + F+ E ++ G Sbjct: 737 TQVRTEFTLGDFSSARDMKKAVAHMKYM--GKGSMTGLALKHMFERSFNPVEGARPVSPG 794 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + A+ G +YA+GV ++ L+ AS Sbjct: 795 ---VSRVAIVFTDGRAQD------DVSAWARRARAGGITMYAVGVGKAIEEE-LQEIASE 844 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 845 PTDKHLFYAED----FSTMGEISEKLKK 868 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 47/289 (16%), Positives = 108/289 (37%), Gaps = 29/289 (10%) Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 Q+N S N ++ G ++ LS+ + HK + S+++ Sbjct: 27 QENQVSEPTKINNDDEPIDQNFKKLGDGKNKQKYNLEKGLSLDVKTLHKHFQFSSLTNQS 86 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + + + S ++ LD++ V+D S SM+ K+ Sbjct: 87 IPVMVSVKTLDKTEDAPKGDQEAVKQESLENRPNLDLICVIDNSGSMSGE------KIQN 140 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQH--IQEKINRLIFG 250 +++ +L+++ R L+ F+S + L +E IN++ Sbjct: 141 VKKTLEYLLELLGDND------RLCLILFNSYSTRLCHLMKTNNSNKPAFKEIINKIQAT 194 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T G+E A+ + + + + I L+DG++ S ++ ++SL Sbjct: 195 GGTDINSGMELAFRVL---------KERKYQNPVSSIFLLSDGQDGSADLRVRQSLE--R 243 Query: 311 EAKRRGAIVYAIGVQAEAADQFL-KNCA-SPDRFYSVQNSRKLHDAFLR 357 + +++ G ++ + K C+ FY V+ ++ + F+ Sbjct: 244 HLPQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFFVD 292 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 95/266 (35%), Gaps = 14/266 (5%) Query: 76 KNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM 135 +Y ++ + T + +L +G T+ + + + + + Sbjct: 1989 SGAITYTLVGSATGTYGQIQLNADGTYTYTLTSAPKTTPNANDGPNTLSESFTYKATDAL 2048 Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGV 194 + P + S + + +I ++++VLD+S SM D G PG+ +L + Sbjct: 2049 GNSTTSTIVVNIVDDVPKAVASERSV-AAVEIDSNILIVLDISGSMADASGVPGLSRLEL 2107 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A ++I +LD + D V+ LVTFSS + V + + L G T Sbjct: 2108 AKQAISALLDKYDDLGD----VKVQLVTFSSNATDRTSVWVDVATAKTLLAGLSAGGGTN 2163 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + YN + + G+ F +DG+ + +I + Sbjct: 2164 YDAAVATMYNAFNTSGKLTGAQNVGY--------FFSDGKPNEGDIGTADEATLKAFLDA 2215 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPD 340 YAIG+ + ++ L A Sbjct: 2216 NNIKNYAIGLGSGVSNANLDPLAYDG 2241 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 73/198 (36%), Gaps = 29/198 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D++ +LD S S+ G + + + ++D + PD R G+V +S Sbjct: 33 KNVHYDLVFILDTSSSV------GKEDFEKVRQWVANLVDTFEIGPDK---TRVGVVRYS 83 Query: 225 SKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + F L G +E+I +G T + L Y + E+ Sbjct: 84 DRPTTEFDL--GRYQTREQIKEAAKNIKYYGGNTNTGDALR--YINTYSFSEEAGGRPTD 139 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 KK I LTDG + +D A + G ++A+GV EA + L AS Sbjct: 140 SAI-KKVAILLTDGRSQDHVLDP------ATAAHKAGIRIFAVGVG-EALKEELDEIASE 191 Query: 339 --PDRFYSVQNSRKLHDA 354 + V + + Sbjct: 192 PKSAHVFHVSDYNAIDKI 209 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 44/267 (16%), Positives = 100/267 (37%), Gaps = 29/267 (10%) Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 + G ++ LS+ + HK + S+ + +P + Sbjct: 16 KNVGDGKNKQKYNLDKGLSLDVKTLHKHFQFSSSTNQSIPIMVSVKTLDKTEDAPKGNQE 75 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + ++ LD++ V+D S SM K+ +++ +L+++ Sbjct: 76 AVKQDILENRPNLDLICVIDNSGSMEGE------KIQNVKKTLEYLLELLGDND------ 123 Query: 217 RSGLVTFSSKIVQTFPL--AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R L+ F+SK + L +E IN++ T G+E A+ + D + Sbjct: 124 RLCLILFNSKATKLCHLMRTNNSNKPAFKEIINKIEANGGTDINSGMELAFRVLKD---R 180 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H + L+DG++ S ++ ++SL + +++ G ++ Sbjct: 181 KYHNPVSS------VFLLSDGQDGSADLKVRQSLE--RHLPQECFTIHSFGFGSDHDGPL 232 Query: 333 L-KNCA-SPDRFYSVQNSRKLHDAFLR 357 + K C+ FY V+ ++ + F+ Sbjct: 233 MNKICSLKDGNFYYVEKINQVDEFFVD 259 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 75/211 (35%), Gaps = 20/211 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+SV SSK D +D +D+ M+ M+ +++ N Sbjct: 983 VTASVCNSSKVDCEID---KVDLVFLMDGSTSIQQTDFKKMKEF---MVSVVQDFDVSNK 1036 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 VR G FS FPL G + I +I N T L + Sbjct: 1037 RVRIGAAQFSDAYRPEFPLGTFIGAKEISIQIENITQIFGNTHIGAALRKVEHYFRPDMG 1096 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ LTDG++ E +RRG +Y++G+ Q Sbjct: 1097 SRINTGTP-----QVLLVLTDGQSQD------EVAQAAEALRRRGIDIYSVGIGDVDDQQ 1145 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ + ++ +V N +L RI + + Sbjct: 1146 LMQITGTAEKKLTVHNFDELKKVNKRIVRNI 1176 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 74/199 (37%), Gaps = 21/199 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ V+D S S++ M M+DI++ N VR G + ++ Sbjct: 809 LDVVFVIDSSGSIDYDEYNIMKDF---------MIDIVRKADVGMNRVRFGALKYADDPE 859 Query: 229 QTFPLA-WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +G + I G T + L ++ + +A+ + + Sbjct: 860 VLFYLDNFGTKPEVISVLQTDQAMGGNTYTAEALGFSDHMFTEARGSRLNKGVP-----Q 914 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG+ + D + + +G +V A+G+ + L S D+++ V Sbjct: 915 VLIVITDGD----SHDADKLNATAKALRDKGILVLAVGIADANPVELLAMAGSSDKYFFV 970 Query: 346 QNSRKLHDAFLRIGKEMVK 364 + L + + Sbjct: 971 ETFGGLKGIISDVTASVCN 989 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 72/186 (38%), Gaps = 25/186 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ M ++ ++ + PD V+ G+V FS Sbjct: 618 KEMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGPDR---VQIGVVQFS 668 Query: 225 SKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I I+++ G TT + L + K Sbjct: 669 DINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGSRP------- 721 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE + +L ++ G I+Y++GV Q + P+ Sbjct: 722 SVRKFLILITDGEAQDIVKEPAVAL------RQEGVIIYSVGVFGSNVTQLEEISGRPEM 775 Query: 342 FYSVQN 347 + V+N Sbjct: 776 VFYVEN 781 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 58/355 (16%), Positives = 106/355 (29%), Gaps = 51/355 (14%) Query: 32 MGLVIETSHKFFVKAKLHYILDHSLLYTATK--------ILNQENGNNGKKQKNDFSYRI 83 + + I S L + A K N N G Q Sbjct: 284 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 343 Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 T LR G I+ ++ + H + + F Sbjct: 344 DSEDNVTKAAVNLRREGVTIFTLGIKGASDTQLEKIASHPAEQYVSKLK---TFADLAAH 400 Query: 144 WCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMNDHFGPGMDKLG 193 IT +V + S+ L D+ +++D S S Sbjct: 401 NQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFH 454 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----I 248 + E++ + P VR G V ++ F + ++ + + Sbjct: 455 EMKIFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEI--NKYSNKQDLGKAIENIRQ 509 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + L + + + AK++ + H ++ LT+G + L Sbjct: 510 LGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LVVLTNG------MSKDSILEP 557 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 N + VYAIG++ EA L+ A + R Y V + L D ++ +E+ Sbjct: 558 ANRLREEHIRVYAIGIK-EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEI 611 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 71/206 (34%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S++ +D L + + + ++ N +R GLV +S++ Sbjct: 230 DVVFLLDMSINGSDE------NLDYLKGFLE---ESVSALDIKENCMRIGLVAYSNETKV 280 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 281 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 337 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 338 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIKGASDTQLEKIASHP------ 383 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 384 --AEQYVSKLKTFADLAAHNQTFLKK 407 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 79/187 (42%), Gaps = 26/187 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S + P + I +M++ S+P + R L +S K+ Sbjct: 28 DVVFLVDSSDRLGSRSFPFV------KMFITKMIN---SLPIEADKYRVALAQYSDKLHS 78 Query: 230 TFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L+ G + + + G + + L+ A+ F A + + Sbjct: 79 EFHLSTFKGRSPMLNHLKKNFGFIGGSLQIGKALQEAHRTYFSATTN----GRDKKQFPP 134 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ L + ++++ + ++A ++ G + ++GVQ +A+++ LK A+ ++ Sbjct: 135 ILVVL-------ASSESEDDVEKASKALQKDGVKIISVGVQ-KASEENLKAMATSQFHFN 186 Query: 345 VQNSRKL 351 ++ R L Sbjct: 187 LRTVRDL 193 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 60/180 (33%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 KI + G+D++ +D+S SM D ++K I L Sbjct: 80 TKIETIKREGVDIVFAVDISKSMLAEDIAPSRLEKSQQLVTQIINNLAS----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEK 272 R GL+ ++ V P+ + + L+ T ++ A + + E Sbjct: 130 RIGLIAYAGSAVPQLPITTDYSSAKMFLQSMNTDLVSSQGTAIAEAIQLAESYYSEDTE- 188 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K ++ ++DGE+ E+L Y A G + IGV E Sbjct: 189 ----------ASKVLVIISDGEDHEG-----EALDYAEAAAENGIRIITIGVGTEKGGTI 233 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 81/203 (39%), Gaps = 22/203 (10%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 DI D+ LD S S + I+ ++D I+ ++ VR G++T+S Sbjct: 1836 DIPTDIAFALDASASFYEE------GFQQEKDFIKSVIDKIEL---SSSGVRVGVLTYSD 1886 Query: 226 K--IVQTFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + I F ++ + +++ I+ + + S T+ GLE A KE + G Sbjct: 1887 EAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTRIDLGLEAA-------KELFLEKSGGRGS 1939 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDR 341 KK +I LTDG+ + D K+ + Y E G ++AIG+ E + P Sbjct: 1940 SKKVLILLTDGQQTYI-PDAKDPVDYAKELAEYGVDIFAIGIGDEINKVDLEDLISKPQH 1998 Query: 342 FYSVQNSRKLHDAF-LRIGKEMV 363 + + L I + Sbjct: 1999 IFLSDDINSLITDLSKDISTALS 2021 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 79/204 (38%), Gaps = 23/204 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSKI 227 D+ +LD S S+ + + P + ++G++ + ++ I Sbjct: 1254 DIAFILDSSRSVTRDHFNRQKEFVKTI---------LGEFPLGEELTQAGIIKYGRTADI 1304 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKS--TPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + I+++ +S L+ A+ ++F ++ A+ D +K Sbjct: 1305 EINFGEFLTQTDLFNAIDKIKHSQADESRLDLALKKAHEELFTSQG-----ARSDKDIEK 1359 Query: 286 YIIFLTDGENSSPNIDNKE----SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I+ L DG S +++ + + + G +++ IGV AE L+N AS Sbjct: 1360 AIVILGDGYISGGGNRSRDLIESAKKEAAKLRELGVLIFTIGVGAEPNSLLLQNFASKKT 1419 Query: 342 FY-SVQNSRKLHDAFLRIGKEMVK 364 +Y +V++ +L + + Sbjct: 1420 YYITVKDYGQLIGKIGALKTSISS 1443 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/185 (22%), Positives = 71/185 (38%), Gaps = 27/185 (14%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +LD S S+ D DK+ +SI D+ N + R G++ F S L Sbjct: 1576 LLDSSGSVGDE---NFDKMKEFVKSIVLNFDV------DNQLTRIGIIRFDSDAEIIIQL 1626 Query: 234 AWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + +++ +N R G T+ LE A + D K ++ Sbjct: 1627 S-DHKTLKDLLNDIDSIRYNEGIQTRIDKALERAMEAFSEKNGGRA-------DATKALV 1678 Query: 289 FLTDGENS--SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 L DG+NS + D E L +AK Y IG+ E + L++ A+ + Sbjct: 1679 LLADGQNSFIEGSQDLNEELKPLIDAK---VFRYVIGIGRELDLKELEDIATNNIAIYAD 1735 Query: 347 NSRKL 351 + +L Sbjct: 1736 SFDEL 1740 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 68/196 (34%), Gaps = 21/196 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ + D S S+ + M K ++ S R + + S Sbjct: 3 VDVALAFDSSSSVGELAYEEMKKFAH---------QVVDSFSISQQNARFAALVYGSNAS 53 Query: 229 QTFPL-AWGVQ-HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + I++ I L S T+ LE A + +F L+ + Sbjct: 54 VEFNFVRYDSALEIKQAIQSLSYLKSNTRIDKALEVAKSDLFS----LQGKVRSRRPMIL 109 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 Y+ F DG + D + + K G + AIGV E LK + + +S Sbjct: 110 YVFF--DGTVTRSMSDLESVVQP---LKDYGVKIIAIGVGPEVNRYQLKKISEDNAIFSG 164 Query: 346 QNSRKLHDAFLRIGKE 361 ++ ++L I ++ Sbjct: 165 KSFKELAPLLYSIVEQ 180 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 48/196 (24%), Positives = 83/196 (42%), Gaps = 21/196 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M I + ++K N V+ G + +S++ Sbjct: 811 LDVVFVLDHSSSIQPRDQESM---------INLTMHLVKKSDVGPNRVQFGALRYSNEPD 861 Query: 229 QTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L I E + L T + LE A N +F EH ++ + K+ + Sbjct: 862 IIFYLNSNRSAIMEYLRSLSAKGGDTYTAKALERA-NILF----TEEHGSRIKQNVKQML 916 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YSVQ 346 I +TDG+ + D+ ++ + +G I+YA+GV EA + L+ A + V Sbjct: 917 IIITDGK----SHDHIHLSDKASKLRAKGIIIYAVGVG-EANQEELETMAGNKHYTIHVS 971 Query: 347 NSRKLHDAFLRIGKEM 362 N L D + + + M Sbjct: 972 NFDSLKDVYQPLQESM 987 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 75/199 (37%), Gaps = 28/199 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ ++D S S+ + K ML++I++ + VR G V +S Sbjct: 439 VDIYFLIDGSGSIRSDHFEQIKKF---------MLEVIENFDIGPDKVRVGAVQYSDTRE 489 Query: 229 QTFPLAWGV--QHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + F + + +++ I+ + G T + L++ I +E+ + Sbjct: 490 KEFDITDYTTDETLRKAISNIRQLGGGTYTGEALDFILQIIKKGREQRINK------VPC 543 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 Y+I LTDG + + L + + V+AIG+ Q + +R Sbjct: 544 YLIVLTDGMSMD------DVLEPAEKLRAENIAVHAIGIGEANRTQLQQIAGKEERVSFG 597 Query: 346 QNSRKLHDAFLRIGKEMVK 364 QN L I E++ Sbjct: 598 QNFDSL----KNIKNEVLH 612 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 71/193 (36%), Gaps = 25/193 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ ++ +L I+ D R G+V FS Sbjct: 621 EDMKADIMFLVDSSGSIGPE------NFRTMKSFMKNLLAKIQIGLDK---TRIGVVQFS 671 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + + I + I R+ T + L ++K Sbjct: 672 DVTKEEFKLDTYFTQKEISDAIERMSPIEQNTLTGKALTSIEPYFTESKGAR-------S 724 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +K++I +TDGE + + L + +G +++A+GV Q + Sbjct: 725 MVRKFLILITDGEAQDDVRNPAKVL------RDKGVVIFAVGVFRANRTQLEEISGDSSL 778 Query: 342 FYSVQNSRKLHDA 354 + V++ L + Sbjct: 779 VFQVESFSDLQEI 791 Score = 39.4 bits (90), Expect = 0.90, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 72/200 (36%), Gaps = 28/200 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT 222 + D D++ ++D S+ + + R+++ L I S DV N +R GL++ Sbjct: 228 EKDSLADVIFLVDESVGTSQNL-----------RNLQNFLKNITSYMDVKDNCLRLGLMS 276 Query: 223 FSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +S + L Q++I +L + + A K+ ++ Sbjct: 277 YSDRAETLSVLKSSTSQAEFQKQIQKLSLRAGKSNVGA---AVEKMRREGFSASSGSRRA 333 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + +T + P+ D + V+A+G+ Q + + Sbjct: 334 LGVPQIAVLVT----NRPSDDEVREATL--NLRLDDVTVFAMGIHGANKTQLEEIVS--- 384 Query: 341 RFYSVQNSRKLHDAFLRIGK 360 Y + + + +++ +G Sbjct: 385 --YPPRQTISMLESYADLGN 402 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 70/193 (36%), Gaps = 33/193 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L + ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMR--LGSPITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQ 361 Query: 231 FPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + T GL+ + I + + Sbjct: 362 TNLIRIIND-SSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE-------- 412 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C E K GAI++ I + AA + L + RF Sbjct: 413 -IVLLTDGEDNR--------ISSCFQEVKHSGAIIHTIALGPSAARELETLSDMTGGLRF 463 Query: 343 YSVQNSRKLHDAF 355 Y+ ++ L DAF Sbjct: 464 YAKEDVNGLIDAF 476 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 75/200 (37%), Gaps = 31/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F + + P + ++++S + + +DVS SM +L A Sbjct: 62 LLFAGLIFIFIALAGPQWGVEKINVTAQSSHSV---IAVDVSDSMK-ARDLKPTRLENAK 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GST 252 ++K + R+G+V F+SK P+ V+ ++ +N+L Sbjct: 118 T-------MLKMLISAKGEQRTGIVAFTSKAYTQCPITNDVEALKYFVNQLRPEMLNAKG 170 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T P ++ A + KK +I LTDGE+ + ++ A Sbjct: 171 TALAPAVQRAAEMLSK------------YPGKKALILLTDGED----HEPEQIEEAIKTA 214 Query: 313 KRRGAIVYAIGVQAEAADQF 332 ++ G + A+G+ E + Sbjct: 215 QKEGIKIIAVGIGTEEGEPI 234 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 49/254 (19%), Positives = 92/254 (36%), Gaps = 42/254 (16%) Query: 121 QHKDYNLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 K ++ + Y+ PF W S AP L + + + +D+++V+D S Sbjct: 2 NDKKLTVACSTEYKDYPFKEKLDIWTLISLKAPSLGMTLDEKEHRKRAPIDLVVVIDKSG 61 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----W 235 SM KL + +++ + + +N R L+TF + + F L Sbjct: 62 SMAGE------KLALVKKTLEFV------VSQLNEKDRLCLITFDTSVYLDFKLTPMTPM 109 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + I + GS T GL ++ D ++ ++ ++ TDG Sbjct: 110 NKYQTLKIIKDISPGSMTNLCGGLMKGLCEVIDRADEEKNEVAS-------VLLFTDGFA 162 Query: 296 SSPNIDNKESLFYCNEA------------KRRGAIVYAIGVQAEAADQFLKNC--ASPDR 341 + + N YC+ + K A +Y G + Q LK A Sbjct: 163 NKGGLTN----IYCSSSQTAKYTIGIVGPKTADASIYTFGFGSNHNAQMLKEISDAGSGM 218 Query: 342 FYSVQNSRKLHDAF 355 +Y ++N + +AF Sbjct: 219 YYYIENVDMIAEAF 232 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 76/195 (38%), Gaps = 26/195 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ + + + +L + P N R G++ +SS+ Sbjct: 33 GPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGP---NATRVGVIQYSSQ 83 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + FPL+ + ++ I L+ T + ++YA N F E Sbjct: 84 VQSVFPLSAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEARV 140 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + +TDG +A+ RG +YA+GVQ L+ ASP + Sbjct: 141 PRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDE 193 Query: 341 RFYSVQNSRKLHDAF 355 + V +S L F Sbjct: 194 HVFLV-DSFDLIQEF 207 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 68/176 (38%), Gaps = 25/176 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 305 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 355 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 356 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 410 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG + + + AK G ++YA+GV ++ K + P Sbjct: 411 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELRKIASEP 460 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 57/152 (37%), Gaps = 19/152 (12%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + ++ +IN L T G+ +A+ + Y I+ L+DG Sbjct: 331 STDATTLKGRINTLDAQGGTNQGIGMFWAWMTLQATAPLYTPAKDSEYKYTDAIVLLSDG 390 Query: 294 ENS-----------SPNIDNKESLFYCN--EAKRRGA---IVYAIGVQA--EAADQFLKN 335 N+ SP +D+++ + C+ K G +Y I V + LK Sbjct: 391 MNTKNRWYGNGSNWSPQVDDRQKIL-CDNITTKVNGVPETTIYTIQVNTSGDPESSVLKY 449 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 C S F+S + + AF +G + K RI Sbjct: 450 CGSTGGFFSTTTASGIQSAFQEVGASLTKLRI 481 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 77/237 (32%), Gaps = 34/237 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F G+I+++ I L + +G ++ S + + LD + L A + + Sbjct: 17 FPAASGGNIAVIFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTS-GK 75 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 Q + + ++N D +I+ + + + ++ + Sbjct: 76 ITAENVQSAA------NTYFTSLYKN--------TDAPSIDVTATYTPKTSSENAKLTVG 121 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 F+ + + + ++ L + + LDV+ SM+ Sbjct: 122 GTGSINTEFMKVM--------NISQMSLGASSTTTWGGTRLRVALALDVTGSMD-----S 168 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ------TFPLAWGVQH 239 KL + ++++D +K+ V +V F+ + L W + Sbjct: 169 AGKLSAMKTAAKQLIDTLKATSTTKEDVYISIVPFNVMVNVGPGNKNATWLDWDTSY 225 >gi|115373770|ref|ZP_01461063.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115369169|gb|EAU68111.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 420 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 79/212 (37%), Gaps = 25/212 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + +++ +V+D S SM KL A R+ RE + + + R L Sbjct: 38 PQERRRVPVNVALVIDRSGSMRGQ------KLDDAKRAAREFITRV------SEEDRVAL 85 Query: 221 VTF-SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V + + V LA + + +N + +T + GLE A ++ ++ Sbjct: 86 VHYGTDVTVFPSTLATPETREQMLTFVNAIEDEGSTNISGGLEAAAQQLQKNADQFRVSR 145 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II L+DG+ ++ ++ + +G V A+GV + + ++ A Sbjct: 146 ---------IILLSDGQPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNENLMQGIA 196 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 F S +L + F R ++ Sbjct: 197 DQGGGFSGFLRSDQLAEVFTRELEQATSTVAR 228 >gi|310817544|ref|YP_003949902.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309390616|gb|ADO68075.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 470 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 79/212 (37%), Gaps = 25/212 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + +++ +V+D S SM KL A R+ RE + + + R L Sbjct: 88 PQERRRVPVNVALVIDRSGSMRGQ------KLDDAKRAAREFITRV------SEEDRVAL 135 Query: 221 VTF-SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V + + V LA + + +N + +T + GLE A ++ ++ Sbjct: 136 VHYGTDVTVFPSTLATPETREQMLTFVNAIEDEGSTNISGGLEAAAQQLQKNADQFRVSR 195 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II L+DG+ ++ ++ + +G V A+GV + + ++ A Sbjct: 196 ---------IILLSDGQPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNENLMQGIA 246 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 F S +L + F R ++ Sbjct: 247 DQGGGFSGFLRSDQLAEVFTRELEQATSTVAR 278 >gi|291569213|dbj|BAI91485.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 412 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 71/200 (35%), Gaps = 26/200 (13%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P ++ + S++ + + +++ ++LD S SMN G ++ + A + + + Sbjct: 16 PNQESNQRQLSISVSAIPDPFEGQVPMNLCLILDHSGSMN---GQPLETVKQAAKELIDR 72 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLE 260 L++ R +V F + P I++KI+ L T GL+ Sbjct: 73 LNV---------GDRISVVAFDHRAKVLVPNQDIADPDGIKKKIDGLRCSGGTAIDEGLK 123 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 ++ K+ LTDGEN DNK L A + Sbjct: 124 LGIEELGKGKQDRISQG----------FLLTDGENEHG--DNKRCLKLAKLATEYKLTIN 171 Query: 321 AIGVQAEAADQFLKNCASPD 340 ++G + L+ A Sbjct: 172 SLGFGNDWNQDILEKIADAG 191 >gi|109733269|gb|AAI16637.1| Von Willebrand factor A domain containing 2 [Mus musculus] Length = 791 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGM 189 Y + F + H + ++SK +D++ +LD S S G Sbjct: 11 YMLLFFRVSPTISLQEVHVNRETMGKIAVASKLMWCSAAVDILFLLDGSHS------IGK 64 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL 247 + R D + P VR G + F S FPL Q ++E I + Sbjct: 65 GSFERSKRFAIAACDALDISPGR---VRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGI 121 Query: 248 IFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +F T++ L+ + + + +I +TDG++ P Sbjct: 122 VFKGGRTETGLALKRLSRGFPGGR---------NGSVPQILIIVTDGKSQGPVA------ 166 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + RG +V+A+GV+ D+ L + P Sbjct: 167 LPAKQLRERGIVVFAVGVRFPRWDELLTLASEP 199 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ IR+ PDV GLV + S++ Sbjct: 528 SLDLVFLLDASASVGRE------NFAQMQSFIRKCTLRFDVNPDVTQ---VGLVVYGSRV 578 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I D ++ A+ K Sbjct: 579 QTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAG---TALLHIEDKVMTVQRGARPGVP--K 633 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ LT G +++ + + G V + V A Sbjct: 634 AVVMLTGG------SGAEDAAVPAQKLRGNGISVLVMSVGA 668 >gi|26325252|dbj|BAC26380.1| unnamed protein product [Mus musculus] Length = 721 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGM 189 Y + F + H + ++SK +D++ +LD S S G Sbjct: 11 YMLLFFRVSPTISLQEVHVNRETMGKIAVASKLMWCSAAVDILFLLDGSHS------IGK 64 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL 247 + R D + P VR G + F S FPL Q ++E I + Sbjct: 65 GSFERSKRFAIAACDALDISPGR---VRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGI 121 Query: 248 IFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +F T++ L+ + + + +I +TDG++ P Sbjct: 122 VFKGGRTETGLALKRLSRGFPGGR---------NGSVPQILIIVTDGKSQGPVA------ 166 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + RG +V+A+GV+ D+ L + P Sbjct: 167 LPAKQLRERGIVVFAVGVRFPRWDELLTLASEP 199 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ IR+ PDV GLV + S++ Sbjct: 528 SLDLVFLLDASASVGRE------NFAQMQSFIRKCTLRFDVNPDVTQ---VGLVVYGSRV 578 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I D ++ A+ K Sbjct: 579 QTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAG---TALLHIEDKVMTVQRGARPGVP--K 633 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ LT G +++ + + G V + V A Sbjct: 634 AVVMLTGG------SGAEDAAVPAQKLRGNGISVLVMSVGA 668 >gi|42741661|ref|NP_766428.2| von Willebrand factor A domain-containing protein 2 precursor [Mus musculus] gi|81893914|sp|Q70UZ7|VWA2_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|27657433|emb|CAD60277.1| AMACO [Mus musculus] gi|74225702|dbj|BAE21683.1| unnamed protein product [Mus musculus] Length = 791 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGM 189 Y + F + H + ++SK +D++ +LD S S G Sbjct: 11 YMLLFFRVSPTISLQEVHVNRETMGKIAVASKLMWCSAAVDILFLLDGSHS------IGK 64 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL 247 + R D + P VR G + F S FPL Q ++E I + Sbjct: 65 GSFERSKRFAIAACDALDISPGR---VRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGI 121 Query: 248 IFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +F T++ L+ + + + +I +TDG++ P Sbjct: 122 VFKGGRTETGLALKRLSRGFPGGR---------NGSVPQILIIVTDGKSQGPVA------ 166 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + RG +V+A+GV+ D+ L + P Sbjct: 167 LPAKQLRERGIVVFAVGVRFPRWDELLTLASEP 199 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ IR+ PDV GLV + S++ Sbjct: 528 SLDLVFLLDASASVGRE------NFAQMQSFIRKCTLRFDVNPDVTQ---VGLVVYGSRV 578 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I D ++ A+ K Sbjct: 579 QTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAG---TALLHIEDKVMTVQRGARPGVP--K 633 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ LT G +++ + + G V + V A Sbjct: 634 AVVMLTGG------SGAEDAAVPAQKLRGNGISVLVMSVGA 668 >gi|319761860|ref|YP_004125797.1| von willebrand factor type a [Alicycliphilus denitrificans BC] gi|330826288|ref|YP_004389591.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] gi|317116421|gb|ADU98909.1| von Willebrand factor type A [Alicycliphilus denitrificans BC] gi|329311660|gb|AEB86075.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] Length = 348 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 70/237 (29%), Gaps = 55/237 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM D+L A + + + D+ VR G+V F+ Sbjct: 88 IILAMDVSGSMRATDVQP-DRLTAAQNAAKAFIQ------DLPRHVRVGVVAFAGTAQLA 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE------------------- 271 + + + I+ T + G+ A +F Sbjct: 141 QLPTQSHEDLLKAIDSFQLQRGTATGNGIMMALATLFPDAGIDIAALGGRQSMRVRPIDE 200 Query: 272 ----KLEHIAKGHDDYKKY----IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 Y II LTDG+ ++ + L A RG VY +G Sbjct: 201 VGRADPAKKPFTPVAPGSYRSAAIIMLTDGQRTTGV----DPLEAAQWAADRGVRVYTVG 256 Query: 324 VQAEAA---------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 V + LK A + ++ ++ L + + + Sbjct: 257 VGTVQGELIGFEGWSMRVRLDEDTLKAVALRTNAEYFHAATAQDLRKVYETLSSRLT 313 >gi|239987768|ref|ZP_04708432.1| hypothetical protein SrosN1_10718 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 75/205 (36%), Gaps = 26/205 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + V+D+S SM + +L + +S+ + D ++ ++ Sbjct: 156 TAPPTSERPPAALTFVVDISGSMAE-----TGRLDLVRKSLTILADELRDDDSLS----- 205 Query: 219 GLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVTFS + P+ I++ ++ + +T G++ Y + + + Sbjct: 206 -LVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQPAQSTNVEAGIKLGYEESVEGHREGAT 264 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAADQFLK 334 ++ L+D ++ + + L + A+ G ++ +GV ++ D F++ Sbjct: 265 NR---------VVLLSDALANTGETEAEGILKKIDSARREYGITLFGVGVGSDYGDAFME 315 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR 357 V + + F+ Sbjct: 316 QLTNKGDGNTTYVGDETQARKVFVD 340 >gi|222478562|ref|YP_002564799.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451464|gb|ACM55729.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 491 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 69/203 (33%), Gaps = 34/203 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ + + +V+D S SM ++ +++ + I N G+ F Sbjct: 221 EAGDSVCVQLVIDTSGSMGGS------RIANTKSGAKQLAETILDANPDNQ---VGVTRF 271 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 ++ L + ++ I+ L T + G++ ++ + Sbjct: 272 NNGASTPQQLTDDLDDVEAAIDGLSASGGTNAQAGVDAGQAELENCPHDN---------- 321 Query: 284 KKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 + ++ DG+ N+ + AK G ++AIGV + + P + Sbjct: 322 -RVMVVFGDGDINTDGSA-----------AKVAGTEIFAIGVGGASFSDLEDLASDPADE 369 Query: 341 RFYSVQNSRKLHDAFLRIGKEMV 363 + + + F ++ + + Sbjct: 370 HVFFAIDDGAIEQIFGQVAETIT 392 >gi|149197810|ref|ZP_01874859.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] gi|149139031|gb|EDM27435.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] Length = 833 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 47/268 (17%), Positives = 91/268 (33%), Gaps = 40/268 (14%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS----- 163 +T LS D + Y M +F + I + ++S Sbjct: 350 IEATHLSARQMDNVRKYVTEFGGGLIMTGSENSFGLG---GYYKTPIEEVLPVTSRYEKE 406 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K L +++V+D S SMN G + A+++ E+L + G++ F Sbjct: 407 KEQPSLALVLVIDKSGSMN---GQPIVLAREASKAAAELLSSRD---------QVGVIAF 454 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L + +I+ + G T P + + + A K++H Sbjct: 455 DGSAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKIKH------ 508 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 +I L+DG++ + + S E + G + + + AA + A Sbjct: 509 -----MIVLSDGQSQGGDFEGISS-----ELAQMGVTISTVSLGQGAAVDLMAAIAQIGN 558 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 R Y N+ ++ F + E + I Sbjct: 559 GRAYVTNNAEEMPRIFTKETMEASRSAI 586 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 71/192 (36%), Gaps = 34/192 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ L + + ++ II+ V G+VTF S Sbjct: 364 VCLVLDKSGSMDSED----RLLRMNQAAALYLIQIIERESLV------GMVTFESTAKIQ 413 Query: 231 FPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + G T GL+ + I + + Sbjct: 414 NNLTKITDDDTYQKITANLPQVAGGGTSICSGLKAGFQAITYSNQNTSGSE--------- 464 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE++ + C E K+ GAI++ I + AA + L + RFY Sbjct: 465 IVLLTDGEDN--------GIHSCFEEVKQSGAIIHTIALGPSAAKELEILSSMTGGYRFY 516 Query: 344 SVQNSRKLHDAF 355 + ++ L DAF Sbjct: 517 ANKDINGLIDAF 528 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 76/196 (38%), Gaps = 26/196 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMND------HFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + ++ + +D++ VLDVS SMND P +L + + + M+ ++ Sbjct: 70 TAAVRPEARVPIDVVAVLDVSGSMNDPAAVPPERRPTTSRLDLLKTAAKFMVAKLE---- 125 Query: 212 VNNVVRSGLVTFSSKIVQ--TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + R +V F+ + V+ + L A G + + +++L T P E A Sbjct: 126 --DGDRLSIVAFNDRPVKELSSGLLYMSADGRRKAMKSVDQLEARGGTALVPAFEEAVKV 183 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +I+ LTDGE++S ++ A R V+ +G+ Sbjct: 184 LDGRVGDGRNRLG-------FIVLLTDGEDTSGFTLSERRREVIRGALGR-YPVHTLGLG 235 Query: 326 AEAADQFLKNCASPDR 341 + L A Sbjct: 236 RAHDPEVLLYLAQESH 251 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 73/198 (36%), Gaps = 23/198 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSS-VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 F F + KI G D+M++LDVS SM P ++L Sbjct: 57 LKFVFFVLAYAALVLGAANPQIGTKIEEAKRSGSDLMILLDVSNSMLAGDLAP--NRLEN 114 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A R+I +++D + + R G++ F+ + P+ + +N + TT Sbjct: 115 AKRAISQLIDNLHN-------DRIGIIIFAGEAYVQLPITTDYSAAKLFLNNI----TTD 163 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 P A D K + G K +I +TDGEN +++ A Sbjct: 164 IVPTQGTAIGAAIDMGMKSFNFVNG---TSKAMILMTDGENHE-----DDAVSAAKRASA 215 Query: 315 RGAIVYAIGVQAEAADQF 332 + ++ IGV +E Sbjct: 216 KDVAIHVIGVGSEEGAPV 233 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 76/196 (38%), Gaps = 26/196 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMND------HFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + ++ + +D++ VLDVS SMND P +L + + + M+ ++ Sbjct: 62 TAAVRPEARVPIDVVAVLDVSGSMNDPAAVPPERRPTTSRLDLLKTAAKFMVAKLE---- 117 Query: 212 VNNVVRSGLVTFSSKIVQ--TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + R +V F+ + V+ + L A G + + +++L T P E A Sbjct: 118 --DGDRLSIVAFNDRPVKELSSGLLYMSADGRRKAMKSVDQLEARGGTALVPAFEEAVKV 175 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +I+ LTDGE++S ++ A R V+ +G+ Sbjct: 176 LDGRVGDGRNRLG-------FIVLLTDGEDTSGFTLSERRREVIRGALGR-YPVHTLGLG 227 Query: 326 AEAADQFLKNCASPDR 341 + L A Sbjct: 228 RAHDPEVLLYLAQESH 243 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 85/236 (36%), Gaps = 34/236 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW A++ + I + ++ + +++ ++D S SM+D DKL + +++ Sbjct: 178 PWQADAKLIRIAI-QAADLAPEKRPPANLVFLIDTSGSMDDP-----DKLPLVKKTVCHF 231 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLE 260 + +++ R L+T+S + P G + I + L T L Sbjct: 232 AEALRADD------RISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGATAGGEALR 285 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY+ K I+ TDG+ + D Y + ++ G + Sbjct: 286 MAYDAAAKNYRKDGINR---------ILLATDGDFNVGISDPATLKNYVADKRKSGISLT 336 Query: 321 AIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQ 365 +G + D+ ++ A + + + R+L + +++ Q Sbjct: 337 TLGYGSGNYNDEMMEQLADAGDGNYSYIDSEAEAKKVLVRQLTSTLATVARDIKIQ 392 >gi|291404848|ref|XP_002718766.1| PREDICTED: von Willebrand factor A domain containing 2 [Oryctolagus cuniculus] Length = 787 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 82/221 (37%), Gaps = 22/221 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+F P + + KIS+ S + + +D+ ++ G + Sbjct: 13 FLFARVPPAHPLQEMHVSRETIGKISAASKM-MRCSAAMDILFLLDGSHSVGKASFERSK 71 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST-T 253 R + D + + VR G + F S FPL Q ++ ++ R+IF T Sbjct: 72 RFAITVCDALDISQER---VRVGAMQFGSAPHLEFPLDAFPTRQEVKAQVKRMIFKGGRT 128 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ L+ + F + + ++ +TDG++ + K Sbjct: 129 ETGLALKRLLHGGFPGGR--------NGSVPQLLVIVTDGKSQGHVASP------AKQLK 174 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 RG V+A+GV+ ++ L AS V + ++ DA Sbjct: 175 ERGVTVFAVGVRFPRWEE-LHALASEPWEQHVLLAEQVEDA 214 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 59/174 (33%), Gaps = 23/174 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ + RS D+ PDV GLV + S++ Sbjct: 529 SLDLVFLLDASSSVGPENFAH---MQSFLRSCALRFDVN---PDVTQ---VGLVVYGSRV 579 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A + D ++ A+ K Sbjct: 580 QVAFGLDAHPTRAAVLRAMSQAPYLGGVGSAG---TALLHVHDQVMTVQRGARPGVP--K 634 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ LT G +++ + + G V +G L+ A P Sbjct: 635 ALVMLTGGR------GVEDATVPAQKLRDNGVSVLVVG-MGPVLRDALRRLAGP 681 >gi|291444745|ref|ZP_06584135.1| lipoprotein [Streptomyces roseosporus NRRL 15998] gi|291347692|gb|EFE74596.1| lipoprotein [Streptomyces roseosporus NRRL 15998] Length = 531 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 75/205 (36%), Gaps = 26/205 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + V+D+S SM + +L + +S+ + D ++ ++ Sbjct: 172 TAPPTSERPPAALTFVVDISGSMAE-----TGRLDLVRKSLTILADELRDDDSLS----- 221 Query: 219 GLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVTFS + P+ I++ ++ + +T G++ Y + + + Sbjct: 222 -LVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQPAQSTNVEAGIKLGYEESVEGHREGAT 280 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAADQFLK 334 ++ L+D ++ + + L + A+ G ++ +GV ++ D F++ Sbjct: 281 NR---------VVLLSDALANTGETEAEGILKKIDSARREYGITLFGVGVGSDYGDAFME 331 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR 357 V + + F+ Sbjct: 332 QLTNKGDGNTTYVGDETQARKVFVD 356 >gi|307719357|ref|YP_003874889.1| batA protein [Spirochaeta thermophila DSM 6192] gi|306533082|gb|ADN02616.1| putative batA protein [Spirochaeta thermophila DSM 6192] Length = 332 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 49/261 (18%), Positives = 96/261 (36%), Gaps = 48/261 (18%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGM 189 Y + + S L+++ + + + D +++ LDVS SM PG Sbjct: 51 YRIASLLRDAALWGMLSLMVLILSGPYLVEREQVVVSDPPTIVIALDVSPSMGAMDIPGR 110 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 + VA IR ++S P + GLV F + + P V++ E++ + Sbjct: 111 QRFQVAREVIRGF---VRSYPHMA----VGLVLFGKEALLEVPPTIDVEYFLERLEAVRL 163 Query: 250 GSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 S T G+ + + H+++ + + + ++ LTDG+N++ I + + Sbjct: 164 FSLGDGTALGMGVGTSLL--------HLSRVNASF-RAVVILTDGKNTTGEILPETA--- 211 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF-----------------------LKNCA--SPDRFY 343 A+ G ++ +GV ++ L+ A S +F+ Sbjct: 212 AEMARELGIPLFTVGVGSDRPVSLDVIDPSTGTRYAGVLEEGYDEETLRRIAEISGGQFF 271 Query: 344 SVQNSRKLHDAFLRIGKEMVK 364 S LH F IG Sbjct: 272 SGYTPTSLHRIFQYIGATATA 292 >gi|301756398|ref|XP_002914036.1| PREDICTED: matrilin-2-like isoform 2 [Ailuropoda melanoleuca] Length = 938 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 50 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 100 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 101 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 159 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 160 --ENVLRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDLNTLKAIGSE 211 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 212 PHEDHVFLVANFSQMESLTSVFQN 235 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 651 TEGPVDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLAVSP---KAARVGLLQYS 701 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 702 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 756 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++A+ G +YA+GV ++ L+ AS Sbjct: 757 STRVPRVAIVFTDGRAQD------DVSEWASKAQANGITMYAVGVGKAIEEE-LQEIASE 809 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 PTDKHLFYAED----FSTMGEISEKLKK 833 >gi|301756396|ref|XP_002914035.1| PREDICTED: matrilin-2-like isoform 1 [Ailuropoda melanoleuca] Length = 957 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 50 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 100 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 101 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 159 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 160 --ENVLRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDLNTLKAIGSE 211 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 212 PHEDHVFLVANFSQMESLTSVFQN 235 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 651 TEGPVDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLAVSP---KAARVGLLQYS 701 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 702 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 756 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++A+ G +YA+GV ++ L+ AS Sbjct: 757 STRVPRVAIVFTDGRAQD------DVSEWASKAQANGITMYAVGVGKAIEEE-LQEIASE 809 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 PTDKHLFYAED----FSTMGEISEKLKK 833 >gi|281350435|gb|EFB26019.1| hypothetical protein PANDA_001886 [Ailuropoda melanoleuca] Length = 942 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 50 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 100 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 101 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 159 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 160 --ENVLRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDLNTLKAIGSE 211 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 212 PHEDHVFLVANFSQMESLTSVFQN 235 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 651 TEGPVDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLAVSP---KAARVGLLQYS 701 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 702 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 756 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++A+ G +YA+GV ++ L+ AS Sbjct: 757 STRVPRVAIVFTDGRAQD------DVSEWASKAQANGITMYAVGVGKAIEEE-LQEIASE 809 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 PTDKHLFYAED----FSTMGEISEKLKK 833 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 77/201 (38%), Gaps = 26/201 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 SD LD++ ++D S S+ + I +L + PDV R GL+ + Sbjct: 14 SDKRLDLVFIIDSSRSVRPY------DFEKVKEFILNILQFLDIRPDV---TRVGLIQYG 64 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + F L Q ++ + R++ + T + +++A N F E + Sbjct: 65 STVKNEFSLKTFARKQDMERAVRRMMYLSTGTMTGLAIQFAVNIAFSETEGARPL---SQ 121 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + I+ +TDG P +A+ G +++AIGV + + P Sbjct: 122 NVPRVIMIVTDGRPQDPVA------EIAAKARNSGILIFAIGVGRVDMNTLKSIGSEPYE 175 Query: 340 DRFYSVQNSRK---LHDAFLR 357 D + V N + L F Sbjct: 176 DHVFLVANFSQIETLTSVFQN 196 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 84/209 (40%), Gaps = 33/209 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ G++ + + +LD + P R GL+ +S Sbjct: 572 TEGPVDLVFVIDGSKSL------GVNNFEIVKEFVLGILDSLTISP---KAARIGLLQYS 622 Query: 225 SKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L A ++ ++ + G + + L+ + F E H+ Sbjct: 623 TQVRTEFTLKQFSTATDMKKAVSQMKYM--GKGSMTGLALKQMTERSFTEAEGARHL--- 677 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG E + +AK+RG +YAIG+ ++ L+ AS Sbjct: 678 SAKVPRVCVVFTDGRAQD------EVSEWAAKAKQRGITMYAIGIGKAIEEE-LREIASD 730 Query: 340 ---DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + ++ A I +++ K+ Sbjct: 731 PPEKHLFYAED----FSAMGEITEKLQKR 755 >gi|301064778|ref|ZP_07205158.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441153|gb|EFK05538.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 625 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 62/197 (31%), Gaps = 32/197 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + P + ++ K G+D++M +D S SM Sbjct: 61 ILILAVGLIFVAMARPQVGYEWKEVKRK---GIDILMAVDTSKSMLAEDVRP----NRLE 113 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGST 252 RS ++D + + R GL+ F+ PL + + L I Sbjct: 114 RSKFGIMDFVSKL----EGDRVGLLPFAGTAFLMCPLTLDYDAFRNSLEALDTNIIPQGG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + A + K ++ ++DGE+ E+L A Sbjct: 170 TDIASAIYEAEAAFNNDANH------------KILVLVSDGEDLEG-----EALSAAQAA 212 Query: 313 KRRGAIVYAIGVQAEAA 329 K R +Y +GV + Sbjct: 213 KERDLTIYTVGVGTPSG 229 >gi|291295619|ref|YP_003507017.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470578|gb|ADD27997.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 412 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 78/241 (32%), Gaps = 27/241 (11%) Query: 122 HKDYNLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 H D + +A + +P L I + + L++ +VLD S S Sbjct: 2 HPDSSPNARPHLDLIPLKPGVSATRPTRQQVLLRIHTPTPQARPERPLLNLALVLDRSGS 61 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ---TFPLAWGV 237 M + A ++ +L R +V + + + P+A G Sbjct: 62 MGGSKLKYTKE--AAIYAVHNLLPE----------DRVAVVIYDDAVEVLVPSTPVADGR 109 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 I I + G +T L + + A + + G + ++ L+DG + Sbjct: 110 AAIANLIRTIRTGGST----ALHAGW--LEGATQVAAYQEAGRLNR---VVLLSDGLANR 160 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAF 355 + E RRG +GV + + + A +Y +++ L F Sbjct: 161 GETNPGVIAEQVRELARRGVSTSTLGVGLDYNEDLMTTMADAGEGNYYFIESPADLPRIF 220 Query: 356 L 356 Sbjct: 221 A 221 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 82/239 (34%), Gaps = 57/239 (23%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+D++ +DVS SM ++ A R I E+++++ + R Sbjct: 80 TKLETVKREGVDIVFAIDVSKSMLAEDVKP-SRIEKAKRIISELIELL-------HGDRI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + ++++ PL + + ++ T ++ A N D+ + Sbjct: 132 AFIPYAAQAYPQLPLTSDYSAAKIFLEAINTDMLSSQGTAIGEAIQTAINYFEDSNQS-- 189 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----- 329 K +I L+DGE+ + + E K +G ++ IG+ Sbjct: 190 ---------SKILIILSDGEDHQ-----QGATEMIQEVKEKGIRIFTIGLGTTQGTTIPI 235 Query: 330 ----------------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L+ A +++ N++++ D ++ + K Sbjct: 236 KENGQTFPKRDKDGEVVITKLNQALLEEIAQEGDGKYFDGSNTQQVIDNLQKVLNNIEK 294 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 57/317 (17%), Positives = 107/317 (33%), Gaps = 72/317 (22%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 + + N ST + ++ ++ + E+P I A+ S + Sbjct: 214 FSDDEPLVVNSAESTDPTSLVSLSRPQL-VTVKALPELPAI------SASESFRTFAVLV 266 Query: 158 SVKISSKSD-------IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +K + D +D++ VLDVS SM KL + R++ ++ + Sbjct: 267 GIKAPALLDDAHLLDRAPIDLVAVLDVSGSMAGS------KLSLLKRAVCFLIQNLGPSD 320 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +V+FSS + FPL G + IN L T GL+ + Sbjct: 321 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVL 374 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-------------PNIDNKESLFY----- 308 + E + II L+DG+++ + + ++ L Y Sbjct: 375 EERSE---------QNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLP 425 Query: 309 ---CNEAKRRG-------AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 C + G V+ G ++ + + S F +++ + DAF Sbjct: 426 ASICPRNRESGDEGRQAIIPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVATVQDAFA 485 Query: 357 R-IGK--EMVKQRILYN 370 IG +V Q + Sbjct: 486 MCIGGLLSVVAQELRLT 502 >gi|119512059|ref|ZP_01631153.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] gi|119463285|gb|EAW44228.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] Length = 435 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + L++ ++LD S SM G + + A + + L S + R +V F+ + Sbjct: 40 LPLNLCLILDKSGSM---HGQPIATVIQAVEQLLDRLQPSDSETPTS-GDRISVVAFAGE 95 Query: 227 IVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P I+ +I +L T GL+ ++ + A Sbjct: 96 AQVIIPNQTLQDTASIKAQIHKKLKASGGTAIAEGLQLGITELMKGTKGAVSQA------ 149 Query: 284 KKYIIFLTDGENSS---------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 LTDG S DNK L + ++A + + +G + L+ Sbjct: 150 ----FLLTDGHGESSLKIWKFEIGKDDNKRCLEFAHKATKINLTINTLGFGNDWNQDLLE 205 Query: 335 NC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A ++ + + F + + + R+ Sbjct: 206 KIADAGGGTLAYIERPEQALEQFRHLLQRIQSVRL 240 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 65/176 (36%), Gaps = 20/176 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ GLD+M+V+D+S SM M + + D + R GL Sbjct: 77 TTAYQGRGLDIMLVVDLSESMA---AMDMRLADRTVTRLDAVADAAARFAANHPGDRIGL 133 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V F S+ P + + + RL G T GL A ++ DA Sbjct: 134 VAFGSRAYAVMPPSADRAALTGALARLAVGAAGKRTAMGDGLGLAVKRLSDAPG------ 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + + DG +++ + +++ A RG VY++GV + FL Sbjct: 188 -----LSRLAVVFGDGRSNAGEVSPEDA---AKAASERGVTVYSVGVGGDEPAPFL 235 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 63/196 (32%), Gaps = 22/196 (11%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSSKIVQTF- 231 +LDVS SM G + L A R + D + + ++TF+ + Sbjct: 346 LLDVSGSMA---GSRIAALQAALRGLTGADDTLSGRFARFRGREKITMITFAGRANDPVD 402 Query: 232 -------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P + + + ++ L T LE Y A E Sbjct: 403 FAVNDPRPGSADLAGVNTFVDGLRLQDGTAIYSALEAGYRAAGAAVEADPGYLTS----- 457 Query: 285 KYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRF 342 I+ +TDGEN+S + + S + A R + I A + Sbjct: 458 --IVLMTDGENNSGISAADFRSSYQRLPAAARAVRTFTIAFGEADPAALRDISADTGGAV 515 Query: 343 YSVQNSRKLHDAFLRI 358 + + + L DAF I Sbjct: 516 FDAR-TSSLADAFKDI 530 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 80/235 (34%), Gaps = 22/235 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + P A +S A + + + K D M++++DVS SM G ++ + Sbjct: 363 VLPASLPMMAPASQATTVSADTAWTNLKKDAR--MLVLIDVSGSMQTKIDGGQSRIELME 420 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ----TFPLAWGVQHIQ--EKINRLIFG 250 + LD++ R G FSS + + PL G Q I N LI Sbjct: 421 STAIAALDVL------PKTTRLGAWAFSSNLQKNHVDYLPLTNGEQPILDDTYRNGLIAK 474 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPN----IDNKE 304 + T + ++D D ++ LTDG N PN +D Sbjct: 475 AHTLPGLAAKNGDTALYDTIAAAYKSVTDTYDPNYVNSVVVLTDGTNDDPNGGLALDQLL 534 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 + + + + I + LK A+ Y + ++ F+ Sbjct: 535 ARLKSQYSADKPVKIVTISLGTGTDPDALKRIAKATDGLSYQTKTPEQISGVFVD 589 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 82/216 (37%), Gaps = 26/216 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +++ D+ +K+ + + + + + L + S ++ +++ Sbjct: 28 INPSAVKAGPDEYYKNGSEKQENGVTISKKVTRYNAADGTYDIELKVKGSTEV-VQNNKI 86 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++V+D S SM L A ++ +D + NN V G+V+F+ K Sbjct: 87 LDIVLVMDTSGSMEGK------SLENAKKAANNFVD---KLLPQNNNVNIGIVSFAEKGE 137 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L V ++ I L T + GLE A + A ++KK ++ Sbjct: 138 IKSGLTRNVTTLKNAIKGLKADGGTYTQQGLEKAATVLNGAP----------AEHKKVMV 187 Query: 289 FLTDGENS-----SPNIDNKESLFYCNEA-KRRGAI 318 + DGE + PN D N A K+ G Sbjct: 188 VIGDGEPTYANGEHPNFDKGGFYRIYNPATKKEGYE 223 >gi|126310280|ref|XP_001371684.1| PREDICTED: similar to collagen XXI [Monodelphis domestica] Length = 957 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 81/216 (37%), Gaps = 30/216 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + + + + + P ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPE------NFEIVKKWLVNISNNFDIGP---KFIQVGVV 79 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S V PL + E + + G T++ +++A + +F AK Sbjct: 80 QYSDYPVLEIPLGSHHSGENLMEAMESIQYLGGNTRTGKAIQFALDHLF---------AK 130 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 K + LTDG++ D A+ ++AIGV +E D L+ A Sbjct: 131 SSRFLTKIAVVLTDGKSQDEVKD------AAEAARDNRITLFAIGVGSETEDAELRAIAN 184 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 S + V++ + I +++ ++ + + Sbjct: 185 KPSSTYVFYVEDYIAISKIREVIKQKLCEESVCPTR 220 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 83/223 (37%), Gaps = 27/223 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P+ + T+ ++ S + D++ ++D S S+N H + I ++ Sbjct: 184 PFSRGRTARTHPQTALLE-GSCGNKRADLVFIIDSSRSVNTHDYAKV------KEFIVDI 236 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGL 259 L + PDV R GL+ + S + F L ++ + R+ + T + + Sbjct: 237 LQFLDIGPDV---TRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAI 293 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 +YA N F E + ++ + I+ +TDG +A+ G ++ Sbjct: 294 QYALNIAFSEAEGARPLR---ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILI 344 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 +AIGV + + P D + V N L F Sbjct: 345 FAIGVGQVDLNTLKAIGSEPHEDHVFLVANFSQIESLTSVFQN 387 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 81/196 (41%), Gaps = 25/196 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ LD++ V+D S S+ + D + + + ++D + P R GL+ +S Sbjct: 803 TEGPLDLVFVIDGSKSLGE------DNFEIVKQFLTAIIDSLAVSP---KAARVGLLQYS 853 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +++ F L ++ +++ + + G + + L++ + + F E + Sbjct: 854 TQVRAEFTLRSFSTAKDMKKAVAHMKYMGKGSMTGQALKHMFERSFTQLEGARPL---SA 910 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + I TDG + + N+AK G +YA+GV ++ L+ AS Sbjct: 911 RVPRVAIVFTDGRAQD------DVSEWANKAKANGITMYAVGVGKAIEEE-LQEIASEPT 963 Query: 339 PDRFYSVQNSRKLHDA 354 + ++ +++ Sbjct: 964 DKHLFYAEDFSTMNEI 979 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 72/217 (33%), Gaps = 28/217 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 +N+ + S+V SS + L++ ++LD S SM + + + Sbjct: 16 ANQSNTQRQVAISLSAVSESSDRTLPLNLGLILDHSGSMTGK------PIKTVKEAAIRL 69 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ + S R +V F K P ++ + ++I RL T G++ Sbjct: 70 VESLGSGD------RLSVVAFDHKAKVIVPNQPIDDIKTVNQQIQRLEPAGGTCIDEGMK 123 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 ++ K+ I LTDGEN DN+ L A + Sbjct: 124 LGIKEVALGKDDRVSQ----------IFLLTDGENEHG--DNERCLKLAQVAAEYNITLN 171 Query: 321 AIGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAF 355 +G L++ A ++ + F Sbjct: 172 TLGFGNHWNQDVLESIADAVGGTLCYIEQPEQAITEF 208 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 78/234 (33%), Gaps = 35/234 (14%) Query: 131 SRYEMPFIFCTFPWCANS-SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 Y + + + + P+ V +D+++ LD S SM + G M Sbjct: 258 PAYRIDLDKTSMNMVRKTFLNKPMSTRDIVVREYADVKLMDIVLCLDTSGSMKEFSGAYM 317 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI- 248 K+ +A +I + + + R +V F+ + + WG +++ IN + Sbjct: 318 -KMDIAKEAIVKYIRYLS-----RTNDRLSMVLFNFRAD----ILWGPHSVKKYINEMEE 367 Query: 249 ------FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 G T LE A + + K+II +TDG ++ Sbjct: 368 MSRYIYPGGGTNIANALEKARIILSKSN-----------YPNKHIICITDGR----TVNA 412 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDA 354 + + +R G + + V + L + F + + L A Sbjct: 413 SSCIKEAVKLRRMGVTLSTVAVGDNSDFDLLMRLSKIGNGLFIKINDISNLDKA 466 >gi|239941305|ref|ZP_04693242.1| hypothetical protein SrosN15_09946 [Streptomyces roseosporus NRRL 15998] Length = 516 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 75/205 (36%), Gaps = 26/205 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + V+D+S SM + +L + +S+ + D ++ ++ Sbjct: 157 TAPPTSERPPAALTFVVDISGSMAE-----TGRLDLVRKSLTILADELRDDDSLS----- 206 Query: 219 GLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVTFS + P+ I++ ++ + +T G++ Y + + + Sbjct: 207 -LVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQPAQSTNVEAGIKLGYEESVEGHREGAT 265 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAADQFLK 334 ++ L+D ++ + + L + A+ G ++ +GV ++ D F++ Sbjct: 266 NR---------VVLLSDALANTGETEAEGILKKIDSARREYGITLFGVGVGSDYGDAFME 316 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR 357 V + + F+ Sbjct: 317 QLTNKGDGNTTYVGDETQARKVFVD 341 >gi|148656912|ref|YP_001277117.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569022|gb|ABQ91167.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 543 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 76/195 (38%), Gaps = 26/195 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--- 226 D+++V+DVS SM KL A + L I +P+ R GL+ FS+ Sbjct: 369 DILLVVDVSGSMEGE------KLEAAKSGLGTFLSRI--LPE----DRVGLIVFSTDARV 416 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 +V PL+ + + I +L T L D + ++ + Sbjct: 417 VVPPAPLSEARIALDDAIAQLNARGKTALYDALITGKQVFDD-------LPPPDEERIRA 469 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 I+ L+DG +++ + +E G ++ + ++A L+ A+ R VQ Sbjct: 470 IVLLSDGLDNASRTTLDQVRLAFDET---GISIFPVAYGSDADLAALEQIATFSRTIVVQ 526 Query: 347 -NSRKLHDAFLRIGK 360 ++ + F + + Sbjct: 527 GDTGDIGQIFENLSR 541 >gi|330901495|gb|EGH32914.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 218 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 62/155 (40%), Gaps = 14/155 (9%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ P + + +++ G D+++ +DVS SM+ P M + + ++ Sbjct: 72 ATARPQWLGEPLPVAA---SGRDLLVAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLG 125 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + R GL+ F S+ PL + + ++ ++ G K+T + Sbjct: 126 DFLEGRKGDRVGLILFGSQAFVQAPLTYDRRTVRVWLDEARIGIAGKNT--------ALG 177 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 DA + + ++ +TDG N++ ID Sbjct: 178 DAIGLALKRLRMRPATSRALVLVTDGANNAGQIDP 212 >gi|149922129|ref|ZP_01910569.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] gi|149817066|gb|EDM76548.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] Length = 546 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 74/191 (38%), Gaps = 19/191 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 GLD+ +VLD S SM DKL A ++ ++++ + R L+++ + Sbjct: 130 GLDLAIVLDRSGSMGG------DKLRFAKQAGLDLVNRLD------EQDRVTLISYDDTV 177 Query: 228 VQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-D 282 L G++ ++ ++ + G TT P L ++ + D Sbjct: 178 TPLSNLQRVDDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHD 237 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 +++I L+DG + + E G V +G+ + + + A Sbjct: 238 RLRHVILLSDGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMGLDYNEDLMTRIADEGGG 297 Query: 341 RFYSVQNSRKL 351 R++ ++++ + Sbjct: 298 RYHFIEDAESI 308 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 49/246 (19%), Positives = 88/246 (35%), Gaps = 40/246 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I P + + S + S S +D++ VLDVS SM+ KL + Sbjct: 232 LIHLKAPLTSGRQNRNWNHAESPQSSQDSRAPVDLVTVLDVSGSMSG------TKLALLK 285 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 R++ ++ + R ++ FSS + FPL G + +N L+ Sbjct: 286 RAMGFVIQNLGPSD------RLSVIAFSSTARRHFPLRRMTETGKLEALQAVNSLVSSGG 339 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN-- 310 T GL + + D K K + II L+DG+++ + + Sbjct: 340 TNIAEGLRKGFKVVVDRKWKNPVCS---------IILLSDGQDTYTISGTSMTRPQADYK 390 Query: 311 -----EAKRRG-----AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR- 357 R G V+A G ++ + + + S F ++ + DAF + Sbjct: 391 SLLPTSIHRNGSSGFRIPVHAFGFGSDHDAASMHSISEISGGTFSFIEAEGVIQDAFAQC 450 Query: 358 IGKEMV 363 IG + Sbjct: 451 IGGLLS 456 >gi|291295701|ref|YP_003507099.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470660|gb|ADD28079.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 298 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 64/169 (37%), Gaps = 17/169 (10%) Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + + + + ++ G+VTFS P + I++ I+ L G T Sbjct: 105 RMVATQMAAKALVDKLPRHIKVGVVTFSGYGTLLLPPTTDRKAIRQAIDNLDLGGGFSFT 164 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL A + I+ + G + S N + L ++A RG Sbjct: 165 YGLLAALEALPQTP--------PEGSRPGVIVLFSHGHDVSGN----DPLKIADQALERG 212 Query: 317 AIVYAIGVQAEA---ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 V+AIGV ++ LK A + R+Y + ++ L A +G+ Sbjct: 213 IQVHAIGVGTHGHNFDEEMLKKVADRTGGRYYPIFSASDLSKAHADLGR 261 >gi|255587116|ref|XP_002534143.1| protein binding protein, putative [Ricinus communis] gi|223525789|gb|EEF28236.1| protein binding protein, putative [Ricinus communis] Length = 728 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 56/291 (19%), Positives = 101/291 (34%), Gaps = 48/291 (16%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDY-NLSAVSRYEMPFIFCTFPWCANSSHA 151 R+ + +IE T + + Y N + + + P N + Sbjct: 200 RSSGNKTPDHNSQKSIEIKTYPEVPSASRSCAYDNFTVLVHLKAPATVTMQNPRINQASL 259 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P L S +D++ VLD+S SM KL + R++ ++ + S Sbjct: 260 PQLSQSP-------RAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND- 305 Query: 212 VNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R ++ FSS + FPL G Q + +N L+ T GL + Sbjct: 306 -----RLSVIAFSSTARRLFPLRRMSDTGRQQALQAVNSLVAHGGTNIAEGLRKGAKVME 360 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN------------EAKRR 315 D +EK + II L+DG+++ + + N + Sbjct: 361 DRREKNPVAS---------IILLSDGQDTYTVSSSGANQPQPNYHLLLPLSIHGGDTSGF 411 Query: 316 GAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+A G A+ + + + S F ++ + DAF + IG + Sbjct: 412 QIPVHAFGFGADHDASSMHSISEVSGGTFSFIETEAVIQDAFAQCIGGLLS 462 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + S++ + S + L++ ++LD S SM G ++ + A + + Sbjct: 27 ISVSAIGETIDSRVPLNLCLILDHSGSMK---GQPVENVKRA---------AWLLVDKLR 74 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + R +V F+ + HI+++INRL T GL ++ + Sbjct: 75 DQDRLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLSANGGTSIDEGLRLGIEELAKGRR 134 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 A LTDGEN DN L + A V +G Sbjct: 135 DTISQA----------FLLTDGENEHG--DNNRCLKFAQLAADYNLTVNTLGFGNNWNQH 182 Query: 332 FLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L+ A +++ + D F + M Sbjct: 183 ILEKISDAGLGSLSHIEHPDQAVDKFNSLLMRMQT 217 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 72/204 (35%), Gaps = 32/204 (15%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKL 192 + + + P + + ++S S++ D+M+VLDVS SM D P +L Sbjct: 61 RLALMLGCLVLAVLALMQPQIPGRTETVTS-SEVSADIMVVLDVSRSMLADDVAP--TRL 117 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLI 248 A + E+ ++ R GLV F+ + PL + ++ + + Sbjct: 118 ARAKAEVAELSSALRGH-------RIGLVAFAGRASVLAPLTPDYGFFRMILDGVDTKSV 170 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T+ L A K I+ +TDGE+ ++ Sbjct: 171 SRGGTEIGQALRKAVRSFDPGPGA------------KMILLITDGEDHGGYAEDA----- 213 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 EA G V AIG +E Q Sbjct: 214 AREALEAGVRVVAIGFGSEQGSQI 237 >gi|258624851|ref|ZP_05719779.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582849|gb|EEW07670.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 189 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 50/126 (39%), Gaps = 17/126 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSIREMLDII 206 P+ V S+ D+M+V+D+S SM+ + G +D+L + + E + Sbjct: 69 RPVWYGEPVSTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSEFIT-- 123 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAY 263 R GL+ F+ PL Q + ++N+ + G+ T G+ A Sbjct: 124 -----KREGDRMGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLAT 178 Query: 264 NKIFDA 269 D+ Sbjct: 179 KTFIDS 184 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 52/245 (21%), Positives = 87/245 (35%), Gaps = 41/245 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I P + L+ S S +D++ V+D+S SM KL + Sbjct: 230 LIHLKAPSASPDQATCRLVNESSVRSPSGRAPVDLVTVIDISGSMAG------TKLALLK 283 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGST 252 R++ ++ + R ++ FSS + F L +G Q + IN L G Sbjct: 284 RAMGFVIQHLGPSD------RLSVIAFSSSARRLFHLQRMSHYGRQQALQAINSLGAGGG 337 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES------- 305 T L+ A I D K + II L+DG+++ N Sbjct: 338 TNIADALKKATKVIEDRSYKNSVCS---------IILLSDGQDTYNICSNVRGGSKDYSS 388 Query: 306 ----LFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-I 358 + RR ++A G A+ L + AS F +++ + DAF + I Sbjct: 389 LVPPSILSDT--RRMLPIHAFGFGADHDSDSLHSIAEASGGTFSFIEDEGVMQDAFAQCI 446 Query: 359 GKEMV 363 G + Sbjct: 447 GGLLS 451 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 52/245 (21%), Positives = 87/245 (35%), Gaps = 41/245 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I P + L+ S S +D++ V+D+S SM KL + Sbjct: 230 LIHLKAPSASPDQATCRLVNESSVRSPSGRAPVDLVTVIDISGSMAG------TKLALLK 283 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGST 252 R++ ++ + R ++ FSS + F L +G Q + IN L G Sbjct: 284 RAMGFVIQHLGPSD------RLSVIAFSSSARRLFHLQRMSHYGRQQALQAINSLGAGGG 337 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES------- 305 T L+ A I D K + II L+DG+++ N Sbjct: 338 TNIADALKKATKVIEDRSYKNSVCS---------IILLSDGQDTYNICSNVRGGSKDYSS 388 Query: 306 ----LFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-I 358 + RR ++A G A+ L + AS F +++ + DAF + I Sbjct: 389 LVPPSILSDT--RRMLPIHAFGFGADHDSDSLHSIAEASGGTFSFIEDEGVMQDAFAQCI 446 Query: 359 GKEMV 363 G + Sbjct: 447 GGLLS 451 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 45/210 (21%), Positives = 80/210 (38%), Gaps = 29/210 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + ++ D++ ++D S S+N D + M+ + PD V+ G Sbjct: 1015 TPEACKNMKADIVFLVDSSASINS------DDYETMKEFMESMVKQAEIGPDR---VQIG 1065 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 L+ FSS+ + FPL +++I T L+Y +K Sbjct: 1066 LIQFSSETKEEFPL--NRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASKG--- 1120 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 G + K+Y+I +TDGE + +++ + G I+YAIGVQ Q L+ Sbjct: 1121 ----GRVNTKQYLIVITDGEAQDAVGNPAKAI------RDHGVIIYAIGVQQANNTQLLE 1170 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ Y + L I E+ Sbjct: 1171 IAGKQEQVYYEDSFDSLAFLNKNIMFEICN 1200 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 84/206 (40%), Gaps = 25/206 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S LD++ ++D S S+ A ++E++D + VR G Sbjct: 1393 SKPACSHEQLDLVFLIDGSASITSS------NFTSAKTFMKEIVDSFTISENR---VRIG 1443 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + +S+ + F L + ++++I+ + +TT + GL + A Sbjct: 1444 VAQYSANPKKEFFLNEYYSSSDMKKQIDSISQLKATTYTGKGLRFVKQFFDPANG----- 1498 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 G + +Y+I +TDG ++ ++ +L + G +++IG+ + + + Sbjct: 1499 --GRKNVPQYLIVMTDGMSNDSVNEDAAAL------RSSGVKIFSIGIGLRNSFELVMIA 1550 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEM 362 SP Y V+ + L +I ++ Sbjct: 1551 GSPKNVYEVETFQALDSIKRQIVAQV 1576 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 51/315 (16%), Positives = 107/315 (33%), Gaps = 36/315 (11%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDF---RNELRENG---FAQDINNIERSTSLS 115 K+ ++NG + + + +F +LR +G FA NI T L Sbjct: 721 KMFTKQNGGRPHQGVQQIAVVMTNGQSMDNFTKPAAKLRRSGVEVFAVGFQNI-NDTELD 779 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD----- 170 II + + + S ++ + + + + +D Sbjct: 780 IIASHPPRKHVTNVESFLQLSNLEFRIQKRLCNEIVVKSFAVPAIARAVKEGCVDTEEAD 839 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIV 228 + ++D S S+ M K M+++I N VR G+V +S + Sbjct: 840 IYFLIDGSGSIYPEDFEDMKKF---------MIELISMFQVGANRVRFGVVQYSDVRRTE 890 Query: 229 QTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + +++ I+++ G T + L +A + H + + Sbjct: 891 FFISEHNTQKMLKDAISQIEQLGGGTLTGEALTSMKQLFVNAAKDRPHK------VPQSL 944 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + +TDGE+ E + G ++AIGV+ ++ S ++ + V N Sbjct: 945 VVITDGESQD------RVTEAAAEIRNDGITIFAIGVKNAVEEEIRDIAGSNEKMFFVNN 998 Query: 348 SRKLHDAFLRIGKEM 362 L + +E+ Sbjct: 999 FDSLKVIKNDLAREL 1013 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 77/212 (36%), Gaps = 25/212 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 C A ++ ++ D++++++ + M D A + +++ Sbjct: 211 LCTTVEEAARQTGQIAQVCRTANQA-DIVLLVESTTRMGDA------TFEKAKNFLYDLV 263 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLE 260 + N +R GLVT++ + F L + + ++ N T + LE Sbjct: 264 SNLDV---GINKIRIGLVTYNDETNPEFLLNSYSSKTEILESIQNMKYVEGYTYTGRALE 320 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 Y F ++ + + +I +T+G++S + E K RG VY Sbjct: 321 YVNTTYF----TQAAGSRFEESVAQILIIVTEGDSSDTLTEP------AKELKSRGISVY 370 Query: 321 AIGVQAEAADQFLKNCASPDR--FYSVQNSRK 350 +G + Q + + PD FY + + Sbjct: 371 VVGTNIKYDRQLQEASSKPDEKFFYQLDDFDD 402 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/325 (12%), Positives = 107/325 (32%), Gaps = 21/325 (6%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 +++ K ++ + + ++ + R+ + + ++ Sbjct: 1080 NRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASKGGRVNTKQYLIVITDGEAQD 1139 Query: 100 GFAQDINNIERS----TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 I ++ + + + ++ + ++ + + + I Sbjct: 1140 AVGNPAKAIRDHGVIIYAIGVQQANNTQLLEIAGK-QEQVYYEDSFDSLAFLNKNIMFEI 1198 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + K+++ D++ +LD S S+ + + R + +++ + Sbjct: 1199 CNPQESCKKTEVA-DIIFLLDASASITR------GEFRLMQRFVEAVVNDSLV---GKDN 1248 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 V+ G V + + + F L K++ L + G Y + + + Sbjct: 1249 VQFGAVVYGTNPAEQFSL----NTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFGT 1304 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 G +I +TDG + D K G IV+A+GV + + Sbjct: 1305 SYGGRPGISHILILVTDGATTE--ADRPNLPIVSKALKDDGIIVFAVGVGKAVPQELQQI 1362 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGK 360 PDR++ VQN + L + I + Sbjct: 1363 AGYPDRWFLVQNYKGLDNIHDNITQ 1387 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 71/194 (36%), Gaps = 22/194 (11%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ M+ + G + + +++ D ++ GL+ +S F L Sbjct: 33 DIVFLMDGSWSIGTENFITMKNFLYTLVNGFDVGLDK---IQIGLIQYSDNARTEFFLNS 89 Query: 236 --GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + + I L + G TK+ LE+ + F ++ + + + +TD Sbjct: 90 YSNKEDVLKYIQNLKYKGGGTKTGLSLEFMLTQHFS----EAAGSRAAEGVPQIAVVITD 145 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRK 350 G+ K G I+YAIG++ + + + PD YSV + Sbjct: 146 GQAQD------SIREPAIAVKNAGIILYAIGIKDAVLSELNEIASDPDDKHVYSVADFNA 199 Query: 351 LHDAFLRIGKEMVK 364 L I + M++ Sbjct: 200 L----QSISQNMIQ 209 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 69/172 (40%), Gaps = 23/172 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVTFSSKIV 228 D++ ++D S S+ GP +L L + S D++ + VR GLV +S + Sbjct: 632 DIVFLIDESSSI----GPINFQLTRV------FLHKVVSALDISLSNVRVGLVLYSDEPR 681 Query: 229 QTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + ++ I + I +L + T A + + ++ + H ++ Sbjct: 682 LELKLNTFNEKYEILDFITKLPYRGGKAHTGA---ALDFLRKKMFTKQNGGRPHQGVQQI 738 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + +T+G++ + +R G V+A+G Q D L AS Sbjct: 739 AVVMTNGQSMD------NFTKPAAKLRRSGVEVFAVGFQ-NINDTELDIIAS 783 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 70/201 (34%), Gaps = 20/201 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S SM F I +++ + + + R GL +S Sbjct: 431 DVVFLVDSSTSMGTIF------FQKMKDFIIHIINQLNVGINKH---RIGLAQYSGLPQT 481 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY--AYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L +E+I + I + T L+ A + E ++ + +++ Sbjct: 482 EFLL--NHYETKEEILKHIKETFTYRGGPLKTGHALEFVRSTFFIEEAGSRINYGNPQFL 539 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + +T + + E K G ++G+ + L+ A+ + Sbjct: 540 VVIT------SSKSEDAVRRHAEELKSVGVTTISVGIGNSDRKE-LEKIATDPFVFQTTG 592 Query: 348 SRKLHDAFLRIGKEMVKQRIL 368 + + + + ++ + +L Sbjct: 593 LQHISNLQQDVANVIIAEDML 613 >gi|148657485|ref|YP_001277690.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569595|gb|ABQ91740.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 459 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 85/235 (36%), Gaps = 37/235 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIG-------LDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P + L + S K+ + + L ++ VLDVS SM+ KL A Sbjct: 58 PGQNVDRYLLLTLCSPAKVPPEHALPREQHRPPLHLVAVLDVSGSMSG------TKLASA 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGS 251 ++R+ L ++ + LVTFS ++ ++ ++ + Sbjct: 112 KEALRQALHFLQDGDVFS------LVTFSDQVQTHLKAESYAQRKRDKMENLLDEIRASG 165 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL + ++ ++ L+DG+ + D ++ + Sbjct: 166 MTALDGGLAQGIDLGQKKRQATT-----------LVLLLSDGQANVGETDLEKIGLRAQK 214 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA-FLRIGKEMV 363 A++ G IV +GV + + + A RFY +Q ++ A +G + Sbjct: 215 ARQSGLIVSTLGVGLDYNEALMVEIANQGGGRFYHIQEGSQIPAALMQELGSAAM 269 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 70/193 (36%), Gaps = 29/193 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + +++ + PD R G+V +S + Sbjct: 40 DLVFLLDTSSSV------GKEDFEKVRQWVSNLVETFEIGPDK---TRVGVVRYSDRPST 90 Query: 230 TFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L G +E+I +G T + L Y F K K Sbjct: 91 EFDL--GKYKTREEIKEAARKIQYYGGNTNTGDALRYITTYSFS---KEAGGRLSDRTVK 145 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDR 341 K I LTDG + +D A + G ++A+GV EA + L AS Sbjct: 146 KVAILLTDGRSQDFVLDP------ATAAHQAGIRIFAVGVG-EALKEELDEIASEPKSAH 198 Query: 342 FYSVQNSRKLHDA 354 + V + + Sbjct: 199 VFHVSDYNAIDKI 211 >gi|187736264|ref|YP_001878376.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426316|gb|ACD05595.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 754 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 66/191 (34%), Gaps = 27/191 (14%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P IF + ++ + ++++ +D S SM +LG Sbjct: 56 LPVIFAVLASIFAILSIARPVDGYTEVK-EIPKSRNILIAIDCSRSMLSKDASP-TRLGR 113 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---- 250 A + ++LD + +N G++ FS V PL ++E I +L FG Sbjct: 114 AKTAAYDLLDAL----PGDNF---GIIIFSGDAVLLMPLTHDHNALKETIEQLQFGWVSQ 166 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + A KE A ++ L+DGE++ + Sbjct: 167 GGTNLENVVRLALQTFKRDKEADAKNA---------LVILSDGEDT-----VNITYKTAE 212 Query: 311 EAKRRGAIVYA 321 A++ I+ Sbjct: 213 AARQHQLIIVT 223 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 70/193 (36%), Gaps = 33/193 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L + ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMR--LGSPITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQ 361 Query: 231 FPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + T GL+ + I + + Sbjct: 362 TNLIRIIND-SSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE-------- 412 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C E K GAI++ I + AA + L + RF Sbjct: 413 -IVLLTDGEDNR--------ISSCFQEVKHSGAIIHTIALGPSAARELETLSDMTGGLRF 463 Query: 343 YSVQNSRKLHDAF 355 Y+ ++ L DAF Sbjct: 464 YAKEDVNGLIDAF 476 >gi|312958282|ref|ZP_07772803.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] gi|311287346|gb|EFQ65906.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] Length = 546 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 79/225 (35%), Gaps = 26/225 (11%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S + + PW + + I +S + ++ +++ ++DVS SM+ G Sbjct: 153 SPFGVTTEVAATPWNPRTQLLRIGIKASDRAVAEL-APANLVFLVDVSGSMDRREGLP-- 209 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLI 248 +S ++L + + R LV ++ + P G I+ I++L Sbjct: 210 ----LVQSTLKLL-----VDQLREQDRVSLVVYAGESRVVLPPTSGRDKAKIRTAIDQLT 260 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G +T G++ AY + I+ TDG+ + D Sbjct: 261 AGGSTAGASGIQLAYQMAREGFIDKGINR---------ILLATDGDFNVGISDFDSLKQM 311 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQNSRK 350 E ++ G + +G + ++ L A + + R+ Sbjct: 312 AAEQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDTLRE 356 >gi|28210485|ref|NP_781429.1| membrane-associated protein [Clostridium tetani E88] gi|28202922|gb|AAO35366.1| membrane-associated protein [Clostridium tetani E88] Length = 842 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 55/307 (17%), Positives = 107/307 (34%), Gaps = 37/307 (12%) Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN---GFAQDINNIERSTSLSIIIDDQ 121 EN K N S + N D N D N+ + ++ + Sbjct: 306 ELENIYRLLKNVNIDSQKYFSNEVSGDVNFLSDFNEIILVNTDYKNLPKDFDTNLEKVVK 365 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 L + N+ LL S + + +++++D S SM Sbjct: 366 EFGSGLMVIGGEN----SFALGSYENTKFEELLPVSCNVKNKRKQGDAGIVLLIDCSGSM 421 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQH 239 +D G G+ K+ +A + E + ++S G++ FS I P A + Sbjct: 422 DDESG-GVKKIELAKQGAIETIKALESED------YIGILGFSDTIDWVVPFQKAENKEK 474 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + +++ +L T PGL + AK K++H +I LTDG+ Sbjct: 475 LIKEVGKLKPKGGTLIIPGLIEGVKTLSSAKTKVKH-----------MILLTDGQAEKNG 523 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 D Y K+ + +G+ ++ + L + + + R Y + + + F + Sbjct: 524 FD-----KYLENMKKNNMTLSTVGLGEDSDREVLTHLSDFTGGRKYFSNDFKSVPIIFAK 578 Query: 358 ---IGKE 361 I ++ Sbjct: 579 ETRISQK 585 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 52/176 (29%), Gaps = 39/176 (22%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA------KEKLEHIAKGHDDYK 284 PL+ ++ +IN L G +T G + + + D Sbjct: 354 TPLSSDRVTLKGQINALSIGGSTAGQIGFAWGWYMVSPNFGYLWPNATQRPAPYNSKDLV 413 Query: 285 KYIIFLTDGENSSP---------------------------NIDNKESLFYCNEAKR--R 315 K ++ +TDG ++P ++ C+ K Sbjct: 414 KVVVLMTDGAFNTPYCKGVIAKDAGSGSGAVDDHINCVATNGDAFTQTRKLCDAMKDPSL 473 Query: 316 GAIVYAIGVQAEAADQ---FLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +G LK CA+ Y +L AF I +E+ RI Sbjct: 474 KLTIFTVGFDVGGDANAVNMLKYCATDAQHVYFPATGSELKTAFKSIAQEISSLRI 529 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 80/221 (36%), Gaps = 39/221 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA--TKI 63 R + +G+I+I A+L + I++ +I+ + ++ LD + L A T + Sbjct: 8 FRRLGRDDRGAIAIQFALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLMAARSTAV 67 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + E + G + I + F N T+ Sbjct: 68 TDAELESVGDP-------AFLAEIAGLNLGLSASNASFKAGAGNHIIGTA---------- 110 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 +A + + ++ T + L TS V SSK+ L++ +VLD++ SM+ Sbjct: 111 ----TATVKPIIANLWTTDDF-------NLTATSDVVRSSKN---LEVAVVLDITGSMSG 156 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +++DI+ + +V +S Sbjct: 157 S------RITDLKTGASDLVDIVVKDQQAPFYSKVAIVPYS 191 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 75/200 (37%), Gaps = 26/200 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + LD++ ++D S S+ + I +L + PD R GL+ + S Sbjct: 28 NKRLDLVFIIDSSRSVRPY------DFEKVKEFILTILQFLDVSPDA---TRVGLIQYGS 78 Query: 226 KIVQTFPLAWGV--QHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L Q I+ + R+ + T + ++YA N F E + + + Sbjct: 79 TVKHEFSLKTFRRKQEIERAVRRMMHLATGTMTGLAIQYAVNIAFSESEGARPL---NQN 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 + I+ +TDG P +A+ G +++AIGV + + P + Sbjct: 136 VPRIIMIVTDGRPQDPVG------EIAAKARNSGILIFAIGVGRVDMNTLKSIGSEPHEE 189 Query: 341 RFYSVQNSRK---LHDAFLR 357 + V N + L F Sbjct: 190 HIFLVANFSQIETLTSVFQT 209 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 86/207 (41%), Gaps = 27/207 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + D + + + +LD ++ P R GL+ +S Sbjct: 1484 TEGPVDLVFVIDGSKSLGE------DNFEIVKQFVSGILDTLEISP---KAARVGLLQYS 1534 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L + +++ ++++ G + + L+ + + F E ++ Sbjct: 1535 SEVRTEFTLRRFSTAKDMKKAVSQMKYMGRGSMTGLALKQMFERSFTETEGARPLSAN-- 1592 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 + I TDG E + AKR G I+YAIG+ ++ L+ + P Sbjct: 1593 -IPRISIVFTDGRAQD------EVSEWAARAKRSGIIIYAIGIGKAIEEELLEIASEPSY 1645 Query: 341 -RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ A I +E+ Q Sbjct: 1646 KHLFYAED----FTAMEDISEELKVQI 1668 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 104/290 (35%), Gaps = 41/290 (14%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS--------- 149 N F ++I N I + DY + E +FC A S Sbjct: 870 NNFRENIGNEYLPLPTDITYEGLFYDYYFDTGEKAECQKLFCPSYSYALSKDPVSEVLGY 929 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS- 208 + + + S + S L++++VLD+S SM F + E D KS Sbjct: 930 YLSVGLNSGIIESDFQRKKLNLVIVLDISGSMGSSFDEYYYDRFGNRVDVNETEDAEKSK 989 Query: 209 -----------IPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTT 253 + + + R GLV F++ P++ +Q ++ + + T Sbjct: 990 IEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLIGAKNMQKLKGDVLEISATDGT 1049 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE--SLFYCNE 311 + + G++ A ++D E + +Y+ IIFLTD + + + N Sbjct: 1050 RLSAGMQMATE-LYD-----EFLEVNQSEYENRIIFLTDAMPNLGQTSEESLLGMTEANA 1103 Query: 312 AKRRGAIVYA--IGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLR 357 K VY IG+ + + ++ +YSV ++++ + Sbjct: 1104 NKN----VYTTFIGIGVDFNTELVEYITKIRGANYYSVHSAKQFKERMDD 1149 >gi|257093736|ref|YP_003167377.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046260|gb|ACV35448.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 452 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 67/205 (32%), Gaps = 26/205 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + K+ + +V+D S SM+ GP + + + I + L+ Sbjct: 36 PLATEKKARKPYHLALVIDRSGSMS---GPPLAEAVRCAKHIADQLEPTDIAS------- 85 Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV F ++ P Q + ++R+ G +T G + + + A + Sbjct: 86 --LVVFDDRVQTLVPPRPVGDRQALHLALSRVHSGGSTNLHGGWQAGADGLLPAAGQAAL 143 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNI-DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 +I L+DG + I D C +A RG G+ + + + Sbjct: 144 AR---------VILLSDGNANVGEITDPAGIAALCAQAAERGVSTSTYGLGSHFNEDLMV 194 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR 357 A Y + L + F Sbjct: 195 EMAKRGGGNHYYGDTAADLFEPFAA 219 >gi|125829720|ref|XP_698253.2| PREDICTED: collagen alpha-1(VI) chain [Danio rerio] Length = 1000 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 79/212 (37%), Gaps = 20/212 (9%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG---MDKLGVATRSIREMLDIIK 207 + V SS + +D+ VLD S S+ P ++++ T+ + L ++ Sbjct: 15 WAGIFAQDVSRSSFRECPVDLFFVLDTSESVALRAKPPEFYINQIKTFTKLFIDELKDLR 74 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLI-FGSTTKSTPGLEYAY 263 D + SG + +S L ++ I+R+ G T + ++ Sbjct: 75 QPCDRDVTWNSGALHYSDDTELVMGLVDLNTKRADLKAAIDRIKYIGKGTYTDCAIKEGI 134 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAI 322 ++ A H KYI+ +TDG + + + NEA++ V+A+ Sbjct: 135 AELLRAG--------SHYHENKYIVVVTDGHPITGYKEPCGGIQEAANEARQHAIKVFAV 186 Query: 323 GVQAEAADQFLKNCASP----DRFYSVQNSRK 350 + + D L A+ F + NSR Sbjct: 187 AISPDQEDTRLSVIATDINYRQNFTAADNSRS 218 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV + M+ G + R + S N VR G+ +S+ + Sbjct: 808 DVLIMMDSSASVGAKNFEMTKDFSRMLAKRFLSAERGNFQVRIGVGQYSNNANLEAEFST 867 Query: 236 G-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 Q + + + + T+ T L +A +G KK ++ +DG Sbjct: 868 NATQVVAQIADAKFQNAGTQVTNALNFAIE-----------RFRGGRTRKKKLLVFSDGR 916 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD----------RFYS 344 + N E + G +Y + V + + L+ S + Sbjct: 917 SQGVNSIQIE--KAVEQVSNAGIELYVLAVGNQVNEAHLRTLVSRGRPYDNTYAYRHLFK 974 Query: 345 VQNSRKLHDA--FLRIGKEMV 363 V + R L + + +++ Sbjct: 975 VPDYRSLVTGVFYQTVSRKIS 995 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 64/341 (18%), Positives = 117/341 (34%), Gaps = 49/341 (14%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST- 112 + A LN + ++N F ++ I E+ + F+ D++ +T Sbjct: 104 AASSSVAAPGLNDDWQGAVLPERNQFEKQVQNGI---MVAGEIPVSTFSIDVDTGSYTTL 160 Query: 113 --SLSIIIDDQHKDYNLSAVSRY-----------EMPFIFCT--FPWCANSSHAPLLITS 157 L Q + + Y E PF T P N L I Sbjct: 161 RRMLKEGRLPQKDTLRVEEMLNYFSYNYPQPNKNEAPFSVTTELAPSPYNDDMMLLRIGL 220 Query: 158 SVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 SK+++G ++ +LDVS SM DKL + +++ + + V+ VV Sbjct: 221 KGYEQSKAELGASNLVFLLDVSGSMASD-----DKLPLLQTALKMLTQQLDEQDKVSIVV 275 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +G +V ++ + + +L G +T G++ AY +H Sbjct: 276 YAGAAG----VVLDGAAGNDIKILTYALEQLTAGGSTNGAEGIQLAYQL------AQKHF 325 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKN 335 KG + +I TDG+ + + E + K+ G + +G D ++ Sbjct: 326 VKGGINR---VILATDGDFNVGTTNLDELVDLVEVQKKHGIGLTTLGFGMGNYNDHLMEQ 382 Query: 336 CASPDR----FYSVQNS------RKLHDAFLRIGKEMVKQR 366 A+ + N +L L I KE+ Q Sbjct: 383 LANKGNGQYAYIDSVNEARKVLVEQLGATLLTIVKEVKVQV 423 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 75/220 (34%), Gaps = 29/220 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +L ++ +++ +++ +VLD S SM L ++ +++ + Sbjct: 26 MLSIAATSEQINTNLPINLCLVLDHSGSMQGK------PLETVKKAALSLIESLGVND-- 77 Query: 213 NNVVRSGLVTFSSKIVQTFP---LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R ++ F + P + I+ KI +L G T G++ + Sbjct: 78 ----RLSVIAFDHRAKVILPSQSRQDDLTLIRSKIQQLRAGGGTAIDEGIKLGIQE---- 129 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + G Y +I LTDGEN DN+ L A G + G Sbjct: 130 ------SSSGSKGYVSHIFLLTDGENEHG--DNQRCLKLAAVAAEYGITLNTFGFGDHWN 181 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ A + ++ + F R+ + R+ Sbjct: 182 QDILEKIADIAGGSLSYIERPEQALIEFTRLFNRLQSVRL 221 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 45/189 (23%), Positives = 78/189 (41%), Gaps = 25/189 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ ++D S S+ M K M+DII S+ + R G+V +S Sbjct: 27 KSGPMDLVFIIDSSRSVRPFEFETMRKF---------MIDIINSLEVGLSTTRVGVVQYS 77 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L +++ IN +I T + ++YA N F +E ++K Sbjct: 78 SQVQTVFSLKTFSNKSDMEKAINEIIPLAQGTMTGLAIQYAMNVAFTEEEGARPLSKN-- 135 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + I +TDG +A+ G +YA+GVQ A L+ AS Sbjct: 136 -IPRVAIIVTDGRPQD------RVTEVAVQAREAGIEIYAVGVQ-RADVSSLRAMASHPL 187 Query: 339 PDRFYSVQN 347 D + V++ Sbjct: 188 DDHVFHVES 196 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 68/184 (36%), Gaps = 27/184 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ V+D S S+ + + ++D + GLV +SS++ Sbjct: 465 IDLVFVIDGSKSVRPQ------NFELVKEFVINIVDSSAISAQGTH---IGLVQYSSRVR 515 Query: 229 QTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 FPL+ Q I+ + + T + L++ + F E + K Sbjct: 516 TEFPLSQYTNGQDIKTAVKNIQYMEKGTMTGLALKHMVEQSFSEAEGARK------NVPK 569 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---F 342 + TDG + + + +AK G +YA+GV D+ + + P F Sbjct: 570 IGLVFTDGRSQD------DISEWAKKAKEAGITMYAVGVGKAVEDELNEIASDPVNKHSF 623 Query: 343 YSVQ 346 Y+ Sbjct: 624 YTAD 627 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 55/156 (35%), Gaps = 24/156 (15%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + ++ KIN L+ T ++ A+ + Y II L+DG Sbjct: 328 STDDSTLKGKINNLVANGATNQAIAMQMAWMMLQPTAPFPAPAKDEKYKYTDAIILLSDG 387 Query: 294 ENSSPNI-----------DNKESLFYCNEAKRRGA---------IVYAIGVQAEA--ADQ 331 N+ D +++L CN K +Y I V + Sbjct: 388 LNTQDRWYGNGSDWSSQVDTRQALL-CNNIKNDPISKTDPTRRTRIYTIQVNTDGDPEST 446 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 LKNCA+ F+ + + AF +IG + + RI Sbjct: 447 VLKNCATDG-FFPTSTASGIASAFAQIGASLSQLRI 481 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 73/221 (33%), Gaps = 26/221 (11%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + +G+I+ + AI L + +G ++ + ++ + +D + L A Sbjct: 16 RFGQDLRGNIAPIFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAALMVA------- 68 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + + Q F N L N AQ + T + + Sbjct: 69 --KDANAASPQMTADQVTAAAQKYF-NALYHNTDAQGASVSAVYTPYN-----NGTPATV 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + F P + + ++ + L + M LDV+ SM Sbjct: 121 VLSGSGNVQTDFM------KVVGFPQISFKTNSTATWGNTKLRVAMALDVTGSM-----S 169 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 KL + ++++D +K+ V ++ F+ + Sbjct: 170 SAGKLVQMKIAAKKLIDTLKASATAEGDVYISIIPFNVMVN 210 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 80/213 (37%), Gaps = 27/213 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S++ S + L +++ +DVS SM + A ++ LD ++ V Sbjct: 76 SLEPISSAKSPLSVVLAIDVSGSMKGE------PMAEARKAAAIFLDELEKDDHV----- 124 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L+TF + A ++E++ L + Y +A E+ Sbjct: 125 -ALITFGQGVYHLSDFAAKKHEVREQLQHLEAKEQ------WTWLYQATLEAMERAVQAP 177 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +I LTDG++ + + L + K +YA+G ++A +L+ A Sbjct: 178 TTRAA----VILLTDGKDEGSPVSEEAVL---DRIKGAQVPIYAMGFGSKAQVDYLQKVA 230 Query: 338 SPDR--FYSVQNSRKLHDAFLRIGKEMVKQRIL 368 S + F S + L + + + + Q IL Sbjct: 231 SASQGAFLSTPQAADLTNLYQTVLDYLKNQYIL 263 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 86/207 (41%), Gaps = 41/207 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM +++G ++ L ++P + ++ F+S + Sbjct: 531 VVLVLDTSGSMLQD-----NRIGALKEAVGVFL---GTLPPGSK---VAVIEFNSFVN-- 577 Query: 231 FPLAWGVQH---------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 PL +G + ++ ++NR T ++ A I + + Sbjct: 578 -PLVFGPANEIFTTRFDDVKSQVNRFRANGGTSYYDAVDRALELIANQTGRRA------- 629 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCA-- 337 ++ LTDGE++S + +S+ +A+ G V+ +GV E + L+ A Sbjct: 630 -----VLALTDGEDTSSRLAGLDSVIL--KARNLGLPVHTLGVGREDEIEVGELQRLARE 682 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + R++ +++ KL F + + + + Sbjct: 683 TRGRYFPARDATKLRVIFAELAQSLRE 709 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 60/414 (14%), Positives = 120/414 (28%), Gaps = 71/414 (17%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 G ++L A+++P F + L + + KA+L + ++L A + + Sbjct: 5 RKQSGHAAMLFAMMIPAFFGIFTLASDGARALQSKARLEDAAEAAVLAIAAHNADNSGSS 64 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 +G + I Q + N+I + ++ Y + A Sbjct: 65 SGSAINKKIASDWIGQYMQDMQAIS-DIKITKLNCNDIAECKEGLENGESRYFQYEILAK 123 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPG 188 + + F S + ++ K S+S +D+M V D S SM G Sbjct: 124 TNHLSWFPGNNSTAGFGESFDVVGSATARKFQSES---VDVMFVSDFSGSMNNKWSGGSN 180 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKI------------------- 227 + + I +++ + + + R G F++ Sbjct: 181 SRRYKDLIKIIGDVIKELDKFNNAHTTTTNRVGFTGFNTYTRKTADNSCYQDQYDRSAGR 240 Query: 228 --------------------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + + I G T S G+ + Sbjct: 241 TVNKIFEVKGCKSRSSGGAKFHDIAMTDNYNEFKNTIKYFKPGGGTASYQGIIRGAQMMD 300 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEA------------KR 314 A E ++ +I L+DG +S + NK C++ K Sbjct: 301 AAPEPR---------PRRIMIILSDGIDSKRSRANKLVEEGMCSKILLKLGNANTSDGKA 351 Query: 315 RGAIVYAIGVQA-EAADQFLKNCASPDRFYSVQNSRK-LHDAFLRIGKEMVKQR 366 + +G A++ L C Y N L+ I +E+ + Sbjct: 352 IKTKMAVVGFDYNPASNPSLAKCVGEHNVYGANNPEDVLNKILELISEEIGHLK 405 >gi|182414211|ref|YP_001819277.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841425|gb|ACB75677.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 611 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 76/215 (35%), Gaps = 40/215 (18%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLL--ITSSVKISSKSDIGLDMMMVLDVSLSM 181 + SA SR P++ + + P + V D ++++ +D+S SM Sbjct: 51 THRTSAPSRAR-PWLSLALAFGVIAVARPQWGRLEEPV-----FDQAREILIAIDLSRSM 104 Query: 182 N--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 D +D+ + T+S+ E L + R GL+ FS PL+ + Sbjct: 105 LAPDVKPSRLDRAKLLTQSLLEKL----------SGERVGLIVFSGTAFLQSPLSSDYEI 154 Query: 240 IQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 ++E + L T + A +++I L+DGE Sbjct: 155 LREFLPALDPTFLPEGGTNYDALINTALTAFGATG-----------AADRFLIILSDGEA 203 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + + E K RG V A+GV A Sbjct: 204 TEDDW-----RSHVAELKNRGIRVIALGVGTTAGA 233 >gi|148669822|gb|EDL01769.1| von Willebrand factor A domain containing 2 [Mus musculus] Length = 748 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 62/172 (36%), Gaps = 21/172 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +D+ ++ G + R D + P VR G + F S Sbjct: 3 CSAAVDILFLLDGSHSIGKGSFERSKRFAIAACDALDISPGR---VRVGALQFGSTPHLE 59 Query: 231 FPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 FPL Q ++E I ++F T++ L+ + + + + Sbjct: 60 FPLDSFSTRQEVKESIKGIVFKGGRTETGLALKRLSRGFPGGR---------NGSVPQIL 110 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I +TDG++ P + + RG +V+A+GV+ D+ L + P Sbjct: 111 IIVTDGKSQGPVA------LPAKQLRERGIVVFAVGVRFPRWDELLTLASEP 156 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ IR+ PDV GLV + S++ Sbjct: 449 SLDLVFLLDASASVGRE------NFAQMQSFIRKCTLRFDVNPDVTQ---VGLVVYGSRV 499 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I D ++ A+ K Sbjct: 500 QTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAG---TALLHIEDKVMTVQRGARPGVP--K 554 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ LT G +++ + + G V + V A Sbjct: 555 AVVMLTGG------SGAEDAAVPAQKLRGNGISVLVMSVGA 589 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 79/217 (36%), Gaps = 36/217 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ ++++V+D S SM + G G +L +A + LD ++ R GL Sbjct: 404 AELRKPANVLLVIDTSGSMQESVPGTGSTRLELAKEAAITSLDEF------SDSDRVGLW 457 Query: 222 TFSSKIVQT-------FPL---------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 FS+ + PL + + E+I+ L G T A+ Sbjct: 458 MFSTDLEDNGQDWRELVPLGPLGASVNGTPRREELAERISNLPPGGGTGLYDTALAAHTL 517 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF--YCNEAKRRGAIVYAIG 323 + + A ++FLTDG+N N + E L E ++G ++ I Sbjct: 518 VAEHSRPDAINA---------VVFLTDGKNEDLNGISLEKLLDSITPEPGQQGVRIFTIS 568 Query: 324 VQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRI 358 +A + + A Y + + + + F + Sbjct: 569 YGEDADLKTMTQIAEATNAAAYDASDPQSIDEVFEAV 605 >gi|229587743|ref|YP_002869862.1| hypothetical protein PFLU0165 [Pseudomonas fluorescens SBW25] gi|229359609|emb|CAY46451.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25] Length = 551 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 76/225 (33%), Gaps = 26/225 (11%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S + + PW + + I +S +++ ++DVS SM+ G Sbjct: 158 SPFGVTTEVAATPWNPRTQLLRIGIKAS-DRPVAELAPANLVFLVDVSGSMDRREGLP-- 214 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ--EKINRLI 248 +S ++L + + R LV ++ + G ++ I++L Sbjct: 215 ----LVKSTLKLL-----VDQLREQDRVSLVVYAGESRVVLKPTSGRDKVKIRNAIDQLT 265 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G +T G+E AY + I+ TDG+ + D Sbjct: 266 AGGSTAGASGIELAYQMAREGFIDNGINR---------ILLATDGDFNVGISDFDSLKQM 316 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQNSRK 350 E ++ G + +G + ++ L A + + N R+ Sbjct: 317 AVEQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLRE 361 >gi|91205150|ref|YP_537505.1| hypothetical protein RBE_0335 [Rickettsia bellii RML369-C] gi|91068694|gb|ABE04416.1| unknown [Rickettsia bellii RML369-C] Length = 516 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 53/325 (16%), Positives = 107/325 (32%), Gaps = 43/325 (13%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-------S 115 ++N+E+ QK + + NE F + I +S + Sbjct: 161 LVNEEDIT--PFQKAIYHPTDFSQLITQISSNEENSLNFIMNNGAIAQSVQVYTADGKAP 218 Query: 116 IIIDDQHKDYNL--SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 II D + + + +Y +P + + +++ ++ + Sbjct: 219 IIASDLKDGFIIDKQYLLKYLLPIFNGFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISL 278 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--IVQTF 231 ++D+S SM F V +I ++LD + IP+ + +V F+ + Sbjct: 279 LIDISGSMEKDFS-------VYKNNILKILDKLAEIPNW----QINIVVFNDESTARSFS 327 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 ++ I+ IN L TK ++ A KG D +I T Sbjct: 328 NQENNIEDIKVYINNLKANGYTKLYGTIKEALESF-----------KGKIDESSTLIVFT 376 Query: 292 DGENSSPNIDNKE---SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV--- 345 DG++ N + E K +Y +G +F + A+ F V Sbjct: 377 DGKDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQYYNQEFFEQVATRGGFTHVSLN 436 Query: 346 --QNSRKLHDAFLRIGKEMVKQRIL 368 +L I ++++ I+ Sbjct: 437 DPTGMHQLQQYIDNIEQKVMTFEII 461 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 76/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D+ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLAFIIDSSRSVNTYHYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 >gi|301756400|ref|XP_002914037.1| PREDICTED: matrilin-2-like isoform 3 [Ailuropoda melanoleuca] Length = 957 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 76/203 (37%), Gaps = 26/203 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 51 SCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLLQ 101 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 102 YGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR-- 159 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ + I+ +TDG +A+ G +++AIGV + + P Sbjct: 160 -ENVLRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDLNTLKAIGSEP 212 Query: 340 --DRFYSVQN---SRKLHDAFLR 357 D + V N L F Sbjct: 213 HEDHVFLVANFSQMESLTSVFQN 235 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 651 TEGPVDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLAVSP---KAARVGLLQYS 701 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 702 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 756 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++A+ G +YA+GV ++ L+ AS Sbjct: 757 STRVPRVAIVFTDGRAQD------DVSEWASKAQANGITMYAVGVGKAIEEE-LQEIASE 809 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 PTDKHLFYAED----FSTMGEISEKLKK 833 >gi|22761666|dbj|BAC11648.1| unnamed protein product [Homo sapiens] Length = 451 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHPSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 >gi|67922256|ref|ZP_00515770.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] gi|67855959|gb|EAM51204.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] Length = 416 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 75/226 (33%), Gaps = 28/226 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 +NS + S+V SS + L++ ++LD S SM+ + + + Sbjct: 16 ANQSNSQRQVAISLSAVSESSDRSLPLNLGLILDHSGSMSGK------PMKTVKEAASYL 69 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ + PD R +V F + P + +++ I L T G++ Sbjct: 70 VEGLG--PD----DRLSVVAFDHRAKVIVPNQPVDEIDGVKDAIASLKAEGGTSIDEGMK 123 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 ++ KE I LTDGEN DN+ L A + Sbjct: 124 LGIKQVALGKEDRVSQ----------IFLLTDGENEHG--DNERCLKLAQVAGEYNITLN 171 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 +G L++ A ++ + F R+ M Sbjct: 172 TLGFGNHWNQDVLESIADSVGGTLCYIEQPEQALTEFSRLFTRMQS 217 >gi|47219688|emb|CAG12610.1| unnamed protein product [Tetraodon nigroviridis] Length = 717 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 ++K + + + +D+ M+ + G + ++ + PD Sbjct: 5 QTSHENIKKITAAGQLMQCSAAMDILFLMDGSYSVGKGSFERSKHYALKLCQALDIRPDK 64 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDA 269 VR GL+ F S F L Q + + ++ + G +T++ L+Y K Sbjct: 65 ---VRVGLIQFGSAPRLEFALDLHATKQELMRHMKKISYRGGSTQTGLALKYVLRKGLPG 121 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 D + +I L+DG++ + + K G +++A+G++ Sbjct: 122 GR-------NSSDAAQIVIVLSDGKSQG------NVMQAAAQLKETGVVLFAVGLRYPRW 168 Query: 330 DQFLKNCAS---------PDRFYSVQN 347 ++ L AS + FY N Sbjct: 169 EE-LHALASEPVENHVFFAEHFYDAVN 194 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 60/180 (33%), Gaps = 16/180 (8%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 LD+ +++ G G D +R + +V + LV +S + F L Sbjct: 455 ALDLVFALDASDGVGRDNFLTLCDFVRSLSVQFDI---NRDVAQLALVAYSRRATTVFNL 511 Query: 234 AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + I + ST A + ++ A+ K ++ +T Sbjct: 512 DTHDSGSAVLTAIGEASYMGGVASTG---TALLHVHSDVLTVDKGARLG--VNKAVVVVT 566 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG +++ + + G V+ +G+ ++ L+ S + V + L Sbjct: 567 DG------SGGTDAVVPAQKLRDNGVSVFVVGIGDMQREKLLQIAGSEEHLILVPSYEDL 620 >gi|332879903|ref|ZP_08447588.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682114|gb|EGJ55026.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 345 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 76/239 (31%), Gaps = 57/239 (23%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 KI + G+D++ +DVS SM ++L A R E ++ +K R Sbjct: 80 TKIETVKREGVDIVFAIDVSKSMLAEDVAP-NRLEKAKRIAFETINQLKG-------DRV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+V +++ L + + + T + A N D Sbjct: 132 GIVAYAASAYPQLALTTDHSAAKMFLQSMNTNMLSSQGTAIQEAIRMATNYFDDK----- 186 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----- 329 + + ++DGE+ + EA+ +G +Y IGV E Sbjct: 187 ------STTSRLLFIISDGEDHE-----MGATEIAAEAQEKGIHIYTIGVGTEKGSPIPM 235 Query: 330 ----------------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + L+ A + ++ + N+++ +I + K Sbjct: 236 RELGEQSYKRDRNGEVVITRLNKELLQQIAINAGGQYLNGDNTQEAVSQIEKILESTEK 294 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 98/277 (35%), Gaps = 32/277 (11%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT--FPWCANSSHAPLLITSSVKI 161 + + ++ + + +Y S S E PF T P N L I Sbjct: 175 NQGRLPEKGTVRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNEHKMLLRIGLKGYE 234 Query: 162 SSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 KS +G ++ +LDVS SM DKL + S++ + + V+ VV +G Sbjct: 235 VDKSQLGASNLVFLLDVSGSM-----NSRDKLPLLKTSLKMLSQQLSEQDHVSIVVYAGA 289 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +V Q I + +N L G +T G++ AY +H +G Sbjct: 290 SG----VVLDGVKGNDTQAINQALNSLKAGGSTNGGAGIQQAYRL------AQKHFIQGG 339 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA-- 337 + +I TDG+ + D++ + + +G + +G D ++ A Sbjct: 340 VNR---VILATDGDFNVGTTDHQALMDLIAAKRDQGIALTTLGFGQGNYNDHLMEQLADK 396 Query: 338 SPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L L I K++ Q Sbjct: 397 GNGHYAYIDTLNEARKVLVDELSSTLLTIAKDVKIQV 433 >gi|218296567|ref|ZP_03497295.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] gi|218243109|gb|EED09641.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] Length = 706 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 49/234 (20%), Positives = 83/234 (35%), Gaps = 35/234 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F P + + + G +++VLDVS SM Sbjct: 275 LLFTATPKGLFFGGWDRALPEDLPLKPLGRKGAALVLVLDVSGSMEGE---------KLA 325 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 ++ L++++S + G+V FSS FP A G + + + L G Sbjct: 326 MAVAGALELVRSAAPED---YLGVVLFSSSPRVLFPPRPMTAQGKKEAESLLLSLRAGGG 382 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T A + D +K ++ L+DG D KE + A Sbjct: 383 TVLGGAFREALRLLQDVP-----------VERKALLVLSDG----IIFDPKEPILA--LA 425 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 G V A+ + +A FL+ A RFY ++L FL+ G+E+ + Sbjct: 426 ATAGVEVSALALGPDADAAFLEALAQRGGGRFYRAATPKELPRLFLKEGQEVFQ 479 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 94/256 (36%), Gaps = 30/256 (11%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLS 180 A + + + PW + + I + +++K +++ ++DVS S Sbjct: 167 FTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGS 226 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M+ DKL +A S++ + +++ + + +G ++K+V Q I Sbjct: 227 MDTE-----DKLQLAKSSLKMLTKQLRAQDSITLITYAG----NTKVVLPSTPGNQTQKI 277 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 I+ L +T ++ AY + EH K + I+ LTDG+ + Sbjct: 278 LNAIDNLTASGSTNGEAAIKLAYQQAT------EHFKKDGINR---ILMLTDGDFNVGVS 328 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--SPDRFYSVQNSR-------- 349 K+ L + +G + +G D ++ A + + + Sbjct: 329 SVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSEAKKVLID 388 Query: 350 KLHDAFLRIGKEMVKQ 365 ++ F + K++ Q Sbjct: 389 EMSATFNTVAKDVKIQ 404 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 609 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 659 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 660 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 714 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 715 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 769 TNKHLFYAED----FSTMDEISEKLKK 791 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|119612173|gb|EAW91767.1| matrilin 2, isoform CRA_c [Homo sapiens] Length = 451 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 71/201 (35%), Gaps = 34/201 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + G+VTF S Sbjct: 309 VCLVLDKSGSMSSE-----DRLFRMNQAAELFL-----IQIIEKGSLVGMVTFDSVAEIR 358 Query: 231 FPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L V T GL+ + I +++ Sbjct: 359 NNLTKITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQSTSGSE--------- 409 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 II LTDGE++ + C E K+ G I++ I + AA + L + RFY Sbjct: 410 IILLTDGEDNE--------IHSCIEEVKQSGVIIHTIALGPSAAKELETLSDMTGGHRFY 461 Query: 344 SVQNSRKLHDAFLRIGKEMVK 364 + ++ L +AF RI Sbjct: 462 ANKDINGLTNAFSRISSRSGN 482 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 609 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 659 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 660 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 714 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 715 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 769 TNKHLFYAED----FSTMDEISEKLKK 791 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 77/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPF--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 109 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 159 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 160 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 218 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 219 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKSIGSE 270 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 271 PHEDHVFLVANFSQIETLTSVFQK 294 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 710 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 760 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 761 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 815 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 816 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 869 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 870 TNKHLFYAED----FSTMDEISEKLKK 892 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 48/216 (22%), Positives = 82/216 (37%), Gaps = 36/216 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +D++ VLDVS SM KL + R++ ++ + S R ++ Sbjct: 287 STTRAPVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIA 334 Query: 223 FSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + FPL + +++ +N L T GL I E AK Sbjct: 335 FSSSARRLFPLRRMTESGRQQSLLAVNSLTSNGGTNIAEGLRKGSKVI------EERQAK 388 Query: 279 GHDDYKKYIIFLTDGENSSPNIDN----KESLFYC----NEAKRRGAIVYAIGVQAEAAD 330 II L+DG+++ K + YC + + V+ G A+ Sbjct: 389 NPVCS---IILLSDGQDTYTVSPTAGVHKGAPEYCALLPSTNGNQQIPVHVFGFGADHDS 445 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + + S F ++ + DAF + IG + Sbjct: 446 VSLHSISQTSGGTFSFIETEAAIQDAFAQCIGGLLS 481 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 75/208 (36%), Gaps = 28/208 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPL-LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 S ++ P F + L + K+ + G+D++ +DVS SM Sbjct: 50 KSAFKAPLKFSLQLLGVAAIVIALVNPKAGTKLETVKREGVDIVFAVDVSKSMLAEDIAP 109 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +++ A R + E+++ + S R G++ ++++ P+ + + + Sbjct: 110 -NRMEKAKRLVSEIINELAS-------DRIGIIAYAAQAYPQLPITTDFGAAKMFLQGMN 161 Query: 249 ----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 T + +E A DA + + + ++DGE+ Sbjct: 162 TDMLSSQGTAISDAIELATTYYNDAAQTN-----------RVLFIVSDGED----HSEGG 206 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ ++A G ++ IGV E Sbjct: 207 AVNAVSKATEAGIKIFTIGVGTEKGAPI 234 >gi|83312059|ref|YP_422323.1| hypothetical protein amb2960 [Magnetospirillum magneticum AMB-1] gi|82946900|dbj|BAE51764.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1171 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 76/219 (34%), Gaps = 24/219 (10%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 C + +T + S LD++M+LD S SM G L + R+ L Sbjct: 39 CTVGQRVKVSLTLTAPPSRHPPPPLDVVMLLDHSSSMGAAPGSP---LQMMLRAAGNFLR 95 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + PD R +V F+ LA + + + G T L A Sbjct: 96 QLS--PDS----RVAVVGFNQVPSVHCTLAATPAQARSALQAISPGGATSIAAALNQAVE 149 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + + K ++ +DG++ I + + + V A+G Sbjct: 150 LLAHGRPGM----------DKVVVLCSDGQDDIAEIADALARLKAIPS----VRVLAVGF 195 Query: 325 QAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 E FL A ++ + +R + D F R+ KE+ Sbjct: 196 GDEVIHATFLAMVADRQDYFHLTRARDMDDVFQRLAKEV 234 >gi|172037673|ref|YP_001804174.1| hypothetical protein cce_2760 [Cyanothece sp. ATCC 51142] gi|171699127|gb|ACB52108.1| unknown [Cyanothece sp. ATCC 51142] Length = 423 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 69/224 (30%), Gaps = 28/224 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + S + + L++ ++LD S SM + + ++D Sbjct: 26 AISLSAVTESSAPQSRSLRDRTLPLNLGLILDHSGSMTGK------PIKTVKEAAMRLVD 79 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + + R +V F + P ++ +++ I RL T G++ Sbjct: 80 GLGASD------RLSVVAFDHRAKVIVPNQPVDDIERVKQAIERLKPEGGTSIDEGMKLG 133 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ K+ I LTDGEN DN+ L A V + Sbjct: 134 IKEVALGKDDRVSQ----------IFLLTDGENEHG--DNERCLKLAQVAAEYNITVNTL 181 Query: 323 GVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVK 364 G L++ A ++ + F R+ + Sbjct: 182 GFGNHWNQDVLESIADAVGGTLCYIEQPEQALTEFSRLFTRIQS 225 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 90/234 (38%), Gaps = 41/234 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + ++ G++++ LD S SM D++ A +I +++ + Sbjct: 73 IGLAGPQFPLEQTKEGVNIVFALDTSGSMEAADYQP-DRITAAKEAIGTLINQLDLKD-- 129 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKE 271 +G++TF S L+ Q + EK+ + +T GL A + Sbjct: 130 ----YAGIITFDSGASTAAYLSPDKQRVIEKLGMIAASDDSTAIGDGLALAVDM------ 179 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------ 325 +K + K +I L+DGE+++ + + + AK G V+ + + Sbjct: 180 -----SKSIPNRKSVVILLSDGESNAGYVSPETA---AEFAKESGVQVFTVAMGSSEKVL 231 Query: 326 -----------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A ++ L+ A + FYS + + L + + ++ +V ++ Sbjct: 232 VGYDWANNPQYATVDEETLEYIADSTGGGFYSSVDEKTLGNIYSQLDDAIVHEK 285 >gi|223936328|ref|ZP_03628240.1| von Willebrand factor type A [bacterium Ellin514] gi|223894846|gb|EEF61295.1| von Willebrand factor type A [bacterium Ellin514] Length = 657 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 60/207 (28%), Gaps = 32/207 (15%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ M + + + P + GLD+++ +D S SM Sbjct: 56 KFRMALVVAAVVFLILTLARPQW---GFTLEEARQRGLDILVAIDTSNSMLAEDIQPNRL 112 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---- 247 ++ M R GLV F+ PL + I+ L Sbjct: 113 ARARLAALDLM--------HRARTDRMGLVAFAGTAFLQCPLTLDDAAFSQSIDSLDTRT 164 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 I T + A + K+ K ++ TDGE+ + Sbjct: 165 ISEGGTALAEAINTARETFKNEKDNH-----------KVLVLFTDGEDQDMGAVSAAEKA 213 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLK 334 G +++ IG+ + L+ Sbjct: 214 A-----AEGMLIFTIGIGTPDG-ELLR 234 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 48/215 (22%), Positives = 82/215 (38%), Gaps = 34/215 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +K L ++MV+D S SM + G +KL +A ++ + + I V GLV Sbjct: 403 TKQQPDLALVMVIDRSGSMAEPVAGGRRNKLDLAKEAVYQASLGLTPIDQV------GLV 456 Query: 222 TFSSKIV---QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 F Q PL + I+ + G T PG+E A + K++H Sbjct: 457 VFDDTANWVLQLQPLP-SMVEIERALGSFGIGGGTNIRPGIEQAALALASTDAKIKH--- 512 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-A 337 ++ LTDG S D + + G + + V +A + A Sbjct: 513 --------VLLLTDGIAESNYSD------LIAQMRASGITISTVAVGLDANPNLVDVANA 558 Query: 338 SPDRFYSVQNSRKLHDAFL-----RIGKEMVKQRI 367 R Y V + ++ FL G+++++Q I Sbjct: 559 GGGRSYRVTSIDEVPRIFLQETIIAAGRDIIEQPI 593 >gi|323699770|ref|ZP_08111682.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323459702|gb|EGB15567.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 2034 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/257 (15%), Positives = 80/257 (31%), Gaps = 35/257 (13%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 L++ ++ ++ + S Y P P A+ + + V ++ + LD+++ Sbjct: 361 LAVNAVERKGNHVDATFSLYG-PDAEGMVPLAADLTVREGGSPARVVSVERTRLPLDIVL 419 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 ++D S SM +D ++ + VR V F +K Sbjct: 420 LVDSSGSMKGQMRNALDATRKFIAAL-----------PADARVRV--VDFDTKPRALPGE 466 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + T + + + AK ++ TDG Sbjct: 467 TRD--AALKGLAGIKANGATCLNDAVLLGLHMLAGAKRPA-------------LLVFTDG 511 Query: 294 ENSSPNIDNKES----LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQN 347 +++ N S + K G V+ IG L A S R+Y + Sbjct: 512 FDANFNDTGPGSKATRREVLDAVKTGGVPVFTIGFGKGHDVSTLDRIASLSGGRYYPASD 571 Query: 348 SRKLHDAFLRIGKEMVK 364 L AF + + Sbjct: 572 PGALDKAFAVVNANLAN 588 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 63/193 (32%), Gaps = 20/193 (10%) Query: 175 LDVSLSMND--HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +D+S SM+ F ++ + + L ++PD V + +TFS + V Sbjct: 612 VDISGSMDKTPDFSGCNYRMDKVKAILHDFL---AALPDE---VLAQGMTFSDQNVIEQV 665 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + +N L T+ + A ++Y++F+TD Sbjct: 666 TTANTGEMLAAMNDLYADGGTEIAGAVAAVLET---------QRAIPST--RRYLLFITD 714 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSVQNSRKL 351 D + + G +G+ F + S + ++ +L Sbjct: 715 AALDVEPEDKLFFETTLAKLRDEGVYCLWVGIGELDPAPFKRAAEISGGSYVLTEDPAEL 774 Query: 352 HDAFLRIGKEMVK 364 AF + ++ K Sbjct: 775 GRAFDGLVADIRK 787 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 60/174 (34%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 KI + G+D++ +DVS SM ++L A R E + +K R Sbjct: 80 TKIETVKREGVDIVFAIDVSKSMLAEDVAP-NRLEKAKRIAFETISQLKG-------DRV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G+V +++ L + + + + ++ A + Sbjct: 132 GIVAYAASAYPQLALTTDHSAAKMFLQGMNTDMLSSQGTAIQEAI-------RMASNYFD 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + + LTDGE+ + EA+ +G +Y IG+ E Sbjct: 185 ENTPTARLLFILTDGEDHE-----MGATEIATEAQEKGVHIYTIGIGTEKGAPI 233 >gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 418 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 74/223 (33%), Gaps = 28/223 (12%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + S S++ + S + L++ ++LD S SM G ++ + A Sbjct: 19 SGSQRLMATSVSAIGETIDSRVPLNLCLILDHSGSMK---GQPVENVKRA---------A 66 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + + + R +V F+ + HI+++INRL T GL Sbjct: 67 WLLVDKLRDQDRLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLSANGGTSIDEGLRLGI 126 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ ++ A LTDGEN DN L + A V +G Sbjct: 127 EELAKGRKDTISQA----------FLLTDGENEHG--DNNRCLKFAQLAADYNLTVNTLG 174 Query: 324 VQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L+ A +++ + D F + M Sbjct: 175 FGNNWNQHILEKISDAGLGSLSHIEHPDQAMDKFDSLLTRMQT 217 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 78/251 (31%), Gaps = 44/251 (17%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLG 193 M TF P S + S + G+D+M+ +DVS SM + P ++L Sbjct: 57 MVLRCLTFILVVCVLARPQTRNSWDQRSVE---GIDIMLAMDVSTSMLAEDLRP--NRLE 111 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 A E + GL F+ + P+ + L+ T Sbjct: 112 AAKNVAAEFI-------SGRPNDNIGLTIFAGESFTQCPMTTDHASLLN----LLRNVRT 160 Query: 254 KSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 GL + K K +I +TDG N+ +I S A Sbjct: 161 DIAARGLISDGTAVGMGLANAVSRLKDSKAKSKVVILITDGSNNMGDISPMTS---AQIA 217 Query: 313 KRRGAIVYAIGVQAEA---------------------ADQFLKNCA--SPDRFYSVQNSR 349 + G VY IGV + L + A + FY N++ Sbjct: 218 QSLGIRVYTIGVGTNKVAPYPMNVGGTTQYVNIPVEIDSKTLSDIAAVTEGNFYRATNNK 277 Query: 350 KLHDAFLRIGK 360 +L + I K Sbjct: 278 ELKQIYNDIDK 288 >gi|302868694|ref|YP_003837331.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504835|ref|YP_004083722.1| von willebrand factor type a [Micromonospora sp. L5] gi|302571553|gb|ADL47755.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315411454|gb|ADU09571.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 316 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 67/211 (31%), Gaps = 32/211 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVSLSM ++L A + + + +P+ N GLV+F+ Sbjct: 89 VMLAIDVSLSMQADDVSP-NRLEAAQEAAK---QFVGELPESYN---LGLVSFAKSANVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + I+ L+ T + + I I+ L Sbjct: 142 VPPTKDRAAVTTAIDGLVLAEATATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVLL 196 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----ADQF---------LKNC 336 +DG +S + + + V I ++ Q L Sbjct: 197 SDGYRTSGRSVEEAAAAA----QAANVPVSTIAFGTDSGQVDIGGQLQRVPVDRTALSQL 252 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + FY ++ +L + +G + + Sbjct: 253 AETTQGFFYEAASASELKQVYQDMGSSIGYR 283 >gi|310657870|ref|YP_003935591.1| hypothetical protein CLOST_0560 [Clostridium sticklandii DSM 519] gi|308824648|emb|CBH20686.1| exported protein of unknown function [Clostridium sticklandii] Length = 873 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 83/227 (36%), Gaps = 40/227 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D+++V+D S SM D + ++ SI D N R G++ FS Sbjct: 397 EKPVDVILVIDTSGSMGTR--IPGDSKAPLYYAKLAAINFANSIIDENPDSRVGVIEFSG 454 Query: 226 KIVQTFP-------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L ++ IN L + T G AYNKI Sbjct: 455 GYYGYASDASTVINLTNNKANLASSINGLTTHNMTNIQAGFRLAYNKIS--------AIS 506 Query: 279 GHDDYKKYIIFLTDG---------ENSSPNIDNKESLFYCNEAKRRGAIV----YAIGVQ 325 D K ++FLTDG +S+P + N ++ E + + + + IG+ Sbjct: 507 STRDSVKSVVFLTDGVANVSIGNWSSSNPVVHNTHTIAAYTEGQSLYSYINGNLFTIGLF 566 Query: 326 AEAADQFLKNCASP----------DRFYSVQNSRKLHDAFLRIGKEM 362 ++ +K+ A +++Y ++ L + I +++ Sbjct: 567 GAISNSSVKSIARDTLQKAVYDDLEKYYEASSAVDLGPVYETISQKL 613 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 84/219 (38%), Gaps = 37/219 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 243 QISRYPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLS 290 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N ++ T GL + D ++K Sbjct: 291 VIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVANGGTNIAEGLRKGVKVMEDRRDKNPV 350 Query: 276 IAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFY------CNEAKRRGAIVYAIGVQAE 327 + II L+DG ++ D L C KR V++ G ++ Sbjct: 351 AS---------IILLSDGRDTYTMNQADPNYKLLLPLSMHGCES-KRFQIPVHSFGFGSD 400 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + + S F +++ + DA + IG + Sbjct: 401 HDASLMHSVSETSGGTFSFIESESVIQDALAQCIGGLLS 439 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 84/219 (38%), Gaps = 37/219 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 240 QISRYPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLS 287 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N ++ T GL + D ++K Sbjct: 288 VIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVANGGTNIAEGLRKGVKVMEDRRDKNPV 347 Query: 276 IAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFY------CNEAKRRGAIVYAIGVQAE 327 + II L+DG ++ D L C KR V++ G ++ Sbjct: 348 AS---------IILLSDGRDTYTMNQADPNYKLLLPLSMHGCES-KRFQIPVHSFGFGSD 397 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + + S F +++ + DA + IG + Sbjct: 398 HDASLMHSVSETSGGTFSFIESESVIQDALAQCIGGLLS 436 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 87/208 (41%), Gaps = 29/208 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S++D + K +++I S + G+V +S Sbjct: 38 APSDLVFILDGSWSVDDINFEIVKKW---------LVNITMSFNIGQKFTQVGVVQYSDD 88 Query: 227 IVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + I + G T + +++A +K+F E +G + Sbjct: 89 PFLHIPLGKHFSS-SDLIKAMESIEYMGGNTNTGRAIKFANDKLFALSE------RGPNG 141 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SP 339 K + LTDG++ E L A+++G I++AIGV +E + L+ A S Sbjct: 142 IAKIAVVLTDGKSQD------EVLAAAEAARKKGIILFAIGVGSETEEAQLRAIANKPSS 195 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +SV++ + + I +++ ++ + Sbjct: 196 TYVFSVKDYKAIAKIREVIRQKLCEETV 223 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 85/215 (39%), Gaps = 37/215 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +S L++++ +D S S+ F + A + + +L I L+ Sbjct: 84 RESSTPLEIVLAIDASESV---FNDEHLEREAAKKFMASLLRKQDQID---------LMD 131 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+ + + VQ I + R+ G T + A ++ + Sbjct: 132 FADDVDELVSFTSDVQKIDSGLGRIHHGDATALYDAVYLASQRLGETPTSAGQR------ 185 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA---IGVQAEAAD--------- 330 + ++ +TDGEN++ + +L +A+R GA++YA + V A+A Sbjct: 186 --RVLVLITDGENTTHHGSYDAAL---EQAQRAGAMIYALIIVPVSADAGRNTGGEHALI 240 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 Q ++ + ++Y V++ L AF + ++ Q Sbjct: 241 QLARD--TGGKYYYVEDKHDLAPAFQHVSDDLRTQ 273 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 76/203 (37%), Gaps = 15/203 (7%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + L ++++LDVS SM+ K+ + EM+ + ++ ++ Sbjct: 9 KPATAKHLPVVLLLDVSGSMSGE------KIENLYDATNEMIKVFSDAVSKEKIIDIAII 62 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 TF + P V +N + T L A + I D + +I + Sbjct: 63 TFGENVELHTPYTSVVDFKSRGLNPFLASGMTPLGTALRMAKDMIEDKETTPSNIYRPA- 121 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PD 340 ++ ++DG + ++ N + +A+ + +A +Q LK+ A + Sbjct: 122 -----VVLVSDGVPTDEWRGPLDNFK--NNGRSSKCQRFAVAIGNDADNQMLKSFAECNE 174 Query: 341 RFYSVQNSRKLHDAFLRIGKEMV 363 F+ +N + D F +I + Sbjct: 175 NFFIAENVSDIVDKFKQISMSVS 197 >gi|150389538|ref|YP_001319587.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] gi|149949400|gb|ABR47928.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] Length = 551 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/310 (13%), Positives = 101/310 (32%), Gaps = 29/310 (9%) Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 T ++ +N + + + + + + IN ++ S I + Sbjct: 68 TNLVKDKNVLEEIPVSQLTGDKSSEEVLSVTPESFIAKEAIDLKINQLDTSQFPKISL-- 125 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 N + + + + S +++ + L + ++LD S S Sbjct: 126 YFSALNAQGIPILNLTRDSFEIYEERIINSDTKPVKSDTFLANLQQVPLSVSLILDNSGS 185 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M+ G M + A + +D +N + ++ F+S + P ++ + Sbjct: 186 MS---GNPMTQAKSAAKQFLNYVDF-------SNGDQVEIIEFNSDVYIRIPYGSDIKSL 235 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 I+ + S T L + + K I+ TDGE N Sbjct: 236 NTAIDTMESNSQTALYDALYTGLVRAYS------------QSGPKCILAFTDGEE---NA 280 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + ++ ++ IGV + ++ LK A + ++ + +L + + Sbjct: 281 SIRSVSEVTELSRATSIPIFIIGVGSLIDEESLKEIAEQTGGEYFYSPTAVELEQIYKTV 340 Query: 359 GKEMVKQRIL 368 + +Q +L Sbjct: 341 YDQQKEQYVL 350 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 71.8 bits (174), Expect = 1e-10, Method: Composition-based stats. Identities = 42/195 (21%), Positives = 78/195 (40%), Gaps = 26/195 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ + R + ++ + P N R G++ +SS+ Sbjct: 33 GPLDLVFVIDSSRSVRPF------EFETMRRFLVNIIRGLDIGP---NATRVGVIQYSSQ 83 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + FPL A+ + ++ I+ ++ T + ++YA N F E Sbjct: 84 VQSVFPLGAFSRREDMERAIHAIVPLAQGTMTGLAIQYAMNVAFSVAE---GARPSQARV 140 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + +TDG +A+ RG +YA+GVQ L+ ASP + Sbjct: 141 PRVAVIVTDGRPQD------RVTEVAAQARNRGIEIYAVGVQRADVGS-LRAMASPPLDE 193 Query: 341 RFYSVQNSRKLHDAF 355 + V++ L F Sbjct: 194 HVFLVESF-DLIQQF 207 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 71/176 (40%), Gaps = 26/176 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V+D S S+ + R + +++D + P+ R GLV +SS++ Sbjct: 387 VDLVLVIDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQYSSRVR 437 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G +++ + + T + L + F + A + Sbjct: 438 TEFPL--GRYGTADEVKQAVLAVEYMEKGTMTGLALRHLVEHSFSEAQGARPRA---QNV 492 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG + + + AK G I+YA+GV ++ L+ AS Sbjct: 493 PRVGLVFTDGRSQD------DISVWAARAKEEGIIMYAVGVGKAVEEE-LREIASD 541 >gi|115482404|ref|NP_001064795.1| Os10g0464500 [Oryza sativa Japonica Group] gi|110289213|gb|AAP54178.2| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113639404|dbj|BAF26709.1| Os10g0464500 [Oryza sativa Japonica Group] Length = 719 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 86/230 (37%), Gaps = 45/230 (19%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 +S ++S +D++ VLDVS SM KL + R++ ++ + Sbjct: 249 SSPATVTS--RAPIDLVTVLDVSWSMAG------TKLALLKRAMSFVIQALGPGD----- 295 Query: 216 VRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +VTFSS + FPL G Q ++++ L+ T L A + D +E Sbjct: 296 -RLSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRE 354 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE---------------AKRRG 316 + + I+ L+DG ++ + ++ R Sbjct: 355 RNPVCS---------IVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARH 405 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+A G A+ + + A S F + + + DAF + IG + Sbjct: 406 VQVHAFGFGADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLS 455 >gi|22758319|gb|AAN05523.1| unknown protein [Oryza sativa Japonica Group] Length = 731 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 86/230 (37%), Gaps = 45/230 (19%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 +S ++S +D++ VLDVS SM KL + R++ ++ + Sbjct: 249 SSPATVTS--RAPIDLVTVLDVSWSMAG------TKLALLKRAMSFVIQALGPGD----- 295 Query: 216 VRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +VTFSS + FPL G Q ++++ L+ T L A + D +E Sbjct: 296 -RLSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRE 354 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE---------------AKRRG 316 + + I+ L+DG ++ + ++ R Sbjct: 355 RNPVCS---------IVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARH 405 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+A G A+ + + A S F + + + DAF + IG + Sbjct: 406 VQVHAFGFGADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLS 455 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 82/217 (37%), Gaps = 40/217 (18%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ VLD+S SM KL + R++ ++ + S R ++ FSS Sbjct: 262 RAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLSVIAFSS 309 Query: 226 KIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + FPL G QH + +N L+ T GL + D +EK + Sbjct: 310 TARRLFPLRRMSDTGRQHALQAVNALVANGGTNIAEGLRKGAKVMEDRREKNPVAS---- 365 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNE----------AKRRG--AIVYAIGVQAEAA 329 II L+DG+++ N + N G V+A G A+ Sbjct: 366 -----IILLSDGQDTYTVSGNGGNQPQPNYQLLLPVSIHGGDNAGFQIPVHAFGFGADHD 420 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + + S F ++ + DAF + IG + Sbjct: 421 ASSMHSISEISGGTFSFIETEAVIQDAFAQCIGGLLS 457 >gi|125532269|gb|EAY78834.1| hypothetical protein OsI_33939 [Oryza sativa Indica Group] gi|125575070|gb|EAZ16354.1| hypothetical protein OsJ_31816 [Oryza sativa Japonica Group] Length = 654 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 86/230 (37%), Gaps = 45/230 (19%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 +S ++S +D++ VLDVS SM KL + R++ ++ + Sbjct: 172 SSPATVTS--RAPIDLVTVLDVSWSMAG------TKLALLKRAMSFVIQALGPGD----- 218 Query: 216 VRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +VTFSS + FPL G Q ++++ L+ T L A + D +E Sbjct: 219 -RLSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRE 277 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE---------------AKRRG 316 + + I+ L+DG ++ + ++ R Sbjct: 278 RNPVCS---------IVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARH 328 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+A G A+ + + A S F + + + DAF + IG + Sbjct: 329 VQVHAFGFGADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLS 378 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 59/175 (33%), Gaps = 22/175 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM-NDHF---GPGMDKLGVATRSIREMLDIIKSIPDVN 213 + G ++++ LD+S SM + F G + +L R + Sbjct: 74 PATKDVVTASGREIVLALDLSGSMVKEDFVLDGKPLSRLDAVRRVASRFVAA-------R 126 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GLV F + P + V + I G + +ST I D Sbjct: 127 RGDRIGLVIFGDRAYVAQPPTFDVGSVAHAIEAAQIGISGRST--------AISDGLGLA 178 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 D K ++ L+DG ++S + ++ A G V+ I + E Sbjct: 179 TRRLLQSDATSKVVVLLSDGVDTSGKV---QAGDAARLAASHGIRVHTIALGPED 230 >gi|323450885|gb|EGB06764.1| hypothetical protein AURANDRAFT_71955 [Aureococcus anophagefferens] Length = 1008 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 23/211 (10%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP F + G+D+ +VLDVS SM D G Sbjct: 323 MPAAPDAFDLRLVDLGGGEAMVEVHTSGGAQRSGVDIEIVLDVSGSMATESEV-QDAAGN 381 Query: 195 ATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSSKIVQTFPLAWGV----QHIQEKIN 245 R LD+ K +++ R GLV + ++ LA + + Sbjct: 382 VQRHGFSTLDVCKHAARCVACSLDDTCRLGLVAYDAQARVVVGLARVTPAHVAKVHAALE 441 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 +L G++T GLE +++ ++ LTDG ++ + + + Sbjct: 442 KLAPGTSTNLWGGLELGVDELVGGAGDNARA----------VLLLTDGVPNNSPPEGEVA 491 Query: 306 LFYCNEAKRRG---AIVYAIGVQAEAADQFL 333 + G V+A G L Sbjct: 492 ALRAKRLTKDGSETVAVFAAGFGYALRSDLL 522 >gi|315186712|gb|EFU20470.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 332 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 86/220 (39%), Gaps = 45/220 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LDVS SM PG + VA IR + ++S P + GLV F + + Sbjct: 92 IVIALDVSPSMGAMDIPGKQRFQVARDVIR---EFVRSYPHMA----VGLVLFGKEAMLE 144 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V++ E++ + S T G+ + + H+++ + + + ++ Sbjct: 145 VPPTIDVEYFLERLEAVRLFSLGDGTALGMGVGISLL--------HLSRVNASF-RAVVI 195 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE---------------------- 327 LTDG+N++ I + + AK V+ +GV ++ Sbjct: 196 LTDGKNTTGEILPETA---AEMAKDLDIPVFTVGVGSDLPVSLDVIDPSTGTRYAGVLEE 252 Query: 328 -AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ L+ A S +F+S LH F IG Sbjct: 253 GYDEETLRRMAEMSGGQFFSGYTPTSLHRIFQYIGATATA 292 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 72/204 (35%), Gaps = 26/204 (12%) Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-----SSKIVQTF 231 S+S +D K+ + + I + G+ TF Sbjct: 809 QSISDSDVMAVWTSKISALKDAASGFVTGISDTSPDSL---VGIATFYGIGNGWNSSTEG 865 Query: 232 PLAWGVQHIQEK-----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L G+ + + +N L T GLE+AY+++ A++ KKY Sbjct: 866 KLNHGLSKVNKNEMLKSVNALFADGGTSPQKGLEHAYSELQKAEDGN----------KKY 915 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCASPDRFYS 344 +I +DGE S N D E+ + K G V +G+ + AS ++ Sbjct: 916 VILFSDGEPSDSN-DKMETEASAVKLKEAGYTVITVGLGLNNETATWLGEKVASAGCAFT 974 Query: 345 VQNSRKLHDAFLRIGKEMVKQRIL 368 + +L+ F I + + R L Sbjct: 975 ADTAEELNKIFQNIQSTITQSRSL 998 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/300 (15%), Positives = 96/300 (32%), Gaps = 65/300 (21%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 R SLSI+ + ++ + + ++ PL + + + S+++ Sbjct: 39 RYASLSIVREAIGTGQSVRRHIPPLL--FLLSLAVMLVAASRPLAV---ITLPSQNET-- 91 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ +DVS SM ++L + + + L D+ VR G+V F+ Sbjct: 92 -IILAMDVSGSMRATDVLP-NRLVASQNAAKAFL------ADLPRNVRVGVVAFAGTAAV 143 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF---------------------- 267 P + + I++ T G+ + ++F Sbjct: 144 VQPPTVSREDLTAAIDKFQLQRGTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLSL 203 Query: 268 -DAKEKLEHIAK------GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + + K II LTDG+ ++ +SL A RG VY Sbjct: 204 DQAGKDDGNGKKAFTPVAPGSYTSAAIILLTDGQRTTG----IDSLDAAKVAADRGIRVY 259 Query: 321 AIGVQAEAADQF---------------LKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 +GV + LK A+ ++ + L + + + Sbjct: 260 TVGVGTVEGETIGFEGWSMRVKLDEETLKGIARATQAEYFYAGTATDLKKVYQTLSSRLT 319 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 83/207 (40%), Gaps = 27/207 (13%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T + IS+ S LD++ ++D S SM G K+ +I ++L+++ N Sbjct: 143 TQKLYISTSSRPNLDLVCIIDNSESM-----SGCSKIENVKNTILQLLEML------NEN 191 Query: 216 VRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R L+TF+S Q L + +Q N + T T GLE A+ + K+ Sbjct: 192 DRLSLITFNSYAKQLCGLKKVSNLNKETLQAITNSIKAYGGTNITSGLEIAFQILQSRKK 251 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAAD 330 K + I L+DG++ + K L + + +++ ++ Sbjct: 252 KNSVSS---------IFLLSDGQDDGADTKIKNLLKITYQQLQEESFTIHSFSFGSDHDC 302 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAF 355 ++ A FY V+ + ++ + F Sbjct: 303 PLMQKIAQIKDGSFYFVEKNDQVDEFF 329 >gi|32476015|ref|NP_869009.1| inter-alpha-trypsin inhibitor domain-containing protein [Rhodopirellula baltica SH 1] gi|32446559|emb|CAD76394.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 764 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 86/251 (34%), Gaps = 38/251 (15%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA-----------PLLITSSV 159 T +I + +Q + + Y + T + P Sbjct: 279 DTHAAITLKNQSTVADKDFIIEYRLAGDDSTLASLTHRESDAEDGYVMLALQPKWSIEPT 338 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +I+ + +++VLD S SMN GP + +L + + + L+ + R+ Sbjct: 339 EITPRE-----VILVLDTSGSMN---GPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTT 390 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ +Q ++ + L T P L+ A + ++ Sbjct: 391 AFQPNAVSATDA----NIQSAKQFVRGLRASGGTNLLPALKLA---LGGEADESAR---- 439 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +Y+I +TD + ++ L Y + + + A V+ I A D + A Sbjct: 440 ----PRYMILMTD----ALVGNDHSILRYLRQPEFQDARVFPIAFGAAPNDYLISRAAEM 491 Query: 340 DRFYSVQNSRK 350 R +S+Q + + Sbjct: 492 GRGFSMQVTNQ 502 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 55/257 (21%), Positives = 101/257 (39%), Gaps = 50/257 (19%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 V +P +F + P A + + ++ G+D + V+D S SMN PG Sbjct: 18 VVIGLLLPMLFLSAPLTAGA-------------NGTAEAGVDAVFVVDTSNSMN-KTDPG 63 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEK 243 V + I +M + ++ R G V ++ +IVQ A + ++ Sbjct: 64 KTAAEVMSMFI-DMSEATRT--------RIGFVAYNDRIVQAQSPASMAEARNREQLKRT 114 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-----ENSSP 298 I L + + GL I AK+ K ++I L+DG +N+ Sbjct: 115 IQGLRYSGYSDLGLGLRRGAEMIEKAKDPA---------RKPFLILLSDGGTDLRQNAGG 165 Query: 299 N---IDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNC--ASPDRFYSVQNSRKL 351 NK+ ++AK +G +Y IG+ + + + LK A+ + Q++ L Sbjct: 166 RSVAASNKDVETVISKAKAQGYPIYTIGLNNDGSVQKEQLKKIAEATGGTSFVTQSTDDL 225 Query: 352 HDAFLRI-GKEMVKQRI 367 + F +I K + Q + Sbjct: 226 PEIFNQIFAKHIQSQLV 242 >gi|255066322|ref|ZP_05318177.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] gi|255049532|gb|EET44996.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] Length = 538 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 91/238 (38%), Gaps = 36/238 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW + + I + ++K D+ +++ ++DVS SM++ +KL + +++R Sbjct: 147 PWQPEAKLIKIGI--QAQDTAKKDLPPANLVFLVDVSGSMDEE-----NKLPLVQKTLRI 199 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGL 259 + ++ V L+T++S P G + I I++L G T L Sbjct: 200 LTQQLRPQDKVT------LITYASGEDLVLPPTSGADKETILSAIDKLRAGGATDGESAL 253 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + AY E+ + + + I+ TDG+ + D + E ++ G + Sbjct: 254 QMAY-------EQAQKAFVPNGINR--ILLATDGDFNVGVSDTETLKSMVAEKRKSGVSL 304 Query: 320 YAIGV-QAEAADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G + ++ A + + N ++L + +++ Q Sbjct: 305 STLGFGMGNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQQQLTSTLATVAQDVKIQV 362 >gi|291299992|ref|YP_003511270.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290569212|gb|ADD42177.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 316 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 79/266 (29%), Gaps = 35/266 (13%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + + +L A + P P S ++ T + + + + +++ LD+ Sbjct: 36 RFSNVDLLAKIAAKGPGWRRHLPAAVLLSALVVMSTGMARPAVDTQEPTERATVVLTLDL 95 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SLSM D+ ++ L+ + +P N GLVTF+ Sbjct: 96 SLSMKAKDVSP-DRFSAMKKAS---LEFVDELPKNYN---LGLVTFAKSASVAVSPTKDR 148 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ I + T G+ A I I+ L+DG +S Sbjct: 149 NQVKSAIKSMKLDRATAIGEGIFSALQAIQSVPPDGASEPAPAR-----ILLLSDGYRTS 203 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNCA--SPDR 341 + + V I + + L A + + Sbjct: 204 GRLVEDGAKAAKAAK----VPVSTIAFGTDTGTVEIEGETQEVPVDRETLSQTAETTGGK 259 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRI 367 FY + L + +G + + + Sbjct: 260 FYEAASVDDLKGVYEDMGSSIGHRTV 285 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/313 (14%), Positives = 101/313 (32%), Gaps = 40/313 (12%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 ++ L A + + + + + T + + ++ + S Sbjct: 443 NAELNAAVNDMVDKQSSQVVPVSEETINGSFNSYVATTLKYDRINIHISRIDTSAYPSIQ 502 Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 I I+ KD ++ F + L + + + + + Sbjct: 503 AYINING-TKDSKEELADQFTKE-DFTVIDTQYEITDFTLNSGAESE-------AVSIGI 553 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D S SM + + ++ I ++ V++ ++ L Sbjct: 554 VMDKSGSMEGAAIANAKQAAT---------EAVEHITSEKMMI----VSYDNEAYLEQSL 600 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 ++ I + G T + GL A + + K +I ++DG Sbjct: 601 TSRSGTLKNSIAAISDGGGTNISAGLNLALDNLEAEKGSRA------------VILMSDG 648 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 ++ D + + + A + G VY +G E D +++ A + +F S +L Sbjct: 649 QDGGSEEDMQAAT---DRAAKLGISVYTVGFG-ECDDAYMQAIAEVTGGKFVKASASTEL 704 Query: 352 HDAFLRIGKEMVK 364 D +L + K +V Sbjct: 705 SDIYLYLQKYIVN 717 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 80/198 (40%), Gaps = 19/198 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V D S S+N + + + I + + K+ PD +VTF+ + + Sbjct: 241 DIAFVFDASSSINANNPNNYGLMKDFMKDIVDRFN--KTGPDGTQF---AVVTFADRATK 295 Query: 230 TFPLAW--GVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I+ I+++ T GLE A ++F + G ++ +K Sbjct: 296 QFGLKDYSSKAEIKGAIDKVTPSIIGQTAIGDGLENARLEVFPNR-----NGGGREEVQK 350 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YS 344 +I LTDG+N+ ES ++ G ++ AIGV L N AS + + ++ Sbjct: 351 VVILLTDGQNNGHKSPEHESSLL----RKEGVVIVAIGVGTGFLKSELINIASSEEYVFT 406 Query: 345 VQNSRKLHDAFLRIGKEM 362 + KL + K Sbjct: 407 TSSFNKLSKIMENVVKLA 424 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 91/219 (41%), Gaps = 17/219 (7%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP+ + +S ++ K + + D+ D++++ D S S+ + + Sbjct: 7 MPYKAVPYESPVATSPTKYKPGNTGKDAEECDVQADIIVLFDDSSSIQ---YDNKENYQM 63 Query: 195 ATRSIREMLDIIKSIP-DVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-- 249 ++E++D ++ + N + G+V FS + FPL + + I++ I ++ Sbjct: 64 MKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRN 123 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T+ GL++ F E + K +K +I +TDG++++ Sbjct: 124 GGQTEIGTGLQHVRENSFSGAEGGGNPDK-----QKIVILMTDGKSNAG----APPQHEA 174 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 ++ K G V AIG+ L+ A+ + NS Sbjct: 175 HKLKAEGVTVIAIGIGQGFVKTELEQIATMKNYVLTTNS 213 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/314 (14%), Positives = 114/314 (36%), Gaps = 19/314 (6%) Query: 3 FLN-IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 L+ +R+ + G+++I+ A+ L + + +G + + V+ ++ LD +L+ Sbjct: 9 CLHTLRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVK 68 Query: 62 KILNQENGNNGKKQKNDFSYRIIKN-IWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 +I N E+ + K++ +D+ + ++N + + + + +T + I Sbjct: 69 QINNTEDTDALKQKVSDWFHAQVENSYALGEIEIDTTNHNITATASGTVPTTFMKI---A 125 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL----- 175 +S S + P + + +L+ ++ S G+ Sbjct: 126 NIDTVPVSVASAVKGPATSYLNVYIVVDTSPSMLLAATTAGQSTMYSGIKCQFACHTGDT 185 Query: 176 ----DVSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + + N + + VA ++RE+LD+I + ++ GL + Sbjct: 186 HTIGKKTYANNYDYSTEKGIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYGLGDTLT 245 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKY 286 + + ++ +G T+ ++ Y + K+K K Sbjct: 246 EVLAPTLSTDIARTRLADSSYGLTSATSKAATYFDVSLATLKQKVGAGGDGTTSGTPLKL 305 Query: 287 IIFLTDGENSSPNI 300 ++ LTDG S Sbjct: 306 VLLLTDGVQSQREW 319 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 80/198 (40%), Gaps = 19/198 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V D S S+N + + + I + + K+ PD +VTF+ + + Sbjct: 244 DIAFVFDASSSINANNPNNYGLMKDFMKDIVDRFN--KTGPDGTQF---AVVTFADRATK 298 Query: 230 TFPLAW--GVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I+ I+++ T GLE A ++F + G ++ +K Sbjct: 299 QFGLKDYSSKAEIKGAIDKVTPSIIGQTAIGDGLENARLEVFPNR-----NGGGREEVQK 353 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YS 344 +I LTDG+N+ ES ++ G ++ AIGV L N AS + + ++ Sbjct: 354 VVILLTDGQNNGHKSPEHESSLL----RKEGVVIVAIGVGTGFLKSELINIASSEEYVFT 409 Query: 345 VQNSRKLHDAFLRIGKEM 362 + KL + K Sbjct: 410 TSSFNKLSKIMENVVKLA 427 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 85/198 (42%), Gaps = 17/198 (8%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-DVNN 214 ++ K + + D+ D++++ D S S+ + + ++E++D ++ + N Sbjct: 31 GNTGKDAEECDVQADIIVLFDDSSSIQ---YDNKENYQMMKNFVKELVDSFTTVGVNGRN 87 Query: 215 VVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF--GSTTKSTPGLEYAYNKIFDAK 270 + G+V FS + FPL + + I++ I ++ G T+ GL++ F Sbjct: 88 GSQFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGA 147 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + K +K +I +TDG++++ ++ K G V AIG+ Sbjct: 148 EGGGNPDK-----QKIVILMTDGKSNAG----APPQHEAHKLKAEGVTVIAIGIGQGFVK 198 Query: 331 QFLKNCASPDRFYSVQNS 348 L+ A+ + NS Sbjct: 199 TELEQIATMKNYVLTTNS 216 >gi|233142080|gb|ACQ91095.1| complement factor B-like protein [Ruditapes decussatus] Length = 697 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 48/220 (21%), Positives = 83/220 (37%), Gaps = 22/220 (10%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD---V 212 T + S GLD+++++DVS S+ D + A + ++ ++DI + Sbjct: 182 TGAQSRLSPGKSGLDVVLLVDVSSSIGD------RSMESAKKFMKLLVDIFGVSNETSGG 235 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIF 267 N R L+TFS++ F L G +E++ R G T L IF Sbjct: 236 KNGTRFALLTFSNEADIVFNLNDGTARSKEEVKRRIDEIQNTGGGTNFRAALLKVVGGIF 295 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK--ESLFYCNEAKRRG-AIVYAIGV 324 K E H + + LTD E +S ++ N+ K G ++ IGV Sbjct: 296 FNVIKKESQRLNHAT--RAVFLLTDAEETSTLEKDRLPRIRQAANDLKNEGHFEIFCIGV 353 Query: 325 QAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKE 361 + L AS + +++ L I ++ Sbjct: 354 GQNIDETTLAEIASTPHIEHVFTLSKFDDLEKVGDIIAEK 393 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 73/210 (34%), Gaps = 29/210 (13%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFG 186 + SR F F L K + G+D+++ LDVS SM Sbjct: 49 TGYSRRRFTFRFLLIFIAFLFGAIGLANLQKGSRMEKITRKGVDVVIALDVSKSMLAGDV 108 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 D+L A + I ++ D + + R GLV F+ PL + + Sbjct: 109 KP-DRLTRAKQLISKLADKLDN-------DRVGLVVFAGNAYLQMPLTIDYSAAKMYLTT 160 Query: 247 ----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 +I T ++ A N F+ KE+ K +I ++DGE+ Sbjct: 161 VSPDMIPTQGTAIGQAIQVA-NDAFNKKERKH----------KSLIIISDGEDHD----- 204 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ A G ++ IG+ + Sbjct: 205 EAAISKARAAFEDGVVINTIGIGSPTGSPL 234 >gi|125527010|gb|EAY75124.1| hypothetical protein OsI_03018 [Oryza sativa Indica Group] Length = 589 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 76/206 (36%), Gaps = 35/206 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 SS GLD++ V+DVS SM+ G+DK A + + L +++ R Sbjct: 59 SSSSTDRAGLDLVAVIDVSGSMDGD---GIDKAKTALQFVIRKL---------SDLDRLC 106 Query: 220 LVTFSSKIVQTFPLAWGV----QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +VTFS+ + PL + ++ ++ L T GLE + + + D + Sbjct: 107 IVTFSTNATRLCPLRFVTAAAQAELKALVDGLKADGMTNMKAGLETSMSVV-DGRRLAAG 165 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 A ++ ++DG + L + VY G A L+ Sbjct: 166 RAVS-------VMLMSDGYQNDGGDARDVHL--------KNVPVYTFGFGASHDSNLLEA 210 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRI 358 A F V +S L F ++ Sbjct: 211 IARKSLGGTFNYVADSANLTGPFSQL 236 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 39/204 (19%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 M+VLD S SMN PG+ ++ A ++ ++L + + V +V G T S+ + Sbjct: 60 MIVLDASGSMNQDDAPGL-RIDAAKAAVTDLLGTLPAPTQVGLMVY-GTSTGSTDAERAA 117 Query: 232 PL----------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + ++ + T L A + + + Sbjct: 118 GCQDIKTLAPVGTLNAATLTSQVAGITASGYTPIGNALRAAAQALPNEGPRS-------- 169 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAK---RRGA--IVYAIGVQAEAADQFLKNC 336 I+ ++DGE++ C+ A+ +G V+ +G + +A + +C Sbjct: 170 -----IVLVSDGEDTCAPPAP------CDVARELHEQGVDLTVHTVGFKVDATARDQLSC 218 Query: 337 ---ASPDRFYSVQNSRKLHDAFLR 357 A+ + N+ L DA Sbjct: 219 VAQATGGTYSDAGNATGLTDALQA 242 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 54/278 (19%), Positives = 98/278 (35%), Gaps = 40/278 (14%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS 162 + NI S ++ I + S+ + + I P+ ++++ Sbjct: 196 MEAANIGSSRTVEIKTYSEFSAIQQSSQDDFAV-LIHLKAPYANPEQVTGRSVSATSVGY 254 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +D++ VLDVS SM KL + R++ ++ + R ++ Sbjct: 255 PTSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RLSVIA 302 Query: 223 FSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + F L G Q + +N L T L+ A I D + Sbjct: 303 FSSTARRLFHLRRMSHSGRQQALQAVNSLGASGGTNIADALKKAAKVIED---------R 353 Query: 279 GHDDYKKYIIFLTDGENSS----------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 H + II L+DG+++ P+ + N R V+ G + Sbjct: 354 SHQNPVCSIILLSDGQDTYNIPSNIRGARPDYSSLVPSSILNHTFRL-VPVHGFGFGVDH 412 Query: 329 ADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + AS F +++ + DAF + IG + Sbjct: 413 DSDALHSIAEASGGTFSFIEDEGVIQDAFAQCIGGLLS 450 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 80/228 (35%), Gaps = 35/228 (15%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF----------------GPGM 189 + + + + S++ S+ LD+++VLD+S SM F G Sbjct: 166 STGRYFTVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGDSR 225 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKIN 245 K+ VA ++ + ++ R G+V F+++ PL + I+ I Sbjct: 226 SKMAVAKDAL------VALTEQLHPDDRVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIR 279 Query: 246 R-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + G T G+ A + + + + A+ I +TD ++ D++ Sbjct: 280 EDIEAGGGTNIADGMAEAADMLGEYADSDPTEAETRQ------IVITDAMPNTGQTDDQA 333 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRK 350 G +GV + + + + + SV ++ Sbjct: 334 LQDRLAGYAEDGIHTSFVGVGVDFNPELVDEITAVRGANYRSVHSAED 381 >gi|262198293|ref|YP_003269502.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262081640|gb|ACY17609.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 419 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 77/203 (37%), Gaps = 27/203 (13%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S + + +++ +V+D S SM GP + VA R + E LD R + Sbjct: 27 TESSARMPVNLALVIDRSSSMR---GPRLASAIVAARQVVEQLDERD---------RLSV 74 Query: 221 VTFSSKIVQTF-PLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + F + F P++ Q +++ + L G T G++ + + Sbjct: 75 IAFDATARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRGALS 134 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 ++ LTDG+ S DN + RG + +G+ D+ L + Sbjct: 135 R---------LVLLTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQGFDDELLADL 185 Query: 337 ASPDR--FYSVQNSRKLHDAFLR 357 A R F+ + ++ + AF R Sbjct: 186 AHSGRGGFHYLASAADIPGAFGR 208 >gi|326922309|ref|XP_003207392.1| PREDICTED: collagen alpha-1(VI) chain-like [Meleagris gallopavo] Length = 998 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 70/214 (32%), Gaps = 26/214 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + ++ D +D+ VLD S S+ P D + +D + Sbjct: 18 WAQQAEVNARVFRAQDCPVDLFFVLDTSESVALRVKPFGDLVAQVKDFTNRFIDKLTE-- 75 Query: 211 DVNNVVR--------SGLVTFSSKIVQTFPLA---WGVQHIQEKINRLI-FGSTTKSTPG 258 R +G + +S +V L G ++ ++ + G T + Sbjct: 76 ---RYFRCDRFLAWNAGALHYSDSVVIIKDLTAMPSGRAELKNSVSAINYIGKGTHTDCA 132 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGA 317 ++ ++ H KY+I +TDG + L NEAK G Sbjct: 133 IKQGIERLLVGG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLDDAANEAKHLGI 184 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 V+++ + DQ L A+ + + L Sbjct: 185 KVFSVAISPHHLDQRLNIIATDHAYRRNFTATSL 218 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 74/210 (35%), Gaps = 34/210 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIV 228 D+M+++D S S+ ++ + + +++ + VR +V +S + Sbjct: 803 DIMLLVDSSTSVGSK------NFDTTKSFVKRLAERFLEASKPAEDSVRVSVVQYSGRNQ 856 Query: 229 Q--TFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 Q P I + ++ + F T + L+Y + KK Sbjct: 857 QKVEVPFQRNYTVIAKAVDNMEFMNEATDVSAALQYVTGLYQRSSRAGAK--------KK 908 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD----- 340 ++F +DG +S I + E ++ G VY + V ++ + ++ + Sbjct: 909 VLVF-SDG--NSQGITARAIERTVQEVQQAGIEVYVLAVGSQVNEPNIRVLVTGKTANYD 965 Query: 341 ------RFYSVQNSRKLHDA--FLRIGKEM 362 + V + L + + +++ Sbjct: 966 VVYGERHLFRVPDYTSLLRGVFYQTVSRKI 995 >gi|55378019|ref|YP_135869.1| hypothetical protein rrnAC1219 [Haloarcula marismortui ATCC 43049] gi|55230744|gb|AAV46163.1| unknown [Haloarcula marismortui ATCC 43049] Length = 788 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 79/209 (37%), Gaps = 37/209 (17%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ V++ + + ++++++DVS S G + K LD++ + D Sbjct: 358 PVASMLPVRVGNATGGESNIVILVDVSGS--AESGLSVQKAVA--------LDVLDQLGD 407 Query: 212 VNNVVRSGLVTFSSKIVQTFP---LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 N G+V F+ + L +KI RL G T GL+ A +++ D Sbjct: 408 ENQ---VGVVAFNQNAYRVSEMQALGQNRAETADKIRRLESGGATDIAVGLQGA-DELLD 463 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 +E II L+DG++ N+ R G V ++GV Sbjct: 464 DREGT-------------IILLSDGQDRLG-----PPAAVANQLGREGTRVVSVGVGKRV 505 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 ++ A S +++ + +L F Sbjct: 506 GVATMRQIASESGGSYFAADETERLRLLF 534 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + S++ + + L++ ++LD S SMN L ++ ++D + N Sbjct: 27 ISISAIAETQDRHVPLNLCLILDHSGSMNG------RSLETVKKAANRLVDRL------N 74 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +V F + P + I+ +INRL T GL ++ K+ Sbjct: 75 PSDRLSVVVFDHRAKVLVPSQSVEDPEKIKNQINRLAADGGTAIDEGLRLGIEELAKGKK 134 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 A LTDGEN DN L + A + +G Sbjct: 135 DTVSQA----------FLLTDGENEHG--DNNRCLKFAQLAASYNLTLNTLGFGDNWNQD 182 Query: 332 FLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 L+ A +Q + D F R+ + Sbjct: 183 VLEKIADAGLGTLSYIQKPEEAVDEFNRLFSRIQT 217 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 48/218 (22%), Positives = 81/218 (37%), Gaps = 38/218 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +D++ VLDVS SM KL + R++ ++ + S R ++ Sbjct: 177 STTRAPVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIA 224 Query: 223 FSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + FPL + +++ +N L T GL I E +K Sbjct: 225 FSSSARRLFPLRRMTESGRQQSLLAVNSLTANGGTNIAEGLRKGSKVI------EERQSK 278 Query: 279 GHDDYKKYIIFLTDGENSSPNIDN----KESLFYCN---EAKRRG---AIVYAIGVQAEA 328 II L+DG+++ K + YC G V+ G A+ Sbjct: 279 NPVCS---IILLSDGQDTYTVSPTAGVHKGATEYCALLPSTTTNGSQQVPVHVFGFGADH 335 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + + S F ++ + DAF + IG + Sbjct: 336 DSVSLHSISQTSGGTFSFIETEATIQDAFAQCIGGLLS 373 >gi|66805723|ref|XP_636583.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] gi|60464969|gb|EAL63080.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] Length = 585 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 93/267 (34%), Gaps = 21/267 (7%) Query: 107 NIERSTSL-SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK- 164 +IE STS+ + D+ + +S L T +K + Sbjct: 95 SIEPSTSVPNQASDNSSGENLISTQLNKLDLNNNNNNNQQQQQQKDSLNSTEPIKKQVEL 154 Query: 165 ---SDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 LD+ +VLD+S SM PG ++ + + V Sbjct: 155 YESEIKQLDV-IVLDLSGSMKSAAFKGSRVPGELEMSRIELAQTLFQTFTDKAISLEVPV 213 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEH 275 GLVTF +I TF L ++ ++ T+ ++ A + KE + Sbjct: 214 AVGLVTFGERIELTFDLTRNFDSFSTELGEVVANQCKTRLFEAIQLAAETLVKFKESCDA 273 Query: 276 IAKGH----DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 A G + + LTDGE++S N D Y K+ I+ +I + + ++ Sbjct: 274 AATGGLKLSSNPMLRVFCLTDGEDNS-NFDPYPVYQY---MKKHNIILDSIPIGLDGRER 329 Query: 332 FLKNC-ASPDRFYSVQNSRKLHDAFLR 357 A+ + +S + + F R Sbjct: 330 LSSFSQATGGSCFIADSSLEGVELFER 356 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 73/207 (35%), Gaps = 28/207 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + L++ ++LD S SMN L + ++ ++D +K+ R +V Sbjct: 35 PQDRTVPLNLCLILDHSGSMNG------RPLEIVKQAAIRLVDRLKTGD------RLSVV 82 Query: 222 TFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F + P + + I+++INRL T GL ++ K++ A Sbjct: 83 AFDHRAKVLVPNQVIDNPEQIKKQINRLAADGGTAIDEGLRLGIEELAKGKKETISQA-- 140 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 LTDGEN DN L + A + +G L+ A Sbjct: 141 --------FLLTDGENEHG--DNNRCLKFAQLAAGYNLTLNTLGFGDNWNQDVLEKIADA 190 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 +Q + + D F R+ + Sbjct: 191 GLGSLSYIQKAEQAVDEFGRLFSRIQT 217 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIMDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLK- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 79/208 (37%), Gaps = 33/208 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 756 STRVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASE 808 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 809 PTDKHLFYAED----FSTMDEISEKLKK 832 >gi|148699893|gb|EDL31840.1| procollagen, type VI, alpha 1, isoform CRA_b [Mus musculus] Length = 643 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 77/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 64 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 123 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 124 YSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG-------- 175 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 176 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSIIA 235 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + DA I + + Sbjct: 236 TDHTYRRNFTAADWGHSRDAEEVISQTI 263 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 79/211 (37%), Gaps = 32/211 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 234 QISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD------RLS 281 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N L+ T GL + D E+ Sbjct: 282 VIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSV 341 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + II L+DG ++ S V++ G ++ + + Sbjct: 342 AS---------IILLSDGRDTYTTNHPDPSYKVMLPQ----ISVHSFGFGSDHDASVMHS 388 Query: 336 CA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + S F +++ + DA + IG + Sbjct: 389 VSEVSGGTFSFIESESVIQDALAQCIGGLLS 419 >gi|26343093|dbj|BAC35203.1| unnamed protein product [Mus musculus] Length = 266 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 77/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 32 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 91 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 92 YSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG-------- 143 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 144 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSIIA 203 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + DA I + + Sbjct: 204 TDHTYRRNFTAADWGHSRDAEEVISQTI 231 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 79/211 (37%), Gaps = 32/211 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 208 QISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD------RLS 255 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N L+ T GL + D E+ Sbjct: 256 VIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSV 315 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + II L+DG ++ S V++ G ++ + + Sbjct: 316 AS---------IILLSDGRDTYTTNHPDPSYKVMLPQ----ISVHSFGFGSDHDASVMHS 362 Query: 336 CA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + S F +++ + DA + IG + Sbjct: 363 VSEVSGGTFSFIESESVIQDALAQCIGGLLS 393 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 79/211 (37%), Gaps = 32/211 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 233 QISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD------RLS 280 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N L+ T GL + D E+ Sbjct: 281 VIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSV 340 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + II L+DG ++ S V++ G ++ + + Sbjct: 341 AS---------IILLSDGRDTYTTNHPDPSYKVMLPQ----ISVHSFGFGSDHDASVMHS 387 Query: 336 CA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + S F +++ + DA + IG + Sbjct: 388 VSEVSGGTFSFIESESVIQDALAQCIGGLLS 418 >gi|6753484|ref|NP_034063.1| collagen alpha-1(VI) chain precursor [Mus musculus] gi|543913|sp|Q04857|CO6A1_MOUSE RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|50479|emb|CAA47032.1| collagen alpha1 type VI-precursor [Mus musculus] gi|148699892|gb|EDL31839.1| procollagen, type VI, alpha 1, isoform CRA_a [Mus musculus] gi|162318378|gb|AAI56501.1| Collagen, type VI, alpha 1 [synthetic construct] gi|225000678|gb|AAI72708.1| Collagen, type VI, alpha 1 [synthetic construct] Length = 1025 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 77/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 32 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 91 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 92 YSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG-------- 143 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 144 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSIIA 203 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + DA I + + Sbjct: 204 TDHTYRRNFTAADWGHSRDAEEVISQTI 231 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 76/208 (36%), Gaps = 26/208 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H A R L ++ P + VR +V +S + Q Sbjct: 826 DITILLDSSASVGSH--NFETTKVFAKRLAERFLSAGRADP--SQDVRVAVVQYSGQGQQ 881 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY--NKIFDAKEKLEHIAKGHDDYKKYI 287 G +Q N + S+ S + A N + A K+ + Sbjct: 882 QP----GRAALQFLQNYTVLASSVDSMDFINDATDVNDALSYVTRFYREASSGATKKRVL 937 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK----------NCA 337 +F +DG + + E EA+R G ++ + V + + ++ + A Sbjct: 938 LF-SDGNSQGATAEAIE--KAVQEAQRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVA 994 Query: 338 SPDRF-YSVQNSRKLHDA--FLRIGKEM 362 +R + V N + L + + +++ Sbjct: 995 FGERHLFRVPNYQALLRGVLYQTVSRKV 1022 >gi|291295671|ref|YP_003507069.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470630|gb|ADD28049.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 744 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/253 (18%), Positives = 88/253 (34%), Gaps = 40/253 (15%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 ++ +L + P W S L + + G+ +++VLDVS SM Sbjct: 297 NQGGSLLWTAT---PQGLFFGGWERTSLADSLPVE-----PVEEPGGVGIVLVLDVSGSM 348 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI---VQTFPLA-WGV 237 + DKLG+A + L++I+S + G+V FS + + P+ G Sbjct: 349 LED-----DKLGLA---VTGSLELIRSARPQD---YIGVVVFSDRPRWLFRPRPMTEQGR 397 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + + G T A + K +I LTDG + Sbjct: 398 KEAESLLLSTQAGGGTMIRRAYLEALEALEQVP-----------TESKQVIALTDGLAAD 446 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAF 355 D ++ + + + A+A +FL+ A ++ V L F Sbjct: 447 VTPDLFDAAREASP----RIKTNTVAIGADADGRFLRELAQAGDGTYWDVPRPEDLPRFF 502 Query: 356 LRIGKEMVKQRIL 368 L + + ++ L Sbjct: 503 LEEAQRVFRREAL 515 >gi|225872598|ref|YP_002754053.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793914|gb|ACO34004.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 313 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 78/214 (36%), Gaps = 40/214 (18%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + ++ + L +++ +D S S+ L R+ RE L + R Sbjct: 60 IAVLERQTGLPLSIVLAIDTSGSVRKD-------LDEEKRAAREFLRAT-----LRPEDR 107 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V F++++ + P ++ I +NRL G T + Y ++ + Sbjct: 108 VEIVNFNTRVHEVVPFTNNLKKIDRGLNRLSEGPATALYAAIAYGSEELAQRPGR----- 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI---GVQAEAADQF-- 332 K ++ ++DG+N+ N +++L + A R +++++ V +A Sbjct: 163 -------KVLVVISDGDNTVANSSYQQAL---DRAVRAETMIFSVIDLPVINDAGRDVGG 212 Query: 333 ------LKNCASPDRFYSVQNSRKLHDAFLRIGK 360 L + +Y L F R+ Sbjct: 213 EHAMIALSEATGGEYYYEAD--GNLQGVFKRLST 244 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 91/257 (35%), Gaps = 63/257 (24%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP---GMDKLGVATRSIREMLDIIKSI-- 209 ++ + + +D +VLDVS SM+D + K+ ++ L I Sbjct: 577 TGAANSSTITTTQSVDFTLVLDVSSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINE 636 Query: 210 PDVNNVVRSGLVTFSS-------------------KIVQTFPLAWGVQHIQEKINRLIFG 250 + ++R GLV F+ PL + ++ K++ L Sbjct: 637 QSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALRHN 696 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SL 306 T++ G ++A + A+ K+ +IF TDG + + +K+ ++ Sbjct: 697 GATRADLGFKHASTVMSGARTDA----------KRVVIFFTDGTPTKVSDFDKDVANSAV 746 Query: 307 FYCNEAKRRGAIVYAIGV---------QAEAADQFLKNCAS----------------PDR 341 Y K GA VY+IGV + + +QF+ +S Sbjct: 747 TYAKSLKDSGATVYSIGVFDGANPSSIEEDQKNQFMNAVSSNYPHATAYDKLGTGSNAGY 806 Query: 342 FYSVQNSRKLHDAFLRI 358 + V N L F +I Sbjct: 807 YKVVSNVSDLKSIFEKI 823 >gi|257456195|ref|ZP_05621392.1| BatA protein [Treponema vincentii ATCC 35580] gi|257446281|gb|EEV21327.1| BatA protein [Treponema vincentii ATCC 35580] Length = 332 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 76/237 (32%), Gaps = 52/237 (21%) Query: 157 SSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 V+ +S+ S G +M V+D S SM +L A R IIKS + Sbjct: 75 EPVRQTSEAMYSSSGQALMFVIDTSPSMAAQDMGTETRLEAAKR-------IIKSFAEKY 127 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAK 270 GL S P ++++L G T GL A + Sbjct: 128 EGDSLGLTALGSSAAVLIPPTIDRHTFLTRLDQLQVGELGDGTAIGMGLASAVLHLTQYS 187 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-- 328 H II TDG+N++ I + + + K + Y IG+ Sbjct: 188 TLPSH-----------IILFTDGDNNTGEIHPRAA---ADIIKHKKIGFYIIGLGKSGYA 233 Query: 329 ---------------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 + L+ A R++S ++ L D F R +++ Sbjct: 234 PVKYIDPIQKKEISGTLNTVFNETELQKIAGYGNGRYFSAKSPELLTDIFNRFIQKI 290 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/242 (19%), Positives = 84/242 (34%), Gaps = 58/242 (23%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ VLDVS SM KL + R++ ++ + R +V+FSS Sbjct: 200 RAPIDLVAVLDVSGSMAGS------KLSLLKRAVCFLIQNLGPSD------RLSIVSFSS 247 Query: 226 KIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + FPL G + IN L T GL+ + + E Sbjct: 248 TARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERSE---------Q 298 Query: 282 DYKKYIIFLTDGENSS-------------PNIDNKESLFY--------CNEAKRRG---- 316 + II L+DG+++ + + ++ L Y C + G Sbjct: 299 NPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGR 358 Query: 317 ---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGK--EMVKQRIL 368 V+ G ++ + + S F +++ + DAF IG +V Q + Sbjct: 359 QAIIPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVAXVQDAFAMCIGGLLSVVAQELR 418 Query: 369 YN 370 Sbjct: 419 LT 420 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 77/195 (39%), Gaps = 26/195 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ + + + +L + P N R G++ +SS+ Sbjct: 171 GPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGP---NATRVGVIQYSSQ 221 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 222 VQSVFPLGAFSRREDMEGAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEARV 278 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + +TDG +A+ RG +YA+GVQ L+ ASP + Sbjct: 279 PRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDE 331 Query: 341 RFYSVQNSRKLHDAF 355 + V++ L F Sbjct: 332 HVFLVESF-DLIQEF 345 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 69/175 (39%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 484 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 534 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 535 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 589 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV ++ L+ AS Sbjct: 590 PRVGLVFTDGRSQD------DVSVWAARAKEEGIVMYAVGVGKAVEEE-LREIAS 637 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 77/215 (35%), Gaps = 33/215 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +S L ++M +D S S+ + + R++ D + F Sbjct: 72 ESATPLSIVMAIDGSESVVTNDRLEKEAGKKFVRALLREQDEFDLMD------------F 119 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S + + + I+ +N L G T + A ++ + Sbjct: 120 SDTVREVVSFTNDKKRIENGLNELRKGDATAVYDAVYLASQRLGETNAGGGRR------- 172 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI---------GVQAEAADQFLK 334 + ++ +TDG+N+ + +++ +A+R G +VYA+ G ++ Sbjct: 173 -RVLVLITDGDNTVHGVGYDQAV---EQAQRAGVMVYALIVVPIEADAGRNTGGEHALIQ 228 Query: 335 NCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + +Y V + R L + ++ ++ Q +L Sbjct: 229 MATDTGGNYYYVNDPRDLAKVYAKVSDDLRTQYVL 263 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 101/264 (38%), Gaps = 29/264 (10%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G ++ LS+ + HK + S+ + +P + + Sbjct: 64 GDGKNKQKYNLEKGLSLDVKTLHKHFQFSSSTNQSIPVMVSVKTLDKTEDAPKGDQEAVK 123 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S ++ LD++ V+D S SM+ K+ +++ +L+++ R Sbjct: 124 QESLENRPNLDLICVIDNSGSMSGE------KIQNVKKTLEYLLELLGDND------RLC 171 Query: 220 LVTFSSKIVQTFPL--AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L+ F+S + L +E IN++ T G+E A+ + D Sbjct: 172 LILFNSYATRLCHLMKTNNSNKPAFKEIINKIYSTGGTDINSGMELAFRVLKD------- 224 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-K 334 + + + + L+DG++ S ++ ++SL + +++ G ++ + K Sbjct: 225 --RKYQNPVSSVFLLSDGQDGSADLRVRQSLE--RHLPQECFTIHSFGFGSDHDGPLMNK 280 Query: 335 NCA-SPDRFYSVQNSRKLHDAFLR 357 C+ FY V+ ++ + F+ Sbjct: 281 ICSLKDGNFYYVEKINQVDEFFVD 304 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 44/245 (17%), Positives = 79/245 (32%), Gaps = 33/245 (13%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI---TSSVKISSK-----SDIGLDMMM 173 + ++ + C + + + S G D++ Sbjct: 1786 FEGGTVTWGCFNGFNLVGALTAVCLGNGSWSEPVPECMAPTTPPPPGCDELSFGGWDLVF 1845 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +LD S S+ + + P +N + GLV FS I + F L Sbjct: 1846 LLDGSGSVGSNNFLNVKNFTKLIT---------DLFPVGDNATKVGLVQFSDTIQKEFDL 1896 Query: 234 -AWGVQ-HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I I+ + G T + ++Y F+ I + +I L Sbjct: 1897 RDYDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFN------TINGNRGSHPDMLIVL 1950 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSR 349 TDGE+ P +S A+ +G ++AIGV L+ A P + V + Sbjct: 1951 TDGESFDPVTFASQS------ARDQGITIFAIGVGTGVDYATLEEIAGDPQKVQQVTDFA 2004 Query: 350 KLHDA 354 L Sbjct: 2005 DLTSV 2009 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 76/208 (36%), Gaps = 29/208 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + LD++ +LD S S+ D L T I D+ N+ R G+V Sbjct: 1582 SNRTLNLDVVFLLDGSGSVG---SANFDLLKTFTTRIATNFDVST------NLTRVGVVQ 1632 Query: 223 FSSKIVQTFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S + F L + + + I + + T + L+Y +F + A Sbjct: 1633 YSDQTNSEFVLNTFSTEAEVLAAIAAISYQNGGTSTGAALDYVRQNVFISASGDRPDAAN 1692 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 +I LTDG +S + F A+ G +Y++G+ L+ A Sbjct: 1693 ------ILIVLTDGVSSD------DVSFPAMAARNAGITIYSVGIGDGVDYNTLQQIAGD 1740 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 P++ L D IG ++ + Sbjct: 1741 PNKVLQATGFSSLDD----IGGQLEELV 1764 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 73/191 (38%), Gaps = 25/191 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 GLD++ +LD S S+ G + E++ D N G+V +S Sbjct: 1018 GGLDLVFLLDGSGSV------GTTNFELVKDFTSEVVLNFNISADTTN---VGVVQYSDT 1068 Query: 227 IVQTFPLA-WGVQ-HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + F L+ + + + + IN++ T + ++Y F + + Sbjct: 1069 VRNEFFLSSYDTKLPLIDAINQISYLTGGTLTGFAIDYVRQSSFSRPAGARN------TF 1122 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRF 342 ++ LTDG++ + + A+ +G ++A+G+ +E L+ P R Sbjct: 1123 PDVLVVLTDGQSQD------DVVSSAAAARSQGITIFAVGIGSEVDFTTLLQISGYPSRI 1176 Query: 343 YSVQNSRKLHD 353 +Q+ L Sbjct: 1177 LQIQDFATLVT 1187 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 60/181 (33%), Gaps = 25/181 (13%) Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWG 236 S S+ D + + ++ N R G+V +S F L A+ Sbjct: 2707 SGSVGS------DNFNLLKAFTQNIVGNFDIAV---NNTRVGVVQYSDFNNIEFNLNAYA 2757 Query: 237 VQ-HIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + + I + + T + +++ +F D ++ LTDGE Sbjct: 2758 TEAEVLAAIGAISYQRGGTFTGAAIDFVRQDVFTTAGG------NRADKPDILLVLTDGE 2811 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHD 353 +S ++ G +YA+G+ + L+ A P R V + + L Sbjct: 2812 SSDSVAGPAQNTL------NAGITIYAVGIGSGVNADTLQEIAGDPGRVLQVADFQGLAA 2865 Query: 354 A 354 Sbjct: 2866 I 2866 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 65/179 (36%), Gaps = 19/179 (10%) Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKIN 245 G D + + + ++D + + G+V +SS F L A+ Q + + IN Sbjct: 2425 GADNFNLVKQFAKRLVDNFEISQ---TDTKVGVVQYSSSSNVEFYLNAFSTKQAVLDAIN 2481 Query: 246 RLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + + T + + Y +IF +Y +I +TDGE+S Sbjct: 2482 AVTYQQGGTNTGAAITYTMQEIF------ASANGARANYPDVLIVVTDGESSDDVAVPAL 2535 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 S A+ G ++YA+GV L+ + + + L + + + Sbjct: 2536 S------ARNAGTLIYAVGVGNGVNQATLLQIAGNAGQVLQAADFAGLTTVVQSLQQNL 2588 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 72/205 (35%), Gaps = 29/205 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 I LD++ +LD S S+ P + + T S+ D+ N R G+ + Sbjct: 1297 NHTIALDLIFLLDGSGSIT---APNFELVKSFTYSVSRNFDV------SPNATRIGVAQY 1347 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F L + +N + + G T + L++ + ++ Sbjct: 1348 SDTNSLEFNLNRYSTKDEVLNAVNGISYQGGGTYTGAALDFVRQTMMVESAGDRTMSPN- 1406 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASP 339 ++ TDGE+S + + G +VYA+G+ A ++ L Sbjct: 1407 -----ILVVATDGESSD------DQRTPAEVLRNAGTLVYAVGIGAGVSSTTLLDIAGYN 1455 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 R + L IG+E+ + Sbjct: 1456 SRVLQATDFASL----EVIGRELQE 1476 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 59/192 (30%), Gaps = 23/192 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ LD++ +LD S S+ + R + N+ + G+V +S Sbjct: 748 DVPLDIVFLLDGSGSVGSA------NFDLVKDFTRTLARNFDIAA---NMTQIGVVQYSD 798 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + F L Q + I+ + + T A + + + Sbjct: 799 TVNREFGLGDFHNRQDVLNAISAVSYQQGGTLTGA---AIDFVRQTSFTTGDGDRPDVP- 854 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRF 342 +I +TDG + + A+R G + +G+ L+ R Sbjct: 855 -NMLIVVTDGVSGDSVQGP------ADAARREGITTFGVGIGNGIDFGTLLEIAGDSARV 907 Query: 343 YSVQNSRKLHDA 354 + L Sbjct: 908 LQADDFGALATV 919 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 67/193 (34%), Gaps = 25/193 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD++ ++D S S+ + A + E+ ++ R G+V +S Sbjct: 131 SGFALDLVFLVDGSSSVGSDNFETIKVFLEAITAGFEV---------SSSQTRVGVVQYS 181 Query: 225 SKIVQTFPL-AWGVQ-HIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + I F L ++ + + I L +T + G+ F E + Sbjct: 182 TGINTEFDLNSFATEAEVINAIRGLSHQRGSTFTGAGIT------FTRLESFTGASGDRP 235 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPD 340 D +I +TDG ++ A+ Y+IG+ E L + Sbjct: 236 DAPNVLIVITDGISADSVDAP------AEAARADNITTYSIGIGDEINYLTLLSIAGMRE 289 Query: 341 RFYSVQNSRKLHD 353 R +V L+D Sbjct: 290 RVLNVTTFGDLND 302 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 70/198 (35%), Gaps = 27/198 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + +D++ ++D S S++ + + ++ I +++ R Sbjct: 343 TTVEPCERLEIDVIFLIDGSSSIS------LLNFDLLKTFLQN---ITMKFDVSSDITRI 393 Query: 219 GLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYA-YNKIFDAKEKLE 274 G+V +S+ + F L A + I N +T G+ + N A Sbjct: 394 GVVQYSTDVNTEFELKTYATEAEVIHAISNITRQRGSTFIGAGINFVRTNSFTVAAGDRP 453 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ +TDG ++ A+ +G + Y+IG+ E L Sbjct: 454 -------LAPNILVTITDGISADDVAGP------AQAARDQGILTYSIGIGEEIQWPTLL 500 Query: 335 NCASPDR-FYSVQNSRKL 351 + A ++V + +L Sbjct: 501 SIAGARHRVFNVTSFSEL 518 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PD+ R GL+ Sbjct: 48 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDI---TRVGLL 98 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L + ++ + R+ + T + ++YA N F E + Sbjct: 99 QYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 157 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG ++A+ G +++AIGV + + Sbjct: 158 --ENVPRVIMIVTDGRPQDSVA------EVASKARNTGILIFAIGVGQVDLNTLKAIGSE 209 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 210 PHKDHVFLVANFSQIESLTSVFQN 233 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 76/205 (37%), Gaps = 31/205 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + ++D + P R GL+ +S Sbjct: 608 TEGPIDLVFVIDGSKSLGEE------NFEIVKHFVTGIIDSLAVSP---KAARVGLLQYS 658 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G ++ + G + + L++ Y + F E + Sbjct: 659 TQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVEGARPL---S 714 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + +AK G +YA+G+ ++ L+ AS Sbjct: 715 TRVPRAAIVFTDGRAQD------DVSEWARKAKANGITMYAVGIGKAIEEE-LQEIASEP 767 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ I +++ Sbjct: 768 IDKHLFYAED----FSTMGEISEKL 788 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PD+ R GL+ Sbjct: 48 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDI---TRVGLL 98 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L + ++ + R+ + T + ++YA N F E + Sbjct: 99 QYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 157 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG ++A+ G +++AIGV + + Sbjct: 158 --ENVPRVIMIVTDGRPQDSVA------EVASKARNTGILIFAIGVGQVDLNTLKAIGSE 209 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 210 PHKDHVFLVANFSQIESLTSVFQN 233 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 76/205 (37%), Gaps = 31/205 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + ++D + P R GL+ +S Sbjct: 636 TEGPIDLVFVIDGSKSLGEE------NFEIVKHFVTGIIDSLAVSP---KAARVGLLQYS 686 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G ++ + G + + L++ Y + F E + Sbjct: 687 TQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVEGARPL---S 742 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + +AK G +YA+G+ ++ L+ AS Sbjct: 743 TRVPRAAIVFTDGRAQD------DVSEWARKAKANGITMYAVGIGKAIEEE-LQEIASEP 795 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ I +++ Sbjct: 796 IDKHLFYAED----FSTMGEISEKL 816 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PD+ R GL+ Sbjct: 48 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDI---TRVGLL 98 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L + ++ + R+ + T + ++YA N F E + Sbjct: 99 QYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 157 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG ++A+ G +++AIGV + + Sbjct: 158 --ENVPRVIMIVTDGRPQDSVA------EVASKARNTGILIFAIGVGQVDLNTLKAIGSE 209 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 210 PHKDHVFLVANFSQIESLTSVFQN 233 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 76/205 (37%), Gaps = 31/205 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + ++D + P R GL+ +S Sbjct: 608 TEGPIDLVFVIDGSKSLGEE------NFEIVKHFVTGIIDSLAVSP---KAARVGLLQYS 658 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G ++ + G + + L++ Y + F E + Sbjct: 659 TQVRTEFTLK-GFSSAKDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVEGARPL---S 714 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + +AK G +YA+G+ ++ L+ AS Sbjct: 715 TRVPRAAIVFTDGRAQD------DVSEWARKAKANGITMYAVGIGKAIEEE-LQEIASEP 767 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ I +++ Sbjct: 768 IDKHLFYAED----FSTMGEISEKL 788 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 62/355 (17%), Positives = 125/355 (35%), Gaps = 50/355 (14%) Query: 40 HKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 +++ + + + A + N + ++N F ++ I E+ + Sbjct: 81 RTMSAESRAYIAQPTASISAAPAL-NGDWPGAVPPERNRFEKQVQNGI---MVAGEIPVS 136 Query: 100 GFAQDINNIERST---SLSIIIDDQHKDYNLSAVSRY-----------EMPFIFCT--FP 143 FA D++ +T L Q + + Y + PF T P Sbjct: 137 TFAIDVDTGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPGKNDAPFSVTTELAP 196 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 N L I SK+++G ++ +LDVS SM DKL + +++ + Sbjct: 197 SPYNDDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMA-----SPDKLPLLQTALKML 251 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + + V+ VV +G +V Q + + +L G +T G++ A Sbjct: 252 TQQLDAQDKVSIVVYAGAAG----VVLDGAAGNDTQTLNYALEQLSAGGSTNGAQGIQLA 307 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y +H +G + +I TDG+ + + E + + K++G + + Sbjct: 308 YQL------AQKHFVEGGINR---VILATDGDFNVGTTNLDELIDLVSARKQQGIGLTTL 358 Query: 323 GV-QAEAADQFLKNCASPDR--FYSVQNSRK--------LHDAFLRIGKEMVKQR 366 G + D ++ A + + + + L L I KE+ Q Sbjct: 359 GFGMGDYNDHLMEQLADKGNGQYAYIDSINEARKVLVEHLSATLLTIAKEVKVQV 413 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 80/210 (38%), Gaps = 21/210 (10%) Query: 164 KSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 K+D+G ++ +LDVS SM DKL + +++ + + ++ VV +G Sbjct: 246 KADLGASQLVFLLDVSGSM-----SSQDKLPLLKNALKMLSQQLDEGDRISIVVYAGASG 300 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V I + +++L G +T G+E AY +H G + Sbjct: 301 ----VVLDGVKGNDTLAISQALDKLKAGGSTNGGAGIELAYQL------AQKHFIAGGVN 350 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDR 341 +I TDG+ + D + E +++G + +G D ++ A Sbjct: 351 R---VILATDGDFNVGVSDQQALEDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADKGN 407 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + L++A + E+ + K Sbjct: 408 GHYAY-IDTLNEARKVLVDEISATLLTIAK 436 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 44/199 (22%), Positives = 79/199 (39%), Gaps = 26/199 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD++ V+D S S+ + + + +L + P N R G++ +S Sbjct: 31 SSGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGP---NATRVGVIQYS 81 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I ++ T + ++YA N F E Sbjct: 82 SQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEA 138 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 139 HVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 191 Query: 340 -DRFYSVQNSRKLHDAFLR 357 + + V++ L F R Sbjct: 192 NEHVFLVESF-DLIQEFGR 209 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 346 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 396 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 397 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 451 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + AK G ++YA+GV ++ L+ AS Sbjct: 452 PRVGLVFTDGRSQD------NISVWAARAKEEGIVMYAVGVGKAVEEE-LREIAS 499 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 44/199 (22%), Positives = 79/199 (39%), Gaps = 26/199 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD++ V+D S S+ + + + +L + P N R G++ +S Sbjct: 31 SSGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGP---NATRVGVIQYS 81 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I ++ T + ++YA N F E Sbjct: 82 SQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEA 138 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 139 HVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 191 Query: 340 -DRFYSVQNSRKLHDAFLR 357 + + V++ L F R Sbjct: 192 NEHVFLVESF-DLIQEFGR 209 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 346 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 396 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 397 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 451 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + AK G ++YA+GV ++ L+ AS Sbjct: 452 PRVGLVFTDGRSQD------NISVWAARAKEEGIVMYAVGVGKAVEEE-LREIAS 499 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 65/166 (39%), Gaps = 20/166 (12%) Query: 165 SDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+++++ LDVS SM D P +L + +I L+ + + R GLV F Sbjct: 85 KRKGIEVVIALDVSNSMLADDIQP--SRLQKSKYTISNFLERLGN-------DRVGLVVF 135 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P+ ++ ++ + I T + + + + +E E K Sbjct: 136 AGQSFVQCPITSDKSALKLFMDIVSTDAIPTQGTNFSSAIRESIRALERIEEGAEAEEKN 195 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K I+ +DGE+ ID EA + +Y +GV Sbjct: 196 -RVRNKVILIFSDGEDHEAGID-----EVLEEAASKNIRIYTVGVG 235 >gi|320450208|ref|YP_004202304.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150377|gb|ADW21755.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 414 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 82/245 (33%), Gaps = 30/245 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK----SDIGLDMMMVL 175 K A R E+ + P + + + VKI L++ VL Sbjct: 2 KNKKKAVSQANPRPELELLPLK-PGVRATGPTRMPMLLRVKIPPVQAEVERPPLNLAFVL 60 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--L 233 D S SM DKL A +++ ++ ++ R +V + ++ P L Sbjct: 61 DRSGSMAG------DKLKFAKKAVAYAVE------NLRPHDRVAVVIYDHQVEVVVPSTL 108 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 A + I ++ + +T G + + + H+ + +I L+DG Sbjct: 109 AENKEEILRRLRPVRPRGSTNLHAGW------LEGSTQVAAHLDAKRLNR---VIVLSDG 159 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKL 351 ++ + +RG +GV + + + A +Y +++ L Sbjct: 160 LANTGETNPNVIAEQVRGLSQRGVSTSTLGVGLDYNEDLMMAMAEAGQGNYYFIESPDDL 219 Query: 352 HDAFL 356 F Sbjct: 220 PGIFA 224 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 76/237 (32%), Gaps = 46/237 (19%) Query: 166 DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D+++VLD S SMN +L + + + P+ R +V+FS Sbjct: 77 QAPVDVVLVLDRSGSMNFVETPNSPTRLDYGKLAAINFAERV-LGPNGIPGSRVSVVSFS 135 Query: 225 S-------------------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + L+ ++ + + INR+ T + G E + Sbjct: 136 GPAYATGVRNNPQRHYGQQNQATTDLDLSSDLRAVTDSINRITAFGGTNTEAGFEQGRSV 195 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-----AKRRG---- 316 I + + + K +I LTDG ++ N + N A G Sbjct: 196 I------EGTTSNQNPNSNKVVIMLTDGLPTASNGNPYAETTDINHVHIQRAINAGKNIY 249 Query: 317 ----AIVYAIGV---QAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 A V+ IG+ N A +Y ++ L F I + + Sbjct: 250 QNDIADVFTIGLTTGMNATEKALADNILTQAQNKGYYPAPSATDLDAIFEEISQRLG 306 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 50/257 (19%), Positives = 90/257 (35%), Gaps = 63/257 (24%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP---GMDKLGVATRSIREMLDIIKSI-- 209 ++ + + +D +VLDVS SM+D + K+ ++ L I Sbjct: 577 TGAANSSTITTTQSVDFTLVLDVSSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINE 636 Query: 210 PDVNNVVRSGLVTFSS-------------------KIVQTFPLAWGVQHIQEKINRLIFG 250 + ++R GLV F+ PL + ++ K++ L Sbjct: 637 QSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALRHN 696 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SL 306 T++ G ++A + A+ K+ +IF TDG + + +K+ ++ Sbjct: 697 GATRADLGFKHASTVMSGARTDA----------KRVVIFFTDGTPTKVSDFDKDVANSAV 746 Query: 307 FYCNEAKRRGAIVYAIGVQAEAA---------DQFLKNCAS----------------PDR 341 Y K GA VY+IGV A +QF+ +S Sbjct: 747 TYAKSLKDSGATVYSIGVFDGANPSSIEENQKNQFMNAVSSNYPHATAYDKLGTGSNAGY 806 Query: 342 FYSVQNSRKLHDAFLRI 358 + V N L F +I Sbjct: 807 YKVVSNVSDLKSIFEKI 823 >gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca] Length = 901 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 71/208 (34%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L++FS Q Sbjct: 244 VIFVIDKSGSMSG------RKMQQTREALIKILDDLSPKDQFN------LISFSGDAAQW 291 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL A V + + T + A + AK+K Sbjct: 292 KPLLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQLLNSAKQKELMPEGTVS---- 347 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + + FL+ A + Sbjct: 348 LIILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVSYAFLEKLALDNGGLAR 407 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + ++ +L D + + ++ Sbjct: 408 RIYEDSDSALQLQDFYEEVANPLLTAVT 435 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 60/174 (34%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 KI + G+D++ +DVS SM ++L A R E + +K R Sbjct: 80 TKIETVKREGVDIVFAIDVSKSMLAEDVAP-NRLEKAKRIAFETISQLKG-------DRV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G+V +++ L + + + + ++ A + Sbjct: 132 GIVAYAASAYPQLALTTDHSAAKMFLQDMNTDMLSSQGTAIQEAI-------RMASNYFD 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + + LTDGE+ + EA+ +G +Y IG+ E Sbjct: 185 ENTPTARLLFILTDGEDHE-----MGATEIATEAQEKGVHIYTIGIGTEKGAPI 233 >gi|70733679|ref|YP_257319.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68347978|gb|AAY95584.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 582 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 77/213 (36%), Gaps = 26/213 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW ++ + I +S + ++ +++ ++DVS SM+ G +S ++ Sbjct: 175 PWNPHTRLLRIGIKASDRAVAEL-APANLVFLVDVSGSMDRREGLP------LVKSTLKL 227 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLE 260 L + + + R LV ++ + G I+ I++L G +T G++ Sbjct: 228 L-----VDQLRDQDRVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQLTAGGSTAGASGIQ 282 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY I+ TDG+ + D E ++ G + Sbjct: 283 LAYQMAQQGFIDQGINR---------ILLATDGDFNVGVSDFDSLKAMAAEKRKSGVSLT 333 Query: 321 AIGVQAEAADQFLK---NCASPDRFYSVQNSRK 350 +G + ++ L A + + N R+ Sbjct: 334 TLGFGVDNYNEHLMEQLADAGDGNYAYIDNLRE 366 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 609 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 659 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 660 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 714 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 715 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 769 TNKHLFYAED----FSTMDEISEKLKK 791 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 44/127 (34%), Gaps = 11/127 (8%) Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + D R GL+ F + PL + +++++ S GL Sbjct: 23 LSDFLDRREGDRVGLLVFGQRAYALTPLTADLTSVRDQL--------ADSVVGLAGRETA 74 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I DA + ++ ++ LTDG N++ ++ L AK G V+ I Sbjct: 75 IGDAIALSVKRLREQKQGQRVVVLLTDGVNTAGVLNP---LKAAELAKAEGVRVHTIAFG 131 Query: 326 AEAADQF 332 D Sbjct: 132 GSGGDSL 138 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 648 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 698 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 699 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 753 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 754 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 807 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 808 TTKHLFYAED----FSTMDEISEKLKK 830 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 76/202 (37%), Gaps = 28/202 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + LD++ ++D S S+ + + EM+D + R ++ ++ Sbjct: 28 KNQPLDLVFIVDSSRSVRPE------EFEKVKIFLSEMIDTLDV---GERTTRVAVMNYA 78 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + FPL + ++E ++R+ + T + ++ A ++F + A Sbjct: 79 STVKVEFPLRTYFDKASMKEAVSRIEPLSAGTMTGLAIQTAMEEVFTEEMGTRPAAFN-- 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + +I +TDG D A+ G +Y +GV A Q L+ AS Sbjct: 137 -IPRVVIVVTDGRPQDQVQD------VAASARTAGIEIYTVGVG-RADMQALRIMASEPL 188 Query: 339 PDRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 189 DEHVFYVETYGVIEKLTSRFRE 210 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 71/208 (34%), Gaps = 36/208 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D + VLD S SM D G+ E++++ + D + R G V ++ + Sbjct: 45 GYDAVFVLDTSYSMRDTDPEGISA---------EVINMFMDLSDADRT-RVGFVAYNHNV 94 Query: 228 VQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V + PL IQ+ I L T GL + Sbjct: 95 VASKPLTSIAVAAQKSQIQQDIRTLNRSGYTDLGLGLRRGSELLAAGA---------SQG 145 Query: 283 YKKYIIFLTDGENSSPNIDNKESL--------FYCNEAKRRGAIVYAIGVQAEA--ADQF 332 + ++I L+DGE S A+ +G VY IG+ + Q Sbjct: 146 RQPFMILLSDGETDFGASSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDGTVNRQE 205 Query: 333 LKNCAS--PDRFYSVQNSRKLHDAFLRI 358 L+ AS + ++ L + RI Sbjct: 206 LERIASQTGGASFITSSAEDLPEILNRI 233 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 51/279 (18%), Positives = 88/279 (31%), Gaps = 31/279 (11%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 + NGN + + S I N+ + + + S S Sbjct: 561 GCGEQYAINNGNYARCRTEYNSDPYINNVLNCNPSSRIAAVANLAFSAGATSSRSFGFHT 620 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + S + FP + ++ SS LD+ +LD S Sbjct: 621 CGTNNCQRGIRESNLGLYNTQGAFPNGDRIVYTACRGSA---GSSSCAAPLDLFFLLDGS 677 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 S+N + + V +++ + R G+V +S + F L V Sbjct: 678 GSVNAANFVKVKQFAV---------NVVNTFDVSLTATRVGVVQYSDRNTLVFNLGNKVN 728 Query: 239 ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + N + T + L+Y ++ A + K II LTDG++ Sbjct: 729 KPSTVSAINNIVYQSGGTNTGAALQY----------VRQYAAWRGGNVPKVIIVLTDGKS 778 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 S ++L G VYAIGV + Q L+ Sbjct: 779 SDSVSGPSQNLVAA------GVEVYAIGVGSFDHGQLLQ 811 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 45/339 (13%), Positives = 120/339 (35%), Gaps = 35/339 (10%) Query: 39 SHKFFVKAKLHYILDHSL---LYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 ++ + LD + + ++ + ++ + KQ N + + Sbjct: 216 ANTATSAKHVRPALDVKMQVNIDAGFELTSLDSLYHPIKQSNV-----GNHYSVNFAGKQ 270 Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 + + F + + K + P N +L+ Sbjct: 271 IADRDFVLQWQANVGAVPKAATFYQTGKTHLADNSDERSETAQRQPNPVDNNMYSLVMLM 330 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 SV++S + I ++++V+D S SM+ + A ++++ L ++ I N Sbjct: 331 PPSVEVSEQHLIARELILVIDTSGSMSGQ------SITQAKQALQFALAGLRDIDSFN-- 382 Query: 216 VRSGLVTFSSKIV--QTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ F+S + PL + + I L T+ L+ A + D+ Sbjct: 383 ----IIEFNSDVTMLSATPLSANSRNIGKANRFIQSLDADGGTEMRSALQTA---LVDSV 435 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++ H + + +IF+TDG + +++ ++ ++ +G+ + Sbjct: 436 QQDSDQTDAHSEMLRQVIFMTDG---AVGNEHELYQLINDQLGDS--RLFTVGIGSAPNS 490 Query: 331 QFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQRI 367 F++ A+ R F + N ++ ++ ++ + + Sbjct: 491 DFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKIEQPVL 529 >gi|77456411|ref|YP_345916.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380414|gb|ABA71927.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 563 Score = 71.0 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 41/225 (18%), Positives = 84/225 (37%), Gaps = 26/225 (11%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S + + PW ++ + I +S + ++ +++ ++DVS SM+ G Sbjct: 156 SPFGVTTELAASPWNPHTRLLRIGIKASDRAVAEL-APANLVFLVDVSGSMDRREGLP-- 212 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLI 248 +S ++L + + R LV ++ + G + I+ I RL Sbjct: 213 ----LVKSTLKLL-----VDQLREQDRVSLVVYAGESSVVLEPTSGREKAKIRTAIERLT 263 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G +T G+E AY + I KG + I+ TDG+ + D Sbjct: 264 AGGSTAGASGIELAYQMA-----QQAFIPKGINR----ILLATDGDFNVGTSDFDSLKQM 314 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQNSRK 350 + ++ G + +G + ++ L A + + N R+ Sbjct: 315 AVDKRKTGISLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLRE 359 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 36/193 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SM+ D+L + + ++ +I+ V G+VTF S Sbjct: 310 VCLVLDKSGSMSAE-----DRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVAEI 358 Query: 230 TFPLAWGVQ-HIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L ++ +KI + T GL+ + I + + Sbjct: 359 QNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE-------- 410 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE++ N C + KR GAI++ I + AA + L N RF Sbjct: 411 -IILLTDGEDNEINS--------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRF 461 Query: 343 YSVQNSRKLHDAF 355 ++ ++ L +AF Sbjct: 462 FANKDITGLTNAF 474 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 71/185 (38%), Gaps = 24/185 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + ++ + +D+ +++D S SM ++L A ++ + +DI +V + Sbjct: 372 AITQLWQQHKKQVDVALIIDTSGSMRQE-----NRLREAKTALGDFIDIFADQDNVQVTI 426 Query: 217 RSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 FS+ + L+ + +I+ L+ T+ + Y I E Sbjct: 427 ------FSTNATELSDLSPIGPKRADLHTRIDGLVADGETRLYSTIGEVYTDIQQQTEVQ 480 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF 332 A ++ LTDGE+++ ++ ++ + ++ I ++A + Sbjct: 481 RIRA---------LVVLTDGEDTASSLSLEQLNEQIRQDESGTSIKIFTIAYGSDANQEV 531 Query: 333 LKNCA 337 L+ A Sbjct: 532 LQRIA 536 >gi|330447678|ref|ZP_08311326.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491869|dbj|GAA05823.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 257 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 46/252 (18%), Positives = 94/252 (37%), Gaps = 21/252 (8%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 I + K + VS +P + + + A ++ P + ++ SS + + + +++ Sbjct: 17 INKSMKDKPTSQQLVSAPSLPDVSTGYGFDALKNNWP-TLNNTQTSSSDNWLAANYLLIF 75 Query: 176 DVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQTFPL 233 D S SM++ + G G K+ A +++ ++ I +V GL F ++ PL Sbjct: 76 DGSGSMDNTNCGNGQRKIVAAKEAMQTFINDIPQDANV------GLYVFDNADSSLRVPL 129 Query: 234 A-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 +++ I + G TT L Y + EK G+ +Y I+ TD Sbjct: 130 GINNRATLKQAIYDVKAGGTTPLKSSLTSGYTAL----EKQAEKQLGYGEYNVVIV--TD 183 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 G+ ++ + + + ++ IG N + S N KL Sbjct: 184 GD---ASVGEEPEVAISRIYQNSPVTIHTIGFCIGNRHAL--NAEGITYYQSANNPEKLL 238 Query: 353 DAFLRIGKEMVK 364 + E + Sbjct: 239 AGLQSVLAESAQ 250 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 79/199 (39%), Gaps = 26/199 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + R + ++ + P N R G++ +S Sbjct: 31 HTGPLDLVFVIDSSRSVRPL------EFETMRRFLVGLVRSLDVGP---NATRVGVIQYS 81 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ + L+ T + ++YA N F E + Sbjct: 82 SQVQSVFPLGAFSRREDMERALRTLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 138 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 139 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 191 Query: 340 -DRFYSVQNSRKLHDAFLR 357 + + V++ L F R Sbjct: 192 DEHVFLVESF-DLIQEFGR 209 >gi|115375477|ref|ZP_01462737.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|310821370|ref|YP_003953728.1| von willebrand factor, type a [Stigmatella aurantiaca DW4/3-1] gi|115367520|gb|EAU66495.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|309394442|gb|ADO71901.1| Von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 562 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 79/221 (35%), Gaps = 27/221 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P+ + + + V S+ + +V D S SM+ DKL +A +++ Sbjct: 189 PFDPSRHFLRVGVQGKVVSRSQRKPAHLVFLV-DTSGSMH-----SQDKLPLAKEAMKV- 241 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ ++N +VT++ P VQ I I+ L G T G+E Sbjct: 242 -----AVRNLNENDTVAIVTYAGSTQDVLPPTPATEVQRIHTAIDLLQSGGGTAMGSGME 296 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIV 319 AY + + + I+ LTDG+ N PN+ + L + G + Sbjct: 297 LAY--------RHAVKKASGNAISRVIV-LTDGDANIGPNLSAESMLSGIEKYVAEGVTL 347 Query: 320 YAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 IG D ++ A + V + ++ F Sbjct: 348 STIGFGMGNYRDDLMERLADKGNGNCFYVDSYQEAKKVFEA 388 >gi|320334211|ref|YP_004170922.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319755500|gb|ADV67257.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 609 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 64/199 (32%), Gaps = 25/199 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + L++ +V+D S SM L A ++ + +D + ++ + Sbjct: 30 VPNATRRPLNVALVIDRSGSMAGS------PLRYALKAAADFVDRLTETDVLS------I 77 Query: 221 VTFSSKIV--QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V + + I++ + + G T + G + A+ Sbjct: 78 VVYDDDVDTLLDAQPVRDKAAIKDLLKGVRAGGITNLSGGWLRGCELVAGARRADAVNR- 136 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-- 336 ++ LTDG+ + D + G +G + + L Sbjct: 137 --------VLLLTDGQANHGVTDTGVLIKTAASKAEAGVSTTTLGFGSSFEEDLLIGMAR 188 Query: 337 ASPDRFYSVQNSRKLHDAF 355 AS FY +Q+ D F Sbjct: 189 ASGGNFYFIQSMDDAADVF 207 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 66/340 (19%), Positives = 120/340 (35%), Gaps = 49/340 (14%) Query: 55 SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST-- 112 + +A LN + ++N F ++ I E + FA D++ +T Sbjct: 107 AATISAAPALNGDWPGAVPPERNRFEKQVQNGI---MVAGETPVSTFAIDVDTGSYTTLR 163 Query: 113 -SLSIIIDDQHKDYNLSAVSRY-----------EMPFIFCT--FPWCANSSHAPLLITSS 158 L Q + + Y E PF T P N L I Sbjct: 164 RMLKEGRLPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPYNYDMMLLRIGLK 223 Query: 159 VKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 SK+++G ++ +LDVS SM DKL + +++ + + + V+ VV Sbjct: 224 GYEQSKAELGASNLVFLLDVSGSMA-----SPDKLPLLQTALKMLTQQLGAQDKVSIVVY 278 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +G +V Q + + +L G +T G++ AY +H+ Sbjct: 279 AGAAG----VVLDGAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAYQL------AKKHLV 328 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC 336 +G + +IF TDG+ + + E + + K+ G + +G + D ++ Sbjct: 329 EGGINR---VIFATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQL 385 Query: 337 ASPDR----FYSVQNS------RKLHDAFLRIGKEMVKQR 366 A + N +L L I KE+ Q Sbjct: 386 ADKGNGQYAYIDSLNEARKVLVEQLSATLLTIAKEVKVQV 425 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 71/203 (34%), Gaps = 33/203 (16%) Query: 170 DMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++++LDVS SM + P G+ +L S+ + + R GLV F+++ Sbjct: 97 DLVLILDVSGSMAKNDVPGGITRLQAVKNSVSKFVAA-------RQSDRIGLVIFANQAW 149 Query: 229 QTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P++ Q +Q +I +L G T L A + + + D K Sbjct: 150 PFAPVSEDKQALQTRITQLSPGMVGEQTAIGDALGVAVKLLDSSA---------NQDASK 200 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----------QFLKN 335 I LTDG +++ + + A V+ I + Q + Sbjct: 201 LAILLTDGNDTASQLAPPLAAQL---AAAHHVQVHTIAFGDSNSAGSDHVDLTQLQEIAR 257 Query: 336 CASPDRFYSVQNSRKLHDAFLRI 358 + + + L + I Sbjct: 258 ITGGKSWTAANSGASLDSVWQEI 280 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQE 242 D+ ++ + + LVTF+ + F +I E Sbjct: 71 NDANDRFYYLKQAATNFTTQLAQSSPNSE---IALVTFNKTATEQFDFKNVGKDSAYITE 127 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 IN + T GL+ AY + + + + K+Y++ LTDG + D Sbjct: 128 TINAMETSGGTHQNEGLDRAYKILNNDQ--------NTSNLKRYVVLLTDGCPNGVTYDQ 179 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAE-------AADQFLKNCASPDRFYSVQNSRKLHDAF 355 + N+ K + +GV + AA +L+ A + Y+ ++ L+ F Sbjct: 180 ITTSI--NKIKSTNTKLITVGVGLDETNTGLKAAKDYLQANADDNMAYNANDASHLNTIF 237 Query: 356 LRIGKEMVK 364 +I + Sbjct: 238 TQILGQTTN 246 >gi|116249091|ref|YP_764932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253741|emb|CAK12134.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 465 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 44/312 (14%), Positives = 107/312 (34%), Gaps = 17/312 (5%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F R + G+++I+ A+ L + + +G + + V+ ++ LD +L+ + Sbjct: 25 FKAFRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAVKE 84 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + +K + F ++ + D + + + +T + I Sbjct: 85 IDTDDTDALKEKVADWFHAQVENSYTLGDIDIDTSNHKITATASGTVPTTLMKI---ANI 141 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM--------- 173 ++S S + P + + +L+ ++ S G+ Sbjct: 142 DTVDVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTAGQSAMYSGIGCQFACHTGDAHT 201 Query: 174 VLDVSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V + N + + VA ++R++LD+I + ++ GL + + + Sbjct: 202 VGKTKYANNYEYSAAKTIKLRADVAGDAVRDVLDMIDDSDSNHQRIKVGLYSLGDTLTEV 261 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKYII 288 + ++ +G T+ ++ Y + K+K K ++ Sbjct: 262 LTPTLSTDTARNRLADASYGLTSATSKAATYFDVSLATLKQKVGTGGDGTSSGSPLKLVL 321 Query: 289 FLTDGENSSPNI 300 LTDG S Sbjct: 322 LLTDGVQSQREW 333 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 67/468 (14%), Positives = 140/468 (29%), Gaps = 118/468 (25%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + GS+ I+T +LL + I+ G+ ++ +A L + D ++L A+ + Sbjct: 25 FGRDEDGSVIIMTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAAS---LNQT 81 Query: 69 GNNGKKQKNDFSYRIIKNIWQT-------------DFRNELRENGFAQDINNIERSTSLS 115 ++ ++ F+ N R+ L N F + ++R T+ + Sbjct: 82 LDSRDVVEDYFAKAGFPNALVGAPIVVDNGNSRTVTVRSALDVNTFYLRLAGMDRLTAPA 141 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + ++ N A + V + + + + + Sbjct: 142 -RSSATEGVGKVEISLVLDISGSMRFSNRFVNMQAAAIAFAEEVLDPANGGT-VSLTI-I 198 Query: 176 DVSLS----------MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--------- 216 + + M P D L I D P V++ V Sbjct: 199 PYAGATNPGPEMFAFMGGVRYP--DTLLAGDDGILGTEDDY-FFPQVSSCVEMVGSDWSS 255 Query: 217 -------RSGLVTFSSKIVQTFPLAWGV---------------QHIQEKINRLIFGSTTK 254 R+ + F + + WG + IN L T Sbjct: 256 AGLPGAGRAQVPHFQVWDIARSVMDWGWCPQDRSSIQYAMATPAQARSFINGLRMHDGTG 315 Query: 255 STPGLEYAYNKIFDAKEKL----EHIAKGHDDY--------------KKYIIFLTDGENS 296 + ++YA + + + H +G KK I+ +TDG+ + Sbjct: 316 THYAMKYALATLDPSSQPAFMHLSHPGRGLVPPQFANRPAAWDDPETKKIIVLMTDGDIT 375 Query: 297 ------------------SPNIDNKESL-FYCNEAKRRG---------------AIVYAI 322 S +I+ +++ + + A G VY + Sbjct: 376 QQERPRIAQQERDIDYIISRSINGRDNRGQFVDAATNVGRFEAICTLANQPARSVDVYTV 435 Query: 323 GVQAEAADQF---LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + ++NCAS + + +L D F I + + R+ Sbjct: 436 AFEVQPNSAADLQMRNCASDPSMFFRTSGAELIDVFSGIAERITDLRL 483 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 86/236 (36%), Gaps = 36/236 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-FGPGMDKLGVA 195 + + + P +T I ++M++LDVS SM + G+ +L Sbjct: 67 MFWLVWALMVCALARPEYLTPPQHI---EKPMRNIMLILDVSGSMEKNDVAGGLTRLQAV 123 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGST 252 +S+++ + R GLV F++ P++ Q ++ +I++L G Sbjct: 124 QQSVKKFVAA-------RKSDRIGLVIFANSAWPFAPVSEDKQALETRISQLTPGMAGQQ 176 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L + + G + K I LTDG +++ + + A Sbjct: 177 TAIGDALGVTVKLL---------DSTGDKEASKLAILLTDGNDTASQLTPR---LAAQLA 224 Query: 313 KRRGAIVYAIGV---QAEAADQ----FLKNCA--SPDRFYSVQNS-RKLHDAFLRI 358 ++ I + D+ L++ A + R ++ +NS L + I Sbjct: 225 VSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLARMTGGRSWTAENSGASLDAVWKEI 280 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 83/219 (37%), Gaps = 37/219 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 243 QISRYPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLS 290 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N ++ T GL + D ++K Sbjct: 291 VIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVANGGTNIAEGLRKGVKVMEDQRDKNPV 350 Query: 276 IAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFY------CNEAKRRGAIVYAIGVQAE 327 + II L+DG + D L C KR V++ G ++ Sbjct: 351 AS---------IILLSDGRATYTMNQADPNYKLLLPLSMHGCES-KRFQIPVHSFGFGSD 400 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + + S F +++ + DA + IG + Sbjct: 401 HDASLMHSVSETSGGTFSFIESESVIQDALAQCIGGLLS 439 >gi|49225581|ref|NP_990438.1| collagen alpha-1(VI) chain precursor [Gallus gallus] gi|115314|sp|P20785|CO6A1_CHICK RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|62875|emb|CAA41062.1| collagen alpha 1 type VI [Gallus gallus] gi|63302|emb|CAA45788.1| collagen type VI alpha 1 subunit [Gallus gallus] gi|211354|gb|AAB59954.1| alpha-1 type VI collagen precursor [Gallus gallus] Length = 1019 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 69/206 (33%), Gaps = 26/206 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR- 217 ++ D +D+ VLD S S+ P D + +D + R Sbjct: 26 ARVLRAQDCPVDLFFVLDTSESVALRVKPFGDLVAQVKDFTNRFIDKLTE-----RYFRC 80 Query: 218 -------SGLVTFSSKIVQTFPLA---WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKI 266 +G + +S +V L G ++ ++ + G T + ++ ++ Sbjct: 81 DRFLAWNAGALHYSDSVVIIKDLTAMPSGRAELKNSVSAINYIGKGTHTDCAIKQGIERL 140 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQ 325 H KY+I +TDG + L NEAK G V+++ + Sbjct: 141 LLGG--------SHLKENKYLIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAIS 192 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKL 351 DQ L A+ + + L Sbjct: 193 PHHLDQRLNIIATDHAYRRNFTATSL 218 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 73/210 (34%), Gaps = 34/210 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIV 228 D+M+++D S S+ ++ + + +++ + VR +V +S + Sbjct: 824 DIMLLVDSSTSVGSK------NFDTTKNFVKRLAERFLEASKPAEDSVRVSVVQYSGRNQ 877 Query: 229 Q--TFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 Q P I + ++ + F T L+Y + KK Sbjct: 878 QKVEVPFQRNYTVIAKAVDNMEFMNEATDVNAALQYIMGLYQRSSRSGAK--------KK 929 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAADQFLKNCASPD-- 340 ++F +DG +S I + E ++ G VY + V + E + L S + Sbjct: 930 VLVF-SDG--NSQGITARAIERTVQEVQQAGIEVYVLAVGSQVNEPNVRVLVTGKSTNYD 986 Query: 341 ------RFYSVQNSRKLHDA--FLRIGKEM 362 + V + L + + +++ Sbjct: 987 VAYGERHLFRVPDYTSLLRGVFYQTVSRKI 1016 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|149909171|ref|ZP_01897828.1| hypothetical protein PE36_09171 [Moritella sp. PE36] gi|149807695|gb|EDM67641.1| hypothetical protein PE36_09171 [Moritella sp. PE36] Length = 402 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 61/418 (14%), Positives = 123/418 (29%), Gaps = 107/418 (25%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G+I++ +LP I ++ + + + V + D ++L A + Sbjct: 5 QRQRGAITLTFTFMLPAIVSLLAITVFFAMYSQVVIRAGQAADSAVLACAYQ-------- 56 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 +N I + ++ + +K L++ Sbjct: 57 ---------------------------QNDTGVVTEGILDYYRPNFVLPELNKSVKLNSN 89 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI-------------GLDMMMVLDV 177 + ++ + P N+ + + V +S+S +D +VLD+ Sbjct: 90 NGCQISAQYRFEPAMVNALPVAIDSDTEVVSNSQSSAKLVQNVNVNGIQNPVDFSLVLDI 149 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIK-SIPDVNNVVRS-----------------G 219 S SM H + I + ++ SI V G Sbjct: 150 SGSMTWHLPELKKIITDVISDIVPSSNQVRFSIVPFQTGVGVSGAPWLLSSEASPKCVDG 209 Query: 220 LVTFSSKIVQTF---------------------------------PLAWGVQHIQEKINR 246 LV + + PL + + + Sbjct: 210 LVYRNGNLDADKTVQSLNYSSDRLDFNEVTPGRWLDRCSETSFILPLTNNLNRVIRYVES 269 Query: 247 L-IFGSTTKSTPGLEYAYNKIFDAKEKLEHI-AKGHDDYKKYIIFLTDGENSSPN-IDNK 303 L G +T S G + + D +K + + +I TDG+++ + ++ Sbjct: 270 LDTSGGSTASYQGFIWGVRTLTDQWQKEWQVTPVQSSSLTQRLILFTDGDDNRRDYFNDL 329 Query: 304 ESLFYCNEA-KRRGAIVYAIGVQ--AEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLR 357 S C+ + V IG A+ QF K CA + N+ +L D F Sbjct: 330 MSAGLCDVIQQDLNIQVSFIGFGVSADRIKQF-KQCAGRNGSVFDANNTAELADYFED 386 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 36/193 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SM+ D+L + + ++ +I+ V G+VTF S Sbjct: 310 VCLVLDKSGSMSAE-----DRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVAEI 358 Query: 230 TFPLAWGVQ-HIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L ++ +KI + T GL+ + I + + Sbjct: 359 QNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE-------- 410 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE++ N C + KR GAI++ I + AA + L N RF Sbjct: 411 -IILLTDGEDNEINS--------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRF 461 Query: 343 YSVQNSRKLHDAF 355 ++ ++ L +AF Sbjct: 462 FANKDITGLTNAF 474 >gi|330469087|ref|YP_004406830.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812058|gb|AEB46230.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 316 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 65/209 (31%), Gaps = 32/209 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVSLSM P +L A + + ++ +P+ N GLV+F+ Sbjct: 89 IMLAIDVSLSMQADDVPP-TRLEGAQEAAK---QFVRELPETYN---VGLVSFAKSANVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + I+ L+ T + + I I+ L Sbjct: 142 VPPTKDRPAVTNAIDGLVLAEATATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVLL 196 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----DQFLKNC---------- 336 +DG ++ + + + V I +A L+ Sbjct: 197 SDGFRTAGRSVEEAAAAA----QAANVPVSTIAFGTDAGHVAIGGQLQRVPVDRMALAAL 252 Query: 337 --ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + FY + +L + +G + Sbjct: 253 AETTEGYFYEAASVSELKQVYQDMGSSIG 281 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 47/238 (19%), Positives = 79/238 (33%), Gaps = 36/238 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 F + + ++ + + S G D + VLD S SM D G+ A Sbjct: 15 FVLFFQTGLSGVICTVNQANAASLGTASIEGYDAVFVLDTSYSMRDTDPEGI-----AAE 69 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGST 252 I +D+ + + R G V ++ +V + PL IQ++I L Sbjct: 70 VISMFMDLSDA-----DRTRVGFVAYNHHVVASKPLTSIGVAAQKSQIQQEIRMLNRSGY 124 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL------ 306 T GL + + ++I L+DGE S Sbjct: 125 TDLGLGLRKGSELLAAGA---------SQGRQPFMILLSDGETDFGVSSGSRSKGDSNND 175 Query: 307 --FYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS--PDRFYSVQNSRKLHDAFLRI 358 A+ +G VY IG+ + Q L+ AS + ++ L + RI Sbjct: 176 VSSVIKSAQTKGYPVYTIGLNHDGTVNRQELERIASQTGGASFITSSAEDLPEILNRI 233 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 47/216 (21%), Positives = 81/216 (37%), Gaps = 36/216 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +D++ VLDVS SM KL + R++ ++ + S R ++ Sbjct: 281 STARAPVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIA 328 Query: 223 FSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + FPL + +++ +N L T L I E AK Sbjct: 329 FSSSARRLFPLRRMTESGRQQSLLAVNSLTSNGGTNIAEALRKGSKVI------EERQAK 382 Query: 279 GHDDYKKYIIFLTDGENSSPNIDN----KESLFYC----NEAKRRGAIVYAIGVQAEAAD 330 II L+DG+++ K + YC + + V+ G A+ Sbjct: 383 NPVCS---IILLSDGQDTYTVSPTAGVHKGAPEYCALLPSTNGNQQVPVHVFGFGADHDS 439 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 L + + S F ++ + DAF + IG + Sbjct: 440 VSLHSISQTSGGTFSFIETEAAIQDAFAQCIGGLLS 475 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 57/334 (17%), Positives = 110/334 (32%), Gaps = 32/334 (9%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQT--DFRNELRENGFAQDIN-NIE 109 D ++++ ++ + + + ++DI+ + Sbjct: 42 DSAVIHFGASVVIKTSPMKSIAYVGAPADGNKNGSVYKCSFSGKSYGNTSCSKDISFDNN 101 Query: 110 RSTSLSIIIDDQHKDYNL----SAVSRYEMPFIFCTFPWC----ANSSHAPLLITSSVKI 161 + +SI D + + S P C SS A + T V Sbjct: 102 AAIGMSIGADSSFSNLYICGNQHTTSCPTTPLQKRMVGACYKKPMTSSTATMFKTPCVPG 161 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 K + D+M VLD S S++D A I +++ S D ++ R G+ Sbjct: 162 CPKIILIADIMFVLDDSSSVDD------TAFRSALNWIIQVVSYFSSYIDSGDL-RVGVY 214 Query: 222 TFS---SKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FS + L W ++++I L+ +T + + +A + E Sbjct: 215 GFSNDDHRSGIRIGLRKWTSATLKKQIGELLNVKSTGAGTYISHAIKETVKVFEANGRKG 274 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNC 336 K II LTDG D + A+ +G ++ ++GV DQ L Sbjct: 275 I-----SKEIILLTDG----GASDWWLLKGEADTARDKGIVLVSVGVGTSVNNDQLLAIA 325 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + R + + L + + + +I N Sbjct: 326 GNKSRVFQATDYNTLDEVVNNVVSTIDAIKITVN 359 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 63/465 (13%), Positives = 133/465 (28%), Gaps = 124/465 (26%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + GSI+I+ A+ L ++GL I+ K +L D ++L Sbjct: 18 LASDRGGSINIMFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGVVTAKEYIA 77 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N + +N F + FA + + + + + Sbjct: 78 SNAKQGDATAAGLTAGRNQATKAFAINTGKVPFATVSVS-------RLDVTRSGQTLTAT 130 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 + + F ++++ T+++ S+ LD +++DVS SM Sbjct: 131 VIYTATIQNTFGKILGLSSTT-----FTNTITASADLASYLDFYLMVDVSGSMGLPTAAA 185 Query: 183 -------------------------DHFGPGMDKLGVATRSIRE-MLDIIKSI--PDVNN 214 + K+ + + ++ + +++K P V N Sbjct: 186 DAEKLASITKEDQGNCQFACHFPGRKGWNNAAGKIQLRSDAVNNAVCELLKRAATPVVPN 245 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-------------LIFGSTTKSTPG--- 258 R G F +++ PL+ + L+ +T+ G Sbjct: 246 QYRIGFYPFINRLATLSPLSDTTTSMTALRTAAQCDKTWPLAFTNLLDTGSTQLFTGNNP 305 Query: 259 ----------LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--------- 299 E A ++ A + + K ++ +TDG +S + Sbjct: 306 TTGTGSGGTHFEKALPQM-KATIQPYGDGSSTTNSKPFVFLITDGMQNSQSYSTNNDART 364 Query: 300 ---------------IDNKESLFY----CNEAKRRGAIV------------YA------- 321 D + C E K GAI+ Y Sbjct: 365 FPGSPSLFKGYGNAGWDGSQPAQIDPSKCKELKDAGAIISILYIPYNQVKNYTNDSYIVW 424 Query: 322 ----IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + + L+ CAS FY+ ++ + + + + Sbjct: 425 ENNRVNGFSPTLADPLRKCASQGFFYTANSADDITASLGAMFDQA 469 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|317419404|emb|CBN81441.1| von Willebrand factor A domain-containing protein 2 [Dicentrarchus labrax] Length = 761 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 77/198 (38%), Gaps = 22/198 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 VKI+S ++ + +D+ M+ + G + ++ + PD VR Sbjct: 3 IVKINSAGEM-MQCSAAMDILFLMDGSYSMGKGSFERSKHYAIKLCQALDVSPDK---VR 58 Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 GL+ F S F L Q +++ + ++ + G +T++ L+Y K F+ Sbjct: 59 VGLIQFGSVPRLEFALDSHTTKQDLKKHMKKVSYRGGSTQTGLALKYVLRKGFEGGR--- 115 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + I L+DG + ++ + K G +++A+G++ ++ Sbjct: 116 ----NSSAAARIAILLSDGRSQG------NAVQAAAQLKETGVVLFAVGLRYPRWEELHA 165 Query: 335 NCASP--DRFYSVQNSRK 350 ++P + ++ Sbjct: 166 LASAPMESHVFFAEHFHD 183 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 60/187 (32%), Gaps = 20/187 (10%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ LD S S++ D +R + +V + LV + Sbjct: 497 GQAVDLVFALDASGSVSP------DNFATMRDFVRGLSVQFDI---NRDVAQMALVAYGR 547 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YK 284 + F L I L G+ + + +AKG Sbjct: 548 RATTVFNL--DTHDTGSAI--LKAVGDANYMGGVASTGTALLHVHSDILTVAKGARPGVN 603 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K ++ +TDG ++ + + G ++ IG+ ++ L+ S + S Sbjct: 604 KAVVVVTDG------SGGDDAAVPAQKLRDNGVSLFVIGIGDIQKERLLQIAGSEEHMIS 657 Query: 345 VQNSRKL 351 V + L Sbjct: 658 VLSYEDL 664 >gi|239908149|ref|YP_002954890.1| hypothetical protein DMR_35130 [Desulfovibrio magneticus RS-1] gi|239798015|dbj|BAH77004.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 328 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 62/176 (35%), Gaps = 14/176 (7%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + G+DMM +D+S SM P + + + + K++ R Sbjct: 73 PPDAPTYAGRGVDMMFAVDLSPSMAAMDIPAEGR---TITRLAAVAEAAKTLALSRPGDR 129 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 GLV F ++ P + + + L G G + A + DA Sbjct: 130 IGLVAFGARAYLVVPPTTDRAALVQALASLDTG-----AAGRKTA---MGDAVGLAAKRL 181 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K ++ DG +++ D + A R G V+A+GV + FL Sbjct: 182 DESPGQAKAVVVFGDGRSNAGETDP---VPAAQAAVRHGVAVFAVGVGGDGPAPFL 234 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 80/198 (40%), Gaps = 19/198 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V D S S+N + + + I + + K+ PD +VTF+ + + Sbjct: 253 DIAFVFDASSSINANNPNNYQLMKNFMKDIVDRFN--KTGPDGTQF---AVVTFADRATK 307 Query: 230 TFPLAWGVQH--IQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I+ I+++ T GLE A ++F + G ++ +K Sbjct: 308 QFGLKDYSSKADIKGAIDKVSPSIIGQTAIGDGLENARLEVFPNR-----NGGGREEVQK 362 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YS 344 +I LTDG+N+ ES ++ G ++ AIGV L N AS + + ++ Sbjct: 363 VVILLTDGQNNGHKSPEHESSLL----RKEGVVIVAIGVGTGFLKSELINIASSEEYVFT 418 Query: 345 VQNSRKLHDAFLRIGKEM 362 + KL + K Sbjct: 419 TSSFDKLSKIMEDVVKLA 436 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 85/198 (42%), Gaps = 17/198 (8%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-DVNN 214 ++ K + + D+ D++++ D S S+ + + ++E++D ++ + N Sbjct: 40 GNTGKDAEECDVQADIIVLFDDSSSIQ---YDNKENYQMMKDFVKELVDSFTTVGVNGRN 96 Query: 215 VVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF--GSTTKSTPGLEYAYNKIFDAK 270 + G+V FS + FPL + + I++ I ++ G T+ GL++ F Sbjct: 97 GSQFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLKHVRENSFSGA 156 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + K +K +I +TDG++++ ++ K G V AIG+ Sbjct: 157 EGGGNPDK-----QKIVILMTDGKSNAG----APPQHEAHKLKAEGVTVIAIGIGQGFVK 207 Query: 331 QFLKNCASPDRFYSVQNS 348 L+ A+ + NS Sbjct: 208 TELEQIATMKNYVLTTNS 225 >gi|326505554|dbj|BAJ95448.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326515132|dbj|BAK03479.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/261 (18%), Positives = 91/261 (34%), Gaps = 38/261 (14%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 ++ ++ + E+ I + S L S++ S S LD++ VLDVS Sbjct: 203 ENNEVTGSVEIKTYAEVQAIQQSVTQKVFSILIHLKAPKSLESVS-SRAPLDLVTVLDVS 261 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA---- 234 SM KL + +++ ++ + R ++ FSS + FPL Sbjct: 262 GSMKGA------KLALLKKAMCFVIQTLGPND------RLSVIAFSSTARRLFPLRQMNV 309 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 G + +N L+ G T + GL+ I EH + II L+DG+ Sbjct: 310 NGRMQAIQAVNSLVDGGGTNISDGLKKGAKVI-------EHRRLKNPVCS--IILLSDGQ 360 Query: 295 NSSPNIDNKESLFYCNEA---------KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 ++ + L + A ++ G + + A S F Sbjct: 361 DTYSVPTFDDELQTNHSALVPPSILPGTGNHVQIHTFGFGMDHDSAAMHAIAETSSGTFS 420 Query: 344 SVQNSRKLHDAFLR-IGKEMV 363 + + + F + IG + Sbjct: 421 FIDAEGSIQNGFAQCIGGLLS 441 >gi|297299828|ref|XP_002805471.1| PREDICTED: matrilin-2-like [Macaca mulatta] Length = 897 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N H + + ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAKV------KEFLMDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F R Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQR 234 >gi|330808169|ref|YP_004352631.1| hypothetical protein PSEBR_a1432 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327376277|gb|AEA67627.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2855 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/180 (23%), Positives = 72/180 (40%), Gaps = 14/180 (7%) Query: 171 MMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++VLDVS SM D G PG+ +L +A ++I +LD + D V+ LVTFSS Sbjct: 2054 LLIVLDVSGSMADDSGVPGLSRLDLAKQAISALLDKYDDLGD----VKVQLVTFSSSATD 2109 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + V + ++ L T + A + + G+ F Sbjct: 2110 QTSVWVDVATAKSLLSSLSADGGTNYDAAVATAKTAFVTSGQLTGAQNIGY--------F 2161 Query: 290 LTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +DG+ NS + + G YAIG+ ++ +L A ++ N+ Sbjct: 2162 FSDGKPNSGLETGTADEAAWKAFLDANGIKNYAIGLGDGVSNDYLDPLAYDGSAHTNTNA 2221 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 48/225 (21%), Positives = 83/225 (36%), Gaps = 35/225 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F PW S A LD++ V+D S S+ + Sbjct: 12 LLFLLQPWETLSQFAG---------PRCRTGPLDLVFVIDSSRSVRPF------EFETMR 56 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTT 253 + + +L + P N R G++ +SS++ FPL A+ + ++ I L+ T Sbjct: 57 QFLVGLLRGLDVGP---NATRVGVIQYSSQVQSVFPLGAFSRREDMERAIRALVPLAQGT 113 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + ++YA N F E + + +TDG +A+ Sbjct: 114 MTGLAIQYAMNVAFSVAE---GARPPEARVPRIAVIVTDGRPQD------RVAEVAAQAR 164 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASP---DRFYSVQNSRKLHDAF 355 RG +YA+GVQ L+ ASP + + V++ L F Sbjct: 165 ARGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVESFN-LIQEF 207 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 68/176 (38%), Gaps = 25/176 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 387 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 437 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 438 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 492 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 493 PRVGLVFTDGRSQD------DISVWARRAKEEGIVMYAVGVGKAVEEELRQIASEP 542 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 48/225 (21%), Positives = 81/225 (36%), Gaps = 35/225 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F PW A LD++ V+D S S+ + Sbjct: 12 LLFLLQPWETQLQLAG---------PRCRTGPLDLVFVIDSSRSVRPF------EFETMR 56 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQHIQEK-INRLIF-GSTT 253 + + +L + P N R G++ +SS++ FPL A+ + E I L+ T Sbjct: 57 QFLVGLLRGLDVGP---NATRVGVIQYSSQVQSVFPLGAFARREDMEHAIRALVPLAQGT 113 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + ++YA N F E + + +TDG +A+ Sbjct: 114 MTGLAIQYAMNVAFSVAE---GARPPEARVPRIAVIVTDGRPQD------RVAEVAAQAR 164 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASP---DRFYSVQNSRKLHDAF 355 RG +YA+GVQ L+ ASP + + V++ L F Sbjct: 165 ARGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVESF-DLIQEF 207 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 68/176 (38%), Gaps = 25/176 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 346 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 396 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 397 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 451 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 452 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELRQIASEP 501 >gi|124004754|ref|ZP_01689598.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] gi|123989877|gb|EAY29406.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] Length = 354 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 73/206 (35%), Gaps = 25/206 (12%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +++P F + P I+ +G D+ + +D+SLSM P +L Sbjct: 72 FKLPLRIIYFALLIIALLGPSFGFGKKSIAV---VGKDIFIAVDLSLSMKATDIPP-SRL 127 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 + +++ +KS R GLV FSS PL + + L Sbjct: 128 EKIKYELSNIINTLKS-------DRIGLVIFSSSAFMHCPLTYDKGALNLFTQILNTNLM 180 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + T LE + +E + K ++Y K ++ +DGE Sbjct: 181 PIGNAGTDFYAPLEL---VLKKYQEANKSNRKQQNEYAKVVVLFSDGEEFG-----DRYT 232 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ K+ V+ +GV + + Sbjct: 233 AIVDQYKQNNIRVFTVGVGSLQGGKI 258 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 314 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 363 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 364 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 415 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 465 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 466 YANKDLNSLIDAFSRIS 482 >gi|153874442|ref|ZP_02002664.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152069095|gb|EDN67337.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 478 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 82/234 (35%), Gaps = 23/234 (9%) Query: 136 PFIFCTFP--WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P +F F C + + + + S D+++++D S SM + KL Sbjct: 8 PILFGIFAAIGCLMGAILGEVFLKTTQPPSIPLPPHDVILLIDTSGSMAEG-----TKLQ 62 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 + + + + + N + +V F + L + ++++ I +L T Sbjct: 63 EVQAAAIQFIQRRHGLTHLANN-KIAVVGFGGRAYLVANLTSDLMNLEQPIQKLRAVGGT 121 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GL+ A N++ + G D ++ I+ TDG+ + + +L K Sbjct: 122 PMDRGLQSAMNQL----------SAGSDSEQRSILLFTDGKPD----NQRTTLNASQLVK 167 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AI +A L + AF + + + +Q + Sbjct: 168 NANIQIVAIATD-DADIGLLTQVTGDAALVFPTSVGNFDQAFQKAEQAIYEQNL 220 >gi|327538509|gb|EGF25172.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 764 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 89/256 (34%), Gaps = 48/256 (18%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA-----------PLLITSSV 159 T +I + +Q + + Y + T + P Sbjct: 279 DTHAAITLKNQSTIADKDFIIEYRLAGDDSTLASLTHRESDAEDGYVMLALQPKWSIEPT 338 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +I+ + +++VLD S SMN GP + +L + + + L+ Sbjct: 339 EITPRE-----VILVLDTSGSMN---GPAISQLRLFADHVLDHLNPNDEFR--------- 381 Query: 220 LVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ FS++ P A +Q ++ + L T P L+ A + ++ Sbjct: 382 VIAFSNRTTAFQPDAIAATDANIQSAKQFVRGLRASGGTNLLPALKLA---LGGEADESA 438 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 +Y++ +TD + ++ L Y + + + A V+ I A D + Sbjct: 439 R--------PRYMVLMTD----ALVGNDHSILRYLRQPEFQDARVFPIAFGAAPNDYLIS 486 Query: 335 NCASPDRFYSVQNSRK 350 A R +S+Q + + Sbjct: 487 RAAEMGRGFSMQVTNQ 502 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 66/180 (36%), Gaps = 30/180 (16%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K G+D+++ LD S SM P +L A + +LD +K R Sbjct: 84 TKSELTKRRGIDVVVALDASKSMLARDIQP--SRLERAKLELTTLLDELKG-------DR 134 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKL 273 GLV F+ PL ++ + ++ T L + + +A Sbjct: 135 VGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEVMPQGGTNVGAALRLSRQVLENADRGS 194 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQF 332 + ++ ++ LTDGE D + EA K G V A+GV +E+ + Sbjct: 195 K---------ERVVVLLTDGE------DLVGDVAEATEALKDSGVQVLAVGVGSESGEPI 239 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKDLNSLIDAFSRIS 477 >gi|320450000|ref|YP_004202096.1| hypothetical protein TSC_c09220 [Thermus scotoductus SA-01] gi|320150169|gb|ADW21547.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 691 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 54/296 (18%), Positives = 105/296 (35%), Gaps = 36/296 (12%) Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 + + + ++ E F + + L ++ + L + R Sbjct: 199 RALVLGDPALARYLEAQ-GFQVEEGPFRLPLEADLVAVGLGVLDLPEGAPEALRSYLRQG 257 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +F P + + + G +++V+DVS SM Sbjct: 258 GGLLFTATPKGLFFGGWDRALPEDLPLKPLGRKGAALVLVMDVSGSMEGE---------K 308 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFG 250 + ++ L++++S + + G+V FSS FP G + + + + G Sbjct: 309 LSLAVAGALELVRSAAEED---YLGVVLFSSTHRVLFPPRPMTEQGKKEAESLLLSVRAG 365 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T A + +G +K I+ L+DG S P D +L Sbjct: 366 GGTVLGSAFREAVRLL-----------QGVPVERKGILVLSDGLISDPQ-DPILAL---- 409 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A+ G V A+ + A+A FLK A R+Y +++L FL+ G+E+ + Sbjct: 410 -AEASGLEVSAMALGADADRAFLKVLAERGGGRYYQAATAQELPRLFLKEGQEVFQ 464 >gi|254458848|ref|ZP_05072272.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084614|gb|EDZ61902.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 629 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 88/227 (38%), Gaps = 26/227 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F +S P+L +V++ + IG D+++ LD+S SM ++L +A Sbjct: 57 LLFLAGILIVIASAEPVLKDGTVRVKA---IGGDILIALDISDSMLCE-DIYPNRLELAK 112 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + E+++ R G++ F+ P+++ + + +++L ++ Sbjct: 113 KKALELINKATK-------DRVGVIAFAKNSYLVSPISFDTKTVSFLLSKL------DTS 159 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + N + KKY++ L+DG D + + AK +G Sbjct: 160 SITQKGTNILTMLGTVE---KTNTSTDKKYLLILSDG------GDETDFSAEIDFAKEKG 210 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 IV+ +G+ E S ++ KL+++ + + Sbjct: 211 IIVFVLGIGTEVGASIKNKDGSLIKYNDKVVISKLNESISELAIKTG 257 >gi|119899150|ref|YP_934363.1| hypothetical protein azo2860 [Azoarcus sp. BH72] gi|119671563|emb|CAL95476.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 343 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 81/271 (29%), Gaps = 60/271 (22%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 Y + S+ L +T +++ +D+S SM ++L Sbjct: 62 YLLGLAAMLVAVARPSALVTLPVTDQT-----------ILLAMDISGSMRATDIAP-NRL 109 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 A + R ++++ S R G+V F++ + I+R+ Sbjct: 110 AAAQAAARSFVEVLPS------DTRVGVVAFAATAALIQAPTRNHDDVLAAIDRVQLQRG 163 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK---------------------YIIFLT 291 T G+ + + + G I+ LT Sbjct: 164 TAIGSGMVLSLATLLPEAGIDLRLLAGDGSPPADKPPPGEPTHAPVPPGSHAYGAIVLLT 223 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---------------LKNC 336 DGE ++ + + + G VY +GV LK+ Sbjct: 224 DGERTTGPPLDFATRLAADH----GVRVYTVGVGTAEGGVVGYEGWSMRVRLDEAALKSI 279 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + ++ Q++ L + ++G + Q Sbjct: 280 ADETRGEYFHAQSAEALRTIYRKLGTRLTLQ 310 >gi|120609754|ref|YP_969432.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] gi|120588218|gb|ABM31658.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] Length = 355 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 70/244 (28%), Gaps = 62/244 (25%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVS SM D+L A + + I D+ VR G+V F+ Sbjct: 88 IMLAMDVSGSMR-AADVHPDRLTAAQDAAKAF------IADLPRHVRVGIVAFAGSAQLA 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK----- 285 + + I+ T + G+ + +F G + Sbjct: 141 QLPTQNHEDLFRAIDSFQLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMD 200 Query: 286 -----------------------------YIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 II LTDG+ ++ + + A RG Sbjct: 201 EIGRPPHRGGNGKGADRPAPVAPGSYTSAAIIMLTDGQRTTGV----DPMEAAQWAADRG 256 Query: 317 AIVYAIGVQAEAADQF---------------LKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 VY +GV A + LK A + ++ + L + + Sbjct: 257 VRVYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETLS 316 Query: 360 KEMV 363 + Sbjct: 317 SRLT 320 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 45/202 (22%), Positives = 77/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K LD++ V+D S S+ + + + +L + P N R G Sbjct: 2 KGPRCRTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLDVGP---NATRVG 52 Query: 220 LVTFSSKIVQTFPL-AWGVQHIQEK-INRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + E I L+ T + ++YA N F E Sbjct: 53 VIQYSSQVQSVFPLGAFARREDMEHAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---GA 109 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 110 RPPEARVPRIAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 162 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 163 ASPPLDEHVFLVESF-DLIQEF 183 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 68/176 (38%), Gaps = 25/176 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 363 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 413 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 414 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 468 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 469 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELRQIASEP 518 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 62/175 (35%), Gaps = 15/175 (8%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLD S SM D + +R D ++S +VTF Sbjct: 14 NPSSRVPVCLVLDTSGSMEG------DSINELNEGVRLFYDAVRSDETALYAAEISVVTF 67 Query: 224 SSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 G ++ + + T + A + + K K E+ A G D Sbjct: 68 GGHASCQA----GFSTLEHQPDAPQFYADGGTPMGEAMNMALDML--EKRKSEYKASGVD 121 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAADQFLKN 335 Y+ +I+ +TDG + + S+ C+ R ++ IG+ +A L Sbjct: 122 YYQPWIVLMTDGMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDADMDVLAR 176 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 70/237 (29%), Gaps = 40/237 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P + + ++ + +DVS SMN+ FG Sbjct: 55 LLPLALAVAVLLCAGPRRLGEPI----DKRKLTNIEICVDVSGSMNNPFGRATR------ 104 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG------ 250 ++ + + GL F ++++ PL V I + G Sbjct: 105 --YDGAMEAVTAFTSYRQGDAFGLTFFGNEVLHWCPLTTDVSAINCATPFMRPGQLPPWF 162 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L ++ E + I+ +TDG++ N Sbjct: 163 GGTLIAKALRACKAELIKRPEG-----------DRMIVLITDGDSQD--FANGADAEVAE 209 Query: 311 EAKRRGAIVYAIGVQAEAADQ--FLKNCA-------SPDRFYSVQNSRKLHDAFLRI 358 E K G V+A+ + + Q ++N + + + + L F RI Sbjct: 210 ELKAEGITVFAVVIGNDRQFQNPIIRNGSVQTVTARTGGESFEAGDPNALATVFKRI 266 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 74/202 (36%), Gaps = 29/202 (14%) Query: 171 MMMVLDVSLSMNDHFGPGM-DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-- 227 +++V+DVS SM G G KL +A ++ L + + + GL F++ + Sbjct: 428 VLVVMDVSGSMASESGYGSESKLDLAKKAATSALGQLT------DTDQMGLWAFTTDLPT 481 Query: 228 --------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V PLA Q I + I+ L + T A + K+ A Sbjct: 482 PDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAATREAAKAMNAQKDPNSINA-- 539 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQAEAADQFLKNC-- 336 ++ LTDG N + D L N A+ G V+ I +A L+ Sbjct: 540 -------VVVLTDGRNEYTDNDLDGLLRELNASAEEDGVRVFTIAYGPDADLATLQEISE 592 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 AS Y +N + F + Sbjct: 593 ASRAAAYDARNPTSIDKVFSDV 614 >gi|209546922|ref|YP_002278840.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538166|gb|ACI58100.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 462 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 55/353 (15%), Positives = 124/353 (35%), Gaps = 72/353 (20%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 F +R + G+++I+ A+ L + + +G + + V+ K+ LD +L+ Sbjct: 20 YFHTLRGLRRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVK 79 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 +I N + + K + +D+ + + N + ID Sbjct: 80 QINNTADADALKAKVSDWFHAQVDNSYTLG-------------------------EIDID 114 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 ++N++A + +P F AN P+ + S+VK + S L++ +V+D S SM Sbjct: 115 TANHNITATASGTVPTTFMKI---ANIDTVPVSVGSAVKGPATS--YLNVYIVIDTSPSM 169 Query: 182 --------NDHFGPGMD--------------------------------KLGVATRSIRE 201 G+ + VA ++++ Sbjct: 170 LLAATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYEYSTAKNIKLRADVAGDAVKD 229 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +L +I + + ++ GL + + + + +++ +G T+ ++ Y Sbjct: 230 VLSLIDTSDSNHERIKVGLYSLGDTLTEVLAPTLSTDTARTRLSTASYGLTSATSKAATY 289 Query: 262 AYNKIFDAKEK--LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + K+K K ++ LTDG S ++ +A Sbjct: 290 FDVSLATLKQKVGAGGDGTTSGTPLKLVLLLTDGVQSKREWVTDSVVWKSGQA 342 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 83/261 (31%), Gaps = 34/261 (13%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + + V +Y++ + + + D++ Sbjct: 241 TPTEQAAHSRPGVTADFVLQYDVSLKDLAGDVQIYNGYFVHYFAPRGLPPIQK----DVI 296 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+DVS SM K+ ++ +L+ + N ++TFS + P Sbjct: 297 FVIDVSGSMFG------TKIKQTKSAMHVILNDLHRDDSFN------IITFSDVVHVWRP 344 Query: 233 ------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 A + ++ +N++ T L A + + H + KK Sbjct: 345 GQSIPATAQNKKSAKDYVNKIEADGWTDINAALMAAASIFN----QTSHKPEKETSTKKI 400 Query: 287 --IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 IIFLTDGE +S + L +A ++ + +A ++ + +R Sbjct: 401 PLIIFLTDGEATSGVLATSRILSNAQKAMGGTISLFCLAFGEDADYNLMRRLSLENRGIA 460 Query: 342 ---FYSVQNSRKLHDAFLRIG 359 + + +L + I Sbjct: 461 RRIYEYSDATLQLKGFYDEIA 481 >gi|47228042|emb|CAF97671.1| unnamed protein product [Tetraodon nigroviridis] Length = 1071 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 74/199 (37%), Gaps = 23/199 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-----GLVT 222 +D+ VLD S S+ P + +D +K+I + R G + Sbjct: 3 PVDLFFVLDTSESVALRQKPPGYYIDQIKTFTTNFIDELKNI--RHQCDRILTWNSGALH 60 Query: 223 FSSKIVQTFPL---AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S +++ L ++ I+ + G T + ++ +++ Sbjct: 61 YSDEVILVGELMDMQTQRSTLKTSISGIEYIGKGTYTDCAIKRGLSELLIGG-------- 112 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KYI+ +TDG + + + NEA++ G V+A+ + + D L A Sbjct: 113 SHYHENKYIVVVTDGHPLTGYKEPCGGVQEAANEARQHGVKVFAVAISPDQEDTRLSLIA 172 Query: 338 SPDRF---YSVQNSRKLHD 353 + + ++ + K+ Sbjct: 173 TDHNYRQNFTAADDSKITK 191 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 55/184 (29%), Gaps = 12/184 (6%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-DVNNVVRS 218 K + D+ + M+ G + + +R + + S VR Sbjct: 863 KKCPDYKCPISFTRSADILVMMDSSASVGQKNFETSKKFVRLLAERFLSAERQGGASVRV 922 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 GL +S L + H+ + F T +EYA + + Sbjct: 923 GLAQYSRIARLEAELTNNLTHLVHHTEQAAFQNDGTNVLGAMEYAIRSLEGRGDTSGGRK 982 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K ++ +DG + + E E G V+ I V + + L+ Sbjct: 983 K--------LVLFSDGRSQAITEALLEKR--AREVADAGVEVFVIAVGNQVNEANLRTLV 1032 Query: 338 SPDR 341 S R Sbjct: 1033 SRGR 1036 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 84/239 (35%), Gaps = 35/239 (14%) Query: 143 PWCANSSHAPLLI--TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 PW + + I + +++ ++DVS SM+D+ DKL + S++ Sbjct: 163 PWRTANRIIKVAIKADDPTITKQSTLPPANLVFLVDVSGSMSDN-----DKLPLVKSSLK 217 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPG 258 + ++ ++ +VT++ + T P G I I+ L +T Sbjct: 218 MLTKQLRPQDTIS------IVTYAGRTQVTLPATRGSDTDKILAAIDSLDASGSTNGEAA 271 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ AY + H K + I+ +TDG+ + D E L + G Sbjct: 272 IKLAYQ------QAKIHYKKDGINR---ILMMTDGDFNVGVSDVDEMLDIIRRERDSGVS 322 Query: 319 VYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + G D ++ A + + + ++ F + K++ Q Sbjct: 323 LSTFGFGEGNLNDHMMEQVADNGNGNYSYIDSLSEAKKALVDEMSATFNTVAKDVKVQV 381 >gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 76/215 (35%), Gaps = 27/215 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S S I +++ V+D S SM K+ ++ ++LD + + L Sbjct: 266 PDSLSTIPKNVIFVIDKSGSMMG------RKIKQTREALIKILDDLSPHDQFD------L 313 Query: 221 VTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++FSS+ PL V + + T + A + A + E Sbjct: 314 ISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQGGTNINDAMLMAVQLLEKANQ-EEL 372 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + +G II LTDG+ + + +A ++ +G + + FL+ Sbjct: 373 LPEGSITL---IILLTDGDPTVGETNPSNIQKNVRKAINGQHSLFCLGFGFDVSYAFLEK 429 Query: 336 CASPDR------FYSVQNSRKLHDAFLRIGKEMVK 364 A + + ++ +L D + + ++ Sbjct: 430 MALENGGLARRIYEDSDSALQLQDFYQEVANPLMT 464 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 77/227 (33%), Gaps = 40/227 (17%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + + D+ + +++V D S SM GP + K A + + Sbjct: 96 VEQEITTFASEDVPVSIVIVFDCSGSM----GPKLAKSRAAVAAFLSSANPEDEFS---- 147 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV F+ + +Q K+ T + A +++ AK Sbjct: 148 -----LVLFNDRAQLVSGFNRQTDELQSKLFYAQSKGRTALLDAIYLAMDQMKHAKHSR- 201 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE------- 327 K ++ ++DG ++ +E N K A +Y+IG+ Sbjct: 202 ----------KAVLVISDGGDNCSRYSMREVK---NRVKEGDAQIYSIGILEAMGFRGRS 248 Query: 328 ----AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A L + A S R + + N +L D +IG + Q +L Sbjct: 249 AEELAGPALLDDIASQSGGRLFEIDNLNELSDVASKIGMALRNQYML 295 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 70/208 (33%), Gaps = 30/208 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S L +++ LD S SM + S+ + R+ ++ Sbjct: 91 SDQVEPLALVLALDSSGSMLEALPQTQAAARALVGSL-------------ADGDRAHILN 137 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F I + + + I+ L T+ + + + A+ + Sbjct: 138 FGDSINVAVGMTADRNRLDQAISGLRAWGATRLNDAVFASAGALAGAEGRGA-------- 189 Query: 283 YKKYIIFLTDGENSSPN---IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--A 337 I+ L++G ++ P+ + +S GA VYA+G+ A L+ A Sbjct: 190 ----IVLLSEGPDADPSGAPLSVVDSEAALAAVVESGAPVYAVGLGPGADAALLRRLAEA 245 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + ++ V ++ L F I + Q Sbjct: 246 TGGAYFPVADAAALPATFSDIATRLRHQ 273 >gi|297562484|ref|YP_003681458.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296846932|gb|ADH68952.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 505 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 69/217 (31%), Gaps = 25/217 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + IT+ + + +VLD S SM +L A R++ +++ + Sbjct: 19 AVSVLVDITAPEREEETERPPATLQVVLDRSGSMGG------GRLDGAVRALLSLVERLA 72 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + GLV+F+ + P ++ I+ L T + GL + Sbjct: 73 PSDNF------GLVSFNDQARVEVPCGPLEDKARVRRLISGLHASGGTDLSSGLLRGVQE 126 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 A ++ ++DG + D+ +A G ++G Sbjct: 127 ARRAGADRGGT----------LLLISDGHANQGVTDHDLLRQVAADAYAHGVTTTSLGYG 176 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++ L A S + A I +E Sbjct: 177 LGYDEELLGAVADGGA-GSALFAEDPDTAGGLIAREA 212 >gi|156743215|ref|YP_001433344.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234543|gb|ABU59326.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 419 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 48/242 (19%), Positives = 93/242 (38%), Gaps = 27/242 (11%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 R + + T ++ + +T +I ++ +++ V+D S SM G Sbjct: 4 EVAIRASLARPYLTAATMPQVAYLLIEVT-PGQIMTQVRAPVNVCFVIDRSGSMK---GE 59 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +D++ AT EMLD + V R+ ++ ++ + + LA +++NR+ Sbjct: 60 KIDRVRRATIRAIEMLDAQDVVSVVIFDHRTEVLIPATPVAKPAELA-------DRVNRV 112 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T+ P +E +I KG + +I LTDG+ S + + L Sbjct: 113 RDSGGTRIAPAIEAGLREI----------DKGPSHMVRRLILLTDGQTESES----DCLR 158 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +A RR + A+GV + + L A S + K+ D F + Sbjct: 159 RAEDAGRRNVPITALGVGKDWNEDLLIEMANRSGGTADYIDRPEKIVDYFQNTIQRAQAT 218 Query: 366 RI 367 + Sbjct: 219 TV 220 >gi|109088926|ref|XP_001104056.1| PREDICTED: anthrax toxin receptor-like isoform 1 [Macaca mulatta] Length = 557 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 78/199 (39%), Gaps = 29/199 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ +LD S S+N+++ +D ++ ++ +R +T+S+ Sbjct: 70 QGSFDLYFILDKSGSVNNNW---IDLYMWVEETVARF---------QSSDIRMCFITYST 117 Query: 226 KIVQTFPLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL I+ ++ +++ T G A +I E G+ Sbjct: 118 DGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTFMQAGFRKAIQQI-------ETFNSGNKV 170 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II +TDGE + +++L +A++ GA VY +GV DQ SP+ Sbjct: 171 PS-MIIAMTDGELVAHAF--QDTLREAQKARKLGANVYTVGVADYKLDQITAIADSPEHV 227 Query: 343 YSVQN----SRKLHDAFLR 357 ++V+N R DA Sbjct: 228 FAVENGFKAMRDTVDALTS 246 >gi|109088928|ref|XP_001104141.1| PREDICTED: anthrax toxin receptor-like isoform 2 [Macaca mulatta] Length = 564 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 78/199 (39%), Gaps = 29/199 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ +LD S S+N+++ +D ++ ++ +R +T+S+ Sbjct: 70 QGSFDLYFILDKSGSVNNNW---IDLYMWVEETVARF---------QSSDIRMCFITYST 117 Query: 226 KIVQTFPLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL I+ ++ +++ T G A +I E G+ Sbjct: 118 DGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTFMQAGFRKAIQQI-------ETFNSGNKV 170 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II +TDGE + +++L +A++ GA VY +GV DQ SP+ Sbjct: 171 PS-MIIAMTDGELVAHAF--QDTLREAQKARKLGANVYTVGVADYKLDQITAIADSPEHV 227 Query: 343 YSVQN----SRKLHDAFLR 357 ++V+N R DA Sbjct: 228 FAVENGFKAMRDTVDALTS 246 >gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Ailuropoda melanoleuca] Length = 849 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 71/208 (34%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L++FS Q Sbjct: 277 VIFVIDKSGSMSG------RKMQQTREALIKILDDLSPKDQFN------LISFSGDAAQW 324 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL A V + + T + A + AK+K Sbjct: 325 KPLLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQLLNSAKQKELMPEGTVS---- 380 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + + FL+ A + Sbjct: 381 LIILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVSYAFLEKLALDNGGLAR 440 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + ++ +L D + + ++ Sbjct: 441 RIYEDSDSALQLQDFYEEVANPLLTAVT 468 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 56/336 (16%), Positives = 118/336 (35%), Gaps = 51/336 (15%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST---SLSI 116 A I + + ++N F +I I E+ + F+ D++ ST + Sbjct: 122 AAPIASDAWYGIKQPERNRFEKQIQNGI---MVAGEIPISTFSIDVDTGSYSTLRRMIKE 178 Query: 117 IIDDQHKDYNLS-----AVSRYEMPFIFCTFPWCANSSHAPLLITSSV----------KI 161 + + Y +P P+ A + AP + ++ Sbjct: 179 GSLPEKGTIRIEEMLNYFTYDYPLPNKNAA-PFSATTELAPSPYNDDMMLLRIGLKGYEL 237 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 238 TKSELGASNLVFLLDVSGSMA-----SADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAA 292 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V +Q + + +L G +T + G+ AY +H +G Sbjct: 293 G----VVLDGASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL------AQKHFIQGGI 342 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + + + + + K+RG + +G + DQ ++ A Sbjct: 343 NR---VILATDGDFNVGVTNFDQLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADKG 399 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L L I K++ Q Sbjct: 400 NGHYAYIDTLNEARKVLVDELSSTLLTIAKDVKVQI 435 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 43/200 (21%), Positives = 74/200 (37%), Gaps = 27/200 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 SD LD+++++D S S+ F + ++ ++ I PD VR L+T+S Sbjct: 187 SDKPLDLILIIDSSESVAHLFDEQIR------FAVERIVRNINVHPDA---VRLALITYS 237 Query: 225 SKIVQTFPLAW----GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + F + +N L TT + L AY + D E+ + Sbjct: 238 GQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTD--TDNENGVRE 295 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKNC 336 KK II TDG + D L K +G ++AI + + L Sbjct: 296 G--VKKMIIIFTDGHSQRSPQDMALRL------KDKGVEIFAITLTPAPYADEGELLSIT 347 Query: 337 ASPDRFYSVQNSRKLHDAFL 356 + D ++ N + L + Sbjct: 348 QNTDHIFTPVNLKVLITTYK 367 >gi|118093056|ref|XP_421769.2| PREDICTED: similar to AMACO [Gallus gallus] Length = 800 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 40/175 (22%), Positives = 72/175 (41%), Gaps = 27/175 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ +LD S S+ + ++ D + P VR G+V FSS Sbjct: 47 ASVDILFLLDGSYSIGR------GSFERSKHFAGKLCDALDIHPGR---VRVGMVQFSSA 97 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L Q ++E+I R F G +T++ L+Y +K F + Sbjct: 98 PHLEFSLDSYLTKQEVKERIKRTAFRGGSTETGRALKYILHKGFPGGR--------NLTV 149 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K +I ++DG++ + + K RG V+A+G++ ++ L AS Sbjct: 150 PKILIIISDGKSQG------STAVPAMQVKERGTTVFAVGIKFPRWEE-LHAVAS 197 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 54/155 (34%), Gaps = 13/155 (8%) Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V + LV + S+ F L + + IN++ F S A I Sbjct: 566 RDVTQIALVAYGSRARTVFALDTYTNNSALLQAINQMPFLGDVASASS---ALLHIHSDV 622 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++ A+ K ++ LT+G +++ + G +V+ + + D Sbjct: 623 MTVQKGARPG--VSKVVVLLTNG------GGMEDAAAPARHLRDNGVMVFVVVIGDAERD 674 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L+ SP + + L I + + ++ Sbjct: 675 TLLRVAGSPSYLVHISSYEDLQRYQDLITERICEE 709 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 48/139 (34%), Gaps = 9/139 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + +D+ M+ G ++ ++ L + N G+ + + I Sbjct: 334 LSLECGVDLLFLMDSSAGVTLEGFLRYKAFLKRFLQAVMGRDSPAN---VGVAQYDTSIR 390 Query: 229 QTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + + I+ L F G T + L Y F + DD + Sbjct: 391 IPIEVGQHKDVFGLMKSIDALNFSGGGTLTGRALRYVAQHGFRRTPVFADVL---DDLPR 447 Query: 286 YIIFLTDGENSSPNIDNKE 304 ++ LTD ++ P + + Sbjct: 448 VVVLLTDSKSQDPVAEAAK 466 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 36/193 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SM+ D+L + + ++ +I+ V G+VTF S Sbjct: 310 VCLVLDKSGSMSAE-----DRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVAEI 358 Query: 230 TFPLAWGVQ-HIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L ++ +KI + T GL+ + I + + Sbjct: 359 QNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE-------- 410 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE++ N C + KR GAI++ I + AA + L N RF Sbjct: 411 -IILLTDGEDNEINS--------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRF 461 Query: 343 YSVQNSRKLHDAF 355 ++ ++ L +AF Sbjct: 462 FANKDITGLTNAF 474 >gi|302551540|ref|ZP_07303882.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] gi|302469158|gb|EFL32251.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] Length = 518 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 77/214 (35%), Gaps = 29/214 (13%) Query: 151 APLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 L+ S++ + + V+DVS SM++ +L +A S+ M D + Sbjct: 147 WSLVRVGLATRSAERTGERLPAALTFVIDVSGSMSEP-----GRLDLAQESLSVMTDRL- 200 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + +VTFS + P+ IQ+ I+ L +T G+E Y Sbjct: 201 -----RDDDSVAIVTFSDEAETVLPMTRLDGNRDEIQDVISDLATQDSTNLGAGVETGYE 255 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIG 323 + + ++ ++D ++ + D L E + G ++ +G Sbjct: 256 TAVEGLREGATNR---------VVLVSDALANTGDTDADTILERIAGERREHGITLFGVG 306 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 V ++ D ++ A V + H+ F Sbjct: 307 VGSDYGDALMERLADRGDGHTVYVSGPDEAHEVF 340 >gi|220933243|ref|YP_002512142.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994553|gb|ACL71155.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 325 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 71/217 (32%), Gaps = 35/217 (16%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 S + + G D+++V+D S SM G + ++ V + + Sbjct: 85 SPGEFIPEPVAGRDLVLVVDTSGSMLVRDYRAEGRPVSRIEVLQGVVTRFVRA------- 137 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDA 269 R L+ + + PL + + ++ RL G T G+ A ++ + Sbjct: 138 REGDRFALIPMAEEAATLVPLTGDRELVASQLARLRAGMLGDDTAIGDGIALALRQLQAS 197 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---- 325 + +I +DGE+++ + E+L A+ G +Y + + Sbjct: 198 GAERRPA----------LILFSDGESNAGLLRPSEALAL---ARAAGVALYTVEITGGQA 244 Query: 326 ---AEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 E + R + V S L I Sbjct: 245 LAPVEGEPSLADMAETTGGRHFHVTRSADLEAVIATI 281 >gi|291407509|ref|XP_002720066.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Oryctolagus cuniculus] Length = 1320 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 70/201 (34%), Gaps = 26/201 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ V+DVS SM K+ +++ +L +++ N +++FS + Sbjct: 284 VVFVIDVSGSMFG------TKMQQTKKAMNVILSDLQANDYFN------IISFSDTVSVW 331 Query: 229 ----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + V + ++ + T L A + + + ++ Sbjct: 332 RAGGSIQATSQNVHSAKNYLDHMEAAGWTDINAALLEAASVLNHSNQEPGRSPGVGRTP- 390 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 IIFLTDGE ++ L +A ++++ +A L+ + +R Sbjct: 391 -LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFPLLRRLSLENRGVA 449 Query: 342 ---FYSVQNSRKLHDAFLRIG 359 + + +L + I Sbjct: 450 RRIYEDTDAALQLEGLYEEIS 470 >gi|74011918|ref|XP_848765.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Canis familiaris] Length = 826 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 69/208 (33%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD +K N L++FS + Sbjct: 270 VIFVIDKSGSMSG------RKIQQTREALIKILDDLKPNDQFN------LISFSGDVTHW 317 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL V + + T + A + A +K Sbjct: 318 KPLLVPASPENVDQAKRYAANIEAHGGTNINDAMLTAVRLLQSANQKELLSDGSVS---- 373 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + +A ++ +G + + FL+ A + Sbjct: 374 LIILLTDGDPTVGETSPARIQKNVQKAIDGQYSLFCLGFGFDVSYVFLEKLALDNGGLAR 433 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + ++ +L D + + ++ Sbjct: 434 RIYEDSDSALQLQDFYQEVANPLLTAVT 461 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 60/451 (13%), Positives = 140/451 (31%), Gaps = 109/451 (24%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F ++ G+ I+TA+L+ + G ++ + ++ +L+ D + + + T E Sbjct: 4 RFIFDRSGNFGIMTALLVVPLLGAAGTAVDFASALSLRTELYAAADAAAVGSITP--TSE 61 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 D S + K+ Q F +++ + + + I + + + Sbjct: 62 AAAQANTMSGDGSLTLGKSEAQKIFFSQMSKKQGDAPVT-------VDISVQKKGDTLSS 114 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----- 182 + MP F + +T + ++ +D M+LD + SM Sbjct: 115 TVSFNATMPTTFM-----QVMGFDEIAVTGAATAQYQTPSYMDFFMLLDNTPSMGVAATT 169 Query: 183 -----------DHFGPGMDK--------------------------------LGVATRSI 199 + G DK + V ++ Sbjct: 170 DDITAMKKATANGHDGGKDKNCAFACHIVSEKGVEDKNSYYNVARNNGVTIRIDVVASAV 229 Query: 200 REMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + ++ K + + R T ++K+ + L + + + N + S Sbjct: 230 KALMAKAKDTQSMPSQFRVAAYTSGKTAQDAKAAKLFKVSDLNYDLGAVAAAANMIKLMS 289 Query: 252 T------TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG------------ 293 + + A I + + D +K + F+ DG Sbjct: 290 IPYQNYYSDQQTSFDEALKGIEGEIKGNIGTGTSNADRQKIVFFVADGVGDSYKPTGCTS 349 Query: 294 ---ENSSPNIDNKESLFYCNEAKRRGAIV---YAIGVQAEAADQF--------------L 333 N I+ ++ YC + K RG V Y + + + Sbjct: 350 PKGANGGRCIEPIDT-TYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETRIAAKM 408 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + CA+P +++V + + +A + +++V Sbjct: 409 EECATPGFYFAVSPTEGIEEAMEALFRKIVS 439 >gi|148676905|gb|EDL08852.1| matrilin 2, isoform CRA_a [Mus musculus] Length = 836 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 75 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 125 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 126 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 185 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 236 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 237 PHKDHVFLVANFSQIESLTSVFQN 260 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTLKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFETVKHFVTGIIDSLAVSP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L G +E + G + + L++ + + F E Sbjct: 701 TQVRTEFTLR-GFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARPPS 756 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 757 TQVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEP 809 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ + Sbjct: 810 IDKHLFYAED----FSTMGEISEKLKE 832 >gi|320010752|gb|ADW05602.1| Protein of unknown function DUF3520 [Streptomyces flavogriseus ATCC 33331] Length = 528 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 85/225 (37%), Gaps = 37/225 (16%) Query: 153 LLITSSVKISSKSDI--GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + + + + +S + + V+D+S SM + +L +A S+ + D ++ Sbjct: 152 VRVGLATRAASNTGERPPAALTFVVDISGSMAEP-----GRLDLAKTSLGILADELRDDD 206 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 V+ LVTFS + P+ +++ I + +T G+E Y + Sbjct: 207 SVS------LVTFSEEAETRLPMTRLRGNRTKLRDAIEEMEPADSTNVAAGVERGYEEAV 260 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQA 326 + K ++ L+D ++ + L +A+ G ++ +GV + Sbjct: 261 EGHRKGATNR---------VVLLSDALANTGETEADAILERVGDARQEYGITLFGVGVGS 311 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + D+ ++ + N + + I E +++ ++ Sbjct: 312 DYGDELMER---------LTNKGDGNTTY--IADEAQARKVFVDQ 345 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 52/202 (25%), Positives = 75/202 (37%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS GLD++ VLDVS SM G + KL +A + + L I R +V Sbjct: 55 SSNDRPGLDLVAVLDVSGSMA---GEKIAKLKMAMLFMIKKLSPID---------RLSIV 102 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TFS+ + PL + + INRL T T GLE A + D + Sbjct: 103 TFSTDSTRLCPLRQITENSQKEFENLINRLKADGWTNITAGLETALKVLNDRSFNGGRVV 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ ++DGE+++ + L V+ G + LK A Sbjct: 163 G--------IMLMSDGEHNTDGDPAEVPL--------GNVPVHTFGFGRNYEPRVLKAVA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 VQN+ L AF Sbjct: 207 HKSIGGTLSDVQNTNNLGKAFS 228 >gi|213965586|ref|ZP_03393780.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213951745|gb|EEB63133.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 330 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 85/229 (37%), Gaps = 27/229 (11%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + + + +M+V+DVSLSM+ D++ A + +E ++ ++ Sbjct: 75 ISLAGPISETKVARNRATVMLVVDVSLSMSATDVAP-DRITAAKEAGQEFVE------NL 127 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + + GLVTFS + + + T + + A + I + Sbjct: 128 PDDLNIGLVTFSGRARTAVSPTTNHDTVNRALQAAELDQATATGDAIAAALDAINQFSDS 187 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQ----- 325 ++ +G I+ L+DG+ + P D + + EA + G V I Sbjct: 188 VQGGGEGAPPAT--IVLLSDGKQTVPQELDDPRGAYTAAAEAAKAGVPVNTISFGTAQGA 245 Query: 326 ---------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 D L+ A + F+S + +L DA+ + ++ Sbjct: 246 ITVQGELIPVPNDDDSLREIARRTKGEFFSAGSLEQLRDAYGSLEDDIG 294 >gi|171058998|ref|YP_001791347.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] gi|170776443|gb|ACB34582.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] Length = 350 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 74/235 (31%), Gaps = 50/235 (21%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +DVS SM ++L + + + + D+ VR G+V+F+ Sbjct: 95 LLLAMDVSGSMRATDVEP-NRLVASQVAAKNF------VKDLPRHVRVGVVSFAGTAAVV 147 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI--- 287 + I+R T G+ + +F G K I Sbjct: 148 QAPTHSRDDVFAAIDRFQLQRGTAIGSGIVLSLATLFPEAGIDLSDITGERRMPKGIGES 207 Query: 288 -------------------IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 I LTDG+ ++ + + A RG VY +G + Sbjct: 208 DKQDDFKPVAPGSYGSAAVILLTDGQRTTG----PDPMDAAKMAADRGVKVYTVGFGTTS 263 Query: 329 A---------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + LKN A + ++ ++ L + + +V +R Sbjct: 264 GEIIGFEGWSMRVRLDEATLKNIANLTQAEYFYAGSATDLQKVYDTLSSRLVFER 318 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 49/192 (25%), Positives = 74/192 (38%), Gaps = 34/192 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ L + + ++ II+ V G+VTF S Sbjct: 310 VCLVLDKSGSMDSDD----RLLRMNQAAELYLIQIIEKESLV------GMVTFDSSAEIQ 359 Query: 231 FPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L Q I +L G T GL+ + I + + Sbjct: 360 NNLTKITDDNAYQNIIAKLPQFAGGGTSICNGLKAGFQAIVYSNQSTSGSE--------- 410 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 II LTDGE++ + C EAK GAI+++I + AA + L N RF Sbjct: 411 IILLTDGEDNQMSS--------CFEEAKASGAIIHSIALGPSAAKELETLSNMTGGLRFS 462 Query: 344 SVQNSRKLHDAF 355 + ++ L DAF Sbjct: 463 ANKDINGLIDAF 474 >gi|293344916|ref|XP_001079629.2| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|293356747|ref|XP_215375.5| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|149043683|gb|EDL97134.1| procollagen, type VI, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1034 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 77/208 (37%), Gaps = 19/208 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 41 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 100 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ I+ + FG T + ++ ++ Sbjct: 101 YSDEVEIIRGLMRMPSGRDELKASIDAVKYFGKGTYTDCAIKKGLEELLIGG-------- 152 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 153 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSIIA 212 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + DA I + + Sbjct: 213 TDHTYRRNFTAADWGHSRDAEETISQTI 240 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 89/263 (33%), Gaps = 66/263 (25%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T+++ + S + + +++ +D S SM ++G + ++ R+++ + + Sbjct: 76 TATITLPSDT---ITLVLAMDTSRSMAATDVAP-TRIGASKQAARDLIVGLPAS------ 125 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA------ 269 VR G+V+F++ P Q + + I+R T + GL A +F Sbjct: 126 VRLGMVSFAATATVVLPPTDNRQDMLDAIDRFQLQLGTATGSGLIQALAVLFPDDGIDLE 185 Query: 270 -----------------------------KEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 +E+ A+ +I L+DG ++ Sbjct: 186 AILFSGESLAPGPGGRSLTEAAAADAVRKREQERPAAQPGSYRHGAVILLSDGRRTTG-- 243 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----------QF----LKNCA--SPDRFY 343 + L A +RG VY +G Q L+ A + ++ Sbjct: 244 --PDPLDAARMAAQRGVRVYTVGFGTPQGGAAAESGLSYYMQLDEPALRAVAAITNGEYF 301 Query: 344 SVQNSRKLHDAFLRIGKEMVKQR 366 ++ L + ++ +R Sbjct: 302 QAGSAADLSQVYRQLSARFALER 324 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 77/193 (39%), Gaps = 36/193 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SM+ D+L + + ++ +I+ V G+VTF S Sbjct: 310 VCLVLDKSGSMSAE-----DRLFQMNQAAELYLIQVIEKGSLV------GMVTFDSVAEI 358 Query: 230 TFPLAWGVQ-HIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L ++ +KI + T GL+ + I + + Sbjct: 359 QNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE-------- 410 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE++ N C + KR GAI++ I + AA + L N RF Sbjct: 411 -IILLTDGEDNEINS--------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRF 461 Query: 343 YSVQNSRKLHDAF 355 ++ ++ L +AF Sbjct: 462 FANKDITGLTNAF 474 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 75/201 (37%), Gaps = 28/201 (13%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKL 192 I F S + + ++ DI G D+ MV+D+S SM+ +L Sbjct: 42 TLIIKLFLRSITFSLLIISLLGPSFGEAERDIQAKGKDIFMVVDLSKSMDAADVTP-SRL 100 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---- 248 + ++ ++ R G++ FS+ PL + ++ I L Sbjct: 101 EKVKFELNRFIENERAN-------RIGIIIFSNDAYIHVPLTYDAAALELFIQSLQTDLL 153 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + T +E AYNK+ ++ + K ++ TDGENSS + Sbjct: 154 PTNGTNVCGAIEMAYNKLMNSADPTSRA--------KMMVLFTDGENSSSCTNALF---- 201 Query: 309 CNEAKRRGAIVYAIGVQAEAA 329 N +R G VY++ V + Sbjct: 202 -NNLRRFGIGVYSVAVGTKVG 221 >gi|219848163|ref|YP_002462596.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542422|gb|ACL24160.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 914 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 81/219 (36%), Gaps = 34/219 (15%) Query: 143 PWCANSSHAPLLITSSVKISS------KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 P + + + + + L +++V+D S SM + G +L +A Sbjct: 379 PRSFGAGGWRRTLLEPILPVALDPPLREERPDLALVLVIDRSGSMRELVDDGRTQLDLAR 438 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTK 254 ++ ++ + + L+ F S PL I++ ++RL+ G T Sbjct: 439 EAVY------QASRGLTQRDQIALIAFDSIADTLLPLQPLPGLFTIEDALSRLVAGGGTN 492 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 G+ A I ++ ++ H +I LTDG + + D L + Sbjct: 493 IRSGIALAAETIATSQARIRH-----------VILLTDGVSETEYADLVADL------RA 535 Query: 315 RGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 +G V AI + + D L+ A ++Y VQ L Sbjct: 536 QGITVSAIAIGLD-TDPALERVAQIGGGKYYLVQRVPDL 573 >gi|119899154|ref|YP_934367.1| hypothetical protein azo2864 [Azoarcus sp. BH72] gi|119671567|emb|CAL95480.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 339 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 43/246 (17%), Positives = 83/246 (33%), Gaps = 49/246 (19%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++ + + S+ +++ +DVSLSM+ D+L A + R D +++ P Sbjct: 76 SAVITLPSEQRT---IILAIDVSLSMSAPDVLP-DRLSAAQAAAR---DFVRNQPP---D 125 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD------- 268 VR G+V+F+ + + I+RL T G+ A + +F Sbjct: 126 VRIGIVSFAGTATVVQAPTDNREDLLGAIDRLQLARHTAIGSGIIVALSALFPEESFDPD 185 Query: 269 -----------AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 A + +I LTDG +S E + A RG Sbjct: 186 PTMMSSAEPGRAPNAPREEVAPGSNGSAAVILLTDGRRTSG----PEPVDAARMAAVRGI 241 Query: 318 IVYAIGVQAEAADQF---------------LKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 V+ +G L+ A + +++ + +L + + Sbjct: 242 RVFTVGFGTAEGATIQNEGWSVFMRFDEGTLRAIADLTQAKYFHAGTAAELQQIYHDLNA 301 Query: 361 EMVKQR 366 V +R Sbjct: 302 RYVLER 307 >gi|71061058|dbj|BAE16255.1| calcium activated chloride channel [Rattus norvegicus] Length = 902 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 71/204 (34%), Gaps = 36/204 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 ICLVLDKSGSMDTE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + I + T GLE + I + + Sbjct: 359 NYLIKITNTGDYKKITGNL-PQQAVGGTSICRGLEAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE+ + + C K GA+++ I + +AA + L + RF Sbjct: 410 -IVLLTDGED--------DLISSCFEVVKHSGAVIHTIALGPKAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 Y+ ++ L DAF I Sbjct: 461 YANKDVNSLMDAFSGISSASGNLS 484 >gi|300869050|ref|ZP_07113652.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300332961|emb|CBN58846.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 411 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 68/199 (34%), Gaps = 28/199 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L++ ++LD S SM L ++ ++D +K R ++ F Sbjct: 37 DRSVPLNLCLILDHSGSMGG------RPLDTVKQAAGRLVDRLKPGD------RLSVIAF 84 Query: 224 SSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K P I+++I++L T GL+ ++ K + A Sbjct: 85 DHKAKVIVPNQFIDDPGSIKKQIDKLRSSGGTAIDEGLKLGIEEMGKGKSETVSQA---- 140 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASP 339 LTDGEN DN L A + ++G + L+ A+ Sbjct: 141 ------FLLTDGENEHG--DNNRCLKLAKLAADYNMTLNSLGFGDDWNQDILEKIADAAG 192 Query: 340 DRFYSVQNSRKLHDAFLRI 358 +Q + D F R+ Sbjct: 193 GTLAYIQRPEQAIDEFSRL 211 >gi|162448748|ref|YP_001611115.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] gi|161159330|emb|CAN90635.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 78/213 (36%), Gaps = 25/213 (11%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + + + S V + + L +++ +D S SM + ++++I + Sbjct: 111 TLIQIGMNSPVDLGALERPPLHLVIAVDTSGSMEGDPIAYVRA---------GLVEMIDA 161 Query: 209 IPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + R LV +S ++V + + E L +T GL AY Sbjct: 162 LQP---TDRISLVRYSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYAL- 217 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +H+ + +IFL+DG ++ + + +G + AIGV A Sbjct: 218 -----AEQHLDPAWQNR---VIFLSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVGA 269 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 E ++ + FY +++ + + + F Sbjct: 270 EFDVDAMRGISEVGAGNFYFLEDPKAVEEVFAE 302 >gi|325473816|gb|EGC77004.1| BatA protein [Treponema denticola F0402] Length = 282 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 51/263 (19%), Positives = 90/263 (34%), Gaps = 48/263 (18%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 C F W + ++ + +K +D G +M +LD+S SM G ++ Sbjct: 6 FLCYFFWYCGIIFLIIALSEPIVFKNKQVYTDAGSSIMFLLDISPSMAAKDMNGETRIAA 65 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + IR+ + P + GL SS P + +++ L G Sbjct: 66 AKKIIRKF---VAKYPGDS----FGLTALSSSAALILPPTIDHKVFLSRLDSLSIGELGD 118 Query: 255 STP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 T G+ A + + + KL YI+ LTDGEN++ I+ K + Sbjct: 119 GTAIGMGLAVSSAYMTRTKLNSS---------YIVLLTDGENNTGEINPKTA---AEVLV 166 Query: 314 RRGAIVYAIGVQAEAADQF-----------------------LKNCA--SPDRFYSVQNS 348 + Y IG+ + LK A ++ S + Sbjct: 167 NKNIGFYVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDEVELKKIAQYGNGKYASASSP 226 Query: 349 RKLHDAFLRIGKEMVKQRILYNK 371 L D F I K++ + + + Sbjct: 227 EILEDIFNTISKQVPAAQSNFTR 249 >gi|290999945|ref|XP_002682540.1| predicted protein [Naegleria gruberi] gi|284096167|gb|EFC49796.1| predicted protein [Naegleria gruberi] Length = 502 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 87/236 (36%), Gaps = 27/236 (11%) Query: 138 IFCTFPWCANSSHAPLL-ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM--DKLGV 194 + C P+ L + ++ +++ +VLD+S SM++ KL Sbjct: 41 VQCEIPFLVRLLSGNLPPQEEEAETTNVLKTPVNICLVLDISGSMDEPLKNRSKGSKLTA 100 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 +IRE++ + D L+T+S F I+++ +T Sbjct: 101 CKSAIRELVTNFLTYKDT-----IHLITYSDSPKTVFTEKNKESVNLNDIDKISTEGSTN 155 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-------ESLF 307 L A + + ++ + K I F +DG+ + + + L Sbjct: 156 IASALHSAVDLLHNS----------NAPGTKLIAFFSDGQCNVGETNLNIFGSGLLKKLK 205 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKE 361 +E K + + GV ++ + +L+ A + +Y +++ DAF R K+ Sbjct: 206 DYSEGKDDQIHISSYGVGSDYDELWLQAIARTGKGEYYYLEDETYAKDAFERSLKK 261 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 64/450 (14%), Positives = 138/450 (30%), Gaps = 109/450 (24%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F+ + + +G+I+I+TA++ + ++G+ I+ K+++ D + L T Sbjct: 5 FIKFKALCGDRRGNIAIITALVSLTLIFILGMGIDYGLAIDRKSQMESYADAAALAAVTP 64 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 + ++ N++ L G + N++ + S++ D + Sbjct: 65 AMVAAGQSSAITTAQ--------NVFNAQA---LTMTGVTYNANDV--TVSIATSGDKRT 111 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 A S+ +P + + I ++ +D ++LD S SM Sbjct: 112 ATVQYQAQSQAMLPDVMGFG---------SIKIGGQATATTTIAPNIDFYLLLDDSPSMA 162 Query: 183 --------------------------------DHFGPGMD--------------KLGVAT 196 D G ++ + Sbjct: 163 IAATQSGINTMVANTTAQGGCAFGCHEENPSADKLGNPYGEDNYALARSLGVTLRIDMLR 222 Query: 197 RSIREMLDIIKSIPDVNNVV-RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF------ 249 ++ ++++ ++ R + TF + L + Q + + Sbjct: 223 QATQDLMTTAQTTETQKGTTYRMAIYTFDIGLNTIGNLTSDLSQAQTEAGNIQLLEVYSN 282 Query: 250 ----------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSP 298 T L N I A D ++ + F+TDG E+ Sbjct: 283 NWLTQNDYNDDEDTNYDTALN-GINAIMPNPGNGTGAA--GDTPQEVLFFVTDGVEDEDV 339 Query: 299 NIDNKESLF---YCNEAKRRGAIV---YA--------------IGVQAEAADQFLKNCAS 338 N + ++SL C K RG + Y I + L+ CAS Sbjct: 340 NGNRQQSLLNTDLCTAIKNRGIRIAVLYTEYLPLPTNSWYNTYIAPFQNSIAPTLQQCAS 399 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 P ++ V++ + A + + V+ L Sbjct: 400 PGLYFEVKSGGDISAAMSALFQTAVQSSYL 429 >gi|115438797|ref|NP_001043678.1| Os01g0640200 [Oryza sativa Japonica Group] gi|20805117|dbj|BAB92788.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113533209|dbj|BAF05592.1| Os01g0640200 [Oryza sativa Japonica Group] gi|125527021|gb|EAY75135.1| hypothetical protein OsI_03030 [Oryza sativa Indica Group] gi|125571342|gb|EAZ12857.1| hypothetical protein OsJ_02777 [Oryza sativa Japonica Group] Length = 589 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 47/261 (18%), Positives = 90/261 (34%), Gaps = 36/261 (13%) Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA-PLLITSSVKISSK 164 N+ E++ S + ++ ++ + + H LL+ SS Sbjct: 4 NDDEQTAPTSTATEPARPTVGITGQLVKQVRLNKYHNDVASMAPHDQELLLELRGSSSST 63 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 GLD++ V+DVS SM+ +DK+ A + + L +++ R +VTF Sbjct: 64 DRAGLDLVAVIDVSGSMDGD---RIDKVKTALQFVIRKL---------SDLDRLCIVTFC 111 Query: 225 SKIVQTFPLAWGV----QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + PL + ++ ++ L T GLE + + D + A Sbjct: 112 TNATRLCPLRFVTAAAQAELKALVDGLKAYGDTNMKGGLETGMSVV-DGRSLAAGRAVS- 169 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-- 338 ++ ++DG + L + VY A L+ A Sbjct: 170 ------VMLMSDGYQNHGGDARDVHL--------KNVPVYTFSFGASHDSNLLEAIARKS 215 Query: 339 -PDRFYSVQNSRKLHDAFLRI 358 F V +S L F ++ Sbjct: 216 LGGTFNYVADSANLTGPFSQL 236 >gi|328882566|emb|CCA55805.1| putative exported protein [Streptomyces venezuelae ATCC 10712] Length = 543 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 75/221 (33%), Gaps = 29/221 (13%) Query: 146 ANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 A S L+ ++ + V+D+S SM + +L + S+ + Sbjct: 146 AGSDGWSLVRVGLATRAADRTGERPPAALTFVVDISGSMAEP-----GRLDLVKESLGLL 200 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGL 259 ++ + LVTFS + P+ ++E +N L S+T G+ Sbjct: 201 ------ADELRDDDSIALVTFSDEAETRLPMTRVGEARGRVREVVNSLATTSSTNVEAGV 254 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAI 318 Y+ D K ++ L+D N+ E K G Sbjct: 255 RTGYDVAVDGHRKDATNR---------VVLLSDALANTGATEAGAILERIEEERKAYGIT 305 Query: 319 VYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 ++ +GV ++ D F++ A + V S + F+ Sbjct: 306 LFGVGVGSDYGDAFMERLADRGDGQTTYVSTSAQARKVFVD 346 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 68/193 (35%), Gaps = 36/193 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SM D+L + + ++ II+ V G+VTF S Sbjct: 309 VCLVLDKSGSMAAE-----DRLFRMNQAAELYLIQIIEKGSLV------GMVTFDSYAQI 357 Query: 230 TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L T GL+ + I + + Sbjct: 358 QNNLIKITEDNTYQKITANLPQEASGGTSICNGLKAGFQAISQSNQSTLGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE++ + C E ++ GAI++ I + A + L N RF Sbjct: 410 -IILLTDGEDNQ--------ISLCFEEVRQSGAIIHTIALGPSAEKELETLSNMTRGHRF 460 Query: 343 YSVQNSRKLHDAF 355 Y+ + L DAF Sbjct: 461 YAHNDINGLIDAF 473 >gi|291297006|ref|YP_003508404.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471965|gb|ADD29384.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 313 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 79/211 (37%), Gaps = 36/211 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D+SLSM + A + + ++S+PD ++ GLV+F+ Sbjct: 87 VIVTIDISLSMRAQ-DIQPTRFEAAKQEAKNF---VRSLPDG---IKVGLVSFAGYATLE 139 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q + ++I L T GL + + G ++ L Sbjct: 140 AEPTTDHQRVIDQIELLQMARRTAIGDGL---LESLRAIPKDENGKPLGPST----VVLL 192 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----------------DQFLK 334 +DG +S + + A+ G +V+ IG+ + ++ L+ Sbjct: 193 SDGRTNSGV----DPMEVAPFARDMGVVVHTIGLGRRSNPGDPDQYWGGYWMQFDEETLR 248 Query: 335 NC--ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 A+ ++Y+ ++ L A+ +G+ + Sbjct: 249 AIAEATGGQYYAAGSAEALRQAYRNLGRMVG 279 >gi|158334872|ref|YP_001516044.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305113|gb|ABW26730.1| von Willebrand factor type A domain protein [Acaryochloris marina MBIC11017] Length = 419 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 68/201 (33%), Gaps = 28/201 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + L++ ++LD S SM G + + A +S+ + L+ R +V F Sbjct: 43 NAPLNLCLILDHSGSMT---GRPLTTVKEAAQSLIDRLNP---------GDRIAVVAFDH 90 Query: 226 KIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P L + I+ I RL T G++ ++ + K Sbjct: 91 HAKVLVPNQLVEDPEQIKALIQRLEPKGGTAIDDGMKLGIEELA--------VGKQGTIS 142 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 + + LTDGEN DN+ + A + +G + + L+ A Sbjct: 143 QAF--LLTDGENEHG--DNQRCQQFAELAAGYNITLNTLGFGSHWNEDVLEGIADSGGGS 198 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 ++ D F + + Sbjct: 199 LSFIEKPENAVDVFNSLFTRI 219 >gi|282900974|ref|ZP_06308907.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] gi|281194065|gb|EFA69029.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] Length = 487 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 69/198 (34%), Gaps = 28/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM+D KL + + + V + +V F S++ Sbjct: 51 AIVLLIDTSSSMSD------GKLTEVKTAASQFMQRRNL-----EVDQIAVVNFGSEVAT 99 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL + + IN+L+ +T G++ A ++ K II Sbjct: 100 PAPLTNDINILNNAINQLLENGSTPMGEGIDTAQGQL------------QATTLNKNIIL 147 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 TDG PN +L + G + A+ + + + NS Sbjct: 148 FTDGIPDDPNFAYNSALSV----RNAGIKLIAVATGGADTNYLTQITGDRSLVFYA-NSG 202 Query: 350 KLHDAFLRIGKEMVKQRI 367 + AF + + KQ I Sbjct: 203 QFDQAFSQAEAVIYKQLI 220 >gi|118348690|ref|XP_001007820.1| U-box domain containing protein [Tetrahymena thermophila] gi|89289587|gb|EAR87575.1| U-box domain containing protein [Tetrahymena thermophila SB210] Length = 790 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 71/197 (36%), Gaps = 16/197 (8%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPD 211 S + D+ V+DVS SM+D K G + +LD++K I + Sbjct: 124 SIKTPEGQQRSACDICCVIDVSGSMSDEAKIKNSK-GDIESNGLTILDLVKHSVKTIINN 182 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 ++ R LV F + + L G H +++ +LI +T G+ A + Sbjct: 183 LDERDRLSLVAFHTNAYKITDLTPMNENGRNHAIKELEKLIPLDSTNIWDGIYQALEVVK 242 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR---GAIVYAIGV 324 +++ + + + I+ TDG+ + I + L + K + G Sbjct: 243 AGQQQSIQKGEQRVAFSQ-ILLFTDGQPN--VIPPRGHLPMLKKYKEENDVNCSISTFGF 299 Query: 325 QAEAADQFLKNCASPDR 341 + L A R Sbjct: 300 GYNLDSELLDQLAIEGR 316 >gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Macaca mulatta] Length = 1313 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 71/206 (34%), Gaps = 36/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM K+ +++ +L +++ N +++FS I Sbjct: 284 VVFVIDVSGSMFG------TKMEQTKKAMNVILSDLRANDYFN------IISFSDTINV- 330 Query: 231 FPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W V ++ ++R+ T L A + + + ++ Sbjct: 331 ----WKAGGSIQATIQNVHSAKDYLHRMEADGWTDINSALLAAASVLNHSNQEPGRGPSV 386 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 IIFLTDGE ++ L +A ++ + +A L+ + Sbjct: 387 GRIP--LIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGDDADFTLLRRLSLE 444 Query: 340 DR------FYSVQNSRKLHDAFLRIG 359 +R + + +L + I Sbjct: 445 NRGIARRIYEDTDAALQLEGLYEEIS 470 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 67/202 (33%), Gaps = 29/202 (14%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + I S K+ + G+D++ LDVS SM ++L Sbjct: 57 LKLIMLLLGISFLILSLVNPKMGS-KLKTVKREGVDVVFALDVSKSMLAEDIAP-NRLEK 114 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFG 250 A + I +++D + S R G++ ++ P+ + ++ Sbjct: 115 AKQIISKIIDKLGS-------DRVGVIIYAGNSYPLLPITTDHAAANMFLQNANPDMVSS 167 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T LE A + ++ ++++ ++DGE+ +E+ Sbjct: 168 QGTAINEALELAKTYYNNDEQTN-----------RFLVIISDGEDHQ-----EETKQVAQ 211 Query: 311 EAKRRGAIVYAIGVQAEAADQF 332 G +Y IGV E Sbjct: 212 NLSNDGVKIYTIGVGTEKGGPI 233 >gi|326675803|ref|XP_002665305.2| PREDICTED: collagen alpha-1(XII) chain [Danio rerio] Length = 3039 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 57/346 (16%), Positives = 122/346 (35%), Gaps = 48/346 (13%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 F + ++ L + + LN +G++ S + +I R + + Sbjct: 270 FDMINQVQKALITEVCSGVEEQLNS--LVSGEEVVEPASNLRVTDISSKSMR--IAWDSS 325 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVS-------RYEMPFIFCTFPWCANSSHAPLL 154 DI + +I + + Y ++ EM + F + P + Sbjct: 326 PGDITGYKLQVFPTIASAKKQELYTGPTMTLANVRDLTPEMEYEISLFALKGLTPSVPQV 385 Query: 155 ITSSVKI-------SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 +T + S + D+ D+++++D S S G+ + +++ Sbjct: 386 VTEKTQPVKVFTECSLEVDVQADIVLLVDGSYS------IGLPNFAKVRAFLEVLVNSFD 439 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 P+ V+ LV +S F L ++ G +T + + Y Sbjct: 440 IGPNK---VQISLVQYSRDPHTEFALNKFDDNAAMVKAVRTFPYRGGSTNTGKAMTYVRE 496 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KIF + D+ + ++ +TDG++S D N+ + ++A+GV Sbjct: 497 KIFVSGRGAR------DNVPRVMVLITDGKSSDSFKDP------ANKLRDTDVEIFAVGV 544 Query: 325 QAEAADQF--LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + + + N + + + V++ DAF RI KE+ L Sbjct: 545 KDAVRSELEAIANVPADNHVFEVED----FDAFERISKELTASICL 586 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 88/251 (35%), Gaps = 29/251 (11%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ MP ++++ L T K+ D+++++D Sbjct: 1118 GTKYTVNVFGMFDGGESMPLAGEEKTTLSDAASENLPYT-PSDAQCKTTAQADIVLLVDG 1176 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I M+ + PD V+ GL +S + L Sbjct: 1177 SWSIGRL------NFKTIRNFIARMVGVFDIGPDR---VQIGLAQYSGDPKTEWHLNAHR 1227 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + L + T A N I K + + +K + +TDG++ Sbjct: 1228 TRTQLLDAVANLPYKGGNTLTG---LALNYILQNNFKPNVGMRPNS--RKIGVLVTDGKS 1282 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + N ++L + +G +YAIGV+ ++ + PD Y+V + L D Sbjct: 1283 QDDIVANSQNL------RDQGIELYAIGVKNADENELRTIASDPDDIHMYNVADFSFLLD 1336 Query: 354 AFLRIGKEMVK 364 + + Sbjct: 1337 IVDDLTNNLCN 1347 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/275 (15%), Positives = 86/275 (31%), Gaps = 46/275 (16%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM--- 171 ++ + + + Y + T S A TS ++ D + + Sbjct: 29 TVQMSWRKPSSQIQ---GYRIQVTSDTDDSKELSLPASSTSTSITDLTPDVDYSVSINSY 85 Query: 172 ------------------MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 V D+ ++ + G + I M D Sbjct: 86 DGAEESIPILGQITKCSVSAVADLVFLVDGSWSVGRENFRFIRSFIGAMAGAFDIEEDK- 144 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 R +V +SS F L + + I L + G T + ++Y + + Sbjct: 145 --TRVAVVQYSSDTRTEFSLNTHFRRPDVLRAIKNLPYKGGNTMTGDAMDYL---VKNTF 199 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + KG + K + +TDG++ P Y + G ++ +G++ D Sbjct: 200 TQAAGARKG---FPKVAMIITDGKSQDPVE------EYAERLRNIGVEIFVLGIKGADED 250 Query: 331 QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEM 362 + LK AS Y+V N ++ + E+ Sbjct: 251 E-LKEIASRPHSKHVYNVPNFDMINQVQKALITEV 284 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 71/200 (35%), Gaps = 31/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + ++ P K+ G D+M++LDVS SM ++L A Sbjct: 61 LFIIAYGFLIVAAADPQ---VGSKMEEVKRKGADLMILLDVSNSMLSQDLSP-NRLENAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGST 252 R+I +++D + + R G++ F+ + P+ + +N ++ Sbjct: 117 RAISQLIDNL-------HDDRIGIIVFAGQAYVQLPITTDYSAAKLFLNTINTNMVPTQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T ++ K +I +TDGEN +++ N A Sbjct: 170 TAIGAAIDLGMQSFDFKNGM-----------SKAMIVITDGENHE-----DDAVSAANHA 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 + + V IGV +E Sbjct: 214 RDKDVTVNVIGVGSEEGAPI 233 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 79/209 (37%), Gaps = 45/209 (21%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN----NVVRSGLV 221 +D++ ++D S S+ D R+ +D +KSI D N ++ Sbjct: 112 QGVMDVVFLIDSSSSITDENY-------------RKEIDFVKSILDYYYLHPNYTLVSIL 158 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS+ + L + +++ I+ T +E A+ + +++ D Sbjct: 159 EFSTDVRVLQELTYDACDVRKAIDSDRMSGLTNIAKAIEEAHRILKNSR---------SD 209 Query: 282 DYKKYIIFLTDGE---------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--AD 330 + I+ +TDG N P+ N ++ AK +Y IGV A + D Sbjct: 210 IPDQ-IVLITDGFQTVHSSINCNDHPHDCNAYAIEKARAAKADDIQIYTIGVGAASYYED 268 Query: 331 QFLKNCASP-DRFYSVQNSRKLHDAFLRI 358 + +SP D+++ L D + I Sbjct: 269 DLRQIASSPSDQYF------SLVDDYSSI 291 >gi|238060066|ref|ZP_04604775.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881877|gb|EEP70705.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 316 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 63/209 (30%), Gaps = 32/209 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVSLSM ++L A + + + +P N GLV+F+ Sbjct: 89 VMLAIDVSLSMQADDVAP-NRLEAAQEAAK---QFVAELPQTYN---LGLVSFAKSANVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + I+ L+ T + + I I+ L Sbjct: 142 VPPTKDRDAVTTAIDGLVLAEATATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVLL 196 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----DQFLKNC---------- 336 +DG +S + + + V I +A L+ Sbjct: 197 SDGFRTSGRSVEEAAAAA----QAANVPVSTIAFGTDAGQVDIGGQLQRVPVDRMALAEL 252 Query: 337 --ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + FY + +L + +G + Sbjct: 253 AETTEGYFYEAASVSELKQVYQDMGSSIG 281 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 53/369 (14%), Positives = 126/369 (34%), Gaps = 89/369 (24%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + G++++ AI + + + +G ++ + + V++++ LD +L+ +I + Sbjct: 19 MVSDRGGNVALTVAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDAALVAAIKEIDEYDE 78 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSII-IDDQHKDYNL 127 +K K+ F D + ++S + + I + + Sbjct: 79 DEIAEKIKDWF------------------------DAQSEKQSATYDLTEITVDKSGHTI 114 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM------ 181 +A + +P T A+ P+ + S+++ + S L++ +V+D S SM Sbjct: 115 TASASGTVPTTLMTL---ADIKTVPVGVISAIEGPATS--YLEVYIVIDKSPSMLLAATS 169 Query: 182 --------------------------------NDHFGPGMDKL------GVATRSIREML 203 + + L VA ++ E+L Sbjct: 170 EDQAMLRADANITCEFACHDTKDPVKKNGTVIASTYYNYIKSLGVKLRTDVALDAVEEVL 229 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG---LE 260 D++ + + + ++ GL + I + + ++K++ G T+ ++ + Sbjct: 230 DMVDAADEDHARIKVGLYSLGETISEVLEPTYSTSTARKKLSDDSSGLTSATSMSATYFQ 289 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-----------C 309 A + K ++ LTDG S+ + K S Y C Sbjct: 290 TALKALKKKVG-TAGDGTSAASPLKLVLLLTDGVQSNRDWVIKWSGKYWGRVTPLNPDWC 348 Query: 310 NEAKRRGAI 318 + K A Sbjct: 349 DYLKDNDAT 357 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 65/340 (19%), Positives = 119/340 (35%), Gaps = 49/340 (14%) Query: 55 SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST-- 112 + +A LN + ++N F ++ I E + FA D++ +T Sbjct: 107 AATISAAPALNGDWPGAVPPERNRFEKQVQNGI---MVAGETPVSTFAIDVDTGSYTTLR 163 Query: 113 -SLSIIIDDQHKDYNLSAVSRY-----------EMPFIFCT--FPWCANSSHAPLLITSS 158 L Q + + Y E PF T P N L I Sbjct: 164 RMLKEGRLPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPYNDDMMLLRIGLK 223 Query: 159 VKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 SK+++G ++ +LDVS SM DKL + +++ + + + V+ VV Sbjct: 224 GYEQSKAELGASNLVFLLDVSGSMA-----SPDKLPLLQTALKMLTQQLGAQDKVSIVVY 278 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +G +V Q + + +L G +T G++ AY +H+ Sbjct: 279 AGAAG----VVLDGAAGNDSQTLNYALEQLSAGGSTNGAQGIQLAYQL------AKKHLV 328 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC 336 +G + +I TDG+ + + E + + K+ G + +G + D ++ Sbjct: 329 EGGINR---VILATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQL 385 Query: 337 ASPDR----FYSVQNS------RKLHDAFLRIGKEMVKQR 366 A + N +L L I KE+ Q Sbjct: 386 ADKGNGQYAYIDSLNEARKVLVEQLSATLLTIAKEVKVQV 425 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/262 (14%), Positives = 89/262 (33%), Gaps = 60/262 (22%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 ++ F F + + P K+ + G+D++ +DVS SM Sbjct: 53 FKPWVKLVFFALVFVFIVIALANPK---VGTKLETVKREGVDIVFAIDVSKSMLAEDVKP 109 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-- 246 +++ A I ++++++ + R + ++++ PL + + Sbjct: 110 -NRIEKAKHIISQLIEVL-------HGDRVAFIPYAAQAYPQLPLTSDYSSAKIFLEGIN 161 Query: 247 --LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 ++ T ++ A N ++ + K +I L+DGE+ +D Sbjct: 162 TNMLSSQGTAIGEAIQMAINYFEESSQT-----------SKILIILSDGEDHQQGVDT-- 208 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAA---------------------------DQFLKNCA 337 EAK +G ++ IG+ L+ A Sbjct: 209 ---VIQEAKDKGIRLFTIGLGTAQGATIPVSENGQIVAKRDNNGQVVITKLNQALLEEIA 265 Query: 338 --SPDRFYSVQNSRKLHDAFLR 357 ++++ N++++ DA + Sbjct: 266 QEGGGKYFNGANTKEVLDALQK 287 >gi|149559056|ref|XP_001512734.1| PREDICTED: similar to collagen, type XXI, alpha 1, partial [Ornithorhynchus anatinus] Length = 225 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 86/239 (35%), Gaps = 35/239 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 Y+M + S H SS D++ +LD S S+ + Sbjct: 13 VAYKMKLLQAFL---ILSLHNYTSTGDGEIRSSCRTAPTDLVFILDGSYSVGPENFEIVK 69 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-- 248 K +++I K+ ++ G+V +S V PL ++ I + Sbjct: 70 KW---------LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLG-SHDSLENLIQAMESI 119 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G T++ +++A + +F AK K + LTDG++ D Sbjct: 120 QYLGGNTRTGKAIQFALDHLF---------AKSQRFLTKIAVVLTDGKSQDEVKD----- 165 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF---YSVQNSRKLHDAFLRIGKEM 362 A+ ++AIGV +E D L+ A+ + + V++ + I +++ Sbjct: 166 -AAEAARDSKITMFAIGVGSETEDAELRAIANKPSYTYVFYVEDYIAISKIREVIKQKL 223 >gi|295688686|ref|YP_003592379.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] gi|295430589|gb|ADG09761.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] Length = 583 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 78/218 (35%), Gaps = 24/218 (11%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 SA + + PW + + + S++ L+++ ++D S SM G Sbjct: 178 TSAQTPFRATVAIAPSPWSSQRQILHIGLQGYAAPRSEA-PPLNLVFLVDTSGSM---MG 233 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK--I 244 P D+L +A +++ ++D ++ R +V ++ G ++ + + Sbjct: 234 P--DRLPLAQKALNVLIDQLRPQD------RVAMVAYAGSAGAVLAPTDGRSKLKMRCAL 285 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 L G +T GLE AY +K +I +TDG+ + D Sbjct: 286 GALQAGGSTAGGRGLELAYGLAKQNFDKKAVNR---------VILITDGDFNVGIADPSR 336 Query: 305 SLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDR 341 + + ++ G + G D ++ A Sbjct: 337 LKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGN 374 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 73/207 (35%), Gaps = 26/207 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + S D+M VLD S S++ D A I ++D D Sbjct: 18 GPSFVLAASGDAGDIMFVLDGSGSIS------ADDFVSAKSFISRVVDAFDIAAD---FT 68 Query: 217 RSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKL 273 R G+V FSS + FPL +++ I + G T + Y N F + Sbjct: 69 RVGVVQFSSFFTEEFPLDRYSDKASLKQAIGNIPQRGGGTLLGQVINYLVNTSFTEAKGA 128 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF 332 ++ G + + +TDG S + + L + + G I ++IGV Sbjct: 129 RPLSDG---IPRIAVLMTDG---SAHDNPTTVLAPAIDALRASGIIAFSIGVGPSVNRDQ 182 Query: 333 LKNCASP-------DRFYSVQNSRKLH 352 L+ A + + + R L Sbjct: 183 LEAVAGDTDRVFLVGAYSVIDDIRDLL 209 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 74/189 (39%), Gaps = 22/189 (11%) Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 S+ + PG D + R ++D + D R G+ F+S PL+ ++ Sbjct: 644 GSIGIYPHPGNDPTNQRVEATRNVIDELDPSAD-----RVGVYDFASSGRALHPLSDDLE 698 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +E + T GLE A N + +G DD ++ +I L+DG+NS+ Sbjct: 699 SAKESVVG-TAYGGTNMAAGLEAALND---------YATRGTDDRERIVILLSDGKNSNT 748 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----LKNCA--SPDRFYSVQNSRKLH 352 D + + + ++ +G+ A D L+ A + +Y + +L Sbjct: 749 ANDERMD-ELADRSDDLDYTLHTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELL 807 Query: 353 DAFLRIGKE 361 D F I E Sbjct: 808 DLFEEIVDE 816 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 87/243 (35%), Gaps = 46/243 (18%) Query: 146 ANSSHAPLLITSSVKISSK---------SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 H + +++ SSK + +D++ VLDVS SM KL + Sbjct: 285 TVLVHLKAPLAQTLQTSSKLEDGNSLGTTRAPVDLITVLDVSGSMAG------TKLALLK 338 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 R++ ++ + S R ++ FSS + FPL G Q + + L Sbjct: 339 RAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGG 392 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFY 308 T GL I D + K + II L+DG+++ + Y Sbjct: 393 TNIAEGLRKGSKVIEDRQAKNPVCS---------IILLSDGQDTYTVSPTAGVHKAAPEY 443 Query: 309 CNEA--KRRG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGK 360 C+ G V+ G A+ L + + S F ++ + DAF + IG Sbjct: 444 CSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGG 503 Query: 361 EMV 363 + Sbjct: 504 LLS 506 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 87/243 (35%), Gaps = 46/243 (18%) Query: 146 ANSSHAPLLITSSVKISSK---------SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 H + +++ SSK + +D++ VLDVS SM KL + Sbjct: 249 TVLVHLKAPLAQTLQTSSKLEDGNSLGTTRAPVDLITVLDVSGSMAG------TKLALLK 302 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 R++ ++ + S R ++ FSS + FPL G Q + + L Sbjct: 303 RAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGG 356 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFY 308 T GL I D + K + II L+DG+++ + Y Sbjct: 357 TNIAEGLRKGSKVIEDRQAKNPVCS---------IILLSDGQDTYTVSPTAGVHKAAPEY 407 Query: 309 CNEA--KRRG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGK 360 C+ G V+ G A+ L + + S F ++ + DAF + IG Sbjct: 408 CSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGG 467 Query: 361 EMV 363 + Sbjct: 468 LLS 470 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 87/269 (32%), Gaps = 54/269 (20%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL----REN 99 K ++ ++D + + E +NGK +K +F E N Sbjct: 444 EKGQVKKVVDMVADDGSKA--SVETTDNGKVKKAEFVTDSFSTFTLAWQEYEPLLTDYAN 501 Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G + +N + + I KD +T V Sbjct: 502 GSVRTSDNSLGAPEHNKRIKYNEKD--------------------------KDYTLTLDV 535 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK-----LGVATRSIREMLDIIKSIPDVNN 214 G+D+++V+D S SM + D + +++ ++D I +PD ++ Sbjct: 536 TGKRGKKAGVDVLLVIDKSGSMGLNDNGRTDSNYFNLMPTLKKTVPTLVDTI--LPDSDS 593 Query: 215 VVRSGLVTFSSKIVQTFPLAWGV------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 V R ++FSS ++ KI L T + A K+ Sbjct: 594 VNRVAAISFSSDDYTGNDISTDWVDYNGKSGFNRKIEGLGTKGGTNWQLAMRNADKKLKP 653 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 E KK ++FL+DGE + Sbjct: 654 RAESQN---------KKVVVFLSDGEPTY 673 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 75/234 (32%), Gaps = 44/234 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++VLD S SM ++ G + + A ++ I ++ + G SS I Sbjct: 1066 AASIVLVLDASASMQEN-GKKLKDIQDAAKAFVNTTKEKSPISEIAVIWYQGSEGSSSTI 1124 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + + INR I + DA E+ I G + KY Sbjct: 1125 TDSGFYTLDTSDNVDAINRFISNKNASGGTPM-------GDALEEANSILSGRPNSSKYA 1177 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRG------AIVYAIGVQAEA------------- 328 + TDG + +N + N A A +Y IG + Sbjct: 1178 LLFTDGMPGYNSSNNSFNCMVANHANNEAKEIKEYAKLYTIGYKLSGSFKWEEGHSQDST 1237 Query: 329 -----------ADQFLKN--CASPD----RFYSVQNSRKLHDAFLRIGKEMVKQ 365 A FLKN +SP+ Y+ N+ L F I ++ Sbjct: 1238 NNHGSHKTETKAADFLKNYLASSPEGDRTYAYTTDNTDGLTKIFEDIAGQIGDL 1291 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 87/243 (35%), Gaps = 46/243 (18%) Query: 146 ANSSHAPLLITSSVKISSK---------SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 H + +++ SSK + +D++ VLDVS SM KL + Sbjct: 240 TVLVHLKAPLAQTLQTSSKLEDGNSLGTTRAPVDLITVLDVSGSMAG------TKLALLK 293 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 R++ ++ + S R ++ FSS + FPL G Q + + L Sbjct: 294 RAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGG 347 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFY 308 T GL I D + K + II L+DG+++ + Y Sbjct: 348 TNIAEGLRKGSKVIEDRQAKNPVCS---------IILLSDGQDTYTVSPTAGVHKAAPEY 398 Query: 309 CNEA--KRRG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGK 360 C+ G V+ G A+ L + + S F ++ + DAF + IG Sbjct: 399 CSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGG 458 Query: 361 EMV 363 + Sbjct: 459 LLS 461 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 87/243 (35%), Gaps = 46/243 (18%) Query: 146 ANSSHAPLLITSSVKISSK---------SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 H + +++ SSK + +D++ VLDVS SM KL + Sbjct: 249 TVLVHLKAPLAQTLQTSSKLEDGNSLGTTRAPVDLITVLDVSGSMAG------TKLALLK 302 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST 252 R++ ++ + S R ++ FSS + FPL G Q + + L Sbjct: 303 RAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQSLQAVYSLTSNGG 356 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----SLFY 308 T GL I D + K + II L+DG+++ + Y Sbjct: 357 TNIAEGLRKGSKVIEDRQAKNPVCS---------IILLSDGQDTYTVSPTAGVHKAAPEY 407 Query: 309 CNEA--KRRG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGK 360 C+ G V+ G A+ L + + S F ++ + DAF + IG Sbjct: 408 CSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDAFAQCIGG 467 Query: 361 EMV 363 + Sbjct: 468 LLS 470 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 82/205 (40%), Gaps = 28/205 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ + LD++ ++D S S+ + + +M+D + R ++ Sbjct: 46 TACKNRPLDLVFIIDSSRSVRPE------EFEKVKIFLSKMIDTLDV---GERTTRVAVM 96 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++S + FPL + ++E ++R+ + T + ++ A +++F E++ Sbjct: 97 NYASTVKVEFPLRTYFDKASMKEAVSRIQPLSAGTMTGLAIQAAMDEVFT--EEMGTRPA 154 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + K +I +TDG + A+ G +YA+GV A Q L+ AS Sbjct: 155 NFNIP-KVVIIVTDGRPQD------QVENVAANARTAGIEIYAVGVG-RADMQSLRIMAS 206 Query: 339 ---PDRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 207 EPLDEHVFYVETYGVIEKLTSKFRE 231 >gi|75812635|ref|YP_320253.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705391|gb|ABA25064.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 592 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 75/239 (31%), Gaps = 29/239 (12%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 ++ + I+ + + + K S +++++D S SM+D Sbjct: 10 TNKPLLFGIYGAIGCLIAAILLGEPLLALTKLAPSSQQTPQAIVLLIDASSSMSD----- 64 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 KL + + ++ D +V+F I PL ++ I L Sbjct: 65 -GKLTEVKTAATKFVERRNLTQDK-----LAVVSFGLDIQTATPLTDNADTLESAIASLS 118 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T GL+ A ++ + I+ TDG S + + Sbjct: 119 EAGGTPMAQGLDAAIGEL------------QATFLSRNILLFTDGVPDSQALAS----LS 162 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ + + A+ + + A P + NS + AF + KQ + Sbjct: 163 AQSARSQRINLIAVATGDADTNYLAQLTADPSLVFYA-NSGQFDQAFRNAEAAIYKQLV 220 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETVRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 497 >gi|225418703|ref|ZP_03761892.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] gi|225041758|gb|EEG52004.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] Length = 1360 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 95/270 (35%), Gaps = 26/270 (9%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 + + +G+ D +++RS +++I D ++A + E P + ++ + Sbjct: 475 QTQFSGYMTDYVSMQRS-AVNISSLDASAFSEITAYVQIETPVDYSIDELKSHITVEDCG 533 Query: 155 I-TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 S + +M++ DVS SM G ++ A I ++ Sbjct: 534 AQISEYNLEKVEYSSANMLLCCDVSGSMQ---GRPIEDSRAAV---------ISMAESMS 581 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R G++ F+S + I+ + T + + Sbjct: 582 GNARLGVILFNSSVQGLTDFTVQPDVIRSTAESMTANGGTNIFDTVVHGLESF------- 634 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K + ++ ++DG+ ++ + + AK + +V+ +G+ +E +L Sbjct: 635 ---PKNGPEVLNTLVVMSDGQENNAHSAEEIQTAIGQAAKDKSILVHCLGLGSEVDANYL 691 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKE 361 + A + + V +S L + + + Sbjct: 692 QTIAQSAGGTYQYVTDSSSLAVFYQNLASQ 721 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 53/170 (31%), Gaps = 36/170 (21%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---KGHDDYK--- 284 PL + E I T +++ Y + A KG D Sbjct: 318 IPLTADSDALLESIEDFRANGFTAGAIAIQWTYYMLSPQWRTAIRNAGLGKGASDADPKK 377 Query: 285 --KYIIFLTDGE------------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K I +TDG+ N + + C+ K G ++ IG + D Sbjct: 378 IAKVAILMTDGQFNTAFAGAGDSYNRQGTLARGNAETLCDNMKNDGIEIFTIGFDLDDKD 437 Query: 331 ----------QFLKNCASPD------RFYSVQNSRKLHDAFLRIGKEMVK 364 LK+C+S D F+ V +L DAF I + K Sbjct: 438 MSTTERDQAKAVLKDCSSKDTSGAKRHFFDVSTGAELDDAFQEIIRNTEK 487 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 82/221 (37%), Gaps = 27/221 (12%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+ +IL + V+ + +G + S + ++ L ++D ++ TA + Sbjct: 14 FARDRGGNFAILFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTG-- 71 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 K + + ++ + + + + I T+ ++ D Sbjct: 72 -----AIKEADANKSVQAFLDANSQAGILQADQIVLDRLIVNRTAKTVQAD--------- 117 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + ++ F F + + S+ + S + + MM LD++ SM Sbjct: 118 --AHVDVGLYFPIF----GTGDMKRVAASTTALYSDKTVEVAMM--LDITGSMAKR--GK 167 Query: 189 MDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIV 228 +DK+G + + + ++ N +R +V ++S + Sbjct: 168 VDKIGDLKTAAKNAVQTMLQKQDPQNPRIRVAIVPYASGVN 208 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 51/223 (22%), Positives = 86/223 (38%), Gaps = 35/223 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S +D ++V+DVS SM DK V+ +++ +D+ + G+V ++ Sbjct: 35 SGNNMDAVLVVDVSNSMTQS-----DKNKVSNEAMKMFVDMTSIQANK-----VGVVAYT 84 Query: 225 SKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 KI + L I+ I+ L G+ T G+ A KI DA + Sbjct: 85 DKIEREKALLEINSEEDKNDIKAFIDSLQKGAYTDIAVGVTEAV-KILDAGRNPNNAP-- 141 Query: 280 HDDYKKYIIFLTDGENSSPNIDNK-------ESLFYCNEAKRRGAIVYAIGVQAEA--AD 330 I+ L DG N ++ E EAK +G VY IG+ A+ Sbjct: 142 ------IIVLLADGNNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGLNADGQLNR 195 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 L+ A + +F+ + KL I +K +++ K Sbjct: 196 TTLQQIAAETNGKFFETSTADKLPQILSEIFANHLKLKVVPVK 238 >gi|292655414|ref|YP_003535311.1| von Willebrand factor type A domain-containing protein [Haloferax volcanii DS2] gi|291372503|gb|ADE04730.1| von Willebrand factor type A domain protein [Haloferax volcanii DS2] Length = 818 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 72/209 (34%), Gaps = 37/209 (17%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L V + +++ +DVS S +S+ LD + + D Sbjct: 387 SLASMLPVTTGEGASQQTNLVFAIDVSGSAESGM--------RVQKSV--ALDALDQLGD 436 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 N R G+V F+ + PL + + I RL G T GL+ A ++ D Sbjct: 437 EN---RVGIVGFNYRAYDVAPLRPLGPNRESAADLIRRLESGGATDIAVGLDGAAQQLGD 493 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + II ++DG + +++ ++ R G V IG Sbjct: 494 RRGT--------------IILISDGHDRF-----QDAATLADQLGRDGVSVITIGTGPNP 534 Query: 329 ADQFLKNC--ASPDRFYSVQNSRKLHDAF 355 ++ L+ AS + + +L F Sbjct: 535 NERTLRAIARASGGNYLRADETDRLRILF 563 >gi|156523168|ref|NP_001095998.1| matrilin-2 [Bos taurus] gi|146186873|gb|AAI40517.1| MATN2 protein [Bos taurus] gi|296480475|gb|DAA22590.1| matrilin 2 [Bos taurus] Length = 958 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 79/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ + D++ ++D S S+N H + I ++L + PDV R GL+ Sbjct: 51 SACENKRADVVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 101 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 102 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 160 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 161 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDTGILIFAIGVGQVDFNTLKAIGSE 212 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 213 PHEDHVFLVANFSQIETLTSVFQK 236 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + D + + + ++D + P R GL+ +S Sbjct: 652 TEGPVDLVFVIDGSKSLGE------DNFEIVKQFVTGIIDSLAISP---KAARVGLLQYS 702 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + F L ++ + + G + + L++ + + F E + Sbjct: 703 TLVRTEFTLRNFSSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQVEGARPL--- 757 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++A+ G +YA+GV ++ + + P Sbjct: 758 SARVPRVAIVFTDGRAQD------DVSEWASKAQASGITMYAVGVGKAIEEELQEIASEP 811 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I ++ K Sbjct: 812 TEKHLFYAED----FSTMGEISDKLQK 834 >gi|61557272|ref|NP_001013220.1| chloride channel calcium activated 2 [Rattus norvegicus] gi|38175219|dbj|BAD01114.1| Ca(2+)-activated chloride channel [Rattus norvegicus] Length = 903 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 71/204 (34%), Gaps = 36/204 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 ICLVLDKSGSMDTE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + I + T GLE + I + + Sbjct: 359 NYLIKITNTGDYKKITGNL-PQQAVGGTSICRGLEAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE+ + + C K GA+++ I + +AA + L + RF Sbjct: 410 -IVLLTDGED--------DLISSCFEVVKHSGAVIHTIALGPKAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 Y+ ++ L DAF I Sbjct: 461 YANKDVNSLMDAFSGISSASGNLS 484 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 76/205 (37%), Gaps = 30/205 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 46 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 96 Query: 230 TFPLA-WGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + + R + G T + L + F + G +K+ Sbjct: 97 AFELGLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFS---RQAGGRPGDRAFKQV 153 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I LTDG + +D A R G ++A+GV A ++ + + P + Sbjct: 154 AILLTDGRSQDLVLD------AAATAHRAGIRIFAVGVGAALREELEEIASEPKSAHVFH 207 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 208 VSD-------FDAIDKIRGKLRRRL 225 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 31/140 (22%), Positives = 59/140 (42%), Gaps = 21/140 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM D++ KL ++ + + + ++ ++ +V F ++ Sbjct: 82 IVLVLDSSGSMADNY-----KLTNLKKAATDFITKMSTVKNLK----IAIVDFDTQATII 132 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L V ++ IN L G T + GL A + ++ E+ +K Sbjct: 133 NKLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASKN------ 186 Query: 286 YIIFLTDGENSSPNIDNKES 305 IIF++DGE + N S Sbjct: 187 -IIFMSDGEPTYYNWQTANS 205 >gi|301059316|ref|ZP_07200243.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300446545|gb|EFK10383.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 527 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 72/212 (33%), Gaps = 27/212 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + + + S +++ +VLD S SM+ KL A ++ E L + Sbjct: 133 VKVTLDAPAPPSRMERPPVNIAIVLDRSGSMSGQ------KLEKAKQAAIEALRRLGQKD 186 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEY-AYNKIF 267 + ++ + + P A V+ I+ +I + G T G+ A Sbjct: 187 MFS------VIVYDHNVKTIVPAQSARNVEWIESRIRGIGPGGNTALFGGVSQGASEVRK 240 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + K H II L+DG + ++ + V IGV + Sbjct: 241 NLSNKYVHR----------IILLSDGLANVGPSSPEDLGRLGAALIKESISVTTIGVGTD 290 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + + S Y V++SR L F Sbjct: 291 YNEDLMARLSQNSDGNTYFVESSRDLPKIFAA 322 >gi|296474801|gb|DAA16916.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] Length = 916 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 76/215 (35%), Gaps = 27/215 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S S I +++ V+D S SM K+ ++ ++LD + + L Sbjct: 266 PDSLSTIPKNVIFVIDKSGSMMG------RKIKQTREALIKILDDLSPHDQFD------L 313 Query: 221 VTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++FSS+ PL V + + T + A + A + E Sbjct: 314 ISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQGGTNINDAMLMAVQLLEKANQ-EEL 372 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + +G II LTDG+ + + +A ++ +G + + FL+ Sbjct: 373 LPEGSITL---IILLTDGDPTVGETNPLNIQKNVRKAINGQHSLFCLGFGFDVSYAFLEK 429 Query: 336 CASPDR------FYSVQNSRKLHDAFLRIGKEMVK 364 A + + ++ +L D + + ++ Sbjct: 430 MALENGGLARRIYEDSDSALQLQDFYQEVANPLMT 464 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 40/197 (20%), Positives = 77/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + ++ + P N R G++ +S Sbjct: 28 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGP---NATRVGVIQYS 78 Query: 225 SKIVQTFPL-AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ + + T + ++YA N F E + Sbjct: 79 SQVQSVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 135 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 136 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 188 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 189 EEHVFLVESF-DLIQEF 204 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 384 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 434 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 435 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 489 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 490 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 537 >gi|163848654|ref|YP_001636698.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526590|ref|YP_002571061.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669943|gb|ABY36309.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450469|gb|ACM54735.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 947 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 80/217 (36%), Gaps = 29/217 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-----IIKSIPDVNNVV 216 + + L ++ V+D S SM+ D+ T S +D I+++ + Sbjct: 403 NRELRPDLAIVFVIDKSGSMDACHCADPDRGAPITSSSERKIDIAKDAIVQAAALLGPQD 462 Query: 217 RSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+VTF TFP G V+ + + ++ + T GL A + +++ Sbjct: 463 TVGVVTFDGAASATFPATRGATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQVDARIK 522 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H +I LTDG S + L + +G + + + +A + Sbjct: 523 H-----------MILLTDGWGSGG-----DQLDLAARLREQGITLTVVAAGSGSAAYLKQ 566 Query: 335 NCA-SPDRFYSVQNSRKLHDAF-----LRIGKEMVKQ 365 A R+Y + ++ F IG +V+Q Sbjct: 567 LAAEGGGRYYPAADMAEVPQIFVQETITAIGNYIVEQ 603 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 65/161 (40%), Gaps = 21/161 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+++M +LDVS SM +L A + I++M+D + R GLV F+ Sbjct: 88 QKGIEVMFLLDVSRSMLAEDVSP-SRLDRAKQQIKDMVDEMSG-------DRVGLVVFAG 139 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + Q+ PL + ++ ++ + S + + DA + K Sbjct: 140 ETRQSVPLTSHYEDFKQSLDAVGPHSVRRGGS-------LLGDAIRSATAGFIDKTNDHK 192 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQ 325 I+ TDGE+ + + EA + G ++ +G+ Sbjct: 193 AIVVFTDGEDQE-----SKPVEAAKEAFTKNGIRIFTVGLG 228 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 40/197 (20%), Positives = 77/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + ++ + P N R G++ +S Sbjct: 28 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGP---NATRVGVIQYS 78 Query: 225 SKIVQTFPL-AWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ + + T + ++YA N F E + Sbjct: 79 SQVQSVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 135 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 136 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 188 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 189 EEHVFLVESF-DLIQEF 204 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 63/162 (38%), Gaps = 25/162 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 343 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 393 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 394 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 448 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + TDG + + + AK G ++YA+GV Sbjct: 449 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVG 484 >gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] gi|122140331|sp|Q3T052|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; Flags: Precursor gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 76/215 (35%), Gaps = 27/215 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S S I +++ V+D S SM K+ ++ ++LD + + L Sbjct: 266 PDSLSTIPKNVIFVIDKSGSMMG------RKIKQTREALIKILDDLSPHDQFD------L 313 Query: 221 VTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++FSS+ PL V + + T + A + A + E Sbjct: 314 ISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQGGTNINDAMLMAVQLLEKANQ-EEL 372 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + +G II LTDG+ + + +A ++ +G + + FL+ Sbjct: 373 LPEGSITL---IILLTDGDPTVGETNPLNIQKNVRKAINGQHSLFCLGFGFDVSYAFLEK 429 Query: 336 CASPDR------FYSVQNSRKLHDAFLRIGKEMVK 364 A + + ++ +L D + + ++ Sbjct: 430 MALENGGLARRIYEDSDSALQLQDFYQEVANPLMT 464 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 45/234 (19%), Positives = 85/234 (36%), Gaps = 28/234 (11%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS---HAPLLITSSVKISSKSDIG- 168 + ++ I+ + V+ Y + + P PL S + + Sbjct: 206 TPALEIEPYAQRMLQRMVNEYGLGLLIMGGPQSFGPGGYFETPLEELSPLSARVPREAPE 265 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + M+ VLD S SM G ++LGVA + L+++ G++ F ++ Sbjct: 266 VTMVFVLDRSGSMQQAVGDS-NRLGVAKNATLSALELLNPQSQ------IGVIVFDTEET 318 Query: 229 QTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL+ + Q ++R+ G T PGL AY ++ + + K+I Sbjct: 319 TVVPLSTLDIPAAQIALDRVDTGGGTAIYPGLVAAYREL-----------QRSESPAKHI 367 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I +TDG + + + + G V A+ + A +N A Sbjct: 368 IVMTDGLSQPGDWEGILRQITAD-----GTTVSAVAIGVGADTGAAENIARLGN 416 >gi|239926966|ref|ZP_04683919.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291435315|ref|ZP_06574705.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291338210|gb|EFE65166.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] Length = 527 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 82/250 (32%), Gaps = 33/250 (13%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSH----APLLITSSVKISSKSDI---GLDMMM 173 + +++ S YE P + L+ S+ + Sbjct: 112 RPEEFVNSFRQEYERPGGDGFTVTVDGARTDEDGWSLVRVGLATRSAAPGGERPPAALTF 171 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D S SM + +L +A +S+ M D + + LVTFS + P+ Sbjct: 172 VVDTSGSMAEP-----GRLDLARKSLAAMTDRL------RDDDSVALVTFSDEAETVLPM 220 Query: 234 AW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + ++RL +T G+E Y + + ++ + Sbjct: 221 TRLGGNRDEVHDAVDRLEPDRSTNLGAGVEAGYETAVEGLREGATNR---------VVLV 271 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQN 347 +D ++ D L + ++R G ++ +GV ++ D ++ A V Sbjct: 272 SDALANTGETDADAILERISRSRRAHGITLFGVGVGSDYGDALMERLADRGDGHTVYVSG 331 Query: 348 SRKLHDAFLR 357 + F Sbjct: 332 EEDAREVFGE 341 >gi|153954292|ref|YP_001395057.1| hypothetical protein CKL_1667 [Clostridium kluyveri DSM 555] gi|219854893|ref|YP_002472015.1| hypothetical protein CKR_1550 [Clostridium kluyveri NBRC 12016] gi|146347173|gb|EDK33709.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219568617|dbj|BAH06601.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 580 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 85/225 (37%), Gaps = 37/225 (16%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S ++++S +D++ VLD S SM + + + +I+ LD+ + + Sbjct: 25 SGVLAAQSGTSMDVIFVLDSSGSMAESDPEKIRE-----EAIKMFLDMGQIQGNKA---- 75 Query: 218 SGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 GLV +S IV+ L + I+ + + G T + GL A + Sbjct: 76 -GLVAYSDSIVREHNLDSINSSEDKDRIKSMASDISLGQRTDTGRGLLEAVKLMESGH-- 132 Query: 273 LEHIAKGHDDYKKYIIFLTDGEN------SSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 K ++ II L+DG+N D K +L C K +G VY IG+ Sbjct: 133 -----KSGNNP--VIILLSDGKNDPERSQDESLNDLKNALQIC---KSKGYPVYTIGLNY 182 Query: 327 EAADQFLK----NCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + ++ + Y + L I + K ++ Sbjct: 183 NGTVDKTQLGDISSSTGGKDYITNTASDLPAILTDIYADNSKLKV 227 >gi|6465945|gb|AAF12731.1|AF108501_1 Ca(2+)-sensitive chloride channel 2 [Mus musculus] Length = 902 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQATGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GISSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKHVSSLIDAFSRIS 477 >gi|13447394|ref|NP_085104.1| chloride channel calcium activated 2 [Mus musculus] gi|12043705|gb|AAG47626.1|AF115852_1 endothelial chloride channel [Mus musculus] gi|14198178|gb|AAH08147.1| Chloride channel calcium activated 2 [Mus musculus] gi|74208910|dbj|BAE21205.1| unnamed protein product [Mus musculus] gi|148680073|gb|EDL12020.1| mCG120735 [Mus musculus] Length = 902 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 49/197 (24%), Positives = 74/197 (37%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQATGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ + C EA R GAI++ I + AA + L + RF Sbjct: 410 -IVLLTDGEDN--------GISSCFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRF 460 Query: 343 YSVQNSRKLHDAFLRIG 359 Y+ ++ L DAF RI Sbjct: 461 YANKHVSSLIDAFSRIS 477 >gi|78484443|ref|YP_390368.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362729|gb|ABB40694.1| Hypothetical protein; predicted integral membrane protein with a von Willebrand factor type A domain [Thiomicrospira crunogena XCL-2] Length = 363 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 73/222 (32%), Gaps = 29/222 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + ++ D++ V++ S+SM + + I+ +LD S N Sbjct: 104 VPLPPEPQTKTVRDIVFVVETSVSMVLEDYQIDGEPQSRIKVIQTVLDQFISGLAGN--- 160 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G + ++ PL + + RL ++ A EK Sbjct: 161 RFGFILYADDAYTLMPLTSDATTARLMLKRLKPYLAGRTDEATGEALGLALQQAEKS--- 217 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ----- 331 + ++ ++DG + E++ Y A+ +Y IGV A + D Sbjct: 218 --TDSTENRIVVLISDGSTRDSRLPIAEAINY---AQGLNIPIYTIGVGANSKDADKREF 272 Query: 332 -----------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 LK A + R+Y + + + L I + Sbjct: 273 RGLLYEALESSSLKQIADQTQGRYYQIGSGQDLQKVLQAIDQ 314 >gi|327282764|ref|XP_003226112.1| PREDICTED: collagen alpha-1(XXI) chain-like [Anolis carolinensis] Length = 956 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 84/218 (38%), Gaps = 34/218 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + R + + P ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSWSVGPE------NFEIVKRWLVNITSNFNIGP---KFIQVGVV 79 Query: 222 TFSSKIVQTFPLAW---GVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +S V PL + ++ E I L G T++ +++A + +F Sbjct: 80 QYSDYPVLEIPLGFHDSNENLVRGMEYIQYL--GGNTQTGKAIQFALDHLF--------- 128 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 AK K + LTDG++ + EA++ ++AIGV +E + L+ Sbjct: 129 AKSSRFLTKIAVVLTDGKSQD------DVKEVAAEARKNRITLFAIGVGSETEEDELRAI 182 Query: 337 A---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 A S + V++ + I +++ ++ + + Sbjct: 183 ANKPSSTYVFYVEDYIAISRIREVIKQKLCEESVCPTR 220 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 79/215 (36%), Gaps = 28/215 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P S SS + +D++ ++D S S+ + A +++++D ++ D Sbjct: 36 PSSTVSPASGSSCRNGPIDLVFIVDSSRSVRP------TEFEKAKEFLQDLVDSLEVGLD 89 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFD 268 R GLV ++S + FPL + ++ + R+ S T + + A K F Sbjct: 90 S---TRVGLVNYASTVRMEFPLKAHFSKPALKGALARVEPLASGTMTGLAIRTAVEKAFA 146 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 A + + +TDG E A+ G +YA+GV A Sbjct: 147 A---EAGARLNSTKVARVAVVVTDGRPQD------EVERVSAAARESGIEIYAVGVD-RA 196 Query: 329 ADQFLKNCAS---PDRFYSVQN---SRKLHDAFLR 357 L+ AS D + V+ KL F Sbjct: 197 DRTSLRLMASQPHEDHVFYVETYGVIEKLTSRFRE 231 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 95/271 (35%), Gaps = 44/271 (16%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL----- 57 F +++F+ KG +++TA+L + +++ ++ + K +L D + L Sbjct: 21 FQKLKDFYQEEKGVYAVMTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAALLLVAE 80 Query: 58 ----------------YTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + + L++ +G+ QK+ + +I+ + + R+E + Sbjct: 81 NNAYRKNPMHDDVTKQSVSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSENKAQKD 140 Query: 102 AQDINNIERSTSLSII----IDDQHKDYNLSAVSRYE---------MPFIFCTFPWCANS 148 I++ I+ +++ V+ Y +P + Sbjct: 141 NHLPVTIDQPFDYKCEELDLINPKNQYSRRKPVTCYVQGSVNREFWIPLSADLVKTHTKN 200 Query: 149 SHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHF---------GPGMDKLGVATRS 198 P+ S + K+ I +D+M+V D S SM +D L Sbjct: 201 GRLPINSGISYAVKEKAIVIPVDLMLVSDFSGSMLWDLKNNENAQYPNRKIDILRSVVSD 260 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 I+ +L K D + R G F+ Q Sbjct: 261 IQNILFPTKLSEDASPYNRMGFAAFAGGTRQ 291 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 31/207 (14%) Query: 177 VSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +S SM F +D + ++ K + + G + + + Sbjct: 367 LSTSMKLIFEDVLDVDKTIKQVENFDGNRVNDYKLTYNNPDHCLGG----NEGVETSQ-- 420 Query: 234 AWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE-KLEHIAKGHDDYKKYIIF 289 AW + + E ++++ +T ++ G N + D AK + ++ ++ Sbjct: 421 AWFTKSKPKVAEALSKIKPTGSTAASSGFIIGANLLMDKNTVPEAQPAKLGTNTQRILMV 480 Query: 290 LTDGENSSPNIDNKESLF---YCNEAKRR--------------GAIVYAIGVQAEAADQF 332 L+DGE++ P D +L C+ +++ A G Q + Sbjct: 481 LSDGEDNRPTFDTLTTLLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQPPPEQKA 540 Query: 333 L-KNCASPDRFYSVQNSRKLHDAFLRI 358 + C + +Y + L DAF +I Sbjct: 541 AWQKCVGENNYYEPSSKEALLDAFKQI 567 >gi|309791847|ref|ZP_07686333.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226108|gb|EFO79850.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 542 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 79/195 (40%), Gaps = 26/195 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---K 226 D+++V+D+S SM DKL + + L ++ +PD R G++TFSS + Sbjct: 368 DIILVVDISGSMEG------DKLEMTRAGLESFL--MRILPD----DRVGMITFSSSATE 415 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 +V L+ +Q I+ + T +E A + E + +D K Sbjct: 416 VVAPAALSENRMQLQMAISEMSATGKTAVFDAVELARQSL-------EALPSTGEDRMKA 468 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSV 345 I+ L+DG +++ I + +E G ++ + A+A L A V Sbjct: 469 IVLLSDGADNASRITLADLERNFDET---GVSIFPVAYGADADRSILDAIAEFSRTIVVV 525 Query: 346 QNSRKLHDAFLRIGK 360 ++ + F + + Sbjct: 526 GDTGDIAQIFENLSR 540 >gi|167647386|ref|YP_001685049.1| von Willebrand factor type A [Caulobacter sp. K31] gi|167349816|gb|ABZ72551.1| von Willebrand factor type A [Caulobacter sp. K31] Length = 592 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 74/204 (36%), Gaps = 28/204 (13%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + L+++ ++D S SM G D+L +A +++ + Sbjct: 205 PWSQDRQLMHIGVQGYAT-PRAGQPPLNLVFLIDTSGSM-----SGPDRLPLAKKALNVL 258 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK--INRLIFGSTTKSTPGLE 260 +D ++ R +V ++ G ++ + + L G +T GLE Sbjct: 259 IDQLRPQD------RVSMVAYAGSAGAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLE 312 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 AY +A+ + D K +I +TDG+ + D + + ++ G Sbjct: 313 LAYA-----------LARQNLDPKAVNRVILMTDGDFNVGIADPTRLKDFVADQRKSGVY 361 Query: 319 VYAIGVQ-AEAADQFLKNCASPDR 341 + G D ++ A Sbjct: 362 LSVYGFGRGNYNDTMMQALAQNGN 385 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 87/221 (39%), Gaps = 21/221 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGM-----DKLGVATRSIREMLDII-KSIPDVNN 214 + + +D++ V+D SLSM+ D+ + + L+ I + Sbjct: 1467 ATVTTKYPVDLVFVIDKSLSMDYDIDGNEIKWWDDETESRKDIVNDALEEIIPDLCSQQY 1526 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ FS + + Q + + S+T+ + L A + + Sbjct: 1527 DIQIAGYQFSGSSTRVLDWSREEQQVLSGLKIARTSSSTEPSQALADALDMLKTGSPAH- 1585 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL- 333 + + KKY+IF+TDGE + P + ++ N A GA +Y IGV ++A+ + Sbjct: 1586 ---RNQSNVKKYLIFMTDGEPTEPEDWSYNAVR--NHA-VPGASIYTIGVSSDASTNLME 1639 Query: 334 ---KNCASPDRF----YSVQNSRKLHDAFLRIGKEMVKQRI 367 S + + +++ + DAF +I E++ Sbjct: 1640 GIRSTALSNGMYAPATFKGTSAQLIRDAFTQIKDEIISTST 1680 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 53/225 (23%), Positives = 91/225 (40%), Gaps = 20/225 (8%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM-DKLGVATRSIREMLDIIKSIPDVNN 214 SV K +M V+D S SM+ FG G D S E+ + D + Sbjct: 1042 VGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDARREVVNSALELF--FNQLSDGDY 1099 Query: 215 VVRSGLVTFSS--KIVQTFPLAWGVQHIQEKI-NRL-IFGSTTKSTPGLEYAYNKIFDAK 270 ++ G FS + V W ++ + N L T++ T G Y + A Sbjct: 1100 NIQFGGYKFSDSGERVNFNDQGWETEYWETDTSNALSHLKLTSRETDGSTYPSQTLRSAI 1159 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNID-NKESLFYCNEAKRR---GAIVYAIGVQA 326 LE++ G ++ K+Y+IFLTDGE + +++ C A + G YAI V Sbjct: 1160 SALENVELG-ENGKRYLIFLTDGEPGQNSYSFSEKEAENCYSAIKNLDSGTTFYAIQVAN 1218 Query: 327 EAADQFLKNCASPDRF--------YSVQNSRKLHDAFLRIGKEMV 363 + F+++ S + ++ +L+ AF ++ E+ Sbjct: 1219 SDSHGFMESMVSNANSVDGVTAQKFVGNSADELNAAFSQMAAEIS 1263 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 62/169 (36%), Gaps = 36/169 (21%) Query: 161 ISSKSDIGLDMMMVLDVSLSMND--HFGPGMDKLGVATRSIREMLDI------------- 205 I+ GLD+++V+D+S SM+D +L V ++ Sbjct: 586 ITGGDQQGLDIVLVIDLSNSMDDGISEDSSDSRLKVLKDTLGYYRTSYGSHGRPSTEGKE 645 Query: 206 --IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKS 255 I + + + R +VT+S+ + L W G Q I++ I L T Sbjct: 646 GFIDELFEQSPNSRFSIVTYSTDA--STELGWTEYGRNGSGQQTIKKAIGDLQANGGTNY 703 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 GL A + +G+ +IFL+DG+ + D E Sbjct: 704 EAGLYQAVEVL---------KERGNSSNIPVVIFLSDGKPTYYYSDVDE 743 >gi|17231851|ref|NP_488399.1| hypothetical protein alr4359 [Nostoc sp. PCC 7120] gi|17133495|dbj|BAB76058.1| alr4359 [Nostoc sp. PCC 7120] Length = 418 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 72/207 (34%), Gaps = 28/207 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + L++ ++LD S SMN L + ++ ++D +K+ R +V Sbjct: 35 PQDRTVPLNLCLILDHSGSMNG------RPLEIVKQAAIRLVDRLKTGD------RLSVV 82 Query: 222 TFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F + P + + I+++I+RL T GL ++ K++ A Sbjct: 83 AFDHRAKVLVPNQVIDNPEQIKKQISRLAADGGTAIDEGLRLGIEELAKGKKETISQA-- 140 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 LTDGEN DN L + A + +G L+ A Sbjct: 141 --------FLLTDGENEHG--DNSRCLKFAQLAAGYSLTLNTLGFGDNWNQDILEKIADA 190 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 +Q + D F R+ + Sbjct: 191 GLGSLSYIQKPEQAVDEFGRLFSRIQT 217 >gi|170076505|ref|YP_001733144.1| hypothetical protein SYNPCC7002_G0035 [Synechococcus sp. PCC 7002] gi|169887367|gb|ACB01075.1| conserved hypothetical protein (von Willebrand factor type A domain) [Synechococcus sp. PCC 7002] Length = 420 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 84/235 (35%), Gaps = 32/235 (13%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISS-------KSDIGLDMMMVLDVSLSMNDHFG 186 +MP I A + + + +KI + L++ VLD S SM+ + Sbjct: 2 KMPMISLIPMHGAIAQGRSVTLDVLIKIEPPLVELDNNARPPLNLGFVLDKSGSMHGN-- 59 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKI 244 KL A ++I ++ + +P R L F +++ P LA Q + E I Sbjct: 60 ----KLDYAKQAIAYAIEQL--LPS----DRLSLTLFDTQVETKIPSTLATDKQRLLETI 109 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + GS+T L + + + + + +I L+DG + + Sbjct: 110 KLIRSGSST----ALHDGWVQGGIQVGQYLNNDHLNR-----VILLSDGLANVGETNPDV 160 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + G A+GV + + L+ A F+ + + L F Sbjct: 161 IASDVHGLMKTGISTSALGVGRDYDEDLLEAIARSGDGNFFHIASPEDLPQIFET 215 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 497 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 262 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 312 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 313 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 367 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 368 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 415 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 383 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 433 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 434 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 488 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 489 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 536 >gi|119512060|ref|ZP_01631154.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463286|gb|EAW44229.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 418 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 70/197 (35%), Gaps = 27/197 (13%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 ++ + I++ +I +I L++ ++LD S SM+ L ++ ++D Sbjct: 20 SSQRQLAVSISAIAEIQ-DRNIPLNLCLILDHSGSMHGL------PLETVKQAAIGLVDK 72 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 +K R +V F + P I+++IN L T GL Sbjct: 73 LKPGD------RLSVVAFDHRATVLVPNQTITNPGQIKKQINSLTADGGTAIDEGLRLGI 126 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ K++ A LTDGEN DN+ L + A + +G Sbjct: 127 EELAKGKKETVSQA----------FLLTDGENEHG--DNQRCLKFAQLATGYNLTLNTLG 174 Query: 324 VQAEAADQFLKNCASPD 340 + L+ A Sbjct: 175 FGDKWNQDVLEKIADAG 191 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 497 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 303 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 353 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 354 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 408 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 409 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 456 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 385 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 435 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 436 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 490 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 491 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 538 >gi|326673138|ref|XP_001334803.4| PREDICTED: collagen alpha-4(VI) chain-like [Danio rerio] Length = 1356 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 82/199 (41%), Gaps = 24/199 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+D + + +++ + PD VR GLV +S Sbjct: 32 DIVFLVDGSAS------IGLDNFQQIRQFLSSLVENFEVAPDK---VRIGLVQYSDTPRT 82 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + I + I L + T + GLE+ I E ++ + + Sbjct: 83 EFSLNTYQNKEEILDYIRNLRYKTGGTHTGQGLEF----ILKQHFIEEAGSRAQQNVPQI 138 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 I +TDG++ E E ++RG ++AIG++ +A + L+ A+ V Sbjct: 139 AIVITDGDSQD------EVDLQAQELRQRGIKIFAIGIK-DADVRLLRQIANEPYDQYVY 191 Query: 347 NSRKLHDAFLRIGKEMVKQ 365 + A I + +V++ Sbjct: 192 SVSD-FAALQGISQSVVRE 209 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 78/188 (41%), Gaps = 31/188 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S+ D+ + + +D PD +VR GL FS + Q Sbjct: 231 DIVLLVDSSGSIGDN------DFEEVKKFLHAFVDRFNLRPD---LVRLGLAQFSDRPYQ 281 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + +K+N LI+ T++ + +E +A+ + Sbjct: 282 EFLLGDYADKKDLHQKLNNLIYRKGGTQTG-------QALTFIRENYFSLARPNVPG--I 332 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I +TDGE+ + + L + G ++ I V + + L+ A+ + + Sbjct: 333 AIVITDGESRDDVEEPAQRL------RNTGVSLFVIRVG-KGNMEKLRAIANIPHEEFLF 385 Query: 344 SVQNSRKL 351 S+ N ++L Sbjct: 386 SINNYQEL 393 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 75/203 (36%), Gaps = 31/203 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ P D L + M+ I+K + + R G+ FS ++ Sbjct: 1167 DLVFLIDGSESIK---PPSWDILK------QTMIGIVKELDIAKDKWRVGVAQFSDILLH 1217 Query: 230 TFPLAWGVQ--HIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L ++E IN + T + L+ + Sbjct: 1218 QFYLNTYTSFAEVEEAINNIKQRKQGTNTWDALKLIKYYFTKENGSRIEGG-----VAQN 1272 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSV 345 ++ +TDGE + ++ L + K + + IG+ E L+ A SPDR Sbjct: 1273 LLLITDGEAND-----EKDLNALADLKNKKIAITVIGIGNEIKKSELREIAGSPDRV--- 1324 Query: 346 QNSRKLHDAFLRIGKEMVKQRIL 368 L + F + + +++L Sbjct: 1325 -----LIETFESLELKTTIRKVL 1342 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 38/252 (15%), Positives = 89/252 (35%), Gaps = 37/252 (14%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMP----FIFCTFPWCANSSHAPLLITSSVKISSKS 165 R+T +S+ + K + +P L + ++++S Sbjct: 353 RNTGVSLFVIRVGKGNMEKLRAIANIPHEEFLFSINNYQELQGLKESLRSKVCLTVTAQS 412 Query: 166 DIGL----DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + L D+ +++D S S +L + +++++ + + N R GL Sbjct: 413 EAFLPKFADLFILVDSSAS--------KQELSIIKNFLQKLIGQLNVGINGN---RVGLA 461 Query: 222 TFSSKIVQTFPLAWGVQHIQEKIN------RLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 FS + + F L + +++ +L + +E+A + F+ Sbjct: 462 QFSENVKEEFLL--NTHRTRNEMSTSIRNLQLTPTGERRIGHAIEHARSNFFN--RDAGS 517 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 A YK++++ + GE++ I + K+ V+A G+ A + Sbjct: 518 RAAEG--YKQFLLVIAAGESADGVIQASR------KIKKDAVTVFAAGLNRADAYEMKDI 569 Query: 336 CASPDRFYSVQN 347 + + V N Sbjct: 570 ASQSHNYKLVGN 581 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 73/192 (38%), Gaps = 34/192 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + +G+VTF S Sbjct: 310 VCLVLDKSGSMSSE-----DRLFQMNQAAELFL-----IQIIEKGSLTGMVTFESSATIQ 359 Query: 231 FPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L ++ + L G T GL + I + + Sbjct: 360 NYLTEITDHNAYEKILANLPQAAGGGTSICSGLRAGFQAIIHSNQNTSGSE--------- 410 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE+ + + C E K+ G++++ I + AA + L N RFY Sbjct: 411 IVLLTDGEDDN--------ISLCFEEVKKSGSVIHTIALGPSAAKELEILSNMTGGHRFY 462 Query: 344 SVQNSRKLHDAF 355 + ++ L DAF Sbjct: 463 ANKDINGLIDAF 474 >gi|149638912|ref|XP_001511941.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Ornithorhynchus anatinus] Length = 3176 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 55/265 (20%), Positives = 100/265 (37%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 R++SL++ +Y +S Y M + + P+ + Sbjct: 437 RTSSLNVRDLSSDTEYQISV---YAMKGLTSSEAVSIMEKTQPMKVQVECSRGVDIKA-- 491 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 492 DIVFLVDGSYS------IGIGNFIKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 542 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF A + + K Sbjct: 543 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVASKGSR------SNVPKV 596 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S + + ++A+GV+ +A L+ ASP Y Sbjct: 597 MILITDGKSSDA------FKEPAIKLRNSDVEIFAVGVK-DAVRSELETIASPPAETHVY 649 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 650 TVED----FDAFQRISFELTQSICL 670 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 78/235 (33%), Gaps = 39/235 (16%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + K K D++ ++D S S+ + + Sbjct: 156 SPVLGQLTIQTGGPTKPGEKKPGRPEIQKCSVSAWTDLIFLVDGSWSVGRNNFKYILDFI 215 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-G 250 A ++ + R G+V +SS F L + + I ++ + G Sbjct: 216 AA---------LVSAFDVGEEKTRVGVVQYSSDTRTEFNLNQYYQRNELLSAIKKIPYKG 266 Query: 251 STTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T + ++Y N DA + K I +TDG++ Sbjct: 267 GNTMTGDAIDYLIKNTFTDAAGARIG-------FPKVAIVITDGKSQDEVEIPAR----- 314 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEM 362 E + RG V+++G++A A + + ++P ++V N + D I ++ Sbjct: 315 -ELRNRGVEVFSLGIKAADAKELKQIASTPSLQHVFNVANFDAIVDIQNEIISQV 368 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 91/232 (39%), Gaps = 37/232 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + GSI I +L ++ ++ GL ++ F +A+L LD ++L A+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS---- 68 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 Q + + + + + L E + + +L++ Sbjct: 69 -------LTQSRSPAEVVRDYVTKAGLADYLDE--------PVVNANTLNVR-------- 105 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A + Y MP +F L +V + + +++ +VLD+S SM Sbjct: 106 SVTATAAYSMPTVFM------KLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDG 159 Query: 186 GPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPL 233 D+L + R+ +DI+ S D V+ +V ++ ++ L Sbjct: 160 TNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADL 211 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPDRFYSVQNSRKLHDAF 355 P + N+ + C+ A+ +G VY++ +AEA Q L+ CAS Y ++ F Sbjct: 493 DPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVF 552 Query: 356 LRIGKEMVKQRI 367 I + + R+ Sbjct: 553 HSIASHITQLRL 564 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 57/339 (16%), Positives = 111/339 (32%), Gaps = 33/339 (9%) Query: 43 FVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFA 102 + + + L + + +N + I + + D Sbjct: 138 LSRNHIMGQMSAPGLPAFREASSSDNFKRQTANGIMVAGEIPVSTFSIDTDTGSYTTLRR 197 Query: 103 -QDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT--FPWCANSSHAPLLITSSV 159 + + ++ + + +Y S S E PF T P N L I Sbjct: 198 WINQGRLPEKGTVRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNDHKMLLRIGLKG 257 Query: 160 KISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 KS +G ++ +LDVS SM DKL + S++ + + V+ VV + Sbjct: 258 YEVDKSQLGASNLVFLLDVSGSM-----NSRDKLPLLKTSLKMLSQQLSEQDHVSIVVYA 312 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G +V + I + +N L G +T G++ AY +H + Sbjct: 313 GASG----VVLDGVKGNDIYAINQALNNLKAGGSTNGGAGIQQAY------GLAQKHFIQ 362 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA 337 G + +I TDG+ + D++ + + +G + +G D ++ A Sbjct: 363 GGVNR---VILATDGDFNVGTTDHQALMDLIASKRDQGIALTTLGFGQGNYNDHLMEQLA 419 Query: 338 --SPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L L I K++ Q Sbjct: 420 DKGNGHYAYIDTLNEARKVLVDELSSTLLTIAKDVKIQV 458 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 55/336 (16%), Positives = 116/336 (34%), Gaps = 51/336 (15%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST---SLSI 116 A I + + ++N F +I I E+ + F+ D++ ST + Sbjct: 127 AAPIASDAWYGIKQPERNRFEKQIQNGI---MVAGEIPISTFSIDVDTGSYSTLRRMIKE 183 Query: 117 IIDDQHKDYNLS-----AVSRYEMPFIFCTFPWCANSSHAPLLITSSV----------KI 161 + + Y +P P+ + AP + ++ Sbjct: 184 GSLPEKGTIRIEEMLNYFTYDYPLPNKNAA-PFSVTTELAPSPYNDDMMLLRIGLKGYEL 242 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 243 TKSELGASNLVFLLDVSGSMA-----SADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAA 297 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V +Q + + +L G +T + G+ AY +H +G Sbjct: 298 G----VVLDGASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL------AQKHFIQGGI 347 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--S 338 + +I TDG+ + + + + + K+RG + +G DQ ++ A Sbjct: 348 NR---VILATDGDFNVGVTNFDQLISLIEKEKQRGIGLTTLGFGMGNYNDQLMEQLADKG 404 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L L I K++ Q Sbjct: 405 NGHYAYIDTLNEARKVLVDELSSTLLTIAKDVKVQI 440 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 86/208 (41%), Gaps = 27/208 (12%) Query: 157 SSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + SK G LD++ V+D S S+ M + M+DII ++ N Sbjct: 32 GTGSQQSKCKTGPLDIVFVIDSSRSVRPFEFETMRRF---------MIDIIHNLDIGPNA 82 Query: 216 VRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEK 272 R G++ +SS++ F L + +++ IN ++ T + ++Y N F +E Sbjct: 83 TRVGVIQYSSQVQNVFSLKSFFTRAEMEKAINNIVPLAQGTMTGLAIQYVMNVAFTTQEG 142 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K + + +TDG + +A+ G +YA+GVQ + Sbjct: 143 ARPLHKK---IPRVAVIVTDGRPQDRVTEVSA------QARAAGIEIYAVGVQRADMNS- 192 Query: 333 LKNCASP---DRFYSVQNSRKLHDAFLR 357 L+ ASP + + V++ L F + Sbjct: 193 LRAMASPALEEHVFLVESF-DLIQQFGK 219 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 74/182 (40%), Gaps = 26/182 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V+D S S+ + + + +++D + P R GLV +SS++ Sbjct: 356 IDLVLVIDGSKSVRPQ------NFELVKQFVNQIVDFLDVSPHG---TRVGLVQYSSRVR 406 Query: 229 QTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL A ++ +++ + T + L++ F E + + Sbjct: 407 TEFPLNKFTTAADLKKAVQRVQYME--KGTMTGLALKHMLEHSFTEAEGARPL---SQNV 461 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRF 342 + + TDG + + + +K G I++A+GV + + + P D+ Sbjct: 462 PRIGLVFTDGRSQD------DISEWARRSKEAGIIMFAVGVGKAVESELREIASEPVDKH 515 Query: 343 YS 344 +S Sbjct: 516 FS 517 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/262 (16%), Positives = 73/262 (27%), Gaps = 81/262 (30%) Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----- 235 M+ + P L +R+ LD I G +S + PL Sbjct: 380 MSLAYSPRQSDLK--KYYLRDSLDKI----YRKGRSEGGGPNYSCTTLPLTPLTDVTTEQ 433 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 G++ +Q I ++ T + + + I E A K +I LTDG N Sbjct: 434 GMKTVQTAIKAMVPNGGTNVPEAMAWGWRTIVQGAPFTEARASTERGNDKVVIVLTDGAN 493 Query: 296 SSPNID-----------------------------------------------------N 302 + D N Sbjct: 494 TYYKYDGLAGSGPDRAGNLSYYSTHGYTARITKKYSQSRLFQESGVSVSQNNTTYTKALN 553 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCAS----------PDRFYSV 345 C+ AK IV + + A+ L++C+S P + + Sbjct: 554 ARFAKLCDNAKAANIIVMTVALDLNEANSTEKAQIDLLRSCSSNSRVRMEGGKPAKLFWN 613 Query: 346 QNSRKLHDAFLRIGKEMVKQRI 367 +L + F +IG E+ R+ Sbjct: 614 STGGELSETFRQIGDELSNLRL 635 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 91/246 (36%), Gaps = 31/246 (12%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 L I F + +G+ +++ A++L + + + ++T++ V+ + LD + L + Sbjct: 32 LTILRFGKDERGNFAMIAALVLVPLLLAGMVAVDTANLMRVRNNVQASLDAAALAVGKRF 91 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 E+ + Y + + D N + + D Q Sbjct: 92 STGESHTVVQDYGARIFYANV--------------TALSADAINFQIAFPQDKTTDQQ-- 135 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + A + + +F + ++ S + +++ +VLD S SM++ Sbjct: 136 ---VQATAAFTYKSLFGVVASRLTGDNWDK-HQYTLTASVRLKNTIEVALVLDNSKSMDE 191 Query: 184 HF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF--PLAWGVQHI 240 G ++ + + ++++ + S +S L+T+ K VQ P A V Sbjct: 192 TRSGSSKKRIDLLKDAASQLVETMAS--------QSALITYVEKPVQFSLVPFAGSVNVG 243 Query: 241 QEKINR 246 + +N Sbjct: 244 PQYLNA 249 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 75/195 (38%), Gaps = 26/195 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ ++D S S+ + + + +L + N R G++ +SS+ Sbjct: 46 GPLDLVFMIDSSRSVRPF------EFETMRQFLVGLLHSLDV---GLNATRVGVIQYSSQ 96 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + FPL + ++ I ++ T + ++YA N F E + Sbjct: 97 VQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPSEERV 153 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + ++ +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 154 PRVLVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDQ 206 Query: 341 RFYSVQNSRKLHDAF 355 + V++ L F Sbjct: 207 HVFLVESF-DLIQEF 220 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 25/177 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 400 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 450 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + Sbjct: 451 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEVQGARPRDLN---V 505 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 506 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELREIASEPS 556 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 75/195 (38%), Gaps = 26/195 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ ++D S S+ + + + +L + N R G++ +SS+ Sbjct: 33 GPLDLVFMIDSSRSVRPF------EFETMRQFLVGLLHSLDV---GLNATRVGVIQYSSQ 83 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + FPL + ++ I ++ T + ++YA N F E + Sbjct: 84 VQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPSEERV 140 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + ++ +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 141 PRVLVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDQ 193 Query: 341 RFYSVQNSRKLHDAF 355 + V++ L F Sbjct: 194 HVFLVESF-DLIQEF 207 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 25/177 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 387 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 437 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + Sbjct: 438 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEVQGARPRDLN---V 492 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 493 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELREIASEPS 543 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 74/211 (35%), Gaps = 31/211 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +S++ L +++ +D S S D A R R++L R + Sbjct: 116 QRESELPLSIVIAIDASGSTKKDLKLETD---SAKRFARDIL---------RPQDRLSVY 163 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS + + P ++ I I+ +I GS T + A + + Sbjct: 164 AFSETVEEIVPFTSDLRRIDRGISEIIAGSATAMYDTIFLASKALMKHDGR--------- 214 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FLKN 335 K ++ +TDG ++ + +++ +++ + + V A ++ Sbjct: 215 ---KVMVLITDGGDTFSSTSYEQAARAATQSETLLYSIIVVPVANSAGRDTGGEHALIQI 271 Query: 336 CA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + Y + L AF +I E+ Q Sbjct: 272 SQDTGGKHYYATDMGSLDVAFKQISDELRTQ 302 >gi|282897673|ref|ZP_06305672.1| von Willebrand factor, type A Precursor [Raphidiopsis brookii D9] gi|281197352|gb|EFA72249.1| von Willebrand factor, type A Precursor [Raphidiopsis brookii D9] Length = 474 Score = 69.5 bits (168), Expect = 7e-10, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 79/237 (33%), Gaps = 29/237 (12%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKI-SSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + + I+ T + + K+ S + +++++D S SM+D Sbjct: 12 KPLLFAIYGTSGCLTAAILLGEPFLALTKLGKSSTIKPQAIVLLIDTSSSMSD------G 65 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 KL + + + D + +V F + + PL + + I++L+ Sbjct: 66 KLAEVKTAASQFIQRRNLESD-----QIAVVNFGATVQTPAPLTNDINTLNNAIDQLLEI 120 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+ A +++ K II TDG PN +L Sbjct: 121 GSTPMGEGINTAQDQL------------QATTLNKNIILFTDGLPDDPNFAYNSALSV-- 166 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + G + A+ + + + NS + AF + + KQ I Sbjct: 167 --RNAGIKLIAVATGGADTNYLTQITGDRSLVFYA-NSGQFDQAFSQAEAVIYKQLI 220 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 69.5 bits (168), Expect = 7e-10, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 78/203 (38%), Gaps = 26/203 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V LD++ V+D S S+ + + + +L + P N R Sbjct: 25 VAGPRCHTGPLDLVFVIDSSRSVRPF------EFETVRQFLVGLLHGLNVGP---NATRV 75 Query: 219 GLVTFSSKIVQTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEH 275 G++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 76 GVIQYSSQVQSVFPLGAFSRPEDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---G 132 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 133 ARPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRA 185 Query: 336 CAS---PDRFYSVQNSRKLHDAF 355 AS + + V++ L F Sbjct: 186 MASHPLDEHVFLVESF-DLIQEF 207 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 69/175 (39%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + PD R GLV FSS++ Sbjct: 346 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPDG---TRVGLVQFSSRVR 396 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 397 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 451 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV ++ L+ AS Sbjct: 452 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEE-LREIAS 499 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 69.5 bits (168), Expect = 7e-10, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 70/219 (31%), Gaps = 31/219 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L + +K+ ++++LD S SM+ G +++ A + + E L+ Sbjct: 268 LASPTIETTPNKTPPAKTVVLILDRSGSMS---GKKIEQARAAMKFVVENLNQDDLFN-- 322 Query: 213 NNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 L+ + + P A I + G +T GL I Sbjct: 323 -------LILYDDTVEMFKPELLRCNAENRAEALRFIEGVRPGGSTDIDQGLRAGLKLIA 375 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D Y+IFLTDG +S + + A A ++ GV + Sbjct: 376 DESRPN------------YVIFLTDGLPTSGETNELKIAEAARAANPLKAKLFVFGVGYD 423 Query: 328 AADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + L + + V+ L A R + + Sbjct: 424 VNARLLDRLSGENGGVSFYVKPDDNLEVAVSRFYERIST 462 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 91/232 (39%), Gaps = 37/232 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + GSI I +L ++ ++ GL ++ F +A+L LD ++L A+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS---- 68 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 Q + + + + + L E + + +L++ Sbjct: 69 -------LTQSRSPAEVVRDYVAKAGLEDYLDE--------PVVNANTLNVR-------- 105 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A + Y MP +F L +V + + +++ +VLD+S SM Sbjct: 106 SVTATAAYSMPTVFM------KLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDG 159 Query: 186 GPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPL 233 D+L + R+ +DI+ S D V+ +V ++ ++ L Sbjct: 160 TNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADL 211 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPDRFYSVQNSRKLHDAF 355 P + N+ + C+ A+ +G VY++ +AEA Q L+ CAS Y ++ F Sbjct: 493 DPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVF 552 Query: 356 LRIGKEMVKQRI 367 I + + R+ Sbjct: 553 HSIASHITQLRL 564 >gi|262197272|ref|YP_003268481.1| hypothetical protein Hoch_4090 [Haliangium ochraceum DSM 14365] gi|262080619|gb|ACY16588.1| Myxococcales GC_trans_RRR domain protein [Haliangium ochraceum DSM 14365] Length = 602 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 72/230 (31%), Gaps = 59/230 (25%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD+++V+D S SM ++ + + ++D + R LV++ S Sbjct: 130 PLDLVVVVDTSGSMATDA-----RMDYVRQGLHLLVDAVD------EDDRLALVSYQSFA 178 Query: 228 VQTFPL--------------------------------------AWGVQHIQEKINRLIF 249 L AW + ++ L Sbjct: 179 EVHAELPALPVEETPEEPTEPTDPVGEPTDPPADPDEDPVDEREAW-RSEMHALVDTLQP 237 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T GLE + +A+ H D + +I L+DG + D+ + Sbjct: 238 GGGTNIYEGLERGFEIAKEAR-------VNHPDRAQRVILLSDGLATEGITDSASIIALS 290 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 G + +GV A + ++ A FY V++ + + F Sbjct: 291 EAFIEGGMGLTTVGVGASFNVELMRGLAERGAGNFYFVEDPEAVREVFTE 340 >gi|326315855|ref|YP_004233527.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323372691|gb|ADX44960.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] Length = 355 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 73/244 (29%), Gaps = 62/244 (25%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVS SM D+L A + + I ++ VR G+V F+ Sbjct: 88 IMLAMDVSGSMR-AADVHPDRLTAAQDAAKAF------IAELPRHVRVGIVAFAGSAQLA 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF----------------------D 268 + + + I+ T + G+ + +F D Sbjct: 141 QLPTQNHEDLFKAIDSFQLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMD 200 Query: 269 AKEKLEHIAKGHDDYKK------------YIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + H + II LTDG+ ++ + + A RG Sbjct: 201 EIGRPPHRGSNGRGADRPAPVAPGSYSSAAIIMLTDGQRTTGV----DPMEAAQWAADRG 256 Query: 317 AIVYAIGVQAEAADQF---------------LKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 VY +GV A + LK A + ++ + L + + Sbjct: 257 VRVYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETLS 316 Query: 360 KEMV 363 + Sbjct: 317 SRLT 320 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 79/209 (37%), Gaps = 36/209 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SMN+ P D+ A I M D +N R ++ F Sbjct: 114 DIVLVIDNSGSMNET-DPNQDRYTAAKNLINRM--------DRDN--RVSVMVFDHATTL 162 Query: 230 TFPLAW-----GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P I +I+ L T + LE + I ++++ Sbjct: 163 LQPFTRVKNQETKDEIIAEIDGLATNDGGTDISLALEDTMSHIQESRDAGRSAM------ 216 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA--SP 339 +I L+DG D+ L E K++ V IG+ Q L+ A + Sbjct: 217 ---VIMLSDG---FSETDHDRVLA---EYKQQQIAVNTIGLSLVNPDGAQLLQTIAAETG 267 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++Y VQ++ L F +I ++ + +L Sbjct: 268 GQYYDVQHAEDLSFVFQKIYDDVGDRSLL 296 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 51/288 (17%), Positives = 107/288 (37%), Gaps = 40/288 (13%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + GSI I +L ++ ++ GL ++ F +A+L LD ++L A+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS---- 68 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 Q + + + + + L E + + +L++ Sbjct: 69 -------LTQSRSPAEVVRDYVAKAGLEDYLDE--------PVVNANTLNVR-------- 105 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A + Y MP +F L +V + + +++ +VLD+S SM Sbjct: 106 SVTATAAYSMPTVFM------KLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDG 159 Query: 186 GPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPL--AW-GVQ 238 D+L + R+ +DI+ S D V+ +V ++ ++ L + V Sbjct: 160 TNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNVS 219 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 H Q + + F ++ +T L +L + Y Sbjct: 220 HRQPYSSCVEFAASDFTTTALANGATLTGSGNSELFSSSSSTQTPTYY 267 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPDRFYSVQNSRKLHDAF 355 P + N+ + C+ A+ +G VY++ +AEA Q L+ CAS Y ++ F Sbjct: 493 DPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVF 552 Query: 356 LRIGKEMVKQRI 367 I + + R+ Sbjct: 553 HSIASHITQLRL 564 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 75/205 (36%), Gaps = 34/205 (16%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++V+DVS SM D G G K+ +A ++ +D + + GL F++ + Sbjct: 411 VLLVVDVSGSMGDEVKGTGKSKIDLAKQAA---IDSLGQFVPRDQ---VGLWQFATHLDG 464 Query: 230 T---------FPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 PL + G + + +++ L S T AY + H+ Sbjct: 465 DKDYQELLPVQPLGSNGKETLASRLSGLTPQSGTGLYDSSLAAYEYL------KAHLDPS 518 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG----AIVYAIGVQAEAADQFLKN 335 + ++ LTDG N P + + L + + G ++ I +A LK Sbjct: 519 AINA---VVVLTDGRNEDPGGVDLDHLVP--QLRPEGNAESVRLFTIAYGGDADQNVLKQ 573 Query: 336 C--ASPDRFYSVQNSRKLHDAFLRI 358 A+ Y ++ F + Sbjct: 574 IAEATAGSEYDSSKPDSINQVFTSV 598 >gi|253584082|ref|ZP_04861280.1| batA protein [Fusobacterium varium ATCC 27725] gi|251834654|gb|EES63217.1| batA protein [Fusobacterium varium ATCC 27725] Length = 325 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 82/222 (36%), Gaps = 36/222 (16%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD---IGLDMM 172 I + +++ ++ + +Y I T + + + S K+ + GL++ Sbjct: 27 IGMSKRNRILDILKLKKYNSVQIIKTILLTLGAIMVVIALLSPQKLLDEDTVEVKGLNIY 86 Query: 173 MVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++D S SM + P ++L A R++ +L +K R G + FS Sbjct: 87 ALIDTSRSMMAEDVYP--NRLEAAKRTLENLLQGLKG-------DRIGFIPFSDSAYIQM 137 Query: 232 PLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + IN LI G T+ LE A E K + K I Sbjct: 138 PLTDDYSIGKNYINALDTNLISGGGTELYQALELA-----------EKSFKEINSDNKTI 186 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 I L+DG + D K F K V++IG+ E Sbjct: 187 IILSDG----GDFDEKSLKFV----KDNKMNVFSIGIGTEEG 220 >gi|330806846|ref|YP_004351308.1| lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327374954|gb|AEA66304.1| Putative lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 557 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 83/227 (36%), Gaps = 29/227 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW ++ + I +S + + +++ ++DVS SM+ G +S ++ Sbjct: 170 PWNPHTRLLRIGIKASDRAVADL-APANLVFLVDVSGSMDRREGLP------LVKSTLKL 222 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI--QEKINRLIFGSTTKSTPGLE 260 L + + + R LV ++ + G + + I++L G +T G+E Sbjct: 223 L-----VDQLRDQDRVSLVVYAGESRVVLKPTSGRDKVTIRNAIDQLDAGGSTAGASGIE 277 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY A+E I+ TDG+ + D + ++ G + Sbjct: 278 LAYQM---ARESFIDKGINR------ILLATDGDFNVGVSDFDSLKQMAVDQRKSGVSLT 328 Query: 321 AIGVQAEAADQFLK---NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +G + ++ L A + + N L +A + ++ Sbjct: 329 TLGFGVDNYNEHLMEQLADAGDGNYAYIDN---LLEARKVLVDQLSS 372 >gi|325267447|ref|ZP_08134103.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] gi|324981088|gb|EGC16744.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] Length = 238 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 83/210 (39%), Gaps = 17/210 (8%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K ++ + L ++++LDVS SM+ DK+ +++ MLD + + Sbjct: 4 PKKFTTPTAKPLPVVLLLDVSSSMSG------DKIDNLNKAVENMLDTFAQEEKMETEIL 57 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++TF K+ P + Q + + L T L+ A I D + Sbjct: 58 VSVITFGGKVDLHVPFT---KASQVQWHGLQVNGDTPMGTALKMAKAMIEDKETTPSRAY 114 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + I+ ++DG+ + NI + + +E + A+ + +A + LK Sbjct: 115 RPT------IVLVSDGQPTDGNIWKQAMADFISEGRSSKCDRMAMAIGHDADETVLKRFI 168 Query: 338 SPDRF--YSVQNSRKLHDAFLRIGKEMVKQ 365 + +N+ +LH+ F R+ + + Sbjct: 169 EGTAHDLFYAENAGQLHEFFQRVTMSVTMR 198 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 65/169 (38%), Gaps = 15/169 (8%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLD S SM G + +L I D ++S +VTF Sbjct: 15 NPSTRVPVCLVLDTSGSME---GQPISELN---EGINCFYDAVRSDETALYAAEIAVVTF 68 Query: 224 SSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 V L ++ + + T + A + + K K E+ A G D Sbjct: 69 GGSAV----LKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLL--EKRKGEYKASGVD 122 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAA 329 Y+ +I+ +TDG+ + + + ++ C K R ++ IG+ +A Sbjct: 123 YYQPWIVLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGEDAD 171 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 78/209 (37%), Gaps = 28/209 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + + D++ ++D S S+ + ++ + D VR Sbjct: 224 ISPACREGAVADVVFLVDSSTSIGPQ------NFQKVKNFLYSVVSGLDVSSDR---VRV 274 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKL 273 GL ++ I F L ++ + I T + LE+ I + Sbjct: 275 GLAQYNDDIYPAFQL--NQHPLKSMVLEQIQNLPYRTGGTNTGNALEF----IRTSYLTE 328 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ D + +I +TDGE++ E ++ K G +VY +G+ + + Sbjct: 329 GSGSRAKDRVPQIVILVTDGESND------EVQEAADQLKEDGIVVYVVGINVQDVQELQ 382 Query: 334 KNCASP-DRF-YSVQNSRKLHDAFLRIGK 360 K + P ++F ++++N L D I + Sbjct: 383 KIASEPFEKFLFNIENFNILQDFSGSILQ 411 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 45/195 (23%), Positives = 76/195 (38%), Gaps = 27/195 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S + + E++ + + PD V+ G++ +S +I Sbjct: 843 DIYFLIDGSGS------IYPEAFLDMKVFMNEVIKMFQIGPDR---VQFGVIQYSDEIKS 893 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 F L+ + E + A N + I + +Y+I Sbjct: 894 KFVLS-QYPTVAELKVAIDNIQQGGGGTTTGEALNNMTQVFADTARI-----NVARYLIV 947 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 +TDG++S P D E L + G I+YAIGV+ EA LK A F+ + Sbjct: 948 ITDGKSSDPVADAAEGL------RASGVIIYAIGVR-EANIDELKEIAKDKIFFVYE--- 997 Query: 350 KLHDAFLRIGKEMVK 364 D I KE+V+ Sbjct: 998 --FDLLKDIQKEVVR 1010 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 72/207 (34%), Gaps = 27/207 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + D++ ++D S S++ R + M++ + ++ G Sbjct: 1014 SSEACKNSKADIVFLIDGSESISPE------DFERMKRFVESMVNQSNI---GTDSIQIG 1064 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEH 275 L+ FSS + F L + N + T++ L + +K Sbjct: 1065 LLQFSSIPKEEFRLNRYSSKVDIY-NAIFAVQQMRDGTRTGKALNFTLPFFESSKG---- 1119 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 G ++Y+I +TDG + + + + I++AIGV Q L+ Sbjct: 1120 ---GRPSVQQYLIVITDG------VAQDNVILPAKALRDKNIIIFAIGVGEAKKSQLLEI 1170 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEM 362 DR Y + L + I ++ Sbjct: 1171 TNDEDRVYYDVDFEVLQNLEKEILSKV 1197 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 67/179 (37%), Gaps = 19/179 (10%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF 249 ++ ++ ++ PD VR GLV +S + F L I E +++L + Sbjct: 649 FQQVISFLKTIVSLLSIRPDA---VRFGLVFYSEEPRLEFSLDTFQNPAEILEHLDKLTY 705 Query: 250 G---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 TK+ L++ N++F E ++ ++ + +T+ N Sbjct: 706 RERRGRTKTGAALDFLRNEVF----IQEKGSRSSHGVQQIAVVITE------NFSQDSVS 755 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +R G +YA+G Q + + L+ AS + IG ++ Q Sbjct: 756 GPASRLRRAGVTIYAVGTQDVSESKDLEKMASYPPWKHSVPLESFLQ-LSIIGSKLTNQ 813 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 56/151 (37%), Gaps = 12/151 (7%) Query: 196 TRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGST 252 TRS+R L I + S + +R GL +S F L+ + + I + F Sbjct: 48 TRSVRNFLYILVNSFNVSSKTIRVGLAQYSDVPHSEFLLSTYHRKNDVLRHIRQFQFKPG 107 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + A I D + ++ + + ++ G E+L Sbjct: 108 ---GKKMGLALQFILDHHFQEAAGSRASQGVPQIAVVMSSGPAEDHVHGPGEAL------ 158 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 +R G +VYA+GV+ + + +SP + Sbjct: 159 RRAGILVYAVGVKDAVWAELREIASSPQENF 189 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 51/213 (23%), Positives = 76/213 (35%), Gaps = 51/213 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L A + + V G+VTF S Sbjct: 309 VCLVLDKSGSM--TIGNRLKRLNQAGKLFL--------LQTVEQGSWVGMVTFDSAAYVQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L A + + + T GL A+ I Sbjct: 359 SELVQINSATERDTLTKSL-PTTASGGTSICSGLRSAFTVIKKKYPTDGAE--------- 408 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF---------LKN 335 II LTDGE++ ++ C +E K+ GAI++ + + AA + L+ Sbjct: 409 -IILLTDGEDN--------TISACFDEVKQSGAIIHTVALGPSAAQELEQMSKMTGGLQT 459 Query: 336 CASPDRFYSVQNSRKLHDAFLRI--GKEMVKQR 366 AS VQN L DAF + G + V QR Sbjct: 460 YASD----QVQN-NGLVDAFAALSSGNKAVSQR 487 >gi|257469960|ref|ZP_05634052.1| von Willebrand factor (vWA) type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] Length = 322 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 82/221 (37%), Gaps = 34/221 (15%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD---IGLDMM 172 I + +++ ++ + +Y I T + + + S ++ ++ GL++ Sbjct: 24 IGMSKRNRILDILKLKKYNFVQIIKTILMTLGAFMVVIALLSPQELLNEDTVEVKGLNIY 83 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 ++D S SM ++L A R++ +L +K R G + FS P Sbjct: 84 ALIDTSRSMMTE-DVYPNRLEAAKRTLENLLQGLKG-------DRIGFIPFSDSAYIQMP 135 Query: 233 LAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + IN LI G T+ LE A E K + K II Sbjct: 136 LTDDYSIGKNYINALDTNLISGGGTELYQALELA-----------EKSFKEINSDNKTII 184 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 L+DG + D+K F K V++IG+ + Sbjct: 185 VLSDG----GDFDDKSLKFV----KDNKMNVFSIGIGTDEG 217 >gi|309789848|ref|ZP_07684427.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308228152|gb|EFO81801.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 420 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 78/219 (35%), Gaps = 30/219 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L+ + ++ +++ VLD S SM +D++ A + LD Sbjct: 28 LIEVQPSAVMAQVRTPVNVSFVLDRSGSMKGD---KIDRVRQAISLAVDRLDAQDIAS-- 82 Query: 213 NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 LV F + P A + I+++++R+ TK P +E +I + Sbjct: 83 -------LVIFDHRNEVLIPAAPVTDRRMIKDRVSRIRDAGGTKIAPAVEKGLREIEKDR 135 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++ LTDG+ + E L ++A R G + A+GV + + Sbjct: 136 SGAIRR----------LVLLTDGQ----TENEDECLRRADDAGRIGVPITALGVGQDWNE 181 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L A S + + ++ + F + I Sbjct: 182 DLLIEMANRSGGTADYIARADEITEYFQNTVQRAQNSAI 220 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 73/199 (36%), Gaps = 39/199 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM D+L T++ + L + V G+VTF S Sbjct: 307 VCLVLDKSGSMA-----IGDRLKRLTQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQ 356 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + T GL A+ I + Sbjct: 357 SALRQIKGGTDRDALTKSL-PTVASGGTSICSGLRSAFTVIRKKYKTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISSCFNEVKQSGAIIHTVALGPSAAAELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRIG 359 Y+ ++ L DAF + Sbjct: 458 YASDQAQNNGLIDAFGALS 476 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 62/214 (28%), Gaps = 31/214 (14%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S V + ++++D S SM + ++++ L V Sbjct: 75 SPVTSKPATRAQRTTVLLIDTSGSMGRS------GMATVRTAVKDFL------ASAPKDV 122 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G+V+F + +Q ++ L T G+ A + ++ Sbjct: 123 RIGVVSFGNTAGPEIAPTTARAAVQAVVDDLRADGNTALFSGVTQAVRMLGSTGDRS--- 179 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKN 335 I+ L+DG+N+ D L +A V + D Sbjct: 180 ----------IVLLSDGKNTVG--DRASGLAAAGKALTASQVRVEVVRFTTGENDPEALA 227 Query: 336 C---ASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A ++ + AF K + Q Sbjct: 228 AFAKAGGGSVVQATDAEGVRTAFQTAAKVLESQV 261 >gi|240169099|ref|ZP_04747758.1| hypothetical protein MkanA1_07284 [Mycobacterium kansasii ATCC 12478] Length = 741 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 79/227 (34%), Gaps = 42/227 (18%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + + + SS D+++VLD S SM K+ A R+ ++D++ + Sbjct: 279 GTWSVTLVPPAEPSS---APRDVVVVLDRSGSMGGW------KMVAARRAAGRIVDMLDT 329 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLA--------WGVQHIQEKINRLIFGSTTKSTPGLE 260 + R ++ F +I +A + L T+ L Sbjct: 330 VD------RFCVLAFDDRIDTPTDMAPGLVEGSDQNRFAAASWLGSLRSRGGTEMAEPLR 383 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIV 319 A + + E + ++ +TDG+ + + + A+ G + Sbjct: 384 RAVELLAGSDEGRQAS----------VVLVTDGQITGEDHLLRSL------AQSLGRIRI 427 Query: 320 YAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 Y +G+ FL A R V++ +L +A R+ + + + Sbjct: 428 YCVGIDRAVNAGFLDRLARLGRGRAELVESEARLDEAMSRLARTIGR 474 >gi|224057976|ref|XP_002299418.1| predicted protein [Populus trichocarpa] gi|222846676|gb|EEE84223.1| predicted protein [Populus trichocarpa] Length = 595 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 90/256 (35%), Gaps = 39/256 (15%) Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + ++ + E P + + + +L +D++ VLDVS SM Sbjct: 111 HAITVKTLPEYPAVSASESFSKFGVLVRVLAPPLDNTLPHHRAPIDIVNVLDVSGSMA-- 168 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHI 240 KL + R++ ++ + R +VTFSS + PL G + Sbjct: 169 -----GKLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARRILPLRTMSGSGREDA 217 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-----EN 295 +N L T GL + +E+ +H + II L+DG + Sbjct: 218 ISVVNSLSATGGTNIVAGLRKGVRVL---EERRQHNSVAS------IILLSDGCDTQSHS 268 Query: 296 SSPNIDNKESLFYCNEAKRRGAI-----VYAIGVQAEAADQFLKNCA--SPDRFYSVQNS 348 + ++ + +F N A + ++ G + + + S F +++ Sbjct: 269 THNRLEYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAAMHAISDVSGGTFSFIESI 328 Query: 349 RKLHDAFLR-IGKEMV 363 L DAF R IG Sbjct: 329 DILQDAFARCIGGLTS 344 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 78/204 (38%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D + ++D S S+N H + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADPVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRVIMIVTDGRPQDSVA------EVAAKARDMGILIFAIGVGQVDFNTLKSIGSE 210 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P D + V N + L F + Sbjct: 211 PHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 701 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TNKHLFYAED----FSTMDEISEKLKK 832 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 77/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + +L + P N R G++ +S Sbjct: 29 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVGVIQYS 79 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ ++ I L+ T + ++YA N F E + Sbjct: 80 SQVQSVFPLRAFSRSEDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 137 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 189 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 190 DEHVFLVESF-DLIQEF 205 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 67/176 (38%), Gaps = 25/176 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + TDG + + + AK G ++YA+GV + + + P Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAELREIASKP 499 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLPGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G +VYA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVVYAVGVGKAVEAE-LREIAS 497 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLPGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 262 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 312 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 313 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 367 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G +VYA+GV + L+ AS Sbjct: 368 PRVGLVFTDGRSQD------DISVWAARAKEEGIVVYAVGVGKAVEAE-LREIAS 415 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLPGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 303 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 353 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 354 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 408 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G +VYA+GV + L+ AS Sbjct: 409 PRVGLVFTDGRSQD------DISVWAARAKEEGIVVYAVGVGKAVEAE-LREIAS 456 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 77/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ G +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARASGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 303 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 353 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 354 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 408 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 409 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 456 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 77/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ G +YA+GVQ L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARASGIEIYAVGVQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 497 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 51/213 (23%), Positives = 76/213 (35%), Gaps = 51/213 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G + +L A + + V G+VTF S Sbjct: 307 VCLVLDKSGSM--TIGNRLKRLNQAGKLFL--------LQTVEQGSWVGMVTFDSAAYVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L A + + + T GL A+ I Sbjct: 357 SELVQINSATERDTLTKSL-PTTASGGTSICSGLRSAFTVIKKKYPTDGAE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF---------LKN 335 II LTDGE++ ++ C +E K+ GAI++ + + AA + L+ Sbjct: 407 -IILLTDGEDN--------TISACFDEVKQSGAIIHTVALGPSAAQELEQMSKMTGGLQT 457 Query: 336 CASPDRFYSVQNSRKLHDAFLRI--GKEMVKQR 366 AS VQN L DAF + G + V QR Sbjct: 458 YASD----QVQN-NGLVDAFAALSSGNKAVSQR 485 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 76/195 (38%), Gaps = 34/195 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++ LDVS SM + GM K+ +A ++ + + + D R +V F+ + Sbjct: 320 IDIVLCLDVSGSMRE-LSSGMPKIEIAKDAVSQYIQFLSKTND-----RLAMVLFNFRAD 373 Query: 229 QTFPLAWGVQHIQEK-------INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 WG+ ++ + + G T LE + + +K Sbjct: 374 VL----WGLHQVRRYWQQMNYMLKYVYAGGGTNLANALERSREVLTRSKSN--------- 420 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K++I +TDG + ++ KE++ +R G + I + + D+ L + Sbjct: 421 --SKHVICVTDGRTVNSSMCVKEAV----RLRRNGTTISTIAIGENSDDELLMRLSKIGG 474 Query: 340 DRFYSVQNSRKLHDA 354 F + + L A Sbjct: 475 GLFIKISSIHDLGKA 489 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 78/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + +L + P N R G++ +S Sbjct: 29 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLNVGP---NATRVGVIQYS 79 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I L+ T + ++YA N F E + Sbjct: 80 SQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 137 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 189 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 190 DEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 67/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G +YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIAMYAVGVGKAVEAE-LREIAS 497 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 78/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + +L + P N R G++ +S Sbjct: 29 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLNVGP---NATRVGVIQYS 79 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I L+ T + ++YA N F E + Sbjct: 80 SQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 137 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 189 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 190 DEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 497 >gi|332667371|ref|YP_004450159.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332336185|gb|AEE53286.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 425 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 81/213 (38%), Gaps = 26/213 (12%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + L + S +S + L++ +V+D S SM DK+ A ++ + ++D Sbjct: 25 YLYLELQGSEAPASADRVPLNLSLVIDRSGSMAG------DKIAYAKKAAQFIVD----- 73 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 +++ R +V + + P A Q ++++I + + T + G+ Y ++ Sbjct: 74 -NLSPEDRVSIVQYDDIVEVLSPSAPVLNKQELRQRIALMEARNMTNLSGGMLAGYEQV- 131 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQA 326 K + + L+DG + D E + G V GV A Sbjct: 132 -------ERTKQARFVNRVL-LLSDGLANHGITDPTVLQQMVQEKFRNAGIAVSTFGVGA 183 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + + + + +Y +++ K+ F Sbjct: 184 DFNELLMTSLSEYGGANYYFIESPDKIPGIFAE 216 >gi|27754463|gb|AAO22679.1| unknown protein [Arabidopsis thaliana] Length = 641 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 50/281 (17%), Positives = 96/281 (34%), Gaps = 36/281 (12%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYN-LSAVSRYEMPFIFCTFPWCANSSHAPL 153 + + GF D + T I D D+ L E+ + + L Sbjct: 130 FVAQRGFEDDEPLPQGDTQ--IHSDGHRSDHQALEIKLFPEVSALAKPVSRADFAVLVHL 187 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + ++ LD++ VLDVS SM+ G M+ + A + I ++ Sbjct: 188 KAEGVSDDARRARAPLDLITVLDVSGSMD---GVKMELMKNAMSFV---------IQNLG 235 Query: 214 NVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R +++FSS + FPL G Q + +N L+ T GL+ I Sbjct: 236 ETDRLSVISFSSMARRLFPLRLMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVI--- 292 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIGVQ 325 K ++ L+DG+++ + + ++ G Sbjct: 293 ---EGRRWKNPVSG---MMLLSDGQDN-FTFSHAGVRLRTDYESLLPSSCRIPIHTFGFG 345 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 ++ + + + S F ++ + DAF + IG + Sbjct: 346 SDHDAELMHTISEVSSGTFSFIETETVIQDAFAQCIGGLLS 386 >gi|15223093|ref|NP_172283.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|8778841|gb|AAF79840.1|AC026875_20 T6D22.13 [Arabidopsis thaliana] gi|332190114|gb|AEE28235.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 641 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 50/281 (17%), Positives = 96/281 (34%), Gaps = 36/281 (12%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYN-LSAVSRYEMPFIFCTFPWCANSSHAPL 153 + + GF D + T I D D+ L E+ + + L Sbjct: 130 FVAQRGFEDDEPLPQGDTQ--IHSDGHRSDHQALEIKLFPEVSALAKPVSRADFAVLVHL 187 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + ++ LD++ VLDVS SM+ G M+ + A + I ++ Sbjct: 188 KAEGVSDDARRARAPLDLITVLDVSGSMD---GVKMELMKNAMSFV---------IQNLG 235 Query: 214 NVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R +++FSS + FPL G Q + +N L+ T GL+ I Sbjct: 236 ETDRLSVISFSSMARRLFPLRLMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVI--- 292 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIGVQ 325 K ++ L+DG+++ + + ++ G Sbjct: 293 ---EGRRWKNPVSG---MMLLSDGQDN-FTFSHAGVRLRTDYESLLPSSCRIPIHTFGFG 345 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 ++ + + + S F ++ + DAF + IG + Sbjct: 346 SDHDAELMHTISEVSSGTFSFIETETVIQDAFAQCIGGLLS 386 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 75/238 (31%), Gaps = 53/238 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D+S SM +L A ++ +L+ V G+V + Sbjct: 98 VLLVMDLSGSMRAQDVKP-SRLRAAQQAATTLLEA------QPAGVSVGVVAMAGTAAVA 150 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN----KIFDAKEKLEHIA--------- 277 + + I RL T G+ A ++ E+L + Sbjct: 151 QAPTRAREAVATAIERLQPQGGTALGNGMLIALTTLLPELTPDAERLMNDDTPPPRKPRA 210 Query: 278 -----------KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 K I+ +DGE+++ + + G +Y +GV Sbjct: 211 LANPPADTEPVKPGSYTSGAIVLFSDGESNAGPAALRAAQLAAEH----GVRIYTVGVGT 266 Query: 327 EAA---------------DQFLKNC--ASPDRFYSVQNSRKLHDAFLRI-GKEMVKQR 366 ++ LK A+ ++ ++++ +L + + + +R Sbjct: 267 PEGVVLSVDGWSARVRLDEKVLKEVADATSAEYFRLEDAAELKRVYRALNARLAFDKR 324 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 67/168 (39%), Gaps = 29/168 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + +D++ V+DVS SM KL A +++ ++ K+ + + R L Sbjct: 189 VPAPVRPPIDVVFVMDVSGSMTTM------KLQSAKSALQAAVNYFKT--NYHPNDRFAL 240 Query: 221 VTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + FS + T + +G + I ++ NRL T + L A + D + Sbjct: 241 IPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYFNDPER- 299 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIV 319 KKYIIFLTDG + N + + + K G + Sbjct: 300 -----------KKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKI 336 >gi|213964310|ref|ZP_03392536.1| BatB protein [Capnocytophaga sputigena Capno] gi|213953052|gb|EEB64408.1| BatB protein [Capnocytophaga sputigena Capno] Length = 345 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 67/207 (32%), Gaps = 29/207 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSS-VKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 SR+++ + L KI + G+D++ +DVS SM Sbjct: 51 SRFKLWVKWSVLAVVFILLSIALANPKIGTKIETVKREGVDIVFAIDVSKSMLAEDVAP- 109 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI- 248 ++L A R E + +K R G+V +++ L + + + Sbjct: 110 NRLEKAKRIAFETISQLKG-------DRVGIVAYAASAYPQLALTTDHSAAKMFLQGMNT 162 Query: 249 ---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T + A N D + + L+DGE+ + Sbjct: 163 DMLSSQGTAIQEAIRMASNYFDDK-----------TPTARLLFILSDGEDHE-----MGA 206 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQF 332 +EA+ +G +Y IG+ E Sbjct: 207 TEIASEAQEKGVHIYTIGIGTEKGAPI 233 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 67/168 (39%), Gaps = 29/168 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + +D++ V+DVS SM KL A +++ ++ K+ + + R L Sbjct: 189 VPAPVRPPIDVVFVMDVSGSMTTM------KLQSAKSALQAAVNYFKT--NYHPNDRFAL 240 Query: 221 VTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + FS + T + +G + I ++ NRL T + L A + D + Sbjct: 241 IPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYFNDPER- 299 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIV 319 KKYIIFLTDG + N + + + K G + Sbjct: 300 -----------KKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKI 336 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 66/202 (32%), Gaps = 29/202 (14%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + S K+ + G+D++ LD+S SM ++L Sbjct: 39 LKVTILLLGMTFLIISLTNPKMGS-KLKTIKREGVDIVFALDISKSMLAEDIAP-NRLEK 96 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFG 250 + + I +++D + S R G++ ++ P+ +Q ++ Sbjct: 97 SKQIISKIIDRLGS-------DRVGIIVYAGNSYPLLPITTDHAAANMFLQNANPDMVSS 149 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T LE A + ++ +++I L+DGE+ +E+ Sbjct: 150 QGTAINEALELAKTYYNNDEQTN-----------RFLIILSDGEDHQ-----EETKQVAQ 193 Query: 311 EAKRRGAIVYAIGVQAEAADQF 332 G +Y IGV Sbjct: 194 NLANNGVKIYTIGVGTARGGPI 215 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 77/193 (39%), Gaps = 25/193 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G++ G ++ +L I+ PD + G++ FS Sbjct: 623 EDMKADIMFLVDSSGS------IGLENFGKMKTFMKSLLAKIQIGPDRTH---IGVIQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + I + I+R+ T + L AK I Sbjct: 674 DKTREEFQLNKYFTQNEISDAIDRMSLIDKNTLTGNALISVDQYFTPAKGARIGI----- 728 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDGE D ++L + +G +++++GV Q + + Sbjct: 729 --KKFLILITDGEAQDAVRDPAKAL------RDKGVVIFSVGVYGANRTQLEEISGDGNL 780 Query: 342 FYSVQNSRKLHDA 354 + V++ L Sbjct: 781 VFQVESFDDLKAI 793 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+N M L + ++K ++ VR G + +S Sbjct: 813 LDIVFVLDHSGSINTQEQENMMALT---------IHLVKKADVGSDRVRIGALKYSDYPE 863 Query: 229 QTFPLA--WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + E + R T + L++ + E+ ++ + K+ Sbjct: 864 ILFHLGKYSNRSSVIEHLRRRRSTGGDTYTARALDH-----TNMMFTEEYGSRIQQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG + N+ N+ +L + + +G +YA+GV + + + + V Sbjct: 919 MLIVITDGVSHDRNLLNETAL----KLRNKGIDIYAVGVGQADQLELEAMAGNKSKTFHV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 N KL D +L + + M Sbjct: 975 DNFNKLKDIYLPLQESM 991 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 68/200 (34%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ + + +A + PD V+ G V +S K+ Sbjct: 442 DIYFLIDGSTSIQGKHFEQIKEFMLAVTG------MFSIGPDK---VQVGAVQYSDKMRV 492 Query: 230 TFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + ++ +N T + L + + I + ++ Y Sbjct: 493 EFYINDNSNNVNLKNAILNIEQLQGNTYTGEALNFTLSIIKEDRKLRT------SQVPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG ++ + L + ++A+G+ Q + +R Q Sbjct: 547 LIVLTDGRSTD------DVLEPAERLRAEQVTIHAVGIGEAIKVQLQQIAGGEERVSFGQ 600 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N L I E+V + Sbjct: 601 NFDSL----RSIKNEVVHRI 616 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 54/166 (32%), Gaps = 26/166 (15%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE----KLEHIAKGHDDYKKY 286 PL +++ + L +T+ G+ + + + A D K Sbjct: 295 TPLTGDFAYLKSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDQSSPAPVSDSVHKV 354 Query: 287 IIFLTDGENSSPNIDNKESLFY---------------------CNEAKRRGAIVYAIGVQ 325 ++F+TDGE + D + + C K+ G +Y + Sbjct: 355 MVFMTDGEMN-TKYDPNDKFDWICSQTQSSACNAFATAARQTACTAMKKSGIEIYTLSYS 413 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 A+A ++NCA+ + + + + I + + + Sbjct: 414 ADADVVNIRNCATNTAHFFTASPATIKTVYETIAAAIRGDTLRLTQ 459 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 81/238 (34%), Gaps = 37/238 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + + G+++I+ ++ + + +G I+ + + +L D ++L A K Sbjct: 1 LSRLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAK--- 57 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + L NG E + + + + Sbjct: 58 -------------YKSGMPEATIAKTINAFLSANG------EFETAVAGKPQVASDESEL 98 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 L + TF AN P+ I S + L++ +VLDVS SM + Sbjct: 99 CLDVADA-----VPTTFMKLANIQSVPISIRSCAALPGVKQ--LEIALVLDVSSSMIEE- 150 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 ++ ++ L S + + + +V FSS++ F LA ++ Sbjct: 151 ----NRFTPMQTAVAGFLQAFSSNTSLVDKTKISIVPFSSRVN--FGLA-NTAWLKSY 201 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 77/216 (35%), Gaps = 30/216 (13%) Query: 163 SKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 KS +G ++ +LDVS SM +DKL + +++ + + + V+ VV +G Sbjct: 216 PKSQLGASNLVFLLDVSGSMA-----SVDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAA 270 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V Q + + +L G +T G+ AY +H Sbjct: 271 G----VVLDGASGNDTQTLNYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNGI 320 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + D + + + K G + +G DQ ++ A Sbjct: 321 NR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKG 377 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L I K++ Q Sbjct: 378 NGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQV 413 >gi|149732296|ref|XP_001503293.1| PREDICTED: collagen, type XXI, alpha 1 [Equus caballus] Length = 957 Score = 69.1 bits (167), Expect = 9e-10, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 93/237 (39%), Gaps = 36/237 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLWIVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTT 253 +++I ++ ++ G+V +S V PL G + + G T Sbjct: 60 -----LVNITRNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGENLMAAMESIHYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ +++A + +F AK K + LTDG++ D A+ Sbjct: 115 RTGKAIQFALDYLF---------AKSSRFLTKIAVVLTDGKSQDEVKD------AAEAAR 159 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 I++AIGV +E + L+ A S + V++ + A +I +E++KQ++ Sbjct: 160 ESKIILFAIGVGSETEEAELRAIANKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|42526759|ref|NP_971857.1| batA protein, putative [Treponema denticola ATCC 35405] gi|41817074|gb|AAS11768.1| batA protein, putative [Treponema denticola ATCC 35405] Length = 332 Score = 69.1 bits (167), Expect = 9e-10, Method: Composition-based stats. Identities = 50/263 (19%), Positives = 90/263 (34%), Gaps = 48/263 (18%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + C W + ++ V +K +D G +M +LD+S SM G ++ Sbjct: 56 LLCYLLWYCGIIFLIIALSEPVIFKNKQVYTDAGSSIMFLLDISPSMAAKDMSGETRIAA 115 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 A + IR+ + P + GL SS P + +++ L G Sbjct: 116 AKKIIRKF---VAKYPGDS----FGLTALSSSAALILPPTIDHKVFLSRLDSLSIGELGD 168 Query: 255 STP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 T G+ A + + + KL YI+ LTDGEN++ I+ K + Sbjct: 169 GTAIGMGLAVSSAYMTRTKLNSS---------YIVLLTDGENNTGEINPKTA---AKVLV 216 Query: 314 RRGAIVYAIGVQAEAADQF-----------------------LKNCA--SPDRFYSVQNS 348 + Y IG+ + LK A ++ S + Sbjct: 217 NKNIGFYVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDELELKKIAQYGNGKYASASSP 276 Query: 349 RKLHDAFLRIGKEMVKQRILYNK 371 L + F I K++ + + + Sbjct: 277 EILENIFNTISKQVPAAQSNFTR 299 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 69.1 bits (167), Expect = 9e-10, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 84/237 (35%), Gaps = 35/237 (14%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 S + +S S +D ++V+DVS SMN + A + +ML Sbjct: 19 IMSSILAWQPQTANAASPSASKVDAVLVVDVSNSMNTSDPGKIG--NEAMKMFIDMLST- 75 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEY 261 + G+V ++ + + L Q ++ I+ L G+ T ++ G++ Sbjct: 76 -------QNDKVGIVAYTDVVQREKALLNISSEADKQELKTFIDGLNRGAYTDTSVGVKE 128 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN-------EAKR 314 A + + H I+ L DG N + EAK+ Sbjct: 129 AIRIL-QDGKTAGHAP--------MIVMLADGNNDFNKTTGRTESQSAQDMAQAVAEAKK 179 Query: 315 RGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 G +Y IG+ A+ L + A + + + ++ L + I +K ++ Sbjct: 180 SGVPIYTIGLNADGKLNKNKLADIAQQTGGKSFITSSADDLPNILSEIFASNLKLKV 236 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 78/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + +L + P N R G++ +S Sbjct: 29 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVGVIQYS 79 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I L+ T + ++YA N F E + Sbjct: 80 SQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 137 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 189 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 190 DEHVFLVESF-DLIQEF 205 >gi|317064189|ref|ZP_07928674.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313689865|gb|EFS26700.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 325 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 82/221 (37%), Gaps = 34/221 (15%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD---IGLDMM 172 I + +++ ++ + +Y I T + + + S ++ ++ GL++ Sbjct: 27 IGMSKRNRILDILKLKKYNFVQIIKTILMTLGAFMVVIALLSPQELLNEDTVEVKGLNIY 86 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 ++D S SM ++L A R++ +L +K R G + FS P Sbjct: 87 ALIDTSRSMMTE-DVYPNRLEAAKRTLENLLQGLKG-------DRIGFIPFSDSAYIQMP 138 Query: 233 LAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + IN LI G T+ LE A E K + K II Sbjct: 139 LTDDYSIGKNYINALDTNLISGGGTELYQALELA-----------EKSFKEINSDNKTII 187 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 L+DG + D+K F K V++IG+ + Sbjct: 188 VLSDG----GDFDDKSLKFV----KDNKMNVFSIGIGTDEG 220 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 55/166 (33%), Gaps = 26/166 (15%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE----KLEHIAKGHDDYKKY 286 PL +++ + L +T+ G+ + + + A+ D K Sbjct: 159 TPLTGDFAYLKSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDETSPAEVSDSVHKV 218 Query: 287 IIFLTDGENSSPNIDNKESLFY---------------------CNEAKRRGAIVYAIGVQ 325 ++F+TDGE + D + + C K+ G +Y + Sbjct: 219 MVFMTDGEMN-TKYDPNDKFDWICSQTQSSACNAFATAAMQTACTAMKKSGIEIYTLSYS 277 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 A+A ++NCA+ + + + + I + + + Sbjct: 278 ADADVVNIRNCATNTAHFFTASPATIKTVYETIAAAIRGDTLRLTQ 323 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 84/241 (34%), Gaps = 49/241 (20%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLD 204 AN+ S + + + +VLD S SM + G G ++ A ++ + +D Sbjct: 41 ANAEDDNNGSDGSTSSNDATGGDSKVAVVLDASDSMAEKDTGDGGTRMDAAKKAANDTID 100 Query: 205 IIKSIPDVNNVVRSGLVTFSSK--------------IVQTFPLAWGV-QHIQEKINRLIF 249 + + ++ ++ + S+ I L + +++KIN L Sbjct: 101 TLA------DSAQTAVIAYGSEESNAPDNRDKGCQDITTLASLGNNKPEDLEDKINGLEP 154 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T ++ A ++ + ++ II ++DG ID C Sbjct: 155 KGYTPIGNAIKKAAEELGSSGKRN-------------IILVSDG------IDTCAPPPVC 195 Query: 310 NEAKR---RGA--IVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKE 361 + A+ G ++ +G + + Q C S + S ++ L +A + Sbjct: 196 DVAEDIAGDGIDLAIHTVGFKVDDKAQKELECISEVSGGTYTSADDTEALTEALTDAAQR 255 Query: 362 M 362 + Sbjct: 256 V 256 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 65/453 (14%), Positives = 133/453 (29%), Gaps = 117/453 (25%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R+F N +G ++I + L + ++ G ++ S K+ L+ D +L + Sbjct: 3 RSFLQNRRGGVAIFFGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVLTALKEAR-- 60 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 E GK + + + + +G I + SLS + +Y Sbjct: 61 EQLKQGKPDWQSIAEKQGGKAFTNNASKIGGVSGTGATI-----NLSLSGGVLSGSLNYA 115 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN---- 182 +A + + + + + S + + D+ V+DVS SM Sbjct: 116 ANAPTHFL-----------RIAGLNTINLKGSASATMSAAQYRDIHFVIDVSASMGIGAT 164 Query: 183 --------------------DHFGPGMDKLGVATRSIREML----------DIIKSIPDV 212 + P D L A R+I L D + IP+ Sbjct: 165 KADQQAMQNSVGCAVACHHAEAADPATDNL-AAVRAIGATLRIDVVRKAVMDALAKIPN- 222 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIF 267 + R + +FS+ + FPL+ + + T L N + Sbjct: 223 DGSTRVAIHSFSNSLKTVFPLSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLNNLLA 282 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY------------------- 308 A + +++ TD S + + Sbjct: 283 SAG-----NGLTASQPRGFVLLATDAVEDSSLFFYADGVAPPFARQWVEPNFVVGNPSYF 337 Query: 309 --------------CNEAKRRGAIVYAI-------------------GVQAEAADQFLKN 335 C+ K +G + + G A + + + Sbjct: 338 AWGLHYVQAPDAANCSAIKAKGYTMMTLETEYLIPDGVYNPTFDAVRGDMGPAMTKSMTD 397 Query: 336 CAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 CAS PD ++ ++ +++ A + + V + Sbjct: 398 CASAPDYYFHAESPQEIDRAVQTMVSKTVNLSL 430 >gi|296156498|ref|ZP_06839336.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295893097|gb|EFG72877.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 446 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 52/312 (16%), Positives = 102/312 (32%), Gaps = 49/312 (15%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +GS+SI+ A+ L + ++GL +++ + +KA+L D +++ + N Sbjct: 9 RRQRGSVSIIVAVSLIALLGILGLAVDSGLGYVIKARLDAATDGAVIAAGEAVTRGSNQT 68 Query: 71 NGKKQKNDFSYRII-KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + N + + + SI+ D + Sbjct: 69 EQTNNAQQAATAFFTANYPAGFLGSSVAAG-------------TPSIVFDA--GTVTIGM 113 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 ++ +P F S + + S LDM V+D + S+N P Sbjct: 114 TAQASVPVSF--------SKVLGFKVLNVSSTSQAIRKTLDMAFVIDTTGSLNTSGVP-- 163 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKI 244 + L+ D R L+ F+ V P + + + I Sbjct: 164 ---AAVRSNAVAFLNNFDVTND-----RVALMHFAYGTVVDVPFSGNARGFDRTTMTADI 215 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 N+ F +T S + A N++ + + I+F +DG +S + Sbjct: 216 NKYTFNGSTNSAEAIWNARNQL--------NTVISQPSSLRVIVFFSDGAPNSFSSFFTT 267 Query: 305 SLFYCNEAKRRG 316 + CN K G Sbjct: 268 NQSGCN--KSAG 277 >gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus] Length = 1029 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 73/202 (36%), Gaps = 26/202 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ V+DVS SM KL +++ ++L +++ N ++TFS + Sbjct: 248 VVFVIDVSGSMFG------TKLQQTKKAMDKILSDLQTSDSFN------IITFSDTVNIW 295 Query: 229 ----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + ++R+ T L A + + + ++ Sbjct: 296 KAEGSIQATVQNIHNAKNYVSRMEANGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP- 354 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 I+FLTDGE ++ L +A ++++ +A L+ + ++ Sbjct: 355 -LIMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEA 413 Query: 342 ---FYSVQNSRKLHDAFLRIGK 360 + + +L + I + Sbjct: 414 RRIYEDADAALQLEGLYAEISR 435 >gi|26350291|dbj|BAC38785.1| unnamed protein product [Mus musculus] Length = 810 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + D++ ++D S S+N + + I ++L + PDV R GL+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLDIGPDV---TRVGLL 99 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L ++ + R+ + T + ++YA N F E + Sbjct: 100 QYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLR- 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + I+ +TDG +A+ G +++AIGV + + Sbjct: 159 --ENVPRIIMIVTDGRPQDSVA------EVAAKARNTGILIFAIGVGQVDLNTVKAIGSE 210 Query: 339 P--DRFYSVQN---SRKLHDAFLR 357 P D + V N L F Sbjct: 211 PHKDHVFLVANFSQIESLTSVFQN 234 >gi|228471033|ref|ZP_04055877.1| BatB protein [Porphyromonas uenonis 60-3] gi|228307253|gb|EEK16276.1| BatB protein [Porphyromonas uenonis 60-3] Length = 342 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 64/196 (32%), Gaps = 21/196 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F + + P L T +S++ +G+D+ +DVS SM D++G A Sbjct: 61 FKLLSIALLIVALARPQLYTH-APVSAQQTVGVDLAFCIDVSNSMA-ARDVKPDRIGFAK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + + + R ++ F+ PL + + + + G + Sbjct: 119 QIVTHTMQQLAGS-------RIAMIVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L A E+ K +I LTDGE+ ++ K +G Sbjct: 172 TNLGQAL-------ERSAQALSAPSRAGKAVILLTDGEDHEGGLE-----EGIKRLKEQG 219 Query: 317 AIVYAIGVQAEAADQF 332 Y + + Sbjct: 220 IKAYVVTIGLPEGATI 235 >gi|110636839|ref|YP_677046.1| outer membrane protein [Cytophaga hutchinsonii ATCC 33406] gi|110279520|gb|ABG57706.1| possible outer membrane protein [Cytophaga hutchinsonii ATCC 33406] Length = 1313 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 76/210 (36%), Gaps = 27/210 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + V ++ + +++ +D+S SM + P +A + +++ + Sbjct: 72 IEQQVVKVVNPAAVKPKSISLVLTIDISESMQKQYMP------LAKNAAAAIVNKLPL-- 123 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ + +F+ + + I L+ T YNK F Sbjct: 124 ---DISECAVTSFNDVSFINTDFTRDRFKLLQSIQTLVPAGGTD--------YNKGFIKS 172 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++K +IFLTDG + +AK GA VY I + A + Sbjct: 173 NAGGLDILKKGLHEKVLIFLTDGY------GDVNPTEIIQQAKSIGAKVYVITLGMSAPE 226 Query: 331 QFLKNC-ASPDRFYS-VQNSRKLHDAFLRI 358 + + A+ +Y V + ++++ ++ I Sbjct: 227 ELKRIVTATNGSYYENVISEQEINAVYMSI 256 >gi|327265811|ref|XP_003217701.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Anolis carolinensis] Length = 914 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 74/204 (36%), Gaps = 17/204 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ +++++L+ + N VV SG ++ + Sbjct: 270 IIFVIDKSGSMIGK------KIQQTIEALQKILEDLNPEDHFNLVVFSGEIS-EWQSSLL 322 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 V+ ++ + ++ T L A N + I+ L Sbjct: 323 KATEENVELAKQYVRTIMAQGGTDINGALLTAINSL---DRATSAELLPEQSIS-MIVLL 378 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------FYS 344 TDG+ + + +A +Y +G + + FL+ A +R + Sbjct: 379 TDGQPTVGETNVNSIQTNIKKANDGNYFLYCLGFGFDVSYTFLEKLALENRGIARRIYED 438 Query: 345 VQNSRKLHDAFLRIGKEMVKQRIL 368 + +L D + + ++K+ ++ Sbjct: 439 SDAALQLQDFYQEVATPILKEIVM 462 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 72/211 (34%), Gaps = 45/211 (21%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 ++ + + +K + N + GLV+F ++ ++ L + ++ IN Sbjct: 169 QSSYNEKNRLKHAQESAIKFVKKFENDKN-ISIGLVSFDTRAIEQKELTSSLSEVKSSIN 227 Query: 246 RLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG---------- 293 L T GL+ A + KG++D KY+I ++DG Sbjct: 228 NLKVAYNGATNIEAGLKSAQKIL----------KKGNEDADKYVILMSDGFPTAFDYAGE 277 Query: 294 --------------------ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 N K S+ + K+ G + IG A + L Sbjct: 278 KFEENFNEHEVQDNTFINFGYNDYRGYAMKHSINQADSLKKVGINSFIIGFSDGANSEKL 337 Query: 334 KNC--ASPDRFYSVQNSRKLHDAFLRIGKEM 362 A+ + +N+ L+ A+ +I ++ Sbjct: 338 NKIAKAAGGEYEEARNTDALNGAYNKIETKV 368 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 48/130 (36%), Gaps = 23/130 (17%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-------GVQHIQE 242 ++ + + +D K + +V +SSK + + I++ Sbjct: 724 SRIDSVKKVANDFVDKFKD----DENTEIAIVRYSSKADVVLDNSNKVFLSSKDNETIKK 779 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--I 300 +IN L T G+ +Y+ + K D +KY+I +TDG ++ Sbjct: 780 RINSLKADVATNIGDGIRKSYSIL----------DKCDKDSEKYMILMTDGVPTAYTCYA 829 Query: 301 DNKESLFYCN 310 + ++L C Sbjct: 830 NTIKTLNNCK 839 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 50/202 (24%), Positives = 76/202 (37%), Gaps = 35/202 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS GLD+++VLD+S SM G ++KL A + + L I R +V Sbjct: 55 SSNDRPGLDLVVVLDLSGSME---GEKIEKLKAAILFMIKKLSSID---------RLSIV 102 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TFS + PL + ++ IN L T GL+ + D + +A Sbjct: 103 TFSRDARRLCPLRQITENSQKDLENLINGLHAYGAANITAGLQTGLKVLNDRRFTGGRVA 162 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ ++ E + N D+ + + N V+ G A LK A Sbjct: 163 T--------IMLVSSSEQN--NGDDADQILVGN------VPVHTFGFGAYHEPGVLKAIA 206 Query: 338 S---PDRFYSVQNSRKLHDAFL 356 F VQN L+ AF Sbjct: 207 HNSIGGTFSDVQNMDNLNKAFS 228 >gi|221108467|ref|XP_002170770.1| PREDICTED: similar to collagen, type XXIX, alpha 1, partial [Hydra magnipapillata] Length = 592 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 27/267 (10%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 NELR G + I + ++S + D + N A +E F + Sbjct: 135 IANELRSAGVTIIVIGITNAVNVSELSDIAGGEENAYATESFE-KLKDVNFLDVIKTK-- 191 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + K+ + +D++ +LD S S+ ++ D L + Sbjct: 192 ---MCETAKVEPTCEAIVDIVFLLDSSGSLRKYYQNEKDFLKSVISAFGVSF-------- 240 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFD 268 N R+ ++TFS + L + +E ++ ++ GSTT+ L A ++F+ Sbjct: 241 --NGTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFE 298 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 LE+ A+ K + LTDG + ++ + NE + G + IG+ Sbjct: 299 ----LENGARPGVA--KILFLLTDGSQTQ-ERGSENPVAIANELRSAGVTIIVIGITNAV 351 Query: 329 ADQFLKNCASP-DRFYSVQNSRKLHDA 354 L + A + Y+ ++ KL D Sbjct: 352 NVSELSDIAGGEENAYAAESFEKLKDV 378 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 99/267 (37%), Gaps = 27/267 (10%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 NELR G + I + ++S + D + N A +E F + Sbjct: 331 IANELRSAGVTIIVIGITNAVNVSELSDIAGGEENAYAAESFE-KLKDVNFLDVIKTK-- 387 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + + +D++ +LD S S+ ++ D L A + + Sbjct: 388 ---MCETATFQPTCEAVVDIVFLLDSSGSLRKYYQNEKDFLKSAISAFGVSV-------- 436 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFD 268 N R+ ++TFS + L + +E ++ ++ GSTT+ L A ++F+ Sbjct: 437 --NGTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFE 494 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAE 327 LE+ A+ K + LTDG + ++ + NE + G + IG+ A Sbjct: 495 ----LENGARPGVA--KILFLLTDGSQTQ-ERGSENPVAIANELRSAGVTIIVIGITNAV 547 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDA 354 + + Y + KL D Sbjct: 548 DVSELFDIAGGEENAYFADSFEKLKDV 574 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 80/199 (40%), Gaps = 21/199 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+ + +D++ +LD S S+ ++ D L + N R+ Sbjct: 1 KVEPTCEAIVDIVFLLDSSGSLRKYYQNEKDFLKSVISAFGVSF----------NGTRAA 50 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++TFS + L + +E ++ ++ GSTT+ L A ++F+ LE+ Sbjct: 51 VITFSYHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFE----LENG 106 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A+ K + LTDG + ++ + NE + G + IG+ L + Sbjct: 107 ARPGVA--KILFLLTDGSQTQ-ERGSENPVAIANELRSAGVTIIVIGITNAVNVSELSDI 163 Query: 337 ASP-DRFYSVQNSRKLHDA 354 A + Y+ ++ KL D Sbjct: 164 AGGEENAYATESFEKLKDV 182 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 74/218 (33%), Gaps = 36/218 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ ++ G D++MV+D+S SM + G + +L A + +R+ + Sbjct: 87 EVQTREAFGRDVLMVVDLSGSMEEKDFATKSGEQLSRLTAAKKVLRDFVTQ-------RQ 139 Query: 215 VVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R GL+ F P ++ ++ + G +T + + Sbjct: 140 GDRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS-- 197 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD- 330 + ++ LTDG ++ + ++ R +Y I + Sbjct: 198 -------PSTSQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIR---IYVIAMGDPENVG 247 Query: 331 ------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 + +S R + + +L++A+ I + Sbjct: 248 EQPLDMDVVSRVSSLTQARSFVAIDQSQLNEAYQVIDQ 285 >gi|297471458|ref|XP_002685231.1| PREDICTED: collagen type VI alpha 4-like [Bos taurus] gi|296490811|gb|DAA32924.1| collagen type VI alpha 4-like [Bos taurus] Length = 780 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 46/228 (20%), Positives = 88/228 (38%), Gaps = 32/228 (14%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 S+ AP+ TS + + D+ ++D S S D + E++ + Sbjct: 149 ASAAAPVGHTSPGCVHIEKA---DIYFLIDGSGS------THQDDFLAMKVFMNEVIKMF 199 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAY 263 PD V+ G+V +S ++ F L+ V ++ ++ + G TK L Sbjct: 200 HVGPDR---VQFGVVQYSDEVSPQFTLSQHSSVAGLEVAVDSIQQKGGGTKMGEALGSMI 256 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 D+ + Y+I +TDG++ P D E+L + G +YA+G Sbjct: 257 QVFADSAR---------SNVPWYLIVVTDGQSMDPVADAAEAL------RGHGVTIYAVG 301 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 V+ +A L+ A F+ V + L + +++ N+ Sbjct: 302 VR-DANIAELQEIAEDRMFF-VHDFESLKTIQQEVVQDICSLETCKNR 347 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 62/188 (32%), Gaps = 23/188 (12%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ M+ ++ M++ D ++ GL+ FSS + F L Sbjct: 350 DIIFLMDGSESISPKDFEKMKEFMKRMVNQSNIGADE---IQIGLLQFSSDPQEEFRLNR 406 Query: 236 -----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 V + ++ T + L + ++ G +Y+I + Sbjct: 407 YSSKVDVHRAISDVKQI--NGGTYTGKALNFTLPFFGSSRG-------GRPSVHQYLIVV 457 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDG + ++L + R I++AIGV Q L+ + Y + Sbjct: 458 TDGVSRDNVALPAKAL------RDRNIIIFAIGVGEVKFSQLLEITNDQSKVYYEEKFES 511 Query: 351 LHDAFLRI 358 L + I Sbjct: 512 LQNLEKEI 519 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 61/173 (35%), Gaps = 37/173 (21%) Query: 232 PLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL +++ +++ GS T + GL + + A + K ++ Sbjct: 379 PLTDDFDKLRKAASQMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKIVLL 438 Query: 290 LTDGENSSPNIDNKE--------------------------------SLFYCNEAKRRGA 317 LTDGEN +E + C + K G Sbjct: 439 LTDGENVVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTAARNVDGWTKNVCTQLKNEGV 498 Query: 318 IVYAIGVQAE--AADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y + +Q++ A CAS P +Y+V + KL + FL+I K ++ Sbjct: 499 QIYTMVLQSDTAANRALYSACASDPSNYYAVNDPTKLPNVFLQIANNFTKLQL 551 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 32/203 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F+ + G+ +++ + LPVI + + S K+ L LD + L ++ Sbjct: 3 LNKFWRSKSGNFTLVLGLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLASS----- 57 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + +Q + + ++ +D Sbjct: 58 ------HLGDLDITRNDAFNRYFQANIVGHGE-----------LDNAQATLTVDKGVNFV 100 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 AV+ ++ F +S H + ++ +S+ L++++VLD + SM Sbjct: 101 KTKAVASADVHLNFAF--LFGDSKHIVVDASAV-----ESNNQLEVVLVLDNTGSMA--- 150 Query: 186 GPGMDKLGVATRSIREMLDIIKS 208 G M L AT+S+ + L+ KS Sbjct: 151 GARMTALRTATKSLLDTLEAAKS 173 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 88/258 (34%), Gaps = 43/258 (16%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT--------SSVKISSKS 165 + D++ Y +S Y++ + + + V + Sbjct: 255 VRFTDIDKYPTYVVSVR-EYKIGDNYFDVDLQKTAMNLSRKSMMHKLFTNKDIVVKEYAN 313 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++ LDVS SM + GM K+ +A ++ + + + D R +V F+ Sbjct: 314 VKTIDIVLCLDVSGSMRE-LSNGMPKIEIAKDAVAQYIQFLSKTND-----RLAMVLFNF 367 Query: 226 KIVQTFPLAWGVQHIQEK-------INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + WG+ ++ + + G T LE + + ++ Sbjct: 368 RADVL----WGLHQVRRYWQQMNYMLKYVYAGGGTNLANALERSREVLTRSRSN------ 417 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 K++I +TDG +++ + +R G + I + + D+ L + Sbjct: 418 -----SKHVICVTDGR----TVNSSMCIKEAVRLRRSGTTISTIAIGENSDDELLMRLSK 468 Query: 338 -SPDRFYSVQNSRKLHDA 354 F + + L A Sbjct: 469 IGGGLFIKISSIHDLGKA 486 >gi|119569860|gb|EAW49475.1| von Willebrand factor A domain containing 2 [Homo sapiens] Length = 576 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 43/186 (23%), Positives = 76/186 (40%), Gaps = 27/186 (14%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 M +LD S S+ G + + D + P+ VR G FSS F Sbjct: 1 MFLLDGSNSV------GKGSFERSKHFAITVCDGLDISPER---VRVGAFQFSSTPHLEF 51 Query: 232 PL-AWGVQ-HIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL ++ Q ++ +I R++F T++ L+Y ++ + + +I Sbjct: 52 PLDSFSTQQEVKARIKRMVFKGGRTETELALKYLLHRGLPGGR--------NASVPQILI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +TDG++ + L K RG V+A+GV+ ++ L AS R V + Sbjct: 104 IVTDGKSQGDVALPSKQL------KERGVTVFAVGVRFPRWEE-LHALASEPRGQHVLLA 156 Query: 349 RKLHDA 354 ++ DA Sbjct: 157 EQVEDA 162 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 65/458 (14%), Positives = 133/458 (29%), Gaps = 100/458 (21%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + F + G +++L A L+ ++ + G+ I+ F + ++ LD ++L A K+ Sbjct: 15 QFKKFNRSEDGVVAVLVAFLMVLLIVFAGMAIDFGLGFNTRRAVNQALDAAVLAVANKLS 74 Query: 65 NQE-NGNNGKKQKNDFSYRIIKNIWQTDFR------NELRENGFAQDINNIERSTS---- 113 E + N + + +KN D + TS Sbjct: 75 TTELSSNTVDSLIDQYFEENLKNSVGGDVVHTKPVVTYDPKGDTVAATATATVKTSFLPV 134 Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 L ++ + L+ S F + S + + S LD ++ Sbjct: 135 LKLLNSESGDFGELTVTSSSTARFPKTKVEVAVVVDVTGSMSGSIGSLKTASRDMLDTLL 194 Query: 174 VLD---VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-------- 222 D + + + P + + R+ V+ VR + Sbjct: 195 PDDNTRLQSRVRISYVPYNVGVKLDKTLARKATFEKSQYGCVHARVRDLAYSGENHDYED 254 Query: 223 -------------FSSKIVQTF-PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 +S PL I+ IN L S T G+ + + + Sbjct: 255 EDDDERVDYIGTNYSWCPNAQMVPLTNDRTKIESSINALRASSATAGQIGIAWGWYTLSP 314 Query: 269 A-----KEKLEHIAKGHDDYKKYIIFLT---------------------------DGENS 296 + + ++ +KY + +T D +NS Sbjct: 315 EWRGFWPTESKPDFYDNNGVRKYAVLMTDGSFNAYYAADYSKADAEHKKLIKNKSDVQNS 374 Query: 297 SPNIDN------------------------------KESLFYCNEAKRRGAIVYAIGVQA 326 +D+ K + C K+ ++Y + + Sbjct: 375 QDPMDSGKLDADDHKKIASKVKWEYDYSSSLSGVPFKTASNLCKNMKKEDIVIYTVFFGS 434 Query: 327 E-AADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEM 362 + + ++ CAS + FY N L AF I ++ Sbjct: 435 DYKGKKIMEECASNSETFYHATNQSALIQAFSSIANDI 472 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 54/422 (12%), Positives = 123/422 (29%), Gaps = 71/422 (16%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 G ++L A+++P +F + L + + KA++ + ++L A N + Sbjct: 2 RKIKKQSGHAALLFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAH--NDD 59 Query: 68 NGNNGKKQKNDFSYRIIKN-IWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 N ++ R I R+ + G N ++ + + Sbjct: 60 NQDSQGAGSGSRVNRQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAECRAGLARGEPRF- 118 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHA-----PLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 +YE+ W + + +D++ V D S SM Sbjct: 119 ----FQYEIEVSSVQDTWFPGNDSIEGFGDTFSAKGAAVARKYQSEAVDIIFVSDYSGSM 174 Query: 182 NDHFGPGMDK----LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 ++ G ++ L + + + L + + +N GL F+ Sbjct: 175 AWNWSGGRNRKYIDLRNIIQEVTDELQKFNDLNNTDNNT-VGLTAFNYYTKTVPSNRSNH 233 Query: 238 QHIQEKIN---RLIFGST---------------------------TKSTPGLEYAYNKIF 267 + + +N R T T + + + Sbjct: 234 CFMTQLVNPNGRFSASQTVRNIFVEKNNRYCVNHGDSSRFQDLPLTDNYSSFNNSVRSFY 293 Query: 268 DAKEKL--------EHIAKGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAK- 313 + + + ++ +I L+DGE+ P+ ++ N Sbjct: 294 PNHGTASFQGIIRGAQMLRKGRNPRRLLIVLSDGEDGDPSRHMQLVNAGMCSTIVNTLSG 353 Query: 314 -------RRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRK-LHDAFLRIGKEMVK 364 + A + +G + + L+ C + Y QN L+ I +E+ Sbjct: 354 DLTPDGHKVKARLAVVGFDYDVNKNRALQKCVGAENVYKAQNRDDILNKILELITEEIGH 413 Query: 365 QR 366 + Sbjct: 414 LK 415 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 76/216 (35%), Gaps = 30/216 (13%) Query: 163 SKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 KS +G ++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 216 PKSQLGASNLVFLLDVSGSMA-----STDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAA 270 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V Q + + +L G +T G+ AY +H Sbjct: 271 G----VVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNGI 320 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + D + + + K G + +G DQ ++ A Sbjct: 321 NR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKG 377 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L I K++ Q Sbjct: 378 NGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQV 413 >gi|89094518|ref|ZP_01167457.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] gi|89081254|gb|EAR60487.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] Length = 445 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 73/200 (36%), Gaps = 27/200 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++ I ++ +VLD S SM DKL A + ++ + V+ +V Sbjct: 63 QTQARIPANIAIVLDKSGSMQG------DKLFRAKEAAIMAINRLSQNDIVS------VV 110 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ S++ P I INR+ T G+ N++ + + Sbjct: 111 SYDSRVNVVVPATKVSDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDLNKVNR-- 168 Query: 280 HDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 +I L+DG N P+ N+ + AK G V IG+ + + A Sbjct: 169 -------VILLSDGLANIGPSTPNELGKLGLSLAKE-GMSVTTIGLGLGYNEDLMTQLAG 220 Query: 338 -SPDRFYSVQNSRKLHDAFL 356 S V+N+ L F Sbjct: 221 FSDGNHAFVENADDLARVFQ 240 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 71/203 (34%), Gaps = 33/203 (16%) Query: 170 DMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++++LDVS SM + G +L S+R+ + R GLV F+S+ Sbjct: 97 DVVLILDVSGSMAKNDVQGGATRLQAVQESVRKFVAA-------RQSDRIGLVIFASQAW 149 Query: 229 QTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P++ Q +Q +IN+L G T L + + + K Sbjct: 150 PFAPVSEDKQALQTRINQLSPGMVGQQTAIGDALGVGVKLL---------DNTTNTEASK 200 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA- 337 I LTDG +++ + + A V+ I + ++N A Sbjct: 201 LAILLTDGNDTASQLAPALAAQL---AAAHHVQVHTIAFGDINSTGDDKVDLPLMQNIAQ 257 Query: 338 --SPDRFYSVQNSRKLHDAFLRI 358 + + + L + I Sbjct: 258 ITGGQSWTAANSGAALDSVWKEI 280 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 93/298 (31%), Gaps = 68/298 (22%) Query: 135 MPFIFCTF-PWCANSSHAP-LLITSSVKISSKSDIGLDMMMVLDVSLSMND--------- 183 M C + P LL ++ +++++VLDVS SM Sbjct: 1 MCIKKCLISALLIVAGSVPALLHADDTEVYFGQSQPVNLLLVLDVSGSMAWTTDACRLNR 60 Query: 184 ------HFGPGM---DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-----SKIVQ 229 PG +L + ++ LD D+ + V+ G++T+S + + Sbjct: 61 WGQPYPSCYPGNGEKSRLDIMKEALELFLD------DLPDNVKVGILTYSAGNNIDLLHE 114 Query: 230 TFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ + I+ L T + L A + + L I Sbjct: 115 VKQLSDNNHKATLLTTIDGLEANGGTLTAGALYEAGSYFRGQYDNLPSPITPGCSNASNI 174 Query: 288 IFLTDGENSSPNIDN-------------------------------KESLFYCNEAKRRG 316 +FLTDG+ +S + + ++ + Sbjct: 175 VFLTDGQPNSMSYNGYSYRNSIINMTGSSCARSDDGKECSEKLAGFLSTVDQIEDLTPSK 234 Query: 317 AIVYAIGVQAEAAD--QFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + I E + FL+N A Y+ ++ L DAF + ++Q ++ Sbjct: 235 VKTHTIAFALEDNNARTFLENVADAGNGQSYTADSTDGLVDAFKSSIQTDIEQSMMVT 292 >gi|196233777|ref|ZP_03132616.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196222139|gb|EDY16670.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 883 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 71/222 (31%), Gaps = 24/222 (10%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 T PW + + +I +++ ++DVS SM + + L Sbjct: 392 VATCPWAPEHRLVRVGLKGR-EIPKDERPPSNLVFLIDVSGSM--NMPNKLPLLQKCFSL 448 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + E L R +VT++S + +Q I+ L G T + G Sbjct: 449 LVEQLGPKD---------RVSIVTYASGTKLVLEPTQDKEAMQTAIDGLHAGGGTHGSSG 499 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ AY + + +I TDG+ + + E L + G Sbjct: 500 IDLAY-------RMAQQSFIPGGTNR--VILATDGDWNIGITNQSELLSMITRKAKSGVF 550 Query: 319 VYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + +G + D L A + + ++ F+ Sbjct: 551 LTVLGFGLDNLKDSMLVKLADHGNGHYAYIDTEQEARKVFVD 592 >gi|241113143|ref|YP_002972978.1| hypothetical protein Rleg_4788 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861351|gb|ACS59017.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 461 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 42/326 (12%), Positives = 116/326 (35%), Gaps = 17/326 (5%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F +R + G+++I+ A+ L + + +G + + V+ ++ LD +L+ + Sbjct: 21 FKTLRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAVKE 80 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + +K + F ++ + D + + + +T + I Sbjct: 81 IDTDDAVALKQKVSDWFHAQVENSYTLGDINIDTSNHKITATASGTVPTTLMRI---ANI 137 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL------- 175 ++S S + P + + +L+ ++ + G+ Sbjct: 138 DTVDVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTAGQATMYSGIGCQFACHTGDAHK 197 Query: 176 --DVSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + + + N + + + VA +++++L +I + + ++ GL + + + Sbjct: 198 IGNKTYNNNYEYSTAKNIKLRADVAGDAVKDVLALIDTSDSNHQRIKVGLYSLGDTLTEV 257 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKYII 288 + ++ +G T+ ++ Y + K+K + K ++ Sbjct: 258 LAPTLSTDTARNRLTDASYGLTSATSKAATYFDVSLTTLKQKVGTGGDGTASNSPLKLVL 317 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKR 314 LTDG S + + N+A Sbjct: 318 LLTDGVQSQREWVTDKVTWSNNKAIS 343 >gi|125559999|gb|EAZ05447.1| hypothetical protein OsI_27661 [Oryza sativa Indica Group] Length = 704 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 49/235 (20%), Positives = 88/235 (37%), Gaps = 38/235 (16%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 A HA + + ++ + +D++ VLDVS SM + KL + R++ ++D Sbjct: 208 AVLVHAKAPSIAVAEATAAARAPVDLVTVLDVSGSMEGY------KLTLLKRAMGFVIDK 261 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEY 261 + R +V+FS + L G + + L G T GL Sbjct: 262 LGPGD------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAVESLAAGGGTNILKGLVE 315 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-------KR 314 A K+FD + +A +I L+DG+++ + N + KR Sbjct: 316 A-AKVFDGRRYRNAVAS--------VILLSDGQDTYNVNGGWGASNSKNYSVLVPPSFKR 366 Query: 315 RG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G V+ G + + A + F ++N + DAF + IG + Sbjct: 367 SGDRRLSVHTFGFGTDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQCIGGLLS 421 >gi|260823583|ref|XP_002606160.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] gi|229291499|gb|EEN62170.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] Length = 515 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 65/185 (35%), Gaps = 23/185 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+DM+ VLD S S+ G D + ++D + R G+V +S ++ Sbjct: 175 GMDMVFVLDGSGSV------GADNFETVKDFVVSVVDGFEIGQSR---TRIGVVQYSDEV 225 Query: 228 VQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L +Q I+ + T + L Y + F + Sbjct: 226 QNEFNLTEYGNKADVQSAISNITYLQGRTYTGAALRYMTDVSFSE---EAGARPPYQAIP 282 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RF 342 K I +TDGE + S A G V+AIG+ + + PD Sbjct: 283 KVGIVVTDGEATDNVQGPASS------AHEAGVNVFAIGIGGYDVRELRQIATDPDATHV 336 Query: 343 YSVQN 347 ++V N Sbjct: 337 FAVDN 341 >gi|149620125|ref|XP_001521750.1| PREDICTED: similar to Collagen, type VI, alpha 1 [Ornithorhynchus anatinus] Length = 1163 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 82/208 (39%), Gaps = 19/208 (9%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMN---DHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 K + D +D+ VLD S S+ FG ++++ + T+ + L D N Sbjct: 173 PQKAIAFQDCPVDLFFVLDTSESVALRVKPFGYLVEQVKIFTKKFIDKLTDRYYRCDRNL 232 Query: 215 VVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 V +G + +S + L G +++++++ + G T + ++ ++ Sbjct: 233 VWNAGALHYSDSVELIQGLTRMPSGQKNLKDRVEAVQYIGKGTHTDCAIKRGIEELLIGG 292 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAA 329 H KY+I +TDG + L NEAK G V++I + Sbjct: 293 --------SHQKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSIAISPGHL 344 Query: 330 DQFLKNCASPDRF---YSVQNSRKLHDA 354 + L A+ + ++V + + ++ Sbjct: 345 EPRLSIIATDHTYRRNFTVNSEENVDES 372 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 77/226 (34%), Gaps = 38/226 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T +K +S S D+ +++D S S+ H + + ++ + + S + Sbjct: 957 YTCPIKFTSSS----DITILVDSSTSVGSH------NFKTSKQFVKRLAERFLSADKTDT 1006 Query: 215 V-VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 VR +V +S + Q + I I+ + F T + Y + Sbjct: 1007 ADVRVSVVQYSGRNQQKLEAQFLQNYTEIASIIDDMEFINDATDVNAAIRYVTTLYQKSS 1066 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + KK ++ +DG +S I K EA+R G +Y + V A + Sbjct: 1067 PRG---------VKKRLLLFSDG--NSQGITGKAIEAAVQEAQRAGIEIYVLVVGRHANE 1115 Query: 331 QFLKNCASPD-----------RFYSVQNSRKLHDA--FLRIGKEMV 363 ++ + + V + L + + +++ Sbjct: 1116 PNIRVLVTGKTTEYDVAYGERHLFRVPDYHALLQGVFYQTVSRKIS 1161 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 94/282 (33%), Gaps = 33/282 (11%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 + +L + + I + ++I + +Q ++ +Y ++ Sbjct: 507 EAAKKLHGQDVSVFVIGIGNESQMNIKMMNQIATEPITKHIKY---ANTVEGINKFKNAL 563 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + S +DM +V D S S+ D E +D + Sbjct: 564 TGQICEDVRSKQSCDSASMDMAIVFDGSDSV------KADNFKKLKTWTGEFIDKLGVQE 617 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDA 269 LV +++ I++ L+ V ++EK+ ++ F T + LE A + + Sbjct: 618 YGAQ---VALVKYATSIIKVSELSSDVDELKEKLMKVPFIQGKTNTGGALERAQQMLAEG 674 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + K I+ +TDG+ + D + + K+ ++Y IGV Sbjct: 675 RP----------SVPKIILLITDGDAT----DKERLDAQIEKLKKSNILIYTIGVGDLID 720 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 L A+ + F V +R F I K K Sbjct: 721 RNELNRIATDEDF--VYETRD----FDSISKIKSSLLGRVCK 756 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 65/187 (34%), Gaps = 29/187 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 DI +D+ ++D S S+ S+ ++ + V+ Sbjct: 760 PKTSGVCGDISVDLQFIVDSSSSVTRKNFGFAKNFVANVSSVFDL---------RSGDVQ 810 Query: 218 SGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEK 272 G++T+S+ + + G H EK+ + + T + L Y Sbjct: 811 VGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGDTHTGTALRY----------- 859 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++ K +IF+TDG I + + +G ++AIGV + Sbjct: 860 ISTNNRWREEVPKILIFVTDGTPQDRAIVPAAARSL----RDKGVRIFAIGVGNAVESEL 915 Query: 333 LKNCASP 339 + + P Sbjct: 916 KEIASEP 922 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 69/201 (34%), Gaps = 30/201 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ M + + ++ N R +V ++ Sbjct: 195 DIVFVVDSSGSIGPKRFDYMKNWVKSIAASFKV---------GENFARFSVVQYTKTAKT 245 Query: 230 TFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YKKYI 287 + I +KI+ +I+ G + N + E+ + K + K+ + Sbjct: 246 VVDFQTLDLSSISQKIDSMIYFQGRNGRGGKTFTGNAL----ERAHTLLKESEPGRKRIV 301 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + LTDG + E + ++A+GV ++ ++ R + Sbjct: 302 LLLTDGSSDD------EYGPVAKAIRDDKVDIFAVGVGRARKNELVEITGDEQRVWQ--- 352 Query: 348 SRKLHDAFLRIGKEMVKQRIL 368 F IG+ Q++L Sbjct: 353 ----TRTFNNIGQ--FNQKLL 367 Score = 38.6 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 45/123 (36%), Gaps = 24/123 (19%) Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T L+Y Y +F ++ A D ++ +I +TDG+ + P E Sbjct: 68 LNGDTCIGEALDYFYRNMFTSQ------AGQRSDVEQRVIVMTDGKRNCP----AEIAKP 117 Query: 309 CNEAKRRGAIVYAIGVQAEAA---------DQFLKNCAS---PDRFYSVQNSRKLHDAFL 356 + + A +YAIG+ + Q L AS + + N +L Sbjct: 118 AELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQLI--LK 175 Query: 357 RIG 359 RIG Sbjct: 176 RIG 178 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 70/218 (32%), Gaps = 36/218 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ ++ G D++MV+D+S SM + G + +L A + +R + Sbjct: 87 EVQTREAFGRDVLMVVDLSGSMEEKDFATESGEQLSRLTAAKKVLRNFVTQ-------RQ 139 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKE 271 R GL+ F P ++ G +T + + Sbjct: 140 GDRFGLILFGDAAFIQTPFTADQNVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS-- 197 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD- 330 + ++ LTDG ++ + ++ R +Y I + Sbjct: 198 -------PSTSQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIR---IYVIAMGDPENVG 247 Query: 331 ------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 + +S R + + +L++A+ I + Sbjct: 248 EQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 285 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 76/234 (32%), Gaps = 30/234 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKI---SSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 + CA + I V + +VLDVS SM G + Sbjct: 2 ITRKRLAAGVCALLAALTAGIAFPVTAAAGEPTGGTAPQVDLVLDVSGSMRTADIDGGTR 61 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTF---------SSKIVQTFPLAWGVQHIQE 242 + A R+ E+LD + +V +R+ + + Sbjct: 62 MAAAKRAFNEVLDA--TPEEVQLGIRTLGANYPGDNQKTGCKDTAQLYPVSTLDRTEAKT 119 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 ++ L T P L + D+ K I+ ++DGE++ +D Sbjct: 120 QVATLSPTGWTPIGPALLKSAGDFTDSA------------SSKRIVLISDGEDTCAPLDP 167 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 E AK G + +G+ + +C A+ + SV+++ +L D Sbjct: 168 CEVAREIG-AKGIGLTIDTLGLVPNTKMRKQLSCIAEATGGTYTSVEHADELTD 220 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 84/226 (37%), Gaps = 29/226 (12%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 S P S + +S D+ ++D S S+ + + Sbjct: 819 MLSGMPPLMSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN------DFIEMKDFM 872 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKST 256 +E++ + PD VR G+V +S KI+ F L + + I+ + G T + Sbjct: 873 KEVIKMFHIGPDR---VRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTG 929 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + D +Y+I +TDG+++ P + + L + G Sbjct: 930 KALSKMVPVFQNTAR---------IDVARYLIVITDGQSTDPVAEAAQGL------RDIG 974 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 +YAIGV+ +A L+ AS F+ + L + +++ Sbjct: 975 VNIYAIGVR-DANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDI 1018 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 97/300 (32%), Gaps = 26/300 (8%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + L +IL+H A +Q + + I+ + R + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGILVYAIGV 170 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP-FIFCTFPWCANSSHAPLLITSSVKIS 162 + +S D + + +P P ++ T + Sbjct: 171 KDASQAELREISSSPKDNFTFFVPNF---PGLPGLAQKLRPELCSTLGKAAQYTERESPA 227 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ ++D S S G+ + + + + ++ V+ GLV Sbjct: 228 CSEASPADIVFLVDSSTS------IGLQNFQKVKHFLHSV---VSGLDVRSDQVQVGLVQ 278 Query: 223 FSSKIVQTFPLAWG--VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S I FPL + ++I L + T + LE + + E AK Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGSRAKD 336 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + ++ +TDGE+S D ++ KR G V+ +G+ + + K P Sbjct: 337 GVP--QIVVLVTDGESSDEVQD------VADQLKRDGVFVFVVGINIQDVQELQKIANEP 388 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 56/340 (16%), Positives = 112/340 (32%), Gaps = 38/340 (11%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L + L A + Q G + + +N + D Sbjct: 890 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARIDVAR 949 Query: 95 ELRENGFAQDINNIERSTS---------LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWC 145 L Q + + + +I + D + L ++ +M FI+ Sbjct: 950 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT-ELEEIASKKMFFIYEFDSLK 1008 Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + I SS S+ D++ ++D S S+ + M++ Sbjct: 1009 SIHQEVIRDICSSENCKSQKA---DIIFLIDGSESIAPK------DFEKMKDFMERMVNQ 1059 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYA 262 D ++ GL+ FSS + F L + + ++ T + L + Sbjct: 1060 SNIGADE---IQIGLLQFSSNPQEEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFT 1116 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ + +Y+I +TDG + ++L + R I++AI Sbjct: 1117 LPFFDSSRGGRPRVH-------QYLIVITDGVSQDNVAPPAKAL------RDRNIIIFAI 1163 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 GV Q L+ D+ + +N L I E+ Sbjct: 1164 GVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 1203 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 56/149 (37%), Gaps = 13/149 (8%) Query: 198 SIREMLDIIK-SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTK 254 S+R L I+ S+ + +R GL +S F L+ + + + I L F Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + A I + + ++ + + ++ G + +R Sbjct: 110 R---MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSG------LTEDHIREPAEALRR 160 Query: 315 RGAIVYAIGVQAEAADQFLKNCASP-DRF 342 G +VYAIGV+ + + + +SP D F Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 62/437 (14%), Positives = 133/437 (30%), Gaps = 101/437 (23%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 G +IL A+++P +F + L + + KA++ + ++L A N +N Sbjct: 4 LRKQSGHAAILFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAAH--NDDNK 61 Query: 70 NNGKKQKNDFSYRIIKNIWQT------DFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 N+ R I + D N L+ + + + + I + + + Sbjct: 62 NSQGSGSGSAVNRKIATDYLEAYLHDVDSVNNLKIHKY--NCDQIPECVAGLARGEPRFF 119 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSM 181 Y + A SR+ FP ++ + +++ +D++ V D S SM Sbjct: 120 QYEVEATSRH-----VSWFPGDSSIPGFGKTFDAKGAATARKYQSEAVDILFVADYSGSM 174 Query: 182 NDHFGPGMDK----LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ-------- 229 + G ++ L + + + L + + +N G+ F+ Sbjct: 175 AGGWNGGSNRKYIDLRNIIKVVTDELQKFNDLNNTDNNT-VGMTGFNYYTKTKPTNRSNS 233 Query: 230 ------------------------------------------TFPLAWGVQHIQEKINRL 247 L +N Sbjct: 234 CFMTQLVYNNNYNINYTKTVNNIFNEKNNKYCVSHSDSSRFRDIDLTDNYSSFNTTVNGF 293 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T S G+ + K + ++ +I L+DG++S + N Sbjct: 294 YPNHGTASYQGIMRGAQML-----------KKGTNPRRLLIVLSDGDDSGTSQKNIHKQL 342 Query: 308 Y----CNEAKRR------------GAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRK 350 C + K+ A + +G + L++CA + + QN+ Sbjct: 343 VNAGMCTKIKQELSTGISSSGQSIKARLAVVGFDYNVNNNTALRDCAGAENVFKAQNTDD 402 Query: 351 -LHDAFLRIGKEMVKQR 366 L+ I +E+ + Sbjct: 403 ILNKILELITEEIGHLK 419 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 84/218 (38%), Gaps = 29/218 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 S P S + +S D+ ++D S S+ + ++E++ + Sbjct: 827 MSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN------DFIEMKDFMKEVIKMFH 880 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYN 264 PD VR G+V +S KI+ F L + + I+ + G T + L Sbjct: 881 IGPDR---VRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVP 937 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + D +Y+I +TDG+++ P + + L + G +YAIGV Sbjct: 938 VFQNTAR---------VDVARYLIVITDGQSTDPVAEAAQGL------RDIGVNIYAIGV 982 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + +A L+ AS F+ + L + +++ Sbjct: 983 R-DANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDI 1018 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 98/300 (32%), Gaps = 26/300 (8%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + L +IL+H A +Q + + I+ + R + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLAEDHIREPAEALRRAGILVYAIGV 170 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP-FIFCTFPWCANSSHAPLLITSSVKIS 162 + +S D + + +P P ++ T + Sbjct: 171 KDASQAELREISSSPKDNFTFFVPNF---PGLPGLAQKLRPELCSTLAKAAQYTEQESPA 227 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ ++D S S G+ + + + + ++ V+ GLV Sbjct: 228 CSEASPADIVFLVDSSTS------IGLQNFQKVKHFLHSV---VLGLDVRSDQVQVGLVQ 278 Query: 223 FSSKIVQTFPLAWG--VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S I FPL + ++I L + T + LE + + E AK Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGSRAKD 336 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + ++ +TDGE+S D ++ KR G V+ +G+ + + K + P Sbjct: 337 GVP--QIVVLVTDGESSDEVQD------VADQLKRDGVFVFVVGINIQDVQELQKIASEP 388 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 113/340 (33%), Gaps = 38/340 (11%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L + L A + QE G + + +N + D Sbjct: 890 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTARVDVAR 949 Query: 95 ELRENGFAQDINNIERSTS---------LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWC 145 L Q + + + +I + D + L ++ +M FI+ Sbjct: 950 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT-ELEEIASKKMFFIYEFDSLK 1008 Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + I SS S+ D++ ++D S S+ + M++ Sbjct: 1009 SIHQEVIRDICSSENCKSQKA---DIIFLIDGSESIAPK------DFEKMKDFMERMVNQ 1059 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYA 262 D ++ GL+ FSS + F L + + ++ T + L + Sbjct: 1060 SNIGADE---IQIGLLQFSSNPREEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFT 1116 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ + +Y+I +TDG + ++L + R I++AI Sbjct: 1117 LPFFDSSRGGRPRVH-------QYLIVITDGVSQDNVAPPAKAL------RDRNIIIFAI 1163 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 GV Q L+ D+ + +N L I E+ Sbjct: 1164 GVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 1203 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 13/149 (8%) Query: 198 SIREMLDIIK-SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTK 254 S+R L I+ S+ + +R GL + F L+ + + + I L F Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + A I + + ++ + + ++ G + +R Sbjct: 110 R---MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSG------LAEDHIREPAEALRR 160 Query: 315 RGAIVYAIGVQAEAADQFLKNCASP-DRF 342 G +VYAIGV+ + + + +SP D F Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 84/218 (38%), Gaps = 29/218 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 S P S + +S D+ ++D S S+ + ++E++ + Sbjct: 827 MSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN------DFIEMKDFMKEVIKMFH 880 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYN 264 PD VR G+V +S KI+ F L + + I+ + G T + L Sbjct: 881 IGPDR---VRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVP 937 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + D +Y+I +TDG+++ P + + L + G +YAIGV Sbjct: 938 VFQNTAR---------VDVARYLIVITDGQSTDPVAEAAQGL------RDIGVNIYAIGV 982 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + +A L+ AS F+ + L + +++ Sbjct: 983 R-DANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDI 1018 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 98/300 (32%), Gaps = 26/300 (8%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + L +IL+H A +Q + + I+ + R + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGILVYAIGV 170 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP-FIFCTFPWCANSSHAPLLITSSVKIS 162 + +S D + + +P P ++ T + Sbjct: 171 KDASQAELREISSSPKDNFTFFVPNF---PGLPGLAQKLRPELCSTLGKAAQYTERESPA 227 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ ++D S S G+ + + + + ++ V+ GLV Sbjct: 228 CSEASPADIVFLVDSSTS------IGLQNFQKVKHFLHSV---VLGLDVRSDQVQVGLVQ 278 Query: 223 FSSKIVQTFPLAWG--VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S I FPL + ++I L + T + LE + + E AK Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGSRAKD 336 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + ++ +TDGE+S D ++ KR G V+ +G+ + + K + P Sbjct: 337 GVP--QIVVLVTDGESSDEVQD------VADQLKRDGVFVFVVGINIQDVQELQKIASEP 388 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 113/340 (33%), Gaps = 38/340 (11%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L + L A + QE G + + +N + D Sbjct: 890 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTARVDVAR 949 Query: 95 ELRENGFAQDINNIERSTS---------LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWC 145 L Q + + + +I + D + L ++ +M FI+ Sbjct: 950 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT-ELEEIASKKMFFIYEFDSLK 1008 Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + I SS S+ D++ ++D S S+ + M++ Sbjct: 1009 SIHQEVIRDICSSENCKSQKA---DIIFLIDGSESIAPK------DFEKMKDFMERMVNQ 1059 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYA 262 D ++ GL+ FSS + F L + + ++ T + L + Sbjct: 1060 SNIGADE---IQIGLLQFSSNPREEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFT 1116 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ + +Y+I +TDG + ++L + R I++AI Sbjct: 1117 LPFFDSSRGGRPRVH-------QYLIVITDGVSQDNVAPPAKAL------RDRNIIIFAI 1163 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 GV Q L+ D+ + +N L I E+ Sbjct: 1164 GVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 1203 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 13/149 (8%) Query: 198 SIREMLDIIK-SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTK 254 S+R L I+ S+ + +R GL + F L+ + + + I L F Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + A I + + ++ + + ++ G + +R Sbjct: 110 R---MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSG------LTEDHIREPAEALRR 160 Query: 315 RGAIVYAIGVQAEAADQFLKNCASP-DRF 342 G +VYAIGV+ + + + +SP D F Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|291087628|ref|ZP_06346959.2| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291074491|gb|EFE11855.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 473 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 71/197 (36%), Gaps = 24/197 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-RSGLVTFSSKIV 228 D++ +D S M G ++ ++ E L+ + P+ R GLV+FS Sbjct: 46 DIVFAIDRSAKME---GSALEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTAT 102 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L+ V+ L G + + A + +K + Sbjct: 103 VNSMLSPVVEQAARAAEGLTAGGKSNQAEAIRAAVKLL-----------DMKTPGEKMLF 151 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCA---SPDRFY 343 +TDG+ + ++ EA++ G VY IG+ A + L++ A S Sbjct: 152 LITDGQTPFRS----QTDSAAAEARQAGVTVYCIGIAAPDGVNREALRSWASGPSDSHII 207 Query: 344 SVQNSRKLHDAFLRIGK 360 ++ + AF R+ K Sbjct: 208 EIRELGEAQTAFERLMK 224 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 70/172 (40%), Gaps = 19/172 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LDVS SM DKL + +S+ +L ++ V+ VV +G + +++ Sbjct: 329 LVFLLDVSGSMG-----SPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGA---AGTVLEP 380 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P+A Q I ++RL G +T G+E AY ++ II Sbjct: 381 TPVA-EQQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNR---------IILA 430 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 TDG+ + D ++ Y + G + +G + D ++ A Sbjct: 431 TDGDFNVGIADPEQLKGYVERKRANGIELSILGFGSGNYNDALMQQLAQNGN 482 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 77/216 (35%), Gaps = 30/216 (13%) Query: 163 SKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 KS +G ++ +LDVS SM +DKL + +++ + + + V+ VV +G Sbjct: 231 PKSQLGASNLVFLLDVSGSMA-----SVDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAA 285 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V Q + + +L G +T G+ AY +H Sbjct: 286 G----VVLDGASGNDTQTLNYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNGI 335 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + D + + + K G + +G DQ ++ A Sbjct: 336 NR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKG 392 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L I K++ Q Sbjct: 393 NGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQV 428 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 79/209 (37%), Gaps = 36/209 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SMN+ P D+ A I M D +N R ++ F Sbjct: 114 DIVLVIDNSGSMNET-DPNQDRYTAAKNLINRM--------DRDN--RVSVIMFDHATTL 162 Query: 230 TFPLAW-----GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P I +I+ L T + LE + I ++++ Sbjct: 163 LQPFTRVNNQETKDEIIAEIDGLATTDGGTDISLALEDTMSHIQESRDAGRSAM------ 216 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA--SP 339 +I L+DG D+ L E K++ V IG+ Q L+ A + Sbjct: 217 ---VIMLSDG---FSETDHDRVLA---EYKQQQIAVNTIGLSLVNPDGAQLLQTIAAETG 267 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++Y VQ++ L F +I ++ + +L Sbjct: 268 GQYYDVQHAEDLSFVFQKIYDDVGDRSLL 296 >gi|32475534|ref|NP_868528.1| BatB [Rhodopirellula baltica SH 1] gi|32446076|emb|CAD75905.1| BatB [Rhodopirellula baltica SH 1] Length = 747 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 70/164 (42%), Gaps = 27/164 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G++ + VLDVS SM ++LG A + I++M+D + R GLV F+ Sbjct: 124 QRGIEAVFVLDVSRSMLAEDVSP-NRLGRAKQQIKDMVDEMPG-------DRVGLVVFAG 175 Query: 226 KIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + QT PL V+ ++ ++ + + ++ + A + D Sbjct: 176 ETRQTLPLTRHVEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDAFLDKTTDH-------- 227 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K ++ LTDGE+ + ++ Y + +G ++ IG+ Sbjct: 228 ---KAMVILTDGEDQESDPVSEAKRAY----EEQGIRIFTIGLG 264 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/245 (17%), Positives = 86/245 (35%), Gaps = 37/245 (15%) Query: 141 TFPWCANSSHAPLLITS--SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 A ++ +D ++V+D S SM + + + Sbjct: 8 LLSLMAAVLLIMTWSGGALPKAAAASQGSNIDAVLVMDASNSMKNSDPERIS-----GEA 62 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTT 253 ++ +D++ + D G+V+++ +I + L ++E I++L G T Sbjct: 63 MKMFIDMLATTGDK-----VGIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYT 117 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-SPNI------DNKESL 306 + GL+ A + H I+ L DG N PN +++ Sbjct: 118 DMSVGLDEAVKVLKQGM-DPAHAP--------MIVVLADGNNDLDPNTGRTSKEASEQLA 168 Query: 307 FYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 EAK G +Y IG+ A+ + L A + + ++ ++ L I Sbjct: 169 QAVKEAKGSGIPIYTIGLNADGKLNKETLAELANQTGGKSFTTSSADDLPQILSEIFASH 228 Query: 363 VKQRI 367 + +I Sbjct: 229 QQLKI 233 >gi|225174955|ref|ZP_03728952.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225169595|gb|EEG78392.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 841 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 85/249 (34%), Gaps = 38/249 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + S + + S + L +++V+D+S SM+ + + + Sbjct: 9 LIKLVVFAMLLSTLAGAVVQARSGRSHTTEALSVILVVDISGSMDRNDPQYLRETATLI- 67 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINR-LIFGSTT 253 ++L G++ F +I + PL A +E + L+ T Sbjct: 68 -FMDLLGPKD---------YLGVLAFDDRIEELVPLQQVADNKGTFKEAVEGNLVPRGFT 117 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN--- 310 LE A+ ++ + + ++FLTDGE + P++D + + Sbjct: 118 DYVGALEEAFEQLHSVETGDAR---------QVVVFLTDGEPN-PHLDARNDDEFMEGYL 167 Query: 311 --------EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 E G VY + E + L+ A + F + + L F + Sbjct: 168 GELWDLTGEYAAAGVPVYPVAFSDEVGPEVLEQIAGHTGADFVLMPDPGDLVVTFFELVS 227 Query: 361 EMVKQRILY 369 + + + + Sbjct: 228 RLKNRNLFF 236 >gi|290985353|ref|XP_002675390.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284088986|gb|EFC42646.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 923 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 75/212 (35%), Gaps = 32/212 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K G+D+++V+D S SM KL + ++ M+D +K R +V F Sbjct: 686 KERKGVDLVLVVDKSGSMAGQ------KLDMVKSTLSFMVDQLK------EKDRVAIVEF 733 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ L G + ++ + + GS T + L + + +++ + Sbjct: 734 DTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLASRQQEKNEVTS- 792 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAADQFLK 334 +I TDG + I E L + ++ G + L Sbjct: 793 -------VILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQDTDANMLT 845 Query: 335 NCASPDR--FYSVQNSRKLHDAFLR-IGKEMV 363 + A + ++ + + AF IG + Sbjct: 846 SIAQKGNGLYDYLETADDIPKAFGNVIGNLVS 877 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 48/257 (18%), Positives = 98/257 (38%), Gaps = 35/257 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P + T +K+ + G+ Sbjct: 385 QTTTLNVRDLTADTEYQISV---FAMKGLTSSEP--TSVMEKTQPKTQPMKVQVECSRGV 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ D+ ++ + G + + + P+ V+ LV +S Sbjct: 440 DIKA--DIVFLVDGSYSIGTANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 494 Query: 230 TFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +++ I + G +T + + Y KIF + + K Sbjct: 495 EFTLK-EFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPK 547 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRF 342 +I +TDGE+S D + + ++A+GV+ +A L+ ASP Sbjct: 548 VMILITDGESSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHV 600 Query: 343 YSVQNSRKLHDAFLRIG 359 ++V++ DAF RI Sbjct: 601 FTVED----FDAFQRIS 613 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 93/272 (34%), Gaps = 44/272 (16%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-- 165 + ST+ +++ D + + ++ Y + K K Sbjct: 72 LAASTTETLLSDLIPETQYVVTITSYN-----EVEESVPVIGQLTIQTGGPTKPGEKKPG 126 Query: 166 ---------DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 D++ ++D S S+ + + VA ++ + Sbjct: 127 KTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIVA---------LVSAFDIGEEKT 177 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEK 272 R G+V +SS F L + + + + ++ + G T + ++Y N ++ Sbjct: 178 RVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTESAGS 237 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I +TDG++ E + G V+++G++A A + Sbjct: 238 RAG-------FPKVGIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKEL 284 Query: 333 LKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 + ++P + ++V N + D I ++ Sbjct: 285 KQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 84/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + ++ D+++++D Sbjct: 1154 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA---DIVLLVDG 1210 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1211 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1261 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + G+ A N I K + + +K + +TDG++ Sbjct: 1262 DKKSLLQAVANLPNKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1316 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1317 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESLSK 1370 Query: 354 AFLRI 358 + Sbjct: 1371 IVDDL 1375 >gi|282892468|ref|ZP_06300802.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497750|gb|EFB40114.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 373 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 84/256 (32%), Gaps = 33/256 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F T W + G+ + +VLD S SM D+ G + Sbjct: 69 LAFLTIAWMDPHFYLLKPDRGERARDHTPHEGIAIYLVLDQSGSMAQKIPLSSDEKGRIS 128 Query: 197 RSIREML----------DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + + D K + ++N + GL+ F+ PL Q I +++N+ Sbjct: 129 VPKIDFIVEITKDFVKGDPTKGLKGLHNDL-VGLIGFARTAQVLSPLTLDHQAIIDQLNK 187 Query: 247 LIF-----GSTTKSTPGLEYAYNKIFDAKEKLE--HIAKGHDDYKKYIIFLTDGENSSPN 299 T + N I K E A + ++ +TDG Sbjct: 188 FSIVKHQDEDGTSIGYAIFKTANLIASTKHFAEELKEASPYTIKNSIMLIVTDGFQDPNP 247 Query: 300 IDNKESLF------YCNEAKRRGAIVYAIGVQAE-------AADQFLKNCA--SPDRFYS 344 +D ++ AK +G VY I V+ + + ++ + +FY Sbjct: 248 LDREDQYRSIELEDAAKYAKEQGVRVYIINVEPRIASEEFGSERRVMQKVTEITGGKFYL 307 Query: 345 VQNSRKLHDAFLRIGK 360 + + +L + + I K Sbjct: 308 LDHIEELKNIYADIDK 323 >gi|119358219|ref|YP_912863.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355568|gb|ABL66439.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 337 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 46/233 (19%), Positives = 77/233 (33%), Gaps = 50/233 (21%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G+D++ +LDVS SM D+L A I L I K + D R L+ F+ Sbjct: 85 ERKGVDVLFMLDVSNSMLVADVSP-DRLTRAKSGI---LRISKGLRDG----RQALLLFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + P+ + + L+ T L A + Sbjct: 137 GSPLVQCPMTTDHAAFEALLGMVSTELVSDQGTAFDSALNLAMRLF---ERTEPPGDVKE 193 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI----------------GV 324 +K I+ L+DGEN S N + K+ G V+ I GV Sbjct: 194 VQGEKVIVLLSDGENHSGNF-----RAVADALKQSGVSVFTIVLGKPLPAAIPLGQSSGV 248 Query: 325 QAEAADQFLKNCASP-----------DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + +AA + +K +SP F+ + + R+ + + Sbjct: 249 KKDAAGKIVKTRSSPETMRRLAGDSGGTFFDA---SEDDAVYDRVAERISTLV 298 >gi|293361231|ref|XP_243912.5| PREDICTED: collagen alpha-1(XII) chain [Rattus norvegicus] Length = 3119 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P P+ + Sbjct: 385 QTTTLNVRDLSADTEYQISV---FAMKGLTSSEPVSVMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I + IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 77/199 (38%), Gaps = 28/199 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + VA ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIVA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKK 285 F L + + + I ++ + G T + ++Y N ++ + K Sbjct: 191 EFNLNQYYRREDLLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGSRAG-------FPK 243 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFY 343 I +TDG++ E + G V+++G++A A + + ++P + + Sbjct: 244 VAIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVF 297 Query: 344 SVQNSRKLHDAFLRIGKEM 362 +V N + D I ++ Sbjct: 298 NVANFDAIVDIQNEIISQV 316 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 85/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + + + D+++++D Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWHLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDIHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|293349337|ref|XP_001060689.2| PREDICTED: collagen, type XII, alpha 1 [Rattus norvegicus] Length = 3064 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P P+ + Sbjct: 385 QTTTLNVRDLSADTEYQISV---FAMKGLTSSEPVSVMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I + IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 77/199 (38%), Gaps = 28/199 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + VA ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIVA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKK 285 F L + + + I ++ + G T + ++Y N ++ + K Sbjct: 191 EFNLNQYYRREDLLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGSRAG-------FPK 243 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFY 343 I +TDG++ E + G V+++G++A A + + ++P + + Sbjct: 244 VAIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVF 297 Query: 344 SVQNSRKLHDAFLRIGKEM 362 +V N + D I ++ Sbjct: 298 NVANFDAIVDIQNEIISQV 316 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 85/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + + + D+++++D Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWHLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDIHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 65/207 (31%), Gaps = 33/207 (15%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGM 189 R + F KIS G++ M+ +D+S SM D + Sbjct: 54 RPLLKFYIMEVILALLIIVIARPQVG-TKISKDKREGIEAMIAMDISNSMLAQDVAPTRL 112 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN---- 245 D+ + + G+V F+ P+ + +N Sbjct: 113 DRSKRLVEDLVNRFTN----------DKIGIVVFAGDAFVQLPITSDYISAKMFLNNISP 162 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 LI T + E EH ++ K II +TDGEN K + Sbjct: 163 ELIGSQGTDIGKAI-----------ELSEHSFSEKANFGKAIIIITDGENHE-----KGA 206 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQF 332 EA+++G V+ +G+ + Sbjct: 207 EEMAREAQKQGIRVFILGIGSPQGAPI 233 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 67/440 (15%), Positives = 132/440 (30%), Gaps = 96/440 (21%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +RN + +G +IL A+++P +F + L + + KA++ + + L ++ Sbjct: 1 MRNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALA-----IS 55 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 N + + + I + + DI +R L D Sbjct: 56 AHNDPDQPDNGSYTPSTRNRQIVVDYVNAYISDVDAVTDIKVAKRRCELIPECVAGLYDG 115 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHA-PLLITSSVKISSKSDI----GLDMMMVLDVSLS 180 ++ +E+ W + + T S + S + +D M D S S Sbjct: 116 DMR-YLEHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDAMFAADFSGS 174 Query: 181 MND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF----- 231 M D P L R+I L +P+ N G+ FS+ Sbjct: 175 MLDTWSGSSNPKYVDLIEIIRNISAELQKFNDLPENRNKSTMGISAFSTFTNSFTSDTGI 234 Query: 232 -------------PLAW-------------------------------------GVQHIQ 241 P W ++ Sbjct: 235 QCSLSQGVNGRNGPATWFRPVKAANTVANIWNPKTEDYCKSGAYAGFHDVNLTSNFNYLN 294 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNI 300 ++ G T S L + + ++ ++ +I L+DG +N + Sbjct: 295 GQVGSFYAGGGTASYQALIRG-----------AQLLRKGNNSRRLLIVLSDGMDNDTQLA 343 Query: 301 DNKESLFYCNEAKRR------------GAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQN 347 D S C + + A + IG A++ LK+C Y ++ Sbjct: 344 DGLVSAGMCRDIQNGLESDRTPDRRPIAAKMAVIGFDYNPFANKALKDCVGEKNVYKAED 403 Query: 348 SRKLHDAFLR-IGKEMVKQR 366 + ++ D L I +E+ + Sbjct: 404 ADEVEDIILELINEEIGHLK 423 >gi|149019069|gb|EDL77710.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] gi|149019070|gb|EDL77711.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] Length = 2827 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P P+ + Sbjct: 327 QTTTLNVRDLSADTEYQISV---FAMKGLTSSEPVSVMEKTQPMKVQVECSRGVDIKA-- 381 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 382 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 432 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I + IN + G +T + + Y KIF + + K Sbjct: 433 EFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPKV 486 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 487 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 539 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 540 TVED----FDAFQRISFELTQSICL 560 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 85/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + + + D+++++D Sbjct: 1092 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSG---MECLTRAEADIVLLVDG 1148 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1149 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWHLNAHR 1199 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1200 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1254 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1255 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDIHAYNVADFESLSK 1308 Query: 354 AFLRI 358 + Sbjct: 1309 IVDDL 1313 >gi|219847682|ref|YP_002462115.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541941|gb|ACL23679.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 76/229 (33%), Gaps = 32/229 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P + LL+ + S S + L++ VLD S SM G ++ + ATR + E Sbjct: 16 VPTSSTPQVVYLLVEAVAPASPTSALPLNLCFVLDRSGSMQ---GAKLESMKAATRRVIE 72 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGL 259 +L +V F + P + + + T + G+ Sbjct: 73 LLRPHDVA---------AIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMSLGM 123 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + A ++ H + ++ LTDG+ D + G Sbjct: 124 QAAQTELQK-----------HLGPDRISRMLLLTDGQTWG---DEPICRDLARTLGQAGV 169 Query: 318 IVYAIGVQAEAADQFLK--NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + A+G+ E +Q L AS + + ++ F + KE Sbjct: 170 RITALGLGTEWNEQLLDDIAAASDGYSDYIADPAQIETFFQQAVKEAQA 218 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/245 (17%), Positives = 86/245 (35%), Gaps = 37/245 (15%) Query: 141 TFPWCANSSHAPLLITS--SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 A ++ +D ++V+D S SM + + + Sbjct: 8 LLSLMAAVLLIMTWSGGALPKAAAASQGSNIDAVLVMDASNSMKNSDPERIS-----GEA 62 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTT 253 ++ +D++ + D G+V+++ +I + L ++E I++L G T Sbjct: 63 MKMFIDMLATTGDK-----VGIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYT 117 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-SPNI------DNKESL 306 + GL+ A + H I+ L DG N PN +++ Sbjct: 118 DMSVGLDEAVKVLKQGM-DPAHAP--------MIVVLADGNNDLDPNTGRTSKEASEQLN 168 Query: 307 FYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 EAK G +Y IG+ A+ + L A + + ++ ++ L I Sbjct: 169 QAVKEAKGSGIPIYTIGLNADGKLNKETLAELAKQTGGKSFTTSSADDLPQILSEIFASH 228 Query: 363 VKQRI 367 + +I Sbjct: 229 QQLKI 233 >gi|293605449|ref|ZP_06687831.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] gi|292816177|gb|EFF75276.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] Length = 343 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 77/222 (34%), Gaps = 35/222 (15%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDI 205 P + + ++ D+++ +D+S SM+ D G + + + + + Sbjct: 77 RPQWVEPPLV---HTEPVRDILLAVDISQSMDSVDFNDAQGQPLSRWDAVKAVVADFI-- 131 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 R GL+ F + PL ++ ++ ++ G+ Sbjct: 132 -----AQRADDRLGLIVFGTGAYPQAPLTRDHASLKLLLD--------EAAVGMAGPNTA 178 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + DA + ++ K +I LTDG ++ + + + V+ IG+ Sbjct: 179 VGDAIGLGIRMLDAAEEQDKVLILLTDGNDTGSAVPPARAAALAAQ---HHVTVHTIGIG 235 Query: 326 AEAAD-------QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 AA L+ A + +F+ ++ L + + + Sbjct: 236 DPAATGEDRVDFDILREVARTAGGQFFPARDLATLREVYATL 277 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 70/207 (33%), Gaps = 27/207 (13%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------- 223 +VLDVS SM G ++ A ++ E+LD + +V +R+ + Sbjct: 42 LVLDVSGSMRARDIDGGSRMAAAKQAFNEVLDA--TPEEVRLGIRTLGANYPGDDRKTGC 99 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + L T P L A + + H Sbjct: 100 KDTAQLYPVSTLDRTEAKTAVATLSPTGWTPIGPALLKAADDL--DGGTGSHR------- 150 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPD 340 I+ +TDGE++ +D E AK G + +G+ + +C A+ Sbjct: 151 ---IVLITDGEDTCAPLDPCEVAREI-AAKGVGLTIDTLGLVPNSKLSKQLSCIAEATGG 206 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + SV++ L D ++ + + Sbjct: 207 TYTSVEHKEDLTDKVNQLVDRAADKVV 233 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 LD++ V+D S S+ + + + +L + P N R G Sbjct: 24 TGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGP---NATRVG 74 Query: 220 LVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +SS++ FPL A+ + ++ I L+ T + ++YA N F E Sbjct: 75 VIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GA 131 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + + +TDG +A+ RG +YA+G+Q L+ Sbjct: 132 RPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGLQRADVGS-LRAM 184 Query: 337 ASP---DRFYSVQNSRKLHDAF 355 ASP + + V++ L F Sbjct: 185 ASPPLDEHVFLVESF-DLIQEF 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 303 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 353 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 354 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 408 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 409 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 456 >gi|330417948|ref|NP_001193426.1| collagen alpha-1(XII) chain [Bos taurus] Length = 3065 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 78/234 (33%), Gaps = 37/234 (15%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + K K D++ ++D S S+ + + Sbjct: 104 VPVIGQLTIQTGGPTKPGEKKPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 163 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-G 250 A ++ + R G+V +SS F L + + + I ++ + G Sbjct: 164 AA---------LVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYKG 214 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + ++Y F A + K I +TDG++ Sbjct: 215 GNTMTGEAIDYLIKNTFTES------AGARVGFPKVAIIITDGKSQDEVEIPAR------ 262 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 263 ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/251 (13%), Positives = 86/251 (34%), Gaps = 31/251 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHK 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDIHAYNVADFDSLSR 1366 Query: 354 AFLRIGKEMVK 364 + + Sbjct: 1367 IVDDLTNNLCN 1377 >gi|296484270|gb|DAA26385.1| collagen, type XII, alpha 1 [Bos taurus] Length = 3115 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 78/234 (33%), Gaps = 37/234 (15%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + K K D++ ++D S S+ + + Sbjct: 104 VPVIGQLTIQTGGPTKPGEKKPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 163 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-G 250 A ++ + R G+V +SS F L + + + I ++ + G Sbjct: 164 AA---------LVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYKG 214 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + ++Y F A + K I +TDG++ Sbjct: 215 GNTMTGEAIDYLIKNTFTES------AGARVGFPKVAIIITDGKSQDEVEIPAR------ 262 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 263 ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/251 (13%), Positives = 86/251 (34%), Gaps = 31/251 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHK 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDIHAYNVADFDSLSR 1366 Query: 354 AFLRIGKEMVK 364 + + Sbjct: 1367 IVDDLTNNLCN 1377 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 76/216 (35%), Gaps = 30/216 (13%) Query: 163 SKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 KS +G ++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 216 PKSQLGASNLVFLLDVSGSMA-----SADKLPLLQTALKLLTAQLSAQDKVSIVVYAGAA 270 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V Q + + +L G +T G+ AY +H Sbjct: 271 G----VVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNGI 320 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + D + + + K G + +G DQ ++ A Sbjct: 321 NR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKG 377 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L I K++ Q Sbjct: 378 NGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQV 413 >gi|296474583|gb|DAA16698.1| collagen, type XXI, alpha 1 [Bos taurus] Length = 507 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 86/213 (40%), Gaps = 37/213 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K +++I K+ ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPENFEIVKKW---------LVNITKNFDIGPKFIQVGVV 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S V PL + + + + G T++ +++A + +F A Sbjct: 80 QYSDYPVLEIPLG-SHESGENLMAAMESIHYLGGNTRTGKAIQFALDYLF---------A 129 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K K + LTDG++ D A+ ++AIGV +E + L+ A Sbjct: 130 KSARFLTKIAVVLTDGKSQDEVKD------AAEAARDSRITLFAIGVGSETEEAELRAIA 183 Query: 338 ---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 S + V++ + A +I +E++KQ++ Sbjct: 184 NKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 76/234 (32%), Gaps = 39/234 (16%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRS 198 + P ++ ++ ++ G D++MV+D+S SM + G + +L A + Sbjct: 74 LMVTALAKPSVLG---EVQTREAFGRDVLMVVDLSGSMEEKDFATESGEQLSRLTAAKKV 130 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKS 255 +R+ + R GL+ F P + ++ G +T Sbjct: 131 LRDFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQEVWLNLLDEAETGMAGQSTNL 183 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + + + ++ LTDG ++ + ++ R Sbjct: 184 GDAIGLGIKVFEQS---------PATSQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIR 234 Query: 316 GAIVYAIGVQAEAAD-------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 +Y I + + +S R + + +L++A+ I + Sbjct: 235 ---IYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 285 >gi|151556217|gb|AAI49226.1| COL21A1 protein [Bos taurus] Length = 518 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 86/213 (40%), Gaps = 37/213 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K +++I K+ ++ G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGPENFEIVKKW---------LVNITKNFDIGPKFIQVGVV 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S V PL + + + + G T++ +++A + +F A Sbjct: 80 QYSDYPVLEIPLG-SHESGENLMAAMESIHYLGGNTRTGKAIQFALDYLF---------A 129 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K K + LTDG++ D A+ ++AIGV +E + L+ A Sbjct: 130 KSARFLTKIAVVLTDGKSQDEVKD------AAEAARDSRITLFAIGVGSETEEAELRAIA 183 Query: 338 ---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 S + V++ + A +I +E++KQ++ Sbjct: 184 NKPSSTYVFYVED----YIAISKI-REVMKQKL 211 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 86/228 (37%), Gaps = 35/228 (15%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T +++ +D ++V+D S SM + + +++ +D++ + D Sbjct: 25 TLPQAMAASQGSKIDAVLVMDASNSMKNSDPDRIGS-----EAMKMFIDMLSTTGDK--- 76 Query: 216 VRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 G+V+++ +I + L ++E I++L G T + GL+ A + Sbjct: 77 --VGVVSYTDRIQREKALLEIQSEADKTALKEFIDQLDRGPYTDISVGLDEAVKVLKQGM 134 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN-------EAKRRGAIVYAIG 323 E H I+ L DG N K S + EAK G +Y IG Sbjct: 135 E-PAHAP--------MIVVLADGNNDLDPNTGKTSQEASDHLNQAVQEAKGSGIPIYTIG 185 Query: 324 VQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + A+ + L A + + ++ ++ L I + +I Sbjct: 186 LNADGKLNKEALAELANQTGGKSFTTSSADDLPQILSEIFASHQQLKI 233 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 78/211 (36%), Gaps = 21/211 (9%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 H + ++ K + ++ ++D S SM DK+ +A +S++ + D +K Sbjct: 280 HLVRVAVQGKRVPVKERTPVHLVYLVDTSGSMQ-----SPDKIELAKKSLKMLTDTLKPG 334 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 V +G V +++ + I + L G +T + G++ AY + + Sbjct: 335 DTVALCTYAGSVR---EVLAPTGIE-SKGKILAALADLTAGGSTAMSSGIDLAY-SLAER 389 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 H+ + +I L+DG+ + + E L A+ +G + +G Sbjct: 390 TLVKGHVNR--------VIVLSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQGNY 441 Query: 330 DQF-LKNCA--SPDRFYSVQNSRKLHDAFLR 357 ++ A + + + + F Sbjct: 442 KDLMMEQLANQGDGNYAYIDSEAQARRVFSE 472 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 80/213 (37%), Gaps = 26/213 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +VK K G D+M+VLD S SM G +A + + + S+ D + R Sbjct: 78 TVKEQFKGLAGRDIMLVLDTSQSMEIADIEGQSGQAMALSRLDAVKQGVMSLLDASEGNR 137 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLE 274 GL+ F + L ++ +++L G +T+ G+ YA + + D + Sbjct: 138 IGLIAFGEQSFVMSDLTAYGDTVRYMVSQLETGFAGDSTRLGDGVGYAVSLLADVDSERA 197 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----- 329 ++ +TDG ++ ++ E+ AK +Y V A+ + Sbjct: 198 -----------IVVLITDGNDTGSDLPPVEA---ARLAKALDVKLYVAAVGADVSTDREP 243 Query: 330 --DQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L+ A + F+ + + + Sbjct: 244 IDEALLRRLAERTGGAFFRIAQVIDFDAMWQTL 276 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 51/288 (17%), Positives = 107/288 (37%), Gaps = 40/288 (13%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + GSI I +L ++ ++ GL ++ F +A+L LD ++L A+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS---- 68 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 Q + + + + + L E + + +L++ Sbjct: 69 -------LTQSRSPAEVVEDYVTKAGLEDYLDE--------PVVNANTLNVR-------- 105 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +++A + Y MP +F L +V + + +++ +VLD+S SM Sbjct: 106 SVTATAAYSMPTVFM------KLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDG 159 Query: 186 GPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQTFPL--AW-GVQ 238 D+L + R+ +DI+ S D V+ +V ++ ++ L + V Sbjct: 160 TNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISISIVPYTGQVNAGADLLATYPNVS 219 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 H Q + + F ++ +T L +L + Y Sbjct: 220 HRQPYSSCVEFAASDFTTTALANGATLTGSGNSELFSSSSSTQTPTYY 267 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPDRFYSVQNSRKLHDAF 355 P + N+ + C+ A+ +G VY++ +AEA Q L+ CAS Y ++ F Sbjct: 493 DPTVKNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVF 552 Query: 356 LRIGKEMVKQRI 367 I + + R+ Sbjct: 553 HSIASHITQLRL 564 >gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2138 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 31/200 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ V+D S SMN G +D L + ++L R L+ FS+ Sbjct: 1443 RFPIDLICVIDTSGSMN---GQPLDLLKETLLFLVDLLQT---------GDRICLIQFST 1490 Query: 226 KIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + PL ++ I+ +INRL+ T G++ A++ + + K Sbjct: 1491 NAQRLTPLLSIESKDNIKSIKNEINRLVAKGGTNICQGMQLAFDVL------KQRRYKNP 1544 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI---GVQAEAADQFLKNCA 337 + L+DG N ++ L N + + I G + + + Sbjct: 1545 ITS---VFLLSDGLNDGAENKIRDLLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKIS 1601 Query: 338 S--PDRFYSVQNSRKLHDAF 355 FY + + ++ + F Sbjct: 1602 QLMDGNFYYIGDIHRIDECF 1621 >gi|271964249|ref|YP_003338445.1| hypothetical protein Sros_2742 [Streptosporangium roseum DSM 43021] gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 774 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 78/223 (34%), Gaps = 39/223 (17%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 +T + + D+++VLD S SM K+ A R+ ++D + Sbjct: 276 TFALTVVPDPAPATANAKDVVLVLDRSGSMTGW------KMVAARRAAARIVDTLTGGD- 328 Query: 212 VNNVVRSGLVTFSSKIVQTFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 R +++F + Q L E + RL T+ LE A Sbjct: 329 -----RFAVLSFDGVVEQPEGLGEGLSEASDRNRYRAVEHLARLEARGGTEMLAPLEQAV 383 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + ++ + ++ +TDG+ + N++ + A+ G V+ +G Sbjct: 384 ALLSESGRD------------RVLVLVTDGQ-----VGNEDQILERIGARLAGVRVHTVG 426 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + FL A R V++ +L +A I + + Sbjct: 427 IDRAVNAGFLGRLAGLGSGRCELVESEDRLDEAMEHIHRRIGA 469 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 80/196 (40%), Gaps = 26/196 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + LD++ ++D S S+ + + + +L +S+ N R G++ +SS Sbjct: 32 NGPLDLVFMIDSSRSVRPF------EFETMRQFLVGLL---RSLDVGLNATRVGVIQYSS 82 Query: 226 KIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 ++ FPL A+ + ++ I ++ T + ++YA N F E + Sbjct: 83 QVQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPSEER 139 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--- 339 + ++ +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 140 VPRVLVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRTMASPPLD 192 Query: 340 DRFYSVQNSRKLHDAF 355 + V++ L F Sbjct: 193 QHVFLVESF-DLIQEF 207 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 25/177 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 387 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 437 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + Sbjct: 438 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRDLN---V 492 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 493 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELREIASEPS 543 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 80/196 (40%), Gaps = 26/196 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + LD++ ++D S S+ + + + +L +S+ N R G++ +SS Sbjct: 32 NGPLDLVFMIDSSRSVRPF------EFETMRQFLVGLL---RSLDVGLNATRVGVIQYSS 82 Query: 226 KIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 ++ FPL A+ + ++ I ++ T + ++YA N F E + Sbjct: 83 QVQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPSEER 139 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--- 339 + ++ +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 140 VPRVLVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRTMASPPLD 192 Query: 340 DRFYSVQNSRKLHDAF 355 + V++ L F Sbjct: 193 QHVFLVESF-DLIQEF 207 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 25/177 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 387 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 437 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + Sbjct: 438 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRDLN---V 492 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 493 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELREIASEPS 543 >gi|264679151|ref|YP_003279058.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] gi|262209664|gb|ACY33762.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] Length = 1405 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 46/288 (15%), Positives = 85/288 (29%), Gaps = 23/288 (7%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKN-IWQTDFRNELRENGFAQDINNIERSTSL 114 + GN D ++ N I T + L G+ D + S + Sbjct: 575 ASAGLKLLTLDGPGNADASLTLDRLADLVNNPITLTTSKGTLTLTGY--DATTGKVSYTY 632 Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---- 170 H + + +++ + T+ L Sbjct: 633 QTSGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLGVLITDTAPSLKPIAESSALSSHGT 692 Query: 171 -MMMVLDVSLSMNDHFGP------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +M+ LD S SM G + +L V S+ +LD D VR ++ F Sbjct: 693 NIMLTLDTSGSMAWSSGVNNSNGWSLSRLDVLKSSVNGLLDKYGEAGD----VRVLILEF 748 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +S Q + + +N L T L A ++ ++ Sbjct: 749 NSSATQKGSGWMSLAEAKTFVNGLYADGGTNYQDALTKAMAAWNNSG----TGKLEGNNV 804 Query: 284 KKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + F TDGE +S+ ++ + + + Y IG+ A Sbjct: 805 QNISYFFTDGEPDSNRSVSSSQQTTWEKFLADNHINSYGIGLGTGATG 852 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 72/188 (38%), Gaps = 18/188 (9%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + D+ +VLD S S+ D++ + P++ R Sbjct: 66 QSGDDCQGIFDLYLVLDKSGSVADNWIHIYSFAEGLVKKFTN--------PNL----RIS 113 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++T+S++ PL ++IN+ + + GL + + A E++ G Sbjct: 114 IITYSTEAEVILPLTSD----SKEINKSLLVLKSIVPQGLTHMQKGLRKANEQIRKSTLG 169 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 II LTDG +++ +A+R GAIVY +GV + Q + P Sbjct: 170 GRIVNSVIIALTDGLLLLKPY--LDTMEEAKKARRMGAIVYTVGVFMYSKQQLVNIAGDP 227 Query: 340 DRFYSVQN 347 DR + V Sbjct: 228 DRCFGVDE 235 >gi|224004848|ref|XP_002296075.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209586107|gb|ACI64792.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 868 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 CA L + S + +D+++ LDVS SM ++KL + Sbjct: 118 CATIKARDLP-----QRDSFARSPIDIVVALDVSGSM------RVEKLDLCKE------T 160 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLE 260 + + ++++ R L++FS V P+ Q I+RL T + Sbjct: 161 LHLLLRELHHDDRFALISFSEDAVIEVPMQKVNERNKQQALHAIDRLSVKGRTNIASAVS 220 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---- 316 A + E + + + LTDG ++ + + + + Sbjct: 221 LAAQVVNGVAEPNK---------VRSVFLLTDGNANTGYTEAIDLVKLTSIFVEANRNPH 271 Query: 317 ---AIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLR 357 ++ G E + L+ A S FYSV+++ ++ AF Sbjct: 272 TPPISLHTFGYGPEPDQKLLRGMAMATSGGSFYSVRDNSQVSSAFGD 318 >gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 797 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 51/292 (17%), Positives = 104/292 (35%), Gaps = 31/292 (10%) Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPF 137 + + D ++ +QD + R+ SL KD+ L+ + +MP Sbjct: 266 LTVDLRAGFPLGDVKSSFHAVDVSQDGDQ-ARTISLKADTVPADKDFELTWKAAPGKMPS 324 Query: 138 IFCTFPWCANSSHAPLLITSSVKI-SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 ++ +T ++ +++ V+D S SM+ GP +++ + Sbjct: 325 AGLFREVIDGKTYLLAFVTPPTAPDTAAPPAKREVVFVIDNSGSMS---GPSIEQARQSL 381 Query: 197 RSIREMLDIIKSIPDVNNVVR--SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 L+ D NV+R + + +V P + + L T+ Sbjct: 382 ALAISKLNP----DDRFNVIRFDDTMTDYFKGLVAATP--DNREKAIAYVRGLTADGGTE 435 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 P L+ A +A G + ++FLTDG I N+ LF A R Sbjct: 436 MLPALQAAL-------RNQGPVATGAL---RQVVFLTDG-----AIGNERQLFQEITANR 480 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 A V+ +G+ + F+ A R F ++ ++ ++ + ++ Sbjct: 481 SDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQN 532 >gi|162456414|ref|YP_001618781.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] gi|161166996|emb|CAN98301.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] Length = 507 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 70/217 (32%), Gaps = 29/217 (13%) Query: 149 SHAPLLITSSVKISSKSD--IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + I +++ +++++D S SM GP M+ A ++ + L Sbjct: 100 TQLGVWIDVPAARAARGQPRAPAAVVLLVDASGSMQ---GPKMENARAAAQAFVDRLPDG 156 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA-WGVQH---IQEKINRLIFGSTTKSTPGLEYA 262 + + +F+ G + I L +T GL+ A Sbjct: 157 DLVS---------VASFADTAQARVAPTVLGRSTRPAVARAIAALGPDGSTNLFAGLKLA 207 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 A + ++ ++DG+ + G + +I Sbjct: 208 EQHALAA---------PSTHAVRRVVLISDGQANIGPSSPDILGALAQRGAAHGVQITSI 258 Query: 323 GVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 GV A+ ++ L A S R Y + +R++ R Sbjct: 259 GVGADYDERTLNALAVGSSGRLYHLTEAREMSSVLER 295 >gi|87310695|ref|ZP_01092823.1| BatB [Blastopirellula marina DSM 3645] gi|87286676|gb|EAQ78582.1| BatB [Blastopirellula marina DSM 3645] Length = 364 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 76/218 (34%), Gaps = 44/218 (20%) Query: 129 AVSRYEM-PFIFCTFPWCANSSHAPLLITSSVKIS---------------SKS-DIGLDM 171 A R+ M P P I S+ ++ S++ G+++ Sbjct: 34 AAGRFAMDPLRRRLMPSGTAMRRWGSAILVSISLTMIAIALLDVRWGKAWSETPQKGIEV 93 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 M VLDVS SM ++L A + I++ML ++ R GL+ F+ + Q+ Sbjct: 94 MFVLDVSRSMLAEDVTP-NRLQRAKQQIKDMLKVMAG-------DRVGLIAFAGETRQSV 145 Query: 232 PLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + ++ ++ + ++ + A D K I Sbjct: 146 PLTSHYEDFKQTLDSVGPHTVRSGGSRLGDAIAAASKGFIDKTYDH-----------KAI 194 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + TDGE+ Y RG V+ +G+ Sbjct: 195 VVFTDGEDQESKPVEVAKSLYA----ERGIRVFTVGLG 228 >gi|332244061|ref|XP_003271190.1| PREDICTED: collagen alpha-1(XII) chain [Nomascus leucogenys] Length = 3100 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPVLSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 75/217 (34%), Gaps = 23/217 (10%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 ++I A ++P + V+G ++ S + K++L D + L ++ N Sbjct: 2 MAIGAASIIP-LVGVVGGGVDASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIP-- 58 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 N + T+F + + + +T + + + ++ Sbjct: 59 -ANIQDKADNFFDTNFPSGMYGTTNVGYTLSAGTATQMDGAATASVPTTLMKVFNVPQID 117 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 C+ P +D+++VLD+S SMN + G ++ Sbjct: 118 IAV----NCSAELDLP---------------NIDVVLVLDMSGSMNSNGTTGSKRITALK 158 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 ++ D++ + VR G+V ++ + L Sbjct: 159 NAVFSFYDVVMAAKPAGTRVRIGIVPYNGAVSVGDEL 195 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 50/160 (31%), Gaps = 35/160 (21%) Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK--GHDDYKKYIIFLTDGENS--- 296 +IN L T G+ +A I A D +++IF+TDGE Sbjct: 466 SRINALSPKGGTMHDIGMIWAGRLISPDGIFAADNASAPNGDPISRHVIFMTDGEMGASP 525 Query: 297 -----SPNID-----------------------NKESLFYCNEAKRRGAIVYAIGVQAEA 328 N D N C + + +++I Sbjct: 526 SNTTAYGNYDMDGRMAGFAASGSWTENQLAAIHNLRLEAICKAIRNKNVTIWSIAFGLPH 585 Query: 329 ADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + + CA+ R + NS +L F I + + R+ Sbjct: 586 S-AYTQGCATGTSRALTAANSSELDSRFRDIAGSIAELRL 624 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 56/154 (36%), Gaps = 26/154 (16%) Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ PL +Q + + + G T + A K+ + E Sbjct: 2 FADHAYLQAPLTLDNLAVQSLLQKAVIGMAGRDTAIGDAIGLAVKKLRERPEG------- 54 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------DQF 332 + +I LTDGEN++ + + L AK+ +Y IGV + + Sbjct: 55 ----SRILILLTDGENNAGAL---KPLQAAELAKQYDIRIYTIGVGGKGGMFSRGLNETE 107 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 LK A + ++ N L++ + I K + K Sbjct: 108 LKKIAQLTNGAYFPATNLGALNNVYEHIDKTLQK 141 >gi|300727143|ref|ZP_07060562.1| BatB protein [Prevotella bryantii B14] gi|299775687|gb|EFI72278.1| BatB protein [Prevotella bryantii B14] Length = 340 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 63/206 (30%), Gaps = 31/206 (15%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + P + T +IS G++ M+ LD+S SM Sbjct: 55 PSVKFWLALGALSLAIIMLARPQMGT---RISHDKRNGIETMIALDISNSMLAEDVAP-S 110 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----R 246 +L + + ++D N + GLV F+ + P+ + + Sbjct: 111 RLTKSKMLVENLVDHF-------NNDQIGLVVFAGQSYVQLPITSDYVSAKMFLQDIQPS 163 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 LI T + H +D K II +TDGE+ ++ Sbjct: 164 LIQTQGTDIAGAINT-----------CMHAFTPNDKVGKAIIVITDGEDHEG-----GAI 207 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 A RG V+ +G+ Sbjct: 208 EAAKAAHDRGINVFILGIGDSKGAPI 233 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 73/198 (36%), Gaps = 34/198 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LDVS SM + +KL ++ +R+ L + R G++ F + + Sbjct: 86 IIIALDVSNSMKEK-----NKLKISKEILRDFL------LKRDEEDRIGILVFDNLPFRL 134 Query: 231 FPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + I+ + T GL A N + K Sbjct: 135 MPLTSDRGALLRVISIIRPAMVDVGGTAMYDGLVEALNMFMKDRRN------------KI 182 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYS 344 II LTDG + + ++ + + + GA +Y IGV + L+ A+ + + Sbjct: 183 IILLTDGGDINSKYTLEDVVRFNQDI---GAKIYTIGVSSGMNFYVLERLSEATGGKAFF 239 Query: 345 VQN--SRKLHDAFLRIGK 360 V + L F I + Sbjct: 240 VTKDYQKALRSVFDEINR 257 >gi|149918749|ref|ZP_01907236.1| batB protein [Plesiocystis pacifica SIR-1] gi|149820350|gb|EDM79766.1| batB protein [Plesiocystis pacifica SIR-1] Length = 421 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 68/206 (33%), Gaps = 38/206 (18%) Query: 168 GLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 GLD+++ +D S SM D + ++L + R G+V F+ Sbjct: 104 GLDIVLAVDYSKSMLAEDVYPSRSERLEAELTRFID--------ESGRRGDRVGVVIFAG 155 Query: 226 KIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH- 280 ++FP+ + + + R T L+ + + + + Sbjct: 156 -AARSFPVTSDMGVLSLFLAHADPRTENPGGTAIGKALDKSIDLLVAVRRDDSGARADQV 214 Query: 281 -----------------DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + I+ LTDGE++ A++ G +Y +G Sbjct: 215 EGEGEDESGAPEAAPALSEADQVIVLLTDGEDTVG-----RPEEVAARAEQLGIRIYTVG 269 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSR 349 + +++ + ++ A ++ Sbjct: 270 IGSDSGEPIMRYDADGQPAGYATDAD 295 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 55/336 (16%), Positives = 116/336 (34%), Gaps = 51/336 (15%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST---SLSI 116 A I + + ++N F +I I E+ + F+ D++ ST + Sbjct: 127 AAPIASDAWYGIKQPERNRFEKQIQNGI---MVAGEIPISTFSIDVDTGSYSTLRRMIKE 183 Query: 117 IIDDQHKDYNLS-----AVSRYEMPFIFCTFPWCANSSHAPLLITSSV----------KI 161 + + Y +P P+ + AP + ++ Sbjct: 184 GSLPEKGTIRIEEMLNYFTYDYPLPNKNAA-PFSVTTELAPSPYNDDMMLLRIGLKGYEL 242 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 243 TKSELGASNLVFLLDVSGSMA-----SADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAA 297 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V +Q + + +L G +T + G+ AY +H +G Sbjct: 298 G----VVLDGASGDDIQALTYALEQLRAGGSTNGSQGILQAYQL------AQKHFIQGGI 347 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + + + + K+RG + +G + DQ ++ A Sbjct: 348 NR---VILATDGDFNVGVTNFDLLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADKG 404 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L L I K++ Q Sbjct: 405 NGHYAYIDTLNEARKVLVDELSSTLLTIAKDVKVQI 440 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 54/172 (31%), Gaps = 43/172 (25%) Query: 235 WGVQHIQEKINRLIFGSTT-------------KSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +N L+ T +T + A + D + Sbjct: 466 NNRSGFVSYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKIRGF 525 Query: 282 DYKKYIIFLTDGENS--------------------SPNIDN--------KESLFYCNEAK 313 KKY+IF+TDG+ S SP DN + CN AK Sbjct: 526 SVKKYMIFMTDGDMSPTWSDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACNAAK 585 Query: 314 RRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 +G ++ I NCAS P++ + ++ L F IG ++ Sbjct: 586 AKGIDIWVIAFSTTLTADM-TNCASKPEQAAGLSSNAALIAKFKEIGSKIAT 636 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 42/291 (14%), Positives = 90/291 (30%), Gaps = 52/291 (17%) Query: 8 NFFYNCKGS-ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 + + +G+ ++++T L+P + +MG ++ + + + + D L + Sbjct: 9 RLWSDQRGNTLALMTLALIP-LVALMGSGLDMTRAYVAQNRFRQACDAGSLAGRRMLA-- 65 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID-DQHKDY 125 G K + L +S ++ + Sbjct: 66 -----GLTLPQAARDEATKYFMFDFPQGYL-------------QSAPYTLTMSVPTAGTL 107 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--- 182 +S+ + + T P+ T S + + D+M V D+S SMN Sbjct: 108 QISSQTT-----VPTTLMGLFGFDTLPISTTCSAT---QDFVNTDIMFVFDLSGSMNCAP 159 Query: 183 ---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQ--- 229 + G M L A S + L+ +S NN+ +R G V ++S + Sbjct: 160 GVTGYCGDVEQSGSRMGALRSAATSFYDTLETAQSQLAANNLRLRYGFVNYNSTVNVGRI 219 Query: 230 --TFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 W + + + + + AY A + + Sbjct: 220 LYEKNPDWMVQSWSYQSRTPDWIDATAYFNNKSACNSAYTYDSQASQADAN 270 >gi|149371022|ref|ZP_01890617.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] gi|149355808|gb|EDM44366.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] Length = 325 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 62/180 (34%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K+ + G+D++ LDVS SM D ++K I L Sbjct: 59 TKLETVKREGVDVVFALDVSKSMLAEDIAPNRLEKSKQLITQIVNEL----------GGD 108 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEK 272 R G++ ++ P+ + +N ++ T + ++ + D ++K Sbjct: 109 RVGIIGYAGSAFPQVPITTDFASTKLFLNSMDTDMVSSQGTAISEAVQMSTTYFDDEEQK 168 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++DGE+ N ++ A G VYAIGV E Sbjct: 169 N-----------RVLFVISDGEDHEGNFESA-----IENATENGIKVYAIGVGTEKGGPI 212 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 62/198 (31%), Gaps = 24/198 (12%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSSKIVQ--- 229 VLD S SM GP + L A + D + + ++TF+ K+ Sbjct: 370 VLDTSGSME---GPRLAALQQALTGLTGADDSLSGRFARFRAREQVTIITFNDKVTATRQ 426 Query: 230 -----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P + ++ I + L G T L+ AY + Sbjct: 427 FTVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTTAAAGMKADPSALTS----- 481 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCAS-PDR 341 I+ +TDGEN+ +D+ L N RG +A+ + S Sbjct: 482 --IVLMTDGENNRG-LDSAGFLARYNTRPPDVRGVRTFAVDFGDADRAALTQIATSTGGA 538 Query: 342 FYSVQNSR-KLHDAFLRI 358 + L D F I Sbjct: 539 VFDATAPGVSLSDVFREI 556 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 78/213 (36%), Gaps = 31/213 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I VK +S + G+D++ V+D S SM K+ +S+ ++LD + Sbjct: 210 ITKDIEQYVKNNSSIEAGIDLLCVIDKSGSMEGK------KIASVQQSLVQLLDFL---- 259 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + R L+TF + PL + ++ I + T G E A+N+I Sbjct: 260 --SEKDRLCLITFDGSAQRLTPLKTLTQDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQI 317 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + K I L+DG++ + + +++ G + Sbjct: 318 ------QQRKMKNQVTS---IFLLSDGQDQGAAEYIQRQKDVVEDI----VTIHSFGYGS 364 Query: 327 EAADQFL-KNC-ASPDRFYSVQNSRKLHDAFLR 357 + + K C FY +++ + L + F Sbjct: 365 DHDAALMSKICKVGQGSFYYIEDVKLLDEFFAD 397 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 45/223 (20%), Positives = 77/223 (34%), Gaps = 32/223 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIP 210 P ++ + + +++MV+DVS SM G KL +A ++ L P Sbjct: 395 PNVLDKVLSSWADLRKPANVLMVIDVSGSMGAGVPDTGRSKLDLAKQAAINAL------P 448 Query: 211 DVNNVVRSGLVTFSSKIVQT------FPLAW----GVQHIQEKINRLIFGSTTKSTPGLE 260 + GL FS+K PL + ++ +++ L T Sbjct: 449 QFGPHDKVGLWMFSTKRDGEKDHLELAPLDTVDAAQRKTLRTRLDGLTPDGGTGLYDTAL 508 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY---CNEAKRRGA 317 AY + D A +IFLTDG+N N + E+L A+ Sbjct: 509 AAYQHVRDRHSGEAINA---------VIFLTDGKNEDNNSLSLENLLPDLRAESAEES-V 558 Query: 318 IVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRI 358 ++ I +A LK + Y + S + F + Sbjct: 559 RMFTIAYGQDADLGVLKQISETTNAAAYDSRESGSIDQVFTAV 601 >gi|313886050|ref|ZP_07819788.1| double-transmembrane N-terminal domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924580|gb|EFR35351.1| double-transmembrane N-terminal domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 342 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 63/194 (32%), Gaps = 21/194 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P L T +SS+ IG+D+ +DVS SM D++G A Sbjct: 61 LKLIAIALLIVALARPQLYTH-APVSSQQTIGVDLAFCIDVSNSMA-ARDVKPDRIGFAK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + + + R +V F+ PL + + + + G + Sbjct: 119 QIVTHTMQELAGS-------RVAMVVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L A E+ K +I LTDGE+ ++ + K++ Sbjct: 172 TNLGQAL-------ERSAQALSAPSRAGKAVILLTDGEDHEGGLE-----EGIDRLKKQE 219 Query: 317 AIVYAIGVQAEAAD 330 Y + + Sbjct: 220 IKAYVVTIGLPDGA 233 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 80/207 (38%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + + + +++D + R GLV +SS++ Sbjct: 711 IDLVLLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSRVR 761 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + + T + L++ F E K Sbjct: 762 TEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 816 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 817 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 869 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 870 HFFYSAD-------FTAISQIAENLKL 889 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 51/215 (23%), Positives = 86/215 (40%), Gaps = 35/215 (16%) Query: 153 LLITSSVKISSKSD------IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 L + ++ + KS +D++ ++D S S+ H M K M+DII Sbjct: 12 LAVMAATEARPKSGEPKCKSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDII 62 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY 263 + R G+V +SS++ F L + + + IN +I T + + YA Sbjct: 63 HELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAM 122 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N F A+E A+ + + I+ TDG + + G +YA+G Sbjct: 123 NVAFSAEE----GARPNVPHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVG 170 Query: 324 VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAF 355 V A A L+ ASP D + V++ L F Sbjct: 171 V-ARADMTSLRAMASPPFEDHVFLVESF-DLIHQF 203 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 80/207 (38%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + + + +++D + R GLV +SS++ Sbjct: 411 IDLVLLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSRVR 461 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + + T + L++ F E K Sbjct: 462 TEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 516 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 517 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 569 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 570 HFFYSAD-------FTAISQIAENLKL 589 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 79/197 (40%), Gaps = 29/197 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ ++D S S+ H M K M+DII + R G+V +S Sbjct: 17 KSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDIIHELDIGLAATRIGVVQYS 67 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L + + + IN +I T + + YA N F A+E A+ + Sbjct: 68 SQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE----GARPNV 123 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ TDG + + G +YA+GV A A L+ ASP Sbjct: 124 PHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVGV-ARADMTSLRAMASPPF 174 Query: 340 -DRFYSVQNSRKLHDAF 355 D + V++ L F Sbjct: 175 EDHVFLVESF-DLIHQF 190 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 80/207 (38%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + + + +++D + R GLV +SS++ Sbjct: 711 IDLVLLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSRVR 761 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + + T + L++ F E K Sbjct: 762 TEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 816 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 817 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 869 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 870 HFFYSAD-------FTAISQIAENLKL 889 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 51/215 (23%), Positives = 86/215 (40%), Gaps = 35/215 (16%) Query: 153 LLITSSVKISSKSD------IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 L + ++ + KS +D++ ++D S S+ H M K M+DII Sbjct: 12 LAVMAATEARPKSGEPKCKSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDII 62 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY 263 + R G+V +SS++ F L + + + IN +I T + + YA Sbjct: 63 HELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAM 122 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N F A+E A+ + + I+ TDG + + G +YA+G Sbjct: 123 NVAFSAEE----GARPNVPHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVG 170 Query: 324 VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAF 355 V A A L+ ASP D + V++ L F Sbjct: 171 V-ARADMTSLRAMASPPFEDHVFLVESF-DLIHQF 203 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 80/207 (38%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + + + +++D + R GLV +SS++ Sbjct: 588 IDLVLLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSRVR 638 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + + T + L++ F E K Sbjct: 639 TEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 693 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 694 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 746 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 747 HFFYSAD-------FTAISQIAENLKL 766 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 51/215 (23%), Positives = 86/215 (40%), Gaps = 35/215 (16%) Query: 153 LLITSSVKISSKSD------IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 L + ++ + KS +D++ ++D S S+ H M K M+DII Sbjct: 12 LAVMAATEARPKSGEPKCKSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDII 62 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY 263 + R G+V +SS++ F L + + + IN +I T + + YA Sbjct: 63 HELDIGLAATRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAM 122 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N F A+E A+ + + I+ TDG + + G +YA+G Sbjct: 123 NVAFSAEE----GARPNVPHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVG 170 Query: 324 VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAF 355 V A A L+ ASP D + V++ L F Sbjct: 171 V-ARADMTSLRAMASPPFEDHVFLVESF-DLIHQF 203 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 79/207 (38%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + + + +++D + R GLV +SS++ Sbjct: 452 IDLVLLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSRVR 502 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + T + L++ F E K Sbjct: 503 TEFPLS--MYHSKDEIKKAEMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 557 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 558 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 610 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 611 HFFYSAD-------FTAISQIAENLKL 630 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 79/197 (40%), Gaps = 29/197 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ ++D S S+ H M K M+DII + R G+V +S Sbjct: 17 KSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDIIHELDIGLAATRIGVVQYS 67 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L + + + IN +I T + + YA N F A+E A+ + Sbjct: 68 SQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE----GARPNV 123 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ TDG + + G +YA+GV A A L+ ASP Sbjct: 124 PHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVGV-ARADMTSLRAMASPPF 174 Query: 340 -DRFYSVQNSRKLHDAF 355 D + V++ L F Sbjct: 175 EDHVFLVESF-DLIHQF 190 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 80/214 (37%), Gaps = 29/214 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ N ++ FS++I Sbjct: 312 VVFVLDSSASMVG------TKLRQTKEALFTILQDLRPEDHFN------IIGFSNRIKVW 359 Query: 228 --VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P+ ++ ++ I+ + T L+ + D + A+ Sbjct: 360 QQDRLVPVTPNNIRDAKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNIDARSVS--- 416 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASP 339 IIFLTDG + + + L +A R ++ IG+ + + L+NC Sbjct: 417 -LIIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGMV 475 Query: 340 DRFYSVQNSRKLHDAFLR-IGK-EMVKQRILYNK 371 F +++ F IG + RI Y++ Sbjct: 476 RHFQEDEDAASHLKGFYDEIGTPLLSDIRIDYSE 509 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 58/452 (12%), Positives = 135/452 (29%), Gaps = 114/452 (25%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK--- 62 I F + +G+I+++ A++L + +G ++ S +++KL D + + ++ Sbjct: 7 IARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVSRTSP 66 Query: 63 ---ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 +G+ D + RI L S+S ++ Sbjct: 67 AYVAAGAMSGDGVISSGADDALRIFNGNLNGLTGYTLA-------------SSSATVTKA 113 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++ ++ + + + + + + ++ +D ++LD S Sbjct: 114 SDVVTSQVTFSAQ-------ISTMFMKVVGMSAMTVGGTSTATASMPKYIDFYLLLDNSP 166 Query: 180 SM---------------------NDHFGPGM-------------------DKLGVATRSI 199 SM +DH ++ V + Sbjct: 167 SMGVGATPTDVSAMIAATANKSSDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSAT 226 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQT--------FPLAWGVQHIQEKINRLIFGS 251 ++++D + +N R + F + + I+ + Sbjct: 227 QQLMDTASATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKG 286 Query: 252 TTKSTPGLEYAYNKIFDA--KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-- 307 + + +Y IF A E + +KY+ F++DG N + Sbjct: 287 QNDNNDQ-DTSYTAIFPAINNEISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKTGN 345 Query: 308 ---------YCNEAKRRGAIV---YAIGVQAEAAD-----------------------QF 332 C K RG + Y + + D Q Sbjct: 346 RCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSEIAQN 405 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ CASP ++ V ++ + +A + K+ V Sbjct: 406 MEACASPGFYFEVSPTQGIAEAMNALFKKAVA 437 >gi|319956033|ref|YP_004167296.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418437|gb|ADV45547.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 335 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 80/228 (35%), Gaps = 27/228 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F + P++ + + G+ ++ LD+S SM ++L A Sbjct: 57 VLFTAVFLMIVALARPVIDKGEKVVPLQ---GITLLTALDISGSMRSK-DRYPNRLEFAK 112 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 +++ LD + + GL+ F+ P +++ ++ Sbjct: 113 VKLKQFLDALPG-------DQVGLMAFARNAFVLAPFTGDTATLKQIVD------GVNED 159 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 A + + A+E +AK + K ++ TDG D + + K RG Sbjct: 160 YINMAATDFVSLAEEAARLLAKKKE---KILVVFTDG------GDPQALKGFKEALKERG 210 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNS-RKLHDAFLRIGKEMV 363 +YA+ V + L P + +L++A +I +E Sbjct: 211 ITLYAVLVGTKKGAPVLDRRGRPMTKRDGTIAITQLNEALGKIARETG 258 >gi|320105612|ref|YP_004181202.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924133|gb|ADV81208.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 335 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 93/264 (35%), Gaps = 29/264 (10%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD 170 + + +I + + + F+ +K + ++++ L Sbjct: 55 AVAPTITTFAREVSQIFTVTDKNG-RFVTGLRQQDFGLLDDNRPPERVIKFTQQTNLPLR 113 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S S+ F D AT ++L + V + F K Sbjct: 114 VGVLMDTSGSIRQRFQFEQD---AATEFFLQVLHRGDAA-----FV----MGFDVKTDLA 161 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 V + + I++L G T ++ ++ K + ++ +I + Sbjct: 162 QDYTNSVDLLNQAIHKLRPGGGTAF-------FDALYTTCRDQMLTLKESNTVRRALIVV 214 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA----EAADQFLKNC--ASPDRFYS 344 +DG ++ +++ C A+ IVY I +AAD+ L+ A+ R + Sbjct: 215 SDGHDNQSRAQENDAIKMCQRAET---IVYTISTNISPTKDAADEVLRRIADATGGRVFF 271 Query: 345 VQNSRKLHDAFLRIGKEMVKQRIL 368 + + F I +E+ Q +L Sbjct: 272 PNRIEDVANGFHSIEEELRSQYLL 295 >gi|150251390|gb|ABR68007.1| matrilin-like 40 kDa protein [Lehmannia valentiana] Length = 390 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 76/196 (38%), Gaps = 20/196 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++VLD S SM D L A+R + L N+ R G + FS + Sbjct: 209 DIVLVLDSSGSMEDKNNELQ--LNFASRFVSHFL-------VGNSKARFGALLFSDFVEN 259 Query: 230 TFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + I R + T T A++ I G + + Sbjct: 260 LFYLNKYTSTADVSKAILRAPYHRGTTLT---NEAFDFIRTEG-VFSTPKGGRSNAPDIV 315 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSVQ 346 + TDG+++ P + +L + KR+ + A+G+ E + + L+ AS D + Sbjct: 316 VVFTDGQSTKPAL----TLAAADNLKRQNVRIVAVGIGNEVSKEELRQVASSRDDVFEAS 371 Query: 347 NSRKLHDAFLRIGKEM 362 + L ++ K + Sbjct: 372 SFENLDYIEQKLAKNV 387 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 89/208 (42%), Gaps = 23/208 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S+ D++ ++D S S++++ M M D++ S + VR Sbjct: 1199 ESSKPVCSNHVADLIFLIDGSESISENSFSTMKTF---------MKDVVDSFDISRDKVR 1249 Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+V +S + + F L + I+E+INR+ S+T + GL + +F++ Sbjct: 1250 LGVVQYSQEPQREFYLNEFYSDTIIKEQINRIEQLRSSTFTGKGLRF-VQSLFES----A 1304 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + + + ++ +TDG ++ D +L + G V+A+GV + + L+ Sbjct: 1305 NGGRKNQGVSQNLVVITDGYSADSVDDAAMAL------RSNGIHVFAVGVGIVNSFELLR 1358 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEM 362 R ++V+N L I E+ Sbjct: 1359 IAGDARRVFTVENFNALKTIKSTIINEI 1386 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 42/260 (16%), Positives = 91/260 (35%), Gaps = 41/260 (15%) Query: 89 QTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS 148 +LR+ G I+ + + + K+ F F + Sbjct: 764 VNQAARDLRQQGIVIYAIGIKDAVQQELEEIAETKNRM----------FFVNDFDSLKHI 813 Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 H ++ + ++ D++ ++D S S+ + ++ + + Sbjct: 814 KHE--IVQEVCSTNVCKNVRADIVFLVDSSNSIRAA------EFQKIKDFMQSFVIKVDV 865 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D VR GL+ FSS+I + F L VQ +++ ++ G+ T +Y Sbjct: 866 GLDN---VRIGLIQFSSEIREEFQLDRYSTIADVQRAIQEMQQIKLGTLTGKALTFAASY 922 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + G + K+Y+I +TDGE ++ + +G +YAI Sbjct: 923 ---------FDRPKGGRPELKQYLIVITDGEAQDSVKSPARAI------RDKGITIYAID 967 Query: 324 VQAEAADQFLKNCASPDRFY 343 + Q ++ + D+ + Sbjct: 968 MLQANNSQLVEITGAQDKVF 987 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 69/189 (36%), Gaps = 22/189 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L ++ SS + D++ ++D S + + + ++++ + P Sbjct: 426 DLDSSAPAVCSSATVA--DIVFLVDESSKIGSK------NFQLIRAFLLKIVNALDIAPS 477 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 VR GLV +S++ F L I + L + T A + + Sbjct: 478 N---VRVGLVLYSNEPRLEFTLDTFKDKLEILNYLKNLPYRGGQAYTGI---AIEFLRNK 531 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E ++ ++ + +TDG++ + + ++ +R VYA+GV+ Sbjct: 532 VFTQEAGSRKKQGVQQIAVVITDGQSLD------DYIEPASKLRRESVTVYAVGVKNITE 585 Query: 330 DQFLKNCAS 338 L A+ Sbjct: 586 GSKLDKIAT 594 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 99/273 (36%), Gaps = 38/273 (13%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 +A + NI + L I +++ + ++ I + ++ Sbjct: 576 YAVGVKNITEGSKLDKIATYPPRNHVTTLKYFLQLSNIRWKIKKQLCNEIVTKTFVVPLQ 635 Query: 161 ISSKSDIGLD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S +D + ++D S S+ + + M ++I+ N Sbjct: 636 SRSLKKGCMDTEEADIYFLIDGSGSI---YPSDFKDMKTF------MNEVIRIFQLGANN 686 Query: 216 VRSGLVTFSSK------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 VR G+V ++S+ I Q + + I+ IN++ G T++ L + A Sbjct: 687 VRFGVVQYASESKTEIIIGQHSQMMRLTEAIEN-INQI--GGGTRTGNALRSMKSLFQMA 743 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 ++ + +I +TDG++ L +++G ++YAIG++ Sbjct: 744 YR---------ENVPQILIVITDGKSEDKVNQAARDL------RQQGIVIYAIGIKDAVQ 788 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + + + +R + V + L I +E+ Sbjct: 789 QELEEIAETKNRMFFVNDFDSLKHIKHEIVQEV 821 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 72/187 (38%), Gaps = 24/187 (12%) Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ ++F + L S+ LD+I R GL +S + Sbjct: 46 DVVFIVDTSTSIAQENFQKVKNFLSSLVSSLDIGLDMI----------RVGLAQYSDEAY 95 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 Q F L + ++I L + ++ G + E AKG+ + Sbjct: 96 QVFLLNQYLLKSDVLDQIGNLPYRGG-ETYTGRALDFVSTRYFTESAGSRAKGYVP--QL 152 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 + +T GE++ E + + RG +Y +G+ + + + + P YS Sbjct: 153 AVLITSGESND------EVEQPAKKLRYRGISIYVVGIGIQNTTELQQIASKPFRRYLYS 206 Query: 345 VQNSRKL 351 + + L Sbjct: 207 IGSFDDL 213 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 58/208 (27%), Gaps = 64/208 (30%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 ++ L I KI +L T G+ +A+ I A Sbjct: 368 NAATQTALQLTPTQSTITAKIAQLTAAGDTNLHEGVMWAWRSISPNPPFSAGSAYNTAGV 427 Query: 284 KKYIIFLT-----------------------------------DGENSSPN--------- 299 +K ++ +T DG + Sbjct: 428 RKILVLMTDGYNNWTSNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDYQSQLDGA 487 Query: 300 -------------IDNKESLFYCNEAKRRGAIVYAIGVQA------EAADQFLKNCAS-P 339 ++ + C AK +G +Y+I A LK+CA+ Sbjct: 488 ANSWTDYKSVSRQAQDELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAGINLLKSCATNA 547 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 D + +S ++ AF +I + K R+ Sbjct: 548 DHYLLATDSTQIDRAFSQIAMNLSKLRL 575 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/307 (13%), Positives = 104/307 (33%), Gaps = 41/307 (13%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F + +G++ +L + + + ++ G ++ + L D + L A++I Sbjct: 5 VNEFVSDHRGNVVMLFGLSVIPVMMMAGAAVDYARGVTTHKVLQQGADTAALAVASRITA 64 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + KQ +N + +T + I K + Sbjct: 65 ATSTADAIKQA---------------------QNVLRSASQRLAAATISNATISADRKTF 103 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + A + +P + + + A + S+ + + + LD S SMN+ Sbjct: 104 CIDA--QVSIPTMIMKIARIDSMAPAVMSCAEIGGGSTNYE----IALALDNSGSMNESA 157 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-------KIVQTFPLAW-GV 237 G G K+ + ++ + + V+ + F+ + L W V Sbjct: 158 G-GATKIQSLKTAATNFVNSMFAKSPGK--VKIAITPFAGLVIPVDPTVAANRALPWIDV 214 Query: 238 QHIQEKINRLIFGSTTKS-TPGLEYAYNKIFDAKEKLEHIAKGH-DDYKKYIIFLTDGEN 295 + + + + FG + G +N + K + G + + Y + +T+ Sbjct: 215 NGLSSQ-HWITFGGKANANAAGFTSRFNVFSNLKSQRADWDFGGCYEPQPYPMNVTETAP 273 Query: 296 SSPNIDN 302 ++ N + Sbjct: 274 TAGNAET 280 >gi|303246180|ref|ZP_07332461.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302492576|gb|EFL52447.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 329 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 68/195 (34%), Gaps = 22/195 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 WC + A + + + G+D+M+ +D+S SM P D+ + + Sbjct: 61 WCVILALAGPRLVTETSVYR--GRGVDIMLAVDLSESMAALDMPLPDR---TVSRLEAVA 115 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLE 260 R GLV F S+ P + + ++RL G T + Sbjct: 116 QAAARFAADRPGDRIGLVAFGSRAYVVLPPTDDRAALTQALSRLSVGAAGRRTAMGDAVG 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A ++ A + ++ DG +++ + E+ A RG V+ Sbjct: 176 LAVKQLDRAPGLA-----------RLVVVFGDGLSNAGEVRPVEAAKA---AAARGIAVF 221 Query: 321 AIGVQAEAADQFLKN 335 +GV + FL N Sbjct: 222 TVGVGGDGPAPFLVN 236 >gi|262165253|ref|ZP_06032990.1| protein BatA [Vibrio mimicus VM223] gi|262024969|gb|EEY43637.1| protein BatA [Vibrio mimicus VM223] Length = 335 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 77/240 (32%), Gaps = 37/240 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-----HFGPGMDKL 192 I W + ++ ++ G D++MV+D+S SM + G + +L Sbjct: 66 IALIVSWLLIVTALTKPS-VLGEVQTREAFGRDVLMVVDLSGSMEEKDFATEAGEQLSRL 124 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIF 249 A + +R+ + R GL+ F P ++ ++ + Sbjct: 125 TAAKKVLRDFVTQ-------RQGDRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMA 177 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T + + + ++ LTDG ++ + ++ Sbjct: 178 GQSTNLGDAIGLGIKVFEQS---------PSTSQDQIMLVLTDGNDTGSFVSPVDAAKIA 228 Query: 310 NEAKRRGAIVYAIGVQAEAAD-------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 R +Y I + + +S R + + +L++A+ I + Sbjct: 229 AAKGIR---IYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 285 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 47/170 (27%), Gaps = 36/170 (21%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI--------AKGHDD 282 PL + + I+ T +++ Y + Sbjct: 318 IPLTADSDALLDSIDDFRAAGYTAGAIAIQWTYYMLSPQWRAAIKNVGLGNGASDANAKK 377 Query: 283 YKKYIIFLTDGE------------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA---- 326 K I +TDG+ N ++ + C K G ++ IG Sbjct: 378 IAKVAILMTDGQFNTAFAGAGGSYNGQGDLARGNAEALCGNMKNDGIEIFTIGFDLNDKD 437 Query: 327 ------EAADQFLKNCASPD------RFYSVQNSRKLHDAFLRIGKEMVK 364 + A LK C+S D ++ +L AF I + K Sbjct: 438 MSATERDQAKAVLKGCSSKDASAAERHYFEASTGAELDAAFQEIIRNTEK 487 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 76/221 (34%), Gaps = 27/221 (12%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+ ++L V+ + G + S + K+ L ++D ++ TA + Sbjct: 14 FARDRGGNFAVLFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTG-- 71 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 K + +K + + + I T+ + + Sbjct: 72 -----VIKEADADNSVKAFLVANSAAGILQPDQVVLDKLIVDKTAKT-----------VQ 115 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 A ++ F F + S+ + S + + MM LD++ SM Sbjct: 116 ANVHVDVALYFPLF----GIGDMQRVAASTTALYSDKTVEVAMM--LDITGSMAKRGN-- 167 Query: 189 MDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIV 228 +DK+G + R + +++ +R +V ++S + Sbjct: 168 VDKIGDLRAAARNAVQTMLQNQDPKRPRIRVAIVPYASGVN 208 >gi|256393600|ref|YP_003115164.1| Vault protein inter-alpha-trypsin domain-containing protein [Catenulispora acidiphila DSM 44928] gi|256359826|gb|ACU73323.1| Vault protein inter-alpha-trypsin domain protein [Catenulispora acidiphila DSM 44928] Length = 1033 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 72/207 (34%), Gaps = 37/207 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D+ ++LD S SM K+ A R+ ++D + + R ++TF ++ Sbjct: 316 PRDVALILDRSGSMGGW------KMTAARRAAARIVDTLTAED------RFAVLTFDDQM 363 Query: 228 VQTFPLAWGVQHIQ--------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L G+ + + + T+ P L A + D Sbjct: 364 ETPDGLPTGLSEATDRHRFRAVQHLATVDARGGTEMEPPLRRAATLLSDDNPDR------ 417 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + +I +TDG+ + N++ L K V+ +G+ FL+ ++ Sbjct: 418 ----DRVLILITDGQ-----VGNEDRLLTTLSPKLTHIRVHTVGIDTAVNAAFLQRLSTL 468 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVK 364 V++ +L DA I + Sbjct: 469 GGGHCELVESEDRLDDAMDAIHHRIAT 495 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 75/209 (35%), Gaps = 27/209 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ +LD S S+ ++ + + T V+ +R + FS+ Sbjct: 38 QGAFDLYFILDKSGSVASNWVEIYEFVEKLTERF------------VSPRMRLSFIVFST 85 Query: 226 KIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I + + L I T G + A +I A G Sbjct: 86 QAKIILPLTGDRYEITKGLKDLSSVIPAGETYMHEGFKLANEQIVKA---------GGKS 136 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + N A+ RGA VY +GV DQ + A+P+ Sbjct: 137 TASVIIALTDGK--LADQIPVLTEKEANIARGRGARVYCVGVLDFNFDQLKRIAAAPENV 194 Query: 343 YSVQNS-RKLHDAFLRIGKEMVKQRILYN 370 + V+ + L I ++ + + + Sbjct: 195 FRVEGGFKDLGLIINSILEKSCTEILYVD 223 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 71/206 (34%), Gaps = 35/206 (16%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + ++ + S+ GLD+++VLDVS SM G + + A + + L Sbjct: 34 ALTADEVTAVVELNATSSTAVREGLDLVVVLDVSGSMR---GEKLQSMKRAMQFVIMKLT 90 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLE 260 + R +V+FSS + PL Q Q + ++ L+ T GL+ Sbjct: 91 PVD---------RLSVVSFSSSATRHCPLRSVTQAAQAELKGIVDGLVANGGTNIKAGLD 141 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A I + ++DG+ + + + VY Sbjct: 142 TALAVIAGRATTKARTPN--------VFLMSDGQQTDGDARQVDPGNVA---------VY 184 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYS 344 G +A L + A SP ++ Sbjct: 185 TFGFGKDADHALLSDVAKKSPGGTFN 210 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 83/214 (38%), Gaps = 24/214 (11%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T+ V +S I +D+ MVLD S S+ + + + + + Sbjct: 70 TAPVAATS-CPIPIDLAMVLDSSGSIGKKDWVKLLEFTKSV---------VDAYSVSEEA 119 Query: 216 VRSGLVTFSSKI------VQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFD 268 G++T+S++ + + ++++ I+ + + T LE A + +F Sbjct: 120 THVGVITYSTEATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNLTFMDKALELANSVLFT 179 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + K + +FLTDG + + + K RG VY +GV + Sbjct: 180 EARGMRPNKK------QVCLFLTDGIQTFDQGPYTKPSIVSQKLKDRGIDVYTVGVGDDV 233 Query: 329 ADQFLKNCASPDRF-YSVQNSRKLHDAFLRIGKE 361 L + +S D++ YS +N +L I +E Sbjct: 234 DLFELLSISSGDKYTYSAKNFDELQAKVQEILQE 267 >gi|194215132|ref|XP_001499245.2| PREDICTED: similar to Collagen alpha-1(XXII) chain [Equus caballus] Length = 1632 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 45/206 (21%), Positives = 75/206 (36%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + + R G+V +S + Sbjct: 47 DLVFLLDTSSSV------GKEDFEKVQQWVANLVDTFEV---GADHTRVGVVRYSDQPTT 97 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + ++ RL + G T + L Y F G +K+ Sbjct: 98 AFELGHFRSREAVKAAARRLAYHGGNTNTGDALRYITRHSFSP---QAGGRPGDRAFKQV 154 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D A R G ++A+GV EA + L+ AS + Sbjct: 155 AILLTDGRSQDLVLD------AAATAHRAGIRIFAVGVG-EALKEELEEIASEPKSAHIF 207 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 208 HVSD-------FNAIDKIRGKLRRRL 226 >gi|311271865|ref|XP_001927121.2| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A domain-containing protein 2 [Sus scrofa] Length = 769 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 83/232 (35%), Gaps = 30/232 (12%) Query: 135 MPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLD----MMM---VLDVSLSMNDHF 185 MP C + P L + +S + + MM +D+ ++ Sbjct: 1 MPSFPLLEAICIFLFAGAPPSLSLQEIHVSKEIVRKISAASKMMWCSAAVDILFLIDGSH 60 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEK 243 G + + + + P VR G V F S FPL A+ Q ++ + Sbjct: 61 SIGKGSFERSKHFAITVCEALDVDPAR---VRVGAVQFGSTPRLEFPLDAFSTQQEVKAE 117 Query: 244 INRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 I R+ F T++ L+Y K F + + ++ +TDG + + Sbjct: 118 IRRMAFKGGRTETGLALKYLLRKGFPGGR--------NASVPQVLLIVTDGRSQGHVAEP 169 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + K+R V+A+GV+ ++ L AS V + ++ DA Sbjct: 170 ------AEQLKQRDVTVFAVGVRFPRWEE-LHILASEPTEQHVLMAEQVEDA 214 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 49/129 (37%), Gaps = 16/129 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ ++ RS D+ +V++ GLV + ++ Sbjct: 529 PLDLVFMLDASASVG---PENFARMQSFLRSCTLQFDVNP------DVMQMGLVVYGGQV 579 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I+D ++ A+ K Sbjct: 580 QTAFGLDTHTTRATVLRALSQAPYLGGAGSAG---TALLHIYDKVMTVQMGARPGVP--K 634 Query: 286 YIIFLTDGE 294 +I +T G+ Sbjct: 635 VVIVVTGGQ 643 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 69/192 (35%), Gaps = 34/192 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + G+VTF S Sbjct: 309 VCLVLDKSGSMSSE-----DRLFRMNQAAELFL-----IQIIEKGSLVGMVTFDSVAEIR 358 Query: 231 FPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L V T GL+ + I +++ Sbjct: 359 NNLTKITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQSTSGSE--------- 409 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 II LTDGE++ + C E K+ G I++ I + AA + L + RFY Sbjct: 410 IILLTDGEDNE--------IHSCIEEVKQSGVIIHTIALGPSAAKELETLSDMTGGHRFY 461 Query: 344 SVQNSRKLHDAF 355 + ++ L +AF Sbjct: 462 ANKDINGLTNAF 473 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 74/196 (37%), Gaps = 27/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D ++D S S+N D M+++I + + VR G+V +S Sbjct: 623 DFYFLIDGSGSINH------DDFAEMKTF---MIELISTFRVGADHVRFGVVQYSDSPTV 673 Query: 230 TFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + V ++ I ++ G T++ L + + ++ Sbjct: 674 EFDIRQHSSVAQLKSAITKIWQTGGGTRTGEALTFMKRLFSEVARDK---------VLRF 724 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I +TDG++ + E ++ +YAIGV++ + L+ S +R + V Sbjct: 725 LIVITDGQSQD------QVAQAAEELRQENITIYAIGVKSAVTKELLEISGSQNRMFFVN 778 Query: 347 NSRKLHDAFLRIGKEM 362 + L + +++ Sbjct: 779 DFDSLKPIQQEVIQDI 794 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 69/172 (40%), Gaps = 23/172 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + ++ ++ + VR GL F+ I + Sbjct: 17 DLVFLVDSSTSIGPE------NFQKVKSFLYSLVLGLEI---GRDQVRVGLAQFNDNIYK 67 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + E+I L + T++ L + + F E AK + + Sbjct: 68 AFLLNQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFFT--ESAGSRAKDNVP--QI 123 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +I +TDGE++ E ++ K +G +Y +G+ + + LK AS Sbjct: 124 VILVTDGESND------EVAEAASKLKGQGVSIYVVGINVQDVQE-LKTIAS 168 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 70/177 (39%), Gaps = 23/177 (12%) Query: 172 MMVLDVSLSMND-------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + S S+ D G++ +A +R+++ + P+ VR GLV +S Sbjct: 404 TLAVCTSASLADIVFLVEASSRIGLENFQLAVELLRKIIHTLIIGPNK---VRVGLVLYS 460 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L I +N+L F G TK+ L++ N +F + ++ Sbjct: 461 DEPRLEFGLNTFLSQSEILSHLNKLPFIGGKTKTGAALDFLRNTVF----TQQKGSRYRQ 516 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 ++ + +T+G + L +R G V+A+G + + L AS Sbjct: 517 GVQQLAVVITEGYSQDEVDRPASLL------RRAGVTVFAVGTLKASGSRDLNKIAS 567 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 59/177 (33%), Gaps = 13/177 (7%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ ++ + +M++ P+ V+ GL+ FSS + F L Sbjct: 806 DILFLVDGSERINTRDFDKMKEFMMQMVNKSDLGPEK---VQIGLLQFSSNPQEEFRL-- 860 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 K++ L + + + + E G + +Y+I + G+ Sbjct: 861 --NTYYSKVDILRAITGMVQIRAGARVGSALSFSLPYFERSRGGRLNVPQYLIIIISGKT 918 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 + +G ++AIGV Q L+ + D+ Y +N L Sbjct: 919 GDAV------KMPAKALRDKGIKIFAIGVHKANNSQLLEITGAQDKVYYEENFDSLL 969 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 75/197 (38%), Gaps = 23/197 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ ++D S S ++++++ + VR G+ +S+ Sbjct: 1185 IDLVFLIDGSSS------IHPRNFTAMKTFMKQIVNSFTI---GKDRVRIGVAQYSTNPQ 1235 Query: 229 QTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + F L + I + I+++ + T + GL + + A + + + Sbjct: 1236 KEFYLNTFYSGAEINQHIDKITQLRTQTYTGKGLRFVKSFFEPANGSRK-----NLHVLQ 1290 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ +TDG ++ + N+ + +++IG+ + + R + V Sbjct: 1291 SLVVITDGMSND------SVVEAANDLRNEKIQIFSIGIGVINLFELQLIAGNVKRVFVV 1344 Query: 346 QNSRKLHDAFLRIGKEM 362 + +L ++ +E+ Sbjct: 1345 GDFGQLGSIERKVVREL 1361 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 53/145 (36%), Gaps = 12/145 (8%) Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFD 268 + VR G +++S F L I++ + L + L++A + + Sbjct: 1026 GKDNVRFGAISYSDNSEVLFSLDTYITKAQIRDAVFHLKPKVGKAHTATALKFAKERFSE 1085 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEAKRRGAIVYAIGVQAE 327 H + + ++ +T N + K+ L + G V+AIG++ Sbjct: 1086 M-----HGGRQSLAVTQILVLIT---NKPTESEEKKYLQESAQTLQEAGIDVFAIGIKNV 1137 Query: 328 AADQFLKNCASPDRFYSVQNSRKLH 352 + DR + VQ+ +L+ Sbjct: 1138 KRPELQAITKHRDRSFMVQSYNELY 1162 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 74/204 (36%), Gaps = 26/204 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ + LD++ ++D S S+ P + ML + PD R GL+ Sbjct: 44 SACHNKPLDLVFIIDSSRSVR----PP--DFEKVKEFLITMLKFLDIGPD---TTRVGLL 94 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L + I+ + R+ + T + ++YA N F E + Sbjct: 95 QYGSTVKNEFSLKTYKKKMDIERAVKRMMHLATGTMTGLAIQYAMNIAFSESEGARPL-- 152 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + + +TDG P +A+ G +++A+GV + Sbjct: 153 -NQHVPRIAMIVTDGRPQDPVA------EIAAKARNSGILIFAVGVGRVDMSTLKTIGSE 205 Query: 339 P--DRFYSVQNSRK---LHDAFLR 357 P + + V N + L F Sbjct: 206 PHTEHVFLVANFSQIETLTTVFQN 229 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 45/203 (22%), Positives = 81/203 (39%), Gaps = 27/203 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ V+D S S+ ++ + + + +LD ++ GL+ +S Sbjct: 566 GEGPVDLVFVIDGSKSLGEN------NFEIVKQFVNGILDSLEISQ---KAAHVGLIQYS 616 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + I F +A V+ +I + GS T L+ + K F + A G Sbjct: 617 THIRTEFTMAQYSSAKDVKKAVSQIKYMGRGSMT--GLALKLMHEKSFSEVQGARPRAMG 674 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG E Y +AK+ G +YAIGV +A D+ L+ AS Sbjct: 675 ---VPRVAIVFTDGRAQD------EVSEYAKKAKQSGITIYAIGVG-KAIDEELQEIASA 724 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + V + A I +++ Sbjct: 725 PQEKHVIYAED-FSAMGYITEKL 746 >gi|326677363|ref|XP_691404.5| PREDICTED: collagen alpha-1(XII) chain-like [Danio rerio] Length = 3085 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 48/265 (18%), Positives = 104/265 (39%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 TS+++ +Y +S +F + + + T +K+S + +G+ Sbjct: 364 TQTSVNVRDLSPDTEYEIS---------LFALKGLTPSEAVMAMEKTQPLKVSLECSLGV 414 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ D+ L ++ + G+ + +++ P+ V+ LV +S Sbjct: 415 DVQA--DIVLLVDGSYSIGITNFAKVRAFLEVLVNTFDIGPNK---VQISLVQYSRDPYT 469 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + + + G +T + + Y +IF A ++ + Sbjct: 470 EFYLNTHHDLNAVVKAVRTFPYRGGSTNTGKAMTYVRERIFIATRGAR------ENVPRV 523 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 I +TDG++S D + + ++A+GV+ +A L+ A+P Y Sbjct: 524 TILITDGKSSDAFKDP------AAKLRNTDVEIFAVGVK-DAVRSELEAIANPPAETHVY 576 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 577 TVED----FDAFQRISNELTQSICL 597 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 47/299 (15%), Positives = 101/299 (33%), Gaps = 35/299 (11%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 A + + + Q S K + + + ++ D + + TS + Sbjct: 7 AAVALFTLAALTSVRAQVMPPSDLRFKILNENTVQMTWKQPLSRVDGFRV-QVTSDTEEP 65 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA----PLLITSSVKISSKSDIGLDMMMV 174 + S + +S+ + I+ + S I D++ + Sbjct: 66 VKEFTLSPTSTKTSIR-DLTPDVDYVVTITSYLGSEESIPISGQITKCSIGAIA-DLVFL 123 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPL 233 +D S S+ + IR + + D+ R G+V +S+ F L Sbjct: 124 VDGSWSVGRE----------NFKFIRNFIAALAGAFDLGEDKTRVGVVQYSTDTRTEFNL 173 Query: 234 A--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + IN L + G T + L+Y +F KG + + + + Sbjct: 174 NQHFRRVDLLRAINNLPYKGGNTMTGEALDYLLKNMFTE---AAGARKG---FPRVAVVI 227 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN 347 TDG++ P Y + K G ++ +G++ ++ + ++P Y+V N Sbjct: 228 TDGKSQDPVEG------YAKKLKNAGVELFTLGIKEADEEELKQMSSTPYRTHVYTVPN 280 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 74/205 (36%), Gaps = 24/205 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D+++++D S S+ I M+ + PD V+ GL + Sbjct: 1175 KTSAKADIVLLVDGSWSIGRL------NFKTIRAFIGRMVGVFDIGPDK---VQIGLAQY 1225 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + L + + + L + + G+ A N I + + Sbjct: 1226 SGDPKTEWHLNAHPTRASLLDAVANLPYKGG-NTMTGM--ALNYILQNNFRPNVGMRPDS 1282 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 +K + +TDG++ + N + L + G +YAIGV+ ++ PD Sbjct: 1283 --RKIGVLVTDGKSQDEIVVNSQRL------RDSGIELYAIGVKNADENELRSIATDPDE 1334 Query: 341 -RFYSVQNSRKLHDAFLRIGKEMVK 364 Y+V + L D + + + Sbjct: 1335 IHMYNVNDFSFLLDIVDDLTENLCN 1359 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 64/199 (32%), Gaps = 64/199 (32%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK------- 284 P++ ++ +I+ + T G+++ + + + + Sbjct: 277 PISQDGPSLKAQIDLFQPRAGTAIYMGMKWGTALLDPSFRETTASLVSDSVVESTFADRP 336 Query: 285 ---------KYIIFLTDGEN---------------------------------------- 295 K I+ +TDG+N Sbjct: 337 ADYSDRETLKTIVLMTDGQNSNSQRISTAYYNSSSEVVHWSKWNFNYYLSQYIKEKDWHR 396 Query: 296 ---SSPNIDNKESLF--YCNEAKRRGAIVYAIGVQA-EAADQFLKNCAS-PDRFYSVQNS 348 + + +L C+ AK G +++ IG + + +K CAS P F+ V+ Sbjct: 397 YYYTRYTAEKGNTLMDNICSAAKDEGIVIWTIGFEVNDTGADVMKKCASSPSHFFRVEGV 456 Query: 349 RKLHDAFLRIGKEMVKQRI 367 +L DAF I ++ + R+ Sbjct: 457 -ELTDAFSAIASQINQLRL 474 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 83/225 (36%), Gaps = 40/225 (17%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 + + F + GSI+ILT ++ ++ +V G+ ++ + KL D A Sbjct: 17 VRLGRFARHEDGSITILTIFIIIMMVMVGGIQLDFMRHEMERTKLQAAADR-----AVLA 71 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + ++ + ++ + + + I N ++++ +++ K Sbjct: 72 AADLDQTLAPA-------DVVDEYFA---KSGMSDYLSSVTIENGLNFRTVTVKANNEMK 121 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + R+ P + S + + +++ +VLDVS SM + Sbjct: 122 T---QFLGRFGFP-------------TLDVPALSKAEERVEK---VEISLVLDVSGSMKN 162 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + KL + + +D + P+ N V L+ +S ++ Sbjct: 163 N-----SKLTTMKDAAKTFIDTV-LRPETKNNVSLSLIPYSEQVN 201 >gi|327542236|gb|EGF28725.1| BatB protein [Rhodopirellula baltica WH47] Length = 700 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 69/165 (41%), Gaps = 29/165 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G++ + VLDVS SM ++LG A + I++M+D + R GLV F+ Sbjct: 77 QRGIEAVFVLDVSRSMLAEDVSP-NRLGRAKQQIKDMVDEMPG-------DRVGLVVFAG 128 Query: 226 KIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + QT PL V+ ++ ++ + + ++ + A + D Sbjct: 129 ETRQTLPLTRHVEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDAFLDKTTDH-------- 180 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQ 325 K ++ LTDGE+ + + A + +G ++ IG+ Sbjct: 181 ---KAMVILTDGEDQE-----SDPVSEAKRAHEEQGIRIFTIGLG 217 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 58/452 (12%), Positives = 135/452 (29%), Gaps = 114/452 (25%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK--- 62 I F + +G+I+++ A++L + +G ++ S +++KL D + + ++ Sbjct: 7 IARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVSRTSP 66 Query: 63 ---ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 +G+ D + RI L S+S ++ Sbjct: 67 AYVAAGAMSGDGVISSGADDALRIFNGNLNGLTGYTLA-------------SSSATVTKA 113 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++ ++ + + + + + + ++ +D ++LD S Sbjct: 114 SDVVTSQVTFSAQ-------ISTMFMKVVGMSAMAVGGTSTATASMPKYIDFYLLLDNSP 166 Query: 180 SM---------------------NDHFGPGM-------------------DKLGVATRSI 199 SM +DH ++ V + Sbjct: 167 SMGVGATPTDVSAMIAATANKSSDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSAT 226 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQT--------FPLAWGVQHIQEKINRLIFGS 251 ++++D + +N R + F + + I+ + Sbjct: 227 QQLMDTATATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKG 286 Query: 252 TTKSTPGLEYAYNKIFDA--KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-- 307 + + +Y IF A E + +KY+ F++DG N + Sbjct: 287 QNDNNDQ-DTSYTAIFPAINNEISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKTGN 345 Query: 308 ---------YCNEAKRRGAIV---YAIGVQAEAAD-----------------------QF 332 C K RG + Y + + D Q Sbjct: 346 RCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSEIAQN 405 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ CASP ++ V ++ + +A + K+ V Sbjct: 406 MEACASPGFYFEVSPTQGIAEAMNALFKKAVA 437 >gi|260578579|ref|ZP_05846489.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258603294|gb|EEW16561.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 646 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 74/242 (30%), Gaps = 43/242 (17%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A + + + + M++LD S SM G ++ A Sbjct: 7 LLFAIALIAALVTIVVPPAQEARAEDEEGKNIPPTMLILDASGSMKTPDAGGQTRMAAAK 66 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKI------------VQTFPL---AWGVQHIQ 241 + + + S + G + + +++ L V I Sbjct: 67 DAAQLFSVAVPSDAE------LGFMVYGTEVGNSPEERDAGCKDVKTLLPVEKGNVTKIP 120 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ ++ T P L A ++ E+ I+ ++DGE++ Sbjct: 121 AEVGKVEASGHTPMGPALRQAAEELPKDGERS-------------IVLVSDGEDTCA--- 164 Query: 302 NKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFL 356 + K+ G + +G ++ + C A + +++ L D+ Sbjct: 165 PPPVCEVAKDLKKEGIDLTINTVGFLVDSKARKELECIAEAGGGEYMDAKDTVSLADSMK 224 Query: 357 RI 358 R+ Sbjct: 225 RL 226 >gi|75076662|sp|Q4R7B7|ANTRL_MACFA RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|67969305|dbj|BAE01005.1| unnamed protein product [Macaca fascicularis] Length = 557 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 77/199 (38%), Gaps = 29/199 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ +LD S S+N+++ +D ++ ++ +R +T+S+ Sbjct: 70 QGSFDLYFILDKSGSVNNNW---IDLYMWVEETVARF---------QSSDIRMCFITYST 117 Query: 226 KIVQTFPLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL I+ ++ +++ T G A +I E G+ Sbjct: 118 DGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTFMQAGFRKAIQQI-------ETFNSGNKV 170 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II +TDGE + +++L +A++ GA VY + V DQ SP+ Sbjct: 171 PS-MIIAMTDGELVAHAF--QDTLREAQKARKLGANVYTVDVADYKLDQITAIADSPEHV 227 Query: 343 YSVQN----SRKLHDAFLR 357 ++V+N R DA Sbjct: 228 FAVENGFKAMRDTVDALTS 246 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 71/188 (37%), Gaps = 18/188 (9%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + D+ +VLD S S+ D++ + P++ R Sbjct: 66 QSGDDCQGIFDLYLVLDKSGSVADNWIHIYSFAEGLVKKFTN--------PNL----RIS 113 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++T+S++ PL ++IN+ + GL + + A E++ G Sbjct: 114 IITYSTEAEVILPLTSD----SKEINKSLLVLKNIVPQGLTHMQKGLRKANEQIRKSTLG 169 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 II LTDG +++ +A+R GAIVY +GV + Q + P Sbjct: 170 GRIVNSVIIALTDGLLLLKPY--LDTMEEAKKARRMGAIVYTVGVFMYSKQQLVNIAGDP 227 Query: 340 DRFYSVQN 347 DR + V Sbjct: 228 DRCFGVDE 235 >gi|327270786|ref|XP_003220169.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 975 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 55/276 (19%), Positives = 97/276 (35%), Gaps = 62/276 (22%) Query: 121 QHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAPLLITSSVKISSKSDI-------GLDM 171 ++N+ A M C F W + + + + + + Sbjct: 248 NKSNHNIKAT---NMQNKQCNFRSTWEVIMNSSDYASSFPINSPPSAPAISLLQTHDRVV 304 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +VLDVS SM D++ ++ L I + G+VTF+S + Sbjct: 305 CLVLDVSGSM-----TTYDRIARLKQAAELFLLQI-----IETGSWVGIVTFNSYATRQI 354 Query: 232 PL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + ++ + G T G+ + ++K + KG + Sbjct: 355 GLRQITSDSVRESLKNYL-PTSAGGGTIICSGVRQGFQV---KQKKYQTSTKGCE----- 405 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF---------LKNC 336 I+ LTDGE++S + C E + G+I++ I + AA + LK Sbjct: 406 IVLLTDGEDNSVSS--------CFAEVQSSGSIIHTIALGPNAAKELEMLADMTGGLKFS 457 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKE---MVKQRILY 369 A+ +S L DAF RI E + +Q I Sbjct: 458 ATDS-----LDSNGLIDAFSRISSESGDISQQSIQL 488 >gi|242042273|ref|XP_002468531.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] gi|241922385|gb|EER95529.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] Length = 650 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 77/259 (29%), Gaps = 44/259 (16%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL--ITSSVKISSKSDIGLDMMMVLDVS 178 ++ + Y + A H + LD++ VLDVS Sbjct: 134 SGGTLAVTTHTEYSAVARDSSRDNFAVLVHIKAPGMTDAEAAAGDAPRAPLDLVTVLDVS 193 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----A 234 SM KL + +++ ++D + R +V+FS + + L Sbjct: 194 GSMIGA------KLALLKQAMGFVIDNLGPHD------RLSVVSFSDRARRVTRLLRMSG 241 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 G + + L+ T GL A + E+ H +I L+DG+ Sbjct: 242 DGKAAAKSAVESLVARGGTNIAEGLRTAARVL----EERRHRNTVSS-----VILLSDGQ 292 Query: 295 NSSPNIDNKES---------------LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 ++ F + A V+ G + + A Sbjct: 293 DTYTAPRWSRGPGAGATPNYEALVPPSFMATSTRDWSAPVHTFGFGNDHDAAAMHVIAES 352 Query: 338 SPDRFYSVQNSRKLHDAFL 356 + F + N + DAF Sbjct: 353 TAGTFSYIGNEAVIQDAFA 371 >gi|145595544|ref|YP_001159841.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145304881|gb|ABP55463.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 316 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 67/232 (28%), Gaps = 39/232 (16%) Query: 153 LLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +L T+ + S + L+ +M+ +DVSLSM + + + Sbjct: 68 VLATALARPSVDTRQPLERATVMLAVDVSLSMQADDVAP----NRLAAAQEAAQQFVAEL 123 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 PD N GLV+F+ P Q + IN L +T + + I Sbjct: 124 PDSYN---LGLVSFAKAANVLVPPTKDRQAVLAAINGLALAESTATGEAVFTCLEAIRSV 180 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 I+ L+DG +S + A+ V I + Sbjct: 181 PADGAAGIPPAR-----IVLLSDGYRTSG----RSVEQAAAAAQAANVAVSTIAFGTDGG 231 Query: 330 D------------------QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + + FY + +L + +G + Sbjct: 232 QVDIGGQRQRVPVDRLALADLAET--TDGYFYEAASVSELKQVYQDMGSSIG 281 >gi|301767380|ref|XP_002919113.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VI) chain-like [Ailuropoda melanoleuca] Length = 1059 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 82/240 (34%), Gaps = 29/240 (12%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDV 212 ++ + + + D +D+ VLD S S+ P + + +D ++ Sbjct: 22 VSGNARAVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKRFIDNLRDRYYRCD 81 Query: 213 NNVV-RSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIF 267 N+V +G + +S ++ PL ++ ++ + FG T + ++ + Sbjct: 82 RNLVWNAGALHYSDEVEIISPLRPMPSDRDALKASVDAVKYFGKGTYTDCAIKKGLEGLX 141 Query: 268 D------------------AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 H KY+I +TDG + L Sbjct: 142 XXXXXXXXXXXXASAGGRRPARSRGCQGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDA 201 Query: 310 -NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF---YSVQNSRKLHDAFLRIGKEMVKQ 365 NEAK G V+++ + + + L A+ + ++ + + DA I + + Sbjct: 202 VNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQTRDAEEIITQTIDTL 261 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 53/162 (32%), Gaps = 20/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ K A R L ++ P + VR +V +S Q Sbjct: 859 DITILLDGSASVGSSHNFDTTK-RFAKRLAERFLTAGRTDPSHD--VRVAVVQYSGPGQQ 915 Query: 230 TFP---LAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y + + Sbjct: 916 RPERGSLQFLQNYTVLAGTVDGMDFFNDATDVNDALSYV---------TRFYREASSREA 966 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG + E EA+R ++ + V Sbjct: 967 KKRLLLFSDGNSQGATAAAIE--KAVQEAQRADIEIFVVVVG 1006 >gi|146325834|sp|Q60847|COCA1_MOUSE RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor Length = 3120 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 99/266 (37%), Gaps = 39/266 (14%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P P+ + Sbjct: 385 QTTTLNVRDLTADTEYQISV---FAMKGLTSSEPTSVMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +++ I + G +T + + Y KIF + + K Sbjct: 491 EFTLK-EFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPK 543 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRF 342 +I +TDG++S D + + ++A+GV+ +A L+ ASP Sbjct: 544 VMILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHV 596 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++V++ DAF RI E+ + L Sbjct: 597 FTVED----FDAFQRISFELTQSICL 618 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 93/272 (34%), Gaps = 44/272 (16%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-- 165 + ST+ +++ D + + ++ Y + K K Sbjct: 72 LAASTTETLLSDLIPETQYVVTITSYN-----EVEESVPVIGQLTIQTGGPTKPGEKKPG 126 Query: 166 ---------DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 D++ ++D S S+ + + VA ++ + Sbjct: 127 KTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIVA---------LVSAFDIGEEKT 177 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEK 272 R G+V +SS F L + + + + ++ + G T + ++Y N ++ Sbjct: 178 RVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTESAGS 237 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I +TDG++ E + G V+++G++A A + Sbjct: 238 RAG-------FPKVAIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKEL 284 Query: 333 LKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 + ++P + ++V N + D I ++ Sbjct: 285 KQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 85/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + ++ D+++++D Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA---DIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|111074529|ref|NP_031756.2| collagen alpha-1(XII) chain [Mus musculus] Length = 3061 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 99/266 (37%), Gaps = 39/266 (14%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P P+ + Sbjct: 385 QTTTLNVRDLTADTEYQISV---FAMKGLTSSEPTSVMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +++ I + G +T + + Y KIF + + K Sbjct: 491 EFTLK-EFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPK 543 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRF 342 +I +TDG++S D + + ++A+GV+ +A L+ ASP Sbjct: 544 VMILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHV 596 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++V++ DAF RI E+ + L Sbjct: 597 FTVED----FDAFQRISFELTQSICL 618 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 93/272 (34%), Gaps = 44/272 (16%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-- 165 + ST+ +++ D + + ++ Y + K K Sbjct: 72 LAASTTETLLSDLIPETQYVVTITSYN-----EVEESVPVIGQLTIQTGGPTKPGEKKPG 126 Query: 166 ---------DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 D++ ++D S S+ + + VA ++ + Sbjct: 127 KTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIVA---------LVSAFDIGEEKT 177 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEK 272 R G+V +SS F L + + + + ++ + G T + ++Y N ++ Sbjct: 178 RVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTESAGS 237 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I +TDG++ E + G V+++G++A A + Sbjct: 238 RAG-------FPKVAIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKEL 284 Query: 333 LKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 + ++P + ++V N + D I ++ Sbjct: 285 KQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 85/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + ++ D+++++D Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA---DIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|146307722|ref|YP_001188187.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145575923|gb|ABP85455.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 566 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 87/217 (40%), Gaps = 26/217 (11%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 PW + + I +S S + +++ ++DVS SM+ G L + + Sbjct: 168 LAVTPWNPQTRLLRIAIKAS-DRSVEELPPANLVFLVDVSGSMHRREG-----LPMVQGT 221 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 ++ ++D ++ R LVT++ ++V I+ I++L G +T Sbjct: 222 LKLLVDQLRPQD------RVSLVTYAGATQVVLDSTPGSDKAKIRAAIDQLTAGGSTAGE 275 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G++ AY + +H+ +G + I+ TDG+ + D + ++ G Sbjct: 276 SGIQLAYQ------QASKHLIEGGINR---ILLATDGDFNVGISDFDSLKQLAADKRKTG 326 Query: 317 AIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNSRK 350 + +G + ++ ++ A + + N R+ Sbjct: 327 VSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDNLRE 363 >gi|327261831|ref|XP_003215731.1| PREDICTED: collagen alpha-1(XII) chain-like [Anolis carolinensis] Length = 3118 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 50/265 (18%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+ ++ +Y ++ Y M + + P P+ + Sbjct: 386 QTTAFNVKDLSADTEYQINV---YAMKGLTPSEPITIMEKTQPVKVQVECSKGVDVKA-- 440 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S G+ + ++ + P+ V+ LV +S Sbjct: 441 DIVLLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPEK---VQISLVQYSRDPHT 491 Query: 230 TFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + I + IN + G +T + + Y KIF A+ + + Sbjct: 492 EFTLNRYNRIDDIIQAINTFPYRGGSTNTGKAMTYVREKIF----VTGRGARPNVP--RV 545 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ A+P Y Sbjct: 546 MILITDGKSSDAFKDP------AIKLRNSDVEIFAVGVK-DAVRTELEAIATPPAETHVY 598 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 599 TVED----FDAFQRISFELTQSVCL 619 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 91/239 (38%), Gaps = 31/239 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ +E MP I ++++ P+L T + K+ D+++++D Sbjct: 1150 GTSYKVNVFGVFEGGESMPLIGEEMTTLSDATVVPILSTG---LECKTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S S+ I ++++ PD V+ GL +S + L Sbjct: 1207 SWSIGRP------NFKTVRSFIARIVEVFDIGPDK---VQIGLAQYSGDPRTEWQLNSHK 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q + + + L + + G+ A N I K E + +K + +TDG++ Sbjct: 1258 TKQSLMDAVANLPYKGG-NTLTGM--ALNFILRNNFKPEAGMRPGA--RKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLH 352 + + L K G +YA+G++ ++ + + PD Y+V + L Sbjct: 1313 QDDIVAPSQRL------KDLGVELYAVGIKNADENELKQIASDPDETHAYNVGDFTLLV 1365 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 59/338 (17%), Positives = 113/338 (33%), Gaps = 36/338 (10%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 +K +L + + L TA + + E N F + + T R G+ Sbjct: 1 MKIRL-SAIAVAALCTALLLCSIEAEVNPPSDLK-FKIIDERTVQMTWTRPSDAIEGYRI 58 Query: 104 DIN----------NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 + + S++ +++ D L +S Y+ + L Sbjct: 59 TVTPTTDGPSREFTLAPSSTETLLRDLTADIEYLVTISSYDNIEESISVSGQLTIQTGGL 118 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + T KI + V D+ ++ + G + I ++ D Sbjct: 119 ITTGEKKIEEIQLQRCSLSAVTDLVFLVDGSWSVGRNNFKYILDFIVALVSAFDIGEDK- 177 Query: 214 NVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 R G+V +SS F L + + + E I + + G T + +EY + Sbjct: 178 --TRVGIVQYSSDTRTEFNLNQYYRQRDLIEAIKNIPYKGGNTMTGEAIEYL------MR 229 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 A D+ K I +TDG++ E + G V+++G++A A Sbjct: 230 NTFVESAGSRKDFPKVAIIITDGKSQDEVEIPAR------ELRAAGVEVFSLGIKAADAK 283 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + P ++V N + D I E+V Q Sbjct: 284 ELKLIASQPSLTHVFNVANFDGIVD----IQNEIVSQV 317 >gi|291396486|ref|XP_002714579.1| PREDICTED: collagen, type XII, alpha 1 [Oryctolagus cuniculus] Length = 3117 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 55/265 (20%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LSI +Y +S + M + + P P+ + Sbjct: 385 QTTTLSIRDLSADTEYQISVSA---MKGLTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 75/199 (37%), Gaps = 28/199 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKK 285 F L + + I R+ + G T + ++Y N ++ + K Sbjct: 191 EFNLNQYYQRDELLAAIKRIPYKGGNTMTGDAIDYLVKNTFLESAGARVG-------FPK 243 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFY 343 I +TDG++ E + G V+++G++A A + + ++P + + Sbjct: 244 VAIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVF 297 Query: 344 SVQNSRKLHDAFLRIGKEM 362 +V N + D I ++ Sbjct: 298 NVANFDAIVDIQNEIISQV 316 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 84/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFDSLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|257785062|ref|YP_003180279.1| Cna B domain-containing protein [Atopobium parvulum DSM 20469] gi|257473569|gb|ACV51688.1| Cna B domain protein [Atopobium parvulum DSM 20469] Length = 863 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 46/250 (18%), Positives = 81/250 (32%), Gaps = 34/250 (13%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P + L I +S+ ++++V D S SM G G +L A R + ++ Sbjct: 62 PNTDGTYDLTLTIKGETSAASEEQKA-NVLVVFDNSSSMTAQTGGGEMRLDAAKRVVNQL 120 Query: 203 LDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 I I +VV L++F+ K + Q N + F + T LE Sbjct: 121 SSTILGINRNAQKDVVEMALLSFNEKPNLECGWTADLNEFQRATNNMGFHTGTNWESALE 180 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE--------SLFYCNEA 312 A + D K + Y+IF+TDG + +L Sbjct: 181 RA-KVLADQKAANGNPT--------YVIFVTDGLPTQDRNGWVRNNQIGYEHALDEARAI 231 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCA------------SPDRFYSVQNSRKLHDAFLRIGK 360 G Y++ + +L+ +Y N+ ++ AF I Sbjct: 232 GSAGYHFYSVYMY--GGHAYLRQLTNYAYTGNPFGNPGGTYYYEANNTAQMEQAFKEIAS 289 Query: 361 EMVKQRILYN 370 + K + Sbjct: 290 VITKSITYKD 299 >gi|297678514|ref|XP_002817115.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 1 [Pongo abelii] Length = 3115 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ P V+ L +S Sbjct: 1191 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQVALAQYS 1241 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 1242 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 1296 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 1297 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 1350 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 1351 HAYNVADFESL 1361 >gi|297291177|ref|XP_001109727.2| PREDICTED: collagen alpha-1(XII) chain-like [Macaca mulatta] Length = 3095 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLIFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 >gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 775 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 81/206 (39%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV- 228 D+ +VLD S SM+ G + + A +++ L + R ++ F + Sbjct: 266 DVTLVLDRSGSMS---GAPLARAKDAAKAVVARL---------GDGDRVNVMAFDDGVDA 313 Query: 229 ---QTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P++ + E I+RL G T L A + ++ + + ++ H Sbjct: 314 LFLRPVPISAERRSQAVEYIDRLSDGGGTDLAGALAEALDAQHPSESEADTGSRPH---- 369 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCASP--DR 341 I+FLTDG++ D++ +L G A V+ IGV L AS R Sbjct: 370 -VILFLTDGQS-----DSQATLQVARG--DAGDARVFTIGVGDGVEKPLLARLASEKRGR 421 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRI 367 F + + ++ R+ E+ + Sbjct: 422 FTFIASPSEIERKVSRLYSEIAAPVL 447 >gi|149637338|ref|XP_001510478.1| PREDICTED: similar to ITI-like protein [Ornithorhynchus anatinus] Length = 1374 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 73/203 (35%), Gaps = 30/203 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +L+ D+++ +VTFS + Sbjct: 320 VVFVIDVSGSMFG------TKMKQTKKAMHVILN------DLHHDDYFNIVTFSDAVSVW 367 Query: 228 -----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +Q P ++ + +N++ T L A + + + Sbjct: 368 KASGSIQATPP--NIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKI 425 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 IIFLTDGE ++ L ++ + ++ + +A ++ + +R Sbjct: 426 P--LIIFLTDGEATAGVTVASRILSNAKQSLKGNISLFGLAFGDDADYHLMRRLSLENRG 483 Query: 342 -----FYSVQNSRKLHDAFLRIG 359 + + +L + I Sbjct: 484 VARRIYEDADATLQLKGFYDEIA 506 >gi|119569135|gb|EAW48750.1| collagen, type XII, alpha 1, isoform CRA_c [Homo sapiens] Length = 3063 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|114608138|ref|XP_518589.2| PREDICTED: collagen, type XII, alpha 1 isoform 3 [Pan troglodytes] Length = 3119 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|114608140|ref|XP_001142912.1| PREDICTED: collagen alpha-1(XII) chain isoform 2 [Pan troglodytes] Length = 3063 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 78/210 (37%), Gaps = 41/210 (19%) Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 VS SM +L + RS +L + +G++TF S L+ Sbjct: 96 VSGSMQATDYQP-TRLESSKRSAEILLKSLDPKD------YAGIITFESGATSAAYLSPD 148 Query: 237 VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + K+ + T GL + + KK +I L+DG N Sbjct: 149 KDRVIRKLQAIEPKEGATAIGDGLALGIDMAESMP-----------NRKKVVILLSDGVN 197 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----------------ADQFLKNCAS 338 ++ I +++ A+ +G V+ IG+ +++ + L+ A+ Sbjct: 198 NAGVIHPEQA---AGFAREKGIQVFTIGMGSDSPVVLGYDWFGNPQYATLDEAMLQQIAA 254 Query: 339 P--DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +++ + R L + + + KE+V+++ Sbjct: 255 STNGQYFKSVDDRTLSEIYSNLNKEIVREK 284 >gi|93141047|ref|NP_004361.3| collagen alpha-1(XII) chain long isoform precursor [Homo sapiens] gi|146345397|sp|Q99715|COCA1_HUMAN RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor gi|55662663|emb|CAH71310.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203512|emb|CAI19898.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203521|emb|CAI19908.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 3063 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|1846005|gb|AAC51244.1| collagen type XII alpha-1 [Homo sapiens] Length = 3063 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYNVADFESL 1364 >gi|163848202|ref|YP_001636246.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526110|ref|YP_002570581.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669491|gb|ABY35857.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449989|gb|ACM54255.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 415 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 28/227 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P LL+ + + + L++ VLD S SM G +D L ATR + E Sbjct: 16 IPSSTTPQVGYLLVEAVAPATLAPSLPLNLCFVLDRSGSMQ---GAKLDNLKAATRRVIE 72 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGL 259 L +V F + P + ++ + T + G+ Sbjct: 73 QLQPTDIA---------AIVIFDDTVQTLIPATPVGDRSALLAAVDSISEAGGTAMSLGM 123 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + ++ G D + ++ LTDG+ D + G + Sbjct: 124 QAGQVEL--------QKHLGSDRLSR-MLLLTDGQTWG---DEPLCRDLARSLGQAGVQI 171 Query: 320 YAIGVQAEAADQFLKNCASPDRFYS--VQNSRKLHDAFLRIGKEMVK 364 A G+ E +Q L + A+ YS + ++ F + E Sbjct: 172 VAFGLGTEWNEQLLDDIAAASNGYSDYIAAPEQIGTFFQQAVHEAQA 218 >gi|157827515|ref|YP_001496579.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] gi|157802819|gb|ABV79542.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] Length = 446 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 49/294 (16%), Positives = 97/294 (32%), Gaps = 38/294 (12%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-------S 115 ++N+E+ QK + + NE F + I +S + Sbjct: 161 LVNEEDIT--PFQKAIYHPTDFSQLITQISSNEENSLNFIMNNGAIAQSVQVYTADGKAP 218 Query: 116 IIIDDQHKDYNL--SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 II D + + + +Y +P + + +++ ++ + Sbjct: 219 IIASDLKDGFIIDKQYLLKYLLPIFNGFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISL 278 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--IVQTF 231 ++D+S SM F V +I ++LD + IP+ + +V F+ + Sbjct: 279 LIDISGSMEKDFS-------VYKNNILKILDKLAEIPNW----QINIVVFNDESTARSFS 327 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 ++ I+ IN L TK ++ A KG D +I T Sbjct: 328 NQENNIEDIKVYINNLKANGYTKLYGTIKEALESF-----------KGKIDESSTLIVFT 376 Query: 292 DGENSSPNIDNKE---SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 DG++ N + E K +Y +G +F + A+ F Sbjct: 377 DGKDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQYYNQEFFEQVATRGGF 430 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 83/237 (35%), Gaps = 35/237 (14%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + SS S +D ++V+DVS SMN + A + +ML Sbjct: 19 IMTSILAWQPQMANASSPSASKVDAVLVVDVSNSMNTSDPGKIG--NEAMKMFIDMLST- 75 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEY 261 + G+V ++ + + L Q ++ I+ L G+ T ++ G++ Sbjct: 76 -------QNDKVGIVAYTDVVQREKALLNITSEADKQELKTFIDGLNRGAYTDTSVGVKE 128 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-------ESLFYCNEAKR 314 A + + H I+ L DG N + + EAK Sbjct: 129 ALRIL-QDGKTAGHAP--------MIVMLADGNNDFNKTTGRTESQSDQDMAQAVAEAKN 179 Query: 315 RGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 G +Y IG+ A+ L + A + + + ++ L + I +K ++ Sbjct: 180 SGVPIYTIGLNADGKLNKNKLADIAQQTGGKSFITSSADDLPNILSEIFASNLKLKV 236 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 73/218 (33%), Gaps = 36/218 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ ++ G D++MV+D+S SM + G + +L A R +R + Sbjct: 123 EVQTREAFGRDVLMVVDLSGSMEEKDFSTAAGEQLSRLTAAKRVLRNFVTQ-------RQ 175 Query: 215 VVRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R GL+ F P ++ ++ + G +T + + Sbjct: 176 GDRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS-- 233 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD- 330 + ++ LTDG ++ + ++ R +Y I + Sbjct: 234 -------PSTSQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIR---IYVIAMGDPENVG 283 Query: 331 ------QFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 + +S R + + +L++A+ I + Sbjct: 284 EQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 321 >gi|182438638|ref|YP_001826357.1| hypothetical protein SGR_4845 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467154|dbj|BAG21674.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 578 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 76/211 (36%), Gaps = 26/211 (12%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + S+ + V+D+S SM + +L + +S+ + D ++ V Sbjct: 201 VGLATEAAPSTAERPPAALTFVVDISGSMAE-----TGRLDLVRKSLAVLTDELRDDDSV 255 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + LVTFS P+ I++ ++ + +T G+ Y + + Sbjct: 256 S------LVTFSDAAETRLPMTRLQGNRNRIKDAVDEMRPEQSTNVEAGITRGYEESVEG 309 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEA 328 K ++ L+D ++ + + L + + G ++ +GV ++ Sbjct: 310 HRKGATNR---------VVLLSDALANTGDTEADGILERIDSTRREYGITLFGVGVGSDY 360 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 D F++ V + + F+ Sbjct: 361 GDAFMERLTNKGDGNTTYVGDEAQARKVFVD 391 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 73/213 (34%), Gaps = 32/213 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD+SLSM +++ A + ++ ++ D+ G+V F+ Sbjct: 88 IIIALDISLSMEAADVQP-NRITAAKEAAQKFVE------DLPERFNVGVVAFARSASVV 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q + + L + T + +N + + + D I+ L Sbjct: 141 VSPTTDHQAVSASLGNLTTRAGTAIGEAV---FNSLDAVRSFDQQAVT--DPPPAAIVLL 195 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG--------------VQAEAADQFLKNC 336 +DG+N+S + + A + I V L+ Sbjct: 196 SDGDNTSG----RSVAEAVDAAMSARVPISTIAYGTQEGTVSIDGRDVNVPVNKATLQTL 251 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + R Y ++ +L + + +IG + + + Sbjct: 252 SEGTSGRAYEAESGSQLREVYEQIGTSLGYRTV 284 >gi|148694466|gb|EDL26413.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] gi|148694467|gb|EDL26414.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] Length = 2886 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 99/266 (37%), Gaps = 39/266 (14%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y +S + M + + P P+ + Sbjct: 385 QTTTLNVRDLTADTEYQISV---FAMKGLTSSEPTSVMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + + + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLAKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +++ I + G +T + + Y KIF + + K Sbjct: 491 EFTLK-EFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPNKGSR------SNVPK 543 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRF 342 +I +TDG++S D + + ++A+GV+ +A L+ ASP Sbjct: 544 VMILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHV 596 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++V++ DAF RI E+ + L Sbjct: 597 FTVED----FDAFQRISFELTQSICL 618 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 93/272 (34%), Gaps = 44/272 (16%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-- 165 + ST+ +++ D + + ++ Y + K K Sbjct: 72 LAASTTETLLSDLIPETQYVVTITSYN-----EVEESVPVIGQLTIQTGGPTKPGEKKPG 126 Query: 166 ---------DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 D++ ++D S S+ + + VA ++ + Sbjct: 127 KTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIVA---------LVSAFDIGEEKT 177 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAY-NKIFDAKEK 272 R G+V +SS F L + + + + ++ + G T + ++Y N ++ Sbjct: 178 RVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTESAGS 237 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I +TDG++ E + G V+++G++A A + Sbjct: 238 RAG-------FPKVAIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKEL 284 Query: 333 LKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 + ++P + ++V N + D I ++ Sbjct: 285 KQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 85/245 (34%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ +P + ++++ P L + ++ D+++++D Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA---DIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|68535223|ref|YP_249928.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] gi|68262822|emb|CAI36310.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] Length = 646 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 74/242 (30%), Gaps = 43/242 (17%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A + + + + M++LD S SM G ++ A Sbjct: 7 LLFAIALIAALVTIVVPPAQEARAEDEEGKNIPPTMLILDASGSMKTPDAGGQTRMAAAK 66 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKI------------VQTFPL---AWGVQHIQ 241 + + + S + G + + +++ L V I Sbjct: 67 DAAQLFSVAVPSDAE------LGFMVYGTEVGNSPEERDAGCKDVKTLLPVEKGNVTKIP 120 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ ++ T P L A ++ E+ I+ ++DGE++ Sbjct: 121 AEVGKVEASGHTPMGPALRQAAEELPKDGERS-------------IVLVSDGEDTCA--- 164 Query: 302 NKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFL 356 + K+ G + +G ++ + C A + +++ L D+ Sbjct: 165 PPPVCEVAKDLKKEGIDLTINTVGFLVDSKARKELECIAEAGGGEYMDAKDTVSLADSMK 224 Query: 357 RI 358 R+ Sbjct: 225 RL 226 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 40/177 (22%), Positives = 65/177 (36%), Gaps = 27/177 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P + +T ++ + +D++ V DVS SM KL A +++ Sbjct: 54 PNADAQGRLDITLTPQGRVDNIIRPPIDVVFVFDVSGSM----VMPSLKLDSAKYALQSA 109 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIV--QTFPLAWGVQHIQEKIN-------RLIFGSTT 253 +D K+ + N+ R LV FS + + P G +++ +N L T Sbjct: 110 VDYFKANANPND--RFALVPFSDDVQYSKVVPFPSGTYDVKQHLNWIATVANSLRANGGT 167 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + D KKYIIFLTDG + + C Sbjct: 168 NYTQALQQAQSFFNDPAR------------KKYIIFLTDGMPTVSIAKEPITYKVCE 212 >gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] Length = 1321 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ V+DVS SM KL +++ +L +++ N ++TFS + Sbjct: 284 VVFVIDVSGSMFG------TKLQQTKKAMDTILSDLQASDSFN------IITFSDTVNIW 331 Query: 229 ----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + ++R+ T L A + + + ++ Sbjct: 332 KAEGSIQATVQNIHSAKNYVSRMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP- 390 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 I+FLTDGE ++ L +A ++++ +A L+ + ++ Sbjct: 391 -LIMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEA 449 Query: 342 ---FYSVQNSRKLHDAFLRIGK 360 + + +L + I + Sbjct: 450 RRIYEDADAALQLEGLYAEISR 471 >gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella oneidensis MR-1] gi|24348130|gb|AAN55237.1|AE015661_7 inter-alpha-trypsin inhibitor domain protein [Shewanella oneidensis MR-1] Length = 760 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 97/237 (40%), Gaps = 30/237 (12%) Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL-- 169 + ++ + + K + A S +N + L++ K+ + + L Sbjct: 319 SPVAWVFNQNGKTHQTQAASDDGSMVNTAASTSASNIDNYSLVMVLPPKVEASGQLNLPR 378 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++++V+D S SM D + A ++R L +K+ N ++ F+S + Sbjct: 379 ELILVIDTSGSMAG------DSIIQAKNALRYALRGLKAQDSFN------IIEFNSDVSL 426 Query: 230 TFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+ A + ++ +NRL T+ + LE A K++ A ++ + Sbjct: 427 LSPVPLPATAENLAIARQFVNRLQADGGTEMSLALEAAL-----PKQRPSRAASENNVLQ 481 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I F+TDG ++ N+E+LF + ++ +G+ + F++ A R Sbjct: 482 QVI-FMTDG-----SVGNEEALFELIRHQIGDNRLFTVGIGSAPNSHFMQRAAELGR 532 >gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus] Length = 1031 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ V+DVS SM KL +++ +L +++ N ++TFS + Sbjct: 250 VVFVIDVSGSMFG------TKLQQTKKAMDTILSDLQASDSFN------IITFSDTVNIW 297 Query: 229 ----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + ++R+ T L A + + + ++ Sbjct: 298 KAEGSIQATVQNIHSAKNYVSRMEADGWTDINAALLAAASVLNHSNQEPGKGRGVGQIP- 356 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 I+FLTDGE ++ L +A ++++ +A L+ + ++ Sbjct: 357 -LIMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQGEA 415 Query: 342 ---FYSVQNSRKLHDAFLRIGK 360 + + +L + I + Sbjct: 416 RRIYEDADAALQLEGLYAEISR 437 >gi|218192066|gb|EEC74493.1| hypothetical protein OsI_09963 [Oryza sativa Indica Group] Length = 641 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 55/141 (39%), Gaps = 25/141 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S S LD++ VLDVS SM+ G + L A + + L R +V Sbjct: 242 SVSSRAPLDLVTVLDVSGSMS---GIKLSLLKRAMSFVIQTLGPND---------RLSVV 289 Query: 222 TFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FSS + FPL G Q + I+ L+ T L+ + D + K Sbjct: 290 AFSSTAQRLFPLRRMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRK----- 344 Query: 278 KGHDDYKKYIIFLTDGENSSP 298 + II L+DG+++ Sbjct: 345 ----NPVSSIILLSDGQDTHS 361 >gi|15893764|ref|NP_347113.1| heat shock protein DnaK [Clostridium acetobutylicum ATCC 824] gi|15023331|gb|AAK78453.1|AE007561_14 DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum ATCC 824] gi|325507887|gb|ADZ19523.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum EA 2018] Length = 698 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 69/196 (35%), Gaps = 30/196 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D+S SM L A + + +D I L+ F+ K+ Sbjct: 514 IVIAIDLSGSMRGK------PLEEAIEASKTFVDSIDEGS-----FSLALIGFADKVKTL 562 Query: 231 FPLAWGVQHIQEKINRL-IFGSTTKS-TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + I I+ L T + + AYN + DA +++ Sbjct: 563 INLTEDREEIFRAIDGLKKADVGTSTMSEPFSEAYNILKDAYGDC------------FVV 610 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 LTDG+ K+ + N+ K + AIG A FL A+ + Sbjct: 611 VLTDGQ----WYGKKDIMAEVNKCKEYEIEIAAIGFG-NAKKDFLDKIATCEENSIFTEV 665 Query: 349 RKLHDAFLRIGKEMVK 364 L +F RI K + + Sbjct: 666 SNLKQSFSRIAKVISR 681 >gi|171846231|ref|NP_034899.2| cartilage matrix protein precursor [Mus musculus] gi|28503027|gb|AAH47140.1| Matrilin 1, cartilage matrix protein [Mus musculus] gi|66350790|emb|CAC79633.1| cartilage matrix protein [Mus musculus] gi|123295072|emb|CAM17812.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] gi|187466486|emb|CAQ51562.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 500 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 83/209 (39%), Gaps = 34/209 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S D++ ++D S S+ + + I +++D + + GLV +S Sbjct: 274 SGSATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSDRLAQ---VGLVQYS 324 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S I Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 325 SSIRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARP 378 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +K I TDG + D +AK G ++A+GV ++ + + P Sbjct: 379 GA--QKVGIVFTDGRSQDYINDAAR------KAKDLGFKMFAVGVGNAVEEELREIASEP 430 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 D ++ + + ++ IGK++ KQ Sbjct: 431 VADHYFYTADFKTINQ----IGKKLQKQI 455 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 73/196 (37%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R GLV ++S + Sbjct: 45 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGLVNYASTVKP 95 Query: 230 TFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + + R+ + T + L++A K E D K Sbjct: 96 EFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKALSDAE---GGRARSPDISKV 152 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-----DR 341 +I +TDG D E A+ G ++AIGV + + P D Sbjct: 153 VIVVTDGRPQDSVRDVSE------RARASGIELFAIGVGRVDKATLRQIASEPQDEHVDY 206 Query: 342 FYSVQNSRKLHDAFLR 357 S KL F Sbjct: 207 VESYNVIEKLAKKFQE 222 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/177 (22%), Positives = 64/177 (36%), Gaps = 27/177 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P + +T ++ + +D++ V DVS SM KL A +++ Sbjct: 54 PNADAQGRLDITLTPQGRVDNIIRPPIDVVFVFDVSGSM----VMPSLKLDSAKYALQSA 109 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTF--PLAWGVQHIQEKIN-------RLIFGSTT 253 +D K+ + N+ R LV FS + P G +++ +N L T Sbjct: 110 VDYFKANANPND--RFALVPFSDGVQSDKVVPFPSGTYDVKQHLNWIATVANSLRANGGT 167 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + D KKYIIFLTDG + + C Sbjct: 168 NYTQALQQAQSFFNDPAR------------KKYIIFLTDGMPTVSIAKEPITYKVCE 212 >gi|238060187|ref|ZP_04604896.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881998|gb|EEP70826.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 779 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 80/216 (37%), Gaps = 28/216 (12%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P ++ + ++ S ++++ +D S SMN G G + VAT + + ++ S Sbjct: 573 PEVVDAILRGWSTLRKKANILLAVDTSGSMNARVG-GRTRFQVATTAADRAVGLLNSAD- 630 Query: 212 VNNVVRSGLVTFSSKIVQT--FPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYA 262 R L +FSS+ Q P + + ++ L G T + A Sbjct: 631 -----RVALWSFSSETDQRRGKPYSEEIRLGPYDRAAFTRRLTGLRVGGNTALYATVRAA 685 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYA 321 + ++ D + A ++ LTDG+N P + D L R V+ Sbjct: 686 HRRLLDNHDPDRINA---------VVVLTDGKNEYPRDNDLDRLLADIELDPNRPVKVFC 736 Query: 322 IGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAF 355 + E+ L AS + + + + +AF Sbjct: 737 VAFDRESDLAALDRIAGASAGKAFDATDPATIDEAF 772 >gi|226226934|ref|YP_002761040.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090125|dbj|BAH38570.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 565 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 64/170 (37%), Gaps = 21/170 (12%) Query: 165 SDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S G+DM + +D SLSM D ++++ R +R M + R L+ Sbjct: 95 SSRGIDMAIAIDASLSMLAQDERPSRLERVKQEVRRLRAM----------SPADRVALIA 144 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+ + PL ++ ++ L G ++ L A ++ E + Sbjct: 145 FAGRSYILTPLTGDDGALELFLDNLDPGVVGQAGSSLSRAI------RQGSELLLASDGS 198 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ L+DGE+ D + + +EA +G V +G Sbjct: 199 ADRALVLLSDGESFDSAEDIESA---ASEAGSKGISVVTVGFGTRDGATI 245 >gi|296205952|ref|XP_002806985.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Callithrix jacchus] Length = 3176 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 76/228 (33%), Gaps = 25/228 (10%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P K+ D++ ++D S S+ + + + Sbjct: 7 LPLVAIF---CLFLSGFPTTHAQQQHADVKNGAAADIIFLVDSSWSIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYTGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T + + ++ D + I+ LTDG + L Sbjct: 115 TNQTG---KGLEYVMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKEGLALPSAGL------ 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRI 358 K V+AIGV+ + + P ++++N LHD + Sbjct: 166 KSADVNVFAIGVEGADEGALKEIASEPLNMHMFNLENFTSLHDIVENL 213 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 55/312 (17%), Positives = 106/312 (33%), Gaps = 32/312 (10%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRDADGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ + Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHA 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ M RS ML + S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKANFFSM-------RSFVMML--VNSLDVGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S V F L + + I + + +L + G +Y E + H Sbjct: 684 SDTPVTEFSLNTYQTKSDILDHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIREHV 743 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++ LT G++ L N R G + + +G + + +P Sbjct: 744 P--QLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSL 795 Query: 342 FYSVQNSRKLHD 353 Y + + L D Sbjct: 796 VYLMDDFSSLPD 807 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 60/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPVGTQQIRVGVVQFSDEPRTMFSLDTYSSKAQVLSAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S D +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIGDGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHVATDDN 397 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 39/295 (13%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1533 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1592 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + D++ +LD S Sbjct: 1593 RLVFTVRDFRDLPNIEERIMNSFGPSAATPAPPGVDIPSPSLPEKKKA-DIVFLLD--GS 1649 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAWGVQH 239 +N + L + +I+ ++ + + ++ GLV ++S + F + + Sbjct: 1650 INFRRDSFQEVLXFVS-------EIVDTVYEDGDSIQVGLVQYNSDPTDEFFXRDFSTKR 1702 Query: 240 -IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1703 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLEQRVPQIAFVVTGGKS--- 1757 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 +++ + G V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1758 ---VEDAQQVSLALTQSGVKVFAVGVRNIDSEEVGKTASNSATAFRVGNVQELSE 1809 >gi|284052945|ref|ZP_06383155.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569123|dbj|BAI91395.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 489 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 70/200 (35%), Gaps = 32/200 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFSSKI 227 ++M++D S SM+ KL R+ E + N+ R +V FSS+ Sbjct: 53 AVVMLIDTSGSMSGS------KLPEVQRAASEFV-------SRQNLKRDDLAVVEFSSRA 99 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + +Q+ I RL T + G A + + ++ I Sbjct: 100 SVVADFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPGN------------I 147 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + TDGE + + + + + + G + A+G + PD + N Sbjct: 148 LLFTDGEPN----NRRMAASIAQQIRASGINLVAVGTGDAPVNYLTALTGDPDLVFYA-N 202 Query: 348 SRKLHDAFLRIGKEMVKQRI 367 L AF K + Q++ Sbjct: 203 FGDLDSAFRGAEKAIYGQQL 222 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 76/211 (36%), Gaps = 22/211 (10%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + + +DM++VLD S SM K+ A +++ +++ ++S Sbjct: 75 LTMTADEVLAPEQTKTKPVDMVIVLDRSGSMGGQ------KVRDAKAAVKGLVEGLRSQD 128 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R LVT+S+ + L + + +N ++ S P K Sbjct: 129 ------RFSLVTYSNSVNGGDGLHYLTADKRNSLNWMV-----DSIPAGGGTNLGGGLEK 177 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEA 328 A G D +I ++DG+ + D + L + G + V +G+ + Sbjct: 178 GVGVLRAYGAPDRMGKVILISDGQANQGVTDPNQ-LAAMAALRDDGLVYSVTTVGIGQDF 236 Query: 329 ADQFLKNCASP--DRFYSVQNSRKLHDAFLR 357 +Q + A R+Y ++N F Sbjct: 237 NEQLMATVADGGRGRYYYLENPGDFLAVFQE 267 >gi|296227162|ref|XP_002759255.1| PREDICTED: hypothetical protein LOC100397584 [Callithrix jacchus] Length = 1319 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 77/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDQPTT 88 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + ++ RL + G T + L Y + F G YK+ Sbjct: 89 AFELGLFGSREEVKAAARRLAYHGGNTNTGDALRYITARSFSP---HAGGRPGDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 82/209 (39%), Gaps = 21/209 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K ++ L ++++LDVS SM+ K+ ++R+MLD + + Sbjct: 8 PSKFTAPKAKPLPVVLLLDVSGSMSGE------KIRNVNDAVRDMLDTFSDTENGETEIH 61 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++TF S++ PLA L G T L+ A I D Sbjct: 62 VAIITFGSQVALHQPLA---SASDIHWQDLSAGGMTPLGTALQMAKAMIED------KDV 112 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEAKRRGAIVYAIGVQAEAADQFLKNC 336 Y+ ++ ++DG PN ++ L ++ + A+ + A+A + L Sbjct: 113 IPSRAYRPTVVLVSDG---GPNDAWEKPLNAFISDGRSAKCDRLAMAIGADADEAVLGKF 169 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + +R + +N+++L D F + + Sbjct: 170 IEGTSNRLFYAENAKQLRDFFKFVTMSVT 198 >gi|255526265|ref|ZP_05393182.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186259|ref|ZP_06854663.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510045|gb|EET86368.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049060|gb|EFG88490.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 580 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 47/221 (21%), Positives = 82/221 (37%), Gaps = 37/221 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + LD++ VLD S SM + + T +I+ LD+ + + GLV Sbjct: 29 SSNTSSNLDVVFVLDSSGSMKESDPEEIR-----TEAIKMFLDMSQVQGNK-----FGLV 78 Query: 222 TFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +S +V+ L + I+ + G T + G+ A N + ++ H Sbjct: 79 AYSDNVVREHNLDTINSNDDKERIKNMALNIPLGQKTDTGAGILEAVN-LMNSGHDKNHK 137 Query: 277 AKGHDDYKKYIIFLTDGENS------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 II L+DG+N D K S+ C K +G VY IG+ + Sbjct: 138 P--------VIILLSDGKNDPQRKTEDSLKDLKSSISTC---KDKGYPVYTIGLNYDGTV 186 Query: 331 QFLK----NCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + + + Y + L I + K ++ Sbjct: 187 DKTQLEEMSNETKGKNYITSTAADLPKILTDIYADNSKLKV 227 >gi|218193011|gb|EEC75438.1| hypothetical protein OsI_11969 [Oryza sativa Indica Group] Length = 516 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 37/207 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S+ S LD++ VLDVS SM +D++ A + LD + R Sbjct: 60 TSSATSRAALDLIAVLDVSTSMAGD---KLDRMKAALLFVIRKLDDVD---------RLS 107 Query: 220 LVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +VTFS+ + PL + + ++ L T GLE + + Sbjct: 108 IVTFSNDAARLCPLRFVAGDAARADLGALVDGLAADGNTNIRAGLEIGL-AVAAGRRLTA 166 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ ++DG+ + + + G V+ G+ A+ L+ Sbjct: 167 GRSVN-------VMLMSDGQQNRADATRLDP---------GGVPVHTFGLGADHDPAVLQ 210 Query: 335 NCASPDR---FYSVQNSRKLHDAFLRI 358 A R F+ V + L F ++ Sbjct: 211 AIAGKSREGMFHYVADGVNLTAPFSQL 237 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 68/198 (34%), Gaps = 37/198 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM ++++ A+R + V G+VTF S Sbjct: 308 VCLVLDKSGSMQSD--NRLNRMNQASRLFL--------LQTVEQGSWVGMVTFDSTAYVQ 357 Query: 231 FPLAWGVQHIQEK--INRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L I RL T GL+ A+ I Sbjct: 358 SELTQLNSGADRDLLIKRLPTVASGGTSICSGLQAAFTSIKKKYPTDGAE---------- 407 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE++ ++ C + K GAI++ + + AA + L + Y Sbjct: 408 IVLLTDGEDN--------TISSCFDLVKNSGAIIHTVALGPSAAKELEQLSKMTGGLQTY 459 Query: 344 SVQNSRK--LHDAFLRIG 359 S + L DAF + Sbjct: 460 SSDQIQNNGLVDAFAALS 477 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 63/174 (36%), Gaps = 19/174 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+D++ +DVS SM ++L A R + E+++ + S R Sbjct: 80 TKLETVKREGVDIVFAVDVSKSMLAEDIAP-NRLEKAKRLVSEIINQLAS-------DRI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G++ ++ + P+ + + + T + A + E Sbjct: 132 GIIAYAGQAFPQLPITTDYGAAKMFLQNMNTNMLTSQGTAINEAIELATTYYDDEEQTN- 190 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++DGE+ S +L ++A G ++ IGV Sbjct: 191 ------RVLFIISDGEDHSEGT----TLKAVDDAIEEGIQIFTIGVGKSKGAPI 234 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 58/172 (33%), Gaps = 36/172 (20%) Query: 232 PLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L +++ + + GS T + GL + + A + K ++ Sbjct: 375 SLTDDFDKLRKAASEMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKIVLL 434 Query: 290 LTDGENSSPNIDNKESLF-------------------------------YCNEAKRRGAI 318 LTDGEN + + C + K +G Sbjct: 435 LTDGENVVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVDGWTKSVCTQLKNQGVQ 494 Query: 319 VYAIGVQAE--AADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y + +Q++ A CAS P +Y+V + KL D F I + + ++ Sbjct: 495 IYTMVLQSDTAANRALYSACASDPSGYYAVNDPAKLPDVFQHIANKFSRLQL 546 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 85/233 (36%), Gaps = 38/233 (16%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F+ + G+ ++L + LP I + ++ S K+ L LD + L ++ Sbjct: 3 LNKFWRSKSGNFALLMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLASS----- 57 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + +Q + + ++ +D Sbjct: 58 ------HLGDLDISRTDAFDRYFQANIAGHGE-----------LANAQATLTVDRGVNFI 100 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 AV+ ++ F ++ H + ++ +SD L++++VLD + SM Sbjct: 101 KTKAVASADVNLNFGF--LFGHNRHIAVDASAV-----ESDNQLEVVLVLDNTGSMA--- 150 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT---FPLAW 235 G M L AT+S+ + L+ KS +R+ LV F + + F +W Sbjct: 151 GARMTALRTATKSLLDTLEATKSPTRQ---IRASLVPFVTAVNVNGDEFDPSW 200 >gi|330502932|ref|YP_004379801.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328917218|gb|AEB58049.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 566 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 87/217 (40%), Gaps = 26/217 (11%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 PW + + I +S S + +++ ++DVS SM+ G L + + Sbjct: 168 LAVTPWNPQTRLLRIAIKAS-DRSVEELPPANLVFLVDVSGSMHRREG-----LPMVQGT 221 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFS--SKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 ++ ++D ++ R LVT++ S+++ I+ I++L G +T Sbjct: 222 LKLLVDQLRPQD------RVSLVTYAGDSQVLLDSAPGSDKAKIRAAIDQLTAGGSTAGE 275 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G++ AY + +H+ G + I+ TDG+ + D + ++ G Sbjct: 276 SGIQLAYQ------QASKHLIDGGINR---ILLATDGDFNVGISDFDSLKQLAADKRKSG 326 Query: 317 AIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNSRK 350 + +G + ++ ++ A + + N R+ Sbjct: 327 VSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDNLRE 363 >gi|260797332|ref|XP_002593657.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] gi|229278884|gb|EEN49668.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] Length = 949 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 68/201 (33%), Gaps = 31/201 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+M+VLD S S+ D +++ ++ R G+V +S Sbjct: 25 AKLDLMLVLDGSGSVGDA------DFAKTLEFAENVVNAFDI---GTDLTRVGVVQYSDT 75 Query: 227 IVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A I N T + LE+A + Sbjct: 76 PTMEFNLGVHADKGSTIAAVNNIQYQNGGTATGAALEFA----------RANANWRGAPV 125 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DR 341 K +I +TDG++ ++L G VYAIGV L+ A+ + Sbjct: 126 PKVMIVVTDGKSGDDVTAAAQALA------GEGVAVYAIGVG-NYDLPELQQIANGNNNN 178 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 +Q+ L A +I ++ Sbjct: 179 VIELQDYNALTAAIDQIAGQV 199 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 54/140 (38%), Gaps = 24/140 (17%) Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 G++ +SS + + F L + + I+ +++ T A + D + Sbjct: 721 VGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGA---AITYMKDNSQWRPG 777 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +A K I +TDG++S + A++ G ++AIGV A L Sbjct: 778 VA-------KIAIVVTDGKSSDDVGPPSSA------AQQTGITMHAIGVGANVDQTELSQ 824 Query: 336 CASPDRF------YSVQNSR 349 AS ++ Y +++ Sbjct: 825 IASTSQYVTTVADYDALDAQ 844 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 57/152 (37%), Gaps = 21/152 (13%) Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKL 273 + G++ +SS + + F L + + I+ ++ G T + A + D + Sbjct: 366 QIGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTG----TAITYMKDNSQWR 421 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++A K I +TDG++S + G ++AIGV A L Sbjct: 422 PNVA-------KIAIVVTDGKSSDDVAAPSSAAQQ------AGITMHAIGVGANVDQTEL 468 Query: 334 KNCASPDRFY-SVQNSRKLHDAFLRIGKEMVK 364 AS ++ +V + L ++ + Sbjct: 469 SQIASTSQYVTNVADYDALDAQMAQLTASVCN 500 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 53/146 (36%), Gaps = 12/146 (8%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 DQ + +++ S+Y + + + + LD+ VLD S Sbjct: 464 DQTELSQIASTSQYVTNVADYDALDAQMAQLTASVCNGPTTPAPTCNAPLDLFFVLDGSG 523 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GV 237 S+ G DK+ T+++ DI + R G+V +S F L Sbjct: 524 SVT---GANFDKVKQFTKNVVNAFDISATA------TRVGVVQYSDSNTLEFNLGDHADK 574 Query: 238 QHIQEKINRLIF-GSTTKSTPGLEYA 262 I+ +++ G T + LE+A Sbjct: 575 PSTLAAIDSIVYQGGGTTTGSALEFA 600 >gi|194216197|ref|XP_001914777.1| PREDICTED: collagen, type XII, alpha 1 [Equus caballus] Length = 3120 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTMLSVRDLSADTEYQISVSA---MKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 78/234 (33%), Gaps = 37/234 (15%) Query: 145 CANSSHAPLLITSSVKISSKSDIGL-----------DMMMVLDVSLSMNDHFGPGMDKLG 193 + +K K D++ ++D S S+ + + Sbjct: 104 VPVIGQLTIQTGGPMKPGEKKSGKTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 163 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-G 250 A ++ + R G+V +SS F L + + I ++ + G Sbjct: 164 AA---------LVSAFDIGEEKTRVGVVQYSSDPRTEFNLNQYYQRDELLAAIKKIPYKG 214 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + ++Y F A + K I +TDG++ Sbjct: 215 GNTMTGDAIDYLIKNTFMES------AGARVGFPKVAIIITDGKSQDEVEIPAR------ 262 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 263 ELRSIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 84/238 (35%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHK 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q + E + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKQSLLEAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++A+G++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAVGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|126662670|ref|ZP_01733669.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] gi|126626049|gb|EAZ96738.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] Length = 347 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 68/218 (31%), Gaps = 32/218 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K+ + G+D++ +D+S SM D ++K I L Sbjct: 80 TKMETVKRQGIDIVFAVDISKSMLAEDIKPNRLEKTKQLVSQIINQLGN----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G+V ++ P+ + + + + A D + Sbjct: 130 RVGIVGYAGSAYPILPMTTDYSIAKMYLQSMNTNMVSSQGTAFNDAIKLAVDYFDVK--- 186 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLK 334 D K II ++DGE+ + + + AK +G + IGV E Sbjct: 187 -----DTSKLIILVSDGEDHG-----EGASEAIDLAKEKGVRILTIGVGTEKGALIPLKD 236 Query: 335 NCASPDRFYSVQNSRKLH-----DAFLRIGKEMVKQRI 367 N + + QN + D I + + I Sbjct: 237 NKGTISSYKKDQNGENVITKLYPDVLKNIATKTKSKYI 274 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 49/159 (30%), Gaps = 29/159 (18%) Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK--GHDDYKKYIIFLTDG- 293 ++ ++ L S T G+ + I A D ++I+F+TDG Sbjct: 481 RTDLETYVDGLTPRSNTYHDFGMIWGARFISPNGIFAASNATAPNGDAISRHIVFMTDGL 540 Query: 294 -------------------ENSSPNIDNKESLFY------CNEAKRRGAIVYAIGVQAEA 328 + + C A++ V+ I Sbjct: 541 LVPNQEIYSMYGIEWWDRRITNDGSGGQARDRHATRFQVACRAARQENISVWVIAFGTTL 600 Query: 329 ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + CA+P R + ++ L F +I +E+ R+ Sbjct: 601 TQNLID-CATPGRAFQANDTAALETRFEQIAQEIAALRL 638 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 76/228 (33%), Gaps = 39/228 (17%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 I I A L+P++ +V G ++ S + + +L D L + + + Sbjct: 10 IVISAASLVPLMAMVGG-GVDASRYYMAETRLQAACDAGALAARRSMADDNFSRADRITG 68 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 F N +D+ +T ++ + +P Sbjct: 69 EKFFDE-----------NYPDGTFGLEDLERSFTATQ----------SGQVNGEASGTLP 107 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--------NDHFGPG 188 P+ + + + V IS+ D++ V+DV+ SM G Sbjct: 108 TAIMA-PFGYDEFSLSVTCEADVNISNT-----DVLFVVDVTGSMNCAPDNPGGGSCGNT 161 Query: 189 MD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 D K+ ++ + D +++ + VR G+V ++S + L Sbjct: 162 EDPGAKIKGLRSAVLKFYDTVETSTSPSAQVRYGMVPYASNVNVGAAL 209 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 76/189 (40%), Gaps = 25/189 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + ++ + P N R G++ +S Sbjct: 29 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGP---NATRVGVIQYS 79 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I L+ T + ++YA N F E + Sbjct: 80 SQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPEE 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 137 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 189 Query: 340 -DRFYSVQN 347 + + V++ Sbjct: 190 DEHVFLVES 198 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 344 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 395 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 449 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 450 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 497 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 74/190 (38%), Gaps = 11/190 (5%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++D S SM + + I ++ R G+ F+ + Sbjct: 286 VAFLIDNSGSMYPKELCPTSSENDVDFKRLDFTQSL--IDKFDDDFRIGISKFTGTYTKM 343 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +++ +NR+ T+ + YN+ K E A G Y I+ L Sbjct: 344 CDFTDDRTELRKVLNRIR----TE-DEIFDGTYNQTALKKCINEFSAAGDGKYVNIIVML 398 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNS 348 +DGE+ + + ESL N A + IV +G+ E +L+ A + ++YS ++ Sbjct: 399 SDGESDEVDAETIESLS--NLANEKSVIVLTVGLGREIDRAWLQEVAYSTGGKYYSASDA 456 Query: 349 RKLHDAFLRI 358 L D + +I Sbjct: 457 TSLDDVYKQI 466 >gi|254453558|ref|ZP_05066995.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] gi|198267964|gb|EDY92234.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] Length = 676 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 71/184 (38%), Gaps = 19/184 (10%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW A++ + + + S + L+++ ++D S SM DKL + +S R Sbjct: 294 TPWNADTQLVHIGLQGEM-PSIQDRPALNLVFLIDTSGSME-----SADKLPLLRQSFRL 347 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 MLD + +V V +G + + + Q I +N L G +T GLE Sbjct: 348 MLDNLAPEDEVAIVTYAGSTSIALQPTQASE----RTTILAALNALNAGGSTNGQGGLEQ 403 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY L K D + I TDG+ + D + Y + + G + Sbjct: 404 AY--------ALAETMKTDGDVSRVI-LATDGDFNVGLSDPRGLQAYIEDKRDTGTYLSV 454 Query: 322 IGVQ 325 +G Sbjct: 455 LGFG 458 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 78/212 (36%), Gaps = 29/212 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ N ++ FS++I Sbjct: 308 VVFVLDSSASMVG------TKLRQTKEALFTILQDLRPEDHFN------IIGFSNRIKVW 355 Query: 228 --VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P+ ++ ++ I+ + T L+ + D + A+ Sbjct: 356 QQDRLVPVTPNNIRDAKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNIDARSVS--- 412 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASP 339 IIFLTDG + + + L +A R ++ IG+ + + L+NC Sbjct: 413 -LIIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGMV 471 Query: 340 DRFYSVQNSRKLHDAFLR-IGK-EMVKQRILY 369 F +++ F IG + RI Y Sbjct: 472 RHFQEDEDAASHLKGFYDEIGTPLLSDIRIDY 503 >gi|311244457|ref|XP_001927071.2| PREDICTED: collagen alpha-1(XII) chain [Sus scrofa] Length = 1894 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T LS+ +Y +S + M + + P P+ + Sbjct: 365 QTTMLSVRDLSADTEYQISVSA---MKGLTASEPISIMEKTQPMKVQVECSRGVDIKA-- 419 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 420 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 470 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 471 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 524 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 525 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 577 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 578 TVED----FDAFQRISFELTQSICL 598 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 120 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 170 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + +++ F A + K Sbjct: 171 EFNLNQFYQRDELLAAIKKIPYKGGNTMTGDAIDFLIKNTFTES------AGARVGFPKV 224 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 225 AIIITDGKSQDEVEIPAR------ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFN 278 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 279 VANFDAIVDIQNEIISQV 296 >gi|239908812|ref|YP_002955554.1| hypothetical protein DMR_41770 [Desulfovibrio magneticus RS-1] gi|239798679|dbj|BAH77668.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 439 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 67/234 (28%), Gaps = 26/234 (11%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + + P + F + + + +K L++ + +D S SM G Sbjct: 5 NVLLTPRRPALVAGFDNTLDVLVRIQAPNTP-EGETKERTRLNLALAIDRSGSMA---GR 60 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKIN 245 +++ + + L N R L+ + S I P I Sbjct: 61 PLEEAKRCASFVVDKL---------KNTDRVSLIAYDSSIETRVPSVKVEDKAIFHRAIE 111 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + G T G +I + II L+DG+ + D E Sbjct: 112 GIDDGGCTNLHGGWLKGAEQISP--------YIDPSTISR-IILLSDGQANEGLTDEAEI 162 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLR 357 C E G G+ + + + A Y + + L D F Sbjct: 163 FKQCRELADAGVTTSTYGLGSNFNETLMIGMAKNGQGNSYYGRTADDLMDPFQE 216 >gi|237737389|ref|ZP_04567870.1| BatB protein [Fusobacterium mortiferum ATCC 9817] gi|229421251|gb|EEO36298.1| BatB protein [Fusobacterium mortiferum ATCC 9817] Length = 322 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 60/168 (35%), Gaps = 31/168 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+++ +++D S SM ++L R + ++ +K R G + FS Sbjct: 78 KGMNIYVLIDTSRSMLTE-DVYPNRLEAGKRVLTNLIQSLKG-------DRVGFIPFSDS 129 Query: 227 IVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL Q IN LI G T+ LE A E K Sbjct: 130 AYIQMPLTDDYNITQNYINAIDTTLISGGGTELYQALELA-----------EKSFKEIGS 178 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K +I ++DG + D K + K VY+IGV + + Sbjct: 179 ENKTVIVISDG----GDFDKKSL----DFVKENKIDVYSIGVGTKEGN 218 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 90/250 (36%), Gaps = 41/250 (16%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + P +S+ S++ +D++ VLD+S SM KL Sbjct: 228 NFTVLVHLKAGATVARENPRGNLASLPQLSQTPRAPVDLVTVLDISGSMAG------TKL 281 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLI 248 + R++ ++ + S R ++ FSS + F L G QH + +N L+ Sbjct: 282 ALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFSLRRMSDAGRQHALQAVNSLV 335 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T GL + + +EK + II L+DG+++ + + Sbjct: 336 ANGGTNIAEGLRKGAKVMEERREKNPVAS---------IILLSDGQDTYTVSGSSGNQPQ 386 Query: 309 CNE----------AKRRG--AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 N G V+A G A+ + + + S F ++ + DA Sbjct: 387 PNYRLLLPLSIHGGDNAGFQIPVHAFGFGADHDASSMHSISEISGGTFSFIETEAVIQDA 446 Query: 355 FLR-IGKEMV 363 F + IG + Sbjct: 447 FAQCIGGLLS 456 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 62/172 (36%), Gaps = 36/172 (20%) Query: 232 PLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL + ++ ++ GS T + GL + + A + + K ++ Sbjct: 350 PLTADLDKLRTAAAQMQEWNGSGTNVSEGLSWGMRVLSPAPPYTDGAPWKTPNTSKIVVL 409 Query: 290 LTDGEN--------------------------SSPNIDNKE-----SLFYCNEAKRRGAI 318 LTDGEN +S D +L C++ K + Sbjct: 410 LTDGENVVYGASAEPEKSDYTSYGYLSSGRFGTSNQTDAARSVDRWTLDVCDKLKAQQVQ 469 Query: 319 VYAIGVQAE--AADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y I +Q++ A CA+ P +Y+V + KL + F I + ++ Sbjct: 470 IYTITLQSDTAANRTLYGKCATNPADYYAVNDPSKLPNVFQTIAGKFTTLQL 521 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 80/233 (34%), Gaps = 38/233 (16%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I F+ + G+ ++L A+ P I +G + S K L LD + L ++ + + Sbjct: 3 INKFWRSESGNFALLFALAAPAILAAVGFAADVSSVMRAKVNLQNSLDAATLS-SSHLSD 61 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E +Q + N + L++ +D Sbjct: 62 DEAA----------RRLAFDGYFQANVANHPE-----------LTNAKLTLSVDKGFNYV 100 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 A++ ++ F ++ H + ++ L++++VLD + SM Sbjct: 101 KTKAIASADVNLYFAF--LFGDNQHIEVDAGGV-----EATNNLEVVLVLDNTGSMAGA- 152 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT---FPLAW 235 K+ + + +LD + + V + +V F + + F +W Sbjct: 153 -----KIKALRDATKVLLDNLDGAKSPDRKVTAAIVPFVTAVNINGDKFDPSW 200 >gi|317406818|gb|EFV86930.1| von Willebrand factor type A domain-containing protein [Achromobacter xylosoxidans C54] Length = 252 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 55/174 (31%), Gaps = 27/174 (15%) Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYA 262 + R GL+ F + PL +Q + G T + A Sbjct: 19 VADFITRRGDDRLGLIVFGTAAYPQAPLTLDHAALQLLLRHTAVGMAGPNTAIGDAIGLA 78 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + E K +I LTDG ++ + + + A + ++ I Sbjct: 79 IRMLDAVDE-----------PDKVLILLTDGNDTGSAVPPQRA---ATLAAQHHIRIHTI 124 Query: 323 GVQAEAAD-------QFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 G+ A L++ A+ RF+ + L + + ++ +R+ Sbjct: 125 GMGDPQARGDDKVDFDLLEHIAQATGGRFFQANDRESLQQVYATL-DQITPRRV 177 >gi|148699894|gb|EDL31841.1| procollagen, type VI, alpha 1, isoform CRA_c [Mus musculus] Length = 227 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 16/167 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS--IPDVNNVV-RSGLVT 222 D +D+ VLD S S+ P + + +D ++ N+V +G + Sbjct: 32 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALH 91 Query: 223 FSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S ++ L G ++ ++ + FG T + ++ ++ Sbjct: 92 YSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG-------- 143 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGV 324 H KY+I +TDG + L NEAK G V+++ + Sbjct: 144 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAI 190 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 62/158 (39%), Gaps = 29/158 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 233 QISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD------RLS 280 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N L+ T GL + D E+ Sbjct: 281 VIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSV 340 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDN----KESLFYC 309 + II L+DG ++ K++L C Sbjct: 341 AS---------IILLSDGRDTYTTNHPDPSYKDALAQC 369 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 62/158 (39%), Gaps = 29/158 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +IS +D++ VLD+S SM KL + R++ ++ + S R Sbjct: 234 QISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD------RLS 281 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ FSS + FPL G Q + +N L+ T GL + D E+ Sbjct: 282 VIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSV 341 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDN----KESLFYC 309 + II L+DG ++ K++L C Sbjct: 342 AS---------IILLSDGRDTYTTNHPDPSYKDALAQC 370 >gi|260061450|ref|YP_003194530.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] gi|88785582|gb|EAR16751.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] Length = 348 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 68/175 (38%), Gaps = 21/175 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K+ + G+D++ +DVS SM + P ++L A R + E+++ + S R Sbjct: 80 TKLETVKREGVDIVFAVDVSKSMLAEDIAP--NRLEKAKRLVSEIINELAS-------DR 130 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G++ ++++ P+ + + + T A I +A E Sbjct: 131 VGIIAYAAQAFPQLPITTDYGAAKMFLQSM----NTDMLSSQGTA---IHEAIELAATYF 183 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + + ++DGE+ + + +A +G ++ IGV + Sbjct: 184 DDEEQTNRILFLVSDGED----HAEDQVMDAIEQATDQGIRIFTIGVGSARGAPI 234 >gi|302532683|ref|ZP_07285025.1| predicted protein [Streptomyces sp. C] gi|302441578|gb|EFL13394.1| predicted protein [Streptomyces sp. C] Length = 248 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 66/192 (34%), Gaps = 32/192 (16%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI---TSSVKISSKSDIGLDMMMVLDVSL 179 Y+ + ++R + C + P K K + +VLDVS Sbjct: 77 PQYDPAMITRKRLAAGACGLFVALTAGFLPSTAAAADGPAKEPPK------VELVLDVSG 130 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--------KIVQTF 231 SM + G ++ A ++ E+LD + +V +R+ T+ Q + Sbjct: 131 SMRANDIDGQSRMAAAKQAFNEVLDAV--PDEVRLGIRTLGATYPGDDRALGCKDTKQLY 188 Query: 232 PL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P+ + + L T P L+ A + + + I+ + Sbjct: 189 PVGTVNRTEAKTAVATLAPTGWTPIGPALQAAAQDL------------EGGNATRRIVLI 236 Query: 291 TDGENSSPNIDN 302 TDGE++ +D Sbjct: 237 TDGEDTCAPLDP 248 >gi|183222779|ref|YP_001840775.1| putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912810|ref|YP_001964365.1| hypothetical protein LBF_3320 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777486|gb|ABZ95787.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781201|gb|ABZ99499.1| Hypothetical protein; putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 550 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 75/211 (35%), Gaps = 25/211 (11%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 +H L + + + L + + +D S SM K+ + +++ + Sbjct: 22 NHLLLRFRTPANPNVEERKPLVIGLAIDKSWSMKGE------KMEAVIDASCALVNWLTR 75 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKI 266 V+ +V +S+ + P+ + + +KI + ++T + G A + Sbjct: 76 HDAVS------IVAYSADVQLIQPVTHLTEKVSVTDKIRNIQVATSTNLSGGWLSALKSL 129 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +K + K ++ LTDG +S D + + + G IGV Sbjct: 130 NQSKIPNAY---------KRVLLLTDGNPTSGIKDKEALVTIAADHLSMGISTTTIGVGN 180 Query: 327 EAADQFLKNC--ASPDRFYSVQNSRKLHDAF 355 + ++ L A FY + N D F Sbjct: 181 DFNEEMLVEIAKAGGGNFYYIDNPENASDIF 211 >gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M] gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 772 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 75/226 (33%), Gaps = 40/226 (17%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 L + + SS D+++VLD S SM K+ A R+ ++D++ + Sbjct: 291 GTWSLTLVPPAEPSS---APRDVVVVLDRSGSMGGW------KMVAARRAAGRIVDMLDA 341 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLE 260 R ++ F +I + + L T L Sbjct: 342 GD------RFCVLAFDDRIETPPAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQPLT 395 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + D+ E + ++ +TDG+ S + + + +Y Sbjct: 396 NAVEMLADSGEDRQAS----------VVLVTDGQISGEDHLLRSLAPVVGRTR-----IY 440 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +GV FL+ A R V++ +L + R+ + + + Sbjct: 441 CVGVDRAVNAGFLERLAGLGSGRAELVESEDRLDEVMARLARTIGR 486 >gi|170578661|ref|XP_001894496.1| Transmembrane cell adhesion receptor mua-3 precursor [Brugia malayi] gi|158598882|gb|EDP36664.1| Transmembrane cell adhesion receptor mua-3 precursor, putative [Brugia malayi] Length = 1742 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 89/235 (37%), Gaps = 23/235 (9%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y ++++ P + + K D+M ++D S S+ G + K Sbjct: 469 SYTCQCYAGFVDVSSSANLQPGRVCTVQTTCPKQKT--DLMFLIDGSGSI----GSYVFK 522 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI- 248 V R I+E +++ D R GL+ +S +I F L+ + I+++ Sbjct: 523 NEVL-RFIKEFVELFDIGLDN---TRVGLIQYSDQIRHEFDLSQYTDKASVISAISQVQY 578 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ +++ + F + K DD + I +TDG + + Sbjct: 579 LTGLTRTGAAIQHMVMEGFSERR---GARKEGDDVARVSIVITDGRSQDNVTEP------ 629 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEM 362 A++ ++++GV L+ A SP R++ V + L + ++ Sbjct: 630 AXNARKSHINMFSVGVTDHVLGPELEAIAGSPLRWFHVDKFKDLDTRLRSLIQKA 684 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 82/209 (39%), Gaps = 21/209 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K ++ L ++++LDVS SM+ K+ ++R+MLD + + Sbjct: 8 PSKFTAPKAKPLPVVLLLDVSGSMSGE------KIRNVNDAVRDMLDTFSDTENGETEIH 61 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++TF S++ PLA L G T L+ A I D Sbjct: 62 VAIITFGSQVALHQPLA---SASDIHWQDLSAGGMTPLGTALQMAKAMIED------KDV 112 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEAKRRGAIVYAIGVQAEAADQFLKNC 336 Y+ ++ ++DG PN ++ L ++ + A+ + A+A + L Sbjct: 113 VPSRAYRPTVVLVSDG---GPNDAWEKPLNAFISDGRSAKCDRLAMAIGADADEAVLGKF 169 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + +R + +N+++L D F + + Sbjct: 170 IEGTSNRLFYAENAKQLRDFFKFVTMSVT 198 >gi|148698183|gb|EDL30130.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 456 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 83/209 (39%), Gaps = 34/209 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S D++ ++D S S+ + + I +++D + + GLV +S Sbjct: 230 SGSATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSDRLAQ---VGLVQYS 280 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S I Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 281 SSIRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARP 334 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +K I TDG + D +AK G ++A+GV ++ + + P Sbjct: 335 GA--QKVGIVFTDGRSQDYINDAAR------KAKDLGFKMFAVGVGNAVEEELREIASEP 386 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 D ++ + + ++ IGK++ KQ Sbjct: 387 VADHYFYTADFKTINQ----IGKKLQKQI 411 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 73/196 (37%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R GLV ++S + Sbjct: 1 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGLVNYASTVKP 51 Query: 230 TFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + + R+ + T + L++A K E D K Sbjct: 52 EFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKALSDAE---GGRARSPDISKV 108 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-----DR 341 +I +TDG D E A+ G ++AIGV + + P D Sbjct: 109 VIVVTDGRPQDSVRDVSE------RARASGIELFAIGVGRVDKATLRQIASEPQDEHVDY 162 Query: 342 FYSVQNSRKLHDAFLR 357 S KL F Sbjct: 163 VESYNVIEKLAKKFQE 178 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 70/208 (33%), Gaps = 33/208 (15%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 +M+VLD S SM G ++ A R++ M+D + + V + S Sbjct: 35 VMVVLDSSGSMTARDAGGSGTRMDAAKRAVGSMVDGLPAGAQVGLAIYGAGTGSSGAEKV 94 Query: 226 ------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++VQ ++ + T L A ++ Sbjct: 95 AGCKDVRVVQPVGPV-NKPALKRAVTATKASGYTPIGQALRTAAAQLPKEG--------- 144 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQFLKNCA 337 ++ I+ ++DGE++ + E ++G V+ IG + +A + C Sbjct: 145 ----QRSIVLVSDGEDTCA---PPQPCEVAKELSKQGVDLHVHTIGFRVDAKARAQLACI 197 Query: 338 ---SPDRFYSVQNSRKLHDAFLRIGKEM 362 + ++ ++ L R+ + Sbjct: 198 AQNTGGTYHDASDADSLLGVLGRVTERA 225 >gi|171912902|ref|ZP_02928372.1| hypothetical protein VspiD_17020 [Verrucomicrobium spinosum DSM 4136] Length = 652 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 67/175 (38%), Gaps = 28/175 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + G ++ + +D S SM ++LG A + +++L+ +P+ R G Sbjct: 81 ETRVDERSGRNIFIAIDTSKSMLADDVSP-NRLGRAKLAAQDLLE---RLPN----DRVG 132 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ F+ + PL + + E I L I + ++ A I D + EH Sbjct: 133 VIAFAGRSYLQAPLTNDHEAVIECIQSLDHTTIPRGGSSIASAIQLAVETI-DKVKGREH 191 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++ TDG+ + + +L A ++G IV +GV Sbjct: 192 G----------MVLFTDGQETD-----EATLTAARMAAQKGLIVIPVGVGTTEGA 231 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 43/368 (11%), Positives = 114/368 (30%), Gaps = 36/368 (9%) Query: 22 AILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNN------GKKQ 75 A+L+P++ ++ ++ + + D + + K + K Sbjct: 20 AMLIPMVIAAASTIV-IGYQVQLSNRAMQAADAASIACEFKGEYDQALTQSYLDYYQPKI 78 Query: 76 KNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM 135 + L + +++ TS + K Y V + Sbjct: 79 DKVRGQIRTNSGCNMSLGYSLSTIFTSLTLSD----TSFVVSSTANEKAYVTEDVVSDPL 134 Query: 136 PFIF---------CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + ++ + +++S+ + + ++ + ++ Sbjct: 135 ELVIVLDISTSMYGAINDLKAILKRGIVSLKEQQNNAQSEDHIKVSIIPFSTGVSVNNAP 194 Query: 187 PGMDKLGVATRSIREMLD------IIKSIPDVNNVVRSGLVTFS------SKIVQTFPLA 234 D E D + ++ ++ + L + S T PL Sbjct: 195 WLNDARTFCVDGTTESEDKFYAARTVANLDITHDQISVKLSQPNKWRESCSAASFTLPLT 254 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI--AKGHDDYKKYIIFLTD 292 + + ++ L T S GL + ++ +K + + D ++ ++ +TD Sbjct: 255 ADLDQVTNTVDSLRTEGGTASYQGLIWGLRQLTPNWQKAWEVGPNRNVDKVERKLVLMTD 314 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLKNCASPDRFYSVQNSRK 350 G + D+ + C+ AK G + +G + QF + P +S +++ Sbjct: 315 GNDYGRYFDDLINAGLCDRAKDYGIALNFVGFGVNGSRLEQFTRCAVDPKGVFSASDTQD 374 Query: 351 LHDAFLRI 358 L F ++ Sbjct: 375 LDHYFSQL 382 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 64/180 (35%), Gaps = 28/180 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S +S+ G++ ++ LD+S SM +L + + +++ Sbjct: 79 GSKLATSRQREGIETIIALDISNSMLAEDVSP-SRLEKSKLLVENLMNKFSE-------D 130 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + GL+ F+ P+ + ++ LI T L+ + N Sbjct: 131 KIGLIVFAGDAFVQLPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFT----- 185 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K II +TDGE++ + +A+ +G V+ +G+ ++ Sbjct: 186 ------PNSKVGKAIILITDGEDNEGGAE-----AMAKQARNKGIKVFILGIGSKEGSTI 234 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 83/212 (39%), Gaps = 34/212 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + D++ ++D S S+ + R I +++D + + + GLV Sbjct: 266 GAAGGSATDLVFLIDGSKSVRPE------NFELVKRFINQIVDSLDVSD---KLAQVGLV 316 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +SS + Q FPL G ++ I + T + L+Y + D + Sbjct: 317 QYSSSVRQEFPL--GRYKTKKDIKAAVKKMSYMEKGTMTGAALKY----LIDNTFTISSG 370 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A+ +K I TDG + D +AK G ++A+GV D+ + Sbjct: 371 ARPG--AQKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIA 422 Query: 337 ASP--DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + P + ++ + + ++ IGK++ K+ Sbjct: 423 SEPVAEHYFYTADFKTINQ----IGKKLQKKI 450 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 73/196 (37%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ H + + ++++ + P N R G++ ++S + Sbjct: 40 DLVFIIDSSRSVRPH------EFEKVKVFLSQVIESLDVGP---NTTRVGVINYASAVKH 90 Query: 230 TFPLAWGVQHIQ--EKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL + + ++ + T + +++A NK F E + K Sbjct: 91 EFPLKAHRSKASLLQAVRKIEPLSTGTMTGLAIQFAINKAFSEVEGSRLKF---PEISKV 147 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG D AK+ G ++AIGV + + P D Sbjct: 148 AIVVTDGRPQDGVKDVSA------RAKQSGIELFAIGVGRVDKHTLRQIASEPLDDHVDY 201 Query: 345 VQN---SRKLHDAFLR 357 V++ KL F Sbjct: 202 VESYSVIEKLSKKFQE 217 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 64/180 (35%), Gaps = 28/180 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S +S+ G++ ++ LD+S SM +L + + +++ Sbjct: 79 GSKLATSRQREGIETIIALDISNSMLAEDVSP-SRLEKSKLLVENLMNKFSE-------D 130 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + GL+ F+ P+ + ++ LI T L+ + N Sbjct: 131 KIGLIVFAGDAFVQLPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFT----- 185 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K II +TDGE++ + +A+ +G V+ +G+ ++ Sbjct: 186 ------PNSKVGKAIILITDGEDNEGGAE-----AMAKQARNKGIKVFILGIGSKEGSTI 234 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 81/224 (36%), Gaps = 28/224 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P + P + + + LD++ ++D S S+ + + E Sbjct: 178 TPRATTAPAKPPTVPAPAEPC--KSRPLDLVFIIDSSRSVRPA------EFEKVKIFLSE 229 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPG 258 M+D + ++ R LV ++S + F L + +++ +R+ + T + Sbjct: 230 MVDSLDI---GSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMA 286 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ A +++F + KG K I +TDG + + A+ G Sbjct: 287 IKTAMDQVFTENAGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA------ARASGIE 337 Query: 319 VYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 +YA+GV + + P D + V+ KL F Sbjct: 338 IYAVGVDRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|149918184|ref|ZP_01906676.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] gi|149820944|gb|EDM80351.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] Length = 522 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 78/230 (33%), Gaps = 21/230 (9%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + I + +++ +LDVS SM+D DKL + T S+R + Sbjct: 129 PWSDEARLVHIGLQGK-SIPERELPPRNLVFLLDVSGSMHD-----QDKLPLLTDSLRVL 182 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ + V VV +G +V I I+ L G +T G++ A Sbjct: 183 VNQLGERDHVAIVVYAGASG----VVLPPTRGSDRGTILAAISELRAGGSTNGGEGIQKA 238 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y EH + +I TDG+ + + G + + Sbjct: 239 YAL------AREHFDPQGINR---VILATDGDFNVGTTTESGLENLIERERESGVFLTVL 289 Query: 323 GVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 G D+ ++ A L +A +G E + K Sbjct: 290 GFGRGNLGDRTMEMLADKGNGNYAY-IDSLAEARKVLGTEAGSTLVTIAK 338 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 44/196 (22%), Positives = 74/196 (37%), Gaps = 31/196 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ ++D S S+ H M + M+DI+ ++ N R G+V +SS+ Sbjct: 4 GPVDLLFLIDSSRSVRPHEFETMRRF---------MIDILNTLDIGLNSTRVGVVQYSSQ 54 Query: 227 IVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + + + T + + Y N F A E Sbjct: 55 VRSEFSLR-SHASLDSMVKAIQEMVPLAQGTMTGLAIRYTMNVAFTAAEGDRPKVPN--- 110 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--- 339 ++ +TDG EA+ RG +YA+GV A A L+ ASP Sbjct: 111 ---VVVIVTDGRPQD------RVAEVAAEARERGMEIYAVGV-ARADMTSLRAMASPPFE 160 Query: 340 DRFYSVQNSRKLHDAF 355 D + V++ L F Sbjct: 161 DHVFLVESF-DLIHQF 175 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 52/151 (34%), Gaps = 10/151 (6%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 ++ + G+ ++TA+L+PV+F+ + + ++ +K+ LD + + Sbjct: 8 MQRLLKDQSGNFGLMTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIKAVRSYGE 67 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E+ N + + N + + + + + Q Sbjct: 68 GESENAVRTEANRLFFANFQT---------PSATDGYNSASPESPAVEFTFSETGQETRA 118 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 + S ++Y P + P+ + +T Sbjct: 119 SASYAAQYN-PVFWGLQPFVISRRSVAARLT 148 >gi|187470892|sp|A6NF34|ANTRL_HUMAN RecName: Full=Anthrax toxin receptor-like; Flags: Precursor Length = 565 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 70/185 (37%), Gaps = 25/185 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ +LD S S+N+++ + + + N +R +T+S+ Sbjct: 72 QGSFDLYFILDKSGSVNNNW----------IDLYMWVEETVARFQSPN--IRMCFITYST 119 Query: 226 KIVQTFPLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL I+ ++ +++ T G A +I + Sbjct: 120 DGQTVLPLTSDKNRIKNGLDQLQKIVPDGHTFMQAGFRKAIQQIESFN--------SGNK 171 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II +TDGE + +++L +A++ GA VY +GV DQ SP Sbjct: 172 VPSMIIAMTDGELVAHAF--QDTLREAQKARKLGANVYTLGVADYNLDQITAIADSPGHV 229 Query: 343 YSVQN 347 ++V+N Sbjct: 230 FAVEN 234 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 69/209 (33%), Gaps = 28/209 (13%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSS-VKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 S ++ FC F L K+ + G+D++ +DVS SM Sbjct: 49 TKSTFKSILKFCVFILGLTLLIVGLANPKIGTKLETVKREGVDIVFAVDVSKSMLAEDIA 108 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +L A R + E++ + S R G++ ++ + P+ + + L Sbjct: 109 P-SRLAKAKRIVSEIIAQLGS-------DRIGIIAYAGQAYPQLPITTDYGAAKMFLQGL 160 Query: 248 I----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 T L+ A D ++ + + ++DGE+ Sbjct: 161 NTNMLSSQGTAINQALDLASTYYDDDEQTN-----------RVLFIISDGED----HSEG 205 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +A +G ++ IGV E Sbjct: 206 STEGAVEKAVDQGIKIFTIGVGTEKGAPI 234 >gi|297243668|ref|ZP_06927599.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] gi|296888419|gb|EFH27160.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] Length = 560 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 64/197 (32%), Gaps = 29/197 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K G++ ++LD S SM D + R++ + L++F Sbjct: 194 KDKQGVNYEILLDKSGSMEDSIDTMKRTMSDFVRNLNYKVGDTGE-----------LISF 242 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S ++ + I+ + T L + Sbjct: 243 DSYLMYMATYTNDKDRLLTGIDNMTPYGMTALYDALYTGITNASNHPGFNC--------- 293 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 +I TDG+++ E + AK +G VY IG EA L+N A + Sbjct: 294 ---VIAFTDGQDNESTHTADEVISL---AKEKGIPVYLIGTS-EADSSTLQNIANETNGY 346 Query: 342 FYSVQNSRKLHDAFLRI 358 F+ + + + + RI Sbjct: 347 FWDMNSISDMQEVMNRI 363 >gi|147899676|ref|NP_001087858.1| collagen alpha-1(XXI) chain precursor [Xenopus laevis] gi|82234134|sp|Q641F3|COLA1_XENLA RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|51950065|gb|AAH82384.1| MGC81791 protein [Xenopus laevis] Length = 957 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 45/214 (21%), Positives = 84/214 (39%), Gaps = 39/214 (18%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K V S + + G+V Sbjct: 29 SSCRTAPNDLVFILDGSWSVGPENFEILKKWVVNITSNFNI---------GPKFTQVGVV 79 Query: 222 TFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +S + PL + + I L G T++ +++A + +F Sbjct: 80 QYSDYPILEIPLGSYESIDDLSRRTQSIQYL--GGNTQTGNAIQFAIDNLF--------- 128 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A+ K I LTDG++ + EA++ ++AIGV +E + L+ Sbjct: 129 ARSLRPLTKIAIVLTDGKSQD------DVKHIAEEARKNKITLFAIGVGSEIEESELRAI 182 Query: 337 A---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S + V++ + A RI +E++KQ++ Sbjct: 183 ANKPSSTYVFYVED----YIAISRI-REIMKQKL 211 >gi|73973308|ref|XP_539002.2| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 1 [Canis familiaris] Length = 3065 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISISA---MKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRG------NVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 78/235 (33%), Gaps = 39/235 (16%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDH-FGPGMDKL 192 + K K D++ ++D S S+ + F +D + Sbjct: 104 VPVIGQLTIQTGGPTKPGEKKPGKPEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 163 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF- 249 G + R G+V +SS F L + + I ++ + Sbjct: 164 GALVSA----------FDIGEEKTRVGVVQYSSDTRTEFNLNQYYRTDELLAAIKKIPYK 213 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T + ++Y F A + K I +TDG++ Sbjct: 214 GGNTMTGDAIDYLIKNTFTES------AGARVGFPKVAIIITDGKSQDEVEIPAR----- 262 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 263 -ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 86/245 (35%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ D+ PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEDELKMIATDPDDTHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 111/309 (35%), Gaps = 40/309 (12%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQT------DFRNELRENGFAQDINNIERSTSLSII 117 +N N ++Q + + +K+ + ++ + D ++I + Sbjct: 72 VNISKNNVKQQQSSQGLFGQVKDYFSNLFGGDPQYQYKYEPKSQYDDDDDILQQYQKQGG 131 Query: 118 IDDQHKDYN-LSAVSRYEMPFIFCTFPWCANS--SHAPLLITSSVKISSKSDIGLDMMMV 174 + +D L ++Y+ + + + S I K +S + G+D++ V Sbjct: 132 LPVNIEDMIELKVEAQYDYCKLKKSEKQIIPAMVSIITKDIEQYAKNNSSIEAGIDLICV 191 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +D S SM+ K+ +S+ ++L+ + + R L+ F + PL Sbjct: 192 IDKSGSMSGQ------KIESVQQSLVQLLNFL------SEKDRLCLIVFDGGAKRHTPLK 239 Query: 235 W----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ + I + G +T G + A+ +I + K I L Sbjct: 240 TLTEGNKKYFKGAIAAISAGGSTNIAAGTDIAFQQI------QQRKMKNQVTS---IFLL 290 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNC-ASPDRFYSVQNS 348 +DG++S E + + +++ G + + K C FY ++N Sbjct: 291 SDGQDSGA----AERIQKQKDRISDVVTIHSFGYGNDHDADLMSKICKVGQGSFYYIENV 346 Query: 349 RKLHDAFLR 357 + L + F Sbjct: 347 KLLDEFFAD 355 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 72/201 (35%), Gaps = 28/201 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D++ ++D S S+ + ML + PD R GL+ + S Sbjct: 48 NKPMDLVFIIDSSRSVRPA------DFEKVKEFLITMLKFLDIGPDN---TRVGLLQYGS 98 Query: 226 KIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + I+ + R+ + T + ++YA N F E + + Sbjct: 99 TVKNEFSLKTYKRKPDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAEGARPL---NQY 155 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---P 339 + + +TDG P +A+ G +++AIGV LK S Sbjct: 156 VPRIAMIVTDGRPQDPVA------EIAAKARNSGILIFAIGVGRVDMST-LKTIGSQPHS 208 Query: 340 DRFYSVQNSRK---LHDAFLR 357 + + V N + L F Sbjct: 209 EHVFLVANFSQIETLTSVFQN 229 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 44/203 (21%), Positives = 80/203 (39%), Gaps = 27/203 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ V+D S S+ + D + + ++ +LD ++ R GL+ +S Sbjct: 566 GEGPVDLVFVIDGSKSLGE------DNFEIVKQFVKGILDSLEISQ---KAARVGLIQYS 616 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + F +A V+ +I + GS T GL E A+ Sbjct: 617 THVRTEFTMAQYSSAKDVKKAVSQIKYMGRGSMT----GLALKLMHEKSFSEAQGARARP 672 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG E Y +AK+ G +YAIG+ +A D+ L+ AS Sbjct: 673 MRVP-RVAIVFTDGRAQD------EVSEYAEKAKQSGITIYAIGIG-KAIDEELQEIASA 724 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + V + A I +++ Sbjct: 725 PQEKHVIYAED-FSAMGYIMEKL 746 >gi|73973312|ref|XP_867438.1| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 3 [Canis familiaris] Length = 2989 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISISA---MKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRG------NVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 78/235 (33%), Gaps = 39/235 (16%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDH-FGPGMDKL 192 + K K D++ ++D S S+ + F +D + Sbjct: 104 VPVIGQLTIQTGGPTKPGEKKPGKPEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 163 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF- 249 G + R G+V +SS F L + + I ++ + Sbjct: 164 GALVSA----------FDIGEEKTRVGVVQYSSDTRTEFNLNQYYRTDELLAAIKKIPYK 213 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T + ++Y F A + K I +TDG++ Sbjct: 214 GGNTMTGDAIDYLIKNTFTES------AGARVGFPKVAIIITDGKSQDEVEIPAR----- 262 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 263 -ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 86/245 (35%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I ++++ + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ D+ PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEDELKMIATDPDDTHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|225872657|ref|YP_002754114.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793405|gb|ACO33495.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 410 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 78/221 (35%), Gaps = 34/221 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + S D+ + M +++D S SM +KL ++ +++ S PD + Sbjct: 171 QVIASFSHEDLPVSMGILVDNSGSM-------QNKLNAVDKAALDLVRA--SNPDDEAFI 221 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V FS + + +++ + T + + +++ Sbjct: 222 ----VNFSDQAYLDQGFTSSIAKLEQGLAHTEARGGTALYDAIVASADELSKDARH---- 273 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 K+ ++ +TDGE+ + ++ ++++ G +YAIG+ + + Sbjct: 274 ------PKQVLLVVTDGEDDASTMNLQQAIQRVQAL--HGPEIYAIGLLYDDSGDEAHRA 325 Query: 337 ---------ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + Y ++ + + + K++ Q + Sbjct: 326 RKALEQLTEQTGGLAYFPRSLENVDEVAAEVAKDIRNQYTI 366 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 50/289 (17%), Positives = 102/289 (35%), Gaps = 29/289 (10%) Query: 88 WQTDFRNELRENGFAQDINNIERSTS--LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWC 145 W+ + FAQ E T + + +N VS P Sbjct: 320 WRPLLAEQPSAVMFAQLGKTHEFKTHEFKNEESLASSQAHNDQVVSEANHPAEAQASDKE 379 Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 A S+A +++ + ++ + ++ +V+D S SM D + A +I L Sbjct: 380 AKDSYALVMLMPP-QDKARVRLPRELTLVIDTSGSMTG------DSIAQAKSAILNALAG 432 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLE 260 + S N ++ F S + P+A + + L T+ P L Sbjct: 433 LGSQDTFN------VIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALL 486 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A ++ + K K ++F+TDG + N + +L N ++R ++ Sbjct: 487 RALSQPESGVSSISSAVKPERL--KQVVFITDG--AVGNEASLFALIAANIGRQR---LF 539 Query: 321 AIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQRI 367 +G+ A F++ A R + V ++ + +++ +I Sbjct: 540 TVGIGAAPNGYFMERAARAGRGTYTYVGKISEVDAKIGELLEKIESPQI 588 >gi|326334019|ref|ZP_08200248.1| BatA protein [Nocardioidaceae bacterium Broad-1] gi|325948168|gb|EGD40279.1| BatA protein [Nocardioidaceae bacterium Broad-1] Length = 336 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 83/258 (32%), Gaps = 46/258 (17%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 LL ++ + ++ + + +++ +D S SM P ++L A + Sbjct: 56 LAAGLVVGALALLAVATTQPQVRAQVPYERATILVAIDTSASMTAEDMPP-NRLEAAKAA 114 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTP 257 +D + + +V GLV+FSS + I+ L T Sbjct: 115 AISFIDQLPARYNV------GLVSFSSSARVVTSPTTDHALVVRSIDGLGPPDGGTAIGE 168 Query: 258 GLEYAYNKIFDAKEKLE----------------HIAKGHDDYKKYIIFLTDGENSSPNID 301 + + + + E+ + +++ L+DG NS+ Sbjct: 169 AVYSSIDDLQQILEEAAASGPSQEPEESEGSEKSEESEEERSPAHLVLLSDGGNSAG--- 225 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAAD-------------QFLKNCA--SPDRFYSVQ 346 + + A+ G I E + L+N A + RFY Sbjct: 226 -RSPVAAAEAAREAGLPTSTIAYGTEGSSATLPGGQSVEVREDTLRNLADTTGGRFYRAS 284 Query: 347 NSRKLHDAFLRIGKEMVK 364 ++ +L + + IG + Sbjct: 285 SADELREVYDDIGTLVGH 302 >gi|118575253|ref|YP_874996.1| hypothetical protein CENSYa_0043 [Cenarchaeum symbiosum A] gi|118193774|gb|ABK76692.1| conserved hypothetical protein [Cenarchaeum symbiosum A] Length = 311 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 45/250 (18%), Positives = 85/250 (34%), Gaps = 50/250 (20%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKL 192 +PF+ + + + + S +++ G +++VLD S SM +D+ +D Sbjct: 55 LPFVL----LVSAIGTGTVGLADPLFPSGQAEGGAGIVLVLDGSGSMAADDYAPTRLDAA 110 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-S 251 A + L R G++ F S + L +I ++ G Sbjct: 111 KAAAAQLVGRL---------APGDRVGVILFGSSAITISYLTSDRAEAAGRIGEIVQGDG 161 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL EK II L+DG ++S E+L Sbjct: 162 ATALGDGLALGVEMAAAGPEKST------------IILLSDGVHNSGRTVPGEALEL--- 206 Query: 312 AKRRGAIVYAIGVQ-----------------AEAADQFLKNCA--SPDRFYSVQNSRKLH 352 A + V+ IG+ AE + L+ A + +Y+ ++ L Sbjct: 207 AIQGNIRVHTIGMGSDEPVRVGDDIFGEPRYAELDEDTLREIADRTGGMYYTSVDNPTLD 266 Query: 353 DAFLRIGKEM 362 F + +++ Sbjct: 267 GIFEALSEDI 276 >gi|301613500|ref|XP_002936246.1| PREDICTED: collagen alpha-1(XII) chain-like [Xenopus (Silurana) tropicalis] Length = 3127 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 106/265 (40%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++TS+++ +Y ++ Y M + + P T +VKI + G+ Sbjct: 385 QTTSINVKDLSPDTEYQINL---YAMKGLTASEPISTLEK------TQAVKIKVECSGGV 435 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ DV L ++ + G+ + ++ P V+ LV +S Sbjct: 436 DIKA--DVVLLVDGSYSIGVANFAKVRAFLEVLVKSFDISPSK---VQISLVQYSRDPFT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L ++ I + +N + G +T + + Y K+F A+ + + Sbjct: 491 EFTLNRYDSIEDILKAVNTFPYRGGSTNTGKAMTYVREKVF----VEIKGARPNVP--RV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP Y Sbjct: 545 MILITDGKSSDAFKDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPSDTHVY 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/268 (16%), Positives = 100/268 (37%), Gaps = 21/268 (7%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G ++++ +T + +Y ++A+S E+ F + + + Sbjct: 65 GPEKELDLPASATQTILTELVPDVEYVVTAISYDELEESVPVFGQLTIQTGGRVTLEEPK 124 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +SKS + V DV ++ + G + + ++ D R G Sbjct: 125 LETSKS-PRCSISSVADVVFLVDGSWSVGRNNFKYILDFMVSLVSAFDIGEDK---TRVG 180 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V +SS F L + + + I R+ + G T + L++ I ++ K Sbjct: 181 VVQYSSDTRTEFNLNSYYKKEELVSAIKRIPYKGGNTMTGDALDFL---IKNSFVKTAGS 237 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 KG + + I +TDG++ E + G V+++G++A A + + Sbjct: 238 RKG---FPRIAIIITDGKSQDEVEIPAR------ELRSLGVEVFSLGIKAADAKELKQIA 288 Query: 337 ASPD--RFYSVQNSRKLHDAFLRIGKEM 362 + P ++V N + + I ++ Sbjct: 289 SLPSLKHVFNVANFDSIVEVQDEIITQV 316 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 88/248 (35%), Gaps = 25/248 (10%) Query: 122 HKDYNLSAVSRYEM-PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Y ++ +E + S AP + S + K+ D+++++D S S Sbjct: 1150 GTSYKVNVFGVFEKGESVPLIGQEMTTLSDAPEVRIDSSGLECKTKAAADIVLLVDGSWS 1209 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQ 238 + I ++++ D V+ L +S + L + Sbjct: 1210 IGRP------NFRTVRSFIARLVEVFDIGSDR---VQIALAQYSGDPRSEWQLNAHSTKK 1260 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + + L + + G+ A N I K E + KK + +TDG++ Sbjct: 1261 SLMDAVANLPYKGG-NTLTGM--ALNFILQNNFKAEAGMRAKS--KKIGVLITDGKSQDD 1315 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFL 356 + + + + +G ++AIG++ ++ + + PD ++V + L + Sbjct: 1316 IVAPSK------KLRDQGIDLFAIGIKNADENELRQIGSDPDETYVFNVADFSLLVNIVD 1369 Query: 357 RIGKEMVK 364 + + Sbjct: 1370 DLTTNLCN 1377 >gi|163760702|ref|ZP_02167782.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] gi|162282024|gb|EDQ32315.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] Length = 668 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 86/238 (36%), Gaps = 21/238 (8%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 S + + PW AN+ + + + + ++++++DVS SM + Sbjct: 265 SVETPFRATVTVTPTPWNANTRLLHIGVKG-YDVKPAARPQANLVLLVDVSGSMQE---- 319 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 DKL + + R ++ ++ V+ V +G + V A I + ++ L Sbjct: 320 -TDKLPLLKSAFRLLIQKLEPEDTVSIVTYAG----DAGTVLEPTPASDKAKILDALDDL 374 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T G+E AY +L A+ + + + TDG+ + D+ Sbjct: 375 RPGGSTAGAAGIEEAY--------RLAEKARVNGGVNRVL-LATDGDFNVGASDDDALKS 425 Query: 308 YCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 E + G + G DQ ++ A + L +A + +E Sbjct: 426 LIEEKRESGVFLSIFGFGQGNYNDQLMQTLAQNGNGVAAY-IDTLAEAEKTLAQEATA 482 >gi|156382663|ref|XP_001632672.1| predicted protein [Nematostella vectensis] gi|156219731|gb|EDO40609.1| predicted protein [Nematostella vectensis] Length = 1235 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 69/361 (19%), Positives = 126/361 (34%), Gaps = 64/361 (17%) Query: 43 FVKAKLHYILDHSLLYTATKIL--NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENG 100 +K L + + +L K+L + E K + N S+ K + R L E Sbjct: 169 GMKRLLRN-IKNGVLKECCKLLPSSLEYHARFKSKANSKSFCEAKAKSAGNVRV-LGEGI 226 Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 ++ N++++ L + P C ++ P TS+ Sbjct: 227 YSAMSRNLKQNQRLKWQFFGSKEGL---CTIYPAAPLKECH---AYDNRLRP-WYTSAAY 279 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGM---DKLGVATRSIREMLDIIKSIPDVNNVVR 217 S+K +++VLD S SM G +L VA ++ +L + +P + Sbjct: 280 PSTKK-----LVIVLDTSSSMASRVELGTKRRTRLDVAKAALSTILSTL--LPQ----DK 328 Query: 218 SGLVTFSSKIVQT----------FPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 G+V F+SK+ LA + V ++++ INR G T+ + A+ Sbjct: 329 VGVVLFNSKVTLAGSSGVDECYSTRLAPAGRFNVNYLKDFINRSRPGGGTQYQNAFKAAF 388 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES-LFYCNEAKR-----RGA 317 + AK + + +++FLTDG P D E K Sbjct: 389 TLLKSAK------SGDGGGEQSFLLFLTDG---GPKDDALEVERLIAQNKKEMEESRERV 439 Query: 318 IVYAIGVQAEAA-DQFLKNCAS--PDRFYSVQNSRKLHDA-------FLRIGKEMVKQRI 367 + IG+ + FL + ++ V N ++ A + + K I Sbjct: 440 TIMTIGLGKDEHMKDFLGRLSKNVGSKYSQVDNEAHMYSAIHDYYSHLQAMATKATKNYI 499 Query: 368 L 368 L Sbjct: 500 L 500 >gi|326923981|ref|XP_003208211.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Meleagris gallopavo] Length = 776 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 68/169 (40%), Gaps = 26/169 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ +LD S S+ + ++ D + P VR G+V FSS Sbjct: 67 ASVDILFLLDGSYSVGR------GSFERSKHFAGKLCDALDIHPGR---VRVGMVQFSSA 117 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L Q ++E+I R F G +T++ L+Y +K F + Sbjct: 118 PHLEFSLDSYLTKQEVKERIKRTAFRGGSTETGRALKYILSKGFPGGR--------NLTV 169 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I ++DG++ + + K RG V+A G++ ++ Sbjct: 170 PKILIIISDGKSQG------STAMPAMQVKERGTTVFAAGIKFPRWEEL 212 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 55/158 (34%), Gaps = 15/158 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + +D+ M+ G ++ ++ L + N G+ + + + Sbjct: 354 LSLDCAVDLLFLMDSSAGVTLEGFLRYKAFLKRFLQAVMGWDSPAN---VGVAQYDTNVR 410 Query: 229 QTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + + I+ L F G T + L Y F + + DD + Sbjct: 411 IPIEVGQHKDVFGLMKSIDALNFSGGGTLTGRALRYIAQHGFRSTPVFADVL---DDLPR 467 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ LTD ++ P + RG ++ ++G Sbjct: 468 VVVLLTDSKSQDPVA------EAAKYVRDRGLLLISVG 499 Score = 36.7 bits (83), Expect = 5.9, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 53/143 (37%), Gaps = 17/143 (11%) Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 +V + L + S+ F L + + + IN++ F S A + + Sbjct: 587 DVTQIALAAYGSRAYTVFALDTHTNNSALLQAINQMPFLGDVAS------ASSALLHINT 640 Query: 272 KLEHIAKGHDD-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + KG K ++ LT+G +++ + + G +V+ + + D Sbjct: 641 DVMTVQKGARPGVSKVVVVLTNG------GGMEDAAAPAQQLRDNGVVVFVVVIGDTERD 694 Query: 331 QFLKNCASPDRFYSVQNSRKLHD 353 L+ C + +Q+ + L Sbjct: 695 TLLRVCMYS--YEDLQHYQDLIT 715 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 46/281 (16%), Positives = 101/281 (35%), Gaps = 34/281 (12%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 N + + R L + K+ + + + PW A + + I + Sbjct: 128 NNGSLPPTDAVRVEELINYFNYDFKNAKKQGNAPFLVTTEMVKSPWHATNRIVKVGIKAE 187 Query: 159 -VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V + ++ +++ ++DVS SMN DKL +A S++ + +++ + Sbjct: 188 DVLAAKQNQPAANLVFLVDVSGSMNSD-----DKLQLAKASLKMLTKQLRAQDTIT---- 238 Query: 218 SGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L+T++ P G Q I I+ L +T ++ AY + + +K Sbjct: 239 --LITYAGNTEVVLPATSGNQTQKILNAIDNLSANGSTNGEAAIKLAYQQAEENFKKQGI 296 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLK 334 I+ LTDG+ + + K+ L + +G + +G D ++ Sbjct: 297 NR---------ILMLTDGDFNVGVSNVKDMLDIIRNNRDKGISLSTLGFGQGNYNDHMME 347 Query: 335 NCA--SPDRFYSVQNSR--------KLHDAFLRIGKEMVKQ 365 A + + + ++ F + K++ Q Sbjct: 348 QVADNGNGNYSYIDSLSEAKKVLIDEMSSTFNTVAKDVKIQ 388 >gi|156742365|ref|YP_001432494.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233693|gb|ABU58476.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 412 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 68/198 (34%), Gaps = 28/198 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 L+ +VLD S SM G + L ATR + + L + +V F Sbjct: 37 AEKAPLNFCLVLDRSGSMQ---GAKLAALKEATRRVIDTLTPQDIVS---------IVLF 84 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + P + ++ +++ + T + G+ ++ +H G Sbjct: 85 DDTVQTLVPATFATDRDALKAQVDAIEEAGGTAMSGGMAAGIVEL------RKHHDPGRV 138 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASP 339 ++ LTDG+ D E R V A+G+ AE ++ L + A+ Sbjct: 139 SA---MLLLTDGQTWG---DEDRCRALAQELARDHVRVTALGLGAEWNEKLLDDIADATG 192 Query: 340 DRFYSVQNSRKLHDAFLR 357 + + ++ F Sbjct: 193 GLSDYIADPSQITTFFQH 210 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 45/231 (19%), Positives = 77/231 (33%), Gaps = 41/231 (17%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD 170 S ++ I + + + S + A S H P + +D Sbjct: 232 SRTVEITTHPEFTE-IPESTSERSFTVLIHLKAPLAQSLHVP----GDDNGPNTGRAPVD 286 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ VLDVS SM KL + R++ ++ + S R ++ FSS + Sbjct: 287 LITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRL 334 Query: 231 FPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL + +++ +N L T GL I E AK Sbjct: 335 FPLRRMTESGRKQSLLAVNSLTSNGGTNIAEGLRKGSKVI------EERQAKNPVCS--- 385 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II L+DG+++ + + Y + A A + CA Sbjct: 386 IILLSDGQDTYTVSPSTGAHKP-----------YTVSPTAGAQKASAEYCA 425 >gi|309792307|ref|ZP_07686777.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225622|gb|EFO79380.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 423 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 77/212 (36%), Gaps = 28/212 (13%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + L +++ ++ S + L++ +V+D S SM +L + ++D++ Sbjct: 26 YVLLEVSAHGVPTNISKLPLNLCLVIDRSSSMRGE------RLQQVKDAANRIVDMLT-- 77 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + LVTF+ + P ++ I+ + T+ G+ A + Sbjct: 78 ----DDDYLSLVTFNDRAEVVIPAQRPQHRPDLKRMISGIEAAGGTEMATGMALALQE-- 131 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + I+ LTDG D + A+ RG + A+G+ E Sbjct: 132 ------TQRPLMGRGVSR-ILLLTDGRTYG---DEGRCVEIARRAQGRGVGLTALGIGGE 181 Query: 328 AADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 + L+ ++ + + ++ ++ F Sbjct: 182 WNEDLLETMSARENSRTQYITSAAEITQIFTD 213 >gi|317127857|ref|YP_004094139.1| von Willebrand factor A [Bacillus cellulosilyticus DSM 2522] gi|315472805|gb|ADU29408.1| von Willebrand factor type A [Bacillus cellulosilyticus DSM 2522] Length = 282 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 74/220 (33%), Gaps = 39/220 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ +++ ++LD S SM+ G + A + L + Sbjct: 33 AKVKHTERSPINLSLLLDRSGSMS---GEPLRYCKEACNFVINQLTDKDILS-------- 81 Query: 219 GLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V F ++ ++E I R+ T + GL + Sbjct: 82 -VVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGLIQGCQHVLK-------- 132 Query: 277 AKGHDDYKKYI---IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + K Y+ I L+DG+ ++ D + + ++ + G ++ +GV ++ L Sbjct: 133 ----QEVKNYVNRVILLSDGQANAGITDKEALVKLADDYQSAGLVISTLGVSEHFDEELL 188 Query: 334 KNCASP--DRFYSVQNSRKLHDAFLR--------IGKEMV 363 + A F+ + + F + IG+ + Sbjct: 189 EGVADSGRGNFHFINEVENIPSIFEQELDGLLNVIGQNIT 228 >gi|238063244|ref|ZP_04607953.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885055|gb|EEP73883.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 265 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 47/196 (23%), Positives = 76/196 (38%), Gaps = 12/196 (6%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDV 177 + + L + P P + LL+ + S++ + + +M+ +DV Sbjct: 35 RFTNLRLLDRVAPDRPAWRRHVPAGLFLAMLALLVVGFARPSAEVRVPRERATVMVAVDV 94 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM D+L A + R +D +PD N GLV F+ P Sbjct: 95 STSMLASDVKP-DRLSAAKDAARRFVD---GLPDEFN---VGLVAFAGSAAVLVPPGTDR 147 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + + I RL GST + A N A L+ A + II L+DG N+S Sbjct: 148 EALHDGIERLAEGSTGVQGTAIGEAINTSLGAVRGLDSQAAKDLPPAR-IILLSDGANTS 206 Query: 298 PNIDNKESLFYCNEAK 313 +D E+ +AK Sbjct: 207 G-MDPMEAAAEAVDAK 221 >gi|319638170|ref|ZP_07992933.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] gi|317400443|gb|EFV81101.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] Length = 530 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 57/345 (16%), Positives = 122/345 (35%), Gaps = 46/345 (13%) Query: 46 AKLHYILDHSLLYTATK---ILNQENGNNGKKQKNDFSYRIIKNIWQT---DFRNELREN 99 LH D +L TA + EN + Q + + + T D N Sbjct: 32 ENLHGAPDSALPATAVAEENLSLTENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYAN 91 Query: 100 ---GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR-YEMPFIFCTFPWCANSSHAPLLI 155 + + + II+ +Y L R + + PW + + I Sbjct: 92 VRRFLNSGKQPPKDAVRIEEIINYFPYNYPLPTDGRPFAVHTETIDSPWQPEAKLIKIGI 151 Query: 156 TSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++K D+ +++ ++DVS SM++ +KL + +++R + ++ V Sbjct: 152 --QAQDTAKKDLPPANLVFLVDVSGSMDEE-----NKLPLVQKTLRILTQQLRPQDKVT- 203 Query: 215 VVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 L+T+SS P G + I + I++L +T L AY E+ Sbjct: 204 -----LITYSSGEELVLPPTSGSDKETILKAIDKLKAEGSTSGESALRMAY-------EE 251 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQ 331 + + + I+ TDG+ + D + E ++ G + +G + + Sbjct: 252 AQKAFVPNGINR--ILLATDGDFNVGVSDTETLKSMVAEKRKTGISLSTLGFGTDNYNED 309 Query: 332 FLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 ++ A + + N ++L + +++ Q Sbjct: 310 MMEQIADAGDGNYSYIDNEKEAKKVLQQQLTSTLATVAQDVKIQV 354 >gi|147906540|ref|NP_001087052.1| von Willebrand factor A domain-containing protein 2 precursor [Xenopus laevis] gi|82235697|sp|Q6DCQ6|VWA2_XENLA RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|50415038|gb|AAH77945.1| MGC80919 protein [Xenopus laevis] Length = 790 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 76/206 (36%), Gaps = 31/206 (15%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSD-----IGLDMMMVLDVSLSMNDHFGPGM 189 + F+ C + ++ KIS+ LD++++LD S S+ Sbjct: 10 LAFLSSQVLQCLCVQELHVNAETANKISAAGRRMRCSSPLDILILLDGSNSIGR------ 63 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRL 247 + ++ D + ++++R G + +S F L ++ I+EKI + Sbjct: 64 GSFERSKHFASKLCDALDI---GSDLIRVGAMQYSGAPQVEFRLDSSFSKAAIKEKIKSI 120 Query: 248 IF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 +F G T++ L+Y K F K +I ++DG++ Sbjct: 121 VFKGGPTETGLALKYIVWKGFPGGR--------PASVPKILIIVSDGKSQG------NIK 166 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 + K V+ +GV+ ++ Sbjct: 167 LPAAQIKGEDIEVFTVGVKFPRWEEL 192 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 71/186 (38%), Gaps = 22/186 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD+ VLD S ++ G +K + + +V + GLVT+SS+ Sbjct: 530 PLDLAFVLDASTAV------GQEKFNRLKNFVTMVSLQFDI---NRDVTQIGLVTYSSRP 580 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + + I R + + ST A ++++ ++ A+ K Sbjct: 581 ETVFGLDTHDSGSSLLQGIGRASYMGGSASTGS---ALLRVYNDVMTVQKGARPG--VNK 635 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ +TDG +++ + + G +VY IG+ + L+ S SV Sbjct: 636 AVVVITDGR------GAEDAAVPAQKLRDNGIMVYVIGIGNIQRNSLLRLAGSEKFLISV 689 Query: 346 QNSRKL 351 + L Sbjct: 690 PSYESL 695 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 47/266 (17%), Positives = 88/266 (33%), Gaps = 32/266 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK---- 62 + F N G I A+L I ++ + +E + KA+L+ L+ ++L + Sbjct: 12 KRFIQNQSGVYIIFGALLTLPIVALLFVSLEVAGIIQDKARLNDALEQAVLSLTAENNSG 71 Query: 63 ------ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 L NGK + + + I +T + L + +T + Sbjct: 72 RKSYDYALTNAEKANGKYLADSEAGKRDSQIVKTFVKLYLPQIDENTMKFEPICTTQNNA 131 Query: 117 IIDDQHKDYN-----LSAVSRYEMPFIFCTFPWCANSSHAP----LLITSSVKISSKSDI 167 I K Y ++ + S P L S + + ++ Sbjct: 132 ITPKNGKQYAYSSSHVTCTVTGSINHRSLFPMTVGKSKIIPEQVSLSSGSMAQKINNVNL 191 Query: 168 GLDMMMVLDVSLSMNDHFGPGM-------DKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 LD+M+V D+S SM+ + KL + + + E+ D + N R + Sbjct: 192 PLDLMVVADLSGSMDYNINNHKVYSNTEASKLTLLKQVLEELTDKYLLSEEANPNNRISM 251 Query: 221 VTFSSKIVQTF------PLAWGVQHI 240 + F+ P W HI Sbjct: 252 IPFAMGAQHPIRNSCVLPFEWNQSHI 277 >gi|212722920|ref|NP_001131192.1| hypothetical protein LOC100192500 [Zea mays] gi|194690832|gb|ACF79500.1| unknown [Zea mays] Length = 650 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 76/213 (35%), Gaps = 18/213 (8%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 IF P +L+ ++ I +D++ VLDVS SMN+ ++ T Sbjct: 47 IFPEIPQGQARKDFQVLVRVEAPARPEARIPIDVVAVLDVSGSMNNPAAAPTER--TRTT 104 Query: 198 SIREMLDIIKSI--PDVNNVVRSGLVTFSSKI--VQTFPL----AWGVQHIQEKINRLIF 249 S ++L + + R +V FS + + L A G ++ +++L Sbjct: 105 SRLDLLKTAAKFMVAKLEDGDRLSIVAFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEA 164 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFY 308 T P E A + + +I+ LTDG E++S + E Sbjct: 165 RGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAEDASGSFTLSERRRE 217 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 R V+A G+ + L A R Sbjct: 218 VIRGALRKYPVHAFGLGTAHGPEVLLYLAQESR 250 >gi|327286711|ref|XP_003228073.1| PREDICTED: cartilage matrix protein-like [Anolis carolinensis] Length = 565 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 46/227 (20%), Positives = 89/227 (39%), Gaps = 30/227 (13%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 CA L + D++ ++D S S+ + + I +++D Sbjct: 318 CACREGFTLNSDGKTCNACSGGAPTDLVFLIDGSKSVRPE------NFELVKKFINQIVD 371 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEY 261 + + D N V GLV +SS + Q FPL + I+ + R+ T + L+Y Sbjct: 372 SLD-VSDRNAHV--GLVQYSSSVRQEFPLGQYKDKKDIKAAVRRMSYMEKGTMTGQALQY 428 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + D+ + A+ K I TDG + D+ +AK G ++A Sbjct: 429 ----LVDSSFAISSGARPGVP--KVAIVFTDGRSQDYINDS------AKKAKELGYKMFA 476 Query: 322 IGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +GV D+ + + P + ++ + R ++ IGK++ + Sbjct: 477 VGVGNAVEDELKEIASEPVAEHYFYTADFRTINQ----IGKKLQNKI 519 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 73/197 (37%), Gaps = 28/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + + ++++ + P N R G+V ++S + Sbjct: 112 DLVFLIDSSRSVRPN------EFEQVKVFLSQVIESLDVGP---NATRVGVVNYASAVKS 162 Query: 230 TFPLAWGVQHIQ--EKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + R+ + T + +++A N+ F E + K Sbjct: 163 EFSLKTHRTKASLLQAVRRIEPLSTGTMTGLAIQFAINRAFSEGEGARVRV---PEINKV 219 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 I +TDG D A+ G ++AIGV L+ ASP + Sbjct: 220 AIIVTDGRPQDAVKD------VAARARALGIEIFAIGVGRVDMHT-LRLIASPPLEEHVD 272 Query: 344 SVQN---SRKLHDAFLR 357 V++ KL F Sbjct: 273 YVESYSVIEKLTKKFQE 289 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 74/189 (39%), Gaps = 36/189 (19%) Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKIN 245 + + + LD ++ DVN V VT++ V L G++ + Sbjct: 300 YVSRADKVKDASYTFLDTLQEKEDVNISV----VTYAGTASKVTNSNLKSGIESAYNVLG 355 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T + G+E A + ++ K I+ L+DGE+++ N S Sbjct: 356 ----TDGTNTGRGIEIASQILSNS-----------TAPNKMIVVLSDGESNAGN-----S 395 Query: 306 LFYCNEAKRRGAIVYAIGVQAEA---ADQFLKNCASPD------RFYSVQNS-RKLHDAF 355 N AK +G IVY IG + + L +CAS D +FY ++ L++ F Sbjct: 396 RTAANSAKNKGCIVYTIGAGIASGSNGAKELFDCASVDQSTNKAKFYLADDTGNALNEIF 455 Query: 356 LRIGKEMVK 364 I E+ + Sbjct: 456 AEIAGEIQE 464 >gi|115376470|ref|ZP_01463705.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|310819725|ref|YP_003952083.1| hypothetical protein STAUR_2452 [Stigmatella aurantiaca DW4/3-1] gi|115366538|gb|EAU65538.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|309392797|gb|ADO70256.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 218 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 79/207 (38%), Gaps = 18/207 (8%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++++ L ++++ DVS SM K+ ++REM++ + D+ V ++ Sbjct: 8 TTQAARPLPVVVLADVSGSMGVD-----GKIQALNLAVREMIEAFQDESDLRAEVHVSVI 62 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 TF + PL G + L T + A + D A Sbjct: 63 TFGGQSRVHLPL--GRARDAAWTD-LGANGGTPMGAAFDLARTMVED------RNAVPSR 113 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN-CASPD 340 Y+ I+ ++DG+ + ESL + A A+ + A+A L+ A P+ Sbjct: 114 AYRPTIVLVSDGQPTDEWKQPLESLL--KNERGGKAFRMALAIGADADHAVLQAFLADPE 171 Query: 341 -RFYSVQNSRKLHDAFLRIGKEMVKQR 366 R Y +R++ F + + + Sbjct: 172 ARVYRADEARQIRKFFQLVTMSVSARS 198 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 75/208 (36%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA G ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGGYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|332255311|ref|XP_003276777.1| PREDICTED: collagen alpha-1(XXII) chain-like, partial [Nomascus leucogenys] Length = 695 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 48/206 (23%), Positives = 80/206 (38%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F + G YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPR---AGGRPGDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 87/238 (36%), Gaps = 36/238 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW + + + I + ++K D+ +++ ++DVS SM D DKL + +++R Sbjct: 171 PWQSEAKLIKIGI--QAQDTAKKDLPPANLVFLVDVSGSMTDP-----DKLPLVKKTLRI 223 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGL 259 + + ++ V L+T++S P G I +N L G T L Sbjct: 224 LTEQLRPQDKVT------LITYASGEQLVLPPTSGKDKDTILRALNALHAGGATSGERAL 277 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 AY + A K + II TDG+ + D + E ++ G + Sbjct: 278 RMAYEQAEKA------YVKNGINR---IILATDGDFNVGVSDTETLKSLVAEKRKSGISL 328 Query: 320 YAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G + ++ A + + + +L + +++ Q Sbjct: 329 STLGYGTGNYNEAMMEQIADAGDGNYSYIDSEKEARKVLRHQLTSTLATVAQDVKIQV 386 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 1/70 (1%) Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + +L C+ AK +G +V+ IG +A Q L+ CAS Y + ++ DAF Sbjct: 512 STKDSRTLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSASHYYDVDGLEISDAFAS 571 Query: 358 IGKEMVKQRI 367 I + + R+ Sbjct: 572 IASAIRQLRL 581 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 72/203 (35%), Gaps = 25/203 (12%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + G ++ ++L ++F + GL ++ + L Y LD ++L A Sbjct: 24 LRAFRSDESGVLAKPMIMILVLMFALGGLGMDLVRMERDRTNLQYTLDRAVLAAADL--- 80 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + ++ L + + + +D + Sbjct: 81 -----DQPLDPEAVVIDYMS-------KSGLSDYTTVVVPEVSPTAKRVKASVDTEFTAG 128 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++++ + + P L + +S +S +++ +VLDVS SM + Sbjct: 129 WMNSIFYED----YMRNPDTYELEPITLPLLASSTA-VESIGNVEISLVLDVSGSMRSN- 182 Query: 186 GPGMDKLGVATRSIREMLDIIKS 208 ++L R+ +E + + Sbjct: 183 ----NRLVNLKRAAKEFVQTMDD 201 >gi|149200035|ref|ZP_01877061.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] gi|149136908|gb|EDM25335.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] Length = 307 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 78/232 (33%), Gaps = 32/232 (13%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P +L +K + + + + + LD S SM FG G ++ VA Sbjct: 56 LAGMLPSLLLIIVIFILAGPKIKGKPEDEKIISNIQICLDSSGSMRADFG-GKNRYEVA- 113 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG------ 250 + +K + GL F ++ + P+ I L Sbjct: 114 ------MQAVKEFTEYREGDAFGLTVFGTEYINWVPVTKDTSAIALATPFLAPDRMSKWF 167 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L + ++ +D + II ++DG + SPN +++ Sbjct: 168 GGTNIAKALRGSQQQLLQ-----------QEDGDRMIILVSDGVSGSPN----DTVDMAQ 212 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 E + + Y I + + + A + + + V N + L + F I K Sbjct: 213 ELRNNKIVAYCIYIGSGNGSPEMNALAAITGGQVFGVNNPKALDETFRFIDK 264 >gi|254692857|ref|NP_081450.1| collagen, type XXII, alpha 1 [Mus musculus] Length = 1613 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 77/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + P R G+V +S + Sbjct: 47 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGP---GHTRVGVVRYSDRPTT 97 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + ++ R+ + G T + L Y ++ F A G+ +K+ Sbjct: 98 AFELGHFNSREEVKAAARRITYHGGNTNTGDALRYITSRSFSA---QAGGRPGNRAFKQV 154 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + G ++A+GV A A + L AS + Sbjct: 155 AILLTDGRSQDLVLDAAAAAH------AAGIRIFAVGVGA-ALKEELDEIASEPKSAHVF 207 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 208 HVSD-------FNAIDKIRGKLRRRL 226 >gi|313207256|ref|YP_004046433.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446572|gb|ADQ82927.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023480|gb|EFT36486.1| BatB [Riemerella anatipestifer RA-YM] gi|325335297|gb|ADZ11571.1| von Willebrand factor type A [Riemerella anatipestifer RA-GD] Length = 335 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 72/211 (34%), Gaps = 31/211 (14%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++ + F F + + + +K+ K + ++M ++D+S SMN Sbjct: 50 KTKWYPKFFLVMYFVAFLFLIFAMMDLIGGKEEIKVQQKMN---NVMFLVDISNSMNAQD 106 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 D+L +A + + + + R GL F+ + PL + ++ Sbjct: 107 VAP-DRLSLAKNIVISSMQKMTN-------DRVGLAVFAGEAFSVMPLTTDYLAAESFVS 158 Query: 246 RL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 L + T ++ A +K + I+ ++DGE++ N Sbjct: 159 GLETSVVSTQGTDFYKAMQVAVSKFKAVSKGSGR-----------IVLISDGEDNEGNEA 207 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 EA+ G V +GV E Sbjct: 208 -----AAIKEAQSNGIQVITVGVGTEEGAPI 233 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ G ++ +L I+ PD G+V FS Sbjct: 624 EDMKADIMFLVDSSGSIGPE------NFGKMKTFMKNLLAKIQIGPDKTQ---IGVVQFS 674 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + F L + + I + I+R+ T + LE+ H+ Sbjct: 675 TDAKEEFQLNKYFTQKEISDAIDRMSLINQNTYTGHALEFVDQYF-------SHLKGARP 727 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDGE D +L + + +++++GV Q + Sbjct: 728 GVKKFLILITDGEARDGVRDPARAL------REKEVVIFSVGVYGANRTQLEEISGDGSL 781 Query: 342 FYSVQNSRKLHDA 354 + V+ L Sbjct: 782 VFQVEKFDDLKAI 794 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 78/192 (40%), Gaps = 23/192 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+N M L + +++ V+ G + +S + Sbjct: 814 LDIVFVLDHSGSINSQQQESMMNLT---------IHLVEKANVGRGQVQFGALKYSKEPE 864 Query: 229 QTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L G + R T + LE+A +++ EH ++ + K+ Sbjct: 865 DLFYLNTFSKGAAITENLRRRRDTYGETYTAKALEHA-----NSQFTEEHGSRIKQNVKQ 919 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYS 344 +I +TDG + D ++ + + +G I+YA+GV EA L+ A + Sbjct: 920 MLIVITDGV----SHDREQLSDTALKLRNKGIIIYAVGVG-EADQYELEAIAGDKNNTRH 974 Query: 345 VQNSRKLHDAFL 356 V N KL D + Sbjct: 975 VDNFDKLKDIYQ 986 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 52/336 (15%), Positives = 115/336 (34%), Gaps = 39/336 (11%) Query: 45 KAKLHYILDHS--LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFA 102 K L ++ + + E+ + + Q ++ N D E + Sbjct: 303 KLSLQAGKSNAGAAIDMMRREAFSESSGSRRAQGVPQIAVLVTNRPSDDEVREAAQKLLR 362 Query: 103 QDINNIE------RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 +D+ +T L I+ + + S ++ F + + T Sbjct: 363 EDVIVFAMSIQGANNTQLEEIVSYPSRQRVSTLKSYADLERYSRIFQKKLQNEIWDQIST 422 Query: 157 SSVKISSKSDIGLD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + + + LD + ++D S S+ + + + ML++ + Sbjct: 423 RAEQRNLDETGCLDTKEADLYFLIDGSTSIQNEQFEQIKRF---------MLEVTEVFSI 473 Query: 212 VNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 V+ G+V +S +I + F + + V + +N T + L + KI + Sbjct: 474 GPGRVQVGVVQYSHEIREEFSIGVYSNDVDLRKAVLNIKQLTGDTHTGAALAFMLPKIRE 533 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 +++ Y+I LTDG+ + D+ L N + G ++A+G+ Sbjct: 534 GRKQR------PSKVPCYLIVLTDGQ----SQDHHRILETANRIRAEGVTIHAVGIGEAD 583 Query: 329 ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + + +R + QN L I E+V Sbjct: 584 KTELQQIAGNEERVHFGQNFDSL----KSIKSEVVH 615 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 106/317 (33%), Gaps = 39/317 (12%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 +L ++++ K RII Q L+ AQ N+ T Sbjct: 136 ILVVLASAKSEDDVEEASKALRRDGVRIISVGVQKASEENLKAMATAQFHFNLR--TVRD 193 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + Q+ + ++Y+ I ++ + D D++ ++ Sbjct: 194 LSTFSQNMTQIIKDAAQYK-------------EGEDDTDIEVPFPVACQKDSLADVVFMV 240 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-- 233 D SL ++ + L T S+ +N +R GL+++S++ L Sbjct: 241 DESLGTRENLRNLQNFLKSITTSL----------DVKDNCMRIGLMSYSNRAKTISSLKS 290 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + E+I +L + + A + + ++ + + +T Sbjct: 291 STTQSEFLEQIQKLSLQAGKSNAGA---AIDMMRREAFSESSGSRRAQGVPQIAVLVT-- 345 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL-- 351 + P+ D E + R IV+A+ +Q Q + + P R V + Sbjct: 346 --NRPSDD--EVREAAQKLLREDVIVFAMSIQGANNTQLEEIVSYPSR-QRVSTLKSYAD 400 Query: 352 HDAFLRIGKEMVKQRIL 368 + + RI ++ ++ I Sbjct: 401 LERYSRIFQKKLQNEIW 417 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 57/144 (39%), Gaps = 4/144 (2%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YKKYIIFL 290 PL +Q IN L T+S GL + ++ + ++ + ++ ++ Sbjct: 258 PLTNVFSRVQNSINSLTANGGTRSYQGLVWGVRQLIPSWQQAWGMKVSSVPETRRKLVLF 317 Query: 291 TDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAAD-QFLKNCAS-PDRFYSVQN 347 TDG + + + +C A K+ G + IG + +NCA P R +S N Sbjct: 318 TDGADEGDAFNQLVNAGFCTTAIKQYGIEMNFIGYGVSPSRITQFENCAGNPLRVFSATN 377 Query: 348 SRKLHDAFLRIGKEMVKQRILYNK 371 + +L++ F I I + Sbjct: 378 TTQLNEYFSDILAVEYSASIHLTR 401 >gi|296198593|ref|XP_002806760.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XII) chain-like [Callithrix jacchus] Length = 3113 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTMLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKXFTKVEDIIEAINNFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + + R G+V +SS Sbjct: 140 DLIFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEDKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F A + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES------AGARVGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I +TDG++ E + G V+++G++A A + + ++P ++ Sbjct: 245 AIVITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLTHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P + ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S S+ I ++++ P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYSGDPRTEWQLNTHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I + + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFRTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKL 351 + + K G ++AIG++ + PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESL 1364 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 78/203 (38%), Gaps = 37/203 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +V+DVS SM + + + V ++L ++ +P+ + R+G+ TF + Sbjct: 50 DVRLVIDVSGSMKRNDPNNLRQPAV------DLL--VQLLPEGS---RAGVWTFGKWVNM 98 Query: 230 TFP-----LAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P W Q K + + T LE A + D G ++ Sbjct: 99 LVPHRDVTDPW-RATAQAKASEINSVGLFTNIGEALEKATFEGAD----------GGAEF 147 Query: 284 KKYIIFLTDG---ENSSPNIDNKESLFYCNEA----KRRGAIVYAIGVQAEAADQFLKNC 336 +K II LTDG + SP + +E +E K G V+ I + A A L Sbjct: 148 RKSIILLTDGMVDIDKSPEQNKREWRRIADEVIPRLKEAGVTVHTIALSANADTNLLNKI 207 Query: 337 --ASPDRFYSVQNSRKLHDAFLR 357 A+ ++ L FL+ Sbjct: 208 SLATGGMAEVAHSADDLMRIFLK 230 >gi|171910783|ref|ZP_02926253.1| Protein containing von Willebrand factor (vWF) type A domain [Verrucomicrobium spinosum DSM 4136] Length = 917 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 91/247 (36%), Gaps = 35/247 (14%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + PW A + I +++ + + + ++DVS SM++ DKL Sbjct: 505 FGVQVDMAEAPWKPEHRLARIAIKG--RVNQQERAPANFVFLVDVSGSMDEP-----DKL 557 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFG 250 + +S+R + + + + R +VT++ P G + I E I+ L G Sbjct: 558 PLVKQSLRMLTERLSTKD------RVAIVTYAGSTAVILPSTAGTEKSRIIEAIDGLGAG 611 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+ AY + +H K + +I TDG+ + E Sbjct: 612 GSTNGAGGIRLAYE------QAQQHFQKEGVNR---VILCTDGDFNVGISSPGELQKLIE 662 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN--------SRKLHDAFLRIG 359 E + + +G A D+ ++ A + + + +++ + I Sbjct: 663 EKAKSRVFLSVLGFGAGNLKDRTMETLADKGNGNYAYIDSLSEARKVLVEQMNATLVTIA 722 Query: 360 KEMVKQR 366 K++ Q Sbjct: 723 KDVKIQV 729 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 273 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 323 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 324 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 377 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 378 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 429 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 430 EHYFYTADFKTINQ----IGKKLQKRI 452 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 72/196 (36%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R GLV ++S + Q Sbjct: 42 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGLVNYASSVKQ 92 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL + + + R+ + T + +++A K E D K Sbjct: 93 EFPLRAHSSKAELLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRPR---SPDISKV 149 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 +I +TDG D A+ G ++AIGV + + P + Sbjct: 150 VIVVTDGRPQDSVRDVSA------RARAGGIELFAIGVGRVDKATLQQIASEPQDEHVDY 203 Query: 345 VQN---SRKLHDAFLR 357 V++ KL F Sbjct: 204 VESYSVIEKLSKKFQE 219 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 77/197 (39%), Gaps = 26/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ V+D S S+ + + + +L +K N R G++ +S Sbjct: 29 HTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLKV---GANATRVGVIQYS 79 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ FPL A+ + ++ I L+ T + ++Y N F E + Sbjct: 80 SQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYVMNVAFSVAE---GARPPEE 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + + +TDG +A+ RG +YA+GVQ L+ ASP Sbjct: 137 RVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 189 Query: 340 -DRFYSVQNSRKLHDAF 355 + + V++ L F Sbjct: 190 DEHVFLVESF-DLIQEF 205 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 54/183 (29%), Gaps = 46/183 (25%) Query: 231 FPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + IN + T S G+ + + E + G + K +I Sbjct: 280 TPLTNVKATLTPAINAMEAWSRGGTLSDVGMAWGLRVLSPEPPFTEGLPWGTPKWSKAVI 339 Query: 289 FLTDGENSSPNID----------------------------------------NKESLFY 308 +TDG+N + N Sbjct: 340 LMTDGDNQFYKLTSTTGGNKVNSAVNSDYGAYGRLDELGRIGTTNATTAKTTINTRLTSV 399 Query: 309 CNEAKRRGAIVYAIGVQA---EAADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVK 364 CN K + IVY + + +A K CA+ +++ + +L AF I + Sbjct: 400 CNAMKAKNIIVYTVTFTSGINQATKDIYKACATDASKYFDSPSQDELKSAFRAIATSLSN 459 Query: 365 QRI 367 R+ Sbjct: 460 LRV 462 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 78/224 (34%), Gaps = 39/224 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + KG+++I+ AI L I +GL ++ + + VK+++ L Sbjct: 6 LSRLMTCRKGNMAIILAIGLLPIITTIGLGVDVARAYAVKSRMSAA-------LDAAALA 58 Query: 66 QENGNNGKKQKNDFSYRII-KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 + + Q + + + N S ++ + D Sbjct: 59 VGSSSGTDAQLSAVAQKFFDANYPTGALGAH--------------PSVAVKVTGD----- 99 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 +S + + F + P+ S+V GL++ MVLD + SM + Sbjct: 100 ----VISASAVAEVDTVFMKVVGLNDVPVHADSTVNRQI---AGLELAMVLDNTGSMTTN 152 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + + + ++ DI+ V+ ++ LV +S+ + Sbjct: 153 -----NNIQAVRDAANQLTDILFGTATVHPYLKIALVPYSAAVN 191 >gi|206901991|ref|YP_002251775.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] gi|206741094|gb|ACI20152.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] Length = 890 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 44/191 (23%), Positives = 71/191 (37%), Gaps = 27/191 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +++VLD S SM + G M K+ +A S + +LD+++ GL+ F Sbjct: 387 KKSNVAIVIVLDASGSMGSYSGGDM-KMELAKESAQLVLDLLE------EKDYFGLIAFD 439 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + I+++ G T P L+ A + A Sbjct: 440 HSYQWIVPLQPLTNKEETASLISKISPGGGTALYPPLKSAGEALIKAP-----------I 488 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 K+II +TDG+ + + V IG+ +A LK+ A Sbjct: 489 KSKHIIAITDGQTEGG-----DFYNLVKYLAKYKITVSTIGIGEDANIPLLKDIANWGNG 543 Query: 341 RFYSVQNSRKL 351 RFY N R L Sbjct: 544 RFYHTWNIRNL 554 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 85/220 (38%), Gaps = 26/220 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW A++ + I + +I S + +++ ++DVS SM DKL + +++R + Sbjct: 164 PWQADAKLIKIAIKAK-EIRSSALPPANLVFLVDVSGSMQ-----AQDKLPLVKKTLRIL 217 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLE 260 +++ V L+T++S P G I + IN+L G +T L+ Sbjct: 218 TKRLRAEDKVT------LITYASNEKLVLPPTSGKDKDTILQAINQLEAGGSTAGEQALQ 271 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + A K + I+ TDG+ + D E ++ G + Sbjct: 272 MAYAQAQKA------YIKNGINR---ILLATDGDFNVGITDFNTLKDTVAEKRKAGISLT 322 Query: 321 AIGVQAEA-ADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 +G +Q ++ A + + N + R Sbjct: 323 TLGFGTGNYNEQLMEQLADAGDGNYSYIDNETEAKKVLQR 362 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 77/224 (34%), Gaps = 31/224 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L S+V S + ++++LD S SM G+ K+ A +++ + + ++PD Sbjct: 20 LPTVSAVADVSDDPVPGKLLLMLDASGSMKAKDPSGLTKIEAAKKALTGV---VGALPDT 76 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 GL + + + + I + T P L A + I E Sbjct: 77 AQ---VGLRVYGATVDGKGKPTPAACADTQLITPI----GTIDKPALTSAISAINALGET 129 Query: 273 L-EHIAKG-----HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-----IVYA 321 H K+ I+ ++DGE S + C K+ A + Sbjct: 130 PIAHSLTEALKDLGPTGKRNIVLVSDGEESC-------TPDPCPAVKKLTAGGVDLQIDT 182 Query: 322 IGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEM 362 +G A + C A +Y +++ L + ++ + Sbjct: 183 VGFGVNAKARSQLQCIADAGKGTYYDAKDAPALAASLSKLSQRA 226 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 40/289 (13%), Positives = 96/289 (33%), Gaps = 24/289 (8%) Query: 90 TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS 149 + ++ EN + D+ N S + + + + +MP + Sbjct: 79 KEVAHDTEENVKSADLKNKSNSEQADLYVHTMYSLLKQEIIPFDKMPLQILGIGRVEDEE 138 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 K + + ++ ++LD S SM M K+ +A +I++ + + Sbjct: 139 KKSNGTKGEQK-NKEDRGNYNIEILLDASGSMAGKIDGKM-KMDIAKEAIQQFVSDLPEA 196 Query: 210 PDVNNVVRSGLVTFSSKIVQ----------TFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +V+ V G + + + + +++ ++ T + Sbjct: 197 VNVSLRVY-GHKGSNDEKDKTASCGAIENVYTLQKYNQTTLRQSLDGFQPVGWTPLAEAI 255 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + + AK K + ++DG + +E+ N + + Sbjct: 256 KRSTETFQSAKAND----------KNIMYVVSDGVETCGGNPVEEAQKVSNSNIKPIMNI 305 Query: 320 YAIGVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 V EA Q + S ++ N+++L D F GK++ +R+ Sbjct: 306 IGFQVDHEAEKQLKEIAEVSKGKYVLANNAKELQDQFKETGKDITSRRL 354 >gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata obscuriglobus UQM 2246] Length = 821 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 34/197 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + S + K + D+++VLD S SM+D K+ A ++++ L ++ Sbjct: 255 MFLISPQVEAEKKRVARDLVLVLDTSSSMSDI------KMQQAKKAVKFCLSQLQPED-- 306 Query: 213 NNVVRSGLVTFSSKIVQTFP--LAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIF 267 R G+V FS+ + + +A ++ + I+ L T P L A Sbjct: 307 ----RFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDALAMRS 362 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE--AKRRG-AIVYAIGV 324 + ++F TDG+ P +D + AK G ++ GV Sbjct: 363 SD---------PSRPFT--MVFFTDGQ---PTVDETNADKIVKNVLAKNTGNTRIFTFGV 408 Query: 325 QAEAADQFLKNCASPDR 341 + L A R Sbjct: 409 GDDVNAAMLDQLADSTR 425 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 63/178 (35%), Gaps = 28/178 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S ++K G++ ++ LD+S SM +L + + ++ Sbjct: 79 GSKLATNKHREGIETIIALDISNSMLAEDVAP-SRLEKSKLLVENLMSRFSE-------D 130 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + GL+ F+ + P+ + ++ LI T L+ A Sbjct: 131 KIGLIVFAGEAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLA---------- 180 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + K II +TDGE++ + +A+ +G V+ +G+ + Sbjct: 181 -ANSFTPNSKAGKAIILITDGEDNEGGAE-----AMAKQARSKGIKVFILGIGSRQGA 232 >gi|301610722|ref|XP_002934910.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 957 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 84/214 (39%), Gaps = 39/214 (18%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS D++ +LD S S+ + K V S + + G+V Sbjct: 29 SSCRTAPNDLVFILDGSWSVGPENFEILKKWVVNITSNFNI---------GPKFTQVGVV 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +S + PL G + ++R G T++ + +A + +F Sbjct: 80 QYSDYPILEIPL--GSYESSDDLSRRMQSIQYLGGNTQTGNAIRFAIDNLF--------- 128 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A+ K + LTDG++ E EA++ ++AIGV +E + L+ Sbjct: 129 ARSLRPLTKIAVVLTDGKSQD------EVKHVAEEARKNKITLFAIGVGSEIEESELRAI 182 Query: 337 A---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S + V++ + A +I +E++KQ++ Sbjct: 183 ANKPSSTYVFYVED----YIAISKI-REIMKQKL 211 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 69/210 (32%), Gaps = 29/210 (13%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 +D+ T + L + P + P + + Sbjct: 129 GNGKEDVYFSFPQTPYQYTRKTEASTAKLDFSLSFSQP-EYAKPPNGDAQGRLDVTLIPQ 187 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +S +D++ V+DVS SM KL A +++ ++ KS + N R Sbjct: 188 GAVSGIIRPPIDVVFVMDVSGSMTAM------KLQSAKSALQAAVNYFKS--NYNQNDRF 239 Query: 219 GLVTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 LV FS + + + +G I N L G T + L A + D Sbjct: 240 ALVPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTDPT 299 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 KKYIIFLTDG + N Sbjct: 300 R------------KKYIIFLTDGMPTVLNT 317 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 80/224 (35%), Gaps = 28/224 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P + P + + + LD++ ++D S S+ + + E Sbjct: 178 TPRATTAPAKPPTVPAPAEPC--KSRPLDLVFIIDSSRSVRPA------EFEKVKIFLSE 229 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPG 258 M++ + ++ R LV ++S + F L + +++ +R+ + T + Sbjct: 230 MVNSLDI---GSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMA 286 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ A ++F + KG K I +TDG + + A+ G Sbjct: 287 IKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA------ARASGIE 337 Query: 319 VYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 +YA+GV + + P D + V+ KL F Sbjct: 338 IYAVGVDRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 80/224 (35%), Gaps = 28/224 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P + P + + + LD++ ++D S S+ + + E Sbjct: 178 TPRATTAPAKPPTVPAPAEPC--KSRPLDLVFIIDSSRSVRPA------EFEKVKIFLSE 229 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPG 258 M++ + ++ R LV ++S + F L + +++ +R+ + T + Sbjct: 230 MVNSLDI---GSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMA 286 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ A ++F + KG K I +TDG + + A+ G Sbjct: 287 IKTAMEQVFTENAGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA------ARASGIE 337 Query: 319 VYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 +YA+GV + + P D + V+ KL F Sbjct: 338 IYAVGVDRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 59/181 (32%), Gaps = 45/181 (24%) Query: 231 FPLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L + I I N L S T + GL++ + + E D +K I Sbjct: 242 QGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGS--SAKDIRKVI 299 Query: 288 IFLTDGENSSPNIDNKESLFY---------------------------------CNEAKR 314 I LTDG+ ++ Y +AK Sbjct: 300 IVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKAKE 359 Query: 315 RGAIVYAIGVQAEAADQ--FLKNCASP-----DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 G V+AI + +K+ AS D +Y ++ + D F +IG+++ + + Sbjct: 360 AGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQLGWRLL 419 Query: 368 L 368 Sbjct: 420 R 420 >gi|242072180|ref|XP_002446026.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] gi|241937209|gb|EES10354.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] Length = 434 Score = 66.4 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 77/216 (35%), Gaps = 29/216 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD----KLGVATRSIREMLDIIKSIPDVNN 214 + +D++ VLDVS SM +G G +L +A ++ + + + Sbjct: 2 TTTTLHERAPVDVVAVLDVSGSMAWDYGNGTTVENHRLELAKEAMAKAIQSLGPAAAAVA 61 Query: 215 VV-----RSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 R +V FS+ + Q PL G Q ++ ++ L G L+ A Sbjct: 62 AGGARRNRLAVVPFSNVVKQVTPLTEMDMEGQQTVKNAVDALKPGGQADYLMPLKIAAKI 121 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + K + + IIF++DG++ D + + K ++A GV Sbjct: 122 LDERKAEEKDRLA-------IIIFVSDGQDHYFR-DTDDMKETLTQHKLIKYPIHAFGVS 173 Query: 326 AE------AADQFLKNC--ASPDRFYSVQNSRKLHD 353 + L+ A+ + S+ + Sbjct: 174 VSEQDSSGGGAKALRAMADATSGSYTSITQDDDVDT 209 >gi|327270778|ref|XP_003220165.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 925 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 76/199 (38%), Gaps = 46/199 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM+ + +++ ++ + + ++ + G+VTF++ Sbjct: 304 VCLVLDVSGSMSGN-----NRIARLKQAAETFI-----LQNIEDGSWVGIVTFNNAATIQ 353 Query: 231 FPLAWGVQ-HIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L V +++ +N + T G++ + E Sbjct: 354 TGLQQVVSDTVRKTLNGYLPISANGGTNICAGVQKGFQVFSSKYASTEGCE--------- 404 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF---------LKNC 336 I+ LTDGE+S L C E +R G++++ I + AA + LK Sbjct: 405 IVLLTDGEDS--------GLSSCFAEVQRSGSVIHTIALGPSAAKELEMLADMTGGLKFS 456 Query: 337 ASPDRFYSVQNSRKLHDAF 355 A+ +S L DAF Sbjct: 457 ATDS-----VDSSSLEDAF 470 >gi|303240107|ref|ZP_07326628.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302592376|gb|EFL62103.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 329 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 41/183 (22%), Positives = 64/183 (34%), Gaps = 31/183 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P KI+ + GLD+ +++D S SM D+L A +II+SI D Sbjct: 69 PQSFIGLAKINKE---GLDIYVLIDTSKSMLAE-DIKPDRLSRAK-------NIIESIID 117 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIF 267 R G + FSS PL + ++ +I G T L A Sbjct: 118 NLEGDRIGFIPFSSAAYIQMPLTDDYDLARMYLDVIDTDMIAGGGTNVGTALNLA----- 172 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 E+ + + +I L+DGE + N S+ V+ IG+ Sbjct: 173 ------ENSFEETSSADRVVIILSDGEEHNSN-----SVDILKSFNDEHLKVFTIGIGTA 221 Query: 328 AAD 330 Sbjct: 222 KGG 224 >gi|301606205|ref|XP_002932732.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 1058 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 80/209 (38%), Gaps = 29/209 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ A + + P + + G+V +S Sbjct: 60 DLVFIVDGSWSVGYK------DFDTAKNWLLNITSSFDIGP---SYTQVGVVQYSDFPQL 110 Query: 230 TFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL + Q+ ++ L G T + +++A ++F ++L + K Sbjct: 111 EIPLGHNTSY-QQLLSALKSIKYLGGNTNTGRAIKFATEEVFPTSKRL------NVSKNK 163 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SPDRF 342 I +TDG++ ++ S A+ +G I++A+GV +E L A S Sbjct: 164 IAIVITDGKSQDNVVNISSS------ARAQGIILFAVGVGSEITKSELVAIANMPSTHYV 217 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 ++ + + +++ ++ + + Sbjct: 218 LYAEDYTTIDRIKETMRQKICEESVCPTR 246 >gi|260823577|ref|XP_002606157.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] gi|229291496|gb|EEN62167.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] Length = 550 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 85/238 (35%), Gaps = 24/238 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLL---ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 +P + NS+ +T+ +++ LD++ LD S S+ D+ Sbjct: 171 PSIPAQAAFWKNYCNSALDKSPNDYVTAVEEVAGCQATSLDLVFALDGSASVGDN----- 225 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRL 247 ++ I ++ PD + R G+V +S + L + + + + + Sbjct: 226 -NFQLSKDFINTIIQSFTIGPD---LTRVGVVQYSFSVNLELELKDHLSAAPLIQAVEEI 281 Query: 248 IFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + T + L+Y F A E + K I LTDGE+ D + Sbjct: 282 EYDEGLFTHTGEALQYITTHSF-AVENGAREVRKGVP--KVSILLTDGESG----DYHDV 334 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEM 362 + A+ G V+ +GV Q L+ A + Y V R L + + + Sbjct: 335 TVWSQRAQEAGITVFPVGVTNNVDQQELETMAGHAGKVYRVGWFRDLEKILTDMEESL 392 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 75/190 (39%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M I + +++ + V+ G + +S+ Sbjct: 839 LDIVFVLDHSGSIGPREQESM---------INLTIHLVEKADVGRDRVQFGALMYSNNPE 889 Query: 229 QTFPLAW--GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R T + L++A N +F EH ++ + ++ Sbjct: 890 ILFYLNTYSSRSAITEHLKRPRDTRGDTYTAKALQHA-NILF----MEEHGSRLKQNVRQ 944 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + E + +G ++A+GV D+ + V Sbjct: 945 LMIVITDGE----SHDRDKLNDTARELRDKGITIFAVGVGRANQDELETMAGKKENTIHV 1000 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 1001 DNFDKLRDIY 1010 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 75/200 (37%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ + S+ +M P N VR G+V +S + Sbjct: 468 DIYFLIDGSSSIRRK---EFEQIQIFMSSVVDM------FPIGPNNVRVGVVQYSHRNEV 518 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ + I + N T + L++ I K + H + Y Sbjct: 519 EFPVSQYTKGIELRKAVWNIKQLKGGTFTGKALDFILPIIKKGKSERIH------EVPCY 572 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++A+G+ Q + +R Q Sbjct: 573 LIVLTDGKSND------SVLEPANRLRAEHITIHAVGIGEANKTQLQQIAGKDERVSFGQ 626 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N L I E+V + Sbjct: 627 NFDSL----KYIKNEIVHRI 642 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 71/191 (37%), Gaps = 27/191 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G+ ++ ++ I+ D + G+V FS Sbjct: 649 EDMKADIMFLVDSSGS------IGLTNFEKMKTFMKNLVGKIEIGADRSQ---VGVVQFS 699 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + + I+ + T + L N+ FD I+KG Sbjct: 700 DYNKEEFQLNKYSTREEVYAAIDGMSPINRNTLTGSAL-TFVNEYFD-------ISKGGR 751 Query: 282 D-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +K++I LTDGE +L + + ++++GV Q + Sbjct: 752 PQVRKFLILLTDGEAQDEVGGPAMAL------RSKSVTIFSVGVYGANRTQLEEISGEGS 805 Query: 341 RFYSVQNSRKL 351 + V+N L Sbjct: 806 LVFHVENFDHL 816 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 95/301 (31%), Gaps = 32/301 (10%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRE----NGFAQ 103 L + A+K L ++ S +K + F LR + FA Sbjct: 141 LASAESEDDVEEASKALREDGVKIISVGVQKASEEDLKAMATPQFHFNLRTARDLSLFAP 200 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 D+ I + + + + P P + + T S Sbjct: 201 DMAQIIQ----DVTQYREGTTVAVVTDVAPTTP-EPLITPAALTTPANNVDETVPFLASC 255 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT 222 + D D++ ++D S+ + R ++ L+ + S DV N +R GL++ Sbjct: 256 QKDSLADLIFLVDESVGTTQNL-----------RDLQNFLENVTSSVDVKDNCMRLGLMS 304 Query: 223 FSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS + V + + Q++I +L + + A + ++ Sbjct: 305 FSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGA---AIEHMRTEGFSESSGSRKA 361 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + +T + + + +G ++A+ VQ Q + P Sbjct: 362 QGVPQIAVLVT------HRASDDAVREAALDLRLQGVTMFAMSVQGANNTQLEDIVSYPS 415 Query: 341 R 341 R Sbjct: 416 R 416 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 75/190 (39%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M I + +++ + V+ G + +S+ Sbjct: 839 LDIVFVLDHSGSIGPREQESM---------INLTIHLVEKADVGRDRVQFGALMYSNNPE 889 Query: 229 QTFPLAW--GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R T + L++A N +F EH ++ + ++ Sbjct: 890 ILFYLNTYSSRSAITEHLKRPRDTRGDTYTAKALQHA-NILF----MEEHGSRLKQNVRQ 944 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE + D + E + +G ++A+GV D+ + V Sbjct: 945 LMIVITDGE----SHDRDKLNDTARELRDKGITIFAVGVGRANQDELETMAGKKENTIHV 1000 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 1001 DNFDKLRDIY 1010 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 75/200 (37%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ + S+ +M P N VR G+V +S + Sbjct: 468 DIYFLIDGSSSIRRK---EFEQIQIFMSSVVDM------FPIGPNNVRVGVVQYSHRNEV 518 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ + I + N T + L++ I K + H + Y Sbjct: 519 EFPVSQYTKGIELRKAVWNIKQLKGGTFTGKALDFILPIIKKGKSERIH------EVPCY 572 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++A+G+ Q + +R Q Sbjct: 573 LIVLTDGKSND------SVLEPANRLRAEHITIHAVGIGEANKTQLQQIAGKDERVSFGQ 626 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N L I E+V + Sbjct: 627 NFDSL----KYIKNEIVHRI 642 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 71/191 (37%), Gaps = 27/191 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G+ ++ ++ I+ D + G+V FS Sbjct: 649 EDMKADIMFLVDSSGS------IGLTNFEKMKTFMKNLVGKIEIGADRSQ---VGVVQFS 699 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + + I+ + T + L N+ FD I+KG Sbjct: 700 DYNKEEFQLNKYSTREEVYAAIDGMSPINRNTLTGSAL-TFVNEYFD-------ISKGGR 751 Query: 282 D-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +K++I LTDGE +L + + ++++GV Q + Sbjct: 752 PQVRKFLILLTDGEAQDEVGGPAMAL------RSKSVTIFSVGVYGANRTQLEEISGEGS 805 Query: 341 RFYSVQNSRKL 351 + V+N L Sbjct: 806 LVFHVENFDHL 816 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 63/181 (34%), Gaps = 28/181 (15%) Query: 165 SDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +D+ +LD S ++ +D F + S+ + I + + R L+++ Sbjct: 1990 ENSYMDVAFLLDNSRNIASDDFQ----AVKGLVSSVIDGFHITSNPSTSESGDRVALLSY 2045 Query: 224 ----SSKIVQTFPLAWGVQHI------QEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKE 271 +S+ + +G + I + LE+A +F Sbjct: 2046 SPSENSRRKGSVKTEFGFTTFDSESIMKNYIHTSLQQLNGDAAIGLALEWAMEGLFLG-- 2103 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + K II ++ GEN + + AK +G I++ + + + D+ Sbjct: 2104 ------TPNPRKHKVIIVISAGENHE---EKEFVKTVALRAKCQGYIIFVVSLGSTRRDE 2154 Query: 332 F 332 Sbjct: 2155 M 2155 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 95/301 (31%), Gaps = 32/301 (10%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRE----NGFAQ 103 L + A+K L ++ S +K + F LR + FA Sbjct: 141 LASAESEDDVEEASKALREDGVKIISVGVQKASEEDLKAMATPQFHFNLRTARDLSLFAP 200 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 D+ I + + + + P P + + T S Sbjct: 201 DMAQIIQ----DVTQYREGTTVAVVTDVAPTTP-EPLITPAALTTPANNVDETVPFLASC 255 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT 222 + D D++ ++D S+ + R ++ L+ + S DV N +R GL++ Sbjct: 256 QKDSLADLIFLVDESVGTTQNL-----------RDLQNFLENVTSSVDVKDNCMRLGLMS 304 Query: 223 FSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS + V + + Q++I +L + + A + ++ Sbjct: 305 FSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGA---AIEHMRTEGFSESSGSRKA 361 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + +T + + + +G ++A+ VQ Q + P Sbjct: 362 QGVPQIAVLVT------HRASDDAVREAALDLRLQGVTMFAMSVQGANNTQLEDIVSYPS 415 Query: 341 R 341 R Sbjct: 416 R 416 >gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Anolis carolinensis] Length = 955 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 75/197 (38%), Gaps = 27/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ N ++ FS++I Sbjct: 308 VVFVLDSSASMVG------TKLRQTKDALFTILQDLRPEDHFN------IIGFSNRIKVW 355 Query: 228 --VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q P+ ++ + I+ + T L+ + + D + + A+ Sbjct: 356 QHDQLVPVTPNNIRDAKVYIHNMSPSGGTNINGALQISTKILNDYIAQNDIEARSVS--- 412 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASP 339 IIFLTDG + I+ + + EA R ++ IG+ + + L+NC Sbjct: 413 -LIIFLTDGRPTFGEIEPAKIINNTKEAIRNKFCLFTIGIGNDVDYKLLERLALENCGMM 471 Query: 340 DRFYSVQNSRKLHDAFL 356 R +++ + F Sbjct: 472 RRVREEEDAAEQLKGFY 488 >gi|262199272|ref|YP_003270481.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082619|gb|ACY18588.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 430 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 88/243 (36%), Gaps = 34/243 (13%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + ++ +Y++ P A + + + + + + LD+ +V+D S S Sbjct: 10 RAGSVAVTVTPQYDL------LPSNARELNLMVRLEGTGDAPA-TRAPLDLALVIDRSGS 62 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWG 236 M+ DKL + E+L+ ++ + LV++SS + ++T G Sbjct: 63 MSG------DKLSDVKTAALELLETLQPEDTIT------LVSYSSDVSMHLMRTRADDAG 110 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + + L T PGL A + A ++ +++ +DG + Sbjct: 111 QREARRALLALQARGGTALGPGLFRALEALEGASDRTRMS---------HLMLFSDGIAN 161 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 + + A G V +GV + + + A R++ +Q+S + Sbjct: 162 AGEVRPSVLGARAAGAFGAGVSVSTMGVGVDYNEDLMTRLADQGGGRYHFIQDSEAIASI 221 Query: 355 FLR 357 Sbjct: 222 LDD 224 >gi|1705570|sp|P51942|MATN1_MOUSE RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1163179|gb|AAB06521.1| cartilage matrix protein precursor [Mus musculus] Length = 500 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 83/209 (39%), Gaps = 34/209 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S D++ ++D S S+ + + I +++D + + GLV +S Sbjct: 274 SGSATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSDRLAQ---VGLVQYS 324 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S I Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 325 SSIRQEFPL--GRFHSKKDIKARVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARP 378 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +K I TDG + D +AK G ++A+GV ++ + + P Sbjct: 379 GA--QKVGIVFTDGRSQDYINDAAR------KAKDLGFKMFAVGVGNAVEEELREIASEP 430 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 D ++ + + ++ IGK++ KQ Sbjct: 431 VADHYFYTADFKTINQ----IGKKLQKQI 455 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 73/196 (37%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R GLV ++S + Sbjct: 45 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGLVNYASTVKP 95 Query: 230 TFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + + R+ + T + L++A K E D K Sbjct: 96 EFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKALSDAE---GGRARSPDISKV 152 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-----DR 341 +I +TDG D E A+ G ++AIG+ + + P D Sbjct: 153 VIVVTDGRPQDSVRDVSE------RARASGIELFAIGLGRVDKATLRQIASEPQDEHVDY 206 Query: 342 FYSVQNSRKLHDAFLR 357 S KL F Sbjct: 207 VESYNVIEKLAKKFQE 222 >gi|156742135|ref|YP_001432264.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233463|gb|ABU58246.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 425 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 64/196 (32%), Gaps = 28/196 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + L++ +VLD S SM +L + + + + LV F+ Sbjct: 42 KLPLNLCLVLDRSSSMRGE------RLMQVKDA------AARIVDQLGQDDYFSLVVFND 89 Query: 226 KIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P ++ ++ I ++ T+ G+ A ++ Sbjct: 90 RADVVIPAQRAIKKADLKAAIAQIEAAGGTEMAQGMALALQEVQRPFLTRGISR------ 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-- 341 II LTDG D + + RG + A+G+ E + L+ + + Sbjct: 144 ---IILLTDGRTYG---DESRCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENSR 197 Query: 342 FYSVQNSRKLHDAFLR 357 + ++ + F Sbjct: 198 AQYIATAQDVVKVFAD 213 >gi|109087573|ref|XP_001097885.1| PREDICTED: collagen alpha-1(XXII) chain-like [Macaca mulatta] Length = 232 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 77/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + ++ RL + G T + L Y F + G YK+ Sbjct: 89 AFELGLFGSREEVKAAARRLAYHGGNTNTGDALRYITALSFSPR---AGGRPGDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV +A + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-KALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 65/444 (14%), Positives = 134/444 (30%), Gaps = 104/444 (23%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +RN + +G +IL A+++P +F + L + + KA++ + + L ++ Sbjct: 1 MRNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALA-----IS 55 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII------- 118 N + + + I + + DI +R L Sbjct: 56 AHNDPDQPDNGSYTPSTRNRQIVVDYVNAYISDIDAVTDIKVAKRRCELISGCVAGLYKG 115 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLD 176 D ++ ++ + +R FP ++ K ++ +D+M D Sbjct: 116 DARYLEHEIDVTTRQN-----SWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDVMFAAD 170 Query: 177 VSLSMND----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 S SM D P L R+I L +P+ + G+ FS+ Sbjct: 171 FSGSMLDTWSGSSNPKYIDLIEIIRNISVELQKFNDLPENRDKSTMGISAFSTFTNSFTS 230 Query: 233 LA--------------------------------WG-----------------------V 237 W Sbjct: 231 DTGIQCSLSQGVNSKNKPGNWFRPVKPANTVANIWNEKTEDYCKSGAYAGFHDVNLTSNF 290 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + ++ G T S Y + + L+ + ++ +I L+DG ++ Sbjct: 291 NSLNGQVGSFYAGGGTAS-------YQALIRGAQLLDR----GRNSRRLLIVLSDGMDND 339 Query: 298 PN-IDNKESLFYCNEAKRR------------GAIVYAIGVQAEA-ADQFLKNCASPDRFY 343 N + S C E + A + IG + A++ LK+C Y Sbjct: 340 RNLANGLVSNGMCREIQAGLESDRTPDGRPIAAKMAVIGFDYDPFANKALKDCVGEKNVY 399 Query: 344 SVQNSRKLHDAFLR-IGKEMVKQR 366 +++ ++ D L I +E+ + Sbjct: 400 KAEDADEVEDIILELINEEVGHLK 423 >gi|2689175|emb|CAA06010.1| hypothetical protein [Borrelia burgdorferi] Length = 328 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 46/239 (19%), Positives = 88/239 (36%), Gaps = 31/239 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + S K G D+++VLD+S SM Sbjct: 49 KDYRLNLMYFFTYSFLYLAAMVMVFALAGP---SVSKKKMIHLSAGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+ GLV F+ P+ + + Sbjct: 106 AVEFSSKNRLEFSK-------ELIRGFISQRENDNIGLVAFAKDASIVVPITTDREFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 ID + + N A+ +Y+IG+ + S F V + L + Sbjct: 208 IDKDQVI---NLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 263 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 75/215 (34%), Gaps = 30/215 (13%) Query: 164 KSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 KS +G ++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 232 KSQLGASNLVFLLDVSGSMA-----STDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAG 286 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V Q + + +L G +T G+ AY +H + Sbjct: 287 ----VVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQL------AKKHFIPNGIN 336 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SP 339 +I TDG+ + D + + K G + +G DQ ++ A Sbjct: 337 R---VILATDGDFNVGVTDFDDLTALIEKEKAHGIGLTTLGFGLGNYNDQLMEQLADKGN 393 Query: 340 DRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L I K++ Q Sbjct: 394 GNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQV 428 >gi|320537259|ref|ZP_08037219.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145887|gb|EFW37543.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 332 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 46/288 (15%), Positives = 92/288 (31%), Gaps = 53/288 (18%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-- 164 I + S S+ + + + ++ + F+ + + + + + + VK+ ++ Sbjct: 26 AIIQPLSFSLTLHNWNDTIIKTSSQMRGLSFLSKSLVFLSLIALI-VAASEPVKLKTEYV 84 Query: 165 -SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ +M V+D+S SM ++ A I + + + GL Sbjct: 85 YTETANSIMFVIDISPSMAAKDINEKTRIQAAKDIITDFVQTYPADA-------FGLTAL 137 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +S P + ++N L G T GL A H AK Sbjct: 138 ASTAALVIPPTIQHEQFFARLNSLQIGELGEGTALGMGLAVA----------AAHFAKN- 186 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------- 332 + II LTDGE+++ I K + Y IG+ + Sbjct: 187 TVKTQSIILLTDGESNTGEIHPN---LAAELIKSKKIGFYIIGIGKDGYANLEYVDPSTG 243 Query: 333 ---------------LKNCASPDR--FYSVQNSRKLHDAFLRIGKEMV 363 L+ A + S ++ L + F I + + Sbjct: 244 EKREGTLQTIFNERELRELAHRGNGIYVSAKSFASLQEIFKNISQNIS 291 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 57/177 (32%), Gaps = 40/177 (22%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA------KEKLEHIAKGHDDYK 284 PL+ ++ +IN L G T GL + + + A D Sbjct: 448 TPLSTDKTALKAQINGLTVGGATAGQIGLAWGWYMVAPNFGYLWPNASQRPAAYKARDLM 507 Query: 285 KYIIFLTDG----------------------ENSSPNIDNK------ESLFYCNEAKR-- 314 K +I +TDG ++ N D ++ C+ K Sbjct: 508 KVVILMTDGGFNMTYCNSVVARNIGSGTNIGDDERINCDATNGSSFDQAAELCDSIKASA 567 Query: 315 RGAIVYAIGVQAEAADQ---FLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y +G FL NCAS D+ Y +L +F I +E+ RI Sbjct: 568 NDITLYTVGFTVGNDQTARNFLTNCASSTDKAYFPATGSELKASFQAIAQEISNLRI 624 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 91/264 (34%), Gaps = 40/264 (15%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+I+I A+L + I++ +++ + ++ LD + L A Sbjct: 20 RRLRRDDRGAIAIQFALLALPLSILLFGLLDVGRLSLQRRQMQDALDAATLMAARSTATS 79 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + I + + F+ NN T+ Sbjct: 80 SADLDTTGDAAFLAE-----IAGMNLGLTASSSTFSAGTNNRVIGTA------------- 121 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 +A R + ++ S + + +S V +SK+ L++ +VLD++ SM + G Sbjct: 122 -TATLRPIIANLW-------QSGNFTVTASSEVVRASKN---LEIALVLDITGSMGN--G 168 Query: 187 PGMDKLGVATRSIREMLDIIKSIP--------DVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + L VA + ++L P + V G T++ + P+ Sbjct: 169 TRIADLKVAAADLVDVLVRDTQTPFYSKMALVPYSAGVNVGA-TYADAVRGPVPVKTITG 227 Query: 239 HIQEKINRLIFGSTTKSTPGLEYA 262 + T++ P + A Sbjct: 228 AAWASGSARSITGITRANPAVVTA 251 >gi|301780200|ref|XP_002925521.1| PREDICTED: collagen alpha-1(XII) chain-like [Ailuropoda melanoleuca] Length = 3172 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTILSVRDLSADTEYQISVSA---MKGLTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLIKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRG------NVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 79/236 (33%), Gaps = 41/236 (17%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDH-FGPGMDKL 192 + K K D++ ++D S S+ + F +D + Sbjct: 104 VPVIGQLTIQTGGPTKPGEKKPGKPEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 163 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF- 249 G + R G+V +SS F L + + I ++ + Sbjct: 164 GALVSA----------FDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 250 GSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + ++Y N ++ + K I +TDG++ Sbjct: 214 GGNTMTGDAIDYLIKNTFTESAGARAG-------FPKVAIIITDGKSQDEVEIPAR---- 262 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 263 --ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 316 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 86/245 (35%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P I ++++ P+L + + + D+++++D Sbjct: 1150 GTTYKVNVFGMFDGGESSPLIGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1206 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I +++I + P V+ L +S + L Sbjct: 1207 SWSIGRA------NFRTVRSFISRIVEIFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1257 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1258 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1312 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ D+ PD Y+V + L Sbjct: 1313 QDDVEAPSK------KLKDEGVELFAIGIKNADEDELKMIATDPDDTHAYNVADFESLSK 1366 Query: 354 AFLRI 358 + Sbjct: 1367 IVDDL 1371 >gi|281348096|gb|EFB23680.1| hypothetical protein PANDA_015036 [Ailuropoda melanoleuca] Length = 3047 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 101/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T LS+ +Y +S + M + + P P+ + Sbjct: 369 QTTILSVRDLSADTEYQISVSA---MKGLTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 423 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 424 DIVFLVDGSYS------IGIANFVKVRAFLEVLIKSFEISPNR---VQISLVQYSRDPHT 474 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 475 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRG------NVPKV 528 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 529 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 581 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 582 TVED----FDAFQRISFELTQSICL 602 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 79/236 (33%), Gaps = 41/236 (17%) Query: 145 CANSSHAPLLITSSVKISSKS-----------DIGLDMMMVLDVSLSMNDH-FGPGMDKL 192 + K K D++ ++D S S+ + F +D + Sbjct: 88 VPVIGQLTIQTGGPTKPGEKKPGKPEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFI 147 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF- 249 G + R G+V +SS F L + + I ++ + Sbjct: 148 GALVSA----------FDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 197 Query: 250 GSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + ++Y N ++ + K I +TDG++ Sbjct: 198 GGNTMTGDAIDYLIKNTFTESAGARAG-------FPKVAIIITDGKSQDEVEIPAR---- 246 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEM 362 E + G V+++G++A A + + ++P + ++V N + D I ++ Sbjct: 247 --ELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV 300 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 86/245 (35%), Gaps = 31/245 (12%) Query: 122 HKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 Y ++ ++ P I ++++ P+L + + + D+++++D Sbjct: 1134 GTTYKVNVFGMFDGGESSPLIGQEMTTLSDTTVMPILSSG---MECLTRAEADIVLLVDG 1190 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--W 235 S S+ I +++I + P V+ L +S + L Sbjct: 1191 SWSIGRA------NFRTVRSFISRIVEIFEIGPKR---VQIALAQYSGDPRTEWQLNAHR 1241 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + + L + + G+ A N I K + + +K + +TDG++ Sbjct: 1242 DKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--RARKIGVLITDGKS 1296 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + + K G ++AIG++ D+ PD Y+V + L Sbjct: 1297 QDDVEAPSK------KLKDEGVELFAIGIKNADEDELKMIATDPDDTHAYNVADFESLSK 1350 Query: 354 AFLRI 358 + Sbjct: 1351 IVDDL 1355 >gi|159038794|ref|YP_001538047.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157917629|gb|ABV99056.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 316 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 58/209 (27%), Gaps = 32/209 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVSLSM + ++ +P N GLV+F+ Sbjct: 89 VMLAIDVSLSMQADDVSP----NRLAAAQEAAQQFVEELPASYN---LGLVSFAKAANVL 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P Q + ++ L+ +T + + I I+ L Sbjct: 142 VPPTKDRQAVVTAVDGLVLAESTATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVLL 196 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----ADQFLK----------- 334 +DG +S + + I + Q + Sbjct: 197 SDGYRTSGRSVEEAAAAAQAANVAVS----TIAFGTDGGQVDIGGQLQRVPVDRFALAEL 252 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + FY + +L + +G + Sbjct: 253 AATTEGHFYEAASVNELKQVYQDMGSSIG 281 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 65/196 (33%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LVFVAIGLFAVLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++D + + G++ F+ P+ + + + +K Sbjct: 117 RLVAQLVDKM-------QNDKVGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + II +TDGEN ++ A +G Sbjct: 170 TAIGAAIN-------LATRSFTPQEGVGRAIIVITDGENHEG-----GAVEAAKAATEKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVNVLGVGMPDGAPI 233 >gi|78776856|ref|YP_393171.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497396|gb|ABB43936.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 595 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 90/248 (36%), Gaps = 28/248 (11%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 + D+ + ++ + + F + P++ + I +KS D+M+ LD Sbjct: 37 EVMDRLRVHSNTLTPKARNALFFLIGFLIIIALAQPVIKEAKTLIKAKSA---DIMIALD 93 Query: 177 VSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 +S SM + P +L A R L K R G+V F+ PL++ Sbjct: 94 ISDSMLAEDVYPK--RLESAKRKALAFLKEAKDE-------RVGVVAFAKDSYLVSPLSF 144 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + +L S T+ + A EK++ K KK ++ L+DG Sbjct: 145 DKHSVSFLLEQLDTTSITEQGSDF----LSVIGAVEKIQKDEK-----KKVLLILSDG-- 193 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAF 355 D + AK+ G ++ +G+ + + + ++ KL++ Sbjct: 194 ----GDKSDFSEEIALAKKSGITIFILGIATKQGAPIKREDGTFIKYNDEIIISKLNENI 249 Query: 356 LRIGKEMV 363 + + Sbjct: 250 ASLATKTG 257 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 44/231 (19%), Positives = 83/231 (35%), Gaps = 42/231 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + FP ++ P+ S + +VLDVS SM G ++ A Sbjct: 18 LVAGLFPTGIAAADEPVAKESP-----------KVELVLDVSGSMRARDIDGKSRMAAAK 66 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTF-----------SSKIVQTFPLAWGVQHIQEKIN 245 ++ E+LD + +V +R+ + + ++ PL + + Sbjct: 67 QAFNEVLDAV--PEEVRLGIRTLGADYPGDDRKRGCKDTRQLYPVGPL--DRTEAKAAVA 122 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L T P L A + D + I+ +TDGE++ +D E Sbjct: 123 TLAPTGWTPIGPALLGAAEDL------------EGGDATRRIVLITDGEDTCAPLDPCEV 170 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 AK ++ +G+ +A + C A+ + SVQ++ +L D Sbjct: 171 AREI-AAKGIHLVIDTLGLVPDAKTRTQLRCIAEATGGTYTSVQHTDELSD 220 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 67/196 (34%), Gaps = 28/196 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + L+ +VLD S SM KL +++ +++ + V +V F Sbjct: 39 KMPLNFCLVLDRSGSMQGA------KLAALKDAVKRVIETLTPQDIV------AIVLFDD 86 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + + +++ + T + G+ ++ + A Sbjct: 87 TVQTLVPATFATDKATLIAQVDAIEEAGGTAMSGGMAAGIVELRKNHDPGRVGA------ 140 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDR 341 ++ LTDG+ D E R G + A+G+ AE + L + A+ Sbjct: 141 ---MLLLTDGQTWG---DEDRCRALAQELARDGVRITALGLGAEWNEALLDDIAEATGGI 194 Query: 342 FYSVQNSRKLHDAFLR 357 + + ++ F Sbjct: 195 SDYIADPAQITTFFQH 210 >gi|90019771|ref|YP_525598.1| inter-alpha-trypsin inhibitor domain-containing protein [Saccharophagus degradans 2-40] gi|89949371|gb|ABD79386.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 763 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 47/300 (15%), Positives = 105/300 (35%), Gaps = 31/300 (10%) Query: 50 YILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE 109 + D L+ + + G+ +Q + I F E+ + Q N Sbjct: 261 QVPDAHLISPPMVLAQGQYGDGQYEQTGKDNRATISIQLDAGFNVANIESLYHQITINKP 320 Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD--- 166 S++ ++ + + + V ++ + A + + + Sbjct: 321 PSSAYNVELTNGSTLMDRDFVLQWRATASSAPQAAVFKETLAGEDYLLLMLLPPQGQQQH 380 Query: 167 ---IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML------DIIKSIPDVNNVVR 217 + D++ V+D S SM + A RS++ L D I + R Sbjct: 381 TQSLSRDIVFVVDTSGSMQG------TSIQQAKRSLQFALRGLNPSDTFNIIEFDTSFSR 434 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F S+ V A VQ +N L + T+ LE A++++ + Sbjct: 435 -----FRSRPVSAT--ASNVQAAVSWVNNLNADNGTEMYAALEEAFDQLASINPNGTENS 487 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K ++ ++ ++F+TDG + N+++L + A ++ + + + F++ A Sbjct: 488 KSSNNLQQ-VVFITDG-----AVGNEQALLSLIHRRLNNARLFTVAIGSAPNSYFMRKAA 541 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 79/208 (37%), Gaps = 31/208 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + K +LD I +P + V +VTF+ Sbjct: 67 MDLLLLVDDSASIGSRNFEQVRKF---------VLDFIDLVPISSEEVHLSVVTFADSPQ 117 Query: 229 ----QTFPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P A Q +E L + G +T + GL A + + + Sbjct: 118 DVFTFKQPQATNKQLAKEAFKYLRYRRGGSTATDKGLIRARRYLTRP------VYGTRAN 171 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 K ++ +TDGE+ +++ ++A+ G V+ +GV + C R+ Sbjct: 172 VPKVLVLMTDGESDRH----YDTIQAADQARAEGISVFVVGVGMANPVECRGVC-GCGRY 226 Query: 343 -----YSVQNSRKLHDAFLRIGKEMVKQ 365 + + N +L I E+ K+ Sbjct: 227 GPCPQFIMSNWNELVQTVDSIMGEVCKK 254 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 81/215 (37%), Gaps = 30/215 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S+ D++ ++D S S+ + + I +++D + + + GLV +S Sbjct: 768 SNAATDVVFLIDGSKSVRPE------NFELVKKWINQIIDKLDVSDNKAH---VGLVQYS 818 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + Q FPL + ++E + ++ T + L Y + D A+ Sbjct: 819 SAVKQEFPLGRYNNKKDLKEAVKKMAYMERGTMTGQALRY----LTDNSFGPGQGARPGV 874 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 K I TDG + D +AK +G +YA+GV D+ + + P Sbjct: 875 T--KVGIVFTDGRSQDYIGD------AAKKAKDQGFKMYAVGVGNAVEDELKEIASEPTA 926 Query: 340 DRFYSV----QNSRKLHDAFLRIGKEMVKQRILYN 370 + ++ + ++ D I E + Y Sbjct: 927 EHYFYTADFKTDGQRREDELKEIASEPTAEHYFYT 961 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 61/176 (34%), Gaps = 20/176 (11%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRL 247 + + ++++ + P N R G+V ++S++ L + + + ++ Sbjct: 558 SEFEQVKVFLAKVIEGLDVGP---NATRVGVVNYASRVKNEVSLKTHRTKAGLIKAVTKI 614 Query: 248 IF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + T + +++A N F E D K I +TDG D Sbjct: 615 EPLSTGTMTGLAIQFAMNVAFSEAEGARLR---SPDISKVAIVVTDGRPQDNVKD----- 666 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 A+ G ++AIGV + + P D V++ KL F Sbjct: 667 -VAQRARDAGIEIFAIGVGRVEMSTLRQMASDPLDDHVDYVESYSVIEKLTKKFQE 721 >gi|325673437|ref|ZP_08153128.1| type II secretion system protein [Rhodococcus equi ATCC 33707] gi|325555458|gb|EGD25129.1| type II secretion system protein [Rhodococcus equi ATCC 33707] Length = 623 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 76/213 (35%), Gaps = 29/213 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +++ + D+++ +DVS M+ GP +D + A D ++ P + Sbjct: 74 EIEVRQQPGSEQDIVLAIDVSGGMS---GPALDDVKRAAS------DFVRQAPTGAH--- 121 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G+V SS L + + +I+ L G + + A + Sbjct: 122 IGIVAISSTPQVLSELTTDSEDLLRRIDGLKAGGNSAIADSVVTAAEML----------- 170 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNC 336 + + ++ LTDG ++S E + ++ +YA+ + L+ Sbjct: 171 ERGEAANNILLLLTDGADTSSAHSMSELPSVLSRSRAS---LYAVQMSTPETNSALLQQV 227 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S ++ S ++ L + + + + Sbjct: 228 ARESRGQYASAGDTAALGAIYQSAARALGNLYV 260 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 55/183 (30%), Gaps = 46/183 (25%) Query: 231 FPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + I L T G+ + + E +A + K +I Sbjct: 276 TPLTGVKATVDSAIQALRAWSRGGTMGDIGMAWGLRVLSPEPPFTEGLAWNTPKWAKAVI 335 Query: 289 FLTDGEN----------------------------------------SSPNIDNKESLFY 308 +TDG+N ++ ++ N Sbjct: 336 LMTDGDNQFYKLTSTTGPNKVNSAVNSDYSGYGRLDQYGALGTTSTTTAKSVINTRLTQV 395 Query: 309 CNEAKRRGAIVYAIGVQA---EAADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVK 364 C K +G VY I + +A K CAS +++ + L +F I E+ + Sbjct: 396 CQAMKDKGITVYTITFTSGINQATKDIYKACASSTAKWFDSPSQADLRASFRAIATELSQ 455 Query: 365 QRI 367 R+ Sbjct: 456 LRV 458 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 54/296 (18%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R ++ +G+++I+ A+ L + + +GL ++T+ + VK+KL LD + L + Sbjct: 3 LRRLMHDRRGTVAIIFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGSS--- 59 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 G + Q + + F + S S+S+ D + Sbjct: 60 --TGTAAELQ---------------QIGQKFFDANFKDSGLDAAGSFSVSVTGDVVSANG 102 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + + + ++ S ++ S GL++ +VLD + SM Sbjct: 103 SAQVQTT-----------LMQLVGIDTIAVSESAQVIR-SIKGLELALVLDNTGSMTTS- 149 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS---------------SKIVQT 230 D +G + +E++DI+ + +R +V +S + Sbjct: 150 ----DNIGALRDAAQELVDILFGGRADHPTLRVAVVPYSASVNPGPIAPTLISGNDAYAP 205 Query: 231 FPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L G + + E++ R + S + P L Y + D + D + Sbjct: 206 TNL-LGWKGCVIERVGRAMEDSPASTAPWLRYQWLPAIDNYYDATKASTVRADPSQ 260 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 91/256 (35%), Gaps = 66/256 (25%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T+++ + S + + +++ +DVS SM ++G A ++ R++ I +P Sbjct: 76 TATLTLPSDT---VTLVLAMDVSRSMAASDVAP-TRIGAAQQAARDL---IVGLPPG--- 125 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE---- 271 VR G+V+F++ + P + + I+R + T + GL + +F Sbjct: 126 VRLGIVSFAATAIVVLPPTDNRLRMLDAIDRFELQNGTATGSGLIQSLAVLFPDDGIDLE 185 Query: 272 ---------------------------KLEHIAKGHDDYKKY----IIFLTDGENSSPNI 300 + + + Y +I L+DG ++ Sbjct: 186 GILFGGESLAPGTGGRSLTEAAAADAVRKRDLEQPGAAPGSYRHGAVILLSDGRRTTG-- 243 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQA--EAADQ-------------FLKNCA--SPDRFY 343 + L A +RG VY +G A +A + L+ A + ++ Sbjct: 244 --PDPLDAARMAAQRGLRVYTVGFGAVQDAGTEGSSLSYEMQVDEPTLRQIATLTDGEYF 301 Query: 344 SVQNSRKLHDAFLRIG 359 ++ L + ++ Sbjct: 302 QAGSAADLTRVYRQLS 317 >gi|149923979|ref|ZP_01912364.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149815157|gb|EDM74708.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 785 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 78/201 (38%), Gaps = 20/201 (9%) Query: 143 PWCANSSHAPLLITSSVKISSKSD--IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 PW + + + ++ ++ +++ +++ +LDVS SM KL + Sbjct: 327 PWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLLDVSGSM-----SSRGKLPLIKHGFT 381 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++++ + + V+ VV +G +V + I ++RL G T + G+ Sbjct: 382 QLVEQLGAEDHVSIVVYAGAAG----VVLPPTSGDQKETILGALDRLEAGGGTNGSAGIV 437 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + A + + +I TDG+ + D+ + + + G + Sbjct: 438 EAYE-LAQANFVDGGVNR--------VILGTDGDFNVGLSDHDALVELIEQKRESGVFLS 488 Query: 321 AIGVQAEAADQFLKNCASPDR 341 +GV D+ ++ A Sbjct: 489 VLGVGGHYDDELMEQLADHGN 509 >gi|326779287|ref|ZP_08238552.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] gi|326659620|gb|EGE44466.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] Length = 575 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 75/211 (35%), Gaps = 26/211 (12%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + S+ + V+D+S SM + +L + +S+ + D ++ V Sbjct: 198 VGLATEAAPSTAERPPAALTFVVDISGSMAE-----TGRLDLVRKSLAVLTDELRDDDSV 252 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + LVTFS P+ I++ + + +T G+ Y + + Sbjct: 253 S------LVTFSDAAETRLPMTRLQGNRNRIKDVVEEMRPEQSTNVEAGITRGYEESVEG 306 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEA 328 K ++ L+D ++ + + L + + G ++ +GV ++ Sbjct: 307 HRKGATNR---------VVLLSDALANTGDTEADGILERIDSTRREYGITLFGVGVGSDY 357 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 D F++ V + + F+ Sbjct: 358 GDAFMERLTNKGDGNTTYVGDEAQARKVFVD 388 >gi|219850594|ref|YP_002465027.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544853|gb|ACL26591.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 958 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 76/217 (35%), Gaps = 29/217 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-----V 216 + + L ++ V+D S SM+ D+ G T S +DI K Sbjct: 403 NRELRPDLAIVFVIDKSGSMDACHCANPDRGGPITSSSERKIDIAKDAVAQATALLSPQD 462 Query: 217 RSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+VTF TF G V+ + + ++ + T GL A + +++ Sbjct: 463 TVGVVTFDGAAFPTFVATRGATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQVDARIK 522 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H +I LTDG S + L + +G + + + +A + Sbjct: 523 H-----------MILLTDGWGSGG-----DQLDIAARLREQGITLTVVAAGSGSATYLQQ 566 Query: 335 NCA-SPDRFYSVQNSRKLHDAF-----LRIGKEMVKQ 365 A R+Y + + F IG +V+Q Sbjct: 567 LAAEGGGRYYPAADMADVPQIFVQETITAIGNYIVEQ 603 >gi|332808244|ref|XP_001147912.2| PREDICTED: cartilage matrix protein [Pan troglodytes] Length = 717 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 376 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 377 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 66/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E H D K Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAE-GGHSRSP--DISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIG + + P Sbjct: 149 VIVVTDGRPHDSVQDVSA------RARASGVELFAIGFGRVDKATLRQIASEP 195 >gi|296207278|ref|XP_002750612.1| PREDICTED: cartilage matrix protein [Callithrix jacchus] Length = 496 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 376 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 377 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 67/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPI------EFEKVKVFLSQVIESLDVGP---NATRVGVVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL V + + + R+ + T + +++A K F E D K Sbjct: 92 EFPLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + P Sbjct: 149 VIVVTDGRPQDSVRDVSA------RARASGVELFAIGVGRVDKATLQQIASEP 195 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 68/210 (32%), Gaps = 29/210 (13%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 +D+ T L + P + P + + Sbjct: 129 GNGKEDVYFSFPQTPYQYTRQTGVSTAKLDFSLSFSQP-EYAKPPNGDAQGRLDVTLVPQ 187 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +S +D++ V+DVS SM KL A +++ ++ KS + N R Sbjct: 188 GAVSGIIRPPIDVVFVMDVSGSMTAM------KLQSAKSALQAAVNYFKS--NYNQNDRF 239 Query: 219 GLVTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 L+ FS + + + +G I N L G T + L A + D Sbjct: 240 ALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTDPT 299 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 KKYIIFLTDG + N Sbjct: 300 R------------KKYIIFLTDGMPTVLNA 317 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 47/255 (18%), Positives = 94/255 (36%), Gaps = 30/255 (11%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS---SKSDIGL 169 S ++ + + N+ + F P +S A V + + Sbjct: 317 SQNLAMQQGFRPANVDVALASPLTAQFGVDPNQPRNSLATPPADVIVAAKNAWANNRKPA 376 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++M+V+D S SM D DK+ A + L+ + S +V G++ FSS Sbjct: 377 NIMLVVDSSGSMRDD-----DKMDQAKLGVEVFLNRLPSKDNV------GMIGFSSSPAV 425 Query: 230 TFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PLA + ++Q + L+ T ++ A ++ + K+ A Sbjct: 426 LVPLATRSENMANLQMQTQGLVPDGNTSLYDAIDLARQELENLKQPDRINA--------- 476 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 I+ L+DG +++ + + L ++ I A+A L+ A R VQ Sbjct: 477 IVVLSDGADTASQLSIDQML---GNFGESSIQIFPIAYGADAETSILQQIADFSRTELVQ 533 Query: 347 -NSRKLHDAFLRIGK 360 ++ + F + + Sbjct: 534 GSTGDIDKIFENLSR 548 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 66/192 (34%), Gaps = 34/192 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SMN + + ++ II+ V GLVTF S Sbjct: 308 VCLVLDKSGSMNAEDH----LFRMNQAAELYLIQIIEKGSLV------GLVTFDSFAKIQ 357 Query: 231 FPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L T GL+ + I + + Sbjct: 358 SKLIKIIDDNTYQKITANLPQEADGGTSICRGLKAGFQAISQSNQSTFGSE--------- 408 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 II LTDGE+ + C E K+ G +++ I + A ++ L N RFY Sbjct: 409 IILLTDGEDYQ--------ISLCFGEVKQSGTVIHTIALGPSADEELETLSNMTGGHRFY 460 Query: 344 SVQNSRKLHDAF 355 + +N L DAF Sbjct: 461 AHKNINGLIDAF 472 >gi|73538307|ref|YP_298674.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121644|gb|AAZ63830.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 353 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 40/262 (15%), Positives = 87/262 (33%), Gaps = 65/262 (24%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T++V + S + + +++ +DVS SM ++ A ++ R+++ + + Sbjct: 76 TATVTLPSDT---ITLVLAMDVSRSMEATDVAP-TRISAAQQAARDLIVGLPAS------ 125 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA------ 269 VR G+V+F+ Q + + I+R T + GL A +F Sbjct: 126 VRLGIVSFAGTATVVLRPTSNRQDMLDAIDRFQLQRGTATGSGLIQALAVLFPDDGIDLE 185 Query: 270 -----------------------------KEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 +E+ A+ +I L+DG + Sbjct: 186 AILFADEPVFSTRRAVPLDEAAAADAVRKREQATQSAQPGSYRHGAVILLSDGRRTVG-- 243 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAAD----------QF----LKNCA--SPDRFYS 344 + + A +RG VY +G + Q L+ A + ++ Sbjct: 244 --PDPVDAARMAAQRGVRVYTVGFGTLGGNAPETSLSYYMQLDEPALRAVATITGAEYFQ 301 Query: 345 VQNSRKLHDAFLRIGKEMVKQR 366 ++ L + ++ +R Sbjct: 302 AGSAADLSQVYRQLSARFALER 323 >gi|320160484|ref|YP_004173708.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] gi|319994337|dbj|BAJ63108.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] Length = 802 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 80/220 (36%), Gaps = 30/220 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L++ + + ++ I D++ VLD S SM K A +++ +L + Sbjct: 281 LMLLAPRIQAPETAIPKDVIFVLDRSGSMEGV------KFQQAKQALEYVL------SRL 328 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 N R L++FS+++ P GV+ I Q+ + L T L A + Sbjct: 329 NPQDRFNLLSFSNQVEVFAPEMEGVEAIPQAQKWVAGLSAAGGTNIHRALLDAIQFVRSQ 388 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + Y+IFLTDG + D ++ L RG ++ GV + Sbjct: 389 RPT-------------YLIFLTDGLPTVGITDREQILDDFARQAPRGLRLFVFGVGYDVD 435 Query: 330 DQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQRI 367 L A V+ L+ A +++ + Sbjct: 436 TFLLDELALAHHGLSLYVRPEEDLNQAVAGFFEKISTPVL 475 >gi|29788808|gb|AAP03354.1| hypothetical protein [Oryza sativa Japonica Group] gi|108708692|gb|ABF96487.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 540 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 74/207 (35%), Gaps = 37/207 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S+ S LD++ VLDVS SM G +D++ A + L + R Sbjct: 60 TSSATSRAALDLIAVLDVSTSMA---GNKLDRMKAALLFVIRKLADVD---------RLS 107 Query: 220 LVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +VTFS+ + PL + + ++ L T GLE + + Sbjct: 108 IVTFSNDAARLCPLRFVAGDAARADLGALVDGLAADGNTNIRAGLEIGL-AVAAGRRLTA 166 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A ++ ++DG+ + + + G V+ G+ A+ L+ Sbjct: 167 GRAVN-------VMLMSDGQQNRGDATRLDP---------GGVPVHTFGLGADHDPAVLQ 210 Query: 335 NCASPDR---FYSVQNSRKLHDAFLRI 358 A R F+ V + L F ++ Sbjct: 211 AIAGKSREGMFHYVADGVNLTAPFSQL 237 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 70/249 (28%), Gaps = 44/249 (17%) Query: 138 IFCTFPWCANSSHAPLLITSSVK--ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 + T S P+ SS S M+V D S SM + G ++ A Sbjct: 54 VLGTTALAVVSGLLPVASAEEETNAPSSSSSTMAPTMVVFDSSGSMITNDAGGQTRIDAA 113 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---------------IVQTFPLAWGVQHI 240 + R + GLVT+ V T P A + + Sbjct: 114 KDAARTFITEAGDDAP------LGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGNSEKM 167 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 ++ L T L A ++ ++ II ++DG + Sbjct: 168 IAHMDGLQPRGFTPIGESLRKAAAELPKEG-------------QRSIILVSDGVATCT-- 212 Query: 301 DNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAF 355 E K +G ++ +G E Q C A+ + + ++ L Sbjct: 213 -PPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIADATGGTYANASDADSLAKEL 271 Query: 356 LRIGKEMVK 364 R Sbjct: 272 NRAAPRTFN 280 >gi|291452634|ref|ZP_06592024.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] gi|291355583|gb|EFE82485.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] Length = 658 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 75/224 (33%), Gaps = 37/224 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPD 211 L G ++MVLD S SM D G G ++ A ++ ++D + P Sbjct: 35 LAAGPPATAEPGPGAG-GLVMVLDSSGSMADEAGGGRTRIEAARDAVGTVVDSLPDGYP- 92 Query: 212 VNNVVRSGLVTF-SSKIVQTF---------PLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +GL + + + PL +++ + + T L Sbjct: 93 ------TGLRVYGADRTSGCTDTRLARPVEPL--DRDAMKKAVAGVEPKGDTPIGLSLRK 144 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIV 319 A + + + ++ ++ ++DGE++ + E+ E G + Sbjct: 145 AVADLPE--------PEPGAVGRRTVLLISDGEDNCGSPPPCEA---AEELAESGLDLRI 193 Query: 320 YAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGK 360 AIG Q E + C A +Y ++ L R + Sbjct: 194 DAIGFQVEGKAREELTCVAEAGHGAYYDAPDAEALARQLQRAAE 237 >gi|239980776|ref|ZP_04703300.1| hypothetical protein SalbJ_15127 [Streptomyces albus J1074] Length = 628 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 75/224 (33%), Gaps = 37/224 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPD 211 L G ++MVLD S SM D G G ++ A ++ ++D + P Sbjct: 5 LAAGPPATAEPGPGAG-GLVMVLDSSGSMADEAGGGRTRIEAARDAVGTVVDSLPDGYP- 62 Query: 212 VNNVVRSGLVTF-SSKIVQTF---------PLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +GL + + + PL +++ + + T L Sbjct: 63 ------TGLRVYGADRTSGCTDTRLARPVEPL--DRDAMKKAVAGVEPKGDTPIGLSLRK 114 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIV 319 A + + + ++ ++ ++DGE++ + E+ E G + Sbjct: 115 AVADLPE--------PEPGAVGRRTVLLISDGEDNCGSPPPCEA---AEELAESGLDLRI 163 Query: 320 YAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGK 360 AIG Q E + C A +Y ++ L R + Sbjct: 164 DAIGFQVEGKAREELTCVAEAGHGAYYDAPDAEALARQLQRAAE 207 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 3/76 (3%) Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPD-RFYSVQNSRKL 351 ++ + N + C+ AK RG +VY IG +A + LK+CAS D ++ VQ ++ Sbjct: 447 DSWGTSTKNARTKAICDAAKARGIVVYTIGFEAPSGGVSVLKDCASSDAHYFDVQGL-EI 505 Query: 352 HDAFLRIGKEMVKQRI 367 DAF I + + R+ Sbjct: 506 SDAFASIATSIRQLRL 521 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 49/333 (14%), Positives = 107/333 (32%), Gaps = 61/333 (18%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA------------TKILNQ 66 + A LL ++ V G+ I+ + L Y LD ++L A LN+ Sbjct: 5 PMIAFLLSMV-AVGGIGIDLMRMERDRTILQYTLDRAVLAAADLDQPLPPDVVVQDYLNK 63 Query: 67 ENGNN---GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQD--------INNIERSTSLS 115 N + + Y+ +++ T F + + QD +S Sbjct: 64 ANLSEYYQPPIAETGIGYKRVESTIDTTFETQWLDFSGGQDMPLYANSRAEESIDGLEIS 123 Query: 116 IIIDDQHKDYNLSAVSR-YEMPFIFC-TFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 +++D ++++ SR Y + ++ + S V +++ + DM Sbjct: 124 LVLDVSG---SMNSNSRLYNLKNAARDFIDTMVANTADNKMSVSIVPYATQVSLPKDM-- 178 Query: 174 VLDVSLSMNDHFGPGM--------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-- 223 LD ++H G++T + + +GL+ + Sbjct: 179 -LDQYNVTDEHEYSNCVNFTGTHFTSTGLSTTASLNRTMHFTPWWSGDARPSNGLIQYPV 237 Query: 224 -SSKIVQT-FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE---------- 271 + + P +++ I L T G+++ + + + Sbjct: 238 CDERAHREVMPFQKDANRLKDFIQNLQAWGNTSIDVGMKWGTVLLDPSAQPVISALTSSS 297 Query: 272 -------KLEHIAKGHDDYKKYIIFLTDGENSS 297 A + K I+ +TDG+N+S Sbjct: 298 VNVPGVFADRPAAYNDTETVKVIVLMTDGQNTS 330 >gi|153823396|ref|ZP_01976063.1| conserved hypothetical protein [Vibrio cholerae B33] gi|126519088|gb|EAZ76311.1| conserved hypothetical protein [Vibrio cholerae B33] Length = 171 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 48/120 (40%), Gaps = 15/120 (12%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPG---MDKLGVATRSI 199 ++ P+ + S+ D+M+V+D+S SM+ + G +D+L + + Sbjct: 62 LLLTAAARPVWYGDPISTSTSHR---DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVL 118 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-TTKSTPG 258 E + R GL+ F+ PL Q + ++N+ + T++ G Sbjct: 119 SEFI-------AKREGDRIGLILFADHAYLQTPLTLDRQTVANQLNQTVLKLIGTQTAIG 171 >gi|125586596|gb|EAZ27260.1| hypothetical protein OsJ_11197 [Oryza sativa Japonica Group] Length = 540 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 74/207 (35%), Gaps = 37/207 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S+ S LD++ VLDVS SM G +D++ A + L + R Sbjct: 60 TSSATSRAALDLIAVLDVSTSMA---GNKLDRMKAALLFVIRKLADVD---------RLS 107 Query: 220 LVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +VTFS+ + PL + + ++ L T GLE + + Sbjct: 108 IVTFSNDAARLCPLRFVAGDAARADLGALVDGLAADGNTNIRAGLEIGL-AVAAGRRLTA 166 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A ++ ++DG+ + + + G V+ G+ A+ L+ Sbjct: 167 GRAVN-------VMLMSDGQQNRGDATRLDP---------GGVPVHTFGLGADHDPAVLQ 210 Query: 335 NCASPDR---FYSVQNSRKLHDAFLRI 358 A R F+ V + L F ++ Sbjct: 211 AIAGKSREGMFHYVADGVNLTAPFSQL 237 >gi|297735865|emb|CBI18619.3| unnamed protein product [Vitis vinifera] Length = 656 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 67/171 (39%), Gaps = 25/171 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + T+S ++ +D++ VLDV M KL + R++R ++ + S Sbjct: 267 TINTTTSSLLNPARRAPIDLVTVLDVGGGMTGA------KLQMMKRAMRLVISSLSSTD- 319 Query: 212 VNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R +V FS+ + PL G + + I LI G T + L+ A + Sbjct: 320 -----RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIAGQGTSAGEALKKASKVLE 374 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 D +E+ + I+ L+DG+N + + N A+ A Sbjct: 375 DRRERNPVAS---------IMLLSDGQNERVSSKSTNPNRPSNPAEDAFAK 416 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 83/222 (37%), Gaps = 35/222 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND--------HFGPGMDKLGVATRSIREMLD 204 L + + K + +D++ VLDVS SM D +L V +++ ++ Sbjct: 30 LRVEAPPMADLKGHVPIDVVAVLDVSGSMGDPAMASSDFEKNKPPSRLDVLKEAMKFIIR 89 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPG 258 + + R +V F+ + V+ + G + ++K++ L T P Sbjct: 90 KLD------DGDRLSIVAFNDRPVKEYSTGLLNISGNGRRIAEKKVDWLEARGGTALMPA 143 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 LE A + + +I+ LTDG+++S +++ + Sbjct: 144 LEEAIRVLDCRPGDSRNSVG-------FILLLTDGDDTSGFRWSRDVINGA----VGKYP 192 Query: 319 VYAIGVQAEAADQFLKNCASPDR----FYSVQNSRKLHDAFL 356 V+ G+ A + + L + A R F +N K+ A Sbjct: 193 VHTFGLGAAHSSEALLHIAQESRGTYSFVDDENMDKIAGALA 234 >gi|312139258|ref|YP_004006594.1| type ii secretion system integral membrane subunit [Rhodococcus equi 103S] gi|311888597|emb|CBH47909.1| putative type II secretion system integral membrane subunit [Rhodococcus equi 103S] Length = 622 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 76/213 (35%), Gaps = 29/213 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +++ + D+++ +DVS M+ GP +D + A D ++ P + Sbjct: 73 EIEVRQQPGSEQDIVLAIDVSGGMS---GPALDDVKRAAS------DFVRQAPAGAH--- 120 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G+V SS L + + +I+ L G + + A + Sbjct: 121 IGIVAISSTPQVLSELTTDSEDLLRRIDGLKAGGNSAIADSVVTAAEML----------- 169 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNC 336 + + ++ LTDG ++S E + ++ +YA+ + L+ Sbjct: 170 ERGEAANNILLLLTDGADTSSAHSMSELPSVLSRSRAS---LYAVQMSTPETNSALLQQV 226 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A S ++ S ++ L + + + + Sbjct: 227 ARESRGQYASAGDTAALGAIYQSAARALGNLYV 259 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 66.0 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 39/185 (21%), Positives = 69/185 (37%), Gaps = 19/185 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 I D LD++ VLD S S+ + +A + ++++ K N + GL Sbjct: 176 IPDACDTNLDVVFVLDQSGSIG--YYNH----QLALNFLSKVVEFFKI---GANKTQVGL 226 Query: 221 VTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIA 277 +T+S+ F L I +I+R+ + T + GL A + + + Sbjct: 227 ITYSTHAYVQFDLNDYHSKSTILNRISRIYYTGGWTATALGLFQAGVILNPQQMRGARPI 286 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + +I LTDG ++ ID G VY +GV + + Sbjct: 287 SQGVP--RVVILLTDGRSNRVPID-----EVAPSLHDFGIQVYTVGVGNIYLPELKFIAS 339 Query: 338 SPDRF 342 PD + Sbjct: 340 DPDPY 344 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 59/426 (13%), Positives = 115/426 (26%), Gaps = 92/426 (21%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G I++L A +LP++ ++ I + VK +L D + ++ Sbjct: 82 RRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRAFSEEQ--TVE 139 Query: 73 KKQKNDFSYRIIKNIW-------QTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + D NE AQ N R + D + Sbjct: 140 AAKAAARLTAAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKVPTSDVAANL 199 Query: 126 NLSAVSRYE-----------MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 + R +PFIF + + S V ++ + D+ +V Sbjct: 200 VAVSSVRINGKRTDDSLLGPVPFIF--------PNTFSIGDFSPVASATAMQVDRDISLV 251 Query: 175 LDVSLSMNDHFGP--------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD S SM+ G D L A + L+ K +R V+++ Sbjct: 252 LDRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLE-WKYRNGQPQYIRR--VSYNRG 308 Query: 227 IVQTFPLAWGVQHI--------------------------QEKINRLIFGSTTKSTPGL- 259 + + + Q N + ++ S L Sbjct: 309 YDEYDLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVLDQTPQNEQVSIASYNSHGTLD 368 Query: 260 ----------EYAYNKIFDAKEKLEHIAKGHDD-----------YKKYIIFLTDGENSSP 298 A ++ K ++ +TDG ++ Sbjct: 369 CWLLDDFDSVRAAVAQLGPNGSTGIGNGMNSGKTAFTHENARPYASKTMVVMTDGNHNYG 428 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFL 356 N + + + A + ++ A R Y + +L AF Sbjct: 429 TQPNTVAQQL---MSSSNLNIQTVTFGGGADQETMQEVAVTGLGRHYHADSGDELVSAFE 485 Query: 357 RIGKEM 362 I + Sbjct: 486 EIANNL 491 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 74/201 (36%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ + N +V FSS+I Sbjct: 306 VVFVLDSSASMVGA------KLKQTKEALFTILHDLRPEDNFN------IVGFSSRIKVW 353 Query: 228 -VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 Q P+ ++ + I+ + T L+ + D + H Sbjct: 354 KDQLVPVTPNSIRDGKVYIHHMSPSGGTNINGALQTGIRLLND---FVAHNDIDARSVS- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 I+FLTDG + I + L EA R ++ IG+ + + L+NC Sbjct: 410 LIVFLTDGRPTVGEIQTPKILNNTKEAARDRVCLFTIGIGDDVDFKLLEKLSLENCGMTR 469 Query: 341 RFY-SVQNSRKLHDAFLRIGK 360 RF + +L + IG Sbjct: 470 RFQVEADAAAQLKGFYDEIGT 490 >gi|326676330|ref|XP_003200547.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 1164 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 53/262 (20%), Positives = 93/262 (35%), Gaps = 30/262 (11%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 + +T SI + Q ++ AVS E T P SS AP T Sbjct: 388 LSPNTQYSISVHTQLQETEGPAVSTTERTLPVSTAPPTIASSSAP-PPTLPATKEVCRAA 446 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ ++D S S+ D D R + + I + + + FS Sbjct: 447 KADLVFLVDGSWSIGD------DNFQKIIRFLHSTAGALDQIGP--DGTQVAIAQFSDDA 498 Query: 228 VQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L+ + + I R+ + G TK+ +++ + +F Sbjct: 499 RTEFSLSSHSSKEELLTAIQRVSYKGGNTKTGRAMKHVKDSVFAPVGGARRG------VP 552 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RF 342 K ++ LTDG + + L E + G IV+AIG + L + P Sbjct: 553 KVLVVLTDGRSQD------DVLQVSQELQAEGYIVFAIGFADADYGELLSIASRPGDRHV 606 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 + V + DAF I + +++ Sbjct: 607 FFVDD----LDAFRTIVENILQ 624 >gi|294055316|ref|YP_003548974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614649|gb|ADE54804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 730 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 45/266 (16%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 D IE + D + + + + PW + + ++ Sbjct: 302 DSVRIEELVNYFNYSDAAPTKSLEDGGAPFAVHLEQMSAPWQPEHRLVRVGLKG-YEMPW 360 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +++ +LDVS SM+ +KL + ++ + + S V VV +G Sbjct: 361 EERPASNLVFLLDVSGSMSQP-----NKLPLLKEALMLLTRRLDSRDRVAIVVYAGASG- 414 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +V A I+ + +L G +T + G+E AY EH + ++ Sbjct: 415 ---LVLPSTTANNTATIEHALTQLQAGGSTNAGAGIELAYQV------AREHFIEDGNNR 465 Query: 284 KKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 +I TDG+ N + ++AK G + +G D L+ ++ + Sbjct: 466 ---VILCTDGDFNVGQTNRGDLAQIVADQAKD-GVSLTVLGFGMGNYKDNMLEELSNKGK 521 Query: 342 --FYSVQNSRKLHDAF-LRIGKEMVK 364 + V + + F + + K Sbjct: 522 GTYAYVDSEAEARKVFLQDLASNIFK 547 >gi|149920875|ref|ZP_01909337.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] gi|149818274|gb|EDM77727.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] Length = 367 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 48/287 (16%), Positives = 99/287 (34%), Gaps = 46/287 (16%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 N F+ + +T D+ D + + +++P P ++ + Sbjct: 24 NPFSANDEASSITTDGGATEDEAGDDGSTTTPMYFDLPGEES-QPTAEGATTDCANVEVD 82 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++ + +++++D S SM D FG G + ++ + ++ + + + + VR Sbjct: 83 TSPTTPT-----VVLLVDQSGSMWDDFG-GQPRWVALENTLFDPVNGV--VKPLEDQVRF 134 Query: 219 GLVTFSSK---------IVQTFPLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 GL +SS ++ F ++G + + T + ++ + Sbjct: 135 GLALYSSMNGSFGGECPLITEFAPSFGNHASLAATFASAMPLDDTPTGDSIKAVAETLAA 194 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKE----SLFYCNEAKRRGAIVYAIG 323 E K I+ TDGE ++ D +E SL A G + I Sbjct: 195 FPEDG----------PKIIVLATDGEPDTCAVPDPQEGQPLSLEATQAAFDDGIRTFVIS 244 Query: 324 VQAEAADQFLKNCASPD------------RFYSVQNSRKLHDAFLRI 358 V + D L+ A+ FY N +L AF + Sbjct: 245 VGNQVTDAHLQELANAGVGLPTQGAVENAPFYKTLNPAELVSAFEAV 291 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 41/84 (48%), Gaps = 7/84 (8%) Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE------AADQFLKNCASP-DRFY 343 TD ++S N+ ++ +L C AK G +Y IG + +++CA+ + ++ Sbjct: 490 TDYHDTSRNMQDELTLEACTNAKTAGVEIYTIGFSVPVDPIDAQGLKMMQDCATDANHYF 549 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 + + L+ AF IG + K R+ Sbjct: 550 AATDVDSLNAAFASIGSGVGKLRL 573 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 65/198 (32%), Gaps = 28/198 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F +G+++++ + + +++G ++ K+ L D + L A I+ Sbjct: 16 FSSCDRGNVAVIFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAALAVAKTIVATTT 75 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + Q + ++N + E+ + +++ + Sbjct: 76 NQQAQSQAQVYLLTNVRNAVAVVTKAEISADRLTLCLDSTAQ------------------ 117 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 I T A+ T+ + + ++ +VLD S SM+ G G Sbjct: 118 ---------IPTTIMKIAHIETITTKATTCAQTPGGMNGTYEIALVLDNSGSMSKSAG-G 167 Query: 189 MDKLGVATRSIREMLDII 206 K+ + ++ I Sbjct: 168 KSKIAALRDAATSFVNNI 185 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 29/89 (32%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +S L I KI+ L T G + + I A D Sbjct: 364 TSSSQTVLQLTATQSTITTKISGLTENGYTNLHEGFMWGWRTISPTGPFAAGRAYATKDN 423 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEA 312 K I+F+TDG N+ + + + A Sbjct: 424 HKIIVFMTDGFNNWQSATSTVTGSAYQAA 452 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 79/207 (38%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + + + +++D + R GLV +SS + Sbjct: 28 IDLVLLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSCVR 78 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + + T + L++ F E K Sbjct: 79 TEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 133 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 134 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 186 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 187 HFFYSAD-------FTAISQIAENLKL 206 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSGK------KIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPDGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|253582981|ref|ZP_04860199.1| magnesium chelatase [Fusobacterium varium ATCC 27725] gi|251835187|gb|EES63730.1| magnesium chelatase [Fusobacterium varium ATCC 27725] Length = 632 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 62/164 (37%), Gaps = 21/164 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + G ++ V+D S SM ++ ++ +L R G+V+ Sbjct: 444 REKRTGASILFVVDSSGSMG-----VKKRMEAVKGAVMSLLK-----DAYEKRDRVGMVS 493 Query: 223 FS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F K + P+ + Q+K+ +L G T G+ AY I + K + Sbjct: 494 FRRDKAEELLPITRSIDLAQKKLEKLATGGKTPLAEGIAKAYTIIKNEMRKDK------- 546 Query: 282 DYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRGAIVYAI 322 + I+FL+DG+ + S KESL + K G I Sbjct: 547 EVVPLIVFLSDGKGNFSASGKDPVKESLEMAEKIKNEGIRAIVI 590 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 41/289 (14%), Positives = 97/289 (33%), Gaps = 24/289 (8%) Query: 90 TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS 149 + + N + D+ N S + + + + +MP + Sbjct: 100 KEAARDTESNIKSADLKNKSNSEQADLYVHMMYSLLKQEIIPFDKMPLQILEIGRVEDEE 159 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 K + + ++ ++LD S SM M K+ +A +I++ + + Sbjct: 160 KKSNGTKGEQK-NKEDRGNYNIEILLDASGSMAGKIDGKM-KMDIAKEAIQQFVSDLPEA 217 Query: 210 PDVNNVVRSGLVTFSSKIVQ----------TFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +V+ V G + + + + ++ ++ T + Sbjct: 218 VNVSLRVY-GHKGSNDEKDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAI 276 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + + AKE ++I YI+ +DG + +E+ N + + Sbjct: 277 KKSTETFQSAKENDKNII--------YIV--SDGVETCGGNPVEEAQKVSNSNIKPIMNI 326 Query: 320 YAIGVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 V EA Q + S ++ ++++L D F GK++ +R+ Sbjct: 327 IGFQVDHEAEKQLKEIAEVSKGKYVLANSAKELQDQFKETGKDITSRRL 375 >gi|148655977|ref|YP_001276182.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568087|gb|ABQ90232.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 420 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 85/241 (35%), Gaps = 31/241 (12%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 R + + ++ + IT ++ ++ +++ V+D S SM G Sbjct: 4 EVTIRASLARPYMAAAATPQVAYMLIEIT-PGQVMTQVRAPVNVCFVIDRSGSMK---GE 59 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKIN 245 +D++ AT EMLD + +V F + P + ++IN Sbjct: 60 KIDRVRRATIRAIEMLDAQDVVS---------VVIFDHRTEVLIPATPVTRPAELIDRIN 110 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 R+ T+ P +E +I KG + +I LTDG+ + + Sbjct: 111 RVRDSGGTRIAPAIEAGLREI----------EKGPPQMVRRLILLTDGQ----TENESDC 156 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 L +A +R + A+GV + + L A S + K+ + F + Sbjct: 157 LRRATDAGQRNVPITALGVGKDWNEDLLIEMANRSGGTADYIDRPEKIVEYFQSTIQRAQ 216 Query: 364 K 364 Sbjct: 217 A 217 >gi|332299343|ref|YP_004441264.1| double-transmembrane region domain protein [Porphyromonas asaccharolytica DSM 20707] gi|332176406|gb|AEE12096.1| double-transmembrane region domain protein [Porphyromonas asaccharolytica DSM 20707] Length = 342 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 62/194 (31%), Gaps = 21/194 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P L T +SS+ IG+D+ +DVS SM D++G A Sbjct: 61 LKLIAIALLFVALARPQLYTH-APVSSQQTIGVDLAFCIDVSNSMA-ARDVKPDRIGFAK 118 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + + + R +V F+ PL + + + + G + Sbjct: 119 QIVTHTMQELAGS-------RVAMVVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQG 171 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L A E+ K +I LTDGE+ ++ K++ Sbjct: 172 TNLGQAL-------ERSAQALSAPSRAGKAVIILTDGEDHEGGLE-----EGIERLKKQE 219 Query: 317 AIVYAIGVQAEAAD 330 Y + + Sbjct: 220 IKAYVVTIGLPDGA 233 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 41/183 (22%), Positives = 72/183 (39%), Gaps = 26/183 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K++ + +D++ VLD S S+ +F + LD R Sbjct: 211 KVTPPCNNPVDIVFVLDGSGSVGRRNFEKVQAGVKKIVGDFNIALDST----------RV 260 Query: 219 GLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEH 275 G+V +SS + Q F L +Q ++ I + T++ +EYA F + Sbjct: 261 GVVQYSSIVRQEFALDTFSNLQGLESGIQSIPYMAGGTRTGAAMEYAIQNSFTSA-NGAR 319 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 GH I+ +TDG + + +AK+ G +V+A+G+ A + L Sbjct: 320 PDVGH-----VIVLVTDGRSYD------DVSQASQKAKQAGIVVFAVGIGDGAVESQLNQ 368 Query: 336 CAS 338 AS Sbjct: 369 IAS 371 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 69/190 (36%), Gaps = 24/190 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D++ +LD S S+ D +++ + + R ++ +SS + Sbjct: 1 PIDIIFMLDGSGSVGP------DNFNKMKEFVKK---TVGGYLIGPSNTRVAVMQYSSSV 51 Query: 228 VQTFPL-AWGV-QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F L A+ + + I + T++ A ++ + A+ + + Sbjct: 52 RQEFALDAFNTLEDLLVGIEEIRYMRGGTRTG----KALTRLRRQGFLESNGARKNVPHV 107 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 I+ TDG +S E ++ G ++YA+GV Q ++ + Sbjct: 108 AVIV--TDGRSSDSVD------QAALETRQSGIVLYAVGVGNYDLGQLTDIASTNETLGV 159 Query: 345 VQNSRKLHDA 354 V N L D Sbjct: 160 VDNFNLLDDV 169 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 271 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 321 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 322 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 376 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 427 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 428 EHYFYTADFKTINQ----IGKKLQKKI 450 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 77/197 (39%), Gaps = 28/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R GLV ++S + Q Sbjct: 40 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGLVNYASAVKQ 90 Query: 230 TFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + + R+ + T + +++A + F E D K Sbjct: 91 EFPLRAHGSKAALLQAVRRIQPLSTGTMTGLAIQFAITRAFSEGE---GGRARSPDISKV 147 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 +I +TDG D ++ G ++AIGV L+ AS + Sbjct: 148 VIVVTDGRPQDSVRDVSA------RSRASGIELFAIGVG-RVDKATLREIASEPQDEHVD 200 Query: 344 SVQN---SRKLHDAFLR 357 V++ KL F Sbjct: 201 YVESYSVIEKLSKKFQE 217 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 98/264 (37%), Gaps = 42/264 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + +G++++L + + ++ G+ I+ + +A L LD ++L AT + N Sbjct: 13 RFAKDEEGTVTVLAFAIFVMFLVMGGIGIDMMRQEMARASLQATLDRAVLAGATAVNN-- 70 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + +I++ + +++ A DI+ S+ ++ Y + Sbjct: 71 ----------ATARAVIEDYFAKSGQSDYLAAQEAGDIDIRLNSSKVTARATQTLDTYLM 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 ++ + T+ V I L++ M LDVS SM Sbjct: 121 RLAG--------------VDTLTSAGNSTAEVTIP-----KLEIAMALDVSGSMIGA--- 158 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKIN 245 ++ + E +D I + N+ V +V FS + + + A V + + Sbjct: 159 ---RIDALKPAAIEFVDSILDSTEPNDAV-ISVVPFSWGVTPSKEIYEALTVNETHKYSS 214 Query: 246 RLIFGST--TKSTPGLEYAYNKIF 267 L + T +T AYN++ Sbjct: 215 CLELNDSHFTDTTIDPNTAYNQLI 238 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 22/59 (37%), Gaps = 6/59 (10%) Query: 295 NSSPNIDNKESLF--YCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP-DRFYSVQN 347 N+ N K+ C EAK G ++Y I +K CAS + Y+ Sbjct: 505 NNPINRSKKDERLDDICREAKSEGIVIYTIAFEMGSQPTGADKIKKCASSVNHHYNATT 563 >gi|301781662|ref|XP_002926252.1| PREDICTED: collagen alpha-3(VI) chain-like [Ailuropoda melanoleuca] Length = 3167 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S L + K+ D++ ++D S S+ + + Sbjct: 7 LPLVAMF---CLFLSGFSLTRAQQQQADVKNGAAADVIFLVDSSWSIGKEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D+I+S+ ++ R LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVIESLAVGDSDFRFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + T + ++ D + I+ LTDG + L Sbjct: 115 SNETG---KGLEYVMQNHLTEAAGSRAGDGVPQVIVVLTDGRSDDGLALPSAGL------ 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHVFNLENFTSLHDI 209 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 66/213 (30%), Gaps = 22/213 (10%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 ++ + ++ D++ +LD S ++ P + Sbjct: 613 LQGVLPGLLAPLRTLSGTSEVHVNKR---DIIFLLDGSSNVGKTNFPYVRDFVT------ 663 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPG 258 +++ S+ ++ +R GLV FS V F L + + RL + G Sbjct: 664 ---NVVNSLDVGSDNIRVGLVQFSDTPVTEFSLDTYQTKAELLAHLRRLQPQGGSGLNTG 720 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 +Y E ++ H + ++ LT G L N R G + Sbjct: 721 SALSYVHANHFTEAGGSRSREHVP--QLLLLLTAG------PAEDAYLPAANALARAGVL 772 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 +G + + +P Y + + L Sbjct: 773 TLCVGASRANKAELEQIAFNPSLVYLMDDFSSL 805 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 79/208 (37%), Gaps = 23/208 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I C SS AP + + + D++ ++D S + + Sbjct: 209 IVGNLVSCVQSSVAPEGAGGTETLKDITAQDSADIIFLVDGSNNTGSVHFAVIRDF---- 264 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTK 254 ++++++ + +R G+V +S + F L + + + L+F Sbjct: 265 -----LVNLLERLSVGAQQIRVGVVQYSDEPRTVFSLDTYSTKAQVLDAVKALVFTGGEL 319 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + GL A + + + ++ + + ++ ++ G +S D +L + Sbjct: 320 ANVGL--ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRDGVVALKQAS---- 373 Query: 315 RGAIVYAIGVQAEAADQF-LKNCASPDR 341 V++ G+ A+AA + L++ A+ D Sbjct: 374 ----VFSFGLGAQAASRAELQHIATNDN 397 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 37/249 (14%), Positives = 92/249 (36%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + + AP + + Sbjct: 1577 NIDRTELQTITNDPRLVFTVREFRELPSIEDRVMHAFGPSGVTPAPPGVDIPSPSRPEKK 1636 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S+N + L + +I+ ++ + + ++ GLV ++S Sbjct: 1637 KA-DVVFLLD--GSINFRRDTFQEVLRFVS-------EIVDTLYEGGDSIQVGLVQYNSD 1686 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 1687 PTDEFFLKDFSTKQQIIDAINKVVYKGGRHANT--KVGIEHLRQNHFVPEAGSRLDQRVP 1744 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ + +L ++G V+A+GV+ +++ K ++ + Sbjct: 1745 QIAFVITGGKSVEDAQEASLALT------QKGVKVFAVGVKNIDSEEVGKIASNSATAFR 1798 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1799 VGNVQELSE 1807 >gi|195614282|gb|ACG28971.1| retrotransposon protein [Zea mays] Length = 650 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 75/213 (35%), Gaps = 18/213 (8%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 IF P +L+ ++ I +D++ VLDVS SMND ++ T Sbjct: 47 IFPEIPQGQARKDFQVLVRVEAPARPEARIPIDVVAVLDVSGSMNDPAAAPTER--TRTT 104 Query: 198 SIREMLDIIKSI--PDVNNVVRSGLVTFSSKI--VQTFPL----AWGVQHIQEKINRLIF 249 S ++L + + R +V FS + + L A G ++ +++L Sbjct: 105 SRLDLLKTAAKFMVAKLEDGDRLSIVAFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEA 164 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFY 308 T P E A + + +I+ LTDG E++S + E Sbjct: 165 RGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAEDASGSFTLSERRRE 217 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 V+A G+ + L A R Sbjct: 218 VIRGALGRYPVHAFGLGTAHGPEVLLYLAQESR 250 >gi|325954651|ref|YP_004238311.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437269|gb|ADX67733.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 336 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 67/182 (36%), Gaps = 28/182 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G+D++ +LDVS SM+ +L A+R I + L+ + R+ LV F+ Sbjct: 85 SREGIDIVYLLDVSTSMDAQDVAP-SRLMKASRIISQSLNSLGG-------DRAALVIFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + PL I + L I T + A + + KG Sbjct: 137 ADGYTISPLTNDYAAIDSYLGSLSTNLISNQGTDFSAAFREAVSVL-----------KGA 185 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + K ++ L+DGE+ + +S+ N V +IG+ + D Sbjct: 186 PNTSKLVVLLSDGEDHESGEN--QSIKLAN---DNQIHVVSIGIGTDKGAPIPVQSMYGD 240 Query: 341 RF 342 + Sbjct: 241 EY 242 >gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790] gi|109625454|emb|CAJ51881.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790] Length = 799 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 71/215 (33%), Gaps = 37/215 (17%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + V++ + +++ +DVS S + VA ++ ++ D Sbjct: 369 SIGTMLPVQVGEGTPGSARVILAIDVSGSTGSGMQI---QKAVALNALGQLGDSTS---- 421 Query: 212 VNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 G+V F+ + + L +++I +L G T GL A + Sbjct: 422 ------VGVVGFNRQAYEVVGLEQLTENRDTTRQRIRQLRAGGGTNIANGLRGAEEMLDG 475 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + +I ++DG ++ + RRG V +G Sbjct: 476 QRGT--------------VILISDGVDAR-----SRATVVAESLGRRGVRVITVGAGQRV 516 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKE 361 + L+ A S ++ + +L F G++ Sbjct: 517 NEPLLEQIADISGGTYFQANETDRLRILFGGSGRQ 551 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 263 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 310 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 311 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 367 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 368 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 426 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 427 RIHEDSDSALQLQDFYQEVANPLLTAVT 454 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 5 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 52 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 53 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 109 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 110 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 168 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 169 RIHEDSDSALQLQDFYQEVANPLLTAVT 196 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|297683708|ref|XP_002819511.1| PREDICTED: collagen alpha-1(XXII) chain-like [Pongo abelii] Length = 259 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 48/206 (23%), Positives = 80/206 (38%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 13 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 63 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F + G YK+ Sbjct: 64 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPR---TGGRPGDRAYKQV 120 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 121 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 173 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 174 HVSD-------FNAIDKIRGKLRRRL 192 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 41/289 (14%), Positives = 97/289 (33%), Gaps = 24/289 (8%) Query: 90 TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS 149 + + N + D+ N S + + + + +MP + Sbjct: 100 KEAARDTESNIKSADLKNKSNSEQADLYVHMMYSLLKQEIIPFDKMPLQILEIGRVEDEE 159 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 K + + ++ ++LD S SM M K+ +A +I++ + + Sbjct: 160 KKSNGTKGEQK-NKEDRGNYNIEILLDASGSMAGKIDGKM-KMDIAKEAIQQFVSDLPEA 217 Query: 210 PDVNNVVRSGLVTFSSKIVQ----------TFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +V+ V G + + + + ++ ++ T + Sbjct: 218 VNVSLRVY-GHKGSNDEKDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAI 276 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + + AKE ++I YI+ +DG + +E+ N + + Sbjct: 277 KKSTETFQSAKENDKNII--------YIV--SDGVETCGGNPVEEAQKVSNSNIKPIMNI 326 Query: 320 YAIGVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 V EA Q + S ++ ++++L D F GK++ +R+ Sbjct: 327 IGFQVDHEAEKQLKEIAEVSKGKYVLANSAKELQDQFKETGKDITSRRL 375 >gi|24375866|ref|NP_719909.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24350833|gb|AAN57353.1|AE015872_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 451 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 72/196 (36%), Gaps = 26/196 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +V+D S SM+ D++ A + ++++K V+ ++ +S Sbjct: 67 EKSPINLSLVIDRSGSMSG------DRIEKAREAAIMAINMLKDDDIVS------VIAYS 114 Query: 225 SKIVQTFPLAW--GVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P + + IN + G +T G+ ++ +K + Sbjct: 115 DNAYLIIPATKVKNKNEMIKIINDTIKPGGSTALFAGVSKGITEVNKFIKKNQVNR---- 170 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 II L+DG+ + KE A ++G V IG+ + + A S Sbjct: 171 -----IILLSDGQANIGPSTTKELADLGQVAGKQGIAVTTIGLGNGYNEDLMTALAGFSD 225 Query: 340 DRFYSVQNSRKLHDAF 355 V+NS L AF Sbjct: 226 GNHAYVENSADLETAF 241 >gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4] gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-4] Length = 759 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 86/212 (40%), Gaps = 30/212 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S + ++ ++++V+D S SM D + A ++R L ++ N Sbjct: 367 VEASEQPNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRGLRPQDSFN----- 415 Query: 219 GLVTFSSKI--VQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + + + PL A + ++ +NRL T+ L A + Sbjct: 416 -IIEFNSDVSLLSSTPLPATATNLAMARQFVNRLQADGGTEMAQALNSAL------PRQA 468 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + A G D + +IF+TDG S ++ N+ ++ +G+ + F+ Sbjct: 469 FNTASGEDKSLRQVIFMTDG---SVGNESALFELIRNQIGDN--RLFTVGIGSAPNSHFM 523 Query: 334 KNCASPDR--FYSVQNSRKLHDAFLRIGKEMV 363 + A R F + + ++ ++ ++ Sbjct: 524 QRAAELGRGTFTYIGDVDEVEQKISKLLAKIQ 555 >gi|2292988|emb|CAA72155.1| Inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] Length = 932 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 99/327 (30%), Gaps = 51/327 (15%) Query: 47 KLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN 106 L + SL L E+ ++ N + K F+ L + +Q+ Sbjct: 170 HLQ-MTSTSLSPQGISTLETESTFMTQELANALTTSQNKTKAHIQFKPTLSQQRKSQNEQ 228 Query: 107 NIERSTSLSI--IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 + ++ +D L + Y F+ P + +L Sbjct: 229 DTVLDGDFTVRYDVDRSSTGGYLQIENGY---FVHHFAPEDLPTMAKNVLF--------- 276 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 V+D S SM K+ ++ ++L + + N ++ FS Sbjct: 277 ---------VIDKSGSMAGK------KIQQTREALIKILKDLSTQDQFN------IIVFS 315 Query: 225 SKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + Q L + + +++ T + A + + + +K Sbjct: 316 GEANQWEQLLVQATEENLNRAVDYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKS 375 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 II LTDGE + + K EA ++ +G + FL+ A Sbjct: 376 VS----LIILLTDGEPTVGETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLALD 431 Query: 340 DR------FYSVQNSRKLHDAFLRIGK 360 + + ++ +L D + + Sbjct: 432 NGGLARRIYEDSDSALQLQDFYQEVAN 458 >gi|297265499|ref|XP_002799207.1| PREDICTED: matrilin-3-like [Macaca mulatta] Length = 445 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 75/218 (34%), Gaps = 32/218 (14%) Query: 154 LITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + + S G LD++ ++D S S+ + + ++D + Sbjct: 62 PASGASEPGRASGAGVCKSRPLDLVFLIDSSRSVRPL------EFTKVKTFVSRIIDTLD 115 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 P R +V ++S + F L Q +++ + ++ + T S ++ A + Sbjct: 116 IGP---ADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGQITPLSTGTMSGLAIQTAMD 172 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 ++F A + K I +TDG + A+ G +YA+GV Sbjct: 173 EVFTA---EAGARGPSSNIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGV 223 Query: 325 QAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + P + + V+ KL F Sbjct: 224 DRADMQSLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 261 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 80/223 (35%), Gaps = 29/223 (13%) Query: 9 FFYNCKGS-ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILD-HSLLYTATKILNQ 66 + +GS ++++ A L+PVI +G I+ + VK++L +D +L + Sbjct: 26 LAGDRRGSTLALMAAGLIPVI-AALGAGIDAGRLYLVKSQLQAGVDAAALAGARAFAVTD 84 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD-Y 125 + KQ + + Y FA D + ++ + K Sbjct: 85 GSPAARDKQASAYFYGN-----------------FASDYMGVS-----NLQLTPDFKTVG 122 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++ + + TF + + ++ + L++M+VLD + SM + Sbjct: 123 GINVTTITARAIVPMTFMRIFGFQPRTMQAVAKAELQPR---PLEVMVVLDDTGSMKANL 179 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 G ++ + + +DI+ + G + + + Sbjct: 180 SGGRTRMVALKEAANDFVDILHQGASSRRDLAMGFIGYDVTVN 222 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 82/204 (40%), Gaps = 30/204 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD++ ++D S S+ + + I ++++ ++ GLV +S Sbjct: 267 SGSALDLVFLIDGSKSVRPE------NFELVKKFINQIVESLEVSEKQAQ---VGLVQYS 317 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + Q FPL + I+ + ++ T + L+Y + D+ + + A+ Sbjct: 318 SSVRQEFPLGQFKNKKDIKAAVKKMAYMEKGTMTGQALKY----LVDSSFSIANGARPGV 373 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 374 P--KVGIVFTDGRSQDYITD------AAKKAKDLGFRMFAVGVGNAVEDELREIASEPVA 425 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMV 363 + ++ + R + IGK++ Sbjct: 426 EHYFYTADFRTI----SNIGKKLQ 445 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 71/196 (36%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + +++ + P N R G++ ++S + Sbjct: 39 DLVFIIDSSRSVRPQ------EFEKVKVFLSRVIEGLDVGP---NSTRVGVINYASAVKN 89 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + R+ + T + +++A ++ F E + K Sbjct: 90 EFSLKTHQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRAFSDTEGARLRSPN---INKV 146 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG D A++ G ++AIGV + + P D Sbjct: 147 AIVVTDGRPQDGVQDVSA------RARQAGIEIFAIGVGRVDMHTLRQIASEPLDDHVDY 200 Query: 345 VQN---SRKLHDAFLR 357 V++ KL F Sbjct: 201 VESYSVIEKLTHKFQE 216 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 80/236 (33%), Gaps = 38/236 (16%) Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 + IW L + + D NI D + + + S + P Sbjct: 4 KRKIWWLFIA--LFFSFYLGDATNIV----FGESNDSNNATLDFTITSS---QLEYAKPP 54 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + + ++ + +D++ V DVS SM KL A +++ + Sbjct: 55 NGDAQGRLDVTLIPKGRVDNIVRPPIDVVFVFDVSGSMTPL------KLQSAKYALQSAV 108 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIV--QTFPLAWGVQHIQEKI-------NRLIFGSTTK 254 D K+ + N+ R L+ FSS + + P G +++ + N L T Sbjct: 109 DYFKANANPND--RFALIPFSSDVQYNKVVPFPTGAYDVKQHLERIANVANDLRAYGGTN 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + D KKYIIFLTDG + + C Sbjct: 167 YTQSLQQAQSFFNDPTR------------KKYIIFLTDGMPTVSIAKEPITYKVCE 210 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 69/196 (35%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + +IKS+ N R GLV ++S + Sbjct: 43 DLVFVVDSSRSVRPV------EFEKVKVFLS---QVIKSLDVGPNATRVGLVNYASTVKP 93 Query: 230 TFPLAWGVQHIQ--EKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL + ++R+ + T + L++A K E D K Sbjct: 94 EFPLRAHTSKASLLQAVHRIQPLSTGTMTGLALQFAITKALSDAEGGRSR---SSDISKV 150 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-----DR 341 +I +TDG D E A+ G ++AIGV + + P D Sbjct: 151 VIVVTDGRPQDSVRDVSE------RARASGIELFAIGVGRVDKATLRQIASEPQDEHVDY 204 Query: 342 FYSVQNSRKLHDAFLR 357 S KL F Sbjct: 205 VESYNVIEKLAKKFQE 220 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 82/208 (39%), Gaps = 34/208 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++ ++D S S+ + + I +++D + + GLV +SS Sbjct: 272 GSATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSDRLAQ---VGLVQYSS 322 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 SIRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPG 376 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP- 339 +K I TDG + D +AK G ++A+GV ++ + + P Sbjct: 377 A--QKVGIVFTDGRSQDYINDAAR------KAKDLGFKMFAVGVGNAVEEELREIASEPV 428 Query: 340 -DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 D ++ + + ++ IGK++ K+ Sbjct: 429 ADHYFYTADFKTINQ----IGKKLQKKI 452 >gi|47208832|emb|CAF90336.1| unnamed protein product [Tetraodon nigroviridis] Length = 443 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 93/267 (34%), Gaps = 34/267 (12%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS--- 163 + L I + L+ P ++ + T + +S+ Sbjct: 151 SSAPVAPLPTIPSPPNPTVQLTTPPAPLATITPEVLPVETSAPATLSVSTFTTTVSAVET 210 Query: 164 KSDIG-----LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++D LD++ ++D S S+ + + +M+D + D R Sbjct: 211 ETDSSCLSRPLDLVFIIDSSRSVRPS------EFEKVKIFLADMVDTLDVGADA---TRV 261 Query: 219 GLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEH 275 +V ++S + F L + ++++ I+R+ + T + ++ A ++ F + Sbjct: 262 AVVNYASTVKTEFLLKDHFNKPNLKKAISRIEPLATGTMTGLAIKTAVSEAFTEQSGARP 321 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--AADQFL 333 + K I +TDG + + A+ G +YA+GV + Q + Sbjct: 322 RPRN---IAKVAIIVTDGRPQDQVEEVSAA------ARASGVEIYAVGVDRADMRSLQLM 372 Query: 334 KNCASPDRFYSVQN---SRKLHDAFLR 357 + D + V+ KL F Sbjct: 373 ASVPLEDHVFYVETYGVIEKLTSKFRE 399 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 80/208 (38%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM++ D+L ++ + L I V G+VTF S Sbjct: 307 VCLVLDKSGSMSNG-----DRLKRLNQAGKLFLLQI-----VEQGSWVGMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L A + + + + T GL A+ I + + + Sbjct: 357 SELVQINGATERDALTKSL-PTVASGGTSICSGLRSAFAVI------RKKFSTDGSE--- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GA+++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISSCFNEVKQSGAVIHTVALGPSAAKELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ ++ L DAF + G QR Sbjct: 458 YASDQAQNNGLIDAFGALSSGNGAASQR 485 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 47/299 (15%), Positives = 94/299 (31%), Gaps = 24/299 (8%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + L +IL+H A +Q + + + R + Sbjct: 111 MGQSLQFILEHHFPEGAGSRASQGVPQVALVMSTGVAEDHFREPAEALKREGILLYAIGV 170 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 ++ D + + + P + + T + Sbjct: 171 KDAAQAELREIASSPKDNFTFFVPNFSGLPGLA--QKLRPELCTTLAKVVQHTEQGSPAC 228 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ ++D S S G+ + I+ + ++ V+ GLV + Sbjct: 229 TEAFLADIVFLVDSSTS------IGLQNFQKVKNFLHS---IVSGLDVRSDQVQVGLVQY 279 Query: 224 SSKIVQTFPLAWG--VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S I F L + E+I L + +T + LE + + E AK Sbjct: 280 SDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALE--FIRANYLTEMSGSRAKDG 337 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +I +TDGE++ D ++ KR G VY +G+ + + K + P Sbjct: 338 VP--QIVILVTDGESNDEVQD------AADQLKRDGVFVYVVGINIQDVQELQKIASEP 388 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 77/193 (39%), Gaps = 23/193 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S++ M + M D+IK + VR G+V +S KI+ Sbjct: 869 DIYFLIDGSGSISPKDFTEMKEF---------MKDVIKMFHIGPDGVRFGVVQYSDKIIS 919 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 F L +++ + A +K+ + +Y+I Sbjct: 920 QFLLT-QYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTAR-----TNVAQYLIV 973 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 +TDG++S P D + L + G +YAIGV+ +A L+ A+ +R + + Sbjct: 974 ITDGQSSDPVADAAQGL------RDTGINIYAIGVR-DANTTELEEIAN-NRVFFTDDFH 1025 Query: 350 KLHDAFLRIGKEM 362 L + +++ Sbjct: 1026 FLKSIHQEVVRDI 1038 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 59/344 (17%), Positives = 112/344 (32%), Gaps = 45/344 (13%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L L TA + Q G + + +N +T+ Sbjct: 910 VVQYSDKIISQFLLTQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQ 969 Query: 95 EL------RENGFAQDINNIERSTSLSIII----DDQHKDYNLSAVSRYEMPFIFCTFPW 144 L + + D R T ++I D + A +R F F + Sbjct: 970 YLIVITDGQSSDPVADAAQGLRDTGINIYAIGVRDANTTELEEIANNRV---FFTDDFHF 1026 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + S + D++ ++D S S++ ++ M++ Sbjct: 1027 LKSIHQEVVRDICSFENCRSQKA--DIIFLIDGSESISSE------DFEKIKDFVKRMVN 1078 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH------IQEKINRLIFGSTTKSTPG 258 D ++ GL+ FSS + F L + ++ G+ T Sbjct: 1079 QSNIGADK---IQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALN 1135 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + I H +Y+I +TDG++ + ++L + R I Sbjct: 1136 FTLPFFDISQGGRPGVH---------QYLIVITDGDSHDDIVSPAKAL------RDRNII 1180 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++AIGV Q L D+ Y +N L + I E+ Sbjct: 1181 IFAIGVGKIQRAQLLAITNDQDKVYHEENFESLQNLEKEILYEV 1224 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 68/168 (40%), Gaps = 18/168 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ + + G + ++ + + PD+ VR LV +S K F L Sbjct: 634 DLVFLVEEFTSAGQPNFQQVIKLLKTTVHSLNIHPDI---VRVSLVFYSEKPQLKFSLNT 690 Query: 236 --GVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + +++L F TK+ L++ ++F ++ I +I + Sbjct: 691 FQNAAQVLTSLDQLTFRARRGRTKAGAALDFLRKEVFLPEKGSRSIWGVQQIA---VIIM 747 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 SP++DN + + +R G +YA+G+Q+ + + L+ A+ Sbjct: 748 -----ESPSLDNVST--PASHLRRTGVTIYAVGIQSASESKDLEKIAT 788 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 47/299 (15%), Positives = 94/299 (31%), Gaps = 24/299 (8%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + L +IL+H A +Q + + + R + Sbjct: 111 MGQSLQFILEHHFPEGAGSRASQGVPQVALVMSTGVAEDHFREPAEALKREGILLYAIGV 170 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 ++ D + + + P + + T + Sbjct: 171 KDAAQAELREIASSPKDNFTFFVPNFSGLPGLA--QKLRPELCTTLAKVVQHTEQGSPAC 228 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ ++D S S G+ + I+ + ++ V+ GLV + Sbjct: 229 TEAFLADIVFLVDSSTS------IGLQNFQKVKNFLHS---IVSGLDVRSDQVQVGLVQY 279 Query: 224 SSKIVQTFPLAWG--VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S I F L + E+I L + +T + LE + + E AK Sbjct: 280 SDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALE--FIRANYLTEMSGSRAKDG 337 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +I +TDGE++ D ++ KR G VY +G+ + + K + P Sbjct: 338 VP--QIVILVTDGESNDEVQD------AADQLKRDGVFVYVVGINIQDVQELQKIASEP 388 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 77/193 (39%), Gaps = 23/193 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S++ M + M D+IK + VR G+V +S KI+ Sbjct: 666 DIYFLIDGSGSISPKDFTEMKEF---------MKDVIKMFHIGPDGVRFGVVQYSDKIIS 716 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 F L +++ + A +K+ + +Y+I Sbjct: 717 QFLLT-QYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTAR-----TNVAQYLIV 770 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 +TDG++S P D + L + G +YAIGV+ +A L+ A+ +R + + Sbjct: 771 ITDGQSSDPVADAAQGL------RDTGINIYAIGVR-DANTTELEEIAN-NRVFFTDDFH 822 Query: 350 KLHDAFLRIGKEM 362 L + +++ Sbjct: 823 FLKSIHQEVVRDI 835 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 59/344 (17%), Positives = 112/344 (32%), Gaps = 45/344 (13%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L L TA + Q G + + +N +T+ Sbjct: 707 VVQYSDKIISQFLLTQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQ 766 Query: 95 EL------RENGFAQDINNIERSTSLSIII----DDQHKDYNLSAVSRYEMPFIFCTFPW 144 L + + D R T ++I D + A +R F F + Sbjct: 767 YLIVITDGQSSDPVADAAQGLRDTGINIYAIGVRDANTTELEEIANNRV---FFTDDFHF 823 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + S + D++ ++D S S++ ++ M++ Sbjct: 824 LKSIHQEVVRDICSFENCRSQKA--DIIFLIDGSESISSE------DFEKIKDFVKRMVN 875 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH------IQEKINRLIFGSTTKSTPG 258 D ++ GL+ FSS + F L + ++ G+ T Sbjct: 876 QSNIGADK---IQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALN 932 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + I H +Y+I +TDG++ + ++L + R I Sbjct: 933 FTLPFFDISQGGRPGVH---------QYLIVITDGDSHDDIVSPAKAL------RDRNII 977 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++AIGV Q L D+ Y +N L + I E+ Sbjct: 978 IFAIGVGKIQRAQLLAITNDQDKVYHEENFESLQNLEKEILYEV 1021 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 68/168 (40%), Gaps = 18/168 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ + + G + ++ + + PD+ VR LV +S K F L Sbjct: 431 DLVFLVEEFTSAGQPNFQQVIKLLKTTVHSLNIHPDI---VRVSLVFYSEKPQLKFSLNT 487 Query: 236 --GVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + +++L F TK+ L++ ++F ++ I +I + Sbjct: 488 FQNAAQVLTSLDQLTFRARRGRTKAGAALDFLRKEVFLPEKGSRSIWGVQQIA---VIIM 544 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 SP++DN + + +R G +YA+G+Q+ + + L+ A+ Sbjct: 545 -----ESPSLDNVST--PASHLRRTGVTIYAVGIQSASESKDLEKIAT 585 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 77/209 (36%), Gaps = 36/209 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM D P D+ A I M D +N R ++ F Sbjct: 114 DVVLVIDNSGSMKDT-DPNQDRYTAAKNLINRM--------DRDN--RVSVIVFDHATTL 162 Query: 230 TFPLAWGVQ-----HIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P I +I+ L T + LE + I ++++ Sbjct: 163 LQPFTRVKNQEIKDEIMAEIDGLATTDGGTDISLALEDTMSHIQESQDAGRSAM------ 216 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCA--SP 339 +I L+DG + + + K++ V IG+ + L+ A + Sbjct: 217 ---VIMLSDGFS-----ETDHDRVLADY-KQQQIAVNTIGLSLVYKDGANLLQTIAAETG 267 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 ++Y VQN+ L F +I ++ + +L Sbjct: 268 GQYYDVQNAADLSFVFQKIYDDVGDRSLL 296 >gi|281356510|ref|ZP_06243002.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281317202|gb|EFB01224.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 783 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 78/219 (35%), Gaps = 38/219 (17%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R+ + + T + ++ P + V + G D+M++ DVS SM + Sbjct: 56 RFRIFLLLLTMLFLIAAAARPFWSSQLVPFEPR---GRDLMVIFDVSKSMLATDIAP-SR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 L A +R++ ++S P+ R GLV F+ K PL + I+ L Sbjct: 112 LEHAKFLLRQL---VESAPN----DRFGLVAFAGKAYLACPLTSDSLAFTQYIDELNTDT 164 Query: 251 ---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T L A A + I+ TDG+ + N S Sbjct: 165 VPLGGTNLEAALRVAEQAFKAAAGGN-----------RGILLFTDGDELAGN-----SAA 208 Query: 308 YCNEAKRRGAIVYAIGVQ-------AEAADQFLKNCASP 339 +E ++R ++ +G+ AD LK AS Sbjct: 209 LVDELRKRQIPLFIVGLGDPEVGAPVPEADGTLKRDASG 247 >gi|164688691|ref|ZP_02212719.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] gi|164602167|gb|EDQ95632.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] Length = 1508 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 49/329 (14%), Positives = 98/329 (29%), Gaps = 50/329 (15%) Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 ++ + D E + + I + D D +L+ ++ E P Sbjct: 305 EKYNQYTFNKMIINDVEQEKKTTFDVKSDTTIAYYYTDLNAPDAVEPDGDLNGPAQPEYP 364 Query: 137 FIFCTFPWCANSSHA----PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD-- 190 S L G D+++V+DVS SM+ Sbjct: 365 NQGAVKTSKTASDTDFPKTGLTRVELGVTGVPLKQGTDVVLVIDVSGSMDWDVDGKQTTD 424 Query: 191 ----KLGVATRSIREMLDII--KSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQH 239 ++ +A S ++ ++ + + + R +V FSS L Q Sbjct: 425 NTKKRITIAKDSAKQFVNQLFANNEDGSKSNNRVSVVIFSSSGYTNGILCSLKNVDNKQT 484 Query: 240 IQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + I+ + T + A + K K ++F++DG + Sbjct: 485 VIDAIDGISNNPTGGTDYDNAMTMAEQVLETVK---------DTTRNKAVLFMSDGAPEN 535 Query: 298 PNIDNKESLFYCNEAK----------RRGAIVYAIGVQAEAAD----------QFLK-NC 336 Y + K GA +Y + + + Q L+ Sbjct: 536 GYNGKTGYDIYPDAFKAHEKSSEIKNNYGATIYTVSFGLKGSQYKELTEDRCRQILRDYM 595 Query: 337 ASPDR-FYSVQNSRKLHDAFLRIGKEMVK 364 AS + + + + L +AF I + K Sbjct: 596 ASNENCYKNANSKEDLENAFTNIATAIRK 624 >gi|126722991|ref|NP_062242.2| inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] gi|59808174|gb|AAH89806.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Rattus norvegicus] gi|149034208|gb|EDL88978.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_a [Rattus norvegicus] Length = 933 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 95/310 (30%), Gaps = 50/310 (16%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI--IIDDQ 121 L E+ ++ N + K F+ L + +Q+ + ++ +D Sbjct: 187 LETESTFMTQELANALTTSQNKTKAHIQFKPTLSQQRKSQNEQDTVLDGDFTVRYDVDRS 246 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + + Y F+ P + +L V+D S SM Sbjct: 247 STGGTIQIENGY---FVHHFAPEDLPTMAKNVLF------------------VIDKSGSM 285 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWG 236 K+ ++ ++L + + N ++ FS + Q L Sbjct: 286 AGK------KIQQTREALIKILKDLSTQDQFN------IIVFSGEANQWEQLLVQATEEN 333 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + +++ T + A + + + +K II LTDGE + Sbjct: 334 LNRAVDYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPT 389 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------FYSVQNSRK 350 + K EA ++ +G + FL+ A + + ++ + Sbjct: 390 VGETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQ 449 Query: 351 LHDAFLRIGK 360 L D + + Sbjct: 450 LQDFYQEVAN 459 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 80/208 (38%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM++ D+L ++ + L I V G+VTF S Sbjct: 307 VCLVLDKSGSMSNG-----DRLKRLNQAGKLFLLQI-----VEQGSWVGMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L A + + + + T GL A+ I + + + Sbjct: 357 SELVQINGATERDALTKSL-PTVASGGTSICSGLRSAFAVI------RKKFSTDGSE--- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GA+++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISSCFNEVKQSGAVIHTVALGPSAAKELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ ++ L DAF + G QR Sbjct: 458 YASDQAQNNGLIDAFGALSSGNGAASQR 485 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 68/200 (34%), Gaps = 31/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F P + K+ + G++ ++ LD+S SM +L + Sbjct: 61 LVFAALAMVIFMLARPQFGS---KMETVKRQGVETVVALDISNSMLAQDVTP-SRLEKSK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKIN-RLIFGST 252 + I +++ N + ++ F+ + P+ + E IN LI Sbjct: 117 KLISRLVETF-------NNDKVAMIVFAGEAFTQLPITSDYISAKMFLETINPSLISTQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + A ++ + I+ +TDGEN ++ EA Sbjct: 170 TDIAGAINLAMKSFT-----------PNEGVGRAIVLITDGENHEG-----GAVEAAQEA 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 ++G V+ +GV + Sbjct: 214 AKKGVRVFVLGVGSPDGAPI 233 >gi|113476847|ref|YP_722908.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167895|gb|ABG52435.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 477 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 81/231 (35%), Gaps = 29/231 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P IF F A + + ++ + +++++D S SM +G + ++ A Sbjct: 13 PIIFGIFGGAGCLIAAAIFGEMWLSLTRRPPQPQTVVLLIDTSSSM---WGGKLPEVQAA 69 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 E ++ VNN +V FSS +++ I L T Sbjct: 70 ATGFVERQNLT-----VNN---LAIVEFSSNSQVLTNFDADKTELKQAIANLTPSGGTNL 121 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + GL+ + + ++ I+ TDG+ + D + S E + Sbjct: 122 SQGLKTVASLLRNSNTPN-------------ILLFTDGQPN----DPRASKSIAREIREA 164 Query: 316 GAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 G + +G ++ +PD + NS ++ AF K + + Sbjct: 165 GINLVTVGTGDANSNYLTSLTENPDLVFFA-NSGEIDQAFRAAEKAISQLS 214 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 69/208 (33%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM +L ++ + L + V G+V F S Sbjct: 307 VCLVLDKSGSM-----TVGGRLKRLNQAGKLFL-----LQTVEQGAWVGMVAFDSAAYVK 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L A + + T GL A+ I Sbjct: 357 SELVQINSAAERDALARSL-PTAASGGTSICSGLRSAFTVIKKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C E K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISACFPEVKQNGAIIHTVALGPSAAKELEELSQMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G QR Sbjct: 458 YASDQAENNGLIDAFGALSSGNRAASQR 485 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 72/227 (31%), Gaps = 43/227 (18%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T++ S+ + ++++LD S SM G+ K+ A +++ + + ++PD Sbjct: 26 TAAATSSADEPVPGKLLLMLDASGSMKAKDPSGLTKIEAAKKALTGV---VGALPDTAQ- 81 Query: 216 VRSGLVTFSSKIVQTFPLA---------------WGVQHIQEKINRLIFGSTTKSTPGLE 260 GL + + + + I + T L Sbjct: 82 --VGLRVYGATVDGKGKPTPAACADTQLIHPIAALDKTKLTTTIAAIKALGETPIAHSLT 139 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--I 318 A + + ++ I+ ++DGE S + + G Sbjct: 140 EALKDLGTSGKRN-------------IVLVSDGEESC----VPDPCPIVKKLTAAGVDLQ 182 Query: 319 VYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEM 362 + +G A + C A +Y +++ L + ++ + Sbjct: 183 IDTVGFGVNAKARTQLQCIADAGKGTYYDAKDAGALATSLNKLSQRA 229 >gi|220910752|ref|YP_002486062.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219867524|gb|ACL47861.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 411 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 77/228 (33%), Gaps = 25/228 (10%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +P + + I V + L++ +V+D S SM K+ Sbjct: 8 LLPLRAAVASDQPTTLDVLVKIIPPVPETRPQRPPLNLGLVIDRSGSMQGA------KME 61 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGS 251 VA ++ ++ + +P R + F ++ P L I I + Sbjct: 62 VARQAACFAVEQL--LPS----DRLSVTIFDDRVECPVPSTLVRDKATIIRTIQGIHSRG 115 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G + + +H+ H + +I L+DG + + + + Sbjct: 116 STALHDGW------VQGGIQVSQHLNPAHLNR---VILLSDGLANVGETNPDAIAQHVHG 166 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLR 357 +RG +G+ + + L+ A FY ++ + +L F Sbjct: 167 LAQRGVSTSTMGIGEDYGEDLLEAMARSGAGSFYHIERTEQLAAIFQA 214 >gi|118090156|ref|XP_420539.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 606 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 74/206 (35%), Gaps = 27/206 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 DM VLD S S+ ++ D + S L P + R + FSS+ Sbjct: 152 GAFDMYFVLDKSGSVAQNWHEIFDFVNQLDGSGFVRLTERFVSPKM----RLSFIVFSSQ 207 Query: 227 IVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + I+E + L T GL+ A +I +G + Sbjct: 208 AHVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQANMQI---------EKQGASRF 258 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAK---RRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG +D + L+ EAK GA VY +GV Q + + + Sbjct: 259 SSIIIALTDG-----KLDGQIPLYAEKEAKISRELGARVYCVGVLDFVQAQLERIADTKE 313 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + V A I ++KQ Sbjct: 314 QVFPVTGG---FQALKGIINSVLKQS 336 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 71/220 (32%), Gaps = 29/220 (13%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + S+++ S DI + + +V+D S SM + +S + + + N Sbjct: 64 VRQSIRLFSHEDIPVTVGLVIDHSGSMRPKMASVIAAARTFIQSSSPEDQMF--VVNFNE 121 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 V GL T P + + I+ T + A + Sbjct: 122 DVTLGLST-------EIPFTNRPEDLTYAISHSPPTGKTALYDAVWKAREWVARGSRD-- 172 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 KK ++ ++DG +++ E L N + V+ IG+ + Sbjct: 173 ---------KKVLVVVSDGGDNASTHTLSEILEAAN---KSNIQVFTIGIFDPDDPDKNP 220 Query: 331 QFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L+ A+ + ++ I K++ Q L Sbjct: 221 GVLRQLARATGGEAFVPDELSEVVAICESIAKDIRSQYTL 260 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 69/179 (38%), Gaps = 23/179 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +LD S S++ + +R++ + ++ +N VR G+ +SS Sbjct: 819 IDVFFLLDGSGSIDG------RDFELQRSFVRDL---VSNLMSGDNDVRVGVAEYSSTYT 869 Query: 229 QT-FPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 Q FP + I ++ + T + L A + I + Sbjct: 870 QIVFPFSSSQSAIDSSLSSMIQTAGATATGTSLGEAADDIGSTARSSA---------ARV 920 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYS 344 +I +TDGE S D + + + G + AIGV + + + L+ S D ++ Sbjct: 921 LILMTDGETSDG--DEQNIDPSVDALRALGVSITAIGVGNSASESELLQIAGSSDHVFN 977 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 62/171 (36%), Gaps = 18/171 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++++ D S S+ + + + +R +++ + R +TF S+ Sbjct: 1244 DLILIQDNSGSIEE------RDFQTSIQFLRALVNGADIESSGS---RIAAITFCSEPTL 1294 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E ++ L S T + A D + + + ++ +I Sbjct: 1295 ---LTDYVSTTSEALDALNTASNTLTCG---TATGAALDFVRENILTDRSNSGARRVVIV 1348 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +TDGE+ + + VYAIGV + L+ AS D Sbjct: 1349 ITDGESQEDFSVVQNAGARLQAEVDD---VYAIGVGSGTDLAELRVIASSD 1396 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/258 (16%), Positives = 70/258 (27%), Gaps = 81/258 (31%) Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQH 239 + P L +R+ LD I N+ G +S PL G++ Sbjct: 352 YSPRQSDLK--KYYLRDSLDKIYRGGRSND----GGPNYSCTSSPLTPLTDVTTEQGMKT 405 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 IQ I ++ T + + + I E K +I LTDG N+ Sbjct: 406 IQTAIKAMVPSGGTNVPEAMAWGWRTIVRGAPFTEARPSTERGNDKVVIVLTDGANTYYK 465 Query: 300 ID-----------------------------------------------------NKESL 306 D N Sbjct: 466 YDGLAGSGPDRAANFSYYSAHGYTARITKHYSQARLFQESGVSVSQNNSTYTKAMNARFA 525 Query: 307 FYCNEAKRRGAIVYAIGVQAEAAD-------QFLKNCAS----------PDRFYSVQNSR 349 C+ AK IV + + + L++C+S P + + Sbjct: 526 KLCDNAKSANIIVMTVALDLSETNSTEKAQIDLLRSCSSNSRVRTESGRPAKLFWNSTGG 585 Query: 350 KLHDAFLRIGKEMVKQRI 367 +L + F +IG E+ RI Sbjct: 586 ELSETFRQIGDELSNLRI 603 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 83/226 (36%), Gaps = 25/226 (11%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + +G+ +++ A++L + + + +++++ V+ + LD + L + E Sbjct: 4 RFAKDERGNFAMIMALVLVPLLLAGMVAVDSANLMRVRNNVQASLDAAALAVGRRFSTGE 63 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + ++ + + D N + + D Q + Sbjct: 64 SQTV--------VQVYGAQVFTANL------TALSADAVNFDVAFPKDKTTDQQ-----I 104 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-G 186 A + + +F ++ S + +++ +VLD S SM++ G Sbjct: 105 QATAGFTYKSLFGVIASRLTGDDWDQN-QYTLSSSVRLKNTIEVALVLDNSRSMDETRSG 163 Query: 187 PGMDKLGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIV 228 ++ + + ++++ + + I V N V+ LV F+ + Sbjct: 164 STKKRIDLLKEAASQLVETMAAQSTLITHVENPVQFSLVPFAGSVN 209 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 89/264 (33%), Gaps = 36/264 (13%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 F+ ++ FF N G+ +++ IL + ++ L ++ S KA+ ++ + L Sbjct: 6 YFIKLKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLALMA 65 Query: 62 KILNQENGN----------NGKKQKNDFSYRIIKNIWQTD-------FRNELRENGFAQD 104 + + N K++ FS ++ + R+ + + +D Sbjct: 66 ENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPSTYFKD 125 Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 I L +++ E+ F W + L + I++ Sbjct: 126 NLKISDKYDYQCNNLQGPNGEQLKSIAC-EISGKFERPSWLYLGKNNGLSFAETTTINAN 184 Query: 165 SDI---------GLDMMMVLDVSLSMNDHFG------PGMDKLGVATRSIREML---DII 206 +D+M+V D+S SMN +D L +I + L + Sbjct: 185 KIYIQKNLDEIIPIDLMLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQNNT 244 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQT 230 + ++ R G +F+ Q Sbjct: 245 EEGKVISQYNRIGFTSFAFGAQQQ 268 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 49/142 (34%), Gaps = 24/142 (16%) Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIFLTDGENSSP 298 + + NR+ T S+ GL N + + +K + ++ I+ L+DGE+ Sbjct: 381 LVSRFNRVPALGATLSSSGLLIGANLLMNTNPDENAQPSKLGANTQRIILVLSDGEDQIN 440 Query: 299 NIDNKESLF-------YCNEAKRR------------GAIVYAIGVQAEAADQFL---KNC 336 N + ++ C + K + + + + + C Sbjct: 441 NASSSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPTRIGFVAFGYGPSGTQKAAWEKC 500 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 +Y N +L ++F +I Sbjct: 501 V-GKYYYVANNKEELLESFRKI 521 >gi|304382531|ref|ZP_07365026.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] gi|304336362|gb|EFM02603.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] Length = 340 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 66/209 (31%), Gaps = 31/209 (14%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + C P + + KI+ + G++ ++ LD+S SM Sbjct: 52 KVRPHVKFGLCCCLLAVLIFMLARPQMGS---KITHEKRNGIEAIIALDISNSMLAQDVV 108 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN-- 245 +L + I M+D + + GL+ F+ P+ + + Sbjct: 109 P-SRLEKSKMLIENMVDNFTN-------DKVGLIVFAGDAFIQLPITSDFVSAKMFLQNI 160 Query: 246 --RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 LI T ++ A ++ K II +TDGE+ Sbjct: 161 DPSLIATQGTDIAKAIDMAMKSFTQ-----------QENVGKAIIVITDGEDHEG----- 204 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +L AK +G V+ +GV Sbjct: 205 GALEAAKTAKAKGYNVFILGVGTAKGAPI 233 >gi|159901411|ref|YP_001547658.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159894450|gb|ABX07530.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 337 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 66/201 (32%), Gaps = 19/201 (9%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R + + P+ + + GL ++++LD S SM + Sbjct: 54 RIRIVLQLSAVGLIIVALARPVWGSGDETL---KRSGLQVLILLDGSRSMAAQDVRP-SR 109 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + + R + +LD ++ G++ F S FPL + + + + Sbjct: 110 IDASKRMVLALLDRLEGNQ-------VGMLMFGSSSYVQFPLTSDLAAARSLVEPINPRG 162 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 L + + + + +I +TDG +S D + Sbjct: 163 -------LSLGGTDVEEVITEGLRSFPIGQIEGRTMILITDGGDSDEQSDGEAVAAAREA 215 Query: 312 AKRRGAIVYAIGVQAEAADQF 332 AK G ++ IG+ EA Q Sbjct: 216 AK-MGLTIHTIGMATEAGGQI 235 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 69/202 (34%), Gaps = 34/202 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGV 194 + C P + + ++K G++ ++ LD+S SM D +DK + Sbjct: 61 LMQCILTLIILILARPQI--GNRISTTKHSFGIETVIALDISNSMLAQDVVPSRLDKSKL 118 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFG 250 + + D + GL+ F+ P+ + ++ LI Sbjct: 119 LIEDLFRIFDN----------DKVGLIVFAGDAFVQLPITSDFISAKMFLDNINPSLIGT 168 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + A H K I+ +TDGE++ + + Sbjct: 169 QGTDIGQAINLA-----------MHSFSPTSKSGKAIVVITDGEDNEG-----RAEEMAS 212 Query: 311 EAKRRGAIVYAIGVQAEAADQF 332 +A++ G VY +GV + + + Sbjct: 213 KAQKAGIQVYILGVGSTSGAEI 234 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 88/243 (36%), Gaps = 24/243 (9%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPF----IFCTFPWCANSSHAPLLITSSV 159 + N+ ++ I + DY+ + ++PF PW + I Sbjct: 121 NDGNLPPKDAVRIEELINYFDYDYPIPASKDVPFATHVNVVPAPWAEGKQLMHVGIKG-Y 179 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + L++ +++DVS SMN DKL +A ++++ ++D + ++ VV +G Sbjct: 180 DLDRTEQPPLNLTLLVDVSGSMNHE-----DKLPLAKKALKLLIDKMDEDDHISVVVYAG 234 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + I ++ L G +T GL AY + + + + Sbjct: 235 AAGTVLEPTKGSE----KSKIFAALDNLSAGGSTAGGEGLRLAY-SLAEQNYDAASVNR- 288 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCAS 338 ++ LTDG+ + ++ + + G + +G D ++ A Sbjct: 289 -------VMLLTDGDFNVGVTSDERLEDFVARKRESGVYLSVLGFGRGNYNDAMMQKIAQ 341 Query: 339 PDR 341 Sbjct: 342 AGN 344 >gi|299529294|ref|ZP_07042734.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] gi|298722738|gb|EFI63655.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] Length = 1449 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 98/307 (31%), Gaps = 23/307 (7%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 I S + I + L + T L+ + + + + + + T + Sbjct: 606 AISVSEALASGSGTFTITASAGLKSLT--LDGPDNADATLTLDRLADLVNNPVTLTTSKG 663 Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 L G+ D + S + H + + +++ + Sbjct: 664 TLTLTGY--DATTGKVSYTYQTSGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLGVLIT 721 Query: 155 ITSSVKISSKSDIGLD-----MMMVLDVSLSMNDHFG-----PGMDKLGVATRSIREMLD 204 T+ L +M+ LD S SMN G + +L V S+ +LD Sbjct: 722 DTAPSLKPIAESSALSSHGTNIMLTLDTSGSMNYGSGVYNGWTQLSRLAVLKSSVNNLLD 781 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 D VR +V F++ Q + + ++ L +G T L+ A N Sbjct: 782 KYGEAGD----VRVMIVEFNTSASQKGGGWMSLAEAKALVSGLGYGGGTNYQTALDTAMN 837 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ + + F TDGE +S+ ++++ + + Y IG Sbjct: 838 AWNNSG----TGKLEGGNVQNISYFFTDGEPDSNRSVNSAQQATWEKFLADNHINSYGIG 893 Query: 324 VQAEAAD 330 + A Sbjct: 894 LGTGATG 900 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 47/242 (19%), Positives = 93/242 (38%), Gaps = 24/242 (9%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 +SF + F N G+I + +LP+ ++ L E SH KA+L ++ + L A Sbjct: 9 LSFNKLFIFIKNENGAILLSFIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQATL--A 66 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + N E+ ++ +K S II + L F++ + ++I Sbjct: 67 LTVDNDESPDDDNIKKEKNSKFIIN-----YAKAYLPNEKFSKPV--------INITSHS 113 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + +Y + Y + F + + ++ S D++ V D S S Sbjct: 114 DYINYQVDMTIYYPTKILNKIFQTVSPEVSISDNARALKYTTTDSK-PTDVVFVADYSGS 172 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 MN++F + ++R + I++ NNV + I+ P +WG ++ Sbjct: 173 MNEYFDESDESDEKKIVALRRIFKDIQNEIKYNNV--------NIDIIGFVPFSWGTKNF 224 Query: 241 QE 242 Sbjct: 225 YS 226 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 39/124 (31%), Gaps = 20/124 (16%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 KI T + G+ N + + Y K +I ++DG++S Sbjct: 387 NTSISKILGTRATGGTLISSGILTGNNIFLE----------TDNSYNKLMIIISDGDDSR 436 Query: 298 PNIDNKESLFY---------CNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQN 347 K C + K G + I + D + C + FY +N Sbjct: 437 QTDKEKRYYNISKNLIKDGMCEKIKDNGIKMAFIAIGYVPREDIDWRRCVGEENFYFAKN 496 Query: 348 SRKL 351 + +L Sbjct: 497 AHEL 500 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 69/160 (43%), Gaps = 21/160 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ + M I + ++K + V+ G + +S + Sbjct: 466 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 516 Query: 229 QTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L++A N +F EH ++ + K+ Sbjct: 517 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 571 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +I +TDGE + D+ + E + +G ++A+GV Sbjct: 572 MLIVITDGE----SHDHDQLNDTALELRNKGITIFAVGVG 607 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 25/190 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ + ++ +L I+ D G+V FS Sbjct: 276 EDMKADIMFLVDSSWSIGNE------NFRKMKIFMKNLLTKIQIGADKTQ---IGVVQFS 326 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 K + F L + Q I + I+R+ T + L + H Sbjct: 327 DKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFT-------HSKGARL 379 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDG D L + + ++++GV Q + Sbjct: 380 GAKKFLILITDGVAQDDVRDPARIL------RGKDVTIFSVGVYNANRSQLEEISGDSSL 433 Query: 342 FYSVQNSRKL 351 + V+N L Sbjct: 434 VFHVENFDHL 443 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 87/266 (32%), Gaps = 29/266 (10%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T L I+ + + S ++ F + + T + + + Sbjct: 29 ANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTGC 88 Query: 169 LD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D + ++D S S+ + + R + E+ ++ PD VR G+V + Sbjct: 89 VDTKEADIHFLIDGSSSIQEK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQY 139 Query: 224 SSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F + I + N T + L+Y I + + Sbjct: 140 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK---- 195 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 Y+I LTDG ++ + + V+A+G+ A + + + Sbjct: 196 --VPCYLIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEE 247 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQR 366 R QN L + +E+ ++ Sbjct: 248 RVSFGQNFDALKSIKNEVVREICAEK 273 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 83/226 (36%), Gaps = 35/226 (15%) Query: 156 TSSVKISS--KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 +I+S DI + + +V+D S SM D P ++ ++++K+ + Sbjct: 174 GEPQQITSFRHEDIPVALGVVIDNSGSMRDKR-PAVNAAT---------INLVKASNPED 223 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V +V F+ V ++E + + T + A N Sbjct: 224 EVF---VVNFNDDYYLDQDYTDSVAKLKEALEKYETRGGTALYDAV-LASN--------- 270 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--- 330 H+ K KK + +TDGE+ + ++++ + G +Y IG+ E Sbjct: 271 AHLMKAPKLEKKVLFIVTDGEDDASLNTLEQTIRKVQQ--ENGPTIYTIGILDETGGHKR 328 Query: 331 ---QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + L+ A + + Q+ ++ +I ++ Q + K Sbjct: 329 RAQRALREMAESTGGVAFFPQSLDEVSRITQQIAHDIRNQYTISYK 374 >gi|45384318|ref|NP_990352.1| collagen alpha-1(XII) chain precursor [Gallus gallus] gi|2506307|sp|P13944|COCA1_CHICK RecName: Full=Collagen alpha-1(XII) chain; AltName: Full=Fibrochimerin; Flags: Precursor gi|222811|dbj|BAA00701.1| alpha 1 chain of type XII collagen [Gallus gallus] gi|2326442|emb|CAA43358.1| collagen type XII alpha 1 chain [Gallus gallus] Length = 3124 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 61/365 (16%), Positives = 131/365 (35%), Gaps = 53/365 (14%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 N G + I I+L V V + +++ + AT+I ++ Sbjct: 299 ANFDGIVDIQNEIILQVCSGV-------DEQLGELVSGEEVVEPASNLVATQISSKSVRI 351 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + + ++ I + G ++ ++T+L++ +Y ++ Sbjct: 352 TWDPSTSQITGYRVQFI-------PMIAGGKQHVLSVGPQTTALNVKDLSPDTEYQINV- 403 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 Y M + + P T VK+ + G+D+ DV ++ + G+ Sbjct: 404 --YAMKGLTPSEPITIMEK------TQQVKVQVECSRGVDVKA--DVVFLVDGSYSIGIA 453 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF- 249 + ++ + P V+ LV +S F L +++ I + Sbjct: 454 NFVKVRAFLEVLVKSFEISPRK---VQISLVQYSRDPHMEFSLNR-YNRVKDIIQAINTF 509 Query: 250 ---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G +T + + Y K+F ++ + + +I +TDG++S Sbjct: 510 PYRGGSTNTGKAMTYVREKVF----VTSKGSRPNVP--RVMILITDGKSSDA------FK 557 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMV 363 + + ++A+GV+ +A L+ ASP Y+V++ DAF RI E+ Sbjct: 558 EPAIKLRDADVEIFAVGVK-DAVRTELEAIASPPAETHVYTVED----FDAFQRISFELT 612 Query: 364 KQRIL 368 + L Sbjct: 613 QSVCL 617 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 95/262 (36%), Gaps = 35/262 (13%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 +++ + Y ++ ++ P + ++++ P L + ++ Sbjct: 1139 DSTVVLEELRAGTTYKVNVFGMFDGGESNPLVGQEMTTLSDTTTEPFLSRG---LECRTR 1195 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+++++D S S+ I ++++ PD V+ GL +S Sbjct: 1196 AEADIVLLVDGSWSIGRP------NFKTVRNFISRIVEVFDIGPDK---VQIGLAQYSGD 1246 Query: 227 IVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + L + + + + L + + G+ A + I K E + + Sbjct: 1247 PRTEWNLNAYRTKEALLDAVTNLPYKGG-NTLTGM--ALDFILKNNFKQEAGLRP--RAR 1301 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RF 342 K + +TDG++ + L + G +YAIG++ ++ + PD Sbjct: 1302 KIGVLITDGKSQDDVVTPSRRL------RDEGVELYAIGIKNADENELKQIATDPDDIHA 1355 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 Y+V + L IG+++ Sbjct: 1356 YNVADFSFL----ASIGEDVTT 1373 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 74/194 (38%), Gaps = 26/194 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + + M+ ++ + R G+V +SS Sbjct: 139 DLVFLVDGSWSVGRNNFRYI---------LDFMVALVSAFDIGEEKTRVGVVQYSSDTRT 189 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I R+ + G T + ++Y + + + KG + K Sbjct: 190 EFNLNQYFRRSDLLDAIKRIPYKGGNTMTGEAIDYL---VKNTFTESAGARKG---FPKV 243 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I +TDG+ E + G V+++G++A A + + P ++ Sbjct: 244 AIVITDGKAQDEVEIPAR------ELRNIGVEVFSLGIKAADAKELKLIASQPSLKHVFN 297 Query: 345 VQNSRKLHDAFLRI 358 V N + D I Sbjct: 298 VANFDGIVDIQNEI 311 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 63/168 (37%), Gaps = 34/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM MD+L ++++ L I G+V F + + Sbjct: 307 VCLVLDKSGSMA-----AMDRLNRMNQAVKLFLLQITEKGSW-----VGIVLFDERAIIR 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL ++ ++ G T G++ A+ I + + Sbjct: 357 NPLIQIISEDDRNYLMTRLPE-AAGGGTSICSGVQAAFQAIKQKFQTTDGSE-------- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF 332 I+ LTDGE+ ++ C E K+ GA ++ + + AA + Sbjct: 408 -IVLLTDGED--------VTVSSCFEEVKQSGATIHTVALGTSAAQEL 446 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 76/208 (36%), Gaps = 30/208 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + D+ VLD S S++ + D + T V+ +R + Sbjct: 28 TPSCHGAYDLYFVLDRSGSVSTDWSEIYDFVKNLTERF------------VSPNLRVSFI 75 Query: 222 TFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS+ PL I + ++ + T G++ A ++ +K Sbjct: 76 VFSSRAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKLATEQMKKEPKKSSS--- 132 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 I+ LTDG+ ++ ++ + A++ GA VY +GV+ +Q S Sbjct: 133 -------IIVALTDGK--LETYIHQLTIDEADSARKYGARVYCVGVKDFDEEQLADVADS 183 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 ++ + V+ A I ++KQ Sbjct: 184 KEQVFPVKGG---FQALKGIVNSILKQS 208 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 76/208 (36%), Gaps = 30/208 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + D+ VLD S S++ + D + T V+ +R + Sbjct: 28 TPSCHGAYDLYFVLDRSGSVSTDWSEIYDFVKNLTERF------------VSPNLRVSFI 75 Query: 222 TFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS+ PL I + ++ + T G++ A ++ +K Sbjct: 76 VFSSRAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKLATEQMKKEPKKSSS--- 132 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 I+ LTDG+ ++ ++ + A++ GA VY +GV+ +Q S Sbjct: 133 -------IIVALTDGK--LETYIHQLTIDEADSARKYGARVYCVGVKDFDEEQLADVADS 183 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 ++ + V+ A I ++KQ Sbjct: 184 KEQVFPVKGG---FQALKGIVNSILKQS 208 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 73/214 (34%), Gaps = 36/214 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 V S + +VLDVS SM G ++ A ++ E+LD + + Sbjct: 26 SATADEPVTKESPK-----VELVLDVSGSMRAKDIDGKSRMSAAKQAFNEVLDAV--PEE 78 Query: 212 VNNVVRSGLVTF-----------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 V +R+ + + ++ PL + + L T P L Sbjct: 79 VRLGIRTLGADYPGQDRKQGCKDTRQLYPVGPL--DRTEAKTAVASLAPTGWTPIGPALL 136 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + D + I+ +TDGE++ +D + AK ++ Sbjct: 137 GAAKDL------------EGGDATRRIVLITDGEDTCAPLDPCQVAREI-AAKGIHLVID 183 Query: 321 AIGVQAEAADQFLKNC---ASPDRFYSVQNSRKL 351 +G+ +A + C A+ + SV + L Sbjct: 184 TLGLVPDAKTRQQLTCIAEATGGTYTSVHRTEDL 217 >gi|108760371|ref|YP_628476.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464251|gb|ABF89436.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 422 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 77/210 (36%), Gaps = 28/210 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + + + +VLD S SMN KL A R+ E++ +K R + + + Sbjct: 44 RVPVSLALVLDRSGSMNGQ------KLADARRAATELVQRLKPED------RLAFIDYGT 91 Query: 226 KIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKEKLEHIAKGHD 281 + + + I+ L +T + L+ A N + +E A Sbjct: 92 DVRVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAAANALRPHMREYRVSRA---- 147 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I L+DG+ ++ + L + +R G V A+GV + + ++ A Sbjct: 148 ------ILLSDGQPTTGIVSEPGLLDQVRQLRRDGITVSALGVGRDYQETLMRGMAEQGG 201 Query: 342 FYS--VQNSRKLHDAFLRIGKEMVKQRILY 369 +S + +S +L + F R + Sbjct: 202 GFSGFIDDSARLAEVFSRELDQATSTVARM 231 >gi|2654431|gb|AAC01506.1| type XII collagen [Homo sapiens] Length = 517 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 101/265 (38%), Gaps = 38/265 (14%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 87 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 141 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM +V + + G+ + ++ + P+ V+ LV +S Sbjct: 142 DMFLV-------DGSYSIGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 191 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 192 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 245 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A D L+ ASP + Sbjct: 246 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVDSELEAIASPPAETHVF 298 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 299 TVED----FDAFQRISFELTQSICL 319 >gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Nomascus leucogenys] Length = 900 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ +Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEAIQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPDGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Nomascus leucogenys] Length = 930 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ +Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEAIQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPDGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|291399431|ref|XP_002716111.1| PREDICTED: matrilin 1, cartilage matrix protein [Oryctolagus cuniculus] Length = 497 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 81/207 (39%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + GLV +SS Sbjct: 273 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSDRLAQ---VGLVQYSSS 323 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 324 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 377 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 378 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 429 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + + IGK++ K+ Sbjct: 430 EHYFYTADFKTITQ----IGKKLQKRI 452 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 71/172 (41%), Gaps = 21/172 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGVVNYASAVRQ 91 Query: 230 TFPL-AWGVQH-IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + + R+ + T + +++A K E + G K Sbjct: 92 EFPLRAHGSKAALLQAVRRIRPLATGTMTGLAIQFAITKALSDAEGGRVTSPG---ISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +I +TDG D E A+ G ++AIGV L+ AS Sbjct: 149 VIVVTDGRPQDSVRDVSE------RARASGVELFAIGVGGRVDKATLRQIAS 194 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 53/131 (40%), Gaps = 4/131 (3%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI--AKGHDDYKKYIIF 289 PL + + ++ L T S GL + ++ +K + + D ++ ++ Sbjct: 252 PLTADLNQVTNAVDSLRTEGGTASYQGLIWGLRQLTPNWQKAWEVGPNRNFDKVERKLVL 311 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDR-FYSVQN 347 +TDG + + D + C+ AK G + +G A + CA +S N Sbjct: 312 MTDGADYGSHFDELINAGLCDRAKDYGVALNFVGFGVYGARLEQFTRCAGDANGVFSASN 371 Query: 348 SRKLHDAFLRI 358 +++L F ++ Sbjct: 372 TQELDSYFSQL 382 >gi|119628048|gb|EAX07643.1| matrilin 1, cartilage matrix protein, isoform CRA_b [Homo sapiens] Length = 519 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFISQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 376 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 377 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E D K Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + + P Sbjct: 149 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGSVDKATLRQIASEP 195 >gi|292490950|ref|YP_003526389.1| von Willebrand factor A [Nitrosococcus halophilus Nc4] gi|291579545|gb|ADE14002.1| Forkhead-associated protein [Nitrosococcus halophilus Nc4] Length = 510 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 78/224 (34%), Gaps = 35/224 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L +S +S + D++++L+ S SM ++ + + + ++ ++ Sbjct: 30 LWASSP--PASAAIEKTDIILLLENSQSMQENDPEFLT-----QKFVLNFINGLRGDSQ- 81 Query: 213 NNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIF 267 ++ F PL + +++L + S G+E A ++ Sbjct: 82 -----IAIILFDGGTSVAMPLVPVSQKTRDALVTNLDKLTYTGRFRNSAAGMERALYELK 136 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDG--ENSSPNIDNKESL----FYCNEAKRRGAIVYA 321 + +K II LT+G E D S + +EA G V+ Sbjct: 137 NFGRPEA---------EKAIILLTNGPIETGDEKRDRDFSRWMQEYLAHEAAEAGIKVFG 187 Query: 322 IGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 I A L+ A + +Y + L AF RI K + Sbjct: 188 IAFTEAADFHLLQILAHTTGGTYYRAPQAVDLQSAFNRIRKVIT 231 >gi|308462096|ref|XP_003093334.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] gi|308250345|gb|EFO94297.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] Length = 384 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 83/229 (36%), Gaps = 25/229 (10%) Query: 136 PFIFCTFP-WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 P + + +++ L + S++ LD++ V+D S M + ++ V Sbjct: 3 PLLILFLLIGVSTANYTALSYEERPCGTDISNLWLDVVAVVDNSKVMG---NGDLAQIAV 59 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA------WGVQHIQEKINRLI 248 I I SIP+ R GLVT++ L ++I +N + Sbjct: 60 LITEIFAESRIGTSIPNQPKTTRLGLVTYNWNATIQAGLDKFQSQQDVFENIFNALNSVS 119 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 S + GL A N + ++Y+K I+ +S N + Sbjct: 120 STSESYLANGLVAAENVL-------ARGPNRGNNYQKVIVLFAASYSSHSN-----PIAI 167 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF---LKNCASPDRFYSVQNSRKLHDA 354 + K+ G + +G F L ASP++ ++ ++ ++ D Sbjct: 168 ADRLKQAGITIITMGYNNVGDPNFYQNLAKIASPNKSFTEKSLSQIGDI 216 >gi|260575971|ref|ZP_05843966.1| von Willebrand factor type A [Rhodobacter sp. SW2] gi|259021897|gb|EEW25198.1| von Willebrand factor type A [Rhodobacter sp. SW2] Length = 670 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 81/249 (32%), Gaps = 34/249 (13%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + PW + + + + + L+++ ++D S SM D +KL Sbjct: 280 FRTAVTVMQTPWNPGTRLVRIGLQGRL-PALDDRPPLNLVFLIDTSGSMEDA-----NKL 333 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + +S+R ML ++ + +V ++ + P I ++RL G Sbjct: 334 PLLKQSLRLML------AELRPEDQVAIVAYAGSAGEILPPTKAENADEILAALDRLGAG 387 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T GL AY G + + + TDG+ + D + Y Sbjct: 388 GSTAGAEGLALAYQV--------ARKMAGAGEVSRVL-LATDGDFNVGIDDPEGLTKYIA 438 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR----FYSVQNS------RKLHDAFLRIG 359 + + G + +G D ++ A + N +L A I Sbjct: 439 KQRDTGVYLSVLGFGRGNLDDATMQALAQNGNGTAAYIDTLNEARKVLVDQLTGALFPIA 498 Query: 360 KEMVKQRIL 368 ++ Q Sbjct: 499 DDVKIQVEW 507 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 97/300 (32%), Gaps = 26/300 (8%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + L +IL+H A +Q + + I+ + R + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGILVYAIGV 170 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP-FIFCTFPWCANSSHAPLLITSSVKIS 162 + +S D + + +P P ++ T + Sbjct: 171 KDASQAELREISSSPKDNFTFFVPNF---PGLPGLAQKLRPELCSTLGKAAQYTERESPA 227 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ ++D S S G+ + + + + ++ V+ GLV Sbjct: 228 CSEASPADIVFLVDSSTS------IGLQNFQKVKHFLHSV---VSGLDVRSDQVQVGLVQ 278 Query: 223 FSSKIVQTFPLAWG--VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S I FPL + ++I L + T + LE + + E AK Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALE--FIRANSLTEMSGSRAKD 336 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + ++ +TDGE+S D ++ KR G V+ +G+ + + K P Sbjct: 337 GVP--QIVVLVTDGESSDEVQD------VADQLKRDGVFVFVVGINIQDVQELQKIANEP 388 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 56/149 (37%), Gaps = 13/149 (8%) Query: 198 SIREMLDIIK-SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTK 254 S+R L I+ S+ + +R GL +S F L+ + + + I L F Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + A I + + ++ + + ++ G + +R Sbjct: 110 R---MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSG------LTEDHIREPAEALRR 160 Query: 315 RGAIVYAIGVQAEAADQFLKNCASP-DRF 342 G +VYAIGV+ + + + +SP D F Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 68/220 (30%), Gaps = 43/220 (19%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + ++++LD S SM G+ K+ A R++ + + ++PD GL Sbjct: 31 SDEPVPGKLLLMLDASGSMKAKDPSGLTKIEAAKRALTGV---VGALPDTAQ---VGLRV 84 Query: 223 FSSKIVQTFPL---------------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + +K+ + I + T L A + Sbjct: 85 YGAKVDGKGKPTPAACADTQLVHPIATLDKPKLTSTIAAIKALGETPIAHSLTEALKDLG 144 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQ 325 + ++ I+ ++DGE S + + G + +G Sbjct: 145 TSGKRN-------------IVLVSDGEESC----VPDPCPAITKLTAAGVDLQIDTVGFG 187 Query: 326 AEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEM 362 + C A +Y +++ L + ++ + Sbjct: 188 VNTKARAQLQCIAAAGKGTYYDAKDASALTTSLSKLSQRA 227 >gi|109000862|ref|XP_001094970.1| PREDICTED: cartilage matrix protein [Macaca mulatta] Length = 495 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 271 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 321 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 322 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 376 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 427 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 428 EHYFYTADFKTINQ----IGKKLQKKI 450 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 65/173 (37%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 40 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 90 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K E D K Sbjct: 91 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRSR---SPDISKV 147 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + P Sbjct: 148 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGRVDKATLRQIASEP 194 >gi|297154321|gb|ADI04033.1| hypothetical protein SBI_00912 [Streptomyces bingchenggensis BCW-1] Length = 423 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 78/211 (36%), Gaps = 37/211 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM GM ++ A ++ E++D + +V +R+ T+ + +T Sbjct: 41 VELVLDVSGSMRARDVDGMSRMAAAKQAFNEVIDAV--PDEVRLGIRTLGATYPGQDRKT 98 Query: 231 FPLAW---------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L + + L T L A + Sbjct: 99 GCLDSKQLYPVGRVDRTEAKTAVATLRPTGWTPIGLALRGASKDLA------------GG 146 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----VYAIGVQAEAADQFLKNC 336 D + I+ +TDGE+S D C+ A+ A V +G+ ++ + +C Sbjct: 147 DATRRIVLITDGEDSCGQPDP------CDVARELAAQGTHLVVDTLGLTLDSKVRDQLSC 200 Query: 337 ---ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A+ + +VQ++ +L ++ + Sbjct: 201 IAEATGGTYTAVQHTDQLSTRIKQLVRRAAD 231 >gi|302346570|ref|YP_003814868.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302151211|gb|ADK97472.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 331 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 72/215 (33%), Gaps = 28/215 (13%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 H+ +++ R + F+ S ++K G++ ++ LD+S SM Sbjct: 44 HQLSPMTSKRRGWIKFVLVELVLLLLILIIARPQVGSRIATNKEREGIETIIALDISNSM 103 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 +L + + +++ + GL+ F+ P+ + Sbjct: 104 LAEDVAP-SRLEKSKLLVENLMNKFSE-------DKIGLIVFAGDAFVQLPITSDYVSAK 155 Query: 242 EKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ LI T L+ + N + K II +TDGE++ Sbjct: 156 MFLDNINPSLIGTQGTDIGKALQLSMNSFT-----------PNSKVGKAIILITDGEDNE 204 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +A+ +G V+ +GV + Sbjct: 205 GGAE-----EMAKQAQSKGIRVFILGVGSTEGATI 234 >gi|224052500|ref|XP_002194907.1| PREDICTED: similar to anthrax toxin receptor 1 [Taeniopygia guttata] Length = 537 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 42/183 (22%), Positives = 66/183 (36%), Gaps = 24/183 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ VLD S S+ +H+ + ++ ++R + FSS+ Sbjct: 24 GAFDLYFVLDKSGSVKNHWTEIYSFVESLAEKF------------ISPMLRMSFIVFSSR 71 Query: 227 IVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I+ ++ L + G T G + A E++ H G Sbjct: 72 GTTIMKLTENREAIRRGLDTLKEELPGGDTFMHEGFKRA-------NEQIYHETYGGVRT 124 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N A+ GAIVY +GV+ Q S D + Sbjct: 125 ASVIIALTDGELQDAQFYYAE--QEANRARSFGAIVYCVGVKDFNETQLSTIADSIDHVF 182 Query: 344 SVQ 346 V+ Sbjct: 183 PVK 185 >gi|260818477|ref|XP_002604409.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] gi|229289736|gb|EEN60420.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] Length = 192 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 78/210 (37%), Gaps = 29/210 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + +D+ VLD S S+ D DK+ +++ DI S Sbjct: 5 GPTTPAPICNALMDLFFVLDGSGSVTD---ANFDKMKQFAKNVVNAFDISASS------T 55 Query: 217 RSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKL 273 R G+V +S F L I+ +++ G T++ LE+A Sbjct: 56 RVGVVQYSDSNTLEFNLGDHADKPSTLAAIDSIVYQGGGTRTGSALEFA----------R 105 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + A + K +I +TDG+++ + N +G VYAIGV + Q L Sbjct: 106 VNAAWRGESVPKVMIVVTDGKSADSVTSS------ANNLASQGVDVYAIGVGNYRSTQLL 159 Query: 334 KNCASP-DRFYSVQNSRKLHDAFLRIGKEM 362 + A + + + L +I + + Sbjct: 160 EIAAGNQNNVIELTDFNALSAEIEQIAQAV 189 >gi|161529149|ref|YP_001582975.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] gi|160340450|gb|ABX13537.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] Length = 316 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 55/286 (19%), Positives = 101/286 (35%), Gaps = 53/286 (18%) Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT--FPWCANSSHAPL 153 L ++ ++I + +SL I+ K R +PF+ P Sbjct: 23 LYSKYNSEKKDSIMKFSSLKIV----KKSVMGKNFLRKHLPFVLMMGILGLAIIGLANPQ 78 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + T SV+ G+++ +VLD S SM +L A +I ++ +K P N Sbjct: 79 IPTLSVE------NGINLSIVLDGSESMAATDYEP-TRLDAAKNAINNLI--LKMGPQHN 129 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEK 272 G+V F S L + I+ + G T GL + +K Sbjct: 130 ----VGVVLFESGATTVSYLTPDKEKSVNAISSIEQGLGATAIGDGLALGVDMASSIPDK 185 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------- 325 +I L+DG ++S + +E+ Y AK ++ IG+ Sbjct: 186 KG-----------VVILLSDGVHNSGLVTPEEATEY---AKINNVQIHTIGLGSIEPVFL 231 Query: 326 ----------AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 AE ++ L A + +Y + + L++ F+ + Sbjct: 232 RDDIYGEPQYAELDEETLVIIAQQTSGNYYKSLDEQTLNEIFVNLS 277 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 50/304 (16%), Positives = 96/304 (31%), Gaps = 50/304 (16%) Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 N +K + +K + + ++GF R + Sbjct: 329 NWMDAEKRKVADEFLKFLHTERVQKRFTDHGFRDHQRRPGRHV-----TEANG------- 376 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-GPG 188 +P T + L++ K ++ ++++V+D S SM G G Sbjct: 377 ----LLPGEPKTTLSLPSDQILDLIL----KTWAELRKPANVLLVIDRSGSMQQTVPGTG 428 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-------KIVQTFPLAWGVQH-- 239 K +A + E L + V GL FS+ + PL + Sbjct: 429 KSKGDLAKEAAAEALAEFRGQDQV------GLWVFSAARRQGERDWQEVVPLGRMTEAHR 482 Query: 240 --IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++E++ L T AY K+ ++ A ++ +TDG+N Sbjct: 483 SLLRERLLGLTLSGGTGLYNTTAAAYEKMTGSRRGDAINA---------VVVMTDGKNER 533 Query: 298 P-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDA 354 P +D + + V+ IG +A L+ A+ Y + + D Sbjct: 534 PGGLDLDGLIAKLGARREESVRVFTIGYGEDADQNVLRRIAEAADGAAYDSSDPNTIGDI 593 Query: 355 FLRI 358 F + Sbjct: 594 FTEV 597 >gi|108763557|ref|YP_631764.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108467437|gb|ABF92622.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 592 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 75/220 (34%), Gaps = 23/220 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P ++ L + K+ S+S ++ ++D S SM+ DKL +A +I+ Sbjct: 217 PSPFDAKRHFLRVGVQGKVVSRSQRKPAHLVFLVDTSGSMHSE-----DKLPLAREAIKV 271 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + + V V +G +++ V A + I ++ L G T G+E Sbjct: 272 AVKNLNENDTVAIVTYAG----NTRDVLPPTPATDAKSIHAALDSLTAGGGTAMGSGMEL 327 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVY 320 AY LTDG+ N N+ L ++ G + Sbjct: 328 AYRHAVKKASGSVVSRVVV---------LTDGDANIGRNVSANAMLDSIHKYTAEGVTLT 378 Query: 321 AIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 +G D ++ A + V + R+ F Sbjct: 379 TVGFGMGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFET 418 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 36/207 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ ++D S S+ + + + +++D + R GLV +SS++ Sbjct: 493 IDLAPLIDGSKSVRPQ------NFELVKQFVNQVVDQLDVSA---KGTRVGLVQYSSRVR 543 Query: 229 QTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL+ + H +++I + + T + L++ F E K Sbjct: 544 TEFPLS--MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKN---I 598 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 + + TDG + + + +AK G +YA+GV D+ L+ AS Sbjct: 599 PRVGLVFTDGRSQD------DIQEWAKKAKEAGITMYAVGVGKAVEDE-LREIASDPVEK 651 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F+ + F I + ++ Sbjct: 652 HFFYSAD-------FTAISQIAENLKL 671 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 78/197 (39%), Gaps = 29/197 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ ++D S S+ H M K M+DII + R G V +S Sbjct: 17 KSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDIIHELDIGLAATRIGAVQYS 67 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L + + + IN +I T + + YA N F A+E A+ + Sbjct: 68 SQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE----GARPNV 123 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ TDG + + G +YA+GV A A L+ ASP Sbjct: 124 PHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVGV-ARADMTSLRAMASPPF 174 Query: 340 -DRFYSVQNSRKLHDAF 355 D + V++ L F Sbjct: 175 EDHVFLVESF-DLIHQF 190 >gi|118353826|ref|XP_001010178.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291945|gb|EAR89933.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 547 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 42/285 (14%), Positives = 101/285 (35%), Gaps = 37/285 (12%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 ++ G Q+ LS+ + HK + ++ + +P + Sbjct: 47 IEYNFQKMGVGQNKQKYNLDKGLSLDVKTLHKHFQFNSSASQSIPVMVSVKTLDKTEDSP 106 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + ++ + ++ LD++ V+D S SM+ + + K + D Sbjct: 107 KTDLEAAKQDRLENRPNLDLVCVIDRSGSMSGNKIENVKKTLEYLLELLGEND------- 159 Query: 212 VNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R L+ F S + + L + ++ + IN + T G+E A+ + Sbjct: 160 -----RLCLIAFDSCVSRRCHLMKTNSSNKPNLIKIINEIHCHGGTNINSGMELAFRVL- 213 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + + + I L+DG++ ++ ++SL + +++ G ++ Sbjct: 214 --------KERKYYNPVSSIFLLSDGQDGGADLRVRQSLE--KHLSQECFTIHSFGFGSD 263 Query: 328 AADQFL-KNCA-SPDRFYSVQNSRKLHD--------AFLRIGKEM 362 + K C+ FY V+ ++ + F I +E+ Sbjct: 264 HDGPLMNKICSLKDGNFYYVEKINQVDEFFVDALGGLFSVIAQEI 308 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAF 355 + ++ + C+ AK G IVY +G +A + + LK CAS D Y + ++ DAF Sbjct: 485 EASAKDQRTDHVCDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYYDADGLEISDAF 544 Query: 356 LRIGKEMVKQRI 367 I + K R+ Sbjct: 545 TSIASSIRKLRL 556 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 53/367 (14%), Positives = 108/367 (29%), Gaps = 114/367 (31%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F G ++++ L + G+ ++ +A L Y +D ++L A Sbjct: 27 FAREEDGLMTVMALFLFLALVGAAGIGVDLMRYEQKRAALQYTMDRAVLAAA-------- 78 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + +Q + +T R+ L + G + ++ S+ + + +S Sbjct: 79 --DLDQQVSP----------ETVVRSYLEKAGLLEYLS--------SVTVQEGLGYRKVS 118 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGP 187 A + E+P + S + L I +++ IG +++ +VLDVS SMN + Sbjct: 119 ATATAELP-----THFMKLSGYDSLTI--PAASTAEESIGNVEISLVLDVSGSMNSNSRL 171 Query: 188 GMDKLGVATRSIREMLDIIK------SIPDVNNVVRSGL-----------------VTFS 224 + A + ML + SI V +G V F Sbjct: 172 Y-NLKNAAKEFVDHMLSATEPGTVSISIVPYATQVNAGADILSYYNVSTEHNYSHCVNFI 230 Query: 225 SKIVQTFPLA--------------------------------------WGVQHIQEKINR 246 L+ + I+ Sbjct: 231 DDEFSQPGLSRVTPLERTMHFDPFSYTKDPISTPVCPVRASTEILPFSNDQTVLNNYIDG 290 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK----------------KYIIFL 290 L T G ++ + + + + K +I + Sbjct: 291 LTGRGNTSIDIGTKWGVVMLDPGTQSVISGLISDNKVPASFQGRPSAYDSGDVLKVLIVM 350 Query: 291 TDGENSS 297 +DGEN++ Sbjct: 351 SDGENTN 357 >gi|226315298|ref|YP_002775194.1| hypothetical protein BBR47_57130 [Brevibacillus brevis NBRC 100599] gi|226098248|dbj|BAH46690.1| hypothetical membrane protein [Brevibacillus brevis NBRC 100599] Length = 424 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 93/237 (39%), Gaps = 41/237 (17%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++M + LL SS + +S ++ +++MVLD S SM D Sbjct: 80 SWKMLVPSTFLMVGVAALLLQLLYGSSFQQASGAN---NIVMVLDTSGSMQSS-----DP 131 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-WGVQHIQEKINRLIF- 249 ++ +M+ + S + +VTF + PL Q +++++ + + Sbjct: 132 DNQLFKAAADMVQRMDSDMN------IAVVTFHDQTNVLQPLTELSSQSVKDEVVKKLLQ 185 Query: 250 ----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T+ L+ +++ + + ++ ++DG + +D + Sbjct: 186 FPRTDGGTRIDLALQAGLDQL-----------QANQMANSTVVLMSDGYSD---LDVPAA 231 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAAD--QFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L K+ IV+ +G+ AD L+ A + +++V+++ ++ F +I Sbjct: 232 LAP---YKQNQVIVHTVGMSQIDADGTALLQKIAAETGGSYFNVEHADQMTGIFGQI 285 >gi|39997259|ref|NP_953210.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] gi|39984149|gb|AAR35537.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] Length = 575 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 55/323 (17%), Positives = 112/323 (34%), Gaps = 40/323 (12%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR--------ENG 100 + +L + GN G+K ++ + R +NG Sbjct: 279 QESVRQALAEALSGKSVGSFGNVGEKLAELLCQNATESSFNGTAAAAPRLPTAQLLNQNG 338 Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 D++ + T+ + + + ++ + P + + T Sbjct: 339 GYDDLSALRVHTA---ALRARLQGLVQASKQKRSTPVSVGHRLDSRVLTRLRICDTRVFT 395 Query: 161 ISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + M+LD S SM + ++K+G+A+R+ + + SIP VR+ Sbjct: 396 RKEEKRAVNTAVCMLLDSSGSMGNT--TILNKMGIASRACFVAAEALFSIP----GVRTA 449 Query: 220 LVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + TF FP+ +G + + N + T+ L +A+ ++ +E Sbjct: 450 IATFKGHDNHVFPMVNFGEKPDHSRFN-ITGSGGTRLGHALWWAWGELSLRRETR----- 503 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K I +DG+ D + + G V IG+Q + Q+L Sbjct: 504 ------KICIAFSDGDTG----DGPVTQAAIKRMREEGIEVIGIGIQDNSIKQYL----- 548 Query: 339 PDRFYSVQNSRKLHDAFLRIGKE 361 PD ++N + A L + +E Sbjct: 549 PDNHRIIKNLDQFTPALLELLRE 571 >gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens] Length = 900 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 73/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEGRLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 70/199 (35%), Gaps = 39/199 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM + D+L ++ R L + V G+VTF S Sbjct: 308 VCLVLDKSGSMLND-----DRLNRMNQASRLFL-----LQTVEQGSWVGMVTFDSAAYVQ 357 Query: 231 FPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + T GL A+ I Sbjct: 358 SELKQLNSGADRDLLIKHL-PTVSAGGTSICSGLRTAFTVIKKKYPTDGSE--------- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C + K+ GAI++ + + AA + L + Sbjct: 408 -IVLLTDGEDN--------TISSCFDLVKQSGAIIHTVALGPAAAKELEQLSKMTGGLQT 458 Query: 343 YSVQNSRK--LHDAFLRIG 359 YS + L DAF + Sbjct: 459 YSSDQVQNNGLVDAFAALS 477 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 64/178 (35%), Gaps = 28/178 (15%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+D++ +DVS SM ++L + + + ++++ + S R Sbjct: 79 TKLETVKREGVDVVFAIDVSKSMLAEDVAP-NRLEKSQQLVTQIINSLAS-------DRV 130 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLE 274 G++ ++ P+ + + + T ++ A D ++ Sbjct: 131 GIIAYAGSAFPQLPITTDYASAKMFLQNMNTDMLSSQGTAINEAIQLAKTYYNDDEQTN- 189 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++DGE+ +S+ EA G ++ IGV + Sbjct: 190 ----------RVLFIISDGEDHEG-----DSVNIAEEASEEGIRIFTIGVGTTKGGRI 232 >gi|294670381|ref|ZP_06735263.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307845|gb|EFE49088.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 553 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 40/257 (15%), Positives = 85/257 (33%), Gaps = 43/257 (16%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKI----------SSKSDIGLDMMMVLDVSLSMN 182 Y P P+ ++ K+ + K +++ ++D+S SM Sbjct: 145 YNYPLPTGGHPFAIHTQTIDSPWQHEAKLIKIGIQAQDLAKKELPPANLVFLVDISGSM- 203 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--I 240 +KL + +++R + + ++ V L+T++S P G I Sbjct: 204 ----NSPEKLPLVKKTLRILTEQLRPQDKVT------LITYASGEELVLPPTSGRNKDEI 253 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 IN+L G +T L+ AY + A K + I+ TDG+ + Sbjct: 254 LRAINKLQAGGSTAGESALKMAYEQAQKA------YVKNGINR---ILLATDGDFNVGVS 304 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------R 349 E ++ G + +G + ++ A + + N Sbjct: 305 STDALKSMVAEKRKSGISLTTLGFGTGNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQH 364 Query: 350 KLHDAFLRIGKEMVKQR 366 +L + +++ Q Sbjct: 365 QLTSTLATVAQDVKIQV 381 >gi|180654|gb|AAA63904.1| cartilage matrix protein [Homo sapiens] Length = 340 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 84/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ L + + I +++D + + + GLV +SS Sbjct: 116 SATDLVFLIDGSKSVRPE------NLELVKKFISQIVDTLDVSD---KLAQVGLVQYSSS 166 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 167 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPG- 219 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 220 -AQKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 272 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 273 EHYFYTADFKTINQ----IGKKLQKKI 295 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 83/218 (38%), Gaps = 42/218 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV----RSGLVTFS 224 +D+++VLD S S+ + + +L P + + G++ + Sbjct: 110 MDIVIVLDGSNSIYPWYE--------VQAFLINILQKFYIGPGQIQFLWFTGQVGVLQYG 161 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 K+V F L+ + ++E + R G T + G+ A ++ F + Sbjct: 162 EKVVHEFKLS-DYKSVEEVVKRARSINQRGGEETNTALGINVACSQAFKHGGRRG----- 215 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL 333 KK +I +TDGE S + D ++ + C + G YAI V + FL Sbjct: 216 ---AKKVMIVITDGE-SHDSADLQQVIKDCE---KDGITRYAIAVLGYYNRRGINPEAFL 268 Query: 334 ---KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G+ + Sbjct: 269 NEIKYIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 306 >gi|301767170|ref|XP_002919035.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Ailuropoda melanoleuca] Length = 891 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 107/301 (35%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + + + F + F Sbjct: 169 MYLKVQPKQLVKHFEIEA--DIYEPQGISTLDAEASFITNDFLGSALTKSFSGKKGRVSF 226 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + H D+ ++ E P AP + K Sbjct: 227 KPSMD--QQRSCPTCTDSLLHGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPK- 283 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ V+DVS SM+ K+ ++ ++LD +K +N ++ SG V Sbjct: 284 --------SVVFVIDVSGSMHG------RKMEQTKDALLKILDDMKEEDYLNIILFSGDV 329 Query: 222 T-FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T + +VQ P +Q + + + T GL A N + A+E+ + Sbjct: 330 TIWRDSLVQATPE--NIQEARTFVKNIHDQGMTNINDGLMRAINMLNKAREEH----RVP 383 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + +I LTDG+ + ++ A +Y +G FL++ A + Sbjct: 384 ERSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALEN 443 Query: 341 R 341 Sbjct: 444 H 444 >gi|260837282|ref|XP_002613634.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] gi|229299020|gb|EEN69643.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] Length = 1460 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 92/254 (36%), Gaps = 29/254 (11%) Query: 88 WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP---- 143 + N+ A +N R + + H+ ++ + FP Sbjct: 127 FLQQIVNDSCATFTAGTGDNTVRPHVYDTMKSNIHEMSGVTWQYYGAKEGEYHQFPKSDR 186 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIRE 201 C + H S K + +++V+DVS SM + G ++L +A ++ Sbjct: 187 SCEGNGHRFRNWYVSAASPKKKN----VVIVMDVSGSMREPHGVPEEQNRLNLAKQAALT 242 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---------AWGVQHIQEKINRLIFGST 252 +LD + G+V+FS++ + +++ IN+ + + Sbjct: 243 VLDTLTPRDWA------GVVSFSARAKAPEGCLGDSLGEANPTNIGIMKDFINQRVPETI 296 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G + A+N F E + +D + IIFLTDG+ + + + + Sbjct: 297 TVYAEGFKKAFNMFF---ESKNKKPEQFEDCQNIIIFLTDGQPTDTYF-TLDDIVKGQDL 352 Query: 313 KRRGAIVYAIGVQA 326 R ++ G+ A Sbjct: 353 MERSVHIFTYGLGA 366 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 92/253 (36%), Gaps = 28/253 (11%) Query: 88 WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP---- 143 + N+ A + +N R I+ + H+ ++ + FP Sbjct: 1084 FLQQIINDSCATFTAGNGDNTVRPNVYDIMKSNIHEMSGVTWQYYGAKEGEYHQFPKNDR 1143 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREM 202 C + H S K + +++V+DVS SM + GP ++L +A ++ + Sbjct: 1144 SCEGNDHRFRNWYVSAASPKKKN----VVIVMDVSGSMREPPGPEEQNRLNLAKQAALTV 1199 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---------AWGVQHIQEKINRLIFGSTT 253 LD + G+V+FS++ + +++ IN+ + + T Sbjct: 1200 LDTLTPRDWG------GVVSFSARAETPEGCLGDSLGEANPTNIGIMKDFINQRVPETIT 1253 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 G A++ E + +D IIFL+DG + + ++ E Sbjct: 1254 MYGVGFRKAFDMF---AEARNKKPEQFEDCYNIIIFLSDGSPTDKDFA-LNAITQGQELM 1309 Query: 314 RRGAIVYAIGVQA 326 R ++ G+ A Sbjct: 1310 DRSVYIFTYGLGA 1322 >gi|114047772|ref|YP_738322.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-7] gi|113889214|gb|ABI43265.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-7] Length = 755 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 76/188 (40%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S + ++ ++++V+D S SM D + A ++R L ++ N Sbjct: 363 VEASEQLNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRGLRPQDSFN----- 411 Query: 219 GLVTFSSKI--VQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + + PL A + ++ +NRL T+ L A + Sbjct: 412 -IIEFNSDVSLLSPTPLPATATNLAMARQFVNRLQADGGTEMAQALNAAL------PRQA 464 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + A G D + +IF+TDG S ++ N+ ++ +G+ + F+ Sbjct: 465 FNTASGEDKSLRQVIFMTDG---SVGNESALFELIRNQIGDN--RLFTVGIGSAPNSHFM 519 Query: 334 KNCASPDR 341 + A R Sbjct: 520 QRAAELGR 527 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 86/221 (38%), Gaps = 35/221 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +K+ + D+ + + +V+D S SM++ D++ A L ++K+ + V Sbjct: 74 QEIKVFRQEDVPISLGLVIDTSASMSNKR----DRVNSAA------LAMVKASNPEDEVF 123 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +++FS + T V+ ++ + +L T L + + K Sbjct: 124 ---VISFSEEAFITQDFTSDVKQLESSLRKLGSKGETAMRDALSLGLDHLRAPARKD--- 177 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFL 333 KK ++ +TDGE++S + + A I+Y IG+ A A+ Q Sbjct: 178 -------KKVLVVITDGEDNSSIQKQENLIR---AAHLSNVIIYGIGLLAAEAPASAQRA 227 Query: 334 KNC------ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 K A+ R + +N + I E+ Q ++ Sbjct: 228 KASLDVLTLATGGRSWYPENVADIEKITPEIAHEIRNQYVI 268 >gi|73950493|ref|XP_544451.2| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Canis familiaris] Length = 562 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 81/203 (39%), Gaps = 30/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 227 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 277 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 278 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 331 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 332 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 383 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + + ++ R+ K++ Sbjct: 384 EHYFYTADFKTINQIGKRLQKKI 406 >gi|116623628|ref|YP_825784.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] gi|116226790|gb|ABJ85499.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] Length = 543 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 49/306 (16%), Positives = 102/306 (33%), Gaps = 56/306 (18%) Query: 13 CKGSISILTAILLPVIFI-VMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNN 71 GS+ +L +LLP I I ++GL I+ S V+ +L +D + + + Sbjct: 11 KSGSVMVLITLLLPSIMIPLVGLAIDASVARLVQLRLQAAVDGAAMGAGRLL---GTPAV 67 Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS 131 + +F + + A D+++ T + + Sbjct: 68 PETLAAEFLASNFRT-------DGSAGTWGAHDLHSTIVYTP--------GITKIIDIDA 112 Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM-D 190 ++P +F +++ + +M+V+D S +M+ G G+ Sbjct: 113 TAQVPLLFLRILGKTSATVRARGSG--------TRTDSRVMLVIDRSGTMDVSDGTGLPT 164 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA---------------- 234 ++ A + + I + + + + GLV F +P + Sbjct: 165 RIENAKTVAQTLF--IPAFTEGADEI--GLVAFDGSAYVAYPPSQPGWDPTTTSSSRGGP 220 Query: 235 -------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 ++ ++N + GS T + L AY ++ A K I Sbjct: 221 DMYFKDPNNPNNMINQVNAIDAGSYTGTAEALWMAYIELQKAHLKDLAQDGVDLRMNS-I 279 Query: 288 IFLTDG 293 + LTDG Sbjct: 280 LLLTDG 285 >gi|332831234|ref|XP_528236.3| PREDICTED: collagen alpha-1(XXII) chain, partial [Pan troglodytes] Length = 695 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 79/206 (38%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F + YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPR---AGGRPRDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|326435505|gb|EGD81075.1| hypothetical protein PTSG_11020 [Salpingoeca sp. ATCC 50818] Length = 552 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 73/201 (36%), Gaps = 27/201 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +D++ V+DVS SM+ KL +A ++ + I +++ GLV + Sbjct: 60 AARGAVDIVAVIDVSGSMSGS------KLDLAKATLEFL------IKNLSQTDHMGLVVY 107 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + FPL G + ++ L T + GL + + ++ Sbjct: 108 HSDVSVAFPLTRMDAEGKRTATAALSTLRAQRCTNLSGGLFKGIEMMQGRERSAASVSS- 166 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQFLKNCA- 337 ++ +TDG + +Y G + ++ LK+ + Sbjct: 167 -------VLLMTDGIANEGVRGPNLITATRQLMGDNPSYSLYTFGYGSNHEEELLKDLSE 219 Query: 338 -SPDRFYSVQNSRKLHDAFLR 357 +Y ++N+ + ++F Sbjct: 220 VGNGMYYYIENNDTIPESFGD 240 >gi|229490509|ref|ZP_04384348.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229322588|gb|EEN88370.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 684 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 81/237 (34%), Gaps = 46/237 (19%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 C P + + T + S + L +MV+D+S SMND+ G +KL A +S+ Sbjct: 19 CLLPVSSAAQT---PTTPPTRAVSAAPAAL--LMVMDLSGSMNDNDANGKNKLTGAKQSL 73 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----------GVQHIQEKINRLIF 249 ++ S GL T+ + P ++ + +++ L Sbjct: 74 SRIVGDTASSSTP-----LGLWTYPTAGSNCDPGSFLAGADGGVRKDTDTLMAQVSGLKA 128 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T + P L + + + ++ ++DGE++ C Sbjct: 129 DGGTPTGPALRASVDSLKANGITTAT-----------VVLISDGESNCGQA-------PC 170 Query: 310 NEAK---RRG--AIVYAIGVQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRI 358 + AK G V A+G Q + C + R+ + + ++ + Sbjct: 171 DTAKQIVAEGFDVTVEALGFQLSGQGRTELECIASTTGGRYSDIADVDEMQKRLKEL 227 >gi|48374067|ref|NP_001001537.1| inter-alpha-trypsin inhibitor heavy chain H4 [Sus scrofa] gi|3024051|sp|P79263|ITIH4_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Major acute phase protein; Short=MAP; Flags: Precursor gi|1836014|gb|AAB46821.1| IHRP [Sus scrofa] gi|4096979|gb|AAD00024.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Sus scrofa] gi|1588326|prf||2208343A inter-alpha-trypsin inhibitor Length = 921 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ ++ ++L + S N LV+FS + + Sbjct: 273 VIFVIDTSGSMRG------RKIQQTREALIKILGDLGSRDQFN------LVSFSGEAPRR 320 Query: 231 FPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 +A V+ + + T + A + A + A+ +I Sbjct: 321 RAVAASAENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVT----FI 376 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------ 341 I LTDG+ + + + EA ++ +G + FL+ A + Sbjct: 377 ILLTDGDPTVGETNPSKIQKNVREAIDGQHSLFCLGFGFDVPYAFLEKMALENGGLARRI 436 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ +L D + + +++ Sbjct: 437 YEDSDSALQLEDFYQEVANPLLRLV 461 >gi|94970371|ref|YP_592419.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94552421|gb|ABF42345.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 356 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 75/208 (36%), Gaps = 27/208 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++++ L + +++D S S+ D+ S E L+ I V + F Sbjct: 129 ETNLPLRVGLLIDSSNSIR-------DRFKFEQESAIEFLNQIIRPKFDKAFV----IGF 177 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + T + + + L G T + YA G+ Sbjct: 178 DTTAEVTQDFTDDTDLLGKGVRMLRPGGGTAMYDAIYYA-------CRDKLLKENGNTAM 230 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA----ADQFLKNC--A 337 +K +I L+DGE++ + +E++ A+ I+YAI D+ L+ A Sbjct: 231 RKAMILLSDGEDNQSRVTREEAVEMAQRAE---VIIYAISTNTSGLKLRGDKVLERFAEA 287 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + R + + +AF I E+ Q Sbjct: 288 TGGRAFFPFKISDVANAFSEIQDELRSQ 315 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 74/201 (36%), Gaps = 39/201 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ ++L ++ + L + + G+VTF S Sbjct: 307 VCLVLDKSGSMSSD-----NRLNRLNQAGKLFL-----LQTIEQGSWVGMVTFDSAAQVR 356 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + + T GL A++ I Sbjct: 357 SELRQIKSGTDREALTKSL-PTVPSGGTSICSGLRVAFSVIKKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C +E ++ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TIKVCFDEVRQSGAIIHTVALGPSAALELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRIGKE 361 Y+ + L DAF + E Sbjct: 458 YASDQVQNNGLIDAFGALSSE 478 >gi|198433657|ref|XP_002122417.1| PREDICTED: similar to PK-120 [Ciona intestinalis] Length = 864 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 74/210 (35%), Gaps = 28/210 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM+ H K+ ++R +LD + I N ++TFSS Sbjct: 300 VVFVIDVSGSMSGH------KIVQTKEALRTILDDLNEIDQFN------IITFSSTTNVW 347 Query: 231 FPLA------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P +++ ++ + + T A E + + Sbjct: 348 HPNEMVDVNPTNIRNAKKHVRSMYARGGTN----FNAAALDGIQLLETISSNRTNTLEEA 403 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + N+ E ++ +G +FL AS ++ Sbjct: 404 SMMILLTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQHLDHEFLDQIASENKGLS 463 Query: 342 ---FYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + + +L D + + ++ I+ Sbjct: 464 RKIYNDADAALQLKDFYDEVASPLLAHVIM 493 >gi|332254526|ref|XP_003276380.1| PREDICTED: cartilage matrix protein [Nomascus leucogenys] Length = 496 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPG- 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 376 -AQKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 65/173 (37%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + +++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIQSLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E D K Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + P Sbjct: 149 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGRVDKATLRQIASEP 195 >gi|119569133|gb|EAW48748.1| collagen, type XII, alpha 1, isoform CRA_a [Homo sapiens] Length = 821 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+LS+ +Y +S + M + + P P+ + Sbjct: 385 QTTTLSVRDLSADTEYQISVSA---MKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA-- 439 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G+ + ++ + P+ V+ LV +S Sbjct: 440 DIVFLVDGSYS------IGIANFVKVRAFLEVLVKSFEISPNR---VQISLVQYSRDPHT 490 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I E IN + G +T + + Y KIF + + K Sbjct: 491 EFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSR------SNVPKV 544 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S D + + ++A+GV+ +A L+ ASP + Sbjct: 545 MILITDGKSSDAFRDP------AIKLRNSDVEIFAVGVK-DAVRSELEAIASPPAETHVF 597 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 598 TVED----FDAFQRISFELTQSICL 618 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 73/198 (36%), Gaps = 26/198 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + A ++ + R G+V +SS Sbjct: 140 DLVFLVDGSWSVGRNNFKYILDFIAA---------LVSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I ++ + G T + ++Y F + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGAR------VGFPKV 244 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG++ E + G V+++G++A A + + ++P + ++ Sbjct: 245 AIIITDGKSQDEVEIPAR------ELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 345 VQNSRKLHDAFLRIGKEM 362 V N + D I ++ Sbjct: 299 VANFDAIVDIQNEIISQV 316 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 98/235 (41%), Gaps = 36/235 (15%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 LL+ S+ +++ SD LD ++V+D S SM + + ++ +D++ Sbjct: 14 AIMLLLFPSMLMAATSDAQLDAILVIDASGSMKETDPNKLGL-----EGVKLFVDMLGLT 68 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFG-STTKSTPGLEYAY 263 + + G+VT+ S + QT+P++ ++I+ ++ + T T GL+ A Sbjct: 69 DN-----QVGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAV 123 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES-------LFYCNEAKRRG 316 + + A G+ I+ TDG N+ + N+ ++A+ G Sbjct: 124 KML------NQRNASGNSPL---IVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEG 174 Query: 317 AIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y IG+ + +L+ + + + ++ ++ +L D I ++ Sbjct: 175 YPIYTIGLNDNGKLNEAYLEKISVDTKAKAFATKDPAELPDILTEIFAAHSNLKV 229 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 72/211 (34%), Gaps = 35/211 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I + D D+ ++ D S S+ +F + ++ +P N Sbjct: 375 IVQKTEQLPVKDCRYDIGIIFDSSGSLEKNFQKQLKFATTL----------VEQMPISPN 424 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDA 269 R ++ F+ K + + ++ +I F TT + L+ + + Sbjct: 425 ATRVAIIQFAGKTKLRVLADFAQKKSAAELKTIIGRSHFFSGTTFTNGALKTMADLFQKS 484 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 K + ++ TDG + +++ K +G +VY +G+ E + Sbjct: 485 KRADAKLK---------VVLFTDGYS------AEDTSEGAEALKSQGVVVYTVGISTEKS 529 Query: 330 -----DQFLKNCASPDRFYSVQNSRKLHDAF 355 + SP+ F++ + +L F Sbjct: 530 TGLNMKELHGMATSPNHFFNASDFVELSKNF 560 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 25/159 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D D++ ++D + S+ F + + ++++ + P V+ R G + +SS Sbjct: 158 DCPSDVIFIIDATSSVRTFFEQYV-------HFVEKVIEGLDIQPSVD---RVGAIVYSS 207 Query: 226 KIVQTFPLAWGVQH----IQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 Q ++ G + + L F S T + L++A N H Sbjct: 208 AHKQRVKISLGEHKDKGSLIAAVESLPFFSGITATGEALKFAAN----------HTEGRR 257 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + + LTDG + L +K + Sbjct: 258 QNLTLTFVVLTDGYSYDLIESGARLLREVPNSKVYAVTI 296 >gi|294673503|ref|YP_003574119.1| BatB/BatC protein [Prevotella ruminicola 23] gi|294471951|gb|ADE81340.1| putative BatB/BatC protein [Prevotella ruminicola 23] Length = 566 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 31/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P T KIS++ G++ ++ +D+S SM +L + Sbjct: 61 ILQGALALLVVMLARPQFGT---KISNEQRTGIETIIAMDISNSMLAEDITP-SRLDRSK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGST 252 + ++D + + GL+ F+ P+ + ++ ++ Sbjct: 117 MMVENLVDHFTN-------DKIGLLVFAGDAFVQLPITSDYVSAKMFLSSIDPSMMATQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T ++ A H + K II +TDGE+ +L A Sbjct: 170 TDIARAIDMAT-----------HSFTQEEGIGKAIIVITDGEDHEG-----GALESAEAA 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K+ G VY +GV + Sbjct: 214 KKAGMRVYVLGVGSTQGAPI 233 >gi|281338026|gb|EFB13610.1| hypothetical protein PANDA_007565 [Ailuropoda melanoleuca] Length = 854 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 107/301 (35%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + + + F + F Sbjct: 134 MYLKVQPKQLVKHFEIEA--DIYEPQGISTLDAEASFITNDFLGSALTKSFSGKKGRVSF 191 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + H D+ ++ E P AP + K Sbjct: 192 KPSMD--QQRSCPTCTDSLLHGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPK- 248 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ V+DVS SM+ K+ ++ ++LD +K +N ++ SG V Sbjct: 249 --------SVVFVIDVSGSMHG------RKMEQTKDALLKILDDMKEEDYLNIILFSGDV 294 Query: 222 T-FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T + +VQ P +Q + + + T GL A N + A+E+ + Sbjct: 295 TIWRDSLVQATPE--NIQEARTFVKNIHDQGMTNINDGLMRAINMLNKAREEH----RVP 348 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + +I LTDG+ + ++ A +Y +G FL++ A + Sbjct: 349 ERSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALEN 408 Query: 341 R 341 Sbjct: 409 H 409 >gi|326504464|dbj|BAJ91064.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 720 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 74/223 (33%), Gaps = 35/223 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P N L + + +GLD++ VLDVS SM + D+LG ++ + Sbjct: 39 PLHENRQQVLLEVIDASSAGGDRRLGLDLVAVLDVSKSMRKN-----DRLGKMKTAMHFV 93 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPG 258 ++ + + R +V FS + + PL + ++ L T G Sbjct: 94 INKLGHMD------RLSIVKFSEEAERLCPLLSVTPPNKARLNHLVDGLQVIDPTNIRDG 147 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 LE + + G I L+DG+ + + + Sbjct: 148 LEAGLSVL------AGRRITGGRVAS--IFLLSDGDENRGHATTVDV---------SDVP 190 Query: 319 VYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLRI 358 VY G + + L A F V + + + F +I Sbjct: 191 VYTFGFGTDYDPKVLDEIARRSKGGTFNFVDDEENMTEPFSQI 233 >gi|332860822|ref|XP_001152090.2| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Pan troglodytes] Length = 1312 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 36/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM FG M++ +A I L +++FS + Sbjct: 284 VVFVIDVSSSM---FGTKMEQTKMAMNVILSDLQANDYFN---------IISFSDTVNV- 330 Query: 231 FPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W V ++ ++ + T L A + + + ++ Sbjct: 331 ----WKAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSNQEPGRGPSV 386 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 IIFLTDGE ++ L +A ++++ +A L+ + Sbjct: 387 GRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLSLE 444 Query: 340 DR------FYSVQNSRKLHDAFLRIG 359 +R + + +L + I Sbjct: 445 NRGIARRIYEDTDAALQLKGLYEEIS 470 >gi|293348732|ref|XP_001072793.2| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] gi|293360639|ref|XP_243609.5| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] Length = 1613 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 77/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + P R G+V +S + Sbjct: 47 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGP---GHTRVGVVRYSDRPTT 97 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + ++ R+ + G T + L Y ++ F A G+ +K+ Sbjct: 98 AFELGHFSSREEVKAAARRITYHGGNTNTGDALRYITSRSFSA---QAGGRPGNRAFKQV 154 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + G ++A+GV A A + L AS + Sbjct: 155 AILLTDGRSQDLVLDAAAAAH------AAGIRIFAVGVGA-ALKEELDEIASEPKSAHVF 207 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 208 HVSD-------FNAIDKIRGKLRRRL 226 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 81/203 (39%), Gaps = 30/203 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD++ ++D S S+ + + I +++D ++ GLV +SS Sbjct: 268 GSALDLVFLIDGSKSVRPE------NFELVKKFINQIVDSLEVSDKQAQ---VGLVQYSS 318 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + Q FPL + I+ + ++ T + L+Y + D+ + A+ Sbjct: 319 SVRQEFPLGQFKSKKDIKAAVKKMSYMEKGTMTGQALKY----LVDSSFSAINGARPGVP 374 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 K I TDG + D +AK G ++A+GV D+ + + P + Sbjct: 375 --KVGIVFTDGRSQDYISD------AAKKAKDSGFRMFAVGVGNAVEDELREIASEPVAE 426 Query: 341 RFYSVQNSRKLHDAFLRIGKEMV 363 ++ + R + +IGK++ Sbjct: 427 HYFYTADFRTI----SKIGKKLQ 445 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 73/196 (37%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ H + + +++ + P N R G++ ++S + Sbjct: 39 DLVFIIDSSRSVRPH------EFEKIKVFVSRVIEALDVGP---NATRVGVINYASAVRN 89 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + + R+ + T + +++A ++ F A E A +KK Sbjct: 90 ELSLQGPHSKAALLQAVRRIQPLSTGTMTGLAIQFAISRAFSAAEGGRGSAPN---FKKV 146 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG D A+ G ++AIGV + + P + Sbjct: 147 AIVVTDGRPQDGVQDVSA------RARAAGIEIFAIGVGRVDMGTLRQMASEPLDEHVDY 200 Query: 345 VQN---SRKLHDAFLR 357 V++ KL F Sbjct: 201 VESYSVIEKLTHKFQE 216 >gi|187939945|gb|ACD39081.1| hypothetical protein PACL_0293 [Pseudomonas aeruginosa] Length = 223 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 78/211 (36%), Gaps = 20/211 (9%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ L ++++ D S SM+ K+ + +++M+ + ++ Sbjct: 6 KFQVQTARPLPIIVLADTSGSMSVD-----GKIEALNKGLKDMISSFAGESRLRAEIQVS 60 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++TF PL H + L+ T L A I D + Sbjct: 61 VITFGGSLAELNLPLT--PAHQLQSFTPLVAEGMTPLGGALSLASEMIED------KDSI 112 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 YK I+ ++DG PN D + N + A +A+ + A+A + L + A Sbjct: 113 PSRAYKPVIVLVSDG---YPNDDWQGPFARLVNGERSSKATRFAMAIGADADEVMLSDFA 169 Query: 338 SPDR--FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + +N+R +H F + + + Sbjct: 170 NDPEAPLFHAENARDIHRFFRAVTMSVSARS 200 >gi|269126610|ref|YP_003299980.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311568|gb|ACY97942.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 315 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 67/222 (30%), Gaps = 35/222 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 VK+ + +M+ +DVSLSM M K R K I D+ Sbjct: 78 PVKVPRERAT---VMVAIDVSLSM-------MAKDVAPNRFEAAKAAAKKFIQDLPARFN 127 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G+V F+ + I+ L T + + I + Sbjct: 128 VGVVAFAGSANVVATPSGDRAAAISSIDTLTLAKRTAIGEAVFTSLQAI-----RSFDAQ 182 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G D +I+ L+DG+N++ + + A+ V I + Sbjct: 183 AGQDPPPAHIVLLSDGDNTTG----RSVPEAIDAARAADVPVSTIAFGTPYGTVEIDGET 238 Query: 338 SP----------------DRFYSVQNSRKLHDAFLRIGKEMV 363 +P + Y ++ +L + IG + Sbjct: 239 TPVEVNKVTLAGLAQGTNGKAYEAADNDQLSQVYANIGTSLG 280 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 57/180 (31%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K+ + G+D++ +DVS SM D ++K I L Sbjct: 80 TKLETVKREGVDVVFAIDVSKSMLAEDIAPNRIEKSKQLVTQIINNLGS----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEK 272 R G++ ++ P+ + ++++ T +E A D ++ Sbjct: 130 RIGIIAYAGSAYPQLPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQT 189 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++DGE+ N +A G ++ IGV Sbjct: 190 N-----------RVLFIISDGEDHVGESSN-----IAEQANDEGIRIFTIGVGKSEGGPI 233 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 80/206 (38%), Gaps = 29/206 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 110 TEGPVDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLTISP---KAARVGLLQYS 160 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +++ F L + +++ + + G + + L++ + + F E + Sbjct: 161 TQVRTEFTLRNFGSAKDMKKAVASMKYMGKGSMTGLALKHMFERSFTQIEGARPL---SA 217 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 218 RVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEPT 270 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I ++ K Sbjct: 271 DKHLFYAED----FSTMGEISDKLQK 292 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 80/206 (38%), Gaps = 29/206 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + + + + ++D + P R GL+ +S Sbjct: 394 TEGPVDLVFVIDGSKSLGEE------NFEIVKQFVTGIIDSLTISP---KAARVGLLQYS 444 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +++ F L + +++ + + G + + L++ + + F E + Sbjct: 445 TQVRTEFTLRNFGSAKDMKKAVASMKYMGKGSMTGLALKHMFERSFTQIEGARPL---SA 501 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + I TDG + + ++AK G +YA+GV ++ L+ AS Sbjct: 502 RVPRVAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEE-LQEIASEPT 554 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I ++ K Sbjct: 555 DKHLFYAED----FSTMGEISDKLQK 576 >gi|308501643|ref|XP_003113006.1| CRE-MUA-3 protein [Caenorhabditis remanei] gi|308265307|gb|EFP09260.1| CRE-MUA-3 protein [Caenorhabditis remanei] Length = 3860 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 82/236 (34%), Gaps = 25/236 (10%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y +N++ P + + K D++ ++D S S+ + Sbjct: 1195 GYTCQCYSGFVDVSSNANLPPGRVCTVQTTCPKQKT--DLVFLIDGSGSIGSY------- 1245 Query: 192 LGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI 248 V + + + ++ + R GL+ +S +I F L + + I+ Sbjct: 1246 --VFKNEVLRFVSEFVELFEIGRSKTRVGLIQYSDQIRHEFDLDQYGDRSSLLKGISETQ 1303 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T++ +++ + F + D + I LTDG + Sbjct: 1304 YLTGLTRTGAAIQHMVQEGFSERR---GARPQQSDIARVAIILTDGRSQDNVTGP----- 1355 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEM 362 + A++ +AIGV L++ A SP+R++ V + L + ++ Sbjct: 1356 -ADSARKLSINTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQKA 1410 >gi|310824614|ref|YP_003956972.1| vault protein, inter-alpha-trypsin domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397686|gb|ADO75145.1| Vault protein, inter-alpha-trypsin domain protein [Stigmatella aurantiaca DW4/3-1] Length = 749 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 47/285 (16%), Positives = 99/285 (34%), Gaps = 34/285 (11%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID-D 120 +++ N+ S +K+I+ ++ G A+ + +E+ + D D Sbjct: 176 RLVGGSAKNDFTFSAKVSSKVPLKSIYSPTHPMDVSRRGEAEAVVGLEQVNGADLSKDLD 235 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + AV + + P + AP S+ +I++K + V+D S S Sbjct: 236 LYFSVSDKAVGLSLLTYKQADEPGYFIALIAPKTEVSASEIAAKR-----VTFVIDTSGS 290 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAW---- 235 M ++ +A +++ + + D NVVR FS+ + FP L Sbjct: 291 MQGS------RMQIAKDALKYCVTRLNP-QDTFNVVR-----FSTDVEALFPALKSAQPE 338 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 +Q + +L T L G ++F+TDG+ Sbjct: 339 NIQKAVAFVEQLEAIGGTAIDEALVRGLQ-----------DNDGKSSAPHLLMFITDGQP 387 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + D + + ++ ++ GV + + L +S Sbjct: 388 TIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDG 432 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 82/211 (38%), Gaps = 22/211 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW +N+ + I + + +++ ++DVS SMN DKLG+ +S++ + Sbjct: 184 PWNSNAYLMEIGIKG-FEPEQQELPPSNLVYLIDVSGSMNSE-----DKLGLVKKSLKLL 237 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ VV +G +V I++ ++RL G +T G+E A Sbjct: 238 AQESSDQDRISIVVYAGASG----VVLEPTKGNDRMAIEQALDRLSAGGSTNGGAGIELA 293 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y A K + +I TDG+ + I+ ++ + + G + Sbjct: 294 YKLAEQA------FIKDGINR---VILATDGDFNVGTINREQLIDLVERKRESGISFTTL 344 Query: 323 GVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 G + + ++ A + + + ++ Sbjct: 345 GFGSGNYNEHLMEQLADKGNGNYGYIDSLQE 375 >gi|310641811|ref|YP_003946569.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246761|gb|ADO56328.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 429 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 85/224 (37%), Gaps = 38/224 (16%) Query: 160 KISSKSDIGL-DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +++ + + D+++V+D S SM P ++L A S+ +D K R Sbjct: 104 QVNPEGRKSVQDLVLVIDNSGSMQQT-DPDNERLTAAK-SLIGQMDGDK---------RV 152 Query: 219 GLVTFSSKIVQTFPLA-WGV----QHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKE 271 +V+F S P G Q + KI+ + T+ L+ +I E Sbjct: 153 AIVSFESTAQLVQPFTPIGTDAEKQAVYSKIDSMQTIMSGGTEIGLALDETIKEI----E 208 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAA 329 + KG +I L+DG ++ R + IG++ Sbjct: 209 TQGNAEKGSL-----VIMLSDG------FSELDTQTALAPYIARQIPINTIGLKLAESEG 257 Query: 330 DQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 L+N AS + +V N++ L AF +I ++ + ++ + Sbjct: 258 IALLQNIASLTGGTYSNVANAQGLTQAFGKIYNKIGDRTLVTER 301 >gi|189524901|ref|XP_693183.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Danio rerio] Length = 963 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 67/202 (33%), Gaps = 29/202 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +++ V+D S SM K+ ++ +L +++ GLVTF Sbjct: 332 QMPKNVVFVIDRSGSMMGE------KMKQTQEALTTIL------SELHEDDYFGLVTFDD 379 Query: 226 KIVQTFP-LAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I P L+ V +E + + S T G+ YA + + K Sbjct: 380 VIESWRPSLSKATPENVTEAKEYVQTINARSMTDINKGILYAVDMLTSEKSASFPNMS-- 437 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ SS D + A ++ +G + L A + Sbjct: 438 -----MIILLTDGQPSSGEQDLSKIQENVRNAINGSMSLFCLGFGYDLDYILLDTLAKQN 492 Query: 341 -----RFYSVQNSRKLHDAFLR 357 R Y ++ F Sbjct: 493 DGLARRVYEASDAALQLQGFYE 514 >gi|254456981|ref|ZP_05070409.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] gi|207085773|gb|EDZ63057.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] Length = 229 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 82/214 (38%), Gaps = 26/214 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + ++++LDVS SM + +++ ML+ K + ++ ++TF Sbjct: 11 EEPKSIPVVLLLDVSYSMQGE------NIDTLNKAVESMLNSFKKAETMETFIKLSIITF 64 Query: 224 SSK--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ + PL + + L +T + I D + Sbjct: 65 GSENGVDLHTPLT---EVSKIDFKPLTVSGSTPMGAAFKMGKAMIEDKDIFKGRDYRPT- 120 Query: 282 DYKKYIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCA 337 I+ L+DGE PN D ++ L + + + + A+ + A + F++ C Sbjct: 121 -----IVLLSDGE---PNDDWRQPLDDFVSTGRTKKCDRMALAIGAADKTVLNMFIEGC- 171 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + +++ + D F +I + ++ NK Sbjct: 172 -ENSLFYAEDAENIIDEFKKITMSVTQRTKSVNK 204 >gi|329849361|ref|ZP_08264207.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841272|gb|EGF90842.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 505 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 77/221 (34%), Gaps = 24/221 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + T PW NS + + + S+ +++ ++DVS SM++ DKL Sbjct: 118 FSITTEVTTTPWNPNSRLLRVGLRAYDVPRSERPAA-NLVFLIDVSGSMDEK-----DKL 171 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + ++R + D+ R +V ++ ++ + +L G + Sbjct: 172 PLVQHALRLV------ADDMRPRDRVSIVVYAGAAGIVLEPTANPAQVRRALGQLKAGGS 225 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+ AY A +I TDG+ + D + + Sbjct: 226 TAGGEGIALAYATARAAYIDGGINR---------VILATDGDFNVGISDPEAIKDLVRKN 276 Query: 313 KRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 K G + A+G + ++ A + + ++ + Sbjct: 277 KDDGITLTALGFGTGNYNEALMEGIADVGNGNYAYIDSASE 317 >gi|38505728|ref|NP_942348.1| hypothetical protein slr7060 [Synechocystis sp. PCC 6803] gi|38423752|dbj|BAD01962.1| slr7060 [Synechocystis sp. PCC 6803] Length = 588 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 75/228 (32%), Gaps = 26/228 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +P A + + IT + L++ V+D S SM H +K+ Sbjct: 9 IPLKNAVCSERAVTLDLIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGH-----NKIT 63 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGS 251 A +++ +D + ++ + F ++ P L + + + G Sbjct: 64 YARQAVCYAIDQLSPGDHLSVTI------FDDQVQTLIPSTLVKDKAQFKRLVQGINPGG 117 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T G ++ + II L+DG + + + Sbjct: 118 CTDLHGGWLQGGIQVSQNLSAELNR----------IILLSDGLANRGETNPDIIATDVHG 167 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 +RGA +G+ + + L+ A +Y V ++ +L F R Sbjct: 168 LAQRGASTTTLGLGDDYNEDLLEAMARSGDGNYYYVADAEQLPTIFER 215 >gi|40805823|ref|NP_690848.1| collagen, type XXII, alpha 1 [Homo sapiens] gi|296434458|sp|Q8NFW1|COMA1_HUMAN RecName: Full=Collagen alpha-1(XXII) chain; Flags: Precursor gi|225000822|gb|AAI72420.1| Collagen, type XXII, alpha 1 [synthetic construct] Length = 1626 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 78/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSP---HAGGRPRDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|22652113|gb|AAN03620.1|AF406780_1 alpha 1 type XXII collagen [Homo sapiens] Length = 1626 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 78/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSP---HAGGRPRDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 81/204 (39%), Gaps = 30/204 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD++ ++D S S+ + + I ++++ ++ GLV +S Sbjct: 267 SGSALDLVFLIDGSKSVRPE------NFELVKKFINQIVESLEVSEKQAQ---VGLVQYS 317 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + Q FPL + I+ + ++ T + L+Y + D + + A+ Sbjct: 318 SSVRQEFPLGQFKNKKDIKAAVKKMAYMEKGTMTGQALKY----LVDNSFSIANGARPGV 373 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 374 P--KVGIVFTDGRSQDYITD------AAKKAKDLGFRMFAVGVGNAVEDELREIASEPVA 425 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMV 363 + ++ + R + IGK++ Sbjct: 426 EHYFYTADFRTI----SNIGKKLQ 445 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 71/196 (36%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + +++ + P N R G++ ++S + Sbjct: 39 DLVFIIDSSRSVRPQ------EFEKVKVFLSRVIEGLDVGP---NSTRVGVINYASAVKN 89 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + R+ + T + +++A ++ F E + K Sbjct: 90 EFSLKTHQTKAGLLQAVRRIEPLSTGTMTGLAIQFAISRAFSDAEGARLR---SSNINKV 146 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG D A++ G ++AIGV + + P D Sbjct: 147 AIVVTDGRPQDGVQDVSA------RARQAGIEIFAIGVGRVDMHTLRQIASEPLDDHVDY 200 Query: 345 VQN---SRKLHDAFLR 357 V++ KL F Sbjct: 201 VESYSVIEKLTHKFQE 216 >gi|219841908|gb|AAI44536.1| COL22A1 protein [Homo sapiens] Length = 1319 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 78/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSP---HAGGRPRDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|229523713|ref|ZP_04413118.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] gi|229337294|gb|EEO02311.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] Length = 886 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 62/181 (34%), Gaps = 16/181 (8%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKI--------SSKSDIGLDMMMVLDVSLSMNDHF 185 +P + L+I I S++ G ++ ++LDVS SM Sbjct: 231 NIPLEAKNAAGAIGTGRVTLVIEDDAPIAKDIFHMTESETKQGANVQLMLDVSGSMGRDA 290 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKI 244 G G +L V S ++++ +++ V+ L + + I L W V + I Sbjct: 291 GNGKTRLQVMKESAIQLIEQYQALGQTK--VQLILFSSDASIKTASGLLWMTVAEAKNYI 348 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 N L T ++ A + Y FL+DG + + K Sbjct: 349 NALSANGGTDYDDAIKLAQESWSGTINGQPLSGATNVSY-----FLSDGVPEGYDWELKN 403 Query: 305 S 305 S Sbjct: 404 S 404 >gi|119612600|gb|EAW92194.1| collagen, type XXII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1626 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 78/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSP---HAGGRPRDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|299136327|ref|ZP_07029511.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298602451|gb|EFI58605.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 376 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 80/212 (37%), Gaps = 28/212 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++++ L + ++LD S S+ F AT + ++L + R+ + F Sbjct: 150 QTNLPLRVGIMLDTSSSIRQRFEFEQ---QAATDFLLQVL---------HPADRAFVEGF 197 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +I + + I RL G T ++ ++ + + + Sbjct: 198 DVQINIAQDFTNRIDMLDTGIRRLRPGGGTAL-------FDSLYRTCKDQMLTLQQDAEV 250 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA----EAADQFLKNC--A 337 +K I+ ++DG++ + E++ C A+ IVY I + D L+ A Sbjct: 251 RKAIVLVSDGDDDYSRVLETEAIKMCQRAET---IVYTISTNVGPSRDKGDDVLQQISDA 307 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 + + + Q + F I +E+ Q +L Sbjct: 308 TGGQSFYPQRIDDVAIGFRNIEEELRSQYLLV 339 >gi|271968449|ref|YP_003342645.1| von Willebrand factor type A domain-containing protein [Streptosporangium roseum DSM 43021] gi|270511624|gb|ACZ89902.1| von Willebrand factor type A domain protein [Streptosporangium roseum DSM 43021] Length = 490 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 75/200 (37%), Gaps = 24/200 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ ++ V+DVS SM + +L + ++ +++D + V+ Sbjct: 139 KAEPEARRPANLTFVVDVSGSMGEP-----GRLDLVREALHKLVDQLGPGDQVS------ 187 Query: 220 LVTFSSKI--VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V FS++ V + A G + I+RL +T GL Y + A Sbjct: 188 IVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAARAFRPAATNR 247 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +I L+DG ++ + + L E+ R + +GV + DQ ++ A Sbjct: 248 ---------VILLSDGLANTGDTTWQGILDRVAESAGRQITLLCVGVGRDYGDQLMEQLA 298 Query: 338 --SPDRFYSVQNSRKLHDAF 355 V ++ F Sbjct: 299 DNGDGAAVYVSSADDARKVF 318 >gi|195941904|ref|ZP_03087286.1| hypothetical protein Bbur8_03396 [Borrelia burgdorferi 80a] gi|312149118|gb|ADQ29189.1| von Willebrand factor type A domain protein [Borrelia burgdorferi N40] Length = 333 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/239 (18%), Positives = 88/239 (36%), Gaps = 31/239 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 49 KDYRLNLIYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+S GLV F+ P+ + + Sbjct: 106 AVEFSSKNRLEFSK-------ELIRSFISQRENDNIGLVAFAKDASIVVPITTDREFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 I + + N A+ +Y+IG+ + S F V + L + Sbjct: 208 IYKDQVI---NLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 263 >gi|310115525|ref|XP_003120135.1| PREDICTED: collagen alpha-1(XXII) chain-like [Homo sapiens] Length = 788 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 47/206 (22%), Positives = 78/206 (37%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + PD R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPDR---TRVGVVRYSDRPTT 88 Query: 230 TFPLA-WGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + ++ RL + G T + L Y + F YK+ Sbjct: 89 AFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSP---HAGGRPRDRAYKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + +D + R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLVLDAAAAAH------RAGIRIFAVGVG-EALKEELEEIASEPKSAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FNAIDKIRGKLRRRL 217 >gi|254436533|ref|ZP_05050027.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] gi|198251979|gb|EDY76293.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] Length = 613 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/174 (22%), Positives = 68/174 (39%), Gaps = 19/174 (10%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW A++ + I + + + L+++ ++D S SM DKL + +S R Sbjct: 228 TPWNADTQLVHIGIQGEM-PAIEDRPPLNLVFLIDTSGSME-----SADKLPLLRQSFRL 281 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 MLD ++ +V V +G + + + Q I +N L G +T GLE Sbjct: 282 MLDNLRPEDEVAIVTYAGSTSIALEPTQASE----RATIIAALNALNAGGSTNGQGGLEQ 337 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 AY L K D + I TDG+ + D + Y + + Sbjct: 338 AY--------ALAETMKTAGDVSRVI-LATDGDFNVGLSDPRGLQAYIEDKRDD 382 >gi|126153367|gb|AAI31711.1| MATN1 protein [Homo sapiens] Length = 480 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 256 SATDLVFLIDGSKSVRPE------NFELVKKFISQIVDTLDVSD---KLAQVGLVQYSSS 306 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 307 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 360 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 361 --QKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 412 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 413 EHYFYTADFKTINQ----IGKKLQKKI 435 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 25 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 75 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E D K Sbjct: 76 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSR---SPDISKV 132 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + + P Sbjct: 133 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGSVDKATLRQIASEP 179 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 66/474 (13%), Positives = 133/474 (28%), Gaps = 114/474 (24%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILD----HSLLYTATK-- 62 F GSI ++TA++L + ++ G I+ +++ L LD + T T+ Sbjct: 15 FLRAKAGSIPVMTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAASLTQTRPP 74 Query: 63 --------------------ILNQENGNNGKKQKNDFS--YRIIKNIWQTDFRNELRENG 100 + E N + + + + + D Sbjct: 75 RETIESFLKAAVTKGSYALDVKADELSNAKRVEASATAVTDTAFLRLIGIDKLTVEAHAE 134 Query: 101 FAQDINNIE------RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 + NIE S S+ + + + Y P Sbjct: 135 AEEKRKNIEISLLLDMSGSMRFDKSGSYPGPSGAMRINYLRPAAKSFMDMVLADGAEDYT 194 Query: 155 ITSSVKISSKSDIGLDMMMVL-------DVSLSMND------------------HFGPGM 189 S V + + IG + L D S Sbjct: 195 TVSIVPYAGQVSIGPVLFDALARNRRQHDRSSCFQFGRNDFTLGVPDFANLPQTQHFTQA 254 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLV-------TFSSKIVQTFPLAWGVQHIQE 242 + ++ + PD + R G + L+ ++++ Sbjct: 255 NHHDALKKAGEAQITEPWWCPDDPHDPRPGTTPDFVAGEGKDTDRTSVSFLSNDREYLKR 314 Query: 243 KINRLIFGSTTKSTPGLEYA-----------------YNKIFDAKEKLEHIAKGHDDYK- 284 +I+ T + L++ Y + + + + + Sbjct: 315 QIDNYKLYDGTGTPIALKWGLLLLDPAIQPMLREAARYRALSEELDIDARFSNRPASFTD 374 Query: 285 ----KYIIFLTDGE------------------NSSPNID-------NKESLFYCNEAKRR 315 K+++ +TDG N S N D + + C AK++ Sbjct: 375 PDTMKFLVLMTDGAISSQRIPKDASKPVQYYNNGSLNTDLYSVGDAERFAAALCTAAKQK 434 Query: 316 GAIVYAIGVQA-EAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 IV+ IG + A + + NCAS + N+ + DAF I + K +++ Sbjct: 435 NVIVFTIGFDVNDTAAKQMSNCASGAERFYRVNALDIQDAFKSIATAIQKIKLI 488 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 71/193 (36%), Gaps = 36/193 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ L + + ++ II+ V G+VTF S Sbjct: 310 VCLVLDKSGSMSSED----RLLRMNQAAELYLIQIIEKGSLV------GMVTFESLATIL 359 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + I + R T GL + I + + Sbjct: 360 NYLTDIIGDNAYEKITANLPR-EASGGTSICSGLRAGFQAIIHSNQSTSGSE-------- 410 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 II LTDGE+ + C E K+ GA+++ I + AA + L N RF Sbjct: 411 -IILLTDGEDDQ--------ISSCFEEVKQSGAVIHTIALGPSAARELETLSNMTGGYRF 461 Query: 343 YSVQNSRKLHDAF 355 Y+ ++ L DAF Sbjct: 462 YANKDINGLTDAF 474 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/216 (20%), Positives = 83/216 (38%), Gaps = 23/216 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIK-SIP 210 L + + K + +D++ VLDVS SM D D S ++L D +K I Sbjct: 30 LRVEAPPMADLKGHVPIDVVEVLDVSGSMGDPAMASSDFKKNKPPSRLDVLKDAMKFIIR 89 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + + R +V F+ + V+ + G + ++K++ L T P LE A Sbjct: 90 KLEDGDRLSIVAFNDRPVKEYSTGLLDISGNGRRIAEKKVDWLEGRGGTALMPALEEAIR 149 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + +I+ LTDG+++S +++ + V+ G+ Sbjct: 150 VLDCRPGDSRNRVG-------FILLLTDGDDTSGFRWSRDVINGA----VGKYPVHTFGL 198 Query: 325 QAEAADQFLKNCASPDR----FYSVQNSRKLHDAFL 356 A + + L A R F +N K+ A Sbjct: 199 GAAHSSEALLYIAQESRGTYSFVDDENMDKIAGALA 234 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 58/426 (13%), Positives = 115/426 (26%), Gaps = 92/426 (21%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G I++L A +LP++ ++ I + VK +L D + ++ Sbjct: 19 RRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRAFSEEQ--TVE 76 Query: 73 KKQKNDFSYRIIKNIW-------QTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + D NE AQ N R + D + Sbjct: 77 AAKAAARLTAAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKVPTSDVAANL 136 Query: 126 NLSAVSRYE-----------MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 + R +PFIF + + S V ++ + D+ +V Sbjct: 137 VAVSSVRINGKRTDDSLLGPVPFIF--------PNTFSIGDFSPVASATAMQVDRDISLV 188 Query: 175 LDVSLSMNDHFGP--------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD S SM+ G D L A + L+ K +R V+++ Sbjct: 189 LDRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLE-WKYRNGQPQYIRR--VSYNRG 245 Query: 227 IVQTFPLAWGVQHI--------------------------QEKINRLIFGSTTKSTPGL- 259 + + + Q N + ++ S L Sbjct: 246 YDEYDLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVLDQTPQNEQVSIASYNSHGTLD 305 Query: 260 ---------------------EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 ++ + K ++ +TDG ++ Sbjct: 306 CWLLDDFDSVRAAVAQLAPNGSTGIGNGMNSGKTAFTHENARPYASKTMVVMTDGNHNYG 365 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFL 356 N + + + A + ++ A R Y + +L AF Sbjct: 366 TQPNTVAQQL---MSSSNLNIQTVTFGGGADQETMQEVAVTGLGRHYHADSGDELVSAFE 422 Query: 357 RIGKEM 362 I + Sbjct: 423 EIANNL 428 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 56/311 (18%), Positives = 104/311 (33%), Gaps = 28/311 (9%) Query: 48 LHYILDHSLLYTAT----KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 + + D SL+ + T + KQ +I+ I + +E+ Sbjct: 693 IQAVEDMSLIGSTTNTGGALRFVSKYFKLAKQARPSVNKILVLITDGEASDEVTAPATEL 752 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP-WCANSSHAPLLITSSVKIS 162 + I S+ + + +K + E F F I S K Sbjct: 753 RNDGIII---YSVGVFNANKTQLEEISGKPEKVFYVENFDILEDIKGDIIFGICSPYKPE 809 Query: 163 SKSDIGL-DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 L D++ V+D S S+ + M ++DI+ NN V+ G V Sbjct: 810 DCKRANLLDVVFVIDSSGSIGPYNYAIMKDF---------VIDIVNKSDVGNNRVQFGAV 860 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +S+ F L I +KI + TT + L +A N ++K + Sbjct: 861 KYSAYPQILFNLNGNKADIIDKIKGDTLLNDTTYTAEALRHAENLFTESKGSRKRRGVP- 919 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ +TDG + D + + G +YAIG++ ++ + D Sbjct: 920 ----QLLMVITDG----TSHDKDKLDAVSTRIRNDGITIYAIGIKDAKREELEIIAENKD 971 Query: 341 RFYSVQNSRKL 351 + V L Sbjct: 972 HVFFVDTFDGL 982 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 72/198 (36%), Gaps = 26/198 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+M ++D S S+ D D I+E+++ D V +V FS Sbjct: 623 KEMKADIMFLVDSSGSIGD------DNFKKMKTFIKEVVNRSYIGVDQVQVG---VVHFS 673 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + F L + I + GSTT + L + AK+ Sbjct: 674 DTSKEVFSLNKNTSKKSDIIQAVEDMSLIGSTTNTGGALRFVSKYFKLAKQARP------ 727 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ +TDGE S E + G I+Y++GV Q + P+ Sbjct: 728 -SVNKILVLITDGEASDEVTAP------ATELRNDGIIIYSVGVFNANKTQLEEISGKPE 780 Query: 341 RFYSVQNSRKLHDAFLRI 358 + + V+N L D I Sbjct: 781 KVFYVENFDILEDIKGDI 798 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 77/201 (38%), Gaps = 30/201 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S S + + + +++D I ++ ++ G+ FS + Sbjct: 1003 IVFLIDGSGS------IPKNDFEIVKDFLTKLVDSIS----FHDNIQFGMAQFSDIYSEE 1052 Query: 231 FPLAWGVQHIQEKIN------RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 FPL G + ++ + G T GL+ + ++ ++ Sbjct: 1053 FPL--GHYQSKSELKNKIANVSMQAGLHTYIGKGLKEVKAFFKSPRRRVARN-----VHQ 1105 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +IF TDGE+ + +R G ++A+GV + + SP+R Y+ Sbjct: 1106 KLLIF-TDGESKD------SFTQPAEDLRREGVEIHAVGVGKIEHAKLQQITVSPERIYT 1158 Query: 345 VQNSRKLHDAFLRIGKEMVKQ 365 N L I +EM K+ Sbjct: 1159 TANYTGLPHITKGITEEMCKE 1179 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 74/196 (37%), Gaps = 25/196 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--SKI 227 D+ ++D S S+N ++E++ + PD VR G+V +S + Sbjct: 443 DIYFLIDGSTSINSDSFH-----KDMKTFLKEVIKMFTVGPD---HVRFGVVQYSNIHRT 494 Query: 228 VQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + ++++ IN + T + LE A+++ ++ Y Sbjct: 495 EFEIDKHSTISNLEKAINNIQYLTGDTYTGAALESMLGLFESARKQRKNKVPT------Y 548 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+ + + ++ G V AIGV + + L+ + +Y V Sbjct: 549 LIVLTDGDPHD------KVKEPADRLRKAGINVIAIGVG-DIKWKGLQEIGESNVYY-VH 600 Query: 347 NSRKLHDAFLRIGKEM 362 L I +++ Sbjct: 601 QYASLKTIKDNIVQDI 616 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 60/177 (33%), Gaps = 18/177 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 SD D++ V+D S G + ++ ++ + +R GLV +S Sbjct: 234 SDSVADVVFVVDES--------VGNANVEYIKTFLQNSINSLDVTE---ECIRIGLVKYS 282 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 ++ L+ + ++ + R+ S L A N ++ + K Sbjct: 283 TETQVVSFLSKETEK-KDILQRIQSFSPRAGKANLGAAINITRKQVFTERAGSRKNQGVK 341 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + +T P+ D+ + V+AIG++ Q + + P Sbjct: 342 QIATIIT----HRPSDDSL--TEAAEQLINADITVFAIGIEGANISQLNQVVSYPPN 392 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 89/241 (36%), Gaps = 34/241 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 T PW +S + I ++ +++ ++DVS SM+ DKL + S Sbjct: 103 LATTPWNKDSLLLRVGIAGRDIATADLPPA-NLVFLVDVSGSMD-----APDKLPLLQSS 156 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKST 256 ++ ++ +++ R LVT++ I P G Q I E I+ L G +T Sbjct: 157 LKLLVRQLRAQD------RITLVTYAGNISVVLPPTPGDQQGRIVEAIDALQSGGSTAGA 210 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+E AY + +G + I+ TDG+ + + + E +R G Sbjct: 211 SGIELAYKA------AQQGYLRGGINR---ILLATDGDFNVGVTNFDQLKGMVAEKRRSG 261 Query: 317 AIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQ 365 + +G D ++ A + + + +L I +++ Q Sbjct: 262 IALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLTHELGATLATIARDVKIQ 321 Query: 366 R 366 Sbjct: 322 V 322 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 46/239 (19%), Positives = 85/239 (35%), Gaps = 28/239 (11%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT----FPWCANSSHAPLLITSSVKISSKS 165 + S+ + + DY A + E+PF T PW + I + + Sbjct: 154 PADSVRVEEMLNYFDYGHPAPASREVPFKVTTELAPAPWNPARQLLMVGIKG-YDVDKRE 212 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++++++D S SM+D KL + R+ + +P + R +V ++ Sbjct: 213 LPPANLVLLVDTSGSMDDPA-----KLPLLKRAF------AQLVPQLRAKDRVSIVAYAG 261 Query: 226 KIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P G +H I + L +T GL AY A + H+ G + Sbjct: 262 HAGLVLPPTPGNRHGEILAALEGLHAAGSTNGGEGLRLAY-----AMARQGHVEGGVNR- 315 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 I+ TDG+ + D L + +R G + +G + D + A Sbjct: 316 ---ILLATDGDFNVGITDRNALLTLVADQRRSGIALSTLGFGSGNYNDAMAERLADAGN 371 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D S SM+ D + A +++ L ++ N +V F+S + + Sbjct: 401 LILVIDTSGSMSG------DAIIQAKTALKYALAGLRPTDKFN------IVQFNSDVDKW 448 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG---HDD 282 +A + + Q INRL T+ + + A N ++ + Sbjct: 449 SGMAMSATPYNLAQAQNYINRLEANGGTEMSIAINAALNIETVTDKETGTELDNNDLGSN 508 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 + ++F+TDG + N+ LF EA+ + ++ IG+ + F++ A R Sbjct: 509 LLRQVLFITDG-----AVSNESMLFELIEAQLGDSRLFTIGIGSAPNAHFMQRAAQLGRG 563 Query: 342 -FYSVQNSRKLHDAFLRIGKEMVK 364 + + +++ + + K++ K Sbjct: 564 TYTYIGKLDEVNQKVVSLLKKIEK 587 >gi|209527391|ref|ZP_03275898.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492182|gb|EDZ92530.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 488 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 67/198 (33%), Gaps = 28/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM+ G + ++ A +K +V FSS+ Sbjct: 52 AVVLLIDTSGSMS---GQKLREVQTAASEFVSR-QNLKRHD-------LAVVEFSSRASV 100 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 +Q+ I RL T + G A + + ++ I+ Sbjct: 101 VADFTRNETELQQAIARLSARGGTNLSEGFNLATSVLQNSDRTPN------------ILL 148 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 TDG ++P + + + + G + A+G + PD + N Sbjct: 149 FTDGVPNNPPM----AASIAQQIRASGINLVAVGTGDAQINYLTALTGDPDLVFYA-NFG 203 Query: 350 KLHDAFLRIGKEMVKQRI 367 L AF K + Q++ Sbjct: 204 DLDRAFRGAEKAIYGQQL 221 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 70/187 (37%), Gaps = 14/187 (7%) Query: 170 DMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ G + + +L I+ + + V L S+ I Sbjct: 3 DLIFLVDTSGSLQYWSGGGWKNGFDDEKVFVNSLLSHIR-VSYKSTYVSVVLFGTSATID 61 Query: 229 QTF---PLAWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + + + + L F T + AY+ IF + H H Sbjct: 62 INYIFNPHPNNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIF--GKYSGHKRPTHQ-V 118 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 K + LTDG+ + N D K RG ++ IGV L++ ASP+ ++ Sbjct: 119 KTAVFLLTDGQWN-WNGDPWP---IAKRLKDRGIEIFTIGVTNGVNVNTLRSLASPNNYF 174 Query: 344 SVQNSRK 350 + + Sbjct: 175 HYNDFTQ 181 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 88/241 (36%), Gaps = 34/241 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 T PW +S + I ++ +++ ++DVS SM+ DKL + S Sbjct: 204 LATTPWNKDSLLLRVGIAGRDIATADLPPA-NLVFLVDVSGSMD-----APDKLPLLQSS 257 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKST 256 ++ ++ +++ R LVT++ P G Q I E I+ L G +T Sbjct: 258 LKLLVRQLRAQD------RITLVTYAGNTSVVLPPTPGDQQGRIVEAIDALQSGGSTAGA 311 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+E AY + +G + I+ TDG+ + + + E +R G Sbjct: 312 SGIELAYKA------AQQGYLRGGINR---ILLATDGDFNVGVTNFDQLKGMVAEKRRSG 362 Query: 317 AIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQ 365 + +G D ++ A + + + +L I +++ Q Sbjct: 363 IALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLTHELGATLATIARDVKIQ 422 Query: 366 R 366 Sbjct: 423 V 423 >gi|326670350|ref|XP_001332841.4| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 609 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/208 (21%), Positives = 75/208 (36%), Gaps = 29/208 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +S G D+ VLD S S+ H+ I ++ + ++ +R Sbjct: 31 TAASSCYGGFDLYFVLDKSGSVQHHWNE-----------IYNFVEHLAQ-KFISPQLRMS 78 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + FS + L + I++ + L G T G++ A +I+ + Sbjct: 79 FIVFSDQGKILMQLTEDREQIRKGLKELQDVRPGGDTFMHEGIQRASEQIYYGNTEGYRT 138 Query: 277 AKGHDDYKKYIIFLTDGE--NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A II LTDGE + +E N ++ GA VY +GV+ Q K Sbjct: 139 AS-------VIIALTDGELHENHFYYAERE----ANRSRSLGASVYCVGVKDFNETQLAK 187 Query: 335 NCASPDRFYSVQNS-RKLHDAFLRIGKE 361 S D + V + L I K+ Sbjct: 188 IADSKDHVFPVNDGFEALQGVIDSILKK 215 >gi|116620210|ref|YP_822366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223372|gb|ABJ82081.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 311 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 72/212 (33%), Gaps = 37/212 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D + + +V D S SM + RS + + LV F S Sbjct: 86 DEPVAVGLVFDTSGSMGEK----------LQRSRMAAREFFHISNPEDEFF---LVEFDS 132 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL I++ + +T + A +++ +K+ KK Sbjct: 133 SPRLVVPLTSDTGTIEDHLTFSRSHGSTALLDAIFLALHEMKHSKKN-----------KK 181 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-------FLKNCA- 337 ++ ++DG ++ KE K ++Y+IGV L + Sbjct: 182 ALLIISDGGDNHSRYSEKEVSSV---VKESDVLIYSIGVFGGGGSPEEAGGPGLLSKVSE 238 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + R + ++ +L D +IG E+ + IL Sbjct: 239 QTGGRLFEA-SAVELPDIAKKIGIELRNRYIL 269 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 76/190 (40%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M L + ++K + V+ G +T+S+ Sbjct: 845 LDIVFVLDHSGSIGPREQESMMNLT---------IHLVKKADVGRDRVQIGALTYSNHPE 895 Query: 229 QTFPLAWGVQH--IQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L+++ N +F EH ++ + ++ Sbjct: 896 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLF----TEEHGSRLTQNVRQ 950 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG + D + E + +G ++A+GV D+ + V Sbjct: 951 LMIVITDGV----SHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHV 1006 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 1007 DNFDKLRDIY 1016 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 44/200 (22%), Positives = 76/200 (38%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ + S+ +M P N VR G+V +S K Sbjct: 474 DIYFLIDGSSSIRKK---EFEQIQIFMSSVIDM------FPIGPNKVRVGVVQYSHKNEV 524 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ I + N T + L++ I K + A Y Sbjct: 525 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------Y 578 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++AIG+ EA L+ A D Sbjct: 579 LIVLTDGKSND------SVLEPANRLRAEQITIHAIGIG-EANKTQLRQIAGKD---ERV 628 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N + D+ I E+V + Sbjct: 629 NFGQNFDSLKSIKNEIVHRI 648 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G ++ ++ I+ D + G+V FS Sbjct: 655 EDMKADIMFLVDSSGS------IGPTNFETMKTFMKNLVGKIQIGADRSQ---VGVVQFS 705 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + I I+R+ + T G + E + G Sbjct: 706 DYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGG------ALTFVNEYFDLSKGGRPQ 759 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I LTDG+ +L + + ++++GV Q + Sbjct: 760 VRKFLILLTDGKAQDEVGGPATAL------RSKSVTIFSVGVYGANRAQLEEISGDGSLV 813 Query: 343 YSVQNSRKLHDA 354 + V+N L Sbjct: 814 FHVENFDHLKAI 825 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-N 214 T S + D D++ ++D S+ + R ++ L+ + S DV N Sbjct: 254 TVPFPTSCQKDSLADLIFLVDESVGTTQNL-----------RDLQNFLENVTSSVDVKDN 302 Query: 215 VVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +R GL++FS + L + Q++I +L + + A ++ Sbjct: 303 CMRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGA---AIEQMRKEGFS 359 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + + +T + + + G ++A+G++ Q Sbjct: 360 ESSGSRKAQGVPQIAVLVT------HRASDDMVREAALDLRLEGVTMFAMGIEGANNTQL 413 Query: 333 LKNCASPDR 341 + P R Sbjct: 414 EDIVSYPSR 422 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 82/206 (39%), Gaps = 31/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S + P +R L+ +I S+P N R L +S + Sbjct: 30 DVVFLVDSSNYLGIKSFP----------FVRTFLNRMISSLPIEANKYRVALAQYSDALH 79 Query: 229 QTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + + G + K L+ A+ F A + + Sbjct: 80 NEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFP 135 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ L + ++++ + +A + G + ++GVQ +A+++ LK A+ + Sbjct: 136 PILVVL-------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHF 187 Query: 344 SVQNSRKL---HDAFLRIGKEMVKQR 366 +++ +R L RI K++ + R Sbjct: 188 NLRTARDLGMFAPNMTRIIKDVTQYR 213 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 76/190 (40%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M L + ++K + V+ G +T+S+ Sbjct: 845 LDIVFVLDHSGSIGPREQESMMNLT---------IHLVKKADVGRDRVQIGALTYSNHPE 895 Query: 229 QTFPLAWGVQH--IQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L+++ N +F EH ++ + ++ Sbjct: 896 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLF----TEEHGSRLTQNVRQ 950 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG + D + E + +G ++A+GV D+ + V Sbjct: 951 LMIVITDGV----SHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHV 1006 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 1007 DNFDKLRDIY 1016 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 44/200 (22%), Positives = 76/200 (38%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ + S+ +M P N VR G+V +S K Sbjct: 474 DIYFLIDGSSSIRKK---EFEQIQIFMSSVIDM------FPIGPNKVRVGVVQYSHKNEV 524 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ I + N T + L++ I K + A Y Sbjct: 525 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------Y 578 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++AIG+ EA L+ A D Sbjct: 579 LIVLTDGKSND------SVLEPANRLRAEQITIHAIGIG-EANKTQLRQIAGKD---ERV 628 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N + D+ I E+V + Sbjct: 629 NFGQNFDSLKSIKNEIVHRI 648 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G ++ ++ I+ D + G+V FS Sbjct: 655 EDMKADIMFLVDSSGS------IGPTNFETMKTFMKNLVGKIQIGADRSQ---VGVVQFS 705 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + I I+R+ + T G + E + G Sbjct: 706 DYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGG------ALTFVNEYFDLSKGGRPQ 759 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I LTDG+ +L + + ++++GV Q + Sbjct: 760 VRKFLILLTDGKAQDEVGGPATAL------RSKSVTIFSVGVYGANRAQLEEISGDGSLV 813 Query: 343 YSVQNSRKLHDA 354 + V+N L Sbjct: 814 FHVENFDHLKAI 825 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-N 214 T S + D D++ ++D S+ + R ++ L+ + S DV N Sbjct: 254 TVPFPTSCQKDSLADLIFLVDESVGTTQNL-----------RDLQNFLENVTSSVDVKDN 302 Query: 215 VVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +R GL++FS + L + Q++I +L + + A ++ Sbjct: 303 CMRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGA---AIEQMRKEGFS 359 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + + +T + + + G ++A+G++ Q Sbjct: 360 ESSGSRKAQGVPQIAVLVT------HRASDDMVREAALDLRLEGVTMFAMGIEGANNTQL 413 Query: 333 LKNCASPDR 341 + P R Sbjct: 414 EDIVSYPSR 422 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 82/206 (39%), Gaps = 31/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S + P +R L+ +I S+P N R L +S + Sbjct: 30 DVVFLVDSSNYLGIKSFP----------FVRTFLNRMISSLPIEANKYRVALAQYSDALH 79 Query: 229 QTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + + G + K L+ A+ F A + + Sbjct: 80 NEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFP 135 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ L + ++++ + +A + G + ++GVQ +A+++ LK A+ + Sbjct: 136 PILVVL-------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHF 187 Query: 344 SVQNSRKL---HDAFLRIGKEMVKQR 366 +++ +R L RI K++ + R Sbjct: 188 NLRTARDLGMFAPNMTRIIKDVTQYR 213 >gi|260834336|ref|XP_002612167.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] gi|229297541|gb|EEN68176.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] Length = 954 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 77/212 (36%), Gaps = 32/212 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I S +SS ++ +D+ +LD S S+ G + + ++++ P Sbjct: 166 IRSQSNVSSTCEVPVDLFWLLDGSNSV------GTANFEKVKQFVVDVVNSFDVSP---T 216 Query: 215 VVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKE 271 R G+V +S+K F L IN + + G T + L+Y Sbjct: 217 ATRVGVVQYSNKNTLMFNLGDKVDKPSTVNAINSIQYQGGGTYTGYALKY---------- 266 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + A + K + LTDGE+ ++L G V+A+GV A + Sbjct: 267 VRQKAAWRGGNVPKVAVVLTDGESYDSVSVAAQNLL------SDGVEVFAVGV---AGFK 317 Query: 332 FLKNCASPD-RFYSVQNSRKLHDAFLRIGKEM 362 + S + + + L I K++ Sbjct: 318 LIAIANSNETNVIELNDFNDLTTKIGEIAKKV 349 >gi|116625273|ref|YP_827429.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228435|gb|ABJ87144.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 87/225 (38%), Gaps = 39/225 (17%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 SVK+ + D+ + M +++D S SM D G K+ A ++ S PD + Sbjct: 54 QSVKVFRREDVPVSMGLIIDNSGSMRDKRG----KVEAAALALVR-----DSNPDDEVFI 104 Query: 217 RSGLVTFSSKIVQTFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 V F+ + P ++ ++E + R+ T + A D ++ Sbjct: 105 ----VNFNDEAFLDNPHGKDFTTNIKEMEEALTRIDSRGGT----AMRDALRMSIDHVKE 156 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H K K ++ +TDG ++S + + + +++ ++Y +G+ + + Sbjct: 157 KAHKDK------KVLVVVTDGNDNSSVVSLENLVKASQQSE---VLIYGVGLLGDEERRE 207 Query: 333 LKNC---------ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + A+ + ++ ++ ++ +++ Q + Sbjct: 208 AQRAQRALKALAEATGGEVFFPKDVNEVDKVAHQVARDIRNQYTI 252 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 67/200 (33%), Gaps = 31/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F P + K+ + G++ ++ LD+S SM +L + Sbjct: 61 LVFSALAMVIFMLARPQFGS---KMETVKRQGVETVVALDISNSMLAEDVTP-SRLEKSK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 + I +++ N + ++ F+ + P+ + + LI Sbjct: 117 KLISRLVETF-------NNDKVAMIVFAGEAFTQLPITSDYISAKMFLETITPSLISTQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T ++ A ++ + I+ +TDGEN ++ EA Sbjct: 170 TDIRGAIDLAMKSFT-----------PNEGVGRAIVLITDGENHEG-----GAVEAAQEA 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 ++G V+ +GV + Sbjct: 214 AKKGVRVFVLGVGSPDGSPI 233 >gi|257389158|ref|YP_003178931.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257171465|gb|ACV49224.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 393 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 67/203 (33%), Gaps = 29/203 (14%) Query: 146 ANSSHAPLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 N +T+ + + ++D+ + + +D S SM R+ Sbjct: 11 PNVPADGTTVTAEIDVEPGEQETDVRRHIALCIDTSGSMEGD---------NIKRARDGA 61 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-W---GVQHIQEKINRLIFGSTTKSTPG 258 + + D + +V F ++ P W Q + + L G T G Sbjct: 62 AWVFGLLADED---YVSIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNG 118 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+ A + + D K ++ L+DG+++ D E L G Sbjct: 119 LKAAKETLSSSA--------TGPDTVKRLLLLSDGKDNERTPDEFEGL--AEAIDDAGIR 168 Query: 319 VYAIGVQAEAADQFLKNCASPDR 341 + + G+ + + ++ + R Sbjct: 169 IQSAGIGTDYNEATIRTLGTAGR 191 >gi|319952790|ref|YP_004164057.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421450|gb|ADV48559.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 348 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 19/174 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+D++ LDVS SM ++L R I E+++ + S R Sbjct: 80 TKLETVKREGVDIVFALDVSKSMLAEDIAP-NRLEKGKRLISEIINHLGS-------DRI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G++ ++++ P+ + + + T A N+ A E Sbjct: 132 GIIAYAAQAYPQLPITTDYSAAKMFLQSM----NTDMLSSQGTAINE---AIELASTYYD 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++DGE+ + + +A G ++ IGV E Sbjct: 185 DETQTNRVLFIISDGEDHAEGT----TEDAVEKATEEGIRIFTIGVGKEKGAPI 234 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 68/195 (34%), Gaps = 30/195 (15%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + +++V+D S SM M+ A +S+ L+ R + Sbjct: 183 VQANVPKPKQIVIVIDKSGSMG---VTNMNLAKEAAKSVVNTLNPQD---------RFAV 230 Query: 221 VTFSS---------KIVQTFPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + FSS Q F + + +++ ++ + G T P L+ A++ Sbjct: 231 MAFSSIFVPFQSTVASDQCFATTFADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSF 290 Query: 266 IFDAKEKLEHIAK--GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + K + + I+F++DG + P + NE I+ G Sbjct: 291 FQQEPSVSDFNIKKIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTYG 350 Query: 324 VQAEAADQFLKNCAS 338 + L+N A+ Sbjct: 351 LGNADFG-VLRNMAT 364 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 67/207 (32%), Gaps = 31/207 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 K G+D++ V+D S SM K+ + ++ M D +K R LV Sbjct: 42 ERKERKGIDLICVVDKSGSMAGS------KIEMVKSTLAFMFDQLKPTD------RIALV 89 Query: 222 TFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F S I + G ++ ++ + GS T + L I E + Sbjct: 90 EFDSNISTSLQFTNMNESGRSKAKQVVSNIRAGSCTNLSGALFEGLRLIGQRTNANEVTS 149 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA----IVYAIGVQAEAADQFL 333 ++ TDG + + E + V+ G + L Sbjct: 150 ---------LLLFTDGLANEGITNTNEIVKKMTTMIHEEIRTNLTVFTFGFGTDTDANML 200 Query: 334 KNCASPDR--FYSVQNSRKLHDAFLRI 358 + + +Y +Q + + AF + Sbjct: 201 TSISQAGNGLYYFLQTTDDIPKAFGNV 227 >gi|114327015|ref|YP_744172.1| hypothetical protein GbCGDNIH1_0351 [Granulibacter bethesdensis CGDNIH1] gi|114315189|gb|ABI61249.1| hypothetical protein GbCGDNIH1_0351 [Granulibacter bethesdensis CGDNIH1] Length = 329 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 79/220 (35%), Gaps = 40/220 (18%) Query: 164 KSDIGLDMMMVLDVSLSMN---DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 K G +M+++D S+SM+ D+ P K +IR + D + P R G+ Sbjct: 78 KVGQGAQIMLLIDRSISMDQTFDNQTPNAAKESKTDAAIRLVKDFFRQRPH----DRFGV 133 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V FS+ + PL ++ ++ + + T GL A+ ++ ++ Sbjct: 134 VAFSTSPILAMPLTEHRAAVEASLDAMRRPAIARTNIGRGLSLAFAQLQNSSPDAA---- 189 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF------ 332 + ++F++DG + ID + EA + G +Y + ++ E Sbjct: 190 ------RVVLFVSDG---AGVIDGELEPRLHAEAVKLGVHIYYLYLRTEGDPGLHAAGET 240 Query: 333 ------------LKNCASPDRFYSVQNSRKLHDAFLRIGK 360 P + + N L A I + Sbjct: 241 ADADAPAALDAWFSGLGVPYKAFEADNPNALKSAVDTINR 280 >gi|297579199|ref|ZP_06941127.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297536793|gb|EFH75626.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 1778 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 62/181 (34%), Gaps = 16/181 (8%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKI--------SSKSDIGLDMMMVLDVSLSMNDHF 185 +P + L+I I S++ G ++ ++LDVS SM Sbjct: 1123 NIPLEAKNAAGAIGTGKVTLIIEDDAPIAKDIFHMTESETKQGANVQLMLDVSGSMGRDA 1182 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKI 244 G G +L V S ++++ +++ V+ L + + I L W V + I Sbjct: 1183 GNGKTRLQVMKESAIQLIEQYQALGQTK--VQLILFSSDASIKTASGLLWMTVAEAKNYI 1240 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 N L T ++ A + Y FL+DG + + K Sbjct: 1241 NALSANGGTDYDDAIKLAQESWSGTINGQPLSGATNVSY-----FLSDGVPEGYDWELKN 1295 Query: 305 S 305 S Sbjct: 1296 S 1296 >gi|224048537|ref|XP_002190467.1| PREDICTED: similar to collagen, type XII, alpha 1 [Taeniopygia guttata] Length = 3122 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 104/265 (39%), Gaps = 37/265 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++T+L++ +Y ++ Y M + + P T VK+ + G+ Sbjct: 384 QTTALNVKDLSPDTEYQINV---YAMKGLTPSEPITIMEK------TQQVKVQVECSRGV 434 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ DV ++ + G+ + ++ + P V+ LV +S Sbjct: 435 DVKA--DVVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPRK---VQISLVQYSRDPHM 489 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V+ I + IN + G +T + + Y K+F ++ + + Sbjct: 490 EFSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVF----VTSKGSRPNVP--RV 543 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFY 343 +I +TDG++S + + ++A+GV+ +A L+ ASP Y Sbjct: 544 MILITDGKSSDA------FKEPAIKLRDADVEIFAVGVK-DAVRTELEAIASPPADTHVY 596 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +V++ DAF RI E+ + L Sbjct: 597 TVED----FDAFQRISFELTQSVCL 617 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 56/336 (16%), Positives = 110/336 (32%), Gaps = 46/336 (13%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIK---------NIWQTDFRN 94 +K L + + L AT + + E N N + Sbjct: 1 MKTALCSAV--AALCAATLLSSIEAEVNPPSDLNFTIIDEHNVQMSWKRPPDAIVGYRIT 58 Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE-----MPFIFCTFPWCANSS 149 + N + ST+ ++I D + +++ Y+ +P +F Sbjct: 59 VVPTNDGPTKEFTLSPSTTQTVISDLIPDVEYIVSIASYDDREESLP-VFGQLTIQTGGP 117 Query: 150 HAPLLITSSVKISSKSDIGL-DMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIK 207 P ++ S + D++ ++D S S+ + FG +D + + Sbjct: 118 GIPEEKKVEAQLQRCSISAMTDLVFLVDGSWSVGRNNFGYILDFMVALVSA--------- 168 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 R G+V +SS F L + + + I R+ + G T + ++Y Sbjct: 169 -FDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLIDAIKRIPYKGGNTMTGEAIDYLVQ 227 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 F KG + K I +TDG+ E + G V+++G+ Sbjct: 228 NTFTES---AGARKG---FPKVAIVITDGKAQDDVEIPAR------ELRNIGVEVFSLGI 275 Query: 325 QAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRI 358 +A A + + P ++V N + D I Sbjct: 276 KAADAKELKLIASQPSLKHVFNVANFDGIVDIQNEI 311 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 90/267 (33%), Gaps = 41/267 (15%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 +++ + Y ++ +E P + ++++ P L + ++ Sbjct: 1137 DSTVVLEELRAGTTYKVNVFGMFEGGESNPLVGQEMTTLSDTTTEPFLSRG---LECRTR 1193 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+++++D S S+ I ++++ PD V+ GL +S Sbjct: 1194 AEADIVLLVDGSWSIGRP------NFKTIRNFIARIVEVFDIGPDR---VQIGLAQYSGD 1244 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W + + K + L G T + L++ F L A+ Sbjct: 1245 PRTE----WNLNAYRTKQSLLEAVANLPYKGGNTLTGMALDFILRNNFKPVAGLRPRAR- 1299 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K + +TDG++ + + + G +YAIG++ ++ + P Sbjct: 1300 -----KIGVLITDGKSQDDVVAPSR------KLRDEGVELYAIGIKNADENELKQIATDP 1348 Query: 340 D--RFYSVQNSRKLHDAFLRIGKEMVK 364 D Y+V + L + + Sbjct: 1349 DDIHAYNVADFSFLATIVDNVTTNLCN 1375 >gi|219847650|ref|YP_002462083.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541909|gb|ACL23647.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 419 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 77/219 (35%), Gaps = 31/219 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 LL ++ + +++ VLD S SM G +++L A E+LD S Sbjct: 28 LLEAQPAPQMTQVRMPVNVCFVLDRSGSMK---GEKIERLRQAVVKAIELLDQQDS---- 80 Query: 213 NNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V F + P I + ++R+ T+ P +E ++ Sbjct: 81 -----LAIVIFDHRTEVLVPAQPVRNRAMILDLVHRIRDAGGTRIAPAVEKGLQELQKMP 135 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + +I LTDG+ N E L ++A R G + A+G+ + + Sbjct: 136 -----------PGVRRLILLTDGQTEHEN----ECLLRADDAGRLGVPITALGIGKDWNE 180 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L A S + ++ + F + + I Sbjct: 181 DLLIEMANRSKGVADYIAQPGEIVNYFQHTVQRAQQTVI 219 >gi|319792023|ref|YP_004153663.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315594486|gb|ADU35552.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 345 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 72/251 (28%), Gaps = 59/251 (23%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + V + S +++ +DVS SM D + A +S + D+ Sbjct: 76 MAVVMLPSNQQT---IILAMDVSGSMRAADVLPNRLVAAQEAAKSFIK---------DLP 123 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE-- 271 V+ G+V F+ + I+ T + + + +F Sbjct: 124 RTVKVGIVAFAGSAQVAQLPTTNHDDLVTAIDSFQLQRATATGNAIVVSLATLFPDAGID 183 Query: 272 -------KLEHIAKGHDDYKKY---------------IIFLTDGENSSPNIDNKESLFYC 309 + KK II LTDG+ ++ + L Sbjct: 184 VEQFSAPSRQRGTPIDQTEKKLKDFTPVAPGSFTSAAIIMLTDGQRTTGV----DPLDAA 239 Query: 310 NEAKRRGAIVYAIGVQAEAADQF---------------LKNCA--SPDRFYSVQNSRKLH 352 A RG +Y +GV + LK A + ++ + L Sbjct: 240 KAAADRGVRIYTVGVGTVDGETIGFEGWSMRVRLDEETLKAVANKTNAEYFYAGTANDLK 299 Query: 353 DAFLRIGKEMV 363 + + ++ Sbjct: 300 KVYETLSSKLT 310 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 75/249 (30%), Gaps = 35/249 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH----FG 186 E+P + H + S + + VLDVS SM Sbjct: 349 PAPELPAQLPFPATSEIADHLLAAYQDEYRRPSHA------IFVLDVSPSMRGERLELLR 402 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTFSSKI--------VQTFPLA 234 + +L S ++ R L+TFS + P + Sbjct: 403 STLRELAGTGTSTGRGTGTDDTLEQRFARFRERERVTLITFSGTVHDTLEFTVNDPQPGS 462 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + L GS T LE AY + D+ I+ +TDGE Sbjct: 463 ADLTAISAAADGLTLGSGTAIYSALEAAYRYVADSAAAPADGVAPLTS----IVLMTDGE 518 Query: 295 NSSPNIDN---KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSR 349 N+ + L + A+ + + +A ++ A + + + + Sbjct: 519 NNQGTTADAFHSSYLALPDAARS--VRTFTVVFG-DARVDEMRTIADWTGGAMFDAR-TS 574 Query: 350 KLHDAFLRI 358 L +AF I Sbjct: 575 SLSEAFREI 583 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 79/214 (36%), Gaps = 32/214 (14%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 ++ ++ D + +VLDVS SM G ++ A ++ E+LD + +V Sbjct: 25 PSGAAADETADQDAP-KVDLVLDVSGSMRAKDIDGGSRMAAAKQAFNEVLDA--TPEEVR 81 Query: 214 NVVRSGLVTF-----------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 +R+ + ++++ PL + + L T P L A Sbjct: 82 LGIRTLGADYPGDNRKEGCKDTAQLYPVGPL--DRTEAKTAVATLSPTGWTPIGPALLKA 139 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + + K I+ ++DGE++ +D E AK G + + Sbjct: 140 ADDLDGGNG------------SKRIVLISDGEDTCAPLDPCEVAREI-AAKGIGLTIDTL 186 Query: 323 GVQAEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 G+ + +C A+ + SV++ +L D Sbjct: 187 GLVPDVKLNRQLSCIAEATGGTYTSVEHRDQLTD 220 >gi|38348336|ref|NP_940912.1| inter-alpha-trypsin inhibitor heavy chain H5-like protein precursor [Homo sapiens] gi|74762375|sp|Q6UXX5|ITH5L_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5-like protein; Short=Inter-alpha inhibitor H5-like protein; Flags: Precursor gi|37181448|gb|AAQ88537.1| ITI-like protein [Homo sapiens] gi|57208182|emb|CAI42356.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|57209935|emb|CAI42344.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|119613592|gb|EAW93186.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_a [Homo sapiens] gi|162318154|gb|AAI57043.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] gi|162318494|gb|AAI56206.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] Length = 1313 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 68/206 (33%), Gaps = 36/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM FG M++ A I L +++FS + Sbjct: 284 VVFVIDVSSSM---FGTKMEQTKTAMNVILSDLQANDYFN---------IISFSDTVNV- 330 Query: 231 FPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W V ++ ++ + T L A + + + ++ Sbjct: 331 ----WKAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSV 386 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 IIFLTDGE ++ L +A ++++ +A L+ + Sbjct: 387 GRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLSLE 444 Query: 340 DR------FYSVQNSRKLHDAFLRIG 359 +R + + +L + I Sbjct: 445 NRGIARRIYEDTDAALQLKGLYEEIS 470 >gi|299533511|ref|ZP_07046889.1| isocitrate dehydrogenase [Comamonas testosteroni S44] gi|298718494|gb|EFI59473.1| isocitrate dehydrogenase [Comamonas testosteroni S44] Length = 688 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 66/200 (33%), Gaps = 28/200 (14%) Query: 146 ANSSHAPLLITSSVKISSKSDIGL-----DMMMVLDVSLSMNDHFGPGM-------DKLG 193 + V + SD L +++++LD+S SMN +L Sbjct: 245 TGKLTVTIGDGIPVAEPTASDYVLPGQDSNLLLMLDLSGSMNWGQENNKNPAPGEKSRLQ 304 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGST 252 + ++ MLD ++ DV VR VTF + Q W + + +N L Sbjct: 305 IMKEAVSLMLDSYAALGDVK--VRI--VTFQNTAAQARQTTWIDIATAKSIVNALTATGG 360 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-SPNIDNKESLFYCNE 311 T + LE A A + + FLTDGE + S ID + Sbjct: 361 TPYSKALETAMQAFNSAGKIDGGKNIAY--------FLTDGEPTASYEIDAAREALWKTF 412 Query: 312 AKRRGAIVYAIGVQAEAADQ 331 + ++ A Sbjct: 413 VDAN--EINSMAFGIGPAAP 430 >gi|328947150|ref|YP_004364487.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447474|gb|AEB13190.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 333 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 68/225 (30%), Gaps = 49/225 (21%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++ VLDVS SM +L A SI M ++ GLV + Sbjct: 87 SSKGASIVFVLDVSPSMAAKDIGSQSRLEAAKNSIVSMAEMNSGSE-------LGLVVMA 139 Query: 225 SKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + ++ + G T GL A + +K Sbjct: 140 ENAAVLVSPTMDRKFFLNRLKTVSVGELGDGTAIGTGLSSAIYHLEKSK----------- 188 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--------- 332 KK I+ +TDGEN+S + + A + +Y +GV Sbjct: 189 SPKKSIVLITDGENNSGAVHPHTA---ARLAVNKDISLYILGVGTRGVVPIDYVDPKSNK 245 Query: 333 --------------LKNCASPDR--FYSVQNSRKLHDAFLRIGKE 361 + AS F+ +++ L A I K Sbjct: 246 IYSGYLESKFDTSSIARIASEGNGKFFEIESISALSQAISSISKS 290 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 62/200 (31%), Gaps = 31/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LVFAVIGLFAMLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----ST 252 R + +++D + + G++ F+ P+ + + + Sbjct: 117 RLVAQLVDKM-------QNDKVGMIVFAGDAFTQLPITNDYISAKMFLESISPSLISKQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + A + + II +TDGEN + A Sbjct: 170 TAIGAAISLATRSFT-----------PQEGIGRAIIVITDGENHEGGVA-----EAAKTA 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 +G V +GV Sbjct: 214 TEKGIQVNVLGVGMPDGAPI 233 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 77/200 (38%), Gaps = 41/200 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 + +VLDVS SM G ++ A ++ ++LD + +V +R+ + Sbjct: 41 VELVLDVSGSMRTRDMGGESRISAAKQAFNDVLDAV--PEEVQLGIRTLGADYPGDDRKV 98 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + ++ PL + + L T P L A + + Sbjct: 99 GCKDTKQLYPVGPL--DRTEAKTAVATLAPTGWTPIGPALLGAADDL------------D 144 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGA--IVYAIGVQAEAADQFLK 334 D + I+ +TDGE++ +D C A+ RG ++ +G+ +A + Sbjct: 145 GGDSTRRIVLITDGEDTCGPLDP------CEVARDIAARGIHLVIDTLGLVPDAKIRQQL 198 Query: 335 NC---ASPDRFYSVQNSRKL 351 C A+ + +VQN+ +L Sbjct: 199 TCIAEATGGTYTAVQNTDEL 218 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 76/190 (40%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M L + ++K + V+ G +T+S+ Sbjct: 804 LDIVFVLDHSGSIGPREQESMMNLT---------IHLVKKADIGRDRVQIGALTYSNHPE 854 Query: 229 QTFPLAWGVQH--IQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L+++ N +F EH ++ + ++ Sbjct: 855 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NILF----TEEHGSRLTQNVRQ 909 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG + D + E + +G ++A+GV D+ + V Sbjct: 910 LMIVITDGV----SHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHV 965 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 966 DNFDKLRDIY 975 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/200 (22%), Positives = 76/200 (38%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ V S+ +M P N VR G+V +S K Sbjct: 433 DIYFLIDGSSSIRKK---EFEQIQVFMSSVIDM------FPIGPNKVRVGVVQYSHKNEV 483 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ I + N T + L++ I K + A Y Sbjct: 484 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------Y 537 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++AIG+ EA L+ A D Sbjct: 538 LIVLTDGKSND------SVLEPANRLRAEQITIHAIGIG-EANKTQLRQIAGKD---ERV 587 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N + D+ I E+V + Sbjct: 588 NFGQNFDSLKSIKNEIVHRI 607 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G ++ ++ I+ D + G+V FS Sbjct: 614 EDMKADIMFLVDSSGS------IGPTNFETMKTFMKNLVGKIQIGADRSQ---VGVVQFS 664 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + I I+R+ + T G + E + G Sbjct: 665 DYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGG------ALTFVNEYFDLSKGGRPQ 718 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I LTDG+ +L + + ++++GV Q + Sbjct: 719 VRKFLILLTDGKAQDEVGGPAMAL------RSKSVTIFSVGVYGANRAQLEEISGDGSLV 772 Query: 343 YSVQNSRKLHDA 354 + V+N L Sbjct: 773 FHVENFDHLKAI 784 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 98/301 (32%), Gaps = 32/301 (10%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRE----NGFAQ 103 L + A+K L ++ S +K + + F LR FA Sbjct: 106 LASAESEDDVEEASKALREDGVKIISVGVQKASEENLKAMATSQFHFNLRTARDLGMFAP 165 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 ++ I + I + ++AV+ ++H T S Sbjct: 166 NMTRIIKDV---IQYREGTTVDLITAVAPATPAAPTIPAALTTAANHVDK--TVPFPTSC 220 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT 222 + D D++ ++D S+ + R ++ L+ + S DV N +R GL++ Sbjct: 221 QKDSLADLIFLVDESVGTTQNL-----------RDLQNFLENVTSSVDVKDNCMRLGLMS 269 Query: 223 FSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS + L + Q++I +L + + A ++ ++ Sbjct: 270 FSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGA---AIEQMRKEGFSESSGSRKA 326 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + +T + + + G ++A+G++ Q + P Sbjct: 327 QGVPQIAVLVT------HRASDDVVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPS 380 Query: 341 R 341 R Sbjct: 381 R 381 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 59/147 (40%), Gaps = 17/147 (11%) Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNK 265 P N R L +S + F L + + + G + K L+ A+ Sbjct: 26 PIEANKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRT 85 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGV 324 F A + + ++ L + ++++ + ++A + G + ++GV Sbjct: 86 YFSAPRS----GRDKKQFPPILVVL-------ASAESEDDVEEASKALREDGVKIISVGV 134 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKL 351 Q +A+++ LK A+ ++++ +R L Sbjct: 135 Q-KASEENLKAMATSQFHFNLRTARDL 160 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 76/190 (40%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M L + ++K + V+ G +T+S+ Sbjct: 592 LDIVFVLDHSGSIGPREQESMMNLT---------IHLVKKADIGRDRVQIGALTYSNHPE 642 Query: 229 QTFPLAWGVQH--IQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L+++ N +F EH ++ + ++ Sbjct: 643 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NILF----TEEHGSRLTQNVRQ 697 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG + D + E + +G ++A+GV D+ + V Sbjct: 698 LMIVITDGV----SHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHV 753 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 754 DNFDKLRDIY 763 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/200 (22%), Positives = 76/200 (38%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ V S+ +M P N VR G+V +S K Sbjct: 221 DIYFLIDGSSSIRKK---EFEQIQVFMSSVIDM------FPIGPNKVRVGVVQYSHKNEV 271 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ I + N T + L++ I K + A Y Sbjct: 272 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------Y 325 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++AIG+ EA L+ A D Sbjct: 326 LIVLTDGKSND------SVLEPANRLRAEQITIHAIGIG-EANKTQLRQIAGKD---ERV 375 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N + D+ I E+V + Sbjct: 376 NFGQNFDSLKSIKNEIVHRI 395 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G ++ ++ I+ D + G+V FS Sbjct: 402 EDMKADIMFLVDSSGS------IGPTNFETMKTFMKNLVGKIQIGADRSQ---VGVVQFS 452 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + I I+R+ + T G + E + G Sbjct: 453 DYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGG------ALTFVNEYFDLSKGGRPQ 506 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I LTDG+ +L + + ++++GV Q + Sbjct: 507 VRKFLILLTDGKAQDEVGGPAMAL------RSKSVTIFSVGVYGANRAQLEEISGDGSLV 560 Query: 343 YSVQNSRKLHDA 354 + V+N L Sbjct: 561 FHVENFDHLKAI 572 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 53/148 (35%), Gaps = 12/148 (8%) Query: 197 RSIREMLDIIKSIPDVN-NVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTT 253 R ++ L+ + S DV N +R GL++FS + L + Q++I +L + Sbjct: 31 RDLQNFLENVTSSVDVKDNCMRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGA 90 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A ++ ++ + + +T + + + Sbjct: 91 SNVGA---AIEQMRKEGFSESSGSRKAQGVPQIAVLVT------HRASDDVVREAALDLR 141 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDR 341 G ++A+G++ Q + P R Sbjct: 142 LEGVTMFAMGIEGANNTQLEDIVSYPSR 169 >gi|260463262|ref|ZP_05811463.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030852|gb|EEW32127.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 644 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 89/238 (37%), Gaps = 38/238 (15%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 IR F + +G+ +++T + + + + + I+ + + LD + + TA +I+ Sbjct: 2 IRKFANDRRGNYALMTVLAMVPLMGALAIGIDYTEMVRERQNALNALDAAGIATAQQIVA 61 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS-TSLSIIIDDQHKD 124 +N F ++ +I+ + T+L++ + + + Sbjct: 62 GATDAEAIAYA---------------------KNFFEANLAHIDPANTTLAVTLPNNNTG 100 Query: 125 -------YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 L+ + N++ S V++ L++ +VLD Sbjct: 101 GGTLKLCGTLTYKPYFLPTAKILAGGTSGNATTMAFNTCSEVRL----KNTLEVSLVLDN 156 Query: 178 SLSMNDH-FGPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSKIVQT 230 S SM + G + + + ++++D + + + V+ V+ LV F++ + Sbjct: 157 SGSMKELGKGSNKVRFDLLKDAAKQLVDQLAGQAQLMKQVSKPVQFSLVPFAASVNVD 214 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 29/82 (35%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 G ++ I+ + T G+ + + + E + K +I LTDG N Sbjct: 425 GASAVKTAIDAMAADGATNVPEGMAWGWRTLSSTAPFTEGRPETERGNDKVLIVLTDGAN 484 Query: 296 SSPNIDNKESLFYCNEAKRRGA 317 + D+ + Y GA Sbjct: 485 TYYTPDSVIAQTYSGTNYNYGA 506 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 83/295 (28%), Gaps = 37/295 (12%) Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 N R + + + I + D + + Y M T C+ + PL Sbjct: 364 NNWRTDVTSNSSSAIRQRFMPKYFADP----GSTTVTPSYGMDAGPNT--SCSTTPIKPL 417 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 S+ +S +D M D + ++ + G L + + V Sbjct: 418 TDVSTTAGASAVKTAIDAMAA-DGATNVPEGMAWGWRTLSSTAPFTEGRPETERGNDKVL 476 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V+ G ++ +A L Y + Sbjct: 477 IVLTDGA---NTYYTPDSVIAQTYSGTNYNYGANDLAGNKAIYSAL--GYVTPYSNGYSY 531 Query: 274 EHIAKGHDDYKKYIIFLTDGEN-SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-- 330 + G + +D N + N+ CN AK +V I + +A + Sbjct: 532 GRMFLGTSSS----VIKSDYSNANYTKAMNEHFTTLCNNAKAANVMVMTIALDLDATNTA 587 Query: 331 -----QFLKNCASPDRF-YSVQNSRK------------LHDAFLRIGKEMVKQRI 367 LK C+S RF + K L + F IG E+ RI Sbjct: 588 EKTQMDALKACSSDSRFSKDPTDPSKPMKLFWNSTGATLSNDFKAIGNELSNLRI 642 >gi|166366827|ref|YP_001659100.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089200|dbj|BAG03908.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 724 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 57/140 (40%), Gaps = 22/140 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T + + ++M++D S SMND +KL A + + ++ P VN Sbjct: 38 LTVTERPPIVEQNPQSVVMLIDTSGSMNDD-----NKLQEAKNAAKAFIERQD--PSVN- 89 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R +V F S++ L + + + I+ L G T+ GL A ++ + Sbjct: 90 --RFAVVGFGSQVQIGTGLTSDLATLNQAIDNLSDGGGTRMDLGLATAIEQLESSSSD-- 145 Query: 275 HIAKGHDDYKKYIIFLTDGE 294 ++I+ TDG+ Sbjct: 146 ----------RHILLFTDGQ 155 >gi|218778177|ref|YP_002429495.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759561|gb|ACL02027.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 558 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 74/190 (38%), Gaps = 25/190 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--V 228 ++ +LDVS SMN +KL + RS+ ++ + + R +VT++ V Sbjct: 197 LVFLLDVSGSMNSE-----NKLPLVKRSMEMLVKELGAGD------RVSIVTYAGSAGLV 245 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 A + I ++RL G +T G+E AY ++ + +I Sbjct: 246 LPSTSARNKRKIITALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNR---------VI 296 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC--ASPDRFYSV 345 TDG+ + E + E +R G + G D+ ++ A FY + Sbjct: 297 LCTDGDFNVGVSSTPELVRMIEEKRRAGIYLTICGFGMGNYKDEKMEAISNAGNGNFYYI 356 Query: 346 QNSRKLHDAF 355 + R+ H F Sbjct: 357 DSRREAHKVF 366 >gi|114587340|ref|XP_516522.2| PREDICTED: similar to PK-120 precursor isoform 5 [Pan troglodytes] gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4 [Pan troglodytes] Length = 930 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEASQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQ-LLDSSNQEEQLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Pan troglodytes] Length = 900 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEASQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQ-LLDSSNQEEQLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|114587344|ref|XP_001172675.1| PREDICTED: similar to PK-120 precursor isoform 2 [Pan troglodytes] Length = 914 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEASQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQ-LLDSSNQEEQLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 56/144 (38%), Gaps = 4/144 (2%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YKKYIIFL 290 PL ++ IN L T+S GL + ++ + ++ I ++ ++ Sbjct: 261 PLTSVFSRVRNSINSLTANGGTRSFHGLLWGVRQLIPSWQQAWGINVSTVPETRRKLVLF 320 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAAD-QFLKNCAS-PDRFYSVQN 347 TDG + D + +C A + G + IG ++ + CA P R +S N Sbjct: 321 TDGADEGDTFDQLVNAGFCTTAINQYGIEMNFIGYGVSSSRIAQFERCAGNPSRVFSATN 380 Query: 348 SRKLHDAFLRIGKEMVKQRILYNK 371 + +L++ F I I + Sbjct: 381 TTQLNEYFSDILAVEYSATIKLTR 404 >gi|254414936|ref|ZP_05028700.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178425|gb|EDX73425.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 576 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 75/200 (37%), Gaps = 21/200 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW NS H + I K S ++ ++ +LDVS SM+D +K Sbjct: 178 FSITTEVAEAPW--NSKHKLVHIGLQGKSISTENLPPSNLVFLLDVSGSMSDA-----NK 230 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + R ++D ++ V+ VV +G V I I++L G Sbjct: 231 LPLLKEAFRLLVDQLRDEDKVSIVVYAGAAG----TVLPPTPGNQKDTILAAIDKLEAGG 286 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY D + + +I TDG+ + +++ + E Sbjct: 287 STAGGQGIKLAYKLAQDNFIESGNNR---------VILATDGDFNVGISSDEQLVSLIEE 337 Query: 312 AKRRGAIVYAIGVQAEAADQ 331 + + + +G Sbjct: 338 KREQDIFLTVLGFGTGNLQD 357 >gi|288802179|ref|ZP_06407619.1| BatB protein [Prevotella melaninogenica D18] gi|288335146|gb|EFC73581.1| BatB protein [Prevotella melaninogenica D18] Length = 331 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 73/215 (33%), Gaps = 28/215 (13%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 H+ +++ R + F+ S ++K G++ ++ LD+S SM Sbjct: 44 HQLSPMTSKRRGWIKFVLVELVLLLLILIIARPQVGSKIATNKEREGIETIIALDISNSM 103 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 +LG + + +++ + GL+ F+ P+ + Sbjct: 104 LAEDVAP-SRLGKSKLIVENLMNKFSE-------DKIGLIVFAGDAFVQLPITSDYVSAK 155 Query: 242 EKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ LI T L+ + N + K II +TDGE++ Sbjct: 156 MFLDNINPSLIGTQGTDIGKALQLSMNSFT-----------PNSKVGKAIILITDGEDNE 204 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +A+ +G V+ +GV + Sbjct: 205 GGAE-----EMAKQAQSKGIRVFILGVGSTEGATI 234 >gi|260912479|ref|ZP_05919015.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] gi|260633398|gb|EEX51552.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] Length = 591 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 64/204 (31%), Gaps = 31/204 (15%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ F P + T KI+ G++ ++ +D+S SM +L Sbjct: 94 VKLGLAITAFALLVVMLARPQMGT---KITHDKRNGIEAVIAVDISNSMMAQDVVP-SRL 149 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLI 248 + I ++D R GLV F+ P+ + + LI Sbjct: 150 EKSKLLIENLVDHF-------THDRIGLVVFAGDAFVQLPITTDYVSAKMFLQNIDPALI 202 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + + K D K +I +TDGE+ +L Sbjct: 203 ATQGTDIAKAINLSMRSFSQQK-----------DIGKAVIVITDGEDHEG-----GALEA 246 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 A RG V+ +G+ + Sbjct: 247 AKAANERGIRVFILGIGSTKGSPI 270 >gi|119628047|gb|EAX07642.1| matrilin 1, cartilage matrix protein, isoform CRA_a [Homo sapiens] Length = 496 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFISQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPG- 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 376 -AQKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E D K Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + + P Sbjct: 149 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGSVDKATLRQIASEP 195 >gi|301758388|ref|XP_002915048.1| PREDICTED: matrilin-3-like [Ailuropoda melanoleuca] Length = 466 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 79/229 (34%), Gaps = 31/229 (13%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + +S P + S+ LD++ ++D S S+ + Sbjct: 34 FTGSQHGTMQASALPFWSLQAGVCKSR---PLDLVFIIDSSRSVRPL------EFTKVKT 84 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTK 254 + +++D + R +V ++S + F L Q +++ + R+ + T Sbjct: 85 FVSQIIDTLDI---GAADTRVAVVNYASTVKTEFHLQTYSDKQSLKQAVARITPLSTGTM 141 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 S ++ A ++ F + K I +TDG + A+ Sbjct: 142 SGLAIQTAMDEAFT---VEAGARGPTSNIPKVAIIVTDGRPQD------QVNEVAARARA 192 Query: 315 RGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQN---SRKLHDAFLR 357 G +YA+GV A + LK AS + + V+ KL F Sbjct: 193 SGIELYAVGVD-RADMESLKMIASEPLDEHVFYVETYGVIEKLSSRFQE 240 >gi|4505111|ref|NP_002370.1| cartilage matrix protein precursor [Homo sapiens] gi|115556|sp|P21941|MATN1_HUMAN RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1732121|gb|AAB38702.1| cartilage matrix protein [Homo sapiens] gi|56205026|emb|CAI19322.1| matrilin 1, cartilage matrix protein [Homo sapiens] gi|182887817|gb|AAI60064.1| Matrilin 1, cartilage matrix protein [synthetic construct] gi|189066540|dbj|BAG35790.1| unnamed protein product [Homo sapiens] Length = 496 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFISQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPG- 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 376 -AQKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E D K Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + + P Sbjct: 149 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGSVDKATLRQIASEP 195 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 73/222 (32%), Gaps = 29/222 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 N++ + + VK S D++ +LD S S+ + ++ + +L + Sbjct: 25 NAAQRASSLQNQVKKYQDSRA--DIVFLLDNSGSVGRYNFEEVE-----IAFVENLLSQL 77 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 P + R +V+F ++E G T + A Sbjct: 78 TISPQAS---RVAVVSFDDVARTHIDYIKYPKNKCSFLRELKTVKYIGEWTNTEDAFRLA 134 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYA 321 + K K+ +I LTDG + + + N K A +++ Sbjct: 135 QELLRPPS-----AFKNERPVKQVVILLTDGRPTRGG----DPVKRANNLKSVYNAEIFS 185 Query: 322 IGVQAEAADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEM 362 IG+ Q L++CA+ Y N F + K + Sbjct: 186 IGIGGNLNKQQLEDCATDAQHLYLSPNFVD----FKDLAKRI 223 >gi|299143633|ref|ZP_07036713.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] gi|298518118|gb|EFI41857.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] Length = 1217 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 52/143 (36%), Gaps = 19/143 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + S +++V+D S SM D ++ A + +D + D + Sbjct: 186 WTVKMLVAARDSVKTSKIVLVIDTSGSMKDF-----GRMKGAKNAANAFVDNV---LDGS 237 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + G+V F+S + + IN L T + G++ A + + Sbjct: 238 QSTQIGIVRFASNVSIVSDFTSNKAKLHSAINALSAEGGTFTQAGVKQARTMLAGSGAD- 296 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 KKY++ L+DG + Sbjct: 297 ----------KKYMVVLSDGVPT 309 >gi|242034233|ref|XP_002464511.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] gi|241918365|gb|EER91509.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] Length = 584 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 82/207 (39%), Gaps = 33/207 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM KL + +++ ++D + R +V+FS+ Sbjct: 147 PLDLVTVLDVSGSMQGS------KLALLKQAMGFVIDNLGPAD------RLSIVSFSNDA 194 Query: 228 VQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L G +E + L+ +T + GL A + D + + + Sbjct: 195 SREIRLTRMSGDGKASAKEAVESLVADGSTNISRGLLVASEVLADRRYRNAVTS------ 248 Query: 284 KKYIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 +I L+DG+++ N N + + R G ++ G ++ + A Sbjct: 249 ---VILLSDGQDNQSGVGRNHQNLVPPLFRDADSRPG-SIHTFGFGSDHDAAAMHAIAEV 304 Query: 338 SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + F V+N + D+F + IG + Sbjct: 305 ARGTFSFVENLAVIQDSFAQCIGGLLS 331 >gi|297669803|ref|XP_002813078.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 1 [Pongo abelii] Length = 3182 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 106/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQSAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ + Sbjct: 576 SIMAFAIGNKGADQAELKEIAFDSSLVFIPAEFRAAPLQGMLPGFLAPLRTLSGTPEVHA 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDVGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y + N +A H Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIH- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 48/297 (16%), Positives = 105/297 (35%), Gaps = 27/297 (9%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I + IN++++ + GLE+ + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANTRVGLEH----LRVNHFVPEAGSRLDQRVPQIAFVITGGKSV 1756 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1757 EDAQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|194291603|ref|YP_002007510.1| hypothetical protein RALTA_B0837 [Cupriavidus taiwanensis LMG 19424] gi|193225507|emb|CAQ71453.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 353 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 89/262 (33%), Gaps = 65/262 (24%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++++ + + + + +++ +D S SM P +++ A ++ R+++ + + Sbjct: 76 SATITLPADT---VTLVLAMDTSRSMEAADVPP-NRISAAQQAARDLVVGLPAS------ 125 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE---- 271 VR G+V+F+ P Q + + I R T + GL A +F Sbjct: 126 VRLGIVSFAGTAAVVLPPTDNRQDMLDAIERFQLQRGTATGSGLFQALAVLFPEDGIDLE 185 Query: 272 ------------------------------KLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + A+ +I L+DG ++ Sbjct: 186 VILFGSRSDRAGRGTSLDEAAAADAARRREQGQQAAQPGSYRHGAVILLSDGRRTTG--- 242 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEA---------------ADQFLKNCAS--PDRFYS 344 + L A +RG VY +G ++ + L+ AS +Y Sbjct: 243 -PDPLDAARMAAQRGVRVYTVGFGSQQVTSAPESSLSYFMQLDEPALRAVASITGGEYYH 301 Query: 345 VQNSRKLHDAFLRIGKEMVKQR 366 ++ L + ++ +R Sbjct: 302 AGSAADLSQVYRQLSARFALER 323 >gi|119591516|gb|EAW71110.1| collagen, type VI, alpha 3, isoform CRA_h [Homo sapiens] Length = 2977 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1331 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1390 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1391 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1447 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1448 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1500 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1501 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1558 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1559 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1607 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|119591510|gb|EAW71104.1| collagen, type VI, alpha 3, isoform CRA_b [Homo sapiens] Length = 2210 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 317 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 374 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 375 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 434 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 435 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 482 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 483 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 539 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 540 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 589 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 590 FNPSLVYLMDDFSSL 604 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1330 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1389 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1390 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1446 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1447 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1499 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1500 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1557 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1558 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1606 >gi|119591509|gb|EAW71103.1| collagen, type VI, alpha 3, isoform CRA_a [Homo sapiens] Length = 2211 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1331 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1390 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1391 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1447 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1448 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1500 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1501 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1558 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1559 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1607 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|119591514|gb|EAW71108.1| collagen, type VI, alpha 3, isoform CRA_f [Homo sapiens] Length = 2244 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 97/262 (37%), Gaps = 29/262 (11%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRY--------EMPFIFCTFPWCAN-SSHAPLL 154 ++ ++E+ I + L A +RY E+P I S+ P Sbjct: 1396 ELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPVREFRELPNIEERIMNSFGPSAATPAP 1455 Query: 155 ITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 S+ D++ +LD S D R + E++D + D + Sbjct: 1456 PGVDTPPPSRPEKKKADIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV--YEDGD 1507 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + ++ GLV ++S F L + I + IN++++ + + + Sbjct: 1508 S-IQVGLVQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANT--KVGLEHLRVNHF 1564 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 E ++ + +T G++ D +L +RG V+A+GV+ +++ Sbjct: 1565 VPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSLALT------QRGVKVFAVGVRNIDSEE 1618 Query: 332 FLKNCASPDRFYSVQNSRKLHD 353 K ++ + V N ++L + Sbjct: 1619 VGKIASNSATAFRVGNVQELSE 1640 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|119591512|gb|EAW71106.1| collagen, type VI, alpha 3, isoform CRA_d [Homo sapiens] Length = 2411 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1759 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|119591513|gb|EAW71107.1| collagen, type VI, alpha 3, isoform CRA_e [Homo sapiens] Length = 3177 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1759 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|114584073|ref|XP_001153410.1| PREDICTED: alpha 3 type VI collagen isoform 2 [Pan troglodytes] Length = 2977 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1331 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1390 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1391 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1447 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1448 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1500 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1501 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1558 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1559 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1607 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|114584069|ref|XP_001153544.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Pan troglodytes] Length = 3177 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1759 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|114584077|ref|XP_001153230.1| PREDICTED: alpha 3 type VI collagen isoform 1 [Pan troglodytes] Length = 3010 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 97/262 (37%), Gaps = 29/262 (11%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRY--------EMPFIFCTFPWCAN-SSHAPLL 154 ++ ++E+ I + L A +RY E+P I S+ P Sbjct: 1396 ELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPVREFRELPNIEERIMNSFGPSAATPAP 1455 Query: 155 ITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 S+ D++ +LD S D R + E++D + D + Sbjct: 1456 PGVDTPPPSRPEKKKADIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV--YEDGD 1507 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + ++ GLV ++S F L + I + IN++++ + + + Sbjct: 1508 S-IQVGLVQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANT--KVGLEHLRVNHF 1564 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 E ++ + +T G++ D +L +RG V+A+GV+ +++ Sbjct: 1565 VPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSLALT------QRGVKVFAVGVRNIDSEE 1618 Query: 332 FLKNCASPDRFYSVQNSRKLHD 353 K ++ + V N ++L + Sbjct: 1619 VGKIASNSATAFRVGNVQELSE 1640 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|114584075|ref|XP_516178.2| PREDICTED: alpha 3 type VI collagen isoform 5 [Pan troglodytes] Length = 2976 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 317 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 374 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 375 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 434 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 435 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 482 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 483 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAGGSRIR- 539 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 540 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 589 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 590 FNPSLVYLMDDFSSL 604 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1330 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1389 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1390 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1446 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1447 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1499 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1500 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1557 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1558 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1606 >gi|55743098|ref|NP_004360.2| collagen alpha-3(VI) chain isoform 1 precursor [Homo sapiens] gi|311033499|sp|P12111|CO6A3_HUMAN RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|225000446|gb|AAI72233.1| Collagen, type VI, alpha 3 [synthetic construct] gi|302313173|gb|ADL14511.1| collagen, type VI, alpha 3 [Homo sapiens] Length = 3177 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1759 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|62988748|gb|AAY24135.1| unknown [Homo sapiens] Length = 2588 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1759 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|3127926|emb|CAA36267.1| collagen type VI, alpha 3 chain [Homo sapiens] Length = 3176 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S P + K+ D++ ++D S ++ + + + Sbjct: 7 LPLVAVF---CLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D++KS+ N LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T T I + ++ D + I+ LTDG + E Sbjct: 115 TNQTG---KGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSA------EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 K V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDI 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 518 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 575 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 576 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 635 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 636 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 683 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 684 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 740 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 741 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 790 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 791 FNPSLVYLMDDFSSL 805 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1531 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1590 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1591 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1647 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1648 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1700 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1701 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1759 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 265 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 324 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 325 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 374 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 375 FSFGLGAQAASRAELQHIATDDN 397 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 61/177 (34%), Gaps = 28/177 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + G+++M+ LDVS SM +L A + + +++D + N + G Sbjct: 81 KQEKEKRKGIEVMIALDVSNSMLAQDVLP-SRLENAKQILSKLVDDM-------NDDKVG 132 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV F+ P+ + ++ L+ T L+ A E Sbjct: 133 LVVFAGDAYTQLPITVDYVSAKMFLSNISPELVPRQGTAIGSALDLAIKSFGAKSEAG-- 190 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K II +TDGEN +++ A IV IG+ Sbjct: 191 ---------KAIILITDGENHE-----DDAIGAAKLAAENNIIVNVIGMGKTDGAPI 233 >gi|290957869|ref|YP_003489051.1| hypothetical protein SCAB_34031 [Streptomyces scabiei 87.22] gi|260647395|emb|CBG70500.1| putative membrane protein [Streptomyces scabiei 87.22] Length = 534 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 62/193 (32%), Gaps = 30/193 (15%) Query: 174 VLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVTFSSKIVQT 230 VLD S SM D L T R+ + + +P ++V VR+ +V+ Sbjct: 360 VLDTSGSMEGDRLDRLKTALTELTGDFRDR-EEVTLMPFGSDVKSVRT-------HVVRP 411 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 G+ I+ +L T L AY + I+ + Sbjct: 412 ADPKAGLDGIRADTRKLSAAGETAIYTSLRRAYEHLGAVDRDTFTS----------IVLM 461 Query: 291 TDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSV 345 TDGEN+ + D+ + A+ V+ I D+ + R + Sbjct: 462 TDGENTEGASPADFDDFYGRLP-DAARH--IPVFPILFGDSDRDELEHIAEVTGGRLFDA 518 Query: 346 QNSRKLHDAFLRI 358 L AF I Sbjct: 519 TR-GSLDGAFEEI 530 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 77/202 (38%), Gaps = 30/202 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM-------DKLGVATRSIREMLDI 205 L + + S + LD++ VLDVS SMND +L V S++ ++ Sbjct: 54 LRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRK 113 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 + + R +V F+ V+ + G +KI+RL T P L Sbjct: 114 LA------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPAL 167 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E A + + + + +I+ LTDG++++ ++++ V Sbjct: 168 EEAVKILDERQGSSRNRVG-------FILLLTDGDDTTGFRWTRDAIHGA----VAKYPV 216 Query: 320 YAIGVQAEAADQFLKNCASPDR 341 + G+ A + L + A R Sbjct: 217 HTFGLGASHDPEALLHIAQGSR 238 >gi|71988405|ref|NP_001022675.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|51587416|emb|CAH19087.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] gi|51591699|emb|CAH19103.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3183 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 86/227 (37%), Gaps = 23/227 (10%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 +N++ P + + K D++ ++D S S+ G + K V R + Sbjct: 618 GFVDVSSNANLPPGRVCTVQTTCPKQKT--DLVFLIDGSGSI----GSYVFKNEVL-RFV 670 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKST 256 RE +++ + + R GL+ +S +I F L + + I+ T++ Sbjct: 671 REFVELFEI---GRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLLKGISETQYLTGLTRTG 727 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 +++ + F + D + I LTDG + + A++ Sbjct: 728 AAIQHMVQEGFSERR---GARPQQSDIARVAIILTDGRSQDNVTGP------ADSARKLS 778 Query: 317 AIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEM 362 +AIGV L++ A SP+R++ V + L + ++ Sbjct: 779 INTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQKA 825 >gi|71988401|ref|NP_001022674.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|22096361|sp|P34576|MUA3_CAEEL RecName: Full=Transmembrane cell adhesion receptor mua-3; AltName: Full=Muscle attachment abnormal protein 3; Flags: Precursor gi|4761646|gb|AAD29428.1|AF139060_1 transmembrane cell adhesion receptor MUA-3 precursor [Caenorhabditis elegans] gi|14530487|emb|CAA83226.2| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] gi|14530561|emb|CAC42345.1| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3767 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 86/227 (37%), Gaps = 23/227 (10%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 +N++ P + + K D++ ++D S S+ G + K V R + Sbjct: 1202 GFVDVSSNANLPPGRVCTVQTTCPKQKT--DLVFLIDGSGSI----GSYVFKNEVL-RFV 1254 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKST 256 RE +++ + + R GL+ +S +I F L + + I+ T++ Sbjct: 1255 REFVELFEI---GRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLLKGISETQYLTGLTRTG 1311 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 +++ + F + D + I LTDG + + A++ Sbjct: 1312 AAIQHMVQEGFSERR---GARPQQSDIARVAIILTDGRSQDNVTGP------ADSARKLS 1362 Query: 317 AIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEM 362 +AIGV L++ A SP+R++ V + L + ++ Sbjct: 1363 INTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQKA 1409 >gi|308068884|ref|YP_003870489.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858163|gb|ADM69951.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 432 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 86/224 (38%), Gaps = 38/224 (16%) Query: 160 KISSKSDIGL-DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +++ + + D+++V+D S SM P ++L A S+ +D K R Sbjct: 107 QVNPEGRKSVQDLVLVIDNSGSMQQT-DPDNERLTAAK-SLIGQMDGDK---------RV 155 Query: 219 GLVTFSSKIVQTFPLA-----WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKE 271 +V+F S P Q + KI+ + T+ L+ +I E Sbjct: 156 AIVSFDSTAQLVQPFTPIRTDAEKQAVYSKIDSMQTIMSGGTEIRLALDETIKEI----E 211 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD- 330 + KG +I L+DG ++ R V IG++ +D Sbjct: 212 TQGNAEKGSL-----VIMLSDG------FSELDTQTALAPYIARQIPVNTIGLKLAESDG 260 Query: 331 -QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 L+N A + + +V N++ L AF +I ++ + ++ + Sbjct: 261 IALLQNIADLTGGTYSNVANAQGLTQAFGKIYNKIGDRTLVTER 304 >gi|313680435|ref|YP_004058174.1| von willebrand factor type a [Oceanithermus profundus DSM 14977] gi|313153150|gb|ADR37001.1| von Willebrand factor type A [Oceanithermus profundus DSM 14977] Length = 747 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 80/209 (38%), Gaps = 29/209 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + G +++VLDVS SM G KL A R+++D G+V Sbjct: 323 EPEPQEGAALVLVLDVSGSMG--LGAP-SKLARAVEGARKLVDAAGPEDT------LGIV 373 Query: 222 TFSSK---IVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 TF+S+ ++ + + + + +++ L T ++ A Sbjct: 374 TFASRSRWLLAPKAMTYRAKREAETRLDALEARGGT-----------QLATAYAAAAEAL 422 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + D ++I+ L+DG+ D + +L +A RG + + A+A FL A Sbjct: 423 EPLDARTRWILVLSDGQ---LEDDPQRTLARARQAAARGVKTLTLALGADADRPFLARLA 479 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 RF + + L +G+E K Sbjct: 480 REGGGRFLDLADPAALPQVLALLGEEAFK 508 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 72/199 (36%), Gaps = 39/199 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + + ++S G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLFLLQIIELRSW--------VGMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLTKRL-PTAASGGTSICSGLRLAFTVIKKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAARELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRIG 359 Y+ + L DAF + Sbjct: 458 YASDQVQNNGLIDAFGALS 476 >gi|332531455|ref|ZP_08407359.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] gi|332039124|gb|EGI75546.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] Length = 346 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 68/235 (28%), Gaps = 53/235 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+ +DVS SM ++ A + R + D+ V+ G+V F+ Sbjct: 88 IMLAIDVSGSMRATDVAP-SRMEAAQAAARA------FLADLPRHVKVGIVAFAGSAQIA 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK------ 284 + + I+R T G+ + IF + Sbjct: 141 QLPTTNREDLNAAIDRFQLQRGTAIGNGIVLSLAAIFPEVGIDLGQFSYGRPGQPRGPSL 200 Query: 285 -------------------KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 II LTDG+ ++ + L A RG VY +GV Sbjct: 201 DMPPAGPPPTPVAPGSYGSAAIILLTDGQRTTG----IDPLEAAKLASERGVRVYTVGVG 256 Query: 326 AEAADQF---------------LKNCASPDR--FYSVQNSRKLHDAFLRIGKEMV 363 LK+ A R ++ ++ L + + + Sbjct: 257 TVEGVTVGFEGWSMHARLDEESLKHIAQQTRAEYFHAASAEALTQVYETLSSRLT 311 >gi|307941490|ref|ZP_07656845.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307775098|gb|EFO34304.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 611 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 41/313 (13%), Positives = 98/313 (31%), Gaps = 45/313 (14%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII-DDQHKDYNL 127 + + +++ N + N A + + D + ++ Sbjct: 141 KTDPVSTFSVDVDTASYAFVRSELVNGGKPNPDAVRAEEMINYFQYDYKVPDSREAPFST 200 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + PW +++ + + + +++ ++D S SM+D Sbjct: 201 NV--------SVVETPWNSDTKLLHIGLKG-YTVPLDDLPPQNLVFLIDTSGSMSDE--- 248 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI---VQTFPLAWGVQHIQEKI 244 +KL + ++ R +L + +VT++ ++ L+ I E I Sbjct: 249 --NKLPLLQQAFRLLLSTL------REDDTIAIVTYAGNAGVLLEPTALS-DKSKIAEAI 299 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 L G +T GL+ AY ++ II TDG+ + + Sbjct: 300 AALTSGGSTAGHAGLKEAYRLAETMQDDDTKSR---------IILATDGDFNVGLSSADD 350 Query: 305 SLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDR----FYSVQNS------RKLHD 353 + E + G + +G D+ ++ A + + ++ Sbjct: 351 MKRFVKEKRDSGITLSVLGFGRGNYNDELMQALAQNGNGVAAYIDTLSEARKVLVDQIVS 410 Query: 354 AFLRIGKEMVKQR 366 + I ++M Q Sbjct: 411 SISTIAQDMKIQV 423 >gi|260778153|ref|ZP_05887046.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] gi|260606166|gb|EEX32451.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] Length = 397 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 50/384 (13%), Positives = 134/384 (34%), Gaps = 35/384 (9%) Query: 9 FFYNCKGSISILT-AILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 GS+S+ A+L+P++ + ++ + +D + L A + Sbjct: 1 MLKQHTGSVSLSFLALLIPLVVLSAATIMIGFQVQLSSRAMQ-AVDAASLACAFADYSDP 59 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELR-----ENGFAQDINNIERSTSLSIIIDDQH 122 + N + + +++K+ + EL F+ ++ S ++ H Sbjct: 60 SVNQAYLEYYQPNVKLVKSEIYSASGCELNMGYQLTGLFSSLKFAQASYSAQSGSVEQAH 119 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS---- 178 + + S + + + ++T +++ + ++ +D + +S Sbjct: 120 VNQSASVTPTEMTLVLDISSSMAGSIDTLKSILTRAIERIEQDNVQIDGRRAISISIVPF 179 Query: 179 ------LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-------- 224 + + G+ + T+ + + +++ +++ + V+ Sbjct: 180 SDGVSARNADWLDDKGVFCIDGLTKESGGSVLVNETVQNLDRIHSEKAVSHRAPDEFLAD 239 Query: 225 -SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S PL + ++ IN L T+S G+ + ++ + Sbjct: 240 CSASATLVPLTDNMSEVKTAINALTTTGGTRSYQGVIWGARQLIPRWRQEWGYNPYSLAP 299 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQAEAAD-QFLKNCASP-- 339 K+ +I +TDG +S +D+ C+ A + IG + + ++C + Sbjct: 300 KQKLILMTDGVDSGYVLDDLIDAGLCDRLANEFAIELNFIGFNVQDSRLAQFQSCINAAN 359 Query: 340 -----DRFYSVQNSRKLHDAFLRI 358 + +S N+ KL + F +I Sbjct: 360 TDGIKGQVFSATNTEKLDEYFSKI 383 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L R Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLRT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|293571190|ref|ZP_06682227.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608742|gb|EFF38027.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1219 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 51/265 (19%), Positives = 83/265 (31%), Gaps = 79/265 (29%) Query: 167 IGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++V+D+S SM G D+ G A ++ L IK + + V GLV FSS Sbjct: 336 KPVDIVLVVDMSGSMESSQSNGWNDRAGAARNGVKNFLQTIKDA-GIGDYVNVGLVGFSS 394 Query: 226 KIVQTFPLAW-----GVQHIQEKINRLI------FGSTTKSTPGLEYAYNKIFDAKEKLE 274 T P + G IN + F T + G+E + Sbjct: 395 PGYVTGPNGYLTVPIGKASDTSHINAINDALKPKFTGGTYTQIGIEQGQQMLA------- 447 Query: 275 HIAKGHDDYKKYIIFLTDGEN--------------------------------------- 295 G + K +I LTDG Sbjct: 448 ----GSSNENKMMIVLTDGVPTFSKKVTAAQTIDGTTYATKFGNTLDEPRNTSKLNSSYE 503 Query: 296 -------SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL---------KNCASP 339 ++ N +L AK G ++ +G+Q FL A+P Sbjct: 504 VGSWGNRTNINSTWPATLGAAKIAKDAGLTIHTLGIQLSKDGNFLTEQQVRDRASLIATP 563 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ + + + D K +VK Sbjct: 564 GKYKDAETTNDVSDYLNEQAKNVVK 588 >gi|305680612|ref|ZP_07403420.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305660143|gb|EFM49642.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 236 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/127 (22%), Positives = 54/127 (42%), Gaps = 3/127 (2%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + ++DVS SM + G L A + + +++ + ++ +R GL+ F + Sbjct: 9 LPVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFCDEAR 68 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL+ + E I +L+ T +N++ A E L G + Sbjct: 69 VVIPLS-EIDAFSENIPQLVAKGGTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPT--VF 125 Query: 289 FLTDGEN 295 F+TDGE+ Sbjct: 126 FITDGED 132 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 71/199 (35%), Gaps = 39/199 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM D+L ++ + L I V G+VTF S Sbjct: 307 VCLVLDKSGSMATG-----DRLKRLNQAGKLFLLQI-----VEQGSWVGMVTFDSAAQVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + T GL A+ I Sbjct: 357 SELIQINSGTERDALTKSL-PTVATGGTSICSGLRSAFAVIKKKYPTDGAE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GA+++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISSCFNEVKQSGAVIHTVALGPSAAKELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRIG 359 Y+ ++ L DAF + Sbjct: 458 YASDQAQNNGLIDAFGALS 476 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 75/195 (38%), Gaps = 31/195 (15%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------- 223 +VLDVS SM G ++ A ++ E+LD + +V +R+ + Sbjct: 43 LVLDVSGSMRTRDIDGGTRMAAAKQAFNEVLDA--TPEEVRLGIRTLGADYPGDDRKTGC 100 Query: 224 --SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++++ PL + + L T P L A + + Sbjct: 101 KDTAQLYPVGPL--DRTEAKTAVATLSPTGWTPIGPALLKAADDL------------DGG 146 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---AS 338 D K I+ ++DGE++ +D E AK G + +G+ + +C A+ Sbjct: 147 DGSKRIVLISDGEDTCAPLDPCEVAREI-AAKGIGLTIDTLGLVPNTKMRQQLSCIAEAT 205 Query: 339 PDRFYSVQNSRKLHD 353 + SV+++ +L D Sbjct: 206 GGTYTSVEHTDELTD 220 >gi|15594518|ref|NP_212307.1| hypothetical protein BB0173 [Borrelia burgdorferi B31] gi|3915348|sp|O51195|Y173_BORBU RecName: Full=Uncharacterized protein BB_0173 gi|2688067|gb|AAC66565.1| predicted coding region BB0173 [Borrelia burgdorferi B31] Length = 341 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 87/239 (36%), Gaps = 31/239 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 57 KDYRLNLMYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 113 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+ GLV F+ P+ + + Sbjct: 114 AVEFSSKNRLEFSK-------ELIRGFISQRENDNIGLVAFAKDASIVVPITTDREFFNK 166 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 167 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 215 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 I + + N A+ +Y+IG+ + S F V + L + Sbjct: 216 IXKDQVI---NLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 271 >gi|307154093|ref|YP_003889477.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984321|gb|ADN16202.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 240 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 72/191 (37%), Gaps = 21/191 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L++ ++LD S SM L A + +++++ + ++ VV F I Sbjct: 39 PLNLSLILDRSGSMAGS------ALKYAKMAAQKLVEYLTPEDILSVVVYD---DFVETI 89 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V P+ I+ +INR+ T + G + + A E + + + Sbjct: 90 VAPQPVK-DQAAIKNQINRINARGCTNLSGGWLTGCDHV-KANLSAERLNR--------V 139 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSV 345 + LTDG + N D K E +G I +G + L N A+ FY + Sbjct: 140 LLLTDGLANVGNSDPKILTKTATEKAEQGIITTTLGFGTYFNEDLLINMANGGKGNFYFI 199 Query: 346 QNSRKLHDAFL 356 Q+ + F Sbjct: 200 QSPQDAAQVFE 210 >gi|325963511|ref|YP_004241417.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469598|gb|ADX73283.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 622 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 69/198 (34%), Gaps = 30/198 (15%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT--- 230 +LDVS SM PG+ KL A ++ + LD + + GL FS Sbjct: 438 LLDVSESMVQE--PGLTKLQRAKDAVLKALDHFTAEDE------IGLAAFSQVGDGPLTP 489 Query: 231 ------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P + + K+N L +TP E D ++ + Sbjct: 490 GVVSPVAPFKTNKEDLIAKLNELKA---VDATPLFEAVSRFAGDQAKEYKDNFINA---- 542 Query: 285 KYIIFLTDGEN--SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 I+ L+DG+N + P S ++ +V+ + +A L+ A S Sbjct: 543 --IVLLSDGKNDTTHPGDLGGLSEQLGHQNHSTPVLVFTLAYGPDADVPTLREIARASGA 600 Query: 341 RFYSVQNSRKLHDAFLRI 358 +Y + +L + + Sbjct: 601 HYYDATDPNRLEEVLGEL 618 >gi|225022482|ref|ZP_03711674.1| hypothetical protein CORMATOL_02522 [Corynebacterium matruchotii ATCC 33806] gi|224944721|gb|EEG25930.1| hypothetical protein CORMATOL_02522 [Corynebacterium matruchotii ATCC 33806] Length = 236 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 29/127 (22%), Positives = 54/127 (42%), Gaps = 3/127 (2%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + ++DVS SM + G L A + + +++ + ++ +R GL+ F + Sbjct: 9 LPVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFYDEAR 68 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL+ + E I +L+ T +N++ A E L G + Sbjct: 69 VVIPLS-EIDAFSENIPQLVAKGGTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPT--VF 125 Query: 289 FLTDGEN 295 F+TDGE+ Sbjct: 126 FITDGED 132 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 66/184 (35%), Gaps = 22/184 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K +D+ +LD S SM K+ +SI +M DI G++TF Sbjct: 91 KKSCPIDIAFLLDASASMGRRTW---GKIKNYVKSIVDMGDISD------QGTHVGIITF 141 Query: 224 SSKIVQTFPLA------WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHI 276 S+ V P +I+ I+ L T L A +F Sbjct: 142 STDPVIDIPFDKYKGVKMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFT------QE 195 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A +D +K + ++DG + ++ N K +G VY +G+ A L Sbjct: 196 AGMREDSQKVAVLMSDGIQTKDRGPFTPTIIAANRLKMKGVQVYTVGIGASIDVLELMWI 255 Query: 337 ASPD 340 AS D Sbjct: 256 ASED 259 >gi|157819693|ref|NP_001101483.1| matrilin-3 [Rattus norvegicus] gi|149050910|gb|EDM03083.1| matrilin 3 (predicted) [Rattus norvegicus] Length = 463 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 72/223 (32%), Gaps = 26/223 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 + A ++ + LD++ ++D S S+ + + + Sbjct: 51 HFSARATSTRAPYSGVRGSGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRI 104 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGL 259 +D + R +V ++S + F L Q +++ + R+ + T S + Sbjct: 105 IDTLDI---GATDTRVAVVNYASTVKIEFQLNTYSNKQALKQAVARITPLSTGTMSGLAI 161 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A + F + K I +TDG + A+ G + Sbjct: 162 QTAMEEAFT---VEAGARGPTSNIPKVAIIVTDGRPQD------QVNEVAARARASGIEL 212 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 YA+GV + + P D + V+ KL F Sbjct: 213 YAVGVDRADMESLKMMASKPLEDHVFYVETYGVIEKLSARFQE 255 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L R Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLRT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|296224421|ref|XP_002758053.1| PREDICTED: matrilin-3 isoform 2 [Callithrix jacchus] Length = 445 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 69/201 (34%), Gaps = 26/201 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ ++D S S+ + + ++D + P R +V ++ Sbjct: 79 KSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLDIGP---ADTRVAVVNYA 129 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + F L Q +++ + R+ + T S ++ A ++ F Sbjct: 130 STVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFT---LEAGARGPSS 186 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + K I +TDG + A+ G +YA+GV + + P Sbjct: 187 NVPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVDRADMESLKMMASEPLE 240 Query: 340 DRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 241 EHVFYVETYGVIEKLSSRFQE 261 >gi|296224419|ref|XP_002758052.1| PREDICTED: matrilin-3 isoform 1 [Callithrix jacchus] Length = 487 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 69/201 (34%), Gaps = 26/201 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ ++D S S+ + + ++D + P R +V ++ Sbjct: 79 KSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLDIGP---ADTRVAVVNYA 129 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + F L Q +++ + R+ + T S ++ A ++ F Sbjct: 130 STVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFT---LEAGARGPSS 186 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + K I +TDG + A+ G +YA+GV + + P Sbjct: 187 NVPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVDRADMESLKMMASEPLE 240 Query: 340 DRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 241 EHVFYVETYGVIEKLSSRFQE 261 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/170 (20%), Positives = 63/170 (37%), Gaps = 27/170 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ + +D+S SMN ++ A ++ ++D + R G++ F+S Sbjct: 76 SGKDIFLAIDLSESMNATDVVP-SRIDRAKNELQGLIDRFSA-------DRIGIILFNSN 127 Query: 227 IVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + ++I+ I L +T +P LE K+ + Sbjct: 128 AYLLTPLTFDTENIRNTIGNLKTHMIDKGSTDFSPMLEMINEKLSVGTQNRG-------- 179 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K I +TDGE + ID K+ ++ +GV + Sbjct: 180 --KVAIVVTDGE-THYQIDE----QLAKRLKQNNIHLFWLGVGTLGGGKI 222 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 77/210 (36%), Gaps = 37/210 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ ++L ++ + L I V N G+V F S Sbjct: 289 VCLVLDKSGSMSSS-----NRLNRMNQAAKYFLMQI-----VENGSWVGMVHFDSTASIR 338 Query: 231 FPL--AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L G + + L T G+ A+ + ++ H Sbjct: 339 SDLIQITGSNERDKLLGSLPTTASGGTSICSGIRRAFEVV---RKLYSHTDGSE------ 389 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ LTDGE+++ +E K+ GAI++ I + A ++ + FY+ Sbjct: 390 IVLLTDGEDNTAGA-------CVDEVKQSGAIIHFIALGPSADKAVIEMSTATGGVHFYA 442 Query: 345 VQNSRK--LHDAFLRIG---KEMVKQRILY 369 + L DAF + ++ +Q + Sbjct: 443 TDEAENNGLIDAFGALASGNTDISQQSLQL 472 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 75/219 (34%), Gaps = 36/219 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L + ++S +++ V+D S SM + A S+ L + Sbjct: 363 LAYVTPPAVASAKKAQREVVFVIDNSGSMGG------TSIEQAKASLDYAL------SHL 410 Query: 213 NNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R ++ F + + F ++ + + + L T P L A + Sbjct: 411 QPGDRFNVIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQGGTAMLPALHAALDDSH 470 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + I+FLTDGE I N++ L A+R + ++ +G+ Sbjct: 471 QGNGLRQ------------IVFLTDGE-----ISNEQQLLDAIAARRGRSRIFMVGIGTA 513 Query: 328 AADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 + + A R F + ++ ++ + + ++ Sbjct: 514 PNSYLMNHAAELGRGTFTHIGSAAEVDERMRALFDKLEN 552 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD+S SM KL ++ +L+ D+ R ++ FS++I Sbjct: 262 VVFVLDISASMVGA------KLQQTREALVTILN------DLRPQDRFNIIGFSNRIKMW 309 Query: 228 -VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ +++ + + L T L+ A + + + + + Sbjct: 310 KDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLNNYVAQNDIEDRSVS---- 365 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 IIFLTDG+ + + L EA ++ +G+ + + L+NC Sbjct: 366 LIIFLTDGKPTFGETNTLRILSNTKEATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTR 425 Query: 341 RFYSVQNSRKLHDAFLR 357 R + + + F Sbjct: 426 RVHEEEKAGAQLIGFYD 442 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD+S SM KL ++ +L+ D+ R ++ FS++I Sbjct: 296 VVFVLDISASMVGA------KLQQTREALVTILN------DLRPQDRFNIIGFSNRIKMW 343 Query: 228 -VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ +++ + + L T L+ A + + + + + Sbjct: 344 KDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLNNYVAQNDIEDRSVS---- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 IIFLTDG+ + + L EA ++ +G+ + + L+NC Sbjct: 400 LIIFLTDGKPTFGETNTLRILSNTKEATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + + + F Sbjct: 460 RVHEEEKAGAQLIGFYD 476 >gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Macaca mulatta] Length = 935 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + G Sbjct: 323 RPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQ-LLDSSNQEERLPDGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALENGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|168702184|ref|ZP_02734461.1| hypothetical protein GobsU_21830 [Gemmata obscuriglobus UQM 2246] Length = 638 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 76/219 (34%), Gaps = 28/219 (12%) Query: 143 PWCANSSHAPLLITSSV-KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW N+ H L + +I ++ +++ ++D S SM ++L + +S+ Sbjct: 247 PW--NAKHHLLRVGVQAHQIPAEKLPPRNLVFLVDTSGSMQQE-----NRLPLVQKSLEL 299 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGL 259 +++ + R +VT++ P G + I + + L T G+ Sbjct: 300 LVEKLT------EKDRVSVVTYAGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGI 353 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + AY D +I TDG+ + +DN E + E ++ + Sbjct: 354 KKAYQFARDTFLDGGVNR---------VILCTDGDFNVGVVDNGELVKLIEEQRKSKVFL 404 Query: 320 YAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 +G D LK A + + F Sbjct: 405 TVLGYGMGNYKDDRLKELANHGNGHHAYIDTLDEAKKVF 443 >gi|192359695|ref|YP_001982895.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685860|gb|ACE83538.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 660 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 51/251 (20%), Positives = 92/251 (36%), Gaps = 42/251 (16%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 V Y + F + S T+S + SK + D+ M++DVS SM + Sbjct: 4 VMGYRLFLSFSCLVLLSLFSLWTGAETTSAPLPSKV-LPADIRMIIDVSGSMKKTDPHNL 62 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKI 244 + V +++ +PD + G+ TF + P +W Q + Sbjct: 63 RRPAVDL--------MVRLLPDGSKA---GIWTFGQSVNLLVPYRLVDESWRQQAAKSAS 111 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 T LE A + + G D +++ ++ LTDG +ID + Sbjct: 112 AINSVALHTHIGAALEKA----------AQDVVAGDDGFRRNLVLLTDGV---VDIDPEA 158 Query: 305 SLFYCN----------EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLH 352 + + K G +V+ I + +A + +K A + F Q++ +L Sbjct: 159 VVNIQERKRILTELLPQLKAAGYVVHTIALSQDADQELMKKLALTTDGVFAVAQSADELM 218 Query: 353 DAFLRIGKEMV 363 AFL I + V Sbjct: 219 QAFLTIFDQAV 229 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 59/146 (40%), Gaps = 11/146 (7%) Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-------DAKEKLEHIAKG 279 + Q PL + + ++ L T + G+ +A+ + D + Sbjct: 320 VSQIQPLLSTRRAFIKALDTLYPEFNTNNAEGVMWAWRLLSPHWRGYWDKGKSELPRDYQ 379 Query: 280 HDDYKKYIIFLTDGENSSPNID-NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 H + +K ++ TDG + +++ + C E K++G + +I Q +K+CAS Sbjct: 380 HPNNRKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQGIEIISIDFN--NRSQVMKSCAS 437 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++Y N R + ++ + K Sbjct: 438 AGQYYIADN-RTIRSVLKQVATTLSK 462 >gi|282900568|ref|ZP_06308510.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194368|gb|EFA69323.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 426 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 77/215 (35%), Gaps = 36/215 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L++ ++LD S SM+ + + ++ +++D ++S R +V F Sbjct: 37 DTSLPLNLCLILDKSGSMHGE------SMSMVINAVEQLIDQLQSGD------RIAIVAF 84 Query: 224 --SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S +++ + + I+ ++ N+L G T GL ++ + A Sbjct: 85 AGSGEVIIPNQIIKDPKTIKSQLHNKLKAGGGTIIGEGLSLGITELLKGSKGACSHA--- 141 Query: 281 DDYKKYIIFLTDGENSS---------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 LTDG + DN+ L +A + + + G E Sbjct: 142 -------FLLTDGYGDNGFKIWRLQIGPNDNQRCLELAQKAAKLNLTINSFGFGDEWNQD 194 Query: 332 FLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 L+ A ++ + + F RI K + Sbjct: 195 LLEKIADAGGGTLAYIETPQNAIEQFNRIFKRIQS 229 >gi|130491439|ref|NP_001076115.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus cuniculus] gi|6579183|dbj|BAA88322.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus cuniculus] Length = 906 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 73/198 (36%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ ++ V+D+S SM K+ ++ ++L I+ D ++V G S Sbjct: 282 KNMSKSLVFVIDISGSMEGQ------KVKQTKEALLKILGDIRP-EDYFDLVLFGSRVQS 334 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + +Q ++ + R T GL + +A+ L ++K Sbjct: 335 WRGSLVPASEANLQAARDFVQRFSLAGATNLNGGLLRGIEILNNAQGNLPAVSKHAA--- 391 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L A +Y++G + FLK+ + + Sbjct: 392 -ILIMLTDGEPTEGVTDRPQILKNIRSAIGGRFPLYSLGFGHDLDFNFLKSLSMENNGWA 450 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 451 QRIYEDHDAAQQLQGFYN 468 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 72/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ N ++ FS++I Sbjct: 253 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFN------VIGFSNRIKVW 300 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V P + V+ + I+ + T L+ A + + + H Sbjct: 301 KDHLVSVTPDS--VRDGKVYIHHMSPTGGTDINGALQRAIRLLNN---YVAHNDIEDRSV 355 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 IIFLTDG+ + + + L EA R ++ IG+ + + L+NC Sbjct: 356 S-LIIFLTDGKPTVGETNTLKILNNTKEATRGQICIFTIGIGDDVDFKLLEKLSLENCGL 414 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + ++ F Sbjct: 415 TRRVHDEDDAGSQLIGFYD 433 >gi|292618048|ref|XP_699485.3| PREDICTED: integrin alpha-1-like [Danio rerio] Length = 1201 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 52/294 (17%), Positives = 112/294 (38%), Gaps = 49/294 (16%) Query: 96 LRENGFAQDINNIERSTSLSII-IDDQHKDYNLSAVSRYE-MPFIFCTFPWCANSSHAPL 153 L EN +IN ++ + ++ + + + L+ +Y M ++ + Sbjct: 91 LSENTTIPNINEVKENMTMGTTLVVNPDGNGFLACGPQYGYMCGKQQYITGICSNVSSSF 150 Query: 154 LITSSVKIS-SKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + +S+ + + +D+++VLD S S+ T + + L I+ P Sbjct: 151 KVLNSIAPTVRECSQDMDIVIVLDGSNSIYPWDH---------ITDFLVKFLQNIEIGP- 200 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKI 266 R G+V++ + F L+ + +E + R G T + G++ A + Sbjct: 201 ----ARVGIVSYGDDVGHVFNLS-QFSNTKELVKNAADIRQRTGHKTMTALGIDTARKEA 255 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F +E A+ KK ++ +TDGE + D+ ++ + G +A+ V Sbjct: 256 F----TVERGARPG--VKKVMVIVTDGE----SHDHHNLKSVIDQCQEDGIERFAVAVLG 305 Query: 327 EAADQ------------FLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + Q ++ AS D F++V + R L +G ++ Sbjct: 306 DYNRQNKSIDEIKKFIEEIEYIASETKSDHFFNVSDERALVTIVDTLGSKIFAL 359 >gi|262195149|ref|YP_003266358.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078496|gb|ACY14465.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 412 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 73/220 (33%), Gaps = 32/220 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++D S SM+ +FG G+ + ++ + D + ++ + + VR G ++S Sbjct: 122 VQFLIDFSGSMDQNFG-GIKRSQAVRNALFDEDDGVVAL--LQSQVRFGASLYTSFDGNE 178 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-----LEHIAKGHDDYKK 285 P + + N L L N + E+ + K Sbjct: 179 APPCPRLTQVAPAFNNLTA-LRADIGGPLNDPPNAGDTPTGESIDAIAENFPDNGPNDKP 237 Query: 286 YIIFLTDGENS-----SPNIDNKESL------FYCNEAKRRGAIVYAIGVQAEAADQFLK 334 I+ TDGE PN D + A G +Y + V + L+ Sbjct: 238 LIVLATDGEPDSCTDPDPNTDPGRAATRRLSEEATQRAFEAGIELYVLSVGNDVGADHLQ 297 Query: 335 NCAS------------PDRFYSVQNSRKLHDAFLRIGKEM 362 A+ P Y N ++L DAF I + Sbjct: 298 RVANAGVGKALDESNDPATVYIGNNQQELVDAFSEIIRSA 337 >gi|163786709|ref|ZP_02181157.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878569|gb|EDP72625.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 345 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 67/200 (33%), Gaps = 35/200 (17%) Query: 142 FPWCANSSHAPLLITSS---VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVAT 196 F C + L + + K+ + G+D++ +DVS SM D +DK Sbjct: 60 FVLCGAFACLSLALVNPKIGTKLETVRSQGVDIVFAVDVSKSMLAEDIAPNRLDKSKQLV 119 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGST 252 I L R G++ ++ K P+ + + + Sbjct: 120 TQIINSLAS----------DRVGIIAYAGKAFPQLPITTDYASAKMFLQNMNTDMLSSQG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + +E A +D +E+ + +I ++DGE+ E++ EA Sbjct: 170 TAISEAIELA-KTYYDDEEQT----------NRVLIIISDGEDHGG-----EAVDIAEEA 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 G + +GV Sbjct: 214 NEEGIRILTVGVGDVKGGPI 233 >gi|317502941|ref|ZP_07961033.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] gi|315665940|gb|EFV05515.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] Length = 340 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 72/205 (35%), Gaps = 23/205 (11%) Query: 130 VSRYEMPFIFCTFPWCANSS--HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 VS+Y FC S KIS + G+++++ LD+S SM Sbjct: 50 VSKYRPTIKFCLLLSAITILIFMIARPQVGS-KISHEKREGIEVLIALDISNSMMAQDVI 108 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +L + I +++D + + GLV F+ P+ + + + Sbjct: 109 P-SRLEKSKLLIEDLVDHFTN-------DKVGLVVFAGDAFVQLPITNDYVSAKMFLQNI 160 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T L A + + EHI + II +TDGE+ +L Sbjct: 161 NPSLITTQGTDLARAISLSQSCFTQREHIG-------RAIIVITDGEDHEG-----GALE 208 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 EA ++G V+ +GV Sbjct: 209 AAREAYKKGINVFILGVGTSKGAPI 233 >gi|290956351|ref|YP_003487533.1| hypothetical protein SCAB_18411 [Streptomyces scabiei 87.22] gi|260645877|emb|CBG68968.1| putative secreted protein [Streptomyces scabiei 87.22] Length = 422 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 74/195 (37%), Gaps = 31/195 (15%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------- 223 ++LDVS SM G ++ A ++ E+LD +V +R+ + Sbjct: 42 LLLDVSGSMRAKDIDGQSRMSAAKQAFNEVLDATPK--EVQLGIRTLGADYPGDDRKTGC 99 Query: 224 --SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++++ PL + + L T P L A + + Sbjct: 100 KDTAQLYPVGPL--DRTEAKTAVATLAPTGWTPIGPALLKAADDL------------EGG 145 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---AS 338 + K I+ ++DGE++ +D E AK G + +G+ A + +C A+ Sbjct: 146 EGTKRIVLISDGEDTCAPLDPCEVAREI-AAKGIGLTIDTLGLVPNAKLRVQLSCIAEAT 204 Query: 339 PDRFYSVQNSRKLHD 353 + S+++ +L D Sbjct: 205 GGTYTSIEHRDELTD 219 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICRGLRSAFTVIKKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISECFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 75/201 (37%), Gaps = 29/201 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S+ ++ +L I+ D G+ FS Sbjct: 623 EDMKADIMFLVDGSSSIGYA------NFEKMKNFMQTLLAKIQIGADKTQ---IGVAQFS 673 Query: 225 SKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + FPL + + I + I+R L+ T + L++ +K Sbjct: 674 DYNKEEFPLNKYFTQKEISDAIDRMLLITGNTLTGSALKFIDTYFTQSKGARHG------ 727 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 KK++I +TDGE + +L + +G I+ ++GV Q + Sbjct: 728 -VKKFLILITDGEAQDDVREPAVAL------RDKGVIILSVGVYGANRTQLEEISGDGSL 780 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 + V+N L I +++ Sbjct: 781 VFHVENFEDL----KEIERKL 797 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 81/197 (41%), Gaps = 21/197 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+N M I + ++K + V+ G + +S Sbjct: 813 LDVVFVLDHSGSINSEQQESM---------INLTIHLVKKSDVGRDRVQFGALRYSDDPD 863 Query: 229 QTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G +T + L + + +H ++ + K+ Sbjct: 864 ILFYLNTYSNRSAIIEHLRRRRDTGGSTFTAKALGRSATLF-----EEQHGSRIKQNVKQ 918 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE+ ++ N +L + + +G + A+GV ++ + + V Sbjct: 919 MLIIITDGESHDRHLLNDTAL----KLRNKGITIIAVGVGKANQEELEAMAGNKENTIHV 974 Query: 346 QNSRKLHDAFLRIGKEM 362 ++ KL D +L + + M Sbjct: 975 KDFDKLKDVYLPLQESM 991 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 71/198 (35%), Gaps = 28/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--I 227 D+ ++D S S+ D + + +A D+ PD VR G+V +S+ + Sbjct: 442 DIYFLIDGSSSIRDKQFLQIKEFMLAVT------DMFNIGPDK---VRVGVVQYSNDRAV 492 Query: 228 VQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + +++ I N T + L++ + ++ Y Sbjct: 493 EFDIDVYHDNSVLRKAIYNIKQLKGGTLTGKALDFILPIMKKGRKTRA------SQVPCY 546 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 ++ LTDG++ E L + +AIG+ + L+ +R Sbjct: 547 LLVLTDGKSED------EVLGPAERIRAEQISTHAIGIGKAHKKELLQIAGEEER----V 596 Query: 347 NSRKLHDAFLRIGKEMVK 364 N + DA I KE+V Sbjct: 597 NFGQNTDALKSIKKEIVH 614 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 79/219 (36%), Gaps = 28/219 (12%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + + +S + D D++ ++D SL + R ++ L+ I S Sbjct: 215 KAVDADVGVPLPMSCQKDSLADLVFLVDESLGSRQNL-----------RHLQTFLEDISS 263 Query: 209 IPDVN-NVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 DV N R GL+ + S+K V + + Q+++ +L + T A + Sbjct: 264 SMDVKENCTRLGLMRYSTSTKTVSSLKSSRSQSEFQQQVQKLSLQAGRAHTGA---AIEQ 320 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + ++ + + +T S P+ D E + + G V+A+ +Q Sbjct: 321 LRREGFSEAGGSRRAQGVPQIAVLVT----SRPSDD--EVRDAALDLRLEGVTVFAVNIQ 374 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + Q + + Y S + ++ +G K Sbjct: 375 GANSTQLEEIVS-----YPPGQSVSVMSSYADLGNYTTK 408 >gi|211546|gb|AAA48695.1| cartilage matrix protein [Gallus gallus] Length = 416 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 82/204 (40%), Gaps = 30/204 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD++ ++D S S+ + + I ++++ ++ GLV +S Sbjct: 190 SGSALDLVFLIDGSKSVRPE------NFELVKKFINQIVESLEVSEKQAQ---VGLVQYS 240 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + Q FPL + I+ + ++ T + L+Y + D+ + + A+ Sbjct: 241 SSVRQEFPLGQFKNKKDIKAAVKKMAYMEKGTMTGQALKY----LVDSSFSIANGARPGV 296 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 297 P--KVGIVFTDGRSQDYITD------AAKKAKDLGFRMFAVGVGNAVEDELREIASEPVA 348 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMV 363 + ++ + R + IGK++ Sbjct: 349 EHYFYTADFRTI----SNIGKKLQ 368 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 53/148 (35%), Gaps = 17/148 (11%) Query: 218 SGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 G++ ++S + F L + + + R+ + T + +++A ++ F E Sbjct: 1 VGVINYASAVKNEFSLKTHQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRAFSDTEGAR 60 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + K I +TDG D A++ G ++AIGV + Sbjct: 61 LRSPN---INKVAIVVTDGRPQDGVQDVSA------RARQAGIEIFAIGVGRVDMHTLRQ 111 Query: 335 NCASP--DRFYSVQN---SRKLHDAFLR 357 + P D V++ KL F Sbjct: 112 IASEPLDDHVDYVESYSVIEKLTHKFQE 139 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/166 (23%), Positives = 60/166 (36%), Gaps = 28/166 (16%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P + + +S +D++ V+DVS SM KL A +++ Sbjct: 55 PNGDAQGRLDVTLVPQGAVSGIIRPPIDVVFVMDVSGSMTAM------KLQSAKSALQAA 108 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTK 254 ++ KS + N R L+ FS + + + +G I N L G T Sbjct: 109 VNYFKS--NYNQNDRFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTN 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + L A + D KKYIIFLTDG + N Sbjct: 167 YSAALSLAKSYFTDPTR------------KKYIIFLTDGMPTVLNT 200 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 78/229 (34%), Gaps = 32/229 (13%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 C + ++ + D + ++LDVS SM G ++ A ++ Sbjct: 11 VCVLLAALTAGLVAFPAGAAADEPTGRDAP-KVNLLLDVSGSMRARDIDGQSRMAAAKQA 69 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSS-----------KIVQTFPLAWGVQHIQEKINRL 247 E+LD +V +R+ ++ ++ PL + + L Sbjct: 70 FNEVLDATPK--EVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPL--DRTEAKAAVATL 125 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T P L A + + K I+ ++DGE++ +D E Sbjct: 126 TPTGWTPIGPSLLKAADDLEGGNG------------SKRIVLISDGEDTCAPLDPCEVAR 173 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 AK G + +G+ A +C A+ + SV++ +L D Sbjct: 174 EI-AAKGIGLTIDTLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELTD 221 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 61/445 (13%), Positives = 129/445 (28%), Gaps = 100/445 (22%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I F + +G+I+++ A++L + +G ++ S ++ KL D + + ++ Sbjct: 7 IARFHRDRRGNIAVIFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVSR--- 63 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + I + R F ++ N+ T S+ Sbjct: 64 --TSPAYIAAGSMSTDGAITSGADDALRI------FNGNLANLTGYTLDSVAATVSKSGE 115 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM---- 181 +++ + A S + + ++ +D ++LD S SM Sbjct: 116 AVTSKVTFSAQISTMFMKAVAVS---SMTVGGVSTATASMPKYIDFYLLLDNSPSMGVGA 172 Query: 182 -----------------NDHFGPGM-------------------DKLGVATRSIREMLDI 205 +DH ++ V + ++++D Sbjct: 173 TPTDVAAMVAATANKSSDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDT 232 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQT-----FPLAWGVQHIQEKINRLIFGS--TTKSTPG 258 + N R + F + F L+ + + + + Sbjct: 233 AAATATYTNQFRMAIYDFGASAQTAGLRNLFALSASLSSAKTAAGAIDLMTVKGQNDNND 292 Query: 259 LEYAYNKIFDAKEKLEHIAKGHD--DYKKYIIFLTDGENSSPNIDNKESLF--------- 307 + Y I A +KL KY+ F++DG N + Sbjct: 293 QDTQYTAILPAIDKLIAAPGTGAAGSPLKYLFFVSDGVADEYNPACLKPKTGNRCQSPIN 352 Query: 308 --YCNEAKRRGAIV---YAIGVQAEAADQFLK-----------------------NCASP 339 C K RG V Y + + D + K CASP Sbjct: 353 PALCKTLKDRGVRVAVLYTTYLDLPSNDWYKKWIAPFNEGPYGPSPNSEIAKNMEACASP 412 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ V ++ + +A + K V Sbjct: 413 GFYFEVSPTQGIAEAMNALFKRAVA 437 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 78/229 (34%), Gaps = 32/229 (13%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 C + ++ + D + ++LDVS SM G ++ A ++ Sbjct: 4 VCVLLAALTAGLVAFPAGAAADEPTGRDAP-KVNLLLDVSGSMRARDIDGQSRMAAAKQA 62 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSS-----------KIVQTFPLAWGVQHIQEKINRL 247 E+LD +V +R+ ++ ++ PL + + L Sbjct: 63 FNEVLDATPK--EVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPL--DRTEAKAAVATL 118 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T P L A + + K I+ ++DGE++ +D E Sbjct: 119 TPTGWTPIGPSLLKAADDLEGGNG------------SKRIVLISDGEDTCAPLDPCEVAR 166 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 AK G + +G+ A +C A+ + SV++ +L D Sbjct: 167 EI-AAKGIGLTIDTLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELTD 214 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 260 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 307 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 308 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 364 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 365 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 423 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 424 RIHEDSDSALQLQDFYQEVANPLLTAVT 451 >gi|104782921|ref|YP_609419.1| RTX toxin [Pseudomonas entomophila L48] gi|95111908|emb|CAK16632.1| putative RTX toxin [Pseudomonas entomophila L48] Length = 2350 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 92/300 (30%), Gaps = 23/300 (7%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 D + L + ++ + T + Q N + S Sbjct: 1438 DGNDLAAGSVTGSEPGSTGETASGSLVGSVSGGFGALTYSLVGNATGQYGQIQLNADGSY 1497 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM- 171 + ++ + + F + NS + + I + + Sbjct: 1498 TYTLTSAPKTPGGTNDGANTVVEQFTYKATDALGNSVISTIAINIVDDVPKAESAVRSIT 1557 Query: 172 --------MMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++V+DVS SMN G PG+ +L +A ++I +LD + D+ + VT Sbjct: 1558 PGEVDSNILLVVDVSSSMNSGSGVPGLTRLELAKQAINTLLDKYDDMGDIKVQI----VT 1613 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 FS+ P+ + + I L G +T A A + + + Sbjct: 1614 FSTGATMQTPVWVSISEAKSLIAGLTAGGSTYYDSAATKAQEAFVSAGKLVGAQNVSY-- 1671 Query: 283 YKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 F +DGE S +I + G +A+G+ + L A Sbjct: 1672 ------FFSDGEPSGGHSITAVRETTWETFLDDNGIKSFAVGMGSGVNAGNLDPLAYDGS 1725 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 101/285 (35%), Gaps = 38/285 (13%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 ++ N + + F + E + F+ D++ ST+ +I Q Sbjct: 106 VSDRNFSLAPTTNDKFESVVQNG---NMVAGETPVSTFSIDVDTGSYSTTRRLINQGQLP 162 Query: 124 DYN------LSAVSRYEMPF-IFCTFPWCANSSHAPLLITSSVK----------ISSKSD 166 N L Y+ P P+ N+ AP + + ++ Sbjct: 163 TKNTVRVEELVNYFSYDYPVPTNSEQPFSVNTELAPSPYNADTQLLRIGLKGFDVAPDKL 222 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +++++LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 223 SASNLVLLLDVSGSM-----SSADKLPLLKQAMLMLSQQLSAQDKVSIVVYAGASG---- 273 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 +V I+ +++L T + G++ AY +H + + Sbjct: 274 VVLDGVAGNDFTAIKTALSQLNAQGGTNGSQGIQLAYQL------AQKHFIENGSNR--- 324 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +I TDG+ + D+++ + + ++G + +G + Sbjct: 325 VILATDGDFNLGMTDHQQLVDFVASRSKKGIGLSTLGFGLGSGSA 369 >gi|186681468|ref|YP_001864664.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463920|gb|ACC79721.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 426 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 70/209 (33%), Gaps = 36/209 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ L++ ++LD S SM+ + +++ ++D +K R +V F Sbjct: 37 DQNLPLNLCLILDRSGSMHGQ------PIKTVIQAVEGLIDRLKVGD------RISVVAF 84 Query: 224 SSKIVQTFP--LAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S + P + + I+ +I ++L T GLE ++ A Sbjct: 85 SGSVEVIIPNQVIEDPESIKSQIKSKLSASGGTAIAEGLELGITELMKGTRGAVSQA--- 141 Query: 281 DDYKKYIIFLTDGENSS---------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 LTDG S DNK L +A + + G Sbjct: 142 -------FLLTDGHGESSLRIWKWDIGRDDNKRCLKLAQKAAKLNLTINTFGFGNSWNQD 194 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L+ A +++ + + F R+ Sbjct: 195 LLEKIADVGGGTLAHIEHPEQAVEQFSRL 223 >gi|297202051|ref|ZP_06919448.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197714313|gb|EDY58347.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 518 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 66/192 (34%), Gaps = 29/192 (15%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREML---DIIKSIPDVNNV--VRSGLVTFSSKIV 228 VLD S SM +D+L A + + + +P + V VR+ +V Sbjct: 345 VLDTSGSMEGD---RLDRLKTALADLTGDFREREEVTLMPFGSQVKSVRT-------HVV 394 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + G+ I++ + L T LE AY+ + ++ I+ Sbjct: 395 KPSDPRAGLDAIRDDTSALSADGDTAIYTSLEKAYDHLGAGRDAFTS-----------IV 443 Query: 289 FLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSVQ 346 +TDGEN++ FY K R V+ I + + R + + Sbjct: 444 LMTDGENTAGAKARDFDAFYARLGRKARDTPVFPILFGDSDRSELAHIADLTGGRLFDAR 503 Query: 347 NSRKLHDAFLRI 358 L AF I Sbjct: 504 Q-GSLDGAFEEI 514 >gi|221109964|ref|XP_002168937.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 221 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 83/221 (37%), Gaps = 33/221 (14%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 IFC +P+C + +D++ +LD S S+ ++ D L Sbjct: 12 IFCFYPYCRTM------------FQPTCEAVVDIVFLLDSSASLRKYYQNEKDFLKSLIS 59 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTK 254 + ++ R+ ++TFS + L + +E ++ ++ GSTT+ Sbjct: 60 AFGVSINS----------TRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTR 109 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 L A ++F+ LE+ A+ K + LTDG + ++ + NE + Sbjct: 110 IDKALRLAQKEVFE----LENGARPGVA--KILFLLTDGSQTQ-ERGSENPVVIANELRS 162 Query: 315 RGAIVYAIGV-QAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 G + IG+ A + + Y + KL D Sbjct: 163 AGVTIIVIGITNAVDVSELFDIAGGEENAYFADSFEKLKDV 203 >gi|203284094|ref|YP_002221834.1| hypothetical protein BDU_172 [Borrelia duttonii Ly] gi|201083537|gb|ACH93128.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 341 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 77/208 (37%), Gaps = 27/208 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 +DY L+ + F + + P + + S G D+++VLD+S SM Sbjct: 57 RDYVLNLLYFVTYTFFYLAIIVMILTLAGPSISRKKMTYLS---SGADIVIVLDISPSMG 113 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L A ++I+ GLV F+ + PL + Sbjct: 114 AIEFSSKNRLEFAK-------ELIEYFVYQRENDNIGLVAFAKEASLIVPLTIDRDFFSK 166 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + KK +I LTDG N Sbjct: 167 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEAPKKSVIVLTDGV---VN 212 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D N A+ +Y++G+ ++ Sbjct: 213 SDEVYKDQVINLAQGLNVKIYSVGIGSD 240 >gi|114587348|ref|XP_001172665.1| PREDICTED: similar to PK-120 precursor isoform 1 [Pan troglodytes] Length = 849 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 263 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEASQW 310 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 311 RPSLVPASAENVNKARSFAVGIQALGGTNINDAMLMAVQ-LLDSSNQEEQLPEGSVSL-- 367 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 368 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 426 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 427 RIHEDSDSALQLQDFYQEVANPLLTAVT 454 >gi|42407700|dbj|BAD08848.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408122|dbj|BAD09262.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|125602049|gb|EAZ41374.1| hypothetical protein OsJ_25891 [Oryza sativa Japonica Group] Length = 704 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 48/234 (20%), Positives = 87/234 (37%), Gaps = 38/234 (16%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 HA + + ++ + +D++ VLDVS SM + KL + R++ ++D + Sbjct: 209 VLVHAKAPSIAVAEATAAARAPVDLVTVLDVSGSMEGY------KLTLLKRAMGFVIDKL 262 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYA 262 R +V+FS + L G + + L G T GL A Sbjct: 263 GPGD------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAMESLAAGGGTNILKGLVEA 316 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-------KRR 315 K+FD + +A +I L+DG+++ + N + KR Sbjct: 317 -AKVFDGRRYRNAVAS--------VILLSDGQDTYNVNGGWGASNSKNYSVLVPPSFKRS 367 Query: 316 G---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G V+ G + + A + F ++N + DAF + IG + Sbjct: 368 GDRRLSVHTFGFGTDHDAVAMNAIAEETGGTFSFIENQAVVQDAFAQCIGGLLS 421 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 81/211 (38%), Gaps = 22/211 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + T PW N H + I K ++ ++ ++DVS SM +GP ++ Sbjct: 212 VRITSEIGTCPW--NEQHRLVRIGLKAKEIPTENLPASNLIFLIDVSGSM---YGP--ER 264 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V VV SG Q I+E I+ L G Sbjct: 265 LDLVKSSLKLLVNNLRDKDKVAIVVYSGAAG----EKLASTPGSDKQKIREAIDELEAGG 320 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY ++ G ++ II TDG+ + ++E + Sbjct: 321 STAGGEGIKLAYKI------ARKNFITGGNNR---IILCTDGDFNMGVSSDQELKKLIEQ 371 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 372 KRKSGVFLTVLGYGMGNYKDSKMQTLAEKGN 402 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 72/198 (36%), Gaps = 26/198 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ ++D S S+ + + +++D + R +V ++S + Sbjct: 149 PLDLVFIVDSSRSVRP------REFEKVKTFLSQVIDTLDI---GETATRVAVVNYASTV 199 Query: 228 VQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + +++ ++R+ + T S + A +++F + Sbjct: 200 KVEFHLQTHSDKESLKQAVSRIAPLATGTMSGLAIRTAMDEVFT---VEAGARAPAFNIP 256 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRF 342 K ++ +TDG + +A+ G +YA+GV + + P Sbjct: 257 KVVVIVTDGRPQD------QVQEAVAQAQASGIEIYAVGVGRADMQSLRQLASEPVETHA 310 Query: 343 YSVQN---SRKLHDAFLR 357 + V+ KL F + Sbjct: 311 FYVETYGVIEKLTSTFRK 328 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 74/198 (37%), Gaps = 36/198 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G ++L ++ + L + + N G+V F S Sbjct: 306 VCLVLDKSGSM-----NGFNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTAYIK 355 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + ++ E + T G++ A+ I + +++ Sbjct: 356 SNLIQIISSKERNNLLESL-PTTANGGTSICAGIKSAFQVIGEIYPQIDGSE-------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE+++ E K+ GAI++ I + A + + F+ Sbjct: 407 -IVLLTDGEDNTAK-------NCIGEVKQSGAIIHLIALGPSADQAVIEMSTITGGNHFF 458 Query: 344 SVQNSRK--LHDAFLRIG 359 + ++ L DAF + Sbjct: 459 ASDEAQNNGLIDAFGALA 476 >gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Macaca mulatta] Length = 900 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + G Sbjct: 323 RPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQ-LLDSSNQEERLPDGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALENGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 74/199 (37%), Gaps = 39/199 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAYVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELVQINSGSDRDTLAKRL-PTAAAGGTSICTGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GA+++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAVIHTVALGPSAAQELEQLSKMTGGFQT 457 Query: 343 YSVQNSRK--LHDAFLRIG 359 Y+ ++ L DAF + Sbjct: 458 YASDQAQNNGLIDAFGALS 476 >gi|268573054|ref|XP_002641504.1| C. briggsae CBR-MUA-3 protein [Caenorhabditis briggsae] gi|187031289|emb|CAP29350.1| CBR-MUA-3 protein [Caenorhabditis briggsae AF16] Length = 3770 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 86/227 (37%), Gaps = 23/227 (10%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 +N++ P + + K D++ ++D S S+ G + K V R + Sbjct: 1209 GFVDVSSNANLPPGRVCTVQTTCPKQKT--DLVFLIDGSGSI----GSYVFKNEVL-RFV 1261 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKST 256 RE +++ + R GL+ +S +I F L + + I+ T++ Sbjct: 1262 REFVELFEI---GRGKTRVGLIQYSDQIRHEFDLDQYGDRSSLLKGISETQYLTGLTRTG 1318 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 +++ + F + +D + I LTDG + + A++ Sbjct: 1319 AAIQHMVQEGFSERR---GARPQQNDIARVAIILTDGRSQDNVTGP------ADAARKLS 1369 Query: 317 AIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEM 362 +AIGV L++ A SP+R++ V + L + ++ Sbjct: 1370 INTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQKA 1416 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/248 (14%), Positives = 91/248 (36%), Gaps = 41/248 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + + + + + ++ G+ +++ +D S SM + +L A S Sbjct: 58 YLFILSLVAIALVIVGLANPHLPLDQTRDGVSVVLAIDDSGSMAANDYQP-TRLEAAKES 116 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTP 257 ++ + +G+V F S L+ + EK + T Sbjct: 117 ASVLIKSLDPKD------YAGVVIFESGATTAAYLSPDKDRVMEKTAAIEQKNGQTALGD 170 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GL A + + + KK ++ L+DG ++ I +++ + + K Sbjct: 171 GLALAVDMA---------DSIPNQ--KKVVVLLSDGVGNAGVISPEDATAFAAQNK---V 216 Query: 318 IVYAIGVQAEA-----------------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 V+ +G+ +++ + L++ A + +Y+ + + LH + + Sbjct: 217 QVFTVGLGSKSPVLLGTDPTGTPQYATLDEAALQSIAEKTGGTYYTSVDEQTLHQIYAGL 276 Query: 359 GKEMVKQR 366 KE+V+++ Sbjct: 277 NKEIVREK 284 >gi|194220937|ref|XP_001501805.2| PREDICTED: matrilin 3 [Equus caballus] Length = 450 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 34/224 (15%) Query: 149 SHAPLLITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 H+ + G LD++ ++D S S+ + + ++ Sbjct: 20 LHSGAPTLGPGQPGVARGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSQI 73 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGL 259 +D + P R +V ++S + F L Q +++ + R+ + T S + Sbjct: 74 IDTLDIGP---ADTRVAVVNYASTVKIEFHLNTHSDKQSLKQAVARITPLSTGTMSGLAI 130 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A ++ F + K I +TDG + A+ G + Sbjct: 131 QTAMDEAFT---VEAGARVPSYNIPKVAIIVTDGRPQD------QVNEVAARARASGIEL 181 Query: 320 YAIGVQAEAADQFLKNCAS---PDRFYSVQN---SRKLHDAFLR 357 YA+GV A + LK AS + + V+ KL F Sbjct: 182 YAVGVD-RADMESLKVIASEPLDEHVFYVETYGVIEKLSSRFQE 224 >gi|187918047|ref|YP_001883610.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] gi|119860895|gb|AAX16690.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] Length = 341 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 73/206 (35%), Gaps = 27/206 (13%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 +DY L+ + F + + P + + S G D+++VLD+S SM Sbjct: 57 RDYGLNLLYFLTYTFFYLAIVVMILTLAGPSISKKKMTYLSN---GADIVIVLDISPSMG 113 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++ A ++IK GLV F+ + PL Sbjct: 114 AIEFSSKNRFEFAK-------ELIKYFAYQRENDNIGLVAFAKEASLIVPLTIDRDFFSR 166 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ +I LTDG N Sbjct: 167 KLDDIYIMDLGNGSALGLGVSIALSHL-----------KHSEAPKRSVIVLTDGV---VN 212 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ 325 D N A+ +Y+IG+ Sbjct: 213 SDEVYKDQVINLAQGLNVKIYSIGIG 238 >gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Macaca mulatta] Length = 888 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 263 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 310 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + G Sbjct: 311 RPSLVPASAENVNEARSFAAGIQALGGTNINEAMLVAVQ-LLDSSNQEERLPDGSVSL-- 367 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 368 -IILLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLALENGGLAR 426 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 427 RIHEDSDSALQLQDFYQEVANPLLTAVT 454 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 83/213 (38%), Gaps = 42/213 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM G D+L ++ + L I + +G+VTF S Sbjct: 307 LILVLDKSGSMA-----GGDRLNRLNQASQLFLLQI-----IEKGSWTGMVTFDSSATIQ 356 Query: 231 FPL---AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L Q I+RL G T GL A+ I + Sbjct: 357 SALIQIETDAQR-NSLISRLPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRF 342 I+ LTDGE+S ++ C +E K+ GAI++ + + A+ + L + Sbjct: 407 -IVLLTDGEDS--------TISSCFDEVKQSGAIIHTVALGPSADPGLEELAKMTGGMKT 457 Query: 343 YSVQNSRK--LHDAFLRIGKE---MVKQRILYN 370 + N++ L DAF + E + ++ I + Sbjct: 458 SATDNAQNNGLIDAFSALSSENGAITQRSIQLD 490 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 79/212 (37%), Gaps = 39/212 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++++ LDVS SM+ ++ A S ++ + G+V F S Sbjct: 87 VNLVVALDVSASMSASDYSP-TRVEAAKGSSEILIRSLSESDTA------GVVIFESGAS 139 Query: 229 QTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ + ++ ++ T GL A + + A Y + Sbjct: 140 SAAYLSSDKNRVVSRLEQVSVKTGKTALGDGLALAVDMVT---------AIPAGTY--IV 188 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---------------ADQF 332 + L+DG ++S I +E+ Y AK G +VY IGV +E+ ++ Sbjct: 189 VLLSDGVSNSGMITPQEAAEY---AKNSGVVVYTIGVGSESPVEVSSDGVQQYASLDEET 245 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 L++ A + ++ + + L I + Sbjct: 246 LRSIAEITGGEYFRSVDEKTLVQIQNTIQTSI 277 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 409 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 459 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 460 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 514 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 515 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 568 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 569 TNKHLFYAED----FSTMDEISEKLKK 591 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 63/167 (37%), Gaps = 15/167 (8%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVT 222 L + +VLD S SM G + +L + L DV +VT Sbjct: 15 NPTARLPVCLVLDTSASMT---GAPITELQEGVSTFFAQL----LADDVAEYSAEVAVVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + A ++ + L G T +E A + KE E+ G D Sbjct: 68 FGGNVDMAVDFA---AVTRQTVPSLTAGGMTPMGEAVETALELLHTRKE--EYKRAGVDY 122 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 Y+ +++ +TDG + +K S + + + V+AIG+ +A Sbjct: 123 YQPWLVIMTDGAPTDN--ISKASRLVDDLVREKKLAVFAIGIGKDAD 167 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 366 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 416 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 417 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 471 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 472 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 525 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 526 TNKHLFYAED----FSTMDEISEKLKK 548 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 231 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 281 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 282 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 336 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 337 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 390 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 391 TNKHLFYAED----FSTMDEISEKLKK 413 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ V+D S S+ + V + + ++D + P R GL+ +S Sbjct: 435 TEGPIDLVFVIDGSKSLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 485 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L++ + + F E + Sbjct: 486 TQVHTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALKHMFERSFTQGEGARPL--- 540 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 541 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 594 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 595 TNKHLFYAED----FSTMDEISEKLKK 617 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 56/340 (16%), Positives = 112/340 (32%), Gaps = 38/340 (11%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L + L A + Q G + + +N + D Sbjct: 32 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARIDVAR 91 Query: 95 ELRENGFAQDINNIERSTS---------LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWC 145 L Q + + + +I + D + L ++ +M FI+ Sbjct: 92 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT-ELEEIASKKMFFIYEFDSLK 150 Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + I SS S+ D++ ++D S S+ + M++ Sbjct: 151 SIHQEVIRDICSSENCKSQKA---DIIFLIDGSESIAPK------DFEKMKDFMERMVNQ 201 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYA 262 D ++ GL+ FSS + F L + + ++ T + L + Sbjct: 202 SNIGADE---IQIGLLQFSSNPQEEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFT 258 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ + +Y+I +TDG + ++L + R I++AI Sbjct: 259 LPFFDSSRGGRPRVH-------QYLIVITDGVSQDNVAPPAKAL------RDRNIIIFAI 305 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 GV Q L+ D+ + +N L I E+ Sbjct: 306 GVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 345 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 67/164 (40%), Gaps = 20/164 (12%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPG 258 M ++IK + VR G+V +S KI+ F L + + I+ + G T + Sbjct: 14 MKEVIKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKA 73 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L + D +Y+I +TDG+++ P + + L + G Sbjct: 74 LSKMVPVFQNTAR---------IDVARYLIVITDGQSTDPVAEAAQGL------RDIGVN 118 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 +YAIGV+ +A L+ AS F+ + L + +++ Sbjct: 119 IYAIGVR-DANTTELEEIASKKMFF-IYEFDSLKSIHQEVIRDI 160 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 77/216 (35%), Gaps = 27/216 (12%) Query: 151 APLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 P + S+ LD++ ++D S S+ + + +M+D + Sbjct: 44 LPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVG 97 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 PD R +V ++S + F L I++ I R+ + T + ++ A ++ Sbjct: 98 PDA---TRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEA 154 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F K +K K I +TDG + + A+ G +YA+GV Sbjct: 155 FTEKSGARPKSKN---ISKVAIIVTDGRPQDQVEEVSAA------ARASGIEIYAVGVDR 205 Query: 327 EAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 ++P D + V+ KL F Sbjct: 206 ADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 71/198 (35%), Gaps = 21/198 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +I + + + + ++DVS SM+ KL + S++ +++ ++ I V VV +G Sbjct: 233 EIDTDNLPATNFVFLIDVSGSMDWD-----GKLDLVKSSMKLLVNNLRPIDRVAIVVYAG 287 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V I E +N L G +T G+ AY AKE L Sbjct: 288 AAG----QVLPSTPGSEKSKILESLNGLTAGGSTAGGEGIVLAYKI---AKENLIEGGNN 340 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC-- 336 II TDG+ + N ++ G + +G D ++ Sbjct: 341 R------IILCTDGDFNVGVSSNDGLEKLIENERKSGVFLSILGYGMGNYKDDKMQTLAQ 394 Query: 337 ASPDRFYSVQNSRKLHDA 354 A + N ++ + Sbjct: 395 AGNGNHAYIDNMQEANKV 412 >gi|332256817|ref|XP_003277513.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Nomascus leucogenys] Length = 3172 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 69/195 (35%), Gaps = 22/195 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S ++ + + + + D++KS+ N LV F Sbjct: 30 KNGAAADIIFLVDSSWTIGEEHFQLVREF---------LYDVVKSLAVGENDFHFALVQF 80 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I+ + + T T I + ++ D Sbjct: 81 NGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTG---KGLEYIMQSHLTKAAGSRAGD 137 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ LTDG + E K V++IGV+ + + P Sbjct: 138 GVPQVIVVLTDGHSKDGLALPSA------ELKSADVNVFSIGVEDADEGALKEIASEPLN 191 Query: 340 DRFYSVQNSRKLHDA 354 ++++N LHD Sbjct: 192 MHVFNLENFTSLHDI 206 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 50/310 (16%), Positives = 105/310 (33%), Gaps = 32/310 (10%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 515 LDGSALYTGSALDFVRNNLFTXSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 572 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ + Sbjct: 573 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHA 632 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 633 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDVGNDNIRVGLVQF 680 Query: 224 SSKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S V F L + + I + +L + G +Y E + H Sbjct: 681 SDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIREHV 740 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++ LT G++ L N R G + + +G + + +P Sbjct: 741 P--QLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSL 792 Query: 342 FYSVQNSRKL 351 Y + + L Sbjct: 793 VYLMDDFSSL 802 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1528 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1587 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1588 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPSPSRPEKKKA-DIVFLLDGS-- 1644 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1645 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1697 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1698 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1755 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1756 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1804 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 262 LINLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 321 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 322 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 371 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 372 FSFGLGAQAASRAELQHIATDDN 394 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 42/200 (21%), Positives = 75/200 (37%), Gaps = 36/200 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+ +V+D S S+ +K V + ++ +I N V+ GL +FS Sbjct: 1398 GRLDIQIVIDTSGSLT----SAPNKDQVLMNFTNNLANMYDTI----NQVKIGLTSFSES 1449 Query: 227 IVQTFPLAW-GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 V PL + +Q+ ++ + + + T T G+E A N + D Sbjct: 1450 SVLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALNDM-----------DTSDAVD 1498 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-- 342 +I +TDG + + ++AK G + A+G + A SP+ + Sbjct: 1499 DVMILITDGFQ---STNTTLMFQMIDQAKADGVRLIALGFFGDFAFY------SPNLYLM 1549 Query: 343 ----YSVQNSRKLHDAFLRI 358 Y N +L I Sbjct: 1550 TNEVYHAANYAELLAIDNTI 1569 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 74/211 (35%), Gaps = 31/211 (14%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + +VLDVS SM G ++ A ++ E+LD + +V + Sbjct: 18 GAAAGEPTGQDAPKVDLVLDVSGSMRARDIDGQSRMAAAKQAFNEVLDA--TPEEVELGI 75 Query: 217 RSGLVTF-----------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 R+ + ++++ PL + + L T P L A Sbjct: 76 RTLGADYPGDDRKTGCKDTAQLYPVGPL--DRTEAKTAVATLTPTGWTPIGPALLKAAGD 133 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + K I+ ++DGE++ +D E AK G + +G+ Sbjct: 134 LDGGNG------------SKRIVLISDGEDTCAPLDPCEVAREI-AAKGIGLTIDTLGLV 180 Query: 326 AEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 A +C A+ + SV++ +L D Sbjct: 181 PTAKLSRQLSCIAEATGGTYTSVEHQDELTD 211 >gi|261415412|ref|YP_003249095.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371868|gb|ACX74613.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326806|gb|ADL26007.1| BatB protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 342 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 55/168 (32%), Gaps = 25/168 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G D++++ D+SLSM ++L + I L+ + R GLV FS Sbjct: 85 ERRGQDLVLLQDISLSMLAEDVKP-NRLVRSRHEISAFLESLTG-------DRVGLVAFS 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + PL +Q + L G T + ++ +H Sbjct: 137 GEAQVMVPLTLDYGTVQMVLRELNPGWLMPGTNLESAIRKGMTLFKNSGGASQHS----- 191 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +I ++DGE N G +Y IG+ + Sbjct: 192 ----VMILMSDGEELEAAAVNAAKEAA-----EFGIKIYTIGIGSREG 230 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 50/167 (29%), Gaps = 33/167 (19%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK--EKLEHIAKGHDDYKKYIIFLT 291 A + N I T GL +A + +Y+IF+T Sbjct: 372 ATDFNNYFTFNNGFIPNGGTWLDVGLLWAARLLSRDGLWSTENDELYHTYPVSRYVIFMT 431 Query: 292 DGENSSPNIDNK------------------------ESLFYCNEAKRRGAIVYAIGVQAE 327 DG S + + L C K +Y I A Sbjct: 432 DGYMSIGSSNYAAYAQEDYWRRVAAAGASKNDNHYARMLMTCTAIKNMDTKIYTISFGAG 491 Query: 328 AA-DQFLKNCAS------PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + D L NC+S P+ Y +S L+ F IG+ + R+ Sbjct: 492 STLDSNLINCSSSTNTTNPEFAYKADSSSDLNRVFRDIGENIGSLRL 538 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 62/180 (34%), Gaps = 30/180 (16%) Query: 6 IRNFFYNCKGS-ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + + N G+ ++I+ A +LP+ +V G ++ S + K +L D +L + Sbjct: 12 LMRLYRNQAGNTLAIVAAAMLPLAGMVGG-ALDISRGYLAKTRLQQACDAGVLAGRKVMG 70 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 + ++ + + R + N + + + +T ++ + + Sbjct: 71 SSGVLSDS---------------VRDEVRKYVSFNYPSGYLGSTLATTDINPTLGSNDQ- 114 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD--IGLDMMMVLDVSLSMN 182 + T A S + + +D+++VLD + SM Sbjct: 115 ----------IALSLTTAIPTAVMRLFGRNNMSITASCTARNDYSNIDIVLVLDTTGSMA 164 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 72/227 (31%), Gaps = 47/227 (20%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-- 225 G D+++V+D S SM A I + + + R +V F S Sbjct: 559 GSDVILVIDSSGSMEGEKWS--TAKTAAKGFIDNLYQNKDGVV---SDDRIAIVDFDSSA 613 Query: 226 --------------KIVQTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNK 265 K+ + ++ + + T L+ A Sbjct: 614 KAYPGTNSGSETFLKVDDKITIKNKTYSAKDYLKSYVLDSQMKDTGGTDYNKALQTA-QS 672 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVY 320 + + + A YI+F++DGE + D + Y E K G +Y Sbjct: 673 VINNRRDSSRPA--------YIVFMSDGEPNGY-WDWLTYRYYDGQKYATELKSDGVTIY 723 Query: 321 AIGVQAEAA--DQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEM 362 ++G+ + ++F+ AS ++ + L + I + Sbjct: 724 SLGLNIGSTNFNKFIVPLASDPTSTYAKNIVKTSDLVGIYDAIASSI 770 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 79/199 (39%), Gaps = 24/199 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+M+V+D S S+ + + ++P + VR GLVTF + V Sbjct: 46 LDVMLVVDESGSIGTSNYGKVRSFIS---------NFAGTMPLSPDDVRVGLVTFGTSAV 96 Query: 229 QTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L+ + + +T + GL A +F ++ G D+ Sbjct: 97 TRWDLSDSRAQNADLLAAAAKKLPYAAGSTYTHLGLAKAEEILFSFQKG------GRDNA 150 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 K I+ +TDG +S + ++L + + RG I+ +GV ++ A D Sbjct: 151 PKMILVMTDGASSRRS----QTLSAAEKLRNRGVIIVVLGVGTGVNSAECRSIAGCDTSD 206 Query: 344 SVQNSRKLHDAFLRIGKEM 362 +V+ R L + + ++ Sbjct: 207 TVECPRYLQSNWGGVSSQI 225 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 81/215 (37%), Gaps = 30/215 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + I + + LD++ ++D S S+ + + EM+D + Sbjct: 53 SINIGAPAEPC--KSRPLDLVFIIDSSRSVRPA------EFEKVKIFLSEMVDSLDI--- 101 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFD 268 ++ R LV ++S + F L + +++ +R+ + T + ++ A ++F Sbjct: 102 GSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFT 161 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KG K I +TDG + + A+ G +YA+GV A Sbjct: 162 ENAGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA------ARASGIEIYAVGVD-RA 211 Query: 329 ADQFLKNCAS---PDRFYSVQN---SRKLHDAFLR 357 + LK AS D + V+ KL F Sbjct: 212 EVRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 69/179 (38%), Gaps = 24/179 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +D+++++D S S+ + + + ++ D +R GL + Sbjct: 255 DTPAMIDLVLLVDGSWSIGRN------NFKLIKEFLSNLISPFSIAEDK---IRVGLSQY 305 Query: 224 SSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 SS + L+ + + E + L + G T + L + + L+ A Sbjct: 306 SSDPRTEWELSAYSTREQVLEAVRNLRYKGGNTFTGLALTHVLE------QNLKPDAGAR 359 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +K +I LTDG++ ++ K G ++AIGV+ + + + P Sbjct: 360 LEAEKLVILLTDGKSQD------DANLAAQTLKNLGIEIFAIGVKNADEAELRQVASEP 412 >gi|149034209|gb|EDL88979.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Rattus norvegicus] Length = 706 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 95/310 (30%), Gaps = 50/310 (16%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI--IIDDQ 121 L E+ ++ N + K F+ L + +Q+ + ++ +D Sbjct: 187 LETESTFMTQELANALTTSQNKTKAHIQFKPTLSQQRKSQNEQDTVLDGDFTVRYDVDRS 246 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + + Y F+ P + +L V+D S SM Sbjct: 247 STGGTIQIENGY---FVHHFAPEDLPTMAKNVLF------------------VIDKSGSM 285 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWG 236 K+ ++ ++L + + N ++ FS + Q L Sbjct: 286 AGK------KIQQTREALIKILKDLSTQDQFN------IIVFSGEANQWEQLLVQATEEN 333 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + +++ T + A + + + +K II LTDGE + Sbjct: 334 LNRAVDYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPT 389 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------FYSVQNSRK 350 + K EA ++ +G + FL+ A + + ++ + Sbjct: 390 VGETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQ 449 Query: 351 LHDAFLRIGK 360 L D + + Sbjct: 450 LQDFYQEVAN 459 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 76/211 (36%), Gaps = 31/211 (14%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + ++LDVS SM G ++ A ++ E+LD + +V + Sbjct: 20 GAAAGEPTGQSAPKVNLLLDVSGSMRAKDIDGQSRMAAAKQAFNEVLDA--TPEEVELGI 77 Query: 217 RSGLVTF-----------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 R+ + ++++ PL + + L T P L A + Sbjct: 78 RTLGANYPGDDRKEGCKDTAQLYPVGPL--NRTEAKTAVATLAPTGWTPIGPALLKAADD 135 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + D K I+ ++DGE++ +D E A+ G + +G+ Sbjct: 136 L------------DGGDGSKRIVLISDGEDTCAPLDPCEVAREI-AARGIGLTIDTLGLV 182 Query: 326 AEAADQFLKNC---ASPDRFYSVQNSRKLHD 353 A +C A+ + SV++ +L D Sbjct: 183 PNAKLSRQLSCIAEATGGTYTSVEHQDELTD 213 >gi|260834334|ref|XP_002612166.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] gi|229297540|gb|EEN68175.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] Length = 815 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 71/207 (34%), Gaps = 29/207 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 SS + +D+ +LD S S+ + +M++ P R G Sbjct: 300 SGSSTCEAPVDLFFLLDGSGSVKAA------NFAKVKQFAVDMVNSFDVSPAA---TRVG 350 Query: 220 LVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ +S++ F L IN + + G T++ L+Y + Sbjct: 351 VLQYSNRNTLVFNLGNKVNKPTTVSAINSISYQGGGTRTGAALQY----------IRGNA 400 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A + K +I LTDG++ ++L VYAIGV ++ L+ Sbjct: 401 AWRRGNVPKVLIVLTDGKSEDSVSGPSQNLV------SDRVEVYAIGVSNFDHEELLQIV 454 Query: 337 -ASPDRFYSVQNSRKLHDAFLRIGKEM 362 + + L I +++ Sbjct: 455 NNKQSNVIELNDFNALATKIDEIAQDV 481 >gi|218249312|ref|YP_002374701.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223889245|ref|ZP_03623833.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226320920|ref|ZP_03796471.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|226321491|ref|ZP_03797017.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|218164500|gb|ACK74561.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223885278|gb|EEF56380.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226232680|gb|EEH31433.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|226233692|gb|EEH32422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|312147800|gb|ADQ30459.1| von Willebrand factor type A domain protein [Borrelia burgdorferi JD1] Length = 333 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 45/239 (18%), Positives = 88/239 (36%), Gaps = 31/239 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 49 KDYRLNLMYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+S GLV F+ P+ + + Sbjct: 106 AVEFSSKNRLEFSK-------ELIRSFISQRENDNIGLVAFAKDASIVVPITTDREFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 I + + N A+ +Y+IG+ + S F V + L + Sbjct: 208 IYKDQVI---NLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 263 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 75/197 (38%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM H ++++ A + + V N G+V F+S Sbjct: 307 VCLVLDTSGSMLSH--NRLNRMNQAAKQFL--------MQTVENGSWVGMVRFNSTATIL 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + EK+ T G++ A+ I + +L+ Sbjct: 357 NKLIQIISSNERNTLLEKL-PTRAQGGTSICSGIKSAFQVIGELYSQLDGSE-------- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE+++ + +E K+ GAIV+ I + +A + N Y Sbjct: 408 -IVLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALGKDADKAVIEMSNITGGSHLY 459 Query: 344 SVQNSRK--LHDAFLRI 358 + + L DAF + Sbjct: 460 ASDEAENNGLIDAFGAL 476 >gi|271964702|ref|YP_003338898.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] gi|270507877|gb|ACZ86155.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] Length = 514 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 74/235 (31%), Gaps = 33/235 (14%) Query: 136 PFIFCTFPWC-ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD-KLG 193 P + FP A + V++ +++ + VLD S SM + L Sbjct: 297 PLLELPFPNRRAAADGLITAYLDEVRVPARA------LFVLDTSGSMEGERIEALRQALV 350 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-------GVQHIQEKINR 246 T + NV+ ++ F P + I+ R Sbjct: 351 TLTGADTSASGTFSRFRSRENVI---MIPFGGSAGLPQPFILPERDPQPALAQIRAYAER 407 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKE 304 L T GL AY + DA I+ +TDGEN+ + D + Sbjct: 408 LRAAGGTAIYDGLRAAYGQAGDAGRDHYTS----------IVLMTDGENTDGSSYEDFEA 457 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRI 358 EA+R+ + + AD+ + + + + + L AF I Sbjct: 458 YYRSLPEARRQ-VRTFVVLFGESDADEMERIATLTRGAVFDAR-TGSLASAFKEI 510 >gi|126732236|ref|ZP_01748037.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] gi|126707318|gb|EBA06383.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] Length = 710 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 65/163 (39%), Gaps = 18/163 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ ++++ ++D S SM D +KL + +S R ML + V+ V +G Sbjct: 345 PEARPPMNLVFLIDTSGSMQDA-----NKLPLLKQSFRLMLGQLGEEDMVSIVTYAGS-- 397 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++++ + Q I + ++RL G +T GL+ AY + Sbjct: 398 -AGRVLEPTKAS-DRQTILDALDRLEAGGSTAGQAGLQQAYATATEMARDGAVSR----- 450 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +I TDG+ + D + Y + G + +G Sbjct: 451 ----VILATDGDFNVGISDPDDMKDYIETQRGTGTYLSVLGFG 489 >gi|119494080|ref|ZP_01624623.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] gi|119452182|gb|EAW33385.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] Length = 608 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 74/208 (35%), Gaps = 25/208 (12%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L + + + L++ +V+D S SM L A ++ +++++ + + V Sbjct: 24 LNFNAQTQAETSPRRPLNLSLVIDRSGSMAGQ------SLRYAIKAAQQLVESLTADDIV 77 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + +V + + I ++I R+ G T + G + D Sbjct: 78 S------VVIYDDQPETILTPQTVEDKAAICKQIGRIRAGGCTNLSGG----WLMGCDCV 127 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + + + ++ LTDG+ + D K G I +G + + Sbjct: 128 KSRQTSDRLNR-----VLLLTDGQANMGITDPKVITKTAQNQAETGIITTTLGFGSYFNE 182 Query: 331 QFLKNC--ASPDRFYSVQNSRKLHDAFL 356 L + A+ FY +Q+ + F Sbjct: 183 DLLISMADAAGGNFYFIQSPDDVAQVFR 210 >gi|294139879|ref|YP_003555857.1| hypothetical protein SVI_1108 [Shewanella violacea DSS12] gi|293326348|dbj|BAJ01079.1| hypothetical protein [Shewanella violacea DSS12] Length = 405 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 55/401 (13%), Positives = 121/401 (30%), Gaps = 51/401 (12%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 Y +G IS++ I LP I ++ + I + + D + L ++ Sbjct: 8 YRQRGDISLMFVICLPFILTMIAVSILLAMYLLTVTRAGQASDAASLACGYSQRADQDLL 67 Query: 71 NGK--------KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 G + + I + R N + T +S+ D Sbjct: 68 VGILDYYRPGFVVHDGEALVSIDGKNRCSIEATYRFNPTMMALLPESARTHVSLSSDTGA 127 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPL-------LITSSVKISSKSDIGLDMMMVL 175 + + + M + S+ P + + G+ +V Sbjct: 128 TSHLVINSTPLPMDLALVLDISSSMSAQLPQLKLIINGALEEIRQQDPNEVGGVRFSLV- 186 Query: 176 DVSLSMNDHFGPGMDKLGV----ATRSIREMLDIIKSIPDVNNVVRS------GLVTFSS 225 F G+ L + + +D + + R+ + Sbjct: 187 --------PFETGVGVLNAPWMPKSAAKVTCVDGLSYGQHSVDYARTVDDLAEPAANLNI 238 Query: 226 KIVQTF-------------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA-KE 271 K V PL + ++++++ L+ TT S GL + + +E Sbjct: 239 KSVFASQWLDACSMDATILPLTQDLNLVKQRVDALVTSGTTSSYQGLIWGVRTLLPQWQE 298 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQA-EAA 329 + + + ++ TDG + ++D+ C + + + IG + Sbjct: 299 EWQIPPVESPALIQRLVLFTDGADQGFHLDDLIEQGLCRVIQDKHHIEMSFIGFGVSDRR 358 Query: 330 DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 Q + CA + Y QN+++L F + K ++ Sbjct: 359 LQQFRECAGDKGKVYDAQNTQELEAFFREALQTDTKASLVL 399 >gi|126173282|ref|YP_001049431.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125996487|gb|ABN60562.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 642 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 75/216 (34%), Gaps = 30/216 (13%) Query: 163 SKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 KS +G ++ +LDVS SM DKL + +++ + + + V+ VV +G Sbjct: 231 PKSQLGASNLVFLLDVSGSMA-----SADKLPLLQTALKLLTAQLSAQDKVSIVVYAGAA 285 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +V Q + + +L G + G+ AY +H Sbjct: 286 G----VVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQL------AKKHFIPNGI 335 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--S 338 + +I TDG+ + D + + + K G + +G DQ ++ A Sbjct: 336 NR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKG 392 Query: 339 PDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 + + +L I K++ Q Sbjct: 393 NGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQV 428 >gi|78189842|ref|YP_380180.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172041|gb|ABB29137.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 334 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 66/201 (32%), Gaps = 24/201 (11%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSS---VKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 ++ +P+ F + S L + G D++ +LDVS SM Sbjct: 49 QHLLPYAFRSLMLFVASGLLLLALAEPRWCGGTKPVLRHGADVLFILDVSRSMQATDVAP 108 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN--- 245 + I +I R GL+ F++ + PL +N Sbjct: 109 NRLMRAK--------QEIAAISQNVQGGRRGLLIFAASPLLHCPLTTDRDGFATLLNMAA 160 Query: 246 -RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 LI T+ P A A E +G + I+ L+DGE+ N+ Sbjct: 161 PELIEEQGTRLQPAFALASTIFDVANESNAASTRG----VQVIVLLSDGEDHDSNVQ--- 213 Query: 305 SLFYCNEAKRRGAIVYAIGVQ 325 + ++ ++ IGV Sbjct: 214 --RAAQQLAKQSVQLFVIGVG 232 >gi|330508298|ref|YP_004384726.1| hypothetical protein MCON_2454 [Methanosaeta concilii GP-6] gi|328929106|gb|AEB68908.1| conserved hypothetical protein, extracellular or membrane bound [Methanosaeta concilii GP-6] Length = 726 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 79/237 (33%), Gaps = 33/237 (13%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +S ++ +F LL VK++ + + D+++VLD S SM Sbjct: 236 TISSEKIGLNLLSFKEEGQDGFFLLLAAPDVKVNEEEIVVKDIILVLDTSGSMQGE---- 291 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP---LAWGVQHIQEKIN 245 K+ A + R +LD +N + R +V+F++ P A ++ ++ Sbjct: 292 --KMDQAKEAARYVLD------HLNPLDRFAIVSFATTTRSFSPSLEPAAQADKGKDFLD 343 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 RL +T + A + + +IFLTDG + Sbjct: 344 RLEAMGSTDINRAMIEAVGLAEEVRPTT-------------LIFLTDGLPTEGVTVTGAI 390 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-----SPDRFYSVQNSRKLHDAFLR 357 L +++ GV + L + + + + AF R Sbjct: 391 LDNVAREAPDNVRIFSFGVGDDVDTDLLDQISMDNGGASTYVRPGEEIDEEVSAFYR 447 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 76/200 (38%), Gaps = 41/200 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 + +VLDVS SM G ++ A ++ ++LD + +V +R+ + Sbjct: 41 VELVLDVSGSMRTRDIDGQSRMSAAKQAFNDVLDAV--PEEVQLGIRTLGANYPGDDRKV 98 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + ++ PL + + L T P L A + + Sbjct: 99 GCKDTKQLYPVGPL--DRTEAKTAVATLAPTGWTPIGPALLGAADDL------------D 144 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGA--IVYAIGVQAEAADQFLK 334 D + I+ +TDGE++ +D C A+ RG ++ +G+ A + Sbjct: 145 GGDATRRIVLITDGEDTCGPLDP------CEVARDIAARGIHLVIDTLGLVPNAKIRQQL 198 Query: 335 NC---ASPDRFYSVQNSRKL 351 C A+ + +VQ++ +L Sbjct: 199 TCIAEATGGTYTAVQHADEL 218 >gi|225468694|ref|XP_002269894.1| PREDICTED: hypothetical protein, partial [Vitis vinifera] Length = 585 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 69/188 (36%), Gaps = 28/188 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + T+S ++ +D++ VLDV M KL + R++R ++ + S Sbjct: 168 TINTTTSSLLNPARRAPIDLVTVLDVGGGMTGA------KLQMMKRAMRLVISSLSSTD- 220 Query: 212 VNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R +V FS+ + PL G + + I LI G T + L+ A + Sbjct: 221 -----RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIAGQGTSAGEALKKASKVLE 275 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGEN---SSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 D +E+ + I+ L+DG+N SS + + + + + Sbjct: 276 DRRERNPVAS---------IMLLSDGQNERVSSKSTNPNRPSNVVSSTRYAHLEIPVHAF 326 Query: 325 QAEAADQF 332 + Sbjct: 327 GFGENGAY 334 >gi|225438159|ref|XP_002262605.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 830 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 69/188 (36%), Gaps = 28/188 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + T+S ++ +D++ VLDV M KL + R++R ++ + S Sbjct: 413 TINTTTSSLLNPARRAPIDLVTVLDVGGGMTGA------KLQMMKRAMRLVISSLSSTD- 465 Query: 212 VNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 R +V FS+ + PL G + + I LI G T + L+ A + Sbjct: 466 -----RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIAGQGTSAGEALKKASKVLE 520 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGEN---SSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 D +E+ + I+ L+DG+N SS + + + + + Sbjct: 521 DRRERNPVAS---------IMLLSDGQNERVSSKSTNPNRPSNVVSSTRYAHLEIPVHAF 571 Query: 325 QAEAADQF 332 + Sbjct: 572 GFGENGAY 579 >gi|332253805|ref|XP_003276022.1| PREDICTED: matrilin-3 [Nomascus leucogenys] Length = 488 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 68/212 (32%), Gaps = 20/212 (9%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + + G+ LD+ ++ + + ++D + P Sbjct: 63 PASGASEPGRARGAGVCKSRPLDLVFIIDSSRSVRPPEFTKVKTFVSRIIDTLDIGP--- 119 Query: 214 NVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 R +V ++S + F L Q +++ + R+ + T S ++ A ++ F Sbjct: 120 ADTRVAVVNYASTVKIEFQLQAYTDKQSLRQAVGRITPLSTGTMSGLAIQTAMDEAFT-- 177 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + K I +TDG + A+ G +YA+GV Sbjct: 178 -VEAGARDPSSNIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVDRADLK 230 Query: 331 QFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + P + + V+ KL F Sbjct: 231 SLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 262 >gi|229520644|ref|ZP_04410068.1| hypothetical protein VIF_001170 [Vibrio cholerae TM 11079-80] gi|229342468|gb|EEO07462.1| hypothetical protein VIF_001170 [Vibrio cholerae TM 11079-80] Length = 1495 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 93/314 (29%), Gaps = 28/314 (8%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI 108 +++ + N + G N S + + + L + + Sbjct: 754 QDTTNNTSADGQLNLSNVAQLSMGIPTGNYTSNGAAISWVLSADKQTLTGSAGGNKVVEF 813 Query: 109 ERSTSLSII------IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV--- 159 ++ +D +K + + + + V Sbjct: 814 TLDNQGNVSSTLHAPVDHANKSGEDTLTINIPLEAKNAAGAIGTGKVTLVIEDDAPVAKE 873 Query: 160 ---KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S+ G ++ ++LDVS SM G G +L V S + +LD S + + +V Sbjct: 874 VFHVAESELKQGANVQLILDVSGSMAWGAGNGKTRLEVMQESAKLLLDQYSSNSNGHVLV 933 Query: 217 RSGLVTFSSKIVQTFPLA--W-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + LV F+ W V+ + I+ L T +E A + Sbjct: 934 Q--LVVFNHAASIKTSDNSYWMSVKDAKAYIDDLSARWQTDYDHAIELAEKFWSGNQNVS 991 Query: 274 EHIAKGHDDYKKYIIFLTDG-----ENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAE 327 + Y FL+DG +N +PN + L + N K A G+ Sbjct: 992 PLSGATNVSY-----FLSDGRPEGDDNGNPNTIEDDELESWINHLKANQITALAYGMGNN 1046 Query: 328 AADQFLKNCASPDR 341 L A Sbjct: 1047 VPQGELDKVAYDGH 1060 >gi|239616825|ref|YP_002940147.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239505656|gb|ACR79143.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 730 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 65/173 (37%), Gaps = 27/173 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S SM+ K+ A ++ ++L ++ + R ++TF++++ Sbjct: 275 DIVFILDISGSMSGQ------KIEKAKLALLQVLQML------HEGDRFSIITFNNEVNN 322 Query: 230 TF----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + + +++ G T L + + K Sbjct: 323 LTERLLPFS-DRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQSTDDRY---------K 372 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQAEAADQFLKNCA 337 ++FLTDG + D + + AK R ++ GV + + L A Sbjct: 373 VVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGYDVNAELLDELA 425 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 76/216 (35%), Gaps = 27/216 (12%) Query: 151 APLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 P + S+ LD++ ++D S S+ + + +M+D + Sbjct: 44 LPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVG 97 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 PD R +V ++S + L I++ I R+ + T + ++ A ++ Sbjct: 98 PDA---TRVAVVNYASTVKIESLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEA 154 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F K +K K I +TDG + + A+ G +YA+GV Sbjct: 155 FTEKSGARPKSKN---ISKVAIIVTDGRPQDQVEEVSAA------ARASGIEIYAVGVDR 205 Query: 327 EAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 ++P D + V+ KL F Sbjct: 206 ADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|242034241|ref|XP_002464515.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] gi|241918369|gb|EER91513.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] Length = 704 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 25/138 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + LD++ VLDVS SM+ GP + L A R + E L+ R +V FS Sbjct: 230 TRAPLDLVTVLDVSRSMS---GPKLALLKRAMRFVIENLEPSD---------RLSVVAFS 277 Query: 225 SKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S + FPL A+G Q Q+ ++ L+ T GL A + D + + + Sbjct: 278 SSACRLFPLRKMTAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPVCS--- 334 Query: 281 DDYKKYIIFLTDGENSSP 298 II L+DG +S Sbjct: 335 ------IILLSDGVDSHN 346 >gi|282877522|ref|ZP_06286340.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300346|gb|EFA92697.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 345 Score = 63.7 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 66/174 (37%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 KIS++ G++ ++ LD+S SM +L + + ++D + + Sbjct: 80 SKISNEKRRGIETIIALDISNSMMAEDVVP-SRLAKSKLMVENLVDNFTN-------DKI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GLV F+ + P+ + ++ +F + N I A Sbjct: 132 GLVVFAGEAFVQLPIT------SDYVSAKMFLQNADPSLITTQGTN-IAQAIRLSMSSFT 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 D + II +TDGE+ E+L EA+++G VY +GV Sbjct: 185 QQDKVGRAIILITDGEDHEG-----EALEAAKEARKKGINVYILGVGETKGAPI 233 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/147 (26%), Positives = 59/147 (40%), Gaps = 14/147 (9%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLE 260 D++K + VR +T+S+ + I+E + +L T GL Sbjct: 25 DVVKKFDNPK--VRISFITYSTDGHTLMKITSDKNEIRENLAKLQNVVPSGATHMQEGLR 82 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A E++E G I+ LTDG + +E+ E++R GA VY Sbjct: 83 KA-------NEQIEQENAGEKKAPIVILALTDG--TLLPFPFEETKMEAEESRRLGATVY 133 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQN 347 IGV+ DQ L SPD + V N Sbjct: 134 CIGVKDYRKDQLLDIADSPDHMFGVDN 160 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 61/174 (35%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ ++ G++ ++ +D+S SM +L + + ++D + + Sbjct: 80 TKMDTRKRQGIEAIIAMDISNSMMAEDVTP-SRLEKSKMLVSNIVDKMTD-------DKI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ ++ + P+ + + + T ++ A + Sbjct: 132 GLIVYAGEAYTQLPITSDYVSAKMFLETINPSMITTQGTDIKQAID-------LAMKSFT 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + D K I +TDGE++ + A +G VY +GV + Sbjct: 185 PNQDVSKAIFVITDGEDNEG-----GVVEMAKAAAEKGIKVYVLGVGSPQGAPI 233 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 58/422 (13%), Positives = 120/422 (28%), Gaps = 73/422 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F N G+I+++ AI L I +G I+ S KAKL +D +LL Sbjct: 11 LSRFRRNDSGNIAVIFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAALLAATGYTAM 70 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + ++ K + + ++L N + DI + + +++ K Sbjct: 71 RGSSSDAKTAATNMFNGQMS-------AHKLTSNSLSIDITDSVSARTVTGSATVVVKTS 123 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD------VSL 179 + M + + ++ + S M L + Sbjct: 124 FMYMFGYPTMTVSASSSASASFPTYMDFYVLVDNSPSQGLGATTADMTTLQNATTDTCAF 183 Query: 180 SMNDHFGPGMDK-------------------LGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + +D + K + V + + + D S V+N R + Sbjct: 184 ACHDTYTSSSKKTLQTNSYYDKAKKLGVTMRIDVVRSATQSLTDTATSSQIVSNQYRMAV 243 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN----------KIFDAK 270 + + + L + YN + Sbjct: 244 YSMGADC-GSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGTMSAMSG 302 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGE-------------NSSPNIDNKESLFYCNEAKRRGA 317 + +K++ F++DG S + C K RG Sbjct: 303 VIPTQGDGSSTNPQKWLFFVSDGVADYAYPTTCSKTTQSGGRCVEPLTTTTCTALKARGI 362 Query: 318 IV---YAIGVQAEAAD--------------QFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 + Y + + +K+CASP +Y V +S + A + + Sbjct: 363 KIAVLYTTYLAITSNGYYNTWVKPWRDSIGTIMKSCASPGYYYEVDSSGSIGSALTALFQ 422 Query: 361 EM 362 + Sbjct: 423 QA 424 >gi|66805993|ref|XP_636718.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] gi|60465117|gb|EAL63216.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] Length = 549 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 81/203 (39%), Gaps = 15/203 (7%) Query: 172 MMVLDVSLSM-----NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 M+VLDVS SM N PG+ ++ + S ++ V +GLV F K Sbjct: 124 MIVLDVSCSMTAVAYNGSTKPGLLEMNRSEVSQALFQTMLDKYVSFEVPVVAGLVLFGQK 183 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 I F ++ +++ ++ +T+ + +A N+I + + + Sbjct: 184 IDTAFEISKNFDSFSQELGEVVANQGSTRLYEAIYHAANEIEKYRNNPKEKLAPDVCCR- 242 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I LTDG+++S NI+ Y K I+ AI + + L A+ + Sbjct: 243 -IFLLTDGQDTS-NINPYNVYQY---LKPLNIILDAIPIGRDDNSTLSTLTK-ATGGSCF 296 Query: 344 SVQNSRKLHDAFLRIGKEMVKQR 366 ++++ + F R + QR Sbjct: 297 MANSTQEGVELFEREALLIPTQR 319 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 77/210 (36%), Gaps = 37/210 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ ++L ++ + L I V N G+V F S Sbjct: 253 VCLVLDKSGSMSSS-----NRLNRMNQAAKYFLMQI-----VENGSWVGMVHFDSTASIR 302 Query: 231 FPL--AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L G + + L T G+ A+ + ++ H Sbjct: 303 SDLIQITGSNERDKLLGSLPTTASGGTSICSGIRRAFEVV---RKLYSHTDGSE------ 353 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ LTDGE+++ +E K+ GAI++ I + A ++ + FY+ Sbjct: 354 IVLLTDGEDNTAGA-------CVDEVKQSGAIIHFIALGPSADKAVIEMSTATGGVHFYA 406 Query: 345 VQNSRK--LHDAFLRIG---KEMVKQRILY 369 + L DAF + ++ +Q + Sbjct: 407 TDEAENNGLIDAFGALASGNTDISQQSLQL 436 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 187 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 234 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 235 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 291 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 292 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 350 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 351 RIHEDSDSALQLQDFYQEVANPLLTAVT 378 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 51/345 (14%), Positives = 119/345 (34%), Gaps = 62/345 (17%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVI---------ETSHKFFVKAKLHYILDHS 55 +R F + ++I+ ++ VI ++ + E S + ++ + Y +D + Sbjct: 3 QLRKGFTLAEMVMAIMLMSMIAVIIGIIFNTMFSSRELIEREASIQAEMRTSMQY-VDRT 61 Query: 56 LLYTATKILNQENGNNGKKQK----------NDFSYRIIKNIWQTDFRNELRENGFAQDI 105 + + + ++ G K+ + +++ +W ++ + + Sbjct: 62 IGKATSVFILDDSKFKGNKEGLTKEWSYIGLSSDGKKVLNYVWDKSKQDWKVSELGTKSL 121 Query: 106 NNIERSTSLSI---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI- 161 +I+ D++ YNL+ +Y + ++ + + + K Sbjct: 122 YDIKLDLEFKTEGAYQDNRLISYNLT--GKYPDSNNKLSIDTAISALNTKQVFSKVAKGK 179 Query: 162 ----------SSKSDIGLDMMMVLDVSLSMND-------HFGPGMDKLGVATRSIREMLD 204 + + + + V D S SM+ ++ ++ + M+ Sbjct: 180 KGIALAYRNDPIEGQMNVAISFVFDKSGSMSWDLNGNNTNYWGPKSRMSILKDKATIMMR 239 Query: 205 IIKSIPDVNNVVRSGLVTFSS----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 +K I +V+ LV+FS L G I+ IN L G T GL Sbjct: 240 DLKDIGNVS----VNLVSFSILGSYVQKDFSELDKGTTTIEASINALQTGGVTNPGDGLR 295 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 Y + + H KY++ LTDG ++ +D ++ Sbjct: 296 YGMMSL-----------QNHSAQLKYVVLLTDGIPNAYTVDTNDT 329 >gi|297668115|ref|XP_002812300.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-3-like [Pongo abelii] Length = 493 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 75/218 (34%), Gaps = 32/218 (14%) Query: 154 LITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ + + G LD++ ++D S S+ + + ++D + Sbjct: 68 PVSGASEPGRARGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLD 121 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 P R +V ++S + F L Q +++ + R+ + T S ++ A + Sbjct: 122 IGP---ADTRVAVVNYASTVKIEFQLQTYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMD 178 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 +IF + K I +TDG + A+ G +YA+GV Sbjct: 179 EIFT---VEAGARGPSSNIPKVAIIVTDGRPQD------QVNEVAARAQASGIELYAVGV 229 Query: 325 QAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + + P + + V+ KL F Sbjct: 230 DRADMESLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 267 >gi|119613593|gb|EAW93187.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_b [Homo sapiens] Length = 523 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 68/206 (33%), Gaps = 36/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM FG M++ A I L +++FS + Sbjct: 174 VVFVIDVSSSM---FGTKMEQTKTAMNVILSDLQANDYFN---------IISFSDTVNV- 220 Query: 231 FPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W V ++ ++ + T L A + + + ++ Sbjct: 221 ----WKAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSV 276 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 IIFLTDGE ++ L +A ++++ +A L+ + Sbjct: 277 GRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLSLE 334 Query: 340 DR------FYSVQNSRKLHDAFLRIG 359 +R + + +L + I Sbjct: 335 NRGIARRIYEDTDAALQLKGLYEEIS 360 >gi|86131263|ref|ZP_01049862.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] gi|85818674|gb|EAQ39834.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] Length = 344 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 59/180 (32%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K+ + G+D++ +DVS SM D ++K I L Sbjct: 80 TKLETVKREGVDVVFAIDVSKSMLAEDIAPNRIEKSKQLVTQIINSLGS----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEK 272 R G++ ++ P+ + ++++ T +E A D ++ Sbjct: 130 RIGIIAYAGSAYPQLPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQT 189 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++DGE+ E+ +A + G ++ IGV Sbjct: 190 N-----------RVLFIISDGEDHVG-----EASSLAEQANKEGIRIFTIGVGKTEGGPI 233 >gi|113476849|ref|YP_722910.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167897|gb|ABG52437.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 441 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 74/193 (38%), Gaps = 25/193 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 I S S V+D S SM D G K+ + ++ ++ I++ P R Sbjct: 30 PNAIVSASRPSTTFTFVIDTSGSMYDDSEVGRPKIDIVVEALERLVTDIQADPR----DR 85 Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV F PL +Q I++L F T+ G+E + N + D+ Sbjct: 86 IALVQFDDSASVLLPLTAATDTVTLQNAISKLRSFSGGTRMALGIEKSLNLLKDSV---- 141 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ +IF TDG+ ID + + + G + A+GV + + L Sbjct: 142 ------LSSRRTLIF-TDGQ----TIDEIDCRELAVQFAQAGIPITALGVG-DYNEDLLV 189 Query: 335 NCA--SPDRFYSV 345 + + R ++V Sbjct: 190 YLSDHTGGRVFNV 202 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 78/214 (36%), Gaps = 28/214 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + I + + LD++ ++D S S+ + + EM++ + Sbjct: 53 SINIGAPAEPC--KSRPLDLVFIIDSSRSVRPA------EFEKVKIFLSEMVNSLDI--- 101 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFD 268 ++ R LV ++S + F L + +++ +R+ + T + ++ A ++F Sbjct: 102 GSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFT 161 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KG K I +TDG + + A+ G +YA+GV Sbjct: 162 ENAGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA------ARASGIEIYAVGVDRAE 212 Query: 329 ADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + + P D + V+ KL F Sbjct: 213 MRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|156409373|ref|XP_001642144.1| predicted protein [Nematostella vectensis] gi|156229285|gb|EDO50081.1| predicted protein [Nematostella vectensis] Length = 203 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 77/205 (37%), Gaps = 34/205 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + LD+ ++D S S+ G G I+ I+ +P R G V FSS Sbjct: 2 KVRLDLGFLIDGSGSIERQ---GRGNFGRVINFIKT---IVSLLPVSPRQTRIGAVLFSS 55 Query: 226 KIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F + +++ + L TK+ L Y Y+++F ++ Sbjct: 56 RPYLMFNFQ-KYRTVRQVLAALQRIRYPRGGTKTGRALRYTYSRLFRSR---------SR 105 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K+ +I LTDG++ + K +G ++AIGV + L AS Sbjct: 106 VRKQALIVLTDGKSQDSVGQPAAFI------KNQGVELFAIGVGRNYRRRDLNQIASRGN 159 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 ++ + F +G+ + + Sbjct: 160 VFTAK--------FENLGRIIGAIK 176 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 48/213 (22%) Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 ++L A S + + ++ +++ GLV+F + + + +++ Sbjct: 169 SSYNEKNRLQHAKESAIKFVQKFENDKNIS----IGLVSFDTTANSQKDITSKLNEVEDS 224 Query: 244 INRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-------- 293 IN L T GL+ A + KG+ D KY+I ++DG Sbjct: 225 INSLKVADNGATNIEAGLKSAQQLL----------KKGNKDADKYVILMSDGFPTAFDYA 274 Query: 294 --------------ENSSPNI--------DNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +N+ N K S+ N K+ G + IG A + Sbjct: 275 GEKVEKNFNYHEIQDNTFINFGYYDYSGYAMKHSINQANSLKKDGINSFIIGFSEGANSE 334 Query: 332 FLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEM 362 L N A+ + +N+ L+ A+ ++ ++ Sbjct: 335 KLNNIAKAAGGEYEEAKNTDTLNGAYDKLETKV 367 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 50/135 (37%), Gaps = 23/135 (17%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-------GVQHIQE 242 +L + + +D K+ + +V +SSK + + I++ Sbjct: 723 SRLDSVKKVANDFVDKFKN----DENTEIAIVRYSSKANIVLDGSNKIFLNGKDNEIIKK 778 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--I 300 +IN L T G+ +Y+ + K D +KY+I +TDG ++ Sbjct: 779 RINSLKADGGTNIGDGIRKSYSIL----------DKCDKDSEKYMILMTDGVPTAYTCYA 828 Query: 301 DNKESLFYCNEAKRR 315 + ++ C +K Sbjct: 829 NTIKASNNCKYSKDN 843 >gi|327270784|ref|XP_003220168.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 952 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 46/205 (22%), Positives = 79/205 (38%), Gaps = 39/205 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM +G + +L A ++ I D + G+VTF+S Sbjct: 303 LCLVLDVSGSM---YGVRVARLKQAAEIFL-----LQIIEDGSW---VGIVTFNSAATIK 351 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + G T+ G+E + + Sbjct: 352 TGLQQITSDSIRRSLTGYL-PVTAGGGTRICNGVEAGFKVFKQKYASEKGCE-------- 402 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRF 342 I+ LTDGE+S ++ YC +E KR G+I++ I + A+ + L + +F Sbjct: 403 -IVLLTDGEDS--------TISYCLDEVKRSGSIIHTIALGRSADPGLEELADMTGGLKF 453 Query: 343 YSVQ--NSRKLHDAFLRIGKEMVKQ 365 + +S L DAF I Sbjct: 454 SATDSLDSNSLIDAFTGISSSDGNL 478 >gi|224531962|ref|ZP_03672594.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] gi|224511427|gb|EEF81833.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] Length = 333 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 47/232 (20%), Positives = 88/232 (37%), Gaps = 28/232 (12%) Query: 98 ENGFAQDINNIERSTSLSIIIDD-QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 + F I+ SL + + KDY L+ + + F++ + + P + Sbjct: 23 NHFFRNRGGKIKFPISLYGNFNSLKLKDYKLNLMYFFTYSFLYLSAMVMVFALAGP---S 79 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S K G D+++VLD+S SM ++L + ++IK Sbjct: 80 VSKKKMIHLSAGADIVIVLDISPSMGAVEFSSKNRLEFSK-------ELIKRFISQREND 132 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKL 273 GLV F+ P+ +K++ + G+ + G+ A + + Sbjct: 133 NIGLVAFAKDASIVVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL------- 185 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K + K+ II LTDG +S I + + N A+ +Y+IG+ Sbjct: 186 ----KHSEAPKRSIIVLTDGVVNSDEIYKDQVI---NLAQGLNVKIYSIGIG 230 >gi|119952971|ref|YP_945180.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861742|gb|AAX17510.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 341 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 68/193 (35%), Gaps = 25/193 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 + F L S K + G D+++VLD+S SM ++L A Sbjct: 67 ITYAFFYLAMTVMILTLAGPSISKKKTTYLSSGADIVIVLDISPSMGAIEFSSKNRLEFA 126 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR---LIFGST 252 ++IK GLV F+ + PL +K++ + G+ Sbjct: 127 K-------ELIKYFVYQRENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDLGNG 179 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + G+ A + + K + K+ +I LTDG N D N A Sbjct: 180 SALGLGISIALSHL-----------KHSEAPKRSVIVLTDGV---VNSDEVYKDQVINLA 225 Query: 313 KRRGAIVYAIGVQ 325 + +Y+IG+ Sbjct: 226 QGLNVKIYSIGIG 238 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 43/200 (21%), Positives = 75/200 (37%), Gaps = 36/200 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+ +V+D S S+ +K V + ++ +I N V+ GL +FS Sbjct: 1239 GRLDIQIVIDTSGSLT----SAPNKDQVLMNFTNNLANMYDTI----NQVKIGLTSFSES 1290 Query: 227 IVQTFPLAW-GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 V PL + +Q+ ++ + + + T T G+E A N + D Sbjct: 1291 SVLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALNDM-----------DTSDAVD 1339 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-- 342 +I +TDG + + ++AK G + A+G E A SP+ + Sbjct: 1340 DVMILITDGFQ---STNTTLMFQMIDQAKAEGVRLIALGFFGEFAFY------SPNLYLM 1390 Query: 343 ----YSVQNSRKLHDAFLRI 358 Y N +L I Sbjct: 1391 TNEVYHAANYAELLAIDNTI 1410 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 66/204 (32%), Gaps = 32/204 (15%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKL 192 + F+ + S+K+ G++ ++ LD+S SM D +DK Sbjct: 57 IKFLLMQSSIALMVLIIARPQIGNRISSTKNGKGIETVIALDISNSMLAQDVIPSRLDKS 116 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLI 248 + + D + GL+ F+ P+ + +N LI Sbjct: 117 KLLIEDLLRSFDN----------DKVGLIVFAGDAFVQLPITSDFISAKMFLNDINPSLI 166 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + A H K II +TDGE++ + Sbjct: 167 GTQGTDIGKAINLA-----------MHSFSPTSKAGKAIIIITDGEDNEGGAE-----AM 210 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 +A+ G +Y +G+ + + + Sbjct: 211 AKKAQEAGFHIYILGIGSTSGAEI 234 >gi|307298147|ref|ZP_07577951.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] gi|306916233|gb|EFN46616.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] Length = 704 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 78/220 (35%), Gaps = 27/220 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 + S E+P W +++ L+T +I + I D++ VLD+S Sbjct: 218 PNSDVALVLTSSEDEIP-SSLATHWDESTNEGYFLLTLIPRIKEEIVIPKDVVFVLDISG 276 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 SM K+ A R++ ++L ++ R +VTF ++ Sbjct: 277 SMYGE------KIEQAKRALEQVLQML------RPGDRFAIVTFDGRVHNLTGSLLDASE 324 Query: 240 IQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 E I + + T L+ + + +K K ++FLTDGE + Sbjct: 325 KAEWIEKVRRIQADGMTNIYGALQTSIDMF----------SKYDTGRFKALLFLTDGEPT 374 Query: 297 SPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKN 335 D + EA+ R +++ GV + L Sbjct: 375 EGITDIGRIISDATPEARARNVHLFSFGVGTGVVAELLDR 414 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 47/220 (21%), Positives = 77/220 (35%), Gaps = 37/220 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S S +D++ VLD S SM + T +I+ LD+ + + GLV Sbjct: 30 STSSPNVDVVFVLDSSGSMKQSDPEEIR-----TEAIKMFLDMGQVQGNKA-----GLVA 79 Query: 223 FSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S I++ L + I+E + + G T GL + + Sbjct: 80 YSDTIIKEHNLDAINSESDKERIEEMASNIPLGQKTDIGRGLLEGAKVLDSGHDSNNRP- 138 Query: 278 KGHDDYKKYIIFLTDGENSSPNI------DNKESLFYCNEAKRRGAIVYAIGVQAEA--A 329 II L+DG+N S D ++ C K +G VY IG+ + Sbjct: 139 --------LIILLSDGKNDSQRSASESLKDLNSAISTC---KSKGYPVYTIGLNYDGTVD 187 Query: 330 DQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L AS + Y + L D I + ++ Sbjct: 188 KAQLTQIASETKGKNYITNKASDLTDILKDIYGDSANVKV 227 >gi|216264497|ref|ZP_03436489.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221217546|ref|ZP_03589016.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224532807|ref|ZP_03673422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224534090|ref|ZP_03674673.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225548563|ref|ZP_03769611.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] gi|225549807|ref|ZP_03770771.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|215980970|gb|EEC21777.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221192609|gb|EEE18826.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224512196|gb|EEF82582.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224512789|gb|EEF83157.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225369615|gb|EEG99064.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|225370826|gb|EEH00261.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] Length = 333 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 45/239 (18%), Positives = 86/239 (35%), Gaps = 31/239 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 49 KDYRLNLMYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + IR + GLV F+ P+ + + Sbjct: 106 AVEFSSKNRLEFSKELIRGFI-------SQRENDNIGLVAFAKDASIVVPITTDREFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 I + + N A+ +Y+IG+ + S F V + L + Sbjct: 208 IYKDQVI---NLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 263 >gi|239908150|ref|YP_002954891.1| hypothetical protein DMR_35140 [Desulfovibrio magneticus RS-1] gi|239798016|dbj|BAH77005.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 595 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 69/192 (35%), Gaps = 31/192 (16%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 C A + P L +SK+ L +M+VLD S SM P D+LG A R Sbjct: 71 FICAVAALALAGVGPRLGIGKPDAASKAPPRLRLMVVLDCSRSMLARDVPP-DRLGAAKR 129 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STT 253 + LD++ +P ++ GLV F+ + P + ++ L T Sbjct: 130 LV---LDVLARLPGLD----VGLVGFAGRAWLACPPTPDRAGLALFLDGLTPEAAPLGGT 182 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GLE A + + ++ +TDGE + ++ Sbjct: 183 DPAKGLEAAGLALAGVRPAA-------------VLLVTDGEATVKPAGQTRTVLP----- 224 Query: 314 RRGAIVYAIGVQ 325 G VYA+ V Sbjct: 225 -PGVPVYAVAVG 235 >gi|118617118|ref|YP_905450.1| hypothetical protein MUL_1447 [Mycobacterium ulcerans Agy99] gi|118569228|gb|ABL03979.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length = 733 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 74/226 (32%), Gaps = 40/226 (17%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 L + + SS D+++VLD S SM K+ A R+ ++D++ + Sbjct: 252 GTWSLTLVPPAEPSS---APRDVVVVLDRSGSMGGW------KMVAARRAAGRIVDMLDA 302 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLE 260 R ++ F +I + + L T L Sbjct: 303 GD------RFCVLAFDDRIETPPAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQPLT 356 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + D+ E + ++ + DG+ S + + + +Y Sbjct: 357 NAVEMLADSGEDRQAS----------VVLVADGQISGEDHLLRSLAPAVGRTR-----IY 401 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +GV FL+ A R V++ +L + R+ + + + Sbjct: 402 CVGVDRAVNAGFLERLAGLGSGRAELVESEDRLDEVMARLARTIGR 447 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 73/197 (37%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 295 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 342 Query: 229 QTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ +++ I+ + T L+ A + + + H G Sbjct: 343 KDHLISVTPDSVRDGKVYIHHMSPTGGTDINEALQTAIRLL---NKYVAHSDSGDRSVS- 398 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 IIFLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 LIIFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTR 458 Query: 341 RFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 459 RVHEEEDAGSQLIGFYD 475 >gi|291231970|ref|XP_002735935.1| PREDICTED: chloride channel accessory 2-like, partial [Saccoglossus kowalevskii] Length = 849 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 81/203 (39%), Gaps = 34/203 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++LD+S SM+ + ++KLG SI +L ++ G+V F+ + Sbjct: 302 VLLLDISGSMSSN--NRIEKLGQ-VASIYILLTA-------DDDDELGMVVFNDQPSTRS 351 Query: 232 PLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + + + + LI G T GL A + + + Sbjct: 352 QMVTISESTRLDLLELIPTRDDIGDATGIGSGLSEAIDVLENGGNDAAGGC--------- 402 Query: 287 IIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 II ++DG EN SP ID+ +S +G V+ I + +A+ L FY Sbjct: 403 IILVSDGEENRSPYIDDVQSTIV-----DKGVCVHTIALGVDASHNMEQLPLATDGKSFY 457 Query: 344 SVQN--SRKLHDAFLRIGKEMVK 364 +N S L++AF+ I K+ Sbjct: 458 YSENPYSNALNEAFITIAKQDTN 480 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 83/245 (33%), Gaps = 28/245 (11%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF----PWCANSSHAPLLITSSV 159 D ++ + + DY + + PF PW + + Sbjct: 222 DNGRAPPRDAVRVEEMINYFDYGYTRPTSAARPFAVTATTTASPWSEGRRIVHVGLQG-Y 280 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ L++ ++DVS SM DKL +A +S+ ++D ++ R Sbjct: 281 ELPENQRRPLNLTFLVDVSGSM-----NSPDKLDLAKQSMNLIIDRLRPQD------RVA 329 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEK--INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + ++ T G Q ++ + + L T G+ AY + Sbjct: 330 VAYYAEGAGTTLAPTAGTQKLKLRCAVASLRASGGTAGATGMTNAY--------DQAQAS 381 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNC 336 G + + ++F TDG+ + D+K Y + +R G + G D ++ Sbjct: 382 FGRNKVNRILMF-TDGDFNVGVTDDKRLEDYVADKRRTGIYLSVYGFGRGNYQDARMQAI 440 Query: 337 ASPDR 341 A Sbjct: 441 AQAGN 445 >gi|203287632|ref|YP_002222647.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] gi|201084852|gb|ACH94426.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] Length = 341 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 77/208 (37%), Gaps = 27/208 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 +DY L+ + F + + P + + S G D+++VLD+S SM Sbjct: 57 RDYVLNLLYFVTYTFFYLAIIVMILTLAGPSISRKKMTYLS---SGADIVIVLDISPSMG 113 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L A ++I+ GLV F+ + PL + Sbjct: 114 AIEFSSKNRLDFAK-------ELIEYFVYQRENDNIGLVAFAKEASLIVPLTIDRDFFSK 166 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + KK +I LTDG N Sbjct: 167 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEAPKKSVIVLTDGV---VN 212 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D N A+ +Y++G+ ++ Sbjct: 213 SDEVYKDQVINLAQGLNVKIYSVGIGSD 240 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 75/195 (38%), Gaps = 31/195 (15%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------- 223 +VLDVS SM G ++ A ++ E+LD + +V +R+ + Sbjct: 43 LVLDVSGSMRTRDIDGGTRMAAAKQAFNEVLDA--TPEEVQLGIRTLGADYPGDDRKTGC 100 Query: 224 --SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++++ PL + + L T P L A + + Sbjct: 101 KDTAQLYPVGPL--DRTEAKTAVATLSPTGWTPIGPALLKAADDL------------DGG 146 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---AS 338 D K I+ ++DGE++ +D E AK G + +G+ + +C A+ Sbjct: 147 DGSKRIVLISDGEDTCAPLDPCEVAREI-AAKGIGLTIDTLGLVPNTKMRRQLSCIAEAT 205 Query: 339 PDRFYSVQNSRKLHD 353 + SV+++ +L D Sbjct: 206 GGTYTSVEHTDELTD 220 >gi|88601603|ref|YP_501781.1| magnesium chelatase, ChlI subunit [Methanospirillum hungatei JF-1] gi|88187065|gb|ABD40062.1| protoporphyrin IX magnesium-chelatase [Methanospirillum hungatei JF-1] Length = 680 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 71/220 (32%), Gaps = 30/220 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 S + ++ + V+D S SM ++ A +I +L+ Sbjct: 468 SLAIVIRSDEVLQKKRIGKTATATLFVVDASGSMGVE-----QRMEAAKGAIFSLLE--- 519 Query: 208 SIPDVNNVVRSGLVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 N R GLV F + PL + ++++ L G T GL+ + + Sbjct: 520 --DSYQNRDRVGLVAFRGEGADVVLPLTSSIDLAYQRLSELPTGGKTPLAAGLQKSLTIL 577 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID--NKESLFYCNEAKRRGAIVYAIGV 324 E ++ +TDG + N E ++ + G I Sbjct: 578 -----MREKQKYPSLLP--LLVLITDGRANVGNGGKLKDEIGLITDDLVKAGIETVIIDT 630 Query: 325 QAEAADQF---LKNCA-----SPDRFYSVQN--SRKLHDA 354 +++ + L C + R++ + + + L A Sbjct: 631 ESKQKGRLSIQLGFCPFIAQRTQGRYFQISDLTASDLSSA 670 >gi|326675078|ref|XP_692457.5| PREDICTED: collagen alpha-6(VI) chain [Danio rerio] Length = 1605 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 50/328 (15%), Positives = 110/328 (33%), Gaps = 49/328 (14%) Query: 47 KLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN 106 L + + A + ++Q G + K + + + R + + Sbjct: 65 SLTEHKNRASFEAAVRGISQPVGGSEKGKALKYVASLFNQ------AKASRPAKVQEILI 118 Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL------------- 153 I TS + D + + VS Y + + + Sbjct: 119 VITDKTSQDDVGDPAEE-LRIQGVSVYAIGVKDASQDELLKMTADETKDFYVTNYDALNV 177 Query: 154 ----LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 ++T + + D+M ++D S S+ +GP I ++++ Sbjct: 178 LKREIVTDICSQEACKNKVADIMFLIDGSSSI---YGP---DFTSMKTFITKVVNGTIIG 231 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKI 266 D V G+V FS+ + FPL + ++E I+ + T + L + Sbjct: 232 EDS---VHVGVVQFSNNPQEQFPLNRYFDQNELEEAIDGIEQLTGDTYTGKALSFISKYF 288 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + G D ++++ +TDGE +++ + +G +++IGV + Sbjct: 289 DASNG-------GRPDVPQFLVVITDGEAHDAVAVPAKAI------RDKGVTIFSIGVAS 335 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDA 354 Q + + D+ Y ++ LH Sbjct: 336 VNTTQLWEISGTQDKVYVQRDFDALHSI 363 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 70/205 (34%), Gaps = 25/205 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +LD S S F + I L + P+ VR G+V Sbjct: 13 DIYFLLDNSGSTRADFED-------VKKFILGSLQLFNIGPNR---VRVGVVKVDRNPTL 62 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + ++ L+Y + AK ++ Sbjct: 63 QFSLTEHKNRASFEAAVRGISQPVGGSEKGKALKYVASLFNQAKASR------PAKVQEI 116 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I +TD + D E L +G VYAIGV+ + D+ LK A + + V Sbjct: 117 LIVITDKTSQDDVGDPAEELRI------QGVSVYAIGVKDASQDELLKMTADETKDFYVT 170 Query: 347 NSRKLHDAFLRIGKEMVKQRILYNK 371 N L+ I ++ Q NK Sbjct: 171 NYDALNVLKREIVTDICSQEACKNK 195 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 85/244 (34%), Gaps = 58/244 (23%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + + L II ++K ++++ + + ++ P+ Sbjct: 527 AQDSELEIITKSKNKIFHVNNYDN--LQDLQMNVSGVLCNATKPVCQNEVA--------- 575 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+++ + ++D ++ P + R G+ FSS Sbjct: 576 -DLVFLIDGSESISEESWI------TVIAFLLNVVDKLRIGP---ELFRVGIAQFSSVYQ 625 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + F + N+ DA + + K + ++ Sbjct: 626 KEFYM------------------------------NEYKDADGEDKGSRKQSGVPQN-LV 654 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +TDG +S ++ + G V+AIG+ A + Q SPDR + VQN Sbjct: 655 LITDGVSSD------RVNEAADQLRNLGINVFAIGIGAVSLQQLSYIAGSPDRLFKVQNF 708 Query: 349 RKLH 352 L+ Sbjct: 709 NYLN 712 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 59/179 (32%), Gaps = 12/179 (6%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV + + + +++ + +N++R G++ +S Q + Sbjct: 386 DVIFLVQCTRQIRLQDFEKIKSFLISVVNSTQI---GDNLIRFGVIVYSDTPSQFSLNQY 442 Query: 236 GVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I E I L + + AY + E K ++I TDG+ Sbjct: 443 NTRRQIAEAITSLKSPAVS-GYTARALAY-SLTYFTEANGGRQKRGVPQMLFMI--TDGD 498 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D + +E + VY IGV + S ++ + V N L D Sbjct: 499 ----ARDRENLRARADEFAAKQINVYGIGVARAQDSELEIITKSKNKIFHVNNYDNLQD 553 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 69/196 (35%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F + P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 MVFVAIGLFSVLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLQKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLVAQLVDKMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + +I +TDGEN +++ +A +G Sbjct: 170 TAIGAAIN-------LASRSFTPQEGVGRAVIVITDGENHEG-----DAVEAAKDAAEKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVNVLGVGMPEGAPI 233 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRLAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|301777181|ref|XP_002924014.1| PREDICTED: cartilage matrix protein-like [Ailuropoda melanoleuca] Length = 495 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 81/203 (39%), Gaps = 30/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 271 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 321 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 322 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + + +AK G ++A+GV D+ + + P Sbjct: 376 --QKVGIVFTDGRSQDYINN------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 427 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + + ++ R+ K++ Sbjct: 428 EHYFYTADFKTINQIGKRLQKKI 450 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 76/197 (38%), Gaps = 28/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 40 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGVVNYASAVKQ 90 Query: 230 TFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + R+ + T + +++A K F E D K Sbjct: 91 EFPLRAHGSKASLLRAVRRIQPLSTGTMTGLAIQFAITKAFSNTE---GGRARSPDISKV 147 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 +I +TDG D A+ G ++AIGV L+ AS + Sbjct: 148 VIVVTDGRPQDSVRDVSA------RARANGIELFAIGVG-RVDKATLRQIASEPQDEHVD 200 Query: 344 SVQN---SRKLHDAFLR 357 V++ KL F Sbjct: 201 YVESYSVIEKLSKKFQE 217 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 58/182 (31%), Gaps = 47/182 (25%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG-------------- 279 + + I+ + T GL+YA + A Sbjct: 484 SDDAATLSAFIDNMRMHDGTGIQYGLKYALALLDPATGSAVTELISAGLVDSRFLGRPIA 543 Query: 280 --HDDYKKYIIFLTDGE--------------NSSPNIDNKESLFY--------------- 308 ++ +K+I+ ++DG N + + S Y Sbjct: 544 WEDEETEKFIVVMSDGAVTDQYRPVDPFAPLNGETELQTQGSGSYTTFSTRGNNLDNLHT 603 Query: 309 -CNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 C A+ G V+A+ +A L+ CAS D + ++ DAF I +++ R Sbjct: 604 QCQLARDLGVTVFAVAFETTDADADELRLCASSDSHFFHVQGTEIIDAFDTIARQINNLR 663 Query: 367 IL 368 ++ Sbjct: 664 LI 665 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 88/223 (39%), Gaps = 40/223 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I + + +G+++I + +L +I ++ G ++ ++ ++A+L LD ++L A Sbjct: 49 IADLAGDEEGTMTIFSTFMLVLILVITGASVDIMYQEAIRARLQATLDRAVLAAA----- 103 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + +Q++ + ++ + G + + ++ Y Sbjct: 104 -----DLDQQQDPVA--VVNDYVTK--------AGLVEHLTDVI----------ATPGLY 138 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + + + + + P++ S+ + + +++ +V+D+S SM + Sbjct: 139 DRTVAADAGLTLDTYFLRM-SGWQTLPVIAASTAE---ERIANVEISLVMDISGSMRWN- 193 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++ A + ++ + + D V L+ F+ ++ Sbjct: 194 ----NRITNARNAAKDFVTKV-LTEDSAGVTTLNLIPFAGQVN 231 >gi|159900699|ref|YP_001546946.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893738|gb|ABX06818.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 828 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 65/198 (32%), Gaps = 30/198 (15%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + ++++LD S SM G DK +A + D + + G+ Sbjct: 377 PPRRERPTVTLLLILDRSASMLGE--SGKDKFSLAKAAAIAATDSLGADDT------IGV 428 Query: 221 VTF---SSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + F + V + GVQ IQ I L G T LE + K+ H Sbjct: 429 LAFDDTNDWTVTFTKVGQGVQLSEIQNNIAGLSAGGGTDIYAALEVGMGGLAQQTGKVRH 488 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + LTDG + + + +G + I + +A L++ Sbjct: 489 A-----------VLLTDGRSGGESSYES----LIAPLRAQGITLSTIAIGGDADTVLLES 533 Query: 336 CA--SPDRFYSVQNSRKL 351 A R++ L Sbjct: 534 LAKLGAGRYHFASRPDDL 551 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 49/239 (20%), Positives = 91/239 (38%), Gaps = 24/239 (10%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT----FPWCANSSHAPLLITSSVKISS 163 + S ++ I + Y+ A ++PF T PW A++ + I ++ S Sbjct: 216 LPPSGAVRIEEMINYFSYDYLATDAGDVPFRATTSVFETPWNADTQLLHIGIQGTLPDVS 275 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L+++ ++D S SMN DKL + + R ML ++ +V+ + +G Sbjct: 276 EH-PPLNLVFLIDTSGSMNQP-----DKLPLLISAFRLMLSELRPEDEVSIITYAGS--- 326 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + +++ P + I +NRL G +T GL AY E E Sbjct: 327 AGQVLAPTPAS-DRATILAALNRLSAGGSTAGQAGLRQAYAIAAAMSEDGEIAR------ 379 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 +I TDG+ + D Y + G + +G D +++ A Sbjct: 380 ---VILATDGDFNVGLNDPDTLKDYITTRRDSGTYLSVLGFGRGNLNDAVMQSLAQNGN 435 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 51/345 (14%), Positives = 119/345 (34%), Gaps = 62/345 (17%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVI---------ETSHKFFVKAKLHYILDHS 55 +R F + ++I+ ++ VI ++ + E S + ++ + Y +D + Sbjct: 3 QLRKGFTLAEMVMAIMLMSMIAVIIGIIFNTMFSSRELIEREASIQAEMRTSMQY-VDRT 61 Query: 56 LLYTATKILNQENGNNGKKQK----------NDFSYRIIKNIWQTDFRNELRENGFAQDI 105 + + + ++ G K+ + +++ +W ++ + + Sbjct: 62 IGKATSVFILDDSKFKGNKEGLTKEWSYIGLSSDGKKVLNYVWDKSKQDWKVSELGTKSL 121 Query: 106 NNIERSTSLSI---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI- 161 +I+ D++ YNL+ +Y + ++ + + + K Sbjct: 122 YDIKLDLEFKTEGAYQDNRLISYNLT--GKYPDTNNKLSIDTAISALNTKQVFSKVAKGK 179 Query: 162 ----------SSKSDIGLDMMMVLDVSLSMND-------HFGPGMDKLGVATRSIREMLD 204 + + + + V D S SM+ ++ ++ + M+ Sbjct: 180 KGIALAYRNDPIEGQMNVAISFVFDKSGSMSWDLNGNNTNYWGPKSRMSILQDKATIMMR 239 Query: 205 IIKSIPDVNNVVRSGLVTFSS----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 +K I +V+ LV+FS L G I+ IN L G T GL Sbjct: 240 DLKDIGNVS----VNLVSFSILGSYVQKDFSELDKGTTTIEASINALQTGGVTNPGDGLR 295 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 Y + + H KY++ LTDG ++ +D ++ Sbjct: 296 YGMMSL-----------QNHSAQLKYVVLLTDGIPNAYTVDTNDT 329 >gi|326428615|gb|EGD74185.1| hypothetical protein PTSG_12412 [Salpingoeca sp. ATCC 50818] Length = 1720 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 69/198 (34%), Gaps = 21/198 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D + VLD S S+ D + + +++I G V F Sbjct: 212 AQQFPFDFVYVLDASGSVGR------DNWNRVLNFTADSISTLRTIDPQAQF---GAVVF 262 Query: 224 SSKIVQTFPL-AWGV-QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S+ PL A+ Q +Q+ + L G +T + L +IF +G Sbjct: 263 STTAEIAVPLQAFDSQQAVQDTVRTLPYAGESTATGNALNLVRREIFSDDAAAISGFRGG 322 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK--NCAS 338 ++ +TDGE + + + + G V+ +GV A ++ Q L+ A+ Sbjct: 323 RA---VVVLVTDGETLELDGVLENAADRLHATGPLGVDVFVLGVGAASSPQLLQDVYTAA 379 Query: 339 PD----RFYSVQNSRKLH 352 ++ +L Sbjct: 380 SGPPETHVFTPDVFDQLL 397 >gi|148655604|ref|YP_001275809.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567714|gb|ABQ89859.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 425 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 66/196 (33%), Gaps = 28/196 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + L++ +VLD S SM +L + ++D + + LV F+ Sbjct: 42 KLPLNLCLVLDRSSSMRGE------RLMQVKEAAARIVDQLGPDDYFS------LVVFND 89 Query: 226 KIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P ++ ++ I ++ T+ GL A ++ Sbjct: 90 RADVVIPAQRAIKKSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISR------ 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-- 341 +I LTDG D + + RG + A+G+ E + L+ + + Sbjct: 144 ---LILLTDGRTYG---DESRCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENSR 197 Query: 342 FYSVQNSRKLHDAFLR 357 + ++ + F Sbjct: 198 AQYIATAQDVVKVFAD 213 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|281345873|gb|EFB21457.1| hypothetical protein PANDA_003362 [Ailuropoda melanoleuca] Length = 399 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 70/190 (36%), Gaps = 19/190 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ M+ + ++ M++ D ++ GL+ FSS + F L Sbjct: 222 DIIFLMDGSESISPEDFEKMKGFVKRMVNQADISTDE---IQIGLLQFSSTPQEEFRLDQ 278 Query: 236 GVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + N T++ L + FD+ G + ++Y+I +TD Sbjct: 279 YSSKVDIHRAITNVQQMNDGTRTGKALNF-TRPFFDSSRG------GRPNVQQYLIVITD 331 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 G + + + + R +++AIGV Q L+ P + Y +N L Sbjct: 332 G------VAQDDVVMPAKALRDRNIVIFAIGVGEAKNAQLLQITDDPQKVYYEENFESLQ 385 Query: 353 DAFLRIGKEM 362 + +I ++ Sbjct: 386 NLEKKILLKV 395 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 72/195 (36%), Gaps = 19/195 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D+ + + ++ + + PD VR GLV +S + F L Sbjct: 3 DLVFLIEEFSWDRQSNFQQVVNFLKSTVSSLNVHPDG---VRIGLVFYSEEPRLEFSLDA 59 Query: 235 -WGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I E ++RL + TK+ L++ N++F E ++ ++ + + Sbjct: 60 FQNPASILEYLDRLTYRRRSGRTKTGAALDFLRNEVF----IEERGSRSKHGVQQMAVVI 115 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 T+G + + +R G +YA+G + + L+N AS + V + Sbjct: 116 TEG------FSQDQLSKSASLLRRAGVTIYAVGTHLASESKDLENIASYPPWKHVISLES 169 Query: 351 LHDAFLRIGKEMVKQ 365 +G ++ Q Sbjct: 170 FLQ-LSVVGNKIKNQ 183 >gi|302382107|ref|YP_003817930.1| hypothetical protein Bresu_0994 [Brevundimonas subvibrioides ATCC 15264] gi|302192735|gb|ADL00307.1| Protein of unknown function DUF3520 [Brevundimonas subvibrioides ATCC 15264] Length = 625 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 51/322 (15%), Positives = 107/322 (33%), Gaps = 40/322 (12%) Query: 49 HYILDHSLLYT--ATKILNQENGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDI 105 +D ++ T++ + + + + D F + Sbjct: 127 QTTVDGVVVPGRPGTRVDTERYPDATPNPVRRVADEPVSTFSIDVDTAAYANVRRFISEG 186 Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF----PWCANSSHAPLLITSSV-- 159 R ++ + + DY + R + PF T PW AN+ I Sbjct: 187 QTPPRD-AVRVEEMINYFDYGYARPGRADEPFAVSTAVAASPWSANAGAGGRQIVHIGLQ 245 Query: 160 --KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + L++ ++DVS SM DKLG+A +++ ++D ++ R Sbjct: 246 GYELPAGERRPLNLTFMVDVSGSMQ-----SPDKLGLAQQTMNLIIDRLRPED------R 294 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEK--INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + ++S + G + ++ + + L G +T G+ AY + A Sbjct: 295 VAVTYYASDVGTAVGPTPGSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAEAA------ 348 Query: 276 IAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQF 332 K I+ TDG+ + D++ Y + + G + G D Sbjct: 349 -----FSPDKVNRILMFTDGDFNVGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDAR 403 Query: 333 LKNCASPDRFYSVQNSRKLHDA 354 ++ A + L +A Sbjct: 404 MQTIAQAGNGVAAY-VDDLDEA 424 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 80/205 (39%), Gaps = 33/205 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND----------HFGPGMDKLGVATRSIREM 202 L + + + K + +D++ VLDVS SMND +L V S++ + Sbjct: 54 LHVEAPPAANLKGHVPIDVVAVLDVSGSMNDPVAAAAAASPESNLQASRLDVLKASMKFI 113 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKST 256 + + + R +V F+ V+ + G +KI+RL T Sbjct: 114 IRKLD------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALM 167 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 P LE A + + + G ++ +I+ LTDG++++ ++++ Sbjct: 168 PALEEAVKILDERQG-------GSRNHVGFILLLTDGDDTTGFRWTRDAIHGA----VAK 216 Query: 317 AIVYAIGVQAEAADQFLKNCASPDR 341 V+ G+ A + L + A R Sbjct: 217 YPVHTFGLGASHDPEALLHIAQGSR 241 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|332221823|ref|XP_003260064.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Nomascus leucogenys] Length = 921 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 53/277 (19%), Positives = 98/277 (35%), Gaps = 57/277 (20%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 +F NE N A + NI+ + + + +D+ + +P + P + Sbjct: 248 EFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNT------IPMVTPPPPPVFSLLK 301 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I + +VLD S SM G D+L ++ + L + Sbjct: 302 ISQRI---------------VCLVLDKSGSMG-----GYDRLNRMNQAAKHFL-----LQ 336 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 V N G+V F + L + + + T G++YA+ Sbjct: 337 TVENGSWVGMVHFDTTATIVNKLIQIKGSDERNTLMAGL-PTYALGGTSICSGIKYAFQV 395 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I H + ++ LTDGE+++ + +E K+ GAIV+ I + Sbjct: 396 IG-----ELHSQLDGSE----VVLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALG 439 Query: 326 AEAADQF--LKNCASPDRFYSVQNSRK--LHDAFLRI 358 + A + + N FY+ ++ L DAF + Sbjct: 440 SAADEAVIEMSNITGGSHFYASDEAQNNGLIDAFGAL 476 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 64/441 (14%), Positives = 128/441 (29%), Gaps = 88/441 (19%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 +G SIL AI++PV+F + L + + KA++ + + L A N Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--NDP 59 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 N N+ R I + + ++ + I R I + Sbjct: 60 NVNSDGLGSGSKVNRRIATDYLKAYITDI----DSISSLKIYRRNCEDIPECSSGLNKGK 115 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAP-LLITSSVKISSKSDI----GLDMMMVLDVSLSMN 182 S YE+ + W ++ T S + S + +D++ D S SM Sbjct: 116 SRFFEYEVEALTTQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSME 175 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR-----SGLVTFSSKIVQTF------ 231 + + G K R I ++ ++ ++N + G+ ++S F Sbjct: 176 EPWTGGRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNYNSC 235 Query: 232 --------PLAWGVQH-----IQEKINRLIFGST--------------------TKSTPG 258 + + I+ +N + T Sbjct: 236 FMKQDYFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDPDAVFHDIYLTNDFDT 295 Query: 259 LEYAYNKIFDAKE--------KLEHIAKGHDDYKKYIIFLTDGENSSPNIDN--KESLFY 308 K + + + + ++ +I ++DG + + Sbjct: 296 FNKEIRKFRPGNGTASCQGIIRSAQMLRKGTNSRRLLIIISDGNDWYYPYSGYKETDKEI 355 Query: 309 CNEAKRRGA---------------------IVYAIGVQAEAAD-QFLKNCASPDRFYSVQ 346 N+ G + IG +A + L NCA D + Q Sbjct: 356 ANKLVNAGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKNKALLNCAGEDNVFKAQ 415 Query: 347 NSRKLHD-AFLRIGKEMVKQR 366 +L D I +E+ + Sbjct: 416 YRDELLDQILSLITEEIGHLK 436 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 66/208 (31%), Gaps = 34/208 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++ +DVS SM FG G ++ + ++I LD GL Sbjct: 89 PPQQKRSLTNIQFAVDVSGSMLAPFGDG-NRYDASMKAIDTFLDF-------RKGDAFGL 140 Query: 221 VTFSSKIVQTFPLAWGVQHIQ--------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 F V PL V I+ E + T L ++ Sbjct: 141 TFFGDAFVHWVPLTTDVTAIRCSPPFMRPETVP--PPFGGTAIAKALNGCKTEL------ 192 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQ 331 + D+ K I+ +TDG + +++E G V+ I V E + Sbjct: 193 -----RRRDEGDKMIVLITDGFSYDLTGNDEE---IARTLSAEGVAVFCIIVGGFEPQAE 244 Query: 332 FLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 + C + + + L F +I Sbjct: 245 IVNICRLTGGEAFRADDPDALPAVFKKI 272 >gi|3024063|sp|P97280|ITIH3_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|1694692|dbj|BAA13940.1| inter-alpha-trypsin inhibitor heavy chain 3 [Mesocricetus auratus] Length = 886 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 105/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 166 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 223 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 224 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQVVNGYFVHFFAPQGLPVVPK- 280 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+D+S SM K+ ++ ++LD +K +N + +G+ Sbjct: 281 --------NIVFVIDISGSMAG------RKIQQTRVALLKILDDMKQDDYLNFILFSTGV 326 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + T GL + DA+E+ + Sbjct: 327 TTWKDSLVQATPA--NLEEARTFVRSISDQGMTNINDGLLRGIRMLTDAREQHTVPERST 384 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ +A +Y +G FL+ A + Sbjct: 385 S----IIIMLTDGDANTGESRPEKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALEN 440 Query: 341 R 341 Sbjct: 441 H 441 >gi|217966673|ref|YP_002352179.1| von Willebrand factor A [Dictyoglomus turgidum DSM 6724] gi|217335772|gb|ACK41565.1| von Willebrand factor type A [Dictyoglomus turgidum DSM 6724] Length = 888 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 48/191 (25%), Positives = 73/191 (38%), Gaps = 27/191 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +++VLD S SM + G M K+ +A S + +LD+++ GL+ F Sbjct: 387 KKSNVAIIIVLDASGSMGSYSGGDM-KMELAKESAQLVLDLLEDKD------YFGLIAFD 439 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL + I+R+ G T P L+ A + Sbjct: 440 HSYQWIVPLQPLTNKEEAASLISRISPGGGTALYPPLKSAGESLLKVP-----------I 488 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 K+II +TDG+ + N N AK V IG+ +A LK+ A Sbjct: 489 KSKHIIAITDGQTEGGDFYN----LVRNLAK-YKITVSTIGIGEDANIPLLKDIANWGNG 543 Query: 341 RFYSVQNSRKL 351 RFY N R L Sbjct: 544 RFYHTWNIRNL 554 >gi|47217757|emb|CAG05979.1| unnamed protein product [Tetraodon nigroviridis] Length = 1380 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 72/192 (37%), Gaps = 27/192 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +LD S S+ G + + + ++D PD R +V +S + Sbjct: 9 DLAFILDTSSSV------GKENFEKIRQWVANLVDSFDVAPDK---TRVAVVRYSDRPTT 59 Query: 230 TFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F LA V+ I L G T + + Y + IF + IA+G + Sbjct: 60 EFNLARYRTLEDVKRAARNIRYL--GGNTMTGDAISYTTSNIFTERNGARPIARG---IQ 114 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRF 342 + I LTDG + ++ ++ A + G ++A+G+ + + A P Sbjct: 115 RVAILLTDGRSQDYVLEPSKA------AAKAGIRMFAVGIGEALKVELDEIAAEPKNAHV 168 Query: 343 YSVQNSRKLHDA 354 + V + + Sbjct: 169 FHVTDFNAIDKI 180 >gi|281353337|gb|EFB28921.1| hypothetical protein PANDA_013248 [Ailuropoda melanoleuca] Length = 471 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 81/203 (39%), Gaps = 30/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 256 SATDLVFLIDGSKSVRPE------NFELVKKFINQIVDTLDVSD---KLAQVGLVQYSSS 306 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + T + L+Y + D + A+ Sbjct: 307 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAALKY----LIDNSFTVSSGARPGA 360 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + + +AK G ++A+GV D+ + + P Sbjct: 361 --QKVGIVFTDGRSQDYINN------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 412 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + ++ + + ++ R+ K++ Sbjct: 413 EHYFYTADFKTINQIGKRLQKKI 435 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 76/197 (38%), Gaps = 28/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 25 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGVVNYASAVKQ 75 Query: 230 TFPL-AWGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL A G + + + R+ + T + +++A K F E D K Sbjct: 76 EFPLRAHGSKASLLRAVRRIQPLSTGTMTGLAIQFAITKAFSNTE---GGRARSPDISKV 132 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 +I +TDG D A+ G ++AIGV L+ AS + Sbjct: 133 VIVVTDGRPQDSVRDVSA------RARANGIELFAIGVG-RVDKATLRQIASEPQDEHVD 185 Query: 344 SVQN---SRKLHDAFLR 357 V++ KL F Sbjct: 186 YVESYSVIEKLSKKFQE 202 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 36/73 (49%), Gaps = 3/73 (4%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPD-RFYSVQNSRKLHDA 354 + N + C AK +G +VY IG +A + L++CAS D ++ V ++ DA Sbjct: 470 GNSTKNARTRSVCEAAKAKGIVVYTIGFEAPSNGVAVLRDCASSDAHYFDVDGL-EIKDA 528 Query: 355 FLRIGKEMVKQRI 367 F I + + R+ Sbjct: 529 FASIATSIRQLRL 541 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 47/350 (13%), Positives = 103/350 (29%), Gaps = 62/350 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA----- 60 +R F + G ++ T + V G+ ++ + L Y LD ++L A Sbjct: 23 LRAFRRDDSGVMAYPTVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRAVLAAADLDQT 82 Query: 61 --TKILNQENGNNG--------KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI----- 105 ++ Q+ N ++ Y+ ++ F L D+ Sbjct: 83 QPPAVVVQDYLNKAGLGEYYQEPIVESGLGYKRVQATIDATFEAHLLRFSNGNDLPVFAT 142 Query: 106 ---NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS 162 +S+++D + S +S ++ ++ + S V + Sbjct: 143 SKAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAAK-DFIDTMVANTTDGKMSISVVPYA 201 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR-----SIREMLDIIKSIPDVNNVVR 217 ++ + D+ +D ++ ++ A S + L+ R Sbjct: 202 TQVSLPDDL---IDQYTTVGENPYSNCINFEAAEYNSASLSTLDTLERSMHFTPWGYSNR 258 Query: 218 SGLVTFSSKIVQTFP------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 +SS + P L ++ I L G T G+++ Sbjct: 259 DMRTYYSSPRLVRSPVCDERASREVLPLQKDATTLKNFIQNLSAGGNTSIDVGMKWGTAL 318 Query: 266 IFDAKEKLEHIAKGH------------------DDYKKYIIFLTDGENSS 297 + + D K I+ +TDG+N+S Sbjct: 319 LDPSARPAISAISTGIGASVPGDFSDRPAEYSDSDTIKIIVLMTDGQNTS 368 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 62/174 (35%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ ++ G++ ++ +D+S SM +L + + ++D + + Sbjct: 80 TKMDTRKRQGIEAIIAMDISNSMMAEDVTP-SRLEKSKMLVSNIVDKMTD-------DKI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ ++ + P+ + + + T ++ A + Sbjct: 132 GLIVYAGEAYTQLPITSDYVSAKIFLETINPSMITTQGTDIKQAID-------LAMKSFT 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + D K I +TDGE++ ++ A +G VY +GV + Sbjct: 185 SNQDVSKAIFVITDGEDNEG-----GAVEMAKAAAEKGIKVYVLGVGSPQGAPI 233 >gi|325919326|ref|ZP_08181363.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] gi|325550197|gb|EGD21014.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] Length = 520 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 86/238 (36%), Gaps = 34/238 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW +S + + ++ +++ ++DVS SM DKL + S++ Sbjct: 125 TPWNNDSLLLRVGVAGRAIATADLPPA-NLVFLVDVSGSME-----SPDKLPLLQSSLKL 178 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGL 259 ++ +++ R LVT++ P G Q I E I+ + G +T G+ Sbjct: 179 LVRQLRAKD------RITLVTYAGNTAVVLPPTPGDQQGRIIEAIDTVQSGGSTAGASGI 232 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E AY + +G + I+ TDG+ + D + E +R G + Sbjct: 233 ELAYKA------AQQGYLRGGINR---ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVAL 283 Query: 320 YAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G D ++ A + + + +L I +++ Q Sbjct: 284 STLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLEARKVLTHELGATLATIARDVKIQV 341 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 74/208 (35%), Gaps = 27/208 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+ FS++ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LIVFSTEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + D+ + E + +G Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQ-LLDSSNQEERLPEGSVSL-- 379 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + + EA ++ +G + + FL+ A + Sbjct: 380 -IILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++ +L D + + ++ Sbjct: 439 RIHEDSDSALQLQDFYQEVANPLLTAVT 466 >gi|149916952|ref|ZP_01905453.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] gi|149822230|gb|EDM81621.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] Length = 416 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 61/200 (30%), Gaps = 29/200 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M++V+D S SM D + A + E++D + LV F S+ Sbjct: 1 MVLVVDTSASMKG------DAIEGAKAAAMELVDGLAEGDSF------ALVVFHSRAEVL 48 Query: 231 FPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAY-------NKIFDAKEKLEHIAKG 279 P + KI + TT GL+ A N + Sbjct: 49 MPSTVINEDSRAAARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAP 108 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + ++ L DG + + RG + A+G E + L + A Sbjct: 109 DPTVLERVVLLGDGVPNDASTIPSTVGQLA----ARGTQITALGYGIEYDETLLASLAEQ 164 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 + F V + + F Sbjct: 165 THGSFRFVDDPEAVASLFRD 184 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 88/257 (34%), Gaps = 38/257 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+++ILT L+ ++F+ G ++ +AKL Y LD ++L A + Sbjct: 25 FATSESGAMTILTLFLIMIVFVASGFAVDVMRYDRERAKLQYALDRAVLAAA-------D 77 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + K+ + K G + ++ ST+ + K Y Sbjct: 78 LDQELCPKDVVIDYLKKEGLDKYL------TGDPKVEPDVCGSTAAVL------KGYR-R 124 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + +M W + +++ ++ +S +++ +VLDVS SM Sbjct: 125 VEANADMDIEMHFMKW----RGIETIASAATSVAEESIGNVEISLVLDVSGSMRGS---- 176 Query: 189 MDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQE 242 KL ++ +D D V +V +S ++ L G I Sbjct: 177 --KLENLKKAANLFIDDMFAKTEDGK--VSISIVPYSEQVSIPDYLMNKLNTQGTNSIAN 232 Query: 243 KINRLIFGSTTKSTPGL 259 ++ T Sbjct: 233 CVDFASADFATTRFTAF 249 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 30/75 (40%), Gaps = 4/75 (5%) Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPD-RFYSV-QNSRKLH 352 + D + + C +AK I+++I A LK C S D +Y N + + Sbjct: 525 NQVQKDPRLT-SICQKAKDEKIIIFSIAFDAPDGVKPLLKGCVSDDGAYYEAKDNDKDII 583 Query: 353 DAFLRIGKEMVKQRI 367 F IG + R+ Sbjct: 584 SVFSSIGSTIQNLRL 598 >gi|194671641|ref|XP_591137.3| PREDICTED: matrilin 3 [Bos taurus] gi|297480578|ref|XP_002691564.1| PREDICTED: matrilin 3 [Bos taurus] gi|296482389|gb|DAA24504.1| matrilin 3 [Bos taurus] Length = 574 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 77/221 (34%), Gaps = 34/221 (15%) Query: 152 PLLITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + + + + G LD++ ++D S S+ + + +++D Sbjct: 147 SVPYSGTGQPGGARGAGVCKSRPLDLVFIIDSSRSVRPL------QFTKVKTFVSKIIDT 200 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYA 262 + P R +V ++S + F L Q ++ + R+ + T S ++ A Sbjct: 201 LDIGP---MDTRVAVVNYASTVKIEFHLQTHSDKQSLKRAVARITPLSTGTMSGLAIQTA 257 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ F + K I +TDG + A+ G +YA+ Sbjct: 258 MDEAFT---VEAGARGPSSNIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAV 308 Query: 323 GVQAEAADQFLKNCAS---PDRFYSVQN---SRKLHDAFLR 357 GV A + LK AS + + V+ KL F Sbjct: 309 GVD-RADMESLKMMASEPLDEHVFYVETYGVIEKLSSRFQE 348 >gi|323320820|gb|ADX36428.1| complement factor B [Apostichopus japonicus] Length = 913 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 88/217 (40%), Gaps = 26/217 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 I D GLD+ + D S S+ + D +A + +E++ I + D N +R G Sbjct: 450 IPLSHDTGLDLYFMFDGSSSVGE------DNFNMAKKFAKELVKEI-GVTDRPNSLRVGA 502 Query: 221 VTFSSKIVQTF-PLAWG-VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F+S+ F +A+ + + I+ + + T E N + + Sbjct: 503 LVFNSEAEIGFHTVAFDSTADVLDAIDSMEYKEGGTNIAKAFEVLSNVMIP-----QTAK 557 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + K + +TDG+ + +++ + + ++ IG+ A + L A Sbjct: 558 LNREKSFKTVFLITDGDATEGGDAQEDAR----AVRDQDVTIHCIGISENATRRTLSGMA 613 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 S + + +++ L + F++I + Q I Y++ Sbjct: 614 SEPLSEHLFFLKDYSTLEE-FIQI---VTNQTIDYSE 646 >gi|170672288|gb|ACB29772.1| matrilin-3 alternative transcript [Homo sapiens] Length = 444 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 73/218 (33%), Gaps = 32/218 (14%) Query: 154 LITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + + G LD++ ++D S S+ + + ++D + Sbjct: 61 PASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLD 114 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 P R +V ++S + F L Q +++ + R+ + T S ++ A + Sbjct: 115 IGP---ADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMD 171 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + F + + K I +TDG + A+ G +YA+GV Sbjct: 172 EAFT---VEAGAREPSSNIPKVAIIVTDGRPQD------QVNEVAARAQASGIELYAVGV 222 Query: 325 QAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + P + + V+ KL F Sbjct: 223 DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|158255148|dbj|BAF83545.1| unnamed protein product [Homo sapiens] Length = 486 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 73/218 (33%), Gaps = 32/218 (14%) Query: 154 LITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + + G LD++ ++D S S+ + + ++D + Sbjct: 61 PASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLD 114 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 P R +V ++S + F L Q +++ + R+ + T S ++ A + Sbjct: 115 IGP---ADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMD 171 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + F + + K I +TDG + A+ G +YA+GV Sbjct: 172 EAFT---VEAGAREPSSNIPKVAIIVTDGRPQD------QVNEVAARAQASGIELYAVGV 222 Query: 325 QAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + P + + V+ KL F Sbjct: 223 DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|42524203|ref|NP_969583.1| hypothetical protein Bd2793 [Bdellovibrio bacteriovorus HD100] gi|39576411|emb|CAE80576.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 354 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 64/185 (34%), Gaps = 16/185 (8%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + P + S ++ S+ G++++ +DVS SM +L A + ++D Sbjct: 72 ALARPQMGESQQEVKSE---GVEIIFAVDVSESMMAEDVKP-SRLAQAKAELSRLVD--- 124 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 +P G+V F+ PL I+ + L S + A Sbjct: 125 LMPGNK----VGIVAFAGSAALLSPLTNDPGAIKMYLESLEPSSVSSQGTNFTEALKISK 180 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +A E+ + I+ +DGE+ + +L + G ++++ E Sbjct: 181 EAFERGGVSTDETVKVTRVILIASDGEDHE-----QGALDEAKKMAGEGVRIFSLAYGTE 235 Query: 328 AADQF 332 Sbjct: 236 KGGAI 240 >gi|11321565|ref|NP_002372.1| matrilin-3 precursor [Homo sapiens] gi|14548113|sp|O15232|MATN3_HUMAN RecName: Full=Matrilin-3; Flags: Precursor gi|3647275|emb|CAA12110.1| matrilin-3 [Homo sapiens] gi|62630192|gb|AAX88937.1| unknown [Homo sapiens] gi|119621242|gb|EAX00837.1| matrilin 3 [Homo sapiens] gi|146218451|gb|AAI39908.1| Matrilin 3 [Homo sapiens] Length = 486 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 73/218 (33%), Gaps = 32/218 (14%) Query: 154 LITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + + G LD++ ++D S S+ + + ++D + Sbjct: 61 PASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLD 114 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 P R +V ++S + F L Q +++ + R+ + T S ++ A + Sbjct: 115 IGP---ADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMD 171 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + F + + K I +TDG + A+ G +YA+GV Sbjct: 172 EAFT---VEAGAREPSSNIPKVAIIVTDGRPQD------QVNEVAARAQASGIELYAVGV 222 Query: 325 QAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + P + + V+ KL F Sbjct: 223 DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|317491692|ref|ZP_07950127.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920126|gb|EFV41450.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 416 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 77/232 (33%), Gaps = 34/232 (14%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 IR F + G+ +I ++ + + +E S +A+L ++ + L Sbjct: 11 IRRFKQDRSGAFAISFVMMSGFLLSMAAFGLEGSRYITERARLSDAMEQAALA------- 63 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + N + + FR +R D++ + + + I +++ Sbjct: 64 ------LTAEDNGDGAQRNYTLSSDYFRAYMRH-----DVDVFKPTVIVKSGISPNNQNL 112 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + F S ++ + K +D++ V D S SMN+ F Sbjct: 113 SYVEYRVSGQTLQDSWFSSTFFPSFDKQVVIGDNGAARKFRSNMDVIFVTDFSGSMNEGF 172 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPLAWG 236 G G KL R + ++ D L +++ V P WG Sbjct: 173 G-GSTKLAELKRIVLKLSDE--------------LFSYNIDNKVGFVPFGWG 209 >gi|288800164|ref|ZP_06405623.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333412|gb|EFC71891.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] Length = 339 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 69/205 (33%), Gaps = 29/205 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R + F+ + S KIS + G++ ++ LD+S SM + Sbjct: 54 RPWVKFLLLITALSSLILALARPQFGS-KISHEKRNGIEAIIALDISNSMLAQDVQP-SR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 L + I +++ + + GLV F+ + P+ + ++ + Sbjct: 112 LDKSKLMIENLINSFIN-------DKIGLVVFAGEAYVQLPITSDYVSAKMFLSDITPNL 164 Query: 251 ---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T + + + K K II +TDGE++ +L Sbjct: 165 ISAQGTDIARAIRVSLSSFTQQKG-----------VGKAIILITDGEDNEG-----GALE 208 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 EAK +G V+ +GV Sbjct: 209 AVKEAKEKGVNVFILGVGDSKGAPI 233 >gi|7441761|pir||JC5576 inter-alpha-trypsin inhibitor heavy chain 3 - golden hamster Length = 889 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 105/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 226 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 227 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQVVNGYFVHFFAPQGLPVVPK- 283 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+D+S SM K+ ++ ++LD +K +N + +G+ Sbjct: 284 --------NIVFVIDISGSMAG------RKIQQTRVALLKILDDMKQDDYLNFILFSTGV 329 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + T GL + DA+E+ + Sbjct: 330 TTWKDSLVQATPA--NLEEARTFVRSISDQGMTNINDGLLRGIRMLTDAREQHTVPERST 387 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ +A +Y +G FL+ A + Sbjct: 388 S----IIIMLTDGDANTGESRPEKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALEN 443 Query: 341 R 341 Sbjct: 444 H 444 >gi|149773093|emb|CAO01896.1| collagen type VI alpha 6 [Mus musculus] Length = 1120 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 17/191 (8%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +D+ M+ D + ++ + VR G+ FS F L Sbjct: 10 VDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNR---VRIGVAQFSDSYRSEFLLG 66 Query: 235 W--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 G + I +I + T L + + ++ LT Sbjct: 67 TFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDMGSRINAGTP-----QVLLVLT 121 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG + E E + +G +Y++G+ + ++ + ++ +V N +L Sbjct: 122 DGRSQD------EVAQAAEELRHKGVDIYSVGIGDVDDQELVQITGTAEKKLTVHNFDEL 175 Query: 352 HDAFLRIGKEM 362 RI + + Sbjct: 176 KKVKKRIVRNI 186 >gi|167549689|ref|ZP_02343448.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205325282|gb|EDZ13121.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 593 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 71/190 (37%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 234 LVFLIDTSGSMQ-----PAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 284 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+F Sbjct: 285 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILFT 335 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 336 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 395 Query: 348 SRKLHDAFLR 357 + Sbjct: 396 LSEAQKVVKD 405 >gi|115963085|ref|XP_001182555.1| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1245 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 61/177 (34%), Gaps = 30/177 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + +++VLD S SM +DK+ A + ++D + + G+ Sbjct: 515 VQASTGDECRVVLVLDTSGSMGTS--NRIDKVNSAATAFVNLVD---------DGISIGI 563 Query: 221 VTFSSKIVQTFPLAW-GVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 VTF+ L Q + I +L T GLE + Sbjct: 564 VTFTGSPTTRHALTQINTQADRDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADG 623 Query: 277 AKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I+ +TDG++S N +++L G V + + +A + Sbjct: 624 G--------IIVLMTDGQDSGIQNHIIRQTLQ------DMGVRVNTVAIGEDAYGEL 666 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 66/196 (33%), Gaps = 30/196 (15%) Query: 176 DVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D S SM D+LG S + L++ D+ + + Sbjct: 337 DKSGSMGGSDARPTSSKYSNDRLGALFESCEKFLEVRDGSSDL-----VSCIMYDHSAYN 391 Query: 230 TF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F PL+ + ++ + G T T ++ + I H YK Sbjct: 392 CFTTNPLS---TSLVSTMSSYVAGGGTSFTNAMQSVSSLISS-------TYPNHQSYKIV 441 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYS 344 ++F++DGE+S+ + + I++ I + + + L+ A+ R F Sbjct: 442 VLFMSDGEDSADEAVSITGQLVSSH----DIILHTIQLGGSSDNTGLRQMAATGRGQFKR 497 Query: 345 VQNSRKLHDAFLRIGK 360 +S L + I Sbjct: 498 ANDSASLAGIYQEIAN 513 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 61/423 (14%), Positives = 119/423 (28%), Gaps = 75/423 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F N G+I+I+ A+ L I +G I+ S KAKL LD ++L Sbjct: 11 LSRFRRNESGNIAIIFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAAMLAATGYTAM 70 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNE-LRENGFAQDINNIERSTSLSIIIDDQHKD 124 + + K N++ + L N + + + +++ Sbjct: 71 RGTAADAKTSAT--------NMYNGQMSSHKLTSNSLNITVTDSVTARTVTGTASVVVNT 122 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD------VS 178 + M + + ++ + S M L + Sbjct: 123 AFMYMFGFPTMTVTASSSASASFPTYMDFYVLVDNSPSQGLGATTADMTTLQNATSDKCA 182 Query: 179 LSMNDHFGPGMDK-------------------LGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +D + K + V + + + D S V+N R Sbjct: 183 FACHDTYTSSTKKTLQTNSYYQIAKNKGVTMRIDVVRSATQSLTDTATSSQVVSNQYRMA 242 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN----------KIFDA 269 + + S + L + YN Sbjct: 243 VYSLGSDC-GSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGAMSGMN 301 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDG--ENSSPNIDNKESLF-----------YCNEAKRRG 316 +K++ F++DG + S P +K L C+ K RG Sbjct: 302 GVIPAQGDGSSTSPQKWLFFVSDGVADYSYPTTCSKTVLSGGRCQEPLNTTTCDTLKARG 361 Query: 317 AIV---YA--------------IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 + Y I ++ +K+CASP +Y V +S + A + Sbjct: 362 IKIAVLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCASPGYYYEVDSSGSIGAALTALF 421 Query: 360 KEM 362 ++ Sbjct: 422 QQA 424 >gi|328865949|gb|EGG14335.1| Ubiquitin-conjugating enzyme E2 4 [Dictyostelium fasciculatum] Length = 563 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 72/202 (35%), Gaps = 13/202 (6%) Query: 172 MMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++VLD+S SM PG ++ + I V GLV F K Sbjct: 148 VIVLDLSGSMQQPAYIGSRVPGELEMTRIEAAQATFQTFIDRFVSYRYPVAVGLVCFGQK 207 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 I TFP++ ++ + + T+ ++ A I + K+ + + Sbjct: 208 IEATFPISSNFDSFSNELGEVEAHQSQTRLWEAIKRAAEVIVEFKKSPTLKLAPNVRSR- 266 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYS 344 I LTDGE++ + + + G I+ +I + + A+ + Sbjct: 267 -IFCLTDGEDNG----STPVFTVFDYLRTHGIILDSIPIGQQGRATLSAFSKATGGTCFV 321 Query: 345 VQNSRKLHDAFLRIGKEMVKQR 366 +S + F R M++ R Sbjct: 322 ANSSIECVQLFEREALLMLEHR 343 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 66/164 (40%), Gaps = 19/164 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKSIPDVN 213 + + + + +VLD S SM ++ G ++ + + ++D + + Sbjct: 188 QTKAGKKVVAAITLVLDESGSMANNMSGGRVTSSNPSRISILRNRAKALIDQFSGLGN-- 245 Query: 214 NVVRSGLVTFSSKIV----QTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLE---YAYN 264 + G++ +S ++F LA G I+ KI+ L T + + YA Sbjct: 246 --IYVGIIPYSDDAYISGTKSFVLANGTNVNTIKNKIDSLTAQGMTNTGDAMRVSYYATK 303 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + D+ +++ Y+I L+DG+ + + +E + + Sbjct: 304 QFKDSPNSIDNTLPTDTKVIPYMILLSDGDPTVFSATTREWVGW 347 >gi|320103074|ref|YP_004178665.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319750356|gb|ADV62116.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 602 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 62/199 (31%), Gaps = 11/199 (5%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + W + + S G D ++V+DVS SM P +L Sbjct: 3 LLALCCVAIGWVWEGTAWFDRLGSQPPPQHPLGSGRDWVLVIDVSRSMAARDAPP-SRLE 61 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 A ++D+ P R L+ FS + V PL ++ + RL G Sbjct: 62 TAQTVANALIDLAMRSPH----DRVALIVFSDRAVIRTPLTRRADLVRRALARLRPGELR 117 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 L A + + + + I L+DGE+ P SL + A Sbjct: 118 PGGSHLAEAVTVAQNLLIRSKRLDDPAFPRSAAIWLLSDGESPRP---PGISLLSASPAP 174 Query: 314 RRGAIVYAIGVQAEAADQF 332 V A+G Sbjct: 175 ---IHVVAVGRAEPPGAPV 190 >gi|157412073|ref|YP_001481413.1| TerY1 [Escherichia coli APEC O1] gi|99867098|gb|ABF67743.1| TerY1 [Escherichia coli APEC O1] Length = 239 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 77/225 (34%), Gaps = 19/225 (8%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P+ PL T+ + S K ++ L + ++LD S SM+ + Sbjct: 1 MPFYQIYKLWPLPETTLLFRSLKKELHLRRLPVYLLLDTSGSMHGE------PIEAVKNG 54 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ +L +K P ++TF S Q PL + ++ L TT Sbjct: 55 VQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT---DLLSFQMPALTASGTTSLGEA 111 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L + I +K KG + +TDG SPN D ++ L A+ G Sbjct: 112 LSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDG---SPNDDWRKGLNDFKAART-GV- 164 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 V A +A LK +S + F + + Sbjct: 165 VVACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSASIS 209 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 60/168 (35%), Gaps = 35/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM + D+L ++ R L + V G+VTF S Sbjct: 308 VCLVLDKSGSMLND-----DRLNRMNQASRLFL-----LQTVEQGSWVGMVTFDSAAYVQ 357 Query: 231 FPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + T GL A+ I Sbjct: 358 SELKQLNSGADRDLLIKHL-PTVSAGGTSICSGLRTAFTVIKKKYPTDGSE--------- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF 332 I+ LTDGE++ ++ C + K+ GAI++ + + AA + Sbjct: 408 -IVLLTDGEDN--------TISSCFDLVKQSGAIIHTVALGPAAAKEL 446 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 73/223 (32%), Gaps = 35/223 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 LL + V+ ++ V+D S SM+ G +++ A + + L+ Sbjct: 278 LLASPPVEEVGDVKTKKTVIFVVDRSGSMS---GEKIEQAKEAAKFVLNNLNEGDLFN-- 332 Query: 213 NNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 ++ + S + P + ++ L G +T L A + Sbjct: 333 -------IIAYDSDVESFEPELQKLDDKTREKALGFVDNLYAGGSTNIDGALAKAMGMLK 385 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D K Y++FLTDG + + + + + A V + GV + Sbjct: 386 DDKRPS------------YMLFLTDGLPTHGEQNEAKIVDNAKQKNDVRARVISFGVGYD 433 Query: 328 AADQFL----KNCASPDRFYSVQNSRKLHDA--FLRIGKEMVK 364 + L + C + + H A + +I ++ Sbjct: 434 VNSRLLDRLSRECFGQSEYVRPNEDIETHVAKLYNKISAPVMT 476 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 77/204 (37%), Gaps = 33/204 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G++ ++V+D S SM + +L VA I + + K G+V F+S Sbjct: 607 SGVEAILVIDDSGSMVSN-DRYNQRLTVAQNLIDNLPENSK----------VGVVKFTSS 655 Query: 227 IVQTFPLAW-GVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + + + + T + A N F E D+ Sbjct: 656 TTKLTTSLTSDKETAKSYLTTSYFRSSGGT----SMYTAINSSFSMFEAT------DDNI 705 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLKNCA--S 338 K +I L+DG S + + A G +Y +G+ + ++ Q+LK A + Sbjct: 706 LKMMIVLSDGATSYTYLHSS----VVTTANNNGVKIYTVGLGSSSSSYFTQYLKPLANNT 761 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEM 362 FY ++ +L D + I K++ Sbjct: 762 GGAFYLASDASQLEDIYKDINKKI 785 >gi|193786838|dbj|BAG52161.1| unnamed protein product [Homo sapiens] Length = 496 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 84/207 (40%), Gaps = 34/207 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + + I +++D + + + GLV +SS Sbjct: 272 SATDLVFLIDGSKSVRPE------NFELVKKFISQIVDTLDVSD---KLAQVGLVQYSSS 322 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q FPL G H ++ I + +T + L+Y + D + A+ Sbjct: 323 VRQEFPL--GRFHTKKDIKAAVRNMSYMEKSTMTGAALKY----LIDNSFTVSSGARPG- 375 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 +K I TDG + D +AK G ++A+GV D+ + + P Sbjct: 376 -AQKVGIVFTDGRSQDYIND------AAKKAKDLGFKMFAVGVGNAVEDELREIASEPVA 428 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + ++ + + ++ IGK++ K+ Sbjct: 429 EHYFYTADFKTINQ----IGKKLQKKI 451 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L V + + + R+ + T + +++A K F E D K Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + + P Sbjct: 149 VIVVTDGRPQDSVQDVSA------RARASGVELFAIGVGSVDKATLRQIASEP 195 >gi|289679370|ref|ZP_06500260.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae FF5] Length = 120 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 50/130 (38%), Gaps = 11/130 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 M +DVS SM+ P M + + ++ + R GL+ F ++ P Sbjct: 1 MAVDVSGSMD---YPDMQWKSDEVSRLVLVQQLLGDFLEGRKGDRVGLILFGTQAFVQAP 57 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L + + ++ ++ G K+T + DA + + ++ +TD Sbjct: 58 LTYDRRTVRVWLDEARIGIAGKNT--------ALGDAIGLALKRLRMRPATSRALVLVTD 109 Query: 293 GENSSPNIDN 302 G N++ ID Sbjct: 110 GANNAGQIDP 119 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 95/267 (35%), Gaps = 31/267 (11%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPF------IFCTFPWCANSSHAPLLITSS--------VK 160 +I I +Q D+ + ++ P + + + Sbjct: 256 NIKILNQTTDHQIPNTIQFIFSLRDQSNHAILFDPNKVVRKNIRIWENGTEIDYLESHAL 315 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++ D L +M+VLD S SM + S +++++ + N+ + G+ Sbjct: 316 LYTQDDFQLQVMIVLDYSASM----YEKNGDITRMALSAKDLIESL------NDTHQVGV 365 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V F L + I + S+ K ++ + K + K Sbjct: 366 VEFHRPDEPPAILQDFTTYKNAAIEAVSQFSSGKIYRDFSSCWDAVL--KGLKQFPEKPD 423 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--S 338 D K ++FL+DG ++S + AK R +Y +G+ + ++ LKN A + Sbjct: 424 PDIFKTLVFLSDGFDNSSFSTPGNVISL---AKERDVHIYILGIGRGSEEEVLKNIALET 480 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +N + F + K++ Q Sbjct: 481 GGTYVHAENIAVFRERFKQTIKDIKGQ 507 >gi|86148746|ref|ZP_01067019.1| VCBS [Vibrio sp. MED222] gi|85833461|gb|EAQ51646.1| VCBS [Vibrio sp. MED222] Length = 2142 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 42/176 (23%), Positives = 73/176 (41%), Gaps = 11/176 (6%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+ V ++ ++ ++ +V+DVS SM D G +L + S+ +ML ++ Sbjct: 1500 PVATAIDVPVTPETKSDTNIQLVIDVSGSMGYDSGVAGKTRLAILKESLAKMLQQYDTLG 1559 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 DV V+ T ++K++ +W V +IN+L + T L A +D Sbjct: 1560 DVK--VQIVTFTGNAKLIHDGSKSWFSVSEAITEINKLKPKNNTDYDDALRKA-RTSWDH 1616 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 E + + Y F+TDG N ID E+ + G AIG+ Sbjct: 1617 DEDSKLPDANNVSY-----FITDGIPNQDDRIDYWEAKTWTKHLDDNGITSQAIGI 1667 >gi|156523281|ref|NP_001096038.1| anthrax toxin receptor 2b [Danio rerio] gi|151335854|gb|ABS00409.1| capillary morphogenesis protein 2B [Danio rerio] Length = 487 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 78/203 (38%), Gaps = 30/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ VLD S S++D++ + T V+ +R + FSS Sbjct: 36 GAFDLYFVLDRSGSVSDNWLEIYGFVEQLTNRF------------VSPKMRVSFIVFSSS 83 Query: 227 IVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL I ++++++ T GL+ A ++ + Sbjct: 84 AEIILPLTGDRVDIDSGLQQLSKIRPAGDTYMHEGLKKAIEQMTSQGARASS-------- 135 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDG+ N+ ++ + A++ GA VY +GV+ A+Q + + D+ + Sbjct: 136 --IIIALTDGK--LEVFMNELAIKEADLARQYGARVYCVGVKDFDANQLTEIADNKDQVF 191 Query: 344 SVQNSRKLHDAFLRIGKEMVKQR 366 V + A I ++++ Sbjct: 192 PVVDG---FQALKNIVNSILQKS 211 >gi|148234427|ref|NP_001080470.1| matrilin 2 [Xenopus laevis] gi|28175657|gb|AAH45220.1| Matn2-prov protein [Xenopus laevis] Length = 589 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 74/205 (36%), Gaps = 28/205 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ + LD++ ++D S S+ + ML + PD R GL+ Sbjct: 44 SACHNKPLDLVFIIDSSRSVRPA------DFEKVKEFLITMLKFLDIGPD---TTRVGLL 94 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + S + F L + I+ + R+ + T + ++YA N F E + Sbjct: 95 QYGSTVKNEFSLKMYKRKSDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAEGARPL-- 152 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + + +TDG P +A+ G +++AIGV LK S Sbjct: 153 -NQYVPRIAMIVTDGRPQDPVE------EISAKARMSGILIFAIGVGRVDMST-LKTIGS 204 Query: 339 ---PDRFYSVQNSRK---LHDAFLR 357 + + V N + L F Sbjct: 205 EPHSEHVFLVANFSQIETLTSVFQN 229 >gi|292627386|ref|XP_001332035.3| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 1056 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 77/202 (38%), Gaps = 22/202 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ M+ + G A R + + + G+V +S PL Sbjct: 102 DLVYIMDGSWSVGDVDFETAKRWLINVTSGFDVSSHYSQ---VGVVQYSDTPRLEIPLGQ 158 Query: 236 --GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 Q + E I ++ G T++ +++A + +F + + +D + + +TD Sbjct: 159 HKTTQQLIEAIEKISYLGGNTQTGRAIKFAVDHVFPSSRR-------NDVKNRIAVVVTD 211 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSR 349 G++ D +A+ +G V+A+GV +E + L A + D ++ Sbjct: 212 GKSQDDVTDASL------DARTQGITVFAVGVGSEITNSELVTIANTPAGDYVLFAEDYT 265 Query: 350 KLHDAFLRIGKEMVKQRILYNK 371 + + +++ ++ + + Sbjct: 266 NIERIRDAMEQKLCEESVCPTR 287 >gi|291242482|ref|XP_002741137.1| PREDICTED: chloride channel calcium activated 3-like [Saccoglossus kowalevskii] Length = 975 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 64/188 (34%), Gaps = 26/188 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM+ D++ +S ++ I D + V G +++ + Sbjct: 342 VVLVLDTSGSMDG------DRIQRLYQSATYFIEN--RIEDGSFVGIVGFSSYAVILASM 393 Query: 231 FPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L +G + K+ T G+ A + D E + ++ Sbjct: 394 TELKYGYQRSEVSSKV-PQEADGATSIGGGVRLALQVLQDGNVTSEGAS---------LL 443 Query: 289 FLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG EN+ P + + E G V I A + ++ Y+ Sbjct: 444 LITDGVENTYPFL-----MNVMQEVYDSGVRVDTIAFTEAAQSTLQQLSSNTGGLYNYVP 498 Query: 348 SRKLHDAF 355 AF Sbjct: 499 EDDNSTAF 506 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 73/211 (34%), Gaps = 41/211 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD++ VLDVS SM DKL + +++ ++ + R +V+FSS Sbjct: 163 RAPLDLVTVLDVSGSMRW------DKLALVKQAMGFVIGSLGPHD------RLSVVSFSS 210 Query: 226 KIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + L G E + L G T GL A + + + + + Sbjct: 211 GARRVTRLLRMSHTGKSLATEAVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSS---- 266 Query: 282 DYKKYIIFLTDGENSS-----------PNIDNKESLFYCNEAKRRG---AIVYAIGVQAE 327 +I L+DG ++ PN + + G A ++ G + Sbjct: 267 -----VILLSDGHDNYSMPRRARGGVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGND 321 Query: 328 AADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 + A+ F ++N + DAF Sbjct: 322 HDAAAMHVVAEATGGTFSFIENEAVIQDAFA 352 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 70/213 (32%), Gaps = 19/213 (8%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P S + ++D S SM + + I Sbjct: 267 PGTYVVGADASVNEAATTRVAFLIDNSGSMYPKELCPTSSENDVDFKRLDFTQSL--IDK 324 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 ++ R G+ F+ + + + I R+ T + L+ + Sbjct: 325 FDSDFRIGISKFTGTYTRMCGFTDDRTALSDVIKRIRTEDEIFDGTHNQTALKRCIEEFT 384 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 A G Y I+ L+DGE+ N ++ ++L A + IV +G+ E Sbjct: 385 ---------ATGDGKYVNIIVMLSDGESDETNAESIKNL--ARLANEKSVIVLTVGLGRE 433 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 +L+ A + ++YS + L D + +I Sbjct: 434 IDRAWLQEMAYSTGGKYYSASEANALDDVYKQI 466 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVGA------KLRQTKDALFTILH------DLRPQDRFNIIGFSNRIKVW 343 Query: 229 QTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ I++ I+ + T L+ A + + + H Sbjct: 344 KDNLISVTPNSIRDGKIYIHHMSPTGGTDINGALQTAIRLLNN---YVAHNDIEDRSVS- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 I+FLTDG+ + + L EA + ++ IG+ + + L+NC Sbjct: 400 LIVFLTDGKPTVGETHTLKILNNTKEAAQGRVCIFTIGIGNDVDFKLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + +N+ F Sbjct: 460 RVHEEENAGAQLIGFYD 476 >gi|207857733|ref|YP_002244384.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206709536|emb|CAR33881.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 596 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 237 LVFLIDTSGSMQ-----PAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 287 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 288 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 338 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 339 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 398 Query: 348 SRKLHDAFLR 357 + Sbjct: 399 LSEAQKVLKD 408 >gi|115372062|ref|ZP_01459374.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] gi|115371027|gb|EAU69950.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] Length = 540 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 72/212 (33%), Gaps = 33/212 (15%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + + P + AP S+ +I++K + V+D S SM ++ Sbjct: 40 LLTYKQADEPGYFIALIAPKTEVSASEIAAKR-----VTFVIDTSGSMQGS------RMQ 88 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAW----GVQHIQEKINRLI 248 +A +++ + + D NVVR FS+ + FP L +Q + +L Sbjct: 89 IAKDALKYCVTRLNP-QDTFNVVR-----FSTDVEALFPALKSAQPENIQKAVAFVEQLE 142 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T L G ++F+TDG+ + D + Sbjct: 143 AIGGTAIDEALVRGLQ-----------DNDGKSSAPHLLMFITDGQPTIGETDEGAIAQH 191 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ ++ GV + + L +S Sbjct: 192 AKDGRKAKTRLFTFGVGEDLNARLLDRLSSDG 223 >gi|332254494|ref|XP_003276364.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Nomascus leucogenys] Length = 1313 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 70/206 (33%), Gaps = 36/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM K+ +++ +L +K+ N +++FS + Sbjct: 284 VVFVIDVSGSMFG------TKIEQTKKAMNVILSDLKANDYFN------IISFSDTVNV- 330 Query: 231 FPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W V ++ ++ + T L A + + + ++ Sbjct: 331 ----WKAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSNQEPGRGPSV 386 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 IIFL DGE ++ L +A ++++ +A L+ + Sbjct: 387 GRIP--LIIFLMDGEPTAGVTTPSVILSNVCQALGHRVSLFSLAFGDDADFTLLRRLSLE 444 Query: 340 DR------FYSVQNSRKLHDAFLRIG 359 +R + + +L + I Sbjct: 445 NRGIAQRIYEDTDAALQLEGLYEEIS 470 >gi|125532270|gb|EAY78835.1| hypothetical protein OsI_33940 [Oryza sativa Indica Group] Length = 606 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 81/212 (38%), Gaps = 37/212 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM KL + +++ ++D + R +V+FS++ Sbjct: 142 PLDLVTVLDVSGSMAG------RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEA 189 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L G + + L+ S T GL A + D + K + Sbjct: 190 SRRTRLLRMSEVGKATAKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSS------ 243 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEA---------KRRGAIVYAIGVQAEAADQFLK 334 +I L+DG++S + Y + + R A ++ G A+ + Sbjct: 244 ---VILLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGRLAPIHTFGFGADHDAAAMN 300 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 A + F V+N + D+F + IG + Sbjct: 301 TIAESTGGTFSFVENEAAIQDSFAQCIGGLLS 332 >gi|316983245|pdb|3N2N|F Chain F, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983246|pdb|3N2N|A Chain A, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983247|pdb|3N2N|B Chain B, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983248|pdb|3N2N|C Chain C, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983249|pdb|3N2N|D Chain D, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983250|pdb|3N2N|E Chain E, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain Length = 185 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 46/199 (23%), Positives = 74/199 (37%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 6 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 53 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 54 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 108 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVYA+GV+ Q + S D + Sbjct: 109 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYAVGVKDFNETQLARIADSKDHVF 164 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 165 PVNDGFQALQGIIHSILKK 183 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 91/277 (32%), Gaps = 39/277 (14%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQ---HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 E+ F + + + I +D +D+ L + F L Sbjct: 278 ESLFHKVRIDTTNDATREITLDGAAAADRDFVLEWSAVASDAPQVGLFREHIGKDDYVLA 337 Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++S + +++ V+D S SM + A S+ L + Sbjct: 338 YVTPPALASPKKVQREVIFVIDNSGSMGG------TSIEQAKASLDYAL------SQLQP 385 Query: 215 VVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R ++ F + + F + + + + L T+ P L A + Sbjct: 386 GDRFNVIRFDDTLTKFFEDSVDANQENIASARRFVTSLEAQGGTEMLPALHAALDDSNQG 445 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + I+FLTDGE I N++ L A+R + ++ +G+ + Sbjct: 446 NGLRQ------------IVFLTDGE-----ISNEQQLLDAVAARRGRSRIFMVGIGSAPN 488 Query: 330 DQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 + A R F + ++ ++ + + ++ Sbjct: 489 SYLMNRAAELGRGTFTHIGSAAEVDERMRALFDKLEN 525 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 48/146 (32%), Gaps = 22/146 (15%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF 249 + +++ P R G+V +S + F L IN + + Sbjct: 11 FAKVKQFAVNVVNTFDISP---TATRVGVVQYSDRNSLVFNLGNKVNKPSTVSAINGISY 67 Query: 250 -GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + L+Y ++ A + K II LTDG++S L Sbjct: 68 QSGGTNTGAALKY----------VRQYAAWREGNVPKVIIVLTDGKSSDSVSGPSRDLV- 116 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLK 334 G VYAIGV Q L+ Sbjct: 117 -----AAGVEVYAIGVGKFDHGQLLQ 137 >gi|194221225|ref|XP_001492399.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Equus caballus] Length = 883 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 110/303 (36%), Gaps = 30/303 (9%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLVKHFEIEA--DIFEPQGISTLDAEASFVTNDLLGSALTKSFSGKKGHVSF 226 Query: 102 --AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 + D + + S++ D Y+++ S + + F P+ Sbjct: 227 KPSLDQQRSCPTCTDSLLNGDFTITYDVNRESPANVQIVNGYFVHFFAPQGLPV------ 280 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +++ V+DVS SM K+ ++ ++L+ +K +N ++ SG Sbjct: 281 -------VPKNVVFVIDVSGSM------YGRKIQQTKDALLKILEDVKEDDYLNFILFSG 327 Query: 220 -LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + T+ +VQ P +Q +E + + T L + + A+E+ + Sbjct: 328 DVTTWKDNLVQATPE--NIQQAREFVMNIHSQGMTNINDALLRGISMLNKAREENAVPER 385 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 II LTDG+ + ++ + A R +Y +G FL++ A Sbjct: 386 STS----IIIMLTDGDANVGESKPEKIQENVHNAIRGKFPLYNLGFGNNLNYNFLESMAL 441 Query: 339 PDR 341 + Sbjct: 442 ENH 444 >gi|190410234|ref|YP_001965735.1| terY1 [Klebsiella pneumoniae] gi|146151027|gb|ABQ02793.1| terY1 [Klebsiella pneumoniae] Length = 239 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 78/225 (34%), Gaps = 19/225 (8%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P+ + PL T+ + S K ++ L + ++LD S SM+ + Sbjct: 1 MPFYQINKPWPLPETTLLFRSLKKELHLRRLPVYLLLDTSGSMHGE------PIEAVKNG 54 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ +L +K P ++TF S Q PL + ++ L TT Sbjct: 55 VQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLT---DLLSFQMPALTASGTTSLGEA 111 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L + I +K KG + +TDG SPN D ++ L A+ G Sbjct: 112 LSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDG---SPNDDWRKGLNDFKAART-GV- 164 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 V A +A LK +S + F + + Sbjct: 165 VVACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSASIS 209 >gi|329744607|ref|NP_001193278.1| inter-alpha-trypsin inhibitor heavy chain H3 [Sus scrofa] Length = 889 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 51/303 (16%), Positives = 110/303 (36%), Gaps = 30/303 (9%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLVKHFEITA--DIFEPQGISTLDAEASFITNDLLGSALTKSFSGKKGHVSF 226 Query: 102 --AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 + D + + S++ D Y+++ S + + F P+ Sbjct: 227 KPSLDQQRSCPTCTDSLLKGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPV------ 280 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +++ V+DVS SM K+ ++ ++LD IK +N V+ SG Sbjct: 281 -------VPKNVVFVIDVSGSM------YGRKMEQTRDALLKILDDIKEDDYLNFVLFSG 327 Query: 220 -LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + T+ +VQ P +Q +E + + T GL + + A+E+ K Sbjct: 328 DVTTWKDSLVQATPE--NIQKAREFVRNIRDQGMTNINDGLLTGISMLNKAREEH----K 381 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + II LTDG+ + ++ A +Y +G FL++ A Sbjct: 382 VPERSTSIIIMLTDGDANMGVSKPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMAL 441 Query: 339 PDR 341 + Sbjct: 442 ENH 444 >gi|326681146|ref|XP_002665520.2| PREDICTED: collagen alpha-1(XXII) chain [Danio rerio] Length = 1623 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 81/229 (35%), Gaps = 29/229 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 M F F + A L V+ ++ D++ +LD S S+ + + Sbjct: 1 MEFRFGLRISLVLTVLALTLYGGGVEGQRAGCKNVHYDLVFILDTSSSVGKENFEKIRQW 60 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---- 248 ++++S + R +V +S + F LA E++ R Sbjct: 61 VA---------NLVESFDVGVDKTRVAVVRYSDRPTVEFNLA--RYKTLEEVKRAAGNIR 109 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G TK+ + + IF + AKG +K I LTDG++ ++ + Sbjct: 110 YLGGNTKTGDAISFTTTNIFTERAGARPAAKG---IQKVAILLTDGQSQDFVLEPSVA-- 164 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDA 354 A G ++A+G+ ++ + A P + V + + Sbjct: 165 ----AAAAGIRLFAVGIGEALKEELEEIAAEPKSAHVFHVTDFDAIDKI 209 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 86/235 (36%), Gaps = 20/235 (8%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ LI S G M++LD + S++ P ++ + Sbjct: 58 LKPSNFKIVNTVTNFYELIDVSTSNLISKPGGYSAMLLLDQTGSIS-TTDPYNLRIEASK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + +GL +F+S L G + E++ + + + Sbjct: 117 IFLNNLGTD----------DYTGLTSFTSSYTSVVKLHSGFTNKTEQMKKSLDTLALNVS 166 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G + I + + + A+ K +I TDGEN+ +++ +A ++ Sbjct: 167 GGTPLYTSTI----QSVTYTAQKGPTANKAVIVFTDGENNVTTNTLEDATA---KAIQQK 219 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 ++ +G+ + L A + F+ +++ +L F +G + Q ++Y Sbjct: 220 IPLFTVGLSTDVNVNVLAQMANETGGAFFYAKDAGQLISTFGTLGNLLHGQGLVY 274 >gi|56675026|gb|AAW19655.1| matrilin-1 [Cervus elaphus] Length = 230 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 72/197 (36%), Gaps = 28/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R GLV ++S + Q Sbjct: 3 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGLVNYASSVKQ 53 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL + + + R+ + T + + +A K E D K Sbjct: 54 EFPLRAHSSKAELLQAVRRIQPLSTGTMTGLAIHFAITKALSDAEGGRPR---SPDISKV 110 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 +I +TDG D A+ G ++AIGV L+ AS + Sbjct: 111 VIVVTDGRPQDSVRDVSA------RARASGIELFAIGVG-RVDKATLRQIASEPQDEHVD 163 Query: 344 SVQN---SRKLHDAFLR 357 V++ KL F Sbjct: 164 YVESYRVIEKLSKKFQE 180 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 56/211 (26%), Gaps = 74/211 (35%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL ++ + + I+ + T GL + + + E + D K ++ + Sbjct: 424 TPLTKTLKTVTDAIDVMGAQGATNIPHGLAWGWRLLTARPPFTEGRSHDEPDNLKVLVLM 483 Query: 291 T-----------------------------------------------DGENSSPN---- 299 T DG + Sbjct: 484 TDGNNTYNLNSGGRPLEIRDYNRSTYGSYGYGAAYSHGSSSRKPGRIYDGTTGNAKDYSV 543 Query: 300 -----IDNKESLFYCNEAKRRG--------AIVYAIGVQAEAA---DQFLKNCASPDR-- 341 ++ C K G +++ I + +++CAS Sbjct: 544 DSYVAAMDQNVAKVCENVKADGRKPGGTDGILIFTIAFDLRDGEPVKKLMEDCASNGLID 603 Query: 342 -----FYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y Q+ +L AF I +++ RI Sbjct: 604 ASEKLYYDAQSQEELAAAFQSITEQISSLRI 634 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 76/237 (32%), Gaps = 34/237 (14%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F I F G+++++ + LPV+ + ++ S + L D + L + Sbjct: 20 FTRIMRFRREKAGNVAVVFGLTLPVLALCFATAVDLSGIYGANRSLQQAADVAALAAGRE 79 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 ++ + F F + +T S + Sbjct: 80 YGRTQDADYLSSVSEAF--------------------FFHNAGDETRGTTQFSYDGVFRE 119 Query: 123 KDYNL-SAVSRYEMPFIFC-TFPWCANSSHA----PLLITSSVKISSKSDIGLDMMMVLD 176 + +R ++P F W PL S + + + +++ +VLD Sbjct: 120 DGLTILKVTARRQLPTFFGDALMWVTGGKLDWRQFPLYAKSEIVVQ---NRSIELALVLD 176 Query: 177 VSLSMND--HFGPGMDKLGVATRSIREMLDIIKSIPDVNN---VVRSGLVTFSSKIV 228 S SM D G K+ + + ++ S + V+ +V FSS + Sbjct: 177 NSGSMQDRPRSGGSKSKIDIIKDAAEDLAKQFLSSDKGSTEEFPVQFAVVPFSSSVN 233 >gi|117920853|ref|YP_870045.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. ANA-3] gi|117613185|gb|ABK48639.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. ANA-3] Length = 751 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 75/188 (39%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S + ++ ++++V+D S SM D + A ++R L ++ N Sbjct: 363 VEASEQLNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRGLRPQDSFN----- 411 Query: 219 GLVTFSSKI--VQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + + PL A + ++ +NRL T+ L A + Sbjct: 412 -IIEFNSDVSLLSPTPLPATASNLAMARQFVNRLQADGGTEMAQALNAAL------PRQA 464 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + A D + +IF+TDG S ++ N+ ++ +G+ + F+ Sbjct: 465 FNAASAEDKSLRQVIFMTDG---SVGNESALFELIRNQIGDN--RLFTVGIGSAPNSHFM 519 Query: 334 KNCASPDR 341 + A R Sbjct: 520 QRAAELGR 527 >gi|326332116|ref|ZP_08198400.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] gi|325950087|gb|EGD42143.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] Length = 338 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 75/185 (40%), Gaps = 20/185 (10%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD-KLGVATRSIREMLDIIKSI 209 P + ++ + LD+++V+D + SM GPG + ++ + +K++ Sbjct: 65 RPSWGAAPSEMRT---ADLDVLVVVDRTRSMVAEDGPGGEARMTQLKK-------DLKAL 114 Query: 210 PDVNNVVRSGLVTFSSKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA-YNKIF 267 VR G +TF ++V P + ++ L + + A +++ Sbjct: 115 SAALPSVRFGAITFGGEVVRTEMPFTYDTTAFNAWVDGLYAERAFDGSGSMVDAPRDEVI 174 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 A E+ + + ++ ++F +DGEN+ +D ++S ++ G +G E Sbjct: 175 SALERDQERF---PERRRIVVFASDGENTREGVD-QQSFSEIDDLSAGGV---VLGYGTE 227 Query: 328 AADQF 332 + Sbjct: 228 EGGRM 232 >gi|189485267|ref|YP_001956208.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287226|dbj|BAG13747.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 330 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 77/229 (33%), Gaps = 30/229 (13%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSR-YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 +R +L +I + +S R Y++ +I + + Sbjct: 27 KRKAALDRLISRVNISTLVSVNLRAYKIKYILLLAGLFFVIIAMACPQYGDGMRTVIKES 86 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++++ LD+S SM +L A I+ I + N + G+V FS Sbjct: 87 S-EIIIALDISKSMLAE-DSKPSRLEKAKM-------IVSKIVEENPGEKMGIVVFSGTA 137 Query: 228 VQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + PL + + ++ + + T+ + + A A Sbjct: 138 MWQCPLTFDLHALKMFLQSVETTNLPLGGTRISSAIMLA-----------SKAASCESAG 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +I ++DGEN I N AK+ G + +IG+ + Sbjct: 187 SRVMILISDGENHDSKI-----KEAVNAAKKAGLRIISIGIGKKEGAPI 230 >gi|149921110|ref|ZP_01909568.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] gi|149817997|gb|EDM77456.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] Length = 532 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 68/205 (33%), Gaps = 34/205 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSG 219 ++ S L++ +V+D S SM R LD I + + Sbjct: 127 VARTSAEPLNLAIVIDHSGSMKGQRE-------------RNALDAAAGMISRLRDGDTVS 173 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V++++K P+ + + T + G+E + + Sbjct: 174 VVSYNTKAHTIVPVTTLDARNRDRVISDLRVGVASRPSGNTCVSCGVEAGLQTLQGRRPG 233 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ ++ L+DGE + D EA+ RG + +IGV + + Sbjct: 234 IDR-----------MLLLSDGEANRGVRDEPGIRRLAREARNRGVSISSIGVDVDYNEVL 282 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAF 355 + A + R Y + L F Sbjct: 283 MSAIAREANGRHYFSETGSNLDAIF 307 >gi|125575071|gb|EAZ16355.1| hypothetical protein OsJ_31817 [Oryza sativa Japonica Group] Length = 579 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 45/212 (21%), Positives = 81/212 (38%), Gaps = 37/212 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM KL + +++ ++D + R +V+FS++ Sbjct: 142 PLDLVTVLDVSGSMAG------RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEA 189 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L G + + L+ S T GL A + D + K + Sbjct: 190 SRRTRLLRMSEVGKATAKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSS------ 243 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNE-----AKRRG----AIVYAIGVQAEAADQFLK 334 +I L+DG++S + Y + A G A ++ G A+ + Sbjct: 244 ---VILLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMN 300 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 A + F V+N + D+F + IG + Sbjct: 301 TIAESTGGTFSFVENEAAIQDSFAQCIGGLLS 332 >gi|22758317|gb|AAN05521.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432564|gb|AAP54179.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 606 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 45/212 (21%), Positives = 81/212 (38%), Gaps = 37/212 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM KL + +++ ++D + R +V+FS++ Sbjct: 142 PLDLVTVLDVSGSMAG------RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEA 189 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L G + + L+ S T GL A + D + K + Sbjct: 190 SRRTRLLRMSEVGKATAKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSS------ 243 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNE-----AKRRG----AIVYAIGVQAEAADQFLK 334 +I L+DG++S + Y + A G A ++ G A+ + Sbjct: 244 ---VILLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMN 300 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 A + F V+N + D+F + IG + Sbjct: 301 TIAESTGGTFSFVENEAAIQDSFAQCIGGLLS 332 >gi|115678877|ref|XP_794839.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1031 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 61/177 (34%), Gaps = 30/177 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + +++VLD S SM +DK+ A + ++D + + G+ Sbjct: 301 VQASTGDECRVVLVLDTSGSMGTS--NRIDKVNSAATAFVNLVD---------DGISIGI 349 Query: 221 VTFSSKIVQTFPLAW-GVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 VTF+ L Q + I +L T GLE + Sbjct: 350 VTFTGSPTTRHALTQINTQADRDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADG 409 Query: 277 AKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I+ +TDG++S N +++L G V + + +A + Sbjct: 410 G--------IIVLMTDGQDSGIQNHIIRQTLQ------DMGVRVNTVAIGEDAYGEL 452 >gi|302796876|ref|XP_002980199.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] gi|300151815|gb|EFJ18459.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] Length = 550 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 79/210 (37%), Gaps = 30/210 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + +D++ VLDVS SM KL + ++ + I ++ R +V+ Sbjct: 77 ADARAPVDLVTVLDVSGSMRGQ------KLELVKTAMEFV------IRNLRQQDRLAIVS 124 Query: 223 FSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS + L G + +L T+ PGL+ ++ + + Sbjct: 125 FSDEPKVHLGLKRMTHDGRAAALSAVEKLRSLGGTEIRPGLKAGFDLLS---------RR 175 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNC 336 + + I+ L+DG +++ + L + + V+ G ++ + + + Sbjct: 176 KNRNPVSSIMLLSDGMDNAITFKRCKVLPVDSYLEDCSERVPVHTFGFGSDHDPEAMLSI 235 Query: 337 --ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 A+ F VQ + AF + IG + Sbjct: 236 AEATGGSFCYVQEESTVQHAFAQCIGGLLS 265 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 48/277 (17%), Positives = 101/277 (36%), Gaps = 34/277 (12%) Query: 90 TDFRNELRENGFAQD--INNIERSTSLSIIIDDQHKDYNLSAVSRYEM-PFIFCTFPWCA 146 +N+ F D I + + + ++ + + Y+M + + Sbjct: 86 NTVKNQDIHTNFNDDEKIEPKKEEAKQNANKYNLNEKLSFEVKALYKMGKLLNSRTQYLP 145 Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + ++ +G+D++ ++D S SMN K+ + +++ +LD + Sbjct: 146 GIVSIKAQDQAVIQNQENQRVGVDLICLIDKSGSMNGQ------KIEMVKQTLALLLDFL 199 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYA 262 N R L+TF S+ + PL Q+ ++ I ++ G T E A Sbjct: 200 ------NENDRYQLITFESQAQRLTPLKRVTDGNKQYFKQVIQQINSGGGTTIGTATEIA 253 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + ++ E + + I L+DG++ N +E + NE ++ Sbjct: 254 FKQL------QERKYRNNVTS---IFLLSDGQDGQANQRIQEQIKTVNEV----FTLHTF 300 Query: 323 GVQAEAADQFL-KNC-ASPDRFYSVQNSRKLHDAFLR 357 G + Q + + C FY VQ+ L + F Sbjct: 301 GFGEDHDAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 337 >gi|89068992|ref|ZP_01156373.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] gi|89045361|gb|EAR51426.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] Length = 669 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 72/193 (37%), Gaps = 19/193 (9%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + PW + + ++ ++ L+++ ++D S SM+ DKL Sbjct: 279 FRATLGVMDSPWAEGRQLVHIGLQGAL-PPVEARPPLNLVFLVDTSGSMD-----APDKL 332 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 G+ +S R ML ++ ++ V +G + +++ P A I + RL G + Sbjct: 333 GLLKQSFRLMLSELRPQDEIAIVAYAGS---AGEVLAPTP-AGERATILAALERLAAGGS 388 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GLE AY E E I+ TDG+ + D + + Sbjct: 389 TNGAGGLEQAYATAEAMTEDGEVSR---------ILLATDGDFNVGLSDPSALEDFIADK 439 Query: 313 KRRGAIVYAIGVQ 325 + G + +G Sbjct: 440 RDSGTYLSVLGFG 452 >gi|198436156|ref|XP_002124087.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1702 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 74/203 (36%), Gaps = 29/203 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ ++D S S+ ++ G+ + + I S P + GL T+S Sbjct: 372 LDLIFLIDESTSVLENDFDGIK---------VWLRNTISSFPIGEEYTQIGLATYSDNPR 422 Query: 229 QTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L H + I + + T + + Y N +F H + Sbjct: 423 IIFHL--NKYHKLDDIRKAVLEVEHTSGGTATGKAILYLTNNMFT------HENGVRPNA 474 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF- 342 K+ ++ LTDG++ I AK G +++AIGV + + PDR+ Sbjct: 475 KRLVVVLTDGKSQDDVIVPSR------IAKESGIVMFAIGVGKVVMGELRAIASDPDRYV 528 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQ 365 Y + + L + + Sbjct: 529 YKINDFSALESIRRELSHSIASL 551 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 73/197 (37%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD+S SM KL ++ +L+ D+ R ++ FS++I Sbjct: 262 VVFVLDISASMVGA------KLQQTREALVTILN------DLRPQDRFNIIGFSNRIKMW 309 Query: 228 -VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ +++ + + L T L+ A + + + + + Sbjct: 310 KDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNYVAQNDIEDRSVS---- 365 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 IIFLTDG+ + + + L EA R ++ +G+ + + L+NC Sbjct: 366 LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLSLENCGLTR 425 Query: 341 RFYSVQNSRKLHDAFLR 357 R + + F Sbjct: 426 RVHEEDKAGAQLIGFYD 442 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 73/197 (37%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD+S SM KL ++ +L+ D+ R ++ FS++I Sbjct: 296 VVFVLDISASMVGA------KLQQTREALVTILN------DLRPQDRFNIIGFSNRIKMW 343 Query: 228 -VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ +++ + + L T L+ A + + + + + Sbjct: 344 KDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNYVAQNDIEDRSVS---- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 IIFLTDG+ + + + L EA R ++ +G+ + + L+NC Sbjct: 400 LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + + F Sbjct: 460 RVHEEDKAGAQLIGFYD 476 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 73/197 (37%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD+S SM KL ++ +L+ D+ R ++ FS++I Sbjct: 296 VVFVLDISASMVGA------KLQQTREALVTILN------DLRPQDRFNIIGFSNRIKMW 343 Query: 228 -VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ +++ + + L T L+ A + + + + + Sbjct: 344 KDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNYVAQNDIEDRSVS---- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 IIFLTDG+ + + + L EA R ++ +G+ + + L+NC Sbjct: 400 LIIFLTDGKPTFGETNTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + + F Sbjct: 460 RVHEEDKAGAQLIGFYD 476 >gi|307102430|gb|EFN50705.1| hypothetical protein CHLNCDRAFT_28788 [Chlorella variabilis] Length = 344 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 52/295 (17%), Positives = 104/295 (35%), Gaps = 49/295 (16%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP------------WCANS 148 F +++ + + KDY S P P + + Sbjct: 6 FRENVKQGYLPLPTDVTFEGIAKDYYFDTTSNTSQPCTELFCPIYSLAAAPDPLRNASAA 65 Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-----------------MDK 191 LL +S++ S L+++++LDVS SM + F K Sbjct: 66 RRRSLLQAASLQASDFRRPRLNLVLLLDVSGSMGESFMSYYYDAAGTQQNLTAEELNTTK 125 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINR- 246 + VA + +LD++ V +V FS++ PL+ + +Q +I++ Sbjct: 126 IDVAKEVLSGVLDLLAPNDSV------AIVLFSTRACTPQPLSRVSCLDIPALQAQIDKD 179 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + S+T + GL+ A ++ E + ++ I+ +TD + +S + Sbjct: 180 MHATSSTSLSAGLDLAIAELKKCSEGMSASLTDTENR---IMVITDQQPNSGDYTTGGLA 236 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRK----LHDAF 355 + G IGV + + ++ + +YSV + L D F Sbjct: 237 ARLRKDADDGIFTTIIGVGLDLNSELAESISKVRGANYYSVHRPGEFRRRLTDEF 291 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 70/164 (42%), Gaps = 16/164 (9%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 + F N + F++ KG+ ++++AI++P + +++G+V+ TS+ K + + +L + Sbjct: 23 IHFFN-KLLFFSKKGNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEALSHGM 81 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFA-QDINNIERSTSLSIII- 118 + I +ND + I D L++N F Q+ + + +++ + I Sbjct: 82 SLI----------CYQNDIERDNLAKIILNDLIVSLKKNNFTKQEADLVAKNSKIDITTL 131 Query: 119 ---DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 K Y+ S Y+MP T + + V Sbjct: 132 INDSTNVKSYHFYIKSVYKMPLNKITKIFYPKDLTIVTNVNKIV 175 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 53/343 (15%), Positives = 116/343 (33%), Gaps = 45/343 (13%) Query: 34 LVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR 93 L I+ S F K + S L++ + N ++ +I ++ Sbjct: 275 LEIKGSRTGFYKLFFKATVYKSASKETCVPLSKRPRKDCPDHCNALGDVVLADINVSNLP 334 Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR----------YEMPFIFCTFP 143 + D++ +E S+ + + + N+ +++ Y + Sbjct: 335 CDEEN----MDVDVVENEQSIGLNLAARQASDNVMSLTTCAPLWQHNCGYAREANGACYE 390 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 AN +H + + +D+++V+D S S+N R +R+++ Sbjct: 391 VEANVTHVRAIHKHNPISQECIRGKIDIVLVVDQSGSVN------QCNFQKVKRWLRDIV 444 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG-----------VQHIQEKINRLI-FGS 251 + G+V +S K + + G Q + + + +L G Sbjct: 445 RSFNLGVTEQD---VGVVVYSKKATTSTVVDLGFSDYDSDGHTKKQEMTKILKKLAYEGG 501 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT + + A + K D KK II LTDG ++ N + + Sbjct: 502 TTYTGYAFKLANEMLTGN--------KSRPDAKKMIILLTDGATTAAN--TLQLKEELDV 551 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 ++ ++ A+GV + ++ F++V +L Sbjct: 552 SRAANVMILAVGVGKFNQTELIQIAGDRKNFFAVTKFSELEKV 594 >gi|7463254|pir||E70121 hypothetical protein BB0173 - Lyme disease spirochete Length = 340 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 86/239 (35%), Gaps = 32/239 (13%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 57 KDYRLNLMYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 113 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+ GLV F+ P+ + + Sbjct: 114 AVEFSSKNRLEFSK-------ELIRGFISQRENDNIGLVAFAKDASIVVPITTDREFFNK 166 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 167 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 215 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 I + N A+ +Y+IG+ + S F V + L + Sbjct: 216 IKD----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 270 >gi|126463435|ref|YP_001044549.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126105099|gb|ABN77777.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 651 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 81/237 (34%), Gaps = 21/237 (8%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + + PW + + + + + + L+++ ++D S SM D Sbjct: 255 NGTPPFRPTLSITRTPWNPETRLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQDPA-- 311 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 KL + +S ML ++ V V +G S+ V A I ++RL Sbjct: 312 ---KLPLLKQSFGLMLGRLRPEDQVAIVTYAG----SAGEVLAPTAANQRSTILSALDRL 364 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T GL AY + G + + ++ TDG+ + D +E Sbjct: 365 DAGGSTAGDEGLALAY--------RTASEMAGAGEVTR-VVLATDGDFNLGISDPEELAR 415 Query: 308 YCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + G + +G D ++ A + L++A + ++ Sbjct: 416 LVAHERDTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLS 471 >gi|326918656|ref|XP_003205604.1| PREDICTED: anthrax toxin receptor 2-like, partial [Meleagris gallopavo] Length = 480 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 48/215 (22%), Positives = 74/215 (34%), Gaps = 35/215 (16%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 DM VLD S S+ ++ D + T V+ +R Sbjct: 37 PADEQPSCHGAFDMYFVLDKSGSVAQNWHEIFDFVNQLTERF------------VSPKMR 84 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLE 274 + FSS+ PL + I+E + L T GL+ A L+ Sbjct: 85 LSFIVFSSQAQVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQA---------NLQ 135 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR---GAIVYAIGVQAEAADQ 331 +G + II LTDG +D + L+ EAK GA VY +GV Q Sbjct: 136 IEKQGASRFSSIIIALTDG-----KLDGQIPLYAEKEAKTSRQLGARVYCVGVLDFVQAQ 190 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + ++ + V A I ++KQ Sbjct: 191 LERIADTKEQVFPVTGG---FQALKGIINSVLKQS 222 >gi|262165797|ref|ZP_06033534.1| RTX toxin related protein [Vibrio mimicus VM223] gi|262025513|gb|EEY44181.1| RTX toxin related protein [Vibrio mimicus VM223] Length = 1505 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 64/194 (32%), Gaps = 22/194 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S++ ++ ++LDVS SM++ G G +L V S ++L+ ++I L Sbjct: 888 TESETKQDANVQLILDVSGSMDNSAGNGKSRLQVMKESAIQLLEQYQAIGQTK----VQL 943 Query: 221 VTFSSKIVQT---FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F+ L V + I+ L G T L+ A + Sbjct: 944 IIFAGTASVQLHEKALWMTVDEAKSYIDALKAGGQTDYDHALQLADEYWSGNRNGTPLTE 1003 Query: 278 KGHDDYKKYIIFLTDGE----------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + Y FL+DG+ + I+ E + + + A G+ Sbjct: 1004 ATNVSY-----FLSDGKPEGFDKQNGVENDNTIEPNELASWISHLESNEITALAYGMGNS 1058 Query: 328 AADQFLKNCASPDR 341 L A Sbjct: 1059 VPQSELDKIAFDGH 1072 >gi|156382099|ref|XP_001632392.1| predicted protein [Nematostella vectensis] gi|156219447|gb|EDO40329.1| predicted protein [Nematostella vectensis] Length = 298 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 67/187 (35%), Gaps = 22/187 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K K +D+ +LD S S+ +++ +SI +M DI G Sbjct: 99 KTIKKKSCPIDIAFLLDASGSIGRR---SWEEIKNFVKSIVDMCDISD------QGTHVG 149 Query: 220 LVTFSSKI------VQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEK 272 ++TFS+ + + +I+ I+ L T L A +F Sbjct: 150 IITFSTDPVIDIAFDKYKGVEMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFT---- 205 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 A +D +K + ++DG + + N K +G VY +G+ A+ Sbjct: 206 --QEAGMREDSQKVAVLMSDGIQTKDRGPFTPTDIAANPLKMKGVQVYTVGIGADVDVFE 263 Query: 333 LKNCASP 339 L AS Sbjct: 264 LMAVASG 270 >gi|224049043|ref|XP_002191793.1| PREDICTED: similar to capillary morphogenesis protein 2 [Taeniopygia guttata] Length = 554 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 78/217 (35%), Gaps = 35/217 (16%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++ DM VLD S S+ ++ D + T V+ Sbjct: 105 SAPAAGEPSCHGAFDMYFVLDKSGSVATNWREIFDFVNQLTERF------------VSPK 152 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEK 272 +R + FS++ PL + I++ + L T GL+ A +I Sbjct: 153 MRLSFIVFSTQAHVIMPLTGDREKIKKGLKDLEEVKPAGETYIHEGLKQANEQI------ 206 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR---GAIVYAIGVQAEAA 329 +G + II LTDG +D + L+ EAK+ GA VY +GVQ Sbjct: 207 ---AKQGASRFSSIIIALTDG-----KLDGQIPLYAEKEAKKSRELGARVYCVGVQDFEP 258 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +Q + ++ + V A I ++KQ Sbjct: 259 EQLERIADVKEQVFPVTGG---FQALKGIINSVLKQS 292 >gi|329923737|ref|ZP_08279132.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328941103|gb|EGG37405.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 654 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 60/167 (35%), Gaps = 25/167 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + ++ +G ++ V+D S SM ++ +I +L R G Sbjct: 458 QKKRENRVGATLLFVVDASGSMA-----ARKRMTAVKGAILSLLQ-----DAYEKRDRIG 507 Query: 220 LVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++ F +++ P+ ++ +++ + G T GL AY + K + Sbjct: 508 MIAFRNNEAELILPVTRSIEAASKQLRSIPTGGKTPLADGLAQAYKVLHSEKRR------ 561 Query: 279 GHDDYKKYIIFLTDGENSSPNI-------DNKESLFYCNEAKRRGAI 318 ++D +I +TDG + +I E L + G Sbjct: 562 -NNDTLPVMIIVTDGRANESSIGLTVYADIWNECLEAAKLIRAAGIR 607 >gi|162449478|ref|YP_001611845.1| hypothetical protein sce1208 [Sorangium cellulosum 'So ce 56'] gi|161160060|emb|CAN91365.1| unnamed protein product [Sorangium cellulosum 'So ce 56'] Length = 607 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 70/216 (32%), Gaps = 27/216 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 P + + IT ++ + + +V+D S SM+ KL +A + R Sbjct: 11 LPAEPSERLLRVEITVPRPEGGQARKPVHLSLVIDRSGSMSGE------KLRLALEAAR- 63 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ----EKINRLIFGSTTKSTP 257 I+++ + R +VTF ++ P + ++ +I T Sbjct: 64 --QAIRTLQPGD---RFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARGNTDLGG 118 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 G + E H+ + ++ LTDG+ + E + R Sbjct: 119 GW------LRGCAEVGAHLPEDAIGR---VLLLTDGQANHGITSPDELTSRARSQRLRRV 169 Query: 318 IVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 IG+ + L + FY + +L Sbjct: 170 TTSTIGLGEGFNEFLLGRLSEEGGGNFYFAARADEL 205 >gi|255537858|ref|XP_002509994.1| protein binding protein, putative [Ricinus communis] gi|223550695|gb|EEF52181.1| protein binding protein, putative [Ricinus communis] Length = 767 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 71/175 (40%), Gaps = 30/175 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D++ VLDVS SM KL + R++R ++ + S R +V F Sbjct: 353 AHRAPIDLVTVLDVSGSMTGA------KLQMLKRAMRLVISSLGSAD------RLSIVAF 400 Query: 224 SSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SS + PL A G + + I+RL+ G T L A + D +E+ + Sbjct: 401 SSVPKRLLPLRRMTAHGQRAARRIIDRLVCGQGTSVGDALRKATKVLEDRRERNPVAS-- 458 Query: 280 HDDYKKYIIFLTDGEN---SSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAA 329 I+ L+DG++ + +++ + + + N + + ++ G Sbjct: 459 -------IMLLSDGQDERVQTSSVNQRHTSGHINSTRFAHIEIPVHSFGFGQSGG 506 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 78/230 (33%), Gaps = 30/230 (13%) Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS--IIIDDQHKDYNLSA 129 KK K + + T + + + + N+ +++ I ID+ ++ Sbjct: 6 RKKMKTRIIKSGLTVLLITMVLLSINTSFVSAEGNSSSSEKTITNSIQIDNMNEGEVRVF 65 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-G 188 + +P + S D+++VLD S SM+ P G Sbjct: 66 KTAKPIP------------NSINRWEISIDVFGRLKREPSDIVLVLDTSGSMDPQKNPQG 113 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-IQEKINRL 247 +D++ A R ++ I + + R LV++ +K+ + + +I L Sbjct: 114 IDRISKAKREAIHFVNEI---FERDASARVALVSYGTKVSSNSFHTKQESNLLINEIKSL 170 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 T + L A + + K I+ L+DG+ + Sbjct: 171 KAEGGTFTQGALYEAKMLLNQS-----------SAPNKTIVLLSDGQPTY 209 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 15/84 (17%), Positives = 32/84 (38%), Gaps = 10/84 (11%) Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + Y ++++ ++ N+ + +YAIG +A + S Sbjct: 290 SQGNQNYFVYMS---------SADAAIIESNQIHQEQIHLYAIGFDTDARGTDILKRISN 340 Query: 340 DRFYSVQNSRK-LHDAFLRIGKEM 362 + +Y +SR L D F +I + Sbjct: 341 NNYYDASSSRDNLDDIFKKISNNI 364 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 80/206 (38%), Gaps = 26/206 (12%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T +D D+ ++ D S S+ +F +L +A + ++ +P N Sbjct: 358 VTEPTDKLPVNDCQYDVGIIFDSSGSLEKNF---QTQLQIA--------NKLQQMPIRPN 406 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + R +V F+ K + ++++ ++I S S N+ L Sbjct: 407 LTRVAIVQFAGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTF---TNQALKRMALLF 463 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-- 332 +K + K ++F TDG ++ + E+L KR+G VY +G+ + Sbjct: 464 EASKRDNCKMKLLVF-TDGYSAEDTAEGIEAL------KRQGITVYTVGISTDKNAGLNV 516 Query: 333 --LKNCA-SPDRFYSVQNSRKLHDAF 355 LK A SP ++ + L F Sbjct: 517 SELKGMATSPSHYFDSSDFDNLLKHF 542 >gi|224534421|ref|ZP_03674999.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] gi|224514523|gb|EEF84839.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] Length = 333 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 79/206 (38%), Gaps = 27/206 (13%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 49 KDYILNLLYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++IK GLV F+ P+ + Sbjct: 106 AVEFSSKNRLEFSK-------ELIKRFISQRENDNIGLVAFAKDASIVVPITTDRDFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ 325 I + + N A+ +Y+IG+ Sbjct: 208 IYKDQVI---NLAQGLNVKIYSIGIG 230 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 62/213 (29%), Gaps = 42/213 (19%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK----- 226 M+V D S SM + G ++ A + R + GLVT+ Sbjct: 1 MVVFDSSGSMITNDAGGQTRIDAAKDAARTFITEAGDDAP------LGLVTYGGNTGEAP 54 Query: 227 ----------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V T P A + + ++ L T L A ++ Sbjct: 55 EDEAAGCQDITVVTPPEAGNSEKMIAHMDGLQPRGFTPIGESLRKAAAELPKEG------ 108 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLK 334 ++ II ++DG + E K +G ++ +G E Q Sbjct: 109 -------QRSIILVSDGVATCT---PPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQEL 158 Query: 335 NC---ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 C A+ + + ++ L R Sbjct: 159 QCIADATGGTYANASDADSLAKELNRAAPRTFN 191 >gi|116625802|ref|YP_827958.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228964|gb|ABJ87673.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 326 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 75/228 (32%), Gaps = 41/228 (17%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + + D + +V D S SM P M+K A ++ + + Sbjct: 84 VQQVITHFATDDAPASVGLVFDTSDSMQ----PRMNKAHEAVEALLKNANPADEFF---- 135 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV FS + + + I + + G +T + A ++ A Sbjct: 136 -----LVQFSDRARLVAGMTKDSEEISRRAASMRIGGSTALLDAVAMAMEEMKSAH---- 186 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------ 328 +K ++ ++DG+++S + + +YAIG+ + Sbjct: 187 -------YLRKVMVIISDGDDNSSRCPVNDLKRI---VREGDVTIYAIGITDDNVPLAYP 236 Query: 329 ------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L A + R + V ++L + +I KQ +L Sbjct: 237 QRDRLTGAALLNEIATQTGGRLFEVHKLKQLPEIAAKISGWTRKQYVL 284 >gi|310818002|ref|YP_003950360.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309391074|gb|ADO68533.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 568 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 65/201 (32%), Gaps = 25/201 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+S+ + ++ +DVS SMN ++L + RS+ +++ + S Sbjct: 185 KVSAAERLPAHLVFTIDVSGSMNME-----NRLELVKRSLAMLVEKLDSRDT------LA 233 Query: 220 LVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V + I E IN L +T GL+ AY + Sbjct: 234 IVVYGDTARTVLEPTRIMDRSRILEAINALHPEGSTNVQAGLQVAYAIAASQVREGATSR 293 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC 336 +I +DG ++ ++G + +G D+ ++ Sbjct: 294 ---------VILCSDGVANNGITQADSIFQSVKAYAQQGVRLTTVGFGMGNYNDELMERL 344 Query: 337 A--SPDRFYSVQNSRKLHDAF 355 + ++ V + F Sbjct: 345 SHVGDGQYAYVDALPEARRIF 365 >gi|161612960|ref|YP_001586925.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362324|gb|ABX66092.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 593 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 234 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 284 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 285 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 335 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 336 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 395 Query: 348 SRKLHDAFLR 357 + Sbjct: 396 LSEAQKVLKD 405 >gi|115377250|ref|ZP_01464460.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365726|gb|EAU64751.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 520 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 65/201 (32%), Gaps = 25/201 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+S+ + ++ +DVS SMN ++L + RS+ +++ + S Sbjct: 137 KVSAAERLPAHLVFTIDVSGSMNME-----NRLELVKRSLAMLVEKLDSRDT------LA 185 Query: 220 LVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V + I E IN L +T GL+ AY + Sbjct: 186 IVVYGDTARTVLEPTRIMDRSRILEAINALHPEGSTNVQAGLQVAYAIAASQVREGATSR 245 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC 336 +I +DG ++ ++G + +G D+ ++ Sbjct: 246 ---------VILCSDGVANNGITQADSIFQSVKAYAQQGVRLTTVGFGMGNYNDELMERL 296 Query: 337 A--SPDRFYSVQNSRKLHDAF 355 + ++ V + F Sbjct: 297 SHVGDGQYAYVDALPEARRIF 317 >gi|288928459|ref|ZP_06422306.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331293|gb|EFC69877.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] Length = 554 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 64/204 (31%), Gaps = 31/204 (15%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ F P + T KI+ G++ ++ +D+S SM +L Sbjct: 57 VKLGLATTAFALLVVMLARPQMGT---KITHDKRNGIETIIAVDISNSMMAQDVVP-SRL 112 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLI 248 + I ++D R GLV F+ P+ + + LI Sbjct: 113 EKSKLLIENLVDNF-------THDRIGLVVFAGDAFVQLPITTDYVSAKMFLQNIDPALI 165 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + + K D K II +TDGE+ +L Sbjct: 166 ATQGTDIAKAINLSMRSFSQQK-----------DIGKAIIVITDGEDHEG-----GALEA 209 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 A RG V+ +G+ + Sbjct: 210 AKAANERGIHVFILGIGSTKGSPI 233 >gi|168465984|ref|ZP_02699854.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195631159|gb|EDX49719.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 593 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 234 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 284 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 285 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 335 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 336 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 395 Query: 348 SRKLHDAFLR 357 + Sbjct: 396 LSEAQKVLKD 405 >gi|319954909|ref|YP_004166176.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319423569|gb|ADV50678.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 703 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 83/229 (36%), Gaps = 24/229 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW N++ + I K D+ + ++DVS SMN+ +K Sbjct: 310 FSINLEAAKTPW--NAATKIVRIGLQGKEYLNEDLPASNLTFLIDVSGSMNN-----QNK 362 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + + ++ ++ V+ VV +G +V + I + ++ L G Sbjct: 363 LPLLKSAFKLLVHQLREKDRVSIVVYAGAAG----VVLEPTGGNDKEKIIKALDNLSAGG 418 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+E AY EK K + +I TDG+ + +K+ E Sbjct: 419 STAGGEGIELAYAL----AEKNFKPNKNNR-----VIMATDGDFNVGASSDKDMETLIEE 469 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 ++ G + +G D L+ A + ++ F Sbjct: 470 KRKSGIFLSVLGFGMGNYKDSKLEKLADKGNGNHAYIDTMQEAQKIFGE 518 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 56/439 (12%), Positives = 124/439 (28%), Gaps = 111/439 (25%) Query: 29 FIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIW 88 G+ ++ S + +L +D + L + ++ + + NGK + + + + Sbjct: 1 MAAAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQLPDDPDVTNGKVKAAVTANLALNDPD 60 Query: 89 QTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS 148 TD + +G A ++ ++ + + + + + Sbjct: 61 ATDIQVT---SGGATRSVCVDAKANVDMTLTKVIGIGDTTVTA----EACAGYNDIELVL 113 Query: 149 SHAPLLITSSVKISSKSDIGLDMM-MVLDVSLSMN------------------------- 182 I S D D++ +++ S S Sbjct: 114 VLDSTGSMKGSPIDSAKDAARDLVNLIMPASTSSTRSKIGLVPFQGKVRIDGSDPVTAER 173 Query: 183 --DHFGPGMDKLGVATRSIREMLDIIKSIPDVN---NVVRSGLVTFSSKIVQTFP----L 233 D GPG + + ++ ++ N SG+ TF+ K L Sbjct: 174 NPDGVGPGCRNADGTLNTGKLKVEYSRTATSTNIFYGYTLSGVSTFTDKTCSGMSPIRAL 233 Query: 234 AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I I + G S T + G+++ + + E +K +I L Sbjct: 234 SSDKNTILNNIEAINAGAVTSGTLISEGIKWGRKVL--SPEAPYVEGSTDKKVRKIMIVL 291 Query: 291 T-----DGENSSPNIDNKES---------------------------------------- 305 T DG ++ Sbjct: 292 TDGDTEDGRCGGNFASASKTVNTYWTNAYFGQGLKPDTATSPYATLSTATATLAQIPDCK 351 Query: 306 ---------LFYCNEAKRR---GAIVYAIGVQAEAA--DQFLKNCASP-----DRFYSVQ 346 L + AK ++++ A A ++ AS D +Y Sbjct: 352 DGGKLNQFVLDEADAAKNDLNYPVEIFSVRFGASDATDKSLMQKIASSKPGTTDHYYDAP 411 Query: 347 NSRKLHDAFLRIGKEMVKQ 365 +S + D F +IG+++ ++ Sbjct: 412 SSTGIQDMFKKIGQQLGQR 430 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 78/214 (36%), Gaps = 28/214 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + I + + LD++ ++D S S+ + + EM++ + Sbjct: 53 SINIGAPAEPC--KSRPLDLVFIIDSSRSVRPA------EFEKVKIFLSEMVNSLDI--- 101 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFD 268 ++ R LV ++S + F L + +++ +R+ + T + ++ A ++F Sbjct: 102 GSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFT 161 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KG K I +TDG + + A+ G +YA+GV Sbjct: 162 ENAGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA------ARASGIEIYAVGVDRAE 212 Query: 329 ADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + + P D + V+ KL F Sbjct: 213 MRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|62180885|ref|YP_217302.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128518|gb|AAX66221.1| putative von Willebrand factor, vWF type A domain [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715363|gb|EFZ06934.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 593 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 234 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 284 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 285 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGANR---ILLT 335 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 336 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 395 Query: 348 SRKLHDAFLR 357 + Sbjct: 396 LSEAQKVLKD 405 >gi|118590977|ref|ZP_01548377.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] gi|118436499|gb|EAV43140.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] Length = 608 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 76/201 (37%), Gaps = 20/201 (9%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW ++ + I K+ +++ ++D S SM D +KL + +S R Sbjct: 221 TPWNEHTKLMQVGIQG-YKVPLDDLPSQNLVFLIDTSGSMADA-----NKLPLLQQSFRL 274 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +L ++ +V V +G S +++ +A I EKIN L G +T GL+ Sbjct: 275 LLSSLRDEDEVAIVTYAGS---SGVLLEPTKVA-DKTRILEKINALTSGGSTAGHEGLKG 330 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY E II TDG+ + D Y E + G + Sbjct: 331 AYALAETMTGDGEQTR---------IILATDGDFNVGLSDPDSLKRYVAEQRENGTALSV 381 Query: 322 IGVQ-AEAADQFLKNCASPDR 341 +G D+ ++ A + Sbjct: 382 LGFGRGNYNDELMQTLAQNGQ 402 >gi|16765642|ref|NP_461257.1| hypothetical protein STM2315 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167992650|ref|ZP_02573747.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262795|ref|ZP_03162869.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|16420855|gb|AAL21216.1| putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197241050|gb|EDY23670.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205329241|gb|EDZ16005.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261247522|emb|CBG25349.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267994407|gb|ACY89292.1| hypothetical protein STM14_2853 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301158873|emb|CBW18386.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312913305|dbj|BAJ37279.1| hypothetical protein STMDT12_C23360 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321222984|gb|EFX48055.1| hypothetical protein SEE_04302 [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323130645|gb|ADX18075.1| Putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332989248|gb|AEF08231.1| hypothetical protein STMUK_2345 [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 593 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 234 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 284 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 285 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 335 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 336 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 395 Query: 348 SRKLHDAFLR 357 + Sbjct: 396 LSEAQKVLKD 405 >gi|224065911|ref|XP_002191398.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Taeniopygia guttata] Length = 809 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 74/207 (35%), Gaps = 30/207 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM K+ ++ ++L ++ + +TF++K+V+ Sbjct: 247 VIFVIDRSGSMTG------RKIEQTRDALLKILQDLRQEDHFS------FITFNNKVVEW 294 Query: 230 TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L V + L T + L A + A+ E Sbjct: 295 KSSLLPATEENVASAAALVQTLTARGGTDISGALLAAVGVLEKAEGLPERSIS------- 347 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ +S + + EA ++ +G + + +FL+ A + Sbjct: 348 MIILLTDGQPTSGEKNVEVIQEKVQEAINGKYALFCLGFGFDVSYKFLEKMALSNGGIAR 407 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + + +L + + ++ Q Sbjct: 408 RIYENADAALQLQGFYQEVATPILMQI 434 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 59/171 (34%), Gaps = 32/171 (18%) Query: 165 SDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S G++++M LD+S SM D +DK + + K GL+ Sbjct: 87 SREGIEVIMALDISNSMLATDVVPSRLDKSKLMVEGLMNKFTKNK----------LGLIV 136 Query: 223 FSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 F+ P+ + ++ LI T + A H Sbjct: 137 FAGDAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKAINLA-----------MHSFT 185 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + K I+ +TDGE++ + +A+ +G V+ +G+ + Sbjct: 186 PNTQTGKAIVVITDGEDNEGGAE-----AMAKQAQEKGIKVFILGIGSTQG 231 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 71/211 (33%), Gaps = 36/211 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 VK D D+ V+D S SM P +D + I ++ R Sbjct: 304 VKSQDLQDFDRDITFVIDTSGSMGGR--PIVDAKESLQLA----------IDRLSEKDRF 351 Query: 219 GLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +V F++ + F Q+ ++ + L G T+ P L A Sbjct: 352 NVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAAL---------- 401 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 D+ K ++F+TDG + + +A+ ++ +G+ + F+ Sbjct: 402 --KRTTTKDFIKQVVFITDGAVGNEAALFSQIKNELGDAR-----LFTVGIGSAPNSYFM 454 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A + V+N+ + + ++ Sbjct: 455 TRAAQFGLGSYVFVRNTADIKQQMDSLLYKL 485 >gi|229491170|ref|ZP_04384998.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229321908|gb|EEN87701.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 614 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 41/211 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S + + ++ V+D S SM G + + A R+ L ++ Sbjct: 39 STPTAQAEETTSSVLFVVDTSGSMA---GSPLAQAKDALRAGIGALSSGQAA-------- 87 Query: 218 SGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 GL +F+ L + N+L G TT + L A + Sbjct: 88 -GLRSFAGDCGNGGQLLVPVATDNRDQLNNATNQLTAGGTTPTPDALRAAAGDL------ 140 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GA--IVYAIGVQAEAA 329 + II ++DG+++ + E K + G V+A+G A Sbjct: 141 -------PSTGDRTIILISDGQSTCG-----DPCAVATELKTQLGIDFRVHAVGFNAPDV 188 Query: 330 DQFLKNC---ASPDRFYSVQNSRKLHDAFLR 357 + +C A+ R+++ N+ +L DA Sbjct: 189 AESELSCIANATGGRYFTATNTTELSDAISA 219 >gi|148655541|ref|YP_001275746.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567651|gb|ABQ89796.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 504 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/271 (15%), Positives = 78/271 (28%), Gaps = 62/271 (22%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISS--------------KSDIGLDMMMVLDVSLSM 181 P + P ++ A T S + + ++ +V D S SM Sbjct: 81 PTVAPFTPATPTATGADAPTTVPESSSPPTDTTTIFRPAEGEAAQVTTNIQLVFDASGSM 140 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G G K+ A R++ ++D + PD+N R +G + Sbjct: 141 AQRIG-GETKIQAARRAMERIIDTLPDNPDLNVGFRV----------------FGHEGDS 183 Query: 242 EKINRLIFGSTTKSTPGLE----------------YAYNKIFDAKEKLEHIAKGHDDYKK 285 + + +T ++ + I A ++ + ++ + Sbjct: 184 SEAQKARSCQSTALLVPMQGVNKALLRQQAQAWQPTGWTPISLALQRAGEDFQAGENVRN 243 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAADQFLKNCA--- 337 II +TDGE + C AK + +G C Sbjct: 244 VIIMVTDGEETCGGD-------PCAVAKALAESQAEVRIDVVGFGTTPDVAKTLRCIAEN 296 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 S + QN L + +K+ L Sbjct: 297 SGGVYTDAQNGDALVQTLEELIAATLKRSTL 327 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 82/238 (34%), Gaps = 29/238 (12%) Query: 130 VSRYEM--PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 RY M P + L +++ + D++ ++D S S+ Sbjct: 14 TPRYSMTLPGFVMLLCIMGAQATVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPS--- 70 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKIN 245 + + +++D + PD R G+V ++S++ L + + ++ Sbjct: 71 ---EFEQVKVFLAKVIDGLSVGPDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVS 124 Query: 246 RLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 ++ + T + +++A N F E + D K I +TDG D Sbjct: 125 KIEPLSTGTMTGLAIQFAMNVAFSEAE----GGRKSPDISKVAIIVTDGRPQDNIRD--- 177 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 A+ G ++AIGV + + P D V++ KL F Sbjct: 178 ---IAARAREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 232 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 82/206 (39%), Gaps = 28/206 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ S+ D++ ++D S S+ + + I ++D + + + N V GLV Sbjct: 278 SACSNAATDVVFLIDGSKSVRPE------NFELVKKWINLIIDKLD-VSETNTHV--GLV 328 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +SS + Q FPL + ++E + R+ + T +A + + D A+ Sbjct: 329 QYSSTVKQEFPLGRHNSKRSLKEAVKRMNYMERGTMTG---HALSFLVDNSFGPNQGARP 385 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K I TDG + D +AK G +YA+GV D+ + + P Sbjct: 386 GVP--KVGIVFTDGRSQDYIGD------AAKKAKALGFKMYAVGVGNAVEDELREIASEP 437 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMV 363 D ++ + +I K++ Sbjct: 438 IADHYFYTAD----FKTMNQIAKKLQ 459 >gi|159899109|ref|YP_001545356.1| FHA domain-containing protein [Herpetosiphon aurantiacus ATCC 23779] gi|159892148|gb|ABX05228.1| FHA domain containing protein [Herpetosiphon aurantiacus ATCC 23779] Length = 785 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/281 (14%), Positives = 86/281 (30%), Gaps = 33/281 (11%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK- 160 + + ++ + + ++ A +S+ L Sbjct: 24 QSQSQDDKPKVVITHQVVTEKNALSVEAYFSVRYNDGRAVPVNEISSTVLDLKNGEPPVQ 83 Query: 161 -ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + +V+D S SM P ++++ A + I V R Sbjct: 84 AVPQDPTTPIKIALVMDQSGSM----NPFIEEVKRAANQAIDQAPANAKIA-VFTFTRMN 138 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V + F + +++ IN R G T A + + + + E Sbjct: 139 SV---DVYLPAFDFSDDRNAVKDYINQNYRSEPGGETCLYTAAHQATDFLLNTLKPEERR 195 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI------VYAIGVQAEAAD 330 A II TDG++ N + + + G +Y IG+ + Sbjct: 196 A---------IILFTDGKDEDINGNQCSDKTVIDVTTKAGPTQGTKTPIYTIGLCSADCA 246 Query: 331 QF----LKNCASPDRFYS-VQNSRKLHDAFLRIGKEMVKQR 366 + L+ + + S V ++ AF+ I + Q+ Sbjct: 247 RIQPESLRQISENTQAISLVGPRDQMSAAFITIMDSIKNQK 287 >gi|332664650|ref|YP_004447438.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333464|gb|AEE50565.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 345 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 68/201 (33%), Gaps = 33/201 (16%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + F+ F + A + P T K + +D++ LD+S SM +L Sbjct: 59 NILFLIGLF-FIAIALANPQWGT---KTQAVRRQSIDIIFALDISQSMLCQDIAP-SRLI 113 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIF 249 R +++++ + R G++ F+ + PL + + + +I Sbjct: 114 QGQRLCQQLIEKLSGN-------RLGVILFAGEAYMQVPLTTDYEAVSLLLQSANPDMIS 166 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T L A + + ++ +TDGE+ + + Sbjct: 167 SQGTSIGEALAIA------------QTNTSKSNGNRVVLVITDGEDHEARAEAQARQAA- 213 Query: 310 NEAKRRGAIVYAIGVQAEAAD 330 R G ++ IG+ +E Sbjct: 214 ----RAGMKIFTIGIGSEEGG 230 >gi|307591436|ref|YP_003900235.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986290|gb|ADN18169.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 441 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 63/179 (35%), Gaps = 22/179 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K S S V+D S SM++ G K + S+ +++ + D + Sbjct: 30 PTKEVSNSRPSTSFAFVIDTSGSMDEVVTGGKSKKSIVIESLYQLVRSGRLTQDDH---- 85 Query: 218 SGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ F + L Q ++ I RL F T G+ A + + Sbjct: 86 IAIIEFHDQASTLIGLTPATQVFQLENAIARLNDFSGGTCMGKGMNEALVLLTNQ----- 140 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ +IF TDGE D ++ + +G + A+GV E + L Sbjct: 141 -----SMTSRRVLIF-TDGE----TFDEEDCEIIAQQFSNQGISITAMGVGDEFNEDLL 189 >gi|327193756|gb|EGE60633.1| hypothetical protein RHECNPAF_136001 [Rhizobium etli CNPAF512] Length = 433 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 78/212 (36%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 30 FKATVTVMPTPWNHDTKLMHVAIKGYDIAPATAPHA-NLVFLIDVSGSMDEP-----DKL 83 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + + R +++ +K+ V+ +VT++ I I++L G Sbjct: 84 PLLKSAFRLLVNRLKADDTVS------IVTYAGNAGTVLEPTRVAEKSKILSAIDKLEAG 137 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AY L A D + + TDG+ + +++ Sbjct: 138 GSTGGAEGIEAAY--------DLAKKAFVKDGVNRVM-LATDGDFNVGPSSDEDLKRIIE 188 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 E ++ G + +G D ++ A Sbjct: 189 EKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGN 220 >gi|313892786|ref|ZP_07826367.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442717|gb|EFR61128.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 230 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 75/201 (37%), Gaps = 17/201 (8%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L ++++LD+S SM+ K+ + +M++ + V+ ++TF + + Sbjct: 16 LPVVLLLDISGSMSGA------KIDSLYDATIDMIETFSAAQAKEQVIDVAIITFGTHVE 69 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + I + T L A + I D Y+ ++ Sbjct: 70 LHTKYTPVKDLQAKGICKFSASGLTPMGTALRMAKDMIED------KDVTPSRIYRPAVV 123 Query: 289 FLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQ 346 ++DG +PN D K + + N+ + +A+ + +A L+ P+ + Sbjct: 124 LVSDG---APNDDWKSPMDKFINDGRSAKCQRFAVAIGNDADRSILERFTQDPNAVLFAE 180 Query: 347 NSRKLHDAFLRIGKEMVKQRI 367 +++ + + F I + + Sbjct: 181 DAKDISEQFKTISMSISTMAV 201 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSVQNSRKLHDAF 355 + N + C+ AK +G IVY IG +A + L++CAS D Y + ++ DAF Sbjct: 457 GNSTKNTRTSNVCSAAKAQGIIVYTIGFEAPSNGVAVLQDCASSDSHYFDVDGLEIRDAF 516 Query: 356 LRIGKEMVKQRI 367 I + K R+ Sbjct: 517 ESIATSIRKLRL 528 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 49/343 (14%), Positives = 106/343 (30%), Gaps = 53/343 (15%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 N+R+F + G+++ T + V G+ ++ + L Y LD ++L A Sbjct: 22 NLRSFRSDESGALAYPTIAFFLAMLAVGGVGVDLMRLERDRTVLQYTLDRAVLAAADLDQ 81 Query: 65 NQENGNN---------------GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE 109 QE + + Y+ +K F L + D+ Sbjct: 82 TQEPAVVVQDYLNKAGLGEYYEAPEVETGLGYKKVKATIDATFDAHLLQFAGGSDLPVYA 141 Query: 110 RST--------SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ST +S+++D + S +S ++ ++ S + Sbjct: 142 SSTAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAAR-DFIDTMVENTTDGR--MSISIV 198 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGM-----DKLGVATRSIREMLDIIKSIPDVNNVV 216 + + + + + + S ++F + D A + E + P + Sbjct: 199 PYATQVSVSDELFDEYTTSGTNNFANCINFETSDYSTTALSTTSERERTMHFSPWYTSNT 258 Query: 217 R-SGL-VTFSSKIVQTF----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R SG + + ++ PL ++ I L T G+++ + + Sbjct: 259 RASGSPIDYEICDDRSSREILPLQKDATTLKSFITNLTAWGNTSIDIGMKWGVALLDPSA 318 Query: 271 EKLEHIAKGHDDYK----------------KYIIFLTDGENSS 297 K I+ +TDG+N+S Sbjct: 319 RPAISSLASGASVPSEFSVRPVDYSDPDTLKIIVLMTDGQNTS 361 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 79 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVR 126 Query: 231 FP--LAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ I++ I+ + T L+ A + + + H G Sbjct: 127 KDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS- 182 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 183 LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTR 242 Query: 341 RFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 243 RVHEEEDAGSQLIGFYD 259 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 78/216 (36%), Gaps = 20/216 (9%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 +A ++ PW A + + I + ++ +++ ++DVS SM++ Sbjct: 194 TTAAEPFKATVTVTPTPWNAGTRLMHVAIKGYEVVQKEAPRA-NLVFLIDVSGSMDEP-- 250 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 DKL + + R ++D ++ V+ V +G ++ V I I+ Sbjct: 251 ---DKLPLLKNAFRLLVDRLRPDDTVSIVTYAG----NAGTVLEPTAVKDKTKILSAIDT 303 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L G +T G++ AY A + I+ TDG+ + ++E Sbjct: 304 LQPGGSTAGAAGIDAAYQLAEKAFVRDGVNR---------ILLATDGDFNVGPSSDEELK 354 Query: 307 FYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDR 341 +R G + +G D ++ A Sbjct: 355 RMVETKRRSGIFLSVLGFGRGNYNDALMQTIAQNGN 390 >gi|223939755|ref|ZP_03631626.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223891531|gb|EEF58021.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 806 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 73/211 (34%), Gaps = 35/211 (16%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V +K + D++ VLD S SM+ K+ A ++++ ++ + N+ R Sbjct: 302 VDAKAKQIVSKDVVFVLDTSGSMSGK------KMEQAKKALQFCVESL------NDGDRF 349 Query: 219 GLVTFSSKIV-----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ FS++ + + I L T L+ A L Sbjct: 350 EIIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKA----------L 399 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +K + ++FLTDG + D + L E + ++ G+ + L Sbjct: 400 SLESKEGRPF--VVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIGTDVNTHLL 457 Query: 334 KNCASPDRFYS--VQNSRKLH----DAFLRI 358 A R +S V L F +I Sbjct: 458 DRIAEETRAFSQYVLPEEDLEVKVSSFFSKI 488 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 73/209 (34%), Gaps = 25/209 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + P + K + +++ +LD S SM + A R Sbjct: 357 TIGNYRYAMVMLTPPRQDDADDKSTKTPVSAREVVFLLDTSGSMAGE------SIVQAKR 410 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGST 252 ++ L + VN ++ F+ + LA +Q + + L Sbjct: 411 AVDFALTQLHPEDSVN------VIEFNDAPQALWNLAMPATANNIQRARNWVASLSANGG 464 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T+ P L A + + ++I +G + ++F+TDG S N D SL A Sbjct: 465 TEMAPALSMA---LHKTNLEQQNINEGSPVQLRQVVFITDG--SVSNEDALMSLIENQLA 519 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDR 341 R ++ IG+ + F+ A R Sbjct: 520 DSR---LFTIGIGSAPNSYFMTQAAQAGR 545 >gi|302796872|ref|XP_002980197.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] gi|300151813|gb|EFJ18457.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] Length = 542 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S + +D++ VLDVS SM KL + ++ + I ++ R + Sbjct: 73 SSDDARAPVDLVTVLDVSGSMRGQ------KLELVKTAMEFV------IRNLRQQDRLAI 120 Query: 221 VTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V+FS + L G + + +L T+ PGL+ ++ + Sbjct: 121 VSFSDEPKVHLGLKRMTYDGREAALSAVEKLRTLGGTEIRPGLKAGFDLLS--------- 171 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLK 334 + + + I+ L+DG +++ + L + + V+ G ++ + + Sbjct: 172 RRRNRNPVSSIMLLSDGMDNAITFKRCKVLPVDSYLEDCSERVPVHTFGFGSDHDPEAML 231 Query: 335 NC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + A+ F VQ + AF + IG + Sbjct: 232 SIAEATGGSFCYVQEESTVQHAFAQCIGGLLS 263 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAF 355 + +L C AK +G +V+ IG +A + L+ CAS Y + ++ DAF Sbjct: 509 GSTTKDARTLDICEAAKAKGVVVFTIGFEAPSRGQEVLQACASSASHYYDVDGLEISDAF 568 Query: 356 LRIGKEMVKQRI 367 I + + R+ Sbjct: 569 ASIASAIRQLRL 580 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 49/361 (13%), Positives = 109/361 (30%), Gaps = 68/361 (18%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA---- 60 +R F + G ++ +++ ++F++ GL ++ + KL Y LD ++L A Sbjct: 23 RMRQFRRDESGVLAKPMIMIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAAADLDQ 82 Query: 61 -----------TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE 109 + + + + +K T+F N F D Sbjct: 83 PLDPEAVVLDYMSKSGLGDYTTVVVPEVSPTAKRVKASVDTNFTASWMNNVFYDDYIRNP 142 Query: 110 RSTSLS-IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS---SHAPLLITSSVKISSK- 164 + L I + + ++ E+ + ++ + V+ Sbjct: 143 DTYQLEPITLPLLASSTAVESIGNVEISLVLDVSGSMRSNDRLVNLKRAAKEFVQTMDDN 202 Query: 165 -SDIGLDMMMV---------------LDVSL----------SMNDHFGPGMDKLGVATRS 198 D + + +V L+VS S +D G+ R+ Sbjct: 203 TEDGKMSISIVPYSTQVSMPEAFLDELNVSSEHDYSHCINFSGSDFNNAGISTTQAYERT 262 Query: 199 IREMLDIIKSIPDVNNVVRSGL-VTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKST 256 + + +VR S +T L + V +Q I+ + T Sbjct: 263 MHFTVWNSGDYRSRTRLVRQPTCAAHSDNPERTALLLSDNVTQLQNYIDAFVPSENTSID 322 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYK--------------------KYIIFLTDGENS 296 G+++ + + + + + K I+ +TDG+N+ Sbjct: 323 LGMKWGSALLDPSVQPVIASLADDANPNQSIASRFANRPVPYTDTETLKVIVMMTDGQNT 382 Query: 297 S 297 S Sbjct: 383 S 383 >gi|149632103|ref|XP_001514474.1| PREDICTED: similar to collagen type VI alpha 4 [Ornithorhynchus anatinus] Length = 1844 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 45/281 (16%), Positives = 99/281 (35%), Gaps = 34/281 (12%) Query: 87 IWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA 146 I +F++E+ E+ I ++ + D+Q K +++ + C + Sbjct: 544 ITAGNFQDEVAESAQELRQRGIVI-YAIGVKTDNQLKLISIAGTE----ENVLCVNDFDT 598 Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ + ++ D++ ++D S S+ + ++ M+++ Sbjct: 599 LKHIKDEVVQDICFPGACKNMKADIIFLIDGSESIKES------NFEKMKEFMKLMVNMS 652 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEY 261 P VR G++ FSS + F L + I ++ G+ T Sbjct: 653 NIGP---ENVRIGVLQFSSSPREEFMLNKYTTKEDLSRAISDIKQIKAGTQTGQALTFTL 709 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 Y E +Y+I +TDGE ++L + +G ++A Sbjct: 710 PYFDTSRWGRPTE---------PQYLIVITDGEAQDSVKGPAKAL------RDKGISIFA 754 Query: 322 IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 IGV Q L+ + D+ + + L +I ++ Sbjct: 755 IGVLEANKTQLLEITGTEDQVFYENDFDSLIFLKKKISFKL 795 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 71/197 (36%), Gaps = 30/197 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ ++D S S++ G +D M +++ NN VR G+V +S Sbjct: 441 VDIYFLIDGSSSID--HGDFLDMK-------MFMSEVLSVFQMGNNRVRFGVVQYSDSPH 491 Query: 229 QTFPLAWGVQHI--QEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + + +E I + + L Y + D K Sbjct: 492 LEFEVGQYHSTVKLKEAIRGIKQLRGRDRIGEALNYMNQRFMDNDR------------VK 539 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +T G ++ E ++RG ++YAIGV+ + + + + + V Sbjct: 540 ILILITAGNFQDEVAES------AQELRQRGIVIYAIGVKTDNQLKLISIAGTEENVLCV 593 Query: 346 QNSRKLHDAFLRIGKEM 362 + L + +++ Sbjct: 594 NDFDTLKHIKDEVVQDI 610 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 51/137 (37%), Gaps = 11/137 (8%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEY 261 + +KS+ + VR GLV +S + F L + I + +L + T Sbjct: 260 ETVKSLVIGPDNVRIGLVLYSDEPRLVFSLETFQTKESIMSHLAKLPYRGGKPKTGA--- 316 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A + + + + ++ + +T+G ++ +R G +YA Sbjct: 317 ALKFLRENIFTQDGNRRYEKRVQRMAVVITEG------FSEDRVSKPASQLRRAGVTIYA 370 Query: 322 IGVQAEAADQFLKNCAS 338 +G+Q L+ AS Sbjct: 371 LGIQRGLERGNLEKMAS 387 >gi|187919333|ref|YP_001888364.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] gi|187717771|gb|ACD18994.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] Length = 451 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 55/308 (17%), Positives = 107/308 (34%), Gaps = 45/308 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F +GS+SI A+ L + ++GL +++ + +KA+L D +++ + Sbjct: 11 FRRRQRGSVSIFVAVSLIALLGILGLAVDSGFGYMIKARLDAATDGAVIAAGEAVT---R 67 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 GNN +Q N+ L + A + + + +++I + Sbjct: 68 GNNQTEQTNNAQQAATAFFAANYPAGFLGSSVSAGTPSIVFNAGTVTIGM---------- 117 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 ++ +P F S + + S LDM+ V+D + S+N P Sbjct: 118 -TAQASVPVTF--------SKVLGFNVLNVSSSSQAIRKTLDMVFVIDNTKSLNTSGVP- 167 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEK 243 + L+ D R L+ F+ V P + + Sbjct: 168 ----AAVRSNAVAFLNNFDVTND-----RVALMHFAYGTVVDVPFKGNTRGFDRATMTTD 218 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 IN+ F +T S L A N++ + + I+F +DG +S + Sbjct: 219 INKYTFDGSTNSPEALWNARNQL--------NTVITQPSSLRVIVFFSDGAPNSFSSFFT 270 Query: 304 ESLFYCNE 311 + CN Sbjct: 271 TNQSKCNN 278 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 45/202 (22%), Positives = 79/202 (39%), Gaps = 30/202 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM-------DKLGVATRSIREMLDI 205 L + + S + LD++ VLDVS SMND +L V S++ ++ Sbjct: 54 LRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRK 113 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 + + R +V F+ V+ + G +KI+RL T P L Sbjct: 114 LA------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPAL 167 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E A KI D ++ G +I+ LTDG++++ ++++ V Sbjct: 168 EEAV-KILDERQGSSRNHVG------FILLLTDGDDTTGFRWTRDAIHGA----VFKYPV 216 Query: 320 YAIGVQAEAADQFLKNCASPDR 341 + G+ A + L + A R Sbjct: 217 HTFGLGASHDPEALLHIAQGSR 238 >gi|258619934|ref|ZP_05714975.1| RTX protein, putative [Vibrio mimicus VM573] gi|258587804|gb|EEW12512.1| RTX protein, putative [Vibrio mimicus VM573] Length = 1108 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 68/229 (29%), Gaps = 30/229 (13%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSV--------KISSKSDIGLDMMMVLDVSLSMNDHF 185 +P L+I S++ G ++ ++LDVS SM++ Sbjct: 456 NIPLEAKNAAGAIGIGKVTLVIEDDAPVAKEVFHITESEAKQGANVQLILDVSGSMDNSA 515 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT---FPLAWGVQHIQE 242 G G +L V S ++L+ ++I L+ F+ L V+ + Sbjct: 516 GNGKSRLQVMKESAIQLLEQYQAIGQTK----VQLIIFAGTASVQLHEKALWMTVEQAKS 571 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-------- 294 I L G T L+ A D H FL+DG Sbjct: 572 YIETLKAGGQTDYDHALQLA-----DENWSGNHNGIPLIGATNVSYFLSDGRPEGFDKQN 626 Query: 295 --NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I+ E + + + A G+ L A Sbjct: 627 GVENQNTIEPSELASWISHLQDNEITALAYGMGNSVPQSELDKIAFDGH 675 >gi|168817956|ref|ZP_02829956.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205345018|gb|EDZ31782.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320086747|emb|CBY96519.1| Inter-alpha-trypsin inhibitor heavy chain H3 Inter-alpha-inhibitor heavy chain 3; ITI heavy chain H3; ITI-HC3; Flags: Precursor [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 604 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 245 LVFLIDTSGSMQ-----PAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 295 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 296 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 346 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 347 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 406 Query: 348 SRKLHDAFLR 357 + Sbjct: 407 LSEAQKVLKD 416 >gi|186683831|ref|YP_001867027.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186466283|gb|ACC82084.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 615 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 64/194 (32%), Gaps = 29/194 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L++ +V+D S SM G + A S+ + L+ + +V + ++ Sbjct: 39 LNLSLVIDRSGSMA---GAALHHALKAAESVVDQLEPDDILS---------VVVYDDEVD 86 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P +++ I ++ G T + G + K D +K Sbjct: 87 SVVPPQAVTNKATLKDSIRKVRAGGITNLSGGWLKGCEHV-----------KTRLDPQKI 135 Query: 287 --IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRF 342 ++ LTDG + D K + G +G + L A F Sbjct: 136 NRVLLLTDGHANMGIQDPKVLTATSGQKAEEGITTTTLGFAQGFNEDLLIGMARAARGNF 195 Query: 343 YSVQNSRKLHDAFL 356 Y +Q+ + + F Sbjct: 196 YFIQSIDEATEVFS 209 >gi|225551939|ref|ZP_03772879.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] gi|225370937|gb|EEH00367.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] Length = 332 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 80/233 (34%), Gaps = 31/233 (13%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS---DIGLDMMMVLDVSLSMNDHFGPG 188 R + + F F + + K G D+++VLD+S SM Sbjct: 51 RLNLMYFFTYFFLYLAAMVMVFALAGPSVSKKKMIHLSAGADIVIVLDISPSMGAVEFSS 110 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-- 246 ++L + ++I+ GLV F+ P+ + +K++ Sbjct: 111 KNRLEFSK-------ELIRGFISQRENDNIGLVAFAKDASIVVPITTDREFFNKKLDDIY 163 Query: 247 -LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + G+ + G+ A + + K + K+ I+ LTDG +S I + Sbjct: 164 IMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDEIYKDQV 212 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP----DRFYSVQNSRKLHDA 354 + N A+ +Y++G+ + S F V + L + Sbjct: 213 I---NLAQGLNVKIYSVGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVEI 262 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/133 (21%), Positives = 56/133 (42%), Gaps = 19/133 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM D K+ + E ++ IK IP+++ + VT+S+ Sbjct: 92 IVLVLDTSGSMKDS------KIKKMKNAAMEFVNKIKKIPNLDIDI----VTYSTSGYTY 141 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + + IN + T + GL A + + + + K I+F+ Sbjct: 142 LNNGNTEEDLLKIINSIKADGGTNTGEGLRKANYIL---------DLEKNKNADKSIVFM 192 Query: 291 TDGENSSPNIDNK 303 +DG + +I K Sbjct: 193 SDGMPTYYSIIAK 205 >gi|258516146|ref|YP_003192368.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] gi|257779851|gb|ACV63745.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] Length = 219 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 71/204 (34%), Gaps = 14/204 (6%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM FG + + ++ M+ +K P ++TF S Sbjct: 11 RLPVYLLLDRSGSM---FGEPI---EAVKQGVKYMISELKKEPQAIETAYISVITFGSDA 64 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q L + K ++ TT L N + K KG DYK + Sbjct: 65 RQDVQLT---ELAAFKEPQIEANGTTSLGAALHILNNCFDNEVRKSTPTQKG--DYKPLV 119 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDGE P D + + + + A + A+G + LK S Sbjct: 120 FIMTDGE---PTDDWENAAREIKQKSGKVANIVAVGCGPDVNTDTLKKITDIVLLMSSYQ 176 Query: 348 SRKLHDAFLRIGKEMVKQRILYNK 371 F + + + + I + K Sbjct: 177 PEDFKQFFRWVSQSVKQASIKFTK 200 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 29/72 (40%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAF 355 ++ + + C+ AK +G ++++ A LK C+S +Y L AF Sbjct: 574 DTSVKDARTKKLCDLAKSKGIYIFSVAADAPSGGKTLLKYCSSGTSYYYEVQGSNLSTAF 633 Query: 356 LRIGKEMVKQRI 367 I + R+ Sbjct: 634 ASIAASISSLRL 645 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 76/241 (31%), Gaps = 41/241 (17%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + N G++ IL+ + V+ I G+ I+ + + +D ++L A+ Sbjct: 21 LHRLLRNEDGALIILSLQVFLVMLITTGIAIDLVRVEERRTLIQNTIDRAVLAAAS---- 76 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 QK D + + + + ++ F + + Sbjct: 77 -------LTQKRDPTLVVKDYLTKAGLGYIASDSSF-----------TPKVEGSIALGWR 118 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-- 183 +S +MP IF + L T ++ +++ +VLD+S SM + Sbjct: 119 RVSVEVDDDMPTIFGPLLGVS-----SLAATGDTTAM-QAVGNVEISLVLDLSGSMTEYV 172 Query: 184 HFGPGMDK-----------LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 P K L VA +S + +V +S+ + Sbjct: 173 KDNPSCTKNCTSSKTRFQYLQVAAKSFINTVFASSGSGVAAGRTSVSVVPYSTNVYLGSE 232 Query: 233 L 233 + Sbjct: 233 M 233 >gi|163848230|ref|YP_001636274.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526140|ref|YP_002570611.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669519|gb|ABY35885.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450019|gb|ACM54285.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 419 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 76/219 (34%), Gaps = 31/219 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 LL + ++ + + + VLD S SM G +++L A E L+ I Sbjct: 28 LLEAQPGAVLTQVRVPIHVCFVLDRSGSMK---GEKIERLRQAVVRAIEQLNEQDIIS-- 82 Query: 213 NNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V F + P I + ++R+ T+ P LE ++ Sbjct: 83 -------IVIFDHRTEVLVPAQPVRQRTAILDLVHRIRDAGGTRIAPALEKGIQELQKMP 135 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++ LTDG+ KE L ++A R G + A+G+ + + Sbjct: 136 QGVRR-----------LVLLTDGQTEH----EKECLLRADDAGRLGIPITALGIGKDWNE 180 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L A S + ++ F + + I Sbjct: 181 DLLIEMANRSRGVADYIAQPGEIVQYFQHTVQRAQQTTI 219 >gi|330465656|ref|YP_004403399.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328808627|gb|AEB42799.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 410 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 67/197 (34%), Gaps = 33/197 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQ 229 + +VLDVS SM G ++ VA ++ +++ ++PD + +R T+ + Q Sbjct: 18 VQLVLDVSGSMRATDIDGRSRISVAQQAFG---EVVDALPDETQLGIRVLGATYRGEDKQ 74 Query: 230 TFPLAW---------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 L + + + L T L A + G Sbjct: 75 QGCLDTQQIVPVGPVNRERAKAAVATLRPTGFTPVGLALREAAKDL------------GG 122 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN----- 335 + I+ +TDGE++ D + E +G + + ++ + Sbjct: 123 GTTARRIVLITDGEDTCAPPDPCQ---VARELAAQGTTLVVDTLGLAPDEKVRRQLLCIA 179 Query: 336 CASPDRFYSVQNSRKLH 352 A+ + + ++ L Sbjct: 180 AATGGTYTAATSAEDLT 196 >gi|221111394|ref|XP_002160866.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 419 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 73/208 (35%), Gaps = 21/208 (10%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 ++ + + + +D+ +LD S S+ + + L Sbjct: 214 ATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQKEKNFLKKLAAVFGI-------- 265 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 +N R G++TFS + + L + E ++++ TT+ L A + Sbjct: 266 --SSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDM 323 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F + G K II LTDG + P D ++ +E + G ++ +G+ + Sbjct: 324 FTSA------NGGRVGVSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGS 376 Query: 327 EAADQFLKNCASP-DRFYSVQNSRKLHD 353 + L + Y+ L D Sbjct: 377 AVNETELSHITGGKKNAYTAATFDSLTD 404 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 71/197 (36%), Gaps = 21/197 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + +D+ +LD S S+ + + L +N R G+ Sbjct: 9 VQPRCEAVVDVAFILDSSHSLEASYQKEKNFLKKLAAVFGI----------SSNGSRVGV 58 Query: 221 VTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +TFS + + L + E ++++ TT+ L A +F + Sbjct: 59 ITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA------N 112 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G K II LTDG + P D ++ +E + G ++ +G+ + + L + Sbjct: 113 GGRVGVSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT 171 Query: 338 SP-DRFYSVQNSRKLHD 353 Y+ L D Sbjct: 172 GGKKNAYTAATFDSLTD 188 >gi|77464595|ref|YP_354099.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389013|gb|ABA80198.1| Putative membrane protein with von Willebrand (VWA) domain [Rhodobacter sphaeroides 2.4.1] Length = 651 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 81/237 (34%), Gaps = 21/237 (8%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + + PW + + + + + + L+++ ++D S SM D Sbjct: 255 NGTPPFRPTLSVTRTPWNPETQLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQDPA-- 311 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 KL + +S ML ++ V V +G S+ V A I ++RL Sbjct: 312 ---KLPLLKQSFGLMLGRLRPEDQVAIVTYAG----SAGEVLAPTAANQRSTILSALDRL 364 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T GL AY + G + + ++ TDG+ + D +E Sbjct: 365 DAGGSTAGEEGLALAY--------RTASEMAGAGEVTR-VVLATDGDFNLGISDPEELAR 415 Query: 308 YCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + G + +G D ++ A + L++A + ++ Sbjct: 416 LVAHERDTGIYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLS 471 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 65/366 (17%), Positives = 129/366 (35%), Gaps = 67/366 (18%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL---YT 59 F ++ F N G +I+TA+L + + + ++ + KA+L D + L Sbjct: 12 FSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAE 71 Query: 60 ATKILNQENGNNGKKQ----------KNDFSYRIIKNIWQTD----FRNELRENGFAQDI 105 + ++ ++ K+Q DFS ++ W+ + ++ + D Sbjct: 72 DNQYRKNKDHSDVKRQNVSQQEIEREGRDFSSAKVQAQWKKRNQELVQGLVKLYLRSDDS 131 Query: 106 NNIERSTSLSI-----------IIDDQHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAP 152 + S+ ++I ++K+ +++ + F PW + Sbjct: 132 KGQKNSSPVTIKEPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSS 191 Query: 153 LLITSSVKISSKSDIG---------LDMMMVLDVSLSMNDHFGP-------------GMD 190 L V I+S +D+MMV D+S SM +D Sbjct: 192 QLYDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMMWAINATGNNPPEVNYPNRRID 251 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLI 248 L A I ++L ++ DV+ R G V+F++ Q L V +++ + + Sbjct: 252 ALREAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTNCVLPYYVKSEDKKR- 310 Query: 249 FGSTTKSTPGLEYAY---NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE--NSSPNIDNK 303 + + + Y N I E L+ + F DG+ +D+K Sbjct: 311 -----EISAKFKKGYNGGNHIAKGFELLDRDLDIPKTIDQISQF--DGQKRTYDFTLDSK 363 Query: 304 ESLFYC 309 S YC Sbjct: 364 TSRNYC 369 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 57/156 (36%), Gaps = 23/156 (14%) Query: 225 SKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGH 280 + + AW + + + ++I T T G+ N + + + K Sbjct: 372 DNVNKKTTQAWFDKNNRAVASALKQIIPRGGTAVTSGIFIGTNLMMEKNKDFEAMPNKIG 431 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFY---CNEAKRR--------------GAIVYAIG 323 + ++ ++ L+DGE++ P+ D L C K + A G Sbjct: 432 TNTRRILMILSDGEDNIPSKDTLVKLMEAGLCTRVKEKIDGLQDSNYPKVETRIAFVAFG 491 Query: 324 VQAEAADQ-FLKNCASPDRFYSVQNSRKLHDAFLRI 358 Q K C D++YSV + + L DAF +I Sbjct: 492 FNPPQKQQEVWKKCV-GDQYYSVSSKQALFDAFKQI 526 >gi|229819442|ref|YP_002880968.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229565355|gb|ACQ79206.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 647 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 74/220 (33%), Gaps = 33/220 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + + + ++ ++D+S SM G +L A +I LD + Sbjct: 441 VAAVAEAFPEVRKDAQVLFLVDLSASMGYDNTAGQTRLEGAQHAITAALDHFTAGD---- 496 Query: 215 VVRSGLVTFSSKIVQTFP--------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R GL F++ P +A + E +N L A+ + Sbjct: 497 --RVGLAGFTTTDGTITPGLVAPVADIADNRGALVEGVNSLEP-----------VAHTPL 543 Query: 267 FDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----VY 320 + A D + I+ L+DG N++ +I+ + A V+ Sbjct: 544 YQAVADFAQQQAAMWDPDRINAIVLLSDGVNATGDIETIGQDDMIHVLHGLHAETPVLVF 603 Query: 321 AIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRI 358 +G +A + L+ A+ +Y + ++ + Sbjct: 604 TLGYSPDADVETLQAISSATGAHYYDATDPTEVEAVLGDL 643 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 76/208 (36%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A I Sbjct: 357 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSASTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|8393899|ref|NP_059047.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Rattus norvegicus] gi|3024056|sp|Q63416|ITIH3_RAT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|602886|emb|CAA58233.1| pre-alpha-inhibitor, heavy chain 3 [Rattus norvegicus] Length = 887 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 70/175 (40%), Gaps = 13/175 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVTFSSKIVQ 229 + V+DVS SM+ K+ ++ ++LD +K +N + +G+ T+ +V+ Sbjct: 285 IAFVIDVSGSMSG------RKIQQTREALLKILDDMKEEDYLNFILFSTGVTTWKDHLVK 338 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P ++ + + + S T GL + A+E + ++ Sbjct: 339 ATPA--NLEEARAFVKNIRDRSMTNINDGLLRGIEMLNKAREDHLVPERSTS----ILVM 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDG+ ++ ++ A R +Y +G FL++ A + ++ Sbjct: 393 LTDGDANTGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFA 447 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 79/217 (36%), Gaps = 51/217 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD+S SM G D++ ++ + L I + +G+V F+S+ + Sbjct: 303 LCLVLDISGSM-----NGFDRIYRLRQAGEQFLLQI-----LETGSWAGIVVFNSQALTK 352 Query: 231 FPLA--WG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L G Q + + G T G+ + E Sbjct: 353 TYLKQITGDSVRQTLSAYL-PTAAGGGTNICSGIREGFQVFLKKYPSTEGCE-------- 403 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF---------LKN 335 I+ LTDGE++ + C E +R G+I++ I + AA + LK Sbjct: 404 -IVLLTDGEDAGVSS--------CFAEVQRSGSIIHTIALGPSAAKELEMLADMTGGLKF 454 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGK---EMVKQRILY 369 A+ +S L DAF I ++ +Q I Sbjct: 455 SATDS-----LDSNGLIDAFSGISSGSGDISQQSIQL 486 >gi|297665730|ref|XP_002811194.1| PREDICTED: cartilage matrix protein-like [Pongo abelii] Length = 495 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 71/170 (41%), Gaps = 25/170 (14%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPG 258 I+ ++ + + + GLV +SS + Q FPL G H ++ I + T + Sbjct: 299 QIVDTLDVSDKLAQVGLVQYSSSVRQEFPL--GRFHTKKDIKAAVRNMSYMEKGTMTGAA 356 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+Y + D + A+ +K I TDG + D +AK G Sbjct: 357 LKY----LIDNSFTVSSGARP--RAQKVGIVFTDGRSQDYIND------AAKKAKDLGFK 404 Query: 319 VYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 ++A+GV D+ + + P + ++ + + ++ IGK++ K+ Sbjct: 405 MFAVGVGNAVEDELREIASEPVAEHYFYTADFKTINQ----IGKKLQKKI 450 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 66/173 (38%), Gaps = 21/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + ++++ + P N R G+V ++S + Q Sbjct: 41 DLVFVVDSSRSVRPV------EFEKVKVFLSQVIESLDVGP---NATRVGMVNYASTVKQ 91 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + + R+ + T + +++A K F E D K Sbjct: 92 EFSLRAHISKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSR---SPDISKV 148 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I +TDG D A+ G ++AIGV + + P Sbjct: 149 VIVVTDGRPQDNVQDVSA------RARASGVELFAIGVGRVDKATLRQIASEP 195 >gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1] Length = 471 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 73/204 (35%), Gaps = 34/204 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ V+DVS SM KL + ++ ++ ++ + R LVTF Sbjct: 54 ERPAIDLVAVIDVSGSMAGQ------KLKMVQSTLEFLM------RNLKDTDRFALVTFD 101 Query: 225 SKIVQTF---PLAWGVQHI-QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S + F P+ + + +L GS T + GL + ++ Sbjct: 102 SDVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFRGVELMQQRGATKGAVSS-- 159 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-----RRGAIVYAIGVQAEAADQFLKN 335 I+ +TDG + D + C + +Y G + + L+ Sbjct: 160 ------ILLMTDGIANEGVRDKDD---MCRALRGLMGPAPDYTIYTFGYGKDHNENMLRQ 210 Query: 336 CASPDR--FYSVQNSRKLHDAFLR 357 + +Y ++++ + ++F Sbjct: 211 LSETGNGMYYFIESNDIIPESFGD 234 >gi|126303712|ref|XP_001380869.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 628 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 47/310 (15%), Positives = 101/310 (32%), Gaps = 45/310 (14%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 ++ + ++ + + NI+ E + N+ + + Sbjct: 347 VAIVMVDGWPTDKVEEASRLARESGINIFFITI-----EGAVENEKQNVIEPNFVDKAVC 401 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++ Y+L+ +S + + P L S ++S D+ V+D S Sbjct: 402 RRNGFYSLNVLSWFSL--HKIVQPLVKRVCDTNRLACSKTCLNSA-----DIGFVIDGSS 454 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 S+ G + + + K + R G V ++ Q + Sbjct: 455 SV------GTGNFRTLLQFVANL---SKEFEISDTDTRIGAVQYT--YEQRLEFGFDKYS 503 Query: 240 IQEKI-NRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 ++ I N + + T + + YA +F K + +K +I +TDG Sbjct: 504 TKQDILNAIKRVNYWSGGTSTGAAINYALEHLF---------KKSKPNKRKLMILITDGR 554 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLH 352 + + A + G I Y+IG+ A D+ P D + V L+ Sbjct: 555 SYD------DVRIPAMAAHQNGVITYSIGIAWAAQDELEVIATHPTKDHSFFVDEFDDLY 608 Query: 353 DAFLRIGKEM 362 + +I + + Sbjct: 609 KSVPKIIQNI 618 >gi|149034207|gb|EDL88977.1| inter-alpha trypsin inhibitor, heavy chain 3 [Rattus norvegicus] Length = 886 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 70/175 (40%), Gaps = 13/175 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVTFSSKIVQ 229 + V+DVS SM+ K+ ++ ++LD +K +N + +G+ T+ +V+ Sbjct: 285 IAFVIDVSGSMSG------RKIQQTREALLKILDDMKEEDYLNFILFSTGVTTWKDHLVK 338 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P ++ + + + S T GL + A+E + ++ Sbjct: 339 ATPA--NLEEARAFVKNIRDRSMTNINDGLLRGIEMLNKAREDHLVPERSTS----ILVM 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDG+ ++ ++ A R +Y +G FL++ A + ++ Sbjct: 393 LTDGDANTGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFA 447 >gi|120556589|ref|YP_960940.1| vault protein inter-alpha-trypsin subunit [Marinobacter aquaeolei VT8] gi|120326438|gb|ABM20753.1| Vault protein inter-alpha-trypsin domain protein [Marinobacter aquaeolei VT8] Length = 712 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 70/201 (34%), Gaps = 38/201 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + +++ V+D S SM + A ++ LD + R ++ F+S+ Sbjct: 352 LRRELLFVIDTSGSMAGE------SIRQARSALLRGLDTL------RPGDRFNVIQFNSQ 399 Query: 227 I---VQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 A G ++ + L T+ L A D E H+ + Sbjct: 400 AHALYTQPVPANGHYLARARDYVQDLTADGGTEMAGALSLA--MGMDGSESSGHVQQ--- 454 Query: 282 DYKKYIIFLTDGE--NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 ++F+TDG N S D + ++ + + + FL+ A Sbjct: 455 -----MVFMTDGAVGNESALFDQIRTGL-------GNRRLFTVAIGSAPNMHFLREAARW 502 Query: 338 SPDRFYSVQNSRKLHDAFLRI 358 ++ +V ++ ++ A ++ Sbjct: 503 GRGQYTAVHSAAEVDKALGKL 523 >gi|51598434|ref|YP_072622.1| hypothetical protein BG0172 [Borrelia garinii PBi] gi|51573005|gb|AAU07030.1| hypothetical protein BG0172 [Borrelia garinii PBi] Length = 333 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 79/206 (38%), Gaps = 27/206 (13%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + S K G D+++VLD+S SM Sbjct: 49 KDYRLNLIYFFTYSFLYLAAMVMVFALAGP---SVSKKKMIHLSAGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+ GLV F+ P+ + Sbjct: 106 AVEFSSKNRLEFSK-------ELIRRFISQRENDNIGLVAFAKDASIVVPITTDRDFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ 325 I + + N A+ +Y+IG+ Sbjct: 208 IYKDQVI---NLAQGLNVKIYSIGIG 230 >gi|46445753|ref|YP_007118.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] gi|46399394|emb|CAF22843.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] Length = 362 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 75/219 (34%), Gaps = 31/219 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTF 223 G+ + +++D S SMN+ + RS + +D++K + + R GLV F Sbjct: 95 GIAIYLIVDQSSSMNEKISS--NSFFERGRSFTK-IDLLKVMAGQFILHRPSDLIGLVAF 151 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFD--------AKEKLE 274 + PL + + ++N L S + + YA K A+E + Sbjct: 152 ARVPKILSPLTLDHELLINQLNDLKAINSMEEDGTAMGYAIYKTAHLIVATKHFAQELQK 211 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN------EAKRRGAIVYAIGVQ--- 325 ++ ++ LTDG +D L AK G +Y I V Sbjct: 212 KGKPAYEIKNAIMVVLTDGFQDPNRLDYGNRLRTIELDEAIAYAKNAGIHLYIINVDPKF 271 Query: 326 -----AEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 A Q ++ + Y + L F I Sbjct: 272 SSPQFAPHRRQIETLAESTGGQLYLANQEKDLKRVFDTI 310 >gi|301616677|ref|XP_002937788.1| PREDICTED: collagen alpha-6(VI) chain-like, partial [Xenopus (Silurana) tropicalis] Length = 1529 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 78/199 (39%), Gaps = 21/199 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S++ + + +++ PD V+ G + +S + Sbjct: 194 DIVFVIDSSGSIDY------TEYKEMQNFMVSLVNKSAVGPDN---VQFGALKYSDYNTE 244 Query: 230 TFPLAWGVQHI--QEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + IN+ G T + + ++ +H ++ + Sbjct: 245 LFYLNRYTNKVDIINHINKDTTQGGNTYTAGAVRFSKEFFT-----EKHGSRKARGVPQI 299 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 ++ +TDG+ + D + + ++ G I+YAIG+ ++ + +++ V Sbjct: 300 VMVITDGD----SHDKDKLNETARQLEQEGIIIYAIGIDQANTNELETLAGTEGKWFMVA 355 Query: 347 NSRKLHDAFLRIGKEMVKQ 365 N L D +++ + M + Sbjct: 356 NFSGLQDILVQVSEAMCNK 374 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 69/193 (35%), Gaps = 27/193 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D++ ++D S M DK ++++++ + V+ V FS+ Sbjct: 3 DLMADILFLVDSSSGMGS------DKFSKMKTFMKDLVNKTEV---GLTGVQFAAVQFSN 53 Query: 226 KIVQTFPLAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + F + + I N + G+ K+ L + +AK A+ Sbjct: 54 SIKEEFQ--FNKHATKNAIWDSIDNMNLMGNVAKTGNALANVADYFTEAKG-----ARPS 106 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ +TD N E + + G I+Y+IG + + + Sbjct: 107 SKVSKILLLITD------NPSQDEVKVPADSLRSNGLIIYSIGGFSANKKELQEISGKIT 160 Query: 341 RFYSVQNSRKLHD 353 +Y + + + Sbjct: 161 PYYQSFDKLQTIE 173 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 61/189 (32%), Gaps = 15/189 (7%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ ++ + + ++D P V + +S ++ + Sbjct: 383 DLLFLIDGSSNTSEETFREIKNFVVSVMDDFNVGPVN---VHIAVSQYSESCIREINFDY 439 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + K N +I TK + A + + + K+ ++ +T G Sbjct: 440 STERGTLK-NEIINIRKTKGRRHIGAALDFTKSTVYSPSSDNRLNQGVKQLLVVITAGNA 498 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAF 355 S ++L + RG +YA+G+ Q + SP++ Y Sbjct: 499 SDQVARPAKAL------RDRGVDIYAVGIGNICKTQLTQITGSPEKIY-----TDDVSGL 547 Query: 356 LRIGKEMVK 364 I K +V+ Sbjct: 548 KAIKKRLVR 556 >gi|156371145|ref|XP_001628626.1| predicted protein [Nematostella vectensis] gi|156215607|gb|EDO36563.1| predicted protein [Nematostella vectensis] Length = 484 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 76/206 (36%), Gaps = 21/206 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +D S SM D A R ++ ++ K G++ FS++ Sbjct: 271 DLAFAIDASGSMGDQ------GFLRAKRFVKALIGSFKVSQ---KGTHVGIIRFSTRAKV 321 Query: 230 TFPLA--WGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + I+ + + TK+ L A ++F + ++ K Sbjct: 322 MFTFTEHFTHEDVNYAIDDIEYTEGGTKTELALRLARTELFS----KQGGSRTSPLIFKL 377 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSV 345 + +TDG + + +++ KR G V A+G+ + L+ AS V Sbjct: 378 FVLMTDGRSEYFHAVARQAKML----KRSGVHVMAVGIGKYTNQRELEVIASSKSDVIGV 433 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + R L I ++ + +L K Sbjct: 434 VSFRDLMIRMNEIKDKLCEIALLNEK 459 >gi|324508820|gb|ADY43721.1| C-type lectin protein 160 [Ascaris suum] Length = 534 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 72/192 (37%), Gaps = 26/192 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + LD+++V+D SLSM G+ ++ ++ + ++ + S DV VR GLVTFS Sbjct: 183 AKLYLDIVVVVDSSLSMTKD---GLIEVAADLATVFQWMN-VSSGTDVGQFVRVGLVTFS 238 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI----FGSTT-KSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ L + RL G + L+ A + + ++ Sbjct: 239 NQAFVNGNLD-DFTSYNSLVKRLFQMPYLGGSELNIESALQSASDILQSSRYYARTA--- 294 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNC 336 I+ T D K N+ K G + + + + L + Sbjct: 295 -------ILLYTSAYGEGGFTDPK---AIANQIKESGTKIITVAFRQQPEGSLVEKLSHL 344 Query: 337 ASPDRFYSVQNS 348 ASP ++ + S Sbjct: 345 ASPGFSFASRQS 356 >gi|320106178|ref|YP_004181768.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924699|gb|ADV81774.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 365 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 81/223 (36%), Gaps = 36/223 (16%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ D+ + + +++D S SM D V S LD ++ + Sbjct: 106 QTIASFRHEDLPVSLGILIDSSGSMYDKRE------AVGKAS----LDFVRLSNPKDEAF 155 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 LV FS + V+ +++ + + T + + + + Sbjct: 156 ---LVDFSDEAFIDQDFTSDVKKLEDGLGYVKASGGTAIYDAVVASADYL---------- 202 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG--VQAEAAD---- 330 AK K+ ++ +TDG++++ ++++ E G ++Y +G ++ Sbjct: 203 AKNAKLPKQVLLIVTDGDDNASGSTLEDAIRRVQEL--DGPVIYCVGLLFGPDSNKSESR 260 Query: 331 ---QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + L+ A + Y + ++ + +++ +Q + Sbjct: 261 HARRVLETLAAQTGGLAYFPRKLSEVDSIATEVAQDIRQQYTI 303 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 68/185 (36%), Gaps = 21/185 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S+ H G ++ ++ P R GL+ +SS+ Sbjct: 1 DLGILIDSSNSIEKH---GRGNFRRVLEFVKRLVSTFHVSPRRA---RIGLIVYSSRSYL 54 Query: 230 TFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA-----KGHD 281 ++ + + I R+ T + ++YA K+F + H Sbjct: 55 VGGFRRYRNLRSVLQAIKRIRYIRGGTYTGKAMKYALRKLFSRRAGYHHARVRLFRSSRK 114 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPD 340 K ++ +TDG + L K+ G +++++GV Q ++ + P Sbjct: 115 GAAKILVMITDGISQDRVTTPALRL------KKMGVVIFSVGVGKRYRLKQLMQIASRPR 168 Query: 341 RFYSV 345 ++ Sbjct: 169 LVFTA 173 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 NELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|73980586|ref|XP_540098.2| PREDICTED: similar to matrilin 3 precursor [Canis familiaris] Length = 481 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 28/202 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ ++D S S+ + + +++D + R +V ++ Sbjct: 77 KSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSQIIDTLDI---GAADTRVAVVNYA 127 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + F L Q +++ + R+ + T S ++ A ++ F Sbjct: 128 STVKIEFHLQTYSDKQSLKQAVARITPLSTGTMSGLAIQTAMDEAFTE---EAGARGPTS 184 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + K I +TDG + A+ G +YA+GV A + LK AS Sbjct: 185 NIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVD-RADMESLKIIASEPL 237 Query: 339 PDRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 238 DEHVFYVETYGVIEKLSSRFQE 259 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 59/454 (12%), Positives = 129/454 (28%), Gaps = 121/454 (26%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 ++ A L + +G+ ++ + KL +D + L + ++ + + +NGK + Sbjct: 1 MVVAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQLPDDPDVDNGKVSQAV 60 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFI 138 + N+ RS ++ D + + + Sbjct: 61 TT---------NLAANDPEAKATDISSGGATRSVCVTAEADVDMTLSKVVGLDATTVTAE 111 Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 C + L T S++ + +++ ++D+ M D K+G+ Sbjct: 112 ACA-GYNDIELVMVLDATGSMRGTPIANVKEAAANLVDLI--MPDSGANTRSKIGLVPFQ 168 Query: 199 IREMLDIIKSIPDVNNVVRSGL------------------------------------VT 222 + +D + + G T Sbjct: 169 GKVRIDGNDPVTAERDPDGVGAGCRNADGTLNDGKLKTEYSDTRSRNSIFYGYTISGVST 228 Query: 223 FSSKIVQTFP----LAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEH 275 + + L+ + I + I + G S T + G+++ + + E Sbjct: 229 YYDRTCSGMSPIRALSSDKEAILDNIGAINAGAVTSGTLISEGIKWGHKVLSPKAPYTEG 288 Query: 276 IAKGHDDYKKYIIFLTD----------------------------GENSSPNID------ 301 +K +I LTD G+ PN Sbjct: 289 N--TDKKVRKIMIVLTDGDTEDGRCGGRYASASRTVNAYWTNAYFGQGLRPNSASSPYDT 346 Query: 302 --------------------NKESLFYCNEAKRRG---AIVYAIGVQAEAADQ--FLKNC 336 N+ L ++AK ++AI A +K Sbjct: 347 LSTASATLAQIPDCTDGGKLNQYVLDEADDAKNDADYPVEIFAIRFGDSDATDISLMKRI 406 Query: 337 ASP-----DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 AS D +Y +S + D F +IG+++ ++ Sbjct: 407 ASSKSGTDDHYYDAPDSSDIKDMFKKIGQQLGQR 440 >gi|284037552|ref|YP_003387482.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283816845|gb|ADB38683.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 320 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 71/205 (34%), Gaps = 28/205 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLG 193 + F + + + + ++ D+ D +++DVS SM D +L Sbjct: 43 VVPKFFLRGSYLTLLIIALLGPSFGEAEGDLITTSHDTFLLVDVSRSM-DAGDIVPTRLE 101 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 I+++ D + + R GL+ + + + PL +++ I + Sbjct: 102 RVKYDIQQLCDTLPA-------DRFGLILAAPQSILLSPLTADHDALKQFIREVHTSISP 154 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T + A K+ D + + + I+ +DGEN S + + Sbjct: 155 TGETDLCNAIAMARQKLIDDSSTHQSV--------RAIVLFSDGENFSSCEQTELARL-- 204 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLK 334 + G + +GV EA K Sbjct: 205 ---RSFGLPLVTVGVGTEAGASIRK 226 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 307 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 357 NELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 407 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 458 YASDQVQNNGLIDAFGALSSGNGAVSQR 485 >gi|15929704|gb|AAH15276.1| Inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 103/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 166 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 223 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 224 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK- 280 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+DVS SM+ K+ ++ ++LD +K +N + + + Sbjct: 281 --------NIVFVIDVSGSMSG------RKIQQTREALLKILDDVKEDDYLNFILFSTDV 326 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + S T GL + A+E + Sbjct: 327 TTWKDHLVQATPA--NLKEAKTFVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERST 384 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ A +Y +G FL+ A + Sbjct: 385 S----IIIMLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALEN 440 Query: 341 R 341 Sbjct: 441 H 441 >gi|168229682|ref|ZP_02654740.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468558|ref|ZP_03074542.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194454922|gb|EDX43761.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205335570|gb|EDZ22334.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 596 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 237 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 287 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 288 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 338 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 339 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 398 Query: 348 SRKLHDAFLR 357 + Sbjct: 399 LSEAQKVLKD 408 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 73/192 (38%), Gaps = 32/192 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATR----SIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +D S SM+ + D GV T +++ ++ + + + ++TF S Sbjct: 568 AICVDCSGSMSTNDKSFKDDNGVLTCYRNIAVQNYVESMFVFDNAS------IITFESSA 621 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + + + + K + T + ++ A +++ H KK I Sbjct: 622 SEECEMTNNKRTLSGKASFYN-RGGTNANSAIDIAIDEL------------NHVYGKKNI 668 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 I L+DG+ + + +++ YC K G ++ + + + A Q LK A + + Sbjct: 669 ILLSDGDVNV----SDDNIKYC---KNNGIRIHTVALGSGANSQLLKQYANDTGGTPLTA 721 Query: 346 QNSRKLHDAFLR 357 + L + Sbjct: 722 TTAEGLTKIYES 733 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 76/190 (40%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLD S S+ M L + ++K + V+ G +T+S+ Sbjct: 845 LDIVFVLDHSGSIGPREQESMMNLT---------IHLVKKADVGRDRVQIGALTYSNHPE 895 Query: 229 QTFPLAWGVQH--IQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I E + R G T + L+++ N +F EH ++ + ++ Sbjct: 896 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLF----TEEHGSRLTQNVRQ 950 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDG + D + E + +G ++A+GV D+ + V Sbjct: 951 LMIVITDGV----SHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHV 1006 Query: 346 QNSRKLHDAF 355 N KL D + Sbjct: 1007 DNFDKLRDIY 1016 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 44/200 (22%), Positives = 76/200 (38%), Gaps = 28/200 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ +++ + S+ +M P N VR G+V +S K Sbjct: 474 DIYFLIDGSSSIRKK---EFEQIQIFMSSVIDM------FPIGPNKVRVGVVQYSHKNEV 524 Query: 230 TFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP++ I + N T + L++ I K + A Y Sbjct: 525 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIKKGKTERTDRAPC------Y 578 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I LTDG+++ L N + ++AIG+ EA L+ A D Sbjct: 579 LIVLTDGKSND------SVLEPANRLRAEQITIHAIGIG-EANKTQLRQIAGKD---ERV 628 Query: 347 NSRKLHDAFLRIGKEMVKQR 366 N + D+ I E+V + Sbjct: 629 NFGQNFDSLKSIKNEIVHRI 648 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ D+M ++D S S G ++ ++ I+ D + G+V FS Sbjct: 655 EDMKADIMFLVDSSGS------IGPTNFETMKTFMKNLVGKIQIGADRSQ---VGVVQFS 705 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + I I+R+ + T G + E + G Sbjct: 706 DYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGG------ALTFVNEYFDLSKGGRPQ 759 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +K++I LTDG+ +L + + ++++GV Q + Sbjct: 760 VRKFLILLTDGKAQDEVGGPATAL------RSKSVTIFSVGVYGANRAQLEEISGDGSLV 813 Query: 343 YSVQNSRKLHDA 354 + V+N L Sbjct: 814 FHVENFDHLKAI 825 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-N 214 T S + D D++ ++D S+ + R ++ L+ + S DV N Sbjct: 254 TVPFPTSCQKDSLADLIFLVDESVGTTQNL-----------RDLQNFLENVTSSVDVKDN 302 Query: 215 VVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +R GL++FS + L + Q++I +L + + A ++ Sbjct: 303 CMRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGA---AIEQMRKEGFS 359 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + + +T + + + G ++A+G++ Q Sbjct: 360 ESSGSRKAQGVPQIAVLVT------HRASDDMVREAALDLRLEGVTMFAMGIEGANNTQL 413 Query: 333 LKNCASPDR 341 + P R Sbjct: 414 EDIVSYPSR 422 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 82/206 (39%), Gaps = 31/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S + P +R L+ +I S+P N R L +S + Sbjct: 30 DVVFLVDSSNYLGIKSFP----------FVRTFLNRMISSLPIEANKYRVALAQYSDALH 79 Query: 229 QTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + + G + K L+ A+ F A + + Sbjct: 80 NEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFP 135 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ L + ++++ + +A + G + ++GVQ +A+++ LK A+ + Sbjct: 136 PILVVL-------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHF 187 Query: 344 SVQNSRKL---HDAFLRIGKEMVKQR 366 +++ +R L RI K++ + R Sbjct: 188 NLRTARDLGMFAPNMTRIIKDVTQYR 213 >gi|300933821|ref|ZP_07149077.1| hypothetical protein CresD4_07088 [Corynebacterium resistens DSM 45100] Length = 676 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 77/216 (35%), Gaps = 49/216 (22%) Query: 172 MMVLDVSLS-MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 ++V+D+S S M G G +L A ++ ++D + PD N G+V + Sbjct: 53 VLVMDLSDSMMTKDAGGGGTRLDAAKKAATGLIDAL---PDSAN---MGMVVYGQQESNA 106 Query: 224 -SSKIVQTFPLAW-------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +++ + ++++I+ T L A ++ E+ Sbjct: 107 PNNRAAGCKDVETISPVGPINKGELKDRISNFKAKGYTPIGNSLLKAAEELGKEGERS-- 164 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAAD 330 I+ ++DG ++ C AK+ ++ +G A+ Sbjct: 165 -----------IVLVSDGHDTCA------PPPVCEVAKKLAGEGYNLTIHTVGFHADRKA 207 Query: 331 QFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + C S ++ S +N+ +L ++ + + Sbjct: 208 RKELECIAKTSGGQYLSAENASELSNSMKFLATRSM 243 >gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Meleagris gallopavo] Length = 881 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 76/206 (36%), Gaps = 23/206 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +++ ++D+S SM+ ++ ++ ++LD IK D N + G + Sbjct: 279 KLPKNVIFIIDISGSMSG------REIQQTREALLKILDDIKE-DDHFNFILFGSDVHTW 331 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 K + ++ + + T G+ + + A E G+ K+ Sbjct: 332 KETLIKATPENLDEARKFVRGIDTKGLTNLYGGMMKGIDMLNAAHE-------GNLVPKR 384 Query: 286 ---YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 II LTDG+ + + ++ + +A +Y +G FL+ A ++ Sbjct: 385 SASIIIMLTDGQPNVGISNTQDIQTHVKKAIEGKYTLYNLGFGYGVDYNFLEKMALENKG 444 Query: 342 -----FYSVQNSRKLHDAFLRIGKEM 362 + ++ +L + + M Sbjct: 445 LARRIYPDSDSALQLQGFYDEVSNPM 470 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 61/148 (41%), Gaps = 17/148 (11%) Query: 214 NVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 N R G++ +SS++ FPL A+ + ++ I L+ T + ++YA N F Sbjct: 17 NATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVA 76 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + + + +TDG +A+ RG +YA+GVQ Sbjct: 77 E---GARPPEERVPRVAVIVTDGRPQD------RVAEVAAQARARGIEIYAVGVQRADVG 127 Query: 331 QFLKNCASP---DRFYSVQNSRKLHDAF 355 L+ ASP + + V++ L F Sbjct: 128 S-LRAMASPPLDEHVFLVESF-DLIQEF 153 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 251 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 301 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 302 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 356 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G +VYA+GV + L+ AS Sbjct: 357 PRVGLVFTDGRSQD------DISVWAARAKEEGIVVYAVGVGKAVEAE-LREIAS 404 >gi|326628521|gb|EGE34864.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 596 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 237 LVFLIDTSGSMQ-----PAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 287 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 288 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 338 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 339 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIYS 398 Query: 348 SRKLHDAFLR 357 + Sbjct: 399 LSEAQKVLKD 408 >gi|126174069|ref|YP_001050218.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS155] gi|125997274|gb|ABN61349.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS155] Length = 771 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 73/188 (38%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S++ + ++++V+D S SM D + A ++ L +K N Sbjct: 381 VEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLKPEDSFN----- 429 Query: 219 GLVTFSSKIVQ--TFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + Q PL + + ++ ++RL T+ L+ A + L Sbjct: 430 -IIEFNSSLSQFSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAAL------PKSL 482 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + +IF+TDG + + E++ ++ +G+ + F+ Sbjct: 483 GSAPSDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFM 537 Query: 334 KNCASPDR 341 + A R Sbjct: 538 QRAAELGR 545 >gi|317419330|emb|CBN81367.1| Integrin alpha-M [Dicentrarchus labrax] Length = 1058 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 78/229 (34%), Gaps = 41/229 (17%) Query: 151 APLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + S + S D+ +LD S S+ ++ +++ + Sbjct: 42 RSNRVGSPIPSSLDECRSEADIAFLLDGSGSVASQ------DFTKMKDFVKNLVNSFQGK 95 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRL-IFGSTTKSTPGLEYA 262 F+ PL + + + + +I+R+ T + +E+ Sbjct: 96 D----------TKFAIAQFSNAPLVHYYFDTFDINNWRTQIDRIRQLTGGTYTAAAIEHV 145 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 N +FD + KK +I +TDGE + D ++ + + A+ + + +AI Sbjct: 146 VNNVFDPSRGSR------LNVKKVLIVITDGE----SHDRRDLPYAASLAQGKNIVRFAI 195 Query: 323 GVQAE----AADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 GV AA Q L AS + V N L + ++ Sbjct: 196 GVGGAFSNVAAKQELDTIASDPPASHVFRVDNFGALEQIRQNLQDKIFS 244 >gi|114576315|ref|XP_515709.2| PREDICTED: matrilin-3 [Pan troglodytes] Length = 486 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 74/218 (33%), Gaps = 32/218 (14%) Query: 154 LITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + + G LD++ ++D S S+ + + ++D + Sbjct: 61 PASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDTLD 114 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 P R +V ++S + F L Q +++ + R+ + T S ++ A + Sbjct: 115 IGP---ADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMD 171 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + F + + K I +TDG + A+ G +YA+GV Sbjct: 172 EAFT---VEAGAREPSSNIPKVAIIVTDGRPQD------QVNEVVARAQASGIELYAVGV 222 Query: 325 QAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 + + P + + V+ KL F Sbjct: 223 DRADMESLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|205353430|ref|YP_002227231.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205273211|emb|CAR38174.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 499 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 140 LVFLIDTSGSMQ-----PAERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 190 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 191 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 241 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 242 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIYS 301 Query: 348 SRKLHDAFLR 357 + Sbjct: 302 LSEAQKVLKD 311 >gi|327260894|ref|XP_003215268.1| PREDICTED: collagen alpha-3(VI) chain-like [Anolis carolinensis] Length = 3053 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 78/204 (38%), Gaps = 24/204 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ D++ ++D S S+ + + + D++K + N R GLV F Sbjct: 28 QNNGAADIIFLVDSSWSIGKEHFQLVREF---------LYDVVKQLDVGGNDFRFGLVQF 78 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F L +Q + I+ + G TK+ GLE+ + ++ Sbjct: 79 SGNPHTEFQLNTYHTLQDVLSHISHMPYMGGGTKTGQGLEF----LIRNHLTKVSGSRVS 134 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP- 339 D + ++ LTDG + L K ++AIGVQ + + + P Sbjct: 135 DGTPQVVVVLTDGRSQDDVTLPSSVL------KSADVNMFAIGVQDAVEGELKEIASEPL 188 Query: 340 -DRFYSVQNSRKLHDAFLRIGKEM 362 ++++N LH + + + Sbjct: 189 EIHLFNLENFTALHGIVGDLVESI 212 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 70/199 (35%), Gaps = 19/199 (9%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +I S D++ +LD S G D + ++D I + + + Sbjct: 1620 EETEIPLDSKKQADIVFLLDSS------INFGRDNFQEVVDFVYGIIDAI---YEEGDSI 1670 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + GLV ++S + F L + I E + R+ + + G A I E Sbjct: 1671 KVGLVQYNSDVSDEFFLKDFTDKEQILEAVKRIAYKGGRTANTG--TAIKHIKAKHFVKE 1728 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ + +T G P D + + + +G V+A+GV+ K Sbjct: 1729 AGSRVDQKVPQIAFIITGG---RPEDDGQTAALALAQ---QGVKVFAVGVRNIDLGDIAK 1782 Query: 335 NCASPDRFYSVQNSRKLHD 353 + + +++L + Sbjct: 1783 LSSDSTTGFRAATAQELSE 1801 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 87/281 (30%), Gaps = 31/281 (11%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 N L++ G A + + + + + L+A S + + Sbjct: 959 ANALKQAGVATFVIKSRTADPVELERIVFAPQFILNADSLSRI----GEIQPEIVNLLKT 1014 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + I SV + D++ ++D S + F ++ ++D + P Sbjct: 1015 IEIRESVCDEIQRK---DVVFLIDGSDATRSSFPE-------LKSFVQRVVDSLDVGPGK 1064 Query: 213 NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 VR +V +S+ F L + + R+ G A N + Sbjct: 1065 ---VRVAVVQYSNDANTEFNLNEYSDKASVITAVQRMTAMGGYAVNTG--AALNYLISNV 1119 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E ++ + +++I LT + E K RGA+ IG Sbjct: 1120 FTREAGSRVQEGVPQFVILLT------AERSRDDVRRPALELKTRGAVPLGIGFGNADIT 1173 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 Q P+ V +L RI + + ++ K Sbjct: 1174 QLQTISFVPEFAVFVSGVSEL----GRIQQLIAERVTRLTK 1210 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 78/192 (40%), Gaps = 21/192 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D++ ++D S ++ GP + + +++ + + +R G+VT+S Sbjct: 232 AQESADIIFLIDGS----NNIGPVI--FATVRDFVANVIERLSV---GSESIRVGVVTYS 282 Query: 225 SKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + + E + L F ++ G A + ++ + Sbjct: 283 DQSRTAFFLNSHTRKADVLEAVKALSFPGGEEANIG--EALEFVVQNHFNRSGGSRIEEH 340 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 + ++ + +S + D +E + K+ G ++IGV+ + L+ A+ F Sbjct: 341 VPQVLVLI---SSSESSDDIREGVL---AMKQAGVFSFSIGVKNADNVE-LQQIATDGSF 393 Query: 343 -YSVQNSRKLHD 353 +++ ++R L D Sbjct: 394 VFTILDTRNLGD 405 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 68/217 (31%), Gaps = 25/217 (11%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 V I+S D+ + D S ++ F + +L + PD Sbjct: 823 VEEVVITSTESKS-DIFFLFDGSSNLAGQFP-------AVREFLLRVLSDLNLGPDA--- 871 Query: 216 VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAKE 271 R + FS + F Q I +++ ++ LE A +F Sbjct: 872 TRVAVAQFSDNVQVEFNFQDIPSKQEILQRVKKMRIKGGRSLNIGAALETAMRDVF---- 927 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + ++ + ++++ L G +S + N K+ G + I + + Sbjct: 928 VRQAGSRIEEGVPQFLVLLAAGRSSD------DVDQPANALKQAGVATFVIKSRTADPVE 981 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + +P + + ++ + I + I Sbjct: 982 LERIVFAPQFILNADSLSRIGEIQPEIVNLLKTIEIR 1018 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 69/214 (32%), Gaps = 25/214 (11%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSL 179 Y + ++P + + + SS +DI D++ ++D S Sbjct: 1385 PQYVFQVSTYQDLPTLEQQLISPVTTLTTQQIQGLIADTSSPTDIDSEAKDIVFLIDSSD 1444 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GV 237 ++ F D + + II+ + + VR G+V FS+ + F L Sbjct: 1445 NVGADFAHIRDFI----------IRIIQQLDVRSRKVRIGVVQFSNNVFPEFFLKTHPTK 1494 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + I R+ T G Y + + ++++ L G + Sbjct: 1495 NAVLQAIRRMRPRGGTPLNVGKALDYVVKNHFIKSAGSRREDGVP--QHLVLLLGGRSQD 1552 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + G ++GV A+ AD Sbjct: 1553 DVGRPSNVIL------SSGIK--SLGVGAKNADS 1578 >gi|238913524|ref|ZP_04657361.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 596 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 237 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 287 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 288 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 338 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 339 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 398 Query: 348 SRKLHDAFLR 357 + Sbjct: 399 LSEAQKVLKD 408 >gi|159110717|ref|NP_032433.2| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Mus musculus] Length = 889 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 103/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 226 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 227 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK- 283 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+DVS SM+ K+ ++ ++LD +K +N + + + Sbjct: 284 --------NIVFVIDVSGSMSG------RKIQQTREALLKILDDVKEDDYLNFILFSTDV 329 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + S T GL + A+E + Sbjct: 330 TTWKDHLVQATPA--NLKEAKTFVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERST 387 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ A +Y +G FL+ A + Sbjct: 388 S----IIIMLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALEN 443 Query: 341 R 341 Sbjct: 444 H 444 >gi|695636|emb|CAA49843.1| inter-alpha-inhibitor H3 chain [Mus musculus] Length = 886 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 103/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 166 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 223 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 224 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK- 280 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+DVS SM+ K+ ++ ++LD +K +N + + + Sbjct: 281 --------NIVFVIDVSGSMSG------RKIQQTREALLKILDDVKEDDYLNFILFSTDV 326 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + S T GL + A+E + Sbjct: 327 TTWKDHLVQATPA--NLKEAKTFVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERST 384 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ A +Y +G FL+ A + Sbjct: 385 S----IIIMLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALEN 440 Query: 341 R 341 Sbjct: 441 H 441 >gi|94501046|ref|ZP_01307570.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] gi|94426793|gb|EAT11777.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] Length = 867 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 50/234 (21%), Positives = 82/234 (35%), Gaps = 42/234 (17%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 AN + + + K S D+ +++D+S SM D+ + I + Sbjct: 22 AISANENLTDIPEGAEAKFLPTSKSS-DVRVLIDMSGSMKDNDPENLR--------IPAL 72 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTP 257 I++ +PD + G+ TF + P A W ++ FG T Sbjct: 73 NLIVQLLPDGSQA---GVWTFGQWVNMLIPPAEVNSEWRTNAKEKAKMINSFGLRTNIGE 129 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-----ENSSPNIDNKESLF----- 307 +E A K+ + +H I LTDG + P DNK Sbjct: 130 AMERATWKLAADSDFEQHA-----------ILLTDGIVDIAADDDPQKDNKNEAERQRIL 178 Query: 308 --YCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 +E K G ++ I + A L+ A + V+NS +L AFL Sbjct: 179 TDVLSEYKNLGVKIHTIALSNAADKVLLEKLALETGGMAEVVENSEQLVKAFLN 232 >gi|148252253|ref|YP_001236838.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] gi|146404426|gb|ABQ32932.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] Length = 755 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/265 (16%), Positives = 94/265 (35%), Gaps = 37/265 (13%) Query: 112 TSLSIIIDDQHKDYNLSAVS-RYEMPFIFCTFPWCANSSHAPLLITSS-VKISSKSDIGL 169 +L+ + +D+ L+ MP + ++ + IT V S++ Sbjct: 298 VTLAEGVAPADRDFELTWKPASVAMPSVGLFHEQVGDADYLLAFITPPAVAASAQRPQPR 357 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S SM + A S+ L ++ R ++ F + Sbjct: 358 DVIFVIDNSGSMGG------TSIRQAKASLLYALGRLQPND------RFNVIRFDDTMTV 405 Query: 230 TFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 FP A V ++ L T+ P + A + D + + Sbjct: 406 LFPSSVPADAEHVGSATSFVSALEARGGTEMVPAMRAA---LTDDGSDSDRV-------- 454 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--F 342 + ++FLTDG I N++ LF A R + ++ +G+ + + A R F Sbjct: 455 RQVVFLTDG-----AIGNEQQLFETITAMRGRSRIFMVGIGSAPNTYLMTRAAELGRGAF 509 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRI 367 + + ++ + + ++ + Sbjct: 510 TPIGSVEQVEERMRDLFAKLENPVV 534 >gi|254443293|ref|ZP_05056769.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257601|gb|EDY81909.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 632 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/301 (13%), Positives = 98/301 (32%), Gaps = 30/301 (9%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 +AT+I + G Q + +I + E F++ I + Sbjct: 41 SATRIRTKLGATVGGAQDIRYLRNLIDEGIIPSPASFTAEGLFSEHDLPIGGDAKEGWLF 100 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 D + + + ++ ++ + A L S + ++ L+++ V+D S Sbjct: 101 DIASQATSFESAAQPKVDIL------------AQLGFVSGIDATTFKPAPLNLVAVVDKS 148 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 SM+ D L + +S+R+++ + S ++ V+ + +T Sbjct: 149 GSMSG------DPLELVRKSLRQVVSQLGSDDQLSIVLYGSSTHIHLEPTKTS--TENRD 200 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I I+R+ +T GLE Y + + + ++ TD + Sbjct: 201 QIIASIDRIQSHGSTAMEAGLELGYQVARQSADAFVGKTR--------VMLFTDERPNVG 252 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFL 356 D + + + IGV + + +S + + + F Sbjct: 253 RTDATGFMAMAESGSKSDIGLTTIGVGVHFGAELAEKISSVRGGNLFFFDDDESMETTFR 312 Query: 357 R 357 + Sbjct: 313 K 313 >gi|197250621|ref|YP_002147271.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197214324|gb|ACH51721.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 598 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 239 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 289 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 290 STAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 340 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 341 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 400 Query: 348 SRKLHDAFLR 357 + Sbjct: 401 LSEAQKVLKD 410 >gi|313212817|emb|CBY36735.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 66/178 (37%), Gaps = 25/178 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + LD++ V+D S S+ D + + + D R + T+S Sbjct: 177 TTKALDIVFVVDESGSVGP------DNFELVKLFLIDYAQDSNIAADA---TRIAIRTYS 227 Query: 225 SKIVQTFPL-AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + ++I +IN L+ T + + N + + + Sbjct: 228 TNSDLDFSLNDFKTRNIISEINNLVYASGGTNTADAITKGLNDFGNDR----------SE 277 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ +TDG++S ++ L + R +AIG+ + L+ A+ D Sbjct: 278 SVKIMVTITDGQSSYDHVKAAADLLKADP---RNIQSFAIGIDGANMAE-LQAIATTD 331 >gi|297279075|ref|XP_001109489.2| PREDICTED: calcium-activated chloride channel regulator 4-like [Macaca mulatta] Length = 931 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 70/197 (35%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G D+L ++ + L I V N G+V F S Sbjct: 307 VCLVLDKSGSMA-----GYDRLNQMNKAAKYFLLQI-----VENGSWVGMVHFDSTATII 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + + T G++ A+ I E H+ Sbjct: 357 NKPIQIISSDERNTLLAWL-PTYASGGTSICSGIKSAFQVIG---ELSSHLDGSE----- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 ++ LTDGE+ + + +E KR GAIV+ I + A + FY Sbjct: 408 -VVLLTDGEDYTAS-------SCIDEVKRSGAIVHFIALGTAADKAVIEMSKITGGRHFY 459 Query: 344 SVQNSRK--LHDAFLRI 358 + ++ L DAF + Sbjct: 460 ASDKAQNNGLIDAFGDL 476 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 70 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 119 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 120 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 169 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 170 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 220 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 221 YASDQVQNNGLIDAFGALSSGNGAVSQR 248 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 51/209 (24%), Positives = 81/209 (38%), Gaps = 43/209 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM D+L ++ + L I + +G+VTF S Sbjct: 307 LILVLDKSGSMAGE-----DRLNRLNQASQLFLLQI-----IEKGSWTGMVTFDSSATIQ 356 Query: 231 FPL---AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L Q I+RL G T GL A+ I Sbjct: 357 SALIQIETDAQR-NSLISRLPTAAGGGTSICSGLRTAFTVIKKKFSTHGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 I+ LTDGE+S ++ C +E K+ GAI++ + + +AD L+ A + Sbjct: 407 -IVLLTDGEDS--------TISSCFDEVKQSGAIIHTVALG-PSADPGLEKLAEMTGGMK 456 Query: 343 YSVQNSRK---LHDAFLRI--GKEMVKQR 366 S ++ + L DAF + G + QR Sbjct: 457 TSATDNAQNNGLIDAFSALSSGNGAITQR 485 >gi|161784288|sp|Q61704|ITIH3_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor Length = 889 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 103/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 226 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 227 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK- 283 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+DVS SM+ K+ ++ ++LD +K +N + + + Sbjct: 284 --------NIVFVIDVSGSMSG------RKIQQTREALLKILDDVKEDDYLNFILFSTDV 329 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + S T GL + A+E + Sbjct: 330 TTWKDHLVQATPA--NLKEAKTFVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERST 387 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ A +Y +G FL+ A + Sbjct: 388 S----IIIMLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALEN 443 Query: 341 R 341 Sbjct: 444 H 444 >gi|148692826|gb|EDL24773.1| inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 103/301 (34%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 167 MYLKVQPKQLVRHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 224 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ + E P AP + K Sbjct: 225 KPSLD--QQRSCPTCTDSLLNGDFTIVYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK- 281 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGL 220 +++ V+DVS SM+ K+ ++ ++LD +K +N + + + Sbjct: 282 --------NIVFVIDVSGSMSG------RKIQQTREALLKILDDVKEDDYLNFILFSTDV 327 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 T+ +VQ P ++ + + + S T GL + A+E + Sbjct: 328 TTWKDHLVQATPA--NLKEAKTFVKNIHDQSMTNINDGLLKGIEMLNKAREDHTVPERST 385 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 II LTDG+ ++ ++ A +Y +G FL+ A + Sbjct: 386 S----IIIMLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALEN 441 Query: 341 R 341 Sbjct: 442 H 442 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 77/216 (35%), Gaps = 27/216 (12%) Query: 151 APLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 P + S+ LD++ ++D S S+ + + +M+D + Sbjct: 44 LPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVG 97 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 PD R +V ++S + F L I++ I R+ + T + ++ A ++ Sbjct: 98 PDA---TRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEA 154 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F K +K K I +TDG + + A+ G +YA+GV Sbjct: 155 FTEKSGARPKSKN---ISKVAIIVTDGRPQDQVEEVSAA------ARASGIEIYAVGVDR 205 Query: 327 EAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 ++P D + V+ KL F Sbjct: 206 ADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|309792255|ref|ZP_07686727.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225796|gb|EFO79552.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 391 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 33/226 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 LL+ ++ + + L+ +VLD S SM G + + ATR + E L + Sbjct: 4 LLVEATPVAAPPVPVPLNFCLVLDRSGSMQ---GAKLQSMKAATRKVIETLTDQDVVS-- 58 Query: 213 NNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V F + P LA + I+ + T + G++ ++ Sbjct: 59 -------IVIFDDTVQTLVPATLATDRTALLAAIDTISEAGGTAMSLGMQAGQVEL---- 107 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 D +++ LTDG+ D + + + A+G+ AE + Sbjct: 108 -----QKHSGPDRLSHMLLLTDGQTWG---DEETCRNIARALGQADVRITALGLGAEWNE 159 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLRI-----GKEMVKQRILY 369 Q L + A S + ++ ++ F R G R+L Sbjct: 160 QLLDDLAEFSDGTSDYIADANQIGTFFQRAIRSAQGTAATDARLLL 205 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 74/197 (37%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 229 QTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + ++ I++ I+ + T L+ A + + + H G Sbjct: 344 KDYLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSDIGDRSVS- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 ++FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 400 LVVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 460 RVHEEEDAGSQLIGFYD 476 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 80/242 (33%), Gaps = 33/242 (13%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F + F N +G+I I I+ P ++ + E SH KAKL ++ + L A Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALA-- 68 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I N E + ++ KN + + L F I NI +T Sbjct: 69 IENNEIPDEPQQIKN-------NALVLSYVNAYLPSKKFLVPIININDNTHYLEYNAAVT 121 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y +S+ N + + + D++ V D S SM Sbjct: 122 MAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASEPT-------DVIFVADYSGSML 174 Query: 183 DHFGPGM----DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 +F +++ + R++ DII + + P +WG + Sbjct: 175 YNFNENKPRDHERIDALRSAFRKLHDIIMDNS-------------NINAIGYIPFSWGTK 221 Query: 239 HI 240 I Sbjct: 222 RI 223 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 91/274 (33%), Gaps = 43/274 (15%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS-SHAPLLITS 157 + F + + I +++++ I V + +C FP+ ++ Sbjct: 193 SAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYLSD 252 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +K SS + +++LD + D + +D + ++I + ++ Sbjct: 253 EIKRSSNT------LLLLDYIGDIID-YDKTIDSITGNAQTIDIPMSDVRFGDVCLQ--- 302 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G +S + Q + + I + T + G+ A N IF K K H Sbjct: 303 -GSNAYSLEQEQYI-------NNIDNIIEMEPHGWTLISSGILSA-NNIFKNKAKNGH-- 351 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLF--------YCNEAKRRGAIVYAIGVQAEAA 329 KK +I L+DG ++ +K + C E K + I + Sbjct: 352 ------KKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPD 405 Query: 330 DQF-------LKNCASPDRFYSVQNSRKLHDAFL 356 + K C D +Y N+ +L Sbjct: 406 NNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQ 439 >gi|148665999|gb|EDK98415.1| matrilin 3 [Mus musculus] Length = 482 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 71/223 (31%), Gaps = 26/223 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 A ++ + LD++ ++D S S+ + + + Sbjct: 51 HLSALATSTRAPYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRI 104 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGL 259 +D + R +V ++S + F L Q +++ + R+ + T S + Sbjct: 105 IDTLDI---GATDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAI 161 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A + F + K I +TDG + A+ G + Sbjct: 162 QTAMEEAFT---VEAGARGPMSNIPKVAIIVTDGRPQD------QVNEVAARARASGIEL 212 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 YA+GV + + P + + V+ KL F Sbjct: 213 YAVGVDRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|306518578|ref|NP_034900.4| matrilin-3 precursor [Mus musculus] gi|6918887|emb|CAB72265.1| matrilin-3 [Mus musculus] gi|26339344|dbj|BAC33343.1| unnamed protein product [Mus musculus] Length = 481 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 71/223 (31%), Gaps = 26/223 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 A ++ + LD++ ++D S S+ + + + Sbjct: 51 HLSALATSTRAPYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRI 104 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGL 259 +D + R +V ++S + F L Q +++ + R+ + T S + Sbjct: 105 IDTLDI---GATDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAI 161 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A + F + K I +TDG + A+ G + Sbjct: 162 QTAMEEAFT---VEAGARGPMSNIPKVAIIVTDGRPQD------QVNEVAARARASGIEL 212 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 YA+GV + + P + + V+ KL F Sbjct: 213 YAVGVDRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|14548114|sp|O35701|MATN3_MOUSE RecName: Full=Matrilin-3; Flags: Precursor gi|2342635|emb|CAA71532.1| matrilin-3 [Mus musculus] Length = 481 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 71/223 (31%), Gaps = 26/223 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 A ++ + LD++ ++D S S+ + + + Sbjct: 51 HLSALATSTRAPYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRI 104 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGL 259 +D + R +V ++S + F L Q +++ + R+ + T S + Sbjct: 105 IDTLDI---GATDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAI 161 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A + F + K I +TDG + A+ G + Sbjct: 162 QTAMEEAFT---VEAGARGPMSNIPKVAIIVTDGRPQD------QVNEVAARARASGIEL 212 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 YA+GV + + P + + V+ KL F Sbjct: 213 YAVGVDRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|291232650|ref|XP_002736273.1| PREDICTED: mind bomb 2-like [Saccoglossus kowalevskii] Length = 847 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/195 (21%), Positives = 68/195 (34%), Gaps = 12/195 (6%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD ++ LD S SM G G+ +L A + + + V +V F SK Sbjct: 441 PLDTVLCLDTSGSMA---GRGLRELKKACTEFLLGIQQTATQTGLRENV--AVVEFGSKT 495 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L + + I+ L G TT GL A ++ L + I Sbjct: 496 RIVRNLTDNYRLTKNAIDSLQAGGTTPMFEGLMEAMKEVIQNGGVLTLPGGKKMTPR--I 553 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRG----AIVYAIGVQAEAADQFLKNCASPDR-F 342 I +TDG + K ++ + K G + +G + + L A Sbjct: 554 ILMTDGRPDDKDQVLKAAMSFGPLWKAVGLPFPIPIACVGCGPDVDKELLAVIAKVTNGM 613 Query: 343 YSVQNSRKLHDAFLR 357 + V + +L D F R Sbjct: 614 FVVGDISQLGDFFRR 628 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 57/177 (32%), Gaps = 9/177 (5%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G G+ +L A + + + + V +V F L ++ I+ Sbjct: 3 GIGIAELKRAGNEFLIGVQLTANQTGLRENV--AVVEFGRNTRIVQSLTDNYVSVKRAID 60 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L+ G TT GL A +I L KG +I +TDG + K + Sbjct: 61 SLVPGGTTPMFEGLMEAMKEIISNGGVL--TLKGGKKMTPRVILMTDGYGDDKDKVAKSA 118 Query: 306 LFYCNEAKRRG----AIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLR 357 + + K G + +G + L A + Y +L F R Sbjct: 119 MSFGPLWKAVGLPHPIPIACVGCGPNVDKELLAIIAEITNGMYVTGEMGQLSGFFRR 175 >gi|239814248|ref|YP_002943158.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239800825|gb|ACS17892.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 345 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 75/268 (27%), Gaps = 59/268 (22%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVAT 196 F A A + V + S +++ +DVS SM D + A Sbjct: 59 FLFLLAMAAMLVAAARPMAVVMLPSNQQT---IILAMDVSGSMRAADVLPNRLVAAQEAA 115 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 +S + D+ V+ G+V F+ + I+ T + Sbjct: 116 KSFIK---------DLPRHVKVGIVAFAGSAQVAQLPTTNHDDLITAIDSFQLQRATATG 166 Query: 257 PGLEYAYNKIFDAKE------------------KLEHIAK------GHDDYKKYIIFLTD 292 + + +F + E AK +I LTD Sbjct: 167 NAIVVSLATLFPDAGIDVSQFSAPSRQRGTPIDQAEKQAKEFTPVAPGSYTSAAVIMLTD 226 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---------------LKNCA 337 G+ ++ + L A RG +Y +GV + LK A Sbjct: 227 GQRTTGV----DPLDAAKAAADRGVRIYTVGVGTVDGETIGFEGWSMRVRLDEETLKAVA 282 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + ++ + L + + + Sbjct: 283 NKTQAEYFYAGTAADLKKVYETLSSRLT 310 >gi|73538303|ref|YP_298670.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121640|gb|AAZ63826.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 358 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 68/231 (29%), Gaps = 54/231 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D+S SM ++ A ++ R +LD V G+V + Sbjct: 98 VILVIDLSGSMRAQDVRP-SRIRAAQQAARVLLDA------QPAGVSVGVVAMAGTAALA 150 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK----- 285 + + I L T GL A + D + Sbjct: 151 QAPSHSKDDVATAIEGLKPQGGTALGNGLLIALTTLLPQTTNDAERLMNGGDVAQPGKPG 210 Query: 286 ---------------------YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 I+ +DGE++S ++ A G VY +GV Sbjct: 211 KAAPGELDNGEPVRPGSYASGAIVLFSDGESNSG----PGAVQAAQLAATYGVRVYTVGV 266 Query: 325 QAEAA---------------DQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 ++ LK A + ++ ++++ L + + Sbjct: 267 GTTEGVVLSADGWSARVRLDEKVLKQVADTTGAEYFRLEDTAALKKVYRAL 317 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 80/217 (36%), Gaps = 36/217 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ D+ +V+D+S SM + R +ML ++ +P + G+ T Sbjct: 21 PEAAKPSDVRLVIDISGSMKKNDP------QNLRRPALDML--VQLLPKGSKA---GIWT 69 Query: 223 FSSKIVQTFPLA-----WGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 F + P WG +I + T LE A + Sbjct: 70 FGQYVNMLVPHKPVDAQWGRTASAASSEIKSI--AQFTNIGAALEKA-------AYDHKQ 120 Query: 276 IAKGHDDYKKYIIFLTDG---ENSSPNIDNKESLFYCNEA----KRRGAIVYAIGVQAEA 328 K DY+ ++I LTDG + ++ KE N ++ G ++ I + A Sbjct: 121 QMKADQDYQTHVILLTDGMVDIDRDNRLNKKERQRILNNVLPMYQQSGITLHTIALSDNA 180 Query: 329 ADQFLKN--CASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + L A+ + +N+ +L + FLR+ + V Sbjct: 181 DKKLLNKLALATDGKVSVAKNAEELMNVFLRVFNQAV 217 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 64/441 (14%), Positives = 128/441 (29%), Gaps = 88/441 (19%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 +G SIL AI++PV+F + L + + KA++ + + L A N Sbjct: 2 KLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--NDP 59 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 N N+ R I + + ++ + I R I + Sbjct: 60 NVNSDGLGSGSKVNRRIATDYLKAYITDI----DSISSLKIYRRNCEDIPECSSGLNKGK 115 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAP-LLITSSVKISSKSDI----GLDMMMVLDVSLSMN 182 S YE+ + W ++ T S + S + +D++ D S SM Sbjct: 116 SRFFEYEVEALTTQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSME 175 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR-----SGLVTFSSKIVQTF------ 231 + + G K R I ++ ++ ++N + G+ ++S F Sbjct: 176 EPWTGGRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNYNSC 235 Query: 232 --------PLAWGVQH-----IQEKINRLIFGST--------------------TKSTPG 258 + + I+ +N + T Sbjct: 236 FMKQDYFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDPDAVFHDIYLTNDFDT 295 Query: 259 LEYAYNKIFDAKE--------KLEHIAKGHDDYKKYIIFLTDGENSSPNIDN--KESLFY 308 K + + + + ++ +I ++DG + + Sbjct: 296 FNKEIMKFRPGNGTASYQGIIRSAQMLRKGTNSRRLLIIISDGNDWYYPYSGYKETDKEI 355 Query: 309 CNEAKRRGA---------------------IVYAIGVQAEAAD-QFLKNCASPDRFYSVQ 346 N+ G + IG +A + L NCA D + Q Sbjct: 356 ANKLVNAGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKNKALLNCAGEDNVFKAQ 415 Query: 347 NSRKLHD-AFLRIGKEMVKQR 366 +L D I +E+ + Sbjct: 416 YRDELLDQILSLITEEIGHLK 436 >gi|15488640|gb|AAH13465.1| inter-alpha trypsin inhibitor, heavy chain 1 [Mus musculus] Length = 909 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 78/203 (38%), Gaps = 26/203 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L+ ++ + + + LV F Sbjct: 286 TNMSKNLVFVIDISGSMEGQ------KVRQTKEALLKILEDMRPVDNFD------LVLFG 333 Query: 225 SKIVQ----TFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SK+ P++ +Q Q+ + R T GL + A+ ++ Sbjct: 334 SKVQSWKGSLVPVSNANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSP 393 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I LTDGE + D + L A R +Y +G + FL+ ++ Sbjct: 394 AS----ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTE 449 Query: 340 DR-----FYSVQNSRKLHDAFLR 357 + Y ++ + F Sbjct: 450 NNGWAQRIYEDHDATQQLQGFYN 472 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 12/121 (9%) Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA-------KEKLEHIAKGHDDYKKYIIF 289 ++ ++ L T + G+ + + + G D +K ++ Sbjct: 321 RAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQWQGRWGQGAAELPRPYGQADNRKIMVL 380 Query: 290 LTDGENSSPNID--NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSVQ 346 +DGE+ P +++ L C E KR+G VY + E +F+ CAS Y Sbjct: 381 FSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAF--EGDARFVAQCASDRSHAYKAT 438 Query: 347 N 347 + Sbjct: 439 S 439 >gi|20381126|gb|AAH28814.1| Itih1 protein [Mus musculus] Length = 911 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 78/203 (38%), Gaps = 26/203 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L+ ++ + + + LV F Sbjct: 288 TNMSKNLVFVIDISGSMEGQ------KVRQTKEALLKILEDMRPVDNFD------LVLFG 335 Query: 225 SKIVQ----TFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SK+ P++ +Q Q+ + R T GL + A+ ++ Sbjct: 336 SKVQSWKGSLVPVSNANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSP 395 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I LTDGE + D + L A R +Y +G + FL+ ++ Sbjct: 396 AS----ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTE 451 Query: 340 DR-----FYSVQNSRKLHDAFLR 357 + Y ++ + F Sbjct: 452 NNGWAQRIYEDHDATQQLQGFYN 474 >gi|3024048|sp|Q61702|ITIH1_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|695632|emb|CAA49841.1| inter-alpha-inhibitor H1 chain [Mus musculus] Length = 907 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 78/203 (38%), Gaps = 26/203 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L+ ++ + + + LV F Sbjct: 284 TNMSKNLVFVIDISGSMEGQ------KVRQTKEALLKILEDMRPVDNFD------LVLFG 331 Query: 225 SKIVQ----TFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SK+ P++ +Q Q+ + R T GL + A+ ++ Sbjct: 332 SKVQSWKGSLVPVSNANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSP 391 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +I LTDGE + D + L A R +Y +G + FL+ ++ Sbjct: 392 AS----ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTE 447 Query: 340 DR-----FYSVQNSRKLHDAFLR 357 + Y ++ + F Sbjct: 448 NNGWAQRIYEDHDATQQLQGFYN 470 >gi|281344227|gb|EFB19811.1| hypothetical protein PANDA_002988 [Ailuropoda melanoleuca] Length = 395 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 28/202 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ ++D S S+ + + +++D + R +V ++ Sbjct: 5 KSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSQIIDTLDI---GAADTRVAVVNYA 55 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + F L Q +++ + R+ + T S ++ A ++ F Sbjct: 56 STVKTEFHLQTYSDKQSLKQAVARITPLSTGTMSGLAIQTAMDEAFT---VEAGARGPTS 112 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + K I +TDG + A+ G +YA+GV A + LK AS Sbjct: 113 NIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVD-RADMESLKMIASEPL 165 Query: 339 PDRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 166 DEHVFYVETYGVIEKLSSRFQE 187 >gi|148692830|gb|EDL24777.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_d [Mus musculus] Length = 651 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L+ ++ + + ++V G S Sbjct: 284 TNMSKNLVFVIDISGSMEGQ------KVRQTKEALLKILEDMRPVDNF-DLVLFGSKVQS 336 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + R T GL + A+ ++ Sbjct: 337 WKGSLVPASNANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 393 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L A R +Y +G + FL+ ++ + Sbjct: 394 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 452 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 453 QRIYEDHDATQQLQGFYN 470 >gi|148692829|gb|EDL24776.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_c [Mus musculus] Length = 658 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L+ ++ + + ++V G S Sbjct: 291 TNMSKNLVFVIDISGSMEGQ------KVRQTKEALLKILEDMRPVDNF-DLVLFGSKVQS 343 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + R T GL + A+ ++ Sbjct: 344 WKGSLVPASNANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 400 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L A R +Y +G + FL+ ++ + Sbjct: 401 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 459 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 460 QRIYEDHDATQQLQGFYN 477 >gi|218261918|ref|ZP_03476586.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] gi|218223694|gb|EEC96344.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] Length = 339 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 61/174 (35%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+++M+ LDVS SM D+L A + + ++ D + + Sbjct: 80 SKLETVKRQGVEIMVCLDVSNSMLAEDVSP-DRLSKAKQMLSKLTDGFSN-------DKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F+ P+ + ++ + + + A N Sbjct: 132 GLIVFAGDAFTQLPITSDYVSAKMFLSSINPSMVSTQGTAIGAAIN-------LAVRSFT 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K II +TDGEN +++ A +G V +G+ Sbjct: 185 PSETSDKAIILITDGENHE-----DDAVKAAAAAAEKGIHVNIVGMGDPKGSPI 233 >gi|221111396|ref|XP_002160892.1| PREDICTED: similar to proximal thread matrix protein 1 [Hydra magnipapillata] Length = 315 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 73/208 (35%), Gaps = 21/208 (10%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 ++ + + + +D+ +LD S S+ + + L Sbjct: 108 ATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQKEKNFLKKLAAVFGI-------- 159 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 +N R G++TFS + + L + E ++++ TT+ L A + Sbjct: 160 --SSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDM 217 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F + G K II LTDG + P D ++ +E + G ++ +G+ + Sbjct: 218 FTSA------NGGRVGVSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGS 270 Query: 327 EAADQFLKNCASP-DRFYSVQNSRKLHD 353 + L + Y+ L D Sbjct: 271 AVNETELSHITGGKKNAYTAATFDSLTD 298 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 2/72 (2%) Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DR 341 K II LTDG + P D ++ +E + G ++ +G+ + + L + Sbjct: 12 VSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHITGGKKN 70 Query: 342 FYSVQNSRKLHD 353 Y+ L D Sbjct: 71 AYTAATFDSLTD 82 >gi|291388581|ref|XP_002710673.1| PREDICTED: collagen, type XXII, alpha 1 [Oryctolagus cuniculus] Length = 1571 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 72/208 (34%), Gaps = 36/208 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + P R G+V +S + Sbjct: 38 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGP---EHTRVGVVRYSDRPST 88 Query: 230 TFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L G +E++ G T + L + + F G ++ Sbjct: 89 AFEL--GHFSSREEVKAAARRIAYHGGNTNTGDALRFITARSFSP---QAGGRPGDRAFQ 143 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDR 341 + I LTDG + D + G ++A+GV EA + L AS Sbjct: 144 QVAILLTDGRSQDLVRDAAAAAH------AAGIRIFAVGVG-EALREELHEIASEPTSAH 196 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRILY 369 + V + F I K K R Sbjct: 197 VFHVSD-------FDAIDKIRGKLRRRL 217 >gi|124249351|ref|NP_032432.2| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Mus musculus] gi|74146297|dbj|BAE28922.1| unnamed protein product [Mus musculus] gi|148692827|gb|EDL24774.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_a [Mus musculus] Length = 907 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L+ ++ + + ++V G S Sbjct: 284 TNMSKNLVFVIDISGSMEGQ------KVRQTKEALLKILEDMRPVDNF-DLVLFGSKVQS 336 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + R T GL + A+ ++ Sbjct: 337 WKGSLVPASNANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 393 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L A R +Y +G + FL+ ++ + Sbjct: 394 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 452 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 453 QRIYEDHDATQQLQGFYN 470 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 82 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 129 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 130 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 184 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 185 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 243 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 244 TRRVHEEEDAGSQLIGFYD 262 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 82 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 129 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 130 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 184 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 185 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 243 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 244 TRRVHEEEDAGSQLIGFYD 262 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 102 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 149 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 150 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 204 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 205 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 263 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 264 TRRVHEEEDAGSQLIGFYD 282 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 171 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 218 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 219 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 273 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 274 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 332 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 333 TRRVHEEEDAGSQLIGFYD 351 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 82 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 129 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 130 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 184 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 185 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 243 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 244 TRRVHEEEDAGSQLIGFYD 262 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ FS++I Sbjct: 178 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVW 225 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 226 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 280 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 281 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 339 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 340 TRRVHEEEDAGSQLIGFYD 358 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 50/273 (18%), Positives = 99/273 (36%), Gaps = 44/273 (16%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL----- 57 F +I+ FF N G +I+TA+L + +++ ++ + KA+L D + L Sbjct: 12 FNSIKQFFQNEHGVYAIITALLAFQLLLLVAFTVDGTGILLDKARLAQATDQAALLLIAE 71 Query: 58 ----------------YTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + + + +N+E + + Q + +++ + + R++ +NG Sbjct: 72 DNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSD-DKNGQ 130 Query: 102 AQDINNIER----STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAPLLI 155 I + + L ++K+ +++ + F PW + L Sbjct: 131 KNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSSRLH 190 Query: 156 TSSVKISSKSDIG---------LDMMMVLDVSLSMND-------HFGPGMDKLGVATRSI 199 V I+S +D+MMV D+S SM +D L + I Sbjct: 191 DGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDVVKDI 250 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +L S D + R G V F+ Q Sbjct: 251 EGILLPKDSRDDTSPYNRMGFVAFAGGARQKTE 283 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 54/139 (38%), Gaps = 20/139 (14%) Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIFLTDGENSS 297 + +N + T T G+ N + D + +K + + ++ ++ L+DGE++ Sbjct: 369 GVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGEDNR 428 Query: 298 PNIDNKESLF---YCNEAKRR------------GAIVYAIGVQAEAADQ---FLKNCASP 339 P L CN+ KR+ A V + + K C Sbjct: 429 PTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGYNPPQDQVNVWKQCV-G 487 Query: 340 DRFYSVQNSRKLHDAFLRI 358 ++Y+V + + L DAF +I Sbjct: 488 KQYYTVFSKQGLLDAFRQI 506 >gi|219684622|ref|ZP_03539565.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219685812|ref|ZP_03540621.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] gi|219671984|gb|EED29038.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219672645|gb|EED29675.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] Length = 333 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 79/206 (38%), Gaps = 27/206 (13%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + + S G D+++VLD+S SM Sbjct: 49 KDYRLNLMYFFTYSFLYLAAMVMVFALAGPSVSKKKMIHLS---AGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++I+ GLV F+ P+ + Sbjct: 106 AVEFSSKNRLEFSK-------ELIRRFISQRENDNIGLVAFAKDASIVVPITTDRDFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ 325 I + + N A+ +Y+IG+ Sbjct: 208 IYKDQVI---NLAQGLNVKIYSIGIG 230 >gi|182413803|ref|YP_001818869.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841017|gb|ACB75269.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 792 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 79/204 (38%), Gaps = 28/204 (13%) Query: 143 PWCANSSHAPLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW + H + I K ++ S +++ +LDVS SM+ +KL + S+R Sbjct: 393 PWA--AQHRLVRIGLKAKDAAVSGRAAANLVFLLDVSGSMDQP-----NKLRLVQESMRL 445 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 +L ++ R +VT++ + + P+A + I + I+ L G +T G Sbjct: 446 LLGRLQPED------RVAIVTYAGNSGLALPSTPVARQRE-ILDAIDELRAGGSTNGAMG 498 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+ AY+ + + +I TDG+ + E + E + G Sbjct: 499 LQLAYDI------AKANFVANGVNR---VILCTDGDFNVGVTSEGELVRLIEEKAKSGVF 549 Query: 319 VYAIGVQAEA-ADQFLKNCASPDR 341 + +G D L+ A Sbjct: 550 LTVLGFGMGNLKDAMLQQIADRGN 573 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 50/273 (18%), Positives = 99/273 (36%), Gaps = 44/273 (16%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL----- 57 F +I+ FF N G +I+TA+L + +++ ++ + KA+L D + L Sbjct: 12 FNSIKQFFQNEHGVYAIITALLAFPLLLLVAFTVDGTGILLDKARLAQATDQAALLLIAE 71 Query: 58 ----------------YTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + + + +N+E + + Q + +++ + + R++ +NG Sbjct: 72 DNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSD-DKNGQ 130 Query: 102 AQDINNIER----STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAPLLI 155 I + + L ++K+ +++ + F PW + L Sbjct: 131 KNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSSRLH 190 Query: 156 TSSVKISSKSDIG---------LDMMMVLDVSLSMND-------HFGPGMDKLGVATRSI 199 V I+S +D+MMV D+S SM +D L + I Sbjct: 191 DGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDVVKDI 250 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +L S D + R G V F+ Q Sbjct: 251 EGILLPKDSRDDTSPYNRMGFVAFAGGARQKTE 283 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 54/139 (38%), Gaps = 20/139 (14%) Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIFLTDGENSS 297 + +N + T T G+ N + D + +K + + ++ ++ L+DGE++ Sbjct: 369 GVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGEDNR 428 Query: 298 PNIDNKESLF---YCNEAKRR------------GAIVYAIGVQAEAADQ---FLKNCASP 339 P L CN+ KR+ A V + + K C Sbjct: 429 PTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGYNPPQDQVNVWKQCV-G 487 Query: 340 DRFYSVQNSRKLHDAFLRI 358 ++Y+V + + L DAF +I Sbjct: 488 KQYYTVFSKQGLLDAFRQI 506 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/148 (22%), Positives = 60/148 (40%), Gaps = 17/148 (11%) Query: 214 NVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 N R G++ +SS++ FPL A+ + ++ I L+ T + ++YA N F Sbjct: 17 NATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVA 76 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + + + +TDG +A+ G +YA+GVQ Sbjct: 77 E---GARPPEERVPRVAVIVTDGRPQD------RVAEVAAQARASGIEIYAVGVQRADVG 127 Query: 331 QFLKNCASP---DRFYSVQNSRKLHDAF 355 L+ ASP + + V++ L F Sbjct: 128 S-LRAMASPPLDEHVFLVESF-DLIQEF 153 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 251 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 301 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 302 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 356 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 357 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 404 >gi|198274643|ref|ZP_03207175.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] gi|198272090|gb|EDY96359.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] Length = 339 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 63/178 (35%), Gaps = 28/178 (15%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G++ ++ LD+S SM +L + + I +++ N + Sbjct: 80 SKMETVKRQGVETVVALDISNSMLAQDVTP-SRLEKSKKLISRLVETF-------NNDKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ F+ + P+ + + LI T ++ A Sbjct: 132 AMIVFAGEAFTQLPITSDYVSAKMFLETISPSLITTQGTDIRGAIDLAMKSFT------- 184 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + I+ +TDGEN ++ EA ++G V+ +GV + Sbjct: 185 ----PNEGVGRAIVLITDGENHEG-----GAIEAAQEAAKKGMRVFVLGVGSPDGSPI 233 >gi|326923641|ref|XP_003208043.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 597 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/183 (22%), Positives = 63/183 (34%), Gaps = 24/183 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ +LD S S+ +H+ + ++ ++R + FSS+ Sbjct: 85 GAFDLYFILDKSGSVRNHWTEIYSFVESLAEKF------------ISPMLRMSFIVFSSR 132 Query: 227 IVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I+ + L G T G + A E++ H G Sbjct: 133 GTTIMKLTENREAIRRGLEILQHEVPGGDTFMHEGFKRA-------NEQIYHETYGGVRT 185 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N A+ GAIVY +GV+ Q S D + Sbjct: 186 ASVIIALTDGELQDVQFYYAE--QEANRARSFGAIVYCVGVKDFNETQLSTIADSIDHVF 243 Query: 344 SVQ 346 V Sbjct: 244 PVT 246 >gi|118092778|ref|XP_421647.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 555 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/183 (22%), Positives = 63/183 (34%), Gaps = 24/183 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ +LD S S+ +H+ + ++ ++R + FSS+ Sbjct: 44 GAFDLYFILDKSGSVRNHWTEIYSFVESLAEKF------------ISPMLRMSFIVFSSR 91 Query: 227 IVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I+ + L G T G + A E++ H G Sbjct: 92 GTTIMKLTENREAIRRGLEILQYEVPGGDTFMHEGFKRA-------NEQIYHETYGGVRT 144 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N A+ GAIVY +GV+ Q S D + Sbjct: 145 ASVIIALTDGELQDVQFYYAE--QEANRARSFGAIVYCVGVKDFNETQLSTIADSIDHVF 202 Query: 344 SVQ 346 V Sbjct: 203 PVT 205 >gi|333030668|ref|ZP_08458729.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741265|gb|EGJ71747.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 342 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 69/204 (33%), Gaps = 23/204 (11%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 ++ +F + + P T K+ + G+++M+ LD+S SM Sbjct: 53 FRPSFKFWVLFACIGFFSILLARPQFGT---KLETVEKQGIEVMIALDISNSMLAQDVSP 109 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +L + I +++D + + + GL+ F+ P+ + ++ + Sbjct: 110 -SRLAKSKLLISKLVDELHN-------DKVGLILFAGDAFTQLPITNDFVSAKMFLSSIS 161 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + + A + + + II +TDGE+ P +L Sbjct: 162 PNLIERQGTSIGKAVD-------LATRSFTSQEGVGRTIILITDGEDHEP-----GALEA 209 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 A G V +GV + Sbjct: 210 VKRAVDAGIQVNVMGVGSPDGAPI 233 >gi|300113557|ref|YP_003760132.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] gi|299539494|gb|ADJ27811.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] Length = 345 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 63/191 (32%), Gaps = 28/191 (14%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G +D+L + E L R GLV F P + +Q + ++ Sbjct: 114 GVQVDRLSAVKEVLGEFLQ-------RREGDRVGLVVFGDAAYLQAPFSTDLQLSRRLLD 166 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 G + DA ++ + K II LTDG ++ + S Sbjct: 167 ECEVG--------MAGPRTAFGDAIGLGVNLFSESEAPAKTIIALTDGNDTKSQV---PS 215 Query: 306 LFYCNEAKRRGAIVYAIGVQAEA-------ADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 + A RR ++ + + Q L+ A + ++ + L + Sbjct: 216 VEAARVAARREIRIHTVAIGDPTTAGEDKLDQQALREVAAETGGSYFFAADRASLAGIYD 275 Query: 357 RIGKEMVKQRI 367 ++ E+ ++I Sbjct: 276 QL-DEIETRKI 285 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 66/179 (36%), Gaps = 14/179 (7%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G+I + TA+L+ V+ ++ ++ + + ++ +L +D + L A ++ + Sbjct: 22 RRGNIIVFTAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQA 81 Query: 73 KKQKNDFSYRIIKNI-------WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 K + + ++ + E+G ++ E + S Sbjct: 82 KATEYLVRNPVGSSMTFVNEEEVPAKIAQFVAEHGDDFEVEAGEWNASTRSFETTNTLPS 141 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 LS Y P + F + +S + D+M+VLD S SMND Sbjct: 142 TLSVSMEY--PTMPTFFGKILGKDSFSIRASSVAMYQPR-----DIMVVLDFSGSMNDD 193 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 64/193 (33%), Gaps = 37/193 (19%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS---SKIVQTFPLAWGVQHIQEKINRLI 248 + S+ +D I + + R GL ++ + + PL V+ + N+ Sbjct: 404 VRALKDSLAIFMDFITEVEVQD---RVGLAVYNAPNGEGMVEVPLTLEVEQVATIANQRQ 460 Query: 249 FG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE----NSSPNID 301 G T GL A + K I+ +TDG+ N S +I Sbjct: 461 AGHYHEYTNIGGGLNAARLHLDQHGRPNAF---------KMIVLITDGQANWRNGSYSIA 511 Query: 302 NKESLFY-----CNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV-------QN 347 N E+ C R V + + A ++ A + ++V Q Sbjct: 512 NAENYLISEANLCAH-DSRKYPVVTLSLGTNADTDIMEQVATITNSTHFNVPGGSTIEQY 570 Query: 348 SRKLHDAFLRIGK 360 +L + F +I K Sbjct: 571 HDQLSETFRKIAK 583 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 79/197 (40%), Gaps = 29/197 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ ++D S S+ H M K M+DII + R G+V +S Sbjct: 17 KSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDIIHELDIGLAATRIGVVQYS 67 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L + + + IN +I T + + YA N F A+E A+ + Sbjct: 68 SQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE----GARPNV 123 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ TDG + + G +YA+GV A A L+ ASP Sbjct: 124 PHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVGV-ARADMTSLRAMASPPF 174 Query: 340 -DRFYSVQNSRKLHDAF 355 D + V++ L F Sbjct: 175 EDHVFLVESF-DLIHQF 190 >gi|120436991|ref|YP_862677.1| von Willebrand factor type A domain-containing protein [Gramella forsetii KT0803] gi|117579141|emb|CAL67610.1| secreted protein containing von Willebrand factor type A domain [Gramella forsetii KT0803] Length = 592 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 82/210 (39%), Gaps = 20/210 (9%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + + PW +++ + + I ++ +++ +LDVS SM +KL Sbjct: 193 FSIQTEYAQTPWNSDTKLVKIGLQGK-TIPLENVPASNLVFLLDVSGSMGQ-----QNKL 246 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + + + + + ++ ++ VV +G SS +V I+E +++L G + Sbjct: 247 PLLKSAFKLLTNNLREQDKISIVVYAG----SSGVVLEPTSGDQKTKIEEALDKLSAGGS 302 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+E AY D K + +I TDG+ + +K E Sbjct: 303 TAGGEGIELAYKIAKDNFIKNGNNR---------VILATDGDFNVGLSSDKAMEDLIKEK 353 Query: 313 KRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 + G + A+G D L++ A Sbjct: 354 RESGIFLTALGFGMGNYKDSKLESLAQTGN 383 >gi|218515577|ref|ZP_03512417.1| hypothetical protein Retl8_18742 [Rhizobium etli 8C-3] Length = 54 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 2 SFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAK 47 SFL+ +R + G+ I+TAIL PV+ G+ I+ K Sbjct: 5 SFLHPCLRRMLSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKIH 52 >gi|291386646|ref|XP_002709870.1| PREDICTED: anthrax toxin receptor 1 [Oryctolagus cuniculus] Length = 564 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYFFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|219127465|ref|XP_002183955.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404678|gb|EEC44624.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 582 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 25/162 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + S ++++VLD S SM D G + L A + + E L + R +V Sbjct: 193 AASSGPKNVVLVLDTSGSMTD--GNRLSLLKQAAKQVIETLTV---------GDRVAIVE 241 Query: 223 FSSKIV--------QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 FSS+ + + I+ T A+ + D+ ++ Sbjct: 242 FSSQAKLFAQDNKFLFTATQKNKELLATHIDSFTAAGATNFLDAFTAAFAVLNDSIDQEY 301 Query: 275 HIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRR 315 H+ I+FLTDGE + P N+ + L N Sbjct: 302 HVGCTTA-----ILFLTDGEMTQPENVQEADVLDLVNTGISN 338 >gi|156358436|ref|XP_001624525.1| predicted protein [Nematostella vectensis] gi|156211311|gb|EDO32425.1| predicted protein [Nematostella vectensis] Length = 1323 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 89/266 (33%), Gaps = 35/266 (13%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 G + + S I + L+ S + F + P P Sbjct: 1076 FTGNRDRGSVVLHSFDPPITNVRMLRLLPLTWHSHISLRLEFYSCPGEIPHLAKPCP--- 1132 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 LD+ + LD S S+ +A ++ +++ +K + ++ Sbjct: 1133 ---------KSLDIGIALDRSTSVGP------TNFNIAKTFLKILVERMKISTNGSHF-- 1175 Query: 218 SGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLE 274 GL+ +SS + + + I +I+ + F T++ L+ A +F Sbjct: 1176 -GLIAYSSSASRVISFRFSQKAADINRQIDAIEFTGGKTRTDFALQVAITDLFTNS---- 1230 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A ++ +I +T+G S ++ K+ + K + V A+G+ + + L Sbjct: 1231 --AGDRENVTDVLIVMTNGRTSQGSLPYKDVMKP---LKEKKVDVIAVGIGPDVNEAELL 1285 Query: 335 NCASP--DRFYSVQNSRKLHDAFLRI 358 A D V + L I Sbjct: 1286 EIAEGGLDHVIRVDDYEALATKLNSI 1311 >gi|224285863|gb|ACN40645.1| unknown [Picea sitchensis] Length = 829 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 81/215 (37%), Gaps = 38/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ VLDVS SM+ KL + R++ + I +++ R +V FSS Sbjct: 355 RAPIDLVTVLDVSGSMSG------TKLALLKRAMAFV------ISNLSPEDRLSVVVFSS 402 Query: 226 KIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L G + + RL+ T GL + D +++ + Sbjct: 403 TAKRVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRRQRNPVAS---- 458 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG----------AIVYAIGVQAEAADQ 331 I+ L+DG+++ LF +E +R V+A G + Sbjct: 459 -----IMLLSDGQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGVDHDAA 513 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + S F +Q + DAF + IG + Sbjct: 514 TMHAISEVSGGTFSFIQAESLVQDAFAQCIGGLLS 548 >gi|124008506|ref|ZP_01693199.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123986014|gb|EAY25864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 425 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 76/201 (37%), Gaps = 26/201 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + I L++ +V+D S SM+ DKL +++ ++D +KS ++ + Sbjct: 37 PEKQERIPLNISLVVDRSGSMSG------DKLNYVKKAVDFVIDNLKSDDVLS------I 84 Query: 221 VTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V + +I A + + EK+ + + T + G+ Y ++ + Sbjct: 85 VQYDDEIDVVASSAKVTNKKALHEKVKGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNR- 143 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCA 337 ++ L+DG ++ ++ + + G + GV ++ + + N + Sbjct: 144 --------VLLLSDGLANAGITAPEQLQQIAQKKFREAGIALSTFGVGSDFNEVLMTNLS 195 Query: 338 --SPDRFYSVQNSRKLHDAFL 356 +Y + K+ F Sbjct: 196 EYGGANYYFIDMPDKIPQIFA 216 >gi|260775644|ref|ZP_05884540.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] gi|260608060|gb|EEX34229.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] Length = 407 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 59/430 (13%), Positives = 131/430 (30%), Gaps = 91/430 (21%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK----IL 64 + +G SIL A+L+P++F V L + + KA++ + + L + + + Sbjct: 1 MHHKQQGHASILFAMLIPLLFGVFALGSDGARAIQSKARIEDASEAAALALSARDDEHAM 60 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 + EN + ++ ++ + +++ S + ++ Sbjct: 61 SDENKTIVQAYIEEYLPVEDSDVTILGIE--------RLECDDMPECRQGSGRGEARYTQ 112 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--- 181 Y++ + + P+ P K +D++ D S SM Sbjct: 113 YSVRVSAD-QTPWFGGGSPEVEVPEVW--RSQGGAKARKYQSNAVDIVFAADFSGSMASP 169 Query: 182 --NDHFGPGMDKLGVATR---------------------SIREMLDIIKSIPDVNNVVR- 217 D + + + S L + VNN+ + Sbjct: 170 WTGGSQPKYRDLIDILEKVTVELAPYNFDSQRYNSSVGVSGFNALTYRNELCAVNNLEKQ 229 Query: 218 --SGLVTFSSKI-------------------VQTFPLAWGVQHIQEKINRLIFGSTTKST 256 G+V +S + PL ++R T S Sbjct: 230 GLLGVVDYSRTVARMWETKSCRPPSISNSAGFHDVPLTDDYSTFNRTVDRFTARGGTAS- 288 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN-------------- 302 Y + L+H + ++ +I ++DG++++ N N Sbjct: 289 ------YQAVMSGARLLDH----GSNNRQILIVISDGQDNNLNHTNGLVNAGMCRDIISR 338 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRFYSVQNSRKLHD-AFLRIGK 360 E N + A + IG E + + + C D + +N+ +L + I + Sbjct: 339 LEGRPSAN-GRDVSARLAFIGFDFEPSMNPAMVRCVGEDNVFKAENTDELFEQIMFLIRE 397 Query: 361 EMVKQRILYN 370 E+ + Sbjct: 398 EVGHLATRRD 407 >gi|86144309|ref|ZP_01062641.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218676258|ref|YP_002395077.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] gi|85837208|gb|EAQ55320.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218324526|emb|CAV26007.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] Length = 438 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 78/219 (35%), Gaps = 15/219 (6%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +++ N G IL LLP + I M ++ S + ++L + + L Sbjct: 1 MKSIKKNR-GVAGILFVGLLPAMVIFMAFSMQMSQQMLAHSRLLEAAEVASLALIASPKE 59 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E+ N + Y I+ N D R+ + + D+ Sbjct: 60 DEDKNVKYARYLVDRY-ILDNSEDVDVAVFTRKCEYKDGCVQASGELAP-------FSDF 111 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 +SA ++Y + + + +T S +D+ + D S SM + + Sbjct: 112 VVSATAKYTSWISYEDVDLEPEFTVSGRAVTRKYLPQS-----VDVYFIGDFSGSMGNSW 166 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G KL V +I+ ++D I+ + R L+ ++ Sbjct: 167 KNGKMKLDVVKETIKRVVDDIEKF-NTEEKSRVALLGYN 204 >gi|332226724|ref|XP_003262542.1| PREDICTED: anthrax toxin receptor 1 isoform 2 [Nomascus leucogenys] Length = 333 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|297266186|ref|XP_002799329.1| PREDICTED: anthrax toxin receptor 1-like [Macaca mulatta] Length = 484 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|158257658|dbj|BAF84802.1| unnamed protein product [Homo sapiens] Length = 333 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|114577872|ref|XP_001136716.1| PREDICTED: anthrax toxin receptor 1 isoform 5 [Pan troglodytes] Length = 321 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|114577870|ref|XP_001136388.1| PREDICTED: tumor endothelial marker 8 isoform 1 [Pan troglodytes] Length = 326 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|114577874|ref|XP_001136473.1| PREDICTED: tumor endothelial marker 8 isoform 2 [Pan troglodytes] Length = 297 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|114577864|ref|XP_525774.2| PREDICTED: anthrax toxin receptor 1 isoform 6 [Pan troglodytes] Length = 564 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|62898107|dbj|BAD96993.1| tumor endothelial marker 8 isoform 3 precursor variant [Homo sapiens] Length = 317 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|16933551|ref|NP_444262.1| anthrax toxin receptor 1 isoform 2 precursor [Homo sapiens] gi|114577866|ref|XP_001136564.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Pan troglodytes] gi|332226726|ref|XP_003262543.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Nomascus leucogenys] gi|16566413|gb|AAL26496.1|AF421380_1 anthrax toxin receptor [Homo sapiens] Length = 368 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|16933553|ref|NP_060623.2| anthrax toxin receptor 1 isoform 3 precursor [Homo sapiens] gi|114577868|ref|XP_001136638.1| PREDICTED: anthrax toxin receptor 1 isoform 4 [Pan troglodytes] gi|15082333|gb|AAH12074.1| Anthrax toxin receptor 1 [Homo sapiens] gi|190690489|gb|ACE87019.1| anthrax toxin receptor 1 protein [synthetic construct] gi|190691863|gb|ACE87706.1| anthrax toxin receptor 1 protein [synthetic construct] gi|312150350|gb|ADQ31687.1| anthrax toxin receptor 1 [synthetic construct] Length = 333 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|62988680|gb|AAY24067.1| unknown [Homo sapiens] Length = 234 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|62870691|gb|AAY18344.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 4 [Homo sapiens] Length = 411 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|62870685|gb|AAY18341.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 1 [Homo sapiens] Length = 387 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|62870689|gb|AAY18343.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 3 [Homo sapiens] Length = 410 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|62870687|gb|AAY18342.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 2 [Homo sapiens] Length = 419 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|14149904|ref|NP_115584.1| anthrax toxin receptor 1 isoform 1 precursor [Homo sapiens] gi|332226722|ref|XP_003262541.1| PREDICTED: anthrax toxin receptor 1 isoform 1 [Nomascus leucogenys] gi|17366074|sp|Q9H6X2|ANTR1_HUMAN RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|14017381|gb|AAK52094.1| tumor endothelial marker 8 precursor [Homo sapiens] Length = 564 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 80/242 (33%), Gaps = 33/242 (13%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F + F N +G+I I I+ P ++ + E SH KAKL ++ + L A Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATL--ALT 68 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I N E + ++ KN + + L F I NI +T Sbjct: 69 IENNEIPDEPQQIKN-------NALVLSYVNAYLPSKKFLVPIININDNTHYLEYNAAVT 121 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y +S+ N + + + D++ V D S SM Sbjct: 122 MAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASEPT-------DVIFVADYSGSML 174 Query: 183 DHFGPGM----DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 +F +++ + R++ DII + + P +WG + Sbjct: 175 YNFNENKPRDHERIDALRSAFRKLHDIIMDNS-------------NINAIGYIPFSWGTK 221 Query: 239 HI 240 I Sbjct: 222 RI 223 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 91/274 (33%), Gaps = 43/274 (15%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS-SHAPLLITS 157 + F + + I +++++ I V + +C FP+ ++ Sbjct: 193 SAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYLSD 252 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +K SS + +++LD + D + +D + ++I + ++ Sbjct: 253 EIKRSSNT------LLLLDYIGDIID-YDKTIDSITGNAQTIDIPMSDVRFGDVCLQ--- 302 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G +S + Q + + I + T + G+ A N +F K K H Sbjct: 303 -GSNAYSLEQEQYI-------NNIDNIIEMEPHGWTLISSGILSA-NNLFKNKAKNGH-- 351 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLF--------YCNEAKRRGAIVYAIGVQAEAA 329 KK +I L+DG ++ +K + C E K + I + Sbjct: 352 ------KKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPD 405 Query: 330 DQF-------LKNCASPDRFYSVQNSRKLHDAFL 356 + K C D +Y N+ +L Sbjct: 406 NNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQ 439 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 71/199 (35%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ N ++ FS++I Sbjct: 296 VVFVLDSSASMVGA------KLRQTKDALFTILHDLRPQDHFN------IIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V P V+ + I+ + T L+ + D+ + + + Sbjct: 344 KDHLVSVTP--NNVRDGKVYIHHMSPSGGTDINGALQTGIALLHDSVARHDLEDRSVS-- 399 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ +G+ + + L+NC Sbjct: 400 --LIVFLTDGKPTVGETHTPKILNNTREAARGRVCIFTVGIGDDVDFRLLEKLSLENCGF 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 + +++ F Sbjct: 458 TRHVHEDEDAGAQLIGFYD 476 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 75/231 (32%), Gaps = 37/231 (16%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA---- 60 ++R F + GS++ILT L+ ++F V G ++ + +L Y LD ++L A Sbjct: 22 HLRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLAAADLDQ 81 Query: 61 ------------------------TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 K+ + N + ++ Sbjct: 82 ELCPRVVVNDYISKEGFDPGIIDEIKVDPETCLNTDSSDSDGDGTDSSDASGSDSDPSDT 141 Query: 97 RENGFAQDINNIERSTSL---SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 +G + S + + ++ W S + Sbjct: 142 ASSGTESGSDGTSSGGDTAGTSTTTNAVELQGKRKVEASAQLNIETHFMKW---SGVDTI 198 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 T+ V + +S +++ +VLDVS SM + K A ++EML+ Sbjct: 199 NSTA-VSAAEESIGNVEISLVLDVSGSMEGAKLTNLQK--AAKDFVKEMLE 246 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/113 (23%), Positives = 51/113 (45%), Gaps = 14/113 (12%) Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID----NKESLFYCNEAKRRG 316 +A+ +I A E L KG +I NS D N++ + C +A+ + Sbjct: 562 WAHTEI-KAIESLFRRTKGDTYADDFI------RNSIVTADISKKNEQVVSLCGKAEEKE 614 Query: 317 AIVYAIGVQAEAA-DQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++++I +A ++ Q LK+CA P R+Y + ++ F I + R+ Sbjct: 615 VLIFSIAFEAPSSVKQMLKDCAVKPARYYEATGT-QIERVFDSISTSIQNLRL 666 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 44/121 (36%), Gaps = 28/121 (23%) Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK--------- 285 + + ++IN L+ +T GL++ + ++ + L + K Sbjct: 372 NDLDTLNKQINNLVAAGSTSINIGLKWGLALLDESIQPLIKTVANDTNVPKIFEDRPRPT 431 Query: 286 -------YIIFLTDGENS------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ +TDG+N P D +F+ GA Y++ V E QF Sbjct: 432 NTTDTLKVVVLMTDGKNDLQRAVVPPYNDGPSDVFW-----NAGAQEYSVLVD-EDNSQF 485 Query: 333 L 333 L Sbjct: 486 L 486 >gi|213961827|ref|ZP_03390093.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] gi|213955616|gb|EEB66932.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] Length = 607 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 70/185 (37%), Gaps = 25/185 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--IV 228 ++ ++D S SM++ +K+ + S + +LD ++ R +V ++S+ I Sbjct: 252 IVFLIDTSGSMDEP-----NKMPLLKASFKLLLD------NLRPEDRIAIVVYASQTGIA 300 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 A + I + I+ L+ +T GL+ AY + KG++ II Sbjct: 301 LPSTPAKEKEKISKVIDDLVASGSTAGGAGLQTAYEVAEKN-----FLPKGNNR----II 351 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSV 345 TDG+ + E E + G + +G D + A + + Sbjct: 352 LATDGDFNVGISSRDELQRLVEEKRNNGIYISVLGYGMGNYRDDMAETIANKGNGNYAYI 411 Query: 346 QNSRK 350 N + Sbjct: 412 DNFTE 416 >gi|17231852|ref|NP_488400.1| hypothetical protein alr4360 [Nostoc sp. PCC 7120] gi|17133496|dbj|BAB76059.1| alr4360 [Nostoc sp. PCC 7120] Length = 427 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 67/199 (33%), Gaps = 34/199 (17%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + S+V + ++ L++ ++LD S SM+ L + ++ ++LD ++ Sbjct: 27 ISISAVAEQFEQNLPLNLCLILDQSGSMHGQ------PLKMVVEAVEKLLDRLQPGD--- 77 Query: 214 NVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 R +V F+ P + + I+ +I +L T GL+ ++ Sbjct: 78 ---RISVVAFAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKGT 134 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSS---------PNIDNKESLFYCNEAKRRGAIVYA 321 A LTDG D++ L + +A + + Sbjct: 135 RGAVSQA----------FLLTDGHGEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINT 184 Query: 322 IGVQAEAADQFLKNCASPD 340 +G L+ A Sbjct: 185 LGFGNNWNQDLLETIADAG 203 >gi|17229274|ref|NP_485822.1| hypothetical protein all1782 [Nostoc sp. PCC 7120] gi|17130872|dbj|BAB73481.1| all1782 [Nostoc sp. PCC 7120] Length = 615 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 74/236 (31%), Gaps = 34/236 (14%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + ++ P + F AN +I L++ +V+D S SM G Sbjct: 2 VKTSYEFDQPILPAGFSLKANILLRFR-----AEIPESPRRNLNLSLVIDRSGSMA---G 53 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKI 244 + A S+ + L+ + +V + + P +++ I Sbjct: 54 AALHHALKAAESVVDQLEPKDILS---------VVVYDDAVDTVVPPQPVTDKPALKKSI 104 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDN 302 ++ G T + G + K D +K ++ LTDG + D Sbjct: 105 RQVRAGGITNLSGGWLKGCEYV-----------KHQLDPQKINRVLLLTDGHANMGIQDP 153 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 K + G +G + L A+ FY +Q+ + + F Sbjct: 154 KILTATSTQKAEEGITTTTLGFAQGFNEDLLIGMARAANGNFYFIQSIDEAAEVFS 209 >gi|84623314|ref|YP_450686.1| hypothetical protein XOO_1657 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|122879130|ref|YP_200396.6| hypothetical protein XOO1757 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188577378|ref|YP_001914307.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|84367254|dbj|BAE68412.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188521830|gb|ACD59775.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 350 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 83/237 (35%), Gaps = 34/237 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 T PW +S + I ++ +++ +DVS SM+ DKL + S Sbjct: 120 LATTPWNKDSLLLRVGIAGRDIATADLPPA-NLVFQVDVSGSMD-----APDKLPLLRSS 173 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKST 256 ++ + + + R LVT++ P G Q I E I+ L G +T Sbjct: 174 LKLL------VRQLRKQDRITLVTYAGNTAVVLPPTPGDQQGRIVEAIDSLQSGGSTAGA 227 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+E AY + +G + I+ TDG+ + D +E +R G Sbjct: 228 SGIELAYKA------AQQGYLRGGINR---ILLATDGDFNVGVTDFDALKGMVSEKRRSG 278 Query: 317 AIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEM 362 + +G D ++ A + + +L I +++ Sbjct: 279 VALSTLGFGTGNYNDNLMEQSADAGDGAYAYIDTPLEARKVLTHELGATLATIARDV 335 >gi|326916308|ref|XP_003204450.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 902 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 65/364 (17%), Positives = 131/364 (35%), Gaps = 51/364 (14%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 N G + I I+L V V + +++ + AT+I ++ Sbjct: 299 ANFDGIVDIQNEIILQVCSGV-------DEQLGELVSGEEVVEPASNLVATQISSKSVRI 351 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + + I+ I + G ++ ++T+L++ +Y ++ Sbjct: 352 TWDPSTSQITGYRIQFI-------PMITGGKQHVLSVGPQTTALNVKDLSPDTEYQINV- 403 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 Y M + + P T VK+ + G+D+ DV ++ + G+ Sbjct: 404 --YAMKGLTPSEPITIMEK------TQQVKVQVECSRGVDVKA--DVVFLVDGSYSIGIA 453 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI 248 + ++ + P V+ LV +S F L V+ I + IN Sbjct: 454 NFVKVRAFLEVLVKSFEISPRK---VQISLVQYSRDPHMEFSLNRYNRVEDIIQAINTFP 510 Query: 249 F-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + G +T + + Y K+F + + +I +TDG++S Sbjct: 511 YRGGSTNTGKAMTYVREKVF------VTSKGSRPNVPRVMILITDGKSSDA------FKE 558 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 + + ++A+GV+ +A L+ ASP Y+V++ DAF RI E+ + Sbjct: 559 PAIKLRDADVEIFAVGVK-DAVRTELEAIASPPAETHVYTVED----FDAFQRISFELTQ 613 Query: 365 QRIL 368 L Sbjct: 614 SVCL 617 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 70/194 (36%), Gaps = 26/194 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + + M+ ++ + R G+V +SS Sbjct: 139 DLVFLVDGSWSVGRNNFRYI---------LDFMVALVSAFDIGEEKTRVGVVQYSSDTRT 189 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I R+ + G T + ++Y F + K Sbjct: 190 EFNLNQYFRRSDLLDAIKRIPYKGGNTMTGEAIDYLVKNTFTESAGARKG------FPKV 243 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I +TDG+ E + G V+++G++A A + + P ++ Sbjct: 244 AIVITDGKAQDEVEIPAR------ELRNIGVEVFSLGIKAADAKELKLIASQPSLKHVFN 297 Query: 345 VQNSRKLHDAFLRI 358 V N + D I Sbjct: 298 VANFDGIVDIQNEI 311 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 63/431 (14%), Positives = 124/431 (28%), Gaps = 142/431 (32%) Query: 36 IETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 I++ + ++L +D + L + ++ + K + + Sbjct: 54 IDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPQLD--------------KGLVRGAVTQY 99 Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 + N +N + T + ++ + Sbjct: 100 MDANYPEASLNGVTPGTEERSVTVTATATVPTIFMNALGI-------------------- 139 Query: 156 TSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 S ++ +K+ G L+++ V+D S SM G + + A + E++ + Sbjct: 140 -GSSEVHAKATAGYNKLEVVFVIDNSGSMK---GTPIQQTNSAASQLVELIMPEGMM--- 192 Query: 213 NNVVRSGLVTFSSKIVQTF----------------------------------------- 231 V+ GLV F K+ Sbjct: 193 -TSVKVGLVPFRGKVHLPAGVDGLPDGCRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVP 251 Query: 232 -----------PLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L + I I N L S T + GL++ + + E Sbjct: 252 KNTCTSIPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGS- 310 Query: 278 KGHDDYKKYIIFLTDGE------------NSSPNIDNKES-------------------- 305 D +K II LTDG+ N +PN + Sbjct: 311 -SAKDIRKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAA 369 Query: 306 -LFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCASP-----DRFYSVQNSRKLHDAFLR 357 L +AK G V+AI + +K+ AS D +Y ++ + D F + Sbjct: 370 MLEEARKAKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKK 429 Query: 358 IGKEMVKQRIL 368 IG+++ + + Sbjct: 430 IGRQLGWRLLR 440 >gi|322434934|ref|YP_004217146.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162661|gb|ADW68366.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 373 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 70/171 (40%), Gaps = 27/171 (15%) Query: 156 TSSVKISS--KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I+S D+ + + +++D S SM D +DK +D++K ++ Sbjct: 125 GVPQTINSFKHQDLPVSIGLLIDSSGSMYDKRN-AVDKAS---------IDLVKLSNPMD 174 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 LV FS++ V + + ++ + T + L + + + Sbjct: 175 EEF---LVDFSTEAFIDTDFTTSVDKLSQGLSYIKSSGGTAAYDALVASADYLT------ 225 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K + K+ +I +TDGE+++ + ++S+ + G ++Y +G+ Sbjct: 226 ----KNAKNTKQVLIIITDGEDNASSATLEQSIRRIQDL--DGPVIYCVGL 270 >gi|291398581|ref|XP_002715573.1| PREDICTED: chloride channel accessory 4 [Oryctolagus cuniculus] Length = 874 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 50/259 (19%), Positives = 89/259 (34%), Gaps = 48/259 (18%) Query: 121 QHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAPLLITSSVKISSKSDI-------GLDM 171 K++N A + C F W S T+ ++ + + Sbjct: 200 NEKNHNREA---PNLQNTKCNFRSTWEVISISEDFQSTTPMEGPPPTPAFSLLRPRQRTV 256 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +VLD S SM D+L ++ + L + V N G+V F S Sbjct: 257 CLVLDKSGSMASS-----DRLNRMNQAAKYFL-----LQVVENGSWVGMVHFDSTASVKS 306 Query: 232 PL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + + + G T+ G+ A+ K ++ + Sbjct: 307 QLIQIKSNSERNQLLQSL-PTVAGGGTQICLGIRAAFQVF-----KQQNSEIDGSE---- 356 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK--NCASPDRFYS 344 I+ LTDGE+S+ + NE G I++ I + A ++ N +Y+ Sbjct: 357 IVLLTDGEDSTAS-------SCVNEVIESGTIIHFIALGPSADASVIQMSNLTGGSHYYA 409 Query: 345 VQNSRK--LHDAFLRIGKE 361 N++ L DAF + E Sbjct: 410 SDNAQNNGLIDAFGALTSE 428 >gi|221109526|ref|XP_002169886.1| PREDICTED: similar to coagulation factor C homolog, cochlin [Hydra magnipapillata] Length = 336 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 71/197 (36%), Gaps = 21/197 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + +D+ +LD S S+ + + L +N R G+ Sbjct: 9 VQPRCEAVVDVAFILDSSHSLEASYQKEKNFLKKLAAVFGI----------SSNGSRVGV 58 Query: 221 VTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +TFS + + L + E ++++ TT+ L A +F + Sbjct: 59 ITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA------N 112 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G K II LTDG + P D ++ +E + G ++ +G+ + + L + Sbjct: 113 GGRVGVSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT 171 Query: 338 SP-DRFYSVQNSRKLHD 353 Y+ L D Sbjct: 172 GGKKNAYTAATFDSLTD 188 >gi|326500988|dbj|BAJ98725.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326527981|dbj|BAJ89042.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 83/224 (37%), Gaps = 43/224 (19%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + LD++ VLDVS SM+ H KL + +++R ++D + PD R +V Sbjct: 203 KPQPRAPLDLVTVLDVSGSMSGH------KLALLKQAMRFVIDNLG--PD----DRLSVV 250 Query: 222 TFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +FSS+ + LA G + L+ T GL A + ++ +H Sbjct: 251 SFSSEARRLTRLARMSDAGKALSVNAVESLVARGGTNIAEGLRTAAKVL----DERQHRN 306 Query: 278 KGHDDYKKYIIFLTDGENSS-------------PNIDNKESLFYCNEAKRR--GAIVYAI 322 ++ L+DG+++ N + + A ++ Sbjct: 307 AVSS-----VVLLSDGQDTYTMMRRRGPSGVHAGNYEELVPPSFARTGADGDWSAPIHTF 361 Query: 323 GVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G + + A+ F ++N + DAF + IG + Sbjct: 362 GFGNDHDAAAMHVIAEATGGTFSFIENEAVIQDAFAQCIGGLLS 405 >gi|326489689|dbj|BAK01825.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 691 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 83/224 (37%), Gaps = 43/224 (19%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + LD++ VLDVS SM+ H KL + +++R ++D + PD R +V Sbjct: 187 KPQPRAPLDLVTVLDVSGSMSGH------KLALLKQAMRFVIDNLG--PD----DRLSVV 234 Query: 222 TFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +FSS+ + LA G + L+ T GL A + ++ +H Sbjct: 235 SFSSEARRLTRLARMSDAGKALSVNAVESLVARGGTNIAEGLRTAAKVL----DERQHRN 290 Query: 278 KGHDDYKKYIIFLTDGENSS-------------PNIDNKESLFYCNEAKRR--GAIVYAI 322 ++ L+DG+++ N + + A ++ Sbjct: 291 AVSS-----VVLLSDGQDTYTMMRRRGPSGVHAGNYEELVPPSFARTGADGDWSAPIHTF 345 Query: 323 GVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G + + A+ F ++N + DAF + IG + Sbjct: 346 GFGNDHDAAAMHVIAEATGGTFSFIENEAVIQDAFAQCIGGLLS 389 >gi|324499530|gb|ADY39800.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 3675 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 91/235 (38%), Gaps = 23/235 (9%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y ++++ P + + K D+M ++D S S+ G + K Sbjct: 1156 SYTCQCYAGFVDVSSSANLQPGRVCTVQTTCPKQKT--DLMFLVDGSGSI----GSYVFK 1209 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI- 248 V R ++E +++ D R GL+ +S +I F L+ + + I+++ Sbjct: 1210 HEVL-RFVKEFVELFDIGLDN---TRVGLIQYSDQIRHEFDLSQYRDKASVIQAISQVHY 1265 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ +++ + F + DD + I +TDG + + + Sbjct: 1266 LTGLTRTGAAIQHMVMEGFSERR---GARAESDDVARVAIVITDGRSQDNVTEPAIA--- 1319 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEM 362 A+R ++AIGV L++ A SP R++ V + L + ++ Sbjct: 1320 ---ARRLHVNMFAIGVTDHVLASELESIAGSPSRWFYVDRFKDLDTRLRSLIQKA 1371 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 61/208 (29%), Gaps = 34/208 (16%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPG 188 + + + P ++ K + G++ M+VLD+S SM D Sbjct: 55 PHVKFVLLMLAVALMIIALARPQMLK---KQENVQKRGIEAMLVLDISNSMMAQDIAPSR 111 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +D + + + L + GL+ F+ P+ + + + Sbjct: 112 LDYAKMLLSQLIDRLTD----------DKMGLIVFAGDAFIQMPITSDKVSAKMFLKTIQ 161 Query: 249 FG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 T ++ A D K+ I LTD EN N Sbjct: 162 PDLIQRQGTAIGSAIDLAVKSFNDTKQSGGRA----------IFLLTDAENHEDN----- 206 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ A+ + V +G+ Sbjct: 207 AVEAAKMARDKNITVNVVGIGTPEGSPI 234 >gi|149410251|ref|XP_001508722.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 950 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 75/211 (35%), Gaps = 27/211 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++ V+D S SM+ K+ ++ ++L + N LV F+S Sbjct: 249 LPKSVVFVIDKSGSMDG------RKIVQTREALLKVLGDLNPEDQFN------LVVFNSM 296 Query: 227 IVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 I Q P V ++ + + T + A + + ++ ++ + Sbjct: 297 ISQWQPSLLKATQENVGSAKKFVLDIRASGGTNINEAVLAAVHLLDESNQREL----LPE 352 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + II LTDGE + + + + ++ +G + + FL+ A + Sbjct: 353 NSVSMIILLTDGEPTVGETNPENIQQNIQRSLDGKYALFCLGFGFDVSYSFLEKMALDNS 412 Query: 342 ------FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + +L D + + ++K Sbjct: 413 GLARRIYEDSDAALQLQDFYQEVATPLLKHV 443 >gi|114557515|ref|XP_524757.2| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Pan troglodytes] Length = 919 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 56/278 (20%), Positives = 99/278 (35%), Gaps = 59/278 (21%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 +F NE N A + NI+ + + + +D+ + +P + P + Sbjct: 248 EFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNT------IPMVTPPPPPVFSLLK 301 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I + +VLD S SM G D+L ++ + L + Sbjct: 302 ISQRI---------------VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQ 336 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG------STTKSTPGLEYAYN 264 V N G+V F S L ++ N L+ G T G++YA+ Sbjct: 337 TVENGSWVGMVHFDSTATVVNKLI--QIKSSDERNTLMAGLPTYPRGGTSICSGIKYAFQ 394 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 I H + ++ LTDGE+++ + +E K+ GAIV+ I + Sbjct: 395 VIG-----ELHSQLDGSE----VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIAL 438 Query: 325 QAEAADQF--LKNCASPDRFYSVQNSRK--LHDAFLRI 358 +A + + FY ++ L DAF + Sbjct: 439 GRDADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGAL 476 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 67/198 (33%), Gaps = 32/198 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ + +L P R +V++S +V+ Sbjct: 35 DLVFLLDRSASVGSA------NFEAEKGFVESLLGQFSISPAS---TRVDVVSYSEDVVR 85 Query: 230 TFPLAW---GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 H + I + + T + L+ A N ++ D Sbjct: 86 HIDYIREPKNKCHFSQDIRHVTYRNSGKTNTNGALQEARNIFVGSR----------QDVH 135 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K ++ L+DG++++ + E ++ G ++ I + D+ S + Sbjct: 136 KVVVLLSDGQSNTGG----DPTTTAEELRQDGVEIFTIAIGLFNKDELNSIATSDHHTFE 191 Query: 345 VQNSRKLHDAFLRIGKEM 362 + R+ F ++ + Sbjct: 192 YSSFRE----FKKLASRI 205 >gi|315501784|ref|YP_004080671.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408403|gb|ADU06520.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 430 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 79/236 (33%), Gaps = 52/236 (22%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T V K + +VLDVS SM G ++ VA ++ +++ ++PD Sbjct: 32 TTEPVAEPPK------VELVLDVSGSMRARDIDGRSRISVAQQAFN---EVVDALPDETQ 82 Query: 215 V-VRSGLVTFSSKIVQTFPLAW---------GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + +R T+ K + L + + L T L A Sbjct: 83 LGIRVLGATYRGKDKKQGCLDTQQIVPVGPVDRTQAKAAVAGLRPTGFTPVGLALRSAAQ 142 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----V 319 + I+ +TDGE++ D C A+ A V Sbjct: 143 DLGTGSTARR------------IVLITDGEDTCAPPDP------CEVARELAAQGTRLVV 184 Query: 320 YAIGVQAEAA--DQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +G+ + Q L C A+ + + Q++ +L RI + + + R Y Sbjct: 185 DTLGLAPDEKVRKQLL--CIAGATGGTYTAAQSADELT---GRIKQLVDRARDTYT 235 >gi|302865239|ref|YP_003833876.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568098|gb|ADL44300.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 429 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 79/236 (33%), Gaps = 52/236 (22%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T V K + +VLDVS SM G ++ VA ++ +++ ++PD Sbjct: 32 TTEPVAEPPK------VELVLDVSGSMRARDIDGRSRISVAQQAFN---EVVDALPDETQ 82 Query: 215 V-VRSGLVTFSSKIVQTFPLAW---------GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + +R T+ K + L + + L T L A Sbjct: 83 LGIRVLGATYRGKDKKQGCLDTQQIVPVGPVDRTQAKAAVAGLRPTGFTPVGLALRSAAQ 142 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----V 319 + I+ +TDGE++ D C A+ A V Sbjct: 143 DLGTGSTARR------------IVLITDGEDTCAPPDP------CEVARELAAQGTRLVV 184 Query: 320 YAIGVQAEAA--DQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +G+ + Q L C A+ + + Q++ +L RI + + + R Y Sbjct: 185 DTLGLAPDEKVRKQLL--CIAGATGGTYTAAQSADELT---GRIKQLVDRARDTYT 235 >gi|58425974|gb|AAW75011.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 365 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 83/237 (35%), Gaps = 34/237 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 T PW +S + I ++ +++ +DVS SM+ DKL + S Sbjct: 135 LATTPWNKDSLLLRVGIAGRDIATADLPPA-NLVFQVDVSGSMD-----APDKLPLLRSS 188 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKST 256 ++ + + + R LVT++ P G Q I E I+ L G +T Sbjct: 189 LKLL------VRQLRKQDRITLVTYAGNTAVVLPPTPGDQQGRIVEAIDSLQSGGSTAGA 242 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+E AY + +G + I+ TDG+ + D +E +R G Sbjct: 243 SGIELAYKA------AQQGYLRGGINR---ILLATDGDFNVGVTDFDALKGMVSEKRRSG 293 Query: 317 AIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEM 362 + +G D ++ A + + +L I +++ Sbjct: 294 VALSTLGFGTGNYNDNLMEQSADAGDGAYAYIDTPLEARKVLTHELGATLATIARDV 350 >gi|94264779|ref|ZP_01288557.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] gi|93454768|gb|EAT05023.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] Length = 771 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 79/232 (34%), Gaps = 36/232 (15%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 N + L S S+ I + ++LD S SM D + A ++I +ML++ Sbjct: 242 PNGGYVALASFSPRLPPSEQPIPTSLAILLDCSGSMAG------DSIAQAKQAISDMLNL 295 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFG-STTKSTPGL 259 ++ N L+ F S++ FP + ++ I + T+ + Sbjct: 296 LRPEDYCN------LIMFGSEVKSVFPCQVAADKTNITTLRRAIRAIDADMGGTE----M 345 Query: 260 EYAYNKIFDAKE--KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + A + K + + I+ +TDG+ + Sbjct: 346 QKALVETLKMSPIYKPPEVEVVPARISRNILLITDGQ-------VWGDKQILRRMAKSDH 398 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 V+ +GV + FL AS + +L +G+++ +Q Sbjct: 399 RVFTVGVGGAVCEAFLHGLAS-----QSGGACELVAPNEEMGEKIARQSKRV 445 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 80/242 (33%), Gaps = 33/242 (13%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F + F N +G+I I I+ P ++ + E SH KAKL ++ + L A Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATL--ALT 68 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I N E + ++ KN + + L F I NI +T Sbjct: 69 IENNEIPDEPQQIKN-------NALVLSYVNAYLPSKKFLVPIININDNTHYLEYNAAVT 121 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y +S+ N + + + D++ V D S SM Sbjct: 122 MAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASEPT-------DVIFVADYSGSML 174 Query: 183 DHFGPGM----DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 +F +++ + R++ DII + + P +WG + Sbjct: 175 YNFNENKPRDHERIDALRSAFRKLHDIIMDNS-------------NINAIGYIPFSWGTK 221 Query: 239 HI 240 I Sbjct: 222 RI 223 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 91/274 (33%), Gaps = 43/274 (15%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS-SHAPLLITS 157 + F + + I +++++ I V + +C FP+ ++ Sbjct: 193 SAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYLSD 252 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +K SS + +++LD + D + +D + ++I + ++ Sbjct: 253 EIKRSSNT------LLLLDYIGDIID-YDKTIDSITGNAQTIDIPMSDVRFGDVCLQ--- 302 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G +S + Q + + I + T + G+ A N IF K K H Sbjct: 303 -GSNAYSLEQEQYI-------NNIDNIIEMEPHGWTLISSGILSA-NNIFKNKAKNGH-- 351 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLF--------YCNEAKRRGAIVYAIGVQAEAA 329 KK +I L+DG ++ +K + C E K + I + Sbjct: 352 ------KKLMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPD 405 Query: 330 DQF-------LKNCASPDRFYSVQNSRKLHDAFL 356 + K C D +Y N+ +L Sbjct: 406 NNKNEPYHINWKKCVGEDNYYEAHNAHELEHKLQ 439 >gi|170079457|ref|YP_001736094.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887126|gb|ACB00840.1| Protein containing von Willebrand factor (vWF) type A domain [Synechococcus sp. PCC 7002] Length = 545 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 70/191 (36%), Gaps = 23/191 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + I +I +++ +++ + DVS SMND DKL + + R + Sbjct: 156 PWQPQHQLLRIGIKGQ-EIENEALPPSNLVFLFDVSGSMNDP-----DKLPLLKSAFRLL 209 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLE 260 ++ ++ R +V ++ P G + I ++ L G +T G+E Sbjct: 210 VN------ELRPEDRVSIVVYAGAAGLVLPSTSGAEKETILAALDNLEAGGSTAGGEGIE 263 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + D + II TDG+ + + E + + + + + Sbjct: 264 LAYQEAADNFLDNGNNR---------IILATDGDFNVGMSSDAELIRLIEQKREQDIFLT 314 Query: 321 AIGVQAEAADQ 331 +G Sbjct: 315 VLGFGTGNLKD 325 >gi|150251392|gb|ABR68008.1| matrilin-like 85 kDa protein [Lehmannia valentiana] Length = 716 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 75/201 (37%), Gaps = 22/201 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++M+ D S S+ L + +IK+ +N VR G V FS K Sbjct: 529 DIIMLFDASNSI---------LLENFDKQFIFAKRLIKNFKIGSNDVRFGGVVFSQKTQL 579 Query: 230 TFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + N S+TK+ N + EK ++ Sbjct: 580 LFNLKDHDDFDGLSKGLTNVKYLDSSTKTDEAFNLVVNDKMFSVEKGGRVSAPD-----I 634 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSV 345 ++ TDG +SP +++ + K+ G + ++ + + L+ +S P+ Sbjct: 635 VLLFTDGNPTSPT----KTITSADTVKKNGISIISLAIGKDLDMDILRTISSKPEFAIEA 690 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 N L ++ + + K+ Sbjct: 691 TNYDMLDYVEKKLAQLLCKES 711 >gi|163850829|ref|YP_001638872.1| vault protein inter-alpha-trypsin subunit [Methylobacterium extorquens PA1] gi|163662434|gb|ABY29801.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium extorquens PA1] Length = 732 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 92/269 (34%), Gaps = 37/269 (13%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ + ER +L+ +D+ L+ + E P I A + ++T Sbjct: 276 EERSASERRITLADGATAADRDFELTWNAAPGEAPSIGLFRERVAGAEAVLAVVTPPESA 335 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS + + D++ V+D S SM + A S+ LD + + R ++ Sbjct: 336 SSAASVPRDVVFVIDNSGSMGGA------SMRQAKASLLIGLDRLGAHD------RFNVI 383 Query: 222 TFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F FP A + + + L T+ L+ A + Sbjct: 384 RFDHSFDTLFPDLVPADAGHLMRAKSFVAGLQASGGTEMLAPLQAALRGATPEETGRLRQ 443 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKN 335 ++FLTDG E+ + A RG + ++ +G+ + +++ Sbjct: 444 ----------VVFLTDG------AIGNEAQIFSAIATERGRSRLFMVGIGSAPNGYLMRH 487 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A F + ++ + + ++ Sbjct: 488 AAELGRGSFTQIDTPDQVTERMRALLVKL 516 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 79/242 (32%), Gaps = 19/242 (7%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + +G + + +++ VI ++GLV++ H + + +L D + L A I Sbjct: 11 FLRDQRGQVLPIAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALAGAEIIPTATT 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + +S + N +G+ T I ++ Sbjct: 71 AAAVYAKATAYSSTTGAANVYKNMTNITMVSGYPILKCLSTMQTQ-GISCVGPLSYNSIQ 129 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + + +P F S + TS+ S ++ +VLD + S + Sbjct: 130 VMQQAVVPLYFARI---IGRSSMTISATSTAAKGGASSRPYNVALVLDTTYS-EISYDSD 185 Query: 189 MDKLGVA-----TRSIREMLD---------IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 + + + LD + S N+VVR G+ TF + T Sbjct: 186 CGNSQMLCTLQGVQILLNQLDPCGTSVTTCSVTSGQATNSVVRVGIFTFPQMVTSTVSSD 245 Query: 235 WG 236 + Sbjct: 246 YD 247 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 59/154 (38%), Gaps = 14/154 (9%) Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY----AYNKIFDAKEKLEH 275 L +S + T W + T + + + A + + + Sbjct: 382 LTLYSDRASTTSDYYW----FSDSKWHTTIDGGTTGSVQMTWPEVWAKWSVRYVAKDIYT 437 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLK 334 A G + + F TD E S D + C+ AK G ++++IG +A + L+ Sbjct: 438 KALGGSENSWFETF-TD-EISYGQKDVR-LQQICDAAKDSGIVIFSIGFEAPENGRNQLR 494 Query: 335 NCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +CAS P +++ ++ AF I ++ R+ Sbjct: 495 DCASQPSNYFNATGV-QITTAFRAIATQLSHLRL 527 >gi|32475885|ref|NP_868879.1| hypothetical protein RB9427 [Rhodopirellula baltica SH 1] gi|32446428|emb|CAD76256.1| hypothetical protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 484 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 79/244 (32%), Gaps = 26/244 (10%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + H+ + ++H + +T S++ +++ +VL Sbjct: 33 MTATPLHQASAEQVKLDVRL-VHPVMKAGEKQTNHLRIALTGFELKSAEERPPVNVCLVL 91 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D S SM+ KL A + +D + V+ +V + S + P Sbjct: 92 DHSGSMSGQ------KLARAKEAAEAAIDRLSDDDIVS------VVLYDSNVTVLVPATK 139 Query: 235 -WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I++KI + GS+T G+ ++ + +I L+DG Sbjct: 140 ATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLADEQVNR---------VILLSDG 190 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKL 351 + +E + V +G+ + + + AS ++++ L Sbjct: 191 LANVGPKSPQELEGLGRSLMKEAISVSTLGLGSGYNEDLMVALASVGGGNHAFIEDADSL 250 Query: 352 HDAF 355 F Sbjct: 251 VSVF 254 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 61/351 (17%), Positives = 123/351 (35%), Gaps = 56/351 (15%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + + + +K+ ++ I D + + N A ++ +R+ + + + Sbjct: 1213 VQSDGKSTKIGEKDSATFTITNTYTPIDINSVIEYNKTATLLDWNQRTYKIDLTASSKTT 1272 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK-------ISSKSDIGLDMMMVLD 176 S + Y++ + + I S I +++ I + D Sbjct: 1273 Q---SMKTPYDIVLVLDQSGSMSQKFVEYNKINGSSMFWRKTYYIKTQNGIYQQLSWSWD 1329 Query: 177 VSLSMNDHF-------GPGMDKLGVATRSIREMLDIIKS--------IPDVNNVVRSGLV 221 + S D + P + VA +S + +D +KS + + N+ R G+V Sbjct: 1330 NTWSYTDSYSGKTVTVDPNTTDVYVAQKSNQTKIDALKSAATTFVNNVANKNSDCRVGIV 1389 Query: 222 TFSSKIVQTFPLAWGVQHIQ----------EKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 TFS+ P+ + I+ L G T GL+ A + Sbjct: 1390 TFSNDGYIK-PITNNSYTLAKVGTSKGDIINTIDGLKTGGDTYPAKGLDKANEIFSENSS 1448 Query: 272 KLEHIAKGHDDYKKYIIFLTDGEN---SSPNIDNKESLFYCNEAK---RRGAIVYAIGVQ 325 + D KK ++FLTDG ++ N D + N AK +G YA+G+ Sbjct: 1449 NSWETVEQTDGRKKMVVFLTDGVPAPANTNNFDENLAGAGTNSAKILHDQGVATYALGIF 1508 Query: 326 AEAA-------------DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEM 362 A D+++++ AS +++ + + L F I + Sbjct: 1509 GAANSDGTMDNASVQRIDKYMQSIASSHEKYMTADSVDNLSSLFESITNNI 1559 >gi|118365082|ref|XP_001015762.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89297529|gb|EAR95517.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 755 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 84/202 (41%), Gaps = 32/202 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +D++ ++D S SM K + +S++ +L I++ ++ LV+F Sbjct: 43 DARLPVDIICLIDNSGSMAGK------KAQLVRKSLKYLLKILEKGDQIS------LVSF 90 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SS PL Q I+ I ++ T PG + + KE+ E Sbjct: 91 SSTAKTLCPLTQVNDENKQQIKSAIKQINGQGGTFVIPGFKEVTKILNSRKEQRE----- 145 Query: 280 HDDYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + +I+ LTDGE +S I N LF +E ++ +Y G + + L+ Sbjct: 146 ----QTFILLLTDGEFGDIDSGKVIQNINRLFTQSEIQKTPY-IYTYGYGDDVNPEILQE 200 Query: 336 CA--SPDRFYSVQNSRKLHDAF 355 A ++ + N +++ D F Sbjct: 201 IAQKFQGKYCLISNVQQVTDWF 222 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 5/80 (6%) Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCAS-PDRFYSVQNSR 349 D N S DN + C AK G +V+ IG + + +++CAS P ++ V Sbjct: 519 DYFNYSEKNDNLD--EICTAAKNAGMVVFTIGFEVSGSQHDIMRSCASAPAYYFDVDGL- 575 Query: 350 KLHDAFLRIGKEMVKQRILY 369 + AF I +E+ K R+++ Sbjct: 576 DISAAFAAIAREISKLRLVF 595 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 72/199 (36%), Gaps = 23/199 (11%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + +GS I + ++ + G+ ++T + + LD ++L A+ ++ Sbjct: 21 FLRDERGSFIIFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAAS---LDQD 77 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + + + F+ + L DI+ E T+ + D + Sbjct: 78 LDPEEVVLDYFT------------KAGLGHVISQDDIDVFENQTNGEVADDVAVTTRRVE 125 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGP 187 A MP F + + ++ + L + +VLDVS SM + Sbjct: 126 ASVSALMPTTFLRLAHMYDLGLY-------TEGGAEEALSLSEISLVLDVSGSMGNSSSS 178 Query: 188 GMDKLGVATRSIREMLDII 206 G K+ R+ + ++++ Sbjct: 179 GYSKIYELRRAAKRFVNVM 197 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 39/90 (43%), Gaps = 16/90 (17%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY-----------AYNKIFDAKEK 272 + + +P+ + + ++++I+ L T G+++ A + + A E Sbjct: 318 NDYWREIYPMGFSAEALRDEIDDLGASGNTSIDLGMKWGAALLDPAAQPAISDLVAANEV 377 Query: 273 LEHIAKGHDDY-----KKYIIFLTDGENSS 297 E +Y +K I+ +TDGEN+S Sbjct: 378 NEAFDGRPFEYTQRGIEKVIVLMTDGENTS 407 >gi|116623920|ref|YP_826076.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227082|gb|ABJ85791.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 300 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 74/218 (33%), Gaps = 42/218 (19%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S D+ + + +VLDVS SM KL A + +L D + + L+T Sbjct: 70 STEDLPVSIGLVLDVSGSMRQ-------KLATARAFLSALL---GGAEDRDESL---LLT 116 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + L ++ + + G +T + D + + Sbjct: 117 CADRPDLQTGLTPDLERLSSLVRATRSGGST-----------ALIDTIYLSIERMRSARN 165 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG-------------VQAEAA 329 +K +I ++DG+++ E + A A +Y+I V A Sbjct: 166 SRKVLIVVSDGQDNFSRHTRSELIS---RAIESEAQIYSIATPEPPHFQKAIELVDANRG 222 Query: 330 DQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L++ A+ + + + R+ + + +Q Sbjct: 223 LILLQDLAHATGGIYLPLDAASSTTAVAERLARTLHEQ 260 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 71/218 (32%), Gaps = 27/218 (12%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I+ K + + VLD S SM ++ + + ++D + + Sbjct: 960 ISIKASAEVKQRAHVALTCVLDRSGSMGGE------RIELVRETCHFLIDQLTADD---- 1009 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAY--NKIFD 268 G+V++S+ + + PL + I+ L T GLE Sbjct: 1010 --YLGIVSYSNTVREDVPLLRMTPEARRLAHTMISSLTLHGGTALYAGLEAGVKQQMAAA 1067 Query: 269 AKEKLEHIAKGHDDYKKYII----FLTDGENSSPNIDNKESLFYCNEAK---RRGAIVYA 321 ++ K A G I+ TDG+ ++ E + + + V+ Sbjct: 1068 SELKALAAAAGGGSDSSRIVHSCFLFTDGQATTGPCTVNEIMGQMTSLQSPADQNITVHT 1127 Query: 322 IGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 G + + + L+ A +Y + + + F Sbjct: 1128 FGFGDDHSVELLQGVAEAQSGVYYYISCADDIPSGFGD 1165 >gi|126336625|ref|XP_001380258.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 819 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 75/202 (37%), Gaps = 17/202 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM K+ ++ ++LD +K N + SG VT K Sbjct: 263 IVFLIDKSGSMAG------RKIKKTKAALIKILDDLKPEDHFNMITFSGHVT-RWKPELV 315 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L ++ + ++ T + A + + ++ +K E II L Sbjct: 316 LALDEHLKEAKTFLSNTPALGVTNVNGAVLAAVSMLDESNKKKELPEGSVSM----IILL 371 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-----RFYSV 345 TDG+++ ++ A R ++ +G + FL+ A + + Sbjct: 372 TDGDSTEGETKLQKIHENVKAAIRGQYHLFCLGFGFDINYVFLERLALDNGGMARHIFEG 431 Query: 346 QNSR-KLHDAFLRIGKEMVKQR 366 ++ +L D + + ++ Q Sbjct: 432 LDAELQLQDFYQEVANPLLTQV 453 >gi|311252475|ref|XP_003125114.1| PREDICTED: anthrax toxin receptor 1-like [Sus scrofa] Length = 564 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTQ 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEELFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 75/198 (37%), Gaps = 36/198 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G ++L ++ + L + + N G+V F S Sbjct: 306 VCLVLDKSGSM-----SGFNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTANIK 355 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + ++ E + + T G+ A+ I + + + Sbjct: 356 SNLIQIISSKERNNLLESLPK-AANGGTSICAGMRSAFQVI-----REVYPQIDGSE--- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 I+ LTDGE++S +E + GAI++ I + A ++ A + F+ Sbjct: 407 -IVLLTDGEDNSAKD-------CIDEVTQSGAIIHLIALGPSADQAVIEMSAMTGGNHFF 458 Query: 344 SVQNSRK--LHDAFLRIG 359 + ++ L DAF + Sbjct: 459 ASDEAQNNGLIDAFGALA 476 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 73/185 (39%), Gaps = 21/185 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW N+ H + I K + ++ ++ +LDVS SMN+ KL + + ++ Sbjct: 228 PW--NAEHRLVRIGIKGKEIANAERPASNLVFLLDVSGSMNNAR-----KLPLLKQGMKL 280 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 ++D + V VV +G +V I E ++RL G +T G+E Sbjct: 281 LVDQLGENDKVAIVVYAGAAG----MVLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIEL 336 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY E+ KG + +I TDG+ + + + + + G + Sbjct: 337 AYQAAT------ENFIKGGVNR---VILCTDGDFNVGVTSTSDLVTMAADKAKSGVFLSV 387 Query: 322 IGVQA 326 +G Sbjct: 388 MGFGT 392 >gi|149200158|ref|ZP_01877182.1| batB protein [Lentisphaera araneosa HTCC2155] gi|149136799|gb|EDM25228.1| batB protein [Lentisphaera araneosa HTCC2155] Length = 621 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 59/176 (33%), Gaps = 27/176 (15%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 K ++ ++D+S SMN +L + +++++ I R GL Sbjct: 84 SQEKESSSRSILFLVDISKSMNVRDMNEQSRLEYSKWWAKKLMNDIPG-------DRFGL 136 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +TFS PL + ++ L+ G T L++A + Sbjct: 137 ITFSRIANIECPLTSEPDMVLLYLSDLNSSLLPGGGTNIAAALDHA-----------QKQ 185 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K ++ + ++ L+DGE +++ V I + + Sbjct: 186 FKENERDSRVVVLLSDGETDGNKW-----RESLEALQKKKIPVNVISLGDPKREGL 236 >gi|304411390|ref|ZP_07393004.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|307305288|ref|ZP_07585036.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] gi|304350245|gb|EFM14649.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|306911591|gb|EFN42016.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] Length = 771 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 75/188 (39%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S++ + ++++V+D S SM D + A ++ L +K N Sbjct: 381 VEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLKPEDSFN----- 429 Query: 219 GLVTFSSKIVQ--TFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + Q PL + + ++ ++RL T+ L+ A + L Sbjct: 430 -IIEFNSSLSQFSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAAL------PKSL 482 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ + +IF+TDG + + E++ ++ +G+ + F+ Sbjct: 483 GSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFM 537 Query: 334 KNCASPDR 341 + A R Sbjct: 538 QRAAELGR 545 >gi|297190882|ref|ZP_06908280.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197722677|gb|EDY66585.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 518 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 71/212 (33%), Gaps = 36/212 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV--NNVVR--- 217 + D ++ VLD S SM ++RE +D + D VR Sbjct: 322 PERDRAAQVIFVLDFSSSMRGER----------ITALRETIDGLAGGDDSPSGKFVRFYR 371 Query: 218 ---SGLVTFSSKIVQTFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ F ++++ + + + ++ + F +T L++AY + Sbjct: 372 GETLTVMRFGGRVLEERNITYDGPRDLDRLRGVVASDDFAGSTAIWSSLDHAYRAVARDL 431 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAE 327 I+ +TDGEN++ + + +A+R Y I Sbjct: 432 VDRPERRVS-------IVLMTDGENNAGMDVDAFVRAHARLPEDARR--VRTYTIRYGEA 482 Query: 328 AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 + + A+ R + R L AF I Sbjct: 483 DTRELDRGARATGGRMVDATD-RSLLSAFKEI 513 >gi|120554526|ref|YP_958877.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324375|gb|ABM18690.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 715 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 48/238 (20%), Positives = 87/238 (36%), Gaps = 39/238 (16%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 W A + + + D+ +V+D+S SM + + + V + Sbjct: 8 WVAAALLLISWQAGAQDPALSLPGNADVRIVVDISGSMKETDPQNLRRPAVRLLA----- 62 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHI--QEKINRLIFGSTTKST 256 +++P+ + GL TF + P +W I E+IN + T Sbjct: 63 ---RTLPEGASA---GLWTFGQYVNMLVPYGVVDQSWRDTAIERSEQINSVALH--TNLG 114 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCN--- 310 +E A N G ++I L+DG+ P D+ E + Sbjct: 115 LAMEKAANDWLS----------GGTLENTHLIVLSDGKVDVPGGDDASQAEEKRIVDSLL 164 Query: 311 -EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K +GA ++ +G+ +A +FL+N A + F Q++ L+ AF V Q Sbjct: 165 PALKDKGATIHTVGLSEKADIRFLRNLARETGGSFQLAQSAEALNLAFANALNTAVPQ 222 >gi|326672458|ref|XP_695815.4| PREDICTED: von Willebrand factor A domain-containing protein 2 [Danio rerio] Length = 795 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 77/219 (35%), Gaps = 23/219 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F + + + +KI++ ++ + +D+ M+ + G + Sbjct: 13 LLFGQVWSIFSVQEIQTDLDTIMKINAAGEM-MQCSAAMDILFVMDSSYSVGKGGFERSR 71 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTT 253 + ++ + + D VR G+V F S L + K G +T Sbjct: 72 HYMLKLCEALDVRQDK---VRVGVVQFGSTPKLEVSLDSYKTKEELKKKLKKIHYRGGST 128 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ L++ K F + + ++ L+DG++ +E K Sbjct: 129 QTGLALKFVLRKGFSGGR--------NSSVPRVVVLLSDGKSQGAVQ------LPASELK 174 Query: 314 RRGAIVYAIGVQAEAADQF--LKNCASPDRFYSVQNSRK 350 G +++A+G++ D+ L + S + ++ Sbjct: 175 LSGVLLFAVGIRYPRWDELRELASGPSDSHVFFAEHFSD 213 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 59/188 (31%), Gaps = 22/188 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD++ VLD S + +T +V + GLV + Sbjct: 528 GQALDLVFVLDASSGVGKENFIHFQDFVRSTSV---------QFDINRDVAQVGLVVYGR 578 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + V F L + + F S + A + + A+ Sbjct: 579 RPVTVFDLDKYNSGSAVLRAVGDAAFLGGKAS---VGSALLHVLSQSLTVGKGARPG--V 633 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 K ++ LTDG +++ + + G ++ IGV + L+ S D Sbjct: 634 NKAVVVLTDG------TGVEDAAVPAQKIRDSGVSIFLIGVGDIQQELLLRISGSEDHMI 687 Query: 344 SVQNSRKL 351 +V + L Sbjct: 688 TVPSYDDL 695 >gi|320105085|ref|YP_004180676.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752367|gb|ADV64127.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 323 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 82/246 (33%), Gaps = 37/246 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +P + P+ + ++ + + +DVS SM FG G Sbjct: 68 VNLAETLPALIL--AVVILMLAGPVRFSEPRTKRVMTN----IELCVDVSGSMMSPFGDG 121 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI-------Q 241 + ++ ++I + LD GL F + + PL V I + Sbjct: 122 -TRYDMSMKAIDKFLDS-------RRGDAFGLTFFGNNYLHWVPLTSDVSAIKCAPPFMK 173 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ L + + +E + II ++DG + Sbjct: 174 PEVAPLWMSG-TEIGKALLGCRRTLVERQEG-----------DRAIILISDGASFDLGGG 221 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIG 359 N E KR G +VYAI + E D + C+ + ++ + L F RI Sbjct: 222 NDE--EVARLLKRDGIVVYAIHIDETEIPDPIVTICSITGGDAFAPDDPSALEAIFKRID 279 Query: 360 KEMVKQ 365 + + Sbjct: 280 QMTPTR 285 >gi|311253076|ref|XP_003125384.1| PREDICTED: matrilin-3-like [Sus scrofa] Length = 488 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 71/202 (35%), Gaps = 28/202 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD++ ++D S S+ + + ++D + R +V ++ Sbjct: 80 KSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRIIDNLDI---GAEDTRVAVVNYA 130 Query: 225 SKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S + F L Q ++ + R+ + T S ++ A ++ F + Sbjct: 131 STVKIEFHLQTHSDKQALKRAVARIAPLSTGTMSGLAIQTAMDEAFT---VEAGARGPNS 187 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + K I +TDG + A+ G +YA+GV A + L+ AS Sbjct: 188 NIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVD-RADMESLRLMASEPL 240 Query: 339 PDRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 241 DEHVFYVETYGVIEKLSSRFQE 262 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 74/208 (35%), Gaps = 30/208 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ + + +L P ++ G+V + ++V Sbjct: 150 MDIVIVLDGSNSIYPWYE--------VQDFLINVLQKFYIGPGQ---IQVGVVQYGERVV 198 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 F L + + E + G E + G D KK +I Sbjct: 199 NEFRLD-DFRTVDEVV---AAAKNIDQRGGEETRTALGINVARTQAFKHGGRPDAKKVMI 254 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FLKNCAS---- 338 +TDGE + D+ + E+++ +Y I V + FL+ Sbjct: 255 VITDGE----SHDSPDLKAAVEESEKDNITLYGIAVLGYYNRRGINPEAFLREIKFIATD 310 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + F+SV + L D +G+++ Sbjct: 311 PDEHFFSVTDESALKDIVDALGEKIFSL 338 >gi|261251272|ref|ZP_05943846.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] gi|260938145|gb|EEX94133.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] Length = 436 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 77/214 (35%), Gaps = 20/214 (9%) Query: 14 KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGK 73 +G IL LLP++ I M ++ S + A++ + + L E+ N Sbjct: 8 RGVAGILFIGLLPIMVIFMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDDNVKY 67 Query: 74 KQK--NDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 ++ + + I ++ + R ++G Q + + + +HK + A Sbjct: 68 ARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQASGEVAPFSDFVVSAKAEHKSWI--AY 125 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 E+ F + P +D+ + D S SM H+ G Sbjct: 126 EEAELKPEFEVAGKSVTRKYLPQP--------------VDVYFIGDFSGSMTGHWKGGKT 171 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 KL V ++I +++ I + R L+ ++ Sbjct: 172 KLDVVKQTIERVVEDIADF-NTEEKSRVALLGYN 204 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/328 (14%), Positives = 100/328 (30%), Gaps = 75/328 (22%) Query: 28 IFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIK-- 85 + I +GL I T+ + ++ + + + + + D ++ Sbjct: 15 LIIFLGLGIVTTITSYATDDVNQEQNIEVGPIVNEESTEGDEGQKDDSYEDLLDESVRTS 74 Query: 86 --NIWQTDFRNELRENGFAQDINNIERSTSLS-IIIDDQHKDYNLSAVSRYEMPFIFCTF 142 N +T + L ++ F +N++ + + I + RY++ Sbjct: 75 ESNKLETVIQTNLNQSLFVTSASNVKPQIDFTYLGITPINPMQGQEFTVRYKLT------ 128 Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P ++ + +++VLD S SM+ KL + ++ Sbjct: 129 ---------PNPFQHNISKPKE------IVLVLDGSGSMSG------TKLTNLKNAAKDF 167 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK------------------- 243 +D +K + ++ +V FSS G I+ Sbjct: 168 IDRLKGVDNLK----VAIVVFSSNATINPISVSGTTKIKSTDKSSESSIPNYKTLQNEYF 223 Query: 244 -----------INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 IN + T + GL A + KG K II ++D Sbjct: 224 LDINDSRLITMINNIDAQGGTNTGDGLRKAEYLLSQ---------KGDSVANKTIILMSD 274 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVY 320 G + + + + Y E K ++ Sbjct: 275 GLPTYYSGSTESGVNYYKEIKDDVVGIF 302 >gi|218529650|ref|YP_002420466.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium chloromethanicum CM4] gi|218521953|gb|ACK82538.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium chloromethanicum CM4] Length = 738 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 91/269 (33%), Gaps = 37/269 (13%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ + ER +L+ +D+ L+ + E P I A + ++T Sbjct: 282 EERSASERRITLADGATAADRDFELTWTAAPGEAPSIGLFREQVAGAEAVLAVVTPPESA 341 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S S + D++ V+D S SM + A S+ LD + + R ++ Sbjct: 342 SPASPVPRDVVFVIDNSGSMGGA------SMRQAKASLLIGLDRLGAGD------RFNVI 389 Query: 222 TFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F FP A + + + L T+ L+ A + Sbjct: 390 RFDHSFDTLFPDLVPADAGHLMRAKSFVAGLQASGGTEMLAPLQAALRDATPEETGRLRQ 449 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKN 335 ++FLTDG E+ + A RG + ++ +G+ + +++ Sbjct: 450 ----------VVFLTDG------AIGNEAQIFSAIATERGRSRLFMVGIGSAPNGYLMRH 493 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A F + ++ + + ++ Sbjct: 494 AAEVGQGSFTQIDTPDQVTERMRALLVKL 522 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 66/202 (32%), Gaps = 25/202 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV----- 216 + + + ++ ++D S SM+ G D + +DI K V Sbjct: 401 NRQQRPDIALVFIIDKSGSMDACHCNGGD-MAAREGGGTRKIDIAKEAVAQAAAVLGKDD 459 Query: 217 RSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + G+VTF T L + + + T G+ AY ++ + K++ Sbjct: 460 KLGVVTFDDSAHWTIELDKVPSQDDVVAALAPVPPSGQTNVVSGMNAAYEQLRQSDAKIK 519 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H I LTDG + +I + + G + + + + + Sbjct: 520 HA-----------ILLTDGWGHATDIGS-----IAENMNKDGITLSVVAAGNGSDNALQR 563 Query: 335 NCA-SPDRFYSVQNSRKLHDAF 355 R+Y + ++ F Sbjct: 564 YAELGGGRYYPARVMEEVPQIF 585 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 70/199 (35%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ + ++ FS++I Sbjct: 253 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 300 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V P V+ + I+ + T L+ A + D + H Sbjct: 301 KDHLVSVTP--DNVRDGKIYIHHMSPTGGTDINGALQRAIKLLND---YVAHNDIEDRSV 355 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 356 S-LIVFLTDGKPTVGETHTLKILNNTKEAARGQVCIFTIGIGDDVDFKLLEKLSLENCGL 414 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R +++ F Sbjct: 415 TRRVLEEEDAGSQLIGFYD 433 >gi|221640506|ref|YP_002526768.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] gi|221161287|gb|ACM02267.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] Length = 651 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 81/237 (34%), Gaps = 21/237 (8%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + + PW + + + + + + L+++ ++D S SM D Sbjct: 255 NGTPPFRPTLSVTRTPWNPETRLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQDPA-- 311 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 KL + +S ML ++ V V +G S+ V A I ++RL Sbjct: 312 ---KLPLLKQSFGLMLGRLRPEDQVAIVTYAG----SAGEVLAPTAANQRSTILSALDRL 364 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T GL AY + G + + ++ TDG+ + D ++ Sbjct: 365 DAGGSTAGEEGLALAY--------RTASEMAGAGEVTR-VVLATDGDFNLGISDPEDLAR 415 Query: 308 YCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + G + +G D ++ A + L++A + ++ Sbjct: 416 LVAHERDTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLS 471 >gi|241672093|ref|XP_002411438.1| neurogenic locus notch, putative [Ixodes scapularis] gi|215504089|gb|EEC13583.1| neurogenic locus notch, putative [Ixodes scapularis] Length = 1597 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 81/201 (40%), Gaps = 31/201 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L S+++ S + D++ VLD S S+ P A + EM+ + + Sbjct: 29 QLFVSTIERYSTTKN--DIVFVLDESGSIGADVFP-------AELAFTEMVARLLVVSP- 78 Query: 213 NNVVRSGLVTFSSKI-----VQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 R ++TFS+ + ++N++ + T++ L YA + Sbjct: 79 -EYSRLTVMTFSNDNLVHIDQVGSSGDTNMCKFVHELNQIPYRSGGTRTREALGYAGEIL 137 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++A+ + + ++ ++DG+ +S + E + +G +++ +GV Sbjct: 138 WNAR----------QEANRIVVLISDGQANSGS----EPSEIARLLRVKGIVIFGVGVAH 183 Query: 327 EAADQFLKNCASPDRFYSVQN 347 D+ L +SP Y ++N Sbjct: 184 INKDELLDVASSPAHTYMLRN 204 >gi|47214246|emb|CAG12465.1| unnamed protein product [Tetraodon nigroviridis] Length = 467 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 72/212 (33%), Gaps = 46/212 (21%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P S SS + +D++ ++D S S+ + A +++++D ++ D Sbjct: 36 PSSTVSPASGSSCRNGPIDLVFIVDSSRSVRP------TEFEKAKEFLQDLVDSLEVGLD 89 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R GLV ++S PLA S T + + A K F A Sbjct: 90 S---TRVGLVNYAS-----TPLA----------------SGTMTGLAIRTAVEKAFAA-- 123 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + +TDG E A+ G +YA+GV A Sbjct: 124 -EAGARLNSTKVARVAVVVTDGRPQD------EVERVSAAARESGIEIYAVGVD-RADRT 175 Query: 332 FLKNCAS---PDRFYSVQN---SRKLHDAFLR 357 L+ AS D + V+ KL F Sbjct: 176 SLRLMASQPHEDHVFYVETYGVIEKLTSRFRE 207 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 75/198 (37%), Gaps = 36/198 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G ++L ++ + L + + N G+V F S Sbjct: 307 VCLVLDKSGSM-----SGFNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTANIK 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + ++ E + + T G+ A+ I + + + Sbjct: 357 SNLIQIISSKERNNLLESLPK-AANGGTSICAGMRSAFQVI-----REVYPQIDGSE--- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 I+ LTDGE++S +E + GAI++ I + A ++ A + F+ Sbjct: 408 -IVLLTDGEDNSAKD-------CIDEVTQSGAIIHLIALGPSADQAVIEMSAMTGGNHFF 459 Query: 344 SVQNSRK--LHDAFLRIG 359 + ++ L DAF + Sbjct: 460 ASDEAQNNGLIDAFGALA 477 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 3/73 (4%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDA 354 + + + + CN AK +G +VY IG +A ++ LK+CAS D + V+ ++ DA Sbjct: 485 NTSTKDARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGL-EIRDA 543 Query: 355 FLRIGKEMVKQRI 367 F I + + R+ Sbjct: 544 FASIATSIRQLRL 556 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/260 (15%), Positives = 86/260 (33%), Gaps = 44/260 (16%) Query: 5 NIRNFFYNCKGSIS-ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +R F + G + + LL + V G+ ++ + L Y LD ++L A Sbjct: 24 QLRAFRQDDSGVLLKPMVGFLLS-MLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA--- 79 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + + + K + + E G + ID + Sbjct: 80 ----DLDQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFK--------KVQSTIDTTFE 127 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + L S +MP + + +S GL++ +VLDVS SM Sbjct: 128 THMLKFSSGQDMPLY-------------------ATSRAEESIDGLEISLVLDVSGSMGS 168 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQ 241 + +L + ++ +D + ++N + ++ +++++ L + Sbjct: 169 N-----SRLANLKVAAKDFVDTM-IANTIDNKMSISIIPYATQVSLPTELMDQYNTTDEH 222 Query: 242 EKINRLIFGSTTKSTPGLEY 261 N + F + T L Sbjct: 223 AYSNCVNFVGSHFQTTALST 242 >gi|326435125|gb|EGD80695.1| hypothetical protein PTSG_01285 [Salpingoeca sp. ATCC 50818] Length = 1006 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 66/189 (34%), Gaps = 21/189 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SS 225 +D+ ++DVS S+ + + + +P N VR L+T+ S+ Sbjct: 308 AVDVGFIVDVSGSVGAANFALVRDFIAS---------TVDMLPVGENTVRVALMTYHSSN 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F ++ + I+ L++ + + A N D + +G Sbjct: 359 MPQFDFDDSFDRATVVSAISSLVYDDSRQYGTATGSALNFFADNMLQASAGYRGGPA--- 415 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYS 344 + TDG + +L GA V IGV A + L+ A SP + Sbjct: 416 IVYVFTDGASQDDVTPGAAALQ------ATGAQVVTIGVTAAVNEAELQEIASSPSDVFI 469 Query: 345 VQNSRKLHD 353 V + L D Sbjct: 470 VADFDSLTD 478 >gi|238759128|ref|ZP_04620297.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702676|gb|EEP95224.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 448 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 88/232 (37%), Gaps = 26/232 (11%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+I I LP+ ++ L E SH AKL ++ + L + + N Sbjct: 17 FIKDEIGAILWPFIIFLPLFIGLLYLSFEISHYLQKAAKLSDAIEQATLA----LTIENN 72 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 NN + + + + ++ R L F+ + +I + +Y Sbjct: 73 TNNPDETQTEKNISLVNAYA----RAYLPSESFSAPVIDIISHPNYIEYRAATTLNYTPK 128 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +++ + I N TS +I+ D++ V+D S+SM+ +FG Sbjct: 129 FLTKELITNIDRRIIVSDNGVAIKNKFTSPGEIT-------DVVFVVDYSVSMDGNFGD- 180 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 +K + +R + + + + NN ++ + P +WG + I Sbjct: 181 -EKKTTKIQELRRIFEDLNNTILKNN---------NTHTIGFVPFSWGTKKI 222 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 15/84 (17%), Positives = 30/84 (35%), Gaps = 8/84 (9%) Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY--------CNEAKRRGAIVYAIGVQAEAA 329 K + K +I ++DG + + D + C K + IG+ Sbjct: 344 KDSKNKDKLMIIISDGNDQEISSDLTQEKITKTLIEKGMCERIKENNIRMVFIGIAYTVK 403 Query: 330 DQFLKNCASPDRFYSVQNSRKLHD 353 + ++C +Y QN+ +L Sbjct: 404 EIKWEDCVGKRNYYEAQNAHELEA 427 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 3/73 (4%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASPD-RFYSVQNSRKLHDA 354 + + + + CN AK +G +VY IG +A ++ LK+CAS D + V+ ++ DA Sbjct: 487 NTSTKDARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGL-EIRDA 545 Query: 355 FLRIGKEMVKQRI 367 F I + + R+ Sbjct: 546 FASIATSIRQLRL 558 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/260 (15%), Positives = 86/260 (33%), Gaps = 44/260 (16%) Query: 5 NIRNFFYNCKGSIS-ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +R F + G + + LL + V G+ ++ + L Y LD ++L A Sbjct: 26 QLRAFRQDDSGVLLKPMVGFLLS-MLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA--- 81 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 + + + K + + E G + ID + Sbjct: 82 ----DLDQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFK--------KVQSTIDTTFE 129 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + L S +MP + + +S GL++ +VLDVS SM Sbjct: 130 THMLKFSSGQDMPLY-------------------ATSRAEESIDGLEISLVLDVSGSMGS 170 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQ 241 + +L + ++ +D + ++N + ++ +++++ L + Sbjct: 171 N-----SRLANLKVAAKDFVDTM-IANTIDNKMSISIIPYATQVSLPTELMDQYNTTDEH 224 Query: 242 EKINRLIFGSTTKSTPGLEY 261 N + F + T L Sbjct: 225 AYSNCVNFVGSHFQTTALST 244 >gi|299139640|ref|ZP_07032813.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598264|gb|EFI54429.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 488 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 81/253 (32%), Gaps = 32/253 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + F + S A S+ ++M++D+S SMN G + Sbjct: 73 LAGAFTMQSASGEAKPFYASAGSGPDAGQHTNVVLMIVDISGSMNQPMAGGSTRFQSLKS 132 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQTFP---LAWGVQHIQEKINRLIFGS-- 251 +I + L ++ D R +V F S +V T ++N L Sbjct: 133 AIAQFLAGMQEGSD-----RVAIVPFESHNVVSTIRSAVFTTHRADALAQLNALPAPGPK 187 Query: 252 -TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL---- 306 T + + + L+H + + ++I +TDG+N D+ + L Sbjct: 188 NNTALYQAVFSGVDSMKGELASLQHEGATLAELQPHLIVMTDGKNEVAPGDDPQLLNGDL 247 Query: 307 ---FYCNEAKRRGAIVYAIGVQ------AEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + IG A A + K RF+ ++ +L A Sbjct: 248 GLQQAVAQVQTSNLDTIGIGFGDKNDIDAGALQKLTK------RFFYASDANQLLAAL-H 300 Query: 358 IGKEMVKQRILYN 370 + + V I Sbjct: 301 VSRSAVSHSIQIT 313 >gi|294140611|ref|YP_003556589.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327080|dbj|BAJ01811.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 747 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 75/192 (39%), Gaps = 25/192 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 LL+ +I + S I D+++V+D S SM+ + A +++ L + + Sbjct: 322 LLMLIPPEIGASSVIARDLILVIDTSGSMSGE------AIVQAKKAMGYALAGLGARDSF 375 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI- 266 N ++ F+S + + + + I L T+ P L A + Sbjct: 376 N------VIAFNSDVHALSAQSLAATAKNIGRANQFIRTLKADGGTEMGPALTRALDNGN 429 Query: 267 -FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + E K+ + F+TDG + N+ SLF E K + ++ IG+ Sbjct: 430 HSTSHQDEEDFDSDGVRLKQVL-FMTDG-----AVANERSLFNLIEDKIGHSRLFTIGIG 483 Query: 326 AEAADQFLKNCA 337 A F++ A Sbjct: 484 AAPNSHFMERAA 495 >gi|194289206|ref|YP_002005113.1| lipoprotein, von willebrand factor type a domain [Cupriavidus taiwanensis LMG 19424] gi|193223041|emb|CAQ69046.1| putative lipoprotein, Von Willebrand factor type A domain [Cupriavidus taiwanensis LMG 19424] Length = 570 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 81/237 (34%), Gaps = 34/237 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW ++ + I ++S + +++ ++DVS SM + Sbjct: 169 PWHPSNVLLRIGIKGK-DMASGALPAANLVFLVDVSGSM-----------NTPDKLPLLK 216 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLE 260 + + + R LVT++S P G I I++L+ G +T G+ Sbjct: 217 SSLKLLVNQLRAQDRITLVTYASGTRVALPPTPGSDKGAIVAAIDQLVAGGSTAGASGIA 276 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + G + ++ TDG+ + D ++ + ++ G + Sbjct: 277 LAYQA------AQQSYIAGGINR---VLLATDGDFNVGVTDFRQLKSMVEDKRKSGVSLS 327 Query: 321 AIGVQAEA-ADQFLKNC--ASPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 +G +Q ++ A + + N ++ I +++ Q Sbjct: 328 TLGFGTGNYNEQLMEQLADAGDGAYSYIDNLMEGNKVLVSEISSTLATIARDVKIQV 384 >gi|281423275|ref|ZP_06254188.1| BatB protein [Prevotella oris F0302] gi|281402611|gb|EFB33442.1| BatB protein [Prevotella oris F0302] Length = 342 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 68/205 (33%), Gaps = 29/205 (14%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + R + F KIS G+++++ LD+S SM Sbjct: 51 SKCRPTIKFWLLLSAMAILILMIARPQAG-TKISHDKRNGIEVIIALDISNSMLAEDVTP 109 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN--- 245 +L + I ++D + + GLV F+ P+ + + Sbjct: 110 -SRLEKSKLLIENLVDHFTN-------DKVGLVVFAGDAFVQLPITSDYVSAKMFLQNIK 161 Query: 246 -RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 LI T + A E ++ D+ K II +TDGE+ Sbjct: 162 PSLIATQGTD-----------LARAIELSQNSFMQRDNIGKAIIVITDGEDHEG-----G 205 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAA 329 +L A ++G+ V+ +G+ Sbjct: 206 ALEAAKAAHKKGSNVFILGIGDPKG 230 >gi|73745523|emb|CAI61969.2| putative TerY1 protein [Escherichia coli] Length = 239 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 76/225 (33%), Gaps = 19/225 (8%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 P+ PL + + S K ++ L + ++LD S SM+ + Sbjct: 1 MPFYQIYKLWPLPENTLLFRSLKKELHLRRLPVYLLLDTSGSMHGE------PIEAVKNG 54 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ +L +K P ++TF S Q PL + ++ L TT Sbjct: 55 VQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT---DLLSFQMPALTASGTTSLGEA 111 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L + I +K KG + +TDG SPN D ++ L A+ G Sbjct: 112 LTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDG---SPNDDWRKGLNDFKAART-GV- 164 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 V A +A LK +S + F + + Sbjct: 165 VVACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSASIS 209 >gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 786 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 33/205 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ A ++R +L+ + N +V + S + Sbjct: 309 VIFVVDRSGSMQGK------KIEQAREAMRYVLNNLHEGDTFN------IVAYDSTVESF 356 Query: 231 FP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + ++ L G +T + L+ A+ + D Sbjct: 357 KPELQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLT------------GSDRPN 404 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF--- 342 YI+FLTDG ++ + + + + A + GV + + L S + F Sbjct: 405 YILFLTDGLPTAGETNEGKIVELAKQKNVHRARMINFGVGYDVNSRLLDR-MSRENFGQS 463 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRI 367 V+ L + R+ +M + Sbjct: 464 QYVRPDENLEASVSRLYSKMSSPVL 488 >gi|284052693|ref|ZP_06382903.1| von Willebrand factor type A domain-containing protein [Arthrospira platensis str. Paraca] gi|291571888|dbj|BAI94160.1| von Willebrand factor type A [Arthrospira platensis NIES-39] Length = 541 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 71/207 (34%), Gaps = 29/207 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LDVS SMN ++L + + ++D + V V +G +V Sbjct: 180 LVFLLDVSGSMNQP-----NRLPLLKEGFKLLVDQLSEQDTVAIAVYAGAAG----VVLP 230 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q I I+ L +T G++ AY + ++ +I Sbjct: 231 PTPGNEKQKIIAAIDGLQAQGSTAGGEGIKLAYELATRMLSEGKNNR---------VILA 281 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQN 347 TDG+ + + E + + RG + +G D ++ + + + N Sbjct: 282 TDGDFNVGVSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNHGNGNYAYIDN 341 Query: 348 --------SRKLHDAFLRIGKEMVKQR 366 S +L I +++ Q Sbjct: 342 LMEAKKVMSTELTGTLFTIAQDVKIQV 368 >gi|166031603|ref|ZP_02234432.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] gi|166028580|gb|EDR47337.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] Length = 685 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 82/214 (38%), Gaps = 28/214 (13%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 AN+ + ++ S+ +++ K + +DM V DVS SM+ L A + + + + Sbjct: 322 ANAKYVKQVVKSANQLNEKEALKVDM--VADVSGSMDGS------PLNEAKQVMSDFVGS 373 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 ++ D ++V L +FS+ + + + IN L+ G T L A + Sbjct: 374 VQF--DAGDLVE--LTSFSTGVCLEQEFSDDAATLTNDINNLVTGDMTSLYDALYTAVER 429 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +I TDG ++ N ++ + N R V+ IG+ Sbjct: 430 VAAQNGA------------RCVIAFTDGNDNYSNCTKEDVVNVAN---RYHVPVFIIGIG 474 Query: 326 AEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRI 358 + + +Y+V + + + I Sbjct: 475 SIDYADVNDIATQTGGMYYNVSDVTSMDKIYEEI 508 >gi|312070072|ref|XP_003137977.1| hypothetical protein LOAG_02391 [Loa loa] gi|307766862|gb|EFO26096.1| hypothetical protein LOAG_02391 [Loa loa] Length = 647 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 66/185 (35%), Gaps = 24/185 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +K G+D++ +LD S S+ + ++ T ++ D V Sbjct: 60 TGTYPAKEGCGVDLLFLLDTSGSLEQIYTKHINWTTQLTEALLTDKDQ----------VH 109 Query: 218 SGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ ++ F L + I I + F T++ L A ++F K+ Sbjct: 110 IAMIQYAETPTIEFSLDTYRNPRDITNHIMTINFHSGGTRTGKALLAAKVELFSEKKGAR 169 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A K I+ TDG + + + + L + K +Y + V ++ + + Sbjct: 170 KNA------SKIIVLFTDGLSVDDPVKHAQQLREVEKVK-----IYVVYVGSDGFEYEMD 218 Query: 335 NCASP 339 A Sbjct: 219 RIAGG 223 >gi|307720604|ref|YP_003891744.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978697|gb|ADN08732.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 631 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFG 186 + + + P++ ++I +KS D+M+ LD+S SM + Sbjct: 48 TLTLKARNALFMLIAVLMTIALAGPVIKDGKIEIKAKSA---DIMIALDISDSMLAEDVY 104 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 P + ++ L++++ P+ R G++ F+ PL++ + + + + Sbjct: 105 P-----NRLKLAKQKALELLRLAPNE----RIGVIAFAKNSYLVSPLSFDHEAVAFLLKK 155 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L S T+ L + + +K Y++ L+DG D K Sbjct: 156 LDTNSITEQGTDLMSMLQVVDKSIKKDSKK---------YLLILSDG------GDKKNFS 200 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 AK + V+ +GV Sbjct: 201 KEITFAKEKDIAVFVLGVGTPQGAPI 226 >gi|238790016|ref|ZP_04633794.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] gi|238721829|gb|EEQ13491.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] Length = 448 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 73/202 (36%), Gaps = 30/202 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 SK +++ +V+D S SM D++ A + ++++ + ++ +V Sbjct: 66 SKRRSPINLALVIDRSTSMTG------DRIEKAREAAILAVNMLDASDTLS------VVA 113 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + KI + + T G+ ++ Sbjct: 114 YDNNAEVIIPATKVNNKPALIAKIQQHIHPMGMTALFAGVSKGIGQVDK----------- 162 Query: 280 HDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + ++ II L+DG+ ++ E A ++G + IG+ + + + A Sbjct: 163 NLNPEQVNRIILLSDGQANTGPTSISELSDLARMAAKKGIAITTIGLGEDYNEDLMTAIA 222 Query: 338 --SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 223 GYSDGNHSFVANSADLESAFTK 244 >gi|154492260|ref|ZP_02031886.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] gi|154087485|gb|EDN86530.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] Length = 339 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 61/174 (35%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+++M+ LDVS SM D+L A + + ++ D + + Sbjct: 80 SKLETVKRQGVEIMVCLDVSNSMLAEDVSP-DRLSKAKQMLSKLTDGFSN-------DKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F+ P+ + ++ + + + A N Sbjct: 132 GLIVFAGDAFTQLPITSDYVSAKMFLSSINPSMVSTQGTAIGAAIN-------LAMRSFT 184 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K II +TDGEN +++ A +G V +G+ Sbjct: 185 PSETSDKAIILITDGENHE-----DDAVKAAAAAAEKGIHVNIVGMGDPKGSPI 233 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 76/247 (30%), Gaps = 75/247 (30%) Query: 182 NDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSK----------- 226 G + KL ++ ++ SI D+ R LV F+S Sbjct: 218 RRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFIN 277 Query: 227 ---------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + +N LI T++ GL A + Sbjct: 278 NNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGGTREG--- 334 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAA--- 329 +K ++F TDG+ +S N D S+ E K A++Y+IGV +A Sbjct: 335 -----AQKVVVFFTDGQPTS-NSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPND 388 Query: 330 --------DQFLKNCAS--------------------------PDRFYSVQNSRKLHDAF 355 + ++ +S D + + ++ +L++ F Sbjct: 389 TNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIF 448 Query: 356 LRIGKEM 362 I ++ Sbjct: 449 NEISSDL 455 >gi|317403331|gb|EFV83845.1| hemolysin-type calcium-binding region [Achromobacter xylosoxidans C54] Length = 1141 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 71/195 (36%), Gaps = 22/195 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + ++ +V+D S SM++ G G+ ++ + +++ + + +K + NV Sbjct: 619 GGTVTTVEPGKNYNIAIVVDTSGSMSEASGTKGLTRMQLTIDALKNLANTLKGHDGIVNV 678 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G + +S L A V + + I +L T + A + Sbjct: 679 ALIGFESTASTKYTINGLNASNVGDLIKAIEKLSASGGTNYEGAFDEAVKWF-----NKQ 733 Query: 275 HIAKGHDDYKKYIIFLTDGE----NSSPNIDNKESLFYCN--EAKRR---------GAIV 319 + ++ FLTDG+ N N D N + K + V Sbjct: 734 PTSSNGQAFENVTYFLTDGDPTFSNRGSNGDWWSGGSTTNYYDMKDAVDKFKGLSGKSTV 793 Query: 320 YAIGVQAEAADQFLK 334 +AIG+ + +LK Sbjct: 794 HAIGIGTGVNEAYLK 808 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 76/247 (30%), Gaps = 75/247 (30%) Query: 182 NDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSK----------- 226 G + KL ++ ++ SI D+ R LV F+S Sbjct: 218 RRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFIN 277 Query: 227 ---------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + +N LI T++ GL A + Sbjct: 278 NNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGGTREG--- 334 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAA--- 329 +K ++F TDG+ +S N D S+ E K A++Y+IGV +A Sbjct: 335 -----AQKVVVFFTDGQPTS-NSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPND 388 Query: 330 --------DQFLKNCAS--------------------------PDRFYSVQNSRKLHDAF 355 + ++ +S D + + ++ +L++ F Sbjct: 389 TNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIF 448 Query: 356 LRIGKEM 362 I ++ Sbjct: 449 NEISSDL 455 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 45/272 (16%), Positives = 93/272 (34%), Gaps = 47/272 (17%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT- 61 +IR F+ N G +++TA+L + +++G ++ + KA+L +D + L Sbjct: 11 CDSIRKFYENELGVYTVMTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQAALALVAE 70 Query: 62 ---------------KILNQEN-----GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++++ ++ GN ++ + +I+ I + R+E Sbjct: 71 NNDYRENKKHGDVNRQVVSPQDKAKFGGNEFMAKQEKRNQELIQGIAKLYLRSENANASS 130 Query: 102 AQDINNIER-STSLSIIIDDQHKDYNLS----------AVSRYEMPFIFCTFPWCANSSH 150 I + S + +Y ++ +P A+ Sbjct: 131 DAPITIDKPFHYSCEELDLPTGNEYARRKPIVCEVQGGVNRKFWLPVSESLV--SADKLK 188 Query: 151 APLLITSSVKISSKSDIGL----DMMMVLDVSLSMNDHFGPGMDK---------LGVATR 197 + S + + G+ ++M+V D S SMN H + L Sbjct: 189 QDRIRMESDTSYAIKEKGIVIPVELMLVSDFSGSMNSHLQDKNGRSLGKTKITILREVVS 248 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 I ++L V+ R G TFS + Q Sbjct: 249 EISKILLPEDVSEGVSPFNRIGFTTFSGGVRQ 280 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 58/141 (41%), Gaps = 22/141 (15%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIFLTDGENS 296 + I ++N + T ++ GL N + D + +K + ++ I+ L+DGE++ Sbjct: 433 KDITRELNIVRPSGWTSASSGLLVGANIMMDENKSPDAKPSKLGTNIQRVILVLSDGEDN 492 Query: 297 SPNIDNKESLF---YCNEAKRR-------------GAIVYAIGVQA-EAADQFL--KNCA 337 P +L C++ + + G + + A+Q K C Sbjct: 493 WPTYSTLTTLLNNGMCDKIREQLGKLQDPNLRELPG-RIAFVAFGYSPPANQVAAWKKCV 551 Query: 338 SPDRFYSVQNSRKLHDAFLRI 358 D++Y+ + +L ++F +I Sbjct: 552 -GDQYYTAYSKEELLESFKQI 571 >gi|198245970|ref|YP_002216383.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940486|gb|ACH77819.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624134|gb|EGE30479.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 593 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 21/190 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM ++L + +++ +++ +++ ++ V +G + + Sbjct: 234 LVFLIDTSGSMQ-----PAERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALA 284 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I+ I+ L +T GL AY + + KG + I+ Sbjct: 285 STAGNNTTAIKAAIDNLDTYGSTGGEAGLRLAYE------QAEKGFIKGGVNR---ILLT 335 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQN 347 TDG+ + D K+ + + +G + +GV + + + A + + + Sbjct: 336 TDGDFNLGITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS 395 Query: 348 SRKLHDAFLR 357 + Sbjct: 396 LSEAQKVLKD 405 >gi|111114995|ref|YP_709613.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|216263812|ref|ZP_03435806.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] gi|110890269|gb|ABH01437.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|215979856|gb|EEC20678.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] Length = 333 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 79/206 (38%), Gaps = 27/206 (13%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 KDY L+ + + F++ + P + S K G D+++VLD+S SM Sbjct: 49 KDYRLNLMYFFTYSFLYLAAMVMVFALAGP---SVSKKKMIHLSAGADIVIVLDISPSMG 105 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++L + ++IK GLV F+ P+ + Sbjct: 106 AVEFSSKNRLEFSK-------ELIKRFISQRENDNIGLVAFAKDASIVVPITTDRDFFNK 158 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 K++ + G+ + G+ A + + K + K+ I+ LTDG +S Sbjct: 159 KLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRSIVVLTDGVVNSDE 207 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQ 325 I + + N A+ +Y+IG+ Sbjct: 208 IYKDQVI---NLAQGLNVRIYSIGIG 230 >gi|296225412|ref|XP_002758286.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Callithrix jacchus] Length = 860 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 48/303 (15%), Positives = 100/303 (33%), Gaps = 29/303 (9%) Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 I + + + + + + + F + F ++ ++ + + Sbjct: 186 ADIFEPQGISMLDAEASFITNDFLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSL 243 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + D+ ++ E P AP + K + V+DVS S Sbjct: 244 LNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------SVAFVIDVSGS 294 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQH 239 M KL ++ +L+ +K +N ++ SG V+ + +VQ P +Q Sbjct: 295 MAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQATPE--NLQE 346 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 347 AKMFVKSIDDRGMTNINDGLLRGISMLNKAREEH----RVPERSTSIVIMLTDGDANVGE 402 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-----SPDRFYSVQNSRKLHDA 354 ++ A R +Y +G FL+N A R Y ++ Sbjct: 403 SRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLENMALENQGFARRIYEDSDADLQLQG 462 Query: 355 FLR 357 F Sbjct: 463 FYE 465 >gi|240137956|ref|YP_002962428.1| hypothetical protein MexAM1_META1p1287 [Methylobacterium extorquens AM1] gi|240007925|gb|ACS39151.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 735 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 91/269 (33%), Gaps = 37/269 (13%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ + ER +L+ +D+ L+ + E P I A + ++T Sbjct: 281 EERSASERRITLADGATAADRDFELTWNAAPGEAPSIGLFRERVAGAEAVLAVVTPPETA 340 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + + D++ V+D S SM + A S+ LD + + R ++ Sbjct: 341 SPAASVPRDVVFVIDNSGSMGGA------SMRQAKASLLIGLDRLGAGD------RFNVI 388 Query: 222 TFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F FP A + + + L T+ L+ A + Sbjct: 389 RFDHSFDTLFPDLVPADAGHLMRAKSFVAGLQASGGTEMLAPLQAALRDATPEETGRLRQ 448 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKN 335 ++FLTDG E+ + A RG + ++ +G+ + +++ Sbjct: 449 ----------VVFLTDG------AIGNEAQIFSAIATERGRSRLFMVGIGSAPNGYLMRH 492 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A F + ++ + + ++ Sbjct: 493 AAELGRGSFTQIDTPDQVTERMRALLVKL 521 >gi|254492197|ref|ZP_05105371.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxidans DMS010] gi|224462522|gb|EEF78797.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxydans DMS010] Length = 2740 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 38/157 (24%), Positives = 63/157 (40%), Gaps = 13/157 (8%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSI 209 P+ + ++ ++MM+LDVS SMND GM +L V +S E+LD + Sbjct: 2062 VPVASPEENSGEATLEVNTNLMMILDVSGSMNDSANFQGMTRLQVMIKSSLELLDQYDAY 2121 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 D V ++TF++ V + I L G T L A N Sbjct: 2122 GD----VMVNIITFATSASNPSGGWVTVDQAKAIILGLTAGGNTNYDDALNDAINAFALG 2177 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + + + F++DGE +S N+ N ++ Sbjct: 2178 GKLGDGQNISY--------FMSDGEPNSNNVSNSATV 2206 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 76/247 (30%), Gaps = 75/247 (30%) Query: 182 NDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTFSSK----------- 226 G + KL ++ ++ SI D+ R LV F+S Sbjct: 220 RRDLGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFIN 279 Query: 227 ---------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + +N LI T++ GL A + Sbjct: 280 NNYNRSQIVTELKSYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGGTREG--- 336 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAA--- 329 +K ++F TDG+ +S N D S+ E K A++Y+IGV +A Sbjct: 337 -----AQKVVVFFTDGQPTS-NSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPND 390 Query: 330 --------DQFLKNCAS--------------------------PDRFYSVQNSRKLHDAF 355 + ++ +S D + + ++ +L++ F Sbjct: 391 TNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIF 450 Query: 356 LRIGKEM 362 I ++ Sbjct: 451 NEISSDL 457 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 82/219 (37%), Gaps = 31/219 (14%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 +T + + + + + ++ + K + LD + TA ++ + + DF Sbjct: 1 MTVVAMVPLMGALAMAVDFTEMSREKQAVSNALDAANFATARRLTEGATDDQLRAYALDF 60 Query: 80 SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL---SAVSRYEMP 136 + I + T+L++ + L +A Y+ P Sbjct: 61 FNANLNKINPAN--------------------TTLTVTLPSNTTGGGLLKMTARLDYK-P 99 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLSMNDH-FGPGMDKLG 193 + + F S S I+S+ + L++ +VLD S SM G G ++ Sbjct: 100 YFYPVFGQLVGKSETDANQRISFNITSEVRLKNTLEVALVLDNSGSMTKTGTGSGQTRID 159 Query: 194 VATRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIV 228 + + ++++D + I V+ V+ GLV F++ + Sbjct: 160 LLKTAAKQLVDTLAQQAAMIKQVDRPVQFGLVPFAASVN 198 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 60/224 (26%), Gaps = 79/224 (35%) Query: 223 FSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S PL G+ I+ I+ + T G+ + + + + + Sbjct: 418 YSCSTNPITPLTDVSVTTGLTAIKAAIDLMKPDGGTNVPEGMAWGWRVVSSGEPFTQGRP 477 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFY----------------------- 308 + K +I LTDG N+ D +S Sbjct: 478 ETERGNDKVVIVLTDGANTYYTPSSLSHSDPADSKSTYASFGYLNPGYNGTSVGRLFMGT 537 Query: 309 -------------------------CNEAKRRGAIVYAIGVQAEAAD-------QFLKNC 336 CN AK +V + + + LK+C Sbjct: 538 SSAIGQFDYSNGNYTNALNEQMATLCNNAKAANIMVMTVALDLSTTKTADQQAIEALKSC 597 Query: 337 ASPDRF-YSVQNSR------------KLHDAFLRIGKEMVKQRI 367 +S RF ++ L + F IG E+ RI Sbjct: 598 SSNSRFRKDPTDASKPAKLFWNATGASLSNDFKEIGNELSNLRI 641 >gi|327403933|ref|YP_004344771.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319441|gb|AEA43933.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 375 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 69/191 (36%), Gaps = 31/191 (16%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREML 203 + S K++ + LD+++ LD+S SM D G + +L A ++I E+L Sbjct: 106 FIILAMAQPVAGSRKVNGSKRV-LDLVICLDISNSMNTQDMGGNDVSRLTAAKQAIGELL 164 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGL 259 + +K R +V F++ PL + I +I T L Sbjct: 165 NQLKGE-------RIAVVIFANDAYTQLPLTMDYGAAKLFIPDIETSMISDQGTNVGRAL 217 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E A + D + K I+ +TDGE+ E K++ + Sbjct: 218 EIAQEQFKD------------TESGKAILVITDGEDHEALW-----KEQIAELKKKNVEL 260 Query: 320 YAIGVQAEAAD 330 +G+ + Sbjct: 261 TYLGLGSSKGG 271 >gi|221039656|dbj|BAH11591.1| unnamed protein product [Homo sapiens] Length = 237 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 80/215 (37%), Gaps = 43/215 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G D+L ++ + L + V N G+V F S Sbjct: 20 VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 69 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + G T G++YA+ I H + Sbjct: 70 NKLIQIKSSDERNTLMAGLPTYPLGG-TSICSGIKYAFQVIG-----ELHSQLDGSE--- 120 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 ++ LTDGE+++ + +E K+ GAIV+ I + A + + FY Sbjct: 121 -VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHFY 172 Query: 344 SVQNSRK--LHDAF-------LRIGKEMVKQRILY 369 ++ L DAF + ++ ++ R+L Sbjct: 173 VSDEAQNNGLIDAFGALTSGNTDLSQKSLQVRVLI 207 >gi|114587338|ref|XP_516521.2| PREDICTED: inter-alpha (globulin) inhibitor H3 isoform 2 [Pan troglodytes] Length = 865 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 163 IFEPQGISMLDAEASFITNDLLGSTLTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 220 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 221 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 271 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ +K +N ++ SG V+ + +VQ P +Q + Sbjct: 272 G------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 323 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 324 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 379 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 380 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 422 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 68/199 (34%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ + ++ FS++I Sbjct: 253 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 300 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V P V+ + I+ + T L+ A + D + H Sbjct: 301 KDHLVSVTP--DNVRDGKVYIHHMSPTGGTDINGALQRAIKLLND---YVAHNDIEDRSV 355 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 IIFLTDG+ + + L EA R ++ IG+ + L+NC Sbjct: 356 S-LIIFLTDGKPTVGETHTLKILNNTKEAARGQICIFTIGIGNDVDFMLLEKLSLENCGL 414 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R ++ F Sbjct: 415 TRRVLEEDDAGAQLIGFYD 433 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 68/199 (34%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ + ++ FS++I Sbjct: 251 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 298 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V P V+ + I+ + T L+ A + D + H Sbjct: 299 KDHLVSVTP--DNVRDGKVYIHHMSPTGGTDINGALQRAIKLLND---YVAHNDIEDRSV 353 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 IIFLTDG+ + + L EA R ++ IG+ + L+NC Sbjct: 354 S-LIIFLTDGKPTVGETHTLKILNNTKEAARGQICIFTIGIGNDVDFMLLEKLSLENCGL 412 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R ++ F Sbjct: 413 TRRVLEEDDAGAQLIGFYD 431 >gi|149624864|ref|XP_001517479.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1), partial [Ornithorhynchus anatinus] Length = 249 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 73/197 (37%), Gaps = 28/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ H + + ++++ + P N R G+V ++S + Sbjct: 27 DLVFIVDSSRSVRPH------EFEKVKVFLTQVIESLDVGP---NATRVGVVNYASAVKH 77 Query: 230 TFPLAWGVQHIQ--EKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FPL + + RL + T + +++A ++ F E + K Sbjct: 78 EFPLKAHRSKASLLQAVRRLEPLSTGTMTGLAIQFAISRAFSEVEGARPL---SPALSKV 134 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I +TDG D A+ G ++AIGV L+ AS + Sbjct: 135 AIVVTDGRPQDDVKDVSA------RAREAGIELFAIGVG-RVDKTTLRRIASEPLAEHVD 187 Query: 344 SVQN---SRKLHDAFLR 357 V++ KL F Sbjct: 188 YVESYSVIEKLAKKFQE 204 >gi|325695689|gb|EGD37588.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK150] Length = 460 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 50/334 (14%), Positives = 107/334 (32%), Gaps = 67/334 (20%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 I+ +L+ +I +++G++ + F +L + S+ + + GK Sbjct: 15 IMAIMLMSMIAVIIGIIF---NTMFSGRELVER-EASIQAEMRTSMQYVDRTIGKATSIF 70 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQD-----------------INNIERSTSLSIIIDDQ 121 + E G ++D I ++ + ++ +D + Sbjct: 71 ILDDSKYQGSEKGLTKEWSYIGLSEDGKKVRNYVWNKSKQNWDITDLGTKSLYNMKLDLE 130 Query: 122 HKD---YNLSAVSRYEMPFIF------CTFPWCANSSHAPLLITSSVKI----------- 161 K Y + + Y + + + ++ ++ + + K Sbjct: 131 FKADDLYKDNRLISYNLTGKYPDTSNKLSIDTAISALNSKQVFSKVAKGKKGIAIAYRND 190 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHF-------GPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + + + V D S SM ++ + M+ +KSI +V+ Sbjct: 191 PIEGQMNVSISFVFDKSGSMAWDMLGKEVEETNRPSRMKILKEKSIAMMKDLKSIGNVS- 249 Query: 215 VVRSGLVTFSS----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 LV FS+ L G I+ IN+L G T GL Y + Sbjct: 250 ---VNLVAFSTLGSYVQEDFSELDKGTTTIETSINKLDEGGYTNPGDGLRYGMVSLQKNP 306 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 +L KY++ LTDG ++ + Sbjct: 307 AQL-----------KYVVLLTDGVPNAFTAKTND 329 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 78/249 (31%), Gaps = 44/249 (17%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM------MMVLDVSLSMNDHFGP-- 187 P + + P + I++ + + + + V D S SM+ Sbjct: 378 PKLTESLGLVPTLRTEPRAVLDGKAIAAARNTFIGIHRRGNTLAVYDTSGSMDLPVANSG 437 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-------------LA 234 G +L +A + +IP R GL FS+ + T P + Sbjct: 438 GKTRLQIAVGAAD------AAIPLFAKDSRLGLWQFSTNLDGTKPYRELVPVGLMNDEVG 491 Query: 235 WGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 G + + +N L T A+ + K + ++ LTD Sbjct: 492 TGTREEALVAAVNGLKAKGGTGLYATALAAFESLS----AQYQPDKPNQ-----VVLLTD 542 Query: 293 GENSSPNIDNKESLFYCN-EAK---RRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQ 346 G+N P + +A+ + + IG A+A L+ A+ + Y Q Sbjct: 543 GQNDDPTSSMTLTQLIATLKAEYNPKAPVHIITIGYGADADMDALRQISAATGSKTYPAQ 602 Query: 347 NSRKLHDAF 355 + + Sbjct: 603 DPNSIFQVM 611 >gi|46127789|ref|XP_388448.1| hypothetical protein FG08272.1 [Gibberella zeae PH-1] Length = 774 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 69/197 (35%), Gaps = 24/197 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD--IIKSIPDVNNVVRSGLVTFSSKI 227 D+++V+DVS SM + A S+ ++ I +N R +VTF+SK Sbjct: 88 DIVLVIDVSGSMGQPAPVPGEDQESAGLSVLDLTKHAARTIIESMNENDRLSIVTFASKA 147 Query: 228 VQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + + T G+ A + + + + Sbjct: 148 KVLQPLLPMNQDNKTRAIKNVKSMEPRDATNLWQGMLEAIKQFNTDE------SSPNVPA 201 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFLKNCA--S 338 I+ LTDG + N ++ + + + G A ++ G LK+ A Sbjct: 202 ---IMILTDGMPNHMN----PAVGFVPKIRNMGPLPASIHTFGFGYSLKSDLLKSIAEIG 254 Query: 339 PDRFYSVQNSRKLHDAF 355 + + ++ + F Sbjct: 255 NGNYAFIPDAGMIGTVF 271 >gi|11498366|ref|NP_069594.1| hypothetical protein AF0760 [Archaeoglobus fulgidus DSM 4304] gi|2649856|gb|AAB90485.1| predicted coding region AF_0760 [Archaeoglobus fulgidus DSM 4304] Length = 959 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 67/215 (31%), Gaps = 53/215 (24%) Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ----TFPLAWGVQ---HIQ 241 M ++ A + +++ GL TF++ + PL + + + Sbjct: 608 MKRMDAAKLAAITFNNMLG------EGDFVGLATFTTYAERISVNQTPLKYMTKDKLRVN 661 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIF---DAKEKLEHIAKGHDDYKKYIIFLTDGENS-- 296 +I L T L + D + + +I LTDGE + Sbjct: 662 NEIEGLYAKLATDHADALYWGVKVFPIWPDETQNNCTECINNTRP--LMILLTDGETTTC 719 Query: 297 ------------------SPNIDNKESLFYCNEAKRR------GAIVYAIGVQAE---AA 329 + + +++L + KR + IG + Sbjct: 720 DKNEDYFNNTCKNKCVRDNGHYGAQQALCVADYIKRNIKVNGFNIPICTIGFGTDIGSDG 779 Query: 330 DQFLKNCASP--DR----FYSVQNSRKLHDAFLRI 358 FL++ ASP D ++ S +L +A+ I Sbjct: 780 QAFLRDIASPRPDNGEACYFFATTSEELIEAYKTI 814 >gi|299138555|ref|ZP_07031734.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599801|gb|EFI55960.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 381 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 85/225 (37%), Gaps = 36/225 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++ D+ + + +++D S SM + + LD++K ++ Sbjct: 128 VAQEIRGFKHEDLPVSLGILIDSSGSMYEKSAAVNEA----------SLDLVKLSNPLDE 177 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV FSS+ + +Q+ + + T + + + + Sbjct: 178 AF---LVDFSSEAYIDQDFTNSIAKLQQGLAYIHTSGGTALYDAVVASADYL-------- 226 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--------- 325 +K K+ ++ +TDGE+++ + + ++ + G +Y IG+ Sbjct: 227 --SKNAKHPKQVLLIVTDGEDNASSASLESAIRRVQDL--DGPAIYCIGLLFGDDVSRSE 282 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A+ A + L+ A + + Y ++ + + + +++ Q + Sbjct: 283 AKHAREVLQELAQQTGGQAYFPKSLKDVDGLTREVAQDIRTQYTI 327 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 61/272 (22%), Positives = 92/272 (33%), Gaps = 74/272 (27%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + +T + +++ +D++ V D S SM D FG G+ K A +++E L Sbjct: 55 LDIEVTPKGQATNEERKPIDVVFVHDTSGSMKDSFG-GVKKATSAENALKESLRFFNQNQ 113 Query: 211 DVNN---VVRSGL-VTFSSKIVQTFPLAWGVQHI---QEKINRLIFG------------- 250 + V V++ + + A G+ I E ++ Sbjct: 114 QSKDKYFFVPFDSDVSYKNYGDKRIQPAEGLSDILPMAEHLDFSEAYWVKKYSWYYGYYW 173 Query: 251 ---------STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 T T LEYA +K G D K+YIIFLTDGE +S N D Sbjct: 174 SQEIFDFSVGGTNYTQSLEYALSKFS-----------GMRDSKRYIIFLTDGEPTSLNHD 222 Query: 302 NKESLFYCNEAKRRG----------------------------AIVYAIGVQAEA--ADQ 331 NK+ Y N R G +Y+I Q Sbjct: 223 NKQYTLYTNGTARAGNVNANYNDVQKFIHEKAVASAEKLGVNDVKMYSIAFAEPGEVNYQ 282 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKE 361 L+N + + R N L + F I KE Sbjct: 283 LLENMSNKTGGRAIQA-NPNSLSNVFTDISKE 313 >gi|218438801|ref|YP_002377130.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218171529|gb|ACK70262.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 573 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 32/185 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+ + + +M V+D S SM+ G ++ + R + ++ N V G Sbjct: 389 KLQKDAGKTVYLMTVIDTSGSMD---GAPLEAVKKGLRIASKEINP-------GNYV--G 436 Query: 220 LVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVT+ + + PL + I+ L T G+ +K+ + K Sbjct: 437 LVTYGDRAAEVVPLGLFDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQK----- 491 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K + D + Y++ LTDG+ + + KE + Y G VY I + + L Sbjct: 492 --KNNPDGRFYLLLLTDGQANMGVTFDEVKEVIEY------SGVRVYPIAYG-DVNQEEL 542 Query: 334 KNCAS 338 + AS Sbjct: 543 EAIAS 547 >gi|149727851|ref|XP_001493364.1| PREDICTED: anthrax toxin receptor 1 [Equus caballus] Length = 603 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G + A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFQRASEQIYYENSQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|118350692|ref|XP_001008625.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89290392|gb|EAR88380.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 648 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 75/196 (38%), Gaps = 29/196 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V+D S SM K+ + ++ ++++++ S+ R +V F+ Sbjct: 222 VDLVVVIDKSGSMEGE------KIQLVKETLVKIINLMSSMD------RICIVCFNESGD 269 Query: 229 QTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + Q + I ++ G T + G+ +A I + K + Sbjct: 270 RPLTFTRVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQN------RKFKNNVTS- 322 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRF 342 I+ L+DG+++ K + + + IG + + L+ + F Sbjct: 323 --ILLLSDGQDTKAYTRVKAYID--KYQIKDAFNIETIGFGEDHDPKLLRTLSDLRNGTF 378 Query: 343 YSVQNSRKLHDAFLRI 358 +Q+ L AF+ I Sbjct: 379 NFMQDVNYLDTAFINI 394 >gi|281202341|gb|EFA76546.1| Ubiquitin-conjugating enzyme [Polysphondylium pallidum PN500] Length = 561 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 62/345 (17%), Positives = 114/345 (33%), Gaps = 23/345 (6%) Query: 22 AILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSY 81 AI + +IF + + V L I+ + + + + Q + S Sbjct: 2 AITINIIFQNKKIALPFKKTQKVHELLEEIIKRAAITS---------IPADQLQLLNDSS 52 Query: 82 RIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT 141 + K D + I + +T+ + D+ S + T Sbjct: 53 ELFKEDTLEDLGINDNVELILKQITSAAAATTTTTPTDNNTSSITNIVNSTRRLSIGDIT 112 Query: 142 FPWCANSSHAP-LLITSSVKISSKSDIGLDMMMVLDVSLSM-----NDHFGPGMDKLGVA 195 N++ P + I +V + S+ +D+ +VLDVS SM PG ++ Sbjct: 113 DSNNNNNNTEPFIPIKPTVSVFSEKIKQIDI-IVLDVSGSMKAAAYAGSKVPGELEMTRI 171 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST-TK 254 + I GLV F +I TF ++ + T+ Sbjct: 172 EVAQALFQTFIDKYVQQEIPACVGLVCFGERIDLTFQPTRNFDSFSTELGDVDANQAKTR 231 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 ++ A I K K DD + LTDG+++S + D + Y K Sbjct: 232 LYEAIKLAAETIVSYKNKHPADILLSDDLNCRVFALTDGQDNSGS-DPYKVFTY---LKE 287 Query: 315 RGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 ++ AI A + L A+ + + +S+ + F R Sbjct: 288 HNIVLDAIPCGNGADKEALGTFTKATGGSCFIIDSSQAGVELFER 332 >gi|308472959|ref|XP_003098706.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] gi|308268306|gb|EFP12259.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] Length = 399 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/241 (18%), Positives = 92/241 (38%), Gaps = 22/241 (9%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGP 187 + + F +++P S V +D+ LD+++V+D S M Sbjct: 4 PTFLVFFFLLAIHRVYTDTYSPYSPLSYVDRPCGTDLSNLWLDVVLVVDNSHGMT---NE 60 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKIN 245 G+ + + SI I + + R GL+T++++ Q L ++ +N Sbjct: 61 GLANVSSSILSIFGNGTRIGTNLTEHRTTRVGLITYNAEATQIADLNVLQSFFNLTNHVN 120 Query: 246 RLIFGSTTKST-----PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + + ST GL+ AY+ + + Y+K +I ++ + N Sbjct: 121 SSLA-EVSNSTWSFDKVGLKAAYDLL----QNQSFPPNSRSHYQKVVILFA-SDSQAQNS 174 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN---SRKLHDAFLR 357 + + + K G + +G E + L N +SP+ + + + K+ A L+ Sbjct: 175 EELDPYPMDYQLKDAGVKIVTVGYGNETLLERLSNISSPEYAFDGYDKGVTAKVQAALLK 234 Query: 358 I 358 I Sbjct: 235 I 235 >gi|86360183|ref|YP_472072.1| hypothetical protein RHE_PC00139 [Rhizobium etli CFN 42] gi|86284285|gb|ABC93345.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 671 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 77/212 (36%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 268 FKATVTVMPTPWNHDTQLMHVAIKGYDIAPATAPHA-NLVFLIDVSGSMDEP-----DKL 321 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + + R +++ +K+ V+ +VT++ I I+RL G Sbjct: 322 PLLKSAFRLLVNKLKADDTVS------IVTYAGNAGTVLEPTRVAEKSKILSAIDRLEAG 375 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+ AY L A D + + TDG+ + +++ Sbjct: 376 GSTGGAEGIAAAY--------DLAKKAFVKDGVNRVM-LATDGDFNVGPSIDEDLKRIIE 426 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 E ++ G + +G D ++ A Sbjct: 427 EKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGN 458 >gi|295840348|ref|ZP_06827281.1| secreted protein [Streptomyces sp. SPB74] gi|295827934|gb|EFG65721.1| secreted protein [Streptomyces sp. SPB74] Length = 418 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 82/227 (36%), Gaps = 31/227 (13%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGL---DMMMVLDVSLSMNDHFGPGMDKLGVAT 196 T + LL + + + + D G + +VLDVS SM G ++ A Sbjct: 1 MTAAGGVLAVALALLPGGTARAADEPDTGKAPPQVELVLDVSGSMRAKDIDGASRMSAAK 60 Query: 197 RSIREMLDIIKSIPDVNNVVRS-GLVTFSSKIVQTFPLAW--------GVQHIQEKINRL 247 +S E+LD + DV +R+ G + + L Sbjct: 61 QSFNEVLDAV--PEDVELGIRTLGADYPGEDRETGCKDTRQLYPVGHPDRTEAKAAVATL 118 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T P L A + +G + K+ I+ +TDGE++ + + Sbjct: 119 SPTGWTPIGPALLGAAEDL-----------RGGEAAKR-IVLITDGEDTCRRDPCEVARE 166 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKL 351 AK +V +G+ +A + +C A+ + +V++++ L Sbjct: 167 IA--AKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTKDL 211 >gi|119585666|gb|EAW65262.1| inter-alpha (globulin) inhibitor H3, isoform CRA_a [Homo sapiens] Length = 670 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 188 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 245 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 246 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 296 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ +K +N ++ SG V+ + +VQ P +Q + Sbjct: 297 G------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 348 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 349 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 404 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|109039132|ref|XP_001085463.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Macaca mulatta] Length = 891 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 109/306 (35%), Gaps = 30/306 (9%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLIKHFEID--VDIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 226 Query: 102 --AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 + D + + S++ D Y+++ S + + F P+ Sbjct: 227 KPSLDQQRSCPTCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPKGLPV------ 280 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + ++ V+D+S SM KL ++ +L+ +K +N ++ SG Sbjct: 281 -------VPKNVAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSG 327 Query: 220 LVT-FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V+ + +VQ P +Q + + + T GL + + A+E+ + Sbjct: 328 DVSTWKEHLVQATPE--NLQEAKTFVKSMEDKGMTNINDGLLRGISMLNKAREEH----R 381 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + +I LTDG+ + ++ A +Y +G FL+N A Sbjct: 382 VPERSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMAL 441 Query: 339 PDRFYS 344 + ++ Sbjct: 442 ENHGFA 447 >gi|330506652|ref|YP_004383080.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] gi|328927460|gb|AEB67262.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] Length = 551 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 61/168 (36%), Gaps = 27/168 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K + + + V DVS SM+ L +S+ E + +N++ GLV++ Sbjct: 371 KKNANICAVFVADVSGSMDGE------PLNNLKKSLLEG----QKYIGKDNLI--GLVSY 418 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + P++ + + L G T + G+ A + + + Sbjct: 419 SDDVYINLPISRFDLNNRSYFVGAVGGLQAGGATATFDGIAVAMKML-----EEQLALDP 473 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K I L+DGE + + N + G +Y IG A+ Sbjct: 474 KLKPK--IFVLSDGETNRGHSLNDIRKLV----EESGIPIYTIGYNAD 515 >gi|207029558|ref|NP_001125590.1| inter-alpha-trypsin inhibitor heavy chain H3 [Pongo abelii] Length = 879 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 69/175 (39%), Gaps = 13/175 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQ 229 + V+D+S SM KL ++ +L+ +K +N ++ SG V+ + +VQ Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKKEDYLNFILFSGDVSTWKEHLVQ 338 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P +Q + + + T GL + + A+E+ + + +I Sbjct: 339 ATPE--NLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RVPERSTSIVIM 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDG+ + ++ A +Y +G FL+N A + ++ Sbjct: 393 LTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|47227632|emb|CAG09629.1| unnamed protein product [Tetraodon nigroviridis] Length = 457 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 75/203 (36%), Gaps = 30/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ VLD S S+ +G ++ + D V+ +R + FS++ Sbjct: 47 GAYDLYFVLDKSGSVAGDWGEIYS-------FVKNLTDRF-----VSPRMRVSFIVFSAQ 94 Query: 227 IVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL I+E + +L T G++ A +I Sbjct: 95 AKVLLPLTGDSYKIKEGLRKLYDVKPAGETFMHVGIKEASVQI----------RAQPSPT 144 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 I+ LTDG+ D ++ NEA++ GA VY +G++ Q + D+ + Sbjct: 145 SSIILALTDGKLEVYVHDL--TVKEANEARKYGARVYCVGIKDFDEQQLANIADTKDQVF 202 Query: 344 SVQNSRKLHDAFLRIGKEMVKQR 366 V++ A I ++K+ Sbjct: 203 PVKDG---FHALKGIVNSILKRS 222 >gi|332519332|ref|ZP_08395799.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045180|gb|EGI81373.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 345 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 69/201 (34%), Gaps = 21/201 (10%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 + + + + + + S K+ + G+D++ +DVS SM ++ Sbjct: 54 KPILKILVLCLAFASLAIALVNPKVGS-KLETIKREGVDIVFAIDVSKSMLAEDIAP-NR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + + ++++ + S R G++ ++ K P+ + + + Sbjct: 112 LEKSKQLVTQIINNLAS-------DRVGIIAYAGKAFPQLPITTDYASAKMFLQNM---- 160 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T A N+ A E + + ++ ++DGE+ S N Sbjct: 161 NTDMLSSQGTAINE---AIELAKTYYDDDQQTNRVLVIISDGEDHSEAAANVAEEAS--- 214 Query: 312 AKRRGAIVYAIGVQAEAADQF 332 G ++ IGV Sbjct: 215 --NEGIRIFTIGVGDAKGGPI 233 >gi|198429401|ref|XP_002121222.1| PREDICTED: similar to integrin alpha 9 [Ciona intestinalis] Length = 1242 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 46/323 (14%), Positives = 98/323 (30%), Gaps = 42/323 (13%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 D S+++ ++ ++ +N + L + F D S Sbjct: 43 DLSVIHFGASVVVKKTASNSQAFVGAPLDERKNG--------SLYKCTF--DGRTSGSSL 92 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + D+ +S + I+ + + + + D++ Sbjct: 93 CIRETAFDKTGAMGISISMNTDDENIYVSVAYAQRLLEGDINTLYLSIACPTEGLTADII 152 Query: 173 MVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--SKIVQ 229 V+D S S++ D + ++ + S R +D V G++ FS + I Sbjct: 153 FVVDESGSVDVDEYRDSLNWMKQVISSFRSYID--------KGDVHVGVIGFSRLNNIDT 204 Query: 230 TFPL---AWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + AW + +IN ++ T + + D + Sbjct: 205 KVRIRLQAWSYTSLTSQINNMVNVRSLNGLTYIGYAINLTITEFDDHGR---------ES 255 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDR 341 K +I LTDG + P + A+ G + ++GV + Q L + R Sbjct: 256 VPKEMILLTDGAATKP----ENVKPAAERARANGIVTVSVGVGSRVDETQLLTIAGNASR 311 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 + N L + + Sbjct: 312 VFKATNYDNLDSVVEGVKSTIQD 334 >gi|124002443|ref|ZP_01687296.1| OmpA family protein [Microscilla marina ATCC 23134] gi|123992272|gb|EAY31640.1| OmpA family protein [Microscilla marina ATCC 23134] Length = 756 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 72/200 (36%), Gaps = 33/200 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ +V+D S SM + + KL ATR + P+ V V F + Sbjct: 401 KPYDISLVMDYSGSMAGN----IKKLEEATRKFI-----LTKHPNDKISV----VKFDER 447 Query: 227 IVQTFPLA-WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + L G + K + L +G +T G + + +A+ Sbjct: 448 LETELRLTAQGSKTDCVKFDGLTRYGGSTALYAGADEGLESLKNAQNN------------ 495 Query: 285 KYIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--S 338 K ++ TDGE +S + +A+ +G V+ I ++ L + + Sbjct: 496 KVMLLFTDGEENSSLQYFGKRAFRASEVVKKAREKGIRVFTIAYGTGVNNKTLNALSMLT 555 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 + Y ++N ++ + + Sbjct: 556 DGKTYFIENPDEIKQVYEEL 575 >gi|110598613|ref|ZP_01386880.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339782|gb|EAT58290.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 343 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 45/193 (23%), Positives = 70/193 (36%), Gaps = 23/193 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + P L I K G DM+ ++DVS SM D+L A Sbjct: 60 MIFLGIASLLAALTGPRLSGGGRPILRK---GADMVFLIDVSRSMRAADVQP-DRLTQAK 115 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 I I R ++ F++K + PL + + + LI Sbjct: 116 F-------EISRISRAVTGGRRAIILFAAKPLVQCPLTADMDAFEALLGMASPDLIEAQG 168 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T LE A+N + + E A + K ++ L+DGE+ + ++ N Sbjct: 169 TDFRSALELAHNVLEPSSESRLASAAKGE---KIMVLLSDGEDHAGDL-----PAAANRI 220 Query: 313 KRRGAIVYAIGVQ 325 K V+AIGV Sbjct: 221 KNGRIHVFAIGVG 233 >gi|308063814|gb|ADO05701.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Sat464] Length = 217 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 72/207 (34%), Gaps = 32/207 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-I 227 + + +++D S SMN+ G ++ I++M++ +K + ++TF Sbjct: 15 IPVFLLVDTSGSMNESLGN-CTRIEALNLCIQKMIETLKQEAKKELFSKMAIITFGENGA 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V P V++I L T A N I D YK Y Sbjct: 74 VLHTPFD-DVKNIN--FKPLSASGGTPLDQAFRLAKNLIED------KDTFPTKFYKPYS 124 Query: 288 IFLTDGENSSPNI---------DNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA 337 I ++DGE + D + + C ++I + EA Q K+ Sbjct: 125 ILVSDGELNDGKWQKALSDFHHDGRSTKSVC----------WSIFIGDREANPQVNKDFG 174 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 FY + KL F + + + K Sbjct: 175 KDGVFY-ADDVEKLVGLFEIMTQTISK 200 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 55/371 (14%), Positives = 107/371 (28%), Gaps = 113/371 (30%) Query: 5 NIRNFFYNCKGSIS-ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +R+F G ++ + L + V G+ ++ + +L Y LD ++L A Sbjct: 22 KVRSFLREEDGVLAKPMIGTFL-AMLAVGGIGVDLMRMERDRTELQYTLDRAVLAAA--- 77 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 ++ + ++ + + L + D S+ ID + Sbjct: 78 ---------DLDQSLDADAVVLDYLT---KAGLEQYYSDPDDQKGLGYKSVEATIDTDFE 125 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 Y L M ++ I SV+IS MVLD+S SMN Sbjct: 126 AYLLKFAGGDNMSLY---------ANSRAEEIIGSVEIS----------MVLDISGSMNS 166 Query: 184 -------------------------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--- 215 + + + ++L + ++ Sbjct: 167 GNRLVNLQAAAKSFVTQITSNTDVSNLSISIIPYATQVNAGEKLLSKYTKVSQEHDYSYC 226 Query: 216 -------------------VRSG---LVTFSSKIVQTF-----------PLAWGVQHIQE 242 +R+ T+S ++ P + Sbjct: 227 VNFIKDQFSKHTLNQNEDLIRTAHFDTFTYSMNMIDRPVCPTRPGSAILPFTNDAAKLHA 286 Query: 243 KINRLIFGSTTKSTPGLEYAY-----------NKIFDAKEKLEH-----IAKGHDDYKKY 286 I+ L T G+++ N + D K E+ A G D K Sbjct: 287 YIDSLTASGNTSIDIGMKWGSALLDPTAQPVVNALVDDKVISENFRGRPKAYGSGDTLKI 346 Query: 287 IIFLTDGENSS 297 II ++DG+N++ Sbjct: 347 IILMSDGQNTN 357 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 3/74 (4%) Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCA-SPDRFYSVQNSRKLHD 353 + ++ + C+ K +G IVY++G +A +A + L++CA SP F+ V+ ++ D Sbjct: 476 NYRVAKDQHTKTICDITKDQGVIVYSVGFEAPSAGIKVLEDCASSPAHFFDVEGL-EISD 534 Query: 354 AFLRIGKEMVKQRI 367 AF I + + R+ Sbjct: 535 AFSSIATSIRQLRL 548 >gi|156409365|ref|XP_001642140.1| predicted protein [Nematostella vectensis] gi|156229281|gb|EDO50077.1| predicted protein [Nematostella vectensis] Length = 156 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 62/180 (34%), Gaps = 37/180 (20%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD+ ++D S S+ + R I+ ++ P R GLV +S+ Sbjct: 7 KARLDLGFLVDGSGSIKAA------RFKGVKRFIQNVISRFHISP---KHTRVGLVLYSN 57 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGS-------TTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + +G K + TK+ L Y +F + ++ Sbjct: 58 NPYKI----FGFNKYTNKNAAMKATGRIPYPRRGTKTGRALAYTGRYLFRSSKRR----- 108 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + +I LTDG + + ++ G +YA+GV + L++ AS Sbjct: 109 ------RVLILLTDGRSYDRVSAPAR------KLRQAGIHIYAVGVGRNYNIKQLRSIAS 156 >gi|150005796|ref|YP_001300540.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294776175|ref|ZP_06741664.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|149934220|gb|ABR40918.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294449998|gb|EFG18509.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] Length = 340 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 63/205 (30%), Gaps = 29/205 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R ++ F S K+ + G++ ++ LD+S SM + Sbjct: 54 RPDIKFWLTFAALTLVILMLARPQFGS-KMETVKRSGVEAVIALDISNSMLAEDVTP-SR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKIN-RL 247 L + + I ++D N + GL+ F+ P+ + E IN L Sbjct: 112 LDKSKKLISRLVDTF-------NNDKVGLIVFAGDAFTQLPITSDYVSAKMFLETINPSL 164 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 I T + A + + II +TDGEN Sbjct: 165 ITTQGTDIGAAIRLAMKSFT-----------PQEGVGRAIIVITDGENHEGGAVEAAQEA 213 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 +G V+ +GV + Sbjct: 214 A-----EKGMQVFVLGVGSPDGSPI 233 >gi|307941757|ref|ZP_07657112.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307775365|gb|EFO34571.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 358 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 97/293 (33%), Gaps = 40/293 (13%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 I+TA + V+ + +G+ ++ S +K ++ LD + L A + + + +K Sbjct: 2 IITAFVFFVLIVAIGVGVDYSRALTLKTRVLGSLDTAALAAAVEFSKLGSEQDARKAAKK 61 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFI 138 + ++L +G + L+I+ DD + +S + +E+P Sbjct: 62 AFDAQV---------SQLNLHG--------AKLKKLNIVTDD--ETMKVSVDAVFELPTT 102 Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + T S + ++ LD++M +D + SM A Sbjct: 103 LMQIAGFKTLE----VATRSDAVGGGQEVILDIVMCIDATGSMGATLRSVQ---RNALSF 155 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH------------IQEKINR 246 + + +K + +++R + + + ++G++ + ++ Sbjct: 156 EANLKNRLKELGRQVDIIRVRPIYYWDYDYDGWSRSYGLKKSTFLKLPDQRTQFKNFVDS 215 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +N+ + + + TD SPN Sbjct: 216 ESAYGGGDWPEAGLECFNEGLRSSWFKTTNTRQSVFP--VVALWTDAPADSPN 266 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 80/202 (39%), Gaps = 30/202 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + I +++D + GLV +SS + Q Sbjct: 17 DLVFLIDGSKSVRPE------NFELVKKFINQIVDSLDVSEQNAQ---VGLVQYSSSVRQ 67 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 FPL I+ + ++ + T A N + D + A+ +K Sbjct: 68 EFPLGRFTSKRDIKAAVKKMTYMEKGTMTG---TALNYLIDNTFAISSGARPG--AQKVG 122 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFYS 344 I TDG + D +AK G ++A+GV D+ L+ AS + ++ Sbjct: 123 IVFTDGRSQDYISD------AAKKAKDLGFKMFAVGVGNAVEDE-LREIASDPVAEHYFY 175 Query: 345 VQNSRKLHDAFLRIGKEMVKQR 366 + + ++ IGK++ K+ Sbjct: 176 TADFKTINQ----IGKKLQKKI 193 >gi|319426151|gb|ADV54225.1| Vault protein inter-alpha-trypsin domain protein [Shewanella putrefaciens 200] Length = 757 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 80/207 (38%), Gaps = 34/207 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S + ++++V+D S SM D + A ++ L+ +K+ N ++ F+ Sbjct: 372 STLPRELILVIDTSGSMAG------DSIVQAKSALLYALNGLKAEDSFN------IIEFN 419 Query: 225 SKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S++ Q P + + ++ I+RL T+ L A + + Sbjct: 420 SELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMALALNAAL-------PRGINRLSE 472 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +IF+TDG + + E++ ++ +G+ + F++ A Sbjct: 473 SSQSLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFMQRAAEL 527 Query: 340 DR--FYSVQNSRKLHDAFLRIGKEMVK 364 R F + N ++ +I + + K Sbjct: 528 GRGTFTYIGNVDEVE---QKISQLLSK 551 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 76/206 (36%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ VLD S S+ G + + + ++D + P+ R G+V +S + Sbjct: 38 DLVFVLDSSSSV------GKENFEKVRQWVANLVDTFEVGPER---TRVGVVRYSDRPAT 88 Query: 230 TFPL-AWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + R + G T + L + F + G +K+ Sbjct: 89 AFELGRFGSRAAVRAAARQLAYHGGHTHTGDALRFITRHSFTPR---AGGRPGDRAFKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + L A+R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLV------LPAATAARRAGIRIFAVGVG-EALREELEEIASEPTAAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FDAIDKIRGKLRRRL 217 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 76/206 (36%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ VLD S S+ G + + + ++D + P+ R G+V +S + Sbjct: 38 DLVFVLDSSSSV------GKENFEKVRQWVANLVDTFEVGPER---TRVGVVRYSDRPAT 88 Query: 230 TFPL-AWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + R + G T + L + F + G +K+ Sbjct: 89 AFELGRFGSRAAVRAAARQLAYHGGHTHTGDALRFITRHSFTPR---AGGRPGDRAFKQV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I LTDG + L A+R G ++A+GV EA + L+ AS + Sbjct: 146 AILLTDGRSQDLV------LPAATAARRAGIRIFAVGVG-EALREELEEIASEPTAAHVF 198 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 199 HVSD-------FDAIDKIRGKLRRRL 217 >gi|242078053|ref|XP_002443795.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] gi|241940145|gb|EES13290.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] Length = 423 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 76/226 (33%), Gaps = 51/226 (22%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM K+ R++ ++D + S R +V FS+ Sbjct: 124 PLDLVTVLDVSGSMAGK------KMERVKRAMGFLIDNLGSDD------RLSVVAFSTDA 171 Query: 228 VQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L G + + L +T GL+ A + + ++ Sbjct: 172 RRIIRLTRMSDDGKAAAKRAVESLAASGSTNIRGGLDVA--AMVLDGRRHKNAVAS---- 225 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC---NEAKRRG--------------------AIVY 320 +I L+DG+++ + + A +G V+ Sbjct: 226 ---VILLSDGQDNQSMHHEYLPTSWVPKHSPAFSKGGYDVLVPPSFQRTAGGDHRCVTVH 282 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G + + + + F ++N + DAF R IG + Sbjct: 283 TFGFGIDHDAAAMHYISEVTGSTFSFIENHAVIQDAFARCIGGLLS 328 >gi|323493925|ref|ZP_08099042.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] gi|323311866|gb|EGA65013.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] Length = 427 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 87/229 (37%), Gaps = 24/229 (10%) Query: 18 SILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY--TATKILNQENGNNGKKQ 75 IL LLP++ I+M ++ S + A++ + + L + K EN ++ Sbjct: 3 GILFIGLLPIMVILMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDENVKYARQL 62 Query: 76 KNDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 + + I ++ + R ++G Q+ + + + +HK + A + + Sbjct: 63 VDRYVVDNINDVDVEVYTRKCEYKDGCVQESGEVAPFSDFVVSAKAEHKSWI--AYEKVD 120 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + F + P +D+ + D S SMN H+ G KL V Sbjct: 121 LKPEFEVAGKSVTRKYLPQP--------------VDVYFIGDFSGSMNGHWKGGKTKLDV 166 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 ++I +++ I++ + R L+ ++ V+ G ++ Sbjct: 167 VKQTIERVVEDIENF-NTEEKSRVALLGYNPLHVKQT----GTVYLNSY 210 >gi|171057436|ref|YP_001789785.1| outer membrane adhesin-like protein [Leptothrix cholodnii SP-6] gi|170774881|gb|ACB33020.1| outer membrane adhesin like proteiin [Leptothrix cholodnii SP-6] Length = 1598 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 71/196 (36%), Gaps = 17/196 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++ LDVS SM+ G G +L A +I ++LD +++ DV VR LV Sbjct: 812 SATAARTNLLISLDVSGSMDTADGVAGATRLASAIEAIGQLLDRHEAMGDVA--VR--LV 867 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 TFSS + V ++ + L T L A + A + Sbjct: 868 TFSSTAQAIGEVWTDVATARQLLASLQADGGTHYDSALAAAQSAFLSAGRLAGAQNVSY- 926 Query: 282 DYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--- 337 FL+DGE + ID+ E + + + AIG+ + A Sbjct: 927 -------FLSDGEPNRGHGIDSAEQRQWESFVTSQAIDSRAIGLGDSTTQASMDGIAYDG 979 Query: 338 SPDRFYSVQNSRKLHD 353 S + Sbjct: 980 SSGHDTDALRVSDFNQ 995 >gi|116750907|ref|YP_847594.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116699971|gb|ABK19159.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 268 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 74/203 (36%), Gaps = 21/203 (10%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + + + +++D S SMN+ G A ++ + + P N GL Sbjct: 73 SDNLLARNYYVIMDSSGSMNEVRCSGNRTKSEAAKTA--LAQFARITPKDAN---MGLAV 127 Query: 223 FSSK-IVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F + I + PL +N G+ T L Y ++ E+ G+ Sbjct: 128 FDAYGIAERVPLGLENRDKFIAAVNATAPGNGTPLHDALLLGYRRL----EETARRQAGY 183 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +Y +++ +TDG+ N D + Y ++ +++ IG + P Sbjct: 184 GEY--HLVVITDGQAYPQNQDPTPVVAYI--LRQSPVVIHTIGFCIGTDHSLNQ----PG 235 Query: 341 R--FYSVQNSRKLHDAFLRIGKE 361 R + + N R+L + E Sbjct: 236 RTVYRAADNPRELQQGLEEVLAE 258 >gi|315052466|ref|XP_003175607.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] gi|311340922|gb|EFR00125.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] Length = 740 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/256 (15%), Positives = 86/256 (33%), Gaps = 30/256 (11%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 + D I + + + +++ + T N + I Sbjct: 2 ATSSSDDGFEIIEDHQIPLRPANAGTTAVTENLTQEPI----LTIHSIPNKDSMIVSIQP 57 Query: 158 SVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPD 211 +K + + D+++V+D+S SMN + + +LD+ K I Sbjct: 58 PLKPENDVPHVPCDIVLVIDISGSMNSSAPIPTGE-RGGEDTGLSILDLTKHAARTIIET 116 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 +N R +VTF +++ F L + + IN+L S+T G++ + Sbjct: 117 LNENDRLAVVTFCTEVKVAFELDFMNKENKSMVLRAINKLYGTSSTNLWHGIKEGLKVLT 176 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSS------PNIDNKESLFYCNEAKRRGAIVYA 321 A + + ++ LTDG + +++L + +++ Sbjct: 177 ---------ATPVRENVQSLLVLTDGAPNHMCPAQGYVPKLRQTLLDHRNSTGSLPLIHT 227 Query: 322 IGVQAEAADQFLKNCA 337 G L++ A Sbjct: 228 FGFGYYLRSPLLQSIA 243 >gi|262275460|ref|ZP_06053270.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] gi|262220705|gb|EEY72020.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] Length = 453 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 56/441 (12%), Positives = 124/441 (28%), Gaps = 100/441 (22%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G ++ I P++F V L +E++ A++ ++ + L A I + + K Sbjct: 12 GVAVVIFVIAYPLLFGVFVLAVESTRYLQTHARIGDGVEVASLAVAANISS--DITENKT 69 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 ++ + D L + + + + + + ++Y+ Sbjct: 70 LAKNYVDGFV-----PDGTISLADINIERKSCDEIYGSQCGVAGVYDEEGL---VFTQYK 121 Query: 135 MPFIFCTFPWCANSSHAP-----LLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFGP 187 + W AP + + + +D+ V D S SM + Sbjct: 122 VTLSSEFESWYPEDDFAPGFEEIVELGGTAVARKYQGFTIDVAFVADFSGSMQQTWNREI 181 Query: 188 GMDKLGVATRSIREMLDIIKSIPD----------------VNNVVRSGLVTFSSKIVQ-- 229 + I L+ + N +G +S+ + Sbjct: 182 KYKGVVNVISDITRKLETFNDHTEQELNGKKVANKVAFIGYNFYPHNGSTFYSNVDYKAN 241 Query: 230 TFPLAWGVQH----------------------IQEKINRLIFGST----------TKSTP 257 L++ Q I +N I + T + Sbjct: 242 YSRLSYKWQENIPEINYRRTARDPINNKRTPIIGRYVNNTIPLYSDDSYFYTLDLTDNFT 301 Query: 258 GLEYAYNKIFDAKEK--------LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY- 308 + + I ++ +K II L+DGE+S + ++ + Sbjct: 302 QFRNTISTFYPDYGTASYEGIIEAAKIVNNGENIRKLIIVLSDGEDSINENNPYDNRYPG 361 Query: 309 -----------CNEA-----------KRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSV 345 C + A ++ IG + + LK CA + S Sbjct: 362 FIAPLIYQSGLCQNIINDLESKEINGRNVEAKIFVIGFGYDLEKNPGLKICAGEENVQSA 421 Query: 346 QNSRKLHD-AFLRIGKEMVKQ 365 + +++ D I +E+ Sbjct: 422 DSYQEIFDTVLQLISEEVGHL 442 >gi|113867117|ref|YP_725606.1| von Willebrand factor type A (vWA) domain-containing protein [Ralstonia eutropha H16] gi|113525893|emb|CAJ92238.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ralstonia eutropha H16] Length = 566 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 88/237 (37%), Gaps = 34/237 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + I ++S + +++ ++DVS SM DKL + S++ + Sbjct: 165 PWHPANVLLRIGIKGK-DMASAALPPANLVFLVDVSGSM-----NSPDKLLLLKSSLKLL 218 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLE 260 ++ ++ R LVT++S P G I I++L+ G +T G+ Sbjct: 219 VNKLRPQD------RITLVTYASGTRVALPPTPGSDKTAISAAIDQLVAGGSTAGASGIA 272 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + G + ++ TDG+ + D ++ E ++ G + Sbjct: 273 LAYQA------AQQSFIAGGINR---VLLATDGDFNVGVTDFRQLKSMVEEKRKSGVSLS 323 Query: 321 AIGVQAEA-ADQFLKNC--ASPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 +G +Q ++ A + + N ++ I +++ Q Sbjct: 324 TLGFGTGNYNEQLMEQLADAGDGAYSYIDNLMEGNKVLVSEISSTLATIARDVKIQV 380 >gi|326775386|ref|ZP_08234651.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326655719|gb|EGE40565.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 424 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 74/196 (37%), Gaps = 31/196 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------ 223 + +VLDVS SM G ++ A ++ ++LD + +V+ +R+ + Sbjct: 40 QVELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDA--APEEVHLGIRTLGADYPGEDRK 97 Query: 224 -----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + ++ PL + + L T P L A + + Sbjct: 98 VGCKDTKQLYPVGPL--DRTEAKAAVATLAPTGFTPIGPALLGAADDL------------ 143 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-- 336 + + I+ +TDGE++ +D E A+ +V +G+ A + C Sbjct: 144 EGGEGSRRIVLITDGEDTCGPLDPCEVAREI-AARGTHLVVDTLGLVPNAKIRQQLTCIA 202 Query: 337 -ASPDRFYSVQNSRKL 351 A+ + +VQ+ +L Sbjct: 203 EATGGTYTAVQHKEEL 218 >gi|182434868|ref|YP_001822587.1| hypothetical protein SGR_1075 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178463384|dbj|BAG17904.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 424 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 74/196 (37%), Gaps = 31/196 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------ 223 + +VLDVS SM G ++ A ++ ++LD + +V+ +R+ + Sbjct: 40 QVELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDA--APEEVHLGIRTLGADYPGEDRK 97 Query: 224 -----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + ++ PL + + L T P L A + + Sbjct: 98 VGCKDTKQLYPVGPL--DRTEAKAAVATLAPTGFTPIGPALLGAADDL------------ 143 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-- 336 + + I+ +TDGE++ +D E A+ +V +G+ A + C Sbjct: 144 EGGEGSRRIVLITDGEDTCGPLDPCEVAREI-AARGTHLVVDTLGLVPNAKIRQQLTCIA 202 Query: 337 -ASPDRFYSVQNSRKL 351 A+ + +VQ+ +L Sbjct: 203 EATGGTYTAVQHKEEL 218 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 65/197 (32%), Gaps = 44/197 (22%) Query: 214 NVVRSGLVTFSSKI---VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA- 269 N + ++S + PL+ ++ +IN G +T GL + + + Sbjct: 432 NYPSTSADSYSPNPCPSAKITPLSSDKTALKAQINNYSVGGSTAGQIGLAWGWYMVAPNF 491 Query: 270 -----KEKLEHIAKGHDDYKKYIIFLTDGENSSP-------------------------- 298 A D K +I +TDG ++P Sbjct: 492 GYIWPSASQRPAAYKSKDLMKVVIMMTDGAFNTPYCNGVIAANAGIGSGSDEDHINCNAT 551 Query: 299 NIDN-KESLFYCNEAKR--RGAIVYAIGVQAEA---ADQFLKNCASPDR--FYSVQNSRK 350 N D ++ C K +Y +G + A FL +CAS F+ S + Sbjct: 552 NGDPFAQARALCTVIKNSANDITLYTVGFAVGSDYTAKTFLTDCASDSSKAFFPATGS-E 610 Query: 351 LHDAFLRIGKEMVKQRI 367 L +F I +E+ RI Sbjct: 611 LKASFTAIAREISSLRI 627 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 89/266 (33%), Gaps = 37/266 (13%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+I+I A+L + I++ V++ S + ++ LD + L A Sbjct: 20 RRLRRDDRGAIAIQFALLALPLSILLFGVLDVSRLSLQRRQMQDALDAATLMAARSAATA 79 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + ++ S++ S+ ++ Sbjct: 80 SADLDTTGDAAFLAEIA------------------GMNLGLTASSSTFSVGTGNR----V 117 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + + P + + TS V SSK+ L++ +VLD++ SM+ Sbjct: 118 IGTATATLKPI----IANLWQAGDFTVTATSEVVRSSKN---LEVALVLDITGSMSG--- 167 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKI 244 ++ + +++DI+ + LV +++ + +A G ++ Sbjct: 168 ---TRIADLKVAASDLVDIVIRDTQTPFYSKVALVPYAAGVNVDTYADMARGPIPVRNIS 224 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAK 270 N + T + ++ Sbjct: 225 NVAWLATGTSIRGVTKALPALLWSDN 250 >gi|254882022|ref|ZP_05254732.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|319640970|ref|ZP_07995678.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] gi|254834815|gb|EET15124.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|317387415|gb|EFV68286.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] Length = 340 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 63/205 (30%), Gaps = 29/205 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R ++ F S K+ + G++ ++ LD+S SM + Sbjct: 54 RPDIKFWLTFAALTLVILMLARPQFGS-KMETVKRSGVEAVIALDISNSMLAEDVTP-SR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKIN-RL 247 L + + I ++D N + GL+ F+ P+ + E IN L Sbjct: 112 LDKSKKLISRLVDTF-------NNDKVGLIVFAGDAFTQLPITSDYVSAKMFLETINPSL 164 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 I T + A + + II +TDGEN Sbjct: 165 ITTQGTDIGAAIRLAMKSFT-----------PQEGVGRAIIVITDGENHEGGAVEAAQEA 213 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 +G V+ +GV + Sbjct: 214 A-----EKGMQVFVLGVGSPDGSPI 233 >gi|194391096|dbj|BAG60666.1| unnamed protein product [Homo sapiens] Length = 698 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 98/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 188 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 245 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 246 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 296 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ +K +N + SG V+ + +VQ P +Q + Sbjct: 297 G------RKLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQATPE--NLQEAR 348 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 349 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 404 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|324231484|emb|CBZ42120.1| C. elegans protein C29A12.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|324231581|emb|CBZ42125.1| C. elegans protein C29A12.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 980 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 81/224 (36%), Gaps = 31/224 (13%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA-PLLITSSVKISSKSD 166 ++ + +D + ++ + PF P ++ P+ + S + Sbjct: 741 LKPEVPVIKPMDFMVRSRSVQFAMTEKPPFTTVMNPMKFFTTTRTPITKPKPLIPYSCTA 800 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ ++DVS D +D A S+ P + VR GL+++S Sbjct: 801 ---DVFFLVDVSQGTGDKSQQYLDIAASAISSL----------PISQDTVRVGLISYSGP 847 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + +EK+ +F G TT++ + YA E H A+ + Sbjct: 848 GRTHVRVFLDKHNEKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGKAHPARKN- 902 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ TDG + + A+ +G + A+ V+ Sbjct: 903 -VKKVLVVFTDGYSQDNPKEASRM------ARAKGIQLIAVAVK 939 >gi|317122731|ref|YP_004102734.1| von Willebrand factor A [Thermaerobacter marianensis DSM 12885] gi|315592711|gb|ADU52007.1| von Willebrand factor type A [Thermaerobacter marianensis DSM 12885] Length = 1122 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 78/225 (34%), Gaps = 40/225 (17%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKL 192 +P F + L + + S+ ++ + + +V+D S SM KL Sbjct: 421 GLPDTFGPGGYTGTPVERALPVD--TDLRSRKNLPTVALALVIDRSGSMEGV------KL 472 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-WGVQHIQEKINRLIFGS 251 +A + R + ++ R G++ F ++ T P+ E + G Sbjct: 473 EMAVEAARRVAQLLTPAD------RLGVILFDTQAYVTRPIEPVDSAGQVEAAFPSVAGG 526 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GLE A + + D + + H +I LTDG E Sbjct: 527 GTSLGVGLEAALHLMKDVRADVRH-----------VIALTDG--------VSEPFDVAGT 567 Query: 312 AK---RRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 A+ +G V A+ + A+A L A + Y + ++ Sbjct: 568 ARAFHEQGITVSAVAIGADADTTTLGWLAQEGGGQLYVAADPGQI 612 >gi|125532271|gb|EAY78836.1| hypothetical protein OsI_33941 [Oryza sativa Indica Group] Length = 645 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 74/218 (33%), Gaps = 43/218 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM + KL + +++ ++D + R +++FSS Sbjct: 173 PLDLVTVLDVSGSMVGN------KLALLKQAMGFVIDNLGPAD------RLCVISFSSGA 220 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L+ G H + + L T L A + D + + Sbjct: 221 SRLMRLSRMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVES------ 274 Query: 284 KKYIIFLTDGENSS-----------PNID----NKESLFYCNEAKRRGAIVYAIGVQAEA 328 +I L+DG+++ N D R V+ G + Sbjct: 275 ---VILLSDGQDTYTVPPRGGYDRDANYDALVPPSLVRSDAGGGGGRAPPVHTFGFGKDH 331 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + A + F ++N + D F + IG + Sbjct: 332 DAAAMHTIAEVTGGTFSFIENEAAIQDGFAQCIGGLLS 369 >gi|114587336|ref|XP_001172570.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1 [Pan troglodytes] Length = 890 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 188 IFEPQGISMLDAEASFITNDLLGSTLTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 245 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 246 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 296 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ +K +N ++ SG V+ + +VQ P +Q + Sbjct: 297 G------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 348 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 349 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 404 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|289547502|ref|NP_001166098.1| integrin, alpha 11a [Danio rerio] Length = 1190 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 80/214 (37%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S + + +L P ++ G++ + K+V Sbjct: 167 MDIVIVLDGSNS--------IYPWNEVQDFLINILRKFYVGPGQ---IQVGVLQYGEKVV 215 Query: 229 QTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L + +++ + G T + G+ A ++ F + Sbjct: 216 SEFQL-NDFRSVEDVVKAARKIGQRGGEETNTALGINVARSEAFKQGGRRG--------A 266 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL---K 334 KK +I +TDGE + D+ + E+++ G YAI V + FL K Sbjct: 267 KKVMIVITDGE----SHDSADLQQVIEESEKDGITRYAIAVLGYYNRRGINPEAFLNEIK 322 Query: 335 NCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 AS F++V + L D +G+ + Sbjct: 323 YIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 356 >gi|55793317|gb|AAV65699.1| integrin alpha 11 subunit [Danio rerio] Length = 1168 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 80/214 (37%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S + + +L P ++ G++ + K+V Sbjct: 145 MDIVIVLDGSNS--------IYPWNEVQDFLINILRKFYVGPGQ---IQVGVLQYGEKVV 193 Query: 229 QTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L + +++ + G T + G+ A ++ F + Sbjct: 194 SEFQL-NDFRSVEDVVKAARKIGQRGGEETNTALGINVARSEAFKQGGRRG--------A 244 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL---K 334 KK +I +TDGE + D+ + E+++ G YAI V + FL K Sbjct: 245 KKVMIVITDGE----SHDSADLQQVIEESEKDGITRYAIAVLGYYNRRGINPEAFLNEIK 300 Query: 335 NCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 AS F++V + L D +G+ + Sbjct: 301 YIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 334 >gi|188527810|ref|YP_001910497.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] gi|188144050|gb|ACD48467.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] Length = 217 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 72/197 (36%), Gaps = 12/197 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-I 227 + + ++LD S SMN+ G ++ I++M++ +K + ++TF Sbjct: 15 IPVFLLLDTSGSMNESLGN-CTRIEALNLCIQKMIETLKQEAKKELFSKMAIITFGENGA 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V P V++I L T A + I D YK Y Sbjct: 74 VLHTPFD-DVKNIN--FKPLSASGGTPLDQAFRLAKDLIED------KDTFPTKFYKLYS 124 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I ++DGE + S F+ + + ++ ++I + + + D + + Sbjct: 125 ILVSDGEPNDDKWQKALSNFHHD-GRSAKSVCWSIFIGDRNTNPQVNKDFGKDGVFYADD 183 Query: 348 SRKLHDAFLRIGKEMVK 364 KL F + + + K Sbjct: 184 VEKLVGLFEIMTQTISK 200 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 68/184 (36%), Gaps = 24/184 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++ + D+++++D S S+ + + ++ D +R Sbjct: 230 SQLQCDTSAMTDIVLLVDGSWSIGRS------NFKLIKEFLSALISPFNIAQDK---IRV 280 Query: 219 GLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEH 275 GL +SS + L+ + E + L + G T + L + + L+ Sbjct: 281 GLSQYSSDPRTEWDLSAYATRDQVLEAVRNLRYKGGNTFTGLALTHVLE------QNLKP 334 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 A + +K +I LTDG++ ++ K G ++AIGV+ + + Sbjct: 335 DAGARLEAEKLVILLTDGKSQD------DANLAAQTLKNMGIEIFAIGVKNADEAELKQV 388 Query: 336 CASP 339 + P Sbjct: 389 ASEP 392 >gi|75070765|sp|Q5RB37|ITIH3_PONAB RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|55728562|emb|CAH91023.1| hypothetical protein [Pongo abelii] Length = 876 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 69/175 (39%), Gaps = 13/175 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQ 229 + V+D+S SM KL ++ +L+ +K +N ++ SG V+ + +VQ Sbjct: 282 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKKEDYLNFILFSGDVSTWKEHLVQ 335 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P +Q + + + T GL + + A+E+ + + +I Sbjct: 336 ATPE--NLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RVPERSTSIVIM 389 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDG+ + ++ A +Y +G FL+N A + ++ Sbjct: 390 LTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 444 >gi|48734898|gb|AAH71224.1| Matrilin 3 [Mus musculus] Length = 481 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 71/223 (31%), Gaps = 26/223 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 A ++ + LD++ ++D S S+ + + + Sbjct: 51 HLSALATSTRAPYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL------EFTKVKTFVSRI 104 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGL 259 +D + R +V ++S + F L Q +++ + R+ + T S + Sbjct: 105 IDTLGI---GATDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAI 161 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A + F + K I +TDG + A+ G + Sbjct: 162 QTAMEEAFT---VEAGARGPMSNIPKVAIIVTDGRPQD------QVNEVAARARASGIEL 212 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 YA+GV + + P + + V+ KL F Sbjct: 213 YAVGVDRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 81/223 (36%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATH 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + P + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDIP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|53804868|ref|YP_113288.1| MxaC protein [Methylococcus capsulatus str. Bath] gi|53758629|gb|AAU92920.1| MxaC protein [Methylococcus capsulatus str. Bath] Length = 327 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 76/223 (34%), Gaps = 41/223 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGM-DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G M+++LD S SM+D F + + E L N++V G+ F Sbjct: 79 TGYGAHMVLLLDRSRSMDDSFAGRTPTGGEESKSAAAERLLSGFVSSGRNDLV--GVAAF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ + PL + ++ + T + GL A + D Sbjct: 137 STSPLFVLPLTDNKAAVLAAVHAMKLPGLAQTHVSKGLAMALSYFGDD----------ST 186 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN------ 335 + ++ ++DG + +D L K +G +Y I ++ + + Sbjct: 187 AGSRIVLLVSDG---AAEVDPDSELKLRRWFKEKGVRLYWIFLRTAGSHGIFETPDNPEE 243 Query: 336 ---CASPDRF--------------YSVQNSRKLHDAFLRIGKE 361 A P+R+ Y +++ L A + +E Sbjct: 244 DNAQARPERYLHLFFNSLGIPYRAYEAEDADALKRAIADVDRE 286 >gi|82913466|ref|XP_728655.1| methanol oxidation protein [Plasmodium yoelii yoelii str. 17XNL] gi|23485116|gb|EAA20220.1| methanol oxidation protein [Plasmodium yoelii yoelii] Length = 720 Score = 60.2 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 76/223 (34%), Gaps = 41/223 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGM-DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G M+++LD S SM+D F + + E L N++V G+ F Sbjct: 154 TGYGAHMVLLLDRSRSMDDSFAGRTPTGGEESKSAAAERLLSGFVSSGRNDLV--GVAAF 211 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ + PL + ++ + T + GL A + D Sbjct: 212 STSPLFVLPLTDNKAAVLAAVHAMKLPGLAQTHVSKGLAMALSYFGDD----------ST 261 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN------ 335 + ++ ++DG + +D L K +G +Y I ++ + + Sbjct: 262 AGSRIVLLVSDG---AAEVDPDSELKLRRWFKEKGVRLYWIFLRTAGSHGIFETPDNPEE 318 Query: 336 ---CASPDRF--------------YSVQNSRKLHDAFLRIGKE 361 A P+R+ Y +++ L A + +E Sbjct: 319 DNAQARPERYLHLFFNSLGIPYRAYEAEDADALKRAIADVDRE 361 >gi|311742726|ref|ZP_07716535.1| possible von Willebrand factor, type A [Aeromicrobium marinum DSM 15272] gi|311314354|gb|EFQ84262.1| possible von Willebrand factor, type A [Aeromicrobium marinum DSM 15272] Length = 435 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 72/221 (32%), Gaps = 33/221 (14%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + + + + + +++VLD S SM+ +L A R++R++++ + Sbjct: 26 YVLVDVEAPALPTREDALPTRLVVVLDRSGSMSGA------RLDHAKRALRQVVEALSPS 79 Query: 210 PDVNNVVRSGLVTFSSKIVQT--FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 GLVTF + + ++ + G +T GL + Sbjct: 80 DSF------GLVTFDQHVEVAVNAGPVTDPAAVLRAVDAVRPGGSTDLAGGLIEGLRQAT 133 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ ++DG + +D + RG +G+ Sbjct: 134 LLDGDAR------------VLLISDGHANQGVVDPDALQTFTARHLDRGVTTSTLGMGLG 181 Query: 328 AADQFLKNCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + L + + ++ + + + A I E Sbjct: 182 YDETLLGAVSRGGTGEQHF----AEEADTAVGAITAECGDL 218 >gi|222612967|gb|EEE51099.1| hypothetical protein OsJ_31818 [Oryza sativa Japonica Group] Length = 600 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 74/222 (33%), Gaps = 43/222 (19%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 LD++ VLDVS SM + KL + +++ ++D + R +++F Sbjct: 153 AQRAPLDLVTVLDVSGSMVGN------KLALLKQAMGFVIDNLGPGD------RLCVISF 200 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SS + L+ G H + + L T L A + D + + Sbjct: 201 SSGASRLMRLSRMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVES-- 258 Query: 280 HDDYKKYIIFLTDGENSS-----------PNID----NKESLFYCNEAKRRGAIVYAIGV 324 +I L+DG+++ N D R V+ G Sbjct: 259 -------VILLSDGQDTYTVPPRGGYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGF 311 Query: 325 QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + + A + F ++N + D F + IG + Sbjct: 312 GKDHDAAAMHTIAEVTGGTFSFIENEAAIQDGFAQCIGGLLS 353 >gi|156324784|ref|XP_001618484.1| hypothetical protein NEMVEDRAFT_v1g2784 [Nematostella vectensis] gi|156199073|gb|EDO26384.1| predicted protein [Nematostella vectensis] Length = 410 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 44/207 (21%), Positives = 77/207 (37%), Gaps = 28/207 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S + +D+ VLD S S+N + M++I+KS + GLV Sbjct: 225 TENSRLAVDLAFVLDGSTSIN---NADPGNFQLLKNF---MINIVKSFKISSERTHVGLV 278 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +S F I + IN T++ L+ A +++F A + Sbjct: 279 LYSFFTQLMFNFDKYSDSASIVKAINTTDYPKGGTRTGEALKMAKSQLFGASMR------ 332 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K +I LTDG +S ++L K G +++A+GV + L AS Sbjct: 333 ---SVPKVLIVLTDGRSSDKVEAPSKAL------KDEGVVIFAVGVGDQIDPSELNVMAS 383 Query: 339 ---PDRFYSVQNSRKLHDAFLRIGKEM 362 D + ++L I ++ Sbjct: 384 DSKSDHVFKA-KFKELDRLVDLIKRKA 409 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/168 (23%), Positives = 60/168 (35%), Gaps = 34/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF 332 I+ LTDGE++ + C EA R GAI++ I + AA + Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAAREL 448 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/168 (23%), Positives = 60/168 (35%), Gaps = 34/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 323 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 372 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 373 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 423 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF 332 I+ LTDGE++ + C EA R GAI++ I + AA + Sbjct: 424 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAAREL 462 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/168 (23%), Positives = 60/168 (35%), Gaps = 34/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + Q I + T GL+ + I + + Sbjct: 359 NYLIKITSSSDYQKITANL-PQQASGGTSICHGLQAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF 332 I+ LTDGE++ + C EA R GAI++ I + AA + Sbjct: 410 -IVLLTDGEDN--------GIRSCFEAVSRSGAIIHTIALGPSAAREL 448 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 72/202 (35%), Gaps = 42/202 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---- 226 +M+VLD S SM G K+ +A + +++ + V GL + + Sbjct: 16 VMIVLDGSNSM-WGQVDGEAKITIAKEVMTDLITNWD------DSVDLGLTVYGHRRKGD 68 Query: 227 ----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V P Q + +K+ + T + L A + K Sbjct: 69 CADIEVVAMPGKVDRQALIDKVQSITPRGKTPISKTLSLAALSVGFFSGKSS-------- 120 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----VYAIGVQA-EAADQFLKNC 336 ++ ++DG + N D C +AK G I V+ IG E + L+ Sbjct: 121 ----VVLVSDGLETC-NADP------CAQAKSLGIINPGFDVHVIGFDVTEEEFKSLQCI 169 Query: 337 AS--PDRFYSVQNSRKLHDAFL 356 A+ +F+ N+ +L DA Sbjct: 170 ATETGGKFFRANNAEELKDALR 191 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 105/319 (32%), Gaps = 49/319 (15%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 + ++ + ++ + + NI+ E + N+ + Sbjct: 460 NRGAAPNVAIVMVDGWPTDKVEEASRLARESGINIFFVTI-----EGPDENEKQNVVEAN 514 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + + + Y+++ S + + P + L S ++S D+ Sbjct: 515 FVDKAVCRTNGYYSINVPSWFSL--HKVVQPLVKRICDSDHLACSKTCLNSA-----DIG 567 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S S+ G + + I K + R G V ++ + Sbjct: 568 FVIDGSSSV------GTGNFRTVLQFVAN---ISKEFEISDTDTRIGAVQYTYEQR---- 614 Query: 233 LAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L +G + K + L T + + YA+ ++F K + +K Sbjct: 615 LEFGFEKQSTKQDVLNAIKRINYWSGGTSTGAAINYAFEQLF---------IKSKPNKRK 665 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 +I +TDG + D+ + A + G I Y++G+ A D+ PD + Sbjct: 666 IMILITDGR----SYDDVQG--PATAAHQNGVITYSVGIAWAAQDELEAIATDPDKEHSF 719 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V L+ RI + + Sbjct: 720 FVDEFDSLYRFVPRIIQNI 738 >gi|209524446|ref|ZP_03272995.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209495237|gb|EDZ95543.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 541 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 71/207 (34%), Gaps = 29/207 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LDVS SMN ++L + + ++D + V V +G +V Sbjct: 180 LVFLLDVSGSMNQP-----NRLPLLKEGFKLLVDQLTEQDTVAIAVYAGAAG----VVLP 230 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q I I+ L +T G++ AY + ++ +I Sbjct: 231 PTPGNEKQKIIAAIDGLQAQGSTAGGEGIKLAYELATRMLSEGKNNR---------VILA 281 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQN 347 TDG+ + + E + + RG + +G D ++ + + + N Sbjct: 282 TDGDFNVGVSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNHGNGNYAYIDN 341 Query: 348 --------SRKLHDAFLRIGKEMVKQR 366 S +L I K++ Q Sbjct: 342 LMEAKKVMSTELTGTLFTIAKDVKIQV 368 >gi|146292839|ref|YP_001183263.1| cell wall anchor domain-containing protein [Shewanella putrefaciens CN-32] gi|145564529|gb|ABP75464.1| LPXTG-motif cell wall anchor domain [Shewanella putrefaciens CN-32] Length = 757 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 80/206 (38%), Gaps = 31/206 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S + ++++V+D S SM D + A ++ L+ +K+ N ++ F+ Sbjct: 372 STLPRELILVIDTSGSMAG------DSIVQAKSALLYALNGLKAEDSFN------IIEFN 419 Query: 225 SKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S++ Q P + + ++ I+RL T+ + L A + + Sbjct: 420 SELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMSLALNAAL-------PRGINRLSE 472 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +IF+TDG + + E++ ++ +G+ + F++ A Sbjct: 473 SSQSLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFMQRAAEL 527 Query: 340 DR--FYSVQNSRKLHDAFLRIGKEMV 363 R F + N ++ ++ ++ Sbjct: 528 GRGTFTYIGNVDEVEQKISKLLSKIQ 553 >gi|156347845|ref|XP_001621774.1| hypothetical protein NEMVEDRAFT_v1g221583 [Nematostella vectensis] gi|156208029|gb|EDO29674.1| predicted protein [Nematostella vectensis] Length = 357 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 26/203 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++++ ++D S S+ND + +++ + V F++ Sbjct: 143 LKMNLVFLIDNSGSIND------TEFDNFKEFAKKLAESFTISA---TYTHVAAVYFNTL 193 Query: 227 IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + + I+ I+ L G T L Y + +F + K Sbjct: 194 ANFGFNLKYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVF--------KVAPRQNVKN 245 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASP---DR 341 ++ LTDG++ ++ G V+A+GV A ++ L AS D Sbjct: 246 VLVVLTDGKSHDSVTLPAAAVRNYGP----GVEVFAVGVGAGDSFVAQLNVIASDPDEDH 301 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 + V++ ++ + E+ K Sbjct: 302 VFHVEHFSQIESTTGAVEDEICK 324 >gi|308081588|ref|NP_001183957.1| vitrin isoform 2 [Mus musculus] gi|26342052|dbj|BAC34688.1| unnamed protein product [Mus musculus] Length = 628 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 71/202 (35%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S SM G + + + K + R G V ++ + Sbjct: 445 DIGFVIDGSSSM------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 494 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 495 ---LEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 542 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 543 KRKVMIIITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKD 596 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V + L+ RI + + Sbjct: 597 HSFFVDDFDNLYKIAPRIIQNI 618 >gi|268561224|ref|XP_002646394.1| Hypothetical protein CBG15363 [Caenorhabditis briggsae] gi|187027190|emb|CAP33690.1| hypothetical protein CBG_15363 [Caenorhabditis briggsae AF16] Length = 400 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 62/194 (31%), Gaps = 18/194 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ LD++ V+D S M + S + + P R GLVT++ Sbjct: 34 GNLWLDVIAVVDNSQGMTTDGLSSVAANIATVFSSGTRIGTNATEP---RTTRVGLVTYN 90 Query: 225 SKIVQTFPLAW--GVQHIQEKI-NRLIF-GSTTKSTPGLE-YAYNKIFDAKEKLEHIAKG 279 S L + + + N L T S A N +F ++ Sbjct: 91 SVAKVNADLNTFQSINDVYNGVFNYLSAVTDATDSYLATGLQAANALFASQSFNSTRN-- 148 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK---NC 336 YKK +I S +D + +E K G + + Q LK Sbjct: 149 --HYKKVVIVYASEYKSYGELDP---VKVADEMKGSGVYIVTVAYDQGGNGQLLKDLAGI 203 Query: 337 ASPDRFYSVQNSRK 350 A+P +S + Sbjct: 204 ATPGYSFSNTDDSD 217 >gi|327270780|ref|XP_003220166.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 906 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 78/211 (36%), Gaps = 39/211 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM D +++ ++ L I + +G+V F+SK Sbjct: 304 VCLVLDVSGSMRDF-----NRIHRLKQAAELFLLQI-----IETGSWAGIVVFNSKASTK 353 Query: 231 FPLAWGVQ-HIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L +++ ++ + T G+ + E Sbjct: 354 AFLQQITSDSVRQTLSDHLPTVADGGTSICSGIREGFQVFLQKYSSTEGCE--------- 404 Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYS 344 I+ LTDGE+SS + C E +R G+ ++ I + AA + + + +S Sbjct: 405 IVLLTDGEDSSVSS--------CFAEVERSGSTIHTIALGPSAAKELEMLANMTGGLTFS 456 Query: 345 VQNSRK---LHDAFLRIGK---EMVKQRILY 369 +S L DAF I ++ +Q I Sbjct: 457 ATDSLDSNGLIDAFSGISSGSGDISQQSIQL 487 >gi|308062304|gb|ADO04192.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Cuz20] Length = 217 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 73/197 (37%), Gaps = 12/197 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-I 227 + + ++LD S SMN+ G ++ I++M++I+K + ++TF Sbjct: 15 IPVFLLLDTSGSMNESLGN-CTRIEALNLCIQKMIEILKQEAKKELFSKMAIITFGENGA 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V P V++I L T A + I D YK Y Sbjct: 74 VLHTPFD-DVKNIN--FKPLSASGGTPLDQAFRLAKDLIED------KDTFPTKFYKLYS 124 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I ++DGE + S F+ + + ++ ++I + + + D + N Sbjct: 125 ILVSDGEPNDDKWQKALSNFHHD-GRSAKSVCWSIFIGDRNTNPQVNKDFGKDGVFYADN 183 Query: 348 SRKLHDAFLRIGKEMVK 364 KL F + + + K Sbjct: 184 VEKLVGLFEIMTQTISK 200 >gi|221133174|ref|XP_002171310.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 221 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 71/197 (36%), Gaps = 21/197 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + +D+ +LD S S+ + + L +N R G+ Sbjct: 9 VQPRCEAVVDVAFILDSSHSLEASYQKEKNFLKKLAAVFGI----------SSNGSRVGV 58 Query: 221 VTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +TFS + + L + E ++++ TT+ L A +F + Sbjct: 59 ITFSYRAKLSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTSA------N 112 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G K II LTDG + P D ++ +E + G ++ +G+ + + L + Sbjct: 113 GGRVGVSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT 171 Query: 338 SP-DRFYSVQNSRKLHD 353 Y+ L D Sbjct: 172 GGKKNAYTAATFDSLTD 188 >gi|126336627|ref|XP_001380264.1| PREDICTED: similar to Inter-alpha trypsin inhibitor, heavy chain 3 [Monodelphis domestica] Length = 894 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 66/172 (38%), Gaps = 21/172 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRSGLVTFSS 225 ++ V+DVS SM+ KL ++ ++L +K +N + VR+ + Sbjct: 284 VVFVIDVSGSMSG------RKLVQTKEALLKILSDVKKDDFLNFILFSSDVRT----WKE 333 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +V P ++ +E ++++ T GL + A+E + + Sbjct: 334 NLVPATPE--NLKAAEEFVHQIQATGGTNINDGLLRGIEMVNKAREMGTVLDRSTS---- 387 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II LTDGE + ++ A +Y +G + FL+ A Sbjct: 388 IIIMLTDGEANVGESRVEKIQENVRNAIGGKYPLYNLGFGYDVNYNFLERMA 439 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/230 (14%), Positives = 72/230 (31%), Gaps = 48/230 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D+S SM ++ A ++ + +LD + + V +G+V + Sbjct: 98 VVVVIDLSGSMRAQ-DIQPSRIRAAQQAAKVLLDTLPA------GVSAGVVAMAGTAAVA 150 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-----------------AKEKL 273 + I+RL T GL A + K Sbjct: 151 QAPSRSKDAAATAIDRLKPQGGTALGNGLLIALTTLLPQTAGDAERLMNGDTTPLQKPDA 210 Query: 274 EH---IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA- 329 H I+ +DGE+++ + + G VY +GV Sbjct: 211 SHSGEAVTPGSYPSGAIVLFSDGESNTGPAATQAAQLAAAH----GVRVYTVGVGTTDGV 266 Query: 330 --------------DQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 ++ LK A+ ++ + ++ +L + + + Sbjct: 267 VLSVDGWSARVRLDEKVLKEVANATGAEYFPLADAAQLKRVYRALNMRLT 316 >gi|333028697|ref|ZP_08456761.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] gi|332748549|gb|EGJ78990.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] Length = 418 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 70/202 (34%), Gaps = 28/202 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GL 220 + +VLDVS SM G ++ A +S E+LD + DV +R+ G Sbjct: 26 PDAGKAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAV--PEDVELGIRTLGA 83 Query: 221 VTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + + L T P L A + + Sbjct: 84 DYPGEDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGWTPIGPALLGAADDLR----- 138 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I+ +TDGE++ + + AK +V +G+ +A + Sbjct: 139 -------GGEASKRIVLITDGEDTCHRDPCEVAREIA--AKGVHLVVDTLGLVPDAKTRD 189 Query: 333 LKNC---ASPDRFYSVQNSRKL 351 +C A+ + +V++++ L Sbjct: 190 QLSCIAEATGGTYTTVRHTKDL 211 >gi|318058741|ref|ZP_07977464.1| hypothetical protein SSA3_12405 [Streptomyces sp. SA3_actG] Length = 418 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 70/202 (34%), Gaps = 28/202 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GL 220 + +VLDVS SM G ++ A +S E+LD + DV +R+ G Sbjct: 26 PDAGKAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAV--PEDVELGIRTLGA 83 Query: 221 VTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + + L T P L A + + Sbjct: 84 DYPGEDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGWTPIGPALLGAADDLR----- 138 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I+ +TDGE++ + + AK +V +G+ +A + Sbjct: 139 -------GGEASKRIVLITDGEDTCHRDPCEVAREIA--AKGVHLVVDTLGLVPDAKTRD 189 Query: 333 LKNC---ASPDRFYSVQNSRKL 351 +C A+ + +V++++ L Sbjct: 190 QLSCIAEATGGTYTTVRHTKDL 211 >gi|291448990|ref|ZP_06588380.1| secreted protein [Streptomyces roseosporus NRRL 15998] gi|291351937|gb|EFE78841.1| secreted protein [Streptomyces roseosporus NRRL 15998] Length = 420 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 72/195 (36%), Gaps = 31/195 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 + +VLDVS SM G ++ A ++ ++LD + V +R+ + Sbjct: 35 VELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDAV--PEQVQLGIRTLGADYPGEDRKV 92 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + ++ PL + + L T P L A + + Sbjct: 93 GCKDTRQLYPVGPL--DRTEAKTAVATLAPTGFTPIGPALLGAADDL------------E 138 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--- 336 + + I+ +TDGE++ +D E A+ +V +G+ A + C Sbjct: 139 GGEGSRRIVLITDGEDTCGPLDPCEVAREI-AARGTHLVVDTLGLVPNAKIRQQLTCIAE 197 Query: 337 ASPDRFYSVQNSRKL 351 A+ + +VQ+ +L Sbjct: 198 ATGGTYTAVQHKEEL 212 >gi|302517652|ref|ZP_07269994.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426547|gb|EFK98362.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 418 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 70/202 (34%), Gaps = 28/202 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GL 220 + +VLDVS SM G ++ A +S E+LD + DV +R+ G Sbjct: 26 PDAGKAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAV--PEDVELGIRTLGA 83 Query: 221 VTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + + L T P L A + + Sbjct: 84 DYPGEDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGWTPIGPALLGAADDLR----- 138 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K I+ +TDGE++ + + AK +V +G+ +A + Sbjct: 139 -------GGEASKRIVLITDGEDTCHRDPCEVAREIA--AKGVHLVVDTLGLVPDAKTRD 189 Query: 333 LKNC---ASPDRFYSVQNSRKL 351 +C A+ + +V++++ L Sbjct: 190 QLSCIAEATGGTYTTVRHTKDL 211 >gi|239945536|ref|ZP_04697473.1| hypothetical protein SrosN15_31402 [Streptomyces roseosporus NRRL 15998] gi|239992004|ref|ZP_04712668.1| hypothetical protein SrosN1_32194 [Streptomyces roseosporus NRRL 11379] Length = 416 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 72/195 (36%), Gaps = 31/195 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 + +VLDVS SM G ++ A ++ ++LD + V +R+ + Sbjct: 31 VELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDAV--PEQVQLGIRTLGADYPGEDRKV 88 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + ++ PL + + L T P L A + + Sbjct: 89 GCKDTRQLYPVGPL--DRTEAKTAVATLAPTGFTPIGPALLGAADDL------------E 134 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--- 336 + + I+ +TDGE++ +D E A+ +V +G+ A + C Sbjct: 135 GGEGSRRIVLITDGEDTCGPLDPCEVAREI-AARGTHLVVDTLGLVPNAKIRQQLTCIAE 193 Query: 337 ASPDRFYSVQNSRKL 351 A+ + +VQ+ +L Sbjct: 194 ATGGTYTAVQHKEEL 208 >gi|302036308|ref|YP_003796630.1| hypothetical protein NIDE0941 [Candidatus Nitrospira defluvii] gi|300604372|emb|CBK40704.1| conserved exported protein of unknown function, contains von Willebrand factor, type A and vault protein inter-alpha-trypsin domain [Candidatus Nitrospira defluvii] Length = 712 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 77/236 (32%), Gaps = 36/236 (15%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEM-PFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLD 170 SL +D+ L P +S+A L++ + + + + D Sbjct: 297 SLREDTVPADRDFQLIWHPAPRTEPMATVFTEQKDGTSYAMLMLVPPTQHRETTARVPRD 356 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++D S SM G +++ + + L R ++ F+ + Sbjct: 357 ITFIIDRSGSMA---GASIEQAKGSLTAALSRLTTQD---------RFNIIQFNHTVRSL 404 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 FP+ +Q L T+ P L A D+ + Sbjct: 405 FPIPQPVTTKSMQQAIRYTEHLAADGGTEILPALRQALKSPQDSARLQQ----------- 453 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 II +TDG+ + N+E LF + ++ IG+ + ++ A R Sbjct: 454 -IILITDGQ-----VGNEEELFELLHQRVGSRRLFTIGIGSTPNSHLMRKAAETGR 503 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 77/205 (37%), Gaps = 37/205 (18%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F + G++ I + +++ +I G+ ++ +AKL LD + L A Sbjct: 19 FQRFRKDEDGALIIFSLMMMVMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA----- 73 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + K + + I DQ +Y Sbjct: 74 --DLDQVMAPADVVRDYMDKAGMLHFLQGDP---------------------IVDQGINY 110 Query: 126 NL-SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSM 181 + +A + MP F P +S P + + +V SS ++ + + +VLDVS SM Sbjct: 111 RIVTANASAPMPLFFYDLPKVFSSPFTPGMSSLTVSGSSTAEERVSDVEISLVLDVSSSM 170 Query: 182 NDHFGPGMDKLGVATRSIREMLDII 206 N + +++ + RE + + Sbjct: 171 NSN-----NRMTNLRPAAREFVTTV 190 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYSVQNSRKLHDAF 355 + + N C A+ +G ++Y + +A + Q L++CAS Y + + AF Sbjct: 517 NGSEANTRLSNICAAARAQGIVIYTVAFEAPSGGQTALQDCASSSSHYFDVDGTDISGAF 576 Query: 356 LRIGKEMVKQRI 367 I ++ ++ Sbjct: 577 SAIASDIRNLKL 588 >gi|332559488|ref|ZP_08413810.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] gi|332277200|gb|EGJ22515.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] Length = 651 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 80/235 (34%), Gaps = 21/235 (8%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 + PW + + + + + + L+++ ++D S SM D Sbjct: 257 TPPFRPSLSVTRTPWNPETRLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQDPA---- 311 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 KL + +S ML ++ V V +G S+ V A I ++RL Sbjct: 312 -KLPLLKQSFGLMLGRLRPEDQVAIVTYAG----SAGEVLAPTAANQRSTILSALDRLDA 366 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T GL AY + G + + ++ TDG+ + D +E Sbjct: 367 GGSTAGEEGLALAY--------RTASEMAGAGEVTR-VVLATDGDFNLGISDPEELARLV 417 Query: 310 NEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + G + +G D ++ A + L++A + ++ Sbjct: 418 AHERDTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLS 471 >gi|301788518|ref|XP_002929675.1| PREDICTED: complement factor B-like [Ailuropoda melanoleuca] gi|281345621|gb|EFB21205.1| hypothetical protein PANDA_019913 [Ailuropoda melanoleuca] Length = 768 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 78/221 (35%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G+ A +R+ ++ + S GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGIGNFTRAKNCLRDFIEKVASYGVKPKY---GLVTYATI 317 Query: 227 IVQTFPLAWGVQHIQEKINRLI---------FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L + + R++ + T + L+ YN + L Sbjct: 318 PKVWVKLRDDNSSDADWVTRILNQISYEDHKLKAGTNTKKALQEVYNMMSWPGNAL---L 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG ++ + ++ L+ + K VY GV Sbjct: 375 EGWNHTRHVIILMTDGLHNMGGDPVSVIHEIRDFLYIGRDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + AS + V++ L D F+++ E Sbjct: 435 NQENINALASKKDKEQHVFRVKDMENLEDVFIQMLDETRTL 475 >gi|119585667|gb|EAW65263.1| inter-alpha (globulin) inhibitor H3, isoform CRA_b [Homo sapiens] Length = 890 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 188 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 245 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 246 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 296 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ +K +N ++ SG V+ + +VQ P +Q + Sbjct: 297 G------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 348 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 349 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 404 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|296223654|ref|XP_002757711.1| PREDICTED: anthrax toxin receptor 1 [Callithrix jacchus] Length = 568 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 43 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTR 90 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 91 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 145 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 146 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 201 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 202 PVNDGFQALQGIIHSILKK 220 >gi|58037355|ref|NP_083089.1| vitrin isoform 1 precursor [Mus musculus] gi|114154829|sp|Q8VHI5|VITRN_MOUSE RecName: Full=Vitrin; Flags: Precursor gi|21707639|gb|AAH34120.1| Vitrin [Mus musculus] Length = 650 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 71/202 (35%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S SM G + + + K + R G V ++ + Sbjct: 467 DIGFVIDGSSSM------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 516 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 517 ---LEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 564 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 565 KRKVMIIITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKD 618 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V + L+ RI + + Sbjct: 619 HSFFVDDFDNLYKIAPRIIQNI 640 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 67/190 (35%), Gaps = 24/190 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ +LD S S G D + M + P N R G+V +S+ Sbjct: 297 FDLIFLLDESGS------IGTDNFKLVKSFTERMANNFDISP---NSTRVGVVQYSNFPG 347 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + I+++ + + T A + + + + + + DD Sbjct: 348 TEFSLNAFTDKAAVLDAISKIDYNGGSTFTGA---AIDFVRNNEFTSVNGDR--DDVPNI 402 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYS 344 +I +TDG +PN D N A G YA+G+ + L + P R Sbjct: 403 LIVITDG---NPNDDVSGPAISANNA---GITTYAVGIGSNVDQANLVQMTAGRPGRVLQ 456 Query: 345 VQNSRKLHDA 354 + L Sbjct: 457 AADFTDLTTV 466 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/316 (13%), Positives = 103/316 (32%), Gaps = 40/316 (12%) Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 ++ + + + NDFS + + + + F D + Sbjct: 984 AELTSTAGSFSRVLRANDFSDL---SAIRQPLHETICQAAFCGDPGTPANGFQQGTYFEG 1040 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI----------GLD 170 + + + + T S PL + ++ + + G D Sbjct: 1041 NTVTFGCNF--GFLLSGTDNTQCQADGSWSNPLPVCIAITTPAPTQAPACNDFPLFNGTD 1098 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S S+ + + + ++ R +V +S + Sbjct: 1099 LVFLLDGSGSVGSN------NFDLVKTFTKNVVQNFDISE---TATRVAVVQYSDQFSTE 1149 Query: 231 FPL-AWGVQH-IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L A+ + + I+ + T + +++ +F + I+ D Y + Sbjct: 1150 FSLNAFSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTS------ISGERDGYPDLL 1203 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQ 346 + +TDG ++ + A+ +G +YA+GV ++ L+ A R V Sbjct: 1204 VVVTDGLSTDDVSGP------ADTARAQGVTIYAVGVGSDIDFNTLEQIAGLTSRVSQVS 1257 Query: 347 NSRKLHDAFLRIGKEM 362 + L + +++ Sbjct: 1258 DFSSLVTLSQTLSQDI 1273 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/261 (13%), Positives = 80/261 (30%), Gaps = 40/261 (15%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAP--LLITSSVKISSKSDI---------GLD 170 + + ++ + C + + ++ S + GLD Sbjct: 1577 FEGHVVTFGCNAGFILNGASSTICQGDGSWSNAVPVCVAITTPSPTRAPVCTDLSFGGLD 1636 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S S+ + + R G+V FS + Sbjct: 1637 LVFLLDGSGSVTAVNFDLVKDFASGV---------VSEFQISTTETRVGVVQFSDTLRTE 1687 Query: 231 FPLA-WGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F ++ + + I+ + + T + A + A ++ Sbjct: 1688 FFMSSFSTKQQVLQAISDIDYIQGNTLTGAAITFATASSFSTPAG--------NRANFPD 1739 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRFYS 344 ++I +TDG + + A+ +G ++A+GV E L+ P+ Sbjct: 1740 FMIVVTDGLSQD------SVVQPAQSARDQGITIFAVGVGNEVDFATLLQITGVPEYILQ 1793 Query: 345 VQNSRKLHDAFLRIGKEMVKQ 365 V + L A L++ + Sbjct: 1794 VTDFSDLLAAQLQVAEIACNL 1814 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 66/195 (33%), Gaps = 25/195 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +I LD++ +LD S S+ + R + + R G+V + Sbjct: 1374 NRNIELDLVFLLDGSGSV------TTANFDIVKEFTRRLANNFDISL---ADTRVGVVQY 1424 Query: 224 SSKIVQTFPL-AWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F L ++ + + N T + +++ F A Sbjct: 1425 SDSPTLEFNLNSFNTNELVDLAIRNIQYQQGGTNTGQAIDFVRVNSFSAN------NGDR 1478 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASP 339 D +I +TDG++S + ++ A+ G +YA+G+ ++ L+ Sbjct: 1479 SDVPNVMIVVTDGQSSDDVVGPAQT------ARNAGISMYAVGIGNGVDTNELLQIAGQV 1532 Query: 340 DRFYSVQNSRKLHDA 354 DR + L Sbjct: 1533 DRVVQSADFSTLFSV 1547 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 73/192 (38%), Gaps = 25/192 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ +LD S S+ + D++++ + GLV +S Sbjct: 2247 TPVDLVFLLDGSSSITSPNFQIVKDFTA---------DVVRTFNVSSAATNVGLVQYSDT 2297 Query: 227 IVQTFPL-AWGVQ-HIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L ++ + + I + T++ A + + + + +G+ Sbjct: 2298 IRTEFFLNSFDTKSGVLNAIGNIGYLQGNTRTG----AAIDFVRISSFSVPAGNRGNQP- 2352 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRF 342 Y+I +TDG + + + A+ G ++A+G+ +E L SP+R Sbjct: 2353 -DYLIVVTDGLSQD------DVVVPAQTARNDGISIFAVGIGSEIDFATLLNIAGSPNRI 2405 Query: 343 YSVQNSRKLHDA 354 + + L +A Sbjct: 2406 LQINDFAGLANA 2417 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 25/192 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ +LD S S+ + D++++ + GLV +S Sbjct: 2506 TPVDLVFLLDGSSSITSPNFQIVKDFTA---------DVVRTFNVSSAATNVGLVQYSDT 2556 Query: 227 IVQTFPL-AWGVQ-HIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L ++ + + I + T++ A + + + + +G+ Sbjct: 2557 IRTEFFLNSFDTKSEVLNAIGNIGYLQGNTRTG----AAIDFVRISSFSVPAGNRGNQP- 2611 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRF 342 Y+I +TDG + E L A+ G ++A+G+ E L SP+R Sbjct: 2612 -DYLIVVTDGLSQD------EVLGPAQTARFEGINIFAVGIGNEIDFTTLLHIAGSPNRV 2664 Query: 343 YSVQNSRKLHDA 354 + + L A Sbjct: 2665 LQINDFAGLASA 2676 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 55/191 (28%), Gaps = 38/191 (19%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +DM+ +LD S S+ + + +++ + + R GLV +S Sbjct: 1891 TPVDMVFLLDGSGSVTQPNFELVKQFTQ---------NVVVNFNISSATTRVGLVQYSDT 1941 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI--AKGHDDYK 284 I F L S P D A Sbjct: 1942 IRTEFFL--------------------NSHPSRNTLTGAAIDFVRTSSFSIPAGNRLTLP 1981 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRFY 343 ++ +TDG + A+ G +YA+G+ +E L R Sbjct: 1982 DVLVVVTDGLSQDDVAGP------AQIARDNGIAIYAVGIGSEVDFATLLDIAGLQSRVL 2035 Query: 344 SVQNSRKLHDA 354 + + L DA Sbjct: 2036 QINDFSSLLDA 2046 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 67/195 (34%), Gaps = 22/195 (11%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D++ +LD S S+ + + R + +SS Sbjct: 12 GGVDLVFLLDGSASVGASNFELVKDFTQ---------QTTAKFDISDGSTRVAVAQYSST 62 Query: 227 IVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L V + I ++ + + + G A + + + A+ Sbjct: 63 PQVEFNLNTNSDVDTLSNAIEQITYMNGDSTFTGF--AIEFVRQSAFSSFNGARDDKP-- 118 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFY 343 ++ +TDG+++ + A+ +G ++A+GV L++ A DR Sbjct: 119 DIMVVVTDGQSADSVTSS------AATAREQGVTMFAVGVGTGVGLSELQDIAGYTDRVL 172 Query: 344 SVQNSRKLHDAFLRI 358 + + +L + I Sbjct: 173 QLNDFVQLAQSADTI 187 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 59/194 (30%), Gaps = 20/194 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + D++ ++D S S+ + + M D + + R G V F Sbjct: 832 NRSVEFDLVFLVDKSSSVGPANFELVKEF---------MYDFTNTFSVGLSDTRIGAVQF 882 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + F + + I +++ A + + + + Sbjct: 883 ADAQTKDFDMDTFATKEQTLAGIQNIVYTDNQVGGVATGAAIDFVRQNSYTRGNGDRTSV 942 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPD 340 ++ +T ++ +E+ A++ G +Y +GV + + + S Sbjct: 943 P--DLLVVVTSSASTDDVASAQET------AEKEGITIYTVGVTNSVSFAELTSTAGSFS 994 Query: 341 RFYSVQNSRKLHDA 354 R + L Sbjct: 995 RVLRANDFSDLSAI 1008 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/161 (20%), Positives = 62/161 (38%), Gaps = 24/161 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D++ +LD S S G D + +E++ P R GL+ +S I Sbjct: 553 GADLVFLLDGSGS------IGTDNFQLVKAFTKEVIRNFAISP---TATRVGLLQYSDTI 603 Query: 228 VQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F + + ++ +++ T + +E+ F A D+Y Sbjct: 604 DNEFFMNEFNTRDELYTAVDNVVYKTGGTFTGFAVEFTRQIAFRTS------AGTRDNYP 657 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +I +TDG N ++ + A +G ++YA+GV Sbjct: 658 DILIVVTDG-----NSEDVVTSAVA-SAIDQGILIYAVGVG 692 Score = 42.5 bits (98), Expect = 0.094, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 7/70 (10%) Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYS 344 ++I +TDG + ++ A+ G ++A+G+ +E AD L+ +P R Sbjct: 2153 FLIVVTDGLSQDNVAVPAQT------ARNNGISIFAVGIGSEVDADTLLQIAGTPSRTLQ 2206 Query: 345 VQNSRKLHDA 354 + + L +A Sbjct: 2207 INDFAGLVNA 2216 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 80/231 (34%), Gaps = 27/231 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + L +++ + D++ ++D S S+ + Sbjct: 3 LPGFVMLLCIMGAQATVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPS------EFEQ 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GS 251 + +++D + PD R G+V ++S++ L + + ++++ + Sbjct: 57 VKVFLAKVIDGLSVGPDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST 113 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + +++A N F E + D K I +TDG D Sbjct: 114 GTMTGLAIQFAMNVAFSEAE----GGRKSPDISKVAIIVTDGRPQDNIRD------IAAR 163 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 A+ G ++AIGV + + P D V++ KL F Sbjct: 164 AREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|254560515|ref|YP_003067610.1| hypothetical protein METDI2058 [Methylobacterium extorquens DM4] gi|254267793|emb|CAX23640.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 733 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 91/269 (33%), Gaps = 37/269 (13%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ + ER +L+ +D+ L+ + E P I A + ++T Sbjct: 277 EERSASERRITLADGATAADRDFELTWNAAPGEAPSIGLFRERVAGAEAVLAVVTPPETA 336 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + + D++ V+D S SM + A S+ LD + + R ++ Sbjct: 337 SPAASVPRDVVFVIDNSGSMGGA------SMRQAKASLLIGLDRLGAHD------RFNVI 384 Query: 222 TFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F FP A + + + L T+ L+ A + Sbjct: 385 RFDHSFDTLFPDLVPADAHHLMRAKSFVAGLQASGGTEMLAPLQAALRDATPEETGRLRQ 444 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKN 335 ++FLTDG E+ + A RG + ++ +G+ + +++ Sbjct: 445 ----------VVFLTDG------AIGNEAQIFSAIATERGRSRLFMVGIGSAPNGYLMRH 488 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A F + ++ + + ++ Sbjct: 489 TAELGRGSFTQIDTPDQVTERMRALLVKL 517 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 79/197 (40%), Gaps = 29/197 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ ++D S S+ H M K M+DII + R G+V +S Sbjct: 17 KSGPVDLVFIIDGSRSVRPHEFETMRKF---------MIDIIHELNIGLAATRIGVVQYS 67 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ F L + + + IN +I T + + YA N F A+E A+ + Sbjct: 68 SQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE----GARPNV 123 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ TDG + + G +YA+GV A A L+ ASP Sbjct: 124 PHVAVIV--TDGRPQDRVAEVAAAARE------SGIEIYAVGV-ARADMTSLRAMASPPF 174 Query: 340 -DRFYSVQNSRKLHDAF 355 D + V++ L F Sbjct: 175 EDHVFLVESF-DLIHQF 190 >gi|268561232|ref|XP_002646396.1| Hypothetical protein CBG15365 [Caenorhabditis briggsae] gi|187027192|emb|CAP33692.1| hypothetical protein CBG_15365 [Caenorhabditis briggsae AF16] Length = 381 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 67/197 (34%), Gaps = 20/197 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ LD+++V+D S M G+ + SI I P R G+VT++ Sbjct: 30 ENLWLDVVLVVDNSKGMTKD---GLTAISANLASIFSDAQ-IGINPSNPKTTRIGMVTYN 85 Query: 225 SKIVQTFPLAWGVQH------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 S L + ++ + ST+ GL+ A N ++ E Sbjct: 86 SNATVDAHLNYDWPKNNDVLNFYSTMSEISEDSTSYVAHGLQAAQNLLYS-----ESFGS 140 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLKNC 336 YKK II ++ + + N K RG + I + L Sbjct: 141 NRSHYKKVIIVC---ASTFKGTGKNDPIPVANRLKGRGVKILTIAYDQGDEKVVEELAKI 197 Query: 337 ASPDRFYSVQNSRKLHD 353 +SP + + L + Sbjct: 198 SSPGLSFKQDANMTLIE 214 >gi|296473276|gb|DAA15391.1| integrin, alpha D [Bos taurus] Length = 1165 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 88/227 (38%), Gaps = 30/227 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 SSH ++ T + ++ +D+ ++D S S++ +R ++D K Sbjct: 133 SSHLQIIRTVPAALPECTNQEIDIAFLIDGSGSID------QTDFKRMKNFVRAVMDRSK 186 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYN 264 + L+ +S+ + F W + Q ++ + T + G+ Sbjct: 187 G-----TNTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGLTFTATGIRTVVR 241 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG- 323 ++F +K A+ K II +TDGE D E EA++ I YAIG Sbjct: 242 ELFHSKNGARKSAR------KIIIVITDGE---KYKDPLEYKDVIPEAEKANIIRYAIGV 292 Query: 324 ---VQAEAADQFLK---NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 QA AA + LK + S D + V + L ++ +++ Sbjct: 293 GDAFQAHAAREELKIIGSVPSEDHVFKVDSFAALSSIQKQLQEKIFA 339 >gi|156523104|ref|NP_001095966.1| integrin alpha-D [Bos taurus] gi|151556938|gb|AAI49717.1| ITGAD protein [Bos taurus] Length = 1165 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 88/227 (38%), Gaps = 30/227 (13%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 SSH ++ T + ++ +D+ ++D S S++ +R ++D K Sbjct: 133 SSHLQIIRTVPAALPECTNQEIDIAFLIDGSGSID------QTDFKRMKNFVRAVMDRSK 186 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYN 264 + L+ +S+ + F W + Q ++ + T + G+ Sbjct: 187 G-----TNTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGLTFTATGIRTVVR 241 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG- 323 ++F +K A+ K II +TDGE D E EA++ I YAIG Sbjct: 242 ELFHSKNGARKSAR------KIIIVITDGE---KYKDPLEYKDVIPEAEKANIIRYAIGV 292 Query: 324 ---VQAEAADQFLK---NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 QA AA + LK + S D + V + L ++ +++ Sbjct: 293 GDAFQAHAAREELKIIGSVPSEDHVFKVDSFAALSSIQKQLQEKIFA 339 >gi|53713709|ref|YP_099701.1| hypothetical protein BF2418 [Bacteroides fragilis YCH46] gi|253565657|ref|ZP_04843112.1| BatB [Bacteroides sp. 3_2_5] gi|265764033|ref|ZP_06092601.1| BatB [Bacteroides sp. 2_1_16] gi|4838139|gb|AAD30859.1|AF116251_2 BatB [Bacteroides fragilis] gi|52216574|dbj|BAD49167.1| conserved hypothetical protein BatB [Bacteroides fragilis YCH46] gi|251945936|gb|EES86343.1| BatB [Bacteroides sp. 3_2_5] gi|263256641|gb|EEZ27987.1| BatB [Bacteroides sp. 2_1_16] gi|301163418|emb|CBW22969.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 341 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 65/196 (33%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LLFTAIGLFAVLLARPQFGS---KLETVKRKGVEVMIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLISKLVDGMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + I+ +TDGEN ++G Sbjct: 170 TAIGAAIN-------LAARSFTPQEGVGRAIVVITDGENHEGGAVEAAKEAA-----KKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVNVLGVGLPDGAPI 233 >gi|307107982|gb|EFN56223.1| hypothetical protein CHLNCDRAFT_35166 [Chlorella variabilis] Length = 329 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 54/302 (17%), Positives = 94/302 (31%), Gaps = 47/302 (15%) Query: 98 ENGFAQDINNIERST-------SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 G AQD+ N + + + KDY S P P + Sbjct: 26 ATGGAQDVENFRTNLEAGYLPLPTDLTYEGLAKDYYFDTSSNATKPCTKLFCPLYSVGLS 85 Query: 151 APLLITSSVKISSKSDIGLD--------------MMMVLDVSLSMNDHFGPGMDKLGVAT 196 L+ + +GLD +++VLDVS SM F V Sbjct: 86 PDPLLGTPASSEFYMAVGLDSGMKAADFARKQLNLVVVLDVSGSMGSPFDSYYYDQTVQP 145 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVT------------FSSKIVQTFPLA----WGVQHI 240 + K DV V +G+V FS PL V + Sbjct: 146 TAGVPDEGETKKKIDVAKEVLAGIVGLLRPDDSLSVVLFSDAACVPKPLGPVRCADVDKL 205 Query: 241 QEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 +E+I+ ++ T G++ ++ + + ++FLTD + ++ + Sbjct: 206 KEQISADVVEMGGTNFQAGIDAGGAQLT---GCAACMEANASLVENRVVFLTDAQPNAGD 262 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRK----LHD 353 + L G IGV + Q +++ ++SV + L D Sbjct: 263 DSEQGLLARIKALSADGIYTTIIGVGLDFNTQLVESIGKVRGSNYFSVHTPGEFRRRLVD 322 Query: 354 AF 355 F Sbjct: 323 EF 324 >gi|146304257|ref|YP_001191573.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702507|gb|ABP95649.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 383 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 80/230 (34%), Gaps = 48/230 (20%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + S S +M F P +++ G +++LD S SM+ Sbjct: 10 SHKYSFNSDLKMAFKILLVPEKISTAT-----------------GFHYIVLLDTSGSMDG 52 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 K+ A + E+L K IP N VTFSS++ + + + Sbjct: 53 L------KIESAKKGAIELL---KRIPQGNK---VSFVTFSSRVNIVREFV-DPEDLTAE 99 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 I+ L G T L A+N Y+I LTDG + D+ Sbjct: 100 ISSLSAGGQTAFFTALLTAFN------------LHNKHGIPSYVILLTDGNPT----DDT 143 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 Y A G + G+ + + LK+ A S FY V ++ ++ Sbjct: 144 NVETYKRIAIPNGVQTISFGLGDDYNETILKSLADRSGGVFYHVNDAMEI 193 >gi|269126104|ref|YP_003299474.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311062|gb|ACY97436.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 583 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 73/202 (36%), Gaps = 29/202 (14%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-- 227 ++ ++DVS SM + G G ++ A R++R+ +D ++ R GL FS+ + Sbjct: 393 VLFLVDVSGSMAEPLPGTGRTRMQQAQRALRQAVDDFVAVD------RVGLWEFSTDLGG 446 Query: 228 -------VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V P++ + ++E+I L T A+ + ++ + Sbjct: 447 GRDYRPLVPIRPMSTPGHRERLREQIAALRPRGDTGLYDSTLAAFRHVRAVRQDGAINS- 505 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-- 336 ++ LTDG N P G ++ I A A LK Sbjct: 506 --------VVVLTDGRNDDPGGGLSLEELLKELDGADGVRIFTIAYGAGADGGALKKISE 557 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 A+ Y ++ L ++ Sbjct: 558 ATDAAAYDSRDPATLDKVLTQV 579 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 78/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM D+L ++ + L I + +G+VTF S Sbjct: 307 LVLVLDKSGSMASG-----DRLNRLNQASKLFLLQI-----IEKGSWAGMVTFDSSATIQ 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + ++ + G T GL A+ I Sbjct: 357 SELIQIETDAQRNSLITRL-PTVAGGGTSICSGLRTAFTVIKKKFSTDGSE--------- 406 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRF 342 I+ LTDGE++ ++ C +E K+ GAI++ + + A+ + L + Sbjct: 407 -IVLLTDGEDN--------TISTCFDEVKQSGAIIHTVALGPSADPGLEKLAEMTGGMKT 457 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 + N++ L DAF + G + QR Sbjct: 458 TATDNAQNNGLIDAFSALSSGNGAITQR 485 >gi|118594676|ref|ZP_01552023.1| MxaC protein, putative [Methylophilales bacterium HTCC2181] gi|118440454|gb|EAV47081.1| MxaC protein, putative [Methylophilales bacterium HTCC2181] Length = 324 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 76/223 (34%), Gaps = 33/223 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+ + G + +VLD S SM+D F G DK G + L I + G Sbjct: 71 KVVERVGEGAQIGLVLDRSASMDDPFSGGGDKAGETKSAAASRL--IIDFFEARTNDMVG 128 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++TFS+ + PL + I+ + + T GL + +F+ A Sbjct: 129 VITFSNSAMFVLPLTENKEAIKAAVTATAGNALFQTNIGAGL-TSSAALFNEVADSGSRA 187 Query: 278 KGHDDYKKYIIFLTDG-----ENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQAE 327 +I L+DG N+ I + L++ + G ++ Sbjct: 188 ---------VILLSDGAGRIDANTQQKIRDWFDRFDIGLYWIVLKQEGGISIFDEDYVPR 238 Query: 328 AADQF---------LKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 DQ SP + Y ++ + L A I + Sbjct: 239 DEDQLPPQIELYEYFSTFRSPFKAYEAEDPKSLETAIKDISLK 281 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 83/218 (38%), Gaps = 41/218 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G+D++ VLD S S+ +F D + +I + +VR G+V +S Sbjct: 203 SGVDVLFVLDGSGSVGKNFDKVKDWVK----------NITAKLDIGKEIVRVGVVQYSHY 252 Query: 227 I---------VQTFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + T ++ G + + + ++R+ T T A K+ + Sbjct: 253 VEGKSINKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTG---RALQKVIRDFDDA 309 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K + ++ LTDG+ DNK L N + +G +A+GV E L Sbjct: 310 YIGNK------QVLLLLTDGQ----AKDNKLILPNANRLRNKGIATFAVGVG-EYDISEL 358 Query: 334 KNCASP----DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 K AS DR ++V + +L + E+ + Sbjct: 359 KLIASGTDSTDRVFTVTDFGELDSIVKSLQTEIQSFVL 396 >gi|221108025|ref|XP_002157834.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 299 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 69/189 (36%), Gaps = 20/189 (10%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 ++ + + + +D+ +LD S S+ + + L Sbjct: 125 ATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQKEKNFLKKLAAVFGI-------- 176 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKI 266 +N R G++TFS + + L + E ++++ TT+ L A + Sbjct: 177 --SSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDM 234 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F + G K II LTDG + P D ++ +E + G ++ +G+ + Sbjct: 235 FTSA------NGGRVGVSKLIILLTDGSQT-PGGDAEDPERIADELRNDGVVILGVGIGS 287 Query: 327 EAADQFLKN 335 + L + Sbjct: 288 AVNETELSH 296 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 38/103 (36%), Gaps = 8/103 (7%) Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT+ L A +F + G K II LTDG + P D ++ +E Sbjct: 4 TTRIDRALRLAQKDMFTSA------NGGRVGVSKLIILLTDGSQT-PGGDAEDPERIADE 56 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSVQNSRKLHD 353 + G ++ +G+ + + L + Y+ L D Sbjct: 57 LRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTD 99 >gi|194211145|ref|XP_001494734.2| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 1022 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 68/197 (34%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M +VLD S SM ++++ A + + + N G+V F S Sbjct: 307 MCLVLDKSGSMAGS--NRLNRMNQAAKHFL--------MQTIENGSWVGMVHFDSTAYIK 356 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + E + T G++ A+ + +++ Sbjct: 357 SKLIQITSSNERNKLLESL-PTAASGGTSICRGIKSAFQVLTGTYPQIDGSE-------- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE+++ +E ++ GAI++ I + A + Y Sbjct: 408 -IVLLTDGEDNTAG-------SCVDEVRQSGAIIHFIALGPSADQAVIEMSTITGGKHKY 459 Query: 344 SVQNSRK--LHDAFLRI 358 + + L DAF + Sbjct: 460 ASDEAANNGLIDAFAAL 476 >gi|156404155|ref|XP_001640273.1| predicted protein [Nematostella vectensis] gi|156227406|gb|EDO48210.1| predicted protein [Nematostella vectensis] Length = 1128 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 74/224 (33%), Gaps = 23/224 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + S D+++V+D S SM P + D + + + V R + Sbjct: 203 VEAASPQPKDVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKV 262 Query: 221 VT--------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + F S + + P+ + +++ ++ T A++ + Sbjct: 263 TSGDAKDEKCFESSLAKASPV--NIDILKKFLDGEYASGGTMYAIAFNAAFDIL------ 314 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAA 329 ++ + + + I+F+TDG + ++ N+ A + G+ A Sbjct: 315 DKYYKEKNTTRRPVILFMTDGAPNDDPGTILNTVKTRNQGLSTKADILTFGMGGGISPAG 374 Query: 330 DQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 L++ A + V + L D + R L Sbjct: 375 VDLLQSLAEQTLDGGARFEVSLTTALRDVSRHLLAVARSARKLV 418 >gi|153000354|ref|YP_001366035.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS185] gi|151364972|gb|ABS07972.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS185] Length = 772 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 75/188 (39%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S++ + ++++V+D S SM D + A ++ L +K N Sbjct: 382 VEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLKPEDSFN----- 430 Query: 219 GLVTFSS--KIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S ++ PL + + ++ ++RL T+ L+ A + L Sbjct: 431 -IIEFNSSLSLLSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAAL------PKSL 483 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ + +IF+TDG + + E++ ++ +G+ + F+ Sbjct: 484 GSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFM 538 Query: 334 KNCASPDR 341 + A R Sbjct: 539 QRAAELGR 546 >gi|147901111|ref|NP_001079801.1| matrilin 1, cartilage matrix protein [Xenopus laevis] gi|32450626|gb|AAH54272.1| MGC64509 protein [Xenopus laevis] Length = 490 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 72/198 (36%), Gaps = 22/198 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + A + +I+S+ N R GLV ++S + Sbjct: 38 DILFIIDSSRSVRPS------EFEQAKVFLS---QVIESLDVGANATRVGLVNYASTVKN 88 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + ++ + T + ++YA N F E + G K Sbjct: 89 EFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAMNNAFTESEGARIKSPG---INKV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSV 345 I +TDG D A+ G +YAIGV + + + P D Sbjct: 146 AIVVTDGRPQDTVKD------ISARARESGLEIYAIGVGRVDKNTLRQIASEPLDEHVDY 199 Query: 346 QNSRKLHDAFLRIGKEMV 363 S L + + +E Sbjct: 200 VESYSLIEKLSKKFQEAF 217 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 74/203 (36%), Gaps = 32/203 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ ++D S S+ + + I +++D + + GLV +SS Sbjct: 265 SAVDLVFLIDGSKSVRPE------NFELVKQFINQIVDSMDVGERRAH---VGLVQYSSS 315 Query: 227 IVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + Q FPL + I+ + ++ T + L+Y + D + + Sbjct: 316 VRQEFPLGRYSSKKDIKSAVKKMSYMEKGTMTGQALQY----LVDNSFAISSGGRPAVP- 370 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--- 340 K I TDG + D AK G ++A+GV ++ + P Sbjct: 371 -KVGIVFTDGRSQDYIND------AAARAKELGYKMFAVGVGNAVEEELRMIASEPQAEH 423 Query: 341 RFYSVQNSRKLHDAFLRIGKEMV 363 FY+ A IGK++ Sbjct: 424 SFYTAD-----FKAMKEIGKKLQ 441 >gi|170589747|ref|XP_001899635.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158593848|gb|EDP32443.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 634 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 69/199 (34%), Gaps = 27/199 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + K + G+D++ +LD S S+ + ++ T ++ + VR Sbjct: 59 TGTYPMKEECGVDLLFLLDTSGSLEQIYTQHINWTTQLTEALLT----------DKDDVR 108 Query: 218 SGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ ++ + F + I I + T++ L A ++F ++ Sbjct: 109 IAMIQYAETPIVEFSFGTYRDLPDITNHIMTINLHSGGTRTGKALLAAKGELFSEEKGAR 168 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFL 333 A K I+ TDG + + + + + + +Y + V ++ +Q + Sbjct: 169 KNA------SKIIVLFTDG------LSVDDPIKHAQQLREIEKIKIYVVYVGSDGFEQEM 216 Query: 334 KNCASP-DRFYSVQNSRKL 351 A + +L Sbjct: 217 NRIAGGRSNVFGSNEFTRL 235 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 80/231 (34%), Gaps = 27/231 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + L +++ + D++ ++D S S+ + Sbjct: 3 LPGFVMLLCILGAQATVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPS------EFEQ 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GS 251 + +++D + PD R G+V ++S++ L + + ++++ + Sbjct: 57 VKVFLAKVIDGLSVGPDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST 113 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + +++A N F E + D K I +TDG D Sbjct: 114 GTMTGLAIQFAMNVAFSEAE----GGRKSPDISKVAIIVTDGRPQDNIRD------IAAR 163 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 A+ G ++AIGV + + P D V++ KL F Sbjct: 164 AREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 82/206 (39%), Gaps = 28/206 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ S+ D++ ++D S S+ + + I ++D + + + N V GLV Sbjct: 260 SACSNAATDVVFLIDGSKSVRPE------NFELVKKWINLIIDKLD-VSETNTHV--GLV 310 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +SS + Q FPL + ++E + R+ + T +A + + D A+ Sbjct: 311 QYSSTVKQEFPLGRHNSKRSLKEAVKRMDYMERGTMTG---HALSFLVDNSFGPNQGARP 367 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K I TDG + D +AK G +YA+GV D+ + + P Sbjct: 368 GVP--KVGIVFTDGRSQDYIGD------AAKKAKALGFKMYAVGVGNAVEDELREIASEP 419 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMV 363 D ++ + +I K++ Sbjct: 420 IADHYFYTAD----FKTMNQIAKKLQ 441 >gi|120599090|ref|YP_963664.1| cell wall anchor domain-containing protein [Shewanella sp. W3-18-1] gi|120559183|gb|ABM25110.1| LPXTG-motif cell wall anchor domain [Shewanella sp. W3-18-1] Length = 757 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 79/206 (38%), Gaps = 31/206 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S + ++++V+D S SM D + A ++ L+ +K+ N ++ F+ Sbjct: 372 STLPRELILVIDTSGSMAG------DSIVQAKSALLYALNGLKAEDSFN------IIEFN 419 Query: 225 SKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S++ Q P + + ++ I+RL T+ L A + + Sbjct: 420 SELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMALALNAAL-------PRGINRLSE 472 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +IF+TDG + + E++ ++ +G+ + F++ A Sbjct: 473 SSQSLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFMQRAAEL 527 Query: 340 DR--FYSVQNSRKLHDAFLRIGKEMV 363 R F + N ++ ++ ++ Sbjct: 528 GRGTFTYIGNVDEVEQKISKLLSKIQ 553 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 35/74 (47%), Gaps = 3/74 (4%) Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCA-SPDRFYSVQNSRKLHD 353 + + + + C++ K + +++ IG +A ++ CA S ++ V+ ++ + Sbjct: 479 TGNSTADGYTEQICDQLKAQDVVIFTIGFEAPQRGQDLMRYCASSSGHYFDVEGV-EISE 537 Query: 354 AFLRIGKEMVKQRI 367 AF I + + R+ Sbjct: 538 AFSSIANTIQQLRL 551 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 44/346 (12%), Positives = 107/346 (30%), Gaps = 60/346 (17%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G ++ + + ++ ++ GL I+ V+ +L D ++L A + Sbjct: 24 FCRDEAGVLTGFSLYIFILMMMIAGLTIDLMRYEAVRTRLQATSDRAVLAAAD---LDQT 80 Query: 69 GNNGKKQKNDFSYRIIKNIWQT-DFRNELRENGFAQDINNIERSTSLSII---------- 117 N ++ F+ + L ++ + +++ Sbjct: 81 TNAKAVVEDYFAKAGMSQYLDGVQVSKGLNFKEVEAQVSATIPTWFMNMSGIETLDAFAR 140 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG-------LD 170 + + N+ ++ AN A ++ + + + Sbjct: 141 SKAEERIQNIEVSLVLDISGSMGWDGKLANMRTAADQFVRTMMAGNDNVAADGTGLTSVS 200 Query: 171 MM---MVLDVSLSMNDHFGPGMDKLG------VATRSIREMLDIIKSIPDVNNVVR--SG 219 ++ V++V + D + + AT +D K++ + + R S Sbjct: 201 IIPYHAVVNVPDELLDEYAVSTQQTVSNCVRFTATDFQSISIDRTKTLDRLAHFDRNNSN 260 Query: 220 LVTFSSKIVQTFP----------LAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 L TF+ + P L W + K+ L T + G+++A + Sbjct: 261 LHTFNGDRLIGRPWCQVGTYGAILPWSTSVTDLTNKVAELGASGNTATDIGMKWAAALLD 320 Query: 268 DAKEKLEHIAKGHDDYK----------------KYIIFLTDGENSS 297 + + + K ++ +TDGEN+S Sbjct: 321 PGTQNIVDDMIDGGHLEADLAGRPVLYSDPETIKVVVLMTDGENTS 366 >gi|115767164|ref|XP_001193368.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] gi|115976266|ref|XP_001179968.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] Length = 797 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 72/215 (33%), Gaps = 36/215 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ T+ V + +++VLD+S SM + M S+ ++ +I + Sbjct: 245 PVSYTAPVFEVLQLSSVRSVVLVLDISGSMEGNRFDRM-----IQSSVVYIMSVIPTGS- 298 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK--INRLIFG--STTKSTPGLEYAYNKIF 267 + G+V F S + L + + +N L T G+ + Sbjct: 299 -----KLGIVVFDSTSQISGNLTDITETASRQRLVNALPPSPVGGTCIGCGILSGIEVLG 353 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + YII L+DGE + +E K G I+ I + Sbjct: 354 SYAQGG------------YIILLSDGEETDAPFIMDTY----DEIKNSGVIIDTITISDS 397 Query: 328 AADQF--LKNCASPDRFYSVQNSR---KLHDAFLR 357 A Q L S + ++R +L AF Sbjct: 398 ADQQMEDLSTNTSGIANFCSDDARTGIRLIQAFQS 432 >gi|332216203|ref|XP_003257234.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Nomascus leucogenys] Length = 890 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 69/175 (39%), Gaps = 13/175 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQ 229 + V+D+S SM KL ++ +L+ +K +N ++ SG V+ + +VQ Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P +Q + + + T GL + + A+E+ + + +I Sbjct: 339 ATPE--NLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RVPERSTSIVIM 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDG+ + ++ A +Y +G FL+N A + ++ Sbjct: 393 LTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ + ++ FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|149919601|ref|ZP_01908080.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] gi|149819544|gb|EDM78972.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] Length = 349 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 73/209 (34%), Gaps = 34/209 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 + +++D S SM++ FG G+ + ++ + + + + +R GL + + Sbjct: 79 ITLLVDRSGSMDEDFG-GISRWQAVGDTLLD--PDVGVVAPLQGDIRFGLSLYDNPGDMC 135 Query: 229 ---QTFPLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 ++ PLA + + T + L + + D + Sbjct: 136 PRVESTPLALNALADMTALYQSAAPEGDTPTGSALTSVADVVAQDP----------DPGE 185 Query: 285 KYIIFLTDGENS---SPNIDNKES--LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 K I+ TDGE PN D + + A +G + V + + L++ A Sbjct: 186 KVIVLATDGEPDTCAQPNPDEGQPEAVAAAQAAYAQGVRTVIVSVGSGISADHLQDMANA 245 Query: 338 --------SPDRFYSVQNSRKLHDAFLRI 358 S +Y + L DAF I Sbjct: 246 GAGVQPGGSDAVYYQALDQASLIDAFSEI 274 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 73/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L ++ + ++ FS++I Sbjct: 48 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 95 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 96 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 150 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 151 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 209 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 210 TRRVHEEEDAGSQLIGFYD 228 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 69/204 (33%), Gaps = 32/204 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + LD + V+D S S+ + I ++ ++ P R GL+ + Sbjct: 49 KAVPLDFVFVIDSSRSIRPRDYEKV------KTFIVNLVQFLEVGP---EATRVGLLQYG 99 Query: 225 SKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFD--AKEKLEHIAKG 279 S + F L+ +++ + N + T + ++YA F + H+ Sbjct: 100 SVVQPEFSLSTFSTKAEVEQAVRNMKHLATGTMTGLAIQYAAETSFTEADGARPAHLHIP 159 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + + +TDG +A++ G ++AIGV LK S Sbjct: 160 -----RIAVVVTDGRPQD------RVEEVAAQARQAGIQIFAIGVGRVDMKT-LKTIGSE 207 Query: 339 --PDRFYSVQN---SRKLHDAFLR 357 + + V + L F Sbjct: 208 PHSEHVHLVASFSQMETLVSVFQS 231 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 83/239 (34%), Gaps = 57/239 (23%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+ +D +D++ ++D S SM + + +++ + P + G Sbjct: 397 KRSACADGAMDLVFMIDGSKSMGPA------NFERVKQFVISIVESLDVSPTGAH---VG 447 Query: 220 LVTFSSKIVQTFPLAWGV--QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ +S+ + F L+ Q I++ ++R+ G + + L + F A+E Sbjct: 448 LLQYSTNVRTEFTLSQHTSAQGIRQAVSRMQYMGRGSMTGSALRRMFQSSFSAEEGAR-- 505 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA------------------- 317 + + + TDG + ++ + +AK G Sbjct: 506 ----PNVPRVSVVFTDGRSQD------DASEWAKKAKNSGIPGSFSYFGGTGRFLSCSFL 555 Query: 318 -----IVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 +YA+GV + + + P+ Y Q F +G+ K + Sbjct: 556 LVLGVTIYAVGVGKAIEQELREIASEPEEKHLYYAQE-------FKDVGEITEKLKSRM 607 >gi|288563|emb|CAA47439.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 885 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 184 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 241 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 242 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 292 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ +K +N ++ SG V+ + +VQ P +Q + Sbjct: 293 G------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 344 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 345 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 400 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 401 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 443 >gi|325982790|ref|YP_004295192.1| PEP motif putative anchor domain-containing protein [Nitrosomonas sp. AL212] gi|325532309|gb|ADZ27030.1| PEP motif putative anchor domain protein [Nitrosomonas sp. AL212] Length = 333 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 27/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 GLD+ +VLD S SM D L A+ ++ L + V + S + Sbjct: 61 GGSGLDLALVLDSSGSMGAVDSGKTLNQWLQEASTALVNALPAASTSVSVIDFDSSAAI- 119 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL+ G + IN + T G++ A ++ A Sbjct: 120 ----LQGLTPLSSGSAAVISAINAIDASGGTNIGAGIDSAAAELTGANHTAGS------- 168 Query: 283 YKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPD 340 + ++ ++DG + P +L +A ++ +G+ D F ++N A+ Sbjct: 169 -TQMMVVVSDGFSSGDPASSALAALGAGVDA------IHTVGL--PGHDAFTMQNIATSG 219 Query: 341 R--FYSVQNSRKLHDAF 355 + + + L D F Sbjct: 220 NGIYTNASSLTSLIDLF 236 >gi|225352478|ref|ZP_03743501.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156985|gb|EEG70354.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 810 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 52/143 (36%), Gaps = 13/143 (9%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 S + + + D+++VLD S SM + + AT ++L + Sbjct: 81 ASGDSSSSTVTTAVPADIVLVLDKSGSMK-NSNRDTNAKNAATALASKLLTAANAALPAE 139 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V+ +VTFS + T I ++ T L+ A N + + Sbjct: 140 QQVQMAVVTFSDRARTTSQFTTSPGAIGTAVSA-WPNGGTNWEDALKTA-NDLSSGR--- 194 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 +K+I+FL+DG + Sbjct: 195 -------SGVQKHIVFLSDGNPT 210 >gi|71896057|ref|NP_001025613.1| matrilin 1, cartilage matrix protein [Xenopus (Silurana) tropicalis] gi|60552391|gb|AAH91071.1| MGC108367 protein [Xenopus (Silurana) tropicalis] Length = 490 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 71/196 (36%), Gaps = 26/196 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + +++S+ N R GLV ++S + Sbjct: 38 DILFIIDSSRSVRPS------EFEQVKVFLS---QVVESLDVGANATRVGLVNYASTVKN 88 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + ++ + T + ++YA N F E + G K Sbjct: 89 EFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAINIAFTEPEGARLKSPG---INKV 145 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I +TDG D A+ G +YAIGV + + + P + Sbjct: 146 AIIVTDGRPQDAVKD------ISARARESGLEIYAIGVGRVDKNTLRQIASEPLDEHVDY 199 Query: 345 VQN---SRKLHDAFLR 357 V++ KL F Sbjct: 200 VESYSLIEKLSKKFQE 215 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 73/203 (35%), Gaps = 32/203 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ ++D S S+ + + I ++++ + + GLV +SS Sbjct: 265 SAVDLVFLIDGSKSVRPE------NFELVKQFINQIVESMDVGDHRAH---VGLVQYSSS 315 Query: 227 IVQTFPLAWGVQH--IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + Q FPL I+ + ++ T + L+Y + D + + Sbjct: 316 VRQEFPLGRYTSKKDIKSAVKKMSYMEKGTMTGQALQY----LIDNSFAISSGGRPAVP- 370 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---D 340 K I TDG + D AK G ++A+GV ++ + P Sbjct: 371 -KVGIVFTDGRSQDYIND------AALRAKELGYKMFAVGVGNAVEEELRMIASEPVVEH 423 Query: 341 RFYSVQNSRKLHDAFLRIGKEMV 363 FY+ A IGK++ Sbjct: 424 SFYTAD-----FKAMKEIGKKLQ 441 >gi|60681980|ref|YP_212124.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|60493414|emb|CAH08200.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] Length = 341 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 65/196 (33%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LLFTAIGLFAVLLARPQFGS---KLETVKRKGVEVMIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLISKLVDGMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + I+ +TDGEN ++G Sbjct: 170 TAIGAAIN-------LAARSFTPQEGVGRAIVVITDGENHEGGAVEAAKEAA-----KKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVNVLGVGLPDGAPI 233 >gi|212693196|ref|ZP_03301324.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|237709938|ref|ZP_04540419.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725395|ref|ZP_04555876.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753590|ref|ZP_06088945.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664301|gb|EEB24873.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|229436082|gb|EEO46159.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456031|gb|EEO61752.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235304|gb|EEZ20828.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 340 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 63/205 (30%), Gaps = 29/205 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R ++ F S K+ + G++ ++ LD+S SM + Sbjct: 54 RPDIKFWLTFAALTLVILMLARPQFGS-KMETVKRSGVEAVIALDISNSMLAEDVTP-SR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKIN-RL 247 L + + I ++D N + GL+ F+ P+ + E IN L Sbjct: 112 LDKSKKLISRLVDTF-------NNDKVGLIVFAGDAFTQLPITSDYVSAKMFLETINPSL 164 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 I T + A + + II +TDGEN Sbjct: 165 ITTQGTDIGTAIRLAMKSFT-----------PQEGVGRAIIVITDGENHEGGAVEAAQEA 213 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 +G V+ +GV + Sbjct: 214 A-----EKGMQVFVLGVGSPDGSPI 233 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 77/215 (35%), Gaps = 27/215 (12%) Query: 152 PLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P + S+ LD++ ++D S S+ + + +M+D + P Sbjct: 45 PHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVGP 98 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIF 267 D R +V ++S + F L I++ I R+ + T + ++ A ++ F Sbjct: 99 DA---TRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAF 155 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K +K K I +TDG + + A+ G +YA+GV Sbjct: 156 TEKSGARPKSKN---ISKVAIIVTDGRPQDQVEEVSAA------ARASGIEIYAVGVDRA 206 Query: 328 AADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 ++P D + V+ KL F Sbjct: 207 DMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|38639545|ref|NP_943314.1| TerY [Klebsiella pneumoniae] gi|38016643|gb|AAR07664.1| TerY [Klebsiella pneumoniae] Length = 212 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 64/196 (32%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM+ + ++ +L +K P ++TF S Sbjct: 3 RLPVYLLLDTSGSMHGE------PIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + TT L N+I +K KG + Sbjct: 57 RQAVPLT---DLLSFNLPSFSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG P D ++ L AK+ G V A +A LK + Sbjct: 112 FLMTDG---GPTDDWRKGLNEFKAAKK-GV-VVACAAGHDADTAVLKEITEIVLQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SSSIKAFFKWVSASVS 182 >gi|322697455|gb|EFY89235.1| U-box domain-containing protein [Metarhizium acridum CQMa 102] Length = 757 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 74/216 (34%), Gaps = 43/216 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFS 224 D+++VLDVS SM D G A + +LD+ K I +N R G+V+F+ Sbjct: 55 DIVLVLDVSTSMEDDAPVP----GEAEETGLTVLDLTKHAALTIIETLNEKDRLGIVSFA 110 Query: 225 SKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L + + KI L +T G+ + E A Sbjct: 111 TNSTIVQTLTHMDISNKEEARRKIKALDPNGSTNLWHGIRDGIQAFEQSSENGNIRA--- 167 Query: 281 DDYKKYIIFLTDG----------------ENSSPNIDNKESLFYCNEAKRRG---AIVYA 321 ++ LTDG E+SS ++D Y + K A ++ Sbjct: 168 ------MMVLTDGMPNHMCVSPLLLWAHREHSSNSVDRCPQQGYIPKLKTLSRLPATIHT 221 Query: 322 IGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 G LK+ A + + ++ + F Sbjct: 222 FGFGYGLRSGLLKSLAEYGHGNYAFIPDAGMIGTVF 257 >gi|302832626|ref|XP_002947877.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] gi|300266679|gb|EFJ50865.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] Length = 733 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 76/252 (30%), Gaps = 40/252 (15%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++S V Y+ + A + A + I + + ++ + L VLD S SM+ Sbjct: 191 SISVVPEYD------QYGLEAEAVRAVVSIKAIADVPERARVALTC--VLDRSGSMS--- 239 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK-- 243 G + + + + L GL++++ + + PL + Sbjct: 240 GGPIRLVRETCHFLIDQLTSDDF---------LGLISYAHDVREDLPLLRMTPASRTLAH 290 Query: 244 --INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI-----------IFL 290 + L+ G +T GL + A+ L Sbjct: 291 AVVEELVAGGSTALYDGLVAGLRQQMAAERDLGGGNGASGGASDSSSPSSLSLVHSCFLF 350 Query: 291 TDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSV 345 TDG+ + + E L + V+ G + + L+ A +Y + Sbjct: 351 TDGQATDGPSNPASIIEGLQAAQAPSGQHVTVHTFGFGNGHSVELLQQVAEAQSGVYYYI 410 Query: 346 QNSRKLHDAFLR 357 + F Sbjct: 411 SCEEDIACGFGD 422 >gi|332221825|ref|XP_003260065.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Nomascus leucogenys] Length = 684 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 75/197 (38%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G D+L ++ + L + V N G+V F + Sbjct: 70 VCLVLDKSGSMG-----GYDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDTTATIV 119 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + T G++YA+ I H + Sbjct: 120 NKLIQIKGSDERNTLMAGL-PTYALGGTSICSGIKYAFQVIG-----ELHSQLDGSE--- 170 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 ++ LTDGE+++ + +E K+ GAIV+ I + + A + + N FY Sbjct: 171 -VVLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALGSAADEAVIEMSNITGGSHFY 222 Query: 344 SVQNSRK--LHDAFLRI 358 + ++ L DAF + Sbjct: 223 ASDEAQNNGLIDAFGAL 239 >gi|299140485|ref|ZP_07033623.1| BatB protein [Prevotella oris C735] gi|298577451|gb|EFI49319.1| BatB protein [Prevotella oris C735] Length = 342 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 29/202 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 R + F KIS G+++++ LD+S SM + Sbjct: 54 RPTIKFWLLLSAMTILILMIARPQAG-TKISHDKRNGIEVIIALDISNSMLAEDVTP-SR 111 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RL 247 L + I ++D + + GLV F+ P+ + + L Sbjct: 112 LEKSKLLIENLVDHFTN-------DKVGLVVFAGDAFVQLPITSDYVSAKMFLQNIKPSL 164 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 I T + A E ++ D+ K II +TDGE+ +L Sbjct: 165 IATQGTD-----------LARAIELSQNSFMQRDNIGKAIIVITDGEDHEG-----GALE 208 Query: 308 YCNEAKRRGAIVYAIGVQAEAA 329 A ++G+ V+ +G+ Sbjct: 209 AAKAAHKKGSNVFILGIGDPKG 230 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 84/236 (35%), Gaps = 33/236 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW A++ + + + +++ +LDVS SM DKL + +++ Sbjct: 190 TPWDADTRLVRIGLAG-YEAPKAERPAANLVFLLDVSGSM-----SSADKLPLVKTAMKT 243 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 ++ + R +V ++ + I +++L G +T GLE Sbjct: 244 LVGQLTPKD------RVSIVVYAGAAGLVLEPTSDSREIMAALDQLQAGGSTAGGAGLEL 297 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY +K + +I TDG+ + DN + L Y + ++ G + Sbjct: 298 AYKVAEASKVDGINR----------VILATDGDFNVGLSDNDKLLEYVEDKRKNGIAMSV 347 Query: 322 IGVQAEA-ADQFLKNCA--SPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G + ++ A + + ++ +L I K++ Q Sbjct: 348 LGFGRGNINEALMEQIADKGNGNYGYIDSAIEARKVLGEQLGATLYTIAKDVKIQV 403 >gi|218672104|ref|ZP_03521773.1| von Willebrand factor type A [Rhizobium etli GR56] Length = 366 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 76/212 (35%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 41 FKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHA-NLVFLIDVSGSMDEP-----DKL 94 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + + R +++ +K V+ +VT++ I I+RL G Sbjct: 95 PLLKSAFRLLVNRLKPDDTVS------IVTYAGNAGTVLEPTRVAEKSKILSAIDRLEAG 148 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AY L A D + + TDG+ + +++ Sbjct: 149 GSTGGAEGIEAAY--------DLAKKAFVKDGVNRVM-LATDGDFNVGPSSDEDLKRIIE 199 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 + + G + +G D ++ A Sbjct: 200 DKREEGIFLTVLGFGRGNLNDSLMQTLAQNGN 231 >gi|118387578|ref|XP_001026893.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89308660|gb|EAS06648.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1074 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 45/324 (13%), Positives = 103/324 (31%), Gaps = 54/324 (16%) Query: 45 KAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQD 104 K+ + A ++ N+ GN + + + + + N + D Sbjct: 261 HKKMQASTYDPIQQQANRLSNKSQGNLNQSP----VLSPMNSQILDN--SSFNPNFISTD 314 Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 +E ++I ++Y ++P + L + + Sbjct: 315 HQMLENQLEINIQST---QNYIQLFEQSQQIPVM------------ISLNTKGNFDAKAY 359 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ V+D S SM+ K+ + ++ ++D + R GLV F+ Sbjct: 360 QRPPIDLICVMDNSGSMHGE------KINMLKETLLYLIDQLDEKD------RLGLVLFN 407 Query: 225 SKIVQ---TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ +++ I+ + T G+ A+ I + + Sbjct: 408 SEVTFRPMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFI---------KTRKYC 458 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + + L+DG +S +L N + + G + + K Sbjct: 459 NPVTSVFLLSDGLDSKAQDRVAVTLK--NMSINEQFSINCFGFGRDHDPILMNQIKKIDQ 516 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEM 362 D F+ L F IG+++ Sbjct: 517 VDMFF----VDALGGLFSVIGQDV 536 >gi|12850399|dbj|BAB28702.1| unnamed protein product [Mus musculus] Length = 650 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 71/202 (35%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S SM G + + + K + R G V ++ + Sbjct: 467 DIGFVIDGSSSM------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 516 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 517 ---LEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 564 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 565 KRKVMIIITDGRSYD------DVRIPAMAACQKGVITYAIGIAWAAQDELEVMATHPAKD 618 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V + L+ RI + + Sbjct: 619 HSFFVDDFDNLYKIAPRIIQNI 640 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 4/79 (5%) Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCAS-PDRFYSVQNSR 349 DG D + C+ A G IVYAIG +A + +++CAS ++ V+ R Sbjct: 535 DGVVGQSQADT-NLMAICDVANAAGIIVYAIGFEAPDRGQRVMEHCASVDANYFDVEG-R 592 Query: 350 KLHDAFLRIGKEMVKQRIL 368 ++ +AF I + + + R++ Sbjct: 593 EISEAFASIARSINQLRLI 611 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 74/222 (33%), Gaps = 36/222 (16%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F +G+I+ +L ++ G+ I+ +++L Y LD ++L A+ Sbjct: 42 RQFMRREEGTITAFATMLFILMVGASGIAIDVMRYETQRSQLQYTLDRAVLAAAS----- 96 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + ++++ + L + +++ Sbjct: 97 -------LTQPYDPEGVVRDYF----------------AIAGIDGYRLDVRVEEGLNFRR 133 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + A + E + F + + + +++ MVLD+S SM ++ Sbjct: 134 VHAYAELE---VRSIFMQMFGVRAMTSPAIGAAE---ERVRRIEVSMVLDISGSMGEN-- 185 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 M + A R + + +V +V ++ ++ Sbjct: 186 NRMTNMRPAAREFVTEVLSANENVNNELLVSVSIVPYNGRVN 227 >gi|72093926|ref|XP_787298.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] gi|115954441|ref|XP_001184332.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] Length = 504 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 63/172 (36%), Gaps = 28/172 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ R+ +LD ++ I R +V F S + Sbjct: 338 VVFVIDVSGSMRG------RKIDQTKRAFTTILDDVRPID------RINIVLFESDVRVW 385 Query: 228 ---VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + +NR+ G T GL A + + + G+ + Sbjct: 386 RSNQMVEATSDNIAAAKRHVNRIRAGGGTNLYDGLRNAVDLLME---------HGNGEAM 436 Query: 285 KYIIFLTDGENSSPNI-DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 II LTDG+ +S ++ E + ++++ FL+ Sbjct: 437 PLIIMLTDGQPTSGSVKSTSEIIQRITNLIDGRLSLFSVSFGNGVDFSFLEK 488 >gi|160874992|ref|YP_001554308.1| cell wall anchor domain-containing protein [Shewanella baltica OS195] gi|160860514|gb|ABX49048.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS195] gi|315267224|gb|ADT94077.1| Vault protein inter-alpha-trypsin domain-containing protein [Shewanella baltica OS678] Length = 771 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 72/188 (38%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S++ + ++++V+D S SM D + A ++ L +K N Sbjct: 381 VEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLKPEDSFN----- 429 Query: 219 GLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + Q + + ++ ++RL T+ L+ A + L Sbjct: 430 -IIEFNSSLSQFSATSLPATSSNLSRARQFVSRLQADGGTEMALALDAAL------PKSL 482 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + + +IF+TDG + + E++ ++ +G+ + F+ Sbjct: 483 GSASPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFM 537 Query: 334 KNCASPDR 341 + A R Sbjct: 538 QRAAELGR 545 >gi|218516298|ref|ZP_03513138.1| hypothetical protein Retl8_22869 [Rhizobium etli 8C-3] Length = 432 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 79/212 (37%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 29 FKATVTVMPTPWNHDTELMHVAIKGYDIAPTTAPHA-NLVFLIDVSGSMDEP-----DKL 82 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFG 250 + + R +++ +K+ V+ +VT++ + I I+RL G Sbjct: 83 PLLKSAFRLLVNRLKADDTVS------IVTYAGNAGTVLEPTRVAEKPKILSAIDRLEAG 136 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AY L A D + + TDG+ + +++ Sbjct: 137 GSTGGAEGIEAAY--------DLAKKAFVQDGVNRVM-LATDGDFNVGPSSDEDLKRIIE 187 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 E ++ G + +G D ++ A Sbjct: 188 EKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGN 219 >gi|282601472|ref|ZP_05981787.2| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] gi|282569002|gb|EFB74537.1| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] Length = 246 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 64/188 (34%), Gaps = 27/188 (14%) Query: 168 GLDMMMVLDVSLSMN-------------------DHFGPG-MDKLGVATRSIREMLDIIK 207 + + + LD S SM + G +L + I+ + ++ Sbjct: 11 RVPICLCLDTSGSMGAVQGDCVDTGKTLFEDGRQWNLVTGGTSRLDELQKGIKLFYNSVR 70 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 +VTF S+ A Q + L T G+ A + + Sbjct: 71 EDEVARYAAEICIVTFDSEAKCRMDFANLDRQ---SDLPELTATGDTAMGEGVNLALDLL 127 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQ 325 K E+ KG D ++ +++ +TDG + + + ++ C + + + V+ I + Sbjct: 128 --ESRKREYQDKGVDYFQPWLVLMTDGVPNGNEGEFERAVQRCRDMEAQKKLTVFPIAIG 185 Query: 326 AEAADQFL 333 E L Sbjct: 186 DEGDQTAL 193 >gi|189524674|ref|XP_684145.3| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 607 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 45/206 (21%), Positives = 70/206 (33%), Gaps = 25/206 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S G D+ VLD S S+ H+ + L P + R Sbjct: 29 EAGSSCYGGFDLYFVLDKSGSVQHHWNE--------IYYFVDHLAHKFISPQL----RMS 76 Query: 220 LVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + FS++ L I+ E++ R++ G T G + A +I+ Sbjct: 77 FIVFSTEGRILMELTEDRDQIRAGLEELQRVLPGGDTFMHKGFQKASQQIYYGTGDGYRT 136 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A II LTDGE D + +++ GA VY +GV+ Q Sbjct: 137 AS-------VIIALTDGELRENEFDL--AAREAGRSRQLGASVYCVGVKDFNETQLATIA 187 Query: 337 ASPDRFYSVQNS-RKLHDAFLRIGKE 361 S D + V + L I K Sbjct: 188 DSKDHVFPVNDGFEALQGVIDSILKR 213 >gi|317122082|ref|YP_004102085.1| von Willebrand factor A [Thermaerobacter marianensis DSM 12885] gi|315592062|gb|ADU51358.1| von Willebrand factor type A [Thermaerobacter marianensis DSM 12885] Length = 791 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 71/189 (37%), Gaps = 32/189 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+ +VLD S SM ++ A +++L V+ R ++TF ++ Sbjct: 607 PVDVCLVLDASASMAGS------RIRAAKDLAQQLL--------VSTRDRVAVITFQERV 652 Query: 228 VQ-TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 VQ PL ++ ++++ T GLE A + ++ + Sbjct: 653 VQVQVPLTRNTTRVERGLSQIQPYGLTPLAQGLEAALAYLAQSRARNP-----------L 701 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGV---QAEAADQFLKNC--ASPD 340 ++ +TDG + P A++ G + IG + +++L+ A+ Sbjct: 702 LVLITDGIPTVPYRSANPLDDAIQVARQLGEGRFGRIGFTCIGLQPNERYLRALVRAAGG 761 Query: 341 RFYSVQNSR 349 R Y V Sbjct: 762 RLYVVDELE 770 >gi|329664416|ref|NP_001193157.1| anthrax toxin receptor 1 [Bos taurus] gi|297480483|ref|XP_002691486.1| PREDICTED: anthrax toxin receptor 1 [Bos taurus] gi|296482440|gb|DAA24555.1| anthrax toxin receptor 1 [Bos taurus] Length = 564 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHKFISPQL----RMSFIVFSTQ 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIHSILKK 218 >gi|99078203|ref|YP_611461.1| von Willebrand factor, type A [Ruegeria sp. TM1040] gi|99035341|gb|ABF62199.1| von Willebrand factor type A [Ruegeria sp. TM1040] Length = 477 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 76/205 (37%), Gaps = 33/205 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-------- 223 +VLD S SM G+ K+ +A ++ +L K++P+ GL+ + Sbjct: 27 TLVLDASGSM-WGQIDGVAKITIAQDVMQHLL---KTLPENQE---LGLMAYGHRRKGDC 79 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P A Q I + + ++ T + + A + + ++EK Sbjct: 80 NDIEQLIAPAAGSRQAISQAVTQISPKGKTPLSAAVMQAADALRSSEEKAT--------- 130 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA--SP 339 +I ++DGE + EA+ ++AIG +AA L+ A + Sbjct: 131 ---VILISDGEETCGLDPCAVGAEL--EARGVDFTLHAIGFGIADDAARAQLQCLAENTG 185 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ +L A ++ Sbjct: 186 GFYRDASSASELTAALAQVAVTTST 210 >gi|253998579|ref|YP_003050642.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] gi|253985258|gb|ACT50115.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] Length = 326 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 80/242 (33%), Gaps = 48/242 (19%) Query: 151 APLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGM--DKLGVATRSIREMLDI 205 L + S ++ +G +++++D S SM+D F + + G + + E L Sbjct: 59 IILGVAGPATPSQPIERTGVGAQLVLIIDRSASMDDPFSGAIASGRAGESKAAAAERL-- 116 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEY 261 I + G+VTFS+ + PL + I I G + T GL Sbjct: 117 ITRFVNERKNDMFGMVTFSNSAMHVLPLTESKEAILAAIRA--AGGSALFQTNIGSGLTT 174 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 L K D + II L+DG ++ + + +Y Sbjct: 175 G----------LAQFDKTPDSGSRAIILLSDGGGRIGAATQEKIRDWLDRMH---VTLYW 221 Query: 322 IGVQAEAA----------------------DQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 I ++ + + + K S + Y ++ + L A I Sbjct: 222 IVLRQPGSISIFDETYKTPDDRPPPPAIELNDYFKTLRSGYQPYEAEDPQSLAAAIQDIN 281 Query: 360 KE 361 ++ Sbjct: 282 RK 283 >gi|241267206|ref|XP_002406336.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215496880|gb|EEC06520.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 519 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 74/220 (33%), Gaps = 34/220 (15%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS---------SKSDIGLDMMMVLDV 177 + R+ +P I P A S + T + + +++VLDV Sbjct: 104 VQISDRFTLPGILEPVPLRAVSPNPVQTGTFPTNRTVFTTFRLFQRSDEKSQRVVLVLDV 163 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL---- 233 S SM G D+L + M+ + +++ G+VTFS + PL Sbjct: 164 SHSMRPRVGE--DRLAFLQCATNHMIRHM-----LHDYQALGIVTFSGRCQVAHPLVVLN 216 Query: 234 -AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 I + I+ L+ G+ T GL A + E A+G + +TD Sbjct: 217 TTDARDGIAKVIDGLVLGAGTSIGCGLSKATEML----EGNGTSARGGLVF-----LVTD 267 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 G+ + ++ G V + A + Sbjct: 268 GDENYKPWIVEQLPILV----SSGVKVSTFALGTLAEKKL 303 >gi|328712316|ref|XP_001943179.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Acyrthosiphon pisum] Length = 830 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 45/221 (20%), Positives = 82/221 (37%), Gaps = 41/221 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQ 229 ++ +LDVS SMN K+ ++ ++L I S ++ S L ++ Q Sbjct: 281 VIFILDVSGSMNGQ------KITQVKGAMSQILSEIDSEDFFTLILFSSLAQIWTINATQ 334 Query: 230 TFPLAW-------------------------GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 W +Q+ ++ I L STT L A + Sbjct: 335 NTSNYWDDRGRNLNNFETMGENHFIFSANEQNIQYAKKFIQALEPDSTTNMEDALNKALS 394 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K + + AK K I+FLTDGE + + + + Y ++ +Y++G Sbjct: 395 IAKLGKMRFKDSAK---TPKPIIVFLTDGEMNEGITNPQALMKYVSDINVDNYPIYSLGF 451 Query: 325 QAEAADQFLKNCASPDR-----FYSVQNSR-KLHDAFLRIG 359 A +FLK + + Y ++ +LH+ + I Sbjct: 452 GKGADIEFLKKLSLNNTGFARVIYEASDASLQLHNFYKEIS 492 >gi|198435588|ref|XP_002122129.1| PREDICTED: similar to von Willebrand factor precursor (vWF) [Ciona intestinalis] Length = 3684 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 85/273 (31%), Gaps = 24/273 (8%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC----TFPWCANSSHAPLLI 155 I ++ +T+ + S +P + T P+ P L+ Sbjct: 210 STTTTIPSVPGATTTFSSVPGVTTTTIPSVPGATTIPSVPGVTTTTVPFLPGLITTPPLL 269 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +D+++V+D S S G+D + + IK N+ Sbjct: 270 PQPHTTLGLTGCTVDLVLVVDSSYS------IGIDGFITLKDILGNL---IKKFDVPNDE 320 Query: 216 VRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKE 271 R LV +S + + L+ + + I + T + L+ + D + Sbjct: 321 TRVSLVQYSKRSQVEWLLSTYPGNLDGMLHTIAGMQMLQGVTYTYHALKLVLQTVIDGSD 380 Query: 272 KLEHIAKGHDDYKKYIIFLTDG--ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 I+ +TDG ++ + + ++ + IGV + Sbjct: 381 SGRRPDVPF-----VIVLITDGRAKDEDIREEVLNKIQVVHDKWNETFHLITIGVGSVDP 435 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 Q + D Y+V++ + + + Sbjct: 436 HQLEGIASHDDYVYTVEDMNEASSLVDEVAASI 468 >gi|194223903|ref|XP_001494710.2| PREDICTED: integrin, alpha 1 [Equus caballus] Length = 1208 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 101/291 (34%), Gaps = 49/291 (16%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA------------PLLIT 156 ++ ++ ++ + + V+ + F+ C + H + Sbjct: 128 VNTSIPNVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 187 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 188 NSIAPVRECSSQLDIVIVLDGSNS--------IYPWKSVTDFLNDLLERMDIGPKQTQ-- 237 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + F L + + N+++ T + G++ A + F Sbjct: 238 -VGIVQYGENVTHEFNLNKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEAFTEARG 296 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-------- 324 H K K ++ +TDGE S N + + C ++ ++I + Sbjct: 297 ARHGVK------KVMVIVTDGE-SHDNHQLNQVIQDCE---KQNIQRFSIAILGHYNRGN 346 Query: 325 -QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 E + +K+ AS F++V + L +G+ + ++ Sbjct: 347 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATVDQ 397 >gi|324504675|gb|ADY42017.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 898 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 75/200 (37%), Gaps = 27/200 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ +LD S S+ + ++ + ++ ++P + VR + ++ + Sbjct: 1 MDVIFLLDTSGSIEQIYQEH----------VKWTVSLVDALPVDRDGVRIAAIQYAGFPL 50 Query: 229 QTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L I++ ++++ F T++ L A +++F + A K Sbjct: 51 TEFALGTYLNADDIRQHLSQIKFQSGVTRTGYALRKADSELFRQERGARSDAI------K 104 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFLKNCASPD-RFY 343 I+ TDG + + L +E + + +Y + V ++ + + A + Sbjct: 105 IIVLFTDG------LSIDDPLKPAHELRDIKRVKIYVVSVGSDGFEPEMNRIAGDKRNVF 158 Query: 344 SVQNSRKLHDAFLRIGKEMV 363 +L D L + Sbjct: 159 GPNELSRLRDTLLSDAERAS 178 >gi|300869833|ref|YP_003784704.1| hypothetical protein BP951000_0196 [Brachyspira pilosicoli 95/1000] gi|300687532|gb|ADK30203.1| putative membrane protein containing von Willebrand factor (vWA) type A domain, BatB [Brachyspira pilosicoli 95/1000] Length = 338 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 73/227 (32%), Gaps = 32/227 (14%) Query: 120 DQHKDYNLSAVSR-YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 K+Y + R + F+ + P KI + + + + ++LD+S Sbjct: 43 KNDKNYKRISKLRIFSASFLILALSLSIFALMQPKWGIIEQKIKTNNYM---ITILLDLS 99 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 SM +L A I + + ++ LV F+ P ++ Sbjct: 100 RSMEADDVWP-SRLERAKLEIEDFVKNTDNLS-------VALVGFAGTSFVASPFTQDME 151 Query: 239 HIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 +N L S T+ L A N + KK II +TDGE Sbjct: 152 TFSYILNELNTKSVTLQGTRIADALVTAKNTFN-----------VNIPGKKSIILITDGE 200 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + + DN E K VY +GV +E + + Sbjct: 201 DHAGYFDN-----ILKELKDNDISVYTVGVGSELGATIRSDIGYSEN 242 >gi|226531069|ref|NP_001152771.1| inter alpha-trypsin inhibitor, heavy chain 4 isoform 2 [Mus musculus] Length = 941 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|151357765|emb|CAO78005.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 903 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|148692824|gb|EDL24771.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_d [Mus musculus] Length = 943 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 277 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 324 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 325 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 380 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 381 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 440 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 441 RIYEDSDSALQLQDFYHEVAN 461 >gi|148692823|gb|EDL24770.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_c [Mus musculus] Length = 927 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 277 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 324 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 325 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 380 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 381 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 440 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 441 RIYEDSDSALQLQDFYHEVAN 461 >gi|148692822|gb|EDL24769.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Mus musculus] Length = 904 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 277 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 324 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 325 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 380 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 381 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 440 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 441 RIYEDSDSALQLQDFYHEVAN 461 >gi|12836422|dbj|BAB23649.1| unnamed protein product [Mus musculus] Length = 902 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|26340740|dbj|BAC34032.1| unnamed protein product [Mus musculus] Length = 941 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|226531047|ref|NP_061216.2| inter alpha-trypsin inhibitor, heavy chain 4 isoform 1 [Mus musculus] gi|26340986|dbj|BAC34155.1| unnamed protein product [Mus musculus] gi|148692825|gb|EDL24772.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_e [Mus musculus] gi|151357764|emb|CAO78004.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|16741341|gb|AAH16500.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] gi|62204734|gb|AAH92258.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 941 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|2739028|gb|AAC25786.1| PK-120 precursor [Mus musculus] Length = 942 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|63100270|gb|AAH94457.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM+ K+ ++ ++L + N L+ FS + Q Sbjct: 275 VIFVIDKSGSMSGK------KIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +R+ T + A + + + +K Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 379 LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLFCLGFGFDVNYPFLEKMALDNGGLAR 438 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 439 RIYEDSDSALQLQDFYHEVAN 459 >gi|326435586|gb|EGD81156.1| hypothetical protein PTSG_11196 [Salpingoeca sp. ATCC 50818] Length = 1445 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 73/201 (36%), Gaps = 28/201 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D++ +LD S S+ M + S ++ + R L+ +S Sbjct: 178 GRSAVDLLFILDGSGSIGSSNFETMRQFTATVTSFFDV---------SPDTTRVALMVYS 228 Query: 225 SKIVQTFPLAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + + F ++ + + +++I N G T++ L+YA +F + G Sbjct: 229 SSVTEIFDFSYVLSNTRDEIITTIRNTNYPGGGTRTGSALDYARTNMFLTSRGARPSSAG 288 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I + DG++ + ++AIG+ ++ L ASP Sbjct: 289 ---VPRVAIVIIDGQSGDSVAQP------AENLRNENVNIFAIGISGADVNE-LNMIASP 338 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 + N+ K D F + Sbjct: 339 ----PITNNVKFIDTFQAFSQ 355 >gi|291299307|ref|YP_003510585.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290568527|gb|ADD41492.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 626 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 81/217 (37%), Gaps = 24/217 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + +S +D +++M LD S SM + G G K+ A ++ +D+ +++P Sbjct: 23 AVVGTAAPASAADNDGELLMALDASGSMEESDGAGNTKMETARDAV---IDVAEAMPGHA 79 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEK--INRLIFGSTTKSTPGLE-YAYNKIFDAK 270 GL + G + +E I+++ + T + L+ I + Sbjct: 80 K---VGL-----RAYGPASTGSGCKASKELVPIDKIDADAITTAATELKPEGDTPIAYSL 131 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEA 328 EK K I+ ++DGE + + + + +G V+ IG Q + Sbjct: 132 EKAAGDFTEAKGP-KTILLVSDGEETCGG----DPVKVAEKIASQGVDLRVHVIGFQVDD 186 Query: 329 A--DQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEM 362 A Q + A +Y Q+ L R + Sbjct: 187 ATRKQLTEIAKAGKGSYYDAQDGPALASRLKRASESA 223 >gi|265766730|ref|ZP_06094559.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263253107|gb|EEZ24583.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 608 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 75/211 (35%), Gaps = 22/211 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + T PW N H + I K ++ ++ ++DVS SM +GP ++ Sbjct: 212 VRITTEVGTCPW--NEQHRLVRIGLKAKEIPTENLPASNLVFLIDVSGSM---YGP--ER 264 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V+ SG Q I+E I+ L Sbjct: 265 LDLVKSSLKLLVNNLREKDKVAIVIYSGAAG----EKLASTPGSDKQKIREAIDELEASG 320 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II TDG+ + +KE + Sbjct: 321 STAGGEGIMLAYKIAQKNFISGGNNR---------IILCTDGDFNVGVSSDKELEKLIEQ 371 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 372 KRKSGIFLTVLGYGMGNYKDNKMQTLAEKGN 402 >gi|327543487|gb|EGF29906.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 464 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 79/244 (32%), Gaps = 26/244 (10%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 + H+ + ++H + +T S++ +++ +VL Sbjct: 13 MTATPLHQASAEQVKLDVRL-VHPVMKAGEKQTNHLRIALTGFELKSTEERPPVNVCLVL 71 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D S SM+ KL A + +D + V+ +V + S + P Sbjct: 72 DHSGSMSGQ------KLARAKEAAEAAIDRLSDDDIVS------VVLYDSNVTVLVPATK 119 Query: 235 -WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I++KI + GS+T G+ ++ + +I L+DG Sbjct: 120 ATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLADEQVNR---------VILLSDG 170 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKL 351 + +E + V +G+ + + + AS ++++ L Sbjct: 171 LANVGPKSPQELEGLGRSLMKEAISVSTLGLGSGYNEDLMVALASVGGGNHAFIEDADSL 230 Query: 352 HDAF 355 F Sbjct: 231 VAVF 234 >gi|109731121|gb|AAI13690.1| Chloride channel accessory 4 [Homo sapiens] gi|109731369|gb|AAI13688.1| Chloride channel accessory 4 [Homo sapiens] gi|313883598|gb|ADR83285.1| chloride channel accessory 4 [synthetic construct] Length = 917 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 96/277 (34%), Gaps = 57/277 (20%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 +F NE N A + NI+ + + + +D+ + +P + P + Sbjct: 248 EFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNT------IPMVTPPPPPVFSLLK 301 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I + +VLD S SM G D+L ++ + L + Sbjct: 302 ISQRI---------------VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQ 336 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 V N G+V F S L + + + G T G++YA+ Sbjct: 337 TVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGG-TSICSGIKYAFQV 395 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I H + ++ LTDGE+++ + +E K+ GAIV+ I + Sbjct: 396 IG-----ELHSQLDGSE----VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALG 439 Query: 326 AEAADQF--LKNCASPDRFYSVQNSRK--LHDAFLRI 358 A + + FY ++ L DAF + Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGAL 476 >gi|150024242|ref|YP_001295068.1| BatB protein [Flavobacterium psychrophilum JIP02/86] gi|149770783|emb|CAL42248.1| BatB protein [Flavobacterium psychrophilum JIP02/86] Length = 346 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 64/174 (36%), Gaps = 20/174 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ G+D++ +DVS SM ++L + + + ++++ + S R Sbjct: 79 TKVEKVKREGIDIVFAIDVSKSMLCEDIAP-NRLEKSKQVVSQIINNLGS-------DRI 130 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G++ ++ P+ + + + + L+ A + + Sbjct: 131 GIIAYAGSAFPVLPITTDYNVAKMFLQSMTSDMVSSQGSNLDEAI-------KLSAKYFE 183 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 G + K +I LTDGE+ S ++ K+ G + IG+ A Sbjct: 184 GSPNTSKLMIMLTDGEDHSEGAESAAEEA-----KKIGMKIITIGIGTTAGGPI 232 >gi|119593591|gb|EAW73185.1| chloride channel, calcium activated, family member 4, isoform CRA_b [Homo sapiens] Length = 918 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 96/277 (34%), Gaps = 57/277 (20%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 +F NE N A + NI+ + + + +D+ + +P + P + Sbjct: 249 EFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNT------IPMVTPPPPPVFSLLK 302 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I + +VLD S SM G D+L ++ + L + Sbjct: 303 ISQRI---------------VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQ 337 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 V N G+V F S L + + + G T G++YA+ Sbjct: 338 TVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGG-TSICSGIKYAFQV 396 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I H + ++ LTDGE+++ + +E K+ GAIV+ I + Sbjct: 397 IG-----ELHSQLDGSE----VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALG 440 Query: 326 AEAADQF--LKNCASPDRFYSVQNSRK--LHDAFLRI 358 A + + FY ++ L DAF + Sbjct: 441 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGAL 477 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 27/61 (44%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 L++R + +G+ ++L AI I L I+ + +F K+ LD + L + Sbjct: 6 FCLHMRRLMRDREGNFAVLGAIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVR 65 Query: 62 K 62 Sbjct: 66 A 66 >gi|28871878|ref|NP_794497.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28855131|gb|AAO58192.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] Length = 224 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 63/171 (36%), Gaps = 15/171 (8%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLDVS SM G + +L + I+ +VTF Sbjct: 15 NPTARVPICLVLDVSGSMA---GEPIRELQA---GVNMFYQAIREDEVAQYAAEISIVTF 68 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+ +T + ++ + LI TT G+ A + + K + G D Y Sbjct: 69 GSEAKRTVDF---MAIERQDVPALIAEGTTSMGQGVNLALDLLEVRKGDYQRA--GVDYY 123 Query: 284 KKYIIFLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + +++ +TDGE + ++ C K V+ I + A L Sbjct: 124 QPWMVVMTDGEPTDDITRASERIREMCESKK---LTVFPIAIGTAANLDIL 171 >gi|116625274|ref|YP_827430.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228436|gb|ABJ87145.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 320 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 77/217 (35%), Gaps = 40/217 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ + + +V+D S SM +KL + ++ + ++ V V F+ Sbjct: 85 EDVPVSLGLVIDNSGSMR-------NKLQKVEAAALALV----KASNRDDEVFI--VNFN 131 Query: 225 SKIVQTFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P + +++ + R+ T ++ + + + Sbjct: 132 DTAYLDNPKDKDFTNDIGELEQALKRIDARGGTAMRDAIQMSIDHLKKGHRD-------- 183 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---- 336 KK ++ +TDG ++S I+ + + A + ++Y +G+ E + Sbjct: 184 ---KKVLVVITDGNDNSSVINMERIMK---NAHQSDVLIYGVGLLTEEEHREAARAKRAL 237 Query: 337 -----ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A+ + + ++ ++ ++ ++ Q + Sbjct: 238 NDLAEATGGKTFFPKDLEEVDAIASQVAHDIRSQYTI 274 >gi|326430897|gb|EGD76467.1| hypothetical protein PTSG_07584 [Salpingoeca sp. ATCC 50818] Length = 985 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 49/228 (21%), Positives = 77/228 (33%), Gaps = 25/228 (10%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--- 211 I S + K + + +V+DVS SM D +K + + LDI+K Sbjct: 207 ILSIMPRLDKIKTPVHICLVIDVSGSM-DRHATQRNKYNQLEKFEQTYLDIVKHAARSIA 265 Query: 212 ---VNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 N +V FS T P+ GV + + L S T G+ Sbjct: 266 NFLQNEDNYLSVVAFSENASVTLPMERMTEAGVSRATDAVGALQPCSCTNLGDGVLRGMQ 325 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIG 323 + + + + ++ LTDGE + N + N K R IV G Sbjct: 326 LLLKGTDTTK--------AQPVLMVLTDGEPNEGNDARDVLRSFRNTYKHARRFIVNTFG 377 Query: 324 VQAEA-ADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 E L A+ + V +S + AF I + Q +L Sbjct: 378 FGFEQIDSPLLSELAALGGGTYAFVPDSSFVGTAF--INATVAAQLVL 423 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 76/200 (38%), Gaps = 25/200 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ + ++ D+M ++D S S+ M ++ ++ + P Sbjct: 602 RNQVVQEICTEEACKEMKADIMFLVDSSGSIGPENFSKM------KTFMKNLVSKSQIGP 655 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIF 267 D V+ G+V FS + F L I I+++ G TT + L + Sbjct: 656 DR---VQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFS 712 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K +I +K++I +TDGE + L ++ G I+Y++GV Sbjct: 713 PTKGARPNI-------RKFLILITDGEAQDIVKEPAVVL------RQEGVIIYSVGVFGS 759 Query: 328 AADQFLKNCASPDRFYSVQN 347 Q + P+ + V+N Sbjct: 760 NVTQLEEISGRPEMVFYVEN 779 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 56/315 (17%), Positives = 99/315 (31%), Gaps = 39/315 (12%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 N N G Q T LR G IE ++ + Sbjct: 320 SARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIAS 379 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DM 171 H + + F IT +V + S+ L D+ Sbjct: 380 HPAEQYVSKLK---TFADLAAHNQTFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADI 436 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++D S S + E++ + P VR G V ++ F Sbjct: 437 YLLIDGSGS------TQATDFHEMKTFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEF 487 Query: 232 PLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + Q + + I + G T + L + + + AK++ + H ++ Sbjct: 488 EINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNKVPCH------LV 541 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSVQN 347 LT+G + L N + VYAIG++ EA L+ A + R Y V + Sbjct: 542 VLTNG------MSKDSILEPANRLREEHIRVYAIGIK-EANQTQLREIAGEEKRVYYVHD 594 Query: 348 SRKLHDAFLRIGKEM 362 L D ++ +E+ Sbjct: 595 FDALKDIRNQVVQEI 609 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD+S + + + + ++ N +R GLV +S++ Sbjct: 228 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 278 Query: 230 TFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ G+ + + I L + T A K+ + ++ + + Sbjct: 279 INSLSMGINKSEVLQHIQNLSPRTGKAYTGA---AIKKLRKEVFSARNGSRKNQGVPQIA 335 Query: 288 IFLT--DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T D E++ +R G ++ +G++ + Q K + P Sbjct: 336 VLVTHRDSEDNVTKAAV--------NLRREGVTIFTLGIEGASDTQLEKIASHP------ 381 Query: 346 QNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + K Sbjct: 382 --AEQYVSKLKTFADLAAHNQTFLKK 405 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 56/166 (33%), Gaps = 34/166 (20%) Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK--GHDDYKKYIIFLTD 292 W +++ +N L T G+ + + + KKYIIF+TD Sbjct: 364 WSRDDLKKYLNTLTPDGGTYHDNGMMWGARWASSGGIFGGNNPEKYNMMPVKKYIIFMTD 423 Query: 293 G------------------------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 G ++ + C++AK G ++ + Sbjct: 424 GLFETGYSRLYSSYGVEQLDARATPGGAYSNQDDQLARHKQRFNLLCSKAKSMGYSIWVL 483 Query: 323 GVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 G A D L NCAS P + + N L F+ IGK + R+ Sbjct: 484 GF-ATTLDASLTNCASTPSQASTSSNQAALMARFVEIGKNIGALRL 528 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 72/216 (33%), Gaps = 31/216 (14%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 +L A+ L + ++G I+ ++ + ++ D ++L + N + + + Sbjct: 1 MLWALFLIPLVALVGSGIDLGTRYVTRKQMQIACDAAVLAGRRAMTNGIVDDGVRAEATK 60 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFI 138 F N Q F ++ + + T++ I + E+P Sbjct: 61 F---FNFNFQQGMFGSKPFTPSISSATTS---KTTVVINAATTVPTSLMRIFGSDELPVS 114 Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP--GMDKLGVAT 196 S+ + D++ VLD + SM D K+ Sbjct: 115 VSCNA-------------------SQDFVNTDIVFVLDTTGSMRDKATSSDSQTKIEALR 155 Query: 197 RSIREMLDIIKSIPDVNNV----VRSGLVTFSSKIV 228 ++ + D + + + +R G+V ++S + Sbjct: 156 SAVLALYDQLAPVQNQLAASGMRLRYGVVPYASAVN 191 >gi|194211467|ref|XP_001916467.1| PREDICTED: similar to collagen, type VI, alpha 3 [Equus caballus] Length = 3165 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 67/195 (34%), Gaps = 22/195 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S+ + + + D+IKS+ N LV F Sbjct: 32 KNGAAADIIFLVDSSWSIGKEHFQLVREF---------LYDVIKSLAVGENDFHFALVQF 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I+ + + + T + ++ D Sbjct: 83 NGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSNQTG---KGLEYVMQTHLTQAAGSRASD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ LTDG + E K V+AIGV+ + + P Sbjct: 140 GVPQVIVVLTDGHSEDGLALPTA------ELKSADVNVFAIGVEDADEGALKEIASEPLN 193 Query: 340 DRFYSVQNSRKLHDA 354 ++++N LHD Sbjct: 194 MHVFNLENFTSLHDI 208 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 73/213 (34%), Gaps = 22/213 (10%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + ++ + ++ D++ +LD S ++ + P + Sbjct: 612 LQGMLPSLLAPLRTLSGTTEVHVNKR---DIIFLLDGSSNVGETNFPYVRDF-------- 660 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPG 258 +++I+ S+ ++ +R GLV FS V F L + + +L S Sbjct: 661 -VMNIVNSLDVGSDNIRVGLVQFSDTPVTEFSLNTYQTKAELLAHLRQLQLQGG--SGLN 717 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 A + + ++ + + ++ LT G++ L N R G + Sbjct: 718 TGSALSYVHANHFTEAGGSRIRERVPQLLLLLTAGQSED------SYLQAANALARAGIL 771 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 + +G + + + +P Y + + L Sbjct: 772 TFCVGARQANKAELEQIAFNPSLVYLMDDFSSL 804 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 93/249 (37%), Gaps = 21/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + T + + AP + + + Sbjct: 1576 NIDRTELQTITNDPRLVFTVREFRELPNIEEKIITSFGPSGVTPAPPGVDTPSPSRPEKK 1635 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S+N + L + +I+ ++ + + ++ GLV ++S Sbjct: 1636 KA-DIVFLLD--GSINFRRDSFQEVLRFVS-------EIVDTLYEGGDSIQVGLVQYNSD 1685 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 1686 PTDEFFLKDFSTKQEIIDAINKVVYKGGRHANT--KVGIEHLRLNHFVPEAGSRLDQRVP 1743 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ E + +RG V+A+GV+ +++ K ++ + Sbjct: 1744 QIAFVITGGKS-------VEDAQEASALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFR 1796 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1797 VGNVQELSE 1805 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 67/180 (37%), Gaps = 22/180 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D++ ++D S G V + +L+ + +R G+V +S Sbjct: 236 AQDSADIIFLIDGSN------NTGSVNFAVIRDFLVNLLERLSI---GTQQIRVGVVQYS 286 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + + + L F + GL A + + + ++ + Sbjct: 287 DEPRTMFSLDTYSTKAQVLDAVKALGFTGGELANVGL--ALDFVVENHFTRAGGSRVEEG 344 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDR 341 + ++ ++ G +S D +L + V++ G+ A+AA + L+ A+ D Sbjct: 345 VPQVLVLISAGPSSDEIRDGVVALKQAS--------VFSFGLGAQAASKAELQQIATNDN 396 >gi|119593590|gb|EAW73184.1| chloride channel, calcium activated, family member 4, isoform CRA_a [Homo sapiens] Length = 917 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 96/277 (34%), Gaps = 57/277 (20%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 +F NE N A + NI+ + + + +D+ + +P + P + Sbjct: 248 EFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNT------IPMVTPPPPPVFSLLK 301 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I + +VLD S SM G D+L ++ + L + Sbjct: 302 ISQRI---------------VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQ 336 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 V N G+V F S L + + + G T G++YA+ Sbjct: 337 TVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGG-TSICSGIKYAFQV 395 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I H + ++ LTDGE+++ + +E K+ GAIV+ I + Sbjct: 396 IG-----ELHSQLDGSE----VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALG 439 Query: 326 AEAADQF--LKNCASPDRFYSVQNSRK--LHDAFLRI 358 A + + FY ++ L DAF + Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGAL 476 >gi|150036262|ref|NP_036260.2| calcium-activated chloride channel regulator 4 [Homo sapiens] gi|205831469|sp|Q14CN2|CLCA4_HUMAN RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel family member 4; Short=hCLCA4; AltName: Full=Calcium-activated chloride channel protein 2; Short=CaCC-2; Short=hCaCC-2; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|37182063|gb|AAQ88834.1| CLCA4 [Homo sapiens] gi|56203696|emb|CAI22170.1| chloride channel, calcium activated, family member 4 [Homo sapiens] Length = 919 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 96/277 (34%), Gaps = 57/277 (20%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 +F NE N A + NI+ + + + +D+ + +P + P + Sbjct: 248 EFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNT------IPMVTPPPPPVFSLLK 301 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 I + +VLD S SM G D+L ++ + L + Sbjct: 302 ISQRI---------------VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQ 336 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 V N G+V F S L + + + G T G++YA+ Sbjct: 337 TVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGG-TSICSGIKYAFQV 395 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I H + ++ LTDGE+++ + +E K+ GAIV+ I + Sbjct: 396 IG-----ELHSQLDGSE----VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALG 439 Query: 326 AEAADQF--LKNCASPDRFYSVQNSRK--LHDAFLRI 358 A + + FY ++ L DAF + Sbjct: 440 RAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGAL 476 >gi|316956996|gb|EFV46978.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 328 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 74/198 (37%), Gaps = 23/198 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKI- 227 D+M VLD S S+ V I L + I +N R ++ +S +I Sbjct: 37 DLMFVLDGSGSIGS---------AVFKNEILRFLREFINLFTIGSNHTRLAIIQYSDQIR 87 Query: 228 -VQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F A + E +NR+ TK+ L + F I G + Sbjct: 88 HELDFKEANSKAEVDEALNRVEYLTGLTKTGDALTDMFKIGFSKSRGARPIETG---VHR 144 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYS 344 I +TDG + F NEAK+ +++A+GV + + ++ S DR + Sbjct: 145 VAIVITDGRSQDIVS------FSANEAKKSNVLMFAVGVTDHVSEAELVEIAGSKDRVFL 198 Query: 345 VQNSRKLHDAFLRIGKEM 362 V+ L+ + ++ Sbjct: 199 VKEFTDLNVRLRSLIQKA 216 >gi|5726289|gb|AAD48398.1|AF127035_1 calcium-activated chloride channel protein 2 [Homo sapiens] Length = 917 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 75/198 (37%), Gaps = 38/198 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G D+L ++ + L + V N G+V F S Sbjct: 307 VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 356 Query: 231 FPLAWGVQHIQEKINRLIFG------STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L ++ N L+ G T G++YA+ I H + Sbjct: 357 NKLI--QIKSSDERNTLMAGLPTYPLGGTSICSGIKYAFQVIG-----ELHSQLDGSE-- 407 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 ++ LTDGE+++ + +E K+ GAIV+ I + A + + F Sbjct: 408 --VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 458 Query: 343 YSVQNSRK--LHDAFLRI 358 Y ++ L DAF + Sbjct: 459 YVSDEAQNNGLIDAFGAL 476 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 33/268 (12%), Positives = 70/268 (26%), Gaps = 88/268 (32%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT----- 222 +D+M+ D S SM + D++ ++ + I R GL + Sbjct: 526 PIDVMLTADRSGSMLRDYP---DRMVSLMDALEDF-----GIEMKEGWDRLGLASFGTYG 577 Query: 223 ------------------------------------FSSKIVQTFPLAWGVQHIQEKINR 246 ++ L ++ Sbjct: 578 NADIIDYGNRYWAGYDNSYYDDWEYISEHYAGNDKNYNDYATIDLNLTEDFSDYNTEVKA 637 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-------- 298 L+ T GL Y+ + D K ++ L+DG+ + Sbjct: 638 LVPDGGTPMRKGLYYSIKHLRDNGRDDA---------VKAVVVLSDGDYNYYGDPLARGS 688 Query: 299 ---NIDNKE------SLFYCNE--------AKRRGAIVYAIGVQ---AEAADQFLKNCA- 337 D + + N AK +++I + L+ A Sbjct: 689 GGTKWDWSDMQEKYYTFSDLNSSEQDMRIFAKDNDIKIFSIAYADGISSEGKAVLQALAE 748 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++Y + L + + I E+ + Sbjct: 749 GTGGKYYYAPSGEDLEEIYEDIAGELKE 776 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 66/188 (35%), Gaps = 27/188 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLDVS SM G ++ RS+ + + + + G+VTF + Sbjct: 167 VVLVLDVSGSM-----TGQGRMERLRRSVSTYI-----LSTIEDGAWLGIVTFRGTSHKI 216 Query: 231 FPLA-WGVQHIQEKINRLIFGS--------TTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L ++E+I + T T + A + A + G Sbjct: 217 CDLEQLNGDSVREEILNMTLDGLTNRTGKVGTNITRAVTLAVQILGPAVQDR---KLGDS 273 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCASP 339 + +I +TDG + N L + AK G ++ I + AE L Sbjct: 274 TGPRQMILITDGRDRRLNNSVIFMLQN-DTAK--GVVIDTIALGDGAEEGLPLLSEVTGG 330 Query: 340 DRFYSVQN 347 F+S + Sbjct: 331 QFFFSPDS 338 >gi|149920662|ref|ZP_01909127.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] gi|149818449|gb|EDM77898.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] Length = 540 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 78/225 (34%), Gaps = 28/225 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N + + + + + + L++ + +D+S SM G +D++ Sbjct: 127 VMGNMINGGNCTVVVVGMNTPIDPAELDRPPLNLTIAVDLSKSME---GEPIDRVRQGLL 183 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKST 256 +RE L+ R LV F + A + I L+ +T Sbjct: 184 QMREQLEPED---------RVTLVGFGDEAQVIVENADKDSVELATAIAALVPWGSTNLY 234 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKR 314 GL A+ + + ++ ++ ++DG ++ N D E L Sbjct: 235 AGLRTAFEQ---------TDLYAQEGWQNRVLLVSDGVPTTGIVNSDKIEGLAEAWSGMG 285 Query: 315 RGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 G +G+ + + ++N + FY V++ + + F Sbjct: 286 YGLT--TVGIGNDFDIELMRNLSELGSGSFYYVEDPDAVIEVFSE 328 >gi|86147474|ref|ZP_01065786.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] gi|85834767|gb|EAQ52913.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] Length = 460 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 53/294 (18%), Positives = 102/294 (34%), Gaps = 56/294 (19%) Query: 14 KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGK 73 +G +++++ I LP I +V+GL ++ + VK+KL +D + + A + N E+ Sbjct: 13 RGLVALMSIIALPFILLVVGLSVDAGRAYIVKSKLFAAVDAASIAAARAVANGEDAGRAA 72 Query: 74 KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 QK + + + + N + +I ID + Sbjct: 73 AQK----------YFSANIPADFYSATPSLGAVNFAYDSFGNISID---------ISATA 113 Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD--VSLSMNDHFGPGMDK 191 ++P IF L + + +D+++V+D SL + D Sbjct: 114 QVPTIFLPL--------IGLDTFNPGVSAQSIRRPVDLVLVIDNTTSLRLGSIGDVTQDV 165 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWG--VQHIQEKINRL 247 + + + + R LV F+ G I+ +I+ Sbjct: 166 IDRSKSFVENFHEGFD---------RISLVKFAFGAEVPVGFNATRGHSRSSIKSEIDSF 216 Query: 248 IFGST-----TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 FGST T ++ G+ A N++ K I+F TDG + Sbjct: 217 NFGSTSNAQYTNASEGMYRALNEL---------RTVTDPANLKVIVFFTDGAPN 261 >gi|156400924|ref|XP_001639042.1| predicted protein [Nematostella vectensis] gi|156226167|gb|EDO46979.1| predicted protein [Nematostella vectensis] Length = 1450 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 72/183 (39%), Gaps = 27/183 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + ++++V+D S SM + G L +A + + +LD + N + G Sbjct: 197 VDAAAPKPKNVVLVVDSSGSMAEKHTANGKTWLQMAIDAAKAVLDTL------NPRDKVG 250 Query: 220 LVTFSSKIVQTFP--LAW------------GVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 +V+ ++ W + +++ ++ + T P L A+ Sbjct: 251 VVSLATDANTPGSNDTTWCYANTLAEANSVNINNMKIFLDGMRSAGFTMYIPALTKAFAL 310 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + ++K + DD + IIFLTD + + ++ N+ ++ A G+ Sbjct: 311 LLNSKPE------SPDDCDQVIIFLTDAKPTELKESVMRTIVESNKLLDNRVVILAYGIG 364 Query: 326 AEA 328 AE Sbjct: 365 AED 367 >gi|146343645|ref|YP_001208693.1| hypothetical protein BRADO6887 [Bradyrhizobium sp. ORS278] gi|146196451|emb|CAL80478.1| conserved hypothetical protein; protein containing a von Willebrand factor type A (VWA) domain; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 755 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 45/308 (14%), Positives = 104/308 (33%), Gaps = 38/308 (12%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N + R+ + ++ + + R +L+ + +D+ L+ Sbjct: 256 AKNDPVNPTKITVRLQAGFALGEVKSH-HHQVTVESTDAETRVITLADGVVPADRDFELT 314 Query: 129 AVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFG 186 E MP + ++ + +T ++ + D++ V+D S SM Sbjct: 315 WKPASENMPSVGLFHEQVGDADYLLAFVTPPAVATATQRPQPRDVIFVIDNSGSMGG--- 371 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQ 241 + A S+ L ++ R ++ F + FP A V + Sbjct: 372 ---TSIRQAKASLLYALGRLQPND------RFNVIRFDDTMTVLFPSSVPADAEHVGNAT 422 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ P + A + D + + + ++FLTDG I Sbjct: 423 RFVSSLDARGGTEMVPAMRAA---LTDDGSDSDRM--------RQVVFLTDG-----AIG 466 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIG 359 N + LF A R + ++ +G+ + + A R F + + ++ + + Sbjct: 467 NDQQLFETITAMRGRSRIFMVGIGSAPNTYLMSRAAELGRGAFTHIGSVEQVEERMRDLF 526 Query: 360 KEMVKQRI 367 ++ + Sbjct: 527 AKLENPVV 534 >gi|326668779|ref|XP_002662551.2| PREDICTED: collagen alpha-1(VII) chain-like [Danio rerio] Length = 2698 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 71/204 (34%), Gaps = 26/204 (12%) Query: 140 CTFPW-CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 C W + L S + + D++ ++D S S+ + Sbjct: 3 CLLQWTLRLAVLLVLPSLSKAQGQCNDVVAADVVFLVDGSSSIGRA------NFMLVKSF 56 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKS 255 + ++ N +R G V +S F G + I N G T++ Sbjct: 57 MAGIVKPFAKAV-GPNGIRFGAVQYSDTARVEFTFTAYLNGTELITAVENINYKGGNTRT 115 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 GL+Y + F D K I +TDG++ ++ + + + Sbjct: 116 GAGLKYIADNFFSPASIR--------DVPKISILITDGKSQDNVLEPSQ------KLRGL 161 Query: 316 GAIVYAIGVQAEAADQFLKNCASP 339 G ++A+G+++ + LK ASP Sbjct: 162 GVKIFAVGIKSADPAE-LKLIASP 184 >gi|254802534|sp|Q0PMD2|ANTR1_RAT RecName: Full=Anthrax toxin receptor 1; Flags: Precursor Length = 562 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 39 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHRFISPQL----RMSFIVFSTR 86 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 87 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 142 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 197 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 198 PVNDGFQALQGIIHSILKK 216 >gi|242034231|ref|XP_002464510.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] gi|241918364|gb|EER91508.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] Length = 647 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 76/216 (35%), Gaps = 42/216 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ VLDVS SM KL + +++ ++D + R +V+FSS Sbjct: 176 LDLVTVLDVSGSMVG------TKLELLKQAMGFVIDNLGPRD------RLCVVSFSSGAN 223 Query: 229 QTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + LA G + + L G T L A I + + + Sbjct: 224 RLMRLARMSDAGKSLARRAVQSLAAGGGTNIGEALRRAAKVIDERMHRNAVAS------- 276 Query: 285 KYIIFLTDGENSS------------PNIDNKESLFYC--NEAKRRGAIVYAIGVQAEAAD 330 ++ L+DG+++ N D + R A V+ G + Sbjct: 277 --VVLLSDGQDTYTVPRRGGYGGRDANYDALVPPSFAFTGAGGRPAAPVHTFGFGTDHDA 334 Query: 331 QFLKNC--ASPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + A+ F +++ + DAF + IG + Sbjct: 335 AAMHTIAEATGGTFSFIEDEAAIQDAFAQCIGGLLS 370 >gi|218196035|gb|EEC78462.1| hypothetical protein OsI_18329 [Oryza sativa Indica Group] Length = 614 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 76/208 (36%), Gaps = 33/208 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+ + + G+D++ VLDVS SM +L ++ + K PD Sbjct: 16 VTAPPVLEGTARAGVDVVAVLDVSGSMEGE------RLEHVKEAMEIFIG--KLGPD--- 64 Query: 215 VVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R +V+F++ + + L + G +E ++ L+ +T L + D Sbjct: 65 -DRLSVVSFATSVRRLTELTYMSEQGRAVAKEIVDGLVADGSTNMGAALLEGAMILRD-- 121 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K ++FL+DG N ++ F + G+ ++ Sbjct: 122 RKGARDESNGRVGC--MMFLSDGTNDEIYKEDISGEFPA----------HTFGLGSDHNP 169 Query: 331 QFLKNCA---SPDRFYSVQNSRKLHDAF 355 +++ A S + +N + AF Sbjct: 170 NVMRHIADETSATYSFVNRNIADIKGAF 197 >gi|217973614|ref|YP_002358365.1| LPXTG-motif cell wall anchor domain-containing protein [Shewanella baltica OS223] gi|217498749|gb|ACK46942.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS223] Length = 772 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 73/188 (38%), Gaps = 28/188 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ S++ + ++++V+D S SM D + A ++ L +K N Sbjct: 382 VEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLKPEDSFN----- 430 Query: 219 GLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ F+S + Q + + ++ ++RL T+ L+ A + L Sbjct: 431 -IIEFNSSLSQFSATSLPATSSNLSRARQFVSRLQADGGTEMALALDAAL------PKSL 483 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ + +IF+TDG + + E++ ++ +G+ + F+ Sbjct: 484 GSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGESR-----LFTVGIGSAPNSHFM 538 Query: 334 KNCASPDR 341 + A R Sbjct: 539 QRAAELGR 546 >gi|156403935|ref|XP_001640163.1| predicted protein [Nematostella vectensis] gi|156227296|gb|EDO48100.1| predicted protein [Nematostella vectensis] Length = 247 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 76/209 (36%), Gaps = 18/209 (8%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS +D+ V+D S SM D + + + + V+ G+V Sbjct: 25 SSPCRRRMDVAFVIDRSASMGDENFGYIKQFIKKVSHEFLL---------SKDSVQVGVV 75 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS F L + + I+++ F + + + L+ K+F + Sbjct: 76 PFSHHYALEFGLTNYTNHKALDAAIDKIQFEGSFSMLSGALKVVQQKLFMPQLVEVKSKA 135 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKN 335 D + ++ + DG N S + ES K G ++ +G+ EA+ + L + Sbjct: 136 KKDKPLQAVVIVGDGGNLSGSDALYESSL---ALKDTGKRLFVVGLGRLEYEASMRMLAS 192 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 F++ + L + R+ + K Sbjct: 193 EPPKTHFFNAGTGKNLKNFVKRLANSICK 221 >gi|113205504|ref|NP_001037714.1| anthrax toxin receptor 1 [Rattus norvegicus] gi|111052876|gb|ABH03702.1| anthrax toxin receptor [Rattus norvegicus] gi|124297147|gb|AAI31854.1| Anthrax toxin receptor 1 [Rattus norvegicus] Length = 576 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 53 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHRFISPQL----RMSFIVFSTR 100 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 101 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 155 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 156 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 211 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 212 PVNDGFQALQGIIHSILKK 230 >gi|32189436|ref|NP_473382.1| anthrax toxin receptor 1 precursor [Mus musculus] gi|17366052|sp|Q9CZ52|ANTR1_MOUSE RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|15987505|gb|AAL11999.1|AF378762_1 tumor endothelial marker 8 precursor [Mus musculus] gi|63100398|gb|AAH94544.1| Anthrax toxin receptor 1 [Mus musculus] Length = 562 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 39 GGFDLYFILDKSGSVLHHWNE--------IYYFVEQLAHRFISPQL----RMSFIVFSTR 86 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ + A Sbjct: 87 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GAIVY +GV+ Q + S D + Sbjct: 142 --VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVF 197 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 198 PVNDGFQALQGIIHSILKK 216 >gi|255033973|ref|YP_003084594.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254946729|gb|ACT91429.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 625 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 88/218 (40%), Gaps = 26/218 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + +S+++ +++ ++DVS SMN+ +KL + ++ + + Sbjct: 224 PWNPGLKLVHIGL-QAKTVSAENLSASNLVFLIDVSGSMNEA-----NKLPLLKQAFKLL 277 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLE 260 D ++ ++ +V ++ G + I++ +++L G +T G+E Sbjct: 278 ADQLRVEDKIS------IVAYAGSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAGGEGIE 331 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY K + KG++ +I TDG+ + + E E ++ G + Sbjct: 332 LAY-----DLAKKHFLPKGNNR----VILATDGDFNVGISNESELQKLIEEKRKAGIFLS 382 Query: 321 AIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 +G D ++ A + + N ++ F Sbjct: 383 VMGFGMGNYKDSHVETLADKGNGNYAYIDNIQEARKVF 420 >gi|162312016|gb|ABX84114.1| hedgling [Nematostella vectensis] Length = 3480 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 70/194 (36%), Gaps = 24/194 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ +LD S S+ + ++ ++D P ++T+S+ Sbjct: 175 QTSVDLVFILDTSGSVGSY------NFEKMKTFVKNVVDFFNIGP---KGTHVAVITYST 225 Query: 226 KIVQTFPLAWGVQH---IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F L ++ +N + + T + L+ A IF ++ + + Sbjct: 226 WAQVEFNLKAHHSSKAALKNAVNAIYYRSGWTYTADALDLAGRNIF----QVANGMRPDK 281 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 K + LTDG ++ N L N+ + G V+ +G+ + P Sbjct: 282 GIPKIAVLLTDGYSNGNN-----PLGPANDLRAAGVNVFCVGIGNYYERELNDIATDPDK 336 Query: 340 DRFYSVQNSRKLHD 353 D + ++N L+ Sbjct: 337 DHVFKLENFNDLNS 350 >gi|62531155|gb|AAH92555.1| LOC594926 protein [Xenopus (Silurana) tropicalis] Length = 895 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 66/172 (38%), Gaps = 13/172 (7%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GLVTF 223 ++ +++ ++D S+SM K+ ++ ++LD +K N V+ G+ + Sbjct: 271 KEVPKNIIFIIDRSISMIGL------KMQQTKEALLKILDDVKEHDHFNFVIFDWGVEIW 324 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +V+ P + + + L T L A + + A + + Sbjct: 325 EQSLVKATPE--NLNRAKAYVRNLYPKGWTNINDALLSAISLLDQAHDARSVPKRSAS-- 380 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 IIF+TDG+ S+ + + A R +Y++G FL+ Sbjct: 381 --LIIFMTDGQPSTGERNLDKIQENARNAIRGKYSLYSLGFGVGVDYPFLEK 430 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 75/207 (36%), Gaps = 31/207 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D+ V+D S + + V + + ++D + P R GL+ +S Sbjct: 650 TEGPIDLAFVIDGSKGLGEE------NFEVVKQFVTGIIDSLTISP---KAARVGLLQYS 700 Query: 225 SKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ F L ++ + + G + + L + + + F E + Sbjct: 701 TQVRTEFTLRNFNSAKDMKKAVAHMKYM--GKGSMTGLALRHMFERSFTQGEGARPL--- 755 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I TDG + + ++AK G +YA+GV ++ + + P Sbjct: 756 STRVPRAAIVFTDGRAQD------DVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 340 --DRFYSVQNSRKLHDAFLRIGKEMVK 364 + ++ I +++ K Sbjct: 810 TTKHLFYAED----FSTMDEISEKLKK 832 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 48/107 (44%), Gaps = 10/107 (9%) Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDA-----KEKLEHIAKGH--DDYKKYIIFLTDG 293 ++ ++ L T + G+ + + + ++ + + + D +K ++ +DG Sbjct: 326 RQALDTLYAAFNTNTAEGVMWGWRLLSPEWQGRWRQGAAALPRPYELQDNRKIMVLFSDG 385 Query: 294 E-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 E + + +++ L C E KR+G +Y + E +F+ CAS Sbjct: 386 EHMTEAALRDRKQLLLCREMKRKGIQIYTVAF--EGDTRFVAQCASD 430 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 86/243 (35%), Gaps = 24/243 (9%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFI----FCTFPWCANSSHAPLLITSSV 159 + + ++ I + DY+ S + PF PW + + I Sbjct: 245 NQGQLPPKDAVRIEEMVNYFDYSYPLPSSAQTPFTTNITVLDSPWKPGNKLLHIGIQG-Y 303 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ + +++ +LDVS SM++ KL + +S+ +L +K V VV +G Sbjct: 304 QLPAGHIPQSNLVFLLDVSGSMDEP-----SKLPLVKQSMELLLSTLKPEDTVAIVVYAG 358 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V I ++ L G +T GL AY + +A + + + Sbjct: 359 AAG----TVLEPTKVREKSKILAALHNLQAGGSTAGGEGLALAYQ-LAEANFNPKGVNR- 412 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCAS 338 II TDG+ + ++ + + +G + +G D ++ A Sbjct: 413 -------IILATDGDFNVGQTGDEPLQDFVERKRAKGIYLSVLGFGQGNYQDALMQTLAQ 465 Query: 339 PDR 341 Sbjct: 466 NGN 468 >gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis] Length = 1039 Score = 59.4 bits (142), Expect = 7e-07, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 59/179 (32%), Gaps = 31/179 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +++ V+D+S SM+ K+ ++ ++L+ + G++ F Sbjct: 430 RLPKNVVFVIDMSGSMSG------TKMQQTREAMLKILEDLDPEDHF------GIILFDH 477 Query: 226 KIVQTFPLAWGVQHIQE----------KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +I W + + + T + A + + + ++ Sbjct: 478 RIQF-----WNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRL 532 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 K D II LTDG+ +S A ++++G + FL Sbjct: 533 PEKSIDM----IILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFLD 587 >gi|313200651|ref|YP_004039309.1| von willebrand factor type a [Methylovorus sp. MP688] gi|312439967|gb|ADQ84073.1| von Willebrand factor type A [Methylovorus sp. MP688] Length = 316 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 80/242 (33%), Gaps = 48/242 (19%) Query: 151 APLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGM--DKLGVATRSIREMLDI 205 L + S ++ IG +++++D S SM+D F + + G + + E L Sbjct: 49 IILGVAGPATPSQPIERTGIGAQLVLIIDRSASMDDPFSGAIASGRAGESKAAAAERL-- 106 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEY 261 I + G+VTFS+ + PL + I I G + T GL Sbjct: 107 ITRFVNERKNDMFGMVTFSNSAMHVLPLTESKEAILAAIRA--AGGSALFQTNIGSGLTT 164 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 L K D + II L+DG ++ + + +Y Sbjct: 165 G----------LAQFDKTPDSGSRAIILLSDGGGRIGAATQEKIRDWLDRMH---VTLYW 211 Query: 322 IGVQAEAA----------------------DQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 I ++ + + + K S + Y ++ + L A I Sbjct: 212 IVLRQPGSISIFDETYKTPDDRPPPPAIELNDYFKTLRSGYQPYEAEDPQSLAAAIQDIN 271 Query: 360 KE 361 ++ Sbjct: 272 RK 273 >gi|116622522|ref|YP_824678.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225684|gb|ABJ84393.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 324 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 80/227 (35%), Gaps = 41/227 (18%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++ S D + + +V D+S SM +S + K+ + Sbjct: 82 VAQTITHFSMDDEPVAVGLVFDISGSMGPK----------LQKSRMAAAEFFKTANPDDE 131 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV F+ + PL V+ IQ ++ T + A +++ + + Sbjct: 132 FF---LVEFNDQPKMVVPLTRDVEQIQNQLTWAQSKGRTALLDAIFLAMSELKKSTKNR- 187 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--- 331 K ++ ++DG ++S E N + ++YAIGV A + Sbjct: 188 ----------KALLIISDGGDNSSRYTESEVR---NLVRENDVLIYAIGVYEFAGGRMRT 234 Query: 332 --------FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L + + R ++ +L D +IG E+ + +L Sbjct: 235 PEEAGGPGLLTELSEQTGGRHLPA-DANELPDIAAKIGVELRNRYVL 280 >gi|17558302|ref|NP_505768.1| hypothetical protein C29A12.6 [Caenorhabditis elegans] gi|3874606|emb|CAA98248.1| C. elegans protein C29A12.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|3877573|emb|CAB01217.1| C. elegans protein C29A12.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 643 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 81/224 (36%), Gaps = 31/224 (13%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA-PLLITSSVKISSKSD 166 ++ + +D + ++ + PF P ++ P+ + S + Sbjct: 404 LKPEVPVIKPMDFMVRSRSVQFAMTEKPPFTTVMNPMKFFTTTRTPITKPKPLIPYSCTA 463 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ ++DVS D +D A S+ P + VR GL+++S Sbjct: 464 ---DVFFLVDVSQGTGDKSQQYLDIAASAISSL----------PISQDTVRVGLISYSGP 510 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + +EK+ +F G TT++ + YA E H A+ + Sbjct: 511 GRTHVRVFLDKHNEKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGKAHPARKN- 565 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ TDG + + A+ +G + A+ V+ Sbjct: 566 -VKKVLVVFTDGYSQDNPKEASRM------ARAKGIQLIAVAVK 602 >gi|157151714|ref|NP_001096685.1| collagen alpha-3(VI) chain [Canis lupus familiaris] gi|70571939|tpe|CAI77244.1| TPA: collagen, type VI, alpha 3 [Canis lupus familiaris] Length = 3169 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 81/224 (36%), Gaps = 25/224 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + C S L + K+ D++ ++D S S+ + + Sbjct: 7 LPLVAIF---CLFLSGFSLTRAQQQQADVKNGAAADIIFLVDSSWSIGKEHFQLVREF-- 61 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST 252 + D+IKS+ ++ R LV F+ F L Q + I+ + + Sbjct: 62 -------LYDVIKSLAVGDSDFRFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGG 114 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + T + ++ D + I+ LT G + D++ +L E Sbjct: 115 SNETG---KGLEYVMQHHLTEAAGSRAGDGVPQVIVVLTHGHS-----DDRLALPSA-EL 165 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHDA 354 K A V+AIGV+ + + P ++++N LHD Sbjct: 166 KSADANVFAIGVEDADEGALKEIASEPPNMHVFNLENFTSLHDI 209 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 72/201 (35%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 ++ + ++ + D++ +LD S ++ P + +++++ ++ Sbjct: 625 RTLSGTSEVHANRR---DIIFLLDGSFNVGKANFPYVRDF---------VMNVVNNLDVG 672 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + + + +L + G +Y Sbjct: 673 SDNIRVGLVQFSDTPVTEFSLNTYQTKSDLLAHLRQLQLKGGSGLNTGSALSYVHANHFT 732 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ LT G + L N R G + + +G Sbjct: 733 EAGGSRIREHVP--QLLLLLTAGRSEDA------YLPAANALARAGVLTFCVGANQANRA 784 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 785 ELEQIAFNPSLVYLMDDFSSL 805 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 76/208 (36%), Gaps = 23/208 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I C SS AP + + + D++ ++D S G V Sbjct: 209 IVGNLVSCVQSSVAPEGAGGTETLKDITAQDSADIIFLIDGSN------NTGSAHFAVIR 262 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTK 254 + +L+ + +R G+V +S + F L + + + L F Sbjct: 263 DFLVNLLERLSV---GAQQIRVGVVQYSDEPRTVFSLDTYSTKAQVLDAVKALAFTGGEL 319 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + GL A + + + ++ + + ++ ++ G +S D +L + Sbjct: 320 ANVGL--ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRDGVVALKQAS---- 373 Query: 315 RGAIVYAIGVQAEAADQF-LKNCASPDR 341 V++ G+ A+AA + L++ A+ D Sbjct: 374 ----VFSFGLGAQAASRAELQHIATNDN 397 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 43/358 (12%), Positives = 122/358 (34%), Gaps = 54/358 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V + E F++K S++ ++ ++ + +F +N Sbjct: 1473 VRIGAVQFSNEVFPVFYLKTHKSQA---SVINALRQLRHRGGSPLNTGKALEFVA---RN 1526 Query: 87 IWQTDFRNELR---------------ENGFAQDINNIERSTSLSIIIDDQH--KDYNLSA 129 ++ + + ++ ++ I S +S+ + +++ + + Sbjct: 1527 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDTSRFSQVISSSGIVSLGVGNRNIDRAELQTI 1586 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK------------SDIGLDMMMVLDV 177 + + F F N + ++ D++ +LD Sbjct: 1587 TNNPRLVFTVREFRELPNIEDRVMHAFGPSGVTPAPPGVDTPSPSRPEKKKADVVFLLD- 1645 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L + +I+ ++ + + ++ GLV ++S F L Sbjct: 1646 -GSINFRRDSFQEVLRFVS-------EIVDTLYEGGDSIQVGLVQYNSDPTDEFFLKDFS 1697 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q I + IN++++ + + + E ++ + +T G++ Sbjct: 1698 TKQQIIDAINKVVYKGGRHANT--KVGIEHLRLNHFVPEAGSRLDQRVPQIAFVITGGKS 1755 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + +L ++G V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1756 VEDAQEASLALT------QKGVKVFAVGVKNIDSEEVGKIASNSATAFRVGNVQELSE 1807 >gi|42407699|dbj|BAD08847.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408121|dbj|BAD09261.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] Length = 703 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 49/254 (19%), Positives = 89/254 (35%), Gaps = 42/254 (16%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + + E P I + P + + S ++ + +D++ VLDVS SM + Sbjct: 227 VVIKTHCEFPAIARSTPRDNFAVLLHVKAPSIAAEAAPARASVDLVTVLDVSGSMEGY-- 284 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQE 242 KL + R++ + R +V+FS + L G + Sbjct: 285 ----KLALLKRAMGLL----------GPGDRLAVVSFSYSARRVIRLTRMSEGGKASAKS 330 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + L T GL A K+FD + +A +I L+DG+++ Sbjct: 331 AVESLHADGCTNILEGLVEA-AKVFDGRRYRNAVAS--------VILLSDGQDNYNVNGG 381 Query: 303 KESLFYCNEA-------KRRG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRK 350 + N + KR G V+ G + + A + F ++N Sbjct: 382 WGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDASAMHTIAEETGGTFSFIENQAV 441 Query: 351 LHDAFLR-IGKEMV 363 + DAF + IG + Sbjct: 442 VQDAFAQCIGGLLS 455 >gi|88811039|ref|ZP_01126295.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] gi|88791578|gb|EAR22689.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] Length = 930 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 68/232 (29%), Gaps = 30/232 (12%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DI 167 + S+++ D Y T + ++ I++ + Sbjct: 343 QGASVTVTADSNLPTAAARIWLSYTSTNAGDTASGSVTVRCVQTGQSWTININANTIARP 402 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +V+D S SMND G G+ K+ + ++I+ GLV F+ Sbjct: 403 RSAVSLVIDRSGSMNDDAGDGITKVQKLREAANVFINIMLPGDG------IGLVRFNDTA 456 Query: 228 VQTFPLAW--------GVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + G I + T G+ N + DA+ Sbjct: 457 QRLMEITDVGASPGGAGRTDALNHIAGSDIDPSGATSIGDGIVNGRNMLNDAQAAPMPDY 516 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 ++ LTDG + A A YA+G+ + Sbjct: 517 DVTA-----MVVLTDGM-------WNRPPSLADVAGSINANTYAVGLGIPSN 556 >gi|75906479|ref|YP_320775.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75700204|gb|ABA19880.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 615 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 77/234 (32%), Gaps = 30/234 (12%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + ++ P + F AN +I L++ +V+D S SM G Sbjct: 2 VKTSYEFDQPILPAGFSLKANILLRFR-----AEIPESPRRNLNLSLVIDRSGSMA---G 53 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + A S+ + L+ + + VV V +V P+ +++ I + Sbjct: 54 AALHHALKAAESVVDQLEPKDIL---SVVVYDDAV---DTVVSPQPVT-DKPALKKSIRQ 106 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKE 304 + G T + G + K D +K ++ LTDG + D K Sbjct: 107 VRAGGITNLSGGWLKGCEYV-----------KHQLDPQKINRVLLLTDGHANMGIQDPKI 155 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 + G +G + L A+ FY +Q+ + + F Sbjct: 156 LTATSAQKAEEGITTTTLGFAQGFNEDLLIGMARAANGNFYFIQSIDEAAEVFS 209 >gi|269128710|ref|YP_003302080.1| Vault protein inter-alpha-trypsin domain-containing protein [Thermomonospora curvata DSM 43183] gi|268313668|gb|ACZ00043.1| Vault protein inter-alpha-trypsin domain protein [Thermomonospora curvata DSM 43183] Length = 795 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 77/217 (35%), Gaps = 41/217 (18%) Query: 154 LITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 T +V S+ + D++++LD S SM+ K+ A R+ ++D + Sbjct: 283 TFTLTVLPPSERCAPRPRDVVILLDRSGSMHGW------KMVAARRAAARIVDTLTGRD- 335 Query: 212 VNNVVRSGLVTFSSKIVQTFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 R +++F + + L E + L T+ L Sbjct: 336 -----RFAVLSFDDMVERPAGLDGGLSPATDRNRFRAVEHLAGLQARGGTELAAPLREGA 390 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + DA + ++ +TDG+ + N++ L + G ++A+G Sbjct: 391 ALLDDAGRD------------RVLVLITDGQ-----VGNEDQLLALIDPFLNGLRIHAVG 433 Query: 324 VQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRI 358 + FL A+ R V++ +L +A I Sbjct: 434 IDQAVNAGFLGRLATAGQGRLELVESEDRLDEAMEHI 470 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 55/167 (32%), Gaps = 37/167 (22%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK--GHDDYKKYIIFLTDG-- 293 Q + + L+ + T GL + + A G ++++IF+TDG Sbjct: 603 QDLSNYVGTLVPHNNTYHDIGLLWGARLMSPTGIFASENATTGGGAQIQRHLIFMTDGAT 662 Query: 294 ------------------------------ENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 +++ ++N S C K + ++ I Sbjct: 663 ATTVNNYASYGLEWWDRRQIAPAGPNDANYDDNLNAVNNARSNALCTAIKNKNITLWVIY 722 Query: 324 VQAE--AADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + A L NCA SP FY +N+ L F I + R+ Sbjct: 723 YGSSDTATKTRLTNCATSPSYFYEARNTTLLIGKFREIADRISNLRL 769 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 67/208 (32%), Gaps = 34/208 (16%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + ++N G+I +TA + ++G + + + VK +L D L + + Sbjct: 12 LMRLYHNQAGNILAITAAAIIPTIGLVGGAFDMARIYAVKTRLQSACDAGALAGRRIMGS 71 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +N F +N F + + D Sbjct: 72 GRWTDN---------NGRPNTTALATFDLNFAQNSFGAE----------NRTRSYSESDG 112 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL---DMMMVLDVSLSMN 182 +S + ++P V+++ + + + D+M VLD S SMN Sbjct: 113 TVSGTASADVPMTLMRVLNVPTKR---------VEVTCEGQMRIPNTDVMFVLDNSGSMN 163 Query: 183 DHFG---PGMDKLGVATRSIREMLDIIK 207 + G+ K+ +IR + + Sbjct: 164 EVIPGDTTGLKKMAGLQLAIRCFYEALA 191 >gi|149187053|ref|ZP_01865360.1| hypothetical protein ED21_31369 [Erythrobacter sp. SD-21] gi|148829342|gb|EDL47786.1| hypothetical protein ED21_31369 [Erythrobacter sp. SD-21] Length = 697 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 86/269 (31%), Gaps = 47/269 (17%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD 170 S L+ ++D+ L + P + + +T + + Sbjct: 258 SVQLAKGEVPANRDFVLRWGAADAAPSVGLFRQAYEGKEYVMATVTPPAAAKVEKLPPRE 317 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ + A++S+ L ++ R ++ F + Sbjct: 318 LIFVIDNSGSMSGE------SMRAASKSLVYALSTLRPED------RFNIIRFDHSMTML 365 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A + + L T P L A ++L + Sbjct: 366 HPDAVAADRTNLAKARRYAESLRGQGGTDMLPALRAALRDRDPDGKRL-----------R 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 IIFLTDG + +E + + A R V+ +G+ + ++ A R Sbjct: 415 QIIFLTDG----NLSNEREMMSEISIALGRS-RVFMVGIGSAPNSHLMRRMAEAGR---- 465 Query: 346 QNSRKLHDAFLRIGKEM---VKQRILYNK 371 F +G++ + R + N+ Sbjct: 466 -------GTFTHVGQDAEAVSEMRRMLNR 487 >gi|254416823|ref|ZP_05030572.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196176369|gb|EDX71384.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 538 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + S L++ +VLD S SM L A ++ + ++D + + V+ Sbjct: 28 GSESSQQTSSRRPLNLSLVLDRSGSMAGA------PLRYAIQAAQNLIDYLTADDFVS-- 79 Query: 216 VRSGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +V + P L ++ KI ++ T + G +++ Sbjct: 80 ----VVIYDDTAEVIIPPQLVGDQAALKAKIGKIRARGCTNLSGGWLLGCSQV------- 128 Query: 274 EHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + ++ ++ LTDG + D + E + +G + Sbjct: 129 ----QANQSPERINRVLLLTDGLANYGIKDPQVLTKTALEKAEADIVTTTLGFGNYFNED 184 Query: 332 FLKNCASP--DRFYSVQNSRKLHDAFL 356 L N A+ FY +Q+ F Sbjct: 185 LLINMANAARGNFYFIQSPDDASQVFE 211 >gi|75907531|ref|YP_321827.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701256|gb|ABA20932.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 427 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 65/199 (32%), Gaps = 34/199 (17%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + S+V + ++ L++ ++LD S SM+ L + ++ +LD ++ Sbjct: 27 ISISAVAEQFEQNLPLNLCLILDQSGSMHGK------PLKMVIAAVERLLDRLQPGD--- 77 Query: 214 NVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 R +V FS P + + I+ +I +L T GL+ ++ Sbjct: 78 ---RISVVAFSGSATVIIPNQIVEDPESIKTQIRKKLQASGGTVIAEGLQQGITELMKGT 134 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSS---------PNIDNKESLFYCNEAKRRGAIVYA 321 A LTDG D++ + +A + + Sbjct: 135 RGAVSQA----------FLLTDGHGEDSLKIWKWEIGPDDSRRCQEFAKKAAKINLTINT 184 Query: 322 IGVQAEAADQFLKNCASPD 340 +G L+ A Sbjct: 185 LGFGNNWNQDLLETIADAG 203 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 72/199 (36%), Gaps = 30/199 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ VLD S SM KL ++ +L D+ R ++ F ++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILH------DLRPQDRFSIIGFPNRIKVW 343 Query: 228 ----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + ++ + I+ + T L+ A + + + H G Sbjct: 344 KDHLISVTPDS--IRDGKVYIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSV 398 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCAS 338 I+FLTDG+ + + L EA R ++ IG+ + + L+NC Sbjct: 399 S-LIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGL 457 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R + +++ F Sbjct: 458 TRRVHEEEDAGSQLIGFYD 476 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 79/216 (36%), Gaps = 31/216 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +S + + VLD+S SM + A +++R L ++ N Sbjct: 324 STTSSALFHQSVTFVLDISGSMYGE------SIEQAKQALRYGLQQLQPEDSFN------ 371 Query: 220 LVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +VTF+ + + + + ++ L T+ L+ A++ Sbjct: 372 IVTFNHEAMLYSEQLLPVTSSTITRALRFVDGLDADGGTEMAAALKAAFSI-------KT 424 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + I+F+TDG ++ N+ +LF E + ++ +G+ + F+ Sbjct: 425 HDQLNSTRWLNQIVFITDG-----SVGNESALFDLIEQQLVDRRLFTVGIGSAPNSYFMT 479 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A + + + ++++ + ++ + + Sbjct: 480 RAAMKGKGTYTYIGDVKEVNTKMRLLFSKISQPVMR 515 >gi|170726477|ref|YP_001760503.1| cell wall anchor domain-containing protein [Shewanella woodyi ATCC 51908] gi|169811824|gb|ACA86408.1| LPXTG-motif cell wall anchor domain protein [Shewanella woodyi ATCC 51908] Length = 739 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 97/254 (38%), Gaps = 33/254 (12%) Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 + F + + ++ + Q K ++LS+ ++ E + +N + Sbjct: 287 ADRDFVLNWRPQLDTKPVAAVFSQQGKTHSLSSKAQVEPTDSNASTKADSNKAVEDDYAL 346 Query: 157 SSVKISSKSDIGLDMM----MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + S + + +V+D S SM+ + A R++ L +K+ Sbjct: 347 LMLLPPSDQKQDVSISRELILVIDTSGSMSGA------SIAQAKRALNYALAGLKAKDTF 400 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 N ++ F+S + P + + + + L T+ ++ A N Sbjct: 401 N------VIEFNSNVGSLSPYSLPATAKNIGLANQYVRSLKANGGTE----MQLALNAAL 450 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D + E + + + ++F+TDG ++ +++SLF+ + K + ++ +G+ + Sbjct: 451 DKGTETEAL---GSERLRQVLFMTDG-----SVGDEQSLFHLIKQKIGESRLFTLGIGSA 502 Query: 328 AADQFLKNCASPDR 341 F++ A R Sbjct: 503 PNSHFMRRAAEFGR 516 >gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis] Length = 849 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 59/179 (32%), Gaps = 31/179 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +++ V+D+S SM+ K+ ++ ++L+ + G++ F Sbjct: 266 RLPKNVVFVIDMSGSMSG------TKMQQTREAMLKILEDLDPEDHF------GIILFDH 313 Query: 226 KIVQTFPLAWGVQHIQE----------KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +I W + + + T + A + + + ++ Sbjct: 314 RIQF-----WNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRL 368 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 K D II LTDG+ +S A ++++G + FL Sbjct: 369 PEKSIDM----IILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFLD 423 >gi|256028718|ref|ZP_05442552.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289766627|ref|ZP_06526005.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289718182|gb|EFD82194.1| von Willebrand factor type A [Fusobacterium sp. D11] Length = 218 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 77/211 (36%), Gaps = 20/211 (9%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++ +S+ L ++++ D S SM + +IR+ML +K + + Sbjct: 1 MEFTSQPKKVLPLILLADTSSSMREWM-------RELNTAIRDMLGTLKEQESLKAEIHI 53 Query: 219 GLVTF-SSKIVQTFPLAWGVQHIQE-KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +TF + L + + N G T L A + + Sbjct: 54 SFITFGNGGANLHTALT----PVSNIEFNDFTEGGMTPLGGALRIAKEMVEN------RE 103 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 Y I+ L+DG + +N+ F N+ + + + ++G+ + LK Sbjct: 104 IIPSKSYAPIILLLSDGAPNDNGWENEMYRFI-NDGRSKKCMRMSLGIGRDYDYDVLKGF 162 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +S Y ++S + D F + + ++ + Sbjct: 163 SSNGEVYEAKDSMNIIDFFKFMTMTIKEKTL 193 >gi|126723120|ref|NP_001075478.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] gi|11041722|dbj|BAB17303.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] Length = 951 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 67/201 (33%), Gaps = 31/201 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ ++D S SM K+ ++ ++LD D+N R L+ FSS Sbjct: 276 VIFIVDQSGSMLG------RKIQQTREALLKILD------DLNPRDRFNLILFSSSATPW 323 Query: 229 ----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L V + + T L A + + +E+L + Sbjct: 324 KTSLVQASLET-VSEARSYAGAIQAAGGTDINEALLLAVSLLDHEQEELRAGSVS----- 377 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + + E EA ++ +G FL+ A + Sbjct: 378 -LLILLTDGEPTQGKTNPTEIQRNVREAIGGRYSLFCLGFGFNVNYPFLEKLALDNGGLA 436 Query: 342 ---FYSVQNSRKLHDAFLRIG 359 + + +L D + + Sbjct: 437 RRVYEDSDAALQLQDFYQEVA 457 >gi|326671055|ref|XP_003199351.1| PREDICTED: integrin alpha-2-like [Danio rerio] Length = 1492 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/283 (14%), Positives = 98/283 (34%), Gaps = 30/283 (10%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L+++ + +I +TSL + + K + + +P Sbjct: 86 KLNLQDSVIIDGVQSINTNTSLGLTLIPVKKRFMTCGPLWAQRCGSQYFYPGVCAEVTQR 145 Query: 153 LLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + S+ + + +D+ +VLD S S+ P D + + +L+ + PD Sbjct: 146 FTLKSAFSPAIQICGGPMDVAIVLDGSNSI----YPWSD----VKKFLLNLLENLDIGPD 197 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R ++ +S + + ++ + + T G E D Sbjct: 198 Q---TRVSIMQYSEDLSFLYHFSFDQNKQKVLLAASDIDQQT----GQETNTFAALDKTR 250 Query: 272 KLEHIAKGHDDY--KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + + + K ++ +TDGE + D + R G I + I + E+A Sbjct: 251 EQAFLPENGGRPGATKVLVVVTDGE----SADGYRGQEVIQKLDRDGIIRFGIAILKESA 306 Query: 330 D--------QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + + + + + ++V + L D +G+ + Sbjct: 307 NIQKFVEEIELIASTPTENYMFNVSSEGALVDITATLGERIFN 349 >gi|210135186|ref|YP_002301625.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] gi|210133154|gb|ACJ08145.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] Length = 214 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 75/213 (35%), Gaps = 30/213 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-I 227 + + ++LD S SM+ G G ++GV I++M++ +K + +VTF + Sbjct: 15 IPVFLLLDTSGSMSHSLGNG-TRIGVLNLCIQKMIETLKQEAKKELFSKMAIVTFGENGV 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P ++++ + L T + A + I D YK Y Sbjct: 74 NLHTPFD-DIKNVNFE--PLSASGGTPLDQAFKLAKDLIED------KDTFPTKFYKPYS 124 Query: 288 IFLTDGENSSPNI---------DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 I ++DGE ++ D + + C ++I + + + Sbjct: 125 ILVSDGEPNNDKWQEPLFNFHHDGRSAKSVC----------WSIFIGDREVNPQVNKDFG 174 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 D + + KL F + + + K K Sbjct: 175 KDGVFYADDVEKLVGLFEIMTQTISKGSASIKK 207 >gi|47522678|ref|NP_999068.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Sus scrofa] gi|3024050|sp|O02668|ITIH2_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1915954|emb|CAA72308.1| inter-alpha-inhibitor heavy-chain H2 [Sus scrofa] Length = 935 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 71/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + LV F+ I Sbjct: 300 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAEDQFS------LVDFNHNIRTW 347 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V + I ++ T L A + +A Sbjct: 348 RNDLVSATKTQVADAKTYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 403 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + + + ++++G+ + FLK ++ +R Sbjct: 404 LIILVSDGDPTVGELQLSKIQKNVKQNIQDNVSLFSLGIGFDVDYDFLKRLSNDNRGMAQ 463 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + + +L + ++ Sbjct: 464 RIYGNQDTASQLKKFYNQVST 484 >gi|313212349|emb|CBY36340.1| unnamed protein product [Oikopleura dioica] Length = 2306 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 79/217 (36%), Gaps = 25/217 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 PL T VK D+ M D+ ++ F D + +D + Sbjct: 554 PLKFTLPVKQKLIEKRSCDLEMEADIIFVLDGSFSTRQDGFDRILSFVSATVDALSG--- 610 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQH-IQEKINRLIFGST--TKSTPGLEYAYNKIFD 268 ++ + +S I + F + ++ + E+I + + S T + +E A++ +FD Sbjct: 611 ---NIQYAAIQYSDVITEEFNFKYRLKKDLIEEIKAMKYDSGWSTYTGLAMEKAWSMLFD 667 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAE 327 + + K ++ LTDG + K G V A+G+ + Sbjct: 668 QQFGARNA------VTKIMVILTDGRTKD------DIEKISENIKNAGDTTVLAVGLNSA 715 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 D+ L S D + + +L D F + + + K Sbjct: 716 PLDELLTLATSKDLAFY---THELADIFHLLAQLIDK 749 >gi|301164324|emb|CBW23882.1| conserved exported hypothetical protein [Bacteroides fragilis 638R] Length = 610 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 79/211 (37%), Gaps = 22/211 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ T PW N+ H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGTCPW--NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G ++ + Q I+E I+ L Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAGVKLEATPGSNKQKIREAIDELEASG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II TDG+ + +KE + Sbjct: 323 STAGGEGIMLAYKIAQKNFISGGNNR---------IILCTDGDFNVGVSSDKELEKLIEQ 373 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 374 KRKSGIFLTVLGYGMGNYKDSKMQTLAEKGN 404 >gi|198436415|ref|XP_002121394.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 904 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 75/180 (41%), Gaps = 22/180 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++MVLD S S+ + PG K+ ++ ++ + V+ + + ++KI Sbjct: 713 IDIVMVLDSSSSVTE---PGWRKMINFVKTALGFYEMGPNSTSVSVFRYNAEIDEANKIS 769 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + +G + + +I RL + T++ L YA + + + D + Sbjct: 770 FQYTQTYGKEQLLRRIGRLPYNGQGTRTGQALSYALHILTNE--------INRPDAVDVV 821 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----DQFLKNCASPDRFY 343 + LTDG++ E+L +R G + YAI +Q E +Q +P + Sbjct: 822 LVLTDGKSQDAVKAPAEAL------RRNGVLTYAIAIQPERGVLNMNQLNDIAGTPHNLF 875 >gi|126733489|ref|ZP_01749236.1| von Willebrand factor, type A [Roseobacter sp. CCS2] gi|126716355|gb|EBA13219.1| von Willebrand factor, type A [Roseobacter sp. CCS2] Length = 699 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 72/184 (39%), Gaps = 19/184 (10%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW A++ + + + + L+++ ++D S SM+D KL + +S R Sbjct: 319 TPWNADTQLVHIALQGQM-PEVAARPPLNLVFLIDTSGSMDDP-----TKLPLLKQSFRL 372 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 MLD ++ V V +G + +++ + I + I L G +T GLE Sbjct: 373 MLDQLRPEDQVAIVEYAGS---AGQVLVPTSAS-ERTTILQAIQSLGAGGSTNGQGGLEQ 428 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY+ +E E +I TDG+ + + + + + G + Sbjct: 429 AYSVAEAMREDGEVNR---------VILATDGDFNVGLSNPDALKDFIADKRETGTYLSV 479 Query: 322 IGVQ 325 +G Sbjct: 480 LGFG 483 >gi|77735553|ref|NP_001029472.1| calcium-activated chloride channel regulator 4 [Bos taurus] gi|74268230|gb|AAI03388.1| Chloride channel accessory 4 [Bos taurus] gi|296489202|gb|DAA31315.1| chloride channel accessory 4 [Bos taurus] Length = 933 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 77/200 (38%), Gaps = 36/200 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM ++L ++ + L + V N G+V F S Sbjct: 308 VCLVLDKSGSMAAS-----NRLNRMNQAAQHFL-----LQTVENGSWVGMVHFDSSASIK 357 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + E + T G+E + I +A +++ Sbjct: 358 SNLIQIISSSERRKLLESL-PTAASGGTSICSGIESGFQAIRNADFQIDGSE-------- 408 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NCASPDRFYS 344 I+ LTDGE+S+ + K+ GAI++ I + +AA + + A+ ++ Sbjct: 409 -IVLLTDGEDSTAK-------SCIEKVKQSGAIIHFIALGPDAAQAVKEMSIATGGKYIY 460 Query: 345 VQNSRK---LHDAFLRIGKE 361 + + L DAF + E Sbjct: 461 ASDEGQNNGLIDAFAALASE 480 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + P + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNE----AKRRGAIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + + VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDHKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|125975554|ref|YP_001039464.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715779|gb|ABN54271.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 536 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 70/195 (35%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW + +L+ K S + ++ ++DVS SM++ +K Sbjct: 146 FSITTEIGQCPWNPENKL--MLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDEP-----NK 198 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + + ++D + V+ VV +G +V I + + L G Sbjct: 199 LPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDSTPGNEKDKILDALMNLEAGG 254 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY+ K + +I TDG+ + E + + Sbjct: 255 STAGAEGIKLAYDVAKKNFIKSGNNR---------VILATDGDFNVGISSEAELVRLIEK 305 Query: 312 AKRRGAIVYAIGVQA 326 + G + +G Sbjct: 306 KRDEGIFLTVLGFGT 320 >gi|322711218|gb|EFZ02792.1| U-box domain-containing protein [Metarhizium anisopliae ARSEF 23] Length = 734 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 66/193 (34%), Gaps = 17/193 (8%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+++VLDVS SM D+ PG + T + I +N+ R G+V+F++ Sbjct: 52 DIVLVLDVSTSMEDNAPVPGETERTGLTVLDLTKHAALTIIETLNDRDRLGIVSFATNST 111 Query: 229 QTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + KI L +T G+ + E A Sbjct: 112 IVQTLTHMDISNKDEARRKIKALDPNGSTNLWHGIRDGIQIFEQSAENGNIRA------- 164 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 ++ LTDG + + + R A ++ G LK+ A + Sbjct: 165 --MMVLTDGMPNHM-CPQQGYIPKLKTLPRLPAAIHTFGFGYGLRSGLLKSLAEYGHGNY 221 Query: 343 YSVQNSRKLHDAF 355 + ++ + F Sbjct: 222 AFIPDAGMIGTVF 234 >gi|302405156|ref|XP_003000415.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] gi|261361072|gb|EEY23500.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] Length = 662 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGL 220 D+++V+DVS SM+D + + +LD+ K + ++ R G+ Sbjct: 88 RAPCDIVLVIDVSGSMDDAAPAPVIPGQKDENTGLSILDLTKHAARTILETLDERDRLGI 147 Query: 221 VTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V F++ L ++KI L + T G+ D + Sbjct: 148 VAFTTNAKVILSLVEMNPDNKVSAKDKIENLQPLNGTNMWHGITEGIKLFSDCDSSSGRV 207 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFL 333 ++ LTDG +S L Y + + G A ++ G L Sbjct: 208 PA--------MMVLTDGLPNSG----CPRLGYIPKLRDMGQLPATIHTFGFGYHIRSGLL 255 Query: 334 KNCA--SPDRFYSVQNSRKLHDAF 355 K+ A + + ++ + F Sbjct: 256 KSIAEIGGGNYAFIPDAGMIGTVF 279 >gi|168998772|ref|YP_001688040.1| TerY1 [Klebsiella pneumoniae NTUH-K2044] gi|238549793|dbj|BAH66144.1| tellurite resistance protein [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 212 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 64/196 (32%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM+ + ++ +L +K P ++TF S Sbjct: 3 RLPVYLLLDTSGSMHGE------PIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + TT L N+I +K KG + Sbjct: 57 RQAVPLT---DLLSFNLPSFSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG P D ++ + AK+ G V A +A LK + Sbjct: 112 FLMTDG---GPTDDWRKGVNEFKAAKK-GV-VVACAAGHDADTAVLKEITEIVLQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SSSIKAFFKWVSASVS 182 >gi|317123666|ref|YP_004097778.1| type II secretion system F domain [Intrasporangium calvum DSM 43043] gi|315587754|gb|ADU47051.1| Type II secretion system F domain [Intrasporangium calvum DSM 43043] Length = 652 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 46/255 (18%), Positives = 85/255 (33%), Gaps = 33/255 (12%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 ++ D + ++ V P +S +++ S V + + M+V+ Sbjct: 33 VMSDVKSSSGTVTGVLTVRSANPVQVDPGSVKASVDGVIVKSFVSEMTHTK--RTAMLVI 90 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D S SM GM + ATR+ + + V G+VTF++ Sbjct: 91 DTSGSMGTD---GMATVRAATRAYLK---------EAPEDVLIGVVTFANTAGVDLKPTV 138 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q +N L T + A + ++ ++ L+DG + Sbjct: 139 DRAAAQRVVNGLDARGDTSLYAAVRSAARAMPGDGDRS-------------MVLLSDGAD 185 Query: 296 SSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLK----NCASPDRFYSVQNSRK 350 + + D + L N E KRRG V + + D + AS N+ Sbjct: 186 T-VSDDRQGDLAEANRELKRRGVRVDVVRFNTDDPDAVVALRSFASASGGSVIPATNASD 244 Query: 351 LHDAFLRIGKEMVKQ 365 + AF + + Q Sbjct: 245 VGAAFKSAARALRSQ 259 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 81/223 (36%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + P + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDIP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSL 477 >gi|148976671|ref|ZP_01813358.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] gi|145964022|gb|EDK29280.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] Length = 303 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 58/299 (19%), Positives = 104/299 (34%), Gaps = 52/299 (17%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G + +++ I LP I +V+GL I+ + VK+KL +D + + A + N Sbjct: 6 RTKYRYSRGLVVLMSVIALPFILLVVGLSIDAGRAYIVKSKLFAAVDAASIAAARAVANG 65 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 E+ QK DF + G + + S+ I Q Sbjct: 66 EDAGRAAAQK------YFAANIPADFYSATPNLGDVNFAYDSFGNISIDISATAQVPT-- 117 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD--VSLSMNDH 184 P + P + S++ +D+++V+D SL + Sbjct: 118 -------------VFLPLIGLDTFNPGVSAQSIRRP------VDLVLVIDNTTSLRLGSI 158 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQE 242 D + + I + I V F S++ F G I+ Sbjct: 159 GDVTQDVIDRSKSFIENFHEGFDRISLVK-------FAFGSEVPVGFNATRGHSRSTIKS 211 Query: 243 KINRLIFGST-----TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 +I+ FGST T ++ G+ A+N++ K I+F TDG + Sbjct: 212 EIDSFNFGSTSNAQYTNASEGMYRAFNEL---------RTVTDPANLKVIVFFTDGAPN 261 >gi|115488386|ref|NP_001066680.1| Os12g0431700 [Oryza sativa Japonica Group] gi|77554879|gb|ABA97675.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113649187|dbj|BAF29699.1| Os12g0431700 [Oryza sativa Japonica Group] gi|125536450|gb|EAY82938.1| hypothetical protein OsI_38156 [Oryza sativa Indica Group] gi|125579179|gb|EAZ20325.1| hypothetical protein OsJ_35934 [Oryza sativa Japonica Group] Length = 524 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 74/248 (29%), Gaps = 34/248 (13%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 + + +P + + ++ S+ GLD++ V+DV Sbjct: 11 TTPIPNGGSKQGLVTMNIP-TYSKKDVALTADSVTAVVEIKATSSTAVREGLDLVAVVDV 69 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM H ++ + A + + L + R +VTF S + L Sbjct: 70 SGSMRGH---KIESVKKALQFVIMKLTPVD---------RLSIVTFESSAKRLTKLRAMT 117 Query: 238 QHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 Q + + + LI T GL+ + D A I ++DG Sbjct: 118 QDFRGELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTAN--------IFLMSDG 169 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRK 350 + + + VY G Q L + A + +V + Sbjct: 170 KLEGKTSGDPTQVNPGE------VSVYTFGFGHGTDHQLLTDIAKNSPGGTYSTVPDGTN 223 Query: 351 LHDAFLRI 358 L F + Sbjct: 224 LSAPFATL 231 >gi|71280467|ref|YP_270055.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146207|gb|AAZ26680.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 786 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 45/288 (15%), Positives = 95/288 (32%), Gaps = 32/288 (11%) Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 I +I + + R+ + +L +K ++L+ +F Sbjct: 309 ITSIVSDSHKIQSRDLSSKLNSEQNAYFITLDKTQVISNKTFDLTWQLIASNQPQVSSFT 368 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + H LL + + I D++ ++D S SM + A S++ L Sbjct: 369 QEISGEHYTLLTFFPPEKAVAQVIARDIIFIIDTSGSMQ------AGSMEQAKSSLQLAL 422 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPG 258 + + N ++ F + FP+ + Q+ I+ L T+ Sbjct: 423 LQLNNKDSFN------IIAFDNDTELLFPVTHMASAHNISKAQQFIDGLSANGGTEMYRP 476 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L A ++ + I+F+TDG + E + N A+ Sbjct: 477 LSNAL--------MMKKDKTQSSKAIRQIVFITDG----AVANEFELMQLLNTAQGDF-R 523 Query: 319 VYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +Y +G+ A F+K A + +QN ++ ++ + Sbjct: 524 LYTVGIGAAPNGYFMKKAAQFGRGSYVFIQNKSEVQRKMSHFMTKISQ 571 >gi|326530406|dbj|BAJ97629.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 657 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 49/273 (17%), Positives = 90/273 (32%), Gaps = 44/273 (16%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 D + S ++ H Y+ A + F + + T + Sbjct: 113 DSRRQAGAASNEAVVVKTHGHYSAVARDSPDDNFAVLVHLKAPGITGSG---TEAAGDDP 169 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D++ VLDVS SM+ KL + +++R ++DI+ PD R +V+F Sbjct: 170 AQRAPVDLVTVLDVSSSMHGS------KLALLKQAMRFVIDILG--PD----DRLSVVSF 217 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SS+ + L G + L + T GL A + E + Sbjct: 218 SSRARRVTRLTRMSDAGKALCVRAVESLTARTGTNIAEGLRTAAKVL------DERRHRN 271 Query: 280 HDDYKKYIIFLTDGENSS-----------PNIDNKESLFYCNEAKRRGAI---VYAIGVQ 325 ++ L+DG+++ PN + G V+ G Sbjct: 272 GVSC---VVLLSDGQDNYTPMRQAFGRGLPNYAALLPPSFARTGTGAGDRATPVHTFGFG 328 Query: 326 AEAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 + + A+ F ++N + DAF Sbjct: 329 NDHDATAMHAVSEATGGTFSFIENEAVIQDAFA 361 >gi|326504154|dbj|BAK02863.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 651 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 49/273 (17%), Positives = 90/273 (32%), Gaps = 44/273 (16%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 D + S ++ H Y+ A + F + + T + Sbjct: 107 DSRRQAGAASNEAVVVKTHGHYSAVARDSPDDNFAVLVHLKAPGITGSG---TEAAGDDP 163 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D++ VLDVS SM+ KL + +++R ++DI+ PD R +V+F Sbjct: 164 AQRAPVDLVTVLDVSSSMHGS------KLALLKQAMRFVIDILG--PD----DRLSVVSF 211 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 SS+ + L G + L + T GL A + E + Sbjct: 212 SSRARRVTRLTRMSDAGKALCVRAVESLTARTGTNIAEGLRTAAKVL------DERRHRN 265 Query: 280 HDDYKKYIIFLTDGENSS-----------PNIDNKESLFYCNEAKRRGAI---VYAIGVQ 325 ++ L+DG+++ PN + G V+ G Sbjct: 266 GVSC---VVLLSDGQDNYTPMRQAFGRGLPNYAALLPPSFARTGTGAGDRATPVHTFGFG 322 Query: 326 AEAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 + + A+ F ++N + DAF Sbjct: 323 NDHDATAMHAVSEATGGTFSFIENEAVIQDAFA 355 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 64/167 (38%), Gaps = 23/167 (13%) Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEK 272 +G VT + PLA G + ++ L +T + G+E + + Sbjct: 796 TGEVTAAMNQEYGNPLAEG-GRANQTLDGLRVYNYGITGSTHTYRGIESYIENMTNGASG 854 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------- 325 + +Y+I TDG+++S N+ ++S+ + K G + + +Q Sbjct: 855 GYVPNAPQGNNSRYLIIFTDGKDNSGNL--QKSMDDTDALKNNGYTIITVLMQSAGMTSE 912 Query: 326 -AEAADQFLKNCASPD-----RFYSV--QNSRKLHDAFLRIGKEMVK 364 E + FLK AS + FY+ + L F I E+ K Sbjct: 913 DVEHSTTFLKRLASSNASGEKYFYTAMYNDPEGLVKVFQDIAHEIAK 959 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 54/173 (31%), Gaps = 8/173 (4%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + + +G +IL AI+LPV+ L ++ + VK +L D + L A + Sbjct: 6 HSSRRLQSQRGGTAILFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGARSLS 65 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQT--DFRNELRENGFAQDIN---NIERSTSLSIIID 119 + + + + ++ ++ E G+ +N + + + + Sbjct: 66 DPGGQPYNWSAASIKALDVARSNVANGGQIQDAAIETGYWNILNPALGMRPAGTPGVPAT 125 Query: 120 DQHKDYNLSAVSRY---EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++ P P + + +V G+ Sbjct: 126 GDVPAVRVTTAISATQNNGPLQLLFAPILGITERSIQASAIAVIAPPSGGTGM 178 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ VLD S SM KL ++ +L ++ N +V FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFN------IVGFSNRIKVW 343 Query: 229 QTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ I++ I+ + T L+ + D + H Sbjct: 344 KDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLLND---YVAHNDIEDRSVS- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 ++FLTDG+ + + L EA R ++ +G+ A+ + L+NC Sbjct: 400 LVVFLTDGKPTVGETHTFKILNNTREATRGRVCIFTVGIGADVDFKLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + ++R F Sbjct: 460 RVHEDHDARAQLIGFYD 476 >gi|126723497|ref|NP_001075477.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Oryctolagus cuniculus] gi|75056157|sp|Q9GLY5|ITIH3_RABIT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|11041718|dbj|BAB17302.1| inter-alpha-trypsin inhibitor heavy chain3 [Oryctolagus cuniculus] Length = 903 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 48/324 (14%), Positives = 113/324 (34%), Gaps = 35/324 (10%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 169 MYLKVQPKQLVKHFEIDA--HIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSF 226 Query: 102 --AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 + D + + S++ D Y+++ S + + F P+ Sbjct: 227 KPSLDQQRSCPTCTDSLLNGDFTITYDVNRESPANIQIVNGYFVHFFAPQGLPV------ 280 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRS 218 + +++ V+DVS SM KL ++ ++L+ ++ +N + S Sbjct: 281 -------VPKNVVFVIDVSGSM------YGRKLEQTKDALLKILEDMREEDHLNFILFSS 327 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + T+ +VQ P +Q + + + +T GL + + A+E+ + Sbjct: 328 DVTTWKEHLVQATPE--NLQEARAFVKSIQDQGSTNLNDGLLRGISMLNTAREEH----R 381 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 + +I LTDG+ +S ++ A +Y +G FL++ A Sbjct: 382 VPERSTSIVIMLTDGDANSGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESLAL 441 Query: 338 ----SPDRFYSVQNSRKLHDAFLR 357 R Y ++ F Sbjct: 442 ENDGFARRIYEDSDANLQLHGFYE 465 >gi|320103513|ref|YP_004179104.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319750795|gb|ADV62555.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 342 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 66/204 (32%), Gaps = 12/204 (5%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G +M+VLD S SMN GPG + + + I P+ GLV F++ Sbjct: 91 GRALMVVLDRSSSMNAPVGPGDRGPSRFEAARIALAEFIAGRPN----DVIGLVGFAALP 146 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + ++ + + LE N + DA + D K + Sbjct: 147 DLAAVPSLDRDFLRAALLAQ------EIARPLEDGTN-LGDALALAADALRDQDALSKVV 199 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV-Q 346 + +TDG NS + + G ++ + + E + L+ + Sbjct: 200 VLVTDGRNSPALPNPLDPQVAAELLDDLGITLHILALGTEDPTELLETETTSRPATEPRT 259 Query: 347 NSRKLHDAFLRIGKEMVKQRILYN 370 N L R+ + Q Sbjct: 260 NLASLGSELERLAQRAGGQVFEIT 283 >gi|332970976|gb|EGK09950.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 441 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 75/211 (35%), Gaps = 25/211 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +++ + ++ ++LD S SM G +K+ VA ++R + +V+ +V Sbjct: 122 TEVNGPEEKQHNVTILLDASGSMA-ARVSGGEKMQVAKEAVRSFTSQMPEGTNVSLIVYG 180 Query: 219 --GLVTFSSKIVQTFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 G + + + + + +Q K++ + T + A ++ + Sbjct: 181 HKGSNSKADQAESCKGIEEIVELGPYNESTLQSKLDPIRATGWTPLAGAMNQAGQRLKET 240 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EA 328 + + E+ I ++DG + KE+ + + IG Sbjct: 241 EGQAEN----------VIYVVSDGLETCGGDPVKEAKSLNQSNIKATVNI--IGFDVGNK 288 Query: 329 ADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 Q LK A +++S + +L F Sbjct: 289 EHQALKKVAEAGGGKYFSATSKTELDLYFRN 319 >gi|282879638|ref|ZP_06288369.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306586|gb|EFA98615.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 346 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 72/211 (34%), Gaps = 23/211 (10%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 I + Q + + P + + KIS++ G++ ++ Sbjct: 44 DIELVQQQMADISKYRPTVKFWLLQSALALLIVMLARPQMGS---KISNEKRNGIETIIA 100 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD+S SM +L + + ++D + + GLV F+ P+ Sbjct: 101 LDISNSMLAEDVVP-SRLAKSKLLVENLVDNFTN-------DKIGLVIFAGDAFIQLPIT 152 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + ++ +F T + + I A D + II +TDGE Sbjct: 153 ------SDYVSAKMFLQNTDPSLITTQGTD-IARAIRLSMSSFTQQDKVGRAIILITDGE 205 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + +L +A ++G V+ +GV Sbjct: 206 DHEG-----GALEAAADANKKGINVFILGVG 231 >gi|149915102|ref|ZP_01903630.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] gi|149810823|gb|EDM70662.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] Length = 444 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 70/234 (29%), Gaps = 26/234 (11%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + ++P N+ + ++ V ++ L++ +VLD S SM Sbjct: 2 IELKITPQVPARLEGHANTLNALIRIVAPSAPVT-ETEPRPPLNLALVLDRSSSMRGQ-- 58 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKI 244 L A R+ + + + R +V F +++++ + Q + + Sbjct: 59 ----PLHEAKRAAD------QIVAGLRPSDRLAIVAFDNATEVMFSGGPRGDGQAARAAL 108 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 +R+ T G + +E + L+DG + D Sbjct: 109 SRIHARGMTALHDGWLLGVEQSIAMREAGTPAR---------VFLLSDGVANVGLTDASA 159 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFL 356 C G G+ + + A Y + + L D F Sbjct: 160 IAADCTRMAEHGITTSTCGLGMGFNEDLMAEMARAGRGNAYYGETAEDLQDPFE 213 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 44/111 (39%), Gaps = 11/111 (9%) Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-------DAKEKLEHIAKGHDDYKKYIIF 289 ++ ++ L T + G+ + + + G D +K ++ Sbjct: 321 RAAYRQALDTLHAAFNTNTAEGVMWGWRLLSPQWQGRWQQGAAELPRPYGQADNRKILVL 380 Query: 290 LTDGENSSPNID--NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +DGE+ P +++ L C E KR+G VY + E +F+ CAS Sbjct: 381 FSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAF--EGDARFVAQCAS 429 >gi|133925809|ref|NP_002208.3| inter-alpha-trypsin inhibitor heavy chain H3 preproprotein [Homo sapiens] gi|166203665|sp|Q06033|ITIH3_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|77748471|gb|AAI07605.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|77748473|gb|AAI07606.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|78070482|gb|AAI07815.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] Length = 890 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/283 (14%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 188 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 245 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 246 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 296 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ ++ +N ++ SG V+ + +VQ P +Q + Sbjct: 297 G------RKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 348 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 349 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 404 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 49/278 (17%), Positives = 101/278 (36%), Gaps = 50/278 (17%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL---YT 59 F ++ F N G +I+TA+L + + + ++ + KA+L D + L Sbjct: 12 FSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAE 71 Query: 60 ATKILNQENGNNGKKQ----------KNDFSYRIIKNIWQTD----FRNELRENGFAQDI 105 + ++ ++ +Q DFS ++ W+ + ++ + D Sbjct: 72 DNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYLRSDDS 131 Query: 106 NNIERSTSLSI-----------IIDDQHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAP 152 N + S+ ++I ++K+ +++ + F PW + Sbjct: 132 NGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSS 191 Query: 153 LLITSSVKISSKSDIG---------LDMMMVLDVSLSMNDHFGPG-----------MDKL 192 L V I+S +D+MMV D+S SMN +D L Sbjct: 192 QLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNRRIDAL 251 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +I+++L D++ R G V+F++ Q Sbjct: 252 REVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQK 289 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 54/140 (38%), Gaps = 22/140 (15%) Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIFLTDGENSS 297 + + + + T T G+ N + D + K + + ++ ++ L+DGE++ Sbjct: 382 GVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLNTNTRRVLLILSDGEDNR 441 Query: 298 PNIDNKESLF---YCNEAKRR--------------GAIVYAIGVQAEAADQFL--KNCAS 338 P+ + + C + K + A+G DQ + K C Sbjct: 442 PSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVALGFN-PPQDQLIAWKKCV- 499 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 ++Y V + + L DAF +I Sbjct: 500 GKQYYPVNSKQGLLDAFKQI 519 >gi|281416565|ref|ZP_06247585.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281407967|gb|EFB38225.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939671|gb|ADU73705.1| Protein of unknown function DUF3520 [Clostridium thermocellum DSM 1313] Length = 538 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 70/195 (35%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW + +L+ K S + ++ ++DVS SM++ +K Sbjct: 148 FSITTEIGQCPWNPENKL--MLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDEP-----NK 200 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + + ++D + V+ VV +G +V I + + L G Sbjct: 201 LPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDSTPGNEKDKILDALMNLEAGG 256 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY+ K + +I TDG+ + E + + Sbjct: 257 STAGAEGIKLAYDVAKKNFIKSGNNR---------VILATDGDFNVGISSEAELVRLIEK 307 Query: 312 AKRRGAIVYAIGVQA 326 + G + +G Sbjct: 308 KRDEGIFLTVLGFGT 322 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 73/235 (31%), Gaps = 30/235 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + I ++ +++ +LDVS SM DKL + S+ + Sbjct: 217 PWNNQRQLLKIGIKGFDIEKAELKAA-NLVFLLDVSGSM-----NAPDKLPLLKSSLTML 270 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + V VV +G +V Q I +N L G +T G+E A Sbjct: 271 TKQLDENDSVAIVVYAGAAG----LVLPATKGNEYQVISNALNNLSAGGSTNGAQGIELA 326 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y +K +I TDG+ + ++ G + + Sbjct: 327 YQIASQNFKKEGINR---------VILATDGDFNVGMSSVDALKKLIANKRKTGIALTTL 377 Query: 323 GVQ-AEAADQFLKNCASPDR----FYSVQNS------RKLHDAFLRIGKEMVKQR 366 G D ++ A+ + N +L I K++ Q Sbjct: 378 GFGQGNYNDGLMEQLANIGNGQHAYIDTINEARKVLVDELSSTMQIIAKDVKIQV 432 >gi|88801114|ref|ZP_01116660.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] gi|88776143|gb|EAR07372.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] Length = 553 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 54/280 (19%), Positives = 99/280 (35%), Gaps = 44/280 (15%) Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQH--KDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 + L G A + + I D++ + Y + Y+ + + Sbjct: 309 SPLYATGEADENERLVLEAFAQFIADNRQVARQYGFNQNPSYQ-------PAYELDDGSV 361 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L K + M V DVS SM+ D++ ++ E + + S Sbjct: 362 ILSAQRIWKDKKSGGRPIAAMFVADVSGSMDG------DRIRALKIALDESANFVSSRNS 415 Query: 212 VNNVVRSGLVTFSSKIVQTFPL-AWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIF 267 GLVTF+ ++ P+ + +Q + + R+ G T + + A +++ Sbjct: 416 ------IGLVTFNDRVNVDLPIREFDLQQKSQFLGAVERMSAGGGTATNDAILVAAHELL 469 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGE--NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + AK H ++K I L+DGE N P D ++ + N V++I Sbjct: 470 N-------FAKTHPEHKLTIFVLSDGETRNGLPLGDVEKVIQMLN------IPVHSIAYG 516 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E+AD LK + A +IG + Q Sbjct: 517 FESAD--LKKV--SGLVEASYTESSTGSAAYQIGNLLNAQ 552 >gi|329902233|ref|ZP_08273073.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] gi|327548825|gb|EGF33456.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] Length = 338 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 45/258 (17%), Positives = 76/258 (29%), Gaps = 43/258 (16%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG----- 186 R + A P + S K+ G +++++LD S SM+ Sbjct: 52 RARLEIALGMLAIVATVLG-PAGLASPATTIEKTGSGAEILVLLDRSASMDSALQEKGAK 110 Query: 187 -PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 P DK + + GL+ FS Q P IQ I Sbjct: 111 TPLTDKYAEPKKRKIAR-TALAGFAAGRPHDAIGLMMFSENQFQVMPFNMRPDMIQAAIQ 169 Query: 246 RLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 G T + A FD + + I+ ++DG I Sbjct: 170 AGGVGSGLGNTDVGSAMLAALRT-FDDRPDSG---------SRIIMLVSDG---GAQIAP 216 Query: 303 KESLFYCNEAKRRGAIVYAI-------------------GVQAEAADQFLKNCASPDRFY 343 L + KR +Y I GV A ++ ++ ++P R + Sbjct: 217 AVRLQIADGLKRNRIALYWIYLRSYNQPALADSDSAEFDGVVEVAMHRYFRSLSTPYRAF 276 Query: 344 SVQNSRKLHDAFLRIGKE 361 N + A +G++ Sbjct: 277 EADNQASMQRAIEAVGRQ 294 >gi|291000628|ref|XP_002682881.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284096509|gb|EFC50137.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 207 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 66/161 (40%), Gaps = 25/161 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I + + + ++ LD++ VLD S SM+ DK+ + +S+ M+D +++ Sbjct: 32 IKAPIYVEKENRSSLDIIAVLDKSGSMS-------DKIELVKKSLLFMIDQMQARD---- 80 Query: 215 VVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R G+V F + + T L G + +N + G+TT + + A++ + + Sbjct: 81 --RLGIVEFDANVSTTLKLTSMDNGGKKQAMNCVNNIKLGTTTNISGAIIEAFDILANRG 138 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 G+ I+ TDG + + + + Sbjct: 139 --------GNISPTTSILLFTDGLPTVGVQQQDKIVNIVEK 171 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 44/337 (13%), Positives = 94/337 (27%), Gaps = 51/337 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F+ + GS+S + V+ + G+ I+ H ++++ LD ++L A N Sbjct: 25 RFWADTSGSMSYVALAGSLVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAA----NLS 80 Query: 68 NGNNGKKQKNDF--SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 N + + D+ + ++ + + L + N S L +I DQ Y Sbjct: 81 NTRDPQTVVEDYFRAMKLEDTLGDVQTGDSLGAKRVRAEGNGSINSHFLGLIGVDQLDVY 140 Query: 126 NLSAVSRYEMPFIFCTFPWCANS--------------------------SHAPLLITSSV 159 + P + S + + Sbjct: 141 GAATAENATAPLEISLVLDVSGSMQGQKIRDLKEAAKAFVDAVLGEGGDNSRVTVSLIPY 200 Query: 160 KISSKSDIGLDMMMVLDV---SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-DVNNV 215 + L LD S + L + + E L N+ Sbjct: 201 NATVNLGDDLSERFNLDRWQNYSSCAIFESSDYNSLSIDPNAGLEQLAHFDPYDYSGNSP 260 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE---- 271 + + P + ++ + I+ T G+++ + A Sbjct: 261 DLTAPWCAEGNNLAIVPHSSDADYLSDVIDSFEAQGNTAIDLGMKWGLALLDPAARPVIG 320 Query: 272 -----------KLEHIAKGHDDYKKYIIFLTDGENSS 297 + K+++ +TDGEN+ Sbjct: 321 DMQADGLVPSSARYRPSDYGTQTMKFVVVMTDGENTQ 357 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 1/72 (1%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAF 355 + N C +AK++ ++ IGV+A A ++NCAS Y +S +L D F Sbjct: 541 DASQANTNLATICAKAKQQDVTIFTIGVEAPQAGLNAMRNCASSASHYYNVSSNQLVDTF 600 Query: 356 LRIGKEMVKQRI 367 I +V+ R+ Sbjct: 601 RSISDVVVELRL 612 >gi|62897073|dbj|BAD96477.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] gi|62898698|dbj|BAD97203.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] Length = 890 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/283 (14%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 188 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 245 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 246 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 296 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ ++ +N ++ SG V+ + +VQ P +Q + Sbjct: 297 G------RKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 348 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 349 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 404 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|298246130|ref|ZP_06969936.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297553611|gb|EFH87476.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 412 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 62/179 (34%), Gaps = 26/179 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 I ++ + L+ +V+D S SM G + + A + + + L+ I Sbjct: 33 PSDIMAQVRMPLNFSLVIDHSGSMK---GAKLRNVKEAVKMVIDRLEPSDYIS------- 82 Query: 218 SGLVTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +V F P A ++ I+R+ T + G+ + +L Sbjct: 83 --VVIFDDSAQVIIPSMPANDPVGMKAAIDRIQDAGGTTMSLGM-------IQSLGELRR 133 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + +I LTDG D +A G +Y +G+ A+ + L Sbjct: 134 WNIPNAVSR--MILLTDGVTYG---DTDRCRQLARDAAAAGISIYPLGIGADWDENLLD 187 >gi|328951281|ref|YP_004368616.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451605|gb|AEB12506.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 328 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 58/167 (34%), Gaps = 20/167 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M+V+D S SM +L A +R LD + R GLV+FS+ Sbjct: 88 VMVVVDTSKSMIAVDQSP-SRLEAARAIVRTFLDRVP------RGARVGLVSFSAYASVL 140 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-------- 282 +++ + L T + A + +E+ G D Sbjct: 141 VLPTARHVEVRKALEALEPQEATSLGAAILAAVRAL-PGRERAGEELLGRDPVPPELQEL 199 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 ++ ++DG ++S + L A+ +Y +GV + Sbjct: 200 PPATVLLISDGVSTSG----LDPLEAARVARAHQVRIYTVGVGSPRG 242 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 77/208 (37%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G +++L A + +L ++ V G+VTF S Sbjct: 20 VCLVLDKSGSMA--TGNRLNRLNQAGQLF--LLQTVELGSWV------GMVTFDSAAHVQ 69 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +++ T GL A+ I Sbjct: 70 SELIQINSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTDGSE--------- 119 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ LTDGE++ ++ C NE K+ GAI++ + + AA + L + Sbjct: 120 -IVLLTDGEDN--------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQT 170 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 Y+ + L DAF + G V QR Sbjct: 171 YASDQVQNNGLIDAFGALSSGNGAVSQR 198 >gi|218661390|ref|ZP_03517320.1| von Willebrand factor type A [Rhizobium etli IE4771] Length = 370 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 67/174 (38%), Gaps = 23/174 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM++ +DKL + S R +++ +K+ V +VT++ Sbjct: 4 LVFLIDVSGSMDE-----LDKLPLLKSSFRLLVNRLKADDTV------AIVTYAGNAGTV 52 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 I I+RL G +T G+E AY L A D + + Sbjct: 53 LEPTRVSEKSKILSAIDRLEAGGSTGGAEGIEAAY--------DLAQKAFVKDGVNRVM- 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 TDG+ + +++ E ++ G + +G D ++ A Sbjct: 104 LATDGDFNVGPSSDEDLKRIIEEKRKEGIFLTVLGFGRGNLNDSLMQTLAQNGN 157 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 62/431 (14%), Positives = 123/431 (28%), Gaps = 142/431 (32%) Query: 36 IETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 I++ + ++L +D + L + ++ + K + + Sbjct: 34 IDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPQLD--------------KGLVRGAVTQY 79 Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 + N +N + T + ++ + Sbjct: 80 MDANYPEASLNGVTPGTEERSVTVTATATVPTIFMNALGI-------------------- 119 Query: 156 TSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 S ++ +K+ G L+++ V+D S SM G + + A + E++ + Sbjct: 120 -GSSEVHAKATAGYNKLEVVFVIDNSGSMK---GTPIQQTNSAASQLVELIMPEGMM--- 172 Query: 213 NNVVRSGLVTFSSKIVQTF----------------------------------------- 231 V+ GLV F K+ Sbjct: 173 -TSVKVGLVPFRGKVHLPAGVDGLPDGCRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVP 231 Query: 232 -----------PLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L + I I N L S T + GL++ + + E Sbjct: 232 KNTCTSIPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGS- 290 Query: 278 KGHDDYKKYIIFLTDGE------------NSSPNIDNKES-------------------- 305 D +K II LTDG+ N +PN + Sbjct: 291 -SAKDIRKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAA 349 Query: 306 -LFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCASP-----DRFYSVQNSRKLHDAFLR 357 L + K G V+AI + +K+ AS D +Y ++ + D F + Sbjct: 350 MLEEARKVKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKK 409 Query: 358 IGKEMVKQRIL 368 IG+++ + + Sbjct: 410 IGRQLGWRLLR 420 >gi|31789427|gb|AAP58542.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 329 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 62/162 (38%), Gaps = 25/162 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++++LD S SM H + AT IR+ L N+ R GLV F+ + Sbjct: 90 DLVIMLDRSASMRAHDVSP-SRFARATAEIRDFLQH-----KPENIDRVGLVGFAGTSLI 143 Query: 230 TFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + ++ + T L A + K +K Sbjct: 144 LSYLTRDLDTVAFYLDWIESDPRTLLGTNIGAALRNALDV----------AKKDDRRARK 193 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + L+DGE+ + + +++ + +G + +IG+ ++ Sbjct: 194 IFVLLSDGEDYGDEVARQLAVY-----RGQGYRINSIGIGSD 230 >gi|47207527|emb|CAF87062.1| unnamed protein product [Tetraodon nigroviridis] Length = 409 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 81/220 (36%), Gaps = 34/220 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S D++ VLD S S+ G+ A + + + + G+V +S Sbjct: 6 STAANDLVYVLDGSWSV------GVSDFDTAKQWLINITSQFDISSH---YTQVGVVQYS 56 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKST-------PGLEYAYNKIFDAKEKLE 274 PL G + I + G T++ P +++A + +F + ++ Sbjct: 57 DAPRLEIPLGKHQGQDELIRAIQSISYLGGNTQARRRSERHLPAIKFAVDHVFSSSQRA- 115 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + +TDG++ +D EA+ +G V+A+GV +E L Sbjct: 116 -----SQVKNRIAVVVTDGKSQDDVVDASM------EARTQGVTVFAVGVGSEITTSELI 164 Query: 335 NCA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 A S ++ +H + +++ ++ + + Sbjct: 165 AIANKPSSTYVLYAEDYTTIHHIRDAMEQKLCEESVCPTR 204 >gi|268558414|ref|XP_002637197.1| Hypothetical protein CBG09720 [Caenorhabditis briggsae] Length = 630 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 92/261 (35%), Gaps = 40/261 (15%) Query: 115 SIIIDDQHKDYNLSAVS-RYEM----PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 + I + K++ + + S + M P P ++ + T+ I Sbjct: 393 NEIEELNGKNFKVRSRSVHFAMTEKPPVTTAMNPMKFFTTSRTPITTAKSLIPYSCTA-- 450 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 D+ ++D+S D +D A S+ P VR GL+++S Sbjct: 451 DVFFLVDLSQGTGDKSQQYLDIAASAISSL----------PISQEAVRVGLISYSGPGRT 500 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + E++ + G TT++ + YA E + H A+ + K Sbjct: 501 HVRVYLDKHNDKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGMAHPARKN--VK 554 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ----AEAADQFLKNCASPD 340 K ++ TDG + D A+ +G + A+ V+ +Q + Sbjct: 555 KVLVVFTDGYSQDHPRDAARG------ARAKGLQLIAVAVKDRLAPPDEEQLAEIGGHAK 608 Query: 341 RFYSVQNSRKLHDAFLRIGKE 361 + N R+L + IG + Sbjct: 609 NVFISPNGRELRE--KIIGTQ 627 >gi|282900951|ref|ZP_06308884.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] gi|281194042|gb|EFA69006.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] Length = 575 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 66/177 (37%), Gaps = 27/177 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + +M V+D S SM +G ++ ++++ L I + N V G Sbjct: 391 KTQKDTGKTVYLMAVIDTSGSM---YGGPLN-------AVKDGLRIASQQINPGNYV--G 438 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVT+ + V LA + I+ L T G+ A +++ K Sbjct: 439 LVTYGDQPVNLVKLAPFDDLQHKRFLAAIDNLQADGATAMYDGMMVALSELVQQK----- 493 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K + + K Y++ LTDG+ + + +E G VY I + Sbjct: 494 --KTNPNGKFYLLLLTDGQTNQG-FNFEEVKEIIQY---SGVRVYPIAYGEVNEAEL 544 >gi|224090449|ref|XP_002195035.1| PREDICTED: integrin, alpha 1 [Taeniopygia guttata] Length = 1184 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/290 (15%), Positives = 100/290 (34%), Gaps = 35/290 (12%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS--RYEMPFIFCTFPWCANSS 149 + L + ++ ++ + +L + K L+ Y+ + T C+N S Sbjct: 95 IKLNLPASTSVPNVVEVKENMTLGTTLVTNPKGGFLACGPLYAYKCGRLHYTTGVCSNVS 154 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + LD+++VLD S S + T + +L + Sbjct: 155 STFETVEAIAPSVQECKTQLDIVIVLDGSNS--------IYPWESVTDFLNSLLRNMDIG 206 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 P G+V + +V F L + + +R+ T++ L + Sbjct: 207 PQQTQ---VGIVQYGQTVVHEFYLNTYSTTEDVMAAASRIRQRGGTQTMTAL--GIDTAR 261 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--- 324 + H A+ +K ++ +TDGE + DN ++ + +AI + Sbjct: 262 EEAFTEAHGARRG--VQKVMVIVTDGE----SHDNYRLQEVIDDCEDENIQRFAIAILGS 315 Query: 325 ------QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 316 YSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALLTIVEALGERIFAL 365 >gi|110331845|gb|ABG67028.1| inter-alpha (globulin) inhibitor H3 [Bos taurus] Length = 889 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 98/282 (34%), Gaps = 28/282 (9%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF--AQDINNIERSTSLSIIIDD 120 I + + + + + ++ + F + F + D + + S++ D Sbjct: 186 IFEPQGISTLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRSCPTCTDSLLKGD 245 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Y+++ S + + F P+ + ++ V+DVS S Sbjct: 246 FIITYDVNRESPANVQIVNGYFVHFFAPQGLPV-------------VPKSVVFVIDVSGS 292 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG-LVTFSSKIVQTFPLAWGVQH 239 M+ K+ ++ ++L+ +K +N ++ SG + T+ +V P +Q Sbjct: 293 MHG------RKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPATPE--NIQE 344 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + + + T L + + A+E+ + II LTDG+ + Sbjct: 345 ASKFVMDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTS----IIIMLTDGDANVGE 400 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ A +Y +G FL+N A + Sbjct: 401 SRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENH 442 >gi|189220466|ref|YP_001941106.1| hypothetical protein Minf_2455 [Methylacidiphilum infernorum V4] gi|189187324|gb|ACD84509.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 340 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 44/247 (17%), Positives = 79/247 (31%), Gaps = 45/247 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKI-SSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLG 193 + + P + V++ S +D ++ LDVS SM D +++ Sbjct: 61 LFLASTTLFFVALSRPQWGKAEVELLESNADY----LIALDVSKSMLAEDTVPSRLERAK 116 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----F 249 + + L + R GLV F+ PL+ + ++E ++ L Sbjct: 117 LLATNFISKL----------HGERVGLVAFTKNAFIEAPLSTDYELLEEILSELSPDDFP 166 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T L+ A + KK +I L+DGE+ + F Sbjct: 167 NGGTNFAAMLDEALQFFSSSGRS-----------KKMLILLSDGEDHGGGWQQRLVDF-- 213 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAF-----LRIGKEMVK 364 K+ V +IG+ + ++N S Y N + F I Sbjct: 214 ---KKESIPVLSIGIG-SSNGAVIRN--SNGSLYKDYNGEPIVSIFNPAALELIAHSTGG 267 Query: 365 QRILYNK 371 I +K Sbjct: 268 LYIQADK 274 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 78/212 (36%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 296 FKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHA-NLVFLIDVSGSMDEP-----DKL 349 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + + R ++ +K+ V+ +VT++ I I+RL G Sbjct: 350 PLLKSAFRLLVSKLKADDTVS------IVTYAGNAGTVLEPTRVAEKSKILSAIDRLEAG 403 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AYN A K + ++ TDG+ + +++ Sbjct: 404 GSTGGAEGIEAAYNLAKQA------FVKDGVNR---VMLATDGDFNVGPSSDEDLKRIIE 454 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 E ++ G + +G D ++ A Sbjct: 455 EKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGN 486 >gi|159900724|ref|YP_001546971.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893763|gb|ABX06843.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 415 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 75/211 (35%), Gaps = 28/211 (13%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++ L+ +VLD S SM DK+ ++RE++ ++ I V+ + Sbjct: 36 PTVQAAPPLNFCLVLDRSGSMAG------DKIQHLREAVREIVANLRPIDAVS------I 83 Query: 221 VTFSSKIVQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V F + P LA + +Q I + T + GL+ ++ + Sbjct: 84 VLFDDTLEVLVPARLADDLPALQNAIESIGEQGGTAMSLGLQAGLAEL--------QKFQ 135 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK--NC 336 D + + LTDG+ D + G + A+G+ E + L Sbjct: 136 AADRVGRVL-LLTDGQTWG---DEDTCRDLAKQIGDLGVSITALGLGTEWNEALLDDLAT 191 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 AS + + ++ F + + + Sbjct: 192 ASNGESDYIADPSQISKYFQQTLQSAQTTTV 222 >gi|258624772|ref|ZP_05719703.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] gi|258582934|gb|EEW07752.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] Length = 419 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 74/218 (33%), Gaps = 20/218 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+ ++ P + +++ ++ S +F A+L + + L + N Sbjct: 5 KKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENN 64 Query: 71 NGKKQK--NDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +K + + I +I + ++G Q T ++ +HK Sbjct: 65 VSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKS--- 121 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 +S + ++ + L +D+ ++D+S SM + Sbjct: 122 -WISYENISLKPEFTVNGSSVTRKFLP------------QPVDVYFIVDMSASMRATWQN 168 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ I +++ +K R L+ + + Sbjct: 169 GKSQIDEVKNVITRVVNDLKDFDTEVKS-RVALLGYHN 205 >gi|156120445|ref|NP_001095368.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] gi|160332333|sp|P56652|ITIH3_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|154425846|gb|AAI51420.1| ITIH3 protein [Bos taurus] gi|296474789|gb|DAA16904.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] Length = 891 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 98/282 (34%), Gaps = 28/282 (9%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF--AQDINNIERSTSLSIIIDD 120 I + + + + + ++ + F + F + D + + S++ D Sbjct: 188 IFEPQGISTLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRSCPTCTDSLLKGD 247 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Y+++ S + + F P+ + ++ V+DVS S Sbjct: 248 FIITYDVNRESPANVQIVNGYFVHFFAPQGLPV-------------VPKSVVFVIDVSGS 294 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG-LVTFSSKIVQTFPLAWGVQH 239 M+ K+ ++ ++L+ +K +N ++ SG + T+ +V P +Q Sbjct: 295 MHG------RKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLVPATPE--NIQE 346 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + + + T L + + A+E+ + II LTDG+ + Sbjct: 347 ASKFVMDIQDRGMTNINDALLRGISMLNKAREEHTVPERSTS----IIIMLTDGDANVGE 402 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ A +Y +G FL+N A + Sbjct: 403 SRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENH 444 >gi|291227856|ref|XP_002733898.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like, partial [Saccoglossus kowalevskii] Length = 627 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 63/167 (37%), Gaps = 10/167 (5%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+DVS SM+ K+G ++R +LD ++S N + S V+F + + Sbjct: 302 VLFVIDVSGSMDGA------KMGQTKEALRVILDDMRSFDRFNILTFSYEVSFWKENMMI 355 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + +N L T GL + + E+ + +I L Sbjct: 356 LATQENILEAKNFVNNLRASGGTNFNGGLVEGVEMLRRVTDDAENTERSAF----LVIML 411 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 TDG+ +S + ++ +G + +FL+ + Sbjct: 412 TDGQPTSGETQLTKIQENAKTYIDGQYSLFCLGFGGDVNFKFLQKIS 458 >gi|258620794|ref|ZP_05715829.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258586992|gb|EEW11706.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 419 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 74/218 (33%), Gaps = 20/218 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+ ++ P + +++ ++ S +F A+L + + L + N Sbjct: 5 KKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENN 64 Query: 71 NGKKQK--NDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +K + + I +I + ++G Q T ++ +HK Sbjct: 65 VSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKS--- 121 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 +S + ++ + L +D+ ++D+S SM + Sbjct: 122 -WISYENISLKPEFTVNGSSVTRKFLP------------QPVDVYFIVDMSASMRATWQN 168 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ I +++ +K R L+ + + Sbjct: 169 GKSQIDEVKNVITRVVNDLKDFDTEVKS-RVALLGYHN 205 >gi|126658524|ref|ZP_01729672.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] gi|126620266|gb|EAZ90987.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] Length = 610 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 25/192 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L++ +VLD S SM + L A ++ +++D + ++ ++ + Sbjct: 39 PLNLSVVLDRSGSMAGY------ALSNAIQATEKLVDFLSPDDLLS------VIIYDDVA 86 Query: 228 VQTFP--LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P Q I+ KI ++ T + G + + H++ + Sbjct: 87 EVIVPHQAVTNKQEIKAKIKKIRARGCTNLSGGW------LLGCSQVKSHLSTDKLNR-- 138 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFY 343 ++ LTDG + + L E ++G + +G + + L A+ FY Sbjct: 139 -VLLLTDGLANIGERKPEILLKTAAEKAQQGIVTTTLGFGSNFNEDLLIGMADAAGGNFY 197 Query: 344 SVQNSRKLHDAF 355 +Q+ D F Sbjct: 198 FIQSPDDSADVF 209 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 81/221 (36%), Gaps = 24/221 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + PW A + + + + S+ +++ ++DVS SMND DKL Sbjct: 199 FSITTDVAQTPWNAQTRLMRVGLRAYDVPRSERPAA-NLVFLVDVSGSMNDP-----DKL 252 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + ++ + D ++ V+ VV +G +++++ + L G + Sbjct: 253 PLVKTALSMLSDNLRPDDKVSIVVYAGAAG------MVLAPTHEGKYVKQALECLSAGGS 306 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+ AY + KG + +I TDG+ + E + Sbjct: 307 TAGGQGMALAYAT------AEANFIKGGINR---VILATDGDFNVGISSIGEVEALVKQN 357 Query: 313 KRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 + G + A+G + ++ A + + ++ + Sbjct: 358 RESGVTLTALGFGTGNYNEALMEKMADVGNGNYAYIDSAME 398 >gi|325688744|gb|EGD30753.1| von Willebrand factor type A [Streptococcus sanguinis SK115] Length = 551 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 79/278 (28%), Gaps = 43/278 (15%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 ++ I +I + N + + I + + L +D K L Sbjct: 263 TTTQLREAAKNGSADILSISYQTYINTPEFSDYEYVPFGIRQDSPLYATTNDATKQEVLK 322 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM--------------V 174 S Y + + + + + + V Sbjct: 323 KFSSYVLEGNNQSKATSYGFNKLDDYSFEEQTTDGNLLMSMQNLWKKNKNNSQPIVGVFV 382 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL- 233 DVS SM+ + +S+ L I N + GLV++S + P+ Sbjct: 383 TDVSGSMDGE------PMNNLKKSLLNSLQYI------NEENQIGLVSYSDDVTINVPID 430 Query: 234 ---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + I L T + G A I D K + + I L Sbjct: 431 TMNSTQKSYFTSAIKGLTPSGGTATYDGTLVAVKMILD-------KMKENPGARPVIFVL 483 Query: 291 TDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +DG+ N + E + K G V IG A+ Sbjct: 484 SDGQTNGGYEFERVEPII-----KALGITVNTIGYNAD 516 >gi|212715236|ref|ZP_03323364.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] gi|212661917|gb|EEB22492.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] Length = 1192 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/127 (23%), Positives = 49/127 (38%), Gaps = 15/127 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM + A +++L S V+ +VTFS+K Sbjct: 492 DIVLVMDKSGSMKGELDN--NAKEAANALAKKLLTDKNSTLPSEQQVQMAVVTFSTKATI 549 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 V I + T L+ A +I G + KK+IIF Sbjct: 550 EQNFTTDVLKINNAVEG-DPDGGTNWEAALKQA------------NILSGRSNVKKHIIF 596 Query: 290 LTDGENS 296 L+DG+ + Sbjct: 597 LSDGDPT 603 >gi|241205700|ref|YP_002976796.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859590|gb|ACS57257.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 791 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 100/292 (34%), Gaps = 31/292 (10%) Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPF 137 + + D ++ QD + R+ SL KD+ L+ + +MP Sbjct: 263 LTVDLRAGFPLGDVKSSFHAVDINQDGDQ-ARTISLKADTVPADKDFELTWKAAAGKMPS 321 Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 ++ +T + + +++ V+D S SM+ GP +++ + Sbjct: 322 AGLFREVIDGKTYLLAFVTPPAAPDTAAPPAKREVVFVIDNSGSMS---GPSIEQARQSL 378 Query: 197 RSIREMLDIIKSIPDVNNVVR--SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 L+ D NV+R + + +V P + + L T+ Sbjct: 379 ALAISKLNP----DDRFNVIRFDDTMTDYFKGLVTATP--DNREKAIGYVRGLTADGGTE 432 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 P L+ A ++FLTDG I N++ LF A R Sbjct: 433 MLPALQAALRNQGPVASGALRQ----------VVFLTDG-----AIGNEQQLFQEITANR 477 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 A V+ +G+ + F+ A R F ++ ++ ++ + ++ Sbjct: 478 GDARVFTVGIGSAPNTYFMTKAAEMGRGTFTAIGSTDQVASRMGELFAKLQN 529 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 52/246 (21%), Positives = 86/246 (34%), Gaps = 41/246 (16%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F + F N +G+I I I+LP ++ + E SH KAKL ++ + L Sbjct: 11 FNHFTLFKKNEQGTILISFMIILPFFIALIFITFEISHYLQRKAKLSDAIEQATLA---- 66 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN-ELRENGFAQDINNIERSTSLSIIIDDQ 121 D +I N + N L F+ I NI +T Sbjct: 67 ------LTIENNAIPDEPQQIKNNALVLSYANAYLPSKEFSVPIININDNTYYLE----- 115 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL---DVS 178 YN + Y P F T N+ + + V I +K+ D+ V+ D S Sbjct: 116 ---YNAAVTMAY--PAKFLTQTSLTNAITDINITDNGVAIKNKAIEASDLTDVIFVADYS 170 Query: 179 LSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 SM +F +++ + R++ DII + + + P + Sbjct: 171 GSMLYNFDVNEPNDHERINALRSAFRKLHDIIMNNS-------------NINAIGYIPFS 217 Query: 235 WGVQHI 240 WG + I Sbjct: 218 WGTKRI 223 Score = 42.9 bits (99), Expect = 0.082, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 45/138 (32%), Gaps = 29/138 (21%) Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-NIDN 302 I ++ T + G+ A N +F + H K +I L+DG ++ N Sbjct: 321 IIKMEPYGWTLISSGILSA-NNLFKKEANNRHR--------KLMIILSDGVDTYQDNFLP 371 Query: 303 KESLFY---------CNEAKRRGAIVYAIGVQAEAADQF-------LKNCASPDRFYSVQ 346 + LF C G + I + D + C D +Y Sbjct: 372 NKGLFISKTLVEKGMCERVISSGIQMAFIAIAYSPDDDVNEPEYINWRQCVGKDNYYEAH 431 Query: 347 NSRKLHDAFLRIGKEMVK 364 N+ +L I + + K Sbjct: 432 NADEL---MRDIQQAISK 446 >gi|118591412|ref|ZP_01548810.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118436084|gb|EAV42727.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 657 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 72/209 (34%), Gaps = 37/209 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++D D++ VLD S SM G K +A + + +PD R+G++ Sbjct: 18 AQTDTSPDLLFVLDSSNSM-WGQIDGTAKAEIARSAFEGF---VAGLPDG---TRAGVMA 70 Query: 223 FSSK-------IVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + + + P++ + E + L T T L A E L Sbjct: 71 YGHRRKADCGDVETLVPVSDLDRAKLVESVKALTPRGKTPITETLRQA-------AELLA 123 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQ-AEAADQ 331 + +I ++DG + + G + IG A ADQ Sbjct: 124 QNDRPGR-----LILISDGIETCGG----DPCALAEALASSGVDFKAHVIGFDIASKADQ 174 Query: 332 FLKNCA---SPDRFYSVQNSRKLHDAFLR 357 C + +++ +++ L++A Sbjct: 175 AKIACIAHLTGGTYWNARDADGLNEALKE 203 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 69/184 (37%), Gaps = 25/184 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ +++ +LD S SM + A R++ L ++ +VN ++ Sbjct: 297 AQQMPSREVVFLLDTSGSMAGE------SIVQAKRAVDFALTQLRPEDNVN------IIQ 344 Query: 223 FSSKIV----QTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F+ + P Q + + L T+ P L A NK + Sbjct: 345 FNDAPQALWKRAMPATAKHIQRARNWVASLHADGGTEMAPALTLALNK-PSLHRDDSDLL 403 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H + ++F+TDG ++ N+++L E+K ++ IG+ + F+ A Sbjct: 404 GSHKLRQ--VVFITDG-----SVSNEDALMSLIESKLADNRLFTIGIGSAPNSYFMTQAA 456 Query: 338 SPDR 341 R Sbjct: 457 QAGR 460 >gi|86358602|ref|YP_470494.1| hypothetical protein RHE_CH03000 [Rhizobium etli CFN 42] gi|86282704|gb|ABC91767.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 780 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/289 (15%), Positives = 98/289 (33%), Gaps = 26/289 (8%) Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY-EMPF 137 + + + ++ + QD + R+ SL KD+ L+ + + P Sbjct: 250 LTVNLKAGFPLGEVKSSFHDVDIGQDGDQ-ARTISLKGDAVPADKDFELTWKAAPGKTPS 308 Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 ++ +T + + +++ V+D S SM+ G +++ + Sbjct: 309 AGLFREAIDGKTYLLAFVTPPTAPDAAAPSKREVVFVIDNSGSMS---GQSIEQARQSLA 365 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 L+ + + +V P + + L T+ P Sbjct: 366 LAISRLNPNDRFNVIRFDDTM--TDYFKGLVAATP--DNREKAVAYVRSLTADGGTEMLP 421 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 LE DA +A G + ++FLTDG I N++ LF A R A Sbjct: 422 ALE-------DALRNQGPVASGAL---RQVVFLTDG-----AIGNEQQLFQEITANRGDA 466 Query: 318 IVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 V+ +G+ + F+ A R F + ++ ++ + ++ Sbjct: 467 RVFTVGIGSAPNTYFMTKAAEVGRGTFTQIGSTDQVASRMSELFAKLQN 515 >gi|326678379|ref|XP_002666248.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Danio rerio] Length = 1089 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 71/189 (37%), Gaps = 35/189 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + D NV R F+ K Sbjct: 250 DMVILVDVSGSVSGL------TLKLIKASVTEMLDTLSD-DDYVNVAR-----FNEKAEA 297 Query: 230 TFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P + +E + ++ TT G +A+N++ + + Sbjct: 298 VVPCFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNTNVPRANCN---- 353 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCAS 338 K I+ TDG D + +F + V+ V D L+ C++ Sbjct: 354 ---KIIMLFTDG-----GEDRAQDIFEQYNWPNKTVRVFTFSVGQHNYDVTPLQWIACSN 405 Query: 339 PDRFYSVQN 347 ++ +++ Sbjct: 406 KGYYFEIRS 414 >gi|297727663|ref|NP_001176195.1| Os10g0464900 [Oryza sativa Japonica Group] gi|22758314|gb|AAN05518.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432565|gb|AAP54180.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|255679473|dbj|BAH94923.1| Os10g0464900 [Oryza sativa Japonica Group] Length = 646 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 74/218 (33%), Gaps = 43/218 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ VLDVS SM + KL + +++ ++D + R +++FSS Sbjct: 173 PLDLVTVLDVSGSMVGN------KLALLKQAMGFVIDNLGPGD------RLCVISFSSGA 220 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L+ G H + + L T L A + D + + Sbjct: 221 SRLMRLSRMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVES------ 274 Query: 284 KKYIIFLTDGENSS-----------PNID----NKESLFYCNEAKRRGAIVYAIGVQAEA 328 +I L+DG+++ N D R V+ G + Sbjct: 275 ---VILLSDGQDTYTVPPRGGYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGKDH 331 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 + A + F ++N + D F + IG + Sbjct: 332 DAAAMHTIAEVTGGTFSFIENEAAIQDGFAQCIGGLLS 369 >gi|74315933|ref|NP_001028276.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|72679321|gb|AAI00122.1| Inter-alpha (globulin) inhibitor H3 [Danio rerio] Length = 892 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 62/170 (36%), Gaps = 26/170 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM + K+ + + +++ + D+ GL+TFSS I Sbjct: 269 VVFIIDQSGSMQGN------KIE------QTRMAMLRILSDLAKDDYFGLITFSSHIQAW 316 Query: 231 FP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V+ + + ++ G T + A N I ++ Sbjct: 317 KPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQEGSAS--------- 367 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +I LTDG+ +S + A +Y +G +FL+ Sbjct: 368 ILILLTDGDPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGFNVRFEFLEK 417 >gi|46395320|dbj|BAD16597.1| DEC-1 [Lymnaea stagnalis] Length = 919 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 22/197 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ + + +L R L + VR G + F S + + Sbjct: 434 DIIFVMDSSSSI--TYPNYVKQLSFVANVTRNFL-------IGKDDVRYGALIFGSNVEK 484 Query: 230 TFPLA--WGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +++ I S+T + L+Y + G D K Sbjct: 485 LFDLKKYDSPVDVEQHIMEATYLASSTDTAAALQY-----ILDQRMFADEQGGRPDAVKI 539 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSV 345 II LTDGE++ P +++ + + G + +IG+ E + L AS + Sbjct: 540 IIVLTDGESTYP----EKTRAEATKLQSLGYHMMSIGIGNEINELELNALASNTSNIFKA 595 Query: 346 QNSRKLHDAFLRIGKEM 362 + + L + Sbjct: 596 ASYQVLDQLHKEVVTRA 612 >gi|328884707|emb|CCA57946.1| hypothetical protein SVEN_4660 [Streptomyces venezuelae ATCC 10712] Length = 535 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 60/197 (30%), Gaps = 36/197 (18%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV----------NNVVRSGLVTF 223 VLD S SMN D+L ++ E+ + +V + VR+ Sbjct: 359 VLDTSGSMNG------DRLERLKTALVELTGDFRDREEVTLMPFGSAVKRDEVRT----- 407 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V + I+ +L T L+ AY + + Sbjct: 408 --HTVDPASPRQALDAIRADARKLTASGGTAIYSSLQEAYRSLGKSSGDTFTS------- 458 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKNC-ASPDR 341 I+ +TDGEN+ FY + A ++ V+ I + + + Sbjct: 459 ---IVLMTDGENTDGAPAAAFDSFYGSLPAGQQRTPVFPILFGDSDRAELGHLADLTGGK 515 Query: 342 FYSVQNSRKLHDAFLRI 358 + L AF I Sbjct: 516 LFDAHQ-GSLDGAFEEI 531 >gi|253565978|ref|ZP_04843432.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251945082|gb|EES85520.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 610 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 79/211 (37%), Gaps = 22/211 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ T PW N+ H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGTCPW--NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G ++ + Q I+E I+ L Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAGVKLEATPGSDKQKIREAIDELEASG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II TDG+ + +KE + Sbjct: 323 STAGGEGIMLAYKIAQKNFISGGNNR---------IILCTDGDFNVGVSSDKELEKLIEQ 373 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 374 KRKSGIFLTVLGYGMGNYKDSKMQTLAEKGN 404 >gi|51597679|ref|YP_071870.1| membrane protein. [Yersinia pseudotuberculosis IP 32953] gi|51590961|emb|CAH22619.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 518 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 99/300 (33%), Gaps = 42/300 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N +G+I + L+PV ++ L E SH +AKL L+ + L +T+ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE------ 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFR-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +ND + N T + + L F+Q + + +YN Sbjct: 71 ----NNYRNDRASNNRNNYLVTSYAQSYLPSERFSQ------PRVVNTYNESLGYTEYNA 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG- 186 S Y++ + + + ++ K S +D++ V D S SM+ FG Sbjct: 121 SLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGD 180 Query: 187 ----PGMDKLGVATRSIREM---------LDIIKSIPDVNNVVRSGL-----VTFSSKIV 228 + KL ++ ++ I +P R T+ Sbjct: 181 IERNNRITKLDELKAIFVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPY 240 Query: 229 QTFPLAWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + ++Q N + AY ++ + K H A + KK+ Sbjct: 241 SPKKIDRNGHYLQRYTASNLKNIPGLDNLSGIDNLAYGQLDEDK----HHAILSEIEKKH 296 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 376 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 424 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 425 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 484 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 485 TGNFYLAKNAHEL 497 >gi|39652254|emb|CAC79611.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 886 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/283 (14%), Positives = 99/283 (34%), Gaps = 24/283 (8%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I + + + + + ++ + F + F ++ ++ + + + Sbjct: 184 IFEPQGISMLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLD--QQRSCPTCTDSLLN 241 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 D+ ++ E P AP + K ++ V+D+S SM Sbjct: 242 GDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPK---------NVAFVIDISGSMA 292 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQ 241 KL ++ +L+ ++ +N ++ SG V+ + +VQ P +Q + Sbjct: 293 G------RKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQATPE--NLQEAR 344 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + T GL + + A+E+ + + +I LTDG+ + Sbjct: 345 TFVKSMEDKGMTNINDGLLRGISMLNKAREEH----RIPERSTSIVIMLTDGDANVGESR 400 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 ++ A +Y +G FL+N A + ++ Sbjct: 401 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 443 >gi|260825786|ref|XP_002607847.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] gi|229293196|gb|EEN63857.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] Length = 187 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 75/201 (37%), Gaps = 29/201 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + LD+ VLD S S+ DK+ T ++ DI S R G+V +S+ Sbjct: 9 NAPLDLFFVLDGSGSVT---YANFDKVKEFTENVVNAFDISASS------TRVGVVQYST 59 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L I+ + + G T++ LE+A + A Sbjct: 60 SNTLEFNLGDHADKPSTLAAIDSISYQGGGTRTGSALEFA----------RLNAAWRGGS 109 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DR 341 K +I +TDG++ + N+ +G VYAIGV A Q L+ A + Sbjct: 110 VPKVMIVVTDGKSGDSVASS------ANDLASQGVDVYAIGVGNYDATQLLEIAAGNQNN 163 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 + + L +I + + Sbjct: 164 VIELTDFNALSAEINQIAQTV 184 >gi|256005895|ref|ZP_05430841.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|255990131|gb|EEU00267.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] Length = 524 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 70/195 (35%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW + +L+ K S + ++ ++DVS SM++ +K Sbjct: 134 FSITTEIGQCPWNPENKL--MLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDEP-----NK 186 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + + ++D + V+ VV +G +V I + + L G Sbjct: 187 LPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDSTPGNEKDKILDALMNLEAGG 242 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY+ K + +I TDG+ + E + + Sbjct: 243 STAGAEGIKLAYDVAKKNFIKSGNNR---------VILATDGDFNVGISSEAELVRLIEK 293 Query: 312 AKRRGAIVYAIGVQA 326 + G + +G Sbjct: 294 KRDEGIFLTVLGFGT 308 >gi|187735658|ref|YP_001877770.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187425710|gb|ACD04989.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 859 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 67/198 (33%), Gaps = 22/198 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K + M +VLD S SM+ G K+ +A I + D + + + + Sbjct: 384 KMKLMTAMSIVLDRSGSMSCSVPGGKTKMDLANA---GTCQTISLLSDQDLISVHAVDSE 440 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 IV L + + ++R+ G GL+ + ++ + H Sbjct: 441 PHPIVTLSSLGPNRKKMISSVSRIASMGGGIFIGAGLKAGWQELQRSVAGTRH------- 493 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCA--S 338 ++ D ++S D +E+L E + G V I + E + L+ A Sbjct: 494 ----LLLFADADDSEEPADYRETLK---EMVKEGVTVSVIALGTEKSADAGLLREIAELG 546 Query: 339 PDRFYSVQNSRKLHDAFL 356 R + + F Sbjct: 547 RGRIFFCDRPGDIPSIFA 564 >gi|116623283|ref|YP_825439.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226445|gb|ABJ85154.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 299 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 83/210 (39%), Gaps = 24/210 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S +SD+ L + +++D S+S L + ++ + P+ + V L+ Sbjct: 68 SQESDLPLTLGLMVDTSMSQRRV-------LDAERGASYRFIETV-LRPNKDQVF---LM 116 Query: 222 TFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 F +I PL ++ + + ++ F + G Y+ + A +++ Sbjct: 117 QFDFRIFMRQPLTNSLRQLSDSLPYVDTPTFNQLRAQSGGGTLLYDAVVTASQEVMLNRT 176 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA 337 G +K +I LTDGE+ + +++ A +Y+I + + L+ + Sbjct: 177 G----RKALILLTDGEDYGSDASVGDAIEAAQRADTL---IYSILFADQGDGRRPLQRMS 229 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + F+ V + + F I +E+ Q Sbjct: 230 KETGGSFFEVSKKQDIDQIFTAIQEELRSQ 259 >gi|60682855|ref|YP_212999.1| hypothetical protein BF3393 [Bacteroides fragilis NCTC 9343] gi|60494289|emb|CAH09084.1| conserved exported hypothetical protein [Bacteroides fragilis NCTC 9343] Length = 610 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 79/211 (37%), Gaps = 22/211 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ T PW N+ H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGTCPW--NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G ++ + Q I+E I+ L Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----NAGVKLEATPGSDKQKIREAIDELEASG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II TDG+ + +KE + Sbjct: 323 STAGGEGIMLAYKIAQKNFISGGNNR---------IILCTDGDFNVGVSSDKELEKLIEQ 373 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 374 KRKSGIFLTVLGYGMGNYKDSKMQTLAEKGN 404 >gi|257064432|ref|YP_003144104.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Slackia heliotrinireducens DSM 20476] gi|256792085|gb|ACV22755.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Slackia heliotrinireducens DSM 20476] Length = 629 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 70/185 (37%), Gaps = 23/185 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + T+S K + S +++ ++D+S SM++ DKL + S + Sbjct: 249 PWNDQTQLLVMTFTASDKAQTASKGS-NLVFLIDISGSMDEP-----DKLDLLKDSFGTL 302 Query: 203 LDIIKSIPDVNNVVRSGLVTFS--SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 L+ + R +VT++ ++ + I +NRL +T GLE Sbjct: 303 LENLGPND------RVSIVTYAAGEDVLLEGASGDDTRKIMRALNRLEADGSTNGEAGLE 356 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY E E + I+ +DG+ + + + E + G + Sbjct: 357 MAY-------EVAERNYIEGGVNR--IVMASDGDLNVGITSESDLYDFVEEKRETGVYLS 407 Query: 321 AIGVQ 325 +G Sbjct: 408 VLGFG 412 >gi|268316012|ref|YP_003289731.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333546|gb|ACY47343.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 339 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 58/177 (32%), Gaps = 24/177 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + GLD+++ LDVS SM +L A + +L+ + R G Sbjct: 81 QPRQAERRGLDLLIALDVSNSMLAEDVAP-SRLARARYELYRLLEHL-------EGDRVG 132 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L+ F+ P ++ ++ LI T ++ A + Sbjct: 133 LILFAGDAFLQCPFTTDYGAVRLFLDVADPSLIPTPGTDYVRMIQVALQAFEAPQPDE-- 190 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ ++DGEN + + + + G A+GV A Sbjct: 191 -----VPRSRVLLVVSDGENHAEGFE-----QALRQLQEAGIERLAVGVGETAGAPI 237 >gi|116753762|ref|YP_842880.1| magnesium chelatase [Methanosaeta thermophila PT] gi|116665213|gb|ABK14240.1| protoporphyrin IX magnesium-chelatase [Methanosaeta thermophila PT] Length = 669 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 69/193 (35%), Gaps = 28/193 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++ V+D S SM M ++ A ++ +L + R G+V F Sbjct: 473 KTSATVLFVVDASGSMG-----AMRRMESAKGAVLSLL-----MDSYQKRDRIGMVAFRG 522 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P V +++ L G T + GL A I + + + + Sbjct: 523 NDADLLLPPCSSVDLAMKRLAELPTGGRTPLSAGLSKALRVI-------QGELIKNKETR 575 Query: 285 KYIIFLTDG-ENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQF---LKNC--- 336 I+ ++DG N S + D K+ + EA+R G I + + L C Sbjct: 576 PMIVLVSDGRANVSISSDPKKEIVQIAEEARRLGVHTVVIDTEVVGSSFMEMRLGYCRDI 635 Query: 337 --ASPDRFYSVQN 347 A+ R+Y + + Sbjct: 636 AEAAGGRYYPISD 648 >gi|126737457|ref|ZP_01753192.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] gi|126722042|gb|EBA18745.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] Length = 479 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 81/238 (34%), Gaps = 47/238 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 APL + + + VLD S SM G+ K+ +A Sbjct: 2 LKKLMIATLLAGVSAPLAANETTRST----------FVLDASGSM-WGQIEGVAKITIAQ 50 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--------VQTFPLAWGVQHIQEKINRLI 248 + ++++L + P+ GL+ + + P A + I + ++ + Sbjct: 51 QVLQKLL--VDLSPNQE----VGLMAYGHRQKGDCSDIEQLIAPAAGTREAISKAVDAIT 104 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + + A + ++EK +I ++DGE + ++ Sbjct: 105 PKGKTPLSAAVIQAAEGLHLSEEKAT------------VILISDGEETCG----RDPCAI 148 Query: 309 CNEAKRRGA--IVYAIGVQA--EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGK 360 E + G ++AIG +AA L+ A + + + + L A + K Sbjct: 149 GAELEAAGVDFTLHAIGFGIADDAARAQLQCLAENTGGVYLDAKGAEGLSAALSHVTK 206 >gi|115623666|ref|XP_789748.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960627|ref|XP_001186460.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 846 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 62/172 (36%), Gaps = 28/172 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ R+ +LD ++ I R +V F S + Sbjct: 339 VVFVIDVSGSMRG------RKMDQTKRAFTTILDDVRPID------RINIVLFESNVRVW 386 Query: 228 --VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q + + +N + G T GL A + + + G+ + Sbjct: 387 RSNQMVEATGDNIAAAKNHVNDISAGGGTNLYDGLTNAVDLLME---------HGNGEAM 437 Query: 285 KYIIFLTDGENSSPN-IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 II LTDG+ +S + E + ++++G FL+ Sbjct: 438 PLIIMLTDGQPTSGSVTSTSEIIKRITNLIDGRLSLFSVGFGNGVDFSFLEK 489 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ VLD S SM KL ++ +L ++ N +V FS++I Sbjct: 296 VVFVLDSSASMVG------TKLRQTKDALFTILHDLRPQDHFN------IVGFSNRIKVW 343 Query: 229 QTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ I++ I+ + T L+ + D + H Sbjct: 344 KDHLVSVTPNSIRDGKVYIHHMSPSGGTDINGALQRGIQLLND---YVAHNDIEDRSVS- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 ++FLTDG+ + + L EA R ++ +G+ A+ + L+NC Sbjct: 400 LVVFLTDGKPTVGETHTLKILNNTREAARGRVCIFTVGIGADVDFKLLEKLSLENCGLTR 459 Query: 341 RFYSVQNSRKLHDAFLR 357 R + ++R F Sbjct: 460 RVHEDHDARAQLIGFYD 476 >gi|47216962|emb|CAG04904.1| unnamed protein product [Tetraodon nigroviridis] Length = 519 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/200 (21%), Positives = 68/200 (34%), Gaps = 28/200 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D D+ VLD S S++ H+ + T V+ +R + FS+ Sbjct: 30 DGAFDIYFVLDRSGSVSGHWPEIFGFVEQLTGRF------------VSPRMRVSYIVFSA 77 Query: 226 KIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + V PL I E + RL T GL+ + Sbjct: 78 RAVVILPLTGHRAEIDEGLERLRQIKPAGETFMHEGLKA----------VSGQMKAQTSP 127 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +I LTDG+ + S+ + A+ GA V+ +GV Q + P+R Sbjct: 128 SSSIVIVLTDGKLEVYPYEL--SVQEADRARGLGARVFCVGVMDFDHKQLAEIADGPERV 185 Query: 343 YSVQNS-RKLHDAFLRIGKE 361 + V + L D I + Sbjct: 186 FPVLSGFHALKDVVATILTQ 205 >gi|53714874|ref|YP_100866.1| putative outer membrane protein [Bacteroides fragilis YCH46] gi|52217739|dbj|BAD50332.1| putative outer membrane protein [Bacteroides fragilis YCH46] Length = 610 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 75/197 (38%), Gaps = 25/197 (12%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ T PW N+ H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGTCPW--NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLIF 249 L + S++ +++ ++ + + +VT++ Q I+E I+ L Sbjct: 267 LDLVKSSLKLLVN------NLRDKDKVAIVTYAGNAGEKLASTPGSDKQKIREAIDELEA 320 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 +T G+ AY + II TDG+ + +KE Sbjct: 321 SGSTAGGEGIMLAYKIAQKNFISGGNNR---------IILCTDGDFNVGVSSDKELEKLI 371 Query: 310 NEAKRRGAIVYAIGVQA 326 + ++ G + +G Sbjct: 372 EQKRKSGIFLTVLGYGM 388 >gi|281420094|ref|ZP_06251093.1| BatB protein [Prevotella copri DSM 18205] gi|281405894|gb|EFB36574.1| BatB protein [Prevotella copri DSM 18205] Length = 345 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 60/180 (33%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 KIS G++ ++ LD+S SM D +DK + ++ + + Sbjct: 80 SKISHDKRHGIETIICLDISNSMLCQDVVPSRLDKSKMLIENLVDNFNN----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEK 272 + GL+ F+ P+ + + + G T ++ A Sbjct: 130 KIGLIVFAGDAFVQLPITTDYVSAKMFLQNITPGLIQTQGTNIGAAIDLASKSFTQ---- 185 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + II +TDGEN P + ++G V+ +G+ Sbjct: 186 -------QENVGRAIIVITDGENHEPGAQEAAAAA-----NKKGINVFILGIGNTKGAPI 233 >gi|297380203|gb|ADI35090.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori v225d] Length = 217 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 73/197 (37%), Gaps = 12/197 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-I 227 + + ++LD S SM+ G G ++ V I++M++ +K + ++TF Sbjct: 15 IPVFLLLDTSGSMSHSLGNG-TRIEVLNLCIQKMIETLKQEAKKELFSKMAIITFGENGA 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V P +++I L T A + I D YK Y Sbjct: 74 VLHTPFD-DIKNIN--FKPLSASGGTPLDQAFRLAKDLIED------KDTFPTKFYKPYS 124 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I ++DGE + S F+ + + ++ ++I + + + D + + Sbjct: 125 ILVSDGEPNDDKWQKALSDFH-HYGRSAKSVCWSIFIGNRNDNPQVNKEFGKDGVFYADD 183 Query: 348 SRKLHDAFLRIGKEMVK 364 KL F + + + K Sbjct: 184 VEKLVGLFEIMTQTISK 200 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 76/199 (38%), Gaps = 24/199 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S++ + ++ M DI + + R G++ + + Sbjct: 248 DLLFLLDGSSSISPNDFS---------TTLTWMRDIAEQFTVGSQFTRVGMMQYGDEPHT 298 Query: 230 TFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 F L + E I+ + +S P Y I + G + + Sbjct: 299 EFDLNTFQNGSQVFEAISNVTQIGG-ESGP-----YAAILQVLRRSLTAQYGSRENVSQI 352 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 IIF+TDG D++ES NE + GA+VY IGV + L+ AS Sbjct: 353 IIFVTDG---GVVDDSEESQTILNELRFSGALVYTIGVGRMVSRPQLRMIASRPASHHVT 409 Query: 344 SVQNSRKLHDAFLRIGKEM 362 ++ + +L +I + Sbjct: 410 TIASYSELSATKSQIIDRI 428 >gi|327262912|ref|XP_003216267.1| PREDICTED: integrin alpha-1-like [Anolis carolinensis] Length = 1166 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 91/269 (33%), Gaps = 41/269 (15%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 +++ + + + Y+ + T C+N S ++ S + + LD+++V Sbjct: 164 TLVTNPKGGFLACGPLYAYKCGRLHYTTGICSNVSSNFEVVNSIAPSVQECNTQLDIVIV 223 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S S + T + +L + P G+V + + F L Sbjct: 224 LDGSNS--------IYPWESVTEFLNSLLQNMNIGPQQTQ---VGIVQYGENVTHEFNL- 271 Query: 235 WGVQHIQEKI----NRLIFGSTT--KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 E++ ++ T + G++ A + F +K ++ Sbjct: 272 -NTYTTVEEVLVAAKKIGQRGGTRTNTALGIDTARKEAFTEARGARRG------VQKVMV 324 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKNCAS- 338 +TDGE + DN + + ++I + E + +K+ AS Sbjct: 325 VVTDGE----SHDNYRLGEVIQDCEDENIQRFSIAILGHYNRGNLSTEKLVEEIKSIASE 380 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 F++V + L +G+ + Sbjct: 381 PTEKHFFNVSDELALLTIVEALGERIFAL 409 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 154 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 204 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 205 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 259 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 260 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 307 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 195 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 245 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 246 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 300 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 301 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 348 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 39/175 (22%), Positives = 77/175 (44%), Gaps = 22/175 (12%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN- 213 T S SD +D+ +LD S S+ + ++++E L I S D+ Sbjct: 9 TTGKPSKESCSDAIVDVGFILDSSGSLRRDY-----------KNLKEFLKTIASFFDIKI 57 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 N ++G++TFS + + L ++ ++++ GSTT+ L ++ N +F + Sbjct: 58 NGSQAGVITFSHRSEHSIKLNDFSDGDSFEKAVDKIPLMGSTTRIDKALRHSKNVMFTNQ 117 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 G + K +I LTDG + + ++ +E + G ++ AIG+ Sbjct: 118 ------NGGRLEATKLLILLTDGSQT-FSAKQEDPSIIADEIRNDGVLIIAIGIG 165 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 85/201 (42%), Gaps = 23/201 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 ++ + S SD +D+ +LD S S+ + ++++E L I S D Sbjct: 171 DIIQSGKPSKESCSDAIVDVGFILDSSGSLRRDY-----------KNVKEFLKTIASFFD 219 Query: 212 VN-NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIF 267 + N ++G++TFS + + L V ++ ++++ GSTT+ L ++ N +F Sbjct: 220 IKINGSQAGVITFSHRSEHSIKLNDFSDVDSFEKAVDKIPLMGSTTRIDKALRHSKNVMF 279 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + G + K +I LTDG + + ++ +E + G ++ AIG+ Sbjct: 280 TNQ------NGGRLEATKLLILLTDGSQT-FSAKQEDPSIIADEIRNDGVLIIAIGIGEG 332 Query: 328 AADQFLKNCA-SPDRFYSVQN 347 L A + Y+ Sbjct: 333 INKTELNRIAGKDENTYNADT 353 >gi|218186188|gb|EEC68615.1| hypothetical protein OsI_36984 [Oryza sativa Indica Group] Length = 585 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 71/175 (40%), Gaps = 20/175 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSI 199 P P+L+ ++ +D++ VLDVS SM G +L + ++ Sbjct: 19 AIPSNEERKEWPVLVHVVAPAKTE-RFPIDLVAVLDVSGSMTKATSMHGWTRLDLVKGAM 77 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-----FPLAW-GVQHIQEKINRLIFGSTT 253 + + + + + R +V F+ K+V + G K+N+L G T Sbjct: 78 KMVTNKLGAGD------RLAIVPFNGKVVAAGATRLMEMTTKGRADANAKVNQLKAGGDT 131 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 K P L++A + + D++ + + +I L+DG+++ D + Y Sbjct: 132 KFLPALKHA-SGLLDSRPAGDKQYRPG-----FIFLLSDGQDNGVLDDKLGGVRY 180 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 42/175 (24%), Positives = 68/175 (38%), Gaps = 29/175 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + + + V D S SM+ D + S+ I + NN V G Sbjct: 365 KKTKDNGKDIIAVFVADCSGSMDG------DPMNQLKNSLTNGAQYI----NDNNYV--G 412 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV++S+ + P+A + Q +N LI T S + A I +AK + Sbjct: 413 LVSYSNSVTIEVPIAQFDLNQRSYFQGAVNNLIASGGTASYDAVVVAVKMITEAKAQ--- 469 Query: 276 IAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 H D K + L+DG N+ ++D + + G VY IG +A Sbjct: 470 ----HPDAKCMLFLLSDGYANNGYSMD-----EITSALRTSGIPVYTIGYGDDAD 515 >gi|327265885|ref|XP_003217738.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 636 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 75/204 (36%), Gaps = 21/204 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVT 222 S + +++ ++DVS+SM+ KL ++ ++L+ IK D N V G + Sbjct: 289 SHLPKNIVFIIDVSISMSG------RKLQQTREALLKILEDIKE-DDYLNFVLFGDDVHK 341 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ P + + ++ T GL + +A + + Sbjct: 342 WKDTLIKATPE--NLDEASRYVQQIDIAGWTNLNGGLMAGIEMLNEAHKNRSLPERSAS- 398 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 II LTDG + D + L A + +Y +G + L+ A+ + Sbjct: 399 ---LIIMLTDGRPTKGERDTQVILSNVRNAIQGKYPLYNLGFGYDLDYGSLEKMAAENNG 455 Query: 342 -----FYSVQNSRKLHDAFLRIGK 360 + ++ +L + + Sbjct: 456 LARRIYEDSDSALQLQGFYDEVAN 479 >gi|297538282|ref|YP_003674051.1| von Willebrand factor type A [Methylotenera sp. 301] gi|297257629|gb|ADI29474.1| von Willebrand factor type A [Methylotenera sp. 301] Length = 328 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 72/219 (32%), Gaps = 36/219 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 IG + MV+D S SM+D F G + V ++ + G++TFS+ Sbjct: 77 GIGAQIGMVIDRSASMDDPFSGGTAEGRVGETKSVAAARLMTEFVNSRQNDMIGVITFSN 136 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + PL + IQ I S T GL A + + + Sbjct: 137 SAMYVLPLTESREAIQAAIKATAGNSLFQTNIGGGLTSAVSLFENVPDSGSRA------- 189 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------------- 329 II L+DG ++ + +R +Y I ++ Sbjct: 190 ---IILLSDGGGRLGGDVQQKLREW---LQRYNITLYWIVLRQPGGISIFNEYKEIDGEP 243 Query: 330 -------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 Q+ K SP Y ++ + L +A I ++ Sbjct: 244 LPQEVELYQYFKTLRSPFSAYEAEDPKSLANAIADINEK 282 >gi|255535988|ref|YP_003096359.1| BatB [Flavobacteriaceae bacterium 3519-10] gi|255342184|gb|ACU08297.1| BatB [Flavobacteriaceae bacterium 3519-10] Length = 335 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 80/229 (34%), Gaps = 32/229 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYE--MPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 + I D + +D + SR+ PF++ S +L S + + Sbjct: 31 KKKKKEIFADKRFRDELFDSRSRFSRFFPFLYLMASLFLIISIVDVLSGSEEVETKQKMN 90 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + + +LDVS SMN ++L A I + + + + G++ F+ + Sbjct: 91 NV--IFLLDVSNSMNAQ-DVEQNRLQQAKNLIINAMGKMTN-------DKVGIIVFAGEA 140 Query: 228 VQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL ++ + + T ++ A +K + + Sbjct: 141 SSIMPLTTDFTAVETYVGGVETSIVKMQGTDFLKAMQTAADKFRNVAKG----------- 189 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ L+DGE++ N A R G V ++G+ +E Sbjct: 190 SRKVVLLSDGEDNEGN-----EKAAAKLANREGIRVISVGIGSEEGAPI 233 >gi|145597778|ref|YP_001161854.1| hypothetical protein YPDSF_0468 [Yersinia pestis Pestoides F] gi|145209474|gb|ABP38881.1| membrane protein [Yersinia pestis Pestoides F] Length = 513 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 85/240 (35%), Gaps = 35/240 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N +G+I + L+PV ++ L E SH +AKL L+ + L +T+ Sbjct: 12 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE------ 65 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFR-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +ND + N T + + L F+Q + + +YN Sbjct: 66 ----NNYRNDRASNNRNNYLVTSYAQSYLPSERFSQ------PRVVNTYNESLGYTEYNA 115 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG- 186 S Y++ + + + ++ K S +D++ V D S SM+ FG Sbjct: 116 SLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGD 175 Query: 187 ----PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 + KL ++ + I S +N + P +WG + I Sbjct: 176 IERNNRITKLDELKAIFVKLNNRIFSNDGIN-------------TIGFVPFSWGTKRISA 222 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 371 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 419 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 420 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 479 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 480 TGNFYLAKNAHEL 492 >gi|296269618|ref|YP_003652250.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092405|gb|ADG88357.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 223 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 44/207 (21%), Positives = 76/207 (36%), Gaps = 22/207 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S+ L +V D S SM GP +D + + + E+ I S P V + R G++ FS Sbjct: 2 SEQVLPFYLVCDESYSME---GPPLDAIN---QELPEIYREIASNPVVADRARLGIIGFS 55 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + PL+ + + I +L T + I + L+ GH Y+ Sbjct: 56 DRAEVLLPLS-DLNDVHS-IPQLAPRGGTNYGAAFALLKSTIEQDVQALKQA--GHRPYR 111 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA----IVYAIGVQAEAADQFLKNCASPD 340 + FLTDG+ + SL G + A G + L+ A+ Sbjct: 112 PCVFFLTDGQPTYEWHQEYRSLT------DSGFPPHPTILAFGFG-DVDATTLQQVAT-F 163 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 R + + A K+++ + Sbjct: 164 RAFIANDDISPAQALREFAKQLLNSVV 190 >gi|295841331|dbj|BAJ07080.1| von Willebrand factor, type A [uncultured bacterium] Length = 334 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 23/172 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---I 227 + +V+D S SM+ KL A ++ L+ IK + R GL+ F + + Sbjct: 159 VYLVVDTSGSMSGE------KLAQAQEALSAFLEQIKGDRE-----RVGLIEFETSVKPV 207 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 VQ L + + R+ T + A+ ++ G ++ + Sbjct: 208 VQLDELGNNRAALDLAVQRMEAAGDTALLDAVYEAHQRL---------RKLGDEERINAV 258 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +TDG+ ++ I ++ + E +V+AI +A L+ P Sbjct: 259 VVMTDGQENNSWISLRKLVPQLAEDWPVPVVVFAIAYGDDADIATLRAITEP 310 >gi|238750905|ref|ZP_04612402.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] gi|238710819|gb|EEQ03040.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] Length = 520 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 53/239 (22%), Positives = 89/239 (37%), Gaps = 26/239 (10%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N G+I + LP+ ++ L E S KAKL ++ + L L EN Sbjct: 17 FIKNENGTILMSFIFFLPIFIGLIFLSFEISCFIQKKAKLSDAMEQATLA-----LTVEN 71 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRN-ELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 N ++ NI + F + L E F++ + ++I H DY+ Sbjct: 72 NNIPSSEQEV-----KNNILISSFAHAYLPEETFSEPV--------ITINSSASHMDYHA 118 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-G 186 Y F+ F + S + K +S + I D++ V D S SMN F G Sbjct: 119 DITMSYPAKFLNKAFNLISISDIKLDESAIAKKNTSITAIPTDVVFVTDYSGSMNRDFDG 178 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 +D ++ I + I K++ + + +V P WG + + N Sbjct: 179 TDIDSTDISKVRIVALRRIFKNLHNEIQQNE------NINLVGFVPFTWGTKRTIDNNN 231 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 35/127 (27%), Gaps = 27/127 (21%) Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 KI T + G+ N + +K ++ L+DG+++ Sbjct: 382 SKILNSRANGGTLISSGILSGNNLFKETNNNN----------RKIMVILSDGDDNDNTHA 431 Query: 302 NKESLF----------------YCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYS 344 + C K + I + + K C FY Sbjct: 432 GDNRINKDAPYLNITKKLIDNGMCERIKDNDIRMVFIAIGYTPDENIDWKKCVGEGNFYL 491 Query: 345 VQNSRKL 351 N+++L Sbjct: 492 ASNAQEL 498 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 76/190 (40%), Gaps = 21/190 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ ++D S S+ ++ D L + P+ + G++TFS Sbjct: 66 VDIGFIMDSSGSLGKNYKNEKDLLKTLAS-------LFSIKPNGSQA---GVITFSFYTE 115 Query: 229 QTFPLAW--GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L + ++R+ TT+ GL A ++F K+E+ + K Sbjct: 116 HSIKLNQFSDQDSFNDAVDRIPLMGHTTRIDKGLRLAQKEMF----KVENGGRPG--VSK 169 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYS 344 ++ LTDG + + +E +++G + AIG+ E ++ +K YS Sbjct: 170 LLVLLTDGSQTQGK-GVIDPAIIADEIRKQGVPIIAIGIGKEINKNELIKIGGGEANTYS 228 Query: 345 VQNSRKLHDA 354 + KL ++ Sbjct: 229 ADDFEKLKES 238 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 68/187 (36%), Gaps = 24/187 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +DM+ +D+S S + L ++IR ++D +P +N GL+T+S Sbjct: 627 VDMVFAMDLSSSSEEI-------LQKQKKAIRSLIDY--HLPSKSNE--LGLITYSDIAN 675 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + + + +N G + A IF + E + K ++ Sbjct: 676 VNSELTSKFNSEILDKIVNN---GRRQNVASAITVASENIFKIRNNDEKLKKK----QRV 728 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 ++ G+ SS N Y + IG+ + D F KN ++ Sbjct: 729 LVLFVVGKPSS-NYPPVVPERYGKLLEENNVKTIIIGLD-DVGDDFNKNIPGSKLSKYSE 786 Query: 345 VQNSRKL 351 + +L Sbjct: 787 NDTANEL 793 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 68/187 (36%), Gaps = 24/187 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +DM+ +D+S S + L ++IR ++D +P +N GL+T+S Sbjct: 811 VDMVFAMDLSSSSEEI-------LQKQKKAIRSLIDY--HLPSKSNE--LGLITYSDIAN 859 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + + + +N G + A IF + E + K ++ Sbjct: 860 VNSELTSKFNSEILDKIVNN---GRRQNVASAITVASENIFKIRNNDEKLKKK----QRV 912 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 ++ G+ SS N Y + IG+ + D F KN ++ Sbjct: 913 LVLFVVGKPSS-NYPPVVPERYGKLLEENNVKTIIIGLD-DVGDDFNKNIPGSKLSKYSE 970 Query: 345 VQNSRKL 351 + +L Sbjct: 971 NDTANEL 977 >gi|125602048|gb|EAZ41373.1| hypothetical protein OsJ_25890 [Oryza sativa Japonica Group] Length = 757 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 49/254 (19%), Positives = 89/254 (35%), Gaps = 42/254 (16%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + + E P I + P + + S ++ + +D++ VLDVS SM + Sbjct: 281 VVIKTHCEFPAIARSTPRDNFAVLLHVKAPSIAAEAAPARASVDLVTVLDVSGSMEGY-- 338 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQE 242 KL + R++ + R +V+FS + L G + Sbjct: 339 ----KLALLKRAMGLL----------GPGDRLAVVSFSYSARRVIRLTRMSEGGKASAKS 384 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + L T GL A K+FD + +A +I L+DG+++ Sbjct: 385 AVESLHADGCTNILEGLVEA-AKVFDGRRYRNAVAS--------VILLSDGQDNYNVNGG 435 Query: 303 KESLFYCNEA-------KRRG---AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRK 350 + N + KR G V+ G + + A + F ++N Sbjct: 436 WGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDASAMHTIAEETGGTFSFIENQAV 495 Query: 351 LHDAFLR-IGKEMV 363 + DAF + IG + Sbjct: 496 VQDAFAQCIGGLLS 509 >gi|224067090|ref|XP_002302350.1| predicted protein [Populus trichocarpa] gi|222844076|gb|EEE81623.1| predicted protein [Populus trichocarpa] Length = 705 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 81/219 (36%), Gaps = 37/219 (16%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS------------RYEMPFIFCTFPWC 145 + F ++ +S +SI D ++ + + Y + P Sbjct: 248 QGFFPTHSTSVVKSDEVSINDRDFSRNVQVRLLPEVAVISVGRGYETYAVALRVKAPPPL 307 Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + + +++ + +D++ VLDVS SM KL + R++R ++ Sbjct: 308 PSLTTRNSSNSTASLLDPSRRAPIDLITVLDVSASMTGA------KLQMLKRAMRLVISS 361 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEY 261 + S R +V FSS + PL G + + I+RL+ G + L Sbjct: 362 LGSAD------RLSIVAFSSSPKRLLPLKRMTPNGQRSARRIIDRLVCGQGSSVGEALRK 415 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 A + D +E+ + I+ L+DG++ + Sbjct: 416 ATKVLEDRRERNPVAS---------IMLLSDGQDERSST 445 >gi|109088171|ref|XP_001107718.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Macaca mulatta] Length = 946 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N VR+ Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRT------W 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 359 RNDLISATKTQVSDAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 49/278 (17%), Positives = 101/278 (36%), Gaps = 50/278 (17%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL---YT 59 F ++ F N G +I+TA+L + + + ++ + KA+L D + L Sbjct: 12 FSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAE 71 Query: 60 ATKILNQENGNNGKKQ----------KNDFSYRIIKNIWQTD----FRNELRENGFAQDI 105 + ++ ++ +Q DFS ++ W+ + ++ + D Sbjct: 72 DNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYLRSDDS 131 Query: 106 NNIERSTSLSI-----------IIDDQHKDYNLSAVSRYEMPFIFCTF--PWCANSSHAP 152 N + S+ ++I ++K+ +++ + F PW + Sbjct: 132 NGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSS 191 Query: 153 LLITSSVKISSKSDIG---------LDMMMVLDVSLSMNDHFGPG-----------MDKL 192 L V I+S +D+MMV D+S SMN +D L Sbjct: 192 QLHDGRVGINSGETYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNRRIDAL 251 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +I+++L D++ R G V+F++ Q Sbjct: 252 REVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQK 289 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 54/140 (38%), Gaps = 22/140 (15%) Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYIIFLTDGENSS 297 + + + + T T G+ N + D + K + + ++ ++ L+DGE++ Sbjct: 382 GVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLNTNTRRVLLILSDGEDNR 441 Query: 298 PNIDNKESLF---YCNEAKRR--------------GAIVYAIGVQAEAADQFL--KNCAS 338 P+ + + C + K + A+G DQ + K C Sbjct: 442 PSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVALGFN-PPQDQLIAWKKCV- 499 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 ++Y V + + L DAF +I Sbjct: 500 GKQYYPVNSKQGLLDAFKQI 519 >gi|309790222|ref|ZP_07684794.1| hypothetical protein OSCT_0745 [Oscillochloris trichoides DG6] gi|308227807|gb|EFO81463.1| hypothetical protein OSCT_0745 [Oscillochloris trichoides DG6] Length = 472 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S + LD++ +LD + SM D ++ SI + +D P +R GLV + Sbjct: 249 DSTLRLDLLFMLDTTGSMGDELY----RIQETIDSIAQRIDAFNPRPQ----IRYGLVAY 300 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + + ++N L + ++ A E Sbjct: 301 KDEGDDYVTRPVAFTTDLAAFRAELNALSAQGGGDTPEAVDAALENSILKME------WS 354 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + + D L EA RG +Y I Sbjct: 355 DTPAVRLVFLVADAGPHIFPQIQFTYLDGAREAVARGVKIYPIA 398 >gi|254787962|ref|YP_003075391.1| PKD domain-containing protein [Teredinibacter turnerae T7901] gi|237687416|gb|ACR14680.1| PKD domain protein [Teredinibacter turnerae T7901] Length = 1083 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 67/231 (29%), Gaps = 58/231 (25%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD ++++D S SM P +L A + I + D G++ F Sbjct: 413 LDSLLIIDSSGSMR-TTDPQSRRLEAANTYVN-----ISAAEDK-----IGIIDFDGSAR 461 Query: 229 QTFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI--------------- 266 P ++ IN + T+ L A + + Sbjct: 462 VVKPFTLLGQQGSTERIQLENAINGIDAVGGTEIAGSLGLACSTLMASWQDDIIEQQHMV 521 Query: 267 -----------------FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 +A+ + KK I LTDG+ S + Sbjct: 522 NDIKALVEEFPNNTISHSNARRAAIPGIAHAINSKKIAILLTDGDTPSSYSQANQCFI-- 579 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR--KLHDAFLRI 358 G ++Y IG ++ + +Y +S L+ A+ +I Sbjct: 580 ----ENGWMLYTIGFGGANGEKLAPLAEASGGYYIAADSSLLDLNCAYQQI 626 >gi|312879450|ref|ZP_07739250.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] gi|310782741|gb|EFQ23139.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] Length = 813 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 47/278 (16%), Positives = 85/278 (30%), Gaps = 41/278 (14%) Query: 110 RSTSLSIIIDDQHKDYN----LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS 165 S + Y+ ++ Y T A ++ L + + + Sbjct: 269 DGVSGDTTVKPSRVQYSFSKPVTTSGDYLNAVTPGTHLAAAQANLKILWLAEATMVY--- 325 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +VLD S SM + D + + +++ S+P V G+V F Sbjct: 326 ------QIVLDRSGSMGTNPDKP-DDPTPLSYAKTAACNLVDSLP---KNVYVGIVQFDD 375 Query: 226 KIVQTFPLAW----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 Q +P+ + IN L G +T YA ++ K L Sbjct: 376 STSQVYPITLIASNDAAAAATRAAAKAAINGLTSGGSTAIYDAASYALSQFVAQKTALSA 435 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFL 333 G LTDGE++S + E + K + +G A +A L Sbjct: 436 DLLG------VTYLLTDGEDNSSSKSVGEVIGEYQAQK---VPLITVGYGAGGQAGSFAL 486 Query: 334 KNCA--SPDRFYS-VQNSRKLHDAFLRIGKEMVKQRIL 368 A + ++++ + L F + L Sbjct: 487 TQLADGTGGQYFASPVDQAALQQVFFAALGKTSDAVSL 524 >gi|157273368|gb|ABV27267.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 324 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 79/226 (34%), Gaps = 41/226 (18%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++ S+ + + + V+D S SM ++ + R+ + + Sbjct: 82 VEQQIEYFSRDEAPVSLGFVVDTSGSMRPRRAKVIEAVKFLARAAK----------PGDE 131 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV F +K I+E ++ +++G T ++ E Sbjct: 132 FF---LVDFKNKAELAEEFTPRPADIEEAVDNIVWGGGTALLDAIQL----------SAE 178 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE------- 327 + K + +K I+ +DG++ D ++ + + VY +G + Sbjct: 179 YADKEGKNRRKAIVVFSDGDDRDSYYDRRQLIKL---LQEYQVQVYIVGFPDDDDDGGLF 235 Query: 328 ------AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A Q +K+ A + R + ++ +L + I ++ Q Sbjct: 236 GRSTRKRAVQLIKDIANETGGRAFFPKSVDELPEIVRTINADLRTQ 281 >gi|255009406|ref|ZP_05281532.1| hypothetical protein Bfra3_09717 [Bacteroides fragilis 3_1_12] gi|313147165|ref|ZP_07809358.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135932|gb|EFR53292.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 341 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 65/196 (33%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LVFTAIGLFAVLLARPQFGS---KLETVKRKGVEVMIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLISKLVDGMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A + + + I+ +TDGEN ++G Sbjct: 170 TAIGAAIS-------LAARSFTPQEGVGRAIVVITDGENHEGGAVEAAKEAA-----KKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVNVLGVGLPDGAPI 233 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 81/222 (36%), Gaps = 42/222 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVTFSSKIV 228 ++ VLD S SM+ K+ +++ +L +K D+ N+VR G + + Sbjct: 304 IVFVLDHSGSMSG------RKIDQLIEAMQNILTDLKET-DLFNIVRFGDLAMVWDVSQN 356 Query: 229 QTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA------ 277 Q L ++ +IN + T+ +E A I D Sbjct: 357 QFTQLPNFNEYGNLEPHLREINLPRAVNGTE--ENIEAAKKIIEDKSRLGMTNMMYGLEV 414 Query: 278 -------KGHDDYKKY---IIFLTDGENS---SPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + KY I+FLTDG + S + ++ N K++ A ++++ Sbjct: 415 GLFLIKRTQEETPDKYQPMIVFLTDGHPNAGMSGRDEITNTVTSLNSGKKK-ASIFSLSF 473 Query: 325 QAEAADQFLKNCAS-----PDRFYSVQNSR-KLHDAFLRIGK 360 A +FL+ +S Y ++ +L D + I Sbjct: 474 GDFADKRFLRKISSKNSGFSRHIYESSDASLQLQDFYRAISA 515 >gi|241620324|ref|XP_002408644.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215503005|gb|EEC12499.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 704 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 81/215 (37%), Gaps = 36/215 (16%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 P+ + K + ++ VLDVS SM +K+ + ++ L+ ++ Sbjct: 44 TDPVPRMPTKFRIVKGQGHIRIVFVLDVSGSMGLE-----NKINMLRQAASRSLE--DNV 96 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN---RLIFGSTTKSTPGL-----EY 261 PD ++ G++TFS + ++ I I +T L ++ Sbjct: 97 PDGSD---VGIITFSDNATVVAGMRTLSAATRQAIKNAVPSIARGSTAIGKALMTSVQKH 153 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIV 319 ++ E E+ A ++ +TDGE + P D +L ++ V Sbjct: 154 GPQELERNGETAENAA---------LLLMTDGEENEPPYINDVLPTLL------QKRLRV 198 Query: 320 YAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHD 353 +++ V EA D + + Y + N+ KL D Sbjct: 199 FSVPVGKEADDGLRVLSERTGENVYPITNTTKLAD 233 >gi|194221223|ref|XP_001915876.1| PREDICTED: inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Equus caballus] Length = 834 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 68/201 (33%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ ++ +++D + N LV F+ + Q Sbjct: 273 VIFVIDQSGSMAG------RKIQQTREALIKIVDDLGPKDQFN------LVCFNEEATQW 320 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A ++ + ++ T + A + A ++ A Sbjct: 321 KPSLVPASAENMKEARNFAAGIMARGGTNINDAVLLAVQLLERANKQELLPAGSVS---- 376 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II LTDG+ + + EA ++ +G + FL+ A + Sbjct: 377 LIILLTDGDPTVGETNRANIQKNVQEAISGQCSLFCLGFGFHVSYAFLEKLALDNGGLAR 436 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + ++ +L D + + Sbjct: 437 RIYEDSDSALQLQDFYQEVAN 457 >gi|257453795|ref|ZP_05619073.1| von Willebrand factor type A domain protein [Enhydrobacter aerosaccus SK60] gi|257448722|gb|EEV23687.1| von Willebrand factor type A domain protein [Enhydrobacter aerosaccus SK60] Length = 260 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 30/159 (18%), Positives = 56/159 (35%), Gaps = 6/159 (3%) Query: 171 MMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++VLD+S SM G G ++ + I + N VR + + Sbjct: 44 CVLVLDLSGSMAIRSGNGDKRRIDMLNEGIEAFYHDLMKDETARNRVRL-AIVIVGGVND 102 Query: 230 TFPLAWGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 T L + + T G+ A N I + L + ++I Sbjct: 103 TAELMMDWTDAIDFFPIKFRENGMTPLGQGMLLALNLIEQERINLRDNGINYTRP--WVI 160 Query: 289 FLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQA 326 +TDG + + ++ C++A + I+Y I + A Sbjct: 161 AMTDGLPTDSQDVWQAAINQCHQAEQNNQCIIYPIAIDA 199 >gi|94732992|emb|CAK03801.1| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1056 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 71/189 (37%), Gaps = 34/189 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + D NV R F+ K Sbjct: 214 DMVILVDVSGSVSGL------TLKLIKASVTEMLDTLSD-DDYVNVAR-----FNEKAEA 261 Query: 230 TFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P + +E + ++ TT G +A+N++ + Sbjct: 262 VVPCFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLN------KTNVPRA 315 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCAS 338 + K I+ TDG D + +F + V+ V D L+ C++ Sbjct: 316 NCNKIIMLFTDG-----GEDRAQDIFEQYNWPNKTVRVFTFSVGQHNYDVTPLQWIACSN 370 Query: 339 PDRFYSVQN 347 ++ +++ Sbjct: 371 KGYYFEIRS 379 >gi|147902754|ref|NP_001082889.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Danio rerio] gi|94732178|emb|CAK04720.1| novel protein similar to vertebrate calcium channel voltage-dependent alpha 2 delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1052 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 71/189 (37%), Gaps = 34/189 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + D NV R F+ K Sbjct: 200 DMVILVDVSGSVSGL------TLKLIKASVTEMLDTLSD-DDYVNVAR-----FNEKAEA 247 Query: 230 TFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P + +E + ++ TT G +A+N++ + Sbjct: 248 VVPCFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLN------KTNVPRA 301 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCAS 338 + K I+ TDG D + +F + V+ V D L+ C++ Sbjct: 302 NCNKIIMLFTDG-----GEDRAQDIFEQYNWPNKTVRVFTFSVGQHNYDVTPLQWIACSN 356 Query: 339 PDRFYSVQN 347 ++ +++ Sbjct: 357 KGYYFEIRS 365 >gi|42524419|ref|NP_969799.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] gi|39576628|emb|CAE80792.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] Length = 469 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 65/223 (29%), Gaps = 28/223 (12%) Query: 149 SHAPLLITSSVKISSKSDIGLDM-MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 S P + VK + + V+D S SM GP K+ V + + L Sbjct: 47 SKIPANASGEVKAEDLPSTAIPLVEYVIDSSGSMGQLMGPKKTKIYVLKKLLARYL---- 102 Query: 208 SIPDVNNVVRSGLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 + SGL + P + I+ + T L Sbjct: 103 -MSQWTEKTSSGLRVIGSRRKKDCKDNYLAIEPAQSKLGAIEGIVKGFEPVGMTPIGQAL 161 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + AY + +EH K ++ TDGE + K + K Sbjct: 162 KDAY-------KDVEHYKGP-----KRVVLFTDGEETCGQDPCKIAAEL--SGKDVDLKF 207 Query: 320 YAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + + + L A + KL + F + K++ Sbjct: 208 FVVAFGLQNQPDVLDKLACIGDMSQADDEEKLEELFQDLDKQL 250 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 60/195 (30%), Gaps = 36/195 (18%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 L ++S + + +++VLD S SM G D + + +D +K+ Sbjct: 868 ITLGASTSGREAGTEAKAASVVLVLDRSGSM------GADGMTALVNAADTFIDTLKTAS 921 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ----EKINRLI--------FGSTTKSTPG 258 + +V F+ + + EK+ + T Sbjct: 922 PDSQ---VAVVYFNGTQDEDDNTTTSKNFTKLNTDEKVKSIKDFLSNNGYSYGGTPMGDA 978 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID-----NKESLFYCNEAK 313 LE A + + +KY++F TDG + D S C Sbjct: 979 LEKAKGLLDADQTGN----------QKYVLFFTDGLPGHSSDDAFNCMVANSAVNCATDI 1028 Query: 314 RRGAIVYAIGVQAEA 328 + A +Y +G Sbjct: 1029 KANATIYTVGYHLSG 1043 >gi|301788516|ref|XP_002929674.1| PREDICTED: complement C2-like isoform 3 [Ailuropoda melanoleuca] Length = 617 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 81/217 (37%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + D + S M+D I S V Sbjct: 112 KIHIQRSGHLNLYLLLDASQSVAE------DDFQIFKESAILMVDRIFSFEIN---VSVA 162 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + + E IN L G+ T + L + + + Sbjct: 163 IITFASKPQVVMSVLYDNSRDVTEVINSLNNINYKDHENGTGTNTYAALNSVHIMMNNQM 222 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ DN + N+ + +YAIGV Sbjct: 223 QRLGMKTAAWQEIRHAIILLTDGKSNMGGSPKLAVDNIREILNINQQRSDYLDIYAIGVG 282 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLR 357 D + + +Q++ L+ F Sbjct: 283 KLDVDWRELNELGSKKDGERHAFILQDTEALYQVFEH 319 >gi|301788514|ref|XP_002929673.1| PREDICTED: complement C2-like isoform 2 [Ailuropoda melanoleuca] Length = 749 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 81/217 (37%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + D + S M+D I S V Sbjct: 244 KIHIQRSGHLNLYLLLDASQSVAE------DDFQIFKESAILMVDRIFSFEIN---VSVA 294 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + + E IN L G+ T + L + + + Sbjct: 295 IITFASKPQVVMSVLYDNSRDVTEVINSLNNINYKDHENGTGTNTYAALNSVHIMMNNQM 354 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ DN + N+ + +YAIGV Sbjct: 355 QRLGMKTAAWQEIRHAIILLTDGKSNMGGSPKLAVDNIREILNINQQRSDYLDIYAIGVG 414 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLR 357 D + + +Q++ L+ F Sbjct: 415 KLDVDWRELNELGSKKDGERHAFILQDTEALYQVFEH 451 >gi|301788512|ref|XP_002929672.1| PREDICTED: complement C2-like isoform 1 [Ailuropoda melanoleuca] gi|281345620|gb|EFB21204.1| hypothetical protein PANDA_019912 [Ailuropoda melanoleuca] Length = 748 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 81/217 (37%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + D + S M+D I S V Sbjct: 243 KIHIQRSGHLNLYLLLDASQSVAE------DDFQIFKESAILMVDRIFSFEIN---VSVA 293 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + + E IN L G+ T + L + + + Sbjct: 294 IITFASKPQVVMSVLYDNSRDVTEVINSLNNINYKDHENGTGTNTYAALNSVHIMMNNQM 353 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ DN + N+ + +YAIGV Sbjct: 354 QRLGMKTAAWQEIRHAIILLTDGKSNMGGSPKLAVDNIREILNINQQRSDYLDIYAIGVG 413 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLR 357 D + + +Q++ L+ F Sbjct: 414 KLDVDWRELNELGSKKDGERHAFILQDTEALYQVFEH 450 >gi|256376610|ref|YP_003100270.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255920913|gb|ACU36424.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 559 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 65/194 (33%), Gaps = 28/194 (14%) Query: 174 VLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 VLDVS SM D L T S + VV L+ F+ + Sbjct: 381 VLDVSGSMEGDRMAQLKRALSRLTGSDESLTGQYCRFRSREEVV---LLPFNQAPLAPQE 437 Query: 233 LAWGV-------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I+ + L+ G T LE AY + + E+ Sbjct: 438 FSVDVGAPRETLERIRGAVEGLVAGGDTAVYDSLERAYGVVGSSPERFTS---------- 487 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYS 344 ++ +TDGEN + Y A+ + V+ I + + + + + Sbjct: 488 -VVLMTDGENRVG----RTFAEYREFARGKAVPVFPIVFGEASRAKMGEIAEITGGAVWD 542 Query: 345 VQNSRKLHDAFLRI 358 NS L AF +I Sbjct: 543 A-NSESLERAFCQI 555 >gi|309361725|emb|CAP28912.2| hypothetical protein CBG_09720 [Caenorhabditis briggsae AF16] Length = 675 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 92/261 (35%), Gaps = 40/261 (15%) Query: 115 SIIIDDQHKDYNLSAVS-RYEM----PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 + I + K++ + + S + M P P ++ + T+ I Sbjct: 438 NEIEELNGKNFKVRSRSVHFAMTEKPPVTTAMNPMKFFTTSRTPITTAKSLIPYSCTA-- 495 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 D+ ++D+S D +D A S+ P VR GL+++S Sbjct: 496 DVFFLVDLSQGTGDKSQQYLDIAASAISSL----------PISQEAVRVGLISYSGPGRT 545 Query: 226 KIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + E++ + G TT++ + YA E + H A+ + K Sbjct: 546 HVRVYLDKHNDKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGMAHPARKN--VK 599 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ----AEAADQFLKNCASPD 340 K ++ TDG + D A+ +G + A+ V+ +Q + Sbjct: 600 KVLVVFTDGYSQDHPRDAARG------ARAKGLQLIAVAVKDRLAPPDEEQLAEIGGHAK 653 Query: 341 RFYSVQNSRKLHDAFLRIGKE 361 + N R+L + IG + Sbjct: 654 NVFISPNGRELRE--KIIGTQ 672 >gi|308472935|ref|XP_003098694.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] gi|308268294|gb|EFP12247.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] Length = 405 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 43/214 (20%), Positives = 71/214 (33%), Gaps = 28/214 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ + P + S++ LD++ V+D S M + G+ + + S+ I Sbjct: 22 SADYDPASYVDRSCGTDLSNLWLDVIAVVDNSRGMTNK---GLSYVASSIISVFGKNTRI 78 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA---------WGVQH-IQEKINRLIFGSTTKST 256 S R GLVT++S Q L +G+ + +N TT Sbjct: 79 GSSSAEPRTTRLGLVTYNSVATQNADLNQYQSIEDAYYGIYGALSTTVNTTESYLTT--- 135 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL A + Y+K II N D + N K G Sbjct: 136 -GLNAAVELF-----SRQSFRSNRQHYRKVIIVYASEYNGRGEFDP---VPIANRLKASG 186 Query: 317 AIVYAIGVQAEAADQFLK---NCASPDRFYSVQN 347 + I + + L+ ASP +S N Sbjct: 187 VNIITIAYEQPGSAGLLQGLSQIASPGFSFSGDN 220 >gi|260297|gb|AAB24261.1| type VI collagen alpha 3 chain [Homo sapiens] Length = 205 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 67/195 (34%), Gaps = 22/195 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S ++ + + + + D++KS+ N LV F Sbjct: 2 KNGAAADIIFLVDSSWTIGEEHFQLVREF---------LYDVVKSLAVGENDFHFALVQF 52 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I+ + + T T I + D Sbjct: 53 NGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTG---KGLEYIMAKHLTKAAGSLAGD 109 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ LTDG + E K V+AIGV+ + + P Sbjct: 110 GVPQVIVVLTDGHSKDGLALPSA------ELKSADVNVFAIGVEDADEGALKEIASEPLN 163 Query: 340 DRFYSVQNSRKLHDA 354 ++++N LHD Sbjct: 164 MHMFNLENFTSLHDI 178 >gi|3024046|sp|P97278|ITIH1_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|1694688|dbj|BAA13938.1| inter-alpha-trypsin inhibitor heavy chain 1 [Mesocricetus auratus] Length = 914 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L +K ++V G S Sbjct: 290 TNMSKNLVFVIDISGSMEGQ------KVKQTKEALLKILGDVKPGDSF-DLVLFGSRVQS 342 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + R T GL + A+ ++ Sbjct: 343 WKGSLVPATQANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 399 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-----KNCASP 339 +I LTDGE + D + L A R +Y +G + FL +N Sbjct: 400 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFNFLEVMSMENSGWA 458 Query: 340 DRFYSVQNSRKLHDAFLR 357 R Y ++ + F Sbjct: 459 QRIYEDHDATQQLQGFYN 476 >gi|51597046|ref|YP_071237.1| hypothetical protein YPTB2727 [Yersinia pseudotuberculosis IP 32953] gi|51590328|emb|CAH21965.1| putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 472 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 71/200 (35%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K A ML+ ++ +V Sbjct: 90 STRRSPINLALVIDRSTSMS---GERIEKAREAAILAVNMLNTTDTLS---------VVA 137 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 138 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 191 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 192 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 248 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 249 SDGNHTFVANSADLEKAFTK 268 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 70/173 (40%), Gaps = 22/173 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 6 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 56 Query: 229 QTFPL-AWGVQ-HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 FPL +G +++ + + T + L + F + A + Sbjct: 57 TEFPLGRYGTAVEVKQAVLAMEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---VPR 113 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + TDG + + + AK G ++YA+GV + L+ AS Sbjct: 114 VGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEAE-LREIAS 159 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 67/175 (38%), Gaps = 26/175 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 6 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 56 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + A Sbjct: 57 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALN---V 111 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + TDG + + + AK G +YA+GV + L+ AS Sbjct: 112 PRVGLVFTDGRSQD------DISVWAARAKEEGIAMYAVGVGKAVEAE-LREIAS 159 >gi|34540039|ref|NP_904518.1| von Willebrand factor type A domain-containing protein [Porphyromonas gingivalis W83] gi|34396350|gb|AAQ65417.1| von Willebrand factor type A domain protein [Porphyromonas gingivalis W83] Length = 1226 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 76/231 (32%), Gaps = 33/231 (14%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 ++ E + +D + I + R++ FA+++ + Sbjct: 56 LIQAEIVYQSVSEHSDLVISPVNEIRPANRFPSHRKSFFAENLRASPPVVPV-------- 107 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 AV +Y +P P N+ L IT+ + + +D S SM Sbjct: 108 ------AVDKYAVPVANPMDPENPNAWDVTLKITTKAVTVPVDVVMV-----IDQSSSMG 156 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L A S + + + VR LV++ + + + + Sbjct: 157 ---GQNIARLKSAIASGQRFVKKMLPKGTATEGVRIALVSYDHEPHRLSDFTKDTAFLCQ 213 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 KI L T + GL+ A N + K+II ++DG Sbjct: 214 KIRALTPIWGTHTQGGLKMARNIMA-----------TSTAVDKHIILMSDG 253 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 23/61 (37%), Gaps = 3/61 (4%) Query: 305 SLFYCNEAKRRGAIVYAIGVQAEA---ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 ++ AK G ++ IG A+ LK A+ + + L AF I + Sbjct: 361 AINEAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDENHFFTATPANLAAAFDNIAQT 420 Query: 362 M 362 + Sbjct: 421 I 421 >gi|227833165|ref|YP_002834872.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] gi|227454181|gb|ACP32934.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 521 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 35/203 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVV------RSGLVTFS 224 +VLD S SM ++ + S+ ++D +PD V + L+ +S Sbjct: 334 ALVLDTSGSMEGE------RMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYS 387 Query: 225 SKIVQTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 S+ Q + + +++ RL+ T + + A++++ + + Sbjct: 388 SEPQQPTRARVDKDKPATTKELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIGT--- 444 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCA 337 ++ +TDGE + + Y + + V+ I EA Q ++ A Sbjct: 445 --------VVLMTDGEVTRGRTFAQFKDAYAQLPEDKKEIPVFVILYG-EANIQEMEELA 495 Query: 338 --SPDRFYSVQNSRKLHDAFLRI 358 + + + N L AF I Sbjct: 496 QLTGGKTFDALN-GDLAAAFEEI 517 >gi|22127367|ref|NP_670790.1| hypothetical protein y3493 [Yersinia pestis KIM 10] gi|45442761|ref|NP_994300.1| hypothetical protein YP_2999 [Yersinia pestis biovar Microtus str. 91001] gi|108809099|ref|YP_653015.1| hypothetical protein YPA_3108 [Yersinia pestis Antiqua] gi|108810706|ref|YP_646473.1| hypothetical protein YPN_0541 [Yersinia pestis Nepal516] gi|150260286|ref|ZP_01917014.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162419964|ref|YP_001604884.1| hypothetical protein YpAngola_A0266 [Yersinia pestis Angola] gi|165939877|ref|ZP_02228416.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009017|ref|ZP_02229915.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211928|ref|ZP_02237963.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167466384|ref|ZP_02331088.1| hypothetical protein YpesF_00480 [Yersinia pestis FV-1] gi|218927875|ref|YP_002345750.1| hypothetical protein YPO0684 [Yersinia pestis CO92] gi|229837366|ref|ZP_04457529.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|229840578|ref|ZP_04460737.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842872|ref|ZP_04463024.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229900904|ref|ZP_04516028.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|294502750|ref|YP_003566812.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|21960452|gb|AAM87041.1|AE013952_8 hypothetical [Yersinia pestis KIM 10] gi|45437627|gb|AAS63177.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|108774354|gb|ABG16873.1| membrane protein [Yersinia pestis Nepal516] gi|108781012|gb|ABG15070.1| putative membrane protein [Yersinia pestis Antiqua] gi|115346486|emb|CAL19360.1| putative membrane protein [Yersinia pestis CO92] gi|149289694|gb|EDM39771.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162352779|gb|ABX86727.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912188|gb|EDR30826.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|165992356|gb|EDR44657.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206674|gb|EDR51154.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|229682243|gb|EEO78335.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|229690139|gb|EEO82196.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229696944|gb|EEO86991.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705489|gb|EEO91499.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|262364727|gb|ACY61284.1| hypothetical protein YPD8_0594 [Yersinia pestis D182038] gi|294353209|gb|ADE63550.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|320016753|gb|ADW00325.1| putative fimbrial anchor [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 518 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 85/240 (35%), Gaps = 35/240 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N +G+I + L+PV ++ L E SH +AKL L+ + L +T+ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE------ 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFR-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +ND + N T + + L F+Q + + +YN Sbjct: 71 ----NNYRNDRASNNRNNYLVTSYAQSYLPSERFSQ------PRVVNTYNESLGYTEYNA 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG- 186 S Y++ + + + ++ K S +D++ V D S SM+ FG Sbjct: 121 SLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGD 180 Query: 187 ----PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 + KL ++ + I S +N + P +WG + I Sbjct: 181 IERNNRITKLDELKAIFVKLNNRIFSNDGIN-------------TIGFVPFSWGTKRISA 227 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 376 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 424 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 425 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 484 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 485 TGNFYLAKNAHEL 497 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 77/240 (32%), Gaps = 24/240 (10%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P F W A L +T+ ++ + + + +++V+DVS SM +L +A Sbjct: 336 PDAFGLGGWYRTPVEAVLPVTTDLRTEVEVPL-VALVIVMDVSQSMTAGNPS---RLELA 391 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 ++D+ G +TFS + F + ++ I + Sbjct: 392 KEGAVGVVDL------AYERDMLGFITFSDRPEWVFRPRQATLQGKREMTAAILNVAPQG 445 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-----CN 310 E AY + D + + K++I LTDG+ + Sbjct: 446 GTIFEPAYREALD-------VLMAQEAAVKHVIVLTDGKFADGTGPFSRGPAPDFGRLAA 498 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 +R G I + A Q L A R+Y + L F + + Sbjct: 499 LGRRSGITTSTIAIGDGADPQQLTTIARAGGGRYYEALDVSTLPRIFTTEALSATRSLLR 558 >gi|225575062|ref|ZP_03783672.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] gi|225037732|gb|EEG47978.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] Length = 393 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 74/213 (34%), Gaps = 46/213 (21%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ +DM+++LD S SM P + + ++ R+ Sbjct: 101 PPMENHSAVDMVLLLDGSGSMQGKKEPCVQATEAL-------------LEQMDEQSRAQA 147 Query: 221 VTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V F+S ++ L G + + + + T+ L +A N + + KE Sbjct: 148 VAFASCVLGNTELLPLDEEGRETLIKFVEGTDIIGGTEFGQPLTFALNSLEEKKETGRIQ 207 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-----EAADQ 331 A +I L+DGE P +E K + ++Y I + A E A Q Sbjct: 208 A---------VILLSDGEGPFPETLEEEY-------KEKDVVLYTIRMDAGEQETETARQ 251 Query: 332 FLKNCASPDRF---YSVQ-----NSRKLHDAFL 356 ++ F V ++ +L AF Sbjct: 252 LVQFAQKTGGFDTKIPVDEKGQISTDELTKAFR 284 >gi|121582838|ref|YP_973280.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596100|gb|ABM39538.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 240 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 74/203 (36%), Gaps = 18/203 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 L ++++ DVS SM+++ K+ +++EM+ + ++ GL+TF + Sbjct: 28 PLPVIVLADVSGSMSEN-----GKIDALNVALKEMILSFGKESGLRAEIQVGLITFGGRE 82 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + PL + + T A K+ + KE+ Y+ Sbjct: 83 AHEHLPLV--AAKVIGGVEAFKANGGTPMGSAFALA-RKLLEDKEQ-----IPSRAYRPV 134 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYS 344 +I ++DG + L + + A +A+ + A+A L + + Sbjct: 135 LILVSDGAPTDAWEAPLADLKASE--RGQKATRFAMAIGADADLDMLAQFPNDREAPVFK 192 Query: 345 VQNSRKLHDAFLRIGKEMVKQRI 367 +R + F + +V + Sbjct: 193 THEARDIGRFFRAVTMSVVSRST 215 >gi|56696061|ref|YP_166415.1| hypothetical protein SPO1165 [Ruegeria pomeroyi DSS-3] gi|56677798|gb|AAV94464.1| conserved domain protein [Ruegeria pomeroyi DSS-3] Length = 257 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 61/185 (32%), Gaps = 33/185 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V ++D D M+V D S SM + G+D+ + RE L ++P + + R Sbjct: 29 PVPALGEADCTQDAMIVFDASGSMAEMGYNGLDRPRIL--DAREALH--DALPRIAALRR 84 Query: 218 SGLVTFSSKIVQT-------------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 GLVT+ + + P I I+ + T T + A Sbjct: 85 LGLVTYGAAMDGDADGDLCKRVSMPFTPSPNAAGQILNLIDAIEPDGNTALTDAVNLAAR 144 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIG 323 I+ +TDG+ + + A+ G V+ IG Sbjct: 145 VFDQPPRPG------------VIVLVTDGDETCGGAPCA---LAADLARDTPGLTVHVIG 189 Query: 324 VQAEA 328 + + Sbjct: 190 FRVRS 194 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 66/206 (32%), Gaps = 34/206 (16%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 S ++ + NLS + P +N+ + L+ + +D++ Sbjct: 22 STNLASASNNVTVNLSVTPSQSV----VILPTTSNAKASLNLMLTPTGNPQTERDPIDLV 77 Query: 173 MVLDVSLSMNDH--FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V D S SM+ + ++ A ++ L N R G V FSS Sbjct: 78 FVFDKSGSMDFKVASNSSVKRIDSAKSAMTNALMFFDG---QNTSDRFGFVPFSSNANTD 134 Query: 231 -FPLA----WG--------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L WG +Q I K L T T L+ A + + Sbjct: 135 VVSLTDSSGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTEALDVASKLFDSSSKD----- 189 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK 303 K IIFLTDG + D K Sbjct: 190 -------KNIIFLTDGTPTFSFSDEK 208 >gi|153950207|ref|YP_001400285.1| von Willebrand factor type A domain-containing protein [Yersinia pseudotuberculosis IP 31758] gi|152961702|gb|ABS49163.1| von Willebrand factor type A domain protein [Yersinia pseudotuberculosis IP 31758] Length = 460 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 71/200 (35%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K A ML+ ++ +V Sbjct: 78 STRRSPINLALVIDRSTSMS---GERIEKAREAAILAVNMLNTTDTLS---------VVA 125 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 126 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 179 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 180 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 236 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 237 SDGNHTFVANSADLEKAFTK 256 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 47/197 (23%), Positives = 80/197 (40%), Gaps = 31/197 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ D L + I+E++ + + P+ V+ G++ +S KI Sbjct: 843 DIYFLIDGSGSI-----NPQDFLEM-KAFIKEVIKMFQIGPNR---VQFGVIQYSDKIQS 893 Query: 230 TFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L+ V ++ I+ G G + +Y+ Sbjct: 894 QFILSQYPSVAELKVAID--------NIQQGGGGTATGEALNNMTQVFADTGRINVARYL 945 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I +TDG++S P + E L + G I+YAIGV+ EA LK A F+ + Sbjct: 946 IVITDGKSSDPVAEAAEGL------RENGVIIYAIGVR-EANIDELKEIAKDKIFFVYE- 997 Query: 348 SRKLHDAFLRIGKEMVK 364 D I KE+V+ Sbjct: 998 ----FDLLKDIQKEVVQ 1010 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 71/196 (36%), Gaps = 25/196 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S++ R + M++ D ++ GL+ FSS + Sbjct: 1024 DIIFLIDGSESISPE------DFEKMKRFVASMVNQSNIGTDG---IQIGLLQFSSIPQE 1074 Query: 230 TFPLAWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + T++ L + +K G ++Y Sbjct: 1075 EFRLNQYSSKVDIYSAIFDVQQMRDGTRTGKALNFTLPFFDSSKG-------GRPSVQQY 1127 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I +TDG I ++L + + I++AIGV Q L+ D+ Y Sbjct: 1128 LIVITDGVAQDNVIIPAKAL------RDKNIIIFAIGVGEAKKSQLLEITNDEDKVYHDV 1181 Query: 347 NSRKLHDAFLRIGKEM 362 N L + I ++ Sbjct: 1182 NFEALQNLEKEILSKV 1197 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 50/143 (34%), Gaps = 18/143 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + ++ + D VR GL ++ I Sbjct: 235 DIVFLVDSSTSIGPQ------NFQKVKNFLYSVVLGLDISSD---HVRVGLAQYNDNIYP 285 Query: 230 TFPLAWG--VQHIQEKINRLIF-GSTTKSTPGLEYA-YNKIFDAKEKLEHIAKGHDDYKK 285 F L + E+I L + T + LE+ N + + + Sbjct: 286 AFQLNQHPLKSTVLEQIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRAKDRVP-----Q 340 Query: 286 YIIFLTDGENSSPNIDNKESLFY 308 +I +TDGE++ + + L Sbjct: 341 IVILVTDGESNDEVQEVADRLKE 363 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 66/183 (36%), Gaps = 18/183 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ + + ++ ++ + PD VR GLV +S + F L Sbjct: 633 DLVFLIEEFSRVRQPNFQQVVNFLKTIVSSLSIHPD---TVRFGLVFYSEEPRLEFSLDT 689 Query: 236 --GVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I E +++L + TK+ L++ N++F E ++ + ++ + + Sbjct: 690 FQNPAKILEHLDKLTYRERRGRTKTGAALDFLRNEVF----IQEKGSRSNHGVQQIAVVI 745 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 +G + + +R G +YA+G Q + + L+ AS + Sbjct: 746 MEGFSQDSVSRP------ASHLRRAGITIYAVGTQNVSESKELEKIASYPHWKYSVPLES 799 Query: 351 LHD 353 Sbjct: 800 FLQ 802 >gi|262184150|ref|ZP_06043571.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 500 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 35/203 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVV------RSGLVTFS 224 +VLD S SM ++ + S+ ++D +PD V + L+ +S Sbjct: 313 ALVLDTSGSMEGE------RMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYS 366 Query: 225 SKIVQTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 S+ Q + + +++ RL+ T + + A++++ + + Sbjct: 367 SEPQQPTRARVDKDKPATTKELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIGT--- 423 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCA 337 ++ +TDGE + + Y + + V+ I EA Q ++ A Sbjct: 424 --------VVLMTDGEVTRGRTFAQFKDAYAQLPEDKKEIPVFVILYG-EANIQEMEELA 474 Query: 338 --SPDRFYSVQNSRKLHDAFLRI 358 + + + N L AF I Sbjct: 475 QLTGGKTFDALN-GDLAAAFEEI 496 >gi|126667415|ref|ZP_01738387.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] gi|126628171|gb|EAZ98796.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] Length = 774 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 45/210 (21%), Positives = 82/210 (39%), Gaps = 35/210 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + +D+ +++D+S SM + + + V + + +PD G+ Sbjct: 49 SPQLPGAVDVRIIVDISGSMKQNDPQNLRRPAVRLLA--------RLLPDGATA---GVW 97 Query: 222 TFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 TF + P AW I++ T +E A + F Sbjct: 98 TFGQYVNMLVPHREVSDAWRDMAIEQSDAINSVAMRTNLGAAIETASDGYF--------- 148 Query: 277 AKGHDDYKKYIIFLTDGE---NSSPNIDNKESLFYCNEA----KRRGAIVYAIGVQAEAA 329 G + I LTDG+ + +P+ + E+ + K++GA +A+ + AEA Sbjct: 149 -TGGVLSNTHFIVLTDGKVDISRNPSANKAEANRILDTLVPPLKQQGARFHAVALSAEAD 207 Query: 330 DQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 +FL+ AS F+ +N+ L AFL Sbjct: 208 TEFLRKLASDSNGSFHVAENANDLSRAFLD 237 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 76/231 (32%), Gaps = 28/231 (12%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW N+ + I K ++ + ++DVS SM DH +K Sbjct: 303 FSISTDVAKTPW--NTQTQLVRIGLQGKEYLNEELPASNLTFLIDVSGSMEDH-----NK 355 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIF 249 L + + + ++ + V+ VV +G P G + I + +L Sbjct: 356 LPLLISAFKLLVHQLIEKDKVSIVVYAGAAG------VVLPPTNGDQKEKIINALQKLEA 409 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T G++ AY +K + +I TDG+ + + Sbjct: 410 GGSTAGGQGIKLAYKLAEKNFKKNGNNR---------VILATDGDFNVGASSDTAMEKLI 460 Query: 310 NEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + G + +G D L+ A + ++ F Sbjct: 461 EKKRASGVFLSVLGFGMGNYKDSKLETLADKGNGNHAYIDTMQEAQKVFGD 511 >gi|257062762|ref|YP_003142434.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] gi|256790415|gb|ACV21085.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] Length = 2281 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 43/279 (15%), Positives = 87/279 (31%), Gaps = 62/279 (22%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP--GMDKLG 193 P + L + + ++++++LD S SM+ G ++ Sbjct: 44 PHTKNLTDNHDGTYTISLDVVGESERKPN---PVNVIVILDNSGSMDTRTGGYGSQTRMA 100 Query: 194 VATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 A ++ + + + + ++V+ LV FS+ +NRL Sbjct: 101 AAQNAVNNLARSLYAYNTTEFPDLVQMALVGFSTTGSVVQGPTNSYNTFSGAVNRLDADG 160 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE----NSSPNIDNKESLF 307 T L+ I DD Y+IF++DG N+ N + ++ + Sbjct: 161 GTNWEDALQ-----------DAAGINFNDDDPT-YVIFVSDGNPTFRNTRGNYNPMDNYY 208 Query: 308 Y-------------------------------CNEAKRRGAIV-----YAIGV--QAEAA 329 Y ++A+ V Y IG + Sbjct: 209 YNTWGVYGNGSDSQTVAGIAAATTIARCYEHAVDDAESLATSVGADHFYTIGAYGNVDRM 268 Query: 330 DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + +P ++S N+ L +A I ++ K I Sbjct: 269 RSLTTDAGAPAGNYFSAANTTDLQNALAAILAQIEKAGI 307 >gi|221104447|ref|XP_002170122.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 898 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 80/197 (40%), Gaps = 22/197 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S LD+ ++D S S+ + + ++ L R+ + NN +G+VT Sbjct: 85 SDCAGVLDVGFIIDSSGSLRNQYRQEVEFLKSLARTFKI----------SNNGAHAGVVT 134 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FSS + L + + + ++ + G T+ L + + + I Sbjct: 135 FSSIAELSIKLNQYYDQEQFERAVDDIPYMGYVTRIDLALRKSLEMFDEINGARKSIP-- 192 Query: 280 HDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 + + LTDGE + + ++ + + +G +++AIG+ + L + A Sbjct: 193 -----QILFLLTDGEQYAGKGVVDENPVSIAKLLRDKGIVIFAIGIGSAVRQSQLNDIAG 247 Query: 338 SPDRFYSVQNSRKLHDA 354 S ++ + +N +L ++ Sbjct: 248 SSEKAFLAKNFNELVNS 264 Score = 39.8 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 93/260 (35%), Gaps = 40/260 (15%) Query: 111 STSLSIIIDDQHKDYNLSAV------------SRYEMPFIFCTFPWCANSSHAPLLITSS 158 ++ + + Y + + ++ I+ + PW + + + Sbjct: 647 HVEINQQLVNNAYVYTIKLNGKVVFFEENMQATSFDNVMIYASDPWHP-AQDGSIKDLTI 705 Query: 159 VKISSKSDIG--LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + S+SD +D+ ++D S S+ +H+ ++ L +++ + Sbjct: 706 INGKSESDCAVIVDVGFIIDSSGSLENHYQQEVEFL----------INLASTFNISKYGA 755 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKL 273 +G+VTFS + L + + + T+ LE + + Sbjct: 756 HAGVVTFSYDAFLSIKLNDYFNQAQFNNAVKDISYLNGGTRIDLALEKSLEMFDELNGAR 815 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ + + LTDGE S +K + + RG I++AIG+ + L Sbjct: 816 KNTP-------QILFLLTDGEQSG----DKNPVDIAKRLRDRGIIIFAIGIGSYVNKTEL 864 Query: 334 KNCA-SPDRFYSVQNSRKLH 352 N S D+ + +N +L Sbjct: 865 NNIVGSNDKAFLAENFNELV 884 >gi|260834995|ref|XP_002612495.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] gi|229297872|gb|EEN68504.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] Length = 443 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 67/200 (33%), Gaps = 21/200 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD ++ LD S SMN G GM +L R + + + V +V F Sbjct: 2 PLDTVLCLDTSGSMN---GRGMAELKKGVRHFLLGVQETANKMSLRENV--AVVEFGGGA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL+ + + ++ L G TT GL A +I L + + Sbjct: 57 RIIQPLSGNYGTVMQSVDNLKAGGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPR--V 114 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRG---------AIVYAIGVQAEAADQFLKNCAS 338 I +TDG D + L G + +G + L+ A Sbjct: 115 ILMTDGYPD----DKENVLKAALSFGPAGWQAVGLPHPIPIACVGCGDDVDKDLLQAIAK 170 Query: 339 -PDRFYSVQNSRKLHDAFLR 357 + Y + + +L + F R Sbjct: 171 LTNGMYILGDVSQLSEFFRR 190 >gi|198415896|ref|XP_002125135.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ciona intestinalis] Length = 1580 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 50/257 (19%), Positives = 97/257 (37%), Gaps = 43/257 (16%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 D ++ + S +++ F +N + L + + ++VLD Sbjct: 283 TEADNEQNAKCNLRSTWDVITSTSDFSGGSNPPNPTLTNLAPTFRVVRVAASRRFVLVLD 342 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 VS SM+ + +L + +S + + S+PD + G+V F S + Sbjct: 343 VSGSMSGN------RLLMMRQSAGDFIST--SLPDGDK---VGIVQFHSSANLMMEI--- 388 Query: 237 VQHIQEKINRL--------IFGSTTKSTPGLEYAYNKI--FDAKEKLEHIAKGHDDYKKY 286 + I +++R+ I G +T G+ A N++ DA E + Sbjct: 389 -RQISSQLDRVAIAAGIPGIAGGSTCIGCGIYAAMNEMERHDANETCGN----------- 436 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSV 345 II LTDG+ + P N S A ++ +V AI + A+ +++ Sbjct: 437 IIVLTDGKENQPPYVNDVSQL----AIQKNCVVNAILFTTTENSALVDLVTATGGQWFFA 492 Query: 346 Q--NSRKLHDAFLRIGK 360 Q + ++L +F I Sbjct: 493 QDRDLKRLMGSFAVIAA 509 >gi|170767616|ref|ZP_02902069.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] gi|170123950|gb|EDS92881.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] Length = 586 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 46/316 (14%), Positives = 100/316 (31%), Gaps = 40/316 (12%) Query: 56 LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ------------ 103 + K N G + F +K + Q + Sbjct: 96 AYESVAKAKATRISNLGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGQ 155 Query: 104 ----DINNIER--STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 D +E + S + + + S + M + PW + + I + Sbjct: 156 LPPPDAVRVEEMVNYFPSDWVINDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILA 215 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + S++ +++ ++D S SM ++L + S++ ++ ++ +++ V Sbjct: 216 Q-DLKSEALPASNLVFLIDTSGSMYSD-----ERLPLIQSSLKLLVKELREQDNISIVTY 269 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +G S+I I I+ L +T GLE AY + Sbjct: 270 AG----DSRIALPSTSGNHKDEINAAIDSLNARGSTNGGAGLEMAYQQAAKG------FI 319 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNC 336 KG + I+ TDG+ + D K + + G + +GV + + + Sbjct: 320 KGGVNR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGRDNYNEAMMVRI 376 Query: 337 A--SPDRFYSVQNSRK 350 A + + + Sbjct: 377 ADVGNGNYSYIDTLSE 392 >gi|22125371|ref|NP_668794.1| hypothetical protein y1474 [Yersinia pestis KIM 10] gi|45442407|ref|NP_993946.1| hypothetical protein YP_2631 [Yersinia pestis biovar Microtus str. 91001] gi|149365130|ref|ZP_01887165.1| putative membrane protein [Yersinia pestis CA88-4125] gi|218930054|ref|YP_002347929.1| hypothetical protein YPO3007 [Yersinia pestis CO92] gi|21958254|gb|AAM85045.1|AE013750_5 hypothetical [Yersinia pestis KIM 10] gi|45437272|gb|AAS62823.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|115348665|emb|CAL21610.1| putative membrane protein [Yersinia pestis CO92] gi|149291543|gb|EDM41617.1| putative membrane protein [Yersinia pestis CA88-4125] Length = 509 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 72/200 (36%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K A ML+I ++ +V Sbjct: 127 STRRSPINLALVIDRSTSMS---GERIEKAREAAILAVNMLNITDTLS---------VVA 174 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 175 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 228 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 229 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 285 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 286 SDGNHTFVANSADLEKAFTK 305 >gi|118085865|ref|XP_418677.2| PREDICTED: similar to collagen, type XXVIII [Gallus gallus] Length = 1144 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 76/199 (38%), Gaps = 33/199 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 L+++ V+D S S+ D ++ ++D + + + R G++ FS K Sbjct: 772 TPLELIFVIDSSESVGP------DNFNSTKTFMKTVIDEVSA---NHATTRIGIINFSHK 822 Query: 227 IVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L + ++ +++++ G T + ++ A + A+ Sbjct: 823 VELVSSLETYTTKESLKSAVDKMLYLGEGTYTASAIKKAISLFQAARPA----------V 872 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAADQFLKN---CA 337 +K + +TDG+ + N D EA ++ IG+ FLK A Sbjct: 873 RKVALVVTDGQ--ADNRDKVHLDLVVKEAHAANIEIFVIGIVQKTDPHYHNFLKEMHLIA 930 Query: 338 SP---DRFYSVQNSRKLHD 353 + + FY +++ + L Sbjct: 931 TDPDEEHFYQIEDFKTLSA 949 >gi|308476046|ref|XP_003100240.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] gi|308265764|gb|EFP09717.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] Length = 879 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 72/190 (37%), Gaps = 27/190 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++++ D S + F + + I++ +P + VR GL+ +S Sbjct: 33 LDIIILFDTSGGNDTVFE----------QQKNWTIKIVRDLPIHEDAVRVGLIQYSDAAK 82 Query: 229 QTFPLAW--GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L+ I + L F G T++ L+ A ++F+ A Sbjct: 83 TEFNLSRYSERNDIITHLETLTFMPGEDTRTGVALDKADEEMFNYIGGARLKAT------ 136 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFY 343 + II TDG + + ++L +R+G +Y I V + + L D + Sbjct: 137 RLIILFTDGLSMDKPTKSAKTL------RRKGVKIYTISVNSIGFVPEMLGIVGDADNVF 190 Query: 344 SVQNSRKLHD 353 + ++ + Sbjct: 191 GPTDEDRIEE 200 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 68/199 (34%), Gaps = 26/199 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 SS +D++ V+D S S+ + D T + +IK + + R GL+ Sbjct: 690 SSSVQCPMDILFVVDSSGSIARTYDTQKD-----THFQDYLTQLIKKVEPSRSH-RVGLI 743 Query: 222 TFSSKIVQTFPLAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F+ +Q ++ ++ + TT LE + + ++ E Sbjct: 744 QFAGPHIQKMEWSFDTHSKNSQLLSAIRSVRHLTGTTYIGAALELSLILLDSRRKHTETT 803 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +I ++DG + + + L K +YAI + ++L + Sbjct: 804 ----------VILISDGFSQDDSTQQAKLLRQLPNVK-----MYAISLNKLTNTKYLTDI 848 Query: 337 ASPDRFYSVQNSRKLHDAF 355 + + + + F Sbjct: 849 VGDRKNLFINDESHWFEEF 867 >gi|327284423|ref|XP_003226937.1| PREDICTED: anthrax toxin receptor 2-like [Anolis carolinensis] Length = 441 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 44/203 (21%), Positives = 74/203 (36%), Gaps = 29/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ VLD S S+ D++ +D + T V+ +R + FS + Sbjct: 40 GAFDLYFVLDKSGSVTDNWFEIVDFVKQLTDRF------------VSPRMRLSFIVFSMQ 87 Query: 227 IVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L I+ + L G T G++ A +I A G Sbjct: 88 AKVILQLTENRAQIERGLEELRNVKPGGETYMHEGIKEANRQIETA---------GGQRT 138 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDG+ + + + + ++R GA VY +GV +Q S D+ + Sbjct: 139 NSIIIALTDGKLEG--LIPQYAEKQADISRRLGARVYCVGVLNFNQEQLESIADSRDQVF 196 Query: 344 SVQNSRKLHDAFLRIGKEMVKQR 366 V ++ A I ++KQ Sbjct: 197 PV---KEGFQALRGIINSILKQS 216 >gi|260786375|ref|XP_002588233.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] gi|229273393|gb|EEN44244.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] Length = 1313 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 71/207 (34%), Gaps = 37/207 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +VLDVS S++ G + +L + K I + GL+TFS Sbjct: 303 QYAERVSIVLDVSGSID--MGTLLPRLNQ---------EASKYIRSFADGSMVGLITFSD 351 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L H Q I L +T G++ + + + Sbjct: 352 TAAVDHALTELTADSHRQSLITALPSSTYGSTSIGAGIQAGLSMLKPTGQGGT------- 404 Query: 282 DYKKYIIFLTDG-ENSSPNI-DNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCA 337 I+ +TDG EN++P I D S+ ++ + I + A L + Sbjct: 405 -----IVLMTDGQENTAPMIQDVWPSVL------QQKVTLVTIAIGEYADMSLEDLASQT 453 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 S FY +++ L + F I + Sbjct: 454 SGLSFYDTEDASHLSEIFTAISSQDSD 480 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 25/177 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ + R + +++D + P+ R GLV FSS++ Sbjct: 197 VDLVLLVDGSKSVRPQ------NFELVKRFVNQIVDFLDVSPEG---TRVGLVQFSSRVR 247 Query: 229 QTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL G ++ + + T + L + F + Sbjct: 248 TEFPL--GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRDLN---V 302 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + TDG + + + AK G ++YA+GV ++ + + P Sbjct: 303 PRVGLVFTDGRSQD------DISVWAARAKEEGIVMYAVGVGKAVEEELREIASEPS 353 >gi|218506166|ref|ZP_03504044.1| hypothetical protein RetlB5_00485 [Rhizobium etli Brasil 5] Length = 205 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 75/180 (41%), Gaps = 30/180 (16%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 +F +R + G++ I+ A+ L + + +G + + V+ K+ LD +L+ Sbjct: 6 AFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVK 65 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 +I N + + K + D+ + ++N + ID Sbjct: 66 QINNTGDTDALKLKVTDWFHAQVENSYTLG-------------------------EIDID 100 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 ++N++A + +P F AN P+ + S+VK + S L++ +V+D S SM Sbjct: 101 TTNHNITATASGTVPTTFMKI---ANIDTVPVSVASAVKGPATS--YLNVYIVIDTSPSM 155 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 83/211 (39%), Gaps = 41/211 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 SV+ +D++ ++D S S+ +F + + ++++L N Sbjct: 124 SVEKIKIKHKRVDIVFLIDASSSVGKANFYSEI-------KFVKKLLSDFNV---SYNYT 173 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQE---------KINRLIF-GSTTKSTPGLEYAYNKI 266 R ++TFSS++ + +++ +I ++ F G T + L+ A Sbjct: 174 RVAVITFSSQMKIFRHIDQISTSVEDNDKCLLLNYQIPKIEFSGGGTYTYGALKEAEEIF 233 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +A+ KK I +TDG ++ ++ + KR+ ++Y+IG+Q+ Sbjct: 234 QNARADS----------KKIIFLITDGFSNG-----RDPIPLAESLKRKNVVIYSIGIQS 278 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + +SP + L D+F Sbjct: 279 GNYAELYNMSSSPG-----DSHSFLLDSFDH 304 >gi|332217052|ref|XP_003257667.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Nomascus leucogenys] Length = 946 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N VR+ Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDRFSVIDFNQNVRT------W 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEASNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ + Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 37/89 (41%), Gaps = 5/89 (5%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R G+ +IL A++LPV+ L ++ + VK +L D + L A + + Sbjct: 9 RRLHRQRGGT-AILFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGAHSLSDA 67 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 G+ + +N+ Q++ N Sbjct: 68 G----GQPYNWSAAVNAAQNVVQSNVANG 92 >gi|326506938|dbj|BAJ91510.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 378 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 62/167 (37%), Gaps = 18/167 (10%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA-PLLITSSVKISSKS 165 ++ S ++ I + + S+ + + CAN + ++ S Sbjct: 205 DVRSSRTVEIKTYSEF-SAIPQSSSQDDFAVLIHLKAPCANPEQITSRPVNATSVGYPTS 263 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ +LDVS SM KL + R++ ++ + R ++ FSS Sbjct: 264 RAPVDLVTLLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RLSVIAFSS 311 Query: 226 KIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + + F L G Q + +N L G T L+ A I D Sbjct: 312 TVRRLFHLRRMSHSGRQQALQAVNSLGAGGGTNIADALKKAAKVIED 358 >gi|146302265|ref|YP_001196856.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146156683|gb|ABQ07537.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 2588 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 57/151 (37%), Gaps = 19/151 (12%) Query: 170 DMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+++ +D+S SM + G + A + L+ K+ P R +V +S+ Sbjct: 69 DVVLAIDISGSMGNTISGDFKTSMDYAKDAALAFLNQAKANPQN----RIAIVAYSTTAS 124 Query: 229 QTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + GV I +IN L ++T G+ + ++ A Sbjct: 125 LKIGLTYLNATGVTQITNQINALQATNSTNIYAGIVRSETELETNGRFDCSTA------- 177 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + II LTDG N+ CN +K Sbjct: 178 RAIILLTDGVT---NVTGTSGNTNCNVSKTS 205 >gi|313235273|emb|CBY10837.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 25/178 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + LD++ V+D S S+ D + + + + N R + T+S Sbjct: 177 TSKALDIVFVVDESGSVGP------DNFDLVKQFLIDYAQDSNIAA---NATRIAIRTYS 227 Query: 225 SKIVQTFPL-AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + +I +IN L+ T + + N + + + Sbjct: 228 TYSDLDFSLNDFKTSNIIFEINNLVHESGGTNTADAITNGLNDFGNDR----------SE 277 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ +TDG+++ + L + R +AIG+ + L+ A+ D Sbjct: 278 SVKIMVTITDGQSNYDRVKAAADLLKADP---RNIQSFAIGIDGANMAE-LQAIATTD 331 >gi|83594486|ref|YP_428238.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] gi|83577400|gb|ABC23951.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] Length = 575 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 71/189 (37%), Gaps = 20/189 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M+ V+D S SM++ ++ A RSIR+ ++ + + +R G+V+FS + Sbjct: 394 MIFVVDGSGSMSEGIAGAPSRISAAKRSIRDTVNAL------HKDIRVGMVSFSDCMATQ 447 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ TP E A + + + + ++ + Sbjct: 448 -----NSKYYSAAERPAFLAGVDAITP--ERATSLAASIRRGGALATRRSET---VMMVV 497 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNS 348 +DGE++ + +A++ I++ I + + AS R ++ ++ Sbjct: 498 SDGEDTCGGDPCAAARAV--KAEKSNVIIHVIDLSGGGNSGVARCIASAGGGRVFTPGSA 555 Query: 349 RKLHDAFLR 357 ++ + Sbjct: 556 AQVTSSLRT 564 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 2/73 (2%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASP-DRFYSVQNSRKLHDA 354 S + N C AK +G V+ +G + E + +++CAS F+ V L A Sbjct: 462 STSAKNARLEAICTAAKNQGVQVFTVGFEVEDDEAIIMEDCASSRAHFFRVSGGGDLTTA 521 Query: 355 FLRIGKEMVKQRI 367 F I +++ + R+ Sbjct: 522 FESIARQITELRL 534 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 49/365 (13%), Positives = 104/365 (28%), Gaps = 111/365 (30%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + +G + I + ++ + G+ ++ + +L LD ++L A Sbjct: 11 LRRDERGGMIIFGLFVFLLLLLAGGMAVDFMRTETARGRLQATLDGAVLAAA-------- 62 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 ++ ++++ + L D+ I ++ Sbjct: 63 ----DLDQDKDPVEVVRDYVA---KAGLDPFLIDVDVTEIA-------------GQRIVT 102 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 A ++ ++ F P S+ S++ LD+ +VLD+S SM Sbjct: 103 ASAKSDVT---MHFMKMVGIDFLPAPARSTA---SEAVSNLDVSLVLDMSGSMEGDKLDQ 156 Query: 183 -------------DHFGPGMDKLGVATRS--------IREMLDIIKSIPDVNNVVRSGLV 221 D G L V + + +ML +N V Sbjct: 157 LQAAAKNFVGIVYDTMGAEKILLNVVPYATQVAAPAGLLDMLGAFLREHSYSNCVSFSAA 216 Query: 222 TFS-SKIVQTFPL----------AWGV-------------------------QHIQEKIN 245 F+ + I++ L WG + I++ I+ Sbjct: 217 DFTETSILEAAALPQGGHFDPFYTWGPLRYDDVTFVCNPDPSTEVLTLASTQREIEDYID 276 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH--------------DDYKKYIIFLT 291 L+ T G+++ I + K I+ +T Sbjct: 277 GLVAEGNTSIDVGMKWGAALIDPDLGSTLNEFANGPSAAGINPVALWGDRSTDKVIVLMT 336 Query: 292 DGENS 296 DG+N+ Sbjct: 337 DGKNT 341 >gi|306820467|ref|ZP_07454103.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304551542|gb|EFM39497.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 538 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 86/292 (29%), Gaps = 37/292 (12%) Query: 83 IIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF 142 I N W+++ ++E+ + I + ID S Y F Sbjct: 114 NITNEWRSNMQSEVAK------IEKHLTDDATDEEIDHFFNQLLYIVGSDYSAVEDINRF 167 Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 + T + +++ +VLD S SM G + +A SI E+ Sbjct: 168 GYVIFKKDMKDPFTGE---KVNENKQVNVEIVLDASGSMAKQIN-GQSMMNIAKNSITEV 223 Query: 203 LDI--------IKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQHIQEKINRLIFGSTT 253 L ++ N SG S P+ I + ++ + T Sbjct: 224 LKHLPKNAKVGLRVFGHKGNNTDSGKTESCSANELIHPIETLNTSAISKALSSVEATGWT 283 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ + E L + YI+ TDG + + + K Sbjct: 284 SIADSIK-------NGGEDLSKFKEEGAVNILYIV--TDGIETCGG----DPIEAAQTLK 330 Query: 314 RRGAIVY--AIGVQAEAA-DQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGK 360 G V IG A D LK A + ++ L +I + Sbjct: 331 NSGTNVVLGIIGFNVNATQDAVLKKIAEAGGGHYAIANDAGTLTSELYKITE 382 >gi|126728411|ref|ZP_01744227.1| hypothetical protein SSE37_20512 [Sagittula stellata E-37] gi|126711376|gb|EBA10426.1| hypothetical protein SSE37_20512 [Sagittula stellata E-37] Length = 219 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 75/208 (36%), Gaps = 43/208 (20%) Query: 172 MMVLDVSLSMNDHFGPGMD--KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----- 224 M+V D S SM + + ++ A +++R++L PD+ + R GLV + Sbjct: 1 MIVFDGSGSMAEMGFNAIGEPRIVQARQAMRQVL------PDIAVLRRLGLVIYGPGGDR 54 Query: 225 --SKIVQTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + W I +I L T T G+ A E L++ Sbjct: 55 TCRNVDLRLTPQWQADAPIISEIEGLRPAGGTALTDGVRLA-------AETLDYRNVPGA 107 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA----------EAADQ 331 ++ +TDG+ + + + + EA G V+ IG + +A + Sbjct: 108 -----VVLVTDGKETCGGTPCQLAAEFAREA--PGLTVHVIGFKVRGDHWDWSTPDAPGE 160 Query: 332 FLKNCA---SPDRFYSVQNSRKLHDAFL 356 + C + ++ S + +L A Sbjct: 161 SVARCLADDTGGQYLSAETVDELVGALR 188 >gi|126433420|ref|YP_001069111.1| von Willebrand factor, type A [Mycobacterium sp. JLS] gi|126233220|gb|ABN96620.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 233 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/163 (22%), Positives = 63/163 (38%), Gaps = 16/163 (9%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + D + +++ DVS SM G + L + L + V +VT Sbjct: 13 ANPDPRVACVVLADVSGSMQ---GEPIAALERGFAAFTRYLQNEVLA---SKRVEVAVVT 66 Query: 223 FSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F + P+ A +Q + TT G+ A + + D K + A G Sbjct: 67 FGTVATVLVPMQEARTLQPV-----AFTASGTTNMAAGIHLALDILED--RKHAYKAAGL 119 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 Y+ +I+ LTDG+ + D + E+ RG V+A+G Sbjct: 120 QYYRPWILLLTDGKPNLDGFDEAVARLNAVES-ARGVTVFAVG 161 >gi|309790583|ref|ZP_07685138.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] gi|308227385|gb|EFO81058.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] Length = 430 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 69/212 (32%), Gaps = 39/212 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + ++V+D S SM++ G +D A ++ M V + + L+ F Sbjct: 88 AGGSAIRSVLVIDRSGSMDE--GNKIDGARDAAQAFVGM---------VRSDDQVALIGF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + ++V P ++ I RL T + + + D + Sbjct: 137 NDQVVVLEPFTDDQAILEAAIRRLRADGGTALYDSIVEGVDLLRDQPGRRA--------- 187 Query: 284 KKYIIFLTDGE---------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--- 331 ++ LTDG+ + + + A + + +G + + D Sbjct: 188 ---LLVLTDGQDCRDLDSCPDDAGSSHTLAEAIAYANAANQPVTLIGLGQRGSSGDDGID 244 Query: 332 --FLKNCASP--DRFYSVQNSRKLHDAFLRIG 359 L+ A+ + ++ L D + I Sbjct: 245 ERVLQRIATETRGSYAYSPDAAALTDLYREIA 276 >gi|206575582|ref|YP_002235851.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] gi|206570426|gb|ACI12072.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] Length = 212 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 65/196 (33%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM+ + ++ +L +K P ++TF S Sbjct: 3 RLPVYLLLDTSGSMHGE------PIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 QT PL + + TT L N+I +K KG + Sbjct: 57 RQTVPLT---DLLNFNLPSFSASGTTALGEALSLTANRIDAEVQKTTAETKGDWRP--LV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG P D ++ L AK+ G V A +A LK + Sbjct: 112 FLMTDG---GPTDDWRKGLNEFKAAKK-GV-VVACAAGHDADTGVLKEITEIVLQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SSSIKAFFKWVSASVS 182 >gi|194227183|ref|XP_001916967.1| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Equus caballus] Length = 946 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 72/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTEDQFS------VVDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V ++ I ++ T L A + +A Sbjct: 359 RNDLVSATTTQVADAKKYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + + + ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAH 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|94969533|ref|YP_591581.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551583|gb|ABF41507.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 362 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 79/216 (36%), Gaps = 37/216 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S D + + ++ D+S SM+ +DK A ++ K+ + +V Sbjct: 132 SSEDAPVSIGVIFDMSGSMS----NKIDKSREAI------VEFFKTANPDDEFF---VVA 178 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+ K ++ IQEK+ L T + NK+ AK + Sbjct: 179 FNDKPEVLQDFTNRIEDIQEKLTILQPKDRTSLLDAIYLGMNKMRQAKYER--------- 229 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAE------AADQFLK 334 K ++ ++DG ++ E + +YAIG+ A A L Sbjct: 230 --KALLIISDGGDNHSRYTENEIKSM---VREADVQIYAIGIYDLAPTTTEEMAGPALLG 284 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + + R + + N +L D +IG E+ Q +L Sbjct: 285 EISDWTGGRMFPIDNVNELADVATKIGVELRNQYVL 320 >gi|72180809|ref|XP_798930.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] gi|115975272|ref|XP_001180569.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] Length = 964 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 59/165 (35%), Gaps = 10/165 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D+S SM+ KL ++ +LD + N + S V F Sbjct: 351 IIFVIDISGSMSG------TKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGML 404 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + V+ + + L T + N + E+ ++ +I L Sbjct: 405 YSTKENVRRAKRFVMGLQEMDNTNLHKAIISGVNMLRAESEQDPQ----EEEIVSMLIVL 460 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 TDG + ID +EA ++ IG A+A FL+ Sbjct: 461 TDGNPNHGEIDKTIIERNVHEAINGDFSLFCIGFGADADYPFLRR 505 >gi|317012798|gb|ADU83406.1| hypothetical protein HPLT_05065 [Helicobacter pylori Lithuania75] Length = 219 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 73/215 (33%), Gaps = 35/215 (16%) Query: 172 MMVLDVSLSMNDHFGPGM----DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 ++LD S SMN+ G ++GV I++M++ +K + +VTF Sbjct: 18 FLLLDTSGSMNESLGDRTRDDRTRIGVLNLCIQKMIETLKQEAKKELFNKMAIVTFGENG 77 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 V P +++I L T + A + I YK Y Sbjct: 78 AVLHTPFD-DIKNIN--FKPLSASGGTPLDQAFKLAKDLI------EYKDTFPTKFYKPY 128 Query: 287 IIFLTDGENSSPNI---------DNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNC 336 I ++DGE + D + + C ++I + EA Q K+ Sbjct: 129 SILVSDGEPNDDKWQEPLFNFHHDGRSAKSVC----------WSIFIGDREANPQVNKDF 178 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 FY + KL F + + + K K Sbjct: 179 GKDGVFY-ADDVEKLVKLFEIMTQTISKGSASIKK 212 >gi|315650876|ref|ZP_07903919.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] gi|315486855|gb|EFU77194.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] Length = 526 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 88/244 (36%), Gaps = 31/244 (12%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + + PW ++ + I K + + +++ ++DVS SM++ DKL Sbjct: 133 FSVTTEISSCPWNPDTKL--MQIGLQAKNTDTTTKPSNLVFLIDVSASMDEP-----DKL 185 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + + + D +K ++ V +G + +V + I I L G + Sbjct: 186 PLVKNAFLLLCDELKENDTISIVTYAG----TDSVVLEGAKGSDKKSIMSAIEDLTAGGS 241 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + G++ AY + + ++ TDG+ + E + + Sbjct: 242 TAGSDGIKTAYKIAEKYFKTEGNNR---------VVLATDGDLNVGITSEGELIKLIKKE 292 Query: 313 KRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRF------YSVQN--SRKLHDAFLRIGKE 361 K + +G + D +++ A + + + S +L F + K+ Sbjct: 293 KESNIFLSVLGFGTDNIKDNKMQSLADNGDGNYSYIDSRFEAKKVLSDELGANFFTVAKD 352 Query: 362 MVKQ 365 + Q Sbjct: 353 VKLQ 356 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 80/231 (34%), Gaps = 27/231 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + L +++ + D++ ++D S S+ + Sbjct: 3 LPGFVMLLCIMGAQATVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPS------EFEQ 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GS 251 + +++D + PD R G+V ++S++ L + + ++++ + Sbjct: 57 VKVFLAKVIDGLSVGPDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST 113 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + +++A N F E + D K I +TDG D Sbjct: 114 GTMTGLAIQFAMNVAFSEAE----GGRKSPDISKVAIIVTDGRPQDNIRD------IAAR 163 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 A+ G ++AIGV + + P D V++ KL F Sbjct: 164 AREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|118384116|ref|XP_001025211.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89306978|gb|EAS04966.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 631 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 51/297 (17%), Positives = 112/297 (37%), Gaps = 37/297 (12%) Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 N+ N + Q+++F ++ + + + +++ NN + +S + Q Sbjct: 49 NKLNDKSLHDQRSNFCLERENSVNKEFLSKTVHQTQQSENENNNKNLSSQQVEEIIQFNL 108 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + + + F C + H S + +D++ V+D S SM+ Sbjct: 109 ISETNEVSIKKSASF----VCGVNLHVKQPKEQS------ERVPMDLICVIDDSGSMSGK 158 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHI 240 K + +S++ +L I+ N R L++F S P + Sbjct: 159 ------KAQLVRKSLKYLLKIM------NENDRICLISFDSVEKILTPFLRNNLENKSEL 206 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 ++ I ++ +T G+E I + KEK + + L+DG++ SP + Sbjct: 207 KKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEK---------NPITCMFLLSDGQDDSPQV 257 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAF 355 D + + + IV G A+ ++N A +Y +++ +K+ + F Sbjct: 258 DLRVQKLIQSYDIQDTFIVNTYGYGADHDATQMRNIAETHKGGYYYIEDVKKVSEWF 314 >gi|94482904|gb|ABF22495.1| anthrax toxin receptor 2 [Rattus norvegicus] Length = 487 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 74/226 (32%), Gaps = 34/226 (15%) Query: 149 SHAPLLITSSVKI-----SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 L + + D+ VLD S S+ +++ + + T Sbjct: 18 GLWLLTVGGPGSLLQAQEQPSCKKAFDLYFVLDKSGSVANNWIEIYNFVHQLTERF---- 73 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLE 260 V+ +R + FSS+ PL I E + + T GL+ Sbjct: 74 --------VSPEMRLSFIVFSSQATIILPLTGDRYKISKGLEDLKAVQPVGETYIHEGLK 125 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A +I +A G II LTDG+ + + +++ GA VY Sbjct: 126 LANEQIQNA---------GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKKSRSLGASVY 174 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +GV Q + S D+ + V+ A I ++ Q Sbjct: 175 CVGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILAQS 217 >gi|307352799|ref|YP_003893850.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] gi|307156032|gb|ADN35412.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] Length = 651 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 47/270 (17%), Positives = 87/270 (32%), Gaps = 33/270 (12%) Query: 58 YTATKILNQEN-GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 T T+ E+ N K + +K R+ + + I R+ I Sbjct: 367 STTTQFAEGESFKLNQKPLSDYLRTDSLKREGNGR-RSATESHDGRYVGSRIPRNMGPDI 425 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 +D A R PF + I+ + + IG ++ V+D Sbjct: 426 ALD---------ATIRAAAPF-QLERGAEGRDLAIKIDISDIREKVRERKIGNTILFVVD 475 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAW 235 S SM ++ +I +L + R GLV F K P Sbjct: 476 ASGSMG-----AQQRMTAVKGAILSLL-----VDAYQKRDRVGLVVFRGKTAELLLPPTS 525 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 V+ ++ + L G T GL A+ + ++ I K +I ++DG+ Sbjct: 526 SVELARKCMQELPVGGKTPLAHGLSKAFEVL----QRELMINKNTMPR---LILISDGKA 578 Query: 296 SSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 + + +++ N + + Y I Sbjct: 579 NVGMTSDSPLNDAIGIANHIREKEIASYVI 608 >gi|242051338|ref|XP_002463413.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] gi|241926790|gb|EER99934.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] Length = 491 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 50/237 (21%), Positives = 81/237 (34%), Gaps = 38/237 (16%) Query: 138 IFCTFP-WCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 C FP +S + K S + LD++ VLDVSLSM KL + Sbjct: 41 TQCEFPALARGASRDRFAVLVHAKAPSDVARAPLDLVTVLDVSLSMKGQ------KLELL 94 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGS 251 +++ ++ + R +VTFS + LA G + + L Sbjct: 95 KQAMCFVIHQLGPAD------RLSIVTFSRHATRQIRLARMSDVGKASAKFAVGALCAVR 148 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID---NKESLFY 308 T GL + +E+ +I L+DG+++S + Y Sbjct: 149 GTNIGQGLRVGAQVLAGRRERNAVAG---------MILLSDGQDTSGCWTTVRPDGTKTY 199 Query: 309 CNEA------KRRGAIVYAIGVQA--EAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 N R A ++ G +AA A+ F V N + D+F R Sbjct: 200 ANLVPPSTSFSSRPAPIHTFGFGTDHDAAAMHAIAEATGGTFSFVGNEAAIQDSFAR 256 >gi|163816539|ref|ZP_02207903.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] gi|158448239|gb|EDP25234.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] Length = 465 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 68/200 (34%), Gaps = 33/200 (16%) Query: 165 SDIGLDM----MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +D +D+ M+V+D S SM DK + E+L+ I + GL Sbjct: 137 TDYDVDIAQETMLVIDDSSSMKTS-----DKNDRRLTAANELLEHIDGNR------KVGL 185 Query: 221 VTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FS I P+ + + ++ T L N Sbjct: 186 IRFSKDIHCYIPMDYLKVNKSTLNHELENKAKEGGTDINDALYAVLNAF---------DK 236 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G + +I LTDG+++ N+D + + N A + I + F+K Sbjct: 237 VGTATGSRSVILLTDGKST-TNVDEE---YLINRANSMNIQINVISLGNHTDKAFIKRIT 292 Query: 338 SPDRFYSVQNSRK--LHDAF 355 S + + S L A+ Sbjct: 293 SSTGGKAAKTSSDFYLDAAY 312 >gi|156409371|ref|XP_001642143.1| predicted protein [Nematostella vectensis] gi|156229284|gb|EDO50080.1| predicted protein [Nematostella vectensis] Length = 332 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 66/200 (33%), Gaps = 27/200 (13%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 SA Y+ L + K + + +D+ +++D S S+ H Sbjct: 155 SASLVYKTSLKSVRSLVKRLQRSICLPTRPTKKPTRLCNRPIDLGLLVDGSGSIVLH--- 211 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI--- 244 G D G ++ ++ + R G++ +S++ F L + I Sbjct: 212 GKDNFGRLIEFVQSLVSFFRI---SRRHTRVGMILYSTRSYPIFRL--NQYTSKRAIMGK 266 Query: 245 --NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 N T++ L YA F ++ K+ +I LTDG + Sbjct: 267 IRNVRYPAGGTRTGQALRYARRYFFSGRKPKGR--------KRVLILLTDGISQDSVKGP 318 Query: 303 KESLFYCNEAKRRGAIVYAI 322 L + GA V+ I Sbjct: 319 ALQL------RNAGAEVFTI 332 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 56/159 (35%), Gaps = 24/159 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ ++ +S + R +V +S++ + Sbjct: 1 DIGFLVDGSASIEKRGKGNFGRMLNLIKSTLN------AFSLRQRRTRVSVVLYSNRPFK 54 Query: 230 TFPLAWGVQHIQ--EKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F ++ IN + + TK L Y +F +++ K+ Sbjct: 55 VFGFNRYSSKLRVIRAINYMRYPRGGTKLRRALYYVKRYLFTRRQRG---------RKQV 105 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 ++ LTDG + SL R G V++IG+ Sbjct: 106 LVVLTDGISRRGVKAPAISLH------RAGVEVHSIGIG 138 >gi|114567231|ref|YP_754385.1| chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338166|gb|ABI69014.1| conserved putative chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 951 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 50/150 (33%), Gaps = 19/150 (12%) Query: 219 GLVTFSSKIV--QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 G+V F F IQ+ I + T P L AY + DA K +H Sbjct: 450 GVVAFDDTAQWVVEFQAVKDKDAIQDDIATIRADGGTSIYPALALAYTALKDAHTKFKH- 508 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 II LTDG+ + + F R G + + V A L+ Sbjct: 509 ----------IILLTDGQ----SATTGDYYFLSRRMARAGITMSTVAVGEGADTLLLEQL 554 Query: 337 AS--PDRFYSVQNSRKLHDAFLRIGKEMVK 364 A+ R+Y + F + + +K Sbjct: 555 AAWGQGRYYFSDEISNIPRIFTKETMKAIK 584 >gi|296481520|gb|DAA23635.1| inter-alpha globulin inhibitor H2 polypeptide [Bos taurus] Length = 946 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 63/176 (35%), Gaps = 21/176 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTEDHFS------VVDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V + I ++ T L A + +A Sbjct: 359 RNDLVSATKTQVADAKNYIEKIQPSGGTNINEALLRAIFILNEANNLGMLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 II ++DG+ + + + + R ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNR 470 >gi|148238273|ref|NP_001091485.1| inter-alpha-trypsin inhibitor heavy chain H2 [Bos taurus] gi|146186952|gb|AAI40657.1| ITIH2 protein [Bos taurus] Length = 946 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 63/176 (35%), Gaps = 21/176 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTEDHFS------VVDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V + I ++ T L A + +A Sbjct: 359 RNDLVSATKTQVADAKNYIEKIQPSGGTNINEALLRAIFILNEANNLGMLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 II ++DG+ + + + + R ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQNIRDNISLFSLGIGFDVDYDFLKRLSNDNR 470 >gi|1915956|emb|CAA72309.1| inter-alpha-inhibitor heavy-chain 1 [Sus scrofa] Length = 779 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +++ V+D+S SM K+ ++ ++L +K D ++V G S Sbjct: 156 TKLNKNVVFVIDISSSMEGQ------KVKQTKEALLKILSDLKP-GDYFDLVLFGSAVQS 208 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + + + +T GL + A+ L + Sbjct: 209 WRGSLVQASTANLDAARSYVRQFSLAGSTNLNGGLLRGIEILNKAQGSLPEFSNRAS--- 265 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-----SP 339 +I LTDGE + D + L +A R +Y +G + FL+ A Sbjct: 266 -ILIMLTDGEPTEGVTDRSQILKNVRDAIRGRFPLYNLGFGHDVDWNFLEVMALENNGRA 324 Query: 340 DRFYSVQNSRKLHDAFLR 357 R Y ++ + F Sbjct: 325 QRIYEDHDAAQQLQGFYD 342 >gi|219125320|ref|XP_002182931.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405725|gb|EEC45667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 523 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 61/167 (36%), Gaps = 30/167 (17%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 T +CA+ + K +D+++VLDVS SM + KL + ++ Sbjct: 43 VSTNHFCASIHARTMP-----KEDEDCRTPIDLIVVLDVSGSMTGN------KLKLCKKT 91 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTK 254 + +L ++++ R GL++F S FP +KI L T Sbjct: 92 LTMLLRVLQTQD------RFGLISFGSDARVEFPAQAMSKQNKASALQKIQSLTTRGCTN 145 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + L A ++ + + + FLTDG + D Sbjct: 146 MSAALGLAVQEL---------KIIEKSNPVRSLFFLTDGLANEGISD 183 >gi|332798630|ref|YP_004460129.1| pilin isopeptide linkage domain-containing protein [Tepidanaerobacter sp. Re1] gi|332696365|gb|AEE90822.1| pilin isopeptide linkage domain protein [Tepidanaerobacter sp. Re1] Length = 925 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 85/240 (35%), Gaps = 31/240 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 AN + + +S+++ D++ VLDVS SM+ G + V +++ ++ Sbjct: 205 SANGLNDYRIYLDVTTEASETETDRDIIFVLDVSNSMDTALGN-TTRFNVLKNTVKSAVN 263 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + P R ++TF ++ + +N L T G Y Y Sbjct: 264 SLVQNPSN----RISIITFGTRAQIVTTRETDRTKLINCVNSLSLPGGT--AGGTNY-YE 316 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA------- 317 + A + + G + K I F++DGE + ++ ++ Y A+ Sbjct: 317 SMLHAAQIVNGSINGSHE--KVIFFVSDGEPT-ASLPAANAMGYAAYAEVATIYAYHAAQ 373 Query: 318 ------IVYAIGVQAEAA-DQFLKNCA------SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 Y++ + ++ L+ + ++ +L AF R ++ Sbjct: 374 EFQNVDRFYSVFIGDDSGSASTLQTITQMVEVNNEKYMVQASSAEQLTSAFNRFVSKVGN 433 >gi|148537043|dbj|BAF63430.1| Ca(2+)-activated chloride channel splicing variant [Rattus norvegicus] Length = 514 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 58/168 (34%), Gaps = 34/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM+ D+L ++ L I + GLVTF S Sbjct: 309 ICLVLDKSGSMDTE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQIQ 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + I + T GLE + I + + Sbjct: 359 NYLIKITNTGDYKKITGNL-PQQAVGGTSICRGLEAGFQAITSSDQSTSGSE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF 332 I+ LTDGE+ + + C K GA+++ I + +AA + Sbjct: 410 -IVLLTDGED--------DLISSCFEVVKHSGAVIHTIALGPKAAREL 448 >gi|327265755|ref|XP_003217673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Anolis carolinensis] Length = 1078 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ +MLD + VN + +F+ K Sbjct: 225 DMVIIVDVSGSVSGL------TLKLMKTSVYDMLDTLSDDDYVN------VASFNQKAQA 272 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +EK+ + TT G E+A+ ++ ++ + Sbjct: 273 VSCFTHLVQANIRNKKVFKEKVEVMEARGTTDYKAGFEFAFEQLQNSNISRANCN----- 327 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F + V+ V D L+ CA+ Sbjct: 328 --KMIMMFTDG-----GEDRVQDVFEKYNWPNKTVRVFTFSVGQHNYDVTPLQWMACANK 380 Query: 340 DRFYSV 345 ++ + Sbjct: 381 GYYFEI 386 >gi|313902369|ref|ZP_07835772.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] gi|313467300|gb|EFR62811.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] Length = 895 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 72/189 (38%), Gaps = 32/189 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+ +VLD S SM + ++ A +++L V+ R ++TF ++ Sbjct: 711 PVDVCLVLDASASMAGN------RIRAAKDLAQQLL--------VSTRDRVAVITFQERV 756 Query: 228 VQ-TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 VQ PL ++ ++++ T GLE A + ++ + Sbjct: 757 VQVQVPLTRNTSRVERGLSQIQPYGLTPLAQGLEVALLYLAQSRARNP-----------L 805 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGV---QAEAADQFLKNC--ASPD 340 ++ +TDG + P A++ G + +G + +++L+ A+ Sbjct: 806 LVLVTDGIPTVPYRTANPLEDAVQVARQLGTGRFGRVGFTCIGLQPNERYLRELVRAAGG 865 Query: 341 RFYSVQNSR 349 R Y V Sbjct: 866 RLYVVDELE 874 >gi|256420242|ref|YP_003120895.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035150|gb|ACU58694.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 639 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/243 (17%), Positives = 93/243 (38%), Gaps = 32/243 (13%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSL 179 + N R +M PW N++H + I K +K ++ ++ ++DVS Sbjct: 230 SNPTGNTPVAVRTDMAI----CPW--NTAHQLVRIALKGKDVAKDNLPPSNLVFLIDVSG 283 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--V 237 SM+D KL + ++ + +++ ++ + R +V ++ P G Sbjct: 284 SMSDA-----KKLPLVKQAFKLLVNQLRPVD------RVAIVVYAGAAGLVLPSTSGDHK 332 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 I + +++L G +T G++ AY E++ K ++ +I TDG+ + Sbjct: 333 TAILDALDKLEAGGSTAGGEGVQLAYKTAT------EYLLKSGNNR---VIIATDGDFNV 383 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 + E + + +G + +G D L+ A + + N + Sbjct: 384 GPSSDGELQRIIEKKREKGIFLSVLGFGMGNYKDNKLELLADKGNGNYAYIDNFEEARRT 443 Query: 355 FLR 357 F Sbjct: 444 FAT 446 >gi|332809376|ref|XP_003308229.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Pan troglodytes] Length = 682 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 38/198 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM G D+L ++ + L + V N G+V F S Sbjct: 70 VCLVLDKSGSMG-----GKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATVV 119 Query: 231 FPLAWGVQHIQEKINRLIFG------STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L ++ N L+ G T G++YA+ I H + Sbjct: 120 NKLI--QIKSSDERNTLMAGLPTYPRGGTSICSGIKYAFQVIG-----ELHSQLDGSE-- 170 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 ++ LTDGE+++ + +E K+ GAIV+ I + +A + + F Sbjct: 171 --VLLLTDGEDNTAS-------SCIDEVKQSGAIVHFIALGRDADEAVIEMSKITGGSHF 221 Query: 343 YSVQNSRK--LHDAFLRI 358 Y ++ L DAF + Sbjct: 222 YVSDEAQNNGLIDAFGAL 239 >gi|85707636|ref|ZP_01038702.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] gi|85689170|gb|EAQ29173.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] Length = 740 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 94/296 (31%), Gaps = 36/296 (12%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRE-----NGFAQDINNI 108 SLL + ++ N + + D + + + Sbjct: 225 ASLLAGSADLIAPTADPNMVARAGG-GLNPVSITVNLDPGFAPEAISSPYHAVSVRGSGS 283 Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 R+ +L+ ++D+ L + + P + + IT Sbjct: 284 TRTVTLADGAVPANRDFELRWSASGDAPMLGLFKQRHGELEYVMATITPPALERVGEAPP 343 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR---SGLVTFSS 225 +M+ V+D S SM P A RS+ L+ ++ D NV+R + F+S Sbjct: 344 REMIFVIDNSGSMAGESMP------AARRSLLYALETLRP-QDRFNVIRFDDTMTELFAS 396 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + + + + L+ T+ P L A + + Sbjct: 397 AVQASDS---NIAAAKTFTHNLMANGGTEMLPALRAALRDRAPDERVRQ----------- 442 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +IFLTDG + + + N ++ V+ +G+ + ++ A R Sbjct: 443 -VIFLTDG----ALSNEADMMEEINRNRKDS-RVFMVGIGSAPNTYLMRRMAEAGR 492 >gi|119386037|ref|YP_917092.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376632|gb|ABL71396.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 855 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 80/223 (35%), Gaps = 38/223 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + +TS++ ++ + ++VLD S SM G++K+ +A + DI+ P Sbjct: 10 IALTSALAPAALAQERPSTILVLDASGSM-WGQIDGINKITIARDVVG---DIVSDFPAD 65 Query: 213 NNVVRSGLVTFSSK-------IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYN 264 N G VT+ + I A G I + L T T + A Sbjct: 66 QN---LGFVTYGHRERGQCADIETLVEPAPGTAAEIAGIVEGLNPRGMTPMTDAVVTAAQ 122 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAI 322 + ++ +I ++DG + N + + G + I Sbjct: 123 ALRHTEQAAT------------VILVSDGIETC----NPDPCAAARALEEAGVDFTAHVI 166 Query: 323 GVQAEA-ADQFLK-NCA---SPDRFYSVQNSRKLHDAFLRIGK 360 G AD L+ C + RF + N+++L++A + Sbjct: 167 GFDVRGEADALLQMQCIAEETGGRFLTADNAQELNEALREVTA 209 >gi|119591515|gb|EAW71109.1| collagen, type VI, alpha 3, isoform CRA_g [Homo sapiens] Length = 2205 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1325 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1384 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1385 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1442 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1494 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1495 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1553 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|119591511|gb|EAW71105.1| collagen, type VI, alpha 3, isoform CRA_c [Homo sapiens] Length = 2971 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1325 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1384 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1385 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1442 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1494 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1495 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1553 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|55743106|ref|NP_476508.2| collagen alpha-3(VI) chain isoform 5 precursor [Homo sapiens] Length = 2971 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1325 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1384 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1385 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1442 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1494 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1495 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1553 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|224012789|ref|XP_002295047.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969486|gb|EED87827.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 818 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 33/227 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGP---GMDKLGVATRSIREMLDI 205 P + + + + ++ D+ + +D+S S+ + FG G Sbjct: 589 VPETLDPTARPTCPTEEDFDLCIAVDMSGSVCNSGFFGNNCVGCSPFVFCQSLFVSQETC 648 Query: 206 IKSIPDVNNVVRS---------------GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + DV R +V+F+S L+ + I + G Sbjct: 649 CANFGDVQQFARLMVYNLSQFGDKNTSFSVVSFASDAEILSGLSSADKTINVLDQLIYSG 708 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-ESLFYC 309 +T + +F + + + + KK+I+ +TDG +++ +++ + +++ Sbjct: 709 GSTNHGQAINACQESLFTSDQ--------NINRKKFIMLITDGVSATDDLNPEADAIDAA 760 Query: 310 NEAKRRGAIVYAIGV---QAEAADQFLKNCASPDRFYSVQNSRKLHD 353 AK G + I + A F+ + ++ Y V N L Sbjct: 761 ETAKSSGITIIPIFISPYNDIDAVSFMSSLSNDGEVY-VTNFDSLDS 806 >gi|108808190|ref|YP_652106.1| hypothetical protein YPA_2196 [Yersinia pestis Antiqua] gi|108811539|ref|YP_647306.1| hypothetical protein YPN_1376 [Yersinia pestis Nepal516] gi|145599390|ref|YP_001163466.1| hypothetical protein YPDSF_2114 [Yersinia pestis Pestoides F] gi|165926883|ref|ZP_02222715.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165935923|ref|ZP_02224493.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011266|ref|ZP_02232164.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166212622|ref|ZP_02238657.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398885|ref|ZP_02304409.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422701|ref|ZP_02314454.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424397|ref|ZP_02316150.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467552|ref|ZP_02332256.1| hypothetical protein YpesF_06584 [Yersinia pestis FV-1] gi|170023658|ref|YP_001720163.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896129|ref|YP_001873241.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|270489996|ref|ZP_06207070.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294504758|ref|YP_003568820.1| membrane protein [Yersinia pestis Z176003] gi|108775187|gb|ABG17706.1| membrane protein [Yersinia pestis Nepal516] gi|108780103|gb|ABG14161.1| putative membrane protein [Yersinia pestis Antiqua] gi|145211086|gb|ABP40493.1| membrane protein [Yersinia pestis Pestoides F] gi|165916068|gb|EDR34675.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921234|gb|EDR38458.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989944|gb|EDR42245.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206553|gb|EDR51033.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958408|gb|EDR55429.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051389|gb|EDR62797.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167056279|gb|EDR66048.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169750192|gb|ACA67710.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699155|gb|ACC89784.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|262362820|gb|ACY59541.1| membrane protein [Yersinia pestis D106004] gi|262366744|gb|ACY63301.1| membrane protein [Yersinia pestis D182038] gi|270338500|gb|EFA49277.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294355217|gb|ADE65558.1| membrane protein [Yersinia pestis Z176003] Length = 472 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 72/200 (36%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K A ML+I ++ +V Sbjct: 90 STRRSPINLALVIDRSTSMS---GERIEKAREAAILAVNMLNITDTLS---------VVA 137 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 138 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 191 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 192 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 248 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 249 SDGNHTFVANSADLEKAFTK 268 >gi|326932668|ref|XP_003212436.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 499 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 71/199 (35%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ + R ++ +R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNEIYHFVEHLARKF------------ISPQLRMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ A Sbjct: 89 GTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GA VY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRELGATVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + L I K+ Sbjct: 200 PVNDGFEALQGIIDSILKK 218 >gi|261880540|ref|ZP_06006967.1| BatB protein [Prevotella bergensis DSM 17361] gi|270332763|gb|EFA43549.1| BatB protein [Prevotella bergensis DSM 17361] Length = 342 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 61/180 (33%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 KIS + G++ ++ +DVS SM D +DK + S+ E Sbjct: 82 TKISHEKRDGIEAIICMDVSNSMKAEDVAPSRLDKSKMLVESMTEHFTN----------D 131 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + GLV F+ P+ + + L+ T + Sbjct: 132 KLGLVVFAGDAFVQLPITSDYVSAKMFLQNIDPSLVSTQGTDIARAITVGMRSFTQ---- 187 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + II +TDGE+ ++ EA++RG V+ +GV + Sbjct: 188 -------QEKVGRAIIVITDGEDHEG-----GAMEAAKEARKRGINVFILGVGSTKGAPI 235 >gi|226874935|ref|NP_034712.2| inter-alpha-trypsin inhibitor heavy chain H2 [Mus musculus] gi|148676057|gb|EDL08004.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 950 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 76/201 (37%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 315 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDDQFS------VVDFNHNVRTW 362 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A ++ + D Sbjct: 363 RNDLVSATKTQIADAKRYIEKIQPSGGTNINEALLRAIFILNEAS----NMGLLNPDSVS 418 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + ++ + ++++G+ + FLK ++ +R Sbjct: 419 LIILVSDGDPTVGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQ 478 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 479 RIYGNQDTSSQLKKFYNQVST 499 >gi|126724455|ref|ZP_01740298.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] gi|126705619|gb|EBA04709.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] Length = 354 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 73/206 (35%), Gaps = 33/206 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSS 225 + + ++++D S SM + + ++ + + + S + G Sbjct: 164 LPMSFVLLVDRSGSMAEIMPEVREAAKEFVAALPDTAECSVSSFAGDWDFSHRG----PE 219 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + P + + + G TT L AY + +++ +K Sbjct: 220 GALTCKPENFAF-------DNIQPGGTTNIYGPLREAYGWLSESERTDH---------QK 263 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQFLKNCASPDRFYS 344 +I LTDG + ++L ++A Y + ++ D++L++ A D ++S Sbjct: 264 AVILLTDGRANDDAASESQTLAMKDDA-------YTFVYYMGDSDDRWLRSLA--DNYFS 314 Query: 345 VQN--SRKLHDAFLRIGKEMVKQRIL 368 S +L F + Q +L Sbjct: 315 GGGHVSAQLERYFNVVSDAYSAQTVL 340 >gi|118101296|ref|XP_425758.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 552 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 71/199 (35%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ + R ++ +R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNEIYHFVEHLARKF------------ISPQLRMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ A Sbjct: 89 GTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GA VY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRELGATVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + L I K+ Sbjct: 200 PVNDGFEALQGIIDSILKK 218 >gi|21707832|gb|AAH34341.1| Inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 76/201 (37%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDDQFS------VVDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A ++ + D Sbjct: 359 RNDLVSATKTQIADAKRYIEKIQPSGGTNINEALLRAIFILNEAS----NMGLLNPDSVS 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + ++ + ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|74227570|dbj|BAE21837.1| unnamed protein product [Mus musculus] Length = 950 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 76/201 (37%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 315 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDDQFS------VVDFNHNVRTW 362 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A ++ + D Sbjct: 363 RNDLVSATKTQIADAKRYIEKIQPSGGTNINEALLRAIFILNEAS----NMGLLNPDSVS 418 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + ++ + ++++G+ + FLK ++ +R Sbjct: 419 LIILVSDGDPTVGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQ 478 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 479 RIYGNQDTSSQLKKFYNQVST 499 >gi|3024068|sp|Q61703|ITIH2_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|695634|emb|CAA49842.1| inter-alpha-inhibitor H2 chain [Mus musculus] gi|122889675|emb|CAM13914.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 76/201 (37%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDDQFS------VVDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A ++ + D Sbjct: 359 RNDLVSATKTQIADAKRYIEKIQPSGGTNINEALLRAIFILNEAS----NMGLLNPDSVS 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + ++ + ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|188994155|ref|YP_001928407.1| hypothetical protein PGN_0291 [Porphyromonas gingivalis ATCC 33277] gi|188593835|dbj|BAG32810.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC 33277] Length = 1228 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 76/231 (32%), Gaps = 33/231 (14%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 ++ E + +D + I + R++ FA+++ + Sbjct: 56 LIQAEIVYQSVSEHSDLVISPVNEIRPANRFPSHRKSFFAENLRASPPVVPV-------- 107 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 AV +Y +P P N+ L IT+ + + +D S SM Sbjct: 108 ------AVDKYAVPVANPMDPENPNAWDVTLKITTKAVTVPVDVVMV-----IDQSSSMG 156 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G + +L A S + + + VR LV++ + + + + Sbjct: 157 ---GQNIARLKSAIASGQRFVKKMLPKGMATEGVRIALVSYDHEPHRLSDFTKDTAFLCQ 213 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 KI L T + GL+ A N + K+II ++DG Sbjct: 214 KIRALTPIWGTHTQGGLKMARNIMA-----------TSTAVDKHIILMSDG 253 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 23/61 (37%), Gaps = 3/61 (4%) Query: 305 SLFYCNEAKRRGAIVYAIGVQAEA---ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 ++ AK G ++ IG A+ LK A+ + + L AF I + Sbjct: 361 AINEAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDENHFFTATPANLAAAFDNIAQT 420 Query: 362 M 362 + Sbjct: 421 I 421 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 80/231 (34%), Gaps = 27/231 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + L +++ + D++ ++D S S+ + Sbjct: 3 LPGFVMLLCILGAQATVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRPS------EFEQ 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GS 251 + +++D + PD R G+V ++S++ L + + ++++ + Sbjct: 57 VKVFLAKVIDGLSVGPDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLST 113 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + +++A N F E + D K I +TDG D Sbjct: 114 GTMTGLAIQFAMNVAFSEAE----GGRKSPDISKVAIIVTDGRPQDNIRD------IAAR 163 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN---SRKLHDAFLR 357 A+ G ++AIGV + + P D V++ KL F Sbjct: 164 AREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|308493174|ref|XP_003108777.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] gi|308248517|gb|EFO92469.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] Length = 425 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 71/202 (35%), Gaps = 34/202 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI-----REMLDIIKSIPDVNNVVRSG 219 + LD+++VLD S + + + D + + + VR Sbjct: 234 TGCELDLVLVLDFSTTTDPVYNSYKDLSKRLVSQLKIGPHYTQVAAVTFATVGRTRVRFN 293 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L ++++ + + I++L G TT G+E A +I +++ IA Sbjct: 294 LKKYTTQ-----------EEVLRGIDKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT 342 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------ADQF 332 K +I TDG ++ K +A G +Y + A A ++ Sbjct: 343 ------KVMIVFTDGWSNKGPDPEKR----AKDAVNAGFEMYTVAYTARAPNSVTLNNET 392 Query: 333 LKNCA-SPDRFYSVQNSRKLHD 353 L + S ++ + L D Sbjct: 393 LSAISGSSGHAFTDVTFQSLVD 414 >gi|320537260|ref|ZP_08037220.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145888|gb|EFW37544.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 319 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 70/217 (32%), Gaps = 21/217 (9%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 I + S + + I F W A + S KI S G+ ++ + Sbjct: 30 INAAYSFAQHKKSILPFLIIRNILFAFAWIFIVLAAAGPLWGS-KIKSVRRQGVSVVFAV 88 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+S SM +L +A + + + L+ V + PL + Sbjct: 89 DISKSMTLKDVKP-SRLNLAKGFCEFLTVKLSNAS-------CALLAVKGDSVLSVPLTF 140 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + I L S T S L+ A K + + K II TDGE Sbjct: 141 EHEVLLKAIESLSPSSYTASGTNLQKALLK-------AAAVFPKNRATAKTIILCTDGEQ 193 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 S NI L E +R G + +G Sbjct: 194 SEGNI-----LEAAKEIQRHGIQLIIVGFGTIEGGDV 225 >gi|223670960|dbj|BAH22727.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 71/212 (33%), Gaps = 31/212 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S GLD++ V D S S+ + D + R ++D I + R ++ Sbjct: 246 SSGAAGLDLVFVFDSSASVGE------DNFRKGIQFARTIIDEF-GISATPSGTRVAVIV 298 Query: 223 FSSKIVQTFPLAWGV-----QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FS F L + ++ N G T + L+ + + Sbjct: 299 FSDAAQVIFNLKSNRIVDKEEAVRRLENLQFQGGGTATKLALQAVIDTVTPELRNNS--- 355 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKN 335 KK + +TDG+++ ++ + R G ++AIGV LK+ Sbjct: 356 ------KKALFLITDGKSNKGGSPDRPAKVL-----RAGFNFEIFAIGVSDSVDKDELKS 404 Query: 336 CASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 AS Y +++ L I + Sbjct: 405 IASEPFRTHVYQIKDYATLVKLKELITTKGTD 436 >gi|240255542|ref|NP_476505.3| collagen alpha-3(VI) chain isoform 2 precursor [Homo sapiens] gi|193787261|dbj|BAG52467.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 111 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 168 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 169 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 228 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 229 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 276 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 277 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 333 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 334 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 383 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 384 FNPSLVYLMDDFSSL 398 >gi|254414094|ref|ZP_05027862.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] gi|196179230|gb|EDX74226.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] Length = 601 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 76/233 (32%), Gaps = 28/233 (12%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPG 188 ++ F+ + ++ V ++++ L++ +V+D S SM Sbjct: 3 APQIEFVPLRDAVSTEALTTLDVLVKIVPPQPEANLKRPELNLGLVIDRSGSMAGK---- 58 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKINR 246 K+ A ++ + + R + + + P LA +I +I R Sbjct: 59 --KIAYARQAACYAVQQLLGSD------RVSVTIYDDIVETLIPSTLATEKNYITRQIER 110 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + + T L + + K + + I L+DG + + Sbjct: 111 IHPRNMT----ALHDGWVE----GGKQVSQYLNPEGLNRVI-LLSDGLANKGQTNADAIA 161 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLR 357 ++G +GV + + L+ A+ +Y + +L + F Sbjct: 162 SDVYGLAQQGVSTTTMGVGDDYNEDLLEVMANSGDGNYYYIDTPEQLPEIFQT 214 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 81/233 (34%), Gaps = 27/233 (11%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW ++ + + I +++ ++DVS SM DKL + ++ Sbjct: 189 TPWNEDTRLIRIGLAG-YDIERSERPPANLVFLMDVSGSMGRP-----DKLPLVKTALAG 242 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + ++ V+ VV +G + I+ +N+L G +T G++ Sbjct: 243 LAGELQPQDKVSIVVYAGAAG------LVLEPTNDTRKIRAALNQLQAGGSTAGGAGIQL 296 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY D + +G + +I TDG+ + + + + G + Sbjct: 297 AYQIAED------NFIEGGVNR---VILATDGDFNVGVSSRDALIEMIEKKRDSGITLTT 347 Query: 322 IGVQAEA-ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 +G + ++ A + + ++ +A +G EM K Sbjct: 348 LGFGTGNYNEAMMEQIANHGNGNYAYIDSA---LEAKKVLGDEMSSTLFTIAK 397 >gi|166363914|ref|YP_001656187.1| hypothetical protein MAE_11730 [Microcystis aeruginosa NIES-843] gi|166086287|dbj|BAG00995.1| hypothetical protein MAE_11730 [Microcystis aeruginosa NIES-843] Length = 581 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 46/326 (14%), Positives = 95/326 (29%), Gaps = 43/326 (13%) Query: 55 SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL 114 + + + + L + + + K + ++ R L + D + Sbjct: 273 ASVASVYESLVIAANSQAGSNQTRYQAVYPKATFSSNMRAILAHAPWISDREKEAAEKVI 332 Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK---------- 164 I+ + + R +P + + A P S + Sbjct: 333 EFILLPETQQIATDLGLRPGVPGVALGSKFSAEFGVNPQPTYDSYRSPQPEVVEAMLKSW 392 Query: 165 ---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + +V+D S SM KL ++ + + R L+ Sbjct: 393 QNYAKKPSQVAVVIDTSGSMEGQ------KLTSVKNTLLNYVQNLGPKE------RIALI 440 Query: 222 TFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +F+S I + + G E I +L T+ YA N + A Sbjct: 441 SFNSVINEPVIIEGTPQGRDRGIEFIGQLRSSGGTRLYDSALYARNWLSQNLRTDTINA- 499 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE---AKRRGAIVYAIGVQAEA--ADQFL 333 ++ LTDGE+S I+ + + + + + IG E Q L Sbjct: 500 --------VLILTDGEDSGSQINLDQLEQELQKSGFSSDQRIAFFTIGYGKEGEFNPQAL 551 Query: 334 KNCAS-PDRFYSVQNSRKLHDAFLRI 358 + A +Y + + + Sbjct: 552 QKIAEVNGGYYRQGDPATISTVMGDL 577 >gi|126314401|ref|XP_001377042.1| PREDICTED: similar to Procollagen, type VI, alpha 2, partial [Monodelphis domestica] Length = 762 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 61/165 (36%), Gaps = 20/165 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P+ + R G+V +S Sbjct: 353 GALDIVFVIDSSESIG---YTNFSLEKNFVINVVNRLGSITKDPNSDTGTRIGVVQYSHD 409 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L G +E++ +L T + L Y YN++ + + Sbjct: 410 GTFEAIKLDDGRIGSLAQFKEEVKKLEWIAGGTWTPSALNYTYNELIKGSRRKKTRV--- 466 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P + + C+ K +V AIG+ Sbjct: 467 -----FAVVITDGRH-DPRDNEQSLKALCD--KVENVVVTAIGIG 503 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 2 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 58 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 59 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 115 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 116 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 175 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 176 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 218 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 763 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 819 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 820 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 876 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 877 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 936 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 937 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 979 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 13/110 (11%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +FS + T P Q ++ N G+ T + L Y + + L Sbjct: 160 TSFSHMLGATNP----TQKTKDHEN----GTGTNTYAALNSVYLMMNNQMRLLGMETMAW 211 Query: 281 DDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + II LTDG+++ D+ + N+ + +YAIGV Sbjct: 212 QEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVG 261 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 246 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 302 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 303 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 359 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 360 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 419 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 420 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 462 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|229838599|ref|ZP_04458758.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229895667|ref|ZP_04510838.1| putative membrane protein [Yersinia pestis Pestoides A] gi|229899165|ref|ZP_04514308.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229901807|ref|ZP_04516929.1| putative membrane protein [Yersinia pestis Nepal516] gi|229681736|gb|EEO77830.1| putative membrane protein [Yersinia pestis Nepal516] gi|229687567|gb|EEO79640.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229694965|gb|EEO85012.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229701473|gb|EEO89501.1| putative membrane protein [Yersinia pestis Pestoides A] Length = 437 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 72/200 (36%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K A ML+I ++ +V Sbjct: 55 STRRSPINLALVIDRSTSMS---GERIEKAREAAILAVNMLNITDTLS---------VVA 102 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 103 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 156 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 157 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 213 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 214 SDGNHTFVANSADLEKAFTK 233 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 318 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 16/213 (7%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ F+ HA + K+D + + V+D S S+ P + Sbjct: 9 FQQAFLSTLLCVALVPLHAQFDDEPVTSCTEKTDCPISVYFVIDTSESIALQTVPIQSLV 68 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRS---GLVTFSSKIVQTFPLAWGVQHIQEKINRLI- 248 + I ++ +++ N V + G + +S + PL K+ + Sbjct: 69 DQIKQFIPRFIEKLENEVYQNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAINY 128 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + + + + K+ + +TDG + + Sbjct: 129 LGRGTFTDCAISNMTQQFQSQTARDV----------KFAVVITDGHVTGSPCGGMK--MQ 176 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 A+ G ++A+ + +Q L+ ASP Sbjct: 177 AERARDMGIKLFAVAPSEDVYEQGLREIASPPH 209 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 59/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+M V+D S S+ V S L I P R G+V +S + Sbjct: 612 GALDIMFVIDSSESIGYTNFTLEKNFVVNVVS---RLGSIAKDPKSETGARVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + RL T + L++AYNK+ + + Sbjct: 669 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK------ 722 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + +TDG P D+K C R +V IG+ Sbjct: 723 --AQVFAVVITDGR-YDPRDDDKNLGALC----GRDVLVNTIGIG 760 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 70/172 (40%), Gaps = 14/172 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + + L + + D N R L+ + Sbjct: 828 TQRPVDIVFLLDGSERIGEQ---NFHRAHHFVEQVAQQLTLARRNDDNMNA-RIALLQYG 883 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + + I + ++ S++ + +A N I + + +A+ + Sbjct: 884 SESEQNVVFPLTYNLTEISNALAQIKYLDSSSNIGSAIIHAINNIVLSPGNGQRVARRNA 943 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + +F+TDG S N++ N K++ + + + ++ L Sbjct: 944 ELS--FVFITDGITGSKNLE-----EAINSMKKQDVMPTVVALGSDVDMDVL 988 >gi|223694808|gb|ACN18090.1| von Willebrand factor type A [uncultured bacterium BLR5] Length = 347 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 84/232 (36%), Gaps = 37/232 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++D+ L + +++DVS S D+ A I ++ K ++ Sbjct: 83 TFQRETDLPLSIALLIDVSASEERTLP---DEKAAARSFIETIIRSSK--------DQAA 131 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLE 274 ++ F+ L V I + + RL + + G+ I + +E Sbjct: 132 IIPFTDYAYLEQGLTPNVLAIYQALQRLEVALPSYVGSGRKISGISSGPGTIANPREGST 191 Query: 275 HI------------AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 I + ++ II LTDG+++S + ++ ++A ++YAI Sbjct: 192 AIWDAVTVSAGEILTRSPGRRRRAIILLTDGQDTSSRVTRGTAI---DKALEAETVIYAI 248 Query: 323 GVQAEA----ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 G+ L N A + R + + L F I KE+ Q +L Sbjct: 249 GIGDSKYEGIDKGALNNVAERTGGRAFFPKRGADLTSVFTEIEKELRSQYLL 300 >gi|196231437|ref|ZP_03130295.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224290|gb|EDY18802.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 725 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 29/169 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G D+++ +D S SM +LG A + ++++ + R GL+ F+ Sbjct: 85 KRQGRDILIAIDCSRSMLSTDLAP-SRLGRAKLATQDLISQLTG-------DRVGLIAFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 PL + + ++ L T I A + + Sbjct: 137 GTAFLQAPLTIDYGAVLDSVSELDTNIIPRGGTN-----------IAAAITEADAAFGKG 185 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + + +I TDGE + + + ++ +G+ + Sbjct: 186 ESDNRCLIIFTDGEELESDAVAAAAAE------KDHMRIFTVGLGSADG 228 >gi|149034205|gb|EDL88975.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 899 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 72/198 (36%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L +K + + ++V G S Sbjct: 281 TNMSKNLVFVIDISGSMEGQ------KVKQTKEALLKILGDMKPVDNF-DLVLFGSQVQS 333 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + R T GL + A+ ++ Sbjct: 334 WKGSLVPASHANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNRAQGSHPELSSPAS--- 390 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L A R +Y +G + FL+ + + Sbjct: 391 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSIENNGWA 449 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 450 QRIYEDHDATQQLQGFYN 467 >gi|313151177|ref|NP_001100761.2| inter-alpha-trypsin inhibitor heavy chain H1 [Rattus norvegicus] gi|149034204|gb|EDL88974.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform CRA_a [Rattus norvegicus] gi|187469475|gb|AAI66831.1| Itih1 protein [Rattus norvegicus] Length = 904 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 72/198 (36%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L +K + + ++V G S Sbjct: 281 TNMSKNLVFVIDISGSMEGQ------KVKQTKEALLKILGDMKPVDNF-DLVLFGSQVQS 333 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + R T GL + A+ ++ Sbjct: 334 WKGSLVPASHANLQAAQDFVRRFSLAGATNLNGGLLRGIEILNRAQGSHPELSSPAS--- 390 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L A R +Y +G + FL+ + + Sbjct: 391 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSIENNGWA 449 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 450 QRIYEDHDATQQLQGFYN 467 >gi|301767378|ref|XP_002919104.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Ailuropoda melanoleuca] Length = 1011 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 59/180 (32%), Gaps = 14/180 (7%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + R L + V R G Sbjct: 37 PEKADCPINVYFVLDTSESVTMQSPIDSLLYHMKQFVRQFISQLQDETYLEQVALSWRYG 96 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS + P + + ++ T + L +I + +K Sbjct: 97 GLHFSDVVRVFSPPDSDRASFTKSLESIVSIRKGTFTDCALANMTQEI------RQLKSK 150 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + +TDG + + A+ G ++ + +Q L++ AS Sbjct: 151 GGVH---FAVVITDGYVTGSPCGGIK--LQAERAREEGIRIFTVAPDQVPNEQGLRDMAS 205 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 57/165 (34%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 610 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 666 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AYN++ + + Sbjct: 667 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 723 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ CN V AIG+ Sbjct: 724 -----FAVVITDGRH-DPRDDDLNLRALCNH----DVTVTAIGIG 758 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 60/147 (40%), Gaps = 14/147 (9%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLE 260 D++K P++ +R +T+S++ L I ++RL T G + Sbjct: 79 DVVKKFPNLK--MRVSFITYSTQGHTLMELTSDRNKIHNSLSRLKNIKPTGATNMHEGFK 136 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A E++E G ++ II LT G + + +E+ +A+ GA VY Sbjct: 137 KA-------NEQIEQENAGGNNAASLIIALTTGPLTPKAL--QETKSEAEKAREMGAKVY 187 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQN 347 +GV+ DQ D+ Y V N Sbjct: 188 CVGVKDYRKDQLDAIVGRKDQMYGVGN 214 >gi|224065787|ref|XP_002190547.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Taeniopygia guttata] Length = 1068 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 71/186 (38%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 221 DMVIIVDVSGSVSGL------TLKLMKTSVYEMLDTLSDDDYVN------VASFNEKAKP 268 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 269 VSCFKHLVQANIRNKKVFKEDVQGMVAKGTTDYKAGFEYAFDQLQNSNITRANCN----- 323 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F + + V+ V D L+ CA+ Sbjct: 324 --KMIMMFTDG-----GEDRVQDVFEKYKWPNKTVRVFTFSVGQHNYDVTPLQWMACANK 376 Query: 340 DRFYSV 345 ++ + Sbjct: 377 GYYFEI 382 >gi|156409369|ref|XP_001642142.1| predicted protein [Nematostella vectensis] gi|156229283|gb|EDO50079.1| predicted protein [Nematostella vectensis] Length = 182 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 75/205 (36%), Gaps = 29/205 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D++ +D S S+ G IR +++ R + +SS Sbjct: 3 NTNIDLVFAIDASSSV------GKVNFERVKGFIRRLVESFHI---SRTSTRVAAIVYSS 53 Query: 226 KIVQTFPLAWGVQ--HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F + RL F T + L A +++F + Sbjct: 54 RPRVAFDFNRYTSARRAAHAVKRLRFLRGGTSTGRALRLASSRLF---------RRYGRK 104 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDR 341 +K ++ +TDG++S + ++L KR+G ++A+GV + L AS P + Sbjct: 105 RRKVLMLITDGKSSDDVLKPSKAL------KRKGVQIFAVGVGMSVSRNELILIASHPSQ 158 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 Y + L + ++ + + Sbjct: 159 VYQA-SFTSLSAIVKSLARKTCESK 182 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 77/211 (36%), Gaps = 30/211 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 + +VLDVS SM G ++ A ++ +++D + +V +R+ + Sbjct: 35 VELVLDVSGSMKTRDIDGQSRMSAAKQAFNDVIDAV--PEEVELGIRTLGADYPGEDKAR 92 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + ++ P+ + + L T P L A + + + Sbjct: 93 GCKDTRQLYPVGPI--DRTEAKTAVATLSPTGWTPIGPALLGAADDLDGDEGG------- 143 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--- 336 + I+ ++DGE++ +D E AK ++ +G+ A + +C Sbjct: 144 ----SRRIVLISDGEDTCGPLDPCEVAREI-AAKGVDLVIDTLGLVPNAKIRQQLSCIAG 198 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ + +VQ+ L D ++ + Sbjct: 199 ATGGTYTAVQHKEDLSDKVKQLVDRAADPVV 229 >gi|254779584|ref|YP_003057690.1| hypothetical protein HELPY_0994 [Helicobacter pylori B38] gi|254001496|emb|CAX29512.1| Conserved hypothetical protein [Helicobacter pylori B38] Length = 214 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 79/205 (38%), Gaps = 14/205 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-I 227 + + ++LD S SM+ G ++GV I++M++ +K + +VTF Sbjct: 15 IPVFLLLDTSGSMSHPLGNS-TRIGVLNLCIQKMIETLKQEAKKELFSKMAIVTFGENGA 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V P ++++ + L T A + I D YK Y Sbjct: 74 VLHTPFD-DIKNVNFE--PLSTSGGTPLDQAFRLAKDLIED------KDTFPTKFYKPYS 124 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQ 346 I ++DGE ++ S F+ + + ++ ++I + EA Q K+ FY Sbjct: 125 ILVSDGEPNNDKWQEPLSSFHHD-GRSAKSVCWSIFIGDREANPQVNKDFGKDGVFY-AD 182 Query: 347 NSRKLHDAFLRIGKEMVKQRILYNK 371 + KL F + + + K K Sbjct: 183 DVEKLVKLFEIMTQTISKGSASIKK 207 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 77/211 (36%), Gaps = 30/211 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 + +VLDVS SM G ++ A ++ +++D + +V +R+ + Sbjct: 36 VELVLDVSGSMKTRDIDGQSRMSAAKQAFNDVIDAV--PEEVELGIRTLGADYPGEDKAR 93 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + ++ P+ + + L T P L A + + + Sbjct: 94 GCKDTRQLYPVGPI--DRTEAKTAVATLSPTGWTPIGPALLGAADDLDGDEGG------- 144 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--- 336 + I+ ++DGE++ +D E AK ++ +G+ A + +C Sbjct: 145 ----SRRIVLISDGEDTCGPLDPCEVAREI-AAKGVDLVIDTLGLVPNAKIRQQLSCIAG 199 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A+ + +VQ+ L D ++ + Sbjct: 200 ATGGTYTAVQHKEDLSDKVKQLVDRAADPVV 230 >gi|229816811|ref|ZP_04447093.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] gi|229785827|gb|EEP21941.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] Length = 1185 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/139 (21%), Positives = 50/139 (35%), Gaps = 13/139 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SMN++ + A +++L S V+ +VTFS++ Sbjct: 492 DIVLVMDKSGSMNEN-NRDANAQKAAKDLAKKLLTGTNSKLPPEQQVQMAVVTFSTEASL 550 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 V I + T L+ A + G KK+IIF Sbjct: 551 KQKFTTNVSEINNAVRG-NPDGGTNWEAALKQANDMQG-----------GRRGVKKHIIF 598 Query: 290 LTDGENSSPNIDNKESLFY 308 L+DG + Y Sbjct: 599 LSDGNPTYRTTSYSGCYSY 617 >gi|84498148|ref|ZP_00996945.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381648|gb|EAP97531.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 533 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 71/212 (33%), Gaps = 33/212 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + + MV+D S SM+ ++LG+ S+ + ++ +VT+ + Sbjct: 184 PVALTMVVDTSGSMD-----IRERLGLVKSSLALL------AENLRPDDTIAIVTYQTDA 232 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 I I+RL G +T GL Y++ +A ++ Sbjct: 233 TPLLEPTPVRDTDTILAAIDRLEAGGSTNLEAGLLLGYDQAREAYKQGATN--------- 283 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA-SPDRFY 343 ++ +DG + D + RRG + +G +D ++ A D FY Sbjct: 284 VVLLASDGVANVGVTDGGRLATAIRDNGRRGIHLVTVGYGMGNYSDHLMEQLADQGDGFY 343 Query: 344 SVQNS---------RKLHDAFLRIGKEMVKQR 366 ++ L + K+ Q Sbjct: 344 EYIDTFEEARKLFVEDLRATLTPVAKDAKIQV 375 >gi|320014437|gb|ADV98008.1| putative membrane protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 437 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 72/200 (36%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K A ML+I ++ +V Sbjct: 55 STRRSPINLALVIDRSTSMS---GERIEKAREAAILAVNMLNITDTLS---------VVA 102 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 103 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 156 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 157 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 213 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 214 SDGNHTFVANSADLEKAFTK 233 >gi|224080732|ref|XP_002192406.1| PREDICTED: anthrax toxin receptor 1 [Taeniopygia guttata] Length = 516 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 72/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ + R ++ +R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNEIYHFVEHLARKF------------ISPQLRMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G E A +I+ A Sbjct: 89 GTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GA VY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYSE--REANRSRDLGATVYCVGVKDFNETQLARIADSRDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIDSILKK 218 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 308 YCNEAKRRGAIVYAIGVQAEAA------DQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGK 360 C+ K G +Y I E L+NCAS P +Y + + AF +G Sbjct: 503 VCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTAASMKQAFSDLGG 562 Query: 361 EMVKQRI 367 ++ RI Sbjct: 563 QLSALRI 569 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 70/170 (41%), Gaps = 16/170 (9%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 KGS+SI+ A+ + + + +E + + V+ +L LD + + A ++ N Sbjct: 6 RKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGTC 65 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + I+ +F + + NG ++ S +I Q+ ++ + Sbjct: 66 TGSCASDTTA----IFWANFSSAHQANGLGP-----FQAVSTGPVITPQNAS-TITIQAN 115 Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 ++P +F + S + +S++G+++ +VLD + S+ Sbjct: 116 VQLPLLF------TKILGVSQIALSEHAQAVRSNMGMELALVLDNTDSLE 159 >gi|50355939|ref|NP_598499.1| anthrax toxin receptor 2 precursor [Mus musculus] gi|68052321|sp|Q6DFX2|ANTR2_MOUSE RecName: Full=Anthrax toxin receptor 2; Flags: Precursor gi|49901393|gb|AAH76595.1| Anthrax toxin receptor 2 [Mus musculus] gi|74141652|dbj|BAE38584.1| unnamed protein product [Mus musculus] gi|74217872|dbj|BAE41940.1| unnamed protein product [Mus musculus] Length = 487 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 51/242 (21%), Positives = 82/242 (33%), Gaps = 36/242 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 SR P + + P + + + S D+ VLD S S+ +++ + Sbjct: 6 SRARSPGSWLFPGLWLLAVGGPGSLLQAQEQPSCKK-AFDLYFVLDKSGSVANNWIEIYN 64 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRL 247 + T V+ +R + FSS+ PL I E + + Sbjct: 65 FVHQLTERF------------VSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAV 112 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T GL+ A +I +A II LTDG +D + Sbjct: 113 KPVGETYIHEGLKLANEQIQNAGGLKASS---------IIIALTDG-----KLDGLVPSY 158 Query: 308 YCNEAKRR---GAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 NEAK+ GA VY +GV Q + S D+ + V+ A I ++ Sbjct: 159 AENEAKKSRSLGASVYCVGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILA 215 Query: 365 QR 366 Q Sbjct: 216 QS 217 >gi|332283431|ref|YP_004415342.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] gi|330427384|gb|AEC18718.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] Length = 585 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 80/213 (37%), Gaps = 36/213 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++++ D S SM G++K+ A + + E+ +KS P+ N GL+ + Sbjct: 22 QADDDVLIIYDASGSM-WGQVDGVNKIVTARKVMGEL---VKSWPENTN---LGLIAYGH 74 Query: 226 KIVQT--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + P + +N + T + L+ A + + + +H A Sbjct: 75 RSAGSCSDIETMIEPQRVDRDAFIKTVNAITPKGKTPISASLKQAADVL----QYRDHNA 130 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKN 335 ++ ++DG S + E K +G + +G + + Sbjct: 131 T--------VVLISDGLESCHG----DPCAVAAELKEKGVDFKAHVVGFDLDQEGNEALS 178 Query: 336 CA---SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 C + F N+ +L DA ++ ++V++ Sbjct: 179 CIAKNTGGIFVPASNADELQDALQQVQAKVVQK 211 >gi|315186713|gb|EFU20471.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 331 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 77/243 (31%), Gaps = 57/243 (23%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ + P + D++++ D+S SM P +L VA Sbjct: 59 FLLLGIAGLLVAYAEPFWGMEPETV---KRRNADIVVLFDISRSMLVRDVPP-SRLDVAK 114 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GST 252 ++ I R G+V F K PL + +++ I L Sbjct: 115 EIALMLVSRISGA-------RWGVVAFKGKGELLLPLTPDLLGLEDAIGLLSPVLLRSPG 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A + + ++ +I L+DGE + I L A Sbjct: 168 TDVASGLSRALEAFPE-----------QSNRQRLVILLSDGEALTGEIGPVLEL-----A 211 Query: 313 KRRGAIVYAIGVQAEAADQ------------------------FLKNCA--SPDRFYSVQ 346 + G V+ +G+ E+ LK A + RF+SV+ Sbjct: 212 RNLGVAVHTVGIGTESGGPVPLEGEEVLKKPSGEPVISRLDASLLKRIAEITGGRFFSVE 271 Query: 347 NSR 349 N+ Sbjct: 272 NAE 274 >gi|300776752|ref|ZP_07086610.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300502262|gb|EFK33402.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 335 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 71/202 (35%), Gaps = 32/202 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 P ++ S LL S S++ + M LDVS SM + P Sbjct: 59 PALYLLATLFLIFSIIDLLNGSEEVKSTQKLNNVIFM--LDVSNSMNAEDIDPS-----R 111 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--- 251 T + M+ +K + N + G+V F+ + PL + I+ + S Sbjct: 112 LTEAKNLMMATMKKM----NNDKIGIVIFAGNAMSIMPLTTDYNSAETYISGIETSSMQI 167 Query: 252 -TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T G++ A K K + ++ L+DGE++ N + L Sbjct: 168 QGTDFLKGMQAAVEKF-----------KNVSKGSRKVVLLSDGEDNEGNDNAAIRL---- 212 Query: 311 EAKRRGAIVYAIGVQAEAADQF 332 A + G + ++G+ + Sbjct: 213 -ANKEGVSITSVGIGTDEGAPV 233 >gi|298713908|emb|CBJ33776.1| conserved unknown protein [Ectocarpus siliculosus] Length = 977 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 71/218 (32%), Gaps = 28/218 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 C S I + + +++ +++D S S+ D G + + A + Sbjct: 46 CGAGSELTFQIEGETSVEANK---VNVAVIIDSSGSIFDIDGAFLLEKEFAK-------N 95 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTF-PLAWGVQ-HIQEKI-NRLIFGSTTKSTPGLEY 261 + S N G +++S +G + E + N T GL Sbjct: 96 VAASFAAKNLFTNGGTASYASFSDAASDGGTFGSEAEFNEFVDNASWIEGDTNIEAGLSK 155 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + + G ++I +TDG+ + + + A+ G IVYA Sbjct: 156 GRELLAN----------GTSTRTSFLILITDGDWNRGG----DPQIEADAARDEGTIVYA 201 Query: 322 IGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 +GV + + L + N +L + I Sbjct: 202 VGVGPDVSEATLLSIGGDLTNVFDASNFTELDNTLDEI 239 >gi|239943867|ref|ZP_04695804.1| hypothetical protein SrosN15_22911 [Streptomyces roseosporus NRRL 15998] gi|291447330|ref|ZP_06586720.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291350277|gb|EFE77181.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 396 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 58/193 (30%), Gaps = 26/193 (13%) Query: 174 VLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVTFSSKIVQT 230 VLD S SM L T RE + + +P + V VR+ V Sbjct: 218 VLDTSGSMKGRRLAQLKSALNGLTGDFRER-EQVTLLPFGSTVKQVRT-------HTVDP 269 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 G I+ L T L AY+ + E I+ + Sbjct: 270 ADPKAGPAAIRADAAALSAEGDTAIYSSLAAAYDHLGPDTESAFTS----------IVLM 319 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAI-VYAIGVQAEAADQFLKNCA-SPDRFYSVQNS 348 TDGEN++ + FY + R V+ + + A + R + Sbjct: 320 TDGENTAGRSAAEFGAFYRALPEARRVTPVFPVVFGDSDRSELEAIAALTGGRLFDGTKE 379 Query: 349 R---KLHDAFLRI 358 L AF I Sbjct: 380 EGPGSLDGAFEEI 392 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 53/297 (17%), Positives = 91/297 (30%), Gaps = 34/297 (11%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N KG I I I LP ++ L+ + + K KL L+ L Sbjct: 25 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALA---------- 74 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + N + + L NNI + + + Sbjct: 75 ---LTAENNAKNDTRNNELISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYHTQLSQYH 131 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + E P +F + H +I S +D++ V D S SM F Sbjct: 132 IDANIEQPTLFPFTSLLID--HDNFIIGGSAAAIKDV-PAMDVVFVTDFSGSMEGDFHNP 188 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 D ++ + + K D+ + ++FS P +WG + K L Sbjct: 189 DDPEVLSKLDELKRI-FFKIADDIYTANKDSTISFS-------PFSWGTKSADNKKCSLH 240 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 F + NKI+ +KY+I +T+ + I+N + Sbjct: 241 F---------MPKEKNKIYPIPSNEIERNTEAHQ-EKYMIAITENIDYLATIENIGT 287 >gi|224065915|ref|XP_002191423.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain H3, partial [Taeniopygia guttata] Length = 869 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 95/275 (34%), Gaps = 28/275 (10%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF--AQDINNIERSTSLSIIIDDQHK 123 + G + + F ++N + F + F D + S S++ D Sbjct: 178 EPQGIAELEAEGTFITNELQNTIKKTFSGKKGHISFKPTLDQQRTCANCSESVLDGDFTV 237 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 Y++ + + + F ++ P + +++ VLD S SM+ Sbjct: 238 RYDVKRTTPDNLQIVNGYFVHFFAPTNLP-------------KLSKNIIFVLDTSGSMSG 284 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQE 242 ++ ++ ++LD IK N ++ ++ + +++ P + ++ Sbjct: 285 ------REIEQTKEALLKILDDIKEDDFFNIILFDSEISTWKETLIKATPE--NLDEARK 336 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + + T GL + + A E+ + II LTDG+ + + Sbjct: 337 FVQHISAQGLTNLHGGLMRGIDILNAAHEENLVPKRSAS----IIIMLTDGQPNVGLSNT 392 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 E +A +Y +G + FL+ A Sbjct: 393 HEIENAVKKAIDGRYTLYNLGFGSGVDYGFLERMA 427 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 83/230 (36%), Gaps = 34/230 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI +++ +VLD S S G A + + +++ + S R G Sbjct: 226 KIVLDPSGSMNIYLVLDGSGS------IGASDFTGAKKCLVNLIEKVASYGVKP---RYG 276 Query: 220 LVTFSSK----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAK 270 LVT+++ + + + + +++N + S T + L+ Y+ + Sbjct: 277 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPD 336 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYA 321 + +G + + II +TDG ++ ID L Y + ++ VY Sbjct: 337 DVP---PEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYV 393 Query: 322 IGVQAEAADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 GV + AS + V++ L D F ++ E + Sbjct: 394 FGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 443 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 83/230 (36%), Gaps = 34/230 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI +++ +VLD S S G A + + +++ + S R G Sbjct: 235 KIVLDPSGSMNIYLVLDGSGS------IGASDFTGAKKCLVNLIEKVASYGVKP---RYG 285 Query: 220 LVTFSSK----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAK 270 LVT+++ + + + + +++N + S T + L+ Y+ + Sbjct: 286 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPD 345 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYA 321 + +G + + II +TDG ++ ID L Y + ++ VY Sbjct: 346 DVP---PEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYV 402 Query: 322 IGVQAEAADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 GV + AS + V++ L D F ++ E + Sbjct: 403 FGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 452 >gi|52545928|emb|CAH56139.1| hypothetical protein [Homo sapiens] Length = 1222 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|13471293|ref|NP_102862.1| hypothetical protein mll1222 [Mesorhizobium loti MAFF303099] gi|14022037|dbj|BAB48648.1| mll1222 [Mesorhizobium loti MAFF303099] Length = 638 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 80/217 (36%), Gaps = 24/217 (11%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 S + + PW A + + I I +++ ++DVS SM++ Sbjct: 232 SVSTPFNSTVSVMPTPWNAQTKLMHVAIKG-FDIKPTEQPKANLVFLIDVSGSMDEP--- 287 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKIN 245 DKL + + R ++ +++ ++ +VT++ + + I I+ Sbjct: 288 --DKLPLLKSAFRLLVSKLRADDTIS------IVTYAGEAGTVLMPTRAAEKDKILNAID 339 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L G +T G++ AY KL + D + + TDG+ + D+ + Sbjct: 340 NLTPGGSTAGEAGIKEAY--------KLAQQSFVKDGVNRVM-LATDGDFNVGQSDDDDL 390 Query: 306 LFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 + ++ G + G + DQ ++ A Sbjct: 391 KRLIEQERKSGVFLSVFGFGHDNLNDQMMQTIAQNGN 427 >gi|310799477|gb|EFQ34370.1| von Willebrand factor type A domain-containing protein [Glomerella graminicola M1.001] Length = 698 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 74/224 (33%), Gaps = 19/224 (8%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P + + + + D+++V+DVS SM + + A + Sbjct: 61 PLASKDGLIAKITPPTQPTEPTDHVPCDIVLVIDVSGSMGCNAPVPANPGEKAENYGLSV 120 Query: 203 LDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTT 253 LD++K + +N+ R G+VTF+SK L ++ IN + T Sbjct: 121 LDLVKHAARTVLETLNDGDRLGIVTFASKAKVLQKLTPMDAKNKALAEKIINGMRPDDAT 180 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GL E ++ LTDG + N+ + + Sbjct: 181 NLWHGLLEGIKLFNTCGEMNMGRVPA-------MMVLTDGMPNHM-CPNQGYVPKLRGME 232 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 + A ++ G LK+ A + + ++ + F Sbjct: 233 QLSASIHTFGFGYSLRSGLLKSIAEIGGGNYSFIPDAGMIGTVF 276 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 308 YCNEAKRRGAIVYAIGVQAEAA------DQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGK 360 C+ K G +Y I E L+NCAS P +Y + + AF +G Sbjct: 500 VCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTAASMKQAFSDLGG 559 Query: 361 EMVKQRI 367 ++ RI Sbjct: 560 QLSALRI 566 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 70/170 (41%), Gaps = 16/170 (9%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 KGS+SI+ A+ + + + +E + + V+ +L LD + + A ++ N Sbjct: 3 RKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGTC 62 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + I+ +F + + NG ++ S +I Q+ ++ + Sbjct: 63 TGSCASDTTA----IFWANFSSAHQANGLGP-----FQAVSTGPVITPQNAS-TITIQAN 112 Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 ++P +F + S + +S++G+++ +VLD + S+ Sbjct: 113 VQLPLLF------TKILGVSQIALSEHAQAVRSNMGMELALVLDNTDSLE 156 >gi|189526999|ref|XP_691588.2| PREDICTED: anthrax toxin receptor 1 [Danio rerio] Length = 554 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 43/183 (23%), Positives = 64/183 (34%), Gaps = 27/183 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ VLD S S+ +H+ S E L + P ++R + FS+ Sbjct: 38 QGAFDLYFVLDKSGSIQNHWIE--------IYSFVEHLAEKFTSP----MLRMSFIVFST 85 Query: 226 KIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L I +N L I G T GLE A +I+ Sbjct: 86 RGTTIMRLTENRDDITRGLNTLKREIPGGDTYMNLGLEEANVQIYHGNYGAAS------- 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDGE + + A+ GAIVY +GV+ Q + + Sbjct: 139 ---VIIALTDGELNDHQFVTAQ--QEAQRARSMGAIVYCVGVKDFNETQLATIADTIEHV 193 Query: 343 YSV 345 + V Sbjct: 194 FPV 196 >gi|220925364|ref|YP_002500666.1| LPXTG-motif cell wall anchor domain-containing protein [Methylobacterium nodulans ORS 2060] gi|219949971|gb|ACL60363.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium nodulans ORS 2060] Length = 725 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 47/267 (17%), Positives = 86/267 (32%), Gaps = 40/267 (14%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 R +L+ +D L+ + F + L + + + + + Sbjct: 277 RRVTLADGPVPADRDIELTWTAAPARAPAIGLFRERVGTDEYLLAVVTPPEGQNLARRPR 336 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S SM + A S+ LD + R ++ F + + Q Sbjct: 337 DVTFVIDNSGSMAGA------SMRQAKASLLMALDRLAPAD------RFNVIRFDNTMDQ 384 Query: 230 TFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 FP A + + + L T+ L A + Sbjct: 385 LFPEAVPADERHLAVARSFVAALEARGGTEMLAPLTAALADPTPERTDRVRQ-------- 436 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 I+FLTDG I N+E +F A R + ++ IG+ + + Y+ Sbjct: 437 --IVFLTDG-----AIGNEEQIFSAIAAGRGRSRLFMIGIGSAPNAHLMT--------YA 481 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILYNK 371 + R + A I + + R L K Sbjct: 482 AELGRGSYTAIGTIDQVAERMRELLTK 508 >gi|240255540|ref|NP_476506.3| collagen alpha-3(VI) chain isoform 3 precursor [Homo sapiens] Length = 1237 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|194389238|dbj|BAG65607.1| unnamed protein product [Homo sapiens] Length = 1237 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAESRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 57/142 (40%), Gaps = 11/142 (7%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G P+ D K+ Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAG----PSSDEIRYGVVA--LKQASVFS 170 Query: 320 YAIGVQAEAADQFLKNCASPDR 341 + +GVQA + + L++ A+ D Sbjct: 171 FGLGVQAASRAE-LQHIATDDN 191 >gi|268572467|ref|XP_002648969.1| Hypothetical protein CBG21291 [Caenorhabditis briggsae] Length = 427 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 73/202 (36%), Gaps = 34/202 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI-----REMLDIIKSIPDVNNVVRSG 219 + LD+++VLD S + + + D + + + + VR Sbjct: 236 TGCELDLVLVLDFSTTTDPVYNSYKDLSKRLVQQLKIGPHYTQVAAVTFATVGRTRVRFN 295 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L +S++ + + I++L G TT G+E A +I +++ IA Sbjct: 296 LKKYSTQ-----------EEVLRGIDKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT 344 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------ADQF 332 K +I TDG ++ K + +A G +Y + A A ++ Sbjct: 345 ------KVMIVFTDGWSNKGPDPEKRAR----DAVNAGFEMYTVAYTARAPGSVTLNNET 394 Query: 333 LKNCA-SPDRFYSVQNSRKLHD 353 L + S ++ + L D Sbjct: 395 LSAISGSSGHAFTDVTFQTLVD 416 >gi|126336622|ref|XP_001380249.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 923 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 72/202 (35%), Gaps = 17/202 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ ++ ++L ++ N V+ G V F Sbjct: 274 VVFVIDKSGSMAG------RKMRQTREAMVQILGDLRPEDQFNLVIFDGHV-FQWMPALL 326 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + V+ ++ + + T + A + D+ K K +I L Sbjct: 327 QASSQNVEQAKKFTSLISAMGATNINDAVLLAVKMLDDSNRKE----KLPPGSVSMVILL 382 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-----RFYSV 345 TDG+ + + K+ A +Y +G + FL+ A + R Y Sbjct: 383 TDGDATDGETNPKKIQENVKAAIGGSYHLYCLGFGFDVNYAFLEKLALENGGVARRIYED 442 Query: 346 QNSR-KLHDAFLRIGKEMVKQR 366 +S +L D + + ++ + Sbjct: 443 SDSDLQLQDFYQEVANPLLTKV 464 >gi|145485516|ref|XP_001428766.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124395854|emb|CAK61368.1| unnamed protein product [Paramecium tetraurelia] Length = 947 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 60/351 (17%), Positives = 123/351 (35%), Gaps = 45/351 (12%) Query: 38 TSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIW-----QTDF 92 SH K L+ +S L +I N E ++ + D S + + + Sbjct: 622 ISHILQCKVLLNLEDSNSALA---EISNAEQLSDKYENSYDRSDAQVNDSFPIPPGILKQ 678 Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 R + F + ++I+++ + + K Y+ N + Sbjct: 679 RILYEKGLFIKRYDSIKKAAFIFTECLETSKFYDPEIRINCLKQLKEIFQ--SQNLLYKV 736 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 I ++++ + D++ V+D S SM + K +A I ++ D D Sbjct: 737 PKIEQLLELN-EIKKNNDIVFVIDHSGSMENI------KKELAINGILKIFDNYLQDQDR 789 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINR---LIFGSTTKSTPGLEYAYNKI 266 + +R F+ I F L +++ I R + T + +AY Sbjct: 790 ISYMR-----FNQNIEVIFDLTSKSENTAYLRSAIERSKNIRAEGMTAMLSAVLHAY--- 841 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 H D +++I+ L DGE++ NI + + +KR + IG+ Sbjct: 842 ------SIHEKAVKKDNQQWIVVLCDGEDNLSNITYERMKKF--TSKRPQISLIVIGIGL 893 Query: 327 ----EAADQFLKNCASPDRFYSVQN--SRKLHDAFLRIGKEMVKQRILYNK 371 + D+ C + + +++ S L AF I + +Y++ Sbjct: 894 SLKPDCLDELYDLCRLSQKGFLIESVYSEDLDIAFQSISNLIFGTSSIYDE 944 >gi|148680070|gb|EDL12017.1| mCG120740 [Mus musculus] Length = 752 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 73/200 (36%), Gaps = 36/200 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM D+L + R+ + L I + N G+V F+ Sbjct: 309 VCLVLDVSGSMA-----SYDRLDLMNRAAKHFLSQI-----IENRSWVGMVHFNHLANIK 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + T G++ A+ + + + Sbjct: 359 SELIQMNSNIERNQLLQTL-PTSADGGTSICSGIKAAFQVFKNGGYETDGTE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRFY 343 I+ L+DGE+S+ +E K GAIV+ I + A+ A + Y Sbjct: 410 -ILLLSDGEDSTAKD-------CIDEVKDSGAIVHFIALGPSADLAVTNMSILTGGKHMY 461 Query: 344 SVQNSRK--LHDAFLRIGKE 361 + ++ L DAF+ + E Sbjct: 462 ASDEAQNNGLIDAFVALASE 481 >gi|124007374|ref|ZP_01692081.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987207|gb|EAY26947.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 1088 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 75/197 (38%), Gaps = 23/197 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M++LDVS SM DKL + S + ++ I++ DV+ V+ +G + IV Sbjct: 914 LMLLLDVSGSM-----SSKDKLPLLKESFKYLISIMRPQDDVSIVIYAG----DAAIVLK 964 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 A + I I++L T G + AY + ++ + II Sbjct: 965 PTSASNQEQINAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNR---------IILA 1015 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDGE K K V++ G + + K A Y N+R Sbjct: 1016 TDGEFPISKYIYKLVEKRAT--KGINLSVFSFGSMTKKFETLEKLVAKGKGNYEQVNARN 1073 Query: 351 LHDAFLRIGKEMVKQRI 367 + ++ KE +R+ Sbjct: 1074 VK---YKLVKEAQSKRV 1087 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 51/126 (40%), Gaps = 24/126 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M++LDVS SM ++L + +++ +++I++ V+ +V F S+ Sbjct: 687 LMLLLDVSGSMK-------NELPMLKSALKYLVNIMRPEDKVS------VVVFGSEAKLM 733 Query: 231 FPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 I + I+ L T GL+ AY I + + + II Sbjct: 734 LRPTSAKYKAQIMQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNR---------II 784 Query: 289 FLTDGE 294 +DGE Sbjct: 785 LASDGE 790 >gi|90021002|ref|YP_526829.1| hypothetical protein Sde_1355 [Saccharophagus degradans 2-40] gi|89950602|gb|ABD80617.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 787 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 87/241 (36%), Gaps = 47/241 (19%) Query: 138 IFCTFPWCANSSHAPLL--ITSSVKISSK----SDIGLDMMMVLDVSLSMNDHFGPGMDK 191 + A++ P L +S +I SK S D+ +V+DVS SM + D Sbjct: 11 LVLMLGLAASAGAQPELNLAQASQEIESKLADRSGKPADVRLVIDVSGSMKRN-----DP 65 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--------LAWGVQHIQEK 243 + ++ ++ +P+ + G+ TF + P + G + Sbjct: 66 ANLRQPAVDLLMQ---LLPEGSKA---GVWTFGKWVNMLVPHQVVDEQWRSLGRAK-ASE 118 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG---ENSSPNI 300 IN + G T LE A + A D+Y K+II LTDG + P+ Sbjct: 119 INSV--GLYTNIGEALEKAAYDLDAA----------SDEYAKHIILLTDGMVDIDKQPDK 166 Query: 301 DNKESLFYCNEA----KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA 354 + +E +E K G ++ + + A + LK + + + L Sbjct: 167 NTQEWRRIVDEVLPKLKAAGYTIHTVALSDNADNNLLKKLSLQTDGIASVAHTADDLMKI 226 Query: 355 F 355 F Sbjct: 227 F 227 >gi|313243983|emb|CBY14858.1| unnamed protein product [Oikopleura dioica] gi|313245509|emb|CBY40220.1| unnamed protein product [Oikopleura dioica] Length = 1393 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 72/209 (34%), Gaps = 19/209 (9%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P I S S +++ V+D S SM G +D+ A RS+ LD Sbjct: 769 LPPKIMLKSNAQSSSSAPYNVVFVMDKSGSM---IGTKLDQTKDAFRSMISSLDRNAKFS 825 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 V + +K+V+ + V+ + I+R+ G T L A Sbjct: 826 IV--GFNYATTAWRNKLVRAT--NYNVEEARSFISRISAGGGTNMHAALLDAIELCNSES 881 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAE 327 I+F+TDG + + L +++++G + IG A Sbjct: 882 ---------SSTVPCMIMFMTDGTATVGVTEESRILADVTKSRQQGKANIALNVIGFGAG 932 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFL 356 + FL + + + Q + AF Sbjct: 933 ISYSFLSRLSVLNSGIARQIFEDTNAAFQ 961 >gi|239990323|ref|ZP_04710987.1| hypothetical protein SrosN1_23653 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 58/193 (30%), Gaps = 26/193 (13%) Query: 174 VLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVTFSSKIVQT 230 VLD S SM L T RE + + +P + V VR+ V Sbjct: 349 VLDTSGSMKGRRLAQLKSALNGLTGDFRER-EQVTLLPFGSTVKQVRT-------HTVDP 400 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 G I+ L T L AY+ + E I+ + Sbjct: 401 ADPKAGPAAIRADAAALSAEGDTAIYSSLAAAYDHLGPDTESAFTS----------IVLM 450 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAI-VYAIGVQAEAADQFLKNCA-SPDRFYSVQNS 348 TDGEN++ + FY + R V+ + + A + R + Sbjct: 451 TDGENTAGRSAAEFGAFYRALPEARRVTPVFPVVFGDSDRSELEAIAALTGGRLFDGTKE 510 Query: 349 R---KLHDAFLRI 358 L AF I Sbjct: 511 EGPGSLDGAFEEI 523 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 16/213 (7%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ F+ HA + K+D + + V+D S S+ P + Sbjct: 9 FQQAFLSTLLCVALVPLHAQFDDEPVTSCTEKTDCPISVYFVIDTSESIALQTVPIQSLV 68 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRS---GLVTFSSKIVQTFPLAWGVQHIQEKINRLI- 248 + I ++ +++ N V + G + +S + PL K+ + Sbjct: 69 DQIKQFIPRFIEKLENEVYQNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRY 128 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + + + + K+ + +TDG + + Sbjct: 129 LGRGTFTDCAISNMTQQFQSQTARDV----------KFAVVITDGHVTGSPCGGMK--MQ 176 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 A+ G ++A+ + +Q L+ ASP Sbjct: 177 AERARDMGIKLFAVAPSEDVYEQGLREIASPPH 209 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 59/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+M V+D S S+ V S L I P R G+V +S + Sbjct: 612 GALDIMFVIDSSESIGYTNFTLEKNFVVNVVS---RLGSIAKDPKSETGARVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + RL T + L++AYNK+ + + Sbjct: 669 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK------ 722 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + +TDG P D+K C R +V IG+ Sbjct: 723 --AQVFAVVITDGR-YDPRDDDKNLGALC----GRDVLVNTIGIG 760 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 16/213 (7%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ F+ HA + K+D + + V+D S S+ P + Sbjct: 9 FQQAFLSTLLCVALVPLHAQFDDEPVTSCTEKTDCPISVYFVIDTSESIALQTVPIQSLV 68 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRS---GLVTFSSKIVQTFPLAWGVQHIQEKINRLI- 248 + I ++ +++ N V + G + +S + PL K+ + Sbjct: 69 DQIKQFIPRFIEKLENEVYQNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRY 128 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T + + + + K+ + +TDG + + Sbjct: 129 LGRGTFTDCAISNMTQQFQSQTARDV----------KFAVVITDGHVTGSPCGGMK--MQ 176 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 A+ G ++A+ + +Q L+ ASP Sbjct: 177 AERARDMGIKLFAVAPSEDVYEQGLREIASPPH 209 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 59/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+M V+D S S+ V S L I P R G+V +S + Sbjct: 612 GALDIMFVIDSSESIGYTNFTLEKNFVVNVVS---RLGSIAKDPKSETGARVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + RL T + L++AYNK+ + + Sbjct: 669 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK------ 722 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + +TDG P D+K C R +V IG+ Sbjct: 723 --AQVFAVVITDGR-YDPRDDDKNLGALC----GRDVLVNTIGIG 760 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 70/172 (40%), Gaps = 14/172 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + + L + + D N R L+ + Sbjct: 828 TQRPVDIVFLLDGSERIGEQ---NFHRAHHFVEQVAQQLTLARRNDDNMNA-RIALLQYG 883 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + + I + ++ S++ + +A N I + + +A+ + Sbjct: 884 SEREQNVVFPLTYNLTEISNALAQIKYLDSSSNIGSAIIHAINNIVLSPGNGQRVARRNA 943 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + +F+TDG S N++ N K++ + + + ++ L Sbjct: 944 ELS--FVFITDGITGSKNLE-----EAINSMKKQDVMPTVVALGSDVDMDVL 988 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 81/223 (36%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKP---RYGLVTYATY 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N++ S T + L+ Y+ + + Sbjct: 318 PRIWVKVSEQDSSNADWVTKQLNKINYEDHKLKSGTNTKKALQAVYSMMSWPDDIP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 435 NQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSL 477 >gi|66793453|ref|NP_001019751.1| vitrin [Gallus gallus] gi|56744182|dbj|BAD81032.1| Akhirin [Gallus gallus] Length = 748 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 45/299 (15%), Positives = 94/299 (31%), Gaps = 41/299 (13%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 ++ + ++ + + NI+ E + N+ + + Sbjct: 467 VVVVMVDGWPTDRVEEASRLARESGINIFFVTI-----EAAAQNEKQNVIEPNFVDKAVC 521 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 + Y+++ S + + P L+ S ++S D+ V+D S Sbjct: 522 RTNGFYSITVPSWFSL--HKVVQPLVKRVCDIDRLVCSKTCLNSA-----DIGFVIDGSS 574 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGV 237 S+ + + K + R G + ++ + F Sbjct: 575 SVGTSNFRTVLQFVANIS---------KEFEISDTDTRIGAIQYTYEQRLEFSFDKYSTK 625 Query: 238 QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 Q + I R+ + T + + YA ++F K + +K +I +TDG + Sbjct: 626 QDVLSAIKRINYWSGGTSTGAAISYASEQLF---------TKSKPNKRKIMILITDGRSY 676 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + A + G I Y+IGV A D+ PD + V L+ Sbjct: 677 D------DVRMPALTAHQNGVIAYSIGVAWAAQDELEAIATDPDKEHSFFVDEFDNLYQ 729 >gi|152990153|ref|YP_001355875.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422014|dbj|BAF69518.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 549 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 68/203 (33%), Gaps = 34/203 (16%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + +F + P+ V++ S + ++ + LD+S SM D+L Sbjct: 53 RLTLLFMALFAMILAMARPVYQKGVVQVES---LSANVGIALDISNSMKATDYYP-DRLQ 108 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IF 249 A + I E + K++ L+ F+ + P + + + + L + Sbjct: 109 FAKKKIEEFIKASKNLN-------IALLAFADEAYIVSPPSSDKEALLYMLKHLDTESLA 161 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T L A + +K ++ TDG N ++ Sbjct: 162 LQGTNFLAALMSADMLLGKEG-------------QKSVVLFTDGGN------KEDFSKEI 202 Query: 310 NEAKRRGAIVYAIGVQAEAADQF 332 AK+RG V+ IG+ + Sbjct: 203 AFAKKRGIQVHIIGIGTQKGAPI 225 >gi|301611663|ref|XP_002935353.1| PREDICTED: LOW QUALITY PROTEIN: anthrax toxin receptor 1-like [Xenopus (Silurana) tropicalis] Length = 565 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/199 (24%), Positives = 71/199 (35%), Gaps = 27/199 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLG-VATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G D+ VLD S S+ H+ + +A R I L R + FS+ Sbjct: 39 GGFDLYFVLDKSGSVLHHWSEIFYFVEHLAQRFIGPQL-------------RMSFIVFST 85 Query: 226 KIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + I++ + L + G T G+E A +I+ H + Sbjct: 86 RGSTLMRLTEDREQIRQGLEELRKVLPGGDTYMHEGIERASEQIY-------HESIKGYR 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDGE E N ++ GA VY +GV+ Q + S D Sbjct: 139 TASVIIALTDGELHEDLFYYAE--REANRSRELGAQVYCVGVKDFNETQLARIADSKDHV 196 Query: 343 YSVQNS-RKLHDAFLRIGK 360 + V L D I K Sbjct: 197 FPVNGGFEALQDIIGSILK 215 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 50/208 (24%), Positives = 80/208 (38%), Gaps = 41/208 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM G D+L ++ + L I + +G+VTF S Sbjct: 63 LILVLDKSGSMA-----GGDRLNRLNQASQLFLLQI-----IEKGSWTGMVTFDSSATIQ 112 Query: 231 FPL---AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L Q I+RL G T GL A+ I + Sbjct: 113 SALIQIETDAQR-NSLISRLPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE--------- 162 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRF 342 I+ LTDGE+S ++ C +E K+ GAI++ + + A+ + L + Sbjct: 163 -IVLLTDGEDS--------TISSCFDEVKQSGAIIHTVALGPSADPGLEELAKMTGGMKT 213 Query: 343 YSVQNSRK--LHDAFLRI--GKEMVKQR 366 N++ L DAF + G + QR Sbjct: 214 SPTDNAQNNGLIDAFSALSSGNGAITQR 241 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 15/76 (19%), Positives = 29/76 (38%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 L++R + G+ ++L AI I L I+ + +F K+ LD + L + Sbjct: 8 LHLRRLVRDRDGNFAVLGAIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVRAY 67 Query: 64 LNQENGNNGKKQKNDF 79 ++ F Sbjct: 68 GEGATEDDAYDAAQKF 83 >gi|27376088|ref|NP_767617.1| hypothetical protein bll0977 [Bradyrhizobium japonicum USDA 110] gi|27349227|dbj|BAC46242.1| bll0977 [Bradyrhizobium japonicum USDA 110] Length = 754 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 47/296 (15%), Positives = 91/296 (30%), Gaps = 47/296 (15%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N + R+ + ++ + + +N R +L+ +D+ L+ Sbjct: 256 NAPANPTSITVRLKAGFALGEVKSH-HHSVKIESPDNATRIVTLADGAVPADRDFELTWK 314 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL--DMMMVLDVSLSMNDHFGPG 188 + F + L + + L +++ V+D S SM Sbjct: 315 PAAQKAPSVGLFREHVGDADYLLAFVTPPSAEQATQKPLPREVVFVIDNSGSMGG----- 369 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEK 243 + A S+ L ++ R ++ F + FP A V Sbjct: 370 -TSIVQAKASLLYALGRLQPAD------RFNVIRFDDTMDVLFPASVPADAAHVGEATSF 422 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++ L T+ P + A G + ++FLTDG I N+ Sbjct: 423 VSALQARGGTEMVPAMRAALT-----------DKIGDTGMVRQVVFLTDG-----AIGNE 466 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 + LF A R + V+ +G+ + + + R AF IG Sbjct: 467 QQLFETITAMRGRSRVFMVGIGSAPNTYLMTRASELGR-----------GAFTHIG 511 >gi|225873423|ref|YP_002754882.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] gi|225793805|gb|ACO33895.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] Length = 339 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 78/216 (36%), Gaps = 26/216 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + ++ + L + +++D S S+ + + +++ L + V Sbjct: 108 TQQTQLPLRLGILVDTSTSIRE-------RFQFEQQAVTNFLLQVLRPKTDEAFVE---- 156 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + + + I L G T + A ++KL + A G Sbjct: 157 GFDEAPNFILNWSNNLDTLSSAIQDLHPGGGTALYDAVYSA------CRDKLLNAASGPI 210 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA----EAADQFLKNCA 337 ++ II ++DG+++ + +++ C A+ +YA+ + D L+ A Sbjct: 211 YVRRAIILVSDGDDNQSHAYLTDAIKECQRAQTA---IYAVSTDTDPTPDPGDDILRKMA 267 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + R + + L +F + E+ Q L K Sbjct: 268 EETGGRAFFPRVITNLPASFNSVEDELRSQYALVYK 303 >gi|78484444|ref|YP_390369.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362730|gb|ABB40695.1| Conserved hypothetical protein; predicted membrane protein with vWA domains and a TPR motif [Thiomicrospira crunogena XCL-2] Length = 651 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 55/183 (30%), Gaps = 20/183 (10%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLG 193 P T W T + G+D+++ LD S SM D Sbjct: 91 PKWLLTVAWVCLIVALAGPRTLVPAPQESTRAGVDILVALDTSRSMLVQDVSPNRFLLAK 150 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 S+ L+ R GL+ ++ + PL++ Q ++ + G Sbjct: 151 SLVESLANRLEPSD---------RLGLMVYAGRPHLVSPLSFDRTLFQHYLDLMRPGILP 201 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG--ENSSPNIDNKESLFYCNE 311 LE A D ++ + ++ LT+G E + N Sbjct: 202 TLGSQLESAIVFGADHLQQTAGK-------SQVLLVLTNGTPEPDQIVAVPEALKAVANT 254 Query: 312 AKR 314 A + Sbjct: 255 ATK 257 >gi|311897983|dbj|BAJ30391.1| hypothetical protein KSE_46100 [Kitasatospora setae KM-6054] Length = 455 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 73/237 (30%), Gaps = 40/237 (16%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 P + ++ + ++LD S SM G ++ VA +SI ++ Sbjct: 29 TVLGTLPAHADEPGAPPAATEAP-KVDLILDGSGSMRTIDIQGKSRMEVAQQSIA---EV 84 Query: 206 IKSIPDVNNV-VRSGLVTF---------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 I ++P+ +R+ T+ + + + T Sbjct: 85 IDALPNETEFGIRTLGATYPGSDQKEGCKDTQQLYRVGKTNKVEAKTAVATVRPTGWTPI 144 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-- 313 L A + + I+ +TDGE++ D C+ A+ Sbjct: 145 GIALRAAAQDLGTG------------PTTRRIVLITDGEDTCAPPDP------CDVAREL 186 Query: 314 -RRGA--IVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +G +V +G+ + + C A+ F V+ +L ++ Sbjct: 187 ASQGIHLVVDTLGLAHDDKTRQQLICIANATGGTFTDVRTQEQLTKRVKQLVNRAQD 243 >gi|291395333|ref|XP_002714013.1| PREDICTED: integrin, alpha 1 [Oryctolagus cuniculus] Length = 1200 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 102/288 (35%), Gaps = 50/288 (17%) Query: 107 NIERSTSL-SIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS- 162 ++ +TS+ ++ ++ + + VS F+ C + + H I S+V + Sbjct: 118 DLPVNTSIANVTEVKENMTFGSTLVSNPRGGFLACGPLYAYRCGHVHYTTGICSNVSPTF 177 Query: 163 ---------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + LD+++VLD S S + T + ++L+ + P Sbjct: 178 QVVNSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVTHFLNDLLERMDIGPKQT 229 Query: 214 NVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDA 269 G+V + + F L + + N++I T + G++ A + F Sbjct: 230 Q---VGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQTMTALGIDTARKEAFTE 286 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----- 324 K K ++ +TDGE + DN + + ++I + Sbjct: 287 ARGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYN 336 Query: 325 ----QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 AE + +K+ AS F++V + L +G+ + Sbjct: 337 RGNLSAEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 384 >gi|260800527|ref|XP_002595180.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] gi|229280424|gb|EEN51192.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] Length = 419 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 62/174 (35%), Gaps = 24/174 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ VLD S S G D + + + P G++ +S++ Sbjct: 1 DIIFVLDGSGS------IGTDNFERIKTFVSKAVTRFNIGPTQTQ---IGVIQYSNQPQS 51 Query: 230 TFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L +Q+ I+ + T + L Y N F K K Sbjct: 52 EILLNDHQDAASLQQAISSINYLQGGTNTGKALRYLANNAFSGKNGAR------AGVSKV 105 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 I +TDG +S + + A + G ++YA+G+ Q L++ AS D Sbjct: 106 AIVVTDGRSSD------DVVRPALNAGKEGIVLYAVGIGGSVDYQELRDIASSD 153 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 63/202 (31%), Gaps = 29/202 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+ ++LD S G D + L+ D + R + + ++ Sbjct: 230 TPLDIAILLDGSD------GVSSDDFEAEKSFAKLFLNEFDIGQDNS---RVTVFQYGTE 280 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q F L + + I G + Y F + + Sbjct: 281 PRQEFAL--DTYETDQDVQDAIADTEYMGGDRNLGQAIRYMATYGF-SGRNGARRSIPS- 336 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I +T GE+ E ++A+R G I+YAIGV L A+ Sbjct: 337 ----VAIIITGGESLD------EVASAASKARRSGIILYAIGVGNATVPAELAAIATTAN 386 Query: 342 F-YSVQNSRKLHDAFLRIGKEM 362 Y+ + L D + E+ Sbjct: 387 TSYAAASFAALKDLRGALADEI 408 >gi|198436264|ref|XP_002122997.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 1993 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 73/194 (37%), Gaps = 25/194 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++++D S S+ D M +L++++S + R G V +++ + Sbjct: 1807 MDIVLIVDSSSSIGDDNFELMRNF---------ILELVRSFNVSRDTTRIGYVRYNNAVD 1857 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + F L + + +QE I + + GS T + L YA A G Sbjct: 1858 ERFQLNTFNTSEEVQEAIRAVPYRGSGTLTGQALSYASRTSVRAPAGRRPGVPG------ 1911 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 I +TDG L R V AIG++ +Q + +++ Sbjct: 1912 VAIVITDGRAQDAVDAPARELQ-------RLMQVVAIGIRGAVPEQLNAIASQQGYVFNI 1964 Query: 346 QNSRKLHDAFLRIG 359 ++ +L + I Sbjct: 1965 EDFNRLDEVLGSIS 1978 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 68/200 (34%), Gaps = 28/200 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ V+D S S+ + ++ P+ R + ++ I Sbjct: 1612 VDVAFVIDSSSSIGPA------NFRTIRNFLIALVQRFSIGPEGA---RFAAIRYNRDIE 1662 Query: 229 QTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEY-AYNKIFDAKEKLEHIAKGHDDYK 284 + L + E IN + + T + + Y A N + + + Sbjct: 1663 HLWNLDQYTTRAALIEGINNIPYNGVGTLTGAAINYTAENIFLPELGRRKGVP------- 1715 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K ++ LTDG + + L K I+ A+G+ DQ + + PD+ ++ Sbjct: 1716 KIVVVLTDGVSYDDVSIPSQRL------KSDNTIIVAVGIGRYDQDQINEIASDPDQDFA 1769 Query: 345 VQNS--RKLHDAFLRIGKEM 362 + L+ I ++ Sbjct: 1770 TTVAGFDGLNRVVTTISSQI 1789 >gi|170091408|ref|XP_001876926.1| predicted protein [Laccaria bicolor S238N-H82] gi|164648419|gb|EDR12662.1| predicted protein [Laccaria bicolor S238N-H82] Length = 886 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/281 (13%), Positives = 92/281 (32%), Gaps = 57/281 (20%) Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 +++ N K + + + ++ + GF Q +N+ + + Sbjct: 476 VTVVDDAIALNKKPESGQVTLA----VVDSNTSEKYLVTGFPQQVNDTVDKYTPDFFVTS 531 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 +Y + +P I W K + + LD+M ++D + S Sbjct: 532 HGGEY-------FFVPSIPTLTAWS--------------KAPTTTKSKLDIMFLIDATGS 570 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV--------QTFP 232 M+ P + + + +I + +++++ + +R GLV F Q + Sbjct: 571 MD----PYIKQASASIGTIYD--NVLRNGSWSKDDIRVGLVAFRDHPQKKATTFLTQKYD 624 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + + ++ L Y + + + A +DD + +TD Sbjct: 625 FTSDMSKVSDNLDALEAKDGED--------YPEASEDALEDALEADWNDDAVMVTVLITD 676 Query: 293 ------GEN----SSPNIDNKESLFYCNEAKRRGAIVYAIG 323 GE+ D + + + G ++Y +G Sbjct: 677 STPHATGESHDYFKDGCPDQNDPVDIADRMADLGIVLYVLG 717 >gi|291242943|ref|XP_002741339.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 958 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 66/200 (33%), Gaps = 34/200 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLDVS SM + KL A + + +++ + G VTFS Sbjct: 315 IVLVLDVSGSM--SLKSRITKLRQAVYTFI--------MDEISLGIDVGCVTFSDTATII 364 Query: 231 FPLA-----WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + L T GL + + + + Sbjct: 365 SWLTPINSDEDREEFLALVMPTLNADGNTAIGSGLLTGLQVLSQNQTESVEGS------- 417 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS----PD 340 + +TDG+ + P + + + G +V + A ++ L+ AS Sbjct: 418 -IMFLVTDGQENVPPYIDDVT----DNIIESGVVVDTLAWGVFAEEK-LETIASGTKGSS 471 Query: 341 RFY-SVQNSRKLHDAFLRIG 359 +Y S +AF+ + Sbjct: 472 YYYSEQTQSNAHVEAFMEVA 491 >gi|145491133|ref|XP_001431566.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398671|emb|CAK64168.1| unnamed protein product [Paramecium tetraurelia] Length = 636 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 87/265 (32%), Gaps = 33/265 (12%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM-PFIFCTFPWCANSSHAPLLITSS 158 + I + + D ++ S Y+M + + L + Sbjct: 90 DDDEKIEPKKEDAKQNTNKYDLNEKLYFEVRSLYKMGKLLNSRTQYLPGIVSIKALDQAV 149 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + +G+D++ ++D+S SM K+ + S+ +L + + R Sbjct: 150 TQNQKNQRVGVDLICLIDISGSMIGV------KIEMVKASLIVLLQFLG------DNDRL 197 Query: 219 GLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 L+TF + + PL + + I ++ + + + A+ ++ Sbjct: 198 QLITFDNDAHRLTPLKTVTNQNKSYFTQIIKQIKANGGNRISEATKMAFYQL------KS 251 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL- 333 + + L+DG + + K + NE ++ G + Q + Sbjct: 252 RKYINNVTS---VFLLSDGVD-YTYPEVKNQIQTVNEV----FTLHTFGFGEDHDAQMMT 303 Query: 334 KNC-ASPDRFYSVQNSRKLHDAFLR 357 + C FY VQ+ L + F Sbjct: 304 QLCNLKSGSFYFVQDVTLLDEFFAD 328 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 69/195 (35%), Gaps = 21/195 (10%) Query: 174 VLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIV 228 VLD S SM D + L A S+ +S +V + +G + V Sbjct: 348 VLDTSGSMAGARIDSLRSALVGLTGADTSLTGRFRRFRSREEVTMLPFNTGPGAPRTFTV 407 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY-- 286 A + I+ L+ T L AY + D ++ Sbjct: 408 PEENPAAELAQIKTFAEGLVARGGTAIYDSLSRAYQVL---------EPLMAADPDRFTS 458 Query: 287 IIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFY 343 I+ +TDGEN++ + D SL A ++ V+ + ++D+ + + + + Sbjct: 459 IVLMTDGENANGSSLPDFLTSLASLPPAMKQ-VPVFTVLFGEGSSDELTQVATRTGGKVF 517 Query: 344 SVQNSRKLHDAFLRI 358 +N +L F I Sbjct: 518 DARNV-QLSRVFQEI 531 >gi|254430946|ref|ZP_05044649.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] gi|197625399|gb|EDY37958.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] Length = 2003 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 96/279 (34%), Gaps = 35/279 (12%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 ++ +G VI + H +D+++ + N +++ +I N Sbjct: 794 ILAGAVGQVIYLAQD-------HSAVDNAVTASLASGANDRATLQVNAD-GTYTFTLIDN 845 Query: 87 IWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA 146 +D + + + ++I+++D D +A + Sbjct: 846 FLLSDPGDTTEQTESISSLVG-----GINILVEDGDGD---AANGTTGIALSLLVKDDIP 897 Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDI 205 + + IT S + G ++ +V+DVS SM + G GM ++ + S E++D Sbjct: 898 TA----VPITESGESFP---TGTNLFLVIDVSGSMANASGVDGMTRMQLQINSALELIDQ 950 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 +++ + V VTF++ F W ++ I L+ S T L N Sbjct: 951 YEALGPLKVNV----VTFATDASAPFSTTWQDADAVKTFIQTLVPTSRTNYDAALNLTIN 1006 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 A + F +DG + +I Sbjct: 1007 TFNGGTASDIDGATN------VLYFFSDGVPNENDISGT 1039 >gi|167525226|ref|XP_001746948.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774728|gb|EDQ88355.1| predicted protein [Monosiga brevicollis MX1] Length = 2718 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 76/233 (32%), Gaps = 21/233 (9%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 F + L ++ D++MV+DVS SM D+ + R Sbjct: 2319 LFAGRQGNPERALRYVQQYSSHAEYAEAADVLMVVDVSGSMTDYMEQARAFVRTIAREGF 2378 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQ--TFPL-AWGVQHIQEKINRLIFGSTTKSTP 257 + S + R L TF + + PL +Q+++ ++ T P Sbjct: 2379 HL----DSAASQH---RMALFTFGTTATALGSEPLFTSDWAQLQQRVAQIAVNGATNYLP 2431 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 LE + D ++ ++ ++F TDG NS N + A Sbjct: 2432 ALELVEQSLRD---LKASDPARYNASRRIVLFQTDGSNSDRNQTRAITATARRIVDELDA 2488 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + A+ A +S + Y V SR+ G + K + + Sbjct: 2489 TLMAVLTGAG--------VSSSNVSYYVGRSRQFDSQDRTDGAALSKLILTVD 2533 >gi|332882611|ref|ZP_08450223.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332679411|gb|EGJ52396.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 547 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 76/213 (35%), Gaps = 26/213 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW N + + + KI +++ ++DVS SM++ +KL + S + + Sbjct: 162 PWNPNHLLLRIGL-QAKKIDLAKAPPSNIVFLIDVSGSMDEE-----NKLPLLQSSFKML 215 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLE 260 L + + +VT+++ P + I + ++ L T G++ Sbjct: 216 L------GQLRPDDKVAIVTYANGTKVALPSTSVKDKEKIIKVLDNLYASGGTSGGKGIQ 269 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + + K + II TDG+ + + + + + + G + Sbjct: 270 LAYEQAQKSFIKNGNNR---------IILATDGDFNIGINNTTDLEKFIEKQRESGIYMS 320 Query: 321 AIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 +G D + A + + N + Sbjct: 321 VLGFGMGNYRDDMAETIADKGNGNYAYIDNITE 353 >gi|11414924|dbj|BAB18554.1| voltage dependent calcium channel alpha2a/delta subunit [Rana catesbeiana] Length = 1102 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 78/221 (35%), Gaps = 45/221 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + ++ V F+S Sbjct: 254 DMLILVDVSGSVSGL------TLKLIRTSVTEMLETLSD----DDFVNVAA--FNSNAHD 301 Query: 230 TFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++E +N + TT G ++A++++ + + Sbjct: 302 VSCFHHLVQANVRNKKVLKEAVNNITAKGTTDYKQGFKFAFDQLRNTNVSRANCN----- 356 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG E+ N K + V+ V + C + Sbjct: 357 --KIIMLFTDG----GEDKATETFKLYN--KNKTVRVFTFSVGQHNYDKGPIQWMACENK 408 Query: 340 DRFYSV-------QNSRKLHDAFLR---IGKEMVKQRILYN 370 +Y + N+++ D R + +E KQ N Sbjct: 409 GYYYEIPSIGAIRINTQEYLDVLGRPMVLAREKAKQVQWTN 449 >gi|149701450|ref|XP_001492635.1| PREDICTED: anthrax toxin receptor 2 [Equus caballus] Length = 488 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 71/213 (33%), Gaps = 29/213 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S + D+ VLD S S+ +++ D + T V+ + Sbjct: 31 VSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYDFVKQLTERF------------VSPQM 78 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FSS+ PL I + + L T GL+ A ++I A Sbjct: 79 RLSFIVFSSQATIILPLTGDRGKISQGLEDLKHVRPVGETYIHEGLKLANDQIQKA---- 134 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 G II LTDG+ + + ++ GA VY +GV Q Sbjct: 135 -----GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKLSRSLGARVYCVGVLDFEQAQLE 187 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + S ++ + V A I ++ Q Sbjct: 188 RIADSKEQVFPVTGG---FQALKGIINSILAQS 217 >gi|198417199|ref|XP_002122571.1| PREDICTED: similar to MGC81791 protein, partial [Ciona intestinalis] Length = 847 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 67/187 (35%), Gaps = 23/187 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ VLD S S+ D G + + +++ PD R G+V ++ + Sbjct: 35 DLVFVLDASSSVGDQ------DFGRVRKWVSDLVATFDIGPD---YTRVGVVVYAEEPEM 85 Query: 230 TFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + + + T++ + + + F I G Y + Sbjct: 86 AIALNQYTDRDSLIQAVGNITYLNGNTRTGKAIRFMNEESFSIANGARDIEFG---YNRL 142 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I LTDG + EA+ G +YA+GV ++ + + PD Sbjct: 143 AIVLTDGRAQDNVFNPSL------EAQNNGIQLYAVGVSTAVVEELNEIASDPDSRHVMQ 196 Query: 345 VQNSRKL 351 V + + + Sbjct: 197 VDDFQAI 203 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 79/235 (33%), Gaps = 56/235 (23%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++ VLD S SM+ + ++ ++ +L +K DV N+V + Sbjct: 305 LPKQVIFVLDTSGSMDGN------RIKQLKEAMNSILSELKK-EDVFNIVEFSSIVKVWN 357 Query: 227 IV----------------------------QTFPLAW-----GVQHIQEKINRLIFGSTT 253 + Q P A+ + ++ + +L T Sbjct: 358 VDKVQVDYEVGEDPWPLYDSPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYGGT 417 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 LE + KE E + I+FLTDGE + + ++ +E Sbjct: 418 DIKSALEVGLKLVKKNKENKEDAHQP------IIVFLTDGEPTMGETNTEKITSAISEM- 470 Query: 314 RRG---AIVYAIGVQAEAADQFLKNCA-----SPDRFYSVQNSR-KLHDAFLRIG 359 G A ++++ A +FL+ + Y ++ +L + + +I Sbjct: 471 NSGETRAPIFSLSFGDGADREFLQKISLKNLGFARHIYEAADASLQLQEFYKQIS 525 >gi|55958063|emb|CAI12958.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 935 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N +R+ Sbjct: 300 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRT------W 347 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 348 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 403 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ + Sbjct: 404 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQ 463 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 464 RIYGNQDTSSQLKKFYNQVST 484 >gi|70778918|ref|NP_002207.2| inter-alpha-trypsin inhibitor heavy chain H2 [Homo sapiens] gi|229462889|sp|P19823|ITIH2_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; AltName: Full=Inter-alpha-trypsin inhibitor complex component II; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|55958062|emb|CAI12957.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 946 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N +R+ Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRT------W 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ + Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|47522638|ref|NP_999089.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Sus scrofa] gi|3024032|sp|Q29052|ITIH1_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|565283|dbj|BAA07632.1| inter-alpha-trypsin inhibitor heavy-chain H1 [Sus scrofa] Length = 902 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 77/206 (37%), Gaps = 17/206 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +++ V+D+S SM K+ ++ ++L +K D ++V G S Sbjct: 279 TKLNKNVVFVIDISSSMEGQ------KVKQTKEALLKILSDLKP-GDYFDLVLFGSAVQS 331 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + + + +T GL + A+ L + Sbjct: 332 WRGSLVQASTANLDAARSYVRQFSLAGSTNLNGGLLRGIEILNKAQGSLPEFSNRAS--- 388 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDGE + D + L +A R +Y +G + FL+ A + Sbjct: 389 -ILIMLTDGEPTEGVTDRSQILKNVRDAIRGRFPLYNLGFGHDVEWNFLEVRALENNGRA 447 Query: 342 ---FYSVQNSRKLHDAFLRIGKEMVK 364 + ++++L + ++ ++K Sbjct: 448 QRIYEDHDSAQQLQGFYDQVANPLLK 473 >gi|119606784|gb|EAW86378.1| inter-alpha (globulin) inhibitor H2, isoform CRA_b [Homo sapiens] gi|124376332|gb|AAI32686.1| Inter-alpha (globulin) inhibitor H2 [Homo sapiens] gi|158256194|dbj|BAF84068.1| unnamed protein product [Homo sapiens] Length = 946 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N +R+ Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRT------W 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ + Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 79/235 (33%), Gaps = 56/235 (23%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++ VLD S SM+ + ++ ++ +L +K DV N+V + Sbjct: 246 LPKQVIFVLDTSGSMDGN------RIKQLKEAMNSILSELKK-EDVFNIVEFSSIVKVWN 298 Query: 227 IV----------------------------QTFPLAW-----GVQHIQEKINRLIFGSTT 253 + Q P A+ + ++ + +L T Sbjct: 299 VDKVQVDYEVGEDPWPLYDSPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYGGT 358 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 LE + KE E + I+FLTDGE + + ++ +E Sbjct: 359 DIKSALEVGLKLVKKNKENKEDAHQP------IIVFLTDGEPTMGETNTEKITSAISEM- 411 Query: 314 RRG---AIVYAIGVQAEAADQFLKNCA-----SPDRFYSVQNSR-KLHDAFLRIG 359 G A ++++ A +FL+ + Y ++ +L + + +I Sbjct: 412 NSGETRAPIFSLSFGDGADREFLQKISLKNLGFARHIYEAADASLQLQEFYKQIS 466 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 57/164 (34%), Gaps = 22/164 (13%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 +T K ++ + +D++ V D S SMND K A ++ ++ K Sbjct: 58 GSIDFHLTPKGKATNANRDPIDVVFVFDKSGSMNDSGKNPQ-KFQSAKDAMTAAVNFFKE 116 Query: 209 IPDVNNVVRSGLVTFSSKIVQ----TFPLAWGVQH---IQEKINRLIFGSTTKSTPGLEY 261 N+ R G V F + F + I N L T T L+ Sbjct: 117 NAGPND--RFGFVPFDDDVETGKVVNFAPENNMASLNLINSNSNSLSALGGTNYTQSLDA 174 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 A G+ KY++F+TDGE + + + + Sbjct: 175 ALGMF------------GNSTNNKYVLFMTDGEPTFSKVIERTT 206 >gi|239817564|ref|YP_002946474.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] gi|239804141|gb|ACS21208.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] Length = 1867 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/150 (22%), Positives = 54/150 (36%), Gaps = 19/150 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M++LD+S SM +L A +I+ ++D D VR LVTFS+ Sbjct: 1424 LMVILDLSGSMGQETPT---RLSRAKEAIQNLIDGYDLYGD----VRVQLVTFST--TGA 1474 Query: 231 FPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 AW + + L +T L A N + + F Sbjct: 1475 SQQAWMTAAEAKALVQNLQAAGSTNYDAALAAAMNGFSATGKLDGAQNVSY--------F 1526 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAI 318 LTDGE + + + + N + RG Sbjct: 1527 LTDGEPTLGDGNTAQLANSSNSSTADRGIQ 1556 >gi|115535038|ref|NP_509469.2| hypothetical protein K09E2.1 [Caenorhabditis elegans] gi|90568060|gb|AAC46572.2| Hypothetical protein K09E2.1 [Caenorhabditis elegans] Length = 915 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 79/222 (35%), Gaps = 27/222 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F F +C +S ++T ++ LD++++ D S + F Sbjct: 1 MLFRVFSFCVFASFWRFVVTIDLQKEGICPPVLDIIILFDTSGGNDTVFE---------- 50 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST-- 252 + + I++ +P + VR GLV +S F L+ I + L F Sbjct: 51 QQKNWTIKIVRDLPIHEDAVRVGLVQYSESAKTEFNLSKYSERNDIIAHMETLTFMQVED 110 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T++ L A +IFD A + II TDG + + Sbjct: 111 TRTGVALNKADEEIFDFNGGARLKAT------RLIIIFTDGLSMD------KPSKAAKAL 158 Query: 313 KRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHD 353 +R+G +Y I V + + L D + + ++ + Sbjct: 159 RRKGVKIYTISVNSIGFIPEMLGIVGDADNVFGPNDEERIEE 200 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 69/209 (33%), Gaps = 39/209 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +SS +D++ V+D S S+ + D L +IK + + R G Sbjct: 721 TLSSAVQCPMDILFVVDSSGSITHTYDTQKDYLT----------QLIKKVEPSRSH-RVG 769 Query: 220 LVTFSSKIVQTFPLAWGV-QHIQEKINRLIF------------GSTTKSTPGLEYAYNKI 266 L+ F+ +Q ++ + ++ + TT LE + + Sbjct: 770 LIQFAGPHIQKMEWSFDTHSKNSQLLSAIRSVRHLTGVVLHFQSGTTYIGAALELSLILL 829 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ E +I ++DG + + + L K +YAI + Sbjct: 830 DSRRKHTETT----------VILISDGFSQDDSTQQAKLLRQLPNVK-----MYAISLNK 874 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAF 355 ++L + + + N + F Sbjct: 875 LTNTKYLTDIVGDRKNLFINNESTWFEEF 903 >gi|302536534|ref|ZP_07288876.1| VWA domain-containing protein [Streptomyces sp. C] gi|302445429|gb|EFL17245.1| VWA domain-containing protein [Streptomyces sp. C] Length = 532 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 62/196 (31%), Gaps = 27/196 (13%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFSS------ 225 VLD S SM + D++G ++ ++ S R L+ F Sbjct: 349 VLDTSGSMEEG-----DRIGRLRSALTDLTGTGSSGTGQRFRDREEVTLLPFGDKVKKVL 403 Query: 226 -KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +V+ + I+ + L T L+ AY + + Sbjct: 404 THVVEPGNPGPALDAIRGDVKSLRPEGGTAVYASLKAAYQHLGEGNADAFTS-------- 455 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFY 343 I+ +TDG++ D EA++R V+A+ + + R + Sbjct: 456 --IVLMTDGQSGDKVKDFDSFYAGLPEAQKR-TPVFAVLFGDSDRKELTHITELTGGRLF 512 Query: 344 SVQNS-RKLHDAFLRI 358 + L AF I Sbjct: 513 DATDGNSSLAGAFEEI 528 >gi|118096699|ref|XP_414253.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Gallus gallus] Length = 886 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 44/308 (14%), Positives = 107/308 (34%), Gaps = 38/308 (12%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF--AQDINNIERSTSLSIIIDDQHK 123 + G + + + F ++N+ + F ++ F D + S S++ D Sbjct: 190 EPQGISELEAEGTFITNDLQNVIKKSFSHKKGHISFKPTLDQQRTCENCSQSLLDGDFIV 249 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 Y++ + + + F ++ P + +++ ++D+S SM+ Sbjct: 250 KYDVKRTTPDNLQIVNGYFVHFFAPTNLP-------------KLPKNVIFIIDISGSMSG 296 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 ++ ++ ++LD IK D N + G K + ++ Sbjct: 297 ------REIEQTREALLKILDDIKE-DDHFNFILFGSDVHIWKETLIKATPENLDEARKF 349 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK---YIIFLTDGENSSPNI 300 + + T G+ + + A E G+ K+ II LTDG+ + Sbjct: 350 VRSIDTEGMTNLYGGIMKGIDMLNAAHE-------GNLVPKRSASIIIMLTDGQPNVGIS 402 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------FYSVQNSRKLHDA 354 + ++ + +A +Y +G FL+ A ++ + ++ +L Sbjct: 403 NTQDIQTHVKKAIEGKYTLYNLGFGYGVDYNFLEKMALENKGLARRIYPDSDSALQLQGF 462 Query: 355 FLRIGKEM 362 + + M Sbjct: 463 YDEVSNPM 470 >gi|2707733|gb|AAD03350.1| microneme protein precursor Etmic-1 [Eimeria tenella] Length = 712 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 68/193 (35%), Gaps = 35/193 (18%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSD---IGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + A S L + S + LD+M+V+D S S G Sbjct: 15 LSLLVGLLAASFAFSSLQPGATTSSGQDQVCTSLLDVMLVVDESGS------IGTSNFRK 68 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---- 250 + I + ++ + P VR GL+TF+++ + W + + L Sbjct: 69 VRQFIEDFVNSMPISP---EDVRVGLITFATR----SKVRWNLSDPKATNPSLAISAARS 121 Query: 251 -----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T + GL+ A ++D + + K ++ +TDG ++ P+ ++ Sbjct: 122 LSYSTGVTYTHYGLQDAKKLLYDTNAGARN------NVPKLVLVMTDGASNLPS----QT 171 Query: 306 LFYCNEAKRRGAI 318 + GAI Sbjct: 172 RSSAAALRDAGAI 184 >gi|281338501|gb|EFB14085.1| hypothetical protein PANDA_006133 [Ailuropoda melanoleuca] Length = 984 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 123 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 170 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 171 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 225 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 226 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 277 Query: 340 DRFYSV 345 +Y + Sbjct: 278 GYYYEI 283 >gi|73981989|ref|XP_852918.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Canis familiaris] Length = 1147 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 309 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 356 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 357 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 411 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 412 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 463 Query: 340 DRFYSV 345 +Y + Sbjct: 464 GYYYEI 469 >gi|161086896|ref|NP_001104313.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform a [Mus musculus] gi|46576352|sp|O08532|CA2D1_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|1905817|gb|AAB50138.1| voltage-gated calcium channel alpha2/delta subunit, alpha2a isoform [Mus musculus] gi|148671294|gb|EDL03241.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Mus musculus] Length = 1103 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 74/213 (34%), Gaps = 32/213 (15%) Query: 156 TSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 T + + ++ + +VLDVS SM G ++ VA ++ +++ ++PD Sbjct: 21 TGPAPALADGEAPVEPPKVELVLDVSGSMRATDIDGRSRISVAQQAFN---EVVDALPDE 77 Query: 213 NNV-VRSGLVTFSSK--------IVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + +R T+ + Q P+ + + L T L A Sbjct: 78 TQLGIRVLGATYPGENKERGCQDTQQIVPVGPVDRVQAKAAVATLRPTGFTPVGLALRSA 137 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + I+ +TDGE++ D E A+ +V + Sbjct: 138 AQDLGTGSTARR------------IVLITDGEDTCAPPDPCEVAREL-AAQGTKLVVDTL 184 Query: 323 GVQAEAA--DQFL-KNCASPDRFYSVQNSRKLH 352 G+ + Q L A+ + + Q++ +L Sbjct: 185 GLAPDEKVRRQLLCIAAATGGTYTAAQSADELT 217 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 72/215 (33%), Gaps = 36/215 (16%) Query: 156 TSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 T V + + ++ + +VLDVS SM G ++ VA ++ +++ ++PD Sbjct: 30 TGPVPALADWETPVEPPKVELVLDVSGSMRATDIDGRSRISVAQQAFN---EVVDALPDE 86 Query: 213 NNV-VR-SGLVTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKSTPGLEYA 262 + +R G Q + + L T L A Sbjct: 87 TELGIRVLGATYPGDDKEQGCQDTQQIVPVGPVDRVQAKAAVATLRPTGYTPVGLALRSA 146 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + I+ +TDGE++ D E A+ +V + Sbjct: 147 AEDLGTGSTARR------------IVLITDGEDTCAPPDPCEVAREL-AAQGTKLVVDTL 193 Query: 323 GVQAEAA--DQFLKNC---ASPDRFYSVQNSRKLH 352 G+ + Q L C A+ + + Q++ +L Sbjct: 194 GLAPDEKVRQQLL--CIAGATGGTYTAAQSADELT 226 >gi|261338458|ref|ZP_05966342.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270276443|gb|EFA22297.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 493 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 91/263 (34%), Gaps = 51/263 (19%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 S + ++S+ +++ D+++++DVS SM + VA Sbjct: 42 SSALMGNGDGSYSLTVSVSSTDMDTAQQQTESDVVVLMDVSGSM------TTTDMKVAKN 95 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKI---VQTFPLAWGVQH--IQEKINRLIFGST 252 ++ + + + + D N+ VR +V FSS+ + W + + +N L Sbjct: 96 AVNGLANQL--LNDENDTVRMSIVRFSSEAKTLEFSNGSEWTHSPALVAQALNTLTSRGN 153 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI------------ 300 T L+ A + +G K Y++ ++DG ++ N Sbjct: 154 TNWDGALQNASALV-----------QGDSARKSYVVLMSDGYPNTINSCYPAVANCTDTS 202 Query: 301 -----DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL--------KNCASPDRFYSVQN 347 +++ N +YA+ + A++ K P + + Sbjct: 203 WSEPNAVPKAIEAANTMPNT--QIYAVSTRTSASESMKELVDGINAKAPKYPAQIMYGTD 260 Query: 348 SRKLHDAFLRIGKEMVKQRILYN 370 + L++AF I + K+ Sbjct: 261 QQSLNNAFDTIADAIRKRFTDVT 283 >gi|126722583|ref|NP_001075745.1| voltage-dependent calcium channel subunit alpha-2/delta-1 preproprotein [Oryctolagus cuniculus] gi|116409|sp|P13806|CA2D1_RABIT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|164763|gb|AAA81562.1| dihydropryridine-sensitive calcium channel alpha-2 subunit [Oryctolagus cuniculus] Length = 1106 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 255 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 302 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 303 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 357 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 358 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 409 Query: 340 DRFYSV 345 +Y + Sbjct: 410 GYYYEI 415 >gi|241554201|ref|YP_002979414.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240863507|gb|ACS61169.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 706 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 77/212 (36%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I I+ + +++ ++DVS SM++ DKL Sbjct: 303 FKATVTVMPTPWNRDTELMHVAIKG-YDIAPATTPRANLVFLIDVSGSMDEP-----DKL 356 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + + R M++ +K+ V+ +VT++ I I+RL G Sbjct: 357 PLLKSAFRLMVNRLKADDTVS------IVTYAGNAGTVLAPTRVAEKSKILSAIDRLEPG 410 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AY L D + + TDG+ + + + Sbjct: 411 GSTGGAEGIEAAY--------DLAKQGFVKDGVNRVM-LATDGDFNVGPSSDGDLKRIIE 461 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 E ++ G + +G D ++ A Sbjct: 462 EKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGN 493 >gi|197124353|ref|YP_002136304.1| von Willebrand factor A [Anaeromyxobacter sp. K] gi|196174202|gb|ACG75175.1| von Willebrand factor type A [Anaeromyxobacter sp. K] Length = 480 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 80/218 (36%), Gaps = 26/218 (11%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 A +H + + + + + ++ VLDVS SM+ KL AT+SI +++D Sbjct: 17 AKDAHLVVSLVAPHGNARAERSPVCVIPVLDVSGSMHGE------KLHFATQSIMKLVDH 70 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQ---TFPLAWGVQ-HIQEKINRLIFGSTTKSTPGLEY 261 + G+V FS+++ + + ++ + RL T GL Sbjct: 71 LAPGDFC------GVVVFSTEVETLAAPTEMTQDRKDALKVALGRLRPRHNTNLAGGLLA 124 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + K D +I TDG + + E L EA A V A Sbjct: 125 GL--------DHAKVTKVPDGMPVRVILFTDGLANEGPATSPEGLCALLEANLGTASVSA 176 Query: 322 IGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLR 357 G +A + L+ ++ + V++ AF R Sbjct: 177 FGYGDDADQELLRELSTLGRGNYAYVRSPEDALTAFAR 214 >gi|188580652|ref|YP_001924097.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium populi BJ001] gi|179344150|gb|ACB79562.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium populi BJ001] Length = 723 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 75/234 (32%), Gaps = 34/234 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P + + L+T + + D++ V+D S SM + A Sbjct: 303 PAVGLFRERVVGAETVLALVTPPEGAAPAVALPRDVVFVIDNSGSMGGA------SIRQA 356 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFG 250 S+ LD + R ++ F FP + + ++ L Sbjct: 357 KASLLIGLDRL------RPGDRFNVIRFDHSFDTLFPDVVPADESHLARAKRFVSGLEAS 410 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T+ L A E + + I+FLTDG I N+ +F Sbjct: 411 GGTEMLAPLRAALADATP--EDTARL--------RQIVFLTDG-----AIGNEAQIFSAI 455 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A+R + ++ +G+ + + + A F + ++ + + ++ Sbjct: 456 AAERGRSRLFMVGIGSAPNGYLMSHAAELGRGSFTQIDTPDQVSERMRALLTKL 509 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 62/196 (31%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + P + + K G+++++ LD+S SM +L A Sbjct: 61 IIFVVIGLFSVLLARPQFGSKQETVKRK---GVEVIIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I ++D + + + G++ F+ P+ + + + +K Sbjct: 117 RLISRLVDELDN-------DKIGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + I+ +TDGEN + +G Sbjct: 170 TAIGEAIN-------LAVRSFTPQEGVGRAIVVITDGENHEGGAVEAAKVAA-----EKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVSVLGVGMPDGAPI 233 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 53/297 (17%), Positives = 91/297 (30%), Gaps = 34/297 (11%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N KG I I I LP ++ L+ + + K KL L+ L Sbjct: 38 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALA---------- 87 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + N + + L NNI + + + Sbjct: 88 ---LTAENNAKNDTRNNELISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYHTQLSQYH 144 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + E P +F + H +I S +D++ V D S SM F Sbjct: 145 IDANIEQPTLFPFTSLLID--HDNFIIGGSAAAIKDV-PAMDVVFVTDFSGSMEGDFHNP 201 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 D ++ + + K D+ + ++FS P +WG + K L Sbjct: 202 DDPEVLSKLDELKRI-FFKIADDIYTANKDSTISFS-------PFSWGTKSADNKKCSLH 253 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 F + NKI+ +KY+I +T+ + I+N + Sbjct: 254 F---------MPKEKNKIYPIPSNEIERNTEAHQ-EKYMIAITENIDYLATIENIGT 300 >gi|260856317|ref|YP_003230208.1| hypothetical protein ECO26_3261 [Escherichia coli O26:H11 str. 11368] gi|260868996|ref|YP_003235398.1| hypothetical protein ECO111_3021 [Escherichia coli O111:H- str. 11128] gi|300903656|ref|ZP_07121573.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301303269|ref|ZP_07209394.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|257754966|dbj|BAI26468.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257765352|dbj|BAI36847.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|300404332|gb|EFJ87870.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300841443|gb|EFK69203.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255206|gb|EFU35174.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323156423|gb|EFZ42578.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323176802|gb|EFZ62392.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 584 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 47/337 (13%), Positives = 104/337 (30%), Gaps = 46/337 (13%) Query: 39 SHKFFVKAKLHYILDHS--LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 + ++ K L L + + A + + N G + F +K + Q Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYKQFDDNPVKQVAQNPLATFS 134 Query: 97 RENGFAQDINN--------IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS 148 + N + ++ + + Y+ + +P P+ Sbjct: 135 LDVDTGSYANVRRFLNHGLLPPPDAVRVEEIVNYFPYDWDIKDKQSIPATK-PIPFAMRY 193 Query: 149 SHAPLLITSSVKI----------SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 AP + + S+ +++ ++D S SM ++L + S Sbjct: 194 ELAPAPWNEQLTLLKIDILAKDHKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSS 248 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKST 256 ++ ++ ++ + +VT++ P G I I+ L +T Sbjct: 249 LKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGG 302 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 303 AGLELAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESG 353 Query: 317 AIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 354 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 390 >gi|156257452|gb|ABU63134.1| microneme 1 precursor [Eimeria tenella] Length = 675 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 61/159 (38%), Gaps = 32/159 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+M+V+D S S G + I + ++ + P VR GL+TF+++ Sbjct: 13 LDVMLVVDESGS------IGTSNFRKVRQFIEDFVNSMPISP---EDVRVGLITFATR-- 61 Query: 229 QTFPLAWGVQHIQEKINRLIFG---------STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + W + + L T + GL+ A ++D + Sbjct: 62 --SKVRWNLSDPKATNPSLAISAARSLSYSTGVTYTHYGLQDAKKLLYDTNAGARN---- 115 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + K ++ +TDG ++ P+ ++ + GAI Sbjct: 116 --NVPKLVLVMTDGASNLPS----QTRSSAAALRDAGAI 148 >gi|114684811|ref|XP_531503.2| PREDICTED: collagen, type VI, alpha 1 isoform 3 [Pan troglodytes] Length = 997 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 66/201 (32%), Gaps = 36/201 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D +D+ VLD S S+ P + + +D ++ Sbjct: 33 DCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLR-----------------D 75 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + + + + FG T + ++ ++ H K Sbjct: 76 RYYRC-----DRNLVVDAVKY--FGKGTYTDCAIKKGLEQLLVGG--------SHLKENK 120 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-- 342 Y+I +TDG + L NEAK G V+++ + + + L A+ + Sbjct: 121 YLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRR 180 Query: 343 -YSVQNSRKLHDAFLRIGKEM 362 ++ + + DA I + + Sbjct: 181 NFTAADWGQSRDAEEAISQTI 201 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P + VR +V +S Q Sbjct: 798 DITILLDGSASVGSHNFDTTKRF--AKRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 853 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 854 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 904 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 905 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 944 >gi|309355882|emb|CAP38139.2| hypothetical protein CBG_21291 [Caenorhabditis briggsae AF16] Length = 430 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 73/202 (36%), Gaps = 34/202 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI-----REMLDIIKSIPDVNNVVRSG 219 + LD+++VLD S + + + D + + + + VR Sbjct: 239 TGCELDLVLVLDFSTTTDPVYNSYKDLSKRLVQQLKIGPHYTQVAAVTFATVGRTRVRFN 298 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L +S++ + + I++L G TT G+E A +I +++ IA Sbjct: 299 LKKYSTQ-----------EEVLRGIDKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT 347 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA------ADQF 332 K +I TDG ++ K + +A G +Y + A A ++ Sbjct: 348 ------KVMIVFTDGWSNKGPDPEKRAR----DAVNAGFEMYTVAYTARAPGSVTLNNET 397 Query: 333 LKNCA-SPDRFYSVQNSRKLHD 353 L + S ++ + L D Sbjct: 398 LSAISGSSGHAFTDVTFQTLVD 419 >gi|162454787|ref|YP_001617154.1| hypothetical protein sce6505 [Sorangium cellulosum 'So ce 56'] gi|161165369|emb|CAN96674.1| putative membrane protein [Sorangium cellulosum 'So ce 56'] Length = 384 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 60/162 (37%), Gaps = 28/162 (17%) Query: 169 LDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+++VLD S SM P R R +++ + I D+ R G V F+ + Sbjct: 89 VDVVVVLDYSKSMYARDVEPS--------RIFRAKVEVARLIKDL-EGARFGAVAFAGEP 139 Query: 228 VQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + FPL I + +L T L+ A + + EH Sbjct: 140 MG-FPLTADGAAIAQFFRQLDPNDMPIGGTAIARALDQANELLKRDPKSAEH-------- 190 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K+ I+ +TDGE+ L +G ++ + + Sbjct: 191 KRIILLVTDGEDLEG-----YPLSVAQAIGAQGTTIHVVQIG 227 >gi|126340390|ref|XP_001364302.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Monodelphis domestica] Length = 1092 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 65/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 254 DMLILVDVSGSVSGL------TLRLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 301 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 302 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 356 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + K C + Sbjct: 357 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRDPVKWKACRNC 408 Query: 340 DRFYSV 345 +Y + Sbjct: 409 GYYYEI 414 >gi|3929911|dbj|BAA34707.1| complement factor B/C2B [Cyprinus carpio] Length = 833 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 80/224 (35%), Gaps = 35/224 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ L++ ++LD S S++ +A + +++ + S +R G+++++ Sbjct: 336 AEGRLNVFILLDTSASISPE------SFHLAKNATIQLVQKLDSYE---VTMRFGIISYA 386 Query: 225 SKIVQTFPLAWGVQHIQEKI-NRLIFGS--------TTKSTPGLEYAYNKIFDAKEKLEH 275 S+ + + + + +L S T L Y ++ +E Sbjct: 387 SEAKEIVSITNDLSQDVHYVMRKLHEFSDKSHGNKRGTNLHDALNKVYEELALLRENKRS 446 Query: 276 IAKGHDDYKKYIIFLTDGE-NSSPNIDNK--------ESLFYCNEAKRRGAIVYAIGVQA 326 ++ + II TDG N P+ N + K VY V Sbjct: 447 HF---NETQNVIIIATDGYSNMGPSPINILPKIRNLFGYKSSVDHTKEELLDVYVFAVGQ 503 Query: 327 EAADQFLKNCAS----PDRFYSVQNSRKLHDAF-LRIGKEMVKQ 365 + Q L++ AS + +++ R+L F I V + Sbjct: 504 QVNKQELQSIASIKKDERHVFVLKDYRQLGLVFNQMISDSAVTK 547 >gi|307323133|ref|ZP_07602343.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890622|gb|EFN21598.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 422 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 70/198 (35%), Gaps = 37/198 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----- 225 + +VLDVS SM G ++ A ++ E+LD + +V +R+ + Sbjct: 40 VELVLDVSGSMRARDVDGDTRMAAAKQAFNEVLDA--TPEEVRLGIRTLGANYPGKDRVA 97 Query: 226 ---KIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q +P+ + + L T L A + + Sbjct: 98 GCRDSEQLYPVGQVDRTEAKAAVATLRPTGWTPIGLALRGASKDLSSGEGTRR------- 150 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----VYAIGVQAEAADQFLKNC 336 I+ +TDGE+S D C+ A+ A V +G+ + + +C Sbjct: 151 -----IVLITDGEDSCGQPDP------CDVARELAAQGTHLVVDTLGLTLDRKVREQLSC 199 Query: 337 ---ASPDRFYSVQNSRKL 351 A+ + ++Q+ +L Sbjct: 200 IAEATGGTYTAIQHRDQL 217 >gi|183219595|ref|YP_001837591.1| hypothetical protein LEPBI_I0170 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167778017|gb|ABZ96315.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 368 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 81/236 (34%), Gaps = 35/236 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 ++ + IF P H LL S + + +LD S SM++ + G Sbjct: 4 GKFPLKIIFLLIPVVLFFLHLSLL------PQSNHNKRY--VFILDASGSMSEKWD-GKT 54 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-------GVQHIQEK 243 ++ VA +++ ++ +P + GLV + ++I G + +K Sbjct: 55 RMAVAKE---KLIQVLGGLPKDAS---VGLVAYGNRIAGCQSARLYHPIQKGGASIVSQK 108 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + ++ +T L+ + + + E IIF++DG S Sbjct: 109 LTTIVPAGSTPIAQTLQVVGEYLLSDQLETE------------IIFISDGVESCEGDPKS 156 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NCASPDRFYSVQNSRKLHDAFLRI 358 K+ + I + + + + + ++ ++ +F RI Sbjct: 157 VLYNLRQSGKKFRLQILGIDIDPKGEEDLKRLSILGDGNYFPLKTPEDYDRSFQRI 212 >gi|189909734|ref|YP_001961289.1| hypothetical protein LBF_0165 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167774410|gb|ABZ92711.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] Length = 373 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 81/236 (34%), Gaps = 35/236 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 ++ + IF P H LL S + + +LD S SM++ + G Sbjct: 9 GKFPLKIIFLLIPVVLFFLHLSLL------PQSNHNKRY--VFILDASGSMSEKWD-GKT 59 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-------GVQHIQEK 243 ++ VA +++ ++ +P + GLV + ++I G + +K Sbjct: 60 RMAVAKE---KLIQVLGGLPKDAS---VGLVAYGNRIAGCQSARLYHPIQKGGASIVSQK 113 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + ++ +T L+ + + + E IIF++DG S Sbjct: 114 LTTIVPAGSTPIAQTLQVVGEYLLSDQLETE------------IIFISDGVESCEGDPKS 161 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NCASPDRFYSVQNSRKLHDAFLRI 358 K+ + I + + + + + ++ ++ +F RI Sbjct: 162 VLYNLRQSGKKFRLQILGIDIDPKGEEDLKRLSILGDGNYFPLKTPEDYDRSFQRI 217 >gi|161086902|ref|NP_001104316.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform d [Mus musculus] gi|1905823|gb|AAB50141.1| voltage-gated calcium channel alpha2/delta subunit, alpha2d isoform [Mus musculus] gi|148671298|gb|EDL03245.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_e [Mus musculus] Length = 1079 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|85710455|ref|ZP_01041519.1| putative secreted protein [Erythrobacter sp. NAP1] gi|85687633|gb|EAQ27638.1| putative secreted protein [Erythrobacter sp. NAP1] Length = 576 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 74/217 (34%), Gaps = 31/217 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 I +++ ++DVS SM DKL + ++ + ++ R + Sbjct: 211 IERDERPPANLVFLMDVSGSMG-----SPDKLPLVQTALSGL------AGELGEQDRVSI 259 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V ++ I+ + L G +T G++ AYN D + +G Sbjct: 260 VVYAGAAGLVLEPTNDTAKIRAALMSLSAGGSTAGGAGIQLAYNIAED------NFIEGG 313 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA-- 337 + +I TDG+ + D + + + RG + +G + ++ A Sbjct: 314 VNR---VILATDGDFNVGVSDRDALVEMVEKNRDRGITLTTLGFGTGNFNEAMMEQIANK 370 Query: 338 SPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 + + ++ ++ I K++ Q Sbjct: 371 GNGNYAYIDSALEAKKVLSDEMSSTLFTIAKDVKIQV 407 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 71/203 (34%), Gaps = 42/203 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 +M+VLD S SM G K+ +A + +++ + V GL+ + Sbjct: 36 VMIVLDGSNSM-WGQVDGEAKITIAKDVMTDLISNWD------DAVDLGLMVYGHRRKGD 88 Query: 224 -SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S V P + +K+ + T + L A + K Sbjct: 89 CSDIEVVALPGKVNRPALIDKVQSISPRGKTPISKTLLLAATSVGYFSGKSS-------- 140 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----VYAIGVQA-EAADQFLKNC 336 ++ ++DG + C +AK G I V+ IG E + L+ Sbjct: 141 ----VVLVSDGLETCDAD-------PCAQAKALGIINPGFDVHVIGFDVTEEEFKSLQCI 189 Query: 337 AS--PDRFYSVQNSRKLHDAFLR 357 A+ +F+ N+ +L DA R Sbjct: 190 ATETGGKFFRANNAEELKDALRR 212 >gi|126304011|ref|XP_001381695.1| PREDICTED: similar to tumor endothelial marker 8 [Monodelphis domestica] Length = 564 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 72/199 (36%), Gaps = 25/199 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+ +LD S S+ H+ E L P + R + FS++ Sbjct: 41 GGFDLYFILDKSGSVLHHWNE--------IYYFVEHLAHKFISPQL----RMSFIVFSTR 88 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + I++ ++ +++ G T G + A +I+ + A Sbjct: 89 GSILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFQRASEQIYYENMQGYRTAS----- 143 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 II LTDGE E N ++ GA VY +GV+ Q + S D + Sbjct: 144 --VIIALTDGELHEDLFFYAE--REANRSRELGATVYCVGVKDFNETQLARIADSKDHVF 199 Query: 344 SVQNS-RKLHDAFLRIGKE 361 V + + L I K+ Sbjct: 200 PVNDGFQALQGIIDSILKK 218 >gi|326922361|ref|XP_003207417.1| PREDICTED: collagen alpha-3(VI) chain-like [Meleagris gallopavo] Length = 3135 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 77/206 (37%), Gaps = 18/206 (8%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 LL++ + ++ + + V D+ ++ + G + + + D++K++ Sbjct: 13 LGLLLSGFCSVGAQQQAAVRNVAVADIIFLVDSSWNIGKEHFQLVREFLY---DVVKALD 69 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIF 267 N R LV FS F L Q + I + G +K+ GLEY + Sbjct: 70 VGGNDFRFALVQFSGNPHTEFQLNTYPSNQDVLSHIANMPYMGGGSKTGKGLEY----LI 125 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + ++ + + II LTDG++ L K + A+GVQ Sbjct: 126 ENHLTKAAGSRASEGVPQVIIVLTDGQSQDDVALPSSVL------KSAHVNMIAVGVQDA 179 Query: 328 AADQFLKNCASP--DRFYSVQNSRKL 351 + + + P ++++N L Sbjct: 180 VEGELKEIASRPFDTHLFNIENFTAL 205 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 86/234 (36%), Gaps = 19/234 (8%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + F+ P + + S D++ +LD SL++ + P + Sbjct: 607 FRLQFMQAILPEVLSPIRTLSGGMIIHETPSVQVTKRDIIFLLDGSLNVGNANFPFVRD- 665 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIFG 250 + +++ + D +R GLV FS F L ++ + I +++ +L Sbjct: 666 -----FVVTLVNNLDVGTDK---IRVGLVQFSDTPKTEFSLYSYQTKSDIIQRLGQLRPK 717 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 + G A N + ++ ++ + ++ +T G ++ P L N Sbjct: 718 GGSVLNTG--SALNFVLSNHFTEAGGSRINEQVPQVLVLVTAGRSADP------FLQVSN 769 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + R G + +A+GV+ + + +P Y V + L + K + Sbjct: 770 DLARAGVLTFAVGVRNADKAELEQIAFNPRMVYFVDDFSGLTALPQELNKPITT 823 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 45/329 (13%), Positives = 108/329 (32%), Gaps = 50/329 (15%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN-------------- 99 +++L ++ + + D+ +KN + + + + Sbjct: 1501 NAVLQAIRRLRLRGGYPVNAGKALDYV---VKNYFIKSAGSRIEDGVPQHLVVILGDQSQ 1557 Query: 100 GFAQDINNIERSTSLS-------------IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA 146 N+ STS+ + + L +P + Sbjct: 1558 DDVNRPANVISSTSIQPLGVGARNVDRNQLQVITNDPGRVLVVQDFTGLPTLERRVQNIL 1617 Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 P S D++ +LD S+++ D + + ++D I Sbjct: 1618 EELPVP-TTESPGYPGPGGKKQADIVFLLDGSINLGR------DNFQEVLQFVYSIVDAI 1670 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 D ++ ++ GL ++S + F L I + IN++I+ + G A Sbjct: 1671 --YEDGDS-IQVGLAQYNSDVTDEFFLKDYSTKPQILDAINKVIYKGGRVANTG--AAIR 1725 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + E ++ + +T G++S D + + E ++G V+A+GV Sbjct: 1726 HLQAKHFVKEAGSRIDQRVPQIAFIITGGKSSD---DGQGASM---EVAQKGVKVFAVGV 1779 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + ++ K + + V +++L + Sbjct: 1780 RNIDLEEVSKLASESATSFRVSTAQELSE 1808 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 62/194 (31%), Gaps = 21/194 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S + D++ ++D S F D + ++ D R Sbjct: 1229 TSPSPAGAKRDIVFLVDGSRYAAQEFYLIRDLIERIVNNLDVGFDTT----------RVS 1278 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V FS F L +Q + RL + A + + Sbjct: 1279 VVQFSEHPHVEFLLNAHSTKDEVQGAVRRLRPRGG--QQVNMGEALEFVAKTIFTRPSGS 1336 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + ++++ L+ S + D+ E + K+ G I + + ++ Sbjct: 1337 RIEEGVPQFLVILS----SRKSDDDLEFPSV--QVKQVGVAPLVIAKNMDPE-EMVQISL 1389 Query: 338 SPDRFYSVQNSRKL 351 SPD + V + ++L Sbjct: 1390 SPDYVFQVSSFQEL 1403 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 43/337 (12%), Positives = 99/337 (29%), Gaps = 38/337 (11%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGN---NGKKQKNDFSYRIIKNIWQTDFRNELRE 98 +K + +L ++ +E G+ G Q N L+E Sbjct: 911 MRLKTGKQLNIGVALDEAVRRLFVKEAGSRIEEGIPQFLVLLAAGKSNDEVERPSGALKE 970 Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 G + + + + L+ S + + + + Sbjct: 971 AGVVTFAIKAKNADLSELERIAYAPQFILNVESLPRISELQANIVNLLKTIQFQPTVVER 1030 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + D++ ++D S + F + + +++ + + VR Sbjct: 1031 GEKK-------DVVFLIDGSDGVRRGFP-------LLKTFVERVVESLDI---GRDKVRV 1073 Query: 219 GLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAKEKLE 274 +V +S+ I F L + I L + + L+Y +F + Sbjct: 1074 AIVQYSNVIQPEFLLDAYEDKADLVSAIQALTIMGGSPLNTGAALDYLIKNVF----TVS 1129 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ + +++I LT D + K G + + IG+ + Sbjct: 1130 SGSRIAEGVPQFLILLT---ADRSQDDVRRPSVV---LKTSGTVPFGIGIGNADLTELQT 1183 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 PD SV + +L + + + + I K Sbjct: 1184 ISFLPDFAISVPDFSQLD----SVQQAVSNRVIRLTK 1216 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 58/181 (32%), Gaps = 14/181 (7%) Query: 123 KDYNLSAVSRYEMPFI--FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 DY S E+P + P S+ + V + S D++ ++D S S Sbjct: 1391 PDYVFQVSSFQELPSLEQKLLAPIETLSADQIRQLLGDVTVPDVSGEEKDVVFLIDSSDS 1450 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 + D L I ++ + P+ VR G+V FS+ + F L Sbjct: 1451 V------RTDGLAHIRDFISRIVQQLDVGPNK---VRIGVVQFSNNVFPEFFLKTHKSKN 1501 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + I RL G Y + + K + ++ L D Sbjct: 1502 AVLQAIRRLRLRGGYPVNAGKALDY-VVKNYFIKSAGSRIEDGVPQHLVVILGDQSQDDV 1560 Query: 299 N 299 N Sbjct: 1561 N 1561 >gi|325116955|emb|CBZ52508.1| unnamed protein product [Neospora caninum Liverpool] Length = 765 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 87/229 (37%), Gaps = 30/229 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ + ++ + LD+ ++D S S+ + + + A + Sbjct: 54 SAITDLMKSGGTIGAAEGCTSQLDICFLVDSSGSIGEAHYEEVKQFLHA---------FL 104 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIF-GSTTKSTPGLEY 261 +P N+ V + LV FS+ + + L A + + + + + G TT + GL+ Sbjct: 105 SKLPIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGTTNTAAGLQT 164 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIV 319 +FD + K +I +TDGE + + ++ + RG V Sbjct: 165 CNQMLFDYPREER------QTVPKLVIAMTDGE----SDSDFHTVNEAKVIRERGGIITV 214 Query: 320 YAIGVQAEAAD-QFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++G+ + + + C +SP Y +L + I KE+ K Sbjct: 215 LSVGMYVNHNECRSMCGCRNDSSPCPLYLQTEWSQLLPSISPILKEVCK 263 >gi|6017001|gb|AAF01565.1|AF061273_1 thrombospondin-related adhesive protein homolog [Neospora caninum] Length = 765 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 87/229 (37%), Gaps = 30/229 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ + ++ + LD+ ++D S S+ + + + A + Sbjct: 54 SAITDLMKSGGTIGAAEGCTSQLDICFLVDSSGSIGEAHYEEVKQFLHA---------FL 104 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIF-GSTTKSTPGLEY 261 +P N+ V + LV FS+ + + L A + + + + + G TT + GL+ Sbjct: 105 SKLPIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGTTNTAAGLQT 164 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIV 319 +FD + K +I +TDGE + + ++ + RG V Sbjct: 165 CNQMLFDYPREER------QTVPKLVIAMTDGE----SDSDFHTVNEAKVIRERGGIITV 214 Query: 320 YAIGVQAEAAD-QFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++G+ + + + C +SP Y +L + I KE+ K Sbjct: 215 LSVGMYVNHNECRSMCGCRNDSSPCPLYLQTEWSQLLPSISPILKEVCK 263 >gi|301753369|ref|XP_002912539.1| PREDICTED: anthrax toxin receptor 2-like [Ailuropoda melanoleuca] Length = 611 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 71/213 (33%), Gaps = 29/213 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S + D+ VLD S S+ +++ + + T V+ + Sbjct: 154 VSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERF------------VSPQM 201 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FSS+ PL I E + R+ T GL+ A +I A Sbjct: 202 RLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGFK 261 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 II LTDG+ + + ++ GA VY +GV Q Sbjct: 262 ASS---------IIIALTDGKLDG--LVPSYAEKEAKISRSFGARVYCVGVLDFEQAQLE 310 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + S D+ + V+ A I ++ + Sbjct: 311 RIADSKDQVFPVKGG---FQALKGIINSILDRS 340 >gi|281346820|gb|EFB22404.1| hypothetical protein PANDA_000280 [Ailuropoda melanoleuca] Length = 482 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 71/213 (33%), Gaps = 29/213 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S + D+ VLD S S+ +++ + + T V+ + Sbjct: 24 VSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERF------------VSPQM 71 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FSS+ PL I E + R+ T GL+ A +I A Sbjct: 72 RLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGFK 131 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 II LTDG+ + + ++ GA VY +GV Q Sbjct: 132 ASS---------IIIALTDGKLDG--LVPSYAEKEAKISRSFGARVYCVGVLDFEQAQLE 180 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + S D+ + V+ A I ++ + Sbjct: 181 RIADSKDQVFPVKGG---FQALKGIINSILDRS 210 >gi|161086906|ref|NP_001104318.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Rattus norvegicus] gi|27450704|gb|AAO14652.1|AF486276_1 calcium channel alpha-2 delta-1 subunit isoform e [Rattus norvegicus] Length = 1084 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|74011920|ref|XP_548489.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Canis familiaris] Length = 897 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 108/301 (35%), Gaps = 26/301 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 ++K + ++ H + I + + + + + ++ + F + F Sbjct: 175 MYLKVQPKQLVKHFEIEA--DIYEPQGISTLDAEASFITNDLLGSALTKSFSGKKGRVSF 232 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ ++ + + + D+ ++ E P AP + K Sbjct: 233 KPSLD--QQRSCPTCTDSLLNGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPK- 289 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG-L 220 +++ V+DVS SM+ K+ ++ ++L +K +N ++ SG + Sbjct: 290 --------NVVFVIDVSGSMHG------RKMEQTKDALLKILGDMKGEDYLNFILFSGDV 335 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +T+ +VQ P ++ + + + T GL + + A+E+ + Sbjct: 336 ITWKDDLVQATPE--NIEEARIFVKNIHDRGLTNINDGLLRGISMLNRAREEH----RVP 389 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + II LTDG+ + ++ A +Y +G FL++ A + Sbjct: 390 ERSTSIIIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALEN 449 Query: 341 R 341 Sbjct: 450 H 450 >gi|167951278|ref|ZP_02538352.1| von Willebrand factor, type A [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 269 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 77/195 (39%), Gaps = 24/195 (12%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 ++DVS SM+ DKL + RS+R + + + V+ VV +G +V Sbjct: 88 LVDVSGSMH-----SPDKLPLLKRSLRLLSRSLDADDRVSLVVYAGASG----VVLEPTP 138 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I++ + +L G +T G+ AY K A+E +I TDG Sbjct: 139 GNKRATIEQALQQLSAGGSTNGGAGIRLAYAK---AREAFIEGGINR------VILATDG 189 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 + + ++++ + + ++ G + +G D ++ A + + + Sbjct: 190 DFNVGTVNHQALIDLIKQQRQAGIALTTLGFGGGNYNDHLMEQLADQGDGNYAYIDS--- 246 Query: 351 LHDAFLRIGKEMVKQ 365 L +A +G+ Q Sbjct: 247 LMEARQGVGERAGCQ 261 >gi|161086904|ref|NP_001104317.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 3 [Rattus norvegicus] gi|17864880|gb|AAL47093.1|AF400662_1 L-type calcium channel alpha2/delta subunit [Rattus norvegicus] Length = 1079 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|6753234|ref|NP_033914.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform e [Mus musculus] gi|1905825|gb|AAB50142.1| voltage-gated calcium channel alpha2/delta subunit, alpha2e isoform [Mus musculus] gi|148671296|gb|EDL03243.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Mus musculus] Length = 1084 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|161086900|ref|NP_001104315.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform c [Mus musculus] gi|1905821|gb|AAB50140.1| voltage-gated calcium channel alpha2/delta subunit, alpha2c isoform [Mus musculus] gi|109732367|gb|AAI15872.1| Cacna2d1 protein [Mus musculus] gi|148671297|gb|EDL03244.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Mus musculus] Length = 1086 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|260837103|ref|XP_002613545.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] gi|229298930|gb|EEN69554.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] Length = 184 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 63/175 (36%), Gaps = 24/175 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 L+ ++ + +D++ +LD S S+ D V + + PD Sbjct: 31 LVNGTICTEVICRMPVDLVFLLDGSGSIGDS------NFQVTKNFVATTTSDFQIGPDNA 84 Query: 214 NVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 +V + S + FPL ++ + IN + + G T++ ++Y Sbjct: 85 Q---VSVVQYESSPTEEFPLDRYATLEDLLSAINLIPYRGGGTRTGRAIDYVVTTTLTVS 141 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K II +TDG++ + AK+ G I+ AIGV Sbjct: 142 RGAR------QGVPKVIIVVTDGQSGD------DVREPARRAKQSGIIMVAIGVG 184 >gi|301764709|ref|XP_002917776.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Ailuropoda melanoleuca] Length = 1091 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|223670958|dbj|BAH22726.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 71/212 (33%), Gaps = 31/212 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S GLD++ V D S S+ + D + R ++D I + R ++ Sbjct: 246 SSGAAGLDLVFVFDSSASVGE------DNFRKGIQFARTIIDEF-GISATPSGTRVAVIV 298 Query: 223 FSSKIVQTFPLAWGV-----QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FS F L + ++ N G T + L+ + + Sbjct: 299 FSDAAQVIFNLKSNRIVDKEEAVRRLENLQFQGGGTATKLALQAVIDTVNPELRNNS--- 355 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKN 335 KK + +TDG+++ ++ + R G ++AIGV LK+ Sbjct: 356 ------KKALFLITDGKSNKGGSPDRPAKVL-----RAGFNFEIFAIGVSDSVDKDELKS 404 Query: 336 CASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 AS Y +++ L I + Sbjct: 405 IASEPFRTHVYQIKDYATLVKLKELITTKGTD 436 >gi|255550407|ref|XP_002516254.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] gi|223544740|gb|EEF46256.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] Length = 755 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 74/218 (33%), Gaps = 48/218 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D+S SM G ++ + A L+ S ++ F+ + Sbjct: 327 IVFIVDISGSME---GKPLEGMKNAMSGALAKLNPKDSFN---------IIAFNGETYLF 374 Query: 231 FPL-----AWGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L V+ E +N I G T + L A + + + L Sbjct: 375 SSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNTQGSLP---------- 424 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKR----RGA---IVYAIGVQAEAADQFLKNCA 337 I +TDG ++ C+ K+ +GA +Y G+ FL+ A Sbjct: 425 -VIFLVTDG-------AVEDERHICDSMKKYVRGKGAICPRIYTFGIGTYCNHYFLRMLA 476 Query: 338 S--PDRF---YSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + ++ Y V + + + F G V ++ + Sbjct: 477 TVCRGQYDAAYDVDSVQARMEIFFSRGLSAVLANVMID 514 >gi|148689378|gb|EDL21325.1| procollagen, type VII, alpha 1 [Mus musculus] Length = 2944 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 69/202 (34%), Gaps = 27/202 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 36 YAADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSGAASA-QGVRFATVQYSDD 88 Query: 227 IVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L G I+ G T++ L + +++F L + + Sbjct: 89 PQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVF-----LPRLTRPGVP- 142 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + K +G ++A+G++ A + LK AS D Sbjct: 143 -KVCILITDGKSQDLVD------TAAQKLKGQGVKLFAVGIK-NADPEELKRVASQPTSD 194 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 F+ V + L I + + Sbjct: 195 FFFFVNDFSILRTLLPLISRRV 216 >gi|115647999|ref|NP_031764.2| collagen alpha-1(VII) chain precursor [Mus musculus] gi|143955303|sp|Q63870|CO7A1_MOUSE RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor Length = 2944 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 69/202 (34%), Gaps = 27/202 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 36 YAADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSGAASA-QGVRFATVQYSDD 88 Query: 227 IVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L G I+ G T++ L + +++F L + + Sbjct: 89 PQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVF-----LPRLTRPGVP- 142 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + K +G ++A+G++ A + LK AS D Sbjct: 143 -KVCILITDGKSQDLVD------TAAQKLKGQGVKLFAVGIK-NADPEELKRVASQPTSD 194 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 F+ V + L I + + Sbjct: 195 FFFFVNDFSILRTLLPLISRRV 216 >gi|2326168|gb|AAB66593.1| type VII collagen [Mus musculus] Length = 2944 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 69/202 (34%), Gaps = 27/202 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 36 YAADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSGAASA-QGVRFATVQYSDD 88 Query: 227 IVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L G I+ G T++ L + +++F L + + Sbjct: 89 PQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVF-----LPRLTRPGVP- 142 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + K +G ++A+G++ A + LK AS D Sbjct: 143 -KVCILITDGKSQDLVD------TAAQKLKGQGVKLFAVGIK-NADPEELKRVASQPTSD 194 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 F+ V + L I + + Sbjct: 195 FFFFVNDFSILRTLLPLISRRV 216 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 4/83 (4%) Query: 291 TD-GENSSPNIDNKESLFY--CNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYSVQ 346 TD G S+ ++L C AK RG ++ I +A + + L NCA+ D Y Sbjct: 494 TDWGSTSARTGSQSDTLMSANCTAAKDRGITIFTIAFEAPSNAETQLNNCATSDNHYYDA 553 Query: 347 NSRKLHDAFLRIGKEMVKQRILY 369 + F I + K ++ Sbjct: 554 QGTSITSVFSSIATTIQKLKLTL 576 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 74/207 (35%), Gaps = 42/207 (20%) Query: 4 LNIRNFFYN----CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 ++R F N KGS++ ++ ++ GL I+ + + ++ LD ++L Sbjct: 22 FSMRKVFSNWRKSEKGSMTAFGIFIVAIMVTSAGLSIDFMRQERTRVQMQQNLDTAVLSA 81 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 A+ + + Y NI +++ + S+++ Sbjct: 82 ASLLQTLG------AEAVVTDYMSKANI-------------------DVDYNLSVNVSEG 116 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 + + +A + + F N + +TS + + L++ +VLDVS Sbjct: 117 INFRAVDATATAT-----LETLFLGLLNIDSLGITVTSGAE---ERIPNLEISLVLDVSG 168 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDII 206 SM + +L + + + I Sbjct: 169 SMGSN-----SRLTNLKTAATQFVSTI 190 >gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 687 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/277 (13%), Positives = 82/277 (29%), Gaps = 46/277 (16%) Query: 88 WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN 147 W ++ + Q+ + ++S S D KD + + +P + Sbjct: 227 WTVSLQSNAQSMEAVQEGDANAPASSPSAYRLD--KDIVVYWRQQQNLPGSVDLITYKEP 284 Query: 148 SSHAP--LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 +L + G D +VLD S SM+ F ++ L Sbjct: 285 GKDKGTFMLTVTPGDDLPAITEGRDWTLVLDRSGSMSGKFSTLLEGLR------------ 332 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV-----QHIQEKINRLI---FGSTTKSTP 257 K N R ++ F+ + G +++Q+ + + T Sbjct: 333 -KGFAKFNRNDRVRVIMFNDNA---TEVTNGWVQATPENLQQVVGAVENAGPSGGTNLMS 388 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 ++ A + + I +TDGE + K + +++ Sbjct: 389 AIQSALTGLDADRTNA-------------IWLVTDGEANVGETKQKAFIEL---LEKKDI 432 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYS--VQNSRKLH 352 ++ + A L+ ++ V NS + Sbjct: 433 RLFTFIMGNSANRPLLEAITKHSNGFAISVSNSDDII 469 >gi|238793630|ref|ZP_04637253.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] gi|238727045|gb|EEQ18576.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] Length = 480 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 80/233 (34%), Gaps = 29/233 (12%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 N +G+I I LP+I ++ E +H K KL ++ + L + Sbjct: 17 LIKNEQGAILFPFIIFLPLIIGLIFFSFELAHFLQKKTKLSDAMEQATLALTVE------ 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 NN + I + + L F+ NI + + + Y+ Sbjct: 71 NNNSTPSAAQITKNA--EIVSSYAQAYLPAETFSTPTINIIYNNGRIEYGAEINMSYSAK 128 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +S ++ + + +I++ ++ + D++ V D S SM+++F Sbjct: 129 FLSNIQVTNLSTIINATDRGAARKNIISAPIEKT-------DVVFVADYSNSMDEYFYHD 181 Query: 189 MD--KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 + K VA R I L+ + + P +WG ++ Sbjct: 182 ENEPKKIVALREIFNRLNDNVLKNK------------NIHTIGFIPFSWGTKN 222 >gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Callithrix jacchus] Length = 946 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ I Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFS------VVDFNHNIRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V + I ++ T L A + +A Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNMGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E R ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKENIRDNISLFSLGMGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLRKFYNQVST 495 >gi|296131396|ref|YP_003638646.1| Monophenol monooxygenase [Cellulomonas flavigena DSM 20109] gi|296023211|gb|ADG76447.1| Monophenol monooxygenase [Cellulomonas flavigena DSM 20109] Length = 971 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 67/200 (33%), Gaps = 31/200 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P+ + H L+ ++ V S + +VLD S SM D G K + R++ + Sbjct: 435 PFVVGTFHVELVASNIVTTDS------SLALVLDRSGSMADVAAGGATKSTLLKRAVGVV 488 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI--NRLIFGSTTKSTPGLE 260 +++ + G+ F + P+ + + L T GL+ Sbjct: 489 HSLMQPTDE------IGIARFGTTADVVLPMTAASAGLGTVLTGTALDPAGATALGRGLQ 542 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 I G K +I +TDG + P + ++ + Sbjct: 543 EGSGLINGP---------GATKPNKAVIVMTDGNENIPPFVDDLPAGTVSQ------TTF 587 Query: 321 AIGVQAEA--ADQFLKNCAS 338 AIG+ +D L A+ Sbjct: 588 AIGLGLPGQVSDPVLDAVAA 607 >gi|161086898|ref|NP_001104314.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform b [Mus musculus] gi|1905819|gb|AAB50139.1| voltage-gated calcium channel alpha2/delta subunit, alpha2b isoform [Mus musculus] Length = 1091 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|194209576|ref|XP_001915257.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Equus caballus] Length = 1016 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 227 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 274 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 275 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 329 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 330 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 381 Query: 340 DRFYSV 345 +Y + Sbjct: 382 GYYYEI 387 >gi|31542335|ref|NP_037051.2| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 1 [Rattus norvegicus] gi|11055592|gb|AAG28164.1|AF286488_1 voltage-gated calcium channel alpha2/delta-1 subunit [Rattus norvegicus] Length = 1091 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|297714304|ref|XP_002833596.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like, partial [Pongo abelii] Length = 384 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 71/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N VR+ Sbjct: 22 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRT------W 69 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 70 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEASNLGLLDPNSVS---- 125 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK +S +R Sbjct: 126 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSSENRGIAQ 185 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 186 RIYGNQDTSSQLKKFYNQVST 206 >gi|271963054|ref|YP_003337250.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] gi|270506229|gb|ACZ84507.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] Length = 594 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 72/212 (33%), Gaps = 38/212 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ ++++V+D S SM + G G +L +A ++ L P + GL Sbjct: 401 SELRKRANVLIVVDKSGSMEEEAAGTGESRLELAKKAAINAL------PQFRGDDKVGLW 454 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE------------YAYNKIFDA 269 FS++ ++ + S +K P L Y+ A Sbjct: 455 AFSTRQDGD----------RDYRELVPIDSVSKIGPALRDELDGLTAGGGTGLYDTTLAA 504 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEA 328 E++ ++FLTDG+N + ++L K ++ IG A Sbjct: 505 VERMRGARDAGAINA--VVFLTDGKNEKTGGSDLDNLL----GKLNPDVRLFTIGYGEGA 558 Query: 329 ADQFLKNC--ASPDRFYSVQNSRKLHDAFLRI 358 LK A+ Y + + F + Sbjct: 559 DQGVLKRIAEATDGAAYDSSRADTIDQVFTSV 590 >gi|11414932|dbj|BAB18558.1| voltage dependent calcium channel alpha2e/delta subunit [Rana catesbeiana] Length = 1083 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 78/221 (35%), Gaps = 45/221 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + ++ V F+S Sbjct: 254 DMLILVDVSGSVSGL------TLKLIRTSVTEMLETLSD----DDFVNVAA--FNSNAHD 301 Query: 230 TFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++E +N + TT G ++A++++ + + Sbjct: 302 VSCFHHLVQANVRNKKVLKEAVNNITAKGTTDYKQGFKFAFDQLRNTNVSRANCN----- 356 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG E+ N K + V+ V + C + Sbjct: 357 --KIIMLFTDG----GEDKATETFKLYN--KNKTVRVFTFSVGQHNYDKGPIQWMACENK 408 Query: 340 DRFYSV-------QNSRKLHDAFLR---IGKEMVKQRILYN 370 +Y + N+++ D R + +E KQ N Sbjct: 409 GYYYEIPSIGAIRINTQEYLDVLGRPMVLAREKAKQVQWTN 449 >gi|11414930|dbj|BAB18557.1| voltage dependent calcium channel alpha2d/delta subunit [Rana catesbeiana] Length = 1078 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 78/221 (35%), Gaps = 45/221 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + ++ V F+S Sbjct: 254 DMLILVDVSGSVSGL------TLKLIRTSVTEMLETLSD----DDFVNVAA--FNSNAHD 301 Query: 230 TFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++E +N + TT G ++A++++ + + Sbjct: 302 VSCFHHLVQANVRNKKVLKEAVNNITAKGTTDYKQGFKFAFDQLRNTNVSRANCN----- 356 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG E+ N K + V+ V + C + Sbjct: 357 --KIIMLFTDG----GEDKATETFKLYN--KNKTVRVFTFSVGQHNYDKGPIQWMACENK 408 Query: 340 DRFYSV-------QNSRKLHDAFLR---IGKEMVKQRILYN 370 +Y + N+++ D R + +E KQ N Sbjct: 409 GYYYEIPSIGAIRINTQEYLDVLGRPMVLAREKAKQVQWTN 449 >gi|325959993|ref|YP_004291459.1| Magnesium chelatase [Methanobacterium sp. AL-21] gi|325331425|gb|ADZ10487.1| Magnesium chelatase [Methanobacterium sp. AL-21] Length = 711 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 72/222 (32%), Gaps = 37/222 (16%) Query: 142 FPWCANSSHAPLLITSSVKIS--------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 A A L + + K + +V+D+S SM K Sbjct: 490 IAIDATLRAAALRSEGEITVKTEDLRQKVRKHGAKASIAVVVDISGSMYGE-----KKAV 544 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGST 252 + +++ N + +V F + P +E+I L G T Sbjct: 545 RVKDILNNLIE-----DAARNGDKVSVVGFKGKDALIIIPTTRRAVSFKEQIENLKIGGT 599 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS--SPNIDNKESLFYCN 310 T G++ + + K + E++ ++ LTDG + +++ Sbjct: 600 TPLASGMKRGFEILKKEKFRDEYVP--------MMLILTDGMPNVAISKSPVDDAIDIAG 651 Query: 311 EAKRRGAIVYAIGVQAEAADQFLK------NCASPDRFYSVQ 346 K ++ I + E A ++ + AS R+Y ++ Sbjct: 652 SLKEN--EIHTIIINFEQAVKYGRDMNMELAVASGGRYYDLE 691 >gi|157819015|ref|NP_001100328.1| procollagen, type VII, alpha 1 [Rattus norvegicus] gi|149018489|gb|EDL77130.1| procollagen, type VII, alpha 1 (predicted) [Rattus norvegicus] Length = 2588 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 72/204 (35%), Gaps = 31/204 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ ++D S S+ + ++ + VR V +S Sbjct: 36 YAADIVFLIDGSSSIGRS------NFREVRGFLEGLVLPFSGAANA-QGVRFATVQYSDD 88 Query: 227 IVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F L +++ G T++ L + +++F L H+ + Sbjct: 89 PQTEFGLDTLGSGGDTIRAIRELSYK--GGNTRTGAALLHVSDRVF-----LPHLTRPGI 141 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 K I +TDG++ + KR+G ++A+G++ A + LK AS Sbjct: 142 P--KVCILITDGKSQDLVD------TAAQKLKRQGVKLFAVGIK-NADPEELKRIASQPT 192 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEM 362 D F+ V + L I + + Sbjct: 193 SDFFFFVNDFSILRTLLPLISRRV 216 >gi|114584071|ref|XP_001153479.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Pan troglodytes] Length = 2971 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 105/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ S Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHS 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDIGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y Y N +A Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAGGSRIR- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1325 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1384 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1385 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1442 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1494 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1495 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1553 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|95930867|ref|ZP_01313598.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95133109|gb|EAT14777.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 698 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 48/260 (18%), Positives = 88/260 (33%), Gaps = 46/260 (17%) Query: 102 AQDINNIERSTSLSIIID-DQHKDYNLSAVSRYEM----PFIFCTFPWCA--NSSHAPLL 154 ++ + + + ID N V Y + P P+ A N++ +L Sbjct: 234 SEQGGEQGNNQIVDVTIDRPAGASLNKDIVFLYRLDDTTPARIELIPYKADRNATGTMML 293 Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + G D VLDVS SM+ H K+ + + L + N+ Sbjct: 294 VVTPAADLQPITEGTDWTFVLDVSGSMDGH------KIATLADGVSQTLGKL------NS 341 Query: 215 VVRSGLVTFSSKIVQTFPLAWGV-----QHIQEKINR---LIFGSTTKSTPGLEYAYNKI 266 R ++TF+ L G + + + INR + G +T GLE A ++ Sbjct: 342 NDRFRIITFN---QSAADLTRGFVTATPEAVGQWINRVKTIAAGGSTNLFAGLETACRRL 398 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 D + I+ +TDG + + +E L ++ + Sbjct: 399 DDDRTTS-------------IVLVTDGVANVGRTEQREFLQL---LTEYDVRLFTFVIGN 442 Query: 327 EAADQFLKNCASPDRFYSVQ 346 A L A +++Q Sbjct: 443 SANRPLLDRLAKDSGGFAMQ 462 >gi|149018695|gb|EDL77336.1| similar to procollagen, type VI, alpha 3 isoform 4 (predicted) [Rattus norvegicus] Length = 719 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 59/344 (17%), Positives = 112/344 (32%), Gaps = 45/344 (13%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 V++ S K + L L TA + Q G + + +N +T+ Sbjct: 23 VVQYSDKIISQFLLTQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQ 82 Query: 95 EL------RENGFAQDINNIERSTSLSIII----DDQHKDYNLSAVSRYEMPFIFCTFPW 144 L + + D R T ++I D + A +R F F + Sbjct: 83 YLIVITDGQSSDPVADAAQGLRDTGINIYAIGVRDANTTELEEIANNRV---FFTDDFHF 139 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + S + D++ ++D S S++ ++ M++ Sbjct: 140 LKSIHQEVVRDICSFENCRSQKA--DIIFLIDGSESISSE------DFEKIKDFVKRMVN 191 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH------IQEKINRLIFGSTTKSTPG 258 D ++ GL+ FSS + F L + ++ G+ T Sbjct: 192 QSNIGADK---IQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALN 248 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + I H +Y+I +TDG++ + ++L + R I Sbjct: 249 FTLPFFDISQGGRPGVH---------QYLIVITDGDSHDDIVSPAKAL------RDRNII 293 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++AIGV Q L D+ Y +N L + I E+ Sbjct: 294 IFAIGVGKIQRAQLLAITNDQDKVYHEENFESLQNLEKEILYEV 337 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 66/161 (40%), Gaps = 14/161 (8%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 M D+IK + VR G+V +S KI+ F L +++ + Sbjct: 5 MKDVIKMFHIGPDGVRFGVVQYSDKIISQFLLT-QYTSMEKLGTAIGNIQQGGGGTTTGE 63 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A +K+ + +Y+I +TDG++S P D + L + G +YA Sbjct: 64 ALSKMALVFRNTAR-----TNVAQYLIVITDGQSSDPVADAAQGL------RDTGINIYA 112 Query: 322 IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 IGV+ +A L+ A+ +R + + L + +++ Sbjct: 113 IGVR-DANTTELEEIAN-NRVFFTDDFHFLKSIHQEVVRDI 151 >gi|148688391|gb|EDL20338.1| mCG126758 [Mus musculus] Length = 223 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 48/222 (21%), Positives = 77/222 (34%), Gaps = 33/222 (14%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 SR P + + P + + + S D+ VLD S S+ +++ + Sbjct: 6 SRARSPGSWLFPGLWLLAVGGPGSLLQAQEQPSCKK-AFDLYFVLDKSGSVANNWIEIYN 64 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRL 247 + T V+ +R + FSS+ PL I E + + Sbjct: 65 FVHQLTERF------------VSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAV 112 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T GL+ A +I +A II LTDG +D + Sbjct: 113 KPVGETYIHEGLKLANEQIQNAGGLKASS---------IIIALTDG-----KLDGLVPSY 158 Query: 308 YCNEAKRR---GAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 NEAK+ GA VY +GV Q + S D+ + V+ Sbjct: 159 AENEAKKSRSLGASVYCVGVLDFEQAQLERIADSKDQVFPVK 200 >gi|291401551|ref|XP_002717040.1| PREDICTED: anthrax toxin receptor 2 [Oryctolagus cuniculus] Length = 486 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 70/211 (33%), Gaps = 29/211 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + D+ VLD S S+ +++ D + T V+ +R Sbjct: 33 AQEQPSCRTAFDLYFVLDKSGSVANNWIEIYDFVQKLTERF------------VSPEMRL 80 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + FSS+ PL I E + + T GL+ A +I A Sbjct: 81 SFIVFSSQATIILPLTGDRGKITKGLEDLKSVSPVGETYIHEGLKLANEQIQKA------ 134 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 G II LTDG+ + + ++ GA VY +GV Q K Sbjct: 135 ---GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLEKI 189 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 S ++ + V+ A I ++ Q Sbjct: 190 ADSKEQVFPVKGG---FQALKGIINSILAQS 217 >gi|11414928|dbj|BAB18556.1| voltage dependent calcium channel alpha2c/delta subunit [Rana catesbeiana] Length = 1085 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 78/221 (35%), Gaps = 45/221 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + ++ V F+S Sbjct: 254 DMLILVDVSGSVSGL------TLKLIRTSVTEMLETLSD----DDFVNVAA--FNSNAHD 301 Query: 230 TFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++E +N + TT G ++A++++ + + Sbjct: 302 VSCFHHLVQANVRNKKVLKEAVNNITAKGTTDYKQGFKFAFDQLRNTNVSRANCN----- 356 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG E+ N K + V+ V + C + Sbjct: 357 --KIIMLFTDG----GEDKATETFKLYN--KNKTVRVFTFSVGQHNYDKGPIQWMACENK 408 Query: 340 DRFYSV-------QNSRKLHDAFLR---IGKEMVKQRILYN 370 +Y + N+++ D R + +E KQ N Sbjct: 409 GYYYEIPSIGAIRINTQEYLDVLGRPMVLAREKAKQVQWTN 449 >gi|89100226|ref|ZP_01173093.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085076|gb|EAR64210.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 463 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 55/374 (14%), Positives = 123/374 (32%), Gaps = 36/374 (9%) Query: 16 SISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQ 75 + IL IFI+ E+ + K D + + + + EN + + Sbjct: 3 NFIKKLTILFLSIFIITACSKESGNNTEQKQSKGN--DAAAIGQVDESEDTENAEDAEDA 60 Query: 76 KNDFSYRIIKNI--WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 K + K I + + + A+D + ++ + + Sbjct: 61 KEQINLAENKKIPASLEEIISYPKGPFTAEDTQIKDPEVQQALSKVPELPEEASEEELND 120 Query: 134 EMPFIFCTFPWC---ANSSHAPLLITSS-VKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 +++ F + L ++ + SS+ ++ ++LD S SM + G Sbjct: 121 LFAYLYSLFRKEYRDPREAIVSLTVSGPESEGSSEEKGSFNVEIILDSSGSMANKMGS-Q 179 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVR----SGL-----VTFSSKIVQTFPLAWGVQHI 240 ++ +A SI++ + +V V +G ++ +S + P + + Sbjct: 180 TRMELAKASIKKFASALPEEANVGLRVYGHKGTGSDADKKMSCASNELVYAPQPYIEAEL 239 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 +N+ T L A + A + K + ++DG + Sbjct: 240 NTALNKFKPAGWTPLAQSLMEAQKDL---------EAYKGEKNKNIVYVVSDGIETC--- 287 Query: 301 DNKESLFYCNEAKRRGA--IVYAIGVQAEAAD-QFLKNC--ASPDRFYSVQNSRKLHDAF 355 + + K G +V IG D Q L+ A+ + +V++ +L + Sbjct: 288 -DGNPVEAAASLKDSGVAPVVNIIGFDVNGKDQQQLEEVAQAAGGTYQNVKSQEQLDNEL 346 Query: 356 LRIGKEMVKQRILY 369 + +E K Y Sbjct: 347 EKAIEESGKWTKWY 360 >gi|226307531|ref|YP_002767491.1| hypothetical protein RER_40440 [Rhodococcus erythropolis PR4] gi|226186648|dbj|BAH34752.1| hypothetical protein RER_40440 [Rhodococcus erythropolis PR4] Length = 233 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 73/212 (34%), Gaps = 22/212 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +V DVS SM G + + + I P + ++ R +++FS + Sbjct: 7 LPFYLVFDVSYSMEPVIGEVNNAMRALK-------NEILKDPILGDIARVCVLSFSDEAR 59 Query: 229 QTFP---LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P LA + +E + L T P + +I L+ +G ++ Sbjct: 60 IDVPMCDLADDTRITRE--DFLQVRGGTSFAPIFDLIGERIAADIADLKGHGEGK-VFRP 116 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQFLKNCASP----- 339 + F+TDG + + + K + + +G +A ++ L+ P Sbjct: 117 TVFFVTDGVPTDAVHEWNSAFTRLTSVKAYPNLVPFGLG---DADEEVLRAITFPPYRQD 173 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 F+ A I + + + + + Sbjct: 174 GYFFMANAGTSAEQAMQAITRIVTQSVVSCTQ 205 >gi|186896818|ref|YP_001873930.1| hypothetical protein YPTS_3520 [Yersinia pseudotuberculosis PB1/+] gi|186699844|gb|ACC90473.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 518 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 83/240 (34%), Gaps = 35/240 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N +G+I + L+PV ++ L E SH +AKL L+ + L +T+ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE------ 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFR-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +ND + N T + + L F+Q + + +YN Sbjct: 71 ----NNYRNDRASNNRNNYLVTSYAQSYLPSERFSQ------PRVVNTYNEILGYTEYNA 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-----N 182 S Y++ + + + ++ K S +D++ V D S SM + Sbjct: 121 SLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFGD 180 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 + KL ++ + I S +N + P +WG + I Sbjct: 181 IELNNRITKLDELKAIFVKLNNRIFSNDGIN-------------TIGFVPFSWGTKRISA 227 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 376 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 424 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 425 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 484 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 485 TGNFYLAKNAHEL 497 >gi|11414926|dbj|BAB18555.1| voltage dependent calcium channel alpha2b/delta subunit [Rana catesbeiana] Length = 1090 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 78/221 (35%), Gaps = 45/221 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + ++ V F+S Sbjct: 254 DMLILVDVSGSVSGL------TLKLIRTSVTEMLETLSD----DDFVNVAA--FNSNAHD 301 Query: 230 TFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++E +N + TT G ++A++++ + + Sbjct: 302 VSCFHHLVQANVRNKKVLKEAVNNITAKGTTDYKQGFKFAFDQLRNTNVSRANCN----- 356 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG E+ N K + V+ V + C + Sbjct: 357 --KIIMLFTDG----GEDKATETFKLYN--KNKTVRVFTFSVGQHNYDKGPIQWMACENK 408 Query: 340 DRFYSV-------QNSRKLHDAFLR---IGKEMVKQRILYN 370 +Y + N+++ D R + +E KQ N Sbjct: 409 GYYYEIPSIGAIRINTQEYLDVLGRPMVLAREKAKQVQWTN 449 >gi|194666191|ref|XP_604080.4| PREDICTED: collagen, type XXVIII-like [Bos taurus] Length = 1147 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 66/174 (37%), Gaps = 18/174 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ ++D S S DK S+ + + + + + ++ + FSS + Sbjct: 46 IDLVFIVDSSESSKIFL---FDKQKDFVGSLSDKIFQLTPVGSLKYDIKLAALQFSSSVQ 102 Query: 229 QTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + W +Q ++++ + F G T S + A + K K Sbjct: 103 IDPPFSSWKDLQTFKQRVKSMNFIGQGTFSYYAIANATRLLKREGRKDGM---------K 153 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +TDG + N D + +A+ G + IG+ A + L+ + Sbjct: 154 VAVLMTDGIDHPKNPDVQS---ISEDARTAGILFITIGLSAVVNETKLRLISGD 204 >gi|260827134|ref|XP_002608520.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] gi|229293871|gb|EEN64530.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] Length = 1236 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 65/180 (36%), Gaps = 31/180 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI--PDVNNV 215 S++ S L + V+D S SM+ D++G ++ + + I + D + Sbjct: 619 SLEASGHQRTPLRFVAVIDESYSMD-------DRIGRDKLTLIQRMQIFAELMAKDFKDE 671 Query: 216 VRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + G+VTF++ P+ G EKI + T + GL A + + Sbjct: 672 DQMGIVTFANDAKVVLPMTRMDSSGRDSALEKIQNISTRGQTNLSDGLLSAISMFKGSSG 731 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--------AIVYAIG 323 H II TDG+ + ID E + N K G + IG Sbjct: 732 SDFHNG---------IILFTDGQANQGIIDAAELVQEYNS-KMAGLGEGVCLPISTFTIG 781 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 33/160 (20%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S+K+ + L + V+D S SM G + R M+ +K+ Sbjct: 41 EQETVSSNKTRLPLRFVAVIDESGSMASTIGNE-TLIYKMKIFARVMVRKMKAEDM---- 95 Query: 216 VRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 G+V F S P+ G + + I L + T G+ K+FD E Sbjct: 96 --LGIVGFDSDARVLLPITQMDKDGKKAAMDSIESLSAKTFTNLCEGILTG-AKLFDTTE 152 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 + H G ++ TDG + D + N Sbjct: 153 ECAHCRNG-------MVVFTDGIANQGITDADGIVSAFNS 185 >gi|331003698|ref|ZP_08327192.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] gi|330412081|gb|EGG91476.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] Length = 528 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 82/235 (34%), Gaps = 30/235 (12%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 PW ++ ++ + K+ +++ ++DVS SM++ DKL + + Sbjct: 139 ISACPWNPDTKLM-MIGMQAKKVEESEKKPSNLVFLIDVSGSMDEP-----DKLPLVKNA 192 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + + +K ++ V +G ++V + I I L +T + G Sbjct: 193 FLLLCEELKENDTISIVTYAG----YDQVVLEGASGSDSKEIMSAIEDLEAAGSTAGSDG 248 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ AY + + +I TDG+ + + + K G Sbjct: 249 IKTAYKIAKKYFKSDGNNR---------VILATDGDLNVGITSEGKLTRLIKKEKESGVF 299 Query: 319 VYAIGVQAEA-ADQFLKNCA--SPDRF------YSVQN--SRKLHDAFLRIGKEM 362 + +G E D ++ A + + + S +L F + K++ Sbjct: 300 LSVLGFGTENIKDNKMEALADNGNGNYSYIDSRFEAKKVLSEELGANFFTVAKDV 354 >gi|301770299|ref|XP_002920606.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Ailuropoda melanoleuca] Length = 1081 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 226 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 273 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 274 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 328 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 329 --KVIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 381 Query: 340 DRFYSV 345 ++ + Sbjct: 382 GYYFEI 387 >gi|281338317|gb|EFB13901.1| hypothetical protein PANDA_009310 [Ailuropoda melanoleuca] Length = 1046 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 224 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 271 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 272 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 326 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 327 --KVIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 379 Query: 340 DRFYSV 345 ++ + Sbjct: 380 GYYFEI 385 >gi|153946957|ref|YP_001399586.1| hypothetical protein YpsIP31758_0593 [Yersinia pseudotuberculosis IP 31758] gi|152958452|gb|ABS45913.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP 31758] Length = 518 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 83/240 (34%), Gaps = 35/240 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F N +G+I + L+PV ++ L E SH +AKL ++ + L +T+ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDAIEQASLALSTE------ 70 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFR-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +ND + N T + + L F+Q + + +YN Sbjct: 71 ----NNYRNDRASNNRNNYLVTSYAQSYLPSERFSQ------PRVVNTYNEILGYTEYNA 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-----N 182 S Y++ + + + ++ K S +D++ V D S SM + Sbjct: 121 SLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFGD 180 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 + KL ++ + I S +N + P +WG + I Sbjct: 181 IELNNRITKLDELKAIFVKLNNRIFSNDGIN-------------TIGFVPFSWGTKRISA 227 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 376 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 424 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 425 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 484 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 485 TGNFYLAKNAHEL 497 >gi|149915863|ref|ZP_01904387.1| hypothetical protein RAZWK3B_07284 [Roseobacter sp. AzwK-3b] gi|149810186|gb|EDM70032.1| hypothetical protein RAZWK3B_07284 [Roseobacter sp. AzwK-3b] Length = 235 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 78/239 (32%), Gaps = 49/239 (20%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND--HFGPGMDKLGVATRSIREMLD 204 + L +TS + + D M+VLD S SM++ ++ A ++ Sbjct: 2 GALALTLALTSPA--GAVTGCARDAMLVLDGSASMSEIGFDPTAPTRIDEARAAL----- 54 Query: 205 IIKSIPDVNNVVRSGLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 +++P + V R GL+T+ S ++ P+ I ++ L G T Sbjct: 55 -AQAMPRIAVVRRVGLLTYGPGGTDACSGIDLRFGPIDDAAGPIIAAVDALRPGGLTPLA 113 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 ++ A + + ++ +TDG + ++A G Sbjct: 114 ASVQAAAEAL------------NYRTTPGIVVLVTDGNETCGGRPCALGTALADQA--EG 159 Query: 317 AIVYAIGVQAE--------------AADQFLKNCASP---DRFYSVQNSRKLHDAFLRI 358 V+ IG + + + C S + Q +L +A + Sbjct: 160 LTVHVIGFRVQYDPFAWDNPEAQTYDGGAVVAKCLSDRTGGLYVDTQTVEELTEALEAV 218 >gi|45382993|ref|NP_990865.1| collagen alpha-3(VI) chain precursor [Gallus gallus] gi|1345652|sp|P15989|CO6A3_CHICK RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|211622|gb|AAA03201.1| alpha-3 collagen type VI [Gallus gallus] Length = 3137 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 78/212 (36%), Gaps = 30/212 (14%) Query: 151 APLLITSSVKISSKSDIGL------DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 LL++ + ++ + D++ ++D S S+ + + + D Sbjct: 13 LGLLLSGFCSVGAQQQAAVRNVAVADIIFLVDSSWSIGKEHFQLVREF---------LYD 63 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEY 261 ++K++ N R LV FS F L Q + I + G +K+ GLEY Sbjct: 64 VVKALDVGGNDFRFALVQFSGNPHTEFQLNTYPSNQDVLSHIANMPYMGGGSKTGKGLEY 123 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + + ++ + + II LTDG++ L K + A Sbjct: 124 ----LIENHLTKAAGSRASEGVPQVIIVLTDGQSQDDVALPSSVL------KSAHVNMIA 173 Query: 322 IGVQAEAADQFLKNCASP--DRFYSVQNSRKL 351 +GVQ + + + P ++++N L Sbjct: 174 VGVQDAVEGELKEIASRPFDTHLFNLENFTAL 205 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 87/234 (37%), Gaps = 19/234 (8%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + F+ P + + S D++ +LD SL++ + P + Sbjct: 607 FRLQFMQAILPEVLSPIRTLSGGMVIHETPSVQVTKRDIIFLLDGSLNVGNANFPFVRDF 666 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIFG 250 V ++ LD+ + +R GLV FS F L ++ + I +++ +L Sbjct: 667 VV---TLVNYLDV------GTDKIRVGLVQFSDTPKTEFSLYSYQTKSDIIQRLGQLRPK 717 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 + G A N + ++ ++ + ++ +T G ++ P L N Sbjct: 718 GGSVLNTG--SALNFVLSNHFTEAGGSRINEQVPQVLVLVTAGRSAVP------FLQVSN 769 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + R G + +A+GV+ + + +P Y + + L + K + Sbjct: 770 DLARAGVLTFAVGVRNADKAELEQIAFNPKMVYFMDDFSDLTTLPQELKKPITT 823 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/329 (13%), Positives = 109/329 (33%), Gaps = 50/329 (15%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN-------------- 99 +++L ++ + + D+ +KN + + + + Sbjct: 1502 NAVLQAIRRLRLRGGYPVNAGKALDYV---VKNYFIKSAGSRIEDGVPQHLVVILGDQSQ 1558 Query: 100 GFAQDINNIERSTSLS-------------IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCA 146 N+ STS+ + + L +P + Sbjct: 1559 DDVNRPANVISSTSIQPLGVGARNVDRNQLQVITNDPGRVLVVQDFTGLPTLERKVQNIL 1618 Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 P V + D++ +LD S+++ D + + ++D I Sbjct: 1619 EELTVP-TTEGPVYPGPEGKKQADIVFLLDGSINLGR------DNFQEVLQFVYSIVDAI 1671 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYN 264 D ++ ++ GL ++S + F L I + IN++I+ + G A Sbjct: 1672 --YEDGDS-IQVGLAQYNSDVTDEFFLKDYSSKPEILDAINKVIYKGGRVANTG--AAIK 1726 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + E ++ + +T G++S D + + E ++G V+A+GV Sbjct: 1727 HLQAKHFVKEAGSRIDQRVPQIAFIITGGKSSD---DGQGASM---EVAQKGVKVFAVGV 1780 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + ++ K + + V +++L + Sbjct: 1781 RNIDLEEVSKLASESATSFRVSTAQELSE 1809 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 58/170 (34%), Gaps = 18/170 (10%) Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTT--KSTPGLEY 261 ++S+ + VR +V +S+ I F L + I L + + L+Y Sbjct: 1061 VESLDIGRDKVRVAIVQYSNAIQPEFLLDAYEDKADLVSAIQALTIMGGSPLNTGAALDY 1120 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 +F + ++ + +++I LT D + K G + + Sbjct: 1121 LIKNVF----TVSSGSRIAEGVPQFLILLT---ADRSQDDVRRPSVV---LKTSGTVPFG 1170 Query: 322 IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 IG+ + PD SV + +L + + + + I K Sbjct: 1171 IGIGNADLTELQTISFLPDFAISVPDFSQLD----SVQQAVSNRVIRLTK 1216 >gi|258647262|ref|ZP_05734731.1| BatB protein [Prevotella tannerae ATCC 51259] gi|260852911|gb|EEX72780.1| BatB protein [Prevotella tannerae ATCC 51259] Length = 339 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 60/179 (33%), Gaps = 32/179 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +S++ G+++ ++LDVS SM D +++ + ++ + + + Sbjct: 81 TTTSENKKGIEVAVMLDVSNSMLAQDVSPNRLERAKLLVSTLIDRMQN----------DK 130 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKL 273 L F+ + P+ + +N + G T + A D K Sbjct: 131 IALGVFAGEAYPQLPITGDYGAAKLFLNSITPGMVTLQGTNLAAAINLADKSFTDKKR-- 188 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K II +TDGE+ + + K VY +G+ Q Sbjct: 189 ---------VGKAIIIITDGEDHQGGAEEAAAAAAKEGRK-----VYILGIGNPGGAQI 233 >gi|225028889|ref|ZP_03718081.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] gi|224953773|gb|EEG34982.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] Length = 1070 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 46/255 (18%), Positives = 90/255 (35%), Gaps = 54/255 (21%) Query: 149 SHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 L +T + K ++++ LD++ +LD S SM + FG G K A+ +I + +K Sbjct: 366 GTYDLTLTVAGKKGTETNKAKLDVIYILDKSGSMKEDFG-GTSKRIAASNAITALTKSLK 424 Query: 208 SIPDVNNVVRSGLVTFSSKI--------------VQTFPLAWGV-QHIQEKINRLIFGST 252 +++ R +VTFS ++W E+ ++ Sbjct: 425 QNANID--ARFSMVTFSGNKTTGMWGQGDTKTWDDAEVAVSWTTDAGTIERGSKPTSNGG 482 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-SPNIDN--------- 302 T G+ A + + +IF++DG+ + N D Sbjct: 483 TNYQAGIRTAKELLTSKRAGAMTA----------VIFISDGDPTFYYNPDGYTRGDGNND 532 Query: 303 ----KESLFYCNEAKRRGAI------VYAIGVQ-AEAADQFLKNCASPD----RFYSVQN 347 ++L C +A + Y +GV A C++ + + N Sbjct: 533 GNGGADNLKVCLDAAKNEIANLGVNYFYTVGVGKANDYVNLSDLCSASGVSGAKNFDGTN 592 Query: 348 SRKLHDAFLRIGKEM 362 + +L AF I ++ Sbjct: 593 TDELTKAFSTIESDI 607 >gi|259490072|ref|NP_001159273.1| hypothetical protein LOC100304363 [Zea mays] gi|223943141|gb|ACN25654.1| unknown [Zea mays] Length = 459 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 76/216 (35%), Gaps = 37/216 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++VLD+S SM KL ++ + I R ++TF SK Sbjct: 42 APLDIVVVLDISGSMRG------TKLEHMKHAMTRFIIEKLGI----RGDRLAIITFESK 91 Query: 227 IVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L+ V+ + L G T GLE + + + H A Sbjct: 92 AHKVFDLSSMLPDQVKKAVAVVEGLKAGGDTNIKAGLEAGLDVLKT-RRGHSHNASC--- 147 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 I ++DG N+D +L + V G ++ +Q L + A Sbjct: 148 ----IFLMSDG---HENVDKARTLL--DRVGEH--SVVTFGFGEKSDEQLLYDIAYHSHA 196 Query: 342 --FYSV---QNSRKLHDAFL--RIGKEMVKQRILYN 370 ++ V ++ +L AF I + + + Sbjct: 197 GTYHHVREKEDENQLMKAFAFLAIYRSISMLDLKVT 232 >gi|311264542|ref|XP_003130217.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Sus scrofa] Length = 998 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 69/194 (35%), Gaps = 20/194 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ ++D S S DK S+ + L + + + ++ + FSS + Sbjct: 48 IDLIFIVDSSESSKIFL---FDKQKDFVDSLSDKLFQLTPVGSLKYDIKLAALQFSSSVQ 104 Query: 229 QTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + W + ++++ + F G T S + A + K K Sbjct: 105 IDPPFSSWKDLHTFKQRVKSMNFIGQGTFSYYAIANATRLLKREGRKDS---------VK 155 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 + +TDG + N D + +A+ G I IG+ + L + SP Sbjct: 156 VALLMTDGIDHPKNPDVQS---ISEDARNAGIIFITIGLSTVVNETKLHLISGNSPGEPI 212 Query: 344 SVQNSRKLHDAFLR 357 + N L D Sbjct: 213 LLLNDSTLVDKIQN 226 >gi|171921010|gb|ACB59193.1| TadG [Actinobacillus suis ATCC 33415] Length = 554 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 49/321 (15%), Positives = 103/321 (32%), Gaps = 36/321 (11%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 SF I+ F + G +++ +L I +M + +E++ KA+L L+ ++L + Sbjct: 5 SFNQIKRFIQDESGVYAVIGGLLALPIVALMFVSLESAGIIQDKARLSDSLEQAVLSLSA 64 Query: 62 KILNQENGNNGKKQKNDFSYRIIK----------NIWQTDFRNELRENGFAQDINNIERS 111 + + N+ K D I ++ L + D N I+ Sbjct: 65 ENNSGRKSNDYKLSNTDAENGHFNPNSKIGERDLEISKSFVTTYLPQT----DPNKIKLQ 120 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI---- 167 + + + S + + W + +I + V I+S S Sbjct: 121 PVCTTTDKKNRQGHTASTETICTVAGTIEHKSWFPLKVGSTEVIPTEVNIASNSKAIKKN 180 Query: 168 ----GLDMMMVLDVSLSMNDHFGP------GMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +D+M+ D+S SM G K+ + + E+ + N+ R Sbjct: 181 TISIPIDLMVAADLSGSMRYDLENRYEPKDGTSKIDILKAVLTELSSNSLFSQESNDNNR 240 Query: 218 SGLVTF-------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + F +++ F L + I L +T ++ K + Sbjct: 241 IAVSPFALGAEYSTTECTLPFALKNNNRTI-NYTKSLGIPTTENVQDIIKNYLTKSGSSN 299 Query: 271 EKLEHIAKGHDDYKKYIIFLT 291 +L + + T Sbjct: 300 SQLSRAIFTQSLVTQIDVTNT 320 >gi|145552898|ref|XP_001462124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429962|emb|CAK94751.1| unnamed protein product [Paramecium tetraurelia] Length = 533 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 72/196 (36%), Gaps = 29/196 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++ ++D S SM K+ + +++++ML ++ R L+ F K+ Sbjct: 121 GVDLVCLIDHSGSMQGE------KIKLVRKTLKQMLTFLQPCD------RLCLIMFDCKV 168 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L VQ + I+ L T G++ A + + + + + Sbjct: 169 YRLTRLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMALSILKH---------RKYKNP 219 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 I L+DG + ++ L N R + G + + + A + Sbjct: 220 VSAIFLLSDGVDEGAEERVRDDLIQYN--IRDSFTIKTFGFGRDCCPKIMSEIAHYKEGQ 277 Query: 342 FYSVQNSRKLHDAFLR 357 FY V N + + F Sbjct: 278 FYFVPNLTNIDECFAE 293 >gi|317493250|ref|ZP_07951672.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918643|gb|EFV39980.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] Length = 544 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 89/226 (39%), Gaps = 26/226 (11%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 S + + PW A+S + I + I++ + +++ ++DVS SM+D Sbjct: 128 NSPFSVATEIAPTPWNAHSKLLRIAIKA-TDINATALPPANLVFLIDVSGSMSDE----- 181 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRL 247 DKL + S++ +++ ++ ++ +V +S + P G I IN+L Sbjct: 182 DKLPLVKNSLKLLVNKMRDQDKIS------IVIYSGETKTVLPPTSGKDKSDILSAINQL 235 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T G++ AY + K + II TDG+ + D ++ Sbjct: 236 SAGGSTAGGSGIDLAYQM------AEKGFIKNGINR---IILATDGDFNVGITDTQQLEE 286 Query: 308 YCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--SPDRFYSVQNSRK 350 + + G + +G D + + A + + + ++ Sbjct: 287 KIKKKSKNGINLTTLGFGQGNYNDSLMMHIADVGNGNYAYIDSMQE 332 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 84/259 (32%), Gaps = 12/259 (4%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+I+ AI++P++ + + I+ H +V+ L I D + + A + N ++ Sbjct: 14 RGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQSLA 73 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 G KN + N E G + + + + Sbjct: 74 TGNAVKNGLQTSSTQITVTPGNWNPQIETGPSYFSAAVPYGHQAN----------AVQVQ 123 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 +P+ F P + A + + S S L++ L+ G+ Sbjct: 124 LSESVPYFFFFGPAKTVQAQAIAWVPNPAAGFSLSSTLLNVSEQQSALLNSLLGGLLGIH 183 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 L + + +L+ S+ + + G V ++ + L + + G Sbjct: 184 NLNLGVAAFNGLLNTSVSLGQLAQSIGVGTV--NNLLDANLTLPGLFTGALKAVGNQAAG 241 Query: 251 STTKSTPGLEYAYNKIFDA 269 ST A + + Sbjct: 242 GGLLSTQSATGALQTLVGS 260 >gi|326921803|ref|XP_003207144.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1054 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 81/208 (38%), Gaps = 33/208 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 L+++ V+D S S+ D ++ ++D + + + R G++ FS K Sbjct: 686 TPLELIFVIDSSESVGP------DNFISTKTFMKTVIDEVLA---NHAKTRIGVINFSHK 736 Query: 227 IVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + L + ++ +++++ G T + ++ A N A+ Sbjct: 737 VELVSSLEKYTTKESLKSAVDKMLYLGEGTYTASAIKKAINLFQAARPA----------V 786 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKN---CA 337 +K + +TDG+ + D EA ++ IG+ E D FLK A Sbjct: 787 RKVAVVVTDGQADAR--DEVHLDMVVREAHAANIEIFVIGIVQETDPHYDNFLKEMHLIA 844 Query: 338 SP---DRFYSVQNSRKLHDAFLRIGKEM 362 + + FY +++ + L ++ ++ Sbjct: 845 TDPDEEHFYRIEDFKTLSALTDKLITKI 872 >gi|300022610|ref|YP_003755221.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524431|gb|ADJ22900.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 638 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 67/202 (33%), Gaps = 33/202 (16%) Query: 172 MMVLDVSLSMNDHFGPGMD-KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---- 226 M+++D S SM P K+ V + +L S R GLV+F + Sbjct: 32 MLIVDGSGSMWGRLAPDNKPKIDVVREKLATILQTPSS-------TRVGLVSFGHRRRGD 84 Query: 227 ---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + + +L T LE A + I ++ Sbjct: 85 CNDVELIASPDSERAALLGPLAKLNPRGPGPVTAALEIAADAIGTSRPAQ---------- 134 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPD 340 II + DG ++ + + A G V IG+ A ++ C A+ Sbjct: 135 ---IIIVGDGADNCQQDSCAAANDFAKSA--PGVAVQVIGIGVPATERPRIACVAQATGG 189 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 R+Y V ++ L+ A + Sbjct: 190 RYYDVTDAAGLNAALDEATQLA 211 >gi|241672104|ref|XP_002411442.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] gi|215504093|gb|EEC13587.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] Length = 1021 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 59/177 (33%), Gaps = 28/177 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G V T + L P N R +++FS V Sbjct: 50 DLVFLLDRSGSV------GQAGFEVETGFVHAFLKGFDVAP---NTTRVAVISFSEDAVV 100 Query: 230 TFPL---AWGVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 H+ K+ + T + GL+ A+ ++ K Sbjct: 101 HADFLKDPGNKCHLSRKMQGVHSANQGATNTGAGLQAAWEVFQRSRP----------TAK 150 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K +I +TDG + + + + K G ++ G+ + ++P Sbjct: 151 KLLILVTDGMATMG----PDPVKKAEKLKNMGVDIFVFGIGRMLKQHLEQLASTPAN 203 >gi|108756796|ref|YP_635538.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108460676|gb|ABF85861.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 700 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 72/201 (35%), Gaps = 25/201 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++S ++ V+DVS SMN ++LG+ R++ +++ + V+ Sbjct: 325 EVSRPQRKPSHLVFVIDVSGSMNLE-----NRLGLVKRALHLLVNELDERDQVS------ 373 Query: 220 LVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V + S V I+ I+ L +T + GLE Y+ H+ Sbjct: 374 IVVYGSTARLVLEPTSAVHAHIIRAAIDSLHTEGSTNAQAGLEMGYSL------AASHLV 427 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC 336 +G + +I +DG ++ D +G + +G D ++ Sbjct: 428 EGGINR---VILCSDGVANTGLTDANSIWERIRARAAKGITLSTVGFGMGNYNDVLMERL 484 Query: 337 A--SPDRFYSVQNSRKLHDAF 355 + + V + H F Sbjct: 485 SQVGEGNYAYVDRIEEAHRIF 505 >gi|298375542|ref|ZP_06985499.1| BatB protein [Bacteroides sp. 3_1_19] gi|298268042|gb|EFI09698.1| BatB protein [Bacteroides sp. 3_1_19] Length = 574 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 15/137 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+++M+ LDVS SM ++L A ++ + D + Sbjct: 80 SKLETVKRQGVEIMVCLDVSNSMLAEDVSP-NRLDKAK-------QMLSRLTDGFTNDKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F+ P+ + ++ + + + A N Sbjct: 132 GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQGTAIGAAIN-------LAARSFT 184 Query: 279 GHDDYKKYIIFLTDGEN 295 + K II +TDGEN Sbjct: 185 PDETTDKAIILITDGEN 201 >gi|264678234|ref|YP_003278141.1| hypothetical protein CtCNB1_2099 [Comamonas testosteroni CNB-2] gi|262208747|gb|ACY32845.1| putative membrane protein [Comamonas testosteroni CNB-2] Length = 408 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 62/168 (36%), Gaps = 13/168 (7%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G+ I A+ + + MG+ ++ F VK +L +D L A ++ N ++ + Sbjct: 12 GAFLITFALFMLFLLGFMGIALDLGRLFIVKTELQTAMDSCALAAAREL-NGQSDAITRA 70 Query: 75 QKNDFSYRIIKNI----WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 Q + N + + +L G + + TS + Y++S++ Sbjct: 71 QNAGMAAGNSNNANLQSANWNGQGKLPATGISFRKQDYVTPTSDGKLARYAECQYSMSSI 130 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSS-----VKISSKSDIGLDMMM 173 ++ + + +S+ P T + S+S + + + Sbjct: 131 ---KLWLLQAMGAFTGDSATWPNTGTVEARAVATRAPSQSACPIPVQL 175 >gi|116254826|ref|YP_770662.1| hypothetical protein pRL100386 [Rhizobium leguminosarum bv. viciae 3841] gi|115259474|emb|CAK10612.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 644 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 76/212 (35%), Gaps = 24/212 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 241 FKATVTVMPTPWNHDTELMHVAIKGYDIAPATTPHA-NLVFLIDVSGSMDEP-----DKL 294 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + + R +++ +K V+ +VT++ I I+RL G Sbjct: 295 PLLKSAFRLLVNRLKPDDTVS------IVTYAGNAGTVLTPTRVAEKSKILSAIDRLEAG 348 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AY L D + + TDG+ + +++ Sbjct: 349 GSTGGAEGIEAAY--------DLAKQGFVKDGVNRVM-LATDGDFNVGPSSDEDLKRIIE 399 Query: 311 EAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 E ++ G + +G D ++ A Sbjct: 400 ERRKDGIFLTVLGFGRGNLNDSLMQTLAQNGN 431 >gi|47219514|emb|CAG09868.1| unnamed protein product [Tetraodon nigroviridis] Length = 1450 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 73/199 (36%), Gaps = 29/199 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ ++D S S+ D + + M+ + I N ++ LV +S Sbjct: 690 KGAKADLVFLIDGSWSIGDE------SFNKVIQFVTSMIGAFEVISP--NGMQVSLVQYS 741 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F L + + + + + G TK+ L++ Y K+F + + Sbjct: 742 DDAKTEFKLNTYYNKGIVISALKSVRYRGGNTKTGIALKHVYEKVFTS------DSGMRR 795 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 + K ++ LTDG + + + + G V+ +GV + + P Sbjct: 796 NVPKVLVVLTDGRSQD------DVKKSAEKLQHSGYSVFVVGVADVDMTELRIIGSKPSE 849 Query: 341 -RFYSVQNSRKLHDAFLRI 358 + V + +DAF +I Sbjct: 850 RHVFVVDD----YDAFAKI 864 >gi|314923048|gb|EFS86879.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966820|gb|EFT10919.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093260|gb|EFT65236.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103482|gb|EFT75458.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327646|gb|EGE69422.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL103PA1] Length = 322 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 64/204 (31%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +V+ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVVSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MPPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG+N+ L N A VY I E + L Sbjct: 203 SDGDNTQGG----SPLVAANRAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSTV 258 Query: 337 ASPDRFYS--VQNSRKLHDAFLRI 358 A S ++ KL + + ++ Sbjct: 259 ADRTHAQSWTADSADKLQEVYQQV 282 >gi|291299883|ref|YP_003511161.1| Vault protein inter-alpha-trypsin domain-containing protein [Stackebrandtia nassauensis DSM 44728] gi|290569103|gb|ADD42068.1| Vault protein inter-alpha-trypsin domain protein [Stackebrandtia nassauensis DSM 44728] Length = 831 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 70/210 (33%), Gaps = 36/210 (17%) Query: 168 GLDMMMVLDVSLSMND--------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 D++++LD S SM +D L A R D + P+ + +G Sbjct: 303 PRDVVVLLDRSGSMGGWKMVAARRAAARIVDTLSSADRFAVRCFDTAMTSPEGLDP--NG 360 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L + + E + T L A + + ++ Sbjct: 361 LSAGTDR---------NRFRAVEHLAGTETRGGTDILKPLSTAVDLLTAGEKGR------ 405 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II +TDG+ + N++ + + G V+ +G+ FL A Sbjct: 406 ----DRVIILVTDGQ-----VGNEDQILRELTGRLSGMRVHVVGIDKAVNAGFLHRLALV 456 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 R V++ +L +A I + +V + Sbjct: 457 GRGRCELVESEDRLDEATAHIHRRIVAPVV 486 >gi|198421589|ref|XP_002123523.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 79/191 (41%), Gaps = 21/191 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D M V+D S S+ + ++ + + S D+++ V+ G+V +S+ Sbjct: 205 IDFMFVVDGSRSVGNESFEV----------VKHWIQQVTSGFDISSSVQVGVVQYSTYQY 254 Query: 229 QT--FPLAWGVQHIQEKINRLIFGSTTKS----TPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + E + ++F + AY E + + D Sbjct: 255 RKVVQPFIKTEIRLGEYKDHILFDAAVDKIKYHDRSTFTAYAIRKTVNEDFKGNMSRYPD 314 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDR 341 ++ ++ LTDG+++ D ++ EAK+ G +A+GV ++ + + SPD+ Sbjct: 315 SRRVMVLLTDGQST----DKEDLSSAAAEAKQEGVETFAVGVGSKIILSELVLIAGSPDK 370 Query: 342 FYSVQNSRKLH 352 +V + +L Sbjct: 371 VITVNDFNELL 381 >gi|332358821|gb|EGJ36643.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1056] Length = 434 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 49/344 (14%), Positives = 111/344 (32%), Gaps = 64/344 (18%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETS-HKFF---------------VKAKLHYIL 52 + ++ I+ ++ ++GL+I + F ++ + Y + Sbjct: 1 MMKKIQKGFTLTEIIIAIILTSMVGLLIGLVFNTMFSGRNIIEREASIQSEMRTSMQY-V 59 Query: 53 DHSLLYTATKILNQENGNNGKKQK---------NDFSYRIIKNIWQTDFRNELRENGFAQ 103 D ++ + + E+ +K + ++I IW ++ Sbjct: 60 DRTIGKATSVFVLDESKYGKDVRKTEGWNYIGLSPDGKKVINYIWNKSTKSWDESVLGTN 119 Query: 104 DINNIERSTSLSI---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 + +++ D++ +YNL+ +Y+ + ++ + +I+ K Sbjct: 120 SLYDMQLDLEFKADESYQDNRLINYNLT--GQYKNSKNKLSIDTAISALNTKQVISKVAK 177 Query: 161 I-----------SSKSDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGVATRSIREM 202 + + + V D S SM ++ + + Sbjct: 178 GKKGVALAYRNDPIEGQVNTAVTFVFDTSGSMGYGLWNQKLEPTDSRTRMNILKTKANLL 237 Query: 203 LDIIKSIPDVN-NVVR-SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 +D +K I +V+ N+VR SG ++ L I+ KI L T GL Sbjct: 238 VDDLKEIGNVSVNLVRFSGDASY--IQEDFVELDKDTDTIKTKIKALPTSWITNPGDGLR 295 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 Y + +L KY++ LTDG ++ Sbjct: 296 YGLVSLQRNPAQL-----------KYVVLLTDGIPNAYTASPDG 328 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 49/145 (33%), Gaps = 21/145 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM+ K+ ++ ++LD + N L+TFSS+ Q Sbjct: 275 VVFVIDKSGSMSG------RKIQQTREALIKILDDLSPRDQFN------LITFSSEATQW 322 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P A V + + T + A + + + + Sbjct: 323 SPSLVPASAENVNKARSFAAAIHALGGTNINDAVLMAVQLLDRSNREERLPTRSVS---- 378 Query: 286 YIIFLTDGENSSPNIDNKESLFYCN 310 II LTDG+ + S C Sbjct: 379 LIILLTDGDPTVGEGPASNSKTRCT 403 >gi|124003889|ref|ZP_01688737.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123990944|gb|EAY30411.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 704 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 93/264 (35%), Gaps = 35/264 (13%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + + + + + + S + Y T PW + + + + +K+ +++ Sbjct: 294 TPTKDKEGKLQTHPFSVNTEY------GTCPWNPHHKLLQIGLQGE-NLQTKNASPANLV 346 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 ++D S SM+ DKL + RS + +L K + D + +S +V Sbjct: 347 FLVDASGSMDSE-----DKLPLLKRSFKVLL---KQLTDSRTKIAIVAYAGASGLVLPAT 398 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + I + + G +T G+E AY A + +I TD Sbjct: 399 SVSHREKILTALENIESGGSTAGGEGIELAYKIAQQAFIAGGNNR---------VILATD 449 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNC--ASPDRFYSVQN-- 347 G+ + ++E + + ++ G + +G D ++ A +Y + Sbjct: 450 GDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDSMMEKLTNAGNGNYYYIDGIN 509 Query: 348 ------SRKLHDAFLRIGKEMVKQ 365 ++ L I K++ Q Sbjct: 510 EAKKVLAKNLTGTLYAIAKDVKIQ 533 >gi|47523446|ref|NP_999348.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Sus scrofa] gi|3341749|gb|AAC36289.1| voltage-dependent calcium channel alpha-2 delta subunit precursor [Sus scrofa] Length = 1091 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|262200403|ref|YP_003271611.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262083750|gb|ACY19718.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 423 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 53/179 (29%), Gaps = 23/179 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +VLD S SM+ GP + A + LD G+VTF Sbjct: 36 RAPAALQVVLDRSGSMS---GPPLAGAQRALAGVIGQLDPRDVF---------GVVTFDD 83 Query: 226 KIVQ---TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PLA + + ++ G T + G ++ A Sbjct: 84 DAQVVLPAAPLA-DKARAVDAVGSIVPGGCTDLSSGYLRGLQELRRATASAGIRGGT--- 139 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ ++DG + D E +A G I +G + L A Sbjct: 140 ----VLVISDGHVNRGIRDLDEFASITAKAAADGIITSTLGYGRGYDETLLSAIARSGN 194 >gi|239617365|ref|YP_002940687.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239506196|gb|ACR79683.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 676 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 78/224 (34%), Gaps = 33/224 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+ T +S K +D++ VLD + SM + +++ + Sbjct: 338 IPVGTTEPRLVSEKLVNFVDVVFVLDTTGSMTQELN----------GMVENLIEFSNILE 387 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK----STPGLEYAYNKI 266 + + R GLVTF +I T L + I+ + S L A N Sbjct: 388 NYGVLARVGLVTFGDEIRLTADLTPSFEKIRRLLQSQTADGGGDVPEISLDALNEALNMN 447 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR----GAIVYAI 322 F +K +I +TD K S E +++ GA + I Sbjct: 448 FLDN------------SQKILILITDASPHIEGDGTKFSSTTIEETRKKILASGATL--I 493 Query: 323 GVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 V ++F++ P + + +++ + + K++ +Q Sbjct: 494 LVVPSNKEEFVRLSEDIPGQLLDIHSAKSFGELIKFVAKQITRQ 537 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 62/174 (35%), Gaps = 13/174 (7%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I S + + +D++ V+D S +M D ++KLG D+++ + + Sbjct: 59 IISIKTEAQEKKKPVDIVFVVDNSGTMYDKVQIVIEKLG----------DLVRLLHENGY 108 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R ++ F +++ + F + G + + T PG ++ + Sbjct: 109 DARFAVLGFGTEVNKEFVVTGGSRFTSSPEKTIERLKETIQYPG---GKDECQIHALWIA 165 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 D K +I LTD + + ++ +R V+ + + Sbjct: 166 SNYDFRKDASKILILLTDEDTTQNKLNEVAKPKLVENIIKRNLTVFTLRYDPDP 219 >gi|156402479|ref|XP_001639618.1| predicted protein [Nematostella vectensis] gi|156226747|gb|EDO47555.1| predicted protein [Nematostella vectensis] Length = 412 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 70/205 (34%), Gaps = 27/205 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+ +LD S S+ + I + + D ++ GLV FSS Sbjct: 223 KAKVDVGFLLDGSGSVEFY---AKGNFQRCKNFINKFVKSFMVSKDDSHF---GLVLFSS 276 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + I +N + T + GL A + ++ A + Sbjct: 277 DSNVEFKFDDHYDAASITAAVNATNYPGMGTYAGKGLTLAKDDLYSAPVRSG-------- 328 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---P 339 + +I +TDG +S ++L + G ++A+G+ L + S P Sbjct: 329 VPRILIVMTDGISSDDVAGPAKAL------RDMGVEIFALGIGKNYDQGQLDSMGSDPKP 382 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 D + + KL I + K Sbjct: 383 DHVVTA-DFDKLDPVIQTIKDKACK 406 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 69/205 (33%), Gaps = 27/205 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+ +LD S S+ + I + + D ++ GLV FSS Sbjct: 3 KAKVDVGFLLDGSGSVEFY---AKGNFQRCKNFINKFVKSFMVSKDDSHF---GLVLFSS 56 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + I +N + T + GL A + ++ A + Sbjct: 57 DSNVEFKFDDHYDAASITAAVNATKYPGMGTYAGKGLTLAKDDLYSAPVRSG-------- 108 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---P 339 + +I +TDG +S ++L + G ++A+G+ L + S P Sbjct: 109 VPRILIVMTDGISSDDVAGPAKAL------RDMGVEIFALGIGKNYDQGQLDSMGSDPKP 162 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 D + + KL I + Sbjct: 163 DHVVTA-DFDKLDPVIQTIKDKACN 186 >gi|147905660|ref|NP_001090738.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Xenopus (Silurana) tropicalis] gi|120537294|gb|AAI29013.1| LOC100036724 protein [Xenopus (Silurana) tropicalis] Length = 1076 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 67/187 (35%), Gaps = 37/187 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL--------V 221 DM++++DVS S++ L + S+ EML+ + ++ V Sbjct: 254 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSD----DDFVNVAAFNNNAHDVS 303 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F+ + + ++E +N + TT G ++A++++ + + Sbjct: 304 CFNHLVQANV---RNKKKLKEAVNNITAKGTTDYKTGFKFAFDQLLNHNVSRANCN---- 356 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCAS 338 K I+ TDG KE+ N K + V+ V + C + Sbjct: 357 ---KIIMLFTDG----GEDKAKETFEAYN--KDKTVRVFTFSVGQHNYDKGPIQWMACQN 407 Query: 339 PDRFYSV 345 +Y + Sbjct: 408 KGFYYEI 414 >gi|237728581|ref|ZP_04559062.1| TerY1 [Citrobacter sp. 30_2] gi|226910059|gb|EEH95977.1| TerY1 [Citrobacter sp. 30_2] Length = 212 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 66/196 (33%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM+ + ++ +L +K P ++TF S Sbjct: 3 RLPVYLLLDTSGSMHGE------PIEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + ++ L TT L + I +K KG + Sbjct: 57 RQAVPLT---DLLSFQMPALTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG SPN D ++ L A+ G V A +A LK + Sbjct: 112 FLMTDG---SPNDDWRKGLNDFKAART-GV-VVACAAGHDADTSVLKEITEIVVQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SSTIKAFFKWVSASIS 182 >gi|62896633|dbj|BAD96257.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNLTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|125830338|ref|XP_692362.2| PREDICTED: anthrax toxin receptor 1 [Danio rerio] Length = 552 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 44/197 (22%), Positives = 68/197 (34%), Gaps = 27/197 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 V+ D+ VLD S S+ H+ S E+L+ P Sbjct: 24 SARAEEDVEEERSCQGAFDLYFVLDKSGSVKHHWQE--------IYSFVELLEQKFISP- 74 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFD 268 ++R + FS++ L + I++ +N L I G T GLE A +I+ Sbjct: 75 ---MLRMSFIVFSTRGNTIMRLTENRETIRKGLNVLRREIPGGDTFMHLGLEKANEQIYQ 131 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 II LTDGE + + A+ GAIVY +GV+ Sbjct: 132 ENYGTAS----------VIIALTDGELQEHQLIAAQQEAA--RARTLGAIVYCVGVKDFN 179 Query: 329 ADQFLKNCASPDRFYSV 345 Q + + V Sbjct: 180 ETQLATIADTSKHVFPV 196 >gi|77747911|ref|NP_638263.2| hypothetical protein XCC2915 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761138|ref|YP_242283.2| hypothetical protein XC_1194 [Xanthomonas campestris pv. campestris str. 8004] Length = 597 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 44/238 (18%), Positives = 86/238 (36%), Gaps = 34/238 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW ++ + + ++ + + +++ ++DVS SM DKL + S++ Sbjct: 202 TPWNTDTLLLRIGVAGR-EVPTAALPAANLVFLVDVSGSMG-----APDKLPLLQSSLKL 255 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGL 259 + + + R LVT++ P G Q I E I+ L G T G+ Sbjct: 256 L------VRQLRKQDRITLVTYAGSTAVVLPPTSGAQQTRIVEAIDSLQSGGGTAGASGI 309 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E AY A +G + I+ TDG+ + D + E +R G + Sbjct: 310 ELAYKAAQQA------YLRGGINR---ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVAL 360 Query: 320 YAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G D ++ A + + ++ +L I +++ Q Sbjct: 361 STLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHELGSTLATIARDVKIQV 418 >gi|320106177|ref|YP_004181767.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924698|gb|ADV81773.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 370 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 81/222 (36%), Gaps = 37/222 (16%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++K S D + + +V D+S SM FG R+ + + + +++ + Sbjct: 134 QTIKTFSTDDAPVSIGIVFDLSGSMMSKFG----------RARKALSEFMRTSNPQDEFF 183 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V F+ + V + ++ L T NK+ DAK + Sbjct: 184 ---VVGFNDRPAVIVDYTSNVDDVDARMVMLRPERRTALIDAAYLGLNKLKDAKYER--- 237 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEAADQ- 331 K ++ ++DG ++ E + +Y+IG+ A ++ Sbjct: 238 --------KALLIISDGGDNRSRYVESELRR---AVRESDTQIYSIGIFDVYAATPEEKS 286 Query: 332 ----FLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + C + R + V ++ +L D RI E+ + +L Sbjct: 287 GPTLLMDICEMTGGRMFRVTDADELGDIAARISAELRNEYVL 328 >gi|126334040|ref|XP_001370580.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1156 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 75/207 (36%), Gaps = 34/207 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + ++D K + L+ +S Sbjct: 156 DIVFLIDGSGSIRPL------QFVQMKNFVMTVMDQFKGTD-----TQFSLMQYSDDFKT 204 Query: 230 TFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + T + G+ ++F AK K Sbjct: 205 HFTFNNFKNDPTSKNLVGPIEQLNGKTHTASGIRKVVRELFQEWNGARKDAK------KI 258 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFLKNCAS---P 339 +I +TDG+ +++ ++ + EA++ G I YAIG +A Q L+ AS Sbjct: 259 LIVITDGQIQGDSLNYRDVIP---EAEKEGVIRYAIGVGYAFNTPSARQELRTIASQPAQ 315 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + V N DA I ++ ++ Sbjct: 316 EHVFQVNN----FDALKNIQNQLQEKI 338 >gi|118593261|ref|ZP_01550646.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] gi|118434152|gb|EAV40808.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] Length = 772 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 64/199 (32%), Gaps = 26/199 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDI 205 + LLI + +++ VLD S SM+ + A +++R D Sbjct: 342 GGGYFSLLIEPPKLPAEDMIGQRELVFVLDTSGSMSGQPIEASKTFMTAAIKALRP--DD 399 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 I +N F+ + V Q + + L G T+ + A+++ Sbjct: 400 YFRILHFSNDT----SQFAGQAVLATE--RNKQKALKFVADLSAGGGTEINQAVNAAFDQ 453 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + ++FLTDG K +A+ +YA GV Sbjct: 454 AQPDN------------TTRIVVFLTDGYIGDEATVIKSIANRIGKAR-----IYAFGVG 496 Query: 326 AEAADQFLKNCASPDRFYS 344 L A+ R Y+ Sbjct: 497 NSVNRFLLDAMATEGRGYA 515 >gi|253582503|ref|ZP_04859725.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] gi|251835648|gb|EES64187.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] Length = 376 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 72/209 (34%), Gaps = 19/209 (9%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F S ++V+ + + ++++ VLD + SM + A I Sbjct: 13 FLGTIVFSTESKPENTAVEQTKAKEKDVEIVFVLDTTGSMG-------GLIQGAKTKIWS 65 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSK----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 +++ + ++ V+ GLV + + + + L+ + I + Sbjct: 66 IVNEVMQT-HKDSKVKIGLVAYRDRGDVYVTKVTQLSENLDEIYSVLMGYKAQGGGDDPE 124 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + A ++ + + + ++ + I + D D+ ++ +AK RG Sbjct: 125 DVRKALHESLE----VIQWSTPRENLSQIIFLVGDAPPHDDYNDSPDTSDTAKKAKSRGI 180 Query: 318 IVYAIGVQ-AEAADQFLKNCA--SPDRFY 343 I+ I D + K A ++ Sbjct: 181 IINTIQCGDMPKTDYYWKAIAQFGGGEYF 209 >gi|332669282|ref|YP_004452290.1| von Willebrand factor type A [Cellulomonas fimi ATCC 484] gi|332338320|gb|AEE44903.1| von Willebrand factor type A [Cellulomonas fimi ATCC 484] Length = 538 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 83/223 (37%), Gaps = 29/223 (13%) Query: 143 PWCANSSHAPLLITSS-VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW A + + ++ VK +++ + +++ +LDVS SM++ +KL + S Sbjct: 159 PWAPGHQLAMIGVQATDVKPTTRGN---NVVFLLDVSGSMDEP-----NKLPLLADSFAL 210 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +++ + V+ V +G S +++ I + + L G +T GLE Sbjct: 211 LVEQLDEDDTVSIVTYAG----SDQVLADSVPGDRRGEIVDILRELRAGGSTGGARGLET 266 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY E + + +I TDG+ + ++ E R G + Sbjct: 267 AY-------ELAAKNFVEGGNNR--VILATDGDFNVGPSTPEQLTELIEEHARTGVYISV 317 Query: 322 IGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK----LHDAFLR 357 +G D ++ A + + + L D F Sbjct: 318 LGFGMGNLKDSTMEAIADHGNGNYAYIDTLDEARKVLVDEFDS 360 >gi|319783082|ref|YP_004142558.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168970|gb|ADV12508.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 704 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 80/217 (36%), Gaps = 24/217 (11%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 SA + + PW ++ + I ++ +++ ++DVS SM++ Sbjct: 298 SASTPFNSTVSVMPTPWNTHTRLMHVAIKGFDVKPTEQPKA-NLVFLIDVSGSMDEP--- 353 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKIN 245 DKL + + R ++ +K+ ++ +VT++ Q I I+ Sbjct: 354 --DKLPLLKSAFRLLVSKLKADDTIS------IVTYAGDAGTVLEPTKASQKDKILSAID 405 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 L G +T G++ AY +L + D + + TDG+ + D+ + Sbjct: 406 NLTPGGSTAGEAGIKEAY--------RLAQKSFVKDGVNRVM-LATDGDFNVGQSDDDDL 456 Query: 306 LFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 + ++ G + G DQ ++ A Sbjct: 457 KRLIEKERKTGVFLSVFGFGRGNLNDQMMQTIAQNGN 493 >gi|301626998|ref|XP_002942667.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 4207 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 64/172 (37%), Gaps = 34/172 (19%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + LD++ ++D S S+ +++L R ++++L +P R ++T Sbjct: 592 REKSLSLDLVFLVDESSSVG--HSNFVNEL----RFVKKLLSDFPVVPSA---TRVAIIT 642 Query: 223 FSSKIVQTFPLAWGVQH---------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEK 272 FSSK + + + +I + + G T + + A + ++ Sbjct: 643 FSSKTNVQTRVDYISSSEPHQHKCSLLNREIPAITYKGGGTFTKGAFQQAAQILRYSR-- 700 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + K I +TDG ++ + + G ++ +G+ Sbjct: 701 --------SNSTKVIFLITDGYSNGG-----DPRPIAANLRDLGVEIFTVGI 739 >gi|326927692|ref|XP_003210025.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Meleagris gallopavo] Length = 1108 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 258 DMVIIVDVSGSVSGL------TLKLMKTSVHEMLDTLSDDDYVN------VASFNEKAKP 305 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 306 VSCFKHLVQANIRNKKVFKEDVQGMVAKGTTDYKAGFEYAFDQLQNSNITRANCN----- 360 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F + V+ V D L+ CA+ Sbjct: 361 --KMIMMFTDG-----GEDRVQDVFEKYNWPNKTVRVFTFSVGQHNYDVTPLQWMACANK 413 Query: 340 DRFYSV 345 ++ + Sbjct: 414 GYYFEI 419 >gi|281209350|gb|EFA83518.1| hypothetical protein PPL_02583 [Polysphondylium pallidum PN500] Length = 461 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 73/199 (36%), Gaps = 19/199 (9%) Query: 172 MMVLDVSLSM-----NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 M+VLD+S SM PG ++ + +I + GLV F + Sbjct: 152 MIVLDLSGSMRSAAFKGSLTPGELEMKRIEIAQALFQTMIDKYVQLEIAAIVGLVCFGER 211 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGH--DDY 283 I TFP ++ ++ + T+ ++ A I +E +A G Sbjct: 212 IEVTFPPTRNFDSFSTELGEVVANQSKTRLYEAIKLAGETIVKYRENPTSLADGFVLAPS 271 Query: 284 KKYI--IF-LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCAS 338 K I +F LTDG+++S N K I+ AI + F K A+ Sbjct: 272 DKLICRVFALTDGQDNS----NACPYEVYKYLKSANIILDAIPIGEGGNTLGSFTK--AT 325 Query: 339 PDRFYSVQNSRKLHDAFLR 357 ++ +S+ + F R Sbjct: 326 GGSCFTFNSSKAGVELFER 344 >gi|1705853|sp|P54290|CA2D1_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|203955|gb|AAA41088.1| dihydropyridine-sesitive L-type calcium channel alpha-2 subunit [Rattus norvegicus] Length = 1091 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 252 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 299 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 300 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFTFAFEQLLNYNVSRANCN----- 354 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 355 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 406 Query: 340 DRFYSV 345 +Y + Sbjct: 407 GYYYEI 412 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 84/259 (32%), Gaps = 12/259 (4%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+I+ AI++P++ + + I+ H +V+ L I D + + A + N ++ Sbjct: 20 RGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQSLA 79 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 G KN + N E G + + + + Sbjct: 80 TGNAVKNGLQTSSTQITVTPGNWNPQIETGPSYFSAAVPYGHQAN----------AVQVQ 129 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 +P+ F P + A + + S S L++ L+ G+ Sbjct: 130 LSESVPYFFFFGPAKTVQAQAIAWVPNPAAGFSLSSTLLNVSEQQSALLNSLLGGLLGIH 189 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 L + + +L+ S+ + + G V ++ + L + + G Sbjct: 190 NLNLGVAAFNGLLNTSVSLGQLAQSIGVGTV--NNLLDANLTLPGLFTGALKAVGNQAAG 247 Query: 251 STTKSTPGLEYAYNKIFDA 269 ST A + + Sbjct: 248 GGLLSTQSATGALQTLVGS 266 >gi|21114118|gb|AAM42187.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572853|gb|AAY48263.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 618 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 44/238 (18%), Positives = 86/238 (36%), Gaps = 34/238 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW ++ + + ++ + + +++ ++DVS SM DKL + S++ Sbjct: 223 TPWNTDTLLLRIGVAGR-EVPTAALPAANLVFLVDVSGSMG-----APDKLPLLQSSLKL 276 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGL 259 + + + R LVT++ P G Q I E I+ L G T G+ Sbjct: 277 L------VRQLRKQDRITLVTYAGSTAVVLPPTSGAQQTRIVEAIDSLQSGGGTAGASGI 330 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E AY A +G + I+ TDG+ + D + E +R G + Sbjct: 331 ELAYKAAQQA------YLRGGINR---ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVAL 381 Query: 320 YAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G D ++ A + + ++ +L I +++ Q Sbjct: 382 STLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHELGSTLATIARDVKIQV 439 >gi|326426687|gb|EGD72257.1| hypothetical protein PTSG_00277 [Salpingoeca sp. ATCC 50818] Length = 2847 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 65/196 (33%), Gaps = 22/196 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+ ++D S S+ + ++ D + + P N VR + +S+ Sbjct: 822 GRAIDVFYLIDGSGSIVSS------DFERERTFLSDLTDALFAFPG--NDVRISIAEYST 873 Query: 226 -KIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P Q I N + G T + L A + I Sbjct: 874 TYTQVLVPYTADETTAQNTISNVVQSGGATATGTALGLAADDIGANARPDA--------- 924 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRF 342 + ++ LTDG S D + ++ G + AIG+ A + + L+ P R Sbjct: 925 ARVLVLLTDGATSDG--DQQNIDPSVSDLNSIGVSITAIGIGDNADETELLQIAGDPTRV 982 Query: 343 YSVQNSRKLHDAFLRI 358 ++ L D I Sbjct: 983 FNNIAFVDLGDFIDEI 998 >gi|301618735|ref|XP_002938765.1| PREDICTED: hypothetical protein LOC100488728 [Xenopus (Silurana) tropicalis] Length = 672 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 68/166 (40%), Gaps = 18/166 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--K 226 LD+M ++D S S H + + + +K + ++ FSS + Sbjct: 55 LDVMFIVDGSESTKGHLFKQQKDFVLN---FTDQISHLKLAKPWKTKTKMAIIQFSSSVR 111 Query: 227 IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 I Q+F GV++ + +N + G T + + A N H + G+ K Sbjct: 112 IEQSFNEWTGVENFKRIVNSMTYIGQGTYTYYAIMNATNIF------KAHKSAGNV---K 162 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 I +TDG + + D +++ + A+ G +IG+ + A++ Sbjct: 163 VAILMTDGIDHPKSPDARQAS---DFARAAGINFISIGLSTQKANK 205 >gi|301609920|ref|XP_002934508.1| PREDICTED: vitrin-like [Xenopus (Silurana) tropicalis] Length = 779 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 70/209 (33%), Gaps = 37/209 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D+ V+D S S+ G + I + + + R G V ++ Sbjct: 594 AADIGFVIDGSSSV------GTGNFRTVLQFIANITNEFEISD---TDTRIGAVQYT--Y 642 Query: 228 VQTFPLAWGVQHIQEKI-NRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 Q + ++ + N ++ T + + YA ++F K + Sbjct: 643 EQRLEFGFDKYSTKQDVMNAIMRIGYWSGGTSTGAAITYASEQLFS---------KSKPN 693 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + R G I YA+G+ A D+ P D Sbjct: 694 KRKILIVITDGRSYD------DVRAPAAAVHRNGVIAYAVGIAWAAQDELESIATDPDKD 747 Query: 341 RFYSVQNSRKLH----DAFLRIGKEMVKQ 365 + V++ L+ F I E Q Sbjct: 748 HSFFVEDFDSLYKFVGKIFQNICTEYNSQ 776 >gi|188586636|ref|YP_001918181.1| von Willebrand factor type A [Natranaerobius thermophilus JW/NM-WN-LF] gi|179351323|gb|ACB85593.1| von Willebrand factor type A [Natranaerobius thermophilus JW/NM-WN-LF] Length = 599 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 68/193 (35%), Gaps = 31/193 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K +++ ++D S SM ++ +L + + ++TF Sbjct: 417 KKQTSMNVCFLVDASGSMGG------RRMQEVKFFAEHVL--------LKGRDKIAILTF 462 Query: 224 -SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P ++ +N++ T + G+E A + + ++ Sbjct: 463 REDNVNVEIPFTRNWDKLRSGLNKIKAFGLTPMSKGIEMARKYLESEVGQQKNT------ 516 Query: 283 YKKYIIFLTDGENS--SPNIDN-KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--A 337 +++ +TDG + D KE+L + + IG+ E +FLK A Sbjct: 517 ---FLVLITDGLPTISDGGEDPFKETLKAAQKLSQTSIKFVCIGL--EPNVKFLKKLAQA 571 Query: 338 SPDRFYSVQNSRK 350 S Y V+ +K Sbjct: 572 SQASLYIVEELQK 584 >gi|116201805|ref|XP_001226714.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] gi|88177305|gb|EAQ84773.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] Length = 777 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 68/194 (35%), Gaps = 16/194 (8%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM-LDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +D+S SM D G A +D++K + +++ R G+VTF Sbjct: 74 DLVLSIDISGSMADEAPAPSKPGGEAGEDTGLRVIDLVKHAARTIVATLDSRDRLGIVTF 133 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +++ P E I + S+T G+ +F E Sbjct: 134 TNRSKVGIPPYENKAKTLENIESMEPFSSTNMWHGIRDGL-SLFSEAEGGSTGRVPA--- 189 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 ++ LTDG + K + + A ++ G E LK+ A Sbjct: 190 ---LLVLTDGMPNYM-CPPKGYVPMLRSMEPLPATIHTFGFGYELRSGLLKSIAEVGGGN 245 Query: 342 FYSVQNSRKLHDAF 355 + + ++ L F Sbjct: 246 YSFIPDAGMLGTVF 259 >gi|315615538|gb|EFU96170.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 575 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 50/342 (14%), Positives = 103/342 (30%), Gaps = 45/342 (13%) Query: 33 GLVIETSHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTD 91 L + ++ K L L + A K N G + F +K + Q Sbjct: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 Query: 92 FRNELRENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRY 133 + N + + I++ + S + Sbjct: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 M + PW + + I + S+ +++ ++D S SM ++L Sbjct: 181 AMRYELAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLP 234 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGS 251 + S++ ++ ++ + +VT++ P G I I+ L Sbjct: 235 LIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEG 288 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T GLE AY + KG + I+ TDG+ + D K + Sbjct: 289 STNGGAGLELAYQQATKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKK 339 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 + G + GV + + + A + + + Sbjct: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 >gi|288573236|ref|ZP_06391593.1| von Willebrand factor type A [Dethiosulfovibrio peptidovorans DSM 11002] gi|288568977|gb|EFC90534.1| von Willebrand factor type A [Dethiosulfovibrio peptidovorans DSM 11002] Length = 225 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 68/168 (40%), Gaps = 15/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + + +VLDVS SM G +++L R + +K +++FS+++ Sbjct: 20 RVPVSLVLDVSGSM---LGAPIEELN---RGVELFFKSLKDDDVARYSAEVSVISFSNEV 73 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q + +G + I L T+ + A + KE + G D Y+ ++ Sbjct: 74 TQ--EVDFGPLE-KCDIPELKAIGKTRMGGAVSLALESLEKRKEL--YRTLGVDYYQPWM 128 Query: 288 IFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + +TDG+ N + ++ ++ K V+ I + A LK Sbjct: 129 VIMTDGKPNDDWQLAAAKTSALVDKGK---LTVFPIAIGDNACTDTLK 173 >gi|301609308|ref|XP_002934204.1| PREDICTED: epithelial chloride channel protein [Xenopus (Silurana) tropicalis] Length = 906 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 74/204 (36%), Gaps = 38/204 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD+S SM + + +L A+ + + V G+VTFS+ Sbjct: 309 VSLVLDISGSMTN--ANRITRLYQASE--------VYIMQIVEQGAYVGIVTFSNVAEIK 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + ++ K+ + T G++ + + Sbjct: 359 SQLVKITDTFQRESLKLKL-PTVATGGTNICAGVQQGLQVNRNLDQSTHGTE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGV--QAEAADQFLKNCASPDRF 342 I+ LTDGE+S + C + + GAI++ I + A+ + L + + Sbjct: 410 -IVLLTDGEDS--------GISSCFPDITKSGAIIHTIALGNNADPGLEKLADLTGGLKL 460 Query: 343 YSVQ--NSRKLHDAFLRIGKEMVK 364 Y+ ++ L D+F I Sbjct: 461 YASDKVDANGLIDSFSGIVSNTGN 484 >gi|296118874|ref|ZP_06837447.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] gi|295967972|gb|EFG81224.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] Length = 674 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 67/195 (34%), Gaps = 32/195 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------- 224 +++D S SM G +L A ++ ++++ + + + G + Sbjct: 59 ALIMDASDSMLAEDVDGGTRLDAAKQAANQLVNSLPETAVMGMLAY-GASGSNAPDNRER 117 Query: 225 --SKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 I P+ + ++ +I L T L A +++ ++ Sbjct: 118 GCQDIDVLAPVERIDNEELKSEIGALEAQGYTPMGNALRAAADELGSEGDRS-------- 169 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKNC--- 336 II ++DG ++ + E G ++ +G + + A Q C Sbjct: 170 -----IILVSDGIDTCA---PPPACEVAEELAGDGFDLAIHTVGFKPDEAAQAELECISE 221 Query: 337 ASPDRFYSVQNSRKL 351 AS + +N+ +L Sbjct: 222 ASGGTYVEAENAEEL 236 >gi|297669805|ref|XP_002813079.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 2 [Pongo abelii] Length = 2976 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 106/315 (33%), Gaps = 42/315 (13%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQSAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F ++ + ++ + Sbjct: 370 SIMAFAIGNKGADQAELKEIAFDSSLVFIPAEFRAAPLQGMLPGFLAPLRTLSGTPEVHA 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDVGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHI 276 S V F L + I L GS + L Y + N +A H Sbjct: 478 SDTPVTEFSL--NTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIH- 534 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + ++ LT G++ L N R G + + +G + + Sbjct: 535 ----EHVPQLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIA 584 Query: 337 ASPDRFYSVQNSRKL 351 +P Y + + L Sbjct: 585 FNPSLVYLMDDFSSL 599 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 48/297 (16%), Positives = 105/297 (35%), Gaps = 27/297 (9%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1325 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 1384 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 1385 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1442 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1494 Query: 239 HIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I + IN++++ + GLE+ + E ++ + +T G++ Sbjct: 1495 QIIDAINKVVYKGGRHANTRVGLEH----LRVNHFVPEAGSRLDQRVPQIAFVITGGKSV 1550 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1551 EDAQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|134300085|ref|YP_001113581.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134052785|gb|ABO50756.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 416 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 72/218 (33%), Gaps = 22/218 (10%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 P ++ + +T+ ++ K ++ V+D S SM G +D A Sbjct: 15 LLPGNKQVAYLMVKLTAPKQVE-KERPVQNLSFVIDRSGSMA---GEKLDYTKKAVAFAV 70 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 L V +V S ++ ++ + + G +T + G+ Sbjct: 71 GHLSPQDYCSVVAFDDMVTMVASSHQVA-------NKDALKMAVESIYPGGSTNLSGGML 123 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 ++ A ++ + ++ LTDG + D+ + E G + Sbjct: 124 LGVREVKLAHKENQINR---------VLLLTDGMANVGVTDHSALVEKSREMAAGGVNLS 174 Query: 321 AIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 G+ + + L+ A FY ++ ++ F Sbjct: 175 TFGLGEDFEEDLLQAMVEAGGGNFYYIEKPDQIPGIFE 212 >gi|226290246|gb|EEH45730.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 757 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 76/201 (37%), Gaps = 23/201 (11%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +DVS SM P ++ G + +LD+ K I +N R G+VTF Sbjct: 75 DIVLCIDVSGSMQLSAPLPTTNESGKREETGLSVLDLTKHAARTIIETLNENDRLGVVTF 134 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S+ + ++ + E + L ++T GL+ N + + + ++ Sbjct: 135 SNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLNVLGEVDLRPRNV--- 191 Query: 280 HDDYKKYIIFLTDGENSS---PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + LTDG+ + + K R +++ G + L++ Sbjct: 192 -----QALYVLTDGQPNHMCPTQGYVPKLRPILERQKDRLPLIHTFGFGYDIRSGLLQSI 246 Query: 337 A--SPDRFYSVQNSRKLHDAF 355 A + + ++ + F Sbjct: 247 AEVGGGTYSFIPDAGMIGTVF 267 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 80/215 (37%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 166 QTYMDIIIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 214 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ AY F E + + Sbjct: 215 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTET----RTAYGIEFARSEAFQKGGRKGA- 268 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + + +++ YA+ V + FL Sbjct: 269 -KKVMIVITDGE----SHDSPDLEQVIDASEKDNITRYAVAVLGYYNRRGINPEAFLSEI 323 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 324 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 358 >gi|149046637|gb|EDL99462.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Rattus norvegicus] Length = 939 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|284052943|ref|ZP_06383153.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569121|dbj|BAI91393.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 463 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 66/210 (31%), Gaps = 39/210 (18%) Query: 163 SKSDIGLDMMMVLDVSLSM------------------NDHFGPGMDKLGVATRSIREMLD 204 S S V+D S SM + + + E L+ Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEY 261 + S ++ R LV F PL ++ I +L F T+ G+E Sbjct: 95 RLVSSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNFSGGTRMALGMEE 154 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A N + K D + + TDG+ D + + G + A Sbjct: 155 ALNIL-----------KNCDLSSRRTLIFTDGQ----TFDESDCRDLATQFAEAGIPITA 199 Query: 322 IGVQAEAADQFLKNCA--SPDRFYSVQNSR 349 +GV E + L + + R ++V ++ Sbjct: 200 LGVG-EYNEDLLLYLSDRTGGRVFNVVETQ 228 >gi|268325023|emb|CBH38611.1| hypothetical protein, containing PKD domain [uncultured archaeon] Length = 1152 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 76/235 (32%), Gaps = 26/235 (11%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 N P +I + + LD++ V+D + SM D + + + +I ++ I Sbjct: 287 NVVADPWSDGKCTQIFYGTGLSLDLIFVIDTTGSMGDD----IANVKASASTIVNEIEAI 342 Query: 207 KSIPDVNNVV---RS---------GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 IPD V R G F+ + P + I I L G Sbjct: 343 --IPDYQVAVVDYRDFPVDPYGGDGDYPFNDVL----PFSTDKAAIISAIQGLTLGWGGD 396 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + A DA +G D K II + D P L A Sbjct: 397 WEESVYSALMHSIDAGSLGG--WRGEDQALKAIILMGDAPPHDPEPFTGYILTSVAIAAE 454 Query: 315 RG--AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +Y I + + ++ +N+ ++ DA L +E+ K+ I Sbjct: 455 LADPVHIYTIQIGGPVGKFAELASQTGGEVFTAENAEEVVDAILEAIEEITKRPI 509 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 58/153 (37%), Gaps = 20/153 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGP---GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ D++++LD S SMN +F ++ S+++ + + + +R G+ Sbjct: 501 NEEVKKDIVLILDTSGSMNFNFYNDSIPYNEKDKRIYSLKQSAKQFINKFNNKDNIRIGI 560 Query: 221 V---TFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + +S L + + I+ + T G+ A + + Sbjct: 561 IPYSYYSGYANNIKQLTEINDNNKKSYENYIDNIKVEGATNQGDGIREAGKMLLNTDGNS 620 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 KKY+I +TDGE ++ I+ + Sbjct: 621 ----------KKYVILITDGEATAITIEKPNLI 643 >gi|118349478|ref|XP_001008020.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289787|gb|EAR87775.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 642 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 75/201 (37%), Gaps = 27/201 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S +D++ V++ S SM+ K+ ++ +L+++ N+ R LV Sbjct: 193 NSRPSIDLVCVINNSESMHGE------KILNVKNTLLYLLEML------NSNDRLSLVLS 240 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ F L + Q ++ IN + T T + A+N + + Sbjct: 241 NNNPTTLFDLKYLDEKNKQDLKRIINNISITQNTNITKSMIKAFNIL---------QFRQ 291 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 + I L+DG +SS + + + + +++ G + + + S Sbjct: 292 SQNKVSSIFLLSDGVDSSAEKQIQNYISSQQSLQNKNFAIHSFGYGFDQDAEMINKICSL 351 Query: 339 -PDRFYSVQNSRKLHDAFLRI 358 FY +QN ++ F + Sbjct: 352 KNGNFYYIQNMNQVDQYFADV 372 >gi|256374530|ref|YP_003098190.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918833|gb|ACU34344.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 550 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 60/201 (29%), Gaps = 31/201 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ LD + SM + + R GLVTF Sbjct: 49 GPMDVVFALDDTGSMGGALNNIKTSINAVVG---------DVVSASGGDYRLGLVTFKDS 99 Query: 227 IVQTFPLAWGVQ-HIQEKI-NRLIFGSTT----KSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I LA G + + N L S L A + A ++ Sbjct: 100 INVVTGLAAGNAGTVTGYVTNVLAASGGGGEPEASDEALRTAV-SLRPAAGIPQNADFTG 158 Query: 281 ---DDYKKYIIFLTDG-----ENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-----QAE 327 + +K+++ +TD +++ D + N A G + A+ V Sbjct: 159 PWRSNARKFVVLVTDARPGGFDDAFTAADQASATAVANSALAAGVKLSAVYVPTSPSMTP 218 Query: 328 AADQFLKNCA--SPDRFYSVQ 346 ++N A + + Q Sbjct: 219 TIAPIMQNYATTTSGVYVQAQ 239 >gi|304314705|ref|YP_003849852.1| cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] gi|302588164|gb|ADL58539.1| predicted cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] Length = 663 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 44/316 (13%), Positives = 94/316 (29%), Gaps = 37/316 (11%) Query: 50 YILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE 109 D +A+ + G++ + IK + + + + R G + I+ Sbjct: 370 SSGDSGASPSASSLGALAADVEGREPETQDMDVDIKRLLRIRGKKKERLYGSRVESKTIK 429 Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 + + + T A+ + + K Sbjct: 430 GRY-----VKSRFPRGSGDVAVD-------ATLRAAASRGELKIEPGDIREKIRKHGARA 477 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IV 228 +++V+D+S SM K I ++ + D R +V F + Sbjct: 478 SIVLVVDISGSMFSE-----KKAARVKGLIERFIEDAQRHKD-----RISVVGFRGRDAK 527 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P ++ ++ + G TT G++ + K H +Y +++ Sbjct: 528 VIIPSTARASSFRDTVDSIRVGGTTPMAQGIKRGLEIL--------REEKRHSEYVPFMV 579 Query: 289 FLTDGENSSPNIDN--KESLFYCNEAKRRGAIVYAIGV-QAEAADQFLK---NCASPDRF 342 L+DG + N +E++ + I Q + L AS + Sbjct: 580 ILSDGMPNVGVERNPKREAVEAAARLREEDIPSAVINFEQGSRGGRDLNMEIALASGGSY 639 Query: 343 YSVQNSRKLHDAFLRI 358 Y + + A RI Sbjct: 640 YDLHDLEDPSMAVPRI 655 >gi|149728587|ref|XP_001492576.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H1 [Equus caballus] Length = 908 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 50/321 (15%), Positives = 108/321 (33%), Gaps = 29/321 (9%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 +K K ++DH + I + + Q + ++ I + F + F Sbjct: 176 LVIKVKPKQLVDHFEID--VDIFEPQGISKLDAQASFLPKQLATQIIKKSFSGKKGHVLF 233 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 + ++ + + + D+ ++ + P AP +T Sbjct: 234 RPTVG--QQQSCPTCSTSLLNGDFRVTYDVNRDKPCDLLVTNNHFAHFFAPQNLT----- 286 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ +++ V+D+S SM K+ ++ ++L ++ D ++V G Sbjct: 287 ----NLNKNLVFVIDISTSMQGQ------KVQQTKEALLKILGDMRP-GDYFDLVLFGSG 335 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S K A +Q ++ + R +T GL + A+ L ++ Sbjct: 336 VQSWKGSLVPASAANLQAARDFVQRFTLEGSTNLNGGLLQGIEILNKAQGSLPEVSNHAS 395 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +I LTDGE + D + L A R +Y +G FL+ + + Sbjct: 396 ----ILIMLTDGEPTEGVTDRSQILKNVRNAIRGKFPLYNLGFGQNVDFNFLEVMSMENN 451 Query: 342 -----FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 452 GRAQRIYEDHDATQQLQGFYD 472 >gi|149375210|ref|ZP_01892982.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] gi|149360574|gb|EDM49026.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] Length = 658 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 85/238 (35%), Gaps = 38/238 (15%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 P + + + D+ +++D+S SM + + + A R + ML Sbjct: 7 LFVVLLALPTTALAQQAPTLQLPDSADVRIIVDISGSMKTNDPNNLRR--PAVRLLARML 64 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFP---LAWGVQHI----QEKINRLIFGSTTKST 256 P N G+ TF + P + + + ++IN + T Sbjct: 65 ------PGQANA---GVWTFGQYVNMLVPHGKVTDDWRGLAVERSDEINSVAL--RTNLG 113 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-----ENSSPNIDNKESLFYC-- 309 ++ A + + L++ I LTDG +N S N +E + Sbjct: 114 EAIQVASDDYLLGSDSLDNTD---------FILLTDGKVDISDNESANDRERERILGALL 164 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +E RGA ++ + + EA LK+ A + R+ ++ L AFL V Q Sbjct: 165 DELSSRGATLHTVALSEEADLALLKSLAERTGGRYALASSADALTLAFLEALNTAVPQ 222 >gi|170741048|ref|YP_001769703.1| hypothetical protein M446_2844 [Methylobacterium sp. 4-46] gi|168195322|gb|ACA17269.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 127/414 (30%), Gaps = 71/414 (17%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 GS+ + +LL + + MG+ ++ + L A + Sbjct: 24 RSGSVGFVFGLLLLPMMVAMGVSVDYARV--------SAARSDLAAAADAAVLSVTNKAA 75 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI----ERSTSLSIIIDDQHKDYNL- 127 + +++ + + + +G + D I R+ +LS + + Sbjct: 76 MSLDMLSAQARVRDAFLKNIQTMPDISGVSADAVVIDLLGVRAATLSYTASYRTAFSGIL 135 Query: 128 ---------SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL-DV 177 +A S+ +P + NS + TS+ + S D+ Sbjct: 136 GMRTLSVSGNAASKSAVPIYMDFYLLLDNSPSMGVGATSADISTMVSRTPDKCAFACHDL 195 Query: 178 SLSMNDHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-----SSKI 227 S +D++ ++ V ++ + ++D + V R+ L T S + Sbjct: 196 SAGNSDYYHLAKSLGVTMRIDVVRQATQRLMDTAANTALVPGQFRTALYTMGADCASVGL 255 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY-------AYNKIFDAKEKLEHIAKGH 280 PL+ + + N T PG + + K + Sbjct: 256 TTVSPLSSDLAAAKT--NAQAIDLMTIQKPGYNNDQCTDFDGVFQSLNGKIDVAGDGSTA 313 Query: 281 DDYKKYIIFLTDG------------ENSSPNIDNKESLFYCNEAKRRGAIV---YAIGVQ 325 +K + ++DG + + +L C K RG + Y + Sbjct: 314 LTPQKVVFLVSDGVADAYYPSTCTRKTTGGRCQEPLTLANCTTLKNRGIKIAVLYTTYLP 373 Query: 326 AEAADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 D + ++ CASP ++ V ++ + DA + + V Q Sbjct: 374 LPTNDWYNTWIAPFQATLPSAMQGCASPGLYFEVSPTQGIADAMTTLFQRTVSQ 427 >gi|45384200|ref|NP_990400.1| integrin alpha-1 [Gallus gallus] gi|2582830|dbj|BAA23160.1| alpha1 integrin [Gallus gallus] Length = 1171 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 47/287 (16%), Positives = 101/287 (35%), Gaps = 47/287 (16%) Query: 107 NIERSTSL-SIIIDDQHKDYNLSAVSRYEMPFIFC-------------TFPWCANSSHAP 152 N+ +TS+ +++ ++ + V+ + F+ C T C+N S Sbjct: 85 NLPDATSVPNVMEVKENMTLGTTLVTNPKGGFLACGPLYAYKCGRLHYTTGVCSNVSSTF 144 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + LD+++VLD S S + T + +L + P Sbjct: 145 ETVKAVAPSVQECKTQLDIVIVLDGSNS--------IYPWESVTAFLNSLLRNMDIGPQQ 196 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 G+V + +V F L + + + R+ T++ L + + Sbjct: 197 TQ---VGIVQYGQTVVHEFYLNTYSTTEEVMDAALRIRQRGGTQTMTAL--GIDTAREEA 251 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------ 324 H A+ +K ++ +TDGE + DN ++ + +AI + Sbjct: 252 FTEAHGARRG--VQKVMVIVTDGE----SHDNYRLQEVIDKCEDENIQRFAIAILGSYSR 305 Query: 325 ---QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 306 GNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTIVEALGERIFAL 352 >gi|225012027|ref|ZP_03702464.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003582|gb|EEG41555.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 346 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 23/205 (11%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 +M + P + T + + G+D++ +DVS SM Sbjct: 52 KFKPGLKMIVLSIAIALLVLGLMNPKIGTQLETVKRE---GVDIVFAIDVSKSMLAEDIA 108 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 ++L + R + +L+ + S R G++ ++++ V P+ + + L Sbjct: 109 P-NRLEKSKRLVSAILNQLAS-------DRVGIIAYAAQAVPQLPITTDYSAAKMFLQAL 160 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + L+ A + D + I ++DGE+ S + N S Sbjct: 161 NTEMLSSQGTALDSAID-------LSGTFFDDEDQTNRVIFLISDGEDHSEDASNAASRA 213 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQF 332 G ++ GV EA Sbjct: 214 A-----AMGIKIFTFGVGTEAGAPI 233 >gi|332665830|ref|YP_004448618.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332334644|gb|AEE51745.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 630 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 79/211 (37%), Gaps = 22/211 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDK 191 + + PW H + I K + ++ +++ ++DVS SM +K Sbjct: 225 FSITTEVSDCPWQP--KHRLVHIGLQGKHTPVENLPAANLVFLVDVSGSM-----SAANK 277 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S + + + ++ V VV +G +V I+E I++L G Sbjct: 278 LPLVQASYKLLAEQLRPQDRVAIVVYAGAAG----LVLESTTGNNKTKIKEAIDKLQAGG 333 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY E+ KG ++ +I +DG+ + + E + E Sbjct: 334 STAGGEGILLAYKT------AKENFIKGGNNR---VILASDGDFNVGVSSDGELVRIIEE 384 Query: 312 AKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 385 ERKSGVYLTILGYGMGNYKDNKMQKLADSGN 415 >gi|198435715|ref|XP_002125840.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 3908 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 76/209 (36%), Gaps = 32/209 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + A ++ +++ + D + R +V +S Sbjct: 3425 DVIFIVDGSWSVGEI------NFRKAKDFLKALVEPFEVGWDNS---RFAVVQYSDDPRT 3475 Query: 230 TFPLAWGVQHIQEKINRLIF----GSTTKSTPGLE-YAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + +N + G T + L Y + A ++ Sbjct: 3476 EF-LMNEHFTVTDVLNAIDAIPYKGGNTNTGKALAFSLYTALSPANGARPYVN------- 3527 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DR 341 K + LTDG + + E ++ G V +GV +A LK+ ASP Sbjct: 3528 KVALVLTDGRSQDEVGNPAR------ELRQAGVKVLTVGVG-DADKNELKSIASPPYDSS 3580 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 Y V + + + + ++ + +L + Sbjct: 3581 VYHVSDYDSISEIKAHLAAKLCEGEVLRD 3609 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K D++++ D S GP + ++D N V G + Sbjct: 394 KKAQKTDLVVLTDGS----WSVGPQN--FKKIQAFLVSLVDAFSI---GFNNVLMGYAQY 444 Query: 224 SSKIVQTFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F L V + IN++ + G T + L+Y +F ++ Sbjct: 445 SDDARTEFNLNEHVTKDDLIRAINQVQYKGGNTATGGALDYIRTNLFTSEGGTRRGVL-- 502 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP- 339 K I +TDGE+ + K G V++IGV A + +SP Sbjct: 503 ----KTAIVITDGES-----ILDDVTEPARMLKEIGVEVFSIGVAAALRSELEDIASSPA 553 Query: 340 -DRFYSVQNSRKL 351 D +SV N + Sbjct: 554 SDHVFSVDNFDDI 566 >gi|163857470|ref|YP_001631768.1| putative lipoprotein [Bordetella petrii DSM 12804] gi|163261198|emb|CAP43500.1| putative lipoprotein [Bordetella petrii] Length = 582 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 84/237 (35%), Gaps = 34/237 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + I +++ + +++ ++D S SM + DKL + +++++ Sbjct: 195 PWNPQRQLLKIGIQG-YRVAPQDIPAANLVFLVDTSGSMAER-----DKLPLIKGALKQL 248 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + + R +VT++ + I I+ L +T GL+ Sbjct: 249 ------VAQLRPQDRVAIVTYAGQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLD 302 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + + KG + I+ +DG+ + D ++ ++ G + Sbjct: 303 LAY------AQAAKGFVKGGVNR---ILLASDGDFNVGATDLEDLKDKIARQRQGGIALT 353 Query: 321 AIGVQAEA-ADQFLKNCASPDR--FYSVQN--------SRKLHDAFLRIGKEMVKQR 366 +GV D A ++ + + + ++ L I +++ Q Sbjct: 354 TLGVGGGNFNDALAMQLADAGNGSYHYLDSLREARKVLAAQMSSTLLTIARDVKIQV 410 >gi|156383825|ref|XP_001633033.1| predicted protein [Nematostella vectensis] gi|156220097|gb|EDO40970.1| predicted protein [Nematostella vectensis] Length = 204 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 73/203 (35%), Gaps = 26/203 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++++ ++D S S+ND + +++ + V F++ Sbjct: 20 LKMNLVFLIDNSGSIND------TEFDNFKEFAKKLAESFTISA---TYTHVAAVYFNTL 70 Query: 227 IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + + I+ I+ L G T L Y + +F + K Sbjct: 71 ANFGFNLKYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVF--------KVAPRQNVKN 122 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASP---DR 341 ++ LTDG++ ++ G V+A+GV A + L AS D Sbjct: 123 VLVVLTDGKSHDSVTLPAAAVRNYGP----GVEVFAVGVGAGDSFVAQLNVIASDPDEDH 178 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 + V++ ++ + E+ K Sbjct: 179 VFHVEHFSQIESTTGAVEDEICK 201 >gi|50950211|ref|NP_001002980.1| collagen alpha-1(VII) chain [Canis lupus familiaris] gi|33149359|gb|AAO64414.1| type VII collagen [Canis lupus familiaris] Length = 2936 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 78/224 (34%), Gaps = 27/224 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +P + + + +++ D++ +LD S S+ Sbjct: 2 RLPLLVAALCAGILAGALRVRAQQRERVTCTRLYAADIVFLLDGSSSIGR------GNFR 55 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-G 250 + ++ VR V +S F L A G + I L + G Sbjct: 56 EVRGFLEGLVWPFSGAASA-QGVRFAAVQYSDDPRTEFGLGALGSGGDVIRAIRELSYKG 114 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T++ + + + +F L +A+ K I +TDG++ + L Sbjct: 115 GNTRTGAAILHVADHVF-----LPQLARPGVP--KVCILITDGKSQDLVDTAAQRL---- 163 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRKL 351 K +G ++A+G++ A + LK AS D F+ V + L Sbjct: 164 --KGQGVKLFAVGIK-NADPEELKRVASQPTSDFFFFVNDFSIL 204 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 68/181 (37%), Gaps = 24/181 (13%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLI 248 + A R++ ++ + P V+ GL+++S + FPL ++ I +KI+ + Sbjct: 1068 RAEAAKRALERLVSALG--PLGPQAVQVGLLSYSHRPSPLFPLNSSYNPDVILQKIHSIP 1125 Query: 249 F--GSTTKSTPGLEYAYNKIF--DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + S + A+ + DA + +HI ++ L D E +I Sbjct: 1126 YVDPSGNNLGTAVVTAHRHLLAPDAPGRRQHIPG-------IMVLLVD-EPLRGDI---- 1173 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD---RFYSVQNSRKLHDAFLRIGKE 361 EA+ G V +G +Q + D F++V + L A + Sbjct: 1174 -FNPIREAQAAGLKVMILGQAGADPEQLRRLVPGMDPVQTFFAVDDGSSLDRAVSGLATS 1232 Query: 362 M 362 + Sbjct: 1233 L 1233 >gi|332970883|gb|EGK09860.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 448 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 76/219 (34%), Gaps = 26/219 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P + S + L + ++LD S SM GM K+ +A ++ + Sbjct: 121 PGGVKGPDGQESAEEKPLHVSILLDASGSMAGQVDGGM-KMNLAKAAVERFASSLPENAK 179 Query: 212 VNNVVRS--GLVTFSSKIVQT------FPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 V+ V G + K V +PL + + + +++ T ++ A Sbjct: 180 VSLWVYGHKGSNSKKDKPVSCKSTEEVYPLGTYQEEKFSQSLDQFRATGWTPIAASMKAA 239 Query: 263 YNKI-FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 ++ ++ E H+ YI+ +DG + E+ + + Sbjct: 240 REELQKNSGEDATHML--------YIV--SDGVETCGGDPVAEAKKLNQSKIKAVVNI-- 287 Query: 322 IGVQA-EAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 IG +A Q L+ A + +V++ + L F Sbjct: 288 IGFDVDDAGQQALQKVAEAGGGEYETVESEQDLRSYFDE 326 >gi|313895388|ref|ZP_07828945.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312976283|gb|EFR41741.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 647 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 73/210 (34%), Gaps = 32/210 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + G++++ V+D S SM ++ ++ +L D R GLV Sbjct: 454 RERKRGVNILFVVDASGSMA-----ARARMRAVKGAMLALLREAYVRRD-----RVGLVA 503 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + PL V+ Q + L G T GL A + A + + Sbjct: 504 FRRDRAETLLPLTRSVELAQRLLRELPTGGRTPLAAGLSEALLHLAGAARR-------GE 556 Query: 282 DYKKYIIFLTDGENSSP--NIDNKE-SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + ++ LTDG ++ D + +L G + + ++ Sbjct: 557 LAETLLVLLTDGRATAAPEGEDPAQAALTAAETIGNTGVR----ALVLDTEQDLVRL--- 609 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + Q + ++H I +E+ QRIL Sbjct: 610 ---HLAAQIAARMHAPCYTI-EELSVQRIL 635 >gi|225030986|gb|ACN79500.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Nilaparvata lugens] Length = 315 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 42/245 (17%), Positives = 80/245 (32%), Gaps = 63/245 (25%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ VLD+S SM FG + +L A ++K + D+N +V FS Sbjct: 60 QVVFVLDISGSM---FGEKIKQLKDA---------MLKILSDLNPQDHFSIVLFSDNAYV 107 Query: 230 TFPLAW-----------------------------------GVQHIQEKINRLIFGSTTK 254 V+ +E + + ++T Sbjct: 108 WSKAKTAVMKKILDEGFYNLDNETLAILDDHRNEILQATPDNVKTAKEFVELIKPTTSTN 167 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 GL + + KE L+ + + FLTDGE PN+D + + NE Sbjct: 168 IIDGLRKGLKLVKEGKETLDTTKEPSQP---IMFFLTDGE---PNVDLTDPVEIVNETSS 221 Query: 315 RGAI----VYAIGVQAEAADQFLKNCASPDRFYS------VQNSRKLHDAFLRIGKEMVK 364 +Y++ A FLK + + ++ + +L++ + I ++ Sbjct: 222 LNEQLKTPIYSLAFGQGADITFLKKLSKANHGFARNIYEGSDATLQLNNFYKEISSPLLA 281 Query: 365 QRILY 369 Sbjct: 282 NVTFI 286 >gi|209550318|ref|YP_002282235.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536074|gb|ACI56009.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 794 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 50/298 (16%), Positives = 101/298 (33%), Gaps = 27/298 (9%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N K + + + ++ E QD + ER+ SL KD+ L+ Sbjct: 255 NAKINPVSLTVDLKAGFPLGEVKSSFHEVDIRQDGDQ-ERTISLKGDAVPADKDFELTWQ 313 Query: 131 SRY-EMPFIFCTFPWCANSS-HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + ++P + + + ++ +++ V+D S SM+ GP Sbjct: 314 AAPGKLPSAGLFREVKDGKTCLLAFVTPPTAPDAAAPPAKREVVFVIDNSGSMS---GPS 370 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +++ + L + + +V P + + L Sbjct: 371 IEQAKQSLALAISRLTPNDRFNVIRFDDTM--TDYFKGLVAATP--DNREKAIAYVRGLP 426 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T+ P LE DA +A G + ++FLTDG I N++ LF Sbjct: 427 ADGGTEMLPALE-------DALRNQGPVATGAL---RQVVFLTDG-----AIGNEQQLFQ 471 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 A R A V+ +G+ + F+ A R F + ++ ++ + ++ Sbjct: 472 EITANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTQIGSTDQVASRMGELFAKLQN 529 >gi|225387166|ref|ZP_03756930.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] gi|225046714|gb|EEG56960.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] Length = 556 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 88/258 (34%), Gaps = 28/258 (10%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + F+ + + R L N D IE + + + +V Sbjct: 87 VANAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEPTEDEPFSV 146 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + Y PW N + + + K ++ +++ ++DVS SM D Sbjct: 147 TTY-----IGDCPWNENHKLLQIGLQAE-KPDLENQKPSNLVFLIDVSGSME-----SAD 195 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLI 248 KLG+ R+ + + ++ V+ +VT+ S +V I I L Sbjct: 196 KLGLVKRAFLLLTENLRPEDTVS------IVTYASSDTVVLDGVSGEEKAAIMTAIENLT 249 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G +T + G+E AY EH K ++ +I TDG+ + + Sbjct: 250 AGGSTDGSKGIETAYRL------AEEHFQKDGNNR---VILATDGDLNLGLTSEGDLTRL 300 Query: 309 CNEAKRRGAIVYAIGVQA 326 + K G + +G Sbjct: 301 IQKKKESGVFLSVMGFGT 318 >gi|226951529|ref|ZP_03821993.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226837721|gb|EEH70104.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 536 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 76/209 (36%), Gaps = 33/209 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM DKL + +++R + + +++ V ++T++S Sbjct: 176 LVFLVDVSGSM-----SAADKLPLVKQTLRILTEQLRAQDKVT------IITYASGEKLV 224 Query: 231 FPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 G + I IN L G T ++ AY + A K + I+ Sbjct: 225 LEPTSGEQKEKILAVINGLRAGGATAGEQAIQLAYKQAEKA------FVKNGINR---IL 275 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNC--ASPDRFYSV 345 TDG+ + D E ++ G + +G +Q ++ A + + Sbjct: 276 LATDGDFNVGITDFSTLKGMVAEKRKSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYI 335 Query: 346 QN--------SRKLHDAFLRIGKEMVKQR 366 N R+L + +++ Q Sbjct: 336 DNKNEAKKVVQRQLSSTLATVAQDVKIQV 364 >gi|217976666|ref|YP_002360813.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217502042|gb|ACK49451.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 346 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 57/163 (34%), Gaps = 20/163 (12%) Query: 138 IFCTFPWCANSSHAPLLITSS---VKISSKSDIGLDMMMVLDVSLSMNDHFG--PGMDKL 192 + TF A L +T ++ K G + +++D S SMN+ F P Sbjct: 48 LALTFAAVAAIGSTALALTGPRRGGELVEKIGDGAETALLIDRSGSMNETFAGRPPEGGE 107 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + R +L R G+V FS+ ++ PL + + + + Sbjct: 108 ESKANAARRVLQD---YVRRRQHDRFGVVGFSTSPIRMLPLTDNMDAVLGAVAAIDRPGL 164 Query: 253 --TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 T GL A + + + + + ++ ++DG Sbjct: 165 DYTDIGRGLSTALSLLSEDAGQS----------PRVLLLVSDG 197 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 98/269 (36%), Gaps = 42/269 (15%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT- 61 F ++ F+++ KG +++TA+L + ++ ++ S +A+L + + L T Sbjct: 11 FYMLKRFYHDEKGVYAVMTALLAFPLLFLIAFAVDGSGILLDRARLAQATEQAALLLTTE 70 Query: 62 --------------KILNQENGN-------NGKKQKNDFSYRIIKNIWQTDFRNELRENG 100 ++ ++E N ++K + + + + Q + LR Sbjct: 71 NNQYRADKSNLSNVQVTDEEIKNAKGSFKTAQDRKKGAQALKRNQELVQGMVKLYLRSYD 130 Query: 101 FAQDI-------NNIERSTSLSIIIDDQHKDYNL------SAVSRYEMPFIFCTFPWCAN 147 Q + + ++ ++ +P+ + N Sbjct: 131 KEQKSSSPITIPKDFVAECRTQTSTRTNGESSSVACLVEGDVKRKFWLPWSYTLTSNNRN 190 Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-----NDHFGP-GMDKLGVATRSIRE 201 + + +VK I +D+M+V D+S SM +D GP +D L +++ Sbjct: 191 TVDINSGKSYAVK-EKDILIPIDLMLVNDISTSMFKPPKDDPQGPKKIDSLKTVVKAVAN 249 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +L + +++ R G+ +F Q Sbjct: 250 ILIPDEPPKNISKYNRIGITSFGLGAQQA 278 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 54/138 (39%), Gaps = 18/138 (13%) Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + ++ + G T ++ G+ N + + + K + ++ ++ L+DG +++ Sbjct: 437 SELNKVMSGIHAGGWTLASAGVFVGTNLLMNINKD-ATPDKIKTNTQRILLVLSDGVDTA 495 Query: 298 PNIDNKESLF--YCNEAKRR------------GAIVYAIGVQAEAADQFLK---NCASPD 340 +E L CN+ + + + + E + K NC P Sbjct: 496 LPTLTQELLKGGMCNKVRNKLDELQDKNYRILPTKIAFVAFGYEQDSELRKEWENCVGPG 555 Query: 341 RFYSVQNSRKLHDAFLRI 358 ++ +N + L + F +I Sbjct: 556 NYHQAKNEKALLEVFKQI 573 >gi|88800880|ref|ZP_01116434.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] gi|88776393|gb|EAR07614.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] Length = 555 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 92/274 (33%), Gaps = 38/274 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT----FPWCANSSHAPLLITSSVKISSKS 165 + S+ + + DY L A P T PW + + S + K+ Sbjct: 134 PADSIRVEEFINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRV-SLQSYRSDFKT 192 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L+++ +LDVS SM DKL + RS + + + R + ++ Sbjct: 193 LPPLNLVFLLDVSGSM-----NSPDKLPLMQRSFNLL------VSQLRPQDRVAIAVYAG 241 Query: 226 KIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + G Q I + IN+L G T + G+ AY+ + A + I + Sbjct: 242 QSGVVLEPTSGDQKAQINQAINQLRAGGGTHGSAGIHLAYD-LAQANYLPDGINR----- 295 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR- 341 I TDG+ + E + G + +G D ++ ++ Sbjct: 296 ---IFIGTDGDFNVGTTSLTELKALIERKREAGVFLSVLGFGTGNYNDALMEELSNHGNG 352 Query: 342 -------FYSVQN--SRKLHDAFLRIGKEMVKQR 366 + + + +L + K++ Q Sbjct: 353 TAYYLDSYQEARKLFATQLAATLQTVAKDVKIQI 386 >gi|327540682|gb|EGF27254.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 887 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 72/199 (36%), Gaps = 20/199 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW N+ + I + I K +++ ++D S SM +KL + ++ + Sbjct: 480 PWNENNRLVRVGI-QAKDIDRKKRPRCNLVFLIDTSGSMKRP-----NKLPLVIEGMKVL 533 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 LD + V VV +G SS +V + I ++ L G +T GL+ A Sbjct: 534 LDQLNKKDRVAIVVYAG----SSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAGLQLA 589 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y A+E +I +DG+ + + + + G + + Sbjct: 590 YQT---ARENFIEDGVNR------VILCSDGDFNVGMTGTDQLVAEATRQSKSGTELTVL 640 Query: 323 GV-QAEAADQFLKNCASPD 340 G D ++ ++ Sbjct: 641 GFGMGNHNDAMMERISNSG 659 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 64/222 (28%), Gaps = 52/222 (23%) Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPLAWGVQHIQ 241 +L VA + M+D + + N VR LV+F + + +++ Sbjct: 155 GKRYSQKTRLDVAKSATNTMIDQLLANNATNPGSVRISLVSFDTFASDATAWSTSSENLH 214 Query: 242 EKINRLIFG---------STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 +N T L+ A + D +K++IF++D Sbjct: 215 SIVNGYKTPQSSHLGGHRGGTNWEDALQKA------------DGTQPRADAQKHVIFVSD 262 Query: 293 GENS----------------------------SPNIDNKESLFYCNEAKRRGAIVYAIGV 324 G + PN + + + GA Y +G Sbjct: 263 GNPTFRISSINGNPDDQYNDVHGHGDDDYYHSHPNYNYDAAKDDAKKIVDGGAAFYTVGT 322 Query: 325 QAEAA--DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +AA + D +Y + L AF I + Sbjct: 323 FGDAARMQNLATEAGASDNYYKADDEAALKAAFKNIVASITH 364 >gi|268610218|ref|ZP_06143945.1| von Willebrand factor, type A [Ruminococcus flavefaciens FD-1] Length = 565 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 63/173 (36%), Gaps = 31/173 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + S + + VLD S SM+ G ++ L + R+ + ++ G Sbjct: 382 KTNKNSGKPIAAVFVLDTSGSMS---GAPLNSLKASLRNSIKYINSSN---------YIG 429 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +V++SS + LA + ++ L T + L A + D Sbjct: 430 VVSYSSNVNVDLELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQAMIMLRD------- 482 Query: 276 IAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQA 326 K + + + L+DG+++S + D ++ +Y IG A Sbjct: 483 FTKDNPNVSPMVFLLSDGQSNSGSEFSDIDGAIATAQ------IPIYTIGYNA 529 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 48/329 (14%), Positives = 109/329 (33%), Gaps = 39/329 (11%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 ++ T + + G+ +K + D N ++ + Sbjct: 63 NARWSGVTAVCDDGAGHCPNP---GIPPGAMKTGVRYDMDNSIKYACSRGMSLVGSPHRT 119 Query: 114 LSIIIDDQHKDYNLSAVSRYEMP-----FIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + + +++P + N ++KI K D Sbjct: 120 CLESRRWSGTEISCQYPYSFDLPEDVQEQFKASLSGILNIKERSASFGRTIKI--KRDGI 177 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L++ +LD S S+ + + ++D + S G++++++ Sbjct: 178 LNVYFLLDASRSVGEA------NFDIYKECSVYLVDELASFDMTIQF---GIISYATVPK 228 Query: 229 QTFPL----AWGVQHIQEKI-NRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 P+ + H+ E I N L + + T LE YN + KE ++ + Sbjct: 229 VIIPIYDENSDNDAHVFEVIENDLKYSDHKDKTGTNIKTALEEVYNMMSFQKETYKNESV 288 Query: 279 GHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFL 333 + + II LTDG+ + D + + + K++ VY G+ + L Sbjct: 289 WNSIHH-IIILLTDGKANIGGRPADTIKHIEEFLDIKKKREDYLDVYTFGIGPDVDMADL 347 Query: 334 KNCAS----PDRFYSVQNSRKLHDAFLRI 358 AS + ++++ ++ F +I Sbjct: 348 SEIASKKDGESHVFRMESANEMKTVFQKI 376 >gi|160878421|ref|YP_001557389.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160427087|gb|ABX40650.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 551 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 72/196 (36%), Gaps = 19/196 (9%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S + + PW ++ L + KI +++ ++DVS SM D D Sbjct: 158 SPFGITTELSDCPWNPDTKLF-LAGIQTEKIDFSKSAPSNLVFLIDVSGSMMDE-----D 211 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 KL + R+ + + + ++ V +G + +V + + IQ I L G Sbjct: 212 KLPLVQRAFLLLTENLTEKDRISIVTYAG----NDTVVLSGAKGNQKEKIQNAITELEAG 267 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T + G+E AY + + + +I TDG+ + E Sbjct: 268 GSTFGSKGIETAYQLAMENYIEGGNNR---------VILATDGDLNVGVTSESELTNLIE 318 Query: 311 EAKRRGAIVYAIGVQA 326 E ++ G + +G Sbjct: 319 EKRKSGVALSVLGFGT 334 >gi|47223676|emb|CAF99285.1| unnamed protein product [Tetraodon nigroviridis] Length = 628 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 46/229 (20%), Positives = 71/229 (31%), Gaps = 31/229 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A +H SS G D+ VLD S S+ ++ Sbjct: 18 VVMALVAGGAGQTHDSRAAGSSCY------GGFDLYFVLDKSGSVQHYWNEIF------- 64 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTT 253 + + ++ +R + FS+ L I+ + L G T Sbjct: 65 YFVHHLAHKF-----ISPQMRMSFIVFSTDGRTLMALTEDRDKIRAGLEELRMVQPGGDT 119 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GL A +I+ A A II LTDGE D + A+ Sbjct: 120 YMDRGLHRASEQIYYAAGDGYRAAS-------VIIALTDGELREDQFDTAQ--REAGRAR 170 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS-RKLHDAFLRIGKE 361 + GA VY +G++ Q S D + V + L I K Sbjct: 171 QLGASVYCVGLKDFNETQLSTIADSKDHVFPVHDGFEALQSVIDSILKR 219 >gi|323941033|gb|EGB37220.1| von Willebrand protein type A [Escherichia coli E482] Length = 565 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 57 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 116 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 117 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 176 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 177 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 230 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 231 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 284 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 285 GLELAYQQATKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 335 Query: 318 IVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 336 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 371 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 79/264 (29%), Gaps = 22/264 (8%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 + RST + ++ + R P + ++ Sbjct: 273 LTEYLRSTEAQRTLTERTFRRPVVTSVRPAAPLSAAKRRELPFPGTRSVADGLLAAYENE 332 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + VLD S SM +D+L A + D + + L+ F Sbjct: 333 LRRPSRTVYVLDTSGSMEGD---RLDRLKAALTQLAGA-DGAATGERFRDREEVTLMPFG 388 Query: 225 SKIVQ-------TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 S++ + I+ L T L AY + + L Sbjct: 389 SEVKAVRTHTVPEDDPGKALAAIRADAKALTADGETAIFSSLRAAYRHLAERASALGDDR 448 Query: 278 KGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 I+ +TDGEN++ + D+ ES + +R V+ I + L+N Sbjct: 449 FTS------IVLMTDGENTAGDSADDFESFYRRLPGAQRTTPVFPILFGDSDRGE-LENI 501 Query: 337 AS--PDRFYSVQNSRKLHDAFLRI 358 AS R + L AF I Sbjct: 502 ASLTGGRLFDATK-GSLDQAFEEI 524 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 65/205 (31%), Gaps = 27/205 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQ 229 ++LD S SM G+ K +A I D IKS V V G S K+ Sbjct: 39 AILLDASGSMAKRID-GVSKYNMAKEEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQS 97 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + Q +N + T LE A + Sbjct: 98 CESIRGVYGFQRFDRQSFLNSLNGIGPTGWTPIAKALEDAKASFTGLHKLG--------- 148 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNC-ASP 339 K + LTDGE + + + E +++ V IG + Q + A Sbjct: 149 -SKSVFLLTDGEETCGG----DPVKTAKELRKQHIKVNVIGFDFKEGFNGQLHEIAKAGG 203 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 ++Y + + ++ F + + Sbjct: 204 GKYYEAHSQKDMNRIFTMAASSLAE 228 >gi|109101588|ref|XP_001084624.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Macaca mulatta] Length = 2969 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 51/310 (16%), Positives = 105/310 (33%), Gaps = 32/310 (10%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S LYT + + N + I K + L E +Q ++RS Sbjct: 312 LDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDE--ISQPAQELKRS 369 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFC--------TFPWCANSSHAPLLITSSVKISS 163 + ++ I ++ D ++ +F +T + ++ + Sbjct: 370 SIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLTGTPEVHA 429 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S ++ P + +++++ S+ N+ +R GLV F Sbjct: 430 NKR---DIIFLLDGSANVGKTNFPYVRDF---------VMNLVNSLDVGNDNIRVGLVQF 477 Query: 224 SSKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S V F L + + I + +L + G +Y E + H Sbjct: 478 SDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIREHV 537 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++ LT G++ L N R G + + +G + + +P Sbjct: 538 P--QLLLLLTAGQSED------SYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSL 589 Query: 342 FYSVQNSRKL 351 Y + + L Sbjct: 590 VYLMDDFSSL 599 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 103/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 1325 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRAELQTITNDP 1384 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + T + ++ AP + + + D++ +LD S Sbjct: 1385 RLVFTVREFRELPNIEERIMTSFGTSAATPAPPGVATPSPSRPEKKKA-DIVFLLDGS-- 1441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1442 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1494 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1495 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1553 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 59/143 (41%), Gaps = 13/143 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 ++++++ +P +R G+V FS + F L + + L F + GL Sbjct: 59 LVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGL 118 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ + + ++ ++ G +S +L + V Sbjct: 119 --ALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQAS--------V 168 Query: 320 YAIGVQAEAADQF-LKNCASPDR 341 ++ G+ A+AA + L++ A+ D Sbjct: 169 FSFGLGAQAASRAELQHIATDDN 191 >gi|197098872|ref|NP_001126081.1| complement C2 [Pongo abelii] gi|55730287|emb|CAH91866.1| hypothetical protein [Pongo abelii] Length = 752 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + I E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDITEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ L N+ + +YAIGV Sbjct: 389 VDHIRELLNINQKRNDYLDIYAIGVG 414 >gi|73949158|ref|XP_535195.2| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Canis familiaris] Length = 946 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 73/201 (36%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + ++ F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAEDQFS------VIDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V ++ I ++ T L A + +A + Sbjct: 359 RNDLVSATRTQVTDAKKYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPESVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + + R ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQHIRDNISLFSLGIGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 61/197 (30%), Gaps = 27/197 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQ 229 ++LD S SM G+ K +A I D IKS V V G S K+ Sbjct: 40 AILLDASGSMAKRID-GVSKYNMAKDEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQS 98 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + Q +N + T LE A + Sbjct: 99 CESIRGVYGFQRFDKQSFLNSLNGIGPTGWTPIAKALEDAKASFNGVHKLG--------- 149 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNC-ASP 339 K + LTDGE + + + E +++ V IG Q A Sbjct: 150 -SKSVYLLTDGEETCGG----DPIKTAKELRKQHIKVNVIGFDFNEGFNGQLHAIAGAGG 204 Query: 340 DRFYSVQNSRKLHDAFL 356 ++Y + + ++ F Sbjct: 205 GKYYEAHSQKDMNRIFK 221 >gi|194211147|ref|XP_001917810.1| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 909 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 68/197 (34%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM + ++ A + + + N G+V F S + Sbjct: 307 VCLVLDKSGSMAGS--NRLSRMNQAAKHFL--------LQTIENGSWVGMVHFDSIALVI 356 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + E + T G++ A+ I D +++ Sbjct: 357 SNLTQIISSNERNKLFESL-PTEAVGGTSICAGIKSAFEVITDMYSQIDGSE-------- 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE+++ +E K+ GAI++ I + A + Y Sbjct: 408 -IVLLTDGEDNTAG-------SCVDEVKQSGAIIHFIALGPSADQAVIEMSTITGGKHKY 459 Query: 344 SVQNSRK--LHDAFLRI 358 + + L DAF + Sbjct: 460 ASDEAANNGLIDAFAAL 476 >gi|148657120|ref|YP_001277325.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569230|gb|ABQ91375.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 774 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 52/158 (32%), Gaps = 20/158 (12%) Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 VR G + + Q I+ L T GL+ + + A Sbjct: 352 VRDG--WYIDDVTIGPEWDDVRARAQAAIDTLNSRGATSIGGGLQSSQRMLDTANPD--- 406 Query: 276 IAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + II L+DG EN+ P + + + + V+ IG+ +A Q + Sbjct: 407 -------LPRVIILLSDGQENTRPFVADVLP-----QIRAAQTTVHTIGLGRDADQQLML 454 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + A + + +L + I + ++ L Sbjct: 455 SIAAQTGGTYNYAPTPEQLSGIYNTISGAVSNRQTLVT 492 >gi|22203747|ref|NP_666119.1| collagen alpha-2(VI) chain precursor [Mus musculus] gi|125987813|sp|Q02788|CO6A2_MOUSE RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|21706759|gb|AAH34414.1| Collagen, type VI, alpha 2 [Mus musculus] gi|148699895|gb|EDL31842.1| procollagen, type VI, alpha 2 [Mus musculus] Length = 1034 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 71/213 (33%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 53 PEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSWRYG 112 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H+ K Sbjct: 113 GLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGK 166 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + + +TDG + + A+ G ++A+ +Q L++ A+ Sbjct: 167 GVVN---FAVVITDGHVTGSPCGGIK--MQAERAREEGIRLFAVAPNRNLNEQGLRDIAN 221 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 222 SPHELYRNNYATMRPDSTEIDQDTINRIIKVMK 254 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 627 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 683 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AYN++ + + Sbjct: 684 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 740 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 741 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 775 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 65/180 (36%), Gaps = 17/180 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + + L + + D N R L+ + Sbjct: 843 TQRPVDIVFLLDGSERLGEQNFHKVRRF---VEDVSRRLTLARRDDDPLNA-RMALLQYG 898 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + V I E + R S + G+ +A N + Sbjct: 899 SQNQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELS 958 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +FLTDG + +++ + +++ + + V + L + DR Sbjct: 959 -----FVFLTDGVTGNDSLEES-----VHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDR 1008 >gi|74217197|dbj|BAC31374.2| unnamed protein product [Mus musculus] Length = 338 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 71/213 (33%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 48 PEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSWRYG 107 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H+ K Sbjct: 108 GLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGK 161 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + + +TDG + + A+ G ++A+ +Q L++ A+ Sbjct: 162 GVVN---FAVVITDGHVTGSPCGGIK--MQAERAREEGIRLFAVAPNRNLNEQGLRDIAN 216 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 217 SPHELYRNNYATMRPDSTEIDQDTINRIIKVMK 249 >gi|326434685|gb|EGD80255.1| hypothetical protein PTSG_10931 [Salpingoeca sp. ATCC 50818] Length = 706 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 70/205 (34%), Gaps = 28/205 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D D++ +LD S S+ +E++ PD R + + Sbjct: 175 GDAVADLLFILDGSGSVGS------GNFQTMLNFAQEVVSFFDVAPDK---TRVAAMVYD 225 Query: 225 SKIVQTFPLAW----GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + F + Q + + T++ L +A + +F +++G Sbjct: 226 SSNYRKFDFDYIQSVSKQQLINYFDTFAYPDGGTETGSALSFALSSMFVTSRGARDLSEG 285 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I +TDG++ ++L + G +YA+G+ + L ASP Sbjct: 286 ---VPRVAIVITDGKSGDDVSAPAQAL------RDAGVTLYAVGISGADVSE-LNQIASP 335 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 V+++ D F K Sbjct: 336 ----PVEDNVVFIDTFSEFSALASK 356 >gi|261822920|ref|YP_003261026.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606933|gb|ACX89419.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 346 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 55/168 (32%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM D L ++ ++ +K P ++ F+ Sbjct: 3 RLPVFFVLDCSESMIG------DNLKKMNDGLQAIVSDLKKDPHALETAWISVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ + RL G T L +I + K KG + Sbjct: 57 QTIVPL---VEVVSFYPPRLPLGGGTSLGAALRELTKQIDEQVRKTTQERKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + + + AIG+ A A L+ Sbjct: 112 YLLTDG---RPTDDTASEITRWKQHYANKVNLIAIGLGASADLNTLRQ 156 >gi|260814492|ref|XP_002601949.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] gi|229287252|gb|EEN57961.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] Length = 1774 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 68/193 (35%), Gaps = 29/193 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +VLD S S++ + + + ++ R G++ +S + Sbjct: 1140 DLFLVLDGSGSVS------VSDFDTVKQFVVAVVSAFTI---GLADTRVGVLQYSDRNTL 1190 Query: 230 TFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L IN + G T + +E+A ++ A K Sbjct: 1191 GCNLGDHPDEASFVSSINTMTRQGGGTSTGAAMEFA----------RQNAAWRPAPVPKI 1240 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSV 345 +I LTDG++S + +L V+AIGV + + L+ P + + + Sbjct: 1241 MIVLTDGKSSDSVVAAAHALAA------DQVTVFAIGVGSFDHSELLEITNNKPSQVFEL 1294 Query: 346 QNSRKLHDAFLRI 358 + L + RI Sbjct: 1295 ADFNVLAQSINRI 1307 >gi|182412149|ref|YP_001817215.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177839363|gb|ACB73615.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 859 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 68/198 (34%), Gaps = 25/198 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ ++LD S SM + R+ +L + + D R L+ F+ Sbjct: 510 AGQPLNLTVLLDTSGSMER------TDRATSVRAALGVLASLLTPDD-----RVTLIGFA 558 Query: 225 SKI-VQTFPLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + LA + + + + F T L A + + + A + Sbjct: 559 RQPRLLAESLAGDQARQLVDLASTTPFTGGTNLEAALSLA----GELARRHHNAAAQNR- 613 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SP 339 I+ +TDG + N D + +++G A GV + D L+ Sbjct: 614 ----IVLITDGAANLGNADPAQLATRIETLRQQGIAFDACGVGTDGLDDAVLEALTRKGD 669 Query: 340 DRFYSVQNSRKLHDAFLR 357 R+Y + F R Sbjct: 670 GRYYVLDAPENADAGFAR 687 >gi|149046636|gb|EDL99461.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Rattus norvegicus] Length = 920 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|153214688|ref|ZP_01949548.1| RTX protein [Vibrio cholerae 1587] gi|124115210|gb|EAY34030.1| RTX protein [Vibrio cholerae 1587] Length = 2093 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 81/243 (33%), Gaps = 22/243 (9%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV------KISSK 164 +++L ID +K S + T + + V S+ Sbjct: 1427 NSTLHAPIDHPNKSGEDSLAINIPLEAKNATGAIGTGKVTLVIEDDAPVAKEVFHVAESE 1486 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV-TF 223 G ++ ++LD S SM + G G +L V + ++L ++ + R L+ + Sbjct: 1487 LKQGANVQLILDTSGSMGEPAGNGQTRLKVMQTAALQLLSEYSALGE----TRVQLIEFY 1542 Query: 224 SSKIVQTFPL---AW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + W V E I+RL G T + A I+D + Sbjct: 1543 SDSRYYVSEINGSKWMTVDEASEHIDRLYAGGGTDYDDATKMA-ADIWDDNDGDMIAGGS 1601 Query: 280 HDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + Y FL+DG+ + + N + L + + A G+ + L A Sbjct: 1602 NISY-----FLSDGQPNQGEELSNNDRLDWEKHLRDHNVTALAYGMGNDVPQGELNKVAY 1656 Query: 339 PDR 341 Sbjct: 1657 DGH 1659 >gi|153873859|ref|ZP_02002297.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152069676|gb|EDN67702.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 367 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 61/174 (35%), Gaps = 23/174 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM + + A + + L + LV ++ Sbjct: 6 LVFLVDVSGSMRSNHKLALL--KSALKLLSNQLTEKDKVS---------LVVYAGAAGVV 54 Query: 231 FPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 G Q ++ + RL G +T + G+ AYN A K + I+ Sbjct: 55 LEPTPGHQSVKINGALERLTAGGSTHGSAGIHLAYNLAEQA------FIKNGINR---IL 105 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDR 341 TDG+ + +D + E ++ G + +G DQ ++ A Sbjct: 106 LATDGDFNVGTVDFEALKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGN 159 >gi|145500364|ref|XP_001436165.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124403304|emb|CAK68768.1| unnamed protein product [Paramecium tetraurelia] Length = 604 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 73/199 (36%), Gaps = 31/199 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S +G+D++ V+D S SM+ K+ + +++ +L+ + R L+ F Sbjct: 181 SKVGVDLLCVIDRSGSMSGE------KIEMVKQTLNILLNFLGPKD------RLCLIQFD 228 Query: 225 SKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L + I+++ T G + A +I + + Sbjct: 229 DTCQRLTNLRRVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQI------KYRKSVNN 282 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--S 338 I L+DG++ + ++ L Y + +++ G ++ + + + Sbjct: 283 VTA---IFVLSDGQDEAAISSLQKQLAYYKQT----LTIHSFGFGSDHDAKLMTKISNLG 335 Query: 339 PDRFYSVQNSRKLHDAFLR 357 FY V N L + F+ Sbjct: 336 KGSFYFVNNISLLDEFFVD 354 >gi|32474636|ref|NP_867630.1| hypothetical protein RB7099 [Rhodopirellula baltica SH 1] gi|32445175|emb|CAD75177.1| conserved hypothetical protein-containing vWFA domain [Rhodopirellula baltica SH 1] Length = 885 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 74/199 (37%), Gaps = 20/199 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW N+ + I + I K +++ ++D S SM +KL + ++ + Sbjct: 478 PWNENNRLVRVGI-QAKDIDRKERPRCNLVFLIDTSGSMKRP-----NKLPLVIEGMKVL 531 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 LD +K+ V VV +G SS +V + I ++ L G +T GL+ A Sbjct: 532 LDQLKNRDRVAIVVYAG----SSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAGLQLA 587 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y A+E +I +DG+ + + + + G + + Sbjct: 588 YQT---ARENFIEDGVNR------VILCSDGDFNVGMTGTDQLVAEATRQSKSGTELTVL 638 Query: 323 GV-QAEAADQFLKNCASPD 340 G D ++ ++ Sbjct: 639 GFGMGNHNDAMMERISNSG 657 >gi|116622495|ref|YP_824651.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225657|gb|ABJ84366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 77/219 (35%), Gaps = 29/219 (13%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + + D + +V+D S SM + R+ + + N Sbjct: 63 EQPITVFNGEDGPVTAGIVIDNSASMEPKRAEVIAAAMAFARASNTRDQMF--VVHFNER 120 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 R GL + P ++ ++ I+ G +T + A + I Sbjct: 121 ARLGL-------PERTPFTGKIKELETAISSFDVGGSTALYDAILLAQSHI--------- 164 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEAADQ 331 +G ++ ++ +TDG ++S +E++ + + G ++YAIG+ + + Sbjct: 165 --RGGVYGRRILLVITDGGDNSSKATLEEAV---DAVAKAGVVIYAIGIYDPNDKDQNPK 219 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L + A + + + I ++ +Q + Sbjct: 220 VLAHLAEVTGGEAFFPTALSDITRICEEIAADVRRQYTI 258 >gi|323936560|gb|EGB32847.1| von Willebrand type A protein [Escherichia coli E1520] Length = 565 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 57 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 116 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 117 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 176 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 177 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 230 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 231 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 284 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 285 GLELAYQQATKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 335 Query: 318 IVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 336 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 371 >gi|260824533|ref|XP_002607222.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] gi|229292568|gb|EEN63232.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] Length = 1897 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 65/176 (36%), Gaps = 27/176 (15%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 V I + + D+ +++D S S+ P + + +L + Sbjct: 1568 WSQGPVDIPAPTCRSKADIHVLVDGSKSVKTRNFPAVRQF---------ILKLAAGFEIG 1618 Query: 213 NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFD 268 N R G+ F+ + F + + + + I ++ + + TK+ L+ Y + Sbjct: 1619 PNKARFGVYQFAKDMQTEFKMNQYNNREALLDAIKKIEYMNQYQTKTGQSLKAVYEEFTK 1678 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 A D +K II +TDG+ + + K +GA V+ +GV Sbjct: 1679 ANGAR-------DGVEKIIILITDGKATD------QVRQPAQYVKNKGAHVFTVGV 1721 >gi|321460551|gb|EFX71592.1| hypothetical protein DAPPUDRAFT_326968 [Daphnia pulex] Length = 950 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 69/194 (35%), Gaps = 38/194 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++V+DVS SM + + KLG + R+ + P N G+V FSS Sbjct: 295 FVVVMDVSGSMKEF--NRIGKLGESVRAWIK-----TDFPSGNQ---LGMVQFSSNAEIL 344 Query: 231 FPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + K+ + +F T GL+ A + D Sbjct: 345 SDLRMIADEKSREEMMAKVPKEVF-VATCIGCGLQLAMQMLKDGG--------------- 388 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRFY 343 I+ +TDG+NS D + +AK V I A+ + L + ++ Sbjct: 389 IIVLVTDGKNSPGYHDISDVKKDIVDAK---IRVITIAYGSEADKNVEHLADVTGGKSYF 445 Query: 344 --SVQNSRKLHDAF 355 +S L AF Sbjct: 446 IKDDDSSEALQQAF 459 >gi|292624276|ref|XP_002665574.1| PREDICTED: collagen alpha-1(XXVIII) chain [Danio rerio] gi|225310547|emb|CAQ19234.1| collagen type XXVIII alpha 1 c precursor [Danio rerio] Length = 1170 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 58/169 (34%), Gaps = 24/169 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 L+++ V+D S S+ D V + ++D + R G+V +S Sbjct: 799 SPLELVFVIDSSESVGP------DNYEVVKDFVNSLIDHVSV---SREATRVGVVLYSHV 849 Query: 227 IVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V L + ++ + R+ G T + + A A+ Sbjct: 850 EVVVASLQQLYDQAAVKTAVRRMPYLGEGTFTGSAIRRATQLFQAARPG----------V 899 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +K + LTDG + N D A G ++ +G+ + Q+ Sbjct: 900 RKVAVVLTDG--LADNRDAVSLKDAAEGAHSAGIEIFVVGIVNNSDSQY 946 >gi|222082657|ref|YP_002542022.1| hypothetical protein Arad_9368 [Agrobacterium radiobacter K84] gi|221727336|gb|ACM30425.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 405 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 34/313 (10%), Positives = 99/313 (31%), Gaps = 26/313 (8%) Query: 28 IFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNI 87 + + +G + + V+ + LD +L+ + + +K + F + + Sbjct: 3 MLLAVGASFDYIRAYNVRQSMQSDLDAALIAAVKNVDAGDTDALKQKVSDWFHAQTESSY 62 Query: 88 WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN 147 D + + + +T + + +S S + P + Sbjct: 63 SLGDIEIDTTNHRITATASGTVPTTLMKL---ANINTVPVSVASAVKGPASSYLNVYIVI 119 Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS-------MNDHFGPGMDKL-----GVA 195 +L+ ++ G+ S ++++ +K VA Sbjct: 120 DKSPSMLLAATTAGQQAMYNGIGCQFACHTGDSHTIGTATYSNNYAYSTEKKIKLRADVA 179 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 ++ E++D+I + + ++ GL + I + ++++ + +T+ + Sbjct: 180 VDAVHEVIDMISASDTNHERIKVGLYSLGDTITEVLAPTLDTTAAGKRVDSDLTSATSTT 239 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-------- 307 + + + + K ++ LTDG S + + Sbjct: 240 YTYFDVSLAALKNKVG-TGGDGSSSATPLKLVLLLTDGVQSQREWVTSGAKYQPKVAPLN 298 Query: 308 --YCNEAKRRGAI 318 +C+ K++ A Sbjct: 299 PAWCDYIKKQSAT 311 >gi|221110023|ref|XP_002170779.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 671 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 70/196 (35%), Gaps = 27/196 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + D+ +LD S S+ + D L S N +G+V Sbjct: 33 TPDCEGFFDVGFILDSSGSLKSQYWKEKDFLKKLANSFGI----------SNKGSHAGVV 82 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 TFS + L + + ++R+ S T+ L A + Sbjct: 83 TFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLALAKALELF--DIKNGARNDV 140 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAAD-QFLKNC 336 + + LTDG+ + + L +E K++G ++A+G+ A A + K Sbjct: 141 PNLLF-----LLTDGKQ-----EPEMPLTHISDEIKQKGIQLFAVGIGAGANKTELEKIV 190 Query: 337 ASPDRFYSVQNSRKLH 352 +P+ + V + KL Sbjct: 191 GNPENVFMVDDFDKLL 206 >gi|209527393|ref|ZP_03275900.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492184|gb|EDZ92532.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 463 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 66/210 (31%), Gaps = 39/210 (18%) Query: 163 SKSDIGLDMMMVLDVSLSM------------------NDHFGPGMDKLGVATRSIREMLD 204 S S V+D S SM + + + E L+ Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEY 261 + S ++ R LV F PL ++ I +L F T+ G+E Sbjct: 95 GLVSSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNFSGGTRMALGMEE 154 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A N + K D + + TDG+ D + + G + A Sbjct: 155 ALNIL-----------KNCDLSSRRTLIFTDGQ----TFDESDCRDLATQFAEAGIPITA 199 Query: 322 IGVQAEAADQFLKNCA--SPDRFYSVQNSR 349 +GV E + L + + R ++V ++ Sbjct: 200 LGVG-EYNEDLLLYLSDRTGGRVFNVVETQ 228 >gi|291295700|ref|YP_003507098.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470659|gb|ADD28078.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 354 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 73/258 (28%), Gaps = 48/258 (18%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPG 188 + P C A + S ++K+ +++ +D S SM P Sbjct: 51 KAHVRWPLALQLLALCLLLLAAARPVASPPLPTNKA----AIVLAVDTSRSMLATDLNP- 105 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 ++L A + R+ ++ P + GLV+FS Q + E I RL Sbjct: 106 -NRLEAAKATARKFIE---LAPP---TTQIGLVSFSDSASALVMPTTDRQKLLEAIERLK 158 Query: 249 FGSTTKSTPGLEYAYNKI-------------------------FDAKEKLEHIAKGHDDY 283 T + + + + Sbjct: 159 PAQNTSIENAIITGVRMLPGRNTLRPPAELQPPGLSQPDPLQGIPDLPLPQQAQPPANLP 218 Query: 284 KKYIIFLTDGENS---SPNIDNKESLFYCNE-AKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ L+DG ++ +P + + +L AK +Y + Sbjct: 219 PGSLVILSDGASNVSSNPTLPTRTTLEVAARFAKNANVRLYTFPMGQPGGA---VTQIEG 275 Query: 340 DRFY---SVQNSRKLHDA 354 +Y +N +L A Sbjct: 276 RHYYIPFEPRNLEQLAQA 293 >gi|3024062|sp|P97279|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus] Length = 946 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 75/201 (37%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTEDQFS------VVDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A ++ + D Sbjct: 359 RNDLVSATKTQITDAKRYIEKIQPSGGTNINEALLRAIFILNEAS----NLGMLNPDSVS 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 I+ ++DG+ + + + + + ++++G+ + FLK ++ +R Sbjct: 415 LIVLVSDGDPTVGELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNRDTSSQLKKFYNQVST 495 >gi|327330197|gb|EGE71946.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL097PA1] Length = 322 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +V+ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVVSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MPPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG+N+ L N A VY I E + L Sbjct: 203 SDGDNTQGG----SPLVAANRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQV 282 >gi|50842461|ref|YP_055688.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427106|ref|ZP_06428822.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130538|ref|YP_003581201.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840063|gb|AAT82730.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159575|gb|EFD07763.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375881|gb|ADD99735.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|313764513|gb|EFS35877.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772104|gb|EFS38070.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313792200|gb|EFS40301.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|313801849|gb|EFS43083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807458|gb|EFS45945.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809968|gb|EFS47689.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313813000|gb|EFS50714.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816054|gb|EFS53768.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818503|gb|EFS56217.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820269|gb|EFS57983.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822922|gb|EFS60636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825146|gb|EFS62860.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827717|gb|EFS65431.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830297|gb|EFS68011.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833671|gb|EFS71385.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|313838673|gb|EFS76387.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] gi|314915507|gb|EFS79338.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314918208|gb|EFS82039.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] gi|314920023|gb|EFS83854.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925156|gb|EFS88987.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932037|gb|EFS95868.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955907|gb|EFT00307.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958219|gb|EFT02322.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960060|gb|EFT04162.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962859|gb|EFT06959.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967773|gb|EFT11872.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973302|gb|EFT17398.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975980|gb|EFT20075.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978483|gb|EFT22577.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314984001|gb|EFT28093.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] gi|314989988|gb|EFT34079.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078074|gb|EFT50125.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315080702|gb|EFT52678.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] gi|315084374|gb|EFT56350.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085715|gb|EFT57691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088865|gb|EFT60841.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096217|gb|EFT68193.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098475|gb|EFT70451.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101165|gb|EFT73141.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315108386|gb|EFT80362.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326129|gb|EGE67919.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA2] gi|327331996|gb|EGE73733.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA3] gi|327443198|gb|EGE89852.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445983|gb|EGE92637.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448037|gb|EGE94691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450841|gb|EGE97495.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453082|gb|EGE99736.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453813|gb|EGF00468.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753529|gb|EGF67145.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754260|gb|EGF67876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754489|gb|EGF68105.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760649|gb|EGF74216.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL099PA1] gi|332675378|gb|AEE72194.1| hypothetical protein PAZ_c10190 [Propionibacterium acnes 266] Length = 322 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +V+ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVVSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MPPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG+N+ L N A VY I E + L Sbjct: 203 SDGDNTQGG----SPLVAANRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQV 282 >gi|291393629|ref|XP_002713393.1| PREDICTED: alpha 1 type VII collagen [Oryctolagus cuniculus] Length = 2937 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 75/224 (33%), Gaps = 27/224 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + + T + + S +++ D++ +LD S S+ Sbjct: 2 RLRLLVATLCVGILAGAPRVRAQSREQVTCTRLYAADIVFLLDGSSSIGRS------NFR 55 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIF-G 250 + ++ VR V +S F L + I L + G Sbjct: 56 EVRGFLEGLVLPFSGAASA-QGVRFAAVQYSDDPRTEFGLDTLASGGEVIRAIRELSYKG 114 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T++ + + + IF L + + K I +TDG++ + Sbjct: 115 GNTRTGAAIRHVADHIF-----LPQLTRPGIP--KVCILITDGKSQD------QVDAAAQ 161 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRKL 351 K +G ++A+G++ A + LK AS D F+ V + L Sbjct: 162 RLKGQGVKLFAVGIK-NADPEELKRVASQPTGDFFFFVNDFSIL 204 >gi|260813588|ref|XP_002601499.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] gi|229286796|gb|EEN57511.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] Length = 375 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 6 PLDIIFLLDGSGSV------GASNFDKVKQFTKKAISGFDISPSGTQ---VGVIQYSTRT 56 Query: 228 VQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F L + + I+ + T + + Y F + A+ Sbjct: 57 RQEFSLNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 110 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +I +TDG + EA+++G VYAIGV ADQ L+ AS + + Sbjct: 111 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQ-LEQIASNNNTLA 163 Query: 345 VQNSRKLHDAFLR 357 ++ L D Sbjct: 164 FVDNFNLLDNLRN 176 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 61/168 (36%), Gaps = 24/168 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 224 PLDIIFLLDGSGSV------GASNFEKVKQFTKKTISGFDISPSGTQ---VGVIQYSTRT 274 Query: 228 VQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 275 RQEFSMNSFLTKETLSAAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 328 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I +TDG + EA+++G VYAIGV DQ Sbjct: 329 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDVDQL 370 >gi|74002027|ref|XP_544943.2| PREDICTED: similar to anthrax toxin receptor 2 [Canis familiaris] Length = 646 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 71/213 (33%), Gaps = 29/213 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S + D+ VLD S S+ +++ + + T V+ + Sbjct: 189 VSAQEQPSCRGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERF------------VSPQM 236 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FSS+ PL I E + + T GL+ A +I A Sbjct: 237 RLSFIVFSSQATIILPLTGDRSKISKGLEDLKNVSPVGETYIHEGLKLANEQIQKA---- 292 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 G II LTDG+ + + ++ GA VY +GV Q Sbjct: 293 -----GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKISRSFGARVYCVGVLDFEQAQLE 345 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + S ++ + V+ A I ++ Q Sbjct: 346 RIADSKEQVFPVKGG---FQALKGIINSILAQS 375 >gi|130498817|ref|NP_001076116.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] gi|11041696|dbj|BAB17301.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] Length = 946 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 71/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ I Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFS------VVDFNHNIRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A Sbjct: 359 RNDLVSATKTQIADAKRYIEKIQPNGGTNINEALLRAIFILNEANNMGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + + + ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQNIQDNVSLFSLGIGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|241983030|emb|CAZ65768.1| antrax toxin receptor 2 [Cricetulus griseus] Length = 463 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 48/229 (20%), Positives = 75/229 (32%), Gaps = 40/229 (17%) Query: 149 SHAPLLITSSVKI-----SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 L + + D+ VLD S S+ +++ + + T Sbjct: 12 GLWLLAVGGPGSLLHAQEQPSCKKAFDLYFVLDKSGSVANNWIEIYNFVHQLTERF---- 67 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLE 260 V+ +R + FSS+ PL I E + + T GL+ Sbjct: 68 --------VSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYIHEGLK 119 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR---GA 317 A +I A G II LTDG +D + NEAK+ GA Sbjct: 120 LANEQIQSA---------GGLKTSSIIIALTDG-----KLDGLVPSYAENEAKKSRTLGA 165 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 VY +GV Q + S D+ + V+ A I ++ Q Sbjct: 166 SVYCVGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILAQS 211 >gi|242078369|ref|XP_002443953.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] gi|241940303|gb|EES13448.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] Length = 567 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 79/240 (32%), Gaps = 41/240 (17%) Query: 138 IFCTFPWCANSSHAPL-LITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 C FP + + K S S LD++ VLDVS SM KL + Sbjct: 47 TQCEFPAVGRFTSRDRFAVLVHAKAPSDVSRAPLDLVTVLDVSDSMKGE------KLALL 100 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGS 251 +++ ++D + R +VTFS+ + LA G + + L Sbjct: 101 KQAMCFVIDQLGPAD------RLSVVTFSNDASRLTRLARMSDAGKASAKIAVESLAVQG 154 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-------- 303 T G+ A + +EK +I L+DG ++ + Sbjct: 155 FTNIKQGIHVAAEVLAGRREKNVVAG---------MILLSDGHDNCGGTSVRPDGTKSYV 205 Query: 304 ----ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 SL + R A ++ G + A+ F V + + D+F R Sbjct: 206 NLVPPSLTVAAGSSRPAAPIHTFGFGTSHDAGAMHAVAEATGGTFSFVGDEAAIQDSFAR 265 >gi|162451432|ref|YP_001613799.1| hypothetical protein sce3160 [Sorangium cellulosum 'So ce 56'] gi|161162014|emb|CAN93319.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 404 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 70/193 (36%), Gaps = 21/193 (10%) Query: 171 MMMVLDVSLSMNDHFGPGM-DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M+V+D S SM +G G D L A S+ + L + VR G +TF+ Q Sbjct: 110 VMIVVDRSGSM---YGSGFWDPLKTAVLSVVDRLQ---------DRVRFGFLTFTGTANQ 157 Query: 230 TFPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY-KKY 286 PL G I + P + E + + KY Sbjct: 158 QCPLLAGADGIALNHHAAIAAAYDEASTVPPGKLETPTAMTFNETVVPELLAFPEPGPKY 217 Query: 287 IIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I+F+TDGE + + +A +G + G+ + A Q L++ A+ Sbjct: 218 ILFVTDGEPDRCDDVRAECARDDVVGAVQDAYEQGIGTFVFGLGSGALAQHLQDVANAGA 277 Query: 342 FYSVQNSRKLHDA 354 V+ R DA Sbjct: 278 GQPVERPRSGTDA 290 >gi|149046634|gb|EDL99459.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Rattus norvegicus] Length = 927 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|145482427|ref|XP_001427236.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394316|emb|CAK59838.1| unnamed protein product [Paramecium tetraurelia] Length = 1189 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 72/189 (38%), Gaps = 14/189 (7%) Query: 170 DMMMVLDVSLSMNDHFGPGMD--KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++++LD+S SM++ + D ++GV D + + LV F ++I Sbjct: 741 AIVVLLDISGSMDELYYDSEDLTRMGVVKAFFSTFADRTMAYDLKHV---ISLVYFDNRI 797 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 ++ ++ +N+ T L+YA N++ K+ I Sbjct: 798 IEKCSFTELFILFKDLVNKAQPTGRTNLYRALKYAENQLLKFKQTYPKCLLR-------I 850 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I LTDG+++ + + + + ++ + V + D A+ + +S Q Sbjct: 851 IALTDGQDNDNH--PLDPIKVAESILKNEILLDSFVVSDDCTDLKKITKATGGQCFSPQT 908 Query: 348 SRKLHDAFL 356 ++ F Sbjct: 909 IQEGLKLFE 917 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 80/222 (36%), Gaps = 34/222 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +S+ +++ + S R GLVT+++ Sbjct: 8 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKSLVNLIEKVASYGVKP---RYGLVTYATY 64 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + + +++N + S T + L+ Y+ + + Sbjct: 65 PKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---P 121 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 122 EGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLV 181 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + AS + V++ L D F ++ E Sbjct: 182 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLS 223 >gi|159900441|ref|YP_001546688.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893480|gb|ABX06560.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 978 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 81/195 (41%), Gaps = 19/195 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + + ++ D+S SM++ G K+ +A+ ++ +++ ++ + L Sbjct: 401 NREKYPPVSVAVIFDISGSMSEVVGGRQ-KVTLASEGAARVVQLLRDFDEI-----TVLP 454 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q P+A + + + +I T G+ + D+ ++ KG + Sbjct: 455 FDSAVQNQYGPVAGSEREVAQ--GEIIARGVT-GGGGIN-----VHDSLVAAGNVLKGRN 506 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN--CASP 339 ++II L DG +S + ++ +E +R G I + FL N A Sbjct: 507 APIRHIILLADGSDSQQQ---ENAVRLTDEHRRLGITTSTIAIGNGGDVGFLNNVAVAGG 563 Query: 340 DRFYSVQNSRKLHDA 354 R + V+++ L D Sbjct: 564 GRHFLVEDALSLPDI 578 >gi|118356063|ref|XP_001011290.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293057|gb|EAR91045.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 520 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 57/135 (42%), Gaps = 25/135 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ V+D S SM K+ + +SI ++L II+ R LV F+S+ Sbjct: 95 PLDLIFVIDTSGSMQGK------KIELVKKSILQVLHIIQGDD------RISLVGFNSQA 142 Query: 228 VQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + IQ+ ++ L G T+ G++ A++ I + + Sbjct: 143 KVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSKNLAS------ 196 Query: 284 KKYIIFLTDGENSSP 298 I L+DG+++ Sbjct: 197 ---IFLLSDGQDNCG 208 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 62/192 (32%), Gaps = 32/192 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 F A L K + + + V DVS SM L +S Sbjct: 366 ISNFDGEAIMKAQKLW-----KEKKDVNNDIVAVFVADVSGSMAGE------PLNRLKQS 414 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTK 254 + +++ V GLV++S+ + P+A + L G T Sbjct: 415 LIN------GSKYISSDVSIGLVSYSTDVNINLPIAKFDLNQRSLFVGAVESLAAGGNTA 468 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + + A + + K K + + K + L+DG + + N + K Sbjct: 469 TFDAIIVATKMLKEEKAK-------NPNAKLMLFVLSDGVTNYGHSLND----IKDMMKT 517 Query: 315 RGAIVYAIGVQA 326 G +Y IG A Sbjct: 518 FGIPIYTIGYNA 529 >gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Acyrthosiphon pisum] Length = 884 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 52/335 (15%), Positives = 102/335 (30%), Gaps = 43/335 (12%) Query: 57 LYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 L I N + + I I + E+G + I S ++ Sbjct: 250 LQVIVDIEESSNITTLEVPDIKTANEIETTISKNKLAKISYESG---NKATITWSPTVKE 306 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 + + Y++ + + + + ++ VLD Sbjct: 307 QLTFTEHGVKGQFIVHYDVDHKSAPNQVLIDDGYFVHFFA----PTDLKPLRTHVIFVLD 362 Query: 177 VSLSM--------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG-----LVTF 223 VS SM + G + ++ + + +N + V+ Sbjct: 363 VSGSMVGQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWTINATQETSNHWDEKVS- 421 Query: 224 SSKIVQTFPLAW-------------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + K L VQ+ ++ I L S+T L A+ AK Sbjct: 422 NWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDALNKAHLI---AK 478 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 G + K I+FLTDGE ++ + +E + Y + +Y++G A Sbjct: 479 LGETRFKDGANTPKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYSLGFGEGADI 538 Query: 331 QFLKNCASPDR-----FYSVQNSR-KLHDAFLRIG 359 FLK + + Y ++ +L + + I Sbjct: 539 DFLKKLSLNNTGFARVIYEASDASLQLRNFYKEIS 573 >gi|311262926|ref|XP_003129419.1| PREDICTED: anthrax toxin receptor 2-like [Sus scrofa] Length = 241 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 72/214 (33%), Gaps = 27/214 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P + + P + S+ + S S D+ VLD S S+ +++ Sbjct: 11 PGSWLVPGLWLLALSGPGALVSAQEQPSCSG-AFDLYFVLDKSGSVANNWIE-------- 61 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGST 252 + L P + R + FSS+ PL I E ++ L Sbjct: 62 IYNFVHQLTERFVSPQM----RLSFIVFSSQATIILPLTGDRGKISEGLDNLKRVSPVGE 117 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL+ A +I A G II LTDG+ + + + Sbjct: 118 TYIHEGLKLANEQIEKA---------GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKIS 166 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 + GA VY +GV Q + S ++ + V Sbjct: 167 RSLGARVYCVGVLDFEQAQLERIADSKEQVFPVT 200 >gi|170744040|ref|YP_001772695.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168198314|gb|ACA20261.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 654 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 71/214 (33%), Gaps = 28/214 (13%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + PW + I ++ +++ ++D S SM ++L Sbjct: 238 FRVTASVFPSPWAEGRKLLHIGIRGYAVAPAE-RPPANLVFLVDTSGSMA-----APNRL 291 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLIFG 250 + +S+ +L + + R LV ++ ++ V A I I L Sbjct: 292 PLVKQSLAMLLTTLDARD------RVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQAH 345 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFY 308 +T G+ AY +A H D K +I TDG+ + E + Sbjct: 346 GSTAGGEGIRQAYA-----------LAARHFDPKAVNRVILATDGDFNVGITGRDELTGF 394 Query: 309 CNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 +R+G + +G D ++ A Sbjct: 395 VARERRKGIFLSVLGFGMGNLNDALMQALAKDGN 428 >gi|297473448|ref|XP_002686617.1| PREDICTED: collagen, type VI, alpha 3-like isoform 2 [Bos taurus] gi|296488812|gb|DAA30925.1| collagen, type VI, alpha 3-like isoform 2 [Bos taurus] Length = 2956 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++++ S+ Sbjct: 418 RTLTGTTEVRVNKR---DIIFLLDGSSNVGETNFPYVRDF---------VMNLVNSLDVG 465 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + ++ + G +Y Sbjct: 466 SDHIRVGLVQFSDTPVTEFSLNTYPTKSELLAHLRQMQLQGGSVLNTGAALSYVHANHFT 525 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ LT G++ L N R G + + +G Sbjct: 526 EAGGSRIQDHVP--QLLLLLTAGQSED------SYLQAANALARAGILTFCVGTSQADRA 577 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 578 ELEEIAFNPGLVYLMDDFSSL 598 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 91/249 (36%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + + AP + + + Sbjct: 1370 NIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAPPGVDTPSPSRPEKK 1429 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S D R + E++D + + + ++ GLV ++S Sbjct: 1430 KA-DIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSD 1479 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 1480 PTDEFFLKDFPTKQQIIDAINKVVYKGGRHANT--KVGLEHLRRNHFVPEAGSRLDQRVP 1537 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ + +L +RG V+A+GV+ +++ K ++ + Sbjct: 1538 QIAFVITGGKSVEDAQEASMALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFR 1591 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1592 VGNVQELSE 1600 >gi|297473446|ref|XP_002686616.1| PREDICTED: collagen, type VI, alpha 3-like isoform 1 [Bos taurus] gi|296488811|gb|DAA30924.1| collagen, type VI, alpha 3-like isoform 1 [Bos taurus] Length = 3162 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++++ S+ Sbjct: 624 RTLTGTTEVRVNKR---DIIFLLDGSSNVGETNFPYVRDF---------VMNLVNSLDVG 671 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + ++ + G +Y Sbjct: 672 SDHIRVGLVQFSDTPVTEFSLNTYPTKSELLAHLRQMQLQGGSVLNTGAALSYVHANHFT 731 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ LT G++ L N R G + + +G Sbjct: 732 EAGGSRIQDHVP--QLLLLLTAGQSED------SYLQAANALARAGILTFCVGTSQADRA 783 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 784 ELEEIAFNPGLVYLMDDFSSL 804 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 91/249 (36%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + + AP + + + Sbjct: 1576 NIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAPPGVDTPSPSRPEKK 1635 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S D R + E++D + + + ++ GLV ++S Sbjct: 1636 KA-DIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSD 1685 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 1686 PTDEFFLKDFPTKQQIIDAINKVVYKGGRHANT--KVGLEHLRRNHFVPEAGSRLDQRVP 1743 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ + +L +RG V+A+GV+ +++ K ++ + Sbjct: 1744 QIAFVITGGKSVEDAQEASMALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFR 1797 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1798 VGNVQELSE 1806 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 85/223 (38%), Gaps = 26/223 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 I C SS AP +I + D++ ++D S G V Sbjct: 208 IVGNLVACVRSSMAPERAGG-TEIPKDITAQDSADIIFLIDGSN------NTGSVNFAVI 260 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIFGSTT 253 + +L+ + +R G+V +S + F L ++ + + + + L F Sbjct: 261 LDFLVNLLERLSI---GTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGE 317 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + GL A + + + ++ + + ++ ++ G +S D +L + Sbjct: 318 LANVGL--ALDFVVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDGVIALKQAS--- 372 Query: 314 RRGAIVYAIGVQAEAADQF-LKNCASPDR-FYSVQNSRKLHDA 354 V++ G+ A+AA + L++ A+ D ++V R L D Sbjct: 373 -----VFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 410 >gi|156409361|ref|XP_001642138.1| predicted protein [Nematostella vectensis] gi|156229279|gb|EDO50075.1| predicted protein [Nematostella vectensis] Length = 989 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 68/208 (32%), Gaps = 18/208 (8%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S Y+ P P K ++ ++ +D+++++D S S+ G+ Sbjct: 304 SAYDYPPRSSCVPCPPGPELPISNTAVPAKTAASAN--VDLVILIDGSRSVERS---GVG 358 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 A R++ P R GL+ + + + F NRL G Sbjct: 359 NFRRAIDFARDLTSSFVVSP---RHTRVGLMVYGKRAYKVFGFND-----YRDNNRLFTG 410 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 E A + +K ++ +TDG I + ++ Sbjct: 411 FNKPIRYPRERAQTATALRSAYRTFFGRNKRSAQKVLVLVTDG-----KIRDAKAKRQSQ 465 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCAS 338 KRRG +Y +G + L+ AS Sbjct: 466 SIKRRGVKIYVVGAGKYFNIKQLEAMAS 493 >gi|170743237|ref|YP_001771892.1| cell wall anchor domain-containing protein [Methylobacterium sp. 4-46] gi|168197511|gb|ACA19458.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium sp. 4-46] Length = 761 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 39/251 (15%), Positives = 79/251 (31%), Gaps = 34/251 (13%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 R +L+ +D+ L+ + F L + + + + + Sbjct: 314 RRVTLADGPVPADRDFALTWRAAPSAAPAVGLFRERVGEDEYLLAVVTPPEGRAPARRPR 373 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM + A S+ LD + R ++ F + Sbjct: 374 EVTFVIDNSGSMAGA------SMRQAKASLLVALDRLGPAD------RFNVIRFDDTMDL 421 Query: 230 TFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 FP + + L T+ P L A + Sbjct: 422 LFPAPVPADEAHRDAARRFVAALEARGGTEMLPPLRAALADPHPEEGDRVRQ-------- 473 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 I+FLTDG I N+E +F A R + ++ IG+ + + + A + Sbjct: 474 --IVFLTDG-----AIGNEEQIFSAISAGRGRSRLFMIGIGSAPNGHLMTHAAELGGGSY 526 Query: 343 YSVQNSRKLHD 353 ++ ++ + Sbjct: 527 TAIGTIDQVAE 537 >gi|296232327|ref|XP_002807820.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Callithrix jacchus] Length = 1018 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 73/214 (34%), Gaps = 16/214 (7%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 38 PEKTDCPINVYFVLDTSESVAMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I H +K Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKSLQGISSFRRGTFTDCALANMTEQI------RLHGSK 151 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 152 GTVH---FAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 P Y + L D+ I ++ + + I K Sbjct: 207 TPHELYRSDYATMLPDS-TEIDQDTINRIIQVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|162419860|ref|YP_001607152.1| von Willebrand factor type A domain-containing protein [Yersinia pestis Angola] gi|162352675|gb|ABX86623.1| von Willebrand factor type A domain protein [Yersinia pestis Angola] Length = 472 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 71/200 (35%), Gaps = 26/200 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S +++ +V+D S SM+ G ++K ML+I ++ +V Sbjct: 90 STRRSPINLALVIDRSTSMS---GERIEKAREEAILAVNMLNITDTLS---------VVA 137 Query: 223 FSSKIVQTFPLA--WGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P + I + + T G+ ++ +H+ + Sbjct: 138 YDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQV------DKHLNRE 191 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + II ++DG+ ++ E A ++G + IG+ + + + A Sbjct: 192 QVNR---IILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGY 248 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 S V NS L AF + Sbjct: 249 SDGNHTFVANSADLEKAFTK 268 >gi|74196449|dbj|BAE34363.1| unnamed protein product [Mus musculus] Length = 650 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 71/202 (35%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 467 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 516 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 517 ---LEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 564 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 565 KRKVMIIITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKD 618 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V + L+ RI + + Sbjct: 619 HSFFVDDFDNLYKIAPRIIQNI 640 >gi|301026928|ref|ZP_07190323.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879508|gb|EFI87719.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 575 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 241 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 295 GLELAYQQATKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 Query: 318 IVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 >gi|150007596|ref|YP_001302339.1| hypothetical protein BDI_0949 [Parabacteroides distasonis ATCC 8503] gi|255013875|ref|ZP_05286001.1| hypothetical protein B2_08202 [Bacteroides sp. 2_1_7] gi|149936020|gb|ABR42717.1| conserved hypothetical protein BatB [Parabacteroides distasonis ATCC 8503] Length = 339 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 15/137 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+++M+ LDVS SM ++L A ++ + D + Sbjct: 80 SKLETVKRQGVEIMVCLDVSNSMLAEDVSP-NRLDKAK-------QMLSRLTDGFTNDKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F+ P+ + ++ + + + A N Sbjct: 132 GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQGTAIGAAIN-------LAARSFT 184 Query: 279 GHDDYKKYIIFLTDGEN 295 + K II +TDGEN Sbjct: 185 PDETTDKAIILITDGEN 201 >gi|156742635|ref|YP_001432764.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233963|gb|ABU58746.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 777 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 55/160 (34%), Gaps = 24/160 (15%) Query: 216 VRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 VR G + + AW Q I+ L T GL+ + + + A Sbjct: 353 VRDG--WYIDDVAL--GPAWDDVRARAQAAIDTLNSRGATSIGGGLQRSQHLLTSANPA- 407 Query: 274 EHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + I+ L+DG EN+SP + + + V+ IGV +A + Sbjct: 408 ---------IPRAIVLLSDGQENTSPYVADVLP-----PIRDAQTTVHTIGVGQDADQRL 453 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + + A + + +L + I + ++ L Sbjct: 454 MLSIAAQTGGTYNYAPTPDQLARIYNTISGNVSNRQTLAT 493 >gi|314988185|gb|EFT32276.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] Length = 322 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +V+ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVVSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MPPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG+N+ L N A VY I E + L Sbjct: 203 SDGDNTQGG----SPLVAANRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTHAKSWTADSADKLREVYQQV 282 >gi|282896561|ref|ZP_06304580.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] gi|281198552|gb|EFA73434.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] Length = 587 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 60/177 (33%), Gaps = 27/177 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + +M V+D S SM+ L +R I + N V G Sbjct: 403 KTQKDAGKTVYLMTVIDTSGSMSG------GPLEAVKNGLRIASQQI----NPGNYV--G 450 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV++ + + LA + I+ L T G+ +++ + Sbjct: 451 LVSYGDQPINLVKLAPFDDLQHKRFLAGIDGLEADGATAMYDGVMVGLSELLQQR----- 505 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K + + K Y++ LTDG+ + + ++ G VY I + Sbjct: 506 --KTNPNGKFYLLLLTDGQTNQG-FNFEQVKEIIEY---SGVRVYPIAYGEVNEAEL 556 >gi|256839783|ref|ZP_05545292.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256738713|gb|EEU52038.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 339 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 15/137 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+++M+ LDVS SM ++L A ++ + D + Sbjct: 80 SKLETVKRQGVEIMVCLDVSNSMLAEDVSP-NRLDKAK-------QMLSRLTDGFTNDKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F+ P+ + ++ + + + A N Sbjct: 132 GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQGTAIGAAIN-------LAARSFT 184 Query: 279 GHDDYKKYIIFLTDGEN 295 + K II +TDGEN Sbjct: 185 PDETTDKAIILITDGEN 201 >gi|323138937|ref|ZP_08073998.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] gi|322395783|gb|EFX98323.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] Length = 482 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 63/488 (12%), Positives = 140/488 (28%), Gaps = 141/488 (28%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSL---LYT 59 F I F + KG ++I+ + + + + GL + + K++L D + + T Sbjct: 4 FGKIAGFTRDDKGGVAIIMGLAVIPLVLASGLAADYAIVQAAKSRLDASADAAALAAIKT 63 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST-SLSIII 118 A + + + N + + + ++ ++ +AQ +I + Sbjct: 64 AQTTIAELSATNPNPRPQAIAAAM----------SQAEKSFYAQAGKRAADLLGKPAIDV 113 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + ++ + MP F ++ LD ++LDVS Sbjct: 114 QIKGQEVTANVAYSAAMPSNFGRIAGVKLMNYNGGAGAQLTMAK-----FLDFYLLLDVS 168 Query: 179 LSMN-------------------DHFGPGMD---------------------KLGVATRS 198 SM + G ++ + Sbjct: 169 GSMGLPSTPAGEAALAAKNPDDLAQYPTGCRFACHFAGSQGYNVSRANNIQLRIDAVGAA 228 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFGSTTK 254 + ++++ K + R G+ F + L ++ ++ IN TT Sbjct: 229 VAQLMEKAKDTATLPKQYRVGVYPFVTHANAFVDLTDNLRGDQYSVESAINYDPATRTTD 288 Query: 255 STPGLEYAYNKIF-----------------------------------DAKEKLEHIAKG 279 L+ + +F +AK G Sbjct: 289 FGRLLDAGKDWVFARDLNPNYKANPNIPADVTPMGAGGSHIHNIFQDINAKIPSVGDGSG 348 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESL----------------------FYCNEAKRRGA 317 + ++ F++DG +S + + CN K RG Sbjct: 349 ASSPQPFVFFVSDGMQNSQSFVSATGTWPGVTPYPTPPGQTVSIRAMDPTLCNVLKARGI 408 Query: 318 IV---------------YAIGVQAEAADQF------LKNCASPDRFYSVQNSRKLHDAFL 356 V +A + +A D ++ CASP+ ++ + DA Sbjct: 409 TVSVLEIPYPTFTNPKPFAAAQEFKANDAVPNLSGAMRACASPNFYFMADTPEGIADAMK 468 Query: 357 RIGKEMVK 364 ++ ++ V+ Sbjct: 469 KMFEQAVQ 476 >gi|281354485|gb|EFB30069.1| hypothetical protein PANDA_020540 [Ailuropoda melanoleuca] Length = 1096 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 74/206 (35%), Gaps = 32/206 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ G + + + ++D + P+ R G+V +S + Sbjct: 550 DLVFLLDTSSSV------GKEDFEKVRQWVANLVDTFEVGPER---TRVGVVRYSDQPTT 600 Query: 230 TFPLA-WGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +G + + R + G T + L + F G +K+ Sbjct: 601 AFELGLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFSP---QAGGRPGDRAFKQV 657 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I L G + +D + R G ++A+GV A A + L+ AS + Sbjct: 658 AILLPAGRSQDLVLDAAAAAH------RAGIRIFAVGVGA-ALKEELEEIASEPKSAHVF 710 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + F I K K R Sbjct: 711 HVSD-------FNAIDKIRGKLRRRL 729 >gi|198426775|ref|XP_002120099.1| PREDICTED: similar to SD03168p [Ciona intestinalis] Length = 1474 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 76/240 (31%), Gaps = 35/240 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 +C + S +I +D++ +LD S S+ + + I Sbjct: 140 LGYCYAGTEFGKTWMSISRIDPFECPKVDILFLLDGSGSIVES------DFEIMKEWIEN 193 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NRLIFGSTTKST 256 + + + V GL+ FS + G+ +E+I N I T + Sbjct: 194 ITLSFDISSNGS--VAVGLMQFSHFSLTKTEFQIGMFTTKEEIMAAMKNVTIKKGNTYTA 251 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + + + +K I+ LTDGE + D + + +G Sbjct: 252 DALRRSIAVFQKSSRY------NDTNTRKVIVLLTDGEAT----DTASLSSTADLVRSQG 301 Query: 317 AIVYAIGV----------QAEAADQFLKN--CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + A+ + A A Q + N +P + V + L I + + Sbjct: 302 ITITAVLITEKVLPSERSAAVAQMQLIVNGVAGNPSGVFVVGTTANLDSVIRAITQRIQS 361 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 56/269 (20%), Positives = 93/269 (34%), Gaps = 68/269 (25%) Query: 163 SKSDIGLDMMMVLDVSLSM-NDHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNNVV 216 S IG D+++V D+S SM D G +L A + E L+ K + N Sbjct: 80 SVEAIGNDIVLVFDISNSMAEDEHGNSTSSNDKKRLTKAKNAAIEFLNNSKISGNKKN-- 137 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAK-- 270 R +VTF+ L ++ ++ I + G T GL A + +AK Sbjct: 138 RYSIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKARTVLKNAKSE 197 Query: 271 ----------EKLEHIAKGHDDYKKYII------FLTD-----------GEN-------- 295 ++ Y++ TD GEN Sbjct: 198 NGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGENAINYDSVI 257 Query: 296 ---------------SSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEAA-DQFLKNC 336 S +++N + NEA K+ G ++ IG ++ + FLKN Sbjct: 258 KGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIFTIGYTTGSSVNSFLKNV 317 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A+ Y+ +S L + I E+V + Sbjct: 318 ATQGEGYAYSSSSDLSGIYENIANEIVTR 346 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 82/271 (30%), Gaps = 77/271 (28%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-- 223 D++++LD S SM++ G +L ++ + I + + N R ++TF Sbjct: 610 KAPQDVVLLLDKSGSMDESMN-GSSRLTHLKNNVIKF---ITKLYEHNPDSRVSVITFAY 665 Query: 224 -------SSKIVQTFPLAWGVQHIQEKINR-------LIFGSTTKSTPGL---------- 259 ++ V+ + G + + + + T+ GL Sbjct: 666 SADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYEVRNQLSSA 725 Query: 260 ---------------------EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS- 297 +YN D ++ A D+ K+ LT G N+ Sbjct: 726 TGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIKFSGNLT-GINNYI 784 Query: 298 ---------------------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFL 333 N N K G ++ IG+ + + D FL Sbjct: 785 ESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTNRSGKGLGKTIFTIGLNSNNSSLFDSFL 844 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 AS + NS + +AF I + Sbjct: 845 TRLASEGHYTKANNSSAMENAFNSIFTSITT 875 >gi|16130205|ref|NP_416773.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89109088|ref|AP_002868.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161758|ref|YP_001459076.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|238901445|ref|YP_002927241.1| hypothetical protein BWG_2044 [Escherichia coli BW2952] gi|256022046|ref|ZP_05435911.1| hypothetical protein E4_01620 [Escherichia sp. 4_1_40B] gi|300948978|ref|ZP_07163036.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300956471|ref|ZP_07168759.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|301647634|ref|ZP_07247429.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|307138934|ref|ZP_07498290.1| hypothetical protein EcolH7_12533 [Escherichia coli H736] gi|331642908|ref|ZP_08344043.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] gi|2495629|sp|P76481|YFBK_ECOLI RecName: Full=Uncharacterized protein yfbK gi|1788606|gb|AAC75330.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675335|dbj|BAE76678.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067438|gb|ABV06693.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|238860346|gb|ACR62344.1| conserved protein [Escherichia coli BW2952] gi|260448637|gb|ACX39059.1| von Willebrand factor type A [Escherichia coli DH1] gi|300316719|gb|EFJ66503.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300451549|gb|EFK15169.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|301074238|gb|EFK89044.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|309702582|emb|CBJ01910.1| putative lipoprotein [Escherichia coli ETEC H10407] gi|315136904|dbj|BAJ44063.1| hypothetical protein ECDH1ME8569_2207 [Escherichia coli DH1] gi|331039706|gb|EGI11926.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] Length = 575 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 241 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 295 GLELAYQQATKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 Query: 318 IVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 56/269 (20%), Positives = 93/269 (34%), Gaps = 68/269 (25%) Query: 163 SKSDIGLDMMMVLDVSLSM-NDHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNNVV 216 S IG D+++V D+S SM D G +L A + E L+ K + N Sbjct: 80 SVEAIGNDIVLVFDISNSMAEDEHGNSTSSNDKKRLTKAKNAAIEFLNNSKISGNKKN-- 137 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAK-- 270 R +VTF+ L ++ ++ I + G T GL A + +AK Sbjct: 138 RYSIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKARTVLKNAKSE 197 Query: 271 ----------EKLEHIAKGHDDYKKYII------FLTD-----------GEN-------- 295 ++ Y++ TD GEN Sbjct: 198 NGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGENAINYDSVI 257 Query: 296 ---------------SSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEAA-DQFLKNC 336 S +++N + NEA K+ G ++ IG ++ + FLKN Sbjct: 258 KGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIFTIGYTTGSSVNSFLKNV 317 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A+ Y+ +S L + I E+V + Sbjct: 318 ATQGEGYAYSSSSDLSGIYENIANEIVTR 346 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 82/271 (30%), Gaps = 77/271 (28%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-- 223 D++++LD S SM++ G +L ++ + I + + N R ++TF Sbjct: 610 KAPQDVVLLLDKSGSMDESMN-GSSRLTHLKNNVIKF---ITKLYEHNPDSRVSVITFAY 665 Query: 224 -------SSKIVQTFPLAWGVQHIQEKINR-------LIFGSTTKSTPGL---------- 259 ++ V+ + G + + + + T+ GL Sbjct: 666 SADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYEVRNQLSSA 725 Query: 260 ---------------------EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS- 297 +YN D ++ A D+ K+ LT G N+ Sbjct: 726 TGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIKFSGNLT-GINNYI 784 Query: 298 ---------------------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFL 333 N N K G ++ IG+ + + D FL Sbjct: 785 ESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTNRSGKGLGKTIFTIGLNSNNSSLFDSFL 844 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 AS + NS + +AF I + Sbjct: 845 TRLASEGHYTKANNSSAMENAFNSIFTSITT 875 >gi|260797293|ref|XP_002593638.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] gi|229278864|gb|EEN49649.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] Length = 388 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 6 PLDIIFLLDGSGSV------GASNFEKVKQFTKKAISGFDISPSGTQ---VGVIQYSTRT 56 Query: 228 VQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 57 RQEFSMNSFVTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 110 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +I +TDG + EA+++G VYAIGV ADQ L+ AS + + Sbjct: 111 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQ-LEQIASNNNTLA 163 Query: 345 VQNSRKLHDAFLR 357 ++ L D Sbjct: 164 FVDNFNLLDNLRN 176 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 62/168 (36%), Gaps = 24/168 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 237 PLDIIFLLDGSGSV------GASNFEKVKQFTKKAISGFDISPSGTQ---VGVIQYSTRT 287 Query: 228 VQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 288 RQEFSMNSFVTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 341 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I +TDG + EA+++G VYAIGV ADQ Sbjct: 342 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQL 383 >gi|262381905|ref|ZP_06075043.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|301310438|ref|ZP_07216377.1| BatB protein [Bacteroides sp. 20_3] gi|262297082|gb|EEY85012.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|300832012|gb|EFK62643.1| BatB protein [Bacteroides sp. 20_3] Length = 339 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 15/137 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ + G+++M+ LDVS SM ++L A ++ + D + Sbjct: 80 SKLETVKRQGVEIMVCLDVSNSMLAEDVSP-NRLDKAK-------QMLSRLTDGFTNDKV 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GL+ F+ P+ + ++ + + + A N Sbjct: 132 GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQGTAIGAAIN-------LAARSFT 184 Query: 279 GHDDYKKYIIFLTDGEN 295 + K II +TDGEN Sbjct: 185 PDETTDKAIILITDGEN 201 >gi|295669664|ref|XP_002795380.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] gi|226285314|gb|EEH40880.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] Length = 773 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 78/203 (38%), Gaps = 27/203 (13%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +DVS SM P D+ G + +LD+ K I +N R G+VTF Sbjct: 75 DIVLCIDVSGSMQLSAPLPTTDESGKREETGLSVLDLTKHAARTIIETLNENDRLGVVTF 134 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S+ + ++ + E + L ++T GL+ + + + +++ Sbjct: 135 SNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLSVLGKVDLRPQNV--- 191 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKES-----LFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + LTDG+ + + ++ K R +++ G + L+ Sbjct: 192 -----QALYVLTDGQPNH--MCPRQGYVPKLRPILERQKDRLPLIHTFGFGYDIRSGLLQ 244 Query: 335 NCA--SPDRFYSVQNSRKLHDAF 355 + A + + ++ + F Sbjct: 245 SIAEVGGGTYSFIPDAGMIGTVF 267 >gi|221135528|ref|XP_002156134.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 194 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 70/196 (35%), Gaps = 27/196 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + D+ +LD S S+ + D L S N +G+V Sbjct: 6 TPDCEGFFDVGFILDSSGSLKSQYWKEKDFLKKLANSFGI----------SNKGSHAGVV 55 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 TFS + L + + ++R+ S T+ L A + Sbjct: 56 TFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLALAKALELF--DIKNGARNDV 113 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAAD-QFLKNC 336 + + LTDG+ + + L +E K++G ++A+G+ A A + K Sbjct: 114 PNLLF-----LLTDGKQ-----EPEMPLTHISDEIKQKGIQLFAVGIGAGANKTELEKIV 163 Query: 337 ASPDRFYSVQNSRKLH 352 +P+ + V + KL Sbjct: 164 GNPENVFMVDDFDKLL 179 >gi|149437043|ref|XP_001515962.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 948 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 74/201 (36%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N+ VR ++ Sbjct: 314 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDQFSVIDFNHNVR----SWKD 363 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +V L ++ I ++ T L A + + ++ + Sbjct: 364 NLVPATDLM--TTDAKKYIEKIQPNGGTNINEALLRAIFIL----REASNLGMLDPNSVS 417 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + R ++++G+ + FL+ + + Sbjct: 418 LIILVSDGDPTVGELKPTVIQKNVKKNMRDNISLFSLGIGFDVDYDFLERLSRENHGMAQ 477 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 478 RIYGNQDTSSQLKQFYNQVST 498 >gi|194386850|dbj|BAG59791.1| unnamed protein product [Homo sapiens] Length = 543 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N +R+ Sbjct: 73 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRT------W 120 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 121 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 176 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ + Sbjct: 177 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQ 236 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 237 RIYGNQDTSSQLKKFYNQVST 257 >gi|260813586|ref|XP_002601498.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] gi|229286795|gb|EEN57510.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] Length = 384 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 6 PLDIIFLLDGSGSV------GASNFEKVKQFTKKTISGFDISPSGTQ---VGVIQYSTRT 56 Query: 228 VQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 57 RQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 110 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +I +TDG + EA+++G VYAIGV ADQ L+ AS + + Sbjct: 111 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQ-LEQIASNNNTLA 163 Query: 345 VQNSRKLHDAFLR 357 ++ L D Sbjct: 164 FVDNFNLLDNLRN 176 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 62/168 (36%), Gaps = 24/168 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 233 PLDIIFLLDGSGSV------GASNFEKVKQFTKKTISGFDISPSGTQ---VGVIQYSTRT 283 Query: 228 VQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 284 RQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 337 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I +TDG + EA+++G VYAIGV ADQ Sbjct: 338 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQL 379 >gi|71280576|ref|YP_269044.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146316|gb|AAZ26789.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 618 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 76/197 (38%), Gaps = 15/197 (7%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + ++ L +++ D+S+S + H + V S+ + ++S+ D Sbjct: 418 QSYRGNNRDLSCLLLADLSMSTDSHLDNDNRVIDVVQDSLLLFGEALQSVGDN-----FA 472 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + FSS +++ EK N + G TPG Y ++ A + + Sbjct: 473 MYGFSSVKRSNIRFTM-LKNFNEKYNDHVRGRIQAITPGF---YTRMGAAIRQATKVISE 528 Query: 280 HDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 K ++ LTDG+ + +++ NEAKR G + I + +A ++L Sbjct: 529 QKTADKLLLILTDGKPNDIDHYEGRFGIEDTHQAINEAKRLGIKPFCITIDVDA-QEYLP 587 Query: 335 NCASPDRFYSVQNSRKL 351 D F + +L Sbjct: 588 YLFGNDGFTQILRPAQL 604 >gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Acyrthosiphon pisum] Length = 919 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 52/335 (15%), Positives = 102/335 (30%), Gaps = 43/335 (12%) Query: 57 LYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 L I N + + I I + E+G + I S ++ Sbjct: 250 LQVIVDIEESSNITTLEVPDIKTANEIETTISKNKLAKISYESG---NKATITWSPTVKE 306 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 + + Y++ + + + + ++ VLD Sbjct: 307 QLTFTEHGVKGQFIVHYDVDHKSAPNQVLIDDGYFVHFFA----PTDLKPLRTHVIFVLD 362 Query: 177 VSLSM--------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG-----LVTF 223 VS SM + G + ++ + + +N + V+ Sbjct: 363 VSGSMVGQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWTINATQETSNHWDEKVS- 421 Query: 224 SSKIVQTFPLAW-------------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + K L VQ+ ++ I L S+T L A+ AK Sbjct: 422 NWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDALNKAHLI---AK 478 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 G + K I+FLTDGE ++ + +E + Y + +Y++G A Sbjct: 479 LGETRFKDGANTPKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYSLGFGEGADI 538 Query: 331 QFLKNCASPDR-----FYSVQNSR-KLHDAFLRIG 359 FLK + + Y ++ +L + + I Sbjct: 539 DFLKKLSLNNTGFARVIYEASDASLQLRNFYKEIS 573 >gi|304437812|ref|ZP_07397761.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369169|gb|EFM22845.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 255 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 68/188 (36%), Gaps = 27/188 (14%) Query: 168 GLDMMMVLDVSLSMN---------------------DHFGPGMDKLGVATRSIREMLDII 206 + + + LD S SM G ++ + I + I Sbjct: 19 RVPVCLCLDTSGSMAAVEAGSYVGTGETIRQDGKLWQIVEGGKSRIQELQKGIEMFFEAI 78 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 ++ + +VTF ++ A ++ + +L T G+ A + + Sbjct: 79 RTDILAADSAEISIVTFDNEAKCLLDFA---NIERQTVPQLHANGLTAMGEGVNLALDLL 135 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQ 325 A+ K E+ KG D Y+ +++ ++DGE + + + + E G V+ IG+ Sbjct: 136 --AQRKKEYQDKGVDYYQPWLVLMSDGEPNGDPTELRRATQRVTELVNAGKLTVFPIGIG 193 Query: 326 AEAADQFL 333 +E L Sbjct: 194 SEPGMDAL 201 >gi|257462368|ref|ZP_05626782.1| magnesium chelatase [Fusobacterium sp. D12] gi|317060032|ref|ZP_07924517.1| magnesium chelatase [Fusobacterium sp. D12] gi|313685708|gb|EFS22543.1| magnesium chelatase [Fusobacterium sp. D12] Length = 605 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 72/206 (34%), Gaps = 30/206 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG ++ V+D S SM ++ +I +L + LV Sbjct: 417 REKRIGTHILFVVDSSGSMG-----AKKRMRAVKGAIFSLLQ-----DAYEKRDKVALVA 466 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F K + + ++ ++++ L G T GL AY I + K + Sbjct: 467 FRKKSAEELLSMTRSIELAKKQLQNLATGGKTPLAEGLFKAYQLI------RQLKKKDGE 520 Query: 282 DYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-- 336 Y ++ ++DG + +ESL + K+ G ++ + E L+ Sbjct: 521 IYP-LLVLISDGRANISLHGRDPIEESLEMARKIKKEGIS--SVVIDTEEGFTLLEMAKN 577 Query: 337 ---ASPDRFYSVQN--SRKLHDAFLR 357 A +Y ++N + + + Sbjct: 578 ISEAMGAEYYRLENIQAEDMLKLLKK 603 >gi|291242484|ref|XP_002741138.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 765 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 70/202 (34%), Gaps = 35/202 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM+ D++ +S ++ + + G+V FSS V Sbjct: 284 VVLVLDTSGSMDG------DRIQRLHQSATYFIET-----RIEDGSFVGIVGFSSYAVIH 332 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I + + T GL A + D E + Sbjct: 333 SGITEIKYGFQRGEIASNV-PQVASGATSIGDGLRVALQVLQDGNVTSEGAS-------- 383 Query: 286 YIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 ++ +TDG EN+ P + + E G V I EAA L+ + + + Sbjct: 384 -LLLITDGIENTYPLL-----MNVMQEVYDSGVRVDTIAY-TEAAQSTLQELSDNTGGLY 436 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 + V ++ + + + Sbjct: 437 FYVPDNDTSTAFIDSLAATISE 458 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 86/234 (36%), Gaps = 26/234 (11%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F N KG+I + I+LP ++ L E S KAKL ++ + L L Sbjct: 31 RKFIKNDKGAILLPFIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQATLA-----LTV 85 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 EN + + + + ++ N L F+ +I+ + Y Sbjct: 86 ENDDLPDELQMRKNVDLVSNFSSA----YLPLEHFSVPEIDIKNNCGQLTYNAKITMSYF 141 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 + +S+ M T N + + T D D++ V D S SMN+ F Sbjct: 142 ANFLSKTAMTNAITTIGTEDNGAAIKQVSTI-------QDKATDVIFVADYSGSMNEGFH 194 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + + G ++R++ + + N+ + ++ P +WG + I Sbjct: 195 GKVPR-GEKINALRDVFNRLNGSILKNS---------NINLIGFVPFSWGTKRI 238 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 43/139 (30%), Gaps = 26/139 (18%) Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + T + G+ YA N +F + + K ++ ++DG + N + Sbjct: 394 AMSPLGQTLVSSGILYA-NTLFKKESNNSNN--------KLMVIISDGIDVFINDTTIQQ 444 Query: 306 LFY----------CNEAKRRGAIVYAIGV-------QAEAADQFLKNCASPDRFYSVQNS 348 Y C K + I + A K C D +Y V ++ Sbjct: 445 SIYISKTLIDKGMCERIKENNIKMVFIAIKDGSNETNEPANYIDWKKCVGEDNYYYVSDA 504 Query: 349 RKLHDAFLRIGKEMVKQRI 367 +L A + + + Sbjct: 505 HELEAALRQSLTTTSSEVV 523 >gi|326426681|gb|EGD72251.1| hypothetical protein PTSG_11571 [Salpingoeca sp. ATCC 50818] Length = 1748 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 41/192 (21%), Positives = 68/192 (35%), Gaps = 28/192 (14%) Query: 154 LITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +S V + + D+++VLD S S+ L A+ ++++ Sbjct: 30 PSSSLVAATPDCTRTDFDLVLVLDESGSVG--MEDWQHTLTFASHF-------VRALDTE 80 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLI----FGSTTKSTPGLEYAYNKIF 267 N ++ VTFS+ V F L +E I N L G +T + LE + + Sbjct: 81 TNSIQVAAVTFSTDPVLQFSL--NTYSTEEHITNALTSLPYAGKSTDTGAALELVLSGVL 138 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D + ++ +TDG P E L + A V+A+GV Sbjct: 139 DNPAGGYRGGRA------VVVVMTDGHTQDP-----ERLQLAAPLLKARADVFAVGVGDN 187 Query: 328 AADQFLKNCASP 339 L AS Sbjct: 188 INVPELFLIASA 199 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 63/182 (34%), Gaps = 29/182 (15%) Query: 166 DIGLDMMMVLDVS---LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D D++ LD S S + + ++ + +++ R V+ Sbjct: 410 DTAFDVVFALDASRAVSSADWSY------------TLEHLRTTMRAFDFGEGATRVAAVS 457 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F + F L I I RL GS T + LE ++ + + + G Sbjct: 458 FGDRTRVGFTLEHVASPADIDSAIARLAYIGSATDTGLALEVVHSSVLE-------LVGG 510 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ +T ++P D L R G +V +IG+ A L + ++P Sbjct: 511 RVTRPTMVVLVT----AAPASDEARLLSAAGALMRSGVLVSSIGIGAGVDAAQLTSISTP 566 Query: 340 DR 341 D Sbjct: 567 DN 568 >gi|295093780|emb|CBK82871.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 549 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 43/179 (24%), Positives = 67/179 (37%), Gaps = 29/179 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + + + V D S SM+ D + S+ I + NN V G Sbjct: 364 KKTKDNGKDIIAVFVADCSGSMDG------DPMNQLKNSLTNGAQYI----NDNNYV--G 411 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV++SS + P+A + Q +N L+ T S + A I DAK Sbjct: 412 LVSYSSSVTVEVPIAQFDLNQRSYFQGSVNNLMASGGTASYDAVVVAMKMITDAK----- 466 Query: 276 IAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 H D K + L+DG N ++D + ++ VY IG +A L Sbjct: 467 --AEHPDAKCMLFLLSDGYANVGYSMD-----EITSALRQSNIPVYTIGYGGDADTDEL 518 >gi|149919617|ref|ZP_01908096.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] gi|149819560|gb|EDM78988.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] Length = 1168 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 68/195 (34%), Gaps = 20/195 (10%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ P S +T + + D + +VLD S SM+ G ++ A Sbjct: 550 FMPAGTPNSDPRSPLTDCVTHTATVHHDFDAYDTVALVLDRSKSMDQDQGSR-KRIEWAQ 608 Query: 197 RSIREMLDIIKSIPDVNNVVRS-------GLVTFSSKIVQ----TFPLAWGVQHIQEKIN 245 R+I + D + V +R + F + + I+E + Sbjct: 609 RAILKWTDHVADGGSVQASLRKFNQDAPPAVFGFKTVLDAVVGGETATEIDSTAIEEYLE 668 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + +T ++ A + + + + + I +TDGE +S + D ++ Sbjct: 669 DIEPDGSTAIGDAIDAAVAALMAHDDLDPNSSNNNA-----IFLITDGEQTSGDKDVCDA 723 Query: 306 LFYCNEAKRRGAIVY 320 L + AK VY Sbjct: 724 LE--DAAKDD-VPVY 735 >gi|254459050|ref|ZP_05072473.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084321|gb|EDZ61610.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 615 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 71/182 (39%), Gaps = 28/182 (15%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P++ V + +KS D+M+ LD+S SM + P ++L +A +L S Sbjct: 72 PVIDDGKVVVKAKSA---DIMIALDISDSMLAEDVYP--NRLELAKEKALTLLSEAPSE- 125 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R G++ F+ PL++ + + +L S T+ + ++ Sbjct: 126 ------RVGIMAFAKNSYLVSPLSFDTGAVSFLLKQLDTTSITQKGTDFLSILDVFNTSQ 179 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E +KY++ L+DG D+KE AK+ +V+ +GV Sbjct: 180 ENDG---------EKYLLILSDG------GDSKEFSKEIELAKKSNIVVFILGVGTVKGA 224 Query: 331 QF 332 Sbjct: 225 PI 226 >gi|15921064|ref|NP_376733.1| hypothetical protein ST0830 [Sulfolobus tokodaii str. 7] gi|15621848|dbj|BAB65842.1| 381aa long hypothetical protein [Sulfolobus tokodaii str. 7] Length = 381 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 44/172 (25%), Positives = 65/172 (37%), Gaps = 30/172 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G +++LD S SM K+ A + ++L+ IP N + +TFSS Sbjct: 39 TGFHYIILLDTSGSMAGI------KIETAKQGALQLLN---KIPPGNKIT---FITFSST 86 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + A + E I+ + T L A IAK H Y Sbjct: 87 VNTLIEFADTSGSVGETISSVTAQGNTVLYTALSTAIQ-----------IAKKHGIPG-Y 134 Query: 287 IIFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II LTDG + N D E L + + G V + G+ + +Q LK A Sbjct: 135 IILLTDGNPTDLTNTDAYEKLQFPD-----GFKVISFGIGDDYNEQLLKVLA 181 >gi|73992734|ref|XP_543096.2| PREDICTED: similar to Protein KIAA1510 precursor [Canis familiaris] Length = 1405 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 58/173 (33%), Gaps = 32/173 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +DM+ ++D S S+ + +++ + P V+ GL +S Sbjct: 203 SPVDMIFLVDGSWSIGHSH------FQQVKDFLASVIEPFEIGPSK---VQVGLTQYSGD 253 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 W + ++ K + L G T + L + L A Sbjct: 254 PQTE----WDLNALRTKEDVLAAVRRLRYKGGNTFTGLALTHVLEH------NLRPAAGP 303 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K +I +TDG++ ++ K G ++A+GV+ + Sbjct: 304 RPEATKVLILVTDGKSQD------DARAAGRILKDLGVAIFAVGVKNADEAEL 350 >gi|262091909|gb|ACY25458.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 621 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 73/195 (37%), Gaps = 24/195 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++++ V D S SM + G +D A +++ L V ++ R G++ FS Sbjct: 259 PVNVIFVADASGSMAE--GNRIDIARAALQALWASL--------VPDLDRVGMIQFSVDP 308 Query: 228 VQTF--PLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + +Q I+RL+ T G++ DA++ D Sbjct: 309 IPASFVPHTRPDSEFLQASIDRLLPYYGTNVQAGIDLGVQLANDARQAWP-------DSD 361 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAE-AADQFLKNCASPDR- 341 Y++ ++DG + D L + + + IGV D L+ A Sbjct: 362 NYVVLISDGVANVDATDPFAILRSAGEDDESNPIRLITIGVGIGHYNDVLLEQLAQYGNG 421 Query: 342 -FYSVQNSRKLHDAF 355 +Y + + + + F Sbjct: 422 WYYYIDSPEQAWETF 436 >gi|194433366|ref|ZP_03065646.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194418460|gb|EDX34549.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|320178755|gb|EFW53718.1| hypothetical protein SGB_04028 [Shigella boydii ATCC 9905] gi|332090753|gb|EGI95846.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 575 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 241 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 295 GLELAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 Query: 318 IVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 346 TLSTFGVGDDNYNEAMMVRIADVGNGNYSYIDTLSE 381 >gi|149440247|ref|XP_001521494.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3, partial [Ornithorhynchus anatinus] Length = 390 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 65/172 (37%), Gaps = 13/172 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVTFSSKIVQ 229 ++ V+DVS SM KL ++ ++L+ +K +N + S + T+ +++ Sbjct: 214 VVFVIDVSGSM------YGRKLVQTKEALLKILEDMKEEDYLNFILFSSEITTWKDTLIK 267 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P ++ +E + + T GL + +A+E + II Sbjct: 268 ATPE--NLKKAKEFVKNIKDEGLTNINDGLMRGIKMLNEARETNVVPKRSTS----LIIM 321 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 LTDGE + I + A +Y +G + L+ A + Sbjct: 322 LTDGEANVGEIRADKIQENVRNAIGGKFPLYNLGFGYDLNYNLLEKMALENH 373 >gi|327490425|gb|EGF22209.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1058] Length = 462 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 49/244 (20%), Positives = 83/244 (34%), Gaps = 29/244 (11%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N +KQ D S K+++ E + G QD I + + D +K +A+ Sbjct: 106 NKQKQDWDVSELGTKSLYNMKLDLEFKTEGAYQDNRLISYNLTGK-YPDTNNKLGIDTAI 164 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S +F + + + + V D S SMN Sbjct: 165 SALNTKQVFSKVAKGKKGVAIAYRTD-----PIQGQMNIAVSFVFDTSGSMNWDLQGRET 219 Query: 191 KLGVATRSI----REMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWGVQHIQE 242 K + ++ + +IK + ++ N + LV FS+ L G I Sbjct: 220 KKSGNESRMDILRKKSVIMIKDLAEIGN-ISVNLVGFSTSAKYIQQNFSNLDNGTNTIIA 278 Query: 243 KINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 IN+ L T GL Y + +L KYI+ LTDG ++ Sbjct: 279 TINKRENLNPDGVTNPGDGLRYGMISLQSQPAQL-----------KYIVLLTDGIPNAYL 327 Query: 300 IDNK 303 +D++ Sbjct: 328 VDSR 331 >gi|301609304|ref|XP_002934186.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 934 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 76/195 (38%), Gaps = 35/195 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQ 229 + +VLDVS SM+ G +L + ++++ N G+V F SS V Sbjct: 305 VSLVLDVSGSMSSSNRIG-RQLQAVELFVVQIIE---------NGAHVGIVKFSSSASVV 354 Query: 230 TFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + + Q ++++ LI G T G+ + Y Sbjct: 355 SSLVKINTQAQRDQLKSLIPRTAGGGTNICAGIRAGI---------ALNKNFDGSSYGTE 405 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 I+ LTDGE DN ++ + GAI++ I + AA + L+ A + ++ Sbjct: 406 IVLLTDGE------DNLDTSLCFKDITDSGAIIHVIALGPNAAKE-LETIANMTGGLRFN 458 Query: 345 VQN---SRKLHDAFL 356 + + +L DAF Sbjct: 459 ALDKVEANELIDAFS 473 >gi|295395241|ref|ZP_06805449.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] gi|294972003|gb|EFG47870.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] Length = 324 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 71/235 (30%), Gaps = 46/235 (19%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + + +T+ + D+M+ LD S SM + +D +S + Sbjct: 67 IGGASALAGISRPVTTETLNPEQKQ--RDVMLCLDASGSMASYNAKILDTYADLIKSFKG 124 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 R G+ F+S V FPL + E + G + G+ + Sbjct: 125 E--------------RIGMTVFNSAAVSVFPLTTDYEMASEFLEDAQLGFESNGLRGINF 170 Query: 262 AYNKIFDAKEKLE------------HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 Y D + + ++ + +IF TD N E + Sbjct: 171 -YQGTVDRSIDGSSLIGDGLASCLNNFDRNDEERSRSVIFATD--NQLAGNPIYELMEAA 227 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 AK+ VYA+ P F++ +L A G EM Sbjct: 228 ELAKKHKVRVYALA---------------PKGFFAASKLDELKQAAELTGGEMFT 267 >gi|149050644|gb|EDM02817.1| similar to vitrin (predicted) [Rattus norvegicus] Length = 427 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 70/202 (34%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 244 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRIGAVQYTYEQR- 293 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 294 ---LEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 341 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 342 KRKVMILITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVIATHPARD 395 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 396 HSFFVDEFDNLYKFVPRIIRNI 417 >gi|254412101|ref|ZP_05025876.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196181067|gb|EDX76056.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 570 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 70/203 (34%), Gaps = 30/203 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +++V+D S SM + KL +++ ++ + S + L+ F+ Sbjct: 385 AKKPSQVVVVVDTSGSMQGN------KLPAVQNTLQNYINSLGSKD------KIALIDFN 432 Query: 225 SKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +I Q G E I+ L TK YA N + D A Sbjct: 433 DEISQPVLVEGTDAGRNRGLEFISGLQAYGGTKLYDAALYARNWLQDNPRPDAINA---- 488 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEA--ADQFLKNC 336 ++ LTDGE+S I+ + ++ + + IG E + LK Sbjct: 489 -----VLILTDGEDSGSQINLNQLEQELQQSGFNSDQRIAFFTIGYGKEGDFDPEALKAI 543 Query: 337 ASPDR-FYSVQNSRKLHDAFLRI 358 A + +Y + + + Sbjct: 544 ADLNAGYYRKGDPETIATVMDDL 566 >gi|239981840|ref|ZP_04704364.1| von Willebrand factor type A [Streptomyces albus J1074] gi|291453698|ref|ZP_06593088.1| von Willebrand factor [Streptomyces albus J1074] gi|291356647|gb|EFE83549.1| von Willebrand factor [Streptomyces albus J1074] Length = 221 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 70/200 (35%), Gaps = 14/200 (7%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L ++ D S SM D + R++ ++ I + P V + R L+ FS Sbjct: 4 LPFYLLCDESGSMTG------DPIDAINRALPDLHHEISTNPTVADKTRFCLIGFSDDAS 57 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + I +++ L G T + K+ E A+GH+ Y+ Sbjct: 58 VLQPLV-DLSDI-DEVPALSAGGLTDYGTAFRTLLRSV--EKDVAELKAQGHEVYRPVAF 113 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 FL+DG + + N I + IG +A Q + A+ F NS Sbjct: 114 FLSDGIPTDEDW-PTAHRELLNSRYAPKIIAFGIG---DAEAQIIGQVANFRAFIQKDNS 169 Query: 349 RKLHDAFLRIGKEMVKQRIL 368 A + + + Sbjct: 170 VSPAQALREFASSLTRSIVR 189 >gi|239617869|ref|YP_002941191.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] gi|239506700|gb|ACR80187.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] Length = 1706 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 54/332 (16%), Positives = 111/332 (33%), Gaps = 43/332 (12%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIW--QTDFRN 94 +T H F + KL+ +D T I ++ + N + + +FS + + R Sbjct: 166 DTVHVFEISTKLYSFVDPD-----TPIYSRSSKVNVEGLRENFSQDFMVTFLSHLGEIRY 220 Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNL-SAVSRYEMPFIF-CTFPWCANSSHAP 152 +L +++ IE L+I I+D + L A + P A Sbjct: 221 QLLNRVYSERTGKIEF---LNINIEDYPEIKILFRAYTDINKPISEEVLLHSDAYILEPS 277 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 I + K + L+ ++ +D S SM A + LD +P+ Sbjct: 278 GRIDLQSLEALKKEPSLNFVLEVDRSGSMKPVM-------EKAKDAASYFLD---LLPEN 327 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + L+ F ++I + ++ + + T + + + Sbjct: 328 SE---LALIAFDTEIEVLKNFTRDREQLKRALAIIKARGATPLYDTVAKGIELLSERSG- 383 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIGVQAEA 328 +++I +TDG +++ S +E A+ +++AIG+ Sbjct: 384 -----------PRFLILVTDGVDANYGDTAPGSEKTLSEVIRLARENNVVIFAIGLGTRI 432 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + L A + F L AF + Sbjct: 433 DEFSLGTLARSTGGMFLKSPTIDNLKTAFNSL 464 >gi|47229708|emb|CAG06904.1| unnamed protein product [Tetraodon nigroviridis] Length = 990 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 70/189 (37%), Gaps = 34/189 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + D NV R F+ K Sbjct: 222 DMVILVDVSGSVSGL------TLKLIKASVMEMLDTLSD-DDYVNVAR-----FNEKAEA 269 Query: 230 TFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P + ++ + ++ TT G +A+N++ + Sbjct: 270 VVPCFKHLVQANVRNKKIFKDAVQQMQAKGTTDYKSGFHFAFNQLLN------KTNVPRA 323 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCAS 338 + K I+ TDG D + +F + V+ V D L+ C + Sbjct: 324 NCNKIIMLFTDG-----GEDRAQDVFMQYNWPNKTVRVFTFSVGQHNYDVTPLQWIACTN 378 Query: 339 PDRFYSVQN 347 ++ +++ Sbjct: 379 KGYYFEIRS 387 >gi|86147193|ref|ZP_01065509.1| TadG-like protein [Vibrio sp. MED222] gi|85835077|gb|EAQ53219.1| TadG-like protein [Vibrio sp. MED222] Length = 435 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 69/450 (15%), Positives = 132/450 (29%), Gaps = 125/450 (27%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G ++L AI++P +F V L + + KA+L + ++L + K + Sbjct: 10 GHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVSAK----------DE 59 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 Q + + R I++ D + L + I + + + ++ +Y + Sbjct: 60 QDHQLAERYIQHYLY-DMDSILDIEVKKLGCDEIPECIAATERGEARYFEYRV-----AG 113 Query: 135 MPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-FGPGMDK 191 FP + +T S K +D+ ++D S SMND G K Sbjct: 114 QTLHKSWFPGNDVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMNDSWSGGRHSK 173 Query: 192 LGVATRSIREMLDIIKSIPDV--NNVVRSGLVTFS-----------------------SK 226 L I ++ D + + D+ + R L F+ + Sbjct: 174 LNDLKDIIEDVADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRDQRVVSREGE 233 Query: 227 IVQTFPLAWGVQHIQEKINRLIF---------------------------------GSTT 253 + + + Q+ I + G T Sbjct: 234 YDKDDTVNFNKTIAQQFIVKGEASRVPNSDDDARFYDLYFTTDFSSFTKKVKKFKAGGGT 293 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS---PNIDNKESLFYCN 310 S G+ A + + K+ II L+DGE+ + + S C+ Sbjct: 294 ASLQGIIRAGQIVTSMSKNQ----------KQLIIILSDGEDWNHYAGQTNKLVSKGMCS 343 Query: 311 EAKR--RGAIV--------------YAIGVQAEAADQF------------------LKNC 336 G V + G+ ++ L+NC Sbjct: 344 NILNMVNGGKVSADNTHDDIEVIGGVSQGMMTPDGERMNARMAVIGFDYELNKNVGLRNC 403 Query: 337 ASPDRFYSVQNSRK-LHDAFLRIGKEMVKQ 365 D Y +N L+ I +E+ Sbjct: 404 VGRDNVYKAENKEDILNKILGLITEEVGHL 433 >gi|326434662|gb|EGD80232.1| hypothetical protein PTSG_10910 [Salpingoeca sp. ATCC 50818] Length = 5100 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 23/179 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++ ++D+S SM+ +F D + +M++ + + R ++ F + Sbjct: 4903 GVDLVCIVDLSGSMSGYFQQLCD-------FLHQMMETFEVGVGASRN-RMAIIGFGTHS 4954 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L + I L T P L A + + + + + + Sbjct: 4955 RIDTWLTNDQSTLSRCIRSLDCRGGTLFQPPLRQALDALRRDQSDYATHNR------RIL 5008 Query: 288 IFLTDGENSSPNID----NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +F TDG N +++ + CN G +V A D+ C R+ Sbjct: 5009 MFQTDGMNGDAGPGVGQLSRQIVDGCNTT-SYGIVV----GDRNAMDRVRDICGPAGRY 5062 >gi|315105441|gb|EFT77417.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] Length = 322 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +V+ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVVSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MPPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG+N+ L N A VY I E + L Sbjct: 203 SDGDNTQGG----SPLVAANRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTHAKSWTADSADKLQEGYQQV 282 >gi|119890597|ref|XP_001256059.1| PREDICTED: alpha 3 type VI collagen, partial [Bos taurus] Length = 1632 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++++ S+ Sbjct: 624 RTLTGTTEVRVNKR---DIIFLLDGSSNVGETNFPYVRDF---------VMNLVNSLDVG 671 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + ++ + G +Y Sbjct: 672 SDHIRVGLVQFSDTPVTEFSLNTYPTKSELLAHLRQMQLQGGSVLNTGAALSYVHANHFT 731 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ LT G++ L N R G + + +G Sbjct: 732 EAGGSRIQDHVP--QLLLLLTAGQSED------SYLQAANALARAGILTFCVGTSQADRA 783 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 784 ELEEIAFNPGLVYLMDDFSSL 804 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 85/223 (38%), Gaps = 26/223 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 I C SS AP +I + D++ ++D S G V Sbjct: 208 IVGNLVACVRSSMAPERAGG-TEIPKDITAQDSADIIFLIDGSN------NTGSVNFAVI 260 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIFGSTT 253 + +L+ + +R G+V +S + F L ++ + + + + L F Sbjct: 261 LDFLVNLLERLSI---GTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGE 317 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + GL A + + + ++ + + ++ ++ G +S D +L + Sbjct: 318 LANVGL--ALDFVVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDGVIALKQAS--- 372 Query: 314 RRGAIVYAIGVQAEAADQF-LKNCASPDR-FYSVQNSRKLHDA 354 V++ G+ A+AA + L++ A+ D ++V R L D Sbjct: 373 -----VFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 410 >gi|73990549|ref|XP_542778.2| PREDICTED: similar to alpha 3 type VI collagen isoform 4 precursor [Canis familiaris] Length = 1320 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 77/198 (38%), Gaps = 33/198 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ D M M + IK + V+ G+V +S I Sbjct: 843 DIYFLIDGSSSIKDANFLEMKVF---------MNEAIKRFQIGPDRVQFGVVQYSDGINI 893 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L+ + ++ I+ + T + L D + Y Sbjct: 894 QFALSQYSSMAELKAAIDDIQQRKGGTMTGEALSRMAQVFVDTAR---------SNVPWY 944 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I +TDG++ P + E+L + G I+YAIGV+ A LK A F++ + Sbjct: 945 LIIITDGKSEDPVAEPAEAL------RGEGVIIYAIGVK-NANVMELKEIAKDKTFFTPE 997 Query: 347 NSRKLHDAFLRIGKEMVK 364 D+ I +++V+ Sbjct: 998 -----FDSLKVIQRDVVQ 1010 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 77/203 (37%), Gaps = 29/203 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D++ ++D S S++ + ++ M++ D ++ GL+ FS Sbjct: 1019 KNRQADIIFLIDGSESISPN------DFEKMKGFVKRMVNQANIGADE---IQIGLLQFS 1069 Query: 225 SKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + F L + I+R + T++ L + ++ G Sbjct: 1070 SSPQEEFRL--NQYSSKADIHRAISKVVQMNDGTRTGKALTFTLPFFDSSRG-------G 1120 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +Y+I +TDG ++L + R +++AIGV + Q L+ Sbjct: 1121 RPNVHQYLIVITDGVAQDDVAIPAKAL------RDRNIVIFAIGVGEAKSAQLLQITDDV 1174 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + Y +N L + +I ++ Sbjct: 1175 QKVYYEENFESLQNLEKKILLKV 1197 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 75/200 (37%), Gaps = 26/200 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +V + S +F +D L S+ D VR GLV +S + Sbjct: 635 VFLVEEFSRDKQWNFQQVIDFLKTTVSSLNVHPD----------GVRIGLVFYSEEPRLE 684 Query: 231 FPLAW--GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + E +++L + TK+ L++ N++F E ++ + Sbjct: 685 FSLDTFQTPAKMLEHLDKLTYRRRSGRTKTGAALDFLRNEVF----VEERGSRSKQGVLQ 740 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T+G + + L +R G +YA+G + + L+N AS + V Sbjct: 741 MAVVITEGFSQDQLSEPASLL------RRAGVTIYAVGTHRASESKDLENIASYPPWKHV 794 Query: 346 QNSRKLHDAFLRIGKEMVKQ 365 + +G ++ Q Sbjct: 795 ISLESFLQ-LSVVGSKIKNQ 813 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 51/142 (35%), Gaps = 16/142 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + ++ + D VR GL ++ I Sbjct: 235 DIVFLVDSSTSIGPQ------NFQKVKNFLYSVILGLDISSDQ---VRVGLAQYNDNIYP 285 Query: 230 TFPLAWG--VQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + E+I L + T + LE + ++ E AK + Sbjct: 286 AFQLNQYPLKSVVLEQIQNLPYRTGDTNTGSALE--FIRMHYLTEAAGSRAKDSVP--QI 341 Query: 287 IIFLTDGENSSPNIDNKESLFY 308 +I +TDGE++ + L Sbjct: 342 VILVTDGESNDEVQEAANKLKE 363 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 77/226 (34%), Gaps = 29/226 (12%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 W A S +I + D++ ++D SL+ D RS+R L Sbjct: 8 WEFIFLAASFGFIKSQRIVCREASVGDVVFLVDTSLNTQD------------IRSVRNFL 55 Query: 204 D-IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLE 260 ++ S + +R GL + F L+ + + + I +L S + Sbjct: 56 YIMVNSFNVSKDSIRVGLAQYGDVPRSEFLLSTYPRKGDVLKHIQKLQPKSW---GHKMG 112 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + D + ++ + + ++ +SP D + KR G ++Y Sbjct: 113 LALQFLLDHHFQATAGSRASQGVPQMAMVIS----NSPAEDPVQ--EAAKALKRAGVLLY 166 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +GV+ + + +SP + F +G K R Sbjct: 167 TVGVKDAVLAELKEIASSP-----AEKFTSFVPNFPDLGSHAQKLR 207 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 73/212 (34%), Gaps = 24/212 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L KI + D++ + D S + + + +LD+ Sbjct: 412 TLCSAVEGKIKEFTQAYADVVFLADTS---QNTSQASFQWMQNFISRVVGLLDV------ 462 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIF 267 + + GL + + F L + I L+ G + ++ L Y + F Sbjct: 463 GRDKYQIGLAQYGGQGHTEFLLNTYHTRDEMIAHIREHFLLRGGSRRTGKALRYLHQTFF 522 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + ++ +Y + +T G++ D ++L + +G V ++GVQ + Sbjct: 523 ----QEAAGSRFLQGIPQYAVVMTSGKSEDEVWDAAQTL------REKGVKVMSVGVQ-D 571 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 + L+ A+P Y +Q + + Sbjct: 572 FDRKELEGMATPPLIYEMQGEDGVRQLMQDVS 603 >gi|83423290|emb|CAI67595.1| collagen, type XXVIII [Homo sapiens] gi|223462744|gb|AAI36893.1| Collagen, type XXVIII, alpha 1 [Homo sapiens] Length = 1125 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 63/179 (35%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG + N D + +A+ G IG+ + L+ + Sbjct: 153 -VKVVLLMTDGIDHPKNPDVQS---ISEDARISGISFITIGLSTVVNEAKLRLISGDSS 207 >gi|290769918|gb|ADD61688.1| putative protein [uncultured organism] Length = 570 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 86/231 (37%), Gaps = 33/231 (14%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + L+ ++ I KS +++ ++DVS SM D +KL + +++ + Sbjct: 180 TKLALVSMNTQAIDFKSAPASNLVFLIDVSGSMFDD-----NKLPLVQQALTML------ 228 Query: 209 IPDVNNVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 ++ R +VT+ S ++V I I L +T + G+E AY Sbjct: 229 AENLTEKDRVSIVTYAGSDEVVLQGVSGDDYHEISSAIEGLEAYGSTNGSAGIETAYAL- 287 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ KG ++ +I TDG+ + + E K G + GV Sbjct: 288 -----AKKYFIKGGNNR---VILCTDGDLNVGLTSEGQLEKLITEKKDSGVFLSTFGVGY 339 Query: 327 ----EAADQFLK-----NCASPDRFYSVQNS--RKLHDAFLRIGKEMVKQR 366 + + L N A D + + + +L + + K++ Q Sbjct: 340 GNYKDNKLELLADKGNGNYAYIDSMFEAKKALVDELGANMVTVAKDVKLQV 390 >gi|148706512|gb|EDL38459.1| vitrin, isoform CRA_a [Mus musculus] Length = 650 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 71/202 (35%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 467 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 516 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 517 ---LEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 564 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 565 KRKVMIIITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKD 618 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V + L+ RI + + Sbjct: 619 HSFFVDDFDNLYKIAPRIIQNI 640 >gi|90412167|ref|ZP_01220173.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] gi|90326891|gb|EAS43276.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] Length = 504 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 81/223 (36%), Gaps = 19/223 (8%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + +G+ I TA+ L +F ++ +E + K +L + + L T NQ++ Sbjct: 15 RHQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAITTA--NQDDKT 72 Query: 71 NGKKQKNDFSYRIIKNIWQ-TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + + I+NI + + E E I+ + + ++ Y ++A Sbjct: 73 YENQLATGYIQAYIRNITSINNIKIERSEG--------IDNYPTPDGNEEREYFQYRVTA 124 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 + + P A + + D +D++ V D S SM + Sbjct: 125 KTNHISWLSSDIIPSFAPTETVANRALARNYPIYLGDKDIDIVFVSDFSGSMKGN----- 179 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQT 230 K+ +I+ + + I D V R V ++ ++ + Sbjct: 180 -KIRALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEK 221 >gi|167521285|ref|XP_001744981.1| hypothetical protein [Monosiga brevicollis MX1] gi|163776595|gb|EDQ90214.1| predicted protein [Monosiga brevicollis MX1] Length = 2728 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 55/162 (33%), Gaps = 13/162 (8%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 F + L ++ D++MV+DVS SM D+ A +R Sbjct: 2495 LFAGRQGNPERALRYVQQYSSHAEYAEAADVLMVVDVSGSMTDYM-------EQARAFVR 2547 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQ---TFPLAWGVQHIQEKINRLIFGSTTKSTP 257 + + + R L TF + +Q+++ ++ T P Sbjct: 2548 TIAREGFHLDSTASQHRMALFTFGTTATALGGEPLFTSDWAQLQQRVAQIAVNGATNYLP 2607 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 L+ + D ++ ++ ++F TDG NS N Sbjct: 2608 ALKLVEQSLRD---LKASDPARYNASRRIVLFQTDGSNSDRN 2646 >gi|313241793|emb|CBY34008.1| unnamed protein product [Oikopleura dioica] Length = 694 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 59/183 (32%), Gaps = 27/183 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ V+D S S G + + + D R + FSS Sbjct: 180 ALDIVFVVDESGS------IGTNNFQLIKDFLEHFASDSTIAADA---TRIAIRPFSSSN 230 Query: 228 VQTFPL-AWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + ++I +I + + T + L+ A + + K Sbjct: 231 YLYFSLNDFKTKNIINEIKNMPYNEGGTNTADALDAALTDYGTDRP----------ESVK 280 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ +TDG ++S + + K R +AIGV + S + Sbjct: 281 VMVTITDGASNSF----LSTSAAADRVKNDLRNIQSFAIGVSGANMAELNAIAISAKHVF 336 Query: 344 SVQ 346 + Sbjct: 337 MLN 339 >gi|301772356|ref|XP_002921600.1| PREDICTED: integrin alpha-1-like, partial [Ailuropoda melanoleuca] Length = 1160 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 80/227 (35%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 138 VVNSIAPVRECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLERMDIGPKQTQ 189 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + N++I T + G++ A + F Sbjct: 190 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQTMTALGIDTARKEAFTEA 246 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + + ++I + Sbjct: 247 RGARRGVK------KVMVIVTDGE----SHDNHQLNKVIQDCEDENIQRFSIAILGSYNR 296 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 297 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFAL 343 >gi|226314649|ref|YP_002774545.1| hypothetical protein BBR47_50640 [Brevibacillus brevis NBRC 100599] gi|226097599|dbj|BAH46041.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 513 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 41/271 (15%), Positives = 88/271 (32%), Gaps = 34/271 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + + I+ Y + + P+ N + I ++S K Sbjct: 119 AEAVRVEEFINFFPTSYPAPTNQTFAIQADSGPSPFQKNLQIVRIGIKGK-ELSPKERKP 177 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSK 226 +++ V+DVS SMN ++L + +S+ ++D ++ V G+V + + Sbjct: 178 ANLVFVIDVSGSMNQE-----NRLELVKKSLHVLVDQLQPTDSV------GIVVYGSEGR 226 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 ++ Q I I+ L +T + GL Y E K + Sbjct: 227 VLLPPTSTEDKQAILSAIDELQPEGSTNAEQGLVLGY-------EMAARSFKPPAINR-- 277 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFY 343 +I +DG + + L + R+ + + G D ++ A + Sbjct: 278 VILCSDGVANVGETGAEGILRSIEDYARKDIYLSSFGFGMGNYNDVMMEQLANKGEGSYA 337 Query: 344 SVQN--------SRKLHDAFLRIGKEMVKQR 366 + + L I +++ Q Sbjct: 338 YIDTFSEARRIFTESLTGTLQTIARDVKIQV 368 >gi|171681714|ref|XP_001905800.1| hypothetical protein [Podospora anserina S mat+] gi|170940816|emb|CAP66465.1| unnamed protein product [Podospora anserina S mat+] Length = 648 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 64/188 (34%), Gaps = 16/188 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVT 222 LD+++ +DVS SM +LD+++ + +++ R G+VT Sbjct: 70 PLDLVLSIDVSGSMGADAPVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDDHDRLGIVT 129 Query: 223 FSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS+ L + I ++++ L S T G+ + + + + Sbjct: 130 FSTSSKVVRELTYMTPANKAKILKQLDALQPLSMTNLWHGIRDGLSLFNNNLKAVNDRRN 189 Query: 279 GHDDYKKYIIFLTDGENSS--PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 ++ LTDG + PN L + ++ G LK+ Sbjct: 190 PGSGRVPALLVLTDGMPNHQCPNQGYVAKLRQWSTLPAS---IHTFGFGYSLRSGLLKSI 246 Query: 337 A--SPDRF 342 A + Sbjct: 247 AEVGGGNY 254 >gi|162452306|ref|YP_001614673.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] gi|161162888|emb|CAN94193.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] Length = 408 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 82/252 (32%), Gaps = 70/252 (27%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 + +++VLD S SM + G D+ + ++ LD+I + + GL F S Sbjct: 96 VHLVIVLDRSDSMTKDWE-GSDRWTMMKGALGTALDVI------RDRMSVGLQLFPSDEH 148 Query: 227 ---------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V P A V I+ ++ G T + L A + Sbjct: 149 CGMPAGEDLSVAVAPGATSVPAIKTLLDGTDPGGATPTADALARALGYLTVGAG------ 202 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----------------------------------IDN 302 G + KY++ TDG + +D+ Sbjct: 203 -GALEGDKYVLLATDGGPNCNQDPAMTCEAATCTTNMDGDCPSGVPNCCVAGLTDVCLDD 261 Query: 303 KESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA---------SPDRFYSVQNSRKLH 352 ++ + + G + +G+ A A L A + R++ V ++ L Sbjct: 262 VRTVQRVKDLRAAGIKTFVVGIPGATAYAHVLDQLAVEGDTATSETSPRYFEVVDAASLG 321 Query: 353 DAFLRIGKEMVK 364 D I +++V+ Sbjct: 322 DTLTGITRDLVR 333 >gi|3850207|gb|AAC72024.1| alpha-1 type VII collagen non-collagenous domain [Canis lupus familiaris] Length = 1253 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 78/224 (34%), Gaps = 27/224 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +P + + + +++ D++ +LD S S+ Sbjct: 2 RLPLLVAPLCAGILAGALRVRAQQRERVTCTRLYAADIVFLLDGSSSIGR------GNFR 55 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF-G 250 + ++ VR V +S F L A G + I L + G Sbjct: 56 EVRGFLEGLVWPFSGAASA-QGVRFAAVQYSDDPRTEFGLGALGSGGDVIRAIRELSYKG 114 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T++ + + + +F L +A+ K I +TDG++ + L Sbjct: 115 GNTRTGAAILHVADHVF-----LPQLARPGVP--KVCILITDGKSQDLVDTAAQRL---- 163 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRKL 351 K +G ++A+G++ A + LK AS D F+ V + L Sbjct: 164 --KGQGVKLFAVGIK-NADPEELKRVASQPTSDFFFFVNDFSIL 204 >gi|322434933|ref|YP_004217145.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162660|gb|ADW68365.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 347 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 83/225 (36%), Gaps = 37/225 (16%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 +K S D + + +V D+S SM FG R+ + + + +++ + Sbjct: 108 NTGQVIKTFSTQDAPVTIGIVFDLSGSMTSKFG----------RARKALSEFLRTSNPAD 157 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +V F+ K V+ ++ ++ L + T + NK+ +AK Sbjct: 158 EFF---VVGFNDKPAVIVDYTSDVEDVEARMVMLKPENRTALIDAVYLGVNKLKEAKYDR 214 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAAD 330 K ++ ++DG ++ E + +Y+IG+ A + Sbjct: 215 -----------KALLIVSDGGDNRSRYTEGELRRV---VRESDVQIYSIGIYDAYAPTEE 260 Query: 331 Q-----FLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + LK+ + + R + V + + D RI E+ + ++ Sbjct: 261 EQLGPVLLKDISEMTGGRMFPVTDIADMADIASRISAELRNEYVI 305 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 38/133 (28%), Gaps = 41/133 (30%) Query: 235 WGVQHIQEKINRLIFGSTT-------------KSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +N L+ T +T + A + D + Sbjct: 107 NNRSGFVSYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVTDPDNPAKIRGF 166 Query: 282 DYKKYIIFLTDGE--------------------NSSPNIDN--------KESLFYCNEAK 313 KKY+IF+TDG+ N SP DN + CN AK Sbjct: 167 SVKKYMIFMTDGDMSPTWNDYSAYGIEYLDGRVNGSPTTDNAALLARHLQRFRMACNAAK 226 Query: 314 RRGAIVYAIGVQA 326 +G ++ I Sbjct: 227 AKGIDIWVIAFST 239 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 47/134 (35%), Gaps = 7/134 (5%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I F +G+ + A++LPV+ ++ E ++ +V A L L + + Sbjct: 249 ITRFLTATRGATVLEFALILPVLCALLAGGFELGYRAYVNAILQGALLEASRQATVGDRS 308 Query: 66 QE--NGNNGKKQKNDFSYRIIKNIWQTDFRNE-----LRENGFAQDINNIERSTSLSIII 118 + + I++I + F N + F ++ + ST Sbjct: 309 GAQIDKTITDRMATLSGSISIQSIKKESFYNFSNVGKPEKLTFDRNGDGAYDSTQDCYED 368 Query: 119 DDQHKDYNLSAVSR 132 + + Y++ S Sbjct: 369 ANNNGAYDVKTNSG 382 >gi|218708116|ref|YP_002415737.1| hypothetical protein VS_0028 [Vibrio splendidus LGP32] gi|218321135|emb|CAV17085.1| Conserved hypothetical protein, putative exported, TadG [Vibrio splendidus LGP32] Length = 435 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 68/459 (14%), Positives = 135/459 (29%), Gaps = 125/459 (27%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +++ G ++L AI++P +F V L + + KA+L + ++L + K Sbjct: 1 MKHAMRKQSGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVSAK--- 57 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +Q + + R I++ D + L + + + + + ++ +Y Sbjct: 58 -------DEQDHQLAERYIQHYLY-DMDSILDIEVKKLGCDEMPECIAATERGEARYFEY 109 Query: 126 NLSAVSRYEMPFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + FP + +T S K +D+ ++D S SMND Sbjct: 110 RV-----AGQTLHKSWFPGNDVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMND 164 Query: 184 H-FGPGMDKLGVATRSIREMLDIIKSIPDV--NNVVRSGLVTFS---------------- 224 G KL I ++ D + + D+ + R L F+ Sbjct: 165 SWSGGRHSKLNDLKDIIEDVADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRD 224 Query: 225 -------SKIVQTFPLAWGVQHIQEKINRLIF---------------------------- 249 + + + + Q+ I + Sbjct: 225 QRVVSREGEYDKDDTVNFNKTIAQQFIVKGEASRVPNGDDDARFYDLYFTTDFSSFTKKV 284 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS---PNID 301 G T S G+ A + + K+ II L+DGE+ + + Sbjct: 285 KKFKAGGGTASLQGIIRAGQIVTSMSKNQ----------KQLIIILSDGEDWNHYAGQTN 334 Query: 302 NKESLFYCNEAKR--RGAIV--------------YAIGVQAEAADQF------------- 332 S C+ G V + G+ ++ Sbjct: 335 KLVSKGMCSNILNMVNGGKVSADNTHDDVEVIGGVSQGMMTPDGERMNARMAVIGFDYEL 394 Query: 333 -----LKNCASPDRFYSVQNSRK-LHDAFLRIGKEMVKQ 365 L+NC D Y +N L+ I +E+ Sbjct: 395 NKNVGLRNCVGRDNVYKAENKEDILNKILGLITEEVGHL 433 >gi|164687487|ref|ZP_02211515.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] gi|164603261|gb|EDQ96726.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] Length = 273 Score = 56.0 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 85/210 (40%), Gaps = 18/210 (8%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +++ S L + ++D S SM+ K+G ++ E+L ++ + ++ Sbjct: 23 PLEVKPISKKNLVIFFLVDTSGSMSGK------KIGTLNTTMEELLPELRGLGGATTDIK 76 Query: 218 SGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++TFSS + + P++ + RL T L A+ ++ + + E Sbjct: 77 LAVMTFSSGCEWITKEPMSVDDY---QYWTRLKAEGLTD----LGEAFTELSNKLSRKEF 129 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + Y I LTDG + ++ ++L + N + G V A+G+ + ++ LK Sbjct: 130 LNAPSLSYAPVIFLLTDGYATDDALEGLKTLQH-NNWYKYGLKV-ALGLGEKFDEELLKK 187 Query: 336 CAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 P+ + + S +L I + Sbjct: 188 FTGNPELVVTAKTSDQLSKLVKTIAVTSSQ 217 >gi|320589835|gb|EFX02291.1| von willebrand factor type a domain containing protein [Grosmannia clavigera kw1407] Length = 735 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 79/262 (30%), Gaps = 35/262 (13%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 + + S SL++ + P P ++ + + Sbjct: 45 LRDSSSSASLNVYRLPSDDGILIKVKP----PAYPGNIPSISSQRGL----------AGQ 90 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSG 219 + +++V+DVS SM + K + +LD++K + +N G Sbjct: 91 GHVPCSIVLVIDVSGSMQEDAPVPATKGEPMESNGLTVLDLVKHAARTILETLNEHDCLG 150 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +VTFS L + I L + T + + E Sbjct: 151 IVTFSEDANVLLMLTPMTQVNKAKALQVILDLEPLTVTNL-------WKGLTAGIEIFSS 203 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 A+ I+ LTDG + + + + A ++ G LK+ Sbjct: 204 KAQFSSVPS--IMLLTDGLPN-FMHPPQGYIPKLRTFGKLPAPIHTFGFGYNLRSGLLKS 260 Query: 336 CA--SPDRFYSVQNSRKLHDAF 355 + + + + ++ L F Sbjct: 261 ISELTGGNYAFISDAGMLGTVF 282 >gi|302917449|ref|XP_003052439.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] gi|256733379|gb|EEU46726.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] Length = 764 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 76/204 (37%), Gaps = 38/204 (18%) Query: 170 DMMMVLDVSLSMND---------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 D+++V+DVS SM + G+ L + + R +++ + N R G+ Sbjct: 86 DIVLVIDVSGSMAGAAPVPGEETNESTGLSILDLTKHAARTIIETM------NESDRLGI 139 Query: 221 VTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 VTF+SK PL + + + + + T ++ + + + +++ Sbjct: 140 VTFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNL-------WHGLLEGIKLFKNV 192 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFL 333 + I+ LTDG + N + + + + G A ++ G L Sbjct: 193 KSSNVPA---IMVLTDGMPNHMN----PAAGFVPKLRAMGQLPASIHTFGFGYHLRSGLL 245 Query: 334 KNCA--SPDRFYSVQNSRKLHDAF 355 K+ A + + ++ + F Sbjct: 246 KSIAEIGGGNYAFIPDAGMIGTVF 269 >gi|167526012|ref|XP_001747340.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774175|gb|EDQ87807.1| predicted protein [Monosiga brevicollis MX1] Length = 1632 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 47/233 (20%), Positives = 86/233 (36%), Gaps = 30/233 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + I F W + + L ++ + +D++ VLD S S++ G Sbjct: 300 LAAIRSKFDWISYVDNTTLRVSVIREG---CKSAIDLIFVLDGSGSIDRESYGG-----T 351 Query: 195 ATRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI 248 ++LD +K N R G++TFSS V F L + + + I+ + Sbjct: 352 PGNFQYKILDFVKQVVSYFDISANATRVGVITFSSSAVINFNLNSFYDKSDMLDAIDNIN 411 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YKKYIIFLTDGENSSPNIDNKESL 306 S+T+ + L + K + + + K ++ LTDG+ SS E Sbjct: 412 YPASSTRISLALASVRQNML----KEYNGMRPESEGVPKVVVVLTDGQASSG----YEPA 463 Query: 307 FYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 + K G +++IG+ + DQ +P + L F I Sbjct: 464 YEAALLKDMGVNMFSIGIGSSIDTDQLEDMATAP----LASHMH-LLKNFDAI 511 >gi|154759255|ref|NP_001032852.2| collagen alpha-1(XXVIII) chain precursor [Homo sapiens] gi|167009138|sp|Q2UY09|COSA1_HUMAN RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor Length = 1125 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG + N D + +A+ G I + + L+ + Sbjct: 153 -VKVVLLMTDGIDHPKNPDVQS---ISEDARISGISFITIALSTVVNEAKLRLISGDSS 207 >gi|119614001|gb|EAW93595.1| hCG2042895 [Homo sapiens] Length = 713 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG + N D + +A+ G I + + L+ + Sbjct: 153 -VKVVLLMTDGIDHPKNPDVQS---ISEDARISGISFITIALSTVVNEAKLRLISGDSS 207 >gi|51095061|gb|EAL24305.1| similar to Matn2-prov protein [Homo sapiens] Length = 651 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG + N D + +A+ G I + + L+ + Sbjct: 153 -VKVVLLMTDGIDHPKNPDVQS---ISEDARISGISFITIALSTVVNEAKLRLISGDSS 207 >gi|51095062|gb|EAL24306.1| similar to Matn2-prov protein [Homo sapiens] Length = 668 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG + N D + +A+ G I + + L+ + Sbjct: 153 -VKVVLLMTDGIDHPKNPDVQS---ISEDARISGISFITIALSTVVNEAKLRLISGDSS 207 >gi|260837292|ref|XP_002613639.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] gi|229299025|gb|EEN69648.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] Length = 240 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 73/187 (39%), Gaps = 25/187 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S D+ + ++ P + G+V +SS Q Sbjct: 4 DILFVVDGSSS------IPADEFEKVKTFLNSIVGHFDIGP---TATQVGVVQYSSSPQQ 54 Query: 230 TFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + N +I G T + L +A + A + A+ K Sbjct: 55 EFALNAHSSLVSLQQAITNIIIIGRGTNTGSALTFARDVALTA----ANGARPGLP--KI 108 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDRFYSV 345 ++ +TDG +S + ++L + G I +AIGV + A+D Q + S DR ++ Sbjct: 109 VVTMTDGASSEDVLTPSQNL------RNDGVITFAIGVTSRASDWQVEEIAGSLDRVFTA 162 Query: 346 QNSRKLH 352 + L Sbjct: 163 SDFDALD 169 >gi|268580761|ref|XP_002645363.1| Hypothetical protein CBG15420 [Caenorhabditis briggsae] Length = 862 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 70/190 (36%), Gaps = 27/190 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++++ D S + F + + I++ +P + VR G+V +S + Sbjct: 33 LDIIILFDTSGGNDTVFE----------QQKNWTIKIVRDLPVHEDAVRVGIVQYSDEAK 82 Query: 229 QTFPLAW--GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L+ I + L F G T++ L A ++IFD A Sbjct: 83 TEFNLSRYSERNDIITHLETLKFMPGEDTRTGVALSKADDEIFDYDGGARLKAT------ 136 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFY 343 + II TDG + + +R+G +Y I V + + L D + Sbjct: 137 RLIIVFTDGLSMD------KPTLAAKALRRKGVKIYTISVNSIGFVPEMLGIVGDADNVF 190 Query: 344 SVQNSRKLHD 353 + ++ + Sbjct: 191 GPTDENRIEE 200 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 68/199 (34%), Gaps = 31/199 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +D++ V+D S S+ + D L +IK + + R GL+ Sbjct: 678 SASVQCPMDILFVVDSSGSIARTYDTQKDYLT----------QLIKKVEPSRSH-RVGLI 726 Query: 222 TFSSKIVQTFPLAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F+ +Q ++ ++ + TT LE + + ++ E Sbjct: 727 QFAGPHIQKMEWSFDTHSKNSQLLSAIRSVRHLTGTTYIGAALELSLILLDSRRKHTETT 786 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +I ++DG + + + L K +YAI + ++L + Sbjct: 787 ----------VILISDGFSQDDSTQQAKLLRQLPNVK-----MYAISLNKLTNTKYLTDI 831 Query: 337 ASPDRFYSVQNSRKLHDAF 355 + + + + + F Sbjct: 832 VGDRKNLFINDESQWFEEF 850 >gi|189501233|ref|YP_001960703.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496674|gb|ACE05222.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 339 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 39/164 (23%), Positives = 59/164 (35%), Gaps = 23/164 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LDM+ +LDVS SM D+L A + I I R GLV F+ Sbjct: 91 ALDMVYLLDVSNSML-ARDISPDRLERAR-------EEIVRISRGIERGRRGLVAFAGSG 142 Query: 228 VQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V PL Q + + LI T + ++ A K+F + E + G Sbjct: 143 VVQCPLTTDQQAFETMLGIASPDLIEAQGTDISAAMDVA-QKMFSGSKTEEKVKAGG--- 198 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + ++DGE K + K + + +GV E Sbjct: 199 --VAVLVSDGE-----AHEKGFSAAARKLKEKDVRLIVVGVGEE 235 >gi|257883753|ref|ZP_05663406.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] gi|257819591|gb|EEV46739.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] Length = 1475 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 68/192 (35%), Gaps = 24/192 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 D + Y V +P + +++ + + K +D++ VLD S Sbjct: 305 DYNGAYIKKWV-EPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSA 363 Query: 180 SMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQ 229 SM++ K ++ EM + D + +R G+V F + + + Sbjct: 364 SMSELTAGTNSQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHEQIS 421 Query: 230 T--FPLAWGVQHIQ-EKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + FPL + + + L T T GL+ Y ++ + + + Sbjct: 422 SDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGE------NRNPE 475 Query: 285 KYIIFLTDGENS 296 K +I + DG + Sbjct: 476 KILIVVGDGTPT 487 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 77/245 (31%), Gaps = 35/245 (14%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHY----------ILDHSLLY 58 F + G+ I+TA+L+ + G+ ++ +H ++ +L+ + Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 A + + + GK +NI+ + EL E ++ + + Sbjct: 68 AAMAMNSNGTVSLGKTDA--------RNIFMSQMSGELAE---------VQVDLGIDVTK 110 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + +S + F+ + I+ + ++ +D ++LD + Sbjct: 111 TANKLNSQVSFTATVPTTFMQIL-------GRDSITISGTATAEYQTAAFMDFYILLDNT 163 Query: 179 LSMNDHFGP-GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SM P + KL + K+I + G+ + Q Sbjct: 164 PSMGVGATPDDVSKLEAKAGCAFACHQMDKTINNYTIAKSLGVAMRIDVVRQATQALTDT 223 Query: 238 QHIQE 242 + Sbjct: 224 AKTER 228 >gi|260836190|ref|XP_002613089.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] gi|229298473|gb|EEN69098.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] Length = 267 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 66/207 (31%), Gaps = 41/207 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D++ +LD S S+ + ++ + G+V + Sbjct: 2 PVDLVFLLDGSGSITAP------NFEITKSFVQN---TTSDFQIGTAHTQVGVVQYEDNP 52 Query: 228 VQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FPL + E + + G T++ A + + D H A+ Sbjct: 53 YDEFPL-NQYATLDELLTAIRNITYRGGGTQTG----KAIDHVVDNSLTESHGARPGVP- 106 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--- 340 K +I +TDG++ + A G I+ AIGV + L AS + Sbjct: 107 -KVVIVVTDGQS------WDSVVAPAQRANHSGIIMVAIGVGSGYDINELMEIASSNDTL 159 Query: 341 ---------RFYSVQNSRKLHDAFLRI 358 ++ V N L F I Sbjct: 160 GTIEYFLRCKYLKVNNLTFL---FQDI 183 >gi|159898662|ref|YP_001544909.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159891701|gb|ABX04781.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 610 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 66/220 (30%), Gaps = 33/220 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I + V+D S SM ++L + ++ + ++ Sbjct: 257 SIEVADRKPAALTFVIDTSGSMAQD-----NRLEMVKNALIYLAGQLEPDDS------LA 305 Query: 220 LVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +V F+ + G I IN L +T + GL + E Sbjct: 306 IVAFNDGMRVVLNPTSGENQMDIITAINSLEPAGSTNAEAGLYKGF-------ELAWQAF 358 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNC 336 K + I+ +DG +S + + L + G + GV D L+ Sbjct: 359 KPEGINR--ILLCSDGVANSGMTEPSQLLATFQQYLDAGVQLSTYGVGMGNYNDILLEQL 416 Query: 337 A--SPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 A + ++ +L + IG+E Q Sbjct: 417 ADKGDGNYAYFDSADEAQRLFGEQLTGSLQTIGREAKIQV 456 >gi|3183041|sp|Q90615|ITA1_CHICK RecName: Full=Integrin alpha-1; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1 gi|497990|gb|AAA59067.1| alpha 1 integrin [Gallus gallus] Length = 285 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 92/265 (34%), Gaps = 33/265 (12%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 +++ + + + Y+ + T C+N S + + + LD+++V Sbjct: 8 TLVTNPKGGFLACGPLYAYKCGRLHYTTGVCSNVSSTFETVKAVAPSVQECKTQLDIVIV 67 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S S + T + +L + P G+V + +V F L Sbjct: 68 LDGSNS--------IYPWESVTAFLNSLLRNMDIGPQQTQ---VGIVQYGQTVVHEFYLN 116 Query: 235 W--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + + R+ T++ L + + H A+ +K ++ +TD Sbjct: 117 TYSTTEEVMDAALRIRQRGGTQTMTAL--GIDTAREEAFTEAHGARRG--VQKVMVIVTD 172 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKNCAS---PD 340 GE + DN ++ + +AI + E + +K+ AS Sbjct: 173 GE----SHDNYRLQEVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEK 228 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQ 365 F++V + L +G+ + Sbjct: 229 HFFNVSDELALVTIVEALGERIFAL 253 >gi|293569033|ref|ZP_06680345.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] gi|291588214|gb|EFF20050.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] Length = 1502 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 68/192 (35%), Gaps = 24/192 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 D + Y V +P + +++ + + K +D++ VLD S Sbjct: 332 DYNGAYIKKWV-EPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSA 390 Query: 180 SMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQ 229 SM++ K ++ EM + D + +R G+V F + + + Sbjct: 391 SMSELTAGTNSQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHEQIS 448 Query: 230 T--FPLAWGVQHIQ-EKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + FPL + + + L T T GL+ Y ++ + + + Sbjct: 449 SDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGE------NRNPE 502 Query: 285 KYIIFLTDGENS 296 K +I + DG + Sbjct: 503 KILIVVGDGTPT 514 >gi|239627294|ref|ZP_04670325.1| von Willebrand factor [Clostridiales bacterium 1_7_47_FAA] gi|239517440|gb|EEQ57306.1| von Willebrand factor [Clostridiales bacterium 1_7_47FAA] Length = 681 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 92/258 (35%), Gaps = 37/258 (14%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP----LLITSSV 159 +I S++ + D N + RY++ N+ L++ Sbjct: 244 NIEQSADSSAHITLKDPADYGGNRDFILRYQLAGQTVNSGLMLNTGEKENFFLLMVQPPE 303 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ +++ + + VLDVS SM FG +D A IR M+ ++ N Sbjct: 304 RVPAEAIPPREYIFVLDVSGSM---FGYPLD---TAKELIRNMVSNLRETDTFN------ 351 Query: 220 LVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 L+ FS+ ++ + V+ INR G T+ P LE A D+ Sbjct: 352 LILFSNDAIRMSARSLPATDENVERAINLINRQKGGGGTELAPALEKAVGIPMDS----- 406 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 G + ++ +TDG S ++++F ++ G+ ++ Sbjct: 407 ----GAGSVSRSVVVITDGYMSD-----EQAIFDIVAGNLDTTSFFSFGIGTSVNRYLIE 457 Query: 335 NCA--SPDRFYSVQNSRK 350 A + V +S + Sbjct: 458 GIARTGGGESFVVTDSSE 475 >gi|326669364|ref|XP_695742.5| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Danio rerio] Length = 3651 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 79/220 (35%), Gaps = 42/220 (19%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V++ + LD++ ++D S S+ G R +R+ML P R Sbjct: 66 VRLLRERGGCLDLVFLVDESSSV------GASNFKSELRFVRKMLSDFPVAP---EATRV 116 Query: 219 GLVTFSSKI-------VQTFPLAWGVQ--HIQEKINRLIF-GSTTKSTPGLEYAYNKIFD 268 LVTFSSK + P A + ++I + + G T + + A + Sbjct: 117 ALVTFSSKSHVVTRADYVSAPKAHQHKCSLFSKEIPSITYRGGGTYTRGAFQRAAQILRQ 176 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++E K I +TDG ++ + + RG ++ +G+ + Sbjct: 177 SRENA----------TKVIFLITDGYSNGG-----DPRPVAAALRERGVEIFTLGI-WQG 220 Query: 329 ADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + L AS + V N + F + + + + Sbjct: 221 NIRELHEMASQPKDQHCFFVHNFAE----FEALARRALHE 256 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 76/237 (32%), Gaps = 19/237 (8%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + G+ I+TA+L+ + G+ ++ +H ++ +L+ D + + + + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSSAVA 67 Query: 69 GNNGKKQKNDFSYRII--KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 S +NI+ + EL E + + + + Sbjct: 68 AAMAMNGNGTISLGKTDARNIFMSQVSGELAE---------VHVDLGIDVTKTANKLNSQ 118 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 +S + F+ + I+ + ++ +D ++LD + SM Sbjct: 119 VSFTATVPTTFMQIF-------GRDSITISGTATAEYQTAAFMDFYILLDNTPSMGVGAT 171 Query: 187 PG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 P + KL T + KS + G+ + Q + Sbjct: 172 PSDVSKLEAKTGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQATQALTDTAKTER 228 >gi|281339017|gb|EFB14601.1| hypothetical protein PANDA_010505 [Ailuropoda melanoleuca] Length = 1153 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 80/227 (35%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 139 VVNSIAPVRECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLERMDIGPKQTQ 190 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + N++I T + G++ A + F Sbjct: 191 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQTMTALGIDTARKEAFTEA 247 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + + ++I + Sbjct: 248 RGARRGVK------KVMVIVTDGE----SHDNHQLNKVIQDCEDENIQRFSIAILGSYNR 297 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 298 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFAL 344 >gi|169234588|ref|NP_001038425.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Danio rerio] gi|169154233|emb|CAH68946.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 1 (CACNA2D1) [Danio rerio] Length = 1069 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++D S S++ L + S+ EML+ + VN +V+F++ Sbjct: 251 DMLILVDASGSVSGL------TLKLIRTSVSEMLETLSDDDYVN------IVSFNNSAKS 298 Query: 230 TFPLAW-------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++E + ++ TT G + A+N++ + Sbjct: 299 VACFENLVQANVRNKKTLKEAVQKITANGTTDYKIGFKEAFNQLASMNVSRANCN----- 353 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKN--CASP 339 K I+ TDG + E + ++ V ++ C + Sbjct: 354 --KIIMLFTDGGEDKASEIFDEYNS------DKRVRIFTFSVGQHNYDKAPIQYMACHNK 405 Query: 340 DRFYSV 345 +Y + Sbjct: 406 GYYYEI 411 >gi|109077202|ref|XP_001094788.1| PREDICTED: integrin alpha-1 [Macaca mulatta] Length = 1179 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVRECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|325282943|ref|YP_004255484.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324314752|gb|ADY25867.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 535 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 72/236 (30%), Gaps = 34/236 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P + P+ A++ ++ S + + + + VLDVS SM +L Sbjct: 317 PGMLVELPFPASAGTIDAILGSYL---NDVRRPANTIFVLDVSGSMEGK------RLEAL 367 Query: 196 TRSIREMLDIIKS----IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------IQEKI 244 ++ + S + R L+ FS + I Sbjct: 368 KAALGNLSGADTSLGWRFAAFADRERVTLIPFSGDVEAVRSFQVNKASRAADLQAIAAAG 427 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-SPNIDNK 303 L G T L AY + A ++ +TDGE + P+++ Sbjct: 428 GALQAGGGTNIYGALSEAYRQAAAAPAGSYTS----------VVLMTDGEGTAGPSLNEF 477 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHDAFLRI 358 + A R + + + + A + R + Q + L AF I Sbjct: 478 RDFYAALPAGARSVKTFTVLFGDSDVQEMNEVAALTGGRTFDGQ--QNLAAAFKEI 531 >gi|47219204|emb|CAG11222.1| unnamed protein product [Tetraodon nigroviridis] Length = 4421 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 78/261 (29%), Gaps = 20/261 (7%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L++ G S + Y L+ ++P I ++ Sbjct: 2251 ASSLKQQGVFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 2310 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + G D++ +LD S F D + ++R +D K V Sbjct: 2311 DTAMPPTVTVERQPRGKDVVFLLDGSDGTRSGFPAMRDFVQRVVETLR--VDDKKDRVSV 2368 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAK 270 R V F T + I + + L + L+Y N +F A Sbjct: 2369 VQYSRDAAVHFYLNTYTTK------REILDALRGLRHKGGRALNTGEALQYLRNNVFTAS 2422 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + +I LT G +S L K+ G +++AIG + Sbjct: 2423 AGSRRT----EGVPQVLILLTGGRSSDSVDSPASDL------KQLGVLIFAIGSRGSDNR 2472 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + SP V L Sbjct: 2473 EIQRISHSPTSALVVPEFTDL 2493 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 77/261 (29%), Gaps = 20/261 (7%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L++ G S + Y L+ ++P I ++ Sbjct: 336 ASSLKQQGVFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 395 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + G D++ +LD S F D + ++R +D K V Sbjct: 396 DTAMPPTVTVERQPRGKDVVFLLDGSDGTRSGFPAMRDFVQRLVETLR--VDDKKDRVSV 453 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAK 270 R V F T + I + + L + L+Y N +F A Sbjct: 454 VQYSRDAAVHFYLNTYTTK------REILDALRGLRHKGGRALNTGEALQYLRNNVFTAS 507 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + +I LT G +S L K G +++AIG + Sbjct: 508 AGSRRT----EGVPQVLILLTGGRSSDSVDSPASDL------KPLGVLIFAIGSRGSDNR 557 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + SP V L Sbjct: 558 EIQRISHSPTSALVVPEFTDL 578 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 78/261 (29%), Gaps = 20/261 (7%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L++ G S + Y L+ ++P I ++ Sbjct: 2804 ASSLKQQGVFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 2863 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + G D++ +LD S F D + ++R +D K V Sbjct: 2864 DTAMPPTVTVERQPRGKDVVFLLDGSDGTRSGFPAMRDFVQRVVETLR--VDDKKDRVSV 2921 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAK 270 R V F T + I + + L + L+Y N +F A Sbjct: 2922 VQYSRDAAVHFYLNTYTTK------REILDALRGLRHKGGRALNTGEALQYLRNNVFTAS 2975 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + +I LT G +S L K+ G +++AIG + Sbjct: 2976 AGSRRTER----VPQLLILLTGGRSSDSVDSPASDL------KQLGVLIFAIGSRGSDNR 3025 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + SP V L Sbjct: 3026 EIQRISHSPTSALVVPEFTDL 3046 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 77/261 (29%), Gaps = 26/261 (9%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L++ G S + Y L+ ++P I ++ Sbjct: 1372 ASSLKQQGVFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 1431 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 G D++ +LD S F D + ++R +D K V Sbjct: 1432 DTAMPPTVT------GKDVVFLLDGSDGTRSGFPAMRDFVQRVVETLR--VDDKKDRVSV 1483 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAK 270 R V F T + I + + L + L+Y N +F A Sbjct: 1484 VQYSRDAAVHFYLNTYTTK------REILDALRGLRHKGGRALNTGEALQYLRNNVFTAS 1537 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + +I LT G +S L K+ G +++AIG + Sbjct: 1538 AGSRRT----EGVPQVLILLTGGRSSDSVDSPASDL------KQLGVLIFAIGSRGSDNR 1587 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + SP V L Sbjct: 1588 EIQRISHSPTSALVVPEFTDL 1608 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 69/227 (30%), Gaps = 20/227 (8%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 ++ + + + T + + + D++ +LD S F Sbjct: 1732 TNSSGSRRLQGVPQMLILLNGGRSYDSVDTPASSLKQQDQQEKDVVFLLDGSDGTRSGFP 1791 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 D + ++R +D K V R V F T + I + + Sbjct: 1792 AMRDFVQRVVETLR--VDDKKDRVSVVQYSRDAAVHFYLNTYTTK------REILDALRG 1843 Query: 247 LIFGSTT--KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 L + L+Y N +F A + + +I LT G +S Sbjct: 1844 LRHKGGRALNTGEALQYLRNNVFTASAGSRRT----EGVPQVLILLTGGRSSDSVDSPAS 1899 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 L K+ G +++AIG + + + SP V L Sbjct: 1900 DL------KQLGVLIFAIGSRGSDNREIQRISHSPTSALVVPEFTDL 1940 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 60/184 (32%), Gaps = 20/184 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S F D + ++R +D K V R V F Sbjct: 1 DVVFLLDGSDGTRSGFPAMRDFVQRVVETLR--VDDKKDRVSVVQYSRDAAVHFYLNTYT 58 Query: 230 TFPLAWGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 T + I + + L + L+Y N +F A + + + Sbjct: 59 TK------REILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSRRT----EGVPQVL 108 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I LT G +S L K+ G +++AIG + + + SP V Sbjct: 109 ILLTGGRSSDSVDSPASDL------KQLGVLIFAIGSRGSDNREIQRISHSPTSALVVPE 162 Query: 348 SRKL 351 L Sbjct: 163 FTDL 166 >gi|116624980|ref|YP_827136.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228142|gb|ABJ86851.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 76/209 (36%), Gaps = 28/209 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ L +++++D S S D ++R +L + N + L TF Sbjct: 79 QTEQKLSVVLLVDTSGSTAKELKYESD---SSSRFFHVLL------GEGNPEDMAALYTF 129 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + +I + P + ++ ++ + + T + A ++ D Sbjct: 130 NWEIREQQPFSRDLRAFDNRLKMMHGEAGTAMYDAVYLAAQRL------------EPRDG 177 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCA- 337 +K I+ +TDG ++ + +++L A + + + +A L A Sbjct: 178 RKVIVVVTDGGDTVSRLSVQKALEAAQLADAVIYAIVVVPITNDAGRNIGGEHALDFMAK 237 Query: 338 -SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + R + +L AF I E+ Q Sbjct: 238 GTGGRIFMPTLGAELDKAFADIITELRTQ 266 >gi|261854814|ref|YP_003262097.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261835283|gb|ACX95050.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 339 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 80/234 (34%), Gaps = 30/234 (12%) Query: 144 WCANSSHAPLLITSSV---KISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSI 199 W A+ + + +T V + +++++D S SM + F G D+ ++ Sbjct: 79 WLASYAALVIALTHPVWQGEFLPAPPPARSILLLVDASPSMQAEDFPAGKDRFIARIDAM 138 Query: 200 R-EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKS 255 + +L I + P R ++ ++ P+ + I +L GS T Sbjct: 139 KQGLLRFIAARPQ----DRFSVIVVTNSAGTLVPMTTDHAVLDYWIRQLRAGINGSDTAL 194 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 GL A I + + ++ TDG ++ + E+L A+ Sbjct: 195 GDGLAMAIRSIAAQSQAGQ--------PAPLLVVWTDGFSTGGLMTPAEALAL---ARAY 243 Query: 316 GAIVYAI-----GVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 G ++ + G + L A + + + ++ +I + Sbjct: 244 GIKLFTVNLAPKGSPPDQGQPSLAQLADLTGGKPILASDLAAMNAVTDQIAASV 297 >gi|224368584|ref|YP_002602747.1| hypothetical protein HRM2_14740 [Desulfobacterium autotrophicum HRM2] gi|223691300|gb|ACN14583.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 222 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 77/210 (36%), Gaps = 18/210 (8%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++++ + ++++ D S SM+ K+ +++R+++D +N + Sbjct: 3 KFAARTARPIPVIILADTSGSMSVD-----GKIDAMNQALRDLIDTFSGESRLNAEIHLS 57 Query: 220 LVTFS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++TF + PL H + L T A I D ++ Sbjct: 58 VITFGGDGAKEHLPLT--CAHTISGFSDLQAHGMTPMGGAFRIAKELIEDKEK------I 109 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 Y+ I+ ++DG + SL A++ A+ + ++A + LK+ + Sbjct: 110 PSRAYRPVIVLVSDGYPNDDWEAAFSSLRGSERAQKA--TRMAMAIGSDADENMLKDFIN 167 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMVKQR 366 + +R + F + + + Sbjct: 168 DPETPVFRANGARDIIRFFRAVSMSVTSRS 197 >gi|18042139|gb|AAL57848.1|AF454755_1 vitrin [Mus musculus] Length = 650 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 71/202 (35%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 467 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 516 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 517 ---LQFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 564 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 565 KRKVMIIITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKD 618 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V + L+ RI + + Sbjct: 619 HSFFVDDFDNLYKIAPRIIQNI 640 >gi|119575262|gb|EAW54867.1| hCG2002731, isoform CRA_d [Homo sapiens] Length = 768 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVQECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|296446540|ref|ZP_06888482.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296255894|gb|EFH02979.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 333 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 77/216 (35%), Gaps = 30/216 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G D++ ++D S SMN+ F + R +L D R G+V F Sbjct: 80 GEGADVVFLIDRSGSMNETFAGRTPSGSEESKASAARRLLQ---GFVDRRGHDRVGVVGF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ + P++ + I I+ + T GL A I Sbjct: 137 STAPMLLMPMSDHREAIAAAIDAVDRPGLDYTNIGRGLAMALALIGSGAPDRSRA----- 191 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--------AIGVQAEAADQFL 333 I+ ++DG + ID + E K+ +Y + G+ + +F Sbjct: 192 -----IVLVSDG---AGVIDPRIQDDLRAEMKKANVNLYWLFLRTAGSAGIYDKPDPEFD 243 Query: 334 KNCASPDRFYSV--QNSRKLHDAFLRIGKEMVKQRI 367 A+P+R + ++ R + AF G E +Q I Sbjct: 244 TPQAAPERHLDLFFKSLRVPYRAFEAEGPEATEQAI 279 >gi|194016356|ref|ZP_03054970.1| YwmC [Bacillus pumilus ATCC 7061] gi|194011829|gb|EDW21397.1| YwmC [Bacillus pumilus ATCC 7061] Length = 233 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 72/228 (31%), Gaps = 15/228 (6%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F T S + + K + + + ++LD S SM G K +A Sbjct: 7 FTLVTLAVLTLSMSISSPVFAKASTVKKHNKDVRVTILLDASGSMARKVE-GERKFDLAK 65 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTF-----SSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + + + + + +R L +S Q+ + GV +Q S Sbjct: 66 QEVFKFAQSL----PKDAKIRMSLFGSEGNNKNSGKAQSCEVIRGVYGVQPYEKESFENS 121 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL-TDGENSSPNIDNKESLFYCN 310 + P + I A E + + ++ K+I++L TDGE + K + N Sbjct: 122 LNELGP---NGWTPIARALEHAKQADEQLNNGTKHIVYLITDGEETCGGDPVKVAKELHN 178 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLR 357 V + Q + A +Y +++ Sbjct: 179 SKGSTVVNVIGLDFNDGYEGQLKQVAKAGKGHYYQASTGKEMGSILSA 226 >gi|297287371|ref|XP_001099130.2| PREDICTED: collagen alpha-2(VI) chain-like isoform 3 [Macaca mulatta] Length = 1029 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 68/213 (31%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H K Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKSLQGISSFRRGTFTDCALANMTEQI------RQHGTK 151 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 152 GTVH---FAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNRNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 622 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 678 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 679 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 735 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 736 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 770 >gi|197336765|ref|YP_002158569.1| transporter [Vibrio fischeri MJ11] gi|197314017|gb|ACH63466.1| transporter [Vibrio fischeri MJ11] Length = 591 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 58/169 (34%), Gaps = 18/169 (10%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMD 190 ++P + L+ S G D +++VLD S SM + Sbjct: 71 QLPLKLLFIVIFLSI----LICAGPTWQKQASPFGEDKAPLLIVLDTSNSMLEKDVLPNR 126 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + + I+ + + + ++GL+ +S PL + + Sbjct: 127 LIRAKQK--------IQDLIALRDGGKTGLIVYSGTAHLAMPLTQDSAVFSPYLAAIEPK 178 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + AY + K++ ++K + + +I LTDG +S N Sbjct: 179 IMPVEG---KSAYKTLPLIKQQFSTLSKSNLPVRGTVILLTDGVTTSDN 224 >gi|167623667|ref|YP_001673961.1| cell wall anchor domain-containing protein [Shewanella halifaxensis HAW-EB4] gi|167353689|gb|ABZ76302.1| LPXTG-motif cell wall anchor domain [Shewanella halifaxensis HAW-EB4] Length = 850 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 42/199 (21%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S I ++++V+D S SM+ D + A +++ L ++ N ++ F+ Sbjct: 451 SSIARELVLVIDTSGSMSG------DAIIQAKSALKYALAGLRPQDSFN------VLQFN 498 Query: 225 SKIVQTF-----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + + A + Q IN L T+ + L+ A K+ + H +K Sbjct: 499 STVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDN---DRGHNSKP 555 Query: 280 HDDYKKY-----------------IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 D +Y ++F+TDG ++ + ++ E++ ++ I Sbjct: 556 VHDDDRYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKNQLGESR-----LFTI 610 Query: 323 GVQAEAADQFLKNCASPDR 341 G+ + F++ A R Sbjct: 611 GIGSAPNAHFMQRAAEVGR 629 >gi|90406967|ref|ZP_01215158.1| putative RTX toxin [Psychromonas sp. CNPT3] gi|90312009|gb|EAS40103.1| putative RTX toxin [Psychromonas sp. CNPT3] Length = 3350 Score = 56.0 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 60/298 (20%), Positives = 112/298 (37%), Gaps = 45/298 (15%) Query: 47 KLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN 106 ++H + D +++ ATK ++E K + + + +I + +N N F DI Sbjct: 2664 EIHDVPDAAIVIGATK--SEEGVWVIKIDEGQSDFDGVVSIRLPEDQN----NMFTLDIK 2717 Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 +T + + Q+ +V+ + L+I+ + Sbjct: 2718 --VTATEQNDNENGQNTTSTTQSVTGTPI----------IVEETVNLVISEPEIAETN-- 2763 Query: 167 IGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +++VLD+S SMN G ++L A ++ +L+ I + VV LV F + Sbjct: 2764 ----LILVLDISGSMNGSIEGSDQNRLDFAKTALSNLLE----IQNTLGVVNVNLVAFEN 2815 Query: 226 KIVQTFPLAW-----GVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 I + + G+ I + I L T L+ E + +A Sbjct: 2816 NIFSSHWVTLDGGPEGLASILQYIENLNADAYGGTNYQDALKTV------MSEFEQGVAS 2869 Query: 279 GHDDYKK--YIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFL 333 G D K I+FL+DG+ I+N + + + +G+Q A DQ L Sbjct: 2870 GDIDVSKDTNIVFLSDGKPGQSIINNPVEQEWNDFTSNYNIDSINTVGIQISAGDQVL 2927 >gi|301788660|ref|XP_002929747.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Ailuropoda melanoleuca] Length = 946 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 73/201 (36%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + ++ F+ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAEDQFS------VIDFNHNVRTW 358 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ I ++ T L A + +A + Sbjct: 359 RNDLVSATKTQIVDAKKYIEKIQPSGGTNINEALLRAIFILNEANNLGMLDPESVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + + R ++++G+ + FLK ++ +R Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKQNIRDNIALFSLGIGFDVDYDFLKRLSNENRGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|94968893|ref|YP_590941.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94550943|gb|ABF40867.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 628 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 60/159 (37%), Gaps = 22/159 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ + L + +V+D S S+ F D + E L + + P+ V V F Sbjct: 393 EAQLPLRIGLVIDTSASIAGRFKFEQD-------AAGEFLQRVLTGPEDLGFV----VGF 441 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+ I+ + I I T + +A K+ E+ Sbjct: 442 SNSILMAQDFTHDSKQIAHSIQAFAPSGGTALWDAVNFAAEKLASHPER--------QPV 493 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 K +I ++DGE++S K+++ A+ VYAI Sbjct: 494 AKILIVISDGEDNSSATTAKQAI---QRAQSEEVAVYAI 529 >gi|332254886|ref|XP_003276564.1| PREDICTED: integrin alpha-1 [Nomascus leucogenys] Length = 1179 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVRECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTKEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|258615515|ref|ZP_05713285.1| hypothetical protein EfaeD_07377 [Enterococcus faecium DO] gi|293563519|ref|ZP_06677967.1| Bee1, putative [Enterococcus faecium E1162] gi|294622786|ref|ZP_06701740.1| Bee1, putative [Enterococcus faecium U0317] gi|291597744|gb|EFF28882.1| Bee1, putative [Enterococcus faecium U0317] gi|291604521|gb|EFF34007.1| Bee1, putative [Enterococcus faecium E1162] Length = 1344 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 68/192 (35%), Gaps = 24/192 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 D + Y V +P + +++ + + K +D++ VLD S Sbjct: 174 DYNGAYIKKWV-EPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSA 232 Query: 180 SMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQ 229 SM++ K ++ EM + D + +R G+V F + + + Sbjct: 233 SMSELTAGTNSQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHEQIS 290 Query: 230 T--FPLAWGVQHIQ-EKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + FPL + + + L T T GL+ Y ++ + + + Sbjct: 291 SDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGE------NRNPE 344 Query: 285 KYIIFLTDGENS 296 K +I + DG + Sbjct: 345 KILIVVGDGTPT 356 >gi|119575261|gb|EAW54866.1| hCG2002731, isoform CRA_c [Homo sapiens] Length = 766 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVQECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|69244819|ref|ZP_00603043.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257882064|ref|ZP_05661717.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257889959|ref|ZP_05669612.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260560224|ref|ZP_05832401.1| von Willebrand factor [Enterococcus faecium C68] gi|314947791|ref|ZP_07851198.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] gi|68196173|gb|EAN10603.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257817722|gb|EEV45050.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257826319|gb|EEV52945.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260073791|gb|EEW62116.1| von Willebrand factor [Enterococcus faecium C68] gi|313645771|gb|EFS10351.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] Length = 1345 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 68/192 (35%), Gaps = 24/192 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 D + Y V +P + +++ + + K +D++ VLD S Sbjct: 175 DYNGAYIKKWV-EPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSA 233 Query: 180 SMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQ 229 SM++ K ++ EM + D + +R G+V F + + + Sbjct: 234 SMSELTAGTNSQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHEQIS 291 Query: 230 T--FPLAWGVQHIQ-EKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + FPL + + + L T T GL+ Y ++ + + + Sbjct: 292 SDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGE------NRNPE 345 Query: 285 KYIIFLTDGENS 296 K +I + DG + Sbjct: 346 KILIVVGDGTPT 357 >gi|156402981|ref|XP_001639868.1| predicted protein [Nematostella vectensis] gi|156226999|gb|EDO47805.1| predicted protein [Nematostella vectensis] Length = 240 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 72/228 (31%), Gaps = 32/228 (14%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +M + V+ ++ +++ ++D S S+ Sbjct: 2 KMSSFLLLAAIISCMLARKAYGVELVEPYNRCYGQVELGFIVDGSRSIE----------A 51 Query: 194 VATRSIREMLDIIKSIPDV----NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL 247 A + + MLD + I ++ R G+ +S+ P +Q E I +L Sbjct: 52 SACGNFKRMLDFTQRIASGFGIASSQTRVGVGLYSTFASVPIPFGKYTSLQETVEGIKKL 111 Query: 248 I-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G T++ L+ +F H K +I LTDG + Sbjct: 112 RYPGEGTRTGRALKLMKTHLFSQSRPKAH---------KVLIVLTDG------TSVDDVK 156 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + G V+A+G+ + LK+ A+ R L Sbjct: 157 APAKALRESGVEVFAVGIGEHYRPRELKDIATDTGHVLTAGFRDLMSV 204 >gi|326789198|ref|YP_004307019.1| hypothetical protein Clole_0061 [Clostridium lentocellum DSM 5427] gi|326539962|gb|ADZ81821.1| Protein of unknown function DUF3520 [Clostridium lentocellum DSM 5427] Length = 670 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 76/207 (36%), Gaps = 29/207 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LDVS SM+D +KL + +S + +K ++ VV +G +V Sbjct: 170 LVFLLDVSGSMSD-----TNKLPLLKKSFNILTSNLKESDCISIVVYAGASG----VVLD 220 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I E + L G +T G+ AY +H K ++ +I Sbjct: 221 GVAGNDESLINEALESLEAGGSTAGAEGIAMAYEL------AEKHFIKDGNNR---VILA 271 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQN 347 TDG+ + + + + + +G + +G+ D +++ A + + + Sbjct: 272 TDGDFNVGPNSESDLIRIIEKKREKGIFLSVLGLGMGNYKDDKMESLADHGNGNYAYIDS 331 Query: 348 S--------RKLHDAFLRIGKEMVKQR 366 +L I K++ Q Sbjct: 332 LQEAKKVLGEQLTGTLFTIAKDVKIQV 358 >gi|312092300|ref|XP_003147289.1| hypothetical protein LOAG_11723 [Loa loa] gi|307757546|gb|EFO16780.1| hypothetical protein LOAG_11723 [Loa loa] Length = 422 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 64/187 (34%), Gaps = 29/187 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D+S + N + ++ S+ R L+TFSS Sbjct: 233 DVVLVMDLSTTTNPIYRKYIEMAEELVNSLVI----------GRRFSRIALITFSSVGKS 282 Query: 230 TFPLAWGV----QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + I I RL G TT G+ + ++ +H + + K Sbjct: 283 RTQFNLDRYFDGKDIVTAIRRLESSGGTTAIGEGIR-----LGTEQKDKQHGGRPVEIAK 337 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFY 343 K ++ TDG ++ + + AK G +Y I + SP Y Sbjct: 338 KIMLVFTDGWSNKGPDVEEMTR----NAKGAGFTLYTIVYEGNGRVD----ANSPGLNLY 389 Query: 344 SVQNSRK 350 +++ Sbjct: 390 TIETMVD 396 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 49/144 (34%), Gaps = 25/144 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L+++++LD S S+ F + + + + + P + R L+ +S Sbjct: 23 LNVLVILDRSDSVKGGFN-------KSRNFVVNVSEELDIGPSTH---RVALIVYSGLSY 72 Query: 229 QTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + W ++ +++ G TT + LE + ++ Sbjct: 73 RREVFKWNFAKSNDEFKKIVLGLRAIGGTTNTKKALELGLELM---------DSRNKSIP 123 Query: 284 KKYIIFLTDGENSSPNIDNKESLF 307 ++ TDG ++ + L Sbjct: 124 TLIMVV-TDGRSADDPKIPAQQLQ 146 >gi|293604651|ref|ZP_06687053.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816982|gb|EFF76061.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 3744 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 77/265 (29%), Gaps = 20/265 (7%) Query: 46 AKLHYILDHSLLYTATKILNQENGNNGKK------QKNDFSYRIIKNIWQTDFRNELREN 99 L ++ L T LN + N T ++R+N Sbjct: 2958 NSLQSVIISRLPTDGTLTLNGNPVTVNTAVSAADIAAGKLVFTPSANGLDTSIGFQVRDN 3017 Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G T ++ D P + + + L Sbjct: 3018 GGTDHGGQNTSGTYNFVLNTDNIVTGENVGSGTGNTPVL-----NGGSGNDIILGDKGGT 3072 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHF-----GPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ + ++ +V+D S SM G G ++ + ++ + + + + N Sbjct: 3073 VVTVEPGKNYNIALVVDTSGSMAYKLDGSTNGSGQSRIALVKDALTNLANQLVGHDGIVN 3132 Query: 215 VVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V G T + V L VQ + I L T A + Sbjct: 3133 VTLIGFATTAGTPVTLQNLTSANVQTLLTAITNLSATGGTNYEAAFNSAVSWFNSQTAAG 3192 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSP 298 + +A G+++ FLTDG+ + Sbjct: 3193 KSVAAGYENVT---FFLTDGDPTYY 3214 >gi|298491707|ref|YP_003721884.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233625|gb|ADI64761.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 426 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 70/225 (31%), Gaps = 36/225 (16%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + SS+ + L++ ++LD S SM G ++ + A + L I Sbjct: 27 ISISSIADELDPSLPLNLCLILDKSGSM---HGEPINTVIQAVEQLLAQLQPGDHIS--- 80 Query: 214 NVVRSGLVTFSSKIVQTFP--LAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V F+ P + + I+ ++ RL G T GL ++ Sbjct: 81 ------IVAFAGTSEVIIPNQIVQDAESIKCQLHKRLKAGGGTIIAEGLSLGITELLKGT 134 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSS---------PNIDNKESLFYCNEAKRRGAIVYA 321 + A LTDG D K L +A R + Sbjct: 135 KGAVSQA----------FLLTDGHGDRGLKIWKWEMGPNDKKRCLELAQKATRVSLTLNT 184 Query: 322 IGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 G + L+ A ++ ++ D F R+ K + Sbjct: 185 FGFGNDWNQDLLEKIADAGGGTLAYIERPQQAVDQFSRLLKRIQS 229 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 75/188 (39%), Gaps = 26/188 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +++ S S++ + + ++ P + G+V +S I Q Sbjct: 4 DILFLVEGSRSVSAL------EFEKMKTFLNNIVGQFDIGP---TATQVGVVQYSWFIRQ 54 Query: 230 TFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + +Q+ I+ + + G T + L +A N A + A+ K Sbjct: 55 ECALNAHSSLASLQQAISNITVLGLGTHTGAALTFARNTALTA----ANGARPGVP--KI 108 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCA-SPDRFYS 344 ++ +TDG + ++L + G I +AI V D+ L++ A SPDR ++ Sbjct: 109 VVVMTDGASEDDVTLPSQNL------RNDGVITFAISVSWSLPNDRLLQDIAGSPDRIFA 162 Query: 345 VQNSRKLH 352 + L Sbjct: 163 ATDFDALD 170 >gi|149636044|ref|XP_001506552.1| PREDICTED: similar to collagen type XX alpha 1 [Ornithorhynchus anatinus] Length = 1500 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 60/182 (32%), Gaps = 33/182 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S +D++ ++D S S+ + + ++ D + GL + Sbjct: 230 NSSAPVDIIFLVDGSWSIGRS------NFRLVREFLASLISPFNIARDK---ISIGLSQY 280 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 S W + K L G T + L + + L+ Sbjct: 281 SGDPRTE----WDLNKFASKDKVLEAVRNLRYKGGNTFTGLALTHVLE------QNLKLE 330 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A + K +I LTDG++ E+ K G ++AIGV+ + L+ Sbjct: 331 AGPRPEADKIVILLTDGKSQD------EANAAAQALKDLGISIFAIGVKNADEAE-LRQV 383 Query: 337 AS 338 AS Sbjct: 384 AS 385 >gi|308466921|ref|XP_003095711.1| hypothetical protein CRE_10578 [Caenorhabditis remanei] gi|308244476|gb|EFO88428.1| hypothetical protein CRE_10578 [Caenorhabditis remanei] Length = 637 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 61/170 (35%), Gaps = 45/170 (26%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D+S D +D A S+ P VR GL+++S Sbjct: 458 DVFFLVDLSQGTGDKSQQYLDIAASAISSL----------PISQEAVRVGLISYSGP--- 504 Query: 230 TFPLAWGVQHIQEKINRLIF--------------GSTTKSTPGLEYAYNKIFDAKEKLEH 275 G H++ +++ G TT++ + YA E + H Sbjct: 505 ------GRTHVRVYLDKHNEKEKLIEEMFLMERHGGTTRTADAIRYATKIF----EGMAH 554 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 A+ + KK ++ TDG + D A+ +G + A+ V+ Sbjct: 555 PAR--RNVKKVLVVFTDGYSQDSPRDAARV------ARAKGLQLIAVAVK 596 >gi|332088403|gb|EGI93521.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 575 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANLGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 241 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 295 GLELAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 Query: 318 IVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 346 TLSTFGVGDDNYNEAMMVRIADVGNGNYSYIDTLSE 381 >gi|323345326|ref|ZP_08085549.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] gi|323093440|gb|EFZ36018.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] Length = 340 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 64/210 (30%), Gaps = 34/210 (16%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+S + G++ ++ LD+S SM +L + + ++D + + Sbjct: 80 TKVSHEKRNGIEAIIALDISNSMMAEDVTP-SRLAKSKLLVENLVDNFTN-------DKI 131 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 GL+ F+ P+ + + LI T + A Sbjct: 132 GLIVFAGDAFVQLPITSDYVSAKMFLQNIDPSLIATQGTDIAGAINLASKSFTQ------ 185 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 D K II +TDGE+ +G ++ +G+ + Sbjct: 186 -----QDKVGKAIIVITDGEDHEGGAIEAAKAAR-----AKGYNIFILGIGSTNGAPI-- 233 Query: 335 NCASPDRFYSV---QNSRKLHDAF-LRIGK 360 A+ KL++ I + Sbjct: 234 PMANGGYLQDASGQTVMTKLNEQMCKEIAQ 263 >gi|152993598|ref|YP_001359319.1| hypothetical protein SUN_2020 [Sulfurovum sp. NBC37-1] gi|151425459|dbj|BAF72962.1| hypothetical protein [Sulfurovum sp. NBC37-1] Length = 940 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 77/204 (37%), Gaps = 36/204 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK----- 226 +++ D S SM G+ K+ +A +++ ++ N V GL + + Sbjct: 28 VIIFDASGSM-WGQINGVTKIEIARDALKNVVREW------NPNVELGLTVYGHRSKGDC 80 Query: 227 --IVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I P+ + + + + ++ T + L A ++ +EK Sbjct: 81 NDIEVVIPIGKVDKKRVIDTVMKIKPKGKTPISRSLRKAAGELKYTEEKAT--------- 131 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNC---AS 338 II ++DG+ + + + E K+ G + + +G + C A+ Sbjct: 132 ---IILISDGKETC----DPDPCATAKELKKEGIDFVAHVVGFNVDKKTDKQLECIANAT 184 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEM 362 ++S +N+ L+ A I K++ Sbjct: 185 GGEYFSAKNAAALNKAMKTIVKKV 208 >gi|119606782|gb|EAW86376.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] gi|119606783|gb|EAW86377.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] Length = 947 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 70/204 (34%), Gaps = 31/204 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N +R+ Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRT------W 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + V + I ++ T L A + +A Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRR---GAIVYAIGVQAEAADQFLKNCASPDR- 341 II ++DG+ + + K S K ++++G+ + FLK ++ + Sbjct: 415 LIILVSDGDPTVGKCELKLSKIQ-KNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 473 Query: 342 -----FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 474 IAQRIYGNQDTSSQLKKFYNQVST 497 >gi|332817190|ref|XP_003309914.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Pan troglodytes] Length = 1241 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 339 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 386 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 387 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 441 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 442 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 494 Query: 340 DRFYSV 345 ++ + Sbjct: 495 GYYFEI 500 >gi|332216457|ref|XP_003257368.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Nomascus leucogenys] Length = 1094 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 226 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 273 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 274 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 328 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 329 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 381 Query: 340 DRFYSV 345 ++ + Sbjct: 382 GYYFEI 387 >gi|297671247|ref|XP_002813757.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Pongo abelii] Length = 1074 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 222 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 269 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 270 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 324 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 325 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 377 Query: 340 DRFYSV 345 ++ + Sbjct: 378 GYYFEI 383 >gi|297671245|ref|XP_002813756.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Pongo abelii] Length = 1081 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 222 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 269 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 270 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 324 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 325 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 377 Query: 340 DRFYSV 345 ++ + Sbjct: 378 GYYFEI 383 >gi|297285706|ref|XP_001090735.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Macaca mulatta] Length = 1417 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 558 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 605 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 606 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 660 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 661 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 713 Query: 340 DRFYSV 345 ++ + Sbjct: 714 GYYFEI 719 >gi|296225305|ref|XP_002758277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Callithrix jacchus] Length = 1251 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 261 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 308 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 309 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 363 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 364 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 416 Query: 340 DRFYSV 345 ++ + Sbjct: 417 GYYFEI 422 >gi|291290994|ref|NP_001167522.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform c [Homo sapiens] Length = 1150 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 291 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 338 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 339 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 393 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 394 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 446 Query: 340 DRFYSV 345 ++ + Sbjct: 447 GYYFEI 452 >gi|288925757|ref|ZP_06419688.1| BatB protein [Prevotella buccae D17] gi|288337412|gb|EFC75767.1| BatB protein [Prevotella buccae D17] Length = 342 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 57/180 (31%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ G++ ++ LD+S SM D +DK + S+ + Sbjct: 80 AEVQRDKRNGIEAIICLDISNSMLAQDVAPSRLDKSKLLVESLVDRFTN----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + GL+ F+ P+ + + LI T + + A + Sbjct: 130 KIGLIVFAGDAYVQLPITSDYVSAKMFLQNIDPSLIQTQGTDIAQAINLGLHSFTQADKI 189 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + II +TDGE+ + +++G V+ +GV Sbjct: 190 G-----------RAIIVITDGEDHEGGAVEAAAEA-----RKKGVNVFILGVGDTKGAPI 233 >gi|194221273|ref|XP_001915997.1| PREDICTED: similar to Voltage-dependent calcium channel subunit alpha-2/delta-2 precursor (Voltage-gated calcium channel subunit alpha-2/delta-2) [Equus caballus] Length = 1127 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 297 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 344 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 345 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 399 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 400 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 452 Query: 340 DRFYSV 345 ++ + Sbjct: 453 GYYFEI 458 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 49/146 (33%), Gaps = 18/146 (12%) Query: 218 SGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEK 272 L+T+S + F + +N L TT + L AY + D Sbjct: 1 LALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTD--TD 58 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAA 329 E+ + KK II TDG + D L K +G ++AI + Sbjct: 59 NENGVREG--VKKMIIIFTDGHSQRSPQDMALRL------KDKGVEIFAITLTPAPYADE 110 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAF 355 + L + D ++ N + F Sbjct: 111 GELLSITQNTDHIFTPVNLKDFEIKF 136 >gi|119585523|gb|EAW65119.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Homo sapiens] Length = 1146 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 291 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 338 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 339 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 393 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 394 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 446 Query: 340 DRFYSV 345 ++ + Sbjct: 447 GYYFEI 452 >gi|119585524|gb|EAW65120.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_c [Homo sapiens] Length = 664 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 291 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 338 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 339 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 393 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 394 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 446 Query: 340 DRFYSV 345 ++ + Sbjct: 447 GYYFEI 452 >gi|2781441|gb|AAB96914.1| alpha 2 delta calcium channel subunit isoform II [Homo sapiens] Length = 1076 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 222 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 269 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 270 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 324 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 325 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 377 Query: 340 DRFYSV 345 ++ + Sbjct: 378 GYYFEI 383 >gi|54112394|ref|NP_006021.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform b [Homo sapiens] gi|7414316|emb|CAB86192.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] gi|119585522|gb|EAW65118.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Homo sapiens] Length = 1143 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 291 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 338 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 339 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 393 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 394 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 446 Query: 340 DRFYSV 345 ++ + Sbjct: 447 GYYFEI 452 >gi|54112392|ref|NP_001005505.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform a [Homo sapiens] gi|2781439|gb|AAB96913.1| alpha 2 delta calcium channel subunit isoform I [Homo sapiens] gi|3043640|dbj|BAA25484.1| KIAA0558 protein [Homo sapiens] gi|3695006|gb|AAC70914.1| putative tumor suppressor gene 26 protein alpha 2 delta calcium channel subunit [Homo sapiens] gi|119585525|gb|EAW65121.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_d [Homo sapiens] gi|156230959|gb|AAI52439.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Homo sapiens] gi|168267416|dbj|BAG09764.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 isoform b [synthetic construct] Length = 1145 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 291 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 338 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 339 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 393 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 394 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 446 Query: 340 DRFYSV 345 ++ + Sbjct: 447 GYYFEI 452 >gi|74725352|sp|Q9NY47|CA2D2_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|7414318|emb|CAB86193.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] Length = 1150 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 70/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 291 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 338 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 339 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 393 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ CA+ Sbjct: 394 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANK 446 Query: 340 DRFYSV 345 ++ + Sbjct: 447 GYYFEI 452 >gi|302336645|ref|YP_003801851.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301633830|gb|ADK79257.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 474 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 45/221 (20%), Positives = 83/221 (37%), Gaps = 27/221 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNN 214 S+ + GL +++++D S SM D G L T + + + S + Sbjct: 82 SLAPNPHESQGLSILLLMDNSGSMYDTLSGDPTGDPALMRTTYARNALRTFVGSSFHAGD 141 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 V TF++ +V A + ++ + T +S L Y+ + D Sbjct: 142 SV--SFATFNTNVVLHADEAGDPVVMDMLLSGIRRPGTDESYTEL---YHALADMA---- 192 Query: 275 HIAKGHDDYKKYIIFLTDGE---------NSSPNIDNKE--SLFYCNEAKRRGAIVYAIG 323 + G ++ +I L+DGE N P N++ E R G +YAI Sbjct: 193 -LPVGERSGRRAVIVLSDGEDYSYATHSGNPHPIYGNQQLSPDEVVEEYIRNGVTLYAIH 251 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 E DQ+L A + Y ++ +L + I +++ Sbjct: 252 FGLEK-DQYLGEMALKTGGAVYDAKDQEELTGIYHDIRQKI 291 >gi|293377912|ref|ZP_06624093.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] gi|292643459|gb|EFF61588.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] Length = 1498 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 64/153 (41%), Gaps = 25/153 (16%) Query: 161 ISSKSD--IGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 I S+ +D++ VLD S SMN+ G G K ++ E+ + + S P+++ + Sbjct: 370 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 427 Query: 217 RSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAY 263 R G+V F S+ I ++ + + INRL T T GL+ Y Sbjct: 428 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 487 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + +K +I + DG + Sbjct: 488 ETLYADNGGE------NRNPEKILIVVGDGTPT 514 >gi|293415564|ref|ZP_06658207.1| yfbK protein [Escherichia coli B185] gi|291433212|gb|EFF06191.1| yfbK protein [Escherichia coli B185] Length = 575 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 101/336 (30%), Gaps = 45/336 (13%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 ++ K L L + A K N G + F +K + Q Sbjct: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 98 ENGFAQDINNIE----------RSTSLSIIID--------DQHKDYNLSAVSRYEMPFIF 139 + N + + I++ + S + M + Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 PW + + I + S+ +++ ++D S SM ++L + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTP 257 + ++ ++ + +VT++ P G I I+ L +T Sbjct: 241 KLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GLE AY + KG + I+ TDG+ + D K + + G Sbjct: 295 GLELAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMIKKQRESGV 345 Query: 318 IVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRK 350 + GV + + + A + + + Sbjct: 346 TLSTFGVGDDNYNEAMMVRIADVGNGNYSYIDTLSE 381 >gi|227552322|ref|ZP_03982371.1| von Willebrand factor, type A [Enterococcus faecium TX1330] gi|227178545|gb|EEI59517.1| von Willebrand factor, type A [Enterococcus faecium TX1330] Length = 1518 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 64/153 (41%), Gaps = 25/153 (16%) Query: 161 ISSKSD--IGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 I S+ +D++ VLD S SMN+ G G K ++ E+ + + S P+++ + Sbjct: 390 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 447 Query: 217 RSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAY 263 R G+V F S+ I ++ + + INRL T T GL+ Y Sbjct: 448 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 507 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + +K +I + DG + Sbjct: 508 ETLYADNGGE------NRNPEKILIVVGDGTPT 534 >gi|260818212|ref|XP_002604277.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] gi|229289603|gb|EEN60288.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] Length = 1119 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 71/201 (35%), Gaps = 28/201 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D +D+ VLD S S++ + + VA S + R G++ +S Sbjct: 748 DESVDLFFVLDGSDSVSLADFDIVKEFVVAVVSGFTI---------SLTDTRVGVLQYSD 798 Query: 226 KIVQTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +N + G T + LE+A +L + Sbjct: 799 GSTLECNLGDHPDWSSFVNSMNTMARQGGGTSTGAALEFA---------RLIAAWRPAPV 849 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DR 341 + +I LTDG++ + ++L V+AIGV + + L+ + DR Sbjct: 850 VPRIMIVLTDGDSEDSVVTPAQALAT------EQVTVFAIGVGSFNRSELLQITNNNQDR 903 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 + + + + + RI + Sbjct: 904 VFELADFNAIANIMNRIIQAA 924 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 58/174 (33%), Gaps = 28/174 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D+ VLD S S+ G+ + + ++ + N R G++ +SS Sbjct: 937 DVTTDLFFVLDGSGSV------GLYNFNTVKQFVVTLVSAFTIGLNDVNDTRVGVLQYSS 990 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 L + +N + G +T++ L+ A A Sbjct: 991 SNTLGCNLG-DHPDLSSFVNAMNAMRYHYGPSTQTGAALQAAGQI----------AAWRP 1039 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ +TDG + + L V+AIGV + L+ Sbjct: 1040 APVPRIMVVVTDGMAHDSVVAPSQGLA------ADQVNVFAIGVGNYVRSELLQ 1087 >gi|170041024|ref|XP_001848278.1| sushi [Culex quinquefasciatus] gi|167864620|gb|EDS28003.1| sushi [Culex quinquefasciatus] Length = 2239 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 79/220 (35%), Gaps = 41/220 (18%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 SV+ + +D++ ++D S S+ + ++++L N R Sbjct: 124 SVEKIKTKNKRVDIVFLIDASSSVGRQ------NFASEIKFVKKLLSDFNV---SYNYTR 174 Query: 218 SGLVTFS---------SKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIF 267 ++TFS +I Q+ + ++ R+ F G T + L+ A Sbjct: 175 VAVITFSSQKKIFRHIDQISQSVEDNDKCLLLNYQVPRIAFSGGGTYTYGALKEAEEIFK 234 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +A+ D KK I +TDG ++ + K ++Y+IG+Q Sbjct: 235 NAR----------LDSKKIIFLITDGFSNGRDPIPLAGRLK----KDNNVVIYSIGIQ-S 279 Query: 328 AADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 L AS D Y + + D F + ++ + Sbjct: 280 GNYAELHAIASAPEGDHCYLLDS----FDHFETLARKALH 315 >gi|116003875|ref|NP_001070294.1| anthrax toxin receptor 2 [Bos taurus] gi|115305014|gb|AAI23758.1| Anthrax toxin receptor 2 [Bos taurus] gi|296486409|gb|DAA28522.1| anthrax toxin receptor 2 [Bos taurus] Length = 488 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 68/192 (35%), Gaps = 26/192 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S + D+ VLD S S+ +++ + + T V+ +R Sbjct: 32 SAQEQPSCHGAFDLYFVLDKSGSVANNWIEIYNFVQQLTERF------------VSPQMR 79 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + FSS+ PL I E ++ L T GL+ A +I A++ Sbjct: 80 LSFIVFSSQATIILPLTGDRGKISEGLDNLKHVSPVGETYIHEGLKLANEQIEKARDLKT 139 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 II LTDG+ + + ++ GA VY +GV Q + Sbjct: 140 SS---------IIIALTDGKLDG--LVPSYAEKEAKISRSLGARVYCVGVLDFEQAQLER 188 Query: 335 NCASPDRFYSVQ 346 S ++ + V+ Sbjct: 189 IADSKEQVFPVK 200 >gi|17537921|ref|NP_496259.1| C-type LECtin family member (clec-60) [Caenorhabditis elegans] gi|3881710|emb|CAA88985.1| C. elegans protein ZK666.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 406 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 75/239 (31%), Gaps = 27/239 (11%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSK-----SDIGLDMMMVLDVSLSMNDHFGPG 188 ++ F+ A S+ S + ++ LD++ V+D S+ M G Sbjct: 2 KLTFLIVLIGVYACSAQDSSTTPSPSYTDRRCGEDLGNLWLDVVAVVDNSIGMT---NGG 58 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------SKIVQTFPLAWGVQHIQE 242 + + S+ I + P R GLVT++ + + Q L ++ Sbjct: 59 LTSIAANIASVVSSGTRIGTNPSEPRTTRLGLVTYNKAAAIQADLNQYQSLDDVYDNVFR 118 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 ++ + + GL A + + K+ Y++ +I S +D Sbjct: 119 ALSSVSTSEESYLANGLARAEDVLEAGKQGY-----NRTHYQRVVIVYASAYKGSGALDP 173 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASPDRFYSVQNSRKLHDAFLRI 358 + K G V + + L ASP + N+ I Sbjct: 174 ---VPVAERLKTSGVTVITVAYDQDGDGALLADLAKIASPPYNF--TNTEDNGQVIGEI 227 >gi|73958316|ref|XP_848776.1| PREDICTED: similar to integrin, alpha D precursor [Canis familiaris] Length = 1166 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 84/224 (37%), Gaps = 24/224 (10%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 SH + T + +D++ ++D S S+ + ++ R++ + + Sbjct: 139 SHLQTIWTVPAALPECPSQEMDIVFLIDGSGSI---YESSFKQMKDFVRALMGHFEGTNT 195 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + + + F+ Q +W + + I +L T + G+ ++F Sbjct: 196 LFSLIQYSHLLKIHFTFTQFQN---SWNPLSLVDPIVQLK--GLTYTATGIRKVVEELFH 250 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--- 325 +K AK K +I +TDG+ D E +A+R G I YAIGV Sbjct: 251 SKNGARKSAK------KILIVITDGQ---KYKDPLEYSDVIPQAERAGIIRYAIGVGDAF 301 Query: 326 -AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +A Q L N S D + V N L ++ +++ Sbjct: 302 WKPSAKQELDNIGSEPAQDHVFRVDNFAALSSIQEQLQEKIFAL 345 >gi|317133199|ref|YP_004092513.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] gi|315471178|gb|ADU27782.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] Length = 535 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 61/186 (32%), Gaps = 18/186 (9%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM D+ G + A I + ++ FS + + Sbjct: 358 LDYSGSMGDNGGE--TGVKKAMDMILNQSTAKLYFLQATPQDKIAVIAFSDSVKAEWYAT 415 Query: 235 WG----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 G + + + I +L G T + A ++ A + A +I + Sbjct: 416 GGDLSSMSTLDQNIQKLQAGGGTDIYTPVMTALQQLAGADVSQCNPA---------VILM 466 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDG++++ Y + K +++I A Q LK + + + Sbjct: 467 TDGQSNTGRTFTNVQSTYKSIGKD--IPIFSIEFGAADPTQ-LKQFGTLSKAALFDGRKD 523 Query: 351 LHDAFL 356 L AF Sbjct: 524 LVAAFK 529 >gi|321460552|gb|EFX71593.1| hypothetical protein DAPPUDRAFT_255504 [Daphnia pulex] Length = 983 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 72/197 (36%), Gaps = 29/197 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++V+D+S SM + D++G + SIR + D+ N G+V FS+ Sbjct: 326 TGTRFVVVMDISGSMKEF-----DRIGKLSESIRSWIKT-----DLRNGSHLGMVQFSAT 375 Query: 227 IVQTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L + +E I +L + T GLE A + + KG + Sbjct: 376 AEILSELTMISDEKSREEMIAKLPKQLQAATCIGCGLELAVQML--------NENKGTSE 427 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 I+ +TDG+NS + + +AK V I +A L Sbjct: 428 TGGVIVLVTDGKNSPGYLHISDVQEDILKAK---IRVITIAFGEKADKNLEDLARQTDGK 484 Query: 341 RFY--SVQNSRKLHDAF 355 ++ L +AF Sbjct: 485 SYFVKDEDGGAALQEAF 501 >gi|291225695|ref|XP_002732834.1| PREDICTED: MUscle Positioning family member (mup-4)-like [Saccoglossus kowalevskii] Length = 317 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 82/199 (41%), Gaps = 37/199 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D++ V+D S S G I+E++D+ V+ R +++SS Sbjct: 108 PIDLLFVIDKSGS------IGQSDFNKIIEHIKELVDLFTVEISVDKT-RVSAISYSSS- 159 Query: 228 VQTFPLAWGVQH------------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 L + + I+ +I+++ F G +T +T L A ++ F + Sbjct: 160 -NKVDLDFNFRRCLFESSSASKTCIKSEIDKIDFEGGSTHTTKALVKARDEAFKS----F 214 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H ++ + K + +TDG ++ N + N K +YA+GV ++ + L+ Sbjct: 215 HGSRTNSH--KVLFLVTDGRSNG----NGPLVETANSLKNDDVEIYALGVTSDVVEAELR 268 Query: 335 NCAS---PDR--FYSVQNS 348 + S PD +Y N+ Sbjct: 269 SIVSDPIPDHLFYYDTFNA 287 >gi|221127354|ref|XP_002168164.1| PREDICTED: similar to microneme 1, partial [Hydra magnipapillata] Length = 285 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 67/152 (44%), Gaps = 17/152 (11%) Query: 216 VRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAK 270 VR ++ +S + ++ W + EK++ + + T++ L+ A +F + Sbjct: 6 VRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIEYSHGKTRTDLALKVARKHVFCNE 65 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAA 329 L+H K +I LTDG+++ P K + F + K + ++GV + Sbjct: 66 CSLQHNI------PKLLIVLTDGQSTFP----KSTQFEAHLIKVENDLTIISVGVSDQVD 115 Query: 330 DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGK 360 + LK+ A+ D + + + L+D +I K Sbjct: 116 IEELKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|163788218|ref|ZP_02182664.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] gi|159876538|gb|EDP70596.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] Length = 688 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 78/214 (36%), Gaps = 22/214 (10%) Query: 148 SSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + + I K + D+ + ++DVS SM+ H +KL + + + +++ + Sbjct: 308 NKTQLVRIGLQGKSYADKDLPASNLTFLIDVSGSMSSH-----NKLPLLKSAFKLLVNQL 362 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + V+ VV +G +V + I +N L G +T G++ AY Sbjct: 363 REKDKVSIVVYAGAAG----VVLEPTSGNNKEKIISALNNLQSGGSTAGGAGIKLAYKLA 418 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +K + +I TDG+ + + + E ++ G + +G Sbjct: 419 EKNFKKKGNNR---------VILATDGDFNVGASSDNDMKTLIEEKRKSGVFLSVLGFGY 469 Query: 327 EA-ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 D L+ A + N ++ F + Sbjct: 470 GNYKDSKLETLADKGNGNHAYIDNMQEAQKVFGK 503 >gi|21228105|ref|NP_634027.1| magnesium-chelatase subunit [Methanosarcina mazei Go1] gi|20906546|gb|AAM31699.1| Magnesium-chelatase subunit [Methanosarcina mazei Go1] Length = 692 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 56/163 (34%), Gaps = 21/163 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + IG ++ V+D S SM ++ + ++ ML + R G Sbjct: 488 KKIREKKIGNLVLFVVDASGSMG-----ARQRMVASKGAVLSML-----MDAYQKRDRVG 537 Query: 220 LVTFS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L+ F P ++ Q+ + + G T + GL Y I + Sbjct: 538 LIAFKGDSAELLLPPTSSIELAQKYLQEMPTGGKTPISRGLVKGYEIIKSELRRD----- 592 Query: 279 GHDDYKKYIIFLTDG-ENSSPNIDN--KESLFYCNEAKRRGAI 318 +++ ++DG N S N + E + + K G Sbjct: 593 PDTCP--FMVLISDGRANVSMNGEPPLHEIITIASRLKEEGIQ 633 >gi|307719356|ref|YP_003874888.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] gi|306533081|gb|ADN02615.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] Length = 331 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 76/243 (31%), Gaps = 57/243 (23%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ + P + D++++ D+S SM P +L VA Sbjct: 59 FLLLGIAGLLVAYAEPFWGMEQETV---KRRNADIVLLFDISRSMLVRDVPP-SRLEVAK 114 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GST 252 ++ I R G+V F K PL + +++ I L Sbjct: 115 EIALMLVSRISGA-------RWGVVAFKGKGELLLPLTPDLLGLEDAIGLLTPVLLRSPG 167 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GL A + ++ +I L+DGE + I L A Sbjct: 168 TDVASGLSRALEAFPQ-----------QSNRQRLVILLSDGEALTGEIGPVLEL-----A 211 Query: 313 KRRGAIVYAIGVQAEAADQ------------------------FLKNCA--SPDRFYSVQ 346 + G V+ +G+ E+ LK A + RF+SV+ Sbjct: 212 RNLGVAVHTVGIGTESGGPVPLEGEDVLKKPSGEPVISRLDASLLKRIAEITGGRFFSVR 271 Query: 347 NSR 349 ++ Sbjct: 272 DAE 274 >gi|296127472|ref|YP_003634724.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019288|gb|ADG72525.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 338 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 67/217 (30%), Gaps = 32/217 (14%) Query: 121 QHKDYNLSAVSR-YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 K Y + R + + F+ + + P KI + + + + + LD+S Sbjct: 44 NDKAYKRISNLRIFSIIFMILSAAVLIFALMQPKWGIIEQKIKTDNYM---ITIALDLSR 100 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 SM+ +L A I + + ++ LV F+ P ++ Sbjct: 101 SMDADDVWP-SRLERAKLEIEKFVKNTDNLA-------VSLVGFAGTSFIACPFTQDMET 152 Query: 240 IQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 ++ L S T+ L A N KK I+ +TDGE+ Sbjct: 153 FSYILDNLSTKSVTLQGTRIADALVTAKNTFNVDA-----------VSKKSIVLITDGED 201 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 D E K VY IGV Sbjct: 202 HGGYFD-----EVLKELKDMNISVYTIGVGTSQGAAI 233 >gi|270487809|ref|ZP_06204883.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|270336313|gb|EFA47090.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] Length = 207 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 65/196 (33%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM + ++ +L ++ P ++TF S Sbjct: 3 RLPVYLLLDTSGSMTGE------PIEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + K+ L+ TT L I + +K KG +I Sbjct: 57 RQAVPLT---DLLNFKLPELVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG SP D ++ L A+ G V A + L+ + Sbjct: 114 --MTDG---SPTDDWRKGLSDFKAART-GV-VVACAAGHAVETKVLQEITEIVLQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SSSIKAFFKWVSASIS 182 >gi|325473817|gb|EGC77005.1| BatB protein [Treponema denticola F0402] Length = 286 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 71/194 (36%), Gaps = 27/194 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F + + PL + V + G+ +M V D+S SM+ ++ V Sbjct: 17 FFSVAWIFLILGLACPLWGSKPVSV---RRRGVSVMFVSDISKSMS-LQDIQPSRIAVQR 72 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GST 252 + ++ +L+ + + GLV + V + PL++ + IN L + Sbjct: 73 QFLKILLEKMHKTSPESA---VGLVITKGEGVLSVPLSFEKNALSSAINALSPLILSSTG 129 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+ A + + + K I+ TDG +S ++ L + Sbjct: 130 TNLEAGVLRALDSFGENRGN-----------SKIIVLCTDGGETSGSL-----LHAAEKI 173 Query: 313 KRRGAIVYAIGVQA 326 K+ AI+ +G Sbjct: 174 KKTDAILIIVGFGT 187 >gi|116329599|ref|YP_799318.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332488|ref|YP_802205.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122492|gb|ABJ80385.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127355|gb|ABJ77447.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 347 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 56/168 (33%), Gaps = 25/168 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S G+D++ ++DVSLSM P +L + ML + R G Sbjct: 82 EKKEESFKGVDILFLVDVSLSMQAIDSPP-TRLARFKEVLLRMLPALSGN-------RFG 133 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ F+ P+ V + + L + T A + K Sbjct: 134 MIVFAGSPFLYCPMTSDVSAFSDYVRGLDVDMVGDRGTDLDGAFSKADALLGSEK----- 188 Query: 276 IAKGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRGAIVY 320 + +I +TDGE+ P + + G IVY Sbjct: 189 -----VFRNRILILVTDGEDQNDPDPVSFPASFQVWAAGTEAGGPIVY 231 >gi|332823604|ref|XP_003311225.1| PREDICTED: complement C2 isoform 2 [Pan troglodytes] Length = 620 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 86 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 139 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 140 IFKESASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANY 196 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 197 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 256 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVG 282 >gi|332823602|ref|XP_003311224.1| PREDICTED: complement C2 isoform 1 [Pan troglodytes] Length = 752 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|284053937|ref|ZP_06384147.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 339 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 40/127 (31%), Gaps = 14/127 (11%) Query: 216 VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R +V F + P I++KI+ L T GL+ ++ K+ Sbjct: 4 DRISVVAFDHRAKVLVPNQDIADPDGIKKKIDGLRCSGGTAIDEGLKLGIEELGKGKQDR 63 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 LTDGEN DNK L A + ++G + L Sbjct: 64 ISQG----------FLLTDGENEHG--DNKRCLKLAKLATEYKLTINSLGFGDDWNQDIL 111 Query: 334 KNCASPD 340 + A Sbjct: 112 EKIADAG 118 >gi|260459671|ref|ZP_05807925.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259034473|gb|EEW35730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 718 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 81/215 (37%), Gaps = 20/215 (9%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 SA + + PW ++ + I ++ +++ ++DVS SM++ Sbjct: 312 SASTPFNSTVSVMPTPWNTHTKLMHVAIKGFDVKPTEQPKA-NLVFLIDVSGSMDEP--- 367 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 DKL + + R ++ +K+ ++ V +G + ++ +A I I+ L Sbjct: 368 --DKLPLLKSAFRLLVSKLKADDTISIVTYAGD---AGTVLMPTKIA-EKDKILNAIDNL 421 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T G++ AY + K + ++ TDG+ + D+ + Sbjct: 422 QPGGSTAGEAGIKEAYKL------AQQSFIKDGVNR---VMLATDGDFNVGQTDDDDLKR 472 Query: 308 YCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 + ++ G + G D+ ++ A Sbjct: 473 LIEQERKTGVFLSVFGFGRGNLNDEMMQTIAQNGN 507 >gi|225543438|ref|NP_001139375.1| complement C2 isoform 2 preproprotein [Homo sapiens] Length = 620 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 86 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 139 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 140 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 196 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 197 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 256 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVG 282 >gi|194374835|dbj|BAG62532.1| unnamed protein product [Homo sapiens] Length = 620 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 86 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 139 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 140 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 196 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 197 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 256 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVG 282 >gi|119593589|gb|EAW73183.1| hCG25234 [Homo sapiens] Length = 195 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 57/168 (33%), Gaps = 34/168 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S SMN D+L + + ++ II+ V GLVTF S Sbjct: 43 VCLVLDKSGSMNAE-----DRLFRMNQAAELYLIQIIEKGSLV------GLVTFDSFAKI 91 Query: 230 TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L T GL+ + I + + Sbjct: 92 QSKLIKIIDDNTYQKITANLPQEADGGTSICRGLKAGFQAIPQSNQSTFGSE-------- 143 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF 332 II LTDGE+ + C E K+ G +++ I + A ++ Sbjct: 144 -IILLTDGEDYQ--------ISLCFGEVKQSGTVIHTIALGPSADEEL 182 >gi|62897125|dbj|BAD96503.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|34628|emb|CAA28169.1| unnamed protein product [Homo sapiens] Length = 752 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 80/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A + + +++ + S R GLVT+++ Sbjct: 174 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKLASYGVKP---RYGLVTYATX 230 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + ++ + +++N + S T + L+ Y+ + + Sbjct: 231 XXIWVKVSEAVSSNADWVTKQLNEINYEDHKLKSGTNTEEALQAVYSMMSWPDDVP---P 287 Query: 278 KGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEA 328 +G + + II +TDG ++ ID Y + ++ VY GV Sbjct: 288 EGWNRTRHVIILMTDGLHNMGGDPITVIDXXXXXXYIGKDRKNPREDYLDVYVFGVGPLV 347 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 348 NQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSL 390 >gi|15277207|dbj|BAB63292.1| C2 [Homo sapiens] Length = 577 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|14550407|ref|NP_000054.2| complement C2 isoform 1 preproprotein [Homo sapiens] gi|3915642|sp|P06681|CO2_HUMAN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|298124|gb|AAB97607.1| complement component C2 [Homo sapiens] gi|2347131|gb|AAB67975.1| complement component C2 [Homo sapiens] gi|28175369|gb|AAH43484.1| Complement component 2 [Homo sapiens] gi|33346923|gb|AAQ15273.1| complement component 2 [Homo sapiens] gi|55961814|emb|CAI17451.1| complement component 2 [Homo sapiens] gi|57209923|emb|CAI41858.1| complement component 2 [Homo sapiens] gi|119623954|gb|EAX03549.1| complement component 2, isoform CRA_b [Homo sapiens] gi|123857990|emb|CAM25860.1| complement component 2 [Homo sapiens] gi|168983782|emb|CAQ06833.1| complement component 2 [Homo sapiens] gi|168984416|emb|CAQ09272.1| complement component 2 [Homo sapiens] gi|168985077|emb|CAQ07481.1| complement component 2 [Homo sapiens] gi|168985955|emb|CAQ07111.1| complement component 2 [Homo sapiens] gi|189069137|dbj|BAG35475.1| unnamed protein product [Homo sapiens] Length = 752 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|38257345|sp|Q8SQ74|CO2_PANTR RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110330|gb|AAL82821.1| complement C2 [Pan troglodytes] Length = 752 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|257895102|ref|ZP_05674755.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] gi|257831667|gb|EEV58088.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] Length = 1341 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 64/153 (41%), Gaps = 25/153 (16%) Query: 161 ISSKSD--IGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 I S+ +D++ VLD S SMN+ G G K ++ E+ + + S P+++ + Sbjct: 213 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 270 Query: 217 RSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAY 263 R G+V F S+ I ++ + + INRL T T GL+ Y Sbjct: 271 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 330 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + +K +I + DG + Sbjct: 331 ETLYADNGGE------NRNPEKILIVVGDGTPT 357 >gi|257879128|ref|ZP_05658781.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257813356|gb|EEV42114.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1258 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 68/192 (35%), Gaps = 24/192 (12%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 D + Y V +P + +++ + + K +D++ VLD S Sbjct: 88 DYNGAYIKKWV-EPVLPSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSA 146 Query: 180 SMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQ 229 SM++ K ++ EM + D + +R G+V F + + + Sbjct: 147 SMSELTAGTNSQTKNAALIEAVNEM--SKDLLSDPSLDIRIGMVNFYHNSTAINNHEQIS 204 Query: 230 T--FPLAWGVQHIQ-EKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + FPL + + + L T T GL+ Y ++ + + + Sbjct: 205 SDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGE------NRNPE 258 Query: 285 KYIIFLTDGENS 296 K +I + DG + Sbjct: 259 KILIVVGDGTPT 270 >gi|188990634|ref|YP_001902644.1| putative secreted protein [Xanthomonas campestris pv. campestris str. B100] gi|167732394|emb|CAP50588.1| putative secreted protein [Xanthomonas campestris pv. campestris] Length = 597 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/238 (18%), Positives = 85/238 (35%), Gaps = 34/238 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW ++ + + + + + +++ ++DVS SM DKL + S++ Sbjct: 202 TPWNTDTLLLRIGVAGR-DVPTAALPPANLVFLVDVSGSMG-----APDKLPLLQSSLKL 255 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGL 259 + + + R LVT++ P G Q I E I+ L G T G+ Sbjct: 256 L------VRQLRKQDRITLVTYAGSTAVVLPPTSGAQQTRIVEAIDSLQSGGGTAGASGI 309 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E AY A +G + I+ TDG+ + D + E +R G + Sbjct: 310 ELAYKAAQQA------YLRGGINR---ILLATDGDFNVGVTDFDQLKGMVAEKRRSGVAL 360 Query: 320 YAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G D ++ A + + ++ +L I +++ Q Sbjct: 361 STLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHELGSTLATIARDVKIQV 418 >gi|157817857|ref|NP_001102478.1| procollagen, type VI, alpha 3 [Rattus norvegicus] gi|149037629|gb|EDL92060.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_a [Rattus norvegicus] Length = 2207 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++ S+ Sbjct: 625 RTLTGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVG 672 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + RL T G +Y Sbjct: 673 SDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSYVHANHFT 732 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ + + P+ D L N R G + + +G Sbjct: 733 EAGGSRIREH-VPQLLLLLM-----AGPSEDVY--LQAANALVRSGVLTFCVGTNQADKA 784 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 785 ELERIAFNPSLVYLMDDFSAL 805 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 23/199 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + K+ D++ ++D S S G D+ + + D+++S+ +N Sbjct: 28 QSDVKNGAAADILFLVDSSWS------AGKDRFLLVQEFLS---DVVESLSVGDNDFHFA 78 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 LV + F L Q + I + + + T I + + Sbjct: 79 LVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTG---KGLEYIIHSHLTEASGS 135 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + D + I+ LTDG++ + E K V+A+GV+ +A ++ L+ A Sbjct: 136 RAADGVPQVIVVLTDGQS-----EEDGFALPSAELKSADVNVFAVGVE-DADERTLREIA 189 Query: 338 S---PDRFYSVQNSRKLHD 353 S ++++N LHD Sbjct: 190 SEPLSMHVFNLENVTSLHD 208 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 126/359 (35%), Gaps = 57/359 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K +S+L ++ + + +F +N Sbjct: 1471 VRIGVVQFSNDVFPEFYLKTHKSQ---NSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1524 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 ++ + + + + +++ R + I + + + Sbjct: 1525 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTI 1584 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-----------SDIGLDMMMVLDVS 178 + + F F N +L + + D++ +LD Sbjct: 1585 TNDPRLVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEKKKADIVFLLD-- 1642 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--G 236 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFST 1695 Query: 237 VQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + IN++I+ + G+E+ + E ++ + + +T G+ Sbjct: 1696 KRQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDERVPQIAFVITGGK 1751 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1752 SVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 67/169 (39%), Gaps = 16/169 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ ++ G V + +L+ + N VR G+V +S + F L Sbjct: 242 DIIFLIDGSQNTGKANFDVIRDFLVNVLERLSV---GNQQVRVGVVQYSDEPRTMFSLDS 298 Query: 236 --GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + RL F + G A + + + ++ + + ++ ++ G Sbjct: 299 YPSKAAVLDAVKRLSFAGGELANIG--QALDFVVENHFTRTGGSRVEEGVPQVLVLISAG 356 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDR 341 +S D +L ++G V++ G+ A+AA + L++ A+ D Sbjct: 357 PSSDEIRDAVVAL-------KQG-SVFSFGLGAQAASRVELQHIATDDN 397 >gi|149037631|gb|EDL92062.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_c [Rattus norvegicus] Length = 2862 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++ S+ Sbjct: 625 RTLTGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVG 672 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + RL T G +Y Sbjct: 673 SDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSYVHANHFT 732 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ + + P+ D L N R G + + +G Sbjct: 733 EAGGSRIREH-VPQLLLLLM-----AGPSEDVY--LQAANALVRSGVLTFCVGTNQADKA 784 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 785 ELERIAFNPSLVYLMDDFSAL 805 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 23/199 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + K+ D++ ++D S S G D+ + + D+++S+ +N Sbjct: 28 QSDVKNGAAADILFLVDSSWS------AGKDRFLLVQEFLS---DVVESLSVGDNDFHFA 78 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 LV + F L Q + I + + + T I + + Sbjct: 79 LVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTG---KGLEYIIHSHLTEASGS 135 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + D + I+ LTDG++ + E K V+A+GV+ +A ++ L+ A Sbjct: 136 RAADGVPQVIVVLTDGQS-----EEDGFALPSAELKSADVNVFAVGVE-DADERTLREIA 189 Query: 338 S---PDRFYSVQNSRKLHD 353 S ++++N LHD Sbjct: 190 SEPLSMHVFNLENVTSLHD 208 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 126/359 (35%), Gaps = 57/359 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K +S+L ++ + + +F +N Sbjct: 1471 VRIGVVQFSNDVFPEFYLKTHKSQ---NSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1524 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 ++ + + + + +++ R + I + + + Sbjct: 1525 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTI 1584 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-----------SDIGLDMMMVLDVS 178 + + F F N +L + + D++ +LD Sbjct: 1585 TNDPRLVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEKKKADIVFLLD-- 1642 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--G 236 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFST 1695 Query: 237 VQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + IN++I+ + G+E+ + E ++ + + +T G+ Sbjct: 1696 KRQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDERVPQIAFVITGGK 1751 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1752 SVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 67/169 (39%), Gaps = 16/169 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ ++ G V + +L+ + N VR G+V +S + F L Sbjct: 242 DIIFLIDGSQNTGKANFDVIRDFLVNVLERLSV---GNQQVRVGVVQYSDEPRTMFSLDS 298 Query: 236 --GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + RL F + G A + + + ++ + + ++ ++ G Sbjct: 299 YPSKAAVLDAVKRLSFAGGELANIG--QALDFVVENHFTRTGGSRVEEGVPQVLVLISAG 356 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDR 341 +S D +L ++G V++ G+ A+AA + L++ A+ D Sbjct: 357 PSSDEIRDAVVAL-------KQG-SVFSFGLGAQAASRVELQHIATDDN 397 >gi|149037630|gb|EDL92061.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_b [Rattus norvegicus] Length = 2867 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++ S+ Sbjct: 625 RTLTGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVG 672 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + RL T G +Y Sbjct: 673 SDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSYVHANHFT 732 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ + + P+ D L N R G + + +G Sbjct: 733 EAGGSRIREH-VPQLLLLLM-----AGPSEDVY--LQAANALVRSGVLTFCVGTNQADKA 784 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 785 ELERIAFNPSLVYLMDDFSAL 805 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 23/199 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + K+ D++ ++D S S G D+ + + D+++S+ +N Sbjct: 28 QSDVKNGAAADILFLVDSSWS------AGKDRFLLVQEFLS---DVVESLSVGDNDFHFA 78 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 LV + F L Q + I + + + T I + + Sbjct: 79 LVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTG---KGLEYIIHSHLTEASGS 135 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + D + I+ LTDG++ + E K V+A+GV+ +A ++ L+ A Sbjct: 136 RAADGVPQVIVVLTDGQS-----EEDGFALPSAELKSADVNVFAVGVE-DADERTLREIA 189 Query: 338 S---PDRFYSVQNSRKLHD 353 S ++++N LHD Sbjct: 190 SEPLSMHVFNLENVTSLHD 208 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 126/359 (35%), Gaps = 57/359 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K +S+L ++ + + +F +N Sbjct: 1471 VRIGVVQFSNDVFPEFYLKTHKSQ---NSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1524 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 ++ + + + + +++ R + I + + + Sbjct: 1525 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTI 1584 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-----------SDIGLDMMMVLDVS 178 + + F F N +L + + D++ +LD Sbjct: 1585 TNDPRLVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEKKKADIVFLLD-- 1642 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--G 236 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFST 1695 Query: 237 VQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + IN++I+ + G+E+ + E ++ + + +T G+ Sbjct: 1696 KRQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDERVPQIAFVITGGK 1751 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1752 SVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 67/169 (39%), Gaps = 16/169 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ ++ G V + +L+ + N VR G+V +S + F L Sbjct: 242 DIIFLIDGSQNTGKANFDVIRDFLVNVLERLSV---GNQQVRVGVVQYSDEPRTMFSLDS 298 Query: 236 --GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + RL F + G A + + + ++ + + ++ ++ G Sbjct: 299 YPSKAAVLDAVKRLSFAGGELANIG--QALDFVVENHFTRTGGSRVEEGVPQVLVLISAG 356 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDR 341 +S D +L ++G V++ G+ A+AA + L++ A+ D Sbjct: 357 PSSDEIRDAVVAL-------KQG-SVFSFGLGAQAASRVELQHIATDDN 397 >gi|149037632|gb|EDL92063.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] gi|149037633|gb|EDL92064.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] Length = 2140 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 22/201 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +T + ++ D++ +LD S ++ + P + +++ S+ Sbjct: 625 RTLTGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVG 672 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ +R GLV FS V F L + + RL T G +Y Sbjct: 673 SDNIRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSYVHANHFT 732 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E + H + ++ + + P+ D L N R G + + +G Sbjct: 733 EAGGSRIREH-VPQLLLLLM-----AGPSEDVY--LQAANALVRSGVLTFCVGTNQADKA 784 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + +P Y + + L Sbjct: 785 ELERIAFNPSLVYLMDDFSAL 805 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 76/199 (38%), Gaps = 23/199 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + K+ D++ ++D S S G D+ + + D+++S+ +N Sbjct: 28 QSDVKNGAAADILFLVDSSWS------AGKDRFLLVQEFLS---DVVESLSVGDNDFHFA 78 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 LV + F L Q + I + + + T I + + Sbjct: 79 LVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTG---KGLEYIIHSHLTEASGS 135 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + D + I+ LTDG++ + E K V+A+GV+ +A ++ L+ A Sbjct: 136 RAADGVPQVIVVLTDGQS-----EEDGFALPSAELKSADVNVFAVGVE-DADERTLREIA 189 Query: 338 S---PDRFYSVQNSRKLHD 353 S ++++N LHD Sbjct: 190 SEPLSMHVFNLENVTSLHD 208 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 126/359 (35%), Gaps = 57/359 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K +S+L ++ + + +F +N Sbjct: 1471 VRIGVVQFSNDVFPEFYLKTHKSQ---NSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1524 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 ++ + + + + +++ R + I + + + Sbjct: 1525 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTI 1584 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-----------SDIGLDMMMVLDVS 178 + + F F N +L + + D++ +LD Sbjct: 1585 TNDPRLVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEKKKADIVFLLD-- 1642 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--G 236 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFST 1695 Query: 237 VQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + IN++I+ + G+E+ + E ++ + + +T G+ Sbjct: 1696 KRQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDERVPQIAFVITGGK 1751 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1752 SVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 67/169 (39%), Gaps = 16/169 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+ ++ G V + +L+ + N VR G+V +S + F L Sbjct: 242 DIIFLIDGSQNTGKANFDVIRDFLVNVLERLSV---GNQQVRVGVVQYSDEPRTMFSLDS 298 Query: 236 --GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + RL F + G A + + + ++ + + ++ ++ G Sbjct: 299 YPSKAAVLDAVKRLSFAGGELANIG--QALDFVVENHFTRTGGSRVEEGVPQVLVLISAG 356 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDR 341 +S D +L ++G V++ G+ A+AA + L++ A+ D Sbjct: 357 PSSDEIRDAVVAL-------KQG-SVFSFGLGAQAASRVELQHIATDDN 397 >gi|332206577|ref|XP_003252372.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 [Nomascus leucogenys] Length = 1107 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 257 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 304 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 305 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 359 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 360 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 411 Query: 340 DRFYSV 345 +Y + Sbjct: 412 GYYYEI 417 >gi|156382085|ref|XP_001632385.1| predicted protein [Nematostella vectensis] gi|156219440|gb|EDO40322.1| predicted protein [Nematostella vectensis] Length = 1221 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 75/201 (37%), Gaps = 19/201 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSK 226 +D+ + +D S M+D L + +++ D N +R G+ T+ ++ Sbjct: 69 MDVALAVDTSDGMSDA------DLAKTKSLVTTLVNQ---FSDSENSIRFGITTYGQEAR 119 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + F + ++ I + + + + + ++ Sbjct: 120 TLANFKQNFDEAKLRTAIKGIQKTGV----QARRHDLAAMAVKNDLFSLEGGMRQGHPRF 175 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YSV 345 +IF + G N+ D K++ G + AIGV + A L AS +RF +S Sbjct: 176 VIFFSAGANTGTADDLKKASKP---LTDLGVNMIAIGVNSNADQASLAELASENRFIFSA 232 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + +L + I +M +++ Sbjct: 233 NSPAELDALWPSIEAQMCQEK 253 >gi|297675239|ref|XP_002815595.1| PREDICTED: integrin alpha-1-like [Pongo abelii] Length = 1179 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVRECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEEENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/272 (16%), Positives = 85/272 (31%), Gaps = 30/272 (11%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 N R +G D + I + ++ + + P F W Sbjct: 378 MANVPRASGGETDNKEATNFSDEQISMLVRNTEQFGCGLVMLGGPNSFGAGGWANTELEK 437 Query: 152 PLLITSSVKISSKSDIG--LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + +K +G + MM +++ G + L V S Sbjct: 438 AMPVDFQIKNEKVKAVGALVMMMHASELAQGNYWQKVIGQEALKVLGPSDYCGCVHWDDF 497 Query: 210 PDVNNVV-RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 +N + R G +I G ++R+ G + P + A + Sbjct: 498 TGRDNWLWRDGAGKGLVRIGGQQKSMLGR------LDRMAPGDMPQFEPAMTMALKDL-- 549 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-- 326 K + K++I ++DG+ S P+ N+ K+ G + + V Sbjct: 550 ---------KPNPASVKHMIIISDGDPSPPSG------TILNQYKQAGIKITTVAVGTHG 594 Query: 327 EAADQFLKNC--ASPDRFYSVQNSRKLHDAFL 356 A L+N A+ ++Y N + L F Sbjct: 595 PAGSTPLQNIANATGGKYYVATNPKALPRIFQ 626 >gi|183985704|gb|AAI66222.1| LOC100158554 protein [Xenopus (Silurana) tropicalis] Length = 899 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 73/204 (35%), Gaps = 38/204 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD+S SM + + +L A+ + + V G+VTFS+ Sbjct: 309 VSLVLDISGSMTN--ANRITRLYQASE--------VYIMQIVEQGAYVGIVTFSNVAEIK 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + ++ K+ + T G++ + + Sbjct: 359 SQLVKITDTFQRESLKLKL-PTVATGGTNICAGVQQGLQVNRNLDQSTHGTE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGV--QAEAADQFLKNCASPDRF 342 I+ LTDGE+S + C + + GAI++ I + A+ + L + + Sbjct: 410 -IVLLTDGEDS--------GISSCFPDITKSGAIIHTIALGNNADPGLEKLADLTGGLKL 460 Query: 343 YSVQ--NSRKLHDAFLRIGKEMVK 364 Y+ + L D+F I Sbjct: 461 YASDKVDVNGLIDSFSGIVSNTGN 484 >gi|149176271|ref|ZP_01854886.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] gi|148844873|gb|EDL59221.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] Length = 715 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 79/263 (30%), Gaps = 53/263 (20%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKS 165 ++ S ++ + H + ++ S E +P F P H Sbjct: 474 DLSGSVQVTARLASDHTELTVNVESLPEDIPARFMEPPPKPVIPHVT------------- 520 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + + D+S SM+ L + ++ L+ + G++ + Sbjct: 521 -----VYLAFDLSGSMSGE------PLAESQKAALAFLEQVDL-----THCSMGVIAVAD 564 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I++ + L G S + A K+ + Sbjct: 565 STQTVLDACQNASKIEKAVKSLSIGMVGCGNSAQPFDTAMKKLKK------------VEG 612 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDR 341 +++I L DG D ++ V AIG +A FL++ AS Sbjct: 613 PRFVITLADGV----WADQPHAVNRAKSLHSAEIDVIAIGFG-DADKNFLRDIASCDEGS 667 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 F+ + L F I + + K Sbjct: 668 FF--TSLSGLSATFSSIAQVITK 688 >gi|146304836|ref|YP_001192152.1| protoporphyrin IX magnesium-chelatase [Metallosphaera sedula DSM 5348] gi|145703086|gb|ABP96228.1| protoporphyrin IX magnesium-chelatase [Metallosphaera sedula DSM 5348] Length = 600 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 58/139 (41%), Gaps = 18/139 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V S ++ + ++++LD S SM+ ++ VA +RE+L + Sbjct: 419 VMKSLETQGAIPILLLLDSSRSMDFSR-----RILVAKAILRELLQKAYQVRSK-----V 468 Query: 219 GLVTFS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 GLVTFS S+ PL ++ ++E +N + T + L A + +E+ Sbjct: 469 GLVTFSGSEARYDVPLTRNLRKVEEFVNGVRPAGKTPMSMALYLALQIV--NRERRSRRK 526 Query: 278 KGHDDYKKYIIFLTDGENS 296 + ++DG+ + Sbjct: 527 LNPLVF-----LISDGKAN 540 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 68/210 (32%), Gaps = 36/210 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G D ++LD+S SM F ++ + K + N R +V F+ Sbjct: 348 QRGTDWTLLLDISGSMQGKFQTLIEGVK-------------KGLKRFNPQDRVRVVLFND 394 Query: 226 KIVQTF----PLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P + I K++ ++ T G+ +A + + + Sbjct: 395 YASNLTGGFLPATQKNIAEIIRKLDLVLPNGGTHLMDGVRFALSGLDADRTSA------- 447 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 I +TDG + ++ + + K++ V+ + A LK Sbjct: 448 ------IWLVTDGVTNVGETKQRKFV---DLLKQKDIRVFTFIMGNGANRPLLKAITKAS 498 Query: 341 RFYSVQ--NSRKLHDAFLRIGKEMVKQRIL 368 +++ NS + + ++ + + Sbjct: 499 NGFAINVSNSDDIIGQLEKAASKVTHEALR 528 >gi|332970076|gb|EGK09074.1| D-amino acid dehydrogenase large subunit [Desmospora sp. 8437] Length = 454 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 53/325 (16%), Positives = 105/325 (32%), Gaps = 27/325 (8%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 + E + K+K+ + I+ + + + + +N R + D Sbjct: 32 TACSSEAVDEKPKKKSFQAATEIEGMLREGPGKFAGDRYDEEKVNKALRKLPDDLTADQA 91 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT----SSVKISSKSDIGLDMMMVLDV 177 + Y P + + + L T K + + + ++LD Sbjct: 92 YTRLIQLLAEDYG-PALREIEEFDPSLQIGELKFTDKEDEDQKQGKEQAKQVHVEILLDA 150 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM G+ K+ +A +I + + ++ V G + K Q A Sbjct: 151 SGSMAGRIRDGV-KMDLAKEAIENFVSDMPENAKISLRVY-GHKGSNRKQDQKESCA-ST 207 Query: 238 QHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG--HDDYKKYIIFLTD 292 + + + FG S + + A E+ K +D + I ++D Sbjct: 208 EVVYPHGSYVKG-KFGKALNSFEP--TGWTPLAAAMEEARQDLKPYAGEDAENIIYVVSD 264 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNC--ASPDRFYSVQNSR 349 G + K + N + + IG +A Q LK A + + Sbjct: 265 GIETCGGDPVKAAKSLYNSDIQAVVNI--IGFDVDDAGQQALKKVAEAGGGEYKTANTRE 322 Query: 350 KLHDAF----LRIGKEMVKQRILYN 370 +L+ +F I KE+ K + YN Sbjct: 323 ELNQSFGIDWDEIEKEVSK--VWYN 345 >gi|227833260|ref|YP_002834967.1| hypothetical protein cauri_1436 [Corynebacterium aurimucosum ATCC 700975] gi|262184244|ref|ZP_06043665.1| hypothetical protein CaurA7_09649 [Corynebacterium aurimucosum ATCC 700975] gi|227454276|gb|ACP33029.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 688 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 68/227 (29%), Gaps = 50/227 (22%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 F P S P + ++ S G M+VLD S SMN G +L A + Sbjct: 38 FAQNPTGVTPSAEPSSEAAPNGGAANSQSG-ATMLVLDSSGSMNVQDAGGQTRLDAAKDA 96 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF---------------PLAWGVQHIQEK 243 ++ + + GLVT+ + + P Sbjct: 97 TKKFVSELGGTIP------LGLVTYGGTVDEAPENQEAGCQDIHVVSGPKEDVGDSFTGP 150 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 I+ L T L+ A ++ I+ ++DG ID Sbjct: 151 IDALQAKGYTPIGDSLKKAAEELGGQHGT--------------IVLVSDG------IDTC 190 Query: 304 ESLFYCNEAK---RRGA--IVYAIGVQAEAADQFLKNC---ASPDRF 342 C AK +G ++ IG + + +C A+ + Sbjct: 191 APPPVCEVAKELHEQGIDLVINTIGFNVDEEARKELSCIAEAAGGEY 237 >gi|326674791|ref|XP_001922046.3| PREDICTED: collagen alpha-1(XIV) chain [Danio rerio] Length = 1852 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 70/198 (35%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ D D R + + I + + FS Sbjct: 1020 DLTFLVDGSWSIGD------DNFQKIIRFLYSTTGALDVIGP--EGTQVAIAQFSDDART 1071 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + + R+ + G TK+ +++ IF A K Sbjct: 1072 EFKLNSYSDKEALLDAVQRISYKGGNTKTGRAIKHVKEAIFSE------DAGVRRGIPKV 1125 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 ++ LTDG + + E + G I++AIG + + + P + Sbjct: 1126 LVVLTDGRSQD------DVNKISKEMQMEGYIIFAIGFADADYGELVNIASKPSERHVFF 1179 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I +++ Sbjct: 1180 VDD----LDAFKKIEEQL 1193 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 63/187 (33%), Gaps = 34/187 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S+ + + +++ D R GL +S Sbjct: 158 DIVILVDGSWSIGRI------NFRLVRMFLENLVNAFDVGIDK---TRIGLAQYSGDPR- 207 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + W + K + G T + L Y F E Sbjct: 208 ---IEWHLNGFSTKEAVIDAVKNLPYKGGNTLTGLALTYVLENSFKP-ESGARDNIP--- 260 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 K I +TDG++ I ++L + G ++AIGV+ ++ + P+ Sbjct: 261 --KIGILITDGKSQDDVISPAQTL------RSSGVELFAIGVKNADENELKAIASEPEDT 312 Query: 341 RFYSVQN 347 Y+V + Sbjct: 313 HVYNVAD 319 >gi|315608293|ref|ZP_07883283.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] gi|315250074|gb|EFU30073.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] Length = 342 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 57/180 (31%), Gaps = 32/180 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++ G++ ++ LD+S SM D +DK + S+ + Sbjct: 80 AEVQRDKRNGIEAIICLDISNSMLAQDVAPSRLDKSKLLVESLVDRFTN----------D 129 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + GL+ F+ P+ + + LI T + + A + Sbjct: 130 KIGLIVFAGDAYVQLPITSDYVSAKMFLQNIDPSLIQTQGTDIAQAINLGLHSFTQADKI 189 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + II +TDGE+ + +++G V+ +GV Sbjct: 190 G-----------RAIIVITDGEDHEGGAVEAAAEA-----RKKGVNVFILGVGDTKGAPI 233 >gi|293571291|ref|ZP_06682325.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608698|gb|EFF37986.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1364 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 64/153 (41%), Gaps = 25/153 (16%) Query: 161 ISSKSD--IGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 I S+ +D++ VLD S SMN+ G G K ++ E+ + + S P+++ + Sbjct: 232 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 289 Query: 217 RSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAY 263 R G+V F S+ I ++ + + INRL T T GL+ Y Sbjct: 290 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 349 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + +K +I + DG + Sbjct: 350 ETLYADNGGE------NRNPEKILIVVGDGTPT 376 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 43/126 (34%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L D L A + N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLT--GAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS-IIIDDQHKDYNLSA 129 + + + F +L+ N +++ I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCM 134 Query: 130 VSRYEM 135 S+ + Sbjct: 135 TSQTGI 140 >gi|299138149|ref|ZP_07031329.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600079|gb|EFI56237.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 349 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 76/222 (34%), Gaps = 31/222 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + ++ L + ++LD S S + D R + E+L L+ Sbjct: 96 TQEKNLPLTIGILLDTSGSQKNVLPLEQD---SGARFLSEVLKPKDEAF---------LI 143 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAY---NKIFDAKE 271 +F + ++ I++ G ++ ++DA Sbjct: 144 SFDVNVDLLSDYTNSAHELKRAIDKASINAASSSAGVPGIGGGPFPTSHPRGTLLYDAVY 203 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV-------YAIGV 324 H +K ++ LTDG + K + +A ++ ++ G Sbjct: 204 LAAHDKLQSQTGRKILVLLTDGGDQGSQETLKSATEAAQKANAILYVILIADRANFSYGF 263 Query: 325 QAEAADQFLKNCASPDRFYSV-QNSRKLHDAFLRIGKEMVKQ 365 A+ + L + + R +V N +KL +AF +I E+ Q Sbjct: 264 NADGQMEQLAH-ETGGRVINVGNNGKKLEEAFDQIQDELRTQ 304 >gi|301109920|ref|XP_002904040.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262096166|gb|EEY54218.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 2146 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 66/208 (31%), Gaps = 28/208 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + S + + VLD S SMN P D + + I D + +VT Sbjct: 1896 AASGGKMHHVFVLDCSGSMNG--QPWNDLMAAWKEYVYNR------IADGATLDLVSVVT 1947 Query: 223 FSSK---IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F + + + + + G T GL A + Sbjct: 1948 FDNSAQIVYEARSITTVTNARIQY-----RGGGTNYAAGLRSANEVLS---------RVN 1993 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 D +K I+F +DG P + + +R G +A+G + + L+ A Sbjct: 1994 FDMFKPAIVFFSDGHPCDPLQGEELATHIRGCYERNGLQAFAVGFGSINLN-MLERVAEK 2052 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 ++ V +L F I + + Sbjct: 2053 LGGTYHHVLTGNELKATFFSISASLSTR 2080 >gi|89053332|ref|YP_508783.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88862881|gb|ABD53758.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 686 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 66/184 (35%), Gaps = 19/184 (10%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW ++ + I + + + L+++ ++D S SMND KL + +S R Sbjct: 301 TPWNPDTQLVHIGIQGDLPV-VEDRPPLNLVFLIDTSGSMNDPA-----KLPLLIQSFRL 354 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 ML+ + +V V +G + + I + L G +T GLE Sbjct: 355 MLNRLSPEDEVAIVTYAGSAGVALEPTAAS----DTATINAALTTLQAGGSTNGVGGLEE 410 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 AY + E ++ TDG+ + D Y E + G + Sbjct: 411 AYRLAGEMMVDGEVSR---------VLLATDGDFNVGLSDAGALEDYIAEQRDTGIYLSV 461 Query: 322 IGVQ 325 +G Sbjct: 462 LGFG 465 >gi|257892717|ref|ZP_05672370.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257829096|gb|EEV55703.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1347 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 64/153 (41%), Gaps = 25/153 (16%) Query: 161 ISSKSD--IGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 I S+ +D++ VLD S SMN+ G G K ++ E+ + + S P+++ + Sbjct: 213 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 270 Query: 217 RSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAY 263 R G+V F S+ I ++ + + INRL T T GL+ Y Sbjct: 271 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 330 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + +K +I + DG + Sbjct: 331 ETLYADNGGE------NRNPEKILIVVGDGTPT 357 >gi|212635869|ref|YP_002312394.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] gi|212557353|gb|ACJ29807.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] Length = 710 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 70/180 (38%), Gaps = 33/180 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++++V+D S SM+ + A SI L + + N ++ F+S Sbjct: 347 APRELILVIDTSGSMSGE------AIEQAKASIIYALAGLSAQDSFN------ILQFNSN 394 Query: 227 IV--QTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + PL A + Q + RL T+ + L+ A ++ +E+L Sbjct: 395 VYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQQDANRERL-------- 446 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++F+TDG + + + N+ ++ ++ IG+ F++ A R Sbjct: 447 ---RQVLFITDG---AVGNEPQLFTQIRNQLQQS--RLFTIGIGDAPNAHFMQRAAELGR 498 >gi|189465624|ref|ZP_03014409.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224539998|ref|ZP_03680537.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] gi|189437898|gb|EDV06883.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224518388|gb|EEF87493.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] Length = 348 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 64/196 (32%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LVFAAIGLFAVLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLVAQLVDKMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + +I +TDGEN +G Sbjct: 170 TAIGAAIN-------LATRSFTPQEGVGRAVIVITDGENHEGGAVEAAKAAA-----EKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVSVLGVGMPDGAPI 233 >gi|218198427|gb|EEC80854.1| hypothetical protein OsI_23472 [Oryza sativa Indica Group] Length = 604 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 80/209 (38%), Gaps = 29/209 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND------HFGPGMDKLGVATRSIREMLDII 206 L + + + +D++ VLDVS SMND +L V S++ ++ + Sbjct: 30 LRVEAPPAADLNGHVPIDVVAVLDVSGSMNDPVAASPESNLQATRLDVLKASMKFIIRKL 89 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + R +V F+ V+ + G +KI+RL + S LE Sbjct: 90 D------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGSGSALMLE 143 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 + +A + L+ + +I+ LTDG++++ +++ + V+ Sbjct: 144 -----LQEAVKILDERQGNSRNRVGFILLLTDGDDTTGFRWSRDVIHGA----VGKYPVH 194 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQN 347 + A + L + A S + V + Sbjct: 195 TFALGAAHDPEALLHIAQESRGTYSFVDD 223 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 51/311 (16%), Positives = 98/311 (31%), Gaps = 42/311 (13%) Query: 38 TSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 + + + L SL + I+N+ G++ N + T R+ Sbjct: 82 LGGIWEITTSIQATLRGSLSSSVQTIVNEAGCILGQENTNLT-----NTDFSTPIRSAFA 136 Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 F I S+++ I + Q ++ P + L+ Sbjct: 137 ARIF---IQQSVGSSAIPITLTQQATWWSTVYR------------PGANATKFIELVSQV 181 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + LD+ V+D S S+ F D L + K + VR Sbjct: 182 ENVTVGCATHKLDLWFVIDGSGSVG--FSNFQDSLRFLAS-------LTKRFTIGPDDVR 232 Query: 218 SGLVTFSSK--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 G +SS I F ++ +I + ST A N + + + Sbjct: 233 VGFSVYSSTSTIHSHFNQHMNNSALEAEILGTSYTGGGTSTG---RAINDVLNNGFVERN 289 Query: 276 IAKGHDD-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A+ + + ++ +TDG++ + K G V+ +G+ + + Sbjct: 290 GARPASEGVPRILVVMTDGQSGDSVKTPS------DNVKAAGITVFGVGIGSGIDIAEVN 343 Query: 335 NCAS-PDRFYS 344 AS PD Y+ Sbjct: 344 EIASNPDSRYA 354 >gi|260062899|ref|YP_003195979.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] gi|88784467|gb|EAR15637.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] Length = 378 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 66/200 (33%), Gaps = 18/200 (9%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT- 230 ++VLD+S S+ L + +D + + + + F + V Sbjct: 142 LLVLDLSNSVLS------GSLTELKSASASFIDNVMPAVPAESF-QMAIYWFDGEDVLHE 194 Query: 231 -FPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL + + + + +T + + + D E Sbjct: 195 LNPLTSSREELIAAVESIDSDFSNDPSTDLYGAVIKSTDLATDLLRDSEQNNTIGAAS-- 252 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 I+ TDG + + ++L +A + IG+ AE + L+ + Sbjct: 253 -IVLFTDGTDQASRYSESQALDKVEKA-NSNISFFTIGLGAEIDSEVLEEIGKTFSVF-A 309 Query: 346 QNSRKLHDAFLRIGKEMVKQ 365 N +L F ++ +++ ++ Sbjct: 310 GNKEELEVTFNQLSQKVSER 329 >gi|221126641|ref|XP_002157470.1| PREDICTED: similar to microneme 1 [Hydra magnipapillata] Length = 403 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 67/152 (44%), Gaps = 17/152 (11%) Query: 216 VRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAK 270 VR ++ +S + ++ W + EK++ + + T++ L+ A +F + Sbjct: 6 VRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIEYSHGKTRTDLALKVARKHVFCNE 65 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAA 329 L+H K +I LTDG+++ P + + F + K + ++GV + Sbjct: 66 CSLQHNI------PKLLIVLTDGQSTFPKL----TQFEAHLIKVENNLTIISVGVSDQVD 115 Query: 330 DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGK 360 + LK+ A+ D + + + L+D +I K Sbjct: 116 IEELKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|194374891|dbj|BAG62560.1| unnamed protein product [Homo sapiens] Length = 723 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 189 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 242 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 243 IFKESASPMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 299 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 300 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 359 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 360 VDHIREILNINQKRNDYLDIYAIGVG 385 >gi|194220813|ref|XP_001500011.2| PREDICTED: vitrin [Equus caballus] Length = 662 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 71/198 (35%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 479 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRIGAVQYTYEQRL 529 Query: 230 TFPLA-WGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + I I R+ + T + + YA ++F K + +K Sbjct: 530 EFGFDDYNTKPDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 580 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 +I +TDG + + A +G I YAIGV A D+ P D + Sbjct: 581 MILITDGRSYD------DVRIPAMVAHHKGVITYAIGVAWAAQDELEVIATHPAKDHSFF 634 Query: 345 VQNSRKLHDAFLRIGKEM 362 V L+ + +I + + Sbjct: 635 VDEFDNLYKSVPKIIQNI 652 >gi|291569722|dbj|BAI91994.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 213 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 64/172 (37%), Gaps = 18/172 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S L + +VLD S SM G ++ + +++ L S P ++TF Sbjct: 2 SSRRLPVYLVLDCSGSM---CGEPIEAVNQGIKALVAELQ---SEPYAIETAYLSVITFE 55 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S Q FPL + ++ + L G TT L+ +K KG Sbjct: 56 STAQQVFPLT---ELMKFQPPVLSAGGTTSLGDALKLLTQCFDKEVKKASDTQKGDWKP- 111 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKN 335 + +TDG + D E +E K++ A + A + A + LK Sbjct: 112 -LVFLMTDGMPT----DTWE--KAADELKQKKPANIIACAAGSGADEYTLKK 156 >gi|119775138|ref|YP_927878.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella amazonensis SB2B] gi|119767638|gb|ABM00209.1| inter-alpha-trypsin inhibitor domain protein [Shewanella amazonensis SB2B] Length = 753 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 70/189 (37%), Gaps = 30/189 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + + + ++++V+D S SM D + A ++ L + N Sbjct: 386 PPQPNLANRLARELVLVIDTSGSMAG------DSMVQARSALIHALGGLGPQDSFN---- 435 Query: 218 SGLVTFSSKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 ++ FSS A+ + Q+ + L T+ LE A E Sbjct: 436 --IIAFSSDARPLWPDAKPATAFNLGAAQQFVRSLEADGGTEMASALELALKTPSVVDED 493 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + + ++F+TDG ++ +++LF E + + ++ + + A F Sbjct: 494 TKRL--------RQVLFITDG-----AVNGEDALFNLIERRLGTSRLFPVAIGAAPNGYF 540 Query: 333 LKNCASPDR 341 + A+ R Sbjct: 541 MSRAAAAGR 549 >gi|221130232|ref|XP_002156394.1| PREDICTED: similar to tyrosine kinase receptor [Hydra magnipapillata] Length = 1746 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGL---------DMMMVLDVSLSMNDHFGPGMDKLGV 194 W +S H KI+S +++ L D+ +LD S S+ + Sbjct: 1112 WSQSSYHWNHTA-VPYKINSDTNLNLKSPDCVGIVDVGFILDSSGSLEKEYPQE------ 1164 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGS 251 + + P + +VTFSS V + L + + + + Sbjct: 1165 -KEFLINLASTFGINPSGAHA---AVVTFSSDAVLSIKLNDYFEQAPFNKSVYEIEHMNG 1220 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T+ L + + +++ + + LTDG+ + ++ + Sbjct: 1221 WTRIDLALRKSLEMFEEINGARKNVP-------RLLFLLTDGKQETNEGGAEDPVNVAQL 1273 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRKLHD 353 + RG + A+G+ L N A S D+ + +N +L Sbjct: 1274 LRDRGVEIIAVGIGKGVNRLELNNIAGSSDKVFLAENFDELIK 1316 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 47/297 (15%), Positives = 94/297 (31%), Gaps = 42/297 (14%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNI--WQTDFRNELRENGFAQDI- 105 L ++ ++ + + K + +F Q + N ++ +G + Sbjct: 1470 QDALVRFIVKQEKPLIEKAVITSLPKLEKEFVLTFDVQFESVQDLYYNVIKFSGSSVSSI 1529 Query: 106 ------NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 + ST L+ +K ++ W T + Sbjct: 1530 YPDFWFHKSRNSTMLTASTLINNKLVSIDQNP-------VAFDMWNKVVLSQTFNGTVHL 1582 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S D+ +LD S S+ ++ +D L S +G Sbjct: 1583 EDPSCEGF-FDVGFILDSSGSLESNYSQEVDFLKQLASSFGI----------SKQGSHAG 1631 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +VTFSS+ + L + + ++ + G T+ LE A K Sbjct: 1632 VVTFSSEAKLSIQLDKYFTDADFNKAVDDIPYMGGGTRIDLALEKAIELFDTKKGSR--- 1688 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ K + LTDG + K + NE K++ ++AIG+ + L Sbjct: 1689 ----NEAPKLLFLLTDGVQ-----EPKMEIPVPNEIKQKIIQLFAIGIGSNVNKDEL 1736 >gi|297288988|ref|XP_002803438.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Macaca mulatta] Length = 997 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|227533248|ref|ZP_03963297.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] gi|227189098|gb|EEI69165.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] Length = 596 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 86/285 (30%), Gaps = 33/285 (11%) Query: 21 TAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFS 80 T L V+ I+M ++ + + NG Sbjct: 8 TGHLFAVLLILMSMLTGLVTSG-------SSVVTAAANIRPTYQTDANGTYPTNSWQVTG 60 Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + + N D + N + + + + S D + DY + + + Sbjct: 61 QQNVINQRGGDQVSGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA--KETNTPG 117 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + N V D+++V+D+S SM G D+ G ++ Sbjct: 118 LYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESKNNGGTDRAGAVRTGVK 167 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGSTTKSTPGL 259 L I++ + + V GL+ FSS G +I + + T + Sbjct: 168 NFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIHVGLGK---AGNTSQQQAI 223 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDY--------KKYIIFLTDGENS 296 A + F+ + + KK +I LTDG + Sbjct: 224 NSALSPTFNGGTYTQIGLRQGSAMLNEDTSGNKKMMILLTDGVPT 268 >gi|149377596|ref|ZP_01895335.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] gi|149358138|gb|EDM46621.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] Length = 718 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 61/178 (34%), Gaps = 19/178 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +++ V+D S SM + A +++ L + D NV++ T Sbjct: 349 NRSLPRELVFVIDTSGSMAGE------SIRQARQALLRGLGTLD-ADDRFNVIQFNSQTH 401 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S + + + + L T+ P L+ A D E Sbjct: 402 SLFMESVPASGNNIARARRYVKGLNADGGTEMAPALDAALETNGDGGEA-------SRAR 454 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++F+TDG + + + ++ ++ +G+ + F++ A R Sbjct: 455 VRQVVFITDGAVGNESALFGKIRDGLGSSR-----LFTVGIGSAPNMHFMREAARYGR 507 >gi|197102272|ref|NP_001124862.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Pongo abelii] gi|55726163|emb|CAH89855.1| hypothetical protein [Pongo abelii] Length = 1079 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|114612128|ref|XP_518969.2| PREDICTED: collagen alpha-1(XXVIII) chain [Pan troglodytes] Length = 1125 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 63/179 (35%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIVLFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG + N D + +A+ G IG+ + L+ + Sbjct: 153 -VKVVLLMTDGIDHPKNPDVQS---ISEDARISGISFITIGLSTVVNEAKLRLISGDSS 207 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 79/213 (37%), Gaps = 39/213 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P ++ + +L P ++ G++ + +V Sbjct: 166 MDIVIVLDGSNSI----YPWVE----VQSFLISILQKFYIAPGQ---IQVGVLQYGETVV 214 Query: 229 QTFPLAWGVQH--IQEKINRLIFGSTTKS--TPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + E R+ T++ G+E A + F + K Sbjct: 215 HEFYLNNYRSVTDVVEAAKRIEQRGGTETRTALGIEKAVTEAFQRGGRKG--------AK 266 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL---KN 335 K +I +TDGE + D+ + +++ YA+ V + FL K Sbjct: 267 KVMIVITDGE----SHDSPDLQRVIESSEKDNITRYAVAVLGYYNRRGINPEAFLNEIKY 322 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 AS F++V + L D +G+ + Sbjct: 323 IASDPDDKHFFNVTDEAALKDIVDALGERIFSL 355 >gi|296209823|ref|XP_002807089.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Callithrix jacchus] Length = 1094 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|221104611|ref|XP_002170515.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Hydra magnipapillata] Length = 1137 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 77/188 (40%), Gaps = 24/188 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++ ++LD S S+ L ++I + D K VR L+ FS + Sbjct: 719 IEIGILLDASTSV---------TLSNWKKTIDFVQDFSKQFKMGPTGVRFALIDFSDDAI 769 Query: 229 QTFPLA----WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 ++ W + EK++ + + T++ LE A +F + L H Sbjct: 770 LQISISDPRFWDQETFGEKVSSIEYSQGKTRTDLALEVARKHVFCNECGLRHNT------ 823 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRF 342 + +I LTDG+++ P + E+ +A + ++GV + + LK+ A+ D Sbjct: 824 PRLLIVLTDGQSTFPKLTQFEAQLI--KA-ENNLTIISVGVSDQVDIEELKSLATDRDHV 880 Query: 343 YSVQNSRK 350 + + + Sbjct: 881 FLLNEQKS 888 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 78/202 (38%), Gaps = 31/202 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++ ++LD S S+ +++ + K VR ++ F+++ Sbjct: 297 MEVAILLDASTSVTS---------TNWKKTVSFVQSFTKEFVMGPTGVRFAVIDFANEAQ 347 Query: 229 QTF----PLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P W + K+ + + TK+ ++ A KIF K L Sbjct: 348 IQINILDPKYWSQEAFSRKVGSIEYSRGRTKTDLAIKLAREKIFCDKCNLRR------TV 401 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRF 342 K +I LTDG+++ P++ KE+ ++K + +GV + L + AS D Sbjct: 402 PKLLIVLTDGQSTDPDLTEKEANLIKTQSK---VSIITMGVGDKIDKNELTSMASNSDYV 458 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 + L + + I ++ + Sbjct: 459 F-------LLNGYKYINDKINQ 473 >gi|198426242|ref|XP_002124410.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 402 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 71/202 (35%), Gaps = 32/202 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D +++LD S S+ P D++ + ML +R G ++ Sbjct: 208 ARADAVLILDSSSSVRK---PNWDRM---IEFVVSMLTQFVVNESS---LRVGAFRYNRA 258 Query: 227 IVQTFPL-----AWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + + + I + + GS T++ + +A + + E+ + + Sbjct: 259 VDSDTQILLNGFTNDKTGLVQAIQDIPYRGSGTRTGNAIRHAKDVLL----LPENGNRPN 314 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----DQFLKNC 336 +++ TDG + + E + GA+ + I V + + DQ L Sbjct: 315 VTDLVFVV--TDGRSQDAVAEVAR------ELRATGAVTFVIAVIIQGSTIERDQMLDIA 366 Query: 337 ASPDRFYSVQNS-RKLHDAFLR 357 SPDR + V L F Sbjct: 367 GSPDRLFEVTGGFDDLDSVFAD 388 >gi|145491135|ref|XP_001431567.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398672|emb|CAK64169.1| unnamed protein product [Paramecium tetraurelia] Length = 590 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/306 (13%), Positives = 106/306 (34%), Gaps = 36/306 (11%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 ++ ++L + NG + Q + + N +++ + D ++ + + Sbjct: 60 SSNQLLFKNNGPIQQVQLRQYKVYQPDQYVK-QSGNAIQKITYNDD--DMIQPKKYTPNK 116 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFP-WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 + ++ + + Y+ + T P + + + +G+D++ ++D+ Sbjct: 117 YNLNEQLSFEVKALYKKGKLPQTRPQYLPGIVSLKAQDQNIFQKQESQRVGVDLICLIDI 176 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S SM K+ + S+ +L + R L+TF + + PL Sbjct: 177 SGSMIGV------KIEMVKASLIVLLQFLGDND------RLQLITFDNDAHRLTPLKTVT 224 Query: 236 --GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + I ++ + + + + ++ + + L+DG Sbjct: 225 NQNKSYFTQIIKQIQADGGNRISEATKMTFYQL------KGRKYINNVTS---VFLLSDG 275 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNC-ASPDRFYSVQNSRKL 351 + + + K+ + NE ++ G + Q + + C FY VQ+ L Sbjct: 276 VDVTY-PEVKKQIKTVNEV----FTLHTFGFGEDHDAQMMTQLCNLKSGSFYFVQDVTLL 330 Query: 352 HDAFLR 357 + F Sbjct: 331 DEFFAD 336 >gi|114614242|ref|XP_001160235.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 1 [Pan troglodytes] Length = 1110 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|71900686|ref|ZP_00682810.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] gi|71729565|gb|EAO31672.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] Length = 414 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 60/203 (29%), Gaps = 12/203 (5%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPRPASYKIRRILKGWDHDACWYPEKAAIGMQMPPSVAVYFAIDLSGSMD 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 D G G +L ++ LD + V LV F L + Sbjct: 167 DVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLVGFGDAPDHRQTLLR-RNCTAQ 225 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKL--EHIAKGHDDYKKYIIFLTDGENSSPNI 300 I L T+ + Y F A A + + F+TDGE P+ Sbjct: 226 GIAELKSWVATR-----QALYGTYFPAGTMDMPSFYAAASSNAVRVAFFITDGEPDPPSA 280 Query: 301 DNKES----LFYCNEAKRRGAIV 319 ++ + + G + Sbjct: 281 TLAQAARADVDQVAHLRCYGITI 303 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 66/180 (36%), Gaps = 27/180 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + D+++++D S S+ + FG + L +D I R GL Sbjct: 241 DTTAMTDIILLVDGSWSIGRNNFGLIREFLASLVAPFNVAMDKI----------RVGLTQ 290 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +SS + L + E + L + G T + L + L+ Sbjct: 291 YSSDPRTEWDLNTYATRDEVLEALRSLRYKGGNTFTGLALTHVLEH------NLKADTGA 344 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + K II LTDG++ + L K G ++A+GV+ A + L+ AS Sbjct: 345 RSEAPKLIILLTDGKSQDDANPPAQVL------KNMGIQIFAVGVK-NADETELRQVASD 397 >gi|324515146|gb|ADY46105.1| C-type lectin protein 160 [Ascaris suum] Length = 309 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 59/164 (35%), Gaps = 23/164 (14%) Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGS--TTKSTPGLE 260 I PD VR LVTFS L + + + R+ F T L+ Sbjct: 35 TISLGPDPGQFVRVALVTFSGHATIAGDLRTFGNYSSLVDALFRMPFHGDSTIDIVKALQ 94 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + + D+++ Y K ++ L SS + N+ K G + Sbjct: 95 TASSILEDSRK-----------YVKTVVLLYSSAFSSGGF--TDPTAIANQLKESGTTIV 141 Query: 321 AIGVQAEAADQF---LKNCASPDRFYSVQNSR---KLHDAFLRI 358 + + + L + ASP ++ +S +L DAF R+ Sbjct: 142 TVAFRQQPEGSLVEKLGSIASPHFAFNSMDSDIIGELLDAFCRV 185 >gi|170681089|ref|YP_001744470.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218700745|ref|YP_002408374.1| hypothetical protein ECIAI39_2418 [Escherichia coli IAI39] gi|170518807|gb|ACB16985.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218370731|emb|CAR18544.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 588 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 48/341 (14%), Positives = 107/341 (31%), Gaps = 50/341 (14%) Query: 39 SHKFFVKAKLHYILDHS--LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 + ++ K L L + + A + + N G + F +K + Q Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFS 134 Query: 97 RENGFAQDINNIE----------RSTSLSIIIDDQHKDYNLSAVSR------------YE 134 + N + + +++ D+ ++ S + Sbjct: 135 LDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSNNKEPVPASKPIPFA 194 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M + PW + + I + S+ +++ ++D S SM ++L + Sbjct: 195 MRYELAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPL 248 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGST 252 S++ ++ ++ + +VT++ P G I I+ L + Sbjct: 249 IQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDADGS 302 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T GLE AY + KG + I+ TDG+ + D K + Sbjct: 303 TNGGAGLELAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQ 353 Query: 313 KRRGAIVYAIGVQ-AEAADQFLKNCA--SPDRFYSVQNSRK 350 + G + GV + + + A + + + Sbjct: 354 RESGVSLSTFGVGDSNYNEAMMVRIADVGNGNYSYIDTLAE 394 >gi|154497289|ref|ZP_02035985.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] gi|150273688|gb|EDN00816.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] Length = 234 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/166 (23%), Positives = 70/166 (42%), Gaps = 12/166 (7%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 + LDVS SM +G +++L +R+ LD ++ + +VTF S Sbjct: 25 LCLDVSSSM---YGQPIEELNA---GLRQFLDELRKDELTCTSAETAVVTFGSSAQCVAD 78 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 A Q ++ L T GL A + + KE+ + A G D Y+ ++ ++D Sbjct: 79 FATADQI---QVEPLEANGLTYMGEGLTMALDLLEQRKER--YKAAGVDYYQPILVVMSD 133 Query: 293 G-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G N P + + + C + R V A+G+ A + L+ + Sbjct: 134 GCPNGDPRVLREAAQRICQMVEARRLTVVAVGIGEGADMEQLRRIS 179 >gi|329744564|ref|NP_001192916.1| voltage-dependent calcium channel subunit alpha-2/delta-1 [Bos taurus] Length = 1091 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|293347920|ref|XP_001064219.2| PREDICTED: vitrin-like [Rattus norvegicus] Length = 648 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 70/202 (34%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 465 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRIGAVQYTYEQR- 514 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 515 ---LEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 562 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 563 KRKVMILITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVIATHPARD 616 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 617 HSFFVDEFDNLYKFVPRIIRNI 638 >gi|239629502|ref|ZP_04672533.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] gi|239528188|gb|EEQ67189.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] Length = 909 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 87/285 (30%), Gaps = 33/285 (11%) Query: 21 TAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFS 80 T L V+ I+M ++ + + NG Sbjct: 11 TGHLFVVLLILMSMLTGLVTSG-------SSVVTAAANIRPTYQTDANGTYPTNSWQVTG 63 Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + + N D + N + + + + S D + DY + + + Sbjct: 64 QQNVINQRGGDQVSGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA--KETNTPG 120 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + N V D+++V+D+S SM G D+ G ++ Sbjct: 121 LYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESKNNGGTDRAGAVRTGVK 170 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGSTTKSTPGL 259 L I++ + + V GL+ FSS G +I+ + + T + Sbjct: 171 NFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGK---AGNTSQQQAI 226 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDY--------KKYIIFLTDGENS 296 A + F+ + + KK +I LTDG + Sbjct: 227 NSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPT 271 >gi|254443704|ref|ZP_05057180.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258012|gb|EDY82320.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 344 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 57/167 (34%), Gaps = 29/167 (17%) Query: 168 GLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++++ +D+S SM D P +L A + MLD ++ GLV F+ Sbjct: 92 SREVIIAMDLSKSMLADDMKP--SRLDRAKLVVESMLDTLEGES-------VGLVVFAGT 142 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P++ Q ++ + + T L A + + Sbjct: 143 AFLQSPMSPDYQILRGFLKEINPSFIPQGGTNYEAMLTTALDSFEQSDGMA--------- 193 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +++I ++DGE+ N K E K + +G + Sbjct: 194 -DRFLIIISDGESLDSNWKAK-----AEELKEQNVRAICLGFGTKEG 234 >gi|42526760|ref|NP_971858.1| batB protein, putative [Treponema denticola ATCC 35405] gi|41817075|gb|AAS11769.1| batB protein, putative [Treponema denticola ATCC 35405] Length = 322 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 71/194 (36%), Gaps = 27/194 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F + + PL + V + G+ ++ V D+S SM+ ++ V Sbjct: 53 FFSAAWIFLILGLACPLWGSKPVSV---RRRGVSVIFVSDISKSMS-LQDIHPSRIAVQR 108 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GST 252 + ++ +L+ + + GLV + V + PL++ + IN L + Sbjct: 109 QFLKILLEKMHKTSPESA---VGLVITKGEGVLSVPLSFEKNALSSAINALSPLILSSTG 165 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+ A + + + K I+ TDG +S ++ L + Sbjct: 166 TNLEAGVLRALDSFGENRGN-----------SKIIVLCTDGGETSGSL-----LHAAEKI 209 Query: 313 KRRGAIVYAIGVQA 326 K+ AI+ +G Sbjct: 210 KKTDAILIIVGFGT 223 >gi|149919202|ref|ZP_01907685.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149819916|gb|EDM79338.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 877 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 49/259 (18%), Positives = 103/259 (39%), Gaps = 40/259 (15%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS-SKSDIGL---DMMMVLDVSLSMND 183 S ++ + F + + P T V+ S D + +M+ V+D S SM+ Sbjct: 328 STAAQAKATAYFGPQLSQSVAGAQPGHFTLVVEPPQSDLDSLVGQREMIFVIDRSGSMSG 387 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF---SSKIVQTFPLAW--GVQ 238 L +A +++RE L ++ + N +++F ++ + + A + Sbjct: 388 V------PLALAKQTLREALSHLRPVDTFN------VISFESSTAMLYEAAVPANEQNLV 435 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 H + I+ L G T + ++ A + H +Y+ F+TDG S+ Sbjct: 436 HAERFIDGLQAGGGTMMSGAVDAA---LSPEIGLGRH---------RYVFFVTDGFISNE 483 Query: 299 NIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKNC--ASPDRFYSVQN---SRKL 351 + +++ A + G A V+ +G+ + + L + A R+ +V N R+ Sbjct: 484 DEIARQASALVRAADKAGQRARVFGMGIGSSPNRELLASLSKAGKGRYLAVGNREHPREA 543 Query: 352 HDAFLRIGKEMVKQRILYN 370 +A+ R+ V I + Sbjct: 544 VEAYTRMVDSAVLTDIHID 562 >gi|114600333|ref|XP_517769.2| PREDICTED: integrin alpha-1 [Pan troglodytes] Length = 1179 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVQECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|83647468|ref|YP_435903.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635511|gb|ABC31478.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 659 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 65/200 (32%), Gaps = 32/200 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + P ++ K + +++ LD+SLSM ++L A Sbjct: 66 LWPLILVLSVVALAGPTWSKQETPVTLKQEA---LVVTLDLSLSMLATDLTP-NRLTRAR 121 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----ST 252 + + ++LD K + LV FS PL ++ + L Sbjct: 122 QKVYDLLDARKEGQ-------TALVAFSGSGHVVAPLTEDSNTLRAMLPALDPFIMPEMG 174 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + + G+E A N I A I+ +TDG ++ + + Sbjct: 175 SNAAAGIESALNVIKQAGAVNAR-----------ILLITDG------VEEVDVQPINDLL 217 Query: 313 KRRGAIVYAIGVQAEAADQF 332 + G + +GV A+ Sbjct: 218 SKAGVSISVLGVGADDGGPI 237 >gi|13603394|gb|AAA52056.2| type VI collagen alpha 2 chain precursor [Homo sapiens] Length = 1019 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEINQDTINRIIKVMK 239 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|262527579|sp|P54289|CA2D1_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor Length = 1103 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 65/187 (34%), Gaps = 37/187 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGV---QAEAADQFLKNCAS 338 K I+ TDG + + N+ K + V+ V + C + Sbjct: 356 --KIIMLFTDG-------GEERAQEIFNKYNKDKKVRVFTFSVGQHNYDRGPIQWMACEN 406 Query: 339 PDRFYSV 345 +Y + Sbjct: 407 KGYYYEI 413 >gi|297673847|ref|XP_002814961.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pongo abelii] gi|261858584|dbj|BAI45814.1| anthrax toxin receptor 2 [synthetic construct] Length = 488 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|223670962|dbj|BAH22728.1| complement factor B precursor [Nematostella vectensis] Length = 858 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 74/205 (36%), Gaps = 28/205 (13%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 I GLD++ V D S S MD + E++ ++ + Sbjct: 384 IELNEAGGLDVVFVFDASSS------IKMDDFRLGLDFSIELVKLLGT-SWKPGGTHVAA 436 Query: 221 VTFSSKIVQTFPL----AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +T+ ++ F L A + + KI ++ G T S L+ ++ + Sbjct: 437 ITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVPFTREGS- 495 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +K + F+TDG ++ K +G +YAIGV + + L Sbjct: 496 --------QKALFFITDGHSNIGG----SPRKAAKILKDKGFQIYAIGVGKKVRRRELME 543 Query: 336 CAS---PDRFYSVQNSRKLHDAFLR 357 AS + SV+ ++L A + Sbjct: 544 IASEPEDEYVISVRKYKQLLSAVKK 568 >gi|224809466|ref|NP_001139266.1| anthrax toxin receptor 2 isoform 2 [Homo sapiens] gi|306526289|sp|P58335|ANTR2_HUMAN RecName: Full=Anthrax toxin receptor 2; AltName: Full=Capillary morphogenesis gene 2 protein; Short=CMG-2; Flags: Precursor Length = 489 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|156390865|ref|XP_001635490.1| predicted protein [Nematostella vectensis] gi|156222584|gb|EDO43427.1| predicted protein [Nematostella vectensis] Length = 851 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 74/205 (36%), Gaps = 28/205 (13%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 I GLD++ V D S S MD + E++ ++ + Sbjct: 377 IELNEAGGLDVVFVFDASSS------IKMDDFRLGLDFSIELVKLLGT-SWKPGGTHVAA 429 Query: 221 VTFSSKIVQTFPL----AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +T+ ++ F L A + + KI ++ G T S L+ ++ + Sbjct: 430 ITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVPFTREGS- 488 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +K + F+TDG ++ K +G +YAIGV + + L Sbjct: 489 --------QKALFFITDGHSNIGG----SPRKAAKILKDKGFQIYAIGVGKKVRRRELME 536 Query: 336 CAS---PDRFYSVQNSRKLHDAFLR 357 AS + SV+ ++L A + Sbjct: 537 IASEPEDEYVISVRKYKQLLSAVKK 561 >gi|119626254|gb|EAX05849.1| anthrax toxin receptor 2, isoform CRA_b [Homo sapiens] Length = 322 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|62089412|dbj|BAD93150.1| anthrax toxin receptor 2 variant [Homo sapiens] Length = 502 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 56 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 101 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 102 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 153 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 210 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 211 FPVKGG---FQALKGIINSILAQS 231 >gi|30013741|gb|AAP04016.1| capillary morphogenesis protein 2 [Homo sapiens] Length = 489 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|50513243|ref|NP_477520.2| anthrax toxin receptor 2 isoform 1 [Homo sapiens] gi|21750159|dbj|BAC03731.1| unnamed protein product [Homo sapiens] Length = 488 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|323701386|ref|ZP_08113060.1| von Willebrand factor type A [Desulfotomaculum nigrificans DSM 574] gi|323533645|gb|EGB23510.1| von Willebrand factor type A [Desulfotomaculum nigrificans DSM 574] Length = 602 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 93/242 (38%), Gaps = 25/242 (10%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 +++Y +P ++++ L S I +++ LD+++++D S SM G Sbjct: 381 QVINQYNLPSGELDEDALYQAAYSNRLFKRSEIIETRTR-NLDIVLLIDSSASMVYPAGE 439 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAWGVQHIQEKINR 246 G+ ++ +A ++ ++ + V V F+ K V L + +I Sbjct: 440 GISRVELARNLAALFVEALEPVDSVKTWV----FGFNLKGAVNLMELYSPHLTNKARIGM 495 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + TT L+YA ++ + K +I + DG N +P + K Sbjct: 496 SVAEGTTPEGSALKYAALRLMSEGRRF---------VPKVLIVIADG-NPNPGPETKLVK 545 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV--QNSRKLHDAFLRIGKEMVK 364 + K G I V +++ YS+ + + F R+ K++V+ Sbjct: 546 EQVRKMKALGCKTINISVGDRPGEEY-------GYEYSIPWTDYNTVIIEFGRLLKKLVE 598 Query: 365 QR 366 +R Sbjct: 599 ER 600 >gi|119026487|ref|YP_910332.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] gi|118766071|dbj|BAF40250.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] Length = 431 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 72/244 (29%), Gaps = 21/244 (8%) Query: 40 HKFFVKAKLHYILDHSLLY--TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR 97 H + D T + + + + D + + KN D + Sbjct: 188 HAGENRTDAVANGDAGATADDGKTGDADDADNDGNTAEDADDADKNGKNDADKDAASAQN 247 Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSA------VSRYEMPFIFCTFPWCANSSHA 151 N ++ + + + I+ D+ + V+ + Sbjct: 248 GNDNDKNDAAKDDTDADEHIMRDRFTFNRKTVMRAARNVAVPQPDHTKSITYNNGGKYTL 307 Query: 152 PLLITSSVKISS-KSDIGLDMMMVLDVSLSMNDHFGPGMDK--------LGVATRSIREM 202 L + S ++ +++++VLD S SMN + L + Sbjct: 308 NLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCMDGSQRRCNKSNPKRLTALKEAATSF 367 Query: 203 LDII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 +D +I D N+ VR + F L ++ ++RL T + G Sbjct: 368 IDATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSANGATPADKG 427 Query: 259 LEYA 262 + A Sbjct: 428 MAAA 431 >gi|90423420|ref|YP_531790.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB18] gi|90105434|gb|ABD87471.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB18] Length = 333 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 76/226 (33%), Gaps = 39/226 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S + G +++V+D S SM+D F A++S ++S + G Sbjct: 74 QSSQQIGSGAHIVLVIDRSSSMDDSFAGSRPTAEQASKSAEAR-RFLRSFVSIGEHDMFG 132 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FS+ +Q PL + + I+ + T G+ A D Sbjct: 133 VAIFSTAPLQALPLTSHREAVLAAIDAIDRPGLSETDIARGIAMALAMHDDDP------- 185 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------- 329 + I+ ++DG + ID + ++R +Y + ++ A Sbjct: 186 ---SAASRAIVLVSDG---AGVIDRRVQEKLRAAFRKRPINLYWVFLRTANARGIFEPPG 239 Query: 330 ---------------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 +F ++ P R + + + DA I + Sbjct: 240 AGERDVPQVAPERHLHRFFQSLKIPYRAFEAERPESIGDAIAEIAR 285 >gi|291242941|ref|XP_002741338.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 788 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 68/203 (33%), Gaps = 40/203 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLDVS SM + KL A + + +++ + G VTFS K Sbjct: 298 IVLVLDVSGSM--SLKSRIIKLQQAVYTFI--------MDEISLGIDVGCVTFSDKANI- 346 Query: 231 FPLAW--------GVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++W + + L G T L + + Sbjct: 347 --ISWLMPINSDEDREEFLALVMPTLNTGGNTAIGSALIAGVQVLSQNDTQPADGG---- 400 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + +TDG+ + P + + G IV + A ++ L+ AS Sbjct: 401 ----ILFLVTDGQENVPQFIEE----VIDNVIESGVIVDTLAWGLFAEEK-LETIASGTK 451 Query: 339 PDRFYSVQNSRK--LHDAFLRIG 359 +Y + ++ +AF+ + Sbjct: 452 GSSYYYSEQAQSNAHVEAFMEVA 474 >gi|194390782|dbj|BAG62150.1| unnamed protein product [Homo sapiens] Length = 1173 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVQECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|125586597|gb|EAZ27261.1| hypothetical protein OsJ_11198 [Oryza sativa Japonica Group] Length = 405 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 75/227 (33%), Gaps = 55/227 (24%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S+ S LD++ VLDVS SM +D++ A + L + + Sbjct: 60 TSSATSRAALDLIAVLDVSTSMAGD---KLDRMKAALLFVIRKLADVDCLS--------- 107 Query: 220 LVTFSSKIVQTFPLAWG-----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +VTFS+ + +PL + ++ ++ L+ T GLE + + Sbjct: 108 IVTFSNDAARLYPLRFDAGDAAWADLKALVDGLVADGNTNIRAGLEIGL-AVAAGRRLTV 166 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A+ D + +D G V+ G+ A+ L+ Sbjct: 167 GRAQNRGDATR--------------LDPG------------GVPVHTFGLGADHDPAVLQ 200 Query: 335 NCASPDR---FYSVQNSRKLHDAFLR--------IGKEMVKQRILYN 370 A R F+ V + L F + I +++ + Sbjct: 201 AIAGKSREGMFHYVADDVNLTAPFSQLLGGLLTIIAQDLELMVTRVD 247 >gi|28198673|ref|NP_778987.1| hypothetical protein PD0767 [Xylella fastidiosa Temecula1] gi|182681364|ref|YP_001829524.1| von Willebrand factor type A [Xylella fastidiosa M23] gi|28056764|gb|AAO28636.1| conserved hypothetical protein [Xylella fastidiosa Temecula1] gi|182631474|gb|ACB92250.1| von Willebrand factor type A [Xylella fastidiosa M23] gi|307579810|gb|ADN63779.1| von Willebrand factor type A [Xylella fastidiosa subsp. fastidiosa GB514] Length = 941 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 59/203 (29%), Gaps = 12/203 (5%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPRPASYKIRRILKGWDHDACWYPEKAAIGMQMPPSVAVYFAIDLSGSMD 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 G G +L ++ LD + V LV F L + Sbjct: 167 YVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLVGFGDAPDHRQTLLR-RNCTAQ 225 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKL--EHIAKGHDDYKKYIIFLTDGENSSPNI 300 I L T+ + Y F A A + + F+TDGE P+ Sbjct: 226 GIAELKSWVATR-----QALYGTYFPAGTMDMPSFYAAASSNAVRVAFFMTDGEPDPPSA 280 Query: 301 DNKES----LFYCNEAKRRGAIV 319 ++ + + G + Sbjct: 281 TLAQAARADVDQVAHLRCYGITI 303 >gi|293359740|ref|XP_233802.5| PREDICTED: vitrin [Rattus norvegicus] Length = 648 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 70/202 (34%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 465 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRIGAVQYTYEQR- 514 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 515 ---LEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 562 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A ++G I YAIG+ A D+ P D Sbjct: 563 KRKVMILITDGRSYD------DVRIPAMAAYQKGVITYAIGIAWAAQDELEVIATHPARD 616 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 617 HSFFVDEFDNLYKFVPRIIRNI 638 >gi|55962354|emb|CAI11851.1| novel protein (zgc:56119) [Danio rerio] gi|56207241|emb|CAI21014.1| novel protein (zgc:56119) [Danio rerio] Length = 946 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 74/194 (38%), Gaps = 20/194 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREML--DIIKSIPDVNNVVRSGLVTFSSKIV 228 ++ V+DVS SM +G M + A ++I + L D SI D N+ VR +S +V Sbjct: 306 IVFVIDVSGSM---WGLKMKQTVEAMKAILDDLSIDDYFSIIDFNHNVRC----WSEDLV 358 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 Q + V ++ I + T L A + A H II Sbjct: 359 QASSIQ--VDEAKKYIQNIKPNGGTNINEALLRAIQMLIKAS---HHGLIDPRSVS-MII 412 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-----RFY 343 ++DG+ + I + ++++G+ + FL+ A + R Y Sbjct: 413 LVSDGDPTVGEIKLSTIQKNVKLRMKEEFSLFSLGIGFDVDFDFLERIAMDNRGIAQRIY 472 Query: 344 SVQNSRKLHDAFLR 357 + QN+ + F Sbjct: 473 ANQNAAEQLKTFYS 486 >gi|327299330|ref|XP_003234358.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] gi|326463252|gb|EGD88705.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] Length = 741 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 72/203 (35%), Gaps = 27/203 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFS 224 D+++V+D+S SMN + + +LD+ K I +N R +VTF Sbjct: 71 DIVLVIDISGSMNSAAPIPTGE-RGGEDTGLSILDLTKHAAKTIIETLNEKDRLAVVTFC 129 Query: 225 SKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +++ F L + I++L S+T G++ N + Sbjct: 130 TEVNVAFELDSMNKENKSTVLGAIDKLYGKSSTNLWHGMKKGLNILATN---------PA 180 Query: 281 DDYKKYIIFLTDGENSS------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ LTDG + +++L + +++ G L+ Sbjct: 181 QGKIQSLLVLTDGAPNHMCPAQGYVPKLRQTLLDHHNLTGTLPLIHTFGFGYYLRSPLLQ 240 Query: 335 NCA--SPDRFYSVQNSRKLHDAF 355 + A F + ++ + F Sbjct: 241 SIAEIGGGTFAFIPDAGMIGTVF 263 >gi|119575263|gb|EAW54868.1| hCG2002731, isoform CRA_e [Homo sapiens] Length = 1177 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVQECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|313903839|ref|ZP_07837228.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] gi|313466027|gb|EFR61552.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] Length = 1151 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 72/222 (32%), Gaps = 34/222 (15%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 MP F + L + S ++ + K+ + + +V+D S SM KL Sbjct: 436 GMPDTFGPGGYTGTPVERALPVHSDLR-NRKNLPTVALTLVIDRSGSMAGL------KLQ 488 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGS 251 +A + R + ++ R +V F S+ T PL + Sbjct: 489 MAVEAARRVAQLLTPAD------RLAVVLFDSQAYVTRPLEPVRNPGEVDRAFPA-AAQG 541 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T GL A + K + H +I LTDG + + Sbjct: 542 GTSLGSGLAAALPLMEGVKADVRH-----------VIALTDG-----VSEPFDVTGLARA 585 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKL 351 +R+G + A+ + +A L A Y + +L Sbjct: 586 FRRQGVTLSAVAIGPDADRNTLAQLAREGGGALYEAADPGQL 627 >gi|31657142|ref|NP_852478.1| integrin alpha-1 precursor [Homo sapiens] gi|124056463|sp|P56199|ITA1_HUMAN RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|187951605|gb|AAI37122.1| Integrin, alpha 1 [Homo sapiens] gi|187957526|gb|AAI37123.1| Integrin, alpha 1 [Homo sapiens] Length = 1179 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 157 VVNSIAPVQECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ 208 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + +++ T + G++ A + F Sbjct: 209 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEA 265 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE + DN + + ++I + Sbjct: 266 RGARRGVK------KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNR 315 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 316 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 362 >gi|6090615|gb|AAF03259.1| dihydropyridine receptor alpha 2 subunit [Homo sapiens] Length = 1110 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 65/187 (34%), Gaps = 37/187 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGV---QAEAADQFLKNCAS 338 K I+ TDG + + N+ K + V+ V + C + Sbjct: 356 --KIIMLFTDG-------GEERAQEIFNKYNKDKKVRVFTFSVGQHNYDRGPIQWMACEN 406 Query: 339 PDRFYSV 345 +Y + Sbjct: 407 KGYYYEI 413 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 35/60 (58%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 N + +G+I+++TA+LLPV+ GL I+ + + +K ++ +D ++ K+ NQ Sbjct: 4 NLGRHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQG 63 >gi|110556625|ref|NP_997091.3| calcium-activated chloride channel regulator 4 [Mus musculus] gi|148680067|gb|EDL12014.1| mCG119588 [Mus musculus] gi|148922513|gb|AAI46305.1| Chloride channel calcium activated 6 [synthetic construct] gi|151556758|gb|AAI48748.1| Chloride channel calcium activated 6 [synthetic construct] Length = 925 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 70/200 (35%), Gaps = 36/200 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M +VLDVS SM D+L ++ + L I + N G+V FSS+ Sbjct: 309 MCLVLDVSGSM-----TSYDRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIV 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + T G++ A+ + + + + Sbjct: 359 HELIQINSDIERNQLLQTL-PTSANGGTSICSGIKAAFQVFKNGEYQTDGTE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYS 344 I+ L+DGE+S+ +E K G+IV+ I + A + Sbjct: 410 -ILLLSDGEDSTAKD-------CIDEVKDSGSIVHFIALGPSADLAVTNMSILTGGNHKL 461 Query: 345 VQNSRK---LHDAFLRIGKE 361 + + L DAF + E Sbjct: 462 ATDEAQNNGLIDAFGALASE 481 >gi|41386751|ref|NP_958822.1| calcium-activated chloride channel regulator 4 [Rattus norvegicus] gi|37703077|gb|AAR01113.1| parturition-related protein PRP3 [Rattus norvegicus] Length = 923 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 71/201 (35%), Gaps = 38/201 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM ++++ A + + + +S G+V F S Sbjct: 308 VCLVLDVSGSMGS--YDRLNRMNQAAKFFLQQILESRSWA--------GMVHFHSSATVK 357 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + E + T G+ A+ + KG+ Sbjct: 358 SELIQINSDVERNQLLETL-PTSASGGTSICSGIRTAFQVFKN---------KGYQTGGN 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 I+ L+DGE+S+ C +E K GA+V+ I + + N + Sbjct: 408 DILLLSDGEDSTAKD--------CLDEVKDSGAVVHFIALGKAFDQSISNMANVTGGKQL 459 Query: 343 YSVQNSRK--LHDAFLRIGKE 361 ++ ++ L DAF + E Sbjct: 460 FATDEAQNNGLIDAFGALASE 480 >gi|297606054|ref|NP_001057930.2| Os06g0578100 [Oryza sativa Japonica Group] gi|255677166|dbj|BAF19844.2| Os06g0578100 [Oryza sativa Japonica Group] Length = 622 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 79/211 (37%), Gaps = 33/211 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND------HFGPGMDKLGVATRSIREMLDII 206 L + + + +D++ VLDVS SMND +L V S++ ++ + Sbjct: 54 LRVEAPPAADLNGHVPIDVVAVLDVSGSMNDPVAASPESNLQATRLDVLKASMKFIIRKL 113 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKST--PG 258 + R +V F+ V+ + G +KI+RL + S P Sbjct: 114 D------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGSGSALMPE 167 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+ A + + + + +I+ LTDG++++ +++ + Sbjct: 168 LQEAVKILDERQGNSRNRVG-------FILLLTDGDDTTGFRWSRDVIHGA----VGKYP 216 Query: 319 VYAIGVQAEAADQFLKNCA--SPDRFYSVQN 347 V+ + A + L + A S + V + Sbjct: 217 VHTFALGAAHDPEALLHIAQESRGTYSFVDD 247 >gi|294997271|ref|NP_001171103.1| integrin alpha-D isoform 2 [Mus musculus] Length = 1169 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 82/221 (37%), Gaps = 24/221 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +D+ ++D S S++ ++ ++ + S ++++ Sbjct: 145 PATMPECPGQEMDIAFLIDGSGSID------QSDFTQMKDFVKALMGQLASTSTSFSLMQ 198 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + F + Q + + I +L T + G++ ++F +K A Sbjct: 199 YSNILKTHFTFTEFKSSLSPQSLVDAIVQLQ--GLTYTASGIQKVVKELFHSKNGARKSA 256 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFL 333 K K +I +TDG+ D E EA++ G I YAIG + A Q L Sbjct: 257 K------KILIVITDGQKFR---DPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQEL 307 Query: 334 KNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 S D + V N L +I +++ +LY + Sbjct: 308 NTIGSAPSQDHVFKVGNFVALRSIQRQIQEKIFAIEVLYKQ 348 >gi|151555227|gb|AAI48416.1| Integrin, alpha D [synthetic construct] Length = 1164 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 82/221 (37%), Gaps = 24/221 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +D+ ++D S S++ ++ ++ + S ++++ Sbjct: 140 PATMPECPGQEMDIAFLIDGSGSID------QSDFTQMKDFVKALMGQLASTSTSFSLMQ 193 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + F + Q + + I +L T + G++ ++F +K A Sbjct: 194 YSNILKTHFTFTEFKSSLSPQSLVDAIVQLQ--GLTYTASGIQKVVKELFHSKNGARKSA 251 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFL 333 K K +I +TDG+ D E EA++ G I YAIG + A Q L Sbjct: 252 K------KILIVITDGQKFR---DPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQEL 302 Query: 334 KNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 S D + V N L +I +++ +LY + Sbjct: 303 NTIGSAPSQDHVFKVGNFVALRSIQRQIQEKIFAIEVLYKQ 343 >gi|54290564|dbj|BAD61973.1| zinc finger-like [Oryza sativa Japonica Group] gi|54291279|dbj|BAD62048.1| zinc finger-like [Oryza sativa Japonica Group] Length = 598 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 79/211 (37%), Gaps = 33/211 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND------HFGPGMDKLGVATRSIREMLDII 206 L + + + +D++ VLDVS SMND +L V S++ ++ + Sbjct: 30 LRVEAPPAADLNGHVPIDVVAVLDVSGSMNDPVAASPESNLQATRLDVLKASMKFIIRKL 89 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKST--PG 258 + R +V F+ V+ + G +KI+RL + S P Sbjct: 90 D------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGSGSALMPE 143 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+ A + + + + +I+ LTDG++++ +++ + Sbjct: 144 LQEAVKILDERQGNSRNRVG-------FILLLTDGDDTTGFRWSRDVIHGA----VGKYP 192 Query: 319 VYAIGVQAEAADQFLKNCA--SPDRFYSVQN 347 V+ + A + L + A S + V + Sbjct: 193 VHTFALGAAHDPEALLHIAQESRGTYSFVDD 223 >gi|327265809|ref|XP_003217700.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 885 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 70/207 (33%), Gaps = 27/207 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRSG 219 + + ++ V+DVS SM K+ A ++ ++++ +K N + VR Sbjct: 278 AHLPKNVAFVIDVSGSMWGS------KIRQAKEAMIKIVEDLKEDDHFNIILFESEVR-- 329 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + I++ P VQ + I + T GL + +A + + Sbjct: 330 --KWKDGIIKATPE--NVQEAKYFIGNITESGLTNFNGGLMAGIEMLNNAHKLKIVPERS 385 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I L+DGE + D A + +Y++G FL+ + Sbjct: 386 ASLT----IMLSDGEANVGETDQFRIQENAKNASQGKYPLYSLGFGYNLDYGFLERLSKV 441 Query: 340 DR------FYSVQNSRKLHDAFLRIGK 360 + + + +L + + Sbjct: 442 NNGVARRIYDDSDAALQLQGFYDEVAN 468 >gi|313226334|emb|CBY21478.1| unnamed protein product [Oikopleura dioica] Length = 367 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 86/223 (38%), Gaps = 31/223 (13%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M + F A +TS D +D++ ++D S S+ G + Sbjct: 1 MQLLRSFFLLAA-------ALTSPATADCPPDAKMDLVFLVDTSSSIR---KAGHKAIES 50 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF--G 250 I +++D PD + G ++++ V F L + + ++ ++ + F G Sbjct: 51 IRSFIYKVVDGFTMGPDHTSF---GAISYNKDPVINFVLNEHYNQEGVKMAVDTIDFESG 107 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T++ + + I + +D K I +TDG +S + E+ Sbjct: 108 KGTETGKAMNFMAQMIDMGFGQR-------NDSKVVAIVITDGRSSEKHDFVAEASK--- 157 Query: 311 EAKRRGAIVYAIGVQAEAADQF---LKNCAS-PDRFYSVQNSR 349 K+ IV A+GV + ++ +K AS PD Y+++ Sbjct: 158 NLKKVVDIVIAVGVNMKKENELSREIKTIASEPDEHYAIEAES 200 >gi|126331114|ref|XP_001371712.1| PREDICTED: similar to Anthrax toxin receptor 2 precursor (Capillary morphogenesis gene 2 protein) (CMG-2) [Monodelphis domestica] Length = 610 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 70/197 (35%), Gaps = 30/197 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--V 212 S + D+ VLD S S+ ++ E+ + +K + + V Sbjct: 152 SLSQAEEQPSCHGVFDLYFVLDKSGSVAQNW--------------IEIYNFVKQLTERFV 197 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDA 269 + +R + FSS+ PL + I E + L T GL A +I +A Sbjct: 198 SPGMRLSFIVFSSQATIILPLTGDSKKITEGLKDLKEVQPVGETYIHEGLRLANEQIKNA 257 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 G +I LTDG+ + K + N ++ GA VY +GV Sbjct: 258 ---------GGLKTSSIVIALTDGKLD--KLVPKYAAKEANISRTLGARVYCVGVLDFDQ 306 Query: 330 DQFLKNCASPDRFYSVQ 346 Q S D+ + V Sbjct: 307 AQLENIADSKDQVFPVT 323 >gi|114614244|ref|XP_001160279.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Pan troglodytes] gi|114614246|ref|XP_519175.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 3 [Pan troglodytes] Length = 1091 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + + + K + V+ V + C + Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENK 407 Query: 340 DRFYSV 345 +Y + Sbjct: 408 GYYYEI 413 >gi|149419345|ref|XP_001517573.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 139 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/134 (23%), Positives = 52/134 (38%), Gaps = 12/134 (8%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FS++ L + I++ ++ +++ G T G E A +I+ Sbjct: 4 RMSFIVFSTRGSTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIY------ 57 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 H II LTDGE + E N ++ GA VY +GV+ Q Sbjct: 58 -HENWQGYRTASVIIALTDGELHENLFFHAE--QEANRSRDFGATVYCVGVKDFNETQLA 114 Query: 334 KNCASPDRFYSVQN 347 + S D + V + Sbjct: 115 RIADSKDHVFPVND 128 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 57/417 (13%), Positives = 134/417 (32%), Gaps = 55/417 (13%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI--- 63 R F + GSI ++ + L + +++G ++ + + +L + D ++L + Sbjct: 17 RVFAADRSGSIGMMFVVTLVPVLLLVGAAVDFTSYQKARTELDAVADQAVLAAVSAAGMK 76 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 ++Q + + + + N R ++ S ++ I Sbjct: 77 MSQADAEAAMAKLFTDAAAALPN-VSASPRAATAPTTDGVRTASLTYSATIRTGIMRLAG 135 Query: 124 DYNL----SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL-DVS 178 + +A + P + NS + T++ + ++ D+S Sbjct: 136 FSTVAFGGTATAASPNPIFTDFYLLLDNSPSMGVAATTADIATMVANTSDQCAFACHDMS 195 Query: 179 LSMNDHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-----IV 228 ND++ + ++ V + ++++D + R + +F + + Sbjct: 196 AGGNDYYAKAKNLGVKMRIDVVRDATQQLMDTASAKAIAAGQYRMAIYSFGTSCSGIGLN 255 Query: 229 QTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 Q L + + L + + ++ A A Sbjct: 256 QVSALTANLSTSKTDAGALDLMTVPYQNYNNDQCTDFDGIFARLNSAVPNPGSGA-SAAS 314 Query: 283 YKKYIIFLTDG--ENSSPNIDNKES----------LFYCNEAKRRGAIV---YAIGVQAE 327 +K + F++DG + + P+ K + L C K RG V Y + Sbjct: 315 PQKVVFFVSDGVADANYPSTCTKPTTNGRCQEPITLANCQALKDRGIRVAVLYTTYLPLP 374 Query: 328 AADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + + CASPD ++ V S + DA + K++V + Sbjct: 375 TNGWYNTWIAPFSSQIATNMAACASPDLYWPVSPSEGIADAMKGLFKKVVDSQRRIT 431 >gi|257470753|ref|ZP_05634843.1| von Willebrand factor type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] gi|317064958|ref|ZP_07929443.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313690634|gb|EFS27469.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 376 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 69/192 (35%), Gaps = 19/192 (9%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ + + ++++ VLD + SM + A I +++ + ++ V+ Sbjct: 30 VEQTRAIEKDVEIVFVLDTTGSMG-------GLIQGAKTKIWSIVNEVMQ-NHKDSKVKI 81 Query: 219 GLVTFSSK----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 GLV + + + + L + I + + A ++ + + Sbjct: 82 GLVAYRDRGDVYVTKVTQLNENLDEIYSVLMDYKAQGGGDDPEDVRKALHESLEIIQWSA 141 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFL 333 ++ + I + D D+ +++ +AK +G I+ I + D++ Sbjct: 142 ----PRENLSQIIFLVGDAPPHDDYNDSPDTVVTAKKAKSKGIIINTIQCGNMPSTDRYW 197 Query: 334 KNCA--SPDRFY 343 K A ++ Sbjct: 198 KAIAQFGGGEYF 209 >gi|269125745|ref|YP_003299115.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310703|gb|ACY97077.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 228 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 63/172 (36%), Gaps = 16/172 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +V D S SM + L + +++ I S P V + R +++FS Sbjct: 6 LPFYLVCDESYSMAGN------PLQEINDQLPQIVTEIASNPTVADKARLCIISFSDTAE 59 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PLA + + + + +L T + I L A GH ++ + Sbjct: 60 VLLPLA-DLNDVHQ-VPQLAPKGATSYGAAFTLLRDTIERDIRDL--KAAGHVPFRPTVF 115 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKNCAS 338 FLTDG+ + + AK G + A G + + L+ A+ Sbjct: 116 FLTDGQPTDSDWATAHQRL---TAKDFGPRPTILAFGFG-DVRPETLRAVAT 163 >gi|37676927|ref|NP_937323.1| hypothetical protein VVA1267 [Vibrio vulnificus YJ016] gi|37201471|dbj|BAC97293.1| uncharacterized protein [Vibrio vulnificus YJ016] Length = 688 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 52/292 (17%), Positives = 92/292 (31%), Gaps = 54/292 (18%) Query: 87 IWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF---- 142 +WQ + N+ +N Q N + + +L KD + + +P Sbjct: 233 VWQANVTNQ--QNAEEQSPNAKQTAFTLD-------KDITVYWRLQEGLPGRLEAVSYRD 283 Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGVA 195 P + L T S G D + VLD S SM+ G+ KL Sbjct: 284 PQQSERGTIKLTFT-PGDDLSAIQQGRDWVFVLDKSGSMSGKHATLTEGVKRGLGKLPSG 342 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 R M D + + +G + + V E IN++ G T Sbjct: 343 DRFRILMFD------NRVQEITNGFIAVNQN---------NVTQAIETINQIATGGGTNL 387 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 LE A + + + II +TDG + + K+ L +R Sbjct: 388 YDALERAVSGLDSDRTTG-------------IILVTDGVANVGVTEKKQFLKL---MQRY 431 Query: 316 GAIVYAIGVQAEAADQFLKNCASPDRFYS--VQNSRKLHDAFLRIGKEMVKQ 365 +Y + A L+ ++ + NS + + + ++ Q Sbjct: 432 DVRLYTFIMGNSANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQ 483 >gi|160889564|ref|ZP_02070567.1| hypothetical protein BACUNI_01988 [Bacteroides uniformis ATCC 8492] gi|270296688|ref|ZP_06202887.1| conserved hypothetical protein [Bacteroides sp. D20] gi|317480056|ref|ZP_07939168.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861081|gb|EDO54512.1| hypothetical protein BACUNI_01988 [Bacteroides uniformis ATCC 8492] gi|270272675|gb|EFA18538.1| conserved hypothetical protein [Bacteroides sp. D20] gi|316903798|gb|EFV25640.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 342 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 LVFAAIGLFTVLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLQKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++D + + G++ F+ P+ + + + +K Sbjct: 117 RLVAQLVDKM-------QNDKVGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + A N + + +I +TDGEN Sbjct: 170 TAIGAAIN-------LAARSFTPQEGVGRTVIVITDGEN 201 >gi|254420639|ref|ZP_05034363.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] gi|196186816|gb|EDX81792.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] Length = 613 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 66/185 (35%), Gaps = 23/185 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ L++ ++DVS SM DKL +A +++ +D ++ ++ Sbjct: 239 ELPQGEQRPLNLTFLVDVSGSMR-----SPDKLDLAKQAMNLAIDRLRPQDTLS------ 287 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEK--INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + ++ T G Q ++ + + L T G+ AY + Sbjct: 288 VTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASGGTAGATGMTNAY--------DQAQAS 339 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNC 336 D + ++F TDG+ + DNK Y E + G + G D ++ Sbjct: 340 FARDKVNRILMF-TDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARMQTI 398 Query: 337 ASPDR 341 A Sbjct: 399 AQAGN 403 >gi|118579649|ref|YP_900899.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118502359|gb|ABK98841.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 337 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/167 (23%), Positives = 65/167 (38%), Gaps = 19/167 (11%) Query: 166 DIGLDMMMVLDVSLSM---NDHFGPGMDKL----GVATRSIREMLDIIKSIPDVNNVVRS 218 G+D+ + +DVS SM ++ P KL R+ L I+ ++ R Sbjct: 78 RSGIDLAIGIDVSKSMLAEDETLPPEGKKLFSIPNRLNRARYCALTILSAL----KGERV 133 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G+ F+SK V PL + I + +T STPG + I E ++ Sbjct: 134 GVFLFASKGVPIVPLTND-YGYCQYILK-HANDSTISTPGSDLG-QAITTGIYLFEESSR 190 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K I+ ++DGE N D+ A +G +Y +G Sbjct: 191 TSV---KSIVLISDGE--DINEDSSVMHEAAQRAAAKGIAIYTVGTG 232 >gi|297560582|ref|YP_003679556.1| von Willebrand factor A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845030|gb|ADH67050.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 699 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 54/165 (32%), Gaps = 20/165 (12%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPL 233 +D S SM ++ +I +L + + GLVTF + T P Sbjct: 525 VDASGSMA-----ARRRMTEVKTAILSLL-----LDAYRRRDKVGLVTFRGREAELTLPP 574 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 V +++ L G T GLE A + + E + ++ +TDG Sbjct: 575 TRSVDVAAARLDDLPAGGRTPLAEGLEEAARVL-----RRERLRDPRLRP--LLVVVTDG 627 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + ++ + G + V E+ L AS Sbjct: 628 RATGGKGAVGRAMAAADHVAGLGVT--TVVVDGESGPLRLGLAAS 670 >gi|123283202|emb|CAM24859.1| complement component 2 [Homo sapiens] gi|123857989|emb|CAM25859.1| complement component 2 [Homo sapiens] gi|168983781|emb|CAQ06832.1| complement component 2 [Homo sapiens] gi|168984348|emb|CAQ08707.1| complement component 2 [Homo sapiens] gi|168984415|emb|CAQ09271.1| complement component 2 [Homo sapiens] gi|168985076|emb|CAQ07480.1| complement component 2 [Homo sapiens] gi|168985954|emb|CAQ07110.1| complement component 2 [Homo sapiens] Length = 353 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 86 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 139 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 140 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 196 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 197 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 256 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 257 VDHIREILNINQKRNDYLDIYAIGVG 282 >gi|54112390|ref|NP_000713.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Homo sapiens] gi|109658756|gb|AAI17469.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|109659118|gb|AAI17471.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|119597396|gb|EAW76990.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] Length = 1091 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 65/187 (34%), Gaps = 37/187 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGV---QAEAADQFLKNCAS 338 K I+ TDG + + N+ K + V+ V + C + Sbjct: 356 --KIIMLFTDG-------GEERAQEIFNKYNKDKKVRVFTFSVGQHNYDRGPIQWMACEN 406 Query: 339 PDRFYSV 345 +Y + Sbjct: 407 KGYYYEI 413 >gi|311696337|gb|ADP99210.1| protein containing a von Willebrand factor type A (vWA) domain [marine bacterium HP15] Length = 704 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 65/191 (34%), Gaps = 33/191 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + D++ V+D S SM G + + A ++ L R ++ F+S+ Sbjct: 341 LPRDLVFVIDTSGSMA---GESIRQARDALQAGLGTLTPRD---------RFNVIQFNSQ 388 Query: 227 IVQTF-----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + ++ ++RL T+ P L A E + Sbjct: 389 THSLFMQPEVATGNNLARARQYVDRLRADGGTEMAPALSRALE-GGGETEDGARV----- 442 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + +IF+TDG + ++ + ++ + + + F++ A R Sbjct: 443 ---RQVIFITDGAVGNEAALFRQIRQQLGNQR-----LFTVAIGSAPNRHFMREAARWGR 494 Query: 342 --FYSVQNSRK 350 + ++ + Sbjct: 495 GTYTAIHSPSD 505 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 63/201 (31%), Gaps = 28/201 (13%) Query: 174 VLDVSLSMNDHFGPGMDK-LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF- 231 VLDVS SM M + L + + + + R L+ FS + Q Sbjct: 344 VLDVSGSMKGARLAQMKEALKLLSGAEASAASQRYAAFQARE--RVLLIPFSGLVGQPAR 401 Query: 232 ------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + L+ T L A + D ++ Sbjct: 402 VQFAAGDLQAASAQVLAYADSLVADGGTAIYDALTLAQQQARQELR---------ADPER 452 Query: 286 Y--IIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFLKNCA-SP 339 + I+ LTDG N++ D A+ G V+ I + + A + Sbjct: 453 FVSIVLLTDGANTAGR-DWAAFEREQRMARDGGAPLVRVFPIIFGEAQSGEMQALAALTG 511 Query: 340 DRFYSVQNSRK--LHDAFLRI 358 R + +N+ K L F I Sbjct: 512 GRAFDARNTGKSGLPLVFKEI 532 >gi|298709908|emb|CBJ31633.1| conserved unknown protein [Ectocarpus siliculosus] Length = 304 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 67/190 (35%), Gaps = 24/190 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII--KSIPDVNNVVRSGL 220 + S D+++VLDVS SM+ + +L +A + +++ + S +V + Sbjct: 108 AASSGPKDVVIVLDVSGSMSQY-----GRLDLAKEAAETVINTLGADSFVNVVTFSETAR 162 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V ++ + + + L T E ++ + E Sbjct: 163 VLLTNSTTLVRATEDNLGELVSLVQNLEFDLANVGTNFGAAFETTFDIL----EASRTSE 218 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLF--YCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + + + I+FLTDG + ++ + C+ G + E + Sbjct: 219 ETSSNCQTAIVFLTDGNTNVGLSTDEVTSKQIACDT---GGIYEH-----VEDGGDLSQA 270 Query: 336 CASPDRFYSV 345 A R+YS+ Sbjct: 271 MAFFYRYYSI 280 >gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] Length = 863 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 72/203 (35%), Gaps = 29/203 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV--TFSSKIV 228 ++ ++D S SM K+ +++ +L D+ + R ++ ++SS + Sbjct: 237 IVFIIDKSGSMGG------TKMRQTKQAMNTILK------DLRDHDRFNVMPFSYSSTMW 284 Query: 229 QTFPLAW----GVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + ++ + + R + G T + A + + + + + Sbjct: 285 RPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDDQPNSPRSAS-- 342 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-- 341 IIFLTDG S + + A R ++ +G + FL+ A +R Sbjct: 343 --LIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGKDVDFPFLEKMALENRGL 400 Query: 342 ----FYSVQNSRKLHDAFLRIGK 360 + + +L + + Sbjct: 401 ARRIYEDSDAALQLKGFYDEVAT 423 >gi|152991131|ref|YP_001356853.1| hypothetical protein NIS_1388 [Nitratiruptor sp. SB155-2] gi|151422992|dbj|BAF70496.1| hypothetical protein [Nitratiruptor sp. SB155-2] Length = 928 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 64/162 (39%), Gaps = 15/162 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI- 227 ++M++LDVS SM G G +L + +++L + D V LV FSS + Sbjct: 562 FNIMLILDVSGSMGWDSGDGTTRLSKEVEAAQKLLQEYSKLGD----VAVKLVLFSSDVS 617 Query: 228 --VQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q P W V ++ L TT ++ A +FD K+ + Y Sbjct: 618 NQAQYIPQNWMSVDKAIGMLDNLYADGTTDYVNAIDGAMQ-LFDQKDGTFFDNGANRVY- 675 Query: 285 KYIIFLTDGENSSPN-IDNKESLFYCNEAKRRGAIVYAIGVQ 325 F++DGE S ID + N + I AIG Sbjct: 676 ----FMSDGEPSYGGEIDGTLQHQWENFLIQHDIIANAIGFG 713 >gi|116625432|ref|YP_827588.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228594|gb|ABJ87303.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 78/214 (36%), Gaps = 29/214 (13%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + +D L +V D S SM ++L A +S+ + L + + Sbjct: 78 VEKPISAFFTADTPLSTGVVFDSSRSMK-------NRLQDARQSVEQFLRTGSTGDEYF- 129 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 L+ FS + P + I ++ + + + D+ Sbjct: 130 -----LIRFSDEAKMLAPFTADTEEIARQLGSIEAKG-----------WTALNDSIVLAA 173 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ +++K ++ ++DG +++ E + + VYA+ + E + K Sbjct: 174 NQSRKARNHRKALLVISDGGDNNSRYTVGEMISI---LREADLRVYAVSI-FERSQLLEK 229 Query: 335 NC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 C + R V+ L D R+ +EM + I Sbjct: 230 ICEETGGRALWVRKLGDLPDIMERLSQEMRSEYI 263 >gi|301780322|ref|XP_002925578.1| PREDICTED: collagen alpha-1(XIV) chain-like [Ailuropoda melanoleuca] Length = 1796 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 80/199 (40%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F ++ L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIINFLYSTVGALNKI---------GADGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I R+ + G TK+ +++ + +F A E K Sbjct: 1083 TEFKLNAYNTKETLLDAIKRISYKGGNTKTGKAIKHVRDSLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISGEMQSNGYNIFAVGVADADYSELVSIGSKPSSRHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIV 228 D+++++D S S+ R +R L+ + + + R GL +S Sbjct: 158 DIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFNVGSEKTRIGLAQYSGDPR 207 Query: 229 QTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + L A+ + + E + L + T A N IF+ K E A+ K Sbjct: 208 IEWHLNAFNTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKPEAGARAG--VSKI 262 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I +TDG++ I +L + G ++AIGV+ ++ + + PD Y+ Sbjct: 263 GILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADENELREIASEPDSTHVYN 316 Query: 345 VQNSRKLHDAFLRIGKEMVKQR 366 V +H + + + + Sbjct: 317 VAEFDLMHTVVESLTRTVCSRV 338 >gi|281350861|gb|EFB26445.1| hypothetical protein PANDA_015099 [Ailuropoda melanoleuca] Length = 1741 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 80/199 (40%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F ++ L ++ ++ + + +V F+ Sbjct: 1004 DLVFMVDGSWSIGDENFNKIINFLYSTVGALNKI---------GADGTQVAMVQFTDDPR 1054 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I R+ + G TK+ +++ + +F A E K Sbjct: 1055 TEFKLNAYNTKETLLDAIKRISYKGGNTKTGKAIKHVRDSLFTA-ESGTRRGIP-----K 1108 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1109 VIVVITDGRSQD------DVNKISGEMQSNGYNIFAVGVADADYSELVSIGSKPSSRHVF 1162 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1163 FVDD----FDAFKKIEDEL 1177 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 84/225 (37%), Gaps = 26/225 (11%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI- 205 + + P VK + D+++++D S S+ R +R L+ Sbjct: 107 GNGNKPTPPEEEVKFFCEIPAIADIVILVDGSWSIGRF----------NFRLVRLFLENL 156 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAY 263 + + + R GL +S + L A+ + + E + L + T A Sbjct: 157 VTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTG---LAL 213 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N IF+ K E A+ K I +TDG++ I +L + G ++AIG Sbjct: 214 NYIFENSFKPEAGARAG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIG 265 Query: 324 VQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 V+ ++ + + PD Y+V +H + + + + Sbjct: 266 VKNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 310 >gi|189536038|ref|XP_693697.3| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Danio rerio] Length = 1157 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D+S SM K+ + ++ + D+ L+TFS + Sbjct: 293 DVIFVIDISGSMIG------TKIKQTKAA------MVSILSDLREGDYFNLITFSDDVHT 340 Query: 230 TFPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA-KGHDD 282 V+ +E + ++I T L A + + Sbjct: 341 WKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQ 400 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDR 341 IIFLTDGE + + L N K G ++ + +A L+ A +R Sbjct: 401 RVPMIIFLTDGEATIGETETDVILH--NAQKSLGLVSLFGLAFGDDADFPMLRRLALENR 458 Query: 342 -----FYSVQNSRKLHDAFLR 357 Y ++ F Sbjct: 459 GVARMVYEDDDAAIQLKGFYD 479 >gi|85701714|ref|NP_001028371.1| calcium activated chloride channel [Mus musculus] gi|74202052|dbj|BAE23018.1| unnamed protein product [Mus musculus] gi|148680069|gb|EDL12016.1| mCG120741 [Mus musculus] gi|187951335|gb|AAI39089.1| Expressed sequence AI747448 [Mus musculus] gi|187957592|gb|AAI39090.1| Expressed sequence AI747448 [Mus musculus] Length = 925 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 71/203 (34%), Gaps = 36/203 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM+ D+L ++ + L I + N G+V FSS+ Sbjct: 309 VCLVLDVSGSMSSS-----DRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIV 358 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + T G++ A+ + + + + Sbjct: 359 HELIQMNSDIERNKLLQTL-PTSAIGGTSICSGIKTAFQVFKNGEYQTDGTE-------- 409 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYS 344 I+ L+DGE+S+ +E K G+IV+ I + A + Sbjct: 410 -ILLLSDGEDSTAKD-------CIDEVKDSGSIVHFIALGPSADLAVTNMSILTGGNHKL 461 Query: 345 VQNSRK---LHDAFLRIGKEMVK 364 + + L DAF + E Sbjct: 462 ATDEAQNNGLIDAFGALASENTD 484 >gi|48427894|sp|Q8SQ75|CO2_PONPY RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110309|gb|AAL82820.1| complement C2 [Pongo pygmaeus] Length = 752 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 80/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + + I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTDVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|218131125|ref|ZP_03459929.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|317476997|ref|ZP_07936239.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986645|gb|EEC52979.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|316906790|gb|EFV28502.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 342 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 58/159 (36%), Gaps = 18/159 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 MVFAAIGLFAVLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLQKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLVAQLVDKMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + A N + + II +TDGEN Sbjct: 170 TAIGAAIN-------LASRSFTPQEGVGRAIIVITDGEN 201 >gi|73972314|ref|XP_860410.1| PREDICTED: similar to complement component 2 precursor isoform 5 [Canis familiaris] Length = 748 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 80/217 (36%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + V S M+D I S V Sbjct: 243 KIQIQRSGHLNLYLLLDASQSVKEE------DFHVFKESAILMVDRIFSFEIN---VSVA 293 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + E IN L G+ T + L + + + Sbjct: 294 IITFASKPKIIMSVLNDNSRDATEVINSLNKVNYKDHENGTGTNTYAALNSVHIMMNNQM 353 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ DN + + N+ + +YAIGV Sbjct: 354 DRLGMKTAAWQEIRHAIILLTDGKSNMGGSPKLAVDNIKEILNINQQRSDYLDIYAIGVG 413 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLR 357 D + + +Q++ L+ F Sbjct: 414 KLDVDWRELNELGSKKDGERHAFILQDTEALYQVFEH 450 >gi|73972312|ref|XP_860371.1| PREDICTED: similar to complement component 2 precursor isoform 4 [Canis familiaris] Length = 682 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 80/217 (36%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + V S M+D I S V Sbjct: 177 KIQIQRSGHLNLYLLLDASQSVKEE------DFHVFKESAILMVDRIFSFEIN---VSVA 227 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + E IN L G+ T + L + + + Sbjct: 228 IITFASKPKIIMSVLNDNSRDATEVINSLNKVNYKDHENGTGTNTYAALNSVHIMMNNQM 287 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ DN + + N+ + +YAIGV Sbjct: 288 DRLGMKTAAWQEIRHAIILLTDGKSNMGGSPKLAVDNIKEILNINQQRSDYLDIYAIGVG 347 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLR 357 D + + +Q++ L+ F Sbjct: 348 KLDVDWRELNELGSKKDGERHAFILQDTEALYQVFEH 384 >gi|332872319|ref|XP_003319171.1| PREDICTED: collagen alpha-2(VI) chain isoform 2 [Pan troglodytes] Length = 828 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|332872317|ref|XP_003319170.1| PREDICTED: collagen alpha-2(VI) chain isoform 1 [Pan troglodytes] Length = 918 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|332872315|ref|XP_531504.3| PREDICTED: collagen alpha-2(VI) chain isoform 3 [Pan troglodytes] Length = 1019 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|332256729|ref|XP_003277468.1| PREDICTED: collagen alpha-2(VI) chain [Nomascus leucogenys] Length = 1124 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 171 PEKTDCPIHVYFVLDTSESVAMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 230 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 231 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 282 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 283 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 339 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 340 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 372 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 747 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 803 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 804 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 860 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 861 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 895 >gi|297708173|ref|XP_002830853.1| PREDICTED: hypothetical protein LOC100461231, partial [Pongo abelii] Length = 885 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 660 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 719 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 720 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 771 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 772 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 828 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 829 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 861 >gi|260837139|ref|XP_002613563.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] gi|229298948|gb|EEN69572.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] Length = 195 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 64/178 (35%), Gaps = 28/178 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D++ +LD S S+ D V + + P+ G+V +++ Sbjct: 41 RMPVDLVFLLDGSGSIGDS------NFQVTKNFVATTTSDFQIGPNNAQ---VGIVQYAN 91 Query: 226 KIVQTFPLAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + L +++ N +G T + ++Y N Sbjct: 92 WLYEEVSL--NQYKTLDELLPAIYNISYWGGGTYTGWAIDYVVNATLTESRGAR------ 143 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 D K +I +TDG+++ + AK+ G I+ AIGV + L A+ Sbjct: 144 QDVPKVVIVVTDGQSAD------DVRQPALRAKQSGIIMVAIGVGSIYDGTELVEIAT 195 >gi|119629723|gb|EAX09318.1| collagen, type VI, alpha 2, isoform CRA_c [Homo sapiens] Length = 1019 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|119629721|gb|EAX09316.1| collagen, type VI, alpha 2, isoform CRA_a [Homo sapiens] Length = 828 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|119629722|gb|EAX09317.1| collagen, type VI, alpha 2, isoform CRA_b [Homo sapiens] Length = 918 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|115527066|ref|NP_478054.2| collagen alpha-2(VI) chain isoform 2C2a precursor [Homo sapiens] Length = 918 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|115527070|ref|NP_478055.2| collagen alpha-2(VI) chain isoform 2C2a' precursor [Homo sapiens] Length = 828 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|115527062|ref|NP_001840.3| collagen alpha-2(VI) chain isoform 2C2 precursor [Homo sapiens] gi|125987812|sp|P12110|CO6A2_HUMAN RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor Length = 1019 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|41350923|gb|AAH65509.1| Collagen, type VI, alpha 2 [Homo sapiens] gi|190690005|gb|ACE86777.1| collagen, type VI, alpha 2 protein [synthetic construct] gi|190691377|gb|ACE87463.1| collagen, type VI, alpha 2 protein [synthetic construct] Length = 1019 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 38 PEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYG 97 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I + Sbjct: 98 GLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR-------- 149 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +TDG + + A+ G ++A+ +Q L++ AS Sbjct: 150 -SKGTVHFAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVAPNQNLKEQGLRDIAS 206 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + I ++ + + I K Sbjct: 207 TPHELYRNDYATMLPDSTEIDQDTINRIIKVMK 239 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AY+++ + + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 760 >gi|85706702|ref|ZP_01037794.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] gi|85668760|gb|EAQ23629.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] Length = 240 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 67/219 (30%), Gaps = 42/219 (19%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + D M+V D S SM + TR I + + + +P++ R GL++ Sbjct: 20 AATGCATDAMLVFDGSGSMAEVGHDP----TAPTRIIEARVALRRVMPEIAPYRRIGLLS 75 Query: 223 F--------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + S P+ + I L G T + A + Sbjct: 76 YGAGGSHPCSGITRHFGPMPDAGAAVVAGIEALTPGGLTPIAASVAAAAEVL-------- 127 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-------- 326 G+ + ++ +TDG + EA+ V+ IG + Sbjct: 128 ----GYRTHPGIVVLVTDGNETCGGTPCALGTALAAEARD--LTVHVIGFRVVHDPFSWN 181 Query: 327 -----EAADQFLKNC---ASPDRFYSVQNSRKLHDAFLR 357 Q + C A+ F S + +L A Sbjct: 182 SPEAKGYDGQTVAKCLADATGGLFVSTETVDELVAALRE 220 >gi|197118197|ref|YP_002138624.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087557|gb|ACH38828.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 318 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 55/157 (35%), Gaps = 20/157 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S SM G +L A ++R+ + +K R GLV F+ Sbjct: 82 DILFLLDTSKSML-TRDLGQSRLAAAKEAVRQAMAGLKGE-------RVGLVVFAGSAFL 133 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL + + G T PG A A ++ +G D K ++ Sbjct: 134 VCPLTTDYALFDQVLKE--AGEETLPLPGTSLA-----AALKEARRALQGEGDEPKVVVL 186 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 L+DGE+ G +YA+ Sbjct: 187 LSDGEDHEGEYVAAARALN-----AAGVKLYAVAAGT 218 >gi|332365023|gb|EGJ42788.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK355] Length = 462 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 80/247 (32%), Gaps = 35/247 (14%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N KKQ D S K+++ E + G QD I + S D +K +A+ Sbjct: 106 NKKKQDWDVSELGTKSLYNMKLDLEFKTEGAYQDNRLISYNLSGK-YPDTNNKLSIDTAI 164 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM- 189 S +F + + + + V D S SMN Sbjct: 165 SALNTKQVFSKVAKGKKGIAIAYRTD-----PIQGQMNIAVSFVFDTSGSMNWDLQGRNV 219 Query: 190 ------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWGVQH 239 ++ + + M+ + I +V+ LV FS+ L G Sbjct: 220 EKTGNESRMDILRKKSVIMIKDLAEIGNVS----VNLVGFSTSAKYIQQNFSNLDNGTNT 275 Query: 240 IQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I I + L T GL Y + +L KYI+ LTDG + Sbjct: 276 IIATITKRENLNPDGVTNPGDGLRYGMISLQSQPAQL-----------KYIVLLTDGIPN 324 Query: 297 SPNIDNK 303 + +D++ Sbjct: 325 AYLVDSR 331 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 51/354 (14%), Positives = 111/354 (31%), Gaps = 55/354 (15%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F G++++L+ + + ++ G+ ++T + +A L LD ++L AT Sbjct: 19 FIQEEDGTVTVLSFFIFVMFLMMGGIGLDTMRQEMARASLQATLDRAVLAGATASTEAGA 78 Query: 69 GNN-----GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 K ++D+ T G ++ + + Sbjct: 79 RTIVEDYFAKSGQSDYLLAQKDGDISTTLNAAKVTAGAELSLDTYLMKLAGVPTLSASGT 138 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS----SVKISSKSDIGLDMMMVLDVSL 179 + + E + A++S L T+ + + S G +M ++ S Sbjct: 139 ATAEVRIPKLEAILVLDVSGSMASNSKIQNLQTAAKDFVTTVMNSSKPGDTVMSIVPFSF 198 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----------------DVNNVVRSGLVT 222 S+ D L V K VN +V + + Sbjct: 199 SVTPP-QSVFDALAVEETHNYSTCLEFKENDYQHATLSSGSSSLSSGIPVNQMVYTSVYG 257 Query: 223 ------------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI---- 266 ++ + ++ P + + + KI+ L T G+ + + Sbjct: 258 DFDNLDSGWRSCYTDEYIRILPYSTSITDLHAKIDALQPAGNTSGNEGMNWGAALLDPTF 317 Query: 267 ----------FDAKEKLEHIAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFY 308 E L ++ +D+ + K IIF+ DG N++ ++ S Y Sbjct: 318 REVTASMIAAGHLSETLANVPSDYDEPETLKAIIFMGDGANTTSYFFDRSSPKY 371 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 26/65 (40%), Gaps = 5/65 (7%) Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF---LKNCAS-PDRFYSVQNSRKLHDAFLRIGKEM 362 C K G +VY+IG + L CAS P ++ + + AF I + Sbjct: 515 NVCKATKTEGVVVYSIGFEVPVNGTAENQLSACASSPAHYFRA-SGTDIKSAFSAIAANV 573 Query: 363 VKQRI 367 + R+ Sbjct: 574 KQLRL 578 >gi|118579647|ref|YP_900897.1| hypothetical protein Ppro_1218 [Pelobacter propionicus DSM 2379] gi|118502357|gb|ABK98839.1| conserved hypothetical protein [Pelobacter propionicus DSM 2379] Length = 367 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 52/283 (18%), Positives = 87/283 (30%), Gaps = 68/283 (24%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K Y ++ M I SS+ K +S +M+V D+S SM Sbjct: 47 KAYGADLLAATAMVLIVLAIANVQYSSYWQ-------KTYPESRW---IMLVQDLSGSMG 96 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G + LD S D+ G++ FS P ++ + ++ Sbjct: 97 RSGEEG-----ASQTLGDVALDGASSFIDMRKKDDLIGIIAFSDVAQLVAPPSFDREILK 151 Query: 242 EKIN------------RLIFGSTTKSTPGLEYA---------------YNKIFDAK---- 270 K+ L+ G T ++ A Y +I D + Sbjct: 152 RKLELLRRKNDSPLFRDLVLGGETNASYATWLAVCVFFMFLPEENQPSYEQINDMRYSLM 211 Query: 271 EKLEHIAKGHDDYKKY-------IIFLTDG--ENSSPNIDNKESLF----YCNEAKRRGA 317 + K I+ TDG E+S N D + L + +R G Sbjct: 212 GRSGTALHIPAQLKDIDFGRGMAIVLFTDGRIESSQGNGDVESGLLNFINVISLVRRLGI 271 Query: 318 IVYAIGVQAEAADQFLKNC------ASPDRFYSVQNSRKLHDA 354 +Y + V E A+ + +S +S+ R L A Sbjct: 272 RLYLVVVGGEVANDVRQAIEEPVGGSSAGHVFSM--PRSLDKA 312 >gi|198421591|ref|XP_002123589.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 401 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/306 (15%), Positives = 93/306 (30%), Gaps = 45/306 (14%) Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS 131 D + N++ ++ N + LS+ + + S Sbjct: 100 AAPATGDLLECPVSNLFSSNPPPRPACNSRNPPGAQRGDAFGLSVDVSPNGRLSACSPTK 159 Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI------GLDMMMVLDVSLSMNDHF 185 + P P + + S+ +++ LDM+ VLD S S+ Sbjct: 160 QQNCPPDSIYSPGYC---YNSMNRGSTWAPGPETNKIRCPIIDLDMLFVLDGSGSV---- 212 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-----------IVQTFPLA 234 G D + ++ + + G++ +S I PL Sbjct: 213 --GKDNFEIVKNWTIKVANSFDISD---GYTQVGVIQYSHYWATEPLDKQSYIKTEVPLG 267 Query: 235 --WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 Q + + T T +A NK ++ + K +I LTD Sbjct: 268 KYRNKQEFSAAVRNISLHEYTTYT---AHALNKTVFDFQQSSRWNRPKT--SKVLILLTD 322 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA----SPDRFYSVQNS 348 G ++ + N + +A+GV EA ++ L+ A + +R Y N Sbjct: 323 GLSTDKQLLPSS----ANYVRSLNITTFAVGVG-EANEKELQEIANGQGTNERVYYTSNF 377 Query: 349 RKLHDA 354 L+ Sbjct: 378 AGLNKI 383 >gi|118138230|pdb|2I6Q|A Chain A, Complement Component C2a gi|118138231|pdb|2I6S|A Chain A, Complement Component C2a Length = 517 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/181 (21%), Positives = 71/181 (39%), Gaps = 23/181 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 KI + L++ ++LD S S++++ + S M+D I S V Sbjct: 8 SKIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKESASLMVDRIFSFEIN---VSV 58 Query: 219 GLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDA 269 ++TF+S+ V L + + E I+ L G+ T + L Y + + Sbjct: 59 AIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQ 118 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGV 324 L + + II LTDG+++ D+ + N+ + +YAIGV Sbjct: 119 MRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGV 178 Query: 325 Q 325 Sbjct: 179 G 179 >gi|281342668|gb|EFB18252.1| hypothetical protein PANDA_020017 [Ailuropoda melanoleuca] Length = 885 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 75/203 (36%), Gaps = 29/203 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + ++ F+ + Sbjct: 248 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAEDQFS------VIDFNHNVRTW 295 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ I ++ T L A + +A + Sbjct: 296 RNDLVSATKTQIVDAKKYIEKIQPSGGTNINEALLRAIFILNEANNLGMLDPESVS---- 351 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDR-- 341 II ++DG+ + N + K S N + ++++G+ + FLK ++ +R Sbjct: 352 LIILVSDGDPTVGNCELKLSKIQKNVKQNIRDNIALFSLGIGFDVDYDFLKRLSNENRGI 411 Query: 342 ----FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 412 AQRIYGNQDTSSQLKKFYNQVST 434 >gi|47218290|emb|CAG04122.1| unnamed protein product [Tetraodon nigroviridis] Length = 993 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 70/204 (34%), Gaps = 26/204 (12%) Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW------ 235 D+FGP + ++ +L D+ R ++FSS+I P Sbjct: 316 RDYFGPPCGRKT--KEALLTIL------GDLRPADRFNFISFSSRIRVWQPGRLVPATPS 367 Query: 236 GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 V+ ++ + L G T ++ + + + IIFLTDG+ Sbjct: 368 AVRDAKKFVVMLPTSGGGTDIDGAIQTGSSLL----RDHLSGRDAGPNSVSLIIFLTDGQ 423 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-----RFYSVQNSR 349 + + L A R ++ IG+ + + L+ A + R ++ Sbjct: 424 PTVGEVRPGAILGNARAAVRDKFCIFTIGMGDDVDYRLLERMALDNCGMMRRIPEEADAS 483 Query: 350 KLHDAFLR-IGK-EMVKQRILYNK 371 + F IG + R+ Y + Sbjct: 484 SMLKGFYDEIGTPLLSDIRVNYTQ 507 >gi|260810969|ref|XP_002600195.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] gi|229285481|gb|EEN56207.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] Length = 323 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 78/216 (36%), Gaps = 36/216 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGL 220 I D++++LD S S+ D A SI +LD I I + + L Sbjct: 108 QRNPAIKRDLLIILDDSGSIGF------DAFQKAKASIATVLDYICPGIGLYSPYHQVAL 161 Query: 221 VTFSSKIVQTFPLAWGVQH---------------IQEKINRLIFGSTTKSTPGLEYAYNK 265 +TF S + F + + +N+ G T + L+YA Sbjct: 162 MTFHSTPTKQFDFNDHGSYAELKEAILAVPYEVLMTNFVNK-QGGPRTDTHEALDYARTT 220 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +F ++ + + ++ LTDG+ + ++ ++ + G V+A+G+ Sbjct: 221 MFTSR----TGLRPGSLRE--VLLLTDGQPN----EDDLTVQAAERLRNSGITVFALGIA 270 Query: 326 AEAADQFLKNCASPDRF---YSVQNSRKLHDAFLRI 358 ++ L+ S + + + L D + Sbjct: 271 DGVDNEHLEQLVSDPEYKHIFHLNTFEDLADMVANV 306 >gi|149412375|ref|XP_001507696.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 691 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 50/315 (15%), Positives = 105/315 (33%), Gaps = 41/315 (13%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 + ++ + ++ + + NI+ E + N+ Sbjct: 403 NRGGAANVAVVMVDGWPTDRVEESSRLARESGINIFFITI-----EGAAESEKQNVVEPN 457 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + + ++ Y+L+ S + + + +H L S ++S D+ Sbjct: 458 FVDKAVCRRNGFYSLNVPSWFGLQKVARPLAKRVCDTHR--LACSKTCLNSA-----DVG 510 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S S+ G + + I K + R G V ++ + F Sbjct: 511 FVIDGSSSV------GTGNFRTVLQFVAN---ISKEFEVSDTDTRVGAVQYTYEQRLEFG 561 Query: 233 LA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + I R+ + T + + YA ++F+ K + +K +I Sbjct: 562 FDQHRTKSDLLSAIKRVNYWSGGTSTGAAIRYALERLFE---------KSKPNKRKLMIV 612 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN 347 +TDG + + A R+G I YAIG+ A D+ + P D + V Sbjct: 613 ITDGRSYD------DVRIPALAAHRKGVITYAIGITWAAQDELEVMASDPDKDHAFFVDE 666 Query: 348 SRKLHDAFLRIGKEM 362 L+ +I + + Sbjct: 667 FDNLYTFVPQIIQNI 681 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 502 QTYMDIIIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 550 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + ++E + + T++ + G Sbjct: 551 DVVHEFHL-NDYRSVKEVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 603 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 604 AKKVMIVITDGE----SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINPEAFLNEI 659 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 660 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 694 >gi|73542340|ref|YP_296860.1| hypothetical protein Reut_A2655 [Ralstonia eutropha JMP134] gi|72119753|gb|AAZ62016.1| putative membrane protein [Ralstonia eutropha JMP134] Length = 412 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 48/140 (34%), Gaps = 2/140 (1%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I + + L V+ + GLVI+ F K +L +D L A ++ + Sbjct: 7 KKERGVILPIVGLTLAVLLGMAGLVIDLGAMFVAKTELQSAVDSCALAAAQELDGAADAL 66 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK--DYNLS 128 + K +Q + + + D S++ + + + ++ + + Sbjct: 67 TRATSAGLTAGNANKVQYQKASASLIDTDVTFSDSLTGAFSSTFTPVANARYAKCGHLTT 126 Query: 129 AVSRYEMPFIFCTFPWCANS 148 + Y + + + Sbjct: 127 GILAYLIQMVGGPTSNAVAA 146 >gi|62286486|sp|Q863A0|CO2_GORGO RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|29690202|gb|AAM10001.1| complement C2 [Gorilla gorilla] Length = 752 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 79/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+SK V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASKPKVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRILGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D + N+ + +YAIGV Sbjct: 389 VDRIREILNINQKRNDYLDIYAIGVG 414 >gi|229577024|ref|NP_001153314.1| collagen type XXVIII alpha 1 a [Danio rerio] gi|228007387|emb|CAQ51228.1| collagen type XXVIII alpha 1 a [Danio rerio] Length = 1208 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 78/211 (36%), Gaps = 26/211 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D L++ +LD S + D+ T I E L I+ R+ L+ + Sbjct: 55 DEDCSLELAFLLDSSETAKDNHQQE----KKFTMDIIEGLQSIRLDTGRKLSWRAALLQY 110 Query: 224 SSKIVQTFPLA-W-GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 SS ++ L W G ++ + I + + T +T + + + Sbjct: 111 SSHVIIEQTLKQWKGTENFKSSIAPMAYIGHGTYTTYAI---------TNMTKIFVEESS 161 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-------QFL 333 + K + LTDG N D ++ +AK +G V+ IG+ A D + L Sbjct: 162 PERIKIALLLTDGFFHPRNPDIFSAMA---DAKNQGVKVFTIGITRTANDPVNAANLRLL 218 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + Y++Q++ + +I + Sbjct: 219 SSTPASRFLYNLQDTNVMEKVITQIAQLAND 249 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 50/354 (14%), Positives = 113/354 (31%), Gaps = 74/354 (20%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETS-HKFF---------------VKAKLHYIL 52 + ++ I+ ++ ++GL+I + F ++ + Y + Sbjct: 1 MMKKIQKGFTLTEIIIAIILTSMVGLLIGLVFNTMFSGRNIIEREASIQSEMRTSMQY-V 59 Query: 53 DHSLLYTATKILNQENGNNGKKQK---------NDFSYRIIKNIWQTDFRNELRENGFAQ 103 D ++ + + E+ +K + ++I IW ++ Sbjct: 60 DRTIGKATSVFVLDESKYGKDVRKTEGWNYIGLSPDGKKVINYIWNKSTKSWDESVLGTN 119 Query: 104 DINNIERSTSLSI---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 + +++ D++ +YNL+ +Y+ + ++ + +I+ K Sbjct: 120 SLYDMQLDLEFKADESYQDNRLINYNLT--GQYKNSKNKLSIDTAISALNTKQVISKVAK 177 Query: 161 I-----------SSKSDIGLDMMMVLDVSLSMND-----------------HFGPGMDKL 192 + + + V D S SM ++ Sbjct: 178 GKKGVALAYRNDPIEGQVNTAVSFVFDTSGSMAYGLRNEGKRNSQGKWGPLDADNPRARM 237 Query: 193 GVATRSIREMLDIIKSIPDVN-NVVR-SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + + ++D +K I +V+ N+VR SG ++ L I+EKI L Sbjct: 238 NILKKKANLLVDDLKEIGNVSVNLVRFSGSASY--IQEDFVELDKDTGKIKEKIKSLPTS 295 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 T GL Y + +L KY++ LTDG ++ Sbjct: 296 WITNPGDGLRYGLVSLQRNPAQL-----------KYVVLLTDGIPNAYTGSPDG 338 >gi|49809|emb|CAA46541.1| alpha-2 collagen [Mus musculus] Length = 1029 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 71/213 (33%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 48 PEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSWRYG 107 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H+ K Sbjct: 108 GLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGK 161 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + + +TDG + + A+ G ++A+ +Q L++ A+ Sbjct: 162 GVVN---FAVVITDGHVTGSPCGGIK--MQAERAREEGIRLFALAPNRNLNEQGLRDIAN 216 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 217 SPHELYRNNYATMRPDSTEIDQDTINRIIKVMK 249 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 622 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 678 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AYN++ + + Sbjct: 679 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 735 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 736 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 770 >gi|308502223|ref|XP_003113296.1| CRE-DIG-1 protein [Caenorhabditis remanei] gi|308265597|gb|EFP09550.1| CRE-DIG-1 protein [Caenorhabditis remanei] Length = 13921 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 88/237 (37%), Gaps = 36/237 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLIT------SSVKISSKSDIGLDMMMVLDVSLSMN 182 + Y + + P ++ P T S + S D D+++VLD S Sbjct: 13158 SYPHYSLQIV----PNVESARTWPTPRTKATTPAGSGRSCSSIDFESDVIIVLDSS---- 13209 Query: 183 DHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 ++F P D+ ++ ++D PDV+ G V +S K+ P+A G + Sbjct: 13210 ENFTP--DEFDSMKDAVASIVDTGFDLAPDVSK---IGFVIYSDKV--AVPVALGHYEDK 13262 Query: 242 -EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 E I +++ + + + G ++ K +I +T+G+N Sbjct: 13263 IELIEKIVDAEKINDGVAIAL----YGLNAARQQFQLHGRENATKIVILITNGKNRGNAA 13318 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASPDRFYSVQNSRKLHD 353 E L G ++A+ V + + + L ++PD V S ++ D Sbjct: 13319 AAAEDLRD-----MYGVQLFAVAVGSNPDELATIKRLVGNSNPDHVIEVAQSTEIDD 13370 >gi|225310541|emb|CAQ19231.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 806 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 73/199 (36%), Gaps = 32/199 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + ++++ V+D S S+ + A ++ + N R GLV +S Sbjct: 437 KERPMELVFVVDSSESIGPENFEIIKDFVAA---------LVDRLTIGRNATRVGLVLYS 487 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ F LA Q I+E I R+ G T + + A ++ F + Sbjct: 488 LEVQLEFNLARYTTKQDIKEAIRRVHYIGEGTYTGSAIHNATHEAFYSAR---------T 538 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------QFLKN 335 KK I +TDG+ + + EA +YA+G+ + Q L Sbjct: 539 GVKKVAIVITDGQTD--KREPVKLEIAVREAHAANIEMYALGIMNISNPTQNEFLQELSL 596 Query: 336 CAS---PDRFYSVQNSRKL 351 AS + Y +++ L Sbjct: 597 IASHPDSEHMYYIKDFNTL 615 >gi|298704728|emb|CBJ28324.1| conserved unknown protein [Ectocarpus siliculosus] Length = 877 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 80/226 (35%), Gaps = 35/226 (15%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML--D 204 N+ P+ + +S +D G + + VLD+ S + + +L + + + L Sbjct: 246 NAGCPPVPDAVASPTASAADGGSEGLCVLDLQRS-DPYRSRKQSRLSLTRMDVSKQLFHA 304 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 I + V GLV+F + ++ +++++ L TK + A Sbjct: 305 FINRQQAYDLPVEVGLVSFGDDVDVPCEPTPLFENFRDEVDTLTPAGNTKLFDAISEACT 364 Query: 265 KI-------------FDAKEKLEHIAKGHD-----------DYKKYIIFLTDGENSSPNI 300 + +E + A G D ++ L+DG+++ I Sbjct: 365 LLEKWQTEWVEKADKRKEEENRKRKAAGGDQANHGPVPDEKRPVLRVVVLSDGKDTKSTI 424 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFY 343 + C ++ G IV +I V E +Q C A+ + Sbjct: 425 ---SAHAVCGRLQKVGVIVDSITVGTEKNNQL--KCLSLATGGYAF 465 >gi|149197817|ref|ZP_01874866.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] gi|149139038|gb|EDM27442.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] Length = 1078 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 95/269 (35%), Gaps = 32/269 (11%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + I++ Y++ +++ A + + ++ + S Sbjct: 677 ASHVRIEEFINNFDYHYSVPKKEAFKIDSELSDHKVYAGVKLLRVGVQGQ-RLGADSQKP 735 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKI 227 V+D S SM ++L + +++ M + +V + G+ +++I Sbjct: 736 GSYTFVIDNSGSMAAE-----NRLPLIQKTLPNMFKAMNQDDEVTILSCEGGVTNLANRI 790 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + ++ + + G+ + G+E AY + Sbjct: 791 TAS-----NHSQLETAVKNIEAGTVANLSVGIEEAYKLAAQNFRSGAVNR---------V 836 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYS 344 I L+DG S + +E L ++ +++G IGV +E D FL+ A +Y Sbjct: 837 ILLSDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVGSEDYDDSFLETLANKGDGVYYF 896 Query: 345 VQNSRKLHD--------AFLRIGKEMVKQ 365 + +++D +F I +++ Q Sbjct: 897 GDSKEQMNDILVNNFEASFKTIARDVKIQ 925 >gi|332233322|ref|XP_003265851.1| PREDICTED: anthrax toxin receptor 2 [Nomascus leucogenys] Length = 488 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 44/231 (19%), Positives = 78/231 (33%), Gaps = 33/231 (14%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 FP + L + D+ VLD S S+ +++ Sbjct: 15 LFPGLWLLVLSGLGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW--------------I 60 Query: 201 EMLDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKS 255 E+ + ++ + + V+ +R + FSS+ PL I E + + T Sbjct: 61 EIYNFVQQLAERFVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKHVSPVGETYI 120 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 GL+ A +I A G II LTDG+ + + ++ Sbjct: 121 HEGLKLANEQIQKA---------GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKISRSL 169 Query: 316 GAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 GA VY +GV Q + S ++ + V+ A I ++ Q Sbjct: 170 GASVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQS 217 >gi|310825488|ref|YP_003957846.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309398560|gb|ADO76019.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 414 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 60/193 (31%), Gaps = 29/193 (15%) Query: 166 DIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +++D S SM +D GL+ F Sbjct: 32 RAPVAVNLLIDRSASMRGAPLVAAVDAAQSLVAQAGPR-------------DYIGLLAFD 78 Query: 225 SKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 Q P+ + E+++ L GS T +E + + + Sbjct: 79 GVPEQLLPVRAMEPDAKTELSERLSSLETGSGTALHEAVELGSSSLH-------RVLIPG 131 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ LTDGE S + + G ++A+G+ + L+ +SP Sbjct: 132 ARRK--LLLLTDGEPSVGPAALADFKTLGAKVAESGVTLHALGLGRHYIPEMLEALSSPS 189 Query: 341 --RFYSVQNSRKL 351 F ++ L Sbjct: 190 GTGFAHADDAEAL 202 >gi|115380522|ref|ZP_01467490.1| hypothetical membrane associated protein [Stigmatella aurantiaca DW4/3-1] gi|115362473|gb|EAU61740.1| hypothetical membrane associated protein [Stigmatella aurantiaca DW4/3-1] Length = 424 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 60/193 (31%), Gaps = 29/193 (15%) Query: 166 DIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +++D S SM +D GL+ F Sbjct: 42 RAPVAVNLLIDRSASMRGAPLVAAVDAAQSLVAQAGPR-------------DYIGLLAFD 88 Query: 225 SKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 Q P+ + E+++ L GS T +E + + + Sbjct: 89 GVPEQLLPVRAMEPDAKTELSERLSSLETGSGTALHEAVELGSSSLH-------RVLIPG 141 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ LTDGE S + + G ++A+G+ + L+ +SP Sbjct: 142 ARRK--LLLLTDGEPSVGPAALADFKTLGAKVAESGVTLHALGLGRHYIPEMLEALSSPS 199 Query: 341 --RFYSVQNSRKL 351 F ++ L Sbjct: 200 GTGFAHADDAEAL 212 >gi|262173885|ref|ZP_06041562.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] gi|261891243|gb|EEY37230.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] Length = 403 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 70/206 (33%), Gaps = 20/206 (9%) Query: 23 ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK--NDFS 80 ++ P + +++ ++ S +F A+L + + L + N +K + + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENNVSYARKLVDRYV 60 Query: 81 YRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF 139 I +I + ++G Q T ++ +HK +S + Sbjct: 61 VDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKS----WISYENISLKP 116 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 ++ + L +D+ ++D+S SM + G ++ I Sbjct: 117 EFTVNGSSVTRKFLP------------QPVDVYFIVDMSASMRATWQNGKSQIDEVKNVI 164 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSS 225 +++ +K R L+ + + Sbjct: 165 TRVVNDLKDFDTEVKS-RVALLGYHN 189 >gi|157694070|ref|YP_001488532.1| hypothetical protein BPUM_3319 [Bacillus pumilus SAFR-032] gi|157682828|gb|ABV63972.1| hypothetical protein YwmC [Bacillus pumilus SAFR-032] Length = 233 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 67/235 (28%), Gaps = 29/235 (12%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F T S + + K + + + ++LD S SM G K +A Sbjct: 7 FTLVTLAVLTLSMSISSPVFAKASTVKKHNKDVRVTILLDASGSMARKVE-GERKFDLAK 65 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTF-----SSKIVQTFPLAWGVQHIQEK-------- 243 + + + + + +R L +S Q+ + GV +Q Sbjct: 66 QEVFKFAQSL----PKDAKIRMSLFGSEGNNKNSGKAQSCEVIRGVYGVQPYEKESFENS 121 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 +N L T LE+A +H + +TDGE + K Sbjct: 122 LNGLGPNGWTPIARALEHAKQTDEQLNNGTKH----------IVYLITDGEETCGGDPVK 171 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLR 357 + N V + Q + A +Y +++ Sbjct: 172 VAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAKAGKGHYYQASTGKEMGSILSA 226 >gi|94971019|ref|YP_593067.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94553069|gb|ABF42993.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 391 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 60/179 (33%), Gaps = 19/179 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + + L + +++D S S+ D+L + + LD + + P Sbjct: 90 ITQFRRDTKLPLTLGLLVDTSYSVR-------DELPAEKTASEKFLDDMLAQPKDQAF-- 140 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-----TTKSTPGLEYAYNKIFDAKEK 272 L+ F ++ L + I L ++ +++DA Sbjct: 141 --LIHFDREVELMTDLTSSKDKLHRGIGELETSGPPSQSSSDDGQRHRRGGTQLYDAIYL 198 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +K I+ LTDGE+ +++ A AIVYAI + E Sbjct: 199 AASEILQKQQGRKAIVVLTDGEDRGSKETLTDAVEAAQRAD---AIVYAIYFKGEQEQS 254 >gi|296100188|ref|YP_003617105.1| hypothetical protein pDK1_p020 [Pseudomonas putida] gi|295443554|dbj|BAJ06433.1| hypothetical protein [Pseudomonas putida] Length = 604 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 77/208 (37%), Gaps = 29/208 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S + ++LD S SM + A ++ +L ++ +P +V +G Sbjct: 418 SKSRAERQSASIQILLDKSGSMK-------SAMDQAEAAVYAVLSALEGLP----LVTTG 466 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++F + + + + + + ++ FG+ ++ L A Sbjct: 467 AMSFPNKANDGVERCALIKSPKERLIRAVSEGGFGAMSEGGTPLAQA----LWPAAVEVL 522 Query: 276 IAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 AKG KK + +TDGE N+ KE + C G V +G A + LK Sbjct: 523 RAKGE---KKILFVITDGEPNAGTTHAAKEFIQRCEV---SGIEVIGLGFG-SANEHILK 575 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEM 362 S ++ +V L ++ + +E Sbjct: 576 ALFS--QYRAVGEVANLKNSLFELVREA 601 >gi|257897710|ref|ZP_05677363.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835622|gb|EEV60696.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 1104 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/153 (23%), Positives = 64/153 (41%), Gaps = 25/153 (16%) Query: 161 ISSKSD--IGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 I S+ +D++ VLD S SMN+ G G K S+ E+ + + S P+++ + Sbjct: 390 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIESVNEISENLLSDPNMD--I 447 Query: 217 RSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAY 263 R G+V F S+ I ++ + + INRL T T GL+ Y Sbjct: 448 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 507 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + +K +I + DG + Sbjct: 508 ETLYADNGGE------NRNPEKILIVVGDGTPT 534 >gi|327188854|gb|EGE56047.1| putative vault protein inter-alpha-trypsin domain-containing protein [Rhizobium etli CNPAF512] Length = 794 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 102/300 (34%), Gaps = 31/300 (10%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N K + + D + QD + R+ SL KD+ L+ Sbjct: 255 NAKINPVSLTVNLKAGFPLGDVNSSFHAVDIRQDSDQ-ARTISLKGDAVPADKDFELTWK 313 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG--LDMMMVLDVSLSMNDHFGPG 188 + F + L + + +++ V+D S SM+ Sbjct: 314 AAPGKTPSAGLFREVKDGKTYLLAFVTPPTAPDAAAAPTKREVVFVIDNSGSMSGQ---- 369 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + A +S+ + + D NV+R + + + +V P + + Sbjct: 370 --SIEQARQSLALAISRLSK-DDRFNVIRFDDTMTDYFNGLVAASP--DNREKAITYVRG 424 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L T+ P LE DA +A G + ++FLTDG I N++ L Sbjct: 425 LTADGGTEMLPALE-------DALRNQGPVASGAL---RQVVFLTDG-----AIGNEQQL 469 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 F A R A V+ +G+ + F+ A R F ++ ++ ++ + ++ Sbjct: 470 FQEISANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQN 529 >gi|320169699|gb|EFW46598.1| hypothetical protein CAOG_04556 [Capsaspora owczarzaki ATCC 30864] Length = 501 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 61/162 (37%), Gaps = 26/162 (16%) Query: 152 PLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + +++K + +I LD++ V+D S SM KL + +++ + Sbjct: 23 TVWAMTTLKAPTFENIRRPTLDIVAVIDKSGSMAG------TKLELVKKTLETL------ 70 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + + R LVT+ +++ L+ G + +N + GS+T + GL N Sbjct: 71 VAQLRACDRLALVTYDTEVTLDLALSPMDDKGRSKATQVVNGIRDGSSTNLSGGLLEGLN 130 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + + ++ LTDG + + + Sbjct: 131 ILRNRPTDSRREVSS-------VLLLTDGLANVGISTTEGII 165 >gi|99031847|pdb|2B2X|A Chain A, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab gi|99031850|pdb|2B2X|B Chain B, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab Length = 223 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 75/216 (34%), Gaps = 37/216 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD S S + + ++L + P G+V + + Sbjct: 23 LDIVIVLDGSNS--------IYPWESVIAFLNDLLKRMDIGPKQTQ---VGIVQYGENVT 71 Query: 229 QTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + N+++ T + G++ A + F K Sbjct: 72 HEFNLNKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEAFTEARGARRGVK------ 125 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKN 335 K ++ +TDGE S N K+ + C ++I + E + +K+ Sbjct: 126 KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKS 181 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 AS F++V + L +G+ + L Sbjct: 182 IASEPTEKHFFNVSDELALVTIVKALGERIFALEAL 217 >gi|89094360|ref|ZP_01167301.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] gi|89081419|gb|EAR60650.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] Length = 707 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 91/273 (33%), Gaps = 37/273 (13%) Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 +G K N ++ + W + N + ++ I SLS +D + Sbjct: 210 DGLRLPKHPNASLQQLSNHEWTANLANTAAASSTDEEGQPINTD-SLSGPATTLDQDILM 268 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPL--LITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + +P + + ++ L + + G D + VLD+S SM F Sbjct: 269 YWRHQPNLPASVDMLAYKESGNNKGTYKLTLTPGTDLPAFNQGRDWVFVLDISGSMKGKF 328 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHI 240 ++ + + + +++ R +V F+++ V++ Sbjct: 329 AALVEGVR-------------EGLSNLSPNDRFRIVLFNNQARSFTQGYLPADKTTVENT 375 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 +++++ G T GL+ N++ + I+ +TDG + Sbjct: 376 LNQLDQIQPGQGTNLYAGLQTGINQLDSDRSTA-------------IVLVTDGVANVGTT 422 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + L N +++ ++ + A L Sbjct: 423 HKSKFL---NLLEQKDVRLFTFIMGNSANRPLL 452 >gi|22127455|ref|NP_670878.1| hypothetical protein y3581 [Yersinia pestis KIM 10] gi|45442682|ref|NP_994221.1| putative tellurium resistance protein [Yersinia pestis biovar Microtus str. 91001] gi|51597754|ref|YP_071945.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 32953] gi|108809183|ref|YP_653099.1| putative tellurium resistance protein [Yersinia pestis Antiqua] gi|108810630|ref|YP_646397.1| tellurium resistance protein [Yersinia pestis Nepal516] gi|145597696|ref|YP_001161772.1| tellurium resistance protein [Yersinia pestis Pestoides F] gi|150260367|ref|ZP_01917095.1| putative tellurium resistance protein [Yersinia pestis CA88-4125] gi|153949002|ref|YP_001399501.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162420775|ref|YP_001605636.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165928282|ref|ZP_02224114.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165937805|ref|ZP_02226366.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|166008850|ref|ZP_02229748.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211902|ref|ZP_02237937.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398450|ref|ZP_02303974.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422005|ref|ZP_02313758.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426387|ref|ZP_02318140.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467477|ref|ZP_02332181.1| putative tellurium resistance protein [Yersinia pestis FV-1] gi|170022821|ref|YP_001719326.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896945|ref|YP_001874057.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|218927794|ref|YP_002345669.1| putative tellurium resistance protein [Yersinia pestis CO92] gi|229837273|ref|ZP_04457436.1| putative tellurium resistance protein [Yersinia pestis Pestoides A] gi|229840487|ref|ZP_04460646.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842969|ref|ZP_04463120.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. India 195] gi|229900823|ref|ZP_04515947.1| putative tellurium resistance protein [Yersinia pestis Nepal516] gi|294502682|ref|YP_003566744.1| putative tellurium resistance protein [Yersinia pestis Z176003] gi|21960549|gb|AAM87129.1|AE013961_1 hypothetical [Yersinia pestis KIM 10] gi|45437548|gb|AAS63098.1| putative tellurium resistance protein [Yersinia pestis biovar Microtus str. 91001] gi|51591036|emb|CAH22700.1| Putative tellurium resistance protein [Yersinia pseudotuberculosis IP 32953] gi|108774278|gb|ABG16797.1| tellurium resistance protein [Yersinia pestis Nepal516] gi|108781096|gb|ABG15154.1| putative tellurium resistance protein [Yersinia pestis Antiqua] gi|115346405|emb|CAL19277.1| putative tellurium resistance protein [Yersinia pestis CO92] gi|145209392|gb|ABP38799.1| tellurium resistance protein [Yersinia pestis Pestoides F] gi|149289775|gb|EDM39852.1| putative tellurium resistance protein [Yersinia pestis CA88-4125] gi|152960497|gb|ABS47958.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162353590|gb|ABX87538.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165914217|gb|EDR32833.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|165919724|gb|EDR37057.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165992189|gb|EDR44490.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206648|gb|EDR51128.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166960142|gb|EDR56163.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167050954|gb|EDR62362.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167054610|gb|EDR64417.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169749355|gb|ACA66873.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699971|gb|ACC90600.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229682162|gb|EEO78254.1| putative tellurium resistance protein [Yersinia pestis Nepal516] gi|229690035|gb|EEO82093.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. India 195] gi|229696853|gb|EEO86900.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705396|gb|EEO91406.1| putative tellurium resistance protein [Yersinia pestis Pestoides A] gi|262360712|gb|ACY57433.1| putative tellurium resistance protein [Yersinia pestis D106004] gi|262364659|gb|ACY61216.1| putative tellurium resistance protein [Yersinia pestis D182038] gi|294353141|gb|ADE63482.1| putative tellurium resistance protein [Yersinia pestis Z176003] Length = 212 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 65/196 (33%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM + ++ +L ++ P ++TF S Sbjct: 3 RLPVYLLLDTSGSMTGE------PIEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + K+ L+ TT L I + +K KG +I Sbjct: 57 RQAVPLT---DLLNFKLPELVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG SP D ++ L A+ G V A + L+ + Sbjct: 114 --MTDG---SPTDDWRKGLSDFKAART-GV-VVACAAGHAVETKVLQEITEIVLQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SSSIKAFFKWVSASIS 182 >gi|303241024|ref|ZP_07327534.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591449|gb|EFL61187.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 569 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 64/198 (32%), Gaps = 10/198 (5%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +++ V+D S SM+ G +D A +++ K+ +++ L+ Sbjct: 377 QTNMKKRANVLFVIDTSGSMS---GEPIDNARSAIQNLFNKEAQEKNYTSIDDEDTISLM 433 Query: 222 TFSSKI-VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 TF++ + + + I+ L T ++ A E Sbjct: 434 TFNTDVSDVYTVKGKDISEMSVVIDSLSASGNTHLYDAVDKAIT------EHQALKQSES 487 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + I+ L+DG +++ I + + + ++ IG D Sbjct: 488 EKKIDIIVVLSDGADTNSQIQFSQLESMLKQKEGNLPVIITIGYGNVDKDVLESISDKTG 547 Query: 341 RFYSVQNSRKLHDAFLRI 358 Y N + F I Sbjct: 548 GKYYEGNPDTIKKVFEEI 565 >gi|328675376|gb|AEB28051.1| hypothetical protein FN3523_0194 [Francisella cf. novicida 3523] Length = 332 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF + S P V + + ++ LDVS SM+ +L Sbjct: 58 LIPLIFLFIWFVTIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGAQQGQ-----------IILITD------SSPSPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSRGNIQYFGIDLSKLESLATA 264 >gi|326674787|ref|XP_003200204.1| PREDICTED: integrin alpha-10-like [Danio rerio] Length = 1170 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 106/309 (34%), Gaps = 34/309 (11%) Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER-STSLSIIIDDQHKDYNLSAV 130 +K D I+++ + L EN F N++ +++ D + + Sbjct: 66 QNNRKGDIYKCIVEDENSNCSKVNLGENAFQNVSRNLKNSHLGMTLTPTDSDGFLACAPL 125 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 E + CA+ + + + +D+++VLD S S+ + Sbjct: 126 WSQECGTSLFSTGICASVTSDMEPNDVIAPTAQRCTTYMDIVIVLDGSNSIYPWYE---- 181 Query: 191 KLGVATRSIREMLDIIKSIPDVNNV--VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 + +L P+ V ++ G ++ ++ + V + I+R Sbjct: 182 ----VQNFLSNILSKFHISPEQMQVGVLQYGEISVHEWSLRDYQTTADVVEAAKNISRQE 237 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G T++ ++ A + F A K +I +TDGE+ D E+L Sbjct: 238 -GRETRTAYAIQMACTEAFSPDRGAREGAT------KVMIVVTDGESHDGE-DLPEALIE 289 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF---------LKNCASP---DRFYSVQNSRKLHDAFL 356 C +R YAI V + +K +S F++V + L+D Sbjct: 290 CE---KRNITRYAIAVLGHYIRRQQDPETFINEIKYISSDPDEKYFFNVTDEAALNDIVD 346 Query: 357 RIGKEMVKQ 365 +G + Sbjct: 347 ALGDRIFSL 355 >gi|148692793|gb|EDL24740.1| calcium channel, voltage-dependent, alpha2/delta subunit 3 [Mus musculus] Length = 1091 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++T++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|6753236|ref|NP_033915.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Mus musculus] gi|81872883|sp|Q9Z1L5|CA2D3_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|4186073|emb|CAA09423.1| calcium channel alpha-2-delta-C subunit [Mus musculus] gi|147898121|gb|AAI40360.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] gi|151555329|gb|AAI48701.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] Length = 1091 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++T++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|308473011|ref|XP_003098732.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] gi|308268332|gb|EFP12285.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] Length = 396 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 76/226 (33%), Gaps = 28/226 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 ++ + + + + ++PL + S++ LD++ V+D S M + G+ + Sbjct: 2 KLGILILLYLFGVYADYSPLSYVDRPCGTDLSNLWLDVIAVVDNSRGMTNK---GLSNVA 58 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----------AWGVQHIQEK 243 + S+ I S R GLVT++S Q L + Sbjct: 59 SSILSVFGENTRIGSNSVEPRTTRLGLVTYNSVASQKADLNQYQSIADAYTGVFDALSTT 118 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++ + T GL A + D + + YK+ +I N + D Sbjct: 119 VDTIQSYLAT----GLALAERMLVD-----QTVNSTRAHYKRVMIVYASEYNGNGESDP- 168 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQ 346 L K + + + + L+ ASP +S + Sbjct: 169 --LPLAERLKLSNINIITVAYEQPGSVGLLQGLTQIASPGFSFSSE 212 >gi|268611865|ref|ZP_06145592.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 550 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 66/183 (36%), Gaps = 19/183 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + I ++ + +++ ++D S SM DKL + + + Sbjct: 164 PWNRDHKLMMVGIQGK-ELQQQETPPSNLVFLIDSSGSM-----NSYDKLPLVQSAFSML 217 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + + ++ V +G SS ++ I E++ + +T G++ A Sbjct: 218 AEQLDKNDRISIVTYAG----SSAVLLDGEKGSNTDEILEQLYSITASGSTNGEGGIKTA 273 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y EH KG ++ +I TDG+ + +E + G + + Sbjct: 274 YEL------AEEHFIKGGNNR---VILATDGDLNVGASSEEELTRLIETKRDNGIYLSVL 324 Query: 323 GVQ 325 G Sbjct: 325 GFG 327 >gi|225024147|ref|ZP_03713339.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] gi|224943172|gb|EEG24381.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] Length = 573 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 74/213 (34%), Gaps = 26/213 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P+ + + + I + ++S + +++ ++DVS SM DKL + ++ + Sbjct: 186 PFRSGAKLIRIGI-QAKEVSQAALPPANLVFLVDVSGSM-----YSRDKLPMVKYTLCTL 239 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLE 260 R LVT++ P G Q I ++ L G +T ++ Sbjct: 240 ------AHQTRAQDRITLVTYADGNKVVLPPTPGNQRQKILAALDSLKAGGSTAGENAIQ 293 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 + + + I+ TDG+ + D E ++ G + Sbjct: 294 ------QAYQAAQRAYIRNGINR---ILLATDGDFNVGITDFNTLRSMVAEKRKSGISLT 344 Query: 321 AIGVQAEA-ADQFLKNC--ASPDRFYSVQNSRK 350 +G + ++ ++ A + + + + Sbjct: 345 TLGFGSGNYNERLMEQLADAGDGNYSYIDSPEE 377 >gi|313213359|emb|CBY37183.1| unnamed protein product [Oikopleura dioica] Length = 435 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 25/178 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + LD++ V+D S S+ D + + + + D R + T+S Sbjct: 177 TSKALDIVFVVDESGSVGP------DNFELVKQFLVDYAQDSNIAADA---TRIAIRTYS 227 Query: 225 SKIVQTFPL-AWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L + + I IN L++ T + + N + + + Sbjct: 228 TNSDLDFSLNDYKTKDIISVINNLVWNGLGTNTADAITNGLNDFGNDR----------SE 277 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K ++ +TDG+++ + L + R +AIG+ + L+ A+ D Sbjct: 278 SVKIMVTITDGQSNYNQVKAAADLLKADP---RNIQSFAIGIDGAHMAE-LQAIATTD 331 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 166 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 214 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 215 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 267 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 268 AKKVMIVITDGE----SHDSPDLEKVIQQSEKDNVTRYAVAVLGYYNRRGINPEAFLNEI 323 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 324 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 358 >gi|114594050|ref|XP_526573.2| PREDICTED: anthrax toxin receptor 2 isoform 6 [Pan troglodytes] Length = 489 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|114594046|ref|XP_001145091.1| PREDICTED: hypothetical protein isoform 4 [Pan troglodytes] gi|114594048|ref|XP_001145172.1| PREDICTED: anthrax toxin receptor 2 isoform 5 [Pan troglodytes] Length = 488 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|114594044|ref|XP_001144933.1| PREDICTED: hypothetical protein isoform 2 [Pan troglodytes] Length = 508 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|320159019|ref|YP_004191397.1| hypothetical protein VVM_02412 [Vibrio vulnificus MO6-24/O] gi|319934331|gb|ADV89194.1| uncharacterized protein [Vibrio vulnificus MO6-24/O] Length = 688 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 84/281 (29%), Gaps = 45/281 (16%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF----PWCANSSHAPL 153 + N E+S + KD + + +P P + L Sbjct: 235 QASVTNQQNAEEQSPNAKQTAFTLDKDITVYWRLQEGLPGRLEAVSYRDPQQSERGTIKL 294 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGVATRSIREMLDII 206 T S G D + VLD S SM+ G+ KL R M D Sbjct: 295 TFT-PGDDLSAIQQGRDWVFVLDKSGSMSGKHATLTEGVKRGLGKLPSGDRFRILMFD-- 351 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + + +G + + V E IN++ G T LE A + + Sbjct: 352 ----NRVQEITNGFIAVNQN---------NVTQAIETINQIATGGGTNLYDALERAVSGL 398 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + II +TDG + + K+ L +R +Y + Sbjct: 399 DSDRTTG-------------IILVTDGVANVGVTEKKQFLKL---MQRYDVRLYTFIMGN 442 Query: 327 EAADQFLKNCASPDRFYS--VQNSRKLHDAFLRIGKEMVKQ 365 A L+ ++ + NS + + + ++ Q Sbjct: 443 SANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQ 483 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 73/217 (33%), Gaps = 13/217 (5%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G++ I+ I + I + L ++ H KA+L +D + L A I + + Sbjct: 12 RRQRGAVIIMITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVAIQKEYDYL 71 Query: 71 NGKKQKNDFSYRII--KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD------QH 122 +++ + ++ + + R L FA D + + + S D Sbjct: 72 RARQEGLVTFTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVEFSERPDPFVPVLTPG 131 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y VS + F A + S S L MM+ + Sbjct: 132 AQYVRVTVSDVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTDLLPMMVCAEDLG--E 189 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 D+FG ++K+ S + P ++R G Sbjct: 190 DNFGYPLNKMMAMKISSQ---QNTPIGPGNFQLIRLG 223 >gi|251799254|ref|YP_003013985.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247546880|gb|ACT03899.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 562 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 74/205 (36%), Gaps = 31/205 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + + V DVS SM+ G +++L + ++ L SI G Sbjct: 379 KEKKNGNKPITAVFVTDVSGSMD---GEPLNRLKESLLKGQKYLGRDNSI---------G 426 Query: 220 LVTFSSKIVQTFPLA-WGVQH---IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV++S + P+A + +N L G T + G+ A + D Sbjct: 427 LVSYSDGVTIKLPIAKYDTNQQSLFVGAVNSLQSGGGTATFDGIVVALKLLED------- 479 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + K I L+DGE + ++ K +Y IG A Q L+N Sbjct: 480 QLATDPNTKPVIFVLSDGETNQG-YTLQDIKGLIETYK---IPIYTIGYNA--NIQALQN 533 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGK 360 +S + + + D +IG Sbjct: 534 ISSINE--AASINADTDDVVYKIGN 556 >gi|27367212|ref|NP_762739.1| hypothetical protein VV2_0803 [Vibrio vulnificus CMCP6] gi|27358780|gb|AAO07729.1| Uncharacterized protein [Vibrio vulnificus CMCP6] Length = 688 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 84/281 (29%), Gaps = 45/281 (16%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF----PWCANSSHAPL 153 + N E+S + KD + + +P P + L Sbjct: 235 QASVTNQQNAEEQSPNAKQTAFTLDKDITVYWRLQEGLPGRLEAVSYRDPQQSERGTIKL 294 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGVATRSIREMLDII 206 T S G D + VLD S SM+ G+ KL R M D Sbjct: 295 TFT-PGDDLSAIQQGRDWVFVLDKSGSMSGKHATLTEGVKRGLGKLPSGDRFRILMFD-- 351 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + + +G + + V E IN++ G T LE A + + Sbjct: 352 ----NRVQEITNGFIAVNQN---------NVTQAIETINQIATGGGTNLYDALERAVSGL 398 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + II +TDG + + K+ L +R +Y + Sbjct: 399 DSDRTTG-------------IILVTDGVANVGVTEKKQFLKL---MQRYDVRLYTFIMGN 442 Query: 327 EAADQFLKNCASPDRFYS--VQNSRKLHDAFLRIGKEMVKQ 365 A L+ ++ + NS + + + ++ Q Sbjct: 443 SANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQ 483 >gi|325661940|ref|ZP_08150560.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471792|gb|EGC75010.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] Length = 1321 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 48/337 (14%), Positives = 109/337 (32%), Gaps = 41/337 (12%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 E + K + + + T K+ + + +Q + E Sbjct: 385 ELNRILSEKEDKEQLKE---INTYAKLTVDHI--VPIADNSQDIDGQKEETFQEN--KEE 437 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 R+ F Q + + + S+ I + + + + L Sbjct: 438 RQEDFTQYVTDTVNQKAASVSISGIDTKEFETVRAVVSLEEGIADTEEKFRENVEILDCG 497 Query: 157 SSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + D + + D S SM K+ +++ + + Sbjct: 498 VEIPDYKVKKLEYDTVNIALCCDNSGSMEGE------KIENLKKAVSTFVGKLA------ 545 Query: 214 NVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + V G+V F S + + +++ + S T G+EY + + K+ Sbjct: 546 DEVNIGIVPFGSGVLEGVCEPGSSREKLEQSVESFRSDSGTNIYSGVEYTLSMLAKEKDA 605 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQ 331 L + ++DG++S P+ +E L A G ++Y++G+ A+ + Sbjct: 606 LN-----------IAVIMSDGQDSIPS---EEQLQKITSACENGNILLYSMGLGADVESE 651 Query: 332 FLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQR 366 L + + V +S L+ + I + + K R Sbjct: 652 VLSTYSDAGNGAYVFVSDSNSLYSFYQYIYQ-ISKNR 687 >gi|255693879|ref|ZP_05417554.1| BatB protein [Bacteroides finegoldii DSM 17565] gi|260620308|gb|EEX43179.1| BatB protein [Bacteroides finegoldii DSM 17565] Length = 342 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 62/196 (31%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F + P + K+ + G+++++ LD+S SM +L A Sbjct: 61 ILFAAIGLFSVLLARPQFGS---KLETVKRKGVEVIIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I ++D + + + G++ F+ P+ + + + +K Sbjct: 117 RLISRLVDELDN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + II +TDGEN +G Sbjct: 170 TAIGEAIN-------LAARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAA-----EKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVNVLGVGMPDGAPI 233 >gi|116876155|gb|ABK30937.1| complement component 2/factor B variant 2 [Carcinoscorpius rotundicauda] Length = 889 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 17/185 (9%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + V D S S+ + + A + M +K R G V+FSS + + Sbjct: 427 IYFVFDASGSIGRKYFN--SSIKFAKGLVTRM--GVKEF-----GTRFGAVSFSSTVSAS 477 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 F L +E +N L T+ + A + + L K I L Sbjct: 478 F-LPQDYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMIPLSKHTFADRAMKTIIFLL 536 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN 347 TDG+ + + E K A +Y+I + + L+ AS D Y +++ Sbjct: 537 TDGKANMRG----DPKQVAKELKADVKAEIYSIALTGDYDINKLREVASSKKDHVYILKD 592 Query: 348 SRKLH 352 L Sbjct: 593 YETLD 597 >gi|115315541|gb|AAV65032.2| complement component 2/factor B variant 1 [Carcinoscorpius rotundicauda] Length = 889 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 17/185 (9%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + V D S S+ + + A + M +K R G V+FSS + + Sbjct: 427 IYFVFDASGSIGRKYFN--SSIKFAKGLVTRM--GVKEF-----GTRFGAVSFSSTVSAS 477 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 F L +E +N L T+ + A + + L K I L Sbjct: 478 F-LPQDYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMIPLSKHTFADRAMKTIIFLL 536 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQFLKNCASP--DRFYSVQN 347 TDG+ + + E K A +Y+I + + L+ AS D Y +++ Sbjct: 537 TDGKANMRG----DPKQVAKELKADVKAEIYSIALTGDYDINKLREVASSKKDHVYILKD 592 Query: 348 SRKLH 352 L Sbjct: 593 YETLD 597 >gi|317419026|emb|CBN81064.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Dicentrarchus labrax] Length = 836 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 58/197 (29%), Gaps = 27/197 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ V+D S SM + L + + D++ L+ F +I Sbjct: 223 VVFVIDRSGSMRRKMVQTREAL-------------LAILKDLHEEDYFALIQFDDRIDSW 269 Query: 230 TFPLAWGVQHIQE----KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + ++ + T + + + + K + Sbjct: 270 QKSLTKATKENVDQAMIYVQQINYRGGTDINQAVLTGVEMLLKDRREK----KLPERSVD 325 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD----- 340 II LTDG +S A R ++ +G + FL + + Sbjct: 326 MIILLTDGMPNSGESHLPRIQENVRSAIRGNMSLFCLGFGNDVDYSFLDVMSKQNKGLAR 385 Query: 341 RFYSVQNSRKLHDAFLR 357 R + ++ F Sbjct: 386 RIFEGSDATLQLQGFYD 402 >gi|160837835|ref|NP_001104272.1| integrin, alpha D [Canis lupus familiaris] Length = 1168 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 84/224 (37%), Gaps = 24/224 (10%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 SH + T + +D++ ++D S S+ ++ R++ + + Sbjct: 137 SHLQTIWTVPAALPECPSQEMDIVFLIDGSGSIE---QSDFKQMKDFVRAVMGQFEGTNT 193 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + + + F+ Q+ +W + + I +L T + G+ ++F Sbjct: 194 LFSLIQYSHLLKIHFTFTQFQS---SWNPLSLVDPIVQL--DGLTYTATGIRKVVEELFH 248 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--- 325 +K AK K +I +TDG+ D E +A+R G I YAIGV Sbjct: 249 SKNGARKSAK------KILIVITDGQ---KYKDPLEYSDVIPQAERAGIIRYAIGVGDAF 299 Query: 326 -AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +A Q L N S D + V N L ++ +++ Sbjct: 300 WKPSAKQELDNIGSEPAQDHVFRVDNFAALSSIQEQLQEKIFAL 343 >gi|327463764|gb|EGF10080.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1057] Length = 462 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 48/247 (19%), Positives = 80/247 (32%), Gaps = 35/247 (14%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N +KQ D S K+++ E + G QD I + S D +K +A+ Sbjct: 106 NKQKQDWDVSELGTKSLYNMKLDLEFKTEGAYQDNRLISYNLSGK-YPDTNNKLSIDTAI 164 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM- 189 S +F + + + + V D S SMN Sbjct: 165 SALNTKQVFSKVAKGKKGIAIAYRTD-----PIQGQMNIAVSFVFDTSGSMNWDLQGRNV 219 Query: 190 ------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWGVQH 239 ++ + + M+ + I +V+ LV FS+ L G Sbjct: 220 EKTGNESRMDILRKKSVIMIKDLAEIGNVS----VNLVGFSTSAKYIQQNFSNLDNGTNT 275 Query: 240 IQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I I + L T GL Y + +L KYI+ LTDG + Sbjct: 276 IIATITKPENLNPDGVTNPGDGLRYGMISLQSQPAQL-----------KYIVLLTDGIPN 324 Query: 297 SPNIDNK 303 + +D++ Sbjct: 325 AYLVDSR 331 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 35/206 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T +D D+ ++ D S S+ +F + N Sbjct: 398 VTEPTDKLPVNDCQYDVGIIFDSSGSLEKNFQT--------------------QLQIANK 437 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + + +V F+ K + ++++ ++I S S N+ L Sbjct: 438 LFQVAIVQFAGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTF---TNQALKRMALLF 494 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-- 332 +K + K ++F TDG ++ + E+L KR+G VY +G+ + Sbjct: 495 EASKRDNCKMKLLVF-TDGYSAEDTAEGIEAL------KRQGITVYTVGISTDKNAGLNV 547 Query: 333 --LKNCA-SPDRFYSVQNSRKLHDAF 355 LK A SP ++ + L F Sbjct: 548 SELKGMATSPSHYFDSSDFDNLLKHF 573 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 11/45 (24%), Positives = 16/45 (35%), Gaps = 1/45 (2%) Query: 327 EAADQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 E L+ CAS ++ + L AF IG + Q Sbjct: 4 EGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQVTRLT 48 >gi|116494017|ref|YP_805751.1| von Willebrand factor domain-containing protein [Lactobacillus casei ATCC 334] gi|116104167|gb|ABJ69309.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei ATCC 334] Length = 909 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 47/292 (16%), Positives = 91/292 (31%), Gaps = 33/292 (11%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT---KILNQENGNNGK 73 ++ +TA + + L+I + L + TA NG Sbjct: 1 MTKMTAKVARTGHLFAVLLI----LMSMLTGLVTSGSSVVTATANIRPTYQTDANGTYPT 56 Query: 74 KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 + + N D + N + + + + S D + DY + + Sbjct: 57 NSWQVTGQQNVINQRGGDQVSGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA-- 113 Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + + N V D+++V+D+S SM G D+ G Sbjct: 114 KETNTPGLYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESKNNGGTDRAG 163 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGST 252 ++ L I++ + + V GL+ FSS G +I+ + + Sbjct: 164 AVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGK---AGN 219 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY--------KKYIIFLTDGENS 296 T + A + F+ + + KK +I LTDG + Sbjct: 220 TSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPT 271 >gi|313837215|gb|EFS74929.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314971986|gb|EFT16084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 322 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD SLSM + + D + S+P N +V+ S+ Sbjct: 96 IVVALDSSLSMKADDASP----NRLAAAKAKAKDFVNSLPTGFN---VAVVSISAHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A + I+ L Sbjct: 149 MPPSTDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMAPGGSKDRVPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG N+ L A VY I E + L + Sbjct: 203 SDGGNTQGG----SPLVAATHAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTDAKSWTADSADKLQEVYKQV 282 >gi|15418999|gb|AAK77222.1| capillary morphogenesis protein-2 [Homo sapiens] gi|119626253|gb|EAX05848.1| anthrax toxin receptor 2, isoform CRA_a [Homo sapiens] Length = 386 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 69/184 (37%), Gaps = 30/184 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQ 346 + V+ Sbjct: 197 FPVK 200 >gi|196007112|ref|XP_002113422.1| hypothetical protein TRIADDRAFT_57573 [Trichoplax adhaerens] gi|190583826|gb|EDV23896.1| hypothetical protein TRIADDRAFT_57573 [Trichoplax adhaerens] Length = 949 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 71/201 (35%), Gaps = 29/201 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L + MVLD S SM+ + + +++ ++ ++ G+VTFS Sbjct: 314 KSEPLRIAMVLDKSGSMSG------RNMQLLSQAAINVIAQSRNFDGK-----LGIVTFS 362 Query: 225 SKIVQTFPLAWGV--QHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + T PL G Q + IN L T G+ + +K+ G Sbjct: 363 TNATVTCPLTAGESDQEKNKLINCLPSEAEGETSIGSGILKGIELLRKSKD-------GR 415 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCAS 338 ++I ++DG+ + ++ +I + A++ L Sbjct: 416 KPSGGHLIVMSDGQENYRPYIE----NIMTNITENEIVITSISLGQHASENLEDLSKLTG 471 Query: 339 PDRFYSVQNSR-KLHDAFLRI 358 +++ NS L +AF I Sbjct: 472 GLSYFASTNSTLTLINAFTTI 492 >gi|308472851|ref|XP_003098652.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] gi|308268252|gb|EFP12205.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] Length = 396 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 69/195 (35%), Gaps = 21/195 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD++ V+D S M PG+ + S+ I S R GLVT++ Sbjct: 35 SNLWLDVVAVVDNSQGMT---NPGLINVASDIYSVFSSGTRIGSNSSEPRTTRVGLVTYN 91 Query: 225 SKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 S Q L + I + I+ ++ + + GL+ A + D E Sbjct: 92 SNATQKADLNKYQSIDDVLNEIYDDISTVVNTADSYLATGLQLAEKMLIDQSEN-----T 146 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKN 335 YK+ +I +D L N K + + + + D L Sbjct: 147 NRAHYKRVVIVYASEYKGEGELDP---LNVANRLKLSDINIITVAYEQKGDDGLFHDLSQ 203 Query: 336 CASPDRFYSVQNSRK 350 ASP + V N+ Sbjct: 204 IASPGFSF-VNNASD 217 >gi|309789715|ref|ZP_07684295.1| magnesium chelatase [Oscillochloris trichoides DG6] gi|308228201|gb|EFO81849.1| magnesium chelatase [Oscillochloris trichoides DG6] Length = 603 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 65/188 (34%), Gaps = 24/188 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIV 228 + V+D S SM +++ ++ +L + GLV+F Sbjct: 420 AVCFVVDASWSMAAE-----ERMQATKAAVLSLLR-----DAYQRRDQVGLVSFQRDYAR 469 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL V+ Q ++ + G T G+ AY + A+ + + ++ Sbjct: 470 VLLPLTNSVELAQRRLQTMPTGGKTPLARGMLTAYELLERARRQDH-------EVVPLMV 522 Query: 289 FLTDGENSSPNIDN---KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRF 342 LTDG+ + + +E+ + R I + ++ L S R+ Sbjct: 523 LLTDGQANVAIGNAPPQQEAYAIADLIAARDIRAIVIDTEHPNFERGLSRRLSEHLKGRY 582 Query: 343 YSVQNSRK 350 Y +++ Sbjct: 583 YRLEDLHD 590 >gi|237716506|ref|ZP_04546987.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408104|ref|ZP_06084652.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645098|ref|ZP_06722824.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809499|ref|ZP_06768202.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|229444153|gb|EEO49944.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262354912|gb|EEZ04004.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639604|gb|EFF57896.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443317|gb|EFG12081.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 342 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 62/196 (31%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + P + K+ + G+++++ LD+S SM +L A Sbjct: 61 IIFVAIGLFSVLLARPQFGS---KLETVKRKGVEVIIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I ++D + + + G++ F+ P+ + + + +K Sbjct: 117 RLISRLVDELDN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + II +TDGEN +G Sbjct: 170 TAIGEAIN-------LAARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAA-----EKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVSVLGVGMPDGAPI 233 >gi|196230799|ref|ZP_03129660.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] gi|196225140|gb|EDY19649.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] Length = 1545 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 67/203 (33%), Gaps = 21/203 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + L+++++LD S SM ++ + A + + L + V R Sbjct: 1189 TAAAGRQPGRPLNIVLLLDRSGSMER--ADRVNIVREALSVLAKHLQPQDKLSIV-TFAR 1245 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + ++ + V + ++N + T L+ AY H A Sbjct: 1246 TPHL-WADAVAGDK-----VHDVIARVNEITPEGGTNLEAALDLAYETAHH------HFA 1293 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNC 336 + +I TDG + +++ +++G + G+ E D L+ Sbjct: 1294 VDSTNR---VILFTDGAANLGDVNPDALTKKVEAQRKQGIALDCFGIGWEGYNDDLLEQL 1350 Query: 337 A--SPDRFYSVQNSRKLHDAFLR 357 + R+ + F Sbjct: 1351 TRNADGRYGFINTPEDAAANFAT 1373 >gi|298484180|ref|ZP_07002346.1| BatB protein [Bacteroides sp. D22] gi|298269684|gb|EFI11279.1| BatB protein [Bacteroides sp. D22] Length = 342 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 62/196 (31%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + P + K+ + G+++++ LD+S SM +L A Sbjct: 61 IIFVAIGLFSVLLARPQFGS---KLETVKRKGVEVIIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I ++D + + + G++ F+ P+ + + + +K Sbjct: 117 RLISRLVDELDN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + II +TDGEN +G Sbjct: 170 TAIGEAIN-------LAARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAA-----EKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVSVLGVGMPDGAPI 233 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/134 (23%), Positives = 55/134 (41%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + + +D + +N + G V +SS Sbjct: 335 APLDLVLVVDWSGSMNDN-----NRIGEVQKGVDRFVDTLAESGITDN-IHMGYVGYSSD 388 Query: 227 IVQTFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +A G V++ + I T + L A N + Sbjct: 389 GYKNDSVAMGPFDSVKNAIKTITPSSTTGGTFTQKALRDAGNMLATPNGH---------- 438 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 439 -KKVIVLLTDGVPT 451 >gi|328872160|gb|EGG20527.1| hypothetical protein DFA_00388 [Dictyostelium fasciculatum] Length = 2097 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/310 (12%), Positives = 96/310 (30%), Gaps = 41/310 (13%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF-AQDINNIERSTSLSIIIDD 120 ++ ++ QK + N + + T L+ + + Sbjct: 1668 QVSVDQSLYTSNAQKKQLAQDTASNEVDAVGGSSTAVTTTHVSNSLTNREKTQLNEMELN 1727 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 Y + W + + I LD++ ++D + S Sbjct: 1728 ILFGYVKDLST--LKACSLVNKLWRKVTLYPTHWTRFIGNIGGDER-DLDLVFLVDNTGS 1784 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT----FPLAWG 236 M+ ++ I+E++D I +I +VN VR G+V ++ V F Sbjct: 1785 MS-------GEIEQCKDKIKEIVDDISAIGNVN--VRVGMVFYNDHPVSNVCQVFDFTDN 1835 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + ++ +++ + + + ++ + + + ++ + D Sbjct: 1836 IAKMKTQLSSVTVYGGDDEPEAMADGFYEV--------NKLSFTPNSTRVLVLIGDANPH 1887 Query: 297 --SPNIDNKESLFYCNE---------AKRRGAIVYAIGVQAEAADQFLKNCA----SPDR 341 N D+ CN + +Y++ + A + A S R Sbjct: 1888 GFGGNGDHYPGGCPCNHDIIELARKLVIEKRVTIYSV-LCRPANHTYQVFSALSDLSEGR 1946 Query: 342 FYSVQNSRKL 351 +++ N+ +L Sbjct: 1947 LFTLSNAAEL 1956 >gi|312108072|ref|XP_003151046.1| hypothetical protein LOAG_15508 [Loa loa] gi|307753789|gb|EFO13023.1| hypothetical protein LOAG_15508 [Loa loa] Length = 203 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 72/196 (36%), Gaps = 22/196 (11%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y ++++ P + + K D+M ++D S S+ + + Sbjct: 24 SYTCQCYAGFVDVSSSANLQPGRVCTVQTTCPKQKT--DLMFLIDGSGSIG-SYVFKNEV 80 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI- 248 L + ++ + R GL+ +S +I F L+ + + I+++ Sbjct: 81 LRFIK-------EFVELFDIGLDNTRVGLIQYSDQIRHEFDLSQYTDKESVINAISQVQY 133 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T++ +++ + F + K DD + I +TDG + + S Sbjct: 134 LTGLTRTGAAIQHMVMEGFSERR---GARKQGDDVARVSIVITDGRSQDNVTEAAIS--- 187 Query: 309 CNEAKRRGAIVYAIGV 324 A++ ++++GV Sbjct: 188 ---ARKSHINMFSVGV 200 >gi|189518186|ref|XP_001331201.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Danio rerio] Length = 969 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 80/216 (37%), Gaps = 32/216 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ +++ +++ ++ + N VTFS++I Sbjct: 317 VVFVIDTSASMLG------TKMKQTKQALFTIINELRPNDNFN------FVTFSNRIRVW 364 Query: 231 FPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD--AKEKLEHIAKGHDD 282 P ++ ++ I + T G++ + D + + H Sbjct: 365 QPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSVSL- 423 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCA 337 IIFLTDG + + + + A + ++ IG+ + + L NC Sbjct: 424 ----IIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLLERMSLDNCG 479 Query: 338 SPDRFYSVQNSRKLHDAFLR-IGK-EMVKQRILYNK 371 + R ++ + F IG + R+ Y++ Sbjct: 480 TMRRIPEDADASLMLKGFYDEIGTPLLSDIRVEYSE 515 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 72/210 (34%), Gaps = 23/210 (10%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 ++ + +ISI+ A+ L I +G ++ + +++KL D + + + ++ Sbjct: 27 RLGHDERANISIIFAMALLPILSAIGCAVDYTQATRLRSKLQSAADAASVASISQQSLGY 86 Query: 68 NGNNGKKQKN--DFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 N + ++ + N L + + ++ L Sbjct: 87 NAALQMTSDGTVQVAVEEATKLFNGNAANSLGYTNLSLNAQVMKTGVKL----------- 135 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + ++P F T + L +T + K +S LD ++LDVS SM Sbjct: 136 AATVAFSADVPTTFMT-----VVGYRKLTVTGTSKSTSSLPPYLDFYLMLDVSGSMG--- 187 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 P D +I D K P+ Sbjct: 188 LPSTDAEQTRLAAINP--DNYKQYPNGCTF 215 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 55/216 (25%), Gaps = 60/216 (27%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----------- 239 + +++++L + N R GL F + PL + Sbjct: 288 RADAVGAAVQQLLVTANATQKTPNQFRIGLYPFVRYLYAYSPLTASINGSPTTPGTINHA 347 Query: 240 -------IQEKINRLIFGSTT---KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + N + T + P + + D + Y+ Sbjct: 348 AANLASQLDTGANASLGSGGTHFENAFPTMNGIITSVGDGSASNKTQP--------YVFL 399 Query: 290 LTDGENS----------SPNIDNKESLFYCNEAKRRGAIVYAIGV--------------- 324 +TDG + N C K RG IV + + Sbjct: 400 ITDGAQNPQVYWNGSWSGSNSATTMDTSKCTTLKSRGIIVSVLYIPYQPIQNPTSFANSE 459 Query: 325 ------QAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 L+ CASP FY+ + + A Sbjct: 460 DFYANANIPKIPPSLQACASPGYFYTANSPADITAA 495 >gi|88858354|ref|ZP_01132996.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] gi|88819971|gb|EAR29784.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] Length = 684 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 52/312 (16%), Positives = 105/312 (33%), Gaps = 46/312 (14%) Query: 46 AKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNI--------WQTDFRNELR 97 A L + T+ +++ + K S IK ++ D Sbjct: 197 ADLQGNWPSASEITSAPFIDELETQSAKAVGQGMSDAKIKQSVSIALNLGFELDTIMSPY 256 Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS--RYEMPFIFCTFPWCANSSHAPLLI 155 Q I N SL ++D+ L + + + + +L+ Sbjct: 257 HEINQQLIGNNHYQVSLKQGTTFANRDFVLRVKPKNQAAIQAAVFKEHFENDDYALVMLM 316 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + + + +++ V+D S SM+ L A ++ L + N Sbjct: 317 PPSDEFIAAQRLPREVIFVIDTSGSMHGE------SLEQAKSALFFALANLDPQDSFN-- 368 Query: 216 VRSGLVTFSSKIV----QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ F+SK+ Q P + ++ + + L T+ A+ ++ D Sbjct: 369 ----IIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIG----LAFEQVLDNS 420 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAA 329 E H DY + I+FLTDG + E+ + G + ++ IG+ + Sbjct: 421 E--------HADYLRQIVFLTDG------SISNETEVFAQIKGSLGDSRIFTIGIGSAPN 466 Query: 330 DQFLKNCASPDR 341 F+ A+ R Sbjct: 467 SYFMTRAATLGR 478 >gi|260904335|ref|ZP_05912657.1| von Willebrand factor type A domain-containing protein [Brevibacterium linens BL2] Length = 324 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 83/240 (34%), Gaps = 36/240 (15%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + I T P + ++ + + D+M+ LDVS SM + Sbjct: 62 LAVIALTGLSTLAGISRPAWVE---TVNPEKKLR-DVMLCLDVSGSMLGYDA-------- 109 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 ++L+ + + D + R G+ F+S V FPL + +Q + G T Sbjct: 110 ------DLLEAYQELVDRFDGERIGMTVFNSTAVSAFPLTDDYEMVQNYLEEAEEGFRTW 163 Query: 255 STPGLEYAYNKIFDAKEKLE-----------HIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + G +Y+++ + + +D + I+F TD N Sbjct: 164 GSEGTDYSWSTSPPNIGGSSLIGDGLVSCVDNFDRQDEDRSRSIVFATD--NMLAGDPLF 221 Query: 304 ESLFYCNEAKRRGAIVYAI---GVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + + A VY++ V + LK+ + + + + + ++ + I Sbjct: 222 DLDDATDIAVESDVRVYSLSPPSVLTNPQTKELKSVSDRTGGKQFDMGSASTIDRIVSEI 281 >gi|27228596|ref|NP_758646.1| hypothetical protein pCAR1_p105 [Pseudomonas resinovorans] gi|219857018|ref|YP_002474050.1| hypothetical protein pCAR12_p105 [Pseudomonas sp. CA10] gi|26106184|dbj|BAC41624.1| hypothetical protein [Pseudomonas resinovorans] gi|219688946|dbj|BAH10037.1| hypothetical protein [Pseudomonas putida] Length = 604 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 78/208 (37%), Gaps = 29/208 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S + ++LD S SM + A ++ +L ++ +P +V +G Sbjct: 418 SKSRAERQSASIQILLDKSGSMK-------SAMDQAEAAVYAVLSALEGLP----LVTTG 466 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++F + + + + + + + ++ FG+ ++ L A Sbjct: 467 AMSFPNKANDGVERCALIKSPKERLIKAVSEGGFGAMSEGGTPLAQA----LWPAAVEVL 522 Query: 276 IAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 AKG KK + +TDGE N+ KE L C G V +G A + LK Sbjct: 523 RAKGE---KKILFVITDGEPNAGTTHAAKEFLQRCEV---SGIEVIGLGFG-SANEHILK 575 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEM 362 S ++ +V L ++ + +E Sbjct: 576 ALFS--QYRAVGEVANLKNSLFELVREA 601 >gi|311252833|ref|XP_003125290.1| PREDICTED: vitrin-like isoform 2 [Sus scrofa] Length = 634 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 69/198 (34%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 451 DIGFVIDGSSSV------GTGNFRTVLQFVANL---SKEFDISDTDTRVGAVQYTYEQRL 501 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + I R+ + T + + YA ++F K + +K Sbjct: 502 EFGFDQYTTKPDVLNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 552 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 +I +TDG + + A +G I YAIGV A ++ P D + Sbjct: 553 MILITDGRSYD------DVRIPAMVAHHKGVITYAIGVAWAAQEELEIIATHPARDHAFF 606 Query: 345 VQNSRKLHDAFLRIGKEM 362 V L+ + +I + + Sbjct: 607 VDEFDNLYKSVPKIIQNI 624 >gi|148666786|gb|EDK99202.1| anthrax toxin receptor 1 [Mus musculus] Length = 533 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 35/149 (23%), Positives = 58/149 (38%), Gaps = 13/149 (8%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FS++ L + I++ ++ +++ G T G E A +I+ + Sbjct: 75 RMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQG 134 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 A II LTDGE E N ++ GAIVY +GV+ Q Sbjct: 135 YRTAS-------VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLA 185 Query: 334 KNCASPDRFYSVQNS-RKLHDAFLRIGKE 361 + S D + V + + L I K+ Sbjct: 186 RIADSKDHVFPVNDGFQALQGIIHSILKK 214 >gi|119946441|ref|YP_944121.1| TPR repeat-containing protein [Psychromonas ingrahamii 37] gi|119865045|gb|ABM04522.1| protein containing tetratricopeptide (TPR) repeat [Psychromonas ingrahamii 37] Length = 657 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 61/165 (36%), Gaps = 29/165 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++ LD+S SM K +++ +++D++KS + GL+ ++ Sbjct: 97 VIALDLSYSM----YATDAKPDRLSQARYKVIDLVKSWEEGEK----GLIAYAGDAFTIS 148 Query: 232 PLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL I I L + +++ LE A + +A + H I Sbjct: 149 PLTTDGNAIINHIPSLSPTIMPVTGSRADLALEQAITLLKNAGYQQGH-----------I 197 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +F+TDG ID + N K IV + + ++ Sbjct: 198 VFITDG------IDPNSAAIMINRLKGSPWIVSILAMGSQQGAPI 236 >gi|54124354|gb|AAV29939.1| anthrax toxin receptor [Rattus norvegicus] Length = 245 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 35/149 (23%), Positives = 58/149 (38%), Gaps = 13/149 (8%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FS++ L + I++ ++ +++ G T G E A +I+ + Sbjct: 18 RMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQG 77 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 A II LTDGE E N ++ GAIVY +GV+ Q Sbjct: 78 YRTAS-------VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLA 128 Query: 334 KNCASPDRFYSVQNS-RKLHDAFLRIGKE 361 + S D + V + + L I K+ Sbjct: 129 RIADSKDHVFPVNDGFQALQGIIHSILKK 157 >gi|73970245|ref|XP_855427.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 555 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 35/149 (23%), Positives = 58/149 (38%), Gaps = 13/149 (8%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FS++ L + I++ ++ +++ G T G E A +I+ + Sbjct: 70 RMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQG 129 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 A II LTDGE E N ++ GAIVY +GV+ Q Sbjct: 130 YRTAS-------VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLA 180 Query: 334 KNCASPDRFYSVQNS-RKLHDAFLRIGKE 361 + S D + V + + L I K+ Sbjct: 181 RIADSKDHVFPVNDGFQALQGIIHSILKK 209 >gi|254480861|ref|ZP_05094107.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038656|gb|EEB79317.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 726 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 85/241 (35%), Gaps = 22/241 (9%) Query: 138 IFCTFPWCANSSHAPLLITSSV------KISSKSDI-GLDMMMVLDVSLSMNDHFGPGMD 190 + P A S+ PL+ V + ++ L + ++ D S SM G Sbjct: 223 LPAALPDPATSTKVPLIALPPVAKMLLPEPQVTTNAPALAVEIIFDASGSMAARLQ-GQT 281 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIF 249 KL +A R++ + + + + +V F + +T + + + Sbjct: 282 KLSLARRALAA---AVPGLENPSILVGMRAYGFDQSLNKTPDASCPNTELVLPFTANRQA 338 Query: 250 GSTTKSTPGL-EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + ++ L Y Y I D+ H D K II ++DGE + Sbjct: 339 TAINRTADALSAYGYTPIADSLTLAGHDLLAIDAQKHMIILISDGEETCGGF----PAAV 394 Query: 309 CNEAKRRGA--IVYAIGVQAEA-ADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMV 363 + G + IG +A A Q ++ AS +++ + +L + +R+ Sbjct: 395 AANLRSLGIDLQTHVIGFDLDATAQQQMQAIASAGGGQYFDAADGDELGASLMRVIDLAQ 454 Query: 364 K 364 + Sbjct: 455 E 455 >gi|326444122|ref|ZP_08218856.1| von Willebrand factor, type A [Streptomyces clavuligerus ATCC 27064] Length = 519 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 28/199 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN--NVVR------SGLVT 222 ++ +LD S SM R++R D + D R ++ Sbjct: 331 VLFLLDFSGSMRG----------AGIRALRTTFDGLSGADDSRTGKFARFHRGETLTVLR 380 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F K+++ + + + E++ L+ +T + ++ + A + + Sbjct: 381 FGGKVLERRTVTYRGERDLERLRSLVASEGFDTTTAI---WSGLDAAYRTAAGMLREDPA 437 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGAIVYAIGVQAEAADQFLKNC-AS 338 I+ +TDG N++ + ++ A+ YA+ AD+ + A+ Sbjct: 438 RPLSIVLMTDGRNNAGM--SLDAFLRAQRARTGPVASVRTYAVRYGEADADELGRAARAT 495 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R R L DAF Sbjct: 496 GGRLLDATE-RPLLDAFKE 513 >gi|320016843|gb|ADW00415.1| putative tellurium resistance protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 212 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 65/195 (33%), Gaps = 16/195 (8%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + ++LD S SM + ++ +L ++ P ++TF S Sbjct: 4 LPVYLLLDTSGSMTGE------PIEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSAR 57 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 Q PL + K+ L+ TT L I + +K KG +I Sbjct: 58 QAVPLT---DLLNFKLPELVANGTTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI- 113 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +TDG SP D ++ L A+ G V A + L+ +S Sbjct: 114 -MTDG---SPTDDWRKGLSDFKAART-GV-VVACAAGHAVETKVLQEITEIVLQLDTADS 167 Query: 349 RKLHDAFLRIGKEMV 363 + F + + Sbjct: 168 SSIKAFFKWVSASIS 182 >gi|294815777|ref|ZP_06774420.1| von Willebrand factor [Streptomyces clavuligerus ATCC 27064] gi|294328376|gb|EFG10019.1| von Willebrand factor [Streptomyces clavuligerus ATCC 27064] Length = 568 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 28/199 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN--NVVR------SGLVT 222 ++ +LD S SM R++R D + D R ++ Sbjct: 380 VLFLLDFSGSMRG----------AGIRALRTTFDGLSGADDSRTGKFARFHRGETLTVLR 429 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F K+++ + + + E++ L+ +T + ++ + A + + Sbjct: 430 FGGKVLERRTVTYRGERDLERLRSLVASEGFDTTTAI---WSGLDAAYRTAAGMLREDPA 486 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGAIVYAIGVQAEAADQFLKNC-AS 338 I+ +TDG N++ + ++ A+ YA+ AD+ + A+ Sbjct: 487 RPLSIVLMTDGRNNAGM--SLDAFLRAQRARTGPVASVRTYAVRYGEADADELGRAARAT 544 Query: 339 PDRFYSVQNSRKLHDAFLR 357 R R L DAF Sbjct: 545 GGRLLDATE-RPLLDAFKE 562 >gi|282854078|ref|ZP_06263415.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583531|gb|EFB88911.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|314981158|gb|EFT25252.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091981|gb|EFT63957.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 322 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 66/204 (32%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +++ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVMSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MPPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG N+ L N A VY I E + L Sbjct: 203 SDGNNTQGG----SPLVAANRAAAAKVSVYTIAFGTETGYVDLDGQRERVAPDTKLLSTV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQV 282 >gi|198430849|ref|XP_002120173.1| PREDICTED: similar to cGMP-dependent protein kinase 1, alpha isozyme (CGK 1 alpha) (cGKI-alpha) [Ciona intestinalis] Length = 1896 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 75/190 (39%), Gaps = 24/190 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S S++ + A R + E+L + D N V G+V F S Sbjct: 839 VDIIFVLDESGSVS------LSSYKQALRWVVELLTSFREDVDKGN-VHVGVVAFHSWAG 891 Query: 229 QTFPL-AWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L A+ ++Q +I L G T P + + E G + Sbjct: 892 TRIALGAFEFSNLQARIIALSNGRNYGGTNIAPAI---------DETLREFNRNGRTGIQ 942 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +I +TDG +S PN + A+ +G + A+GV + Q L + R + Sbjct: 943 KQMILMTDGYSSYPNAISP----AAQRARAQGVVTVAVGVGGSSYAQLLNIAGNQTRVFY 998 Query: 345 VQNSRKLHDA 354 N +L + Sbjct: 999 ATNFNRLGEV 1008 >gi|149034246|gb|EDL89016.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_b [Rattus norvegicus] Length = 924 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++T++ ++ Sbjct: 79 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYNEELHY 126 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 127 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNVLSDFNHTGQ-----G 181 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 182 SICSQAIMLITDG-----AVDTYDTIFAKYNWPERKVRIFTYLIGREAAFADNLKWMACA 236 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 237 NKGFFTQISTLAD 249 >gi|149034245|gb|EDL89015.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Rattus norvegicus] Length = 930 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++T++ ++ Sbjct: 79 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYNEELHY 126 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 127 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNVLSDFNHTGQ-----G 181 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 182 SICSQAIMLITDG-----AVDTYDTIFAKYNWPERKVRIFTYLIGREAAFADNLKWMACA 236 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 237 NKGFFTQISTLAD 249 >gi|28212256|ref|NP_783185.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Rattus norvegicus] gi|81871225|sp|Q8CFG5|CA2D3_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|27450708|gb|AAO14654.1|AF486278_1 calcium channel alpha-2 delta-3 subunit [Rattus norvegicus] Length = 1085 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++T++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNVLSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPERKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|331085807|ref|ZP_08334890.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406730|gb|EGG86235.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 1321 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 48/337 (14%), Positives = 110/337 (32%), Gaps = 41/337 (12%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 E + K + + + T K+ + + + +Q + E Sbjct: 385 ELNRILSEKEDKEQLKE---IDTYAKLTVDHI--VPIVDHSQDTDGQKEETFQEN--KEE 437 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 R+ F Q + + + S+ I + + + + L Sbjct: 438 RQEDFTQYVTDTVNQKAASVSISGIDTKEFETVRAVVSLEEGIADTEEKFRGNVEILDCG 497 Query: 157 SSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + D + + D S SM K+ +++ + + Sbjct: 498 VEIPDYKVKKLEYDTVNIALCCDNSGSMEGE------KIENLKKAVSTFVGKLA------ 545 Query: 214 NVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + V G+V F S + + +++ + S T G+EY + + K+ Sbjct: 546 DEVNIGIVPFGSGVLEGVCEPGSSREKLEQSVESFRSDSGTNIYSGVEYTLSMLAKEKDA 605 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQ 331 L + ++DG++S P+ +E L A G ++Y++G+ A+ + Sbjct: 606 LN-----------IAVIMSDGQDSIPS---EEQLQKITSACENGNILLYSMGLGADVESE 651 Query: 332 FLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQR 366 L + + V +S L+ + I + + K R Sbjct: 652 VLSTYSDAGNGAYVFVSDSNSLYSFYQYIYQ-ISKNR 687 >gi|162449101|ref|YP_001611468.1| hypothetical protein sce0831 [Sorangium cellulosum 'So ce 56'] gi|161159683|emb|CAN90988.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 377 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 60/202 (29%), Gaps = 16/202 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVA---TRSIREMLDIIKSIPDVNNVVRS-GLVTF 223 LDM+++LD S SM G+ A + M + P +N ++ Sbjct: 91 PLDMLILLDRSGSMLGSKWVGVTNALSAFVTDAASAGMNVGLTYFPRSSNGQSDCNHTSY 150 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V L + I G T P L+ K + Sbjct: 151 DELAVAIGELPVNTPELTASIQSTSPGGGTPMRPALQ--------GVLTNATAYKDANPS 202 Query: 284 KKYIIFL-TDGENSSPNIDN-KESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPD 340 K I+ L TDG+ S + + + A + G Y + VQ + A+ Sbjct: 203 HKVIVVLATDGDPSGCSGNTVASTAEMAQRALRYNGVQTYVVAVQGSTLTNLDQIAAAGG 262 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + AF E+ Sbjct: 263 T-TRAFDVTADITAFSAKMAEI 283 >gi|75750453|ref|YP_319892.1| hypothetical protein ATV_gp61 [Acidianus two-tailed virus] gi|123849288|sp|Q3V4Q4|Y892_ATV RecName: Full=Putative VWFA domain-containing protein ORF892 gi|74474836|emb|CAI59910.1| hypothetical protein [Acidianus two-tailed virus] Length = 892 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 67/188 (35%), Gaps = 25/188 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ D + V+D S SM + ++ L K N++ + Sbjct: 715 KSEEKKQGDFLFVIDSSGSMEGN---------KIATALAIPLVTYKKYKGKRNIL---VE 762 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 TFS + P+ + +++I + + FG T + YA I + + Sbjct: 763 TFSDE---PSPI-YNIKNIANVLGSMKFGG-TNIGSAVLYALKNIDKPDSDYDRKLRESL 817 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAADQFLKNCASP 339 + +I LTDGE+ P+ +E K+ + Y I + + C Sbjct: 818 RKTRTLILLTDGEDEIPDDIAREINSL---KKKNKVELLCYGIDLGERGLKTLKEIC--- 871 Query: 340 DRFYSVQN 347 D Y+V + Sbjct: 872 DEVYAVGS 879 >gi|89100218|ref|ZP_01173085.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085068|gb|EAR64202.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 456 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 83/222 (37%), Gaps = 28/222 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR---- 217 S +S ++ +VLD S SM + G ++ +A SI+E + +++ V Sbjct: 145 SQESPASFNVEIVLDASGSMANKLGSK-TRMELAKESIKEFASSLPEEANISLRVYGHKG 203 Query: 218 SGL-----VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +G ++ SS + P + + +++ T L A + Sbjct: 204 TGSDSDKKMSCSSNELVYPPQPYNEGELNSALDKFNPAGWTPLAQSLIEAQKDL------ 257 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAAD 330 D K + ++DG + + + + K G +V IG + D Sbjct: 258 ---AQFEGQDNKNMVYVVSDGIETC----DGNPVEAAKDLKDSGVAPVVNIIGFDVKGKD 310 Query: 331 -QFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 Q L+ A+ + +V + ++L + + +E +K R Y Sbjct: 311 QQQLEEVAKAAGGTYQNVTSQQQLQNELNKAVEESLKWRTWY 352 >gi|284922261|emb|CBG35346.1| putative lipoprotein [Escherichia coli 042] Length = 588 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 49/340 (14%), Positives = 107/340 (31%), Gaps = 48/340 (14%) Query: 39 SHKFFVKAKLHYILDHS--LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 + ++ K L L + + A + + N G + F +K + Q Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFS 134 Query: 97 RENGFAQDINNIE----------RSTSLSIIIDDQHKDYNLSAVSRYEMPF--------- 137 + N + + +++ D+ ++ S + P Sbjct: 135 LDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSNNKEPVPASKPIPFA 194 Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 + C + LL + KS+ +++ ++D S SM ++L + Sbjct: 195 MRCELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD-----ERLPLI 249 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTT 253 S++ ++ ++ + +VT++ P G I I+ L +T Sbjct: 250 QSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGST 303 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 GLE AY + KG + I+ TDG+ + D K + + Sbjct: 304 NGGAGLELAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQR 354 Query: 314 RRGAIVYAIGVQ-AEAADQFLKNCA--SPDRFYSVQNSRK 350 G + GV + + + A + + + Sbjct: 355 ESGVSLSTFGVGDSNYNEAMMVRIADVGNGNYSYIDTLAE 394 >gi|73976419|ref|XP_852853.1| PREDICTED: hypothetical protein XP_847760 [Canis familiaris] Length = 642 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 63/177 (35%), Gaps = 24/177 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV---NNVVRSGLVTFSS 225 +D++ V+D S S + + + D + + V ++ + FSS Sbjct: 47 IDVVFVVDSSES------SKIALFDKQKDFVNSLSDKVFQLTPVGFLKYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + F G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNFIGQGTFSYYAISNATMLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K + +TDG + N D + +A+ G + IG+ + L+ + Sbjct: 153 -VKVALLMTDGIDHPKNPDVQS---ISEDARTAGILFITIGLSTVVNEAKLRLISGD 205 >gi|73972306|ref|XP_860269.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 3 [Canis familiaris] Length = 549 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 73/221 (33%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +R+ ++ + S GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTRAKNCLRDFIEKVASYGVKPKY---GLVTYATN 317 Query: 227 IVQTFPLAWGVQHIQEKINRL---------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ + S T + LE Y+ + E Sbjct: 318 PKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSMMNWPGETP---P 374 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 + + II +TDG N + + + L + K VY GV Sbjct: 375 ADWNRTRHVIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + AS + V++ L D F+++ E Sbjct: 435 NQENINALASKKDKEQHVFKVKDMENLEDVFIQMLDETRTL 475 >gi|73972310|ref|XP_849985.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 2 [Canis familiaris] Length = 1112 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 73/221 (33%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +R+ ++ + S GLVT+++ Sbjct: 606 IVLDPSGSMNIYLVLDGSDSIGAGNFTRAKNCLRDFIEKVASYGVKPKY---GLVTYATN 662 Query: 227 IVQTFPLAWGVQHIQEKINRL---------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ + S T + LE Y+ + E Sbjct: 663 PKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSMMNWPGETP---P 719 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 + + II +TDG N + + + L + K VY GV Sbjct: 720 ADWNRTRHVIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPREDYLDVYVFGVGPLV 779 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + AS + V++ L D F+++ E Sbjct: 780 NQENINALASKKDKEQHVFKVKDMENLEDVFIQMLDETRTL 820 >gi|73972308|ref|XP_532086.2| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 1 [Canis familiaris] Length = 767 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 73/221 (33%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +R+ ++ + S GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTRAKNCLRDFIEKVASYGVKPKY---GLVTYATN 317 Query: 227 IVQTFPLAWGVQHIQEKINRL---------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ + S T + LE Y+ + E Sbjct: 318 PKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSMMNWPGETP---P 374 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 + + II +TDG N + + + L + K VY GV Sbjct: 375 ADWNRTRHVIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPREDYLDVYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + AS + V++ L D F+++ E Sbjct: 435 NQENINALASKKDKEQHVFKVKDMENLEDVFIQMLDETRTL 475 >gi|54309668|ref|YP_130688.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] gi|46914106|emb|CAG20886.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] Length = 494 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 76/226 (33%), Gaps = 25/226 (11%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G+ I TA+ L +F ++ +E + K +L + + L T NQ++ Sbjct: 5 RQQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAVTTA--NQDDKT 62 Query: 71 NGKKQKNDFSYRIIKNI-WQTDFRNELRENGF---AQDINNIERSTSLSIIIDDQHKDYN 126 + ++ I+NI D + E E D N + H Sbjct: 63 YENQLATNYVQTYIRNIAIINDIKVERSEGIDYYPTPDGNEEREYFQYRVTAKTDHTS-- 120 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 +S +P T + I D +D++ V D S SM + Sbjct: 121 --WLSSDIIPSFSPTETVANRALARNYPIYL-------GDKDIDIVFVSDFSGSMKGN-- 169 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQT 230 K+ +I+ + + I D V R V ++ ++ + Sbjct: 170 ----KIRALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEK 211 >gi|240172543|ref|ZP_04751202.1| hypothetical protein MkanA1_24733 [Mycobacterium kansasii ATCC 12478] Length = 963 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++++LD S SM K+ A R ++D + S R ++TF +I Sbjct: 294 PRDVVLLLDRSHSMAGW------KIVAARRVSTRIVDTLSSTD------RFAVLTFGDRI 341 Query: 228 VQTFPLAWG--------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + LA G E + R+ T+ L + + ++ + Sbjct: 342 DRAGGLADGLVEASDRHRYRAIEHLARVDARGDTELLAPLRQGLSLLRGSQGRDA----- 396 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 ++ +TDG+ + +++ +E + V+ +GV FL A Sbjct: 397 ------VLVLITDGQVGN---EDQLLRELSDELQH--VRVHTVGVDQAVNAGFLDRLANI 445 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 V++ +L +A + + + Sbjct: 446 GGGHCELVESEDRLDEAMHAMQRRIGA 472 >gi|311268952|ref|XP_003132278.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Sus scrofa] Length = 1146 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 68/186 (36%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG D + +F R V+ V + CA+ Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTGRVFTFSVGQHNYDVTPPAXXGCATK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|311268950|ref|XP_003132277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Sus scrofa] Length = 1153 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 68/186 (36%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG D + +F R V+ V + CA+ Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTGRVFTFSVGQHNYDVTPPAXXGCATK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|220933244|ref|YP_002512143.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994554|gb|ACL71156.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 589 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 69/187 (36%), Gaps = 24/187 (12%) Query: 157 SSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + V+++ + + +M+VLDVS +M +L A + ML+ ++ Sbjct: 83 APVQVAGEGHRGDVSLMLVLDVSATMQAQDLAP-RRLTRALLEVDGMLEGLRGE------ 135 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 R GLV F+ + + P + + ++R G + P A + + Sbjct: 136 -RVGLVAFAGRALMLAPPTHDRRLLSHYLSR---GPEALADPAGLSASRAVAEGLRLAGE 191 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEAKRRGAIVYAIGVQA-------E 327 +G ++ +TDG+ + ++ + G +Y +GV + Sbjct: 192 ALEGSGA----VVLITDGDARAFAGARLAAMQTQARALRDAGHTLYVLGVGGTEPVPVPD 247 Query: 328 AADQFLK 334 A L+ Sbjct: 248 GAGGLLR 254 >gi|291386938|ref|XP_002709809.1| PREDICTED: vitrin [Oryctolagus cuniculus] Length = 869 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 69/198 (34%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K R G V ++ + Sbjct: 686 DLGFVIDGSSSV------GTGNFRTVLQFVANL---SKEFEISETDTRIGAVQYTYEQRL 736 Query: 230 TFPLA-WGVQ-HIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + I I R+ + T + + YA ++F K + +K Sbjct: 737 EFGFDKYNTKPDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 787 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 +I +TDG + + A +G I YAIGV A D+ P D + Sbjct: 788 MILITDGRSYD------DVRIPAMAAHHKGVITYAIGVAWAAQDELEVIATYPAKDHSFF 841 Query: 345 VQNSRKLHDAFLRIGKEM 362 V L+ RI + + Sbjct: 842 VDEFDNLYKFVPRIIQNI 859 >gi|262371960|ref|ZP_06065239.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] gi|262311985|gb|EEY93070.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] Length = 537 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 89/237 (37%), Gaps = 34/237 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW N+ + I + +S K +++ ++DVS SM+D DKL + +++R + Sbjct: 150 PWKENAKLIKIGI-QAKDLSVKQLPAANLVFLVDVSGSMDDP-----DKLPLVKQTLRIL 203 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLE 260 + ++ V ++T++S G Q I I+ L G T ++ Sbjct: 204 TEQLRPQDKVT------IITYASGEKLVLEPTSGDQKDKILRVIDELRAGGATSGEQAIQ 257 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + A K + I+ TDG+ + D E ++ G + Sbjct: 258 LAYKQAEKA------FIKNGINR---ILLATDGDFNVGITDFSTLKGMVAEKRKSGVSLT 308 Query: 321 AIGVQAEA-ADQFLKNC--ASPDRFYSVQN--------SRKLHDAFLRIGKEMVKQR 366 A+G ++ ++ A + + N R+L + +++ Q Sbjct: 309 ALGYGTGNYNEELMEQIADAGDGNYSYIDNKNEAKKVVQRQLSSTLATVAQDVKIQV 365 >gi|258651507|ref|YP_003200663.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554732|gb|ACV77674.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 593 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 81/226 (35%), Gaps = 31/226 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSI 209 P + +++ ++ G M+ V+D+S SM G D++ +A + L + ++ Sbjct: 368 PQAVDDAIRSVQVTNEGTRMLAVMDISGSMLAQVPGTNGADRIDLAKDAAARGLGLYRAD 427 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYN---- 264 D GL FS+++ + + I+ L + L A N Sbjct: 428 SD------IGLWEFSTRL----SPTSDHRELIP-ISSLGPDGQGSTGAARLAAALNGLQA 476 Query: 265 ------KIFDAKEKLEHIAKGHDDYKK--YIIFLTDGENSSPNI---DNKESLFYCNEAK 313 ++D + D + ++ LTDG N N D S + Sbjct: 477 IPDGGTGLYDTVLDATRTVRAGYDPDRVNVVLLLTDGMNDDVNSITMDQLLSTLAAEQDP 536 Query: 314 RRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 R V +I ++ L+ A+ Y Q+ R++ + FL Sbjct: 537 ARPVPVISIAFGPDSDVAALQQISRATGGATYLSQDPRQIGEIFLD 582 >gi|260797295|ref|XP_002593639.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] gi|229278865|gb|EEN49650.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] Length = 373 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 62/168 (36%), Gaps = 24/168 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 222 PLDIIFLLDGSGSV------GASNFEKVKQFTKKTISGFDISPSGTQ---VGVIQYSTRT 272 Query: 228 VQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 273 RQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 326 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I +TDG + EA+++G VYAIGV ADQ Sbjct: 327 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQL 368 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G + ++ + P G++ +S++ Sbjct: 6 PLDIIFLLDGSGSV------GASNFVKVKQFTKKTISGFDISPSGTQ---VGVIQYSTRT 56 Query: 228 VQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + + + I+ + T + + Y F + A+ Sbjct: 57 RQEFSMNSFVTKETLSAAIDEVQYMRGGTLTGKAIRYVTKYGFGKSD----GARPGVP-- 110 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +I +TDG + EA+++G VYAIGV ADQ L+ AS + + Sbjct: 111 KVVIVVTDGVSYDAVAAP------ALEAQQKGITVYAIGVSGYDADQ-LEQIASNNNTLA 163 Query: 345 VQNSRKLHDAFLR 357 ++ L D Sbjct: 164 FVDNFNLLDNLRN 176 >gi|194335402|ref|YP_002017196.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307879|gb|ACF42579.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 343 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 64/194 (32%), Gaps = 23/194 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IFC + P + + K G D++ +LDVS SM D+LG A Sbjct: 60 LIFCGIALLLFALAGPRFCSGGRPVLRK---GADIVFMLDVSRSMRARDVLP-DRLGQAK 115 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 I SI R ++ F++ + PL + LI Sbjct: 116 -------QEITSISRAVTGGRMSILLFAASPLVQCPLTTDRDAFDALLGMASPDLIEEQG 168 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T E A + E +K ++ L+DGE+ + E + Sbjct: 169 TSFRAAFELAGRLLEPTLEDR---MASGVKGEKIVVLLSDGEDHTG-----EVRSAVQQL 220 Query: 313 KRRGAIVYAIGVQA 326 K+ ++ IGV Sbjct: 221 KKANVHLFVIGVGM 234 >gi|330839962|ref|YP_004414542.1| von Willebrand factor type A [Selenomonas sputigena ATCC 35185] gi|329747726|gb|AEC01083.1| von Willebrand factor type A [Selenomonas sputigena ATCC 35185] Length = 215 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 63/174 (36%), Gaps = 16/174 (9%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + L + ++LD S SM G ++ + +++ L ++ P ++T Sbjct: 2 ADVGRRLPVYLLLDCSGSM---MGEPIEAVRQGIKAL---LSELRGDPQALETAYLSVIT 55 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F+S++ QT L + + K RL T L+ + K KG Sbjct: 56 FASQVRQTTKLT---ELMLFKEPRLEAEGCTLMGGALKLLAECVRTEVRKNTETQKGDWR 112 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFLKN 335 + LTDG + D ++ E K + A A+A +LK Sbjct: 113 P--LVFLLTDGSPT----DLEDFRQAAAEIKSLKLGNIIACAAGADADTSYLKQ 160 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 78/219 (35%), Gaps = 34/219 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K D+++ +D S SM + AT S+ + L+ + R +TF Sbjct: 108 KEKAFKDIVIAIDTSGSMEQS--DPNGERFKATSSLIDNLEGNR---------RIAFMTF 156 Query: 224 SSKIVQTFP-LAWGVQHIQEKINRLIFG------STTKSTPGLEYAYNKIFDAKEKLEHI 276 + F + + +E + I T + AY I + + Sbjct: 157 DDSPILQFDFMEATTKEQKEVVKAKIASYQQNDDGQTGVRDMINEAYELIQNNSKNHSGS 216 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLK 334 +I ++DG S + N +L N + +Y IG+ Q+L Sbjct: 217 ----------LIMISDGAPSDDSASNIPAL-VSNYVQNN-IPIYTIGMMYGDNSAEQYLI 264 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + A + + YS ++ + AF +I + K+ ++ + Sbjct: 265 DIANLTGGQHYSTSDTTMIAGAFGQIRYDEGKRELMTER 303 >gi|294651171|ref|ZP_06728503.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292822924|gb|EFF81795.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 446 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 76/209 (36%), Gaps = 33/209 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM DKL + +++R + + +++ V ++T++S Sbjct: 176 LVFLVDVSGSM-----SAADKLPLVKQTLRILTEQLRAQDKVT------IITYASGEKLV 224 Query: 231 FPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 G + I IN L +T ++ AY + A K + I+ Sbjct: 225 LEPTSGEQKEKILAVINGLRARGSTAGEQAIQLAYKQAEKA------FVKNGINR---IL 275 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNC--ASPDRFYSV 345 TDG+ + D E ++ G + +G +Q ++ A + + Sbjct: 276 LATDGDFNVGITDFNTLKGMVAEKRKSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYI 335 Query: 346 QN--------SRKLHDAFLRIGKEMVKQR 366 N R+L + +++ Q Sbjct: 336 DNKNEAKKVVQRQLSSTLATVAQDVKIQV 364 >gi|114586161|ref|XP_001172548.1| PREDICTED: hypothetical protein [Pan troglodytes] Length = 543 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + D++ ++D S S++ R + M++ D ++ G Sbjct: 86 SSEACKNSKADIIFLIDGSESISPK------DFEKMKRFVESMVNQSNIGTDG---IQIG 136 Query: 220 LVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ FSS ++ F L + + + T++ L + +K Sbjct: 137 LLQFSSIPLEEFRLNQYSSKVDIYRATFDVQQMRDGTRTGKALNFTLPFFDSSKG----- 191 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 G ++Y+I +TDG I ++L + + I++AIGV Q L+ Sbjct: 192 --GRPSVQQYLIVITDGVAQDNVIIPAKAL------RDKNIIIFAIGVGEAKKSQLLEIT 243 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEM 362 D+ Y N L + I ++ Sbjct: 244 NDEDKVYHDVNFEALQNLEKEILSKV 269 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 12/83 (14%) Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + +Y+I +TDG++S + E L + G +YAIG++ EA LK A Sbjct: 12 NVARYLIVITDGKSSDSVAEAAEGL------RANGVNIYAIGIR-EANIDELKEIAKDKI 64 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 F+ + D I KE+V+ Sbjct: 65 FFVYE-----FDLLKDIQKEVVQ 82 >gi|123228966|emb|CAI21016.2| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H5 (ITIH5) [Danio rerio] Length = 906 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 80/216 (37%), Gaps = 32/216 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SM K+ +++ +++ ++ + N VTFS++I Sbjct: 254 VVFVIDTSASMLG------TKMKQTKQALFTIINELRPNDNFN------FVTFSNRIRVW 301 Query: 231 FPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD--AKEKLEHIAKGHDD 282 P ++ ++ I + T G++ + D + + H Sbjct: 302 QPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSVSL- 360 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCA 337 IIFLTDG + + + + A + ++ IG+ + + L NC Sbjct: 361 ----IIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLLERMSLDNCG 416 Query: 338 SPDRFYSVQNSRKLHDAFLR-IGK-EMVKQRILYNK 371 + R ++ + F IG + R+ Y++ Sbjct: 417 TMRRIPEDADASLMLKGFYDEIGTPLLSDIRVEYSE 452 >gi|124008260|ref|ZP_01692956.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123986209|gb|EAY26038.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 552 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 52/306 (16%), Positives = 116/306 (37%), Gaps = 39/306 (12%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 KK N+ ++ +K + F ++ +++ +I + L + +++ Sbjct: 85 KEKKPANENTFLSVKTAPLSTFSIDVDNASYSRARKSI-NNGQLPSTSSVRLEEFINYFN 143 Query: 131 SRYEMPFIFCTF---------PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 +Y+ P F PW + + + S K + +++ ++DVS SM Sbjct: 144 YQYKQPEGQHPFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLS-NLVFLIDVSGSM 202 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV--QH 239 DKL + ++ + +++ ++ R +V ++ P G Q Sbjct: 203 -----SAPDKLPLLRKAFKMLVN------NLGEEDRVAIVVYAGNAGLVLPATQGTDKQK 251 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I E +++L G +T G++ AY K + II TDG+ + Sbjct: 252 IMEALDKLQSGGSTAGGAGIKLAYKIAKQNFIKEGNNR---------IILATDGDFNLGA 302 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 ++ E ++ G + +G+ D ++ A +Y + N L++A+ Sbjct: 303 SSDQAMQNLIEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYLDN---LNEAYK 359 Query: 357 RIGKEM 362 GK++ Sbjct: 360 VFGKDL 365 >gi|326932352|ref|XP_003212283.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Meleagris gallopavo] Length = 450 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 83/212 (39%), Gaps = 36/212 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S S+ D++++LD S S++ + + + +++ P N V++ ++ Sbjct: 28 PSISNSEGDILLLLDSSGSVSYY------EFSKVKEFMWDLMQPFTFGP---NDVQTSII 78 Query: 222 TFSSKIVQTFPLAW--GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 S+ FP +Q+ I N T + L +A K+F + A Sbjct: 79 HISTTPTMEFPFDRYLSRGTVQQAIRNTRQLMGDTNTGKALSFAKEKLFS------NDAG 132 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCA 337 D K ++++TDG +S + + L K G V+ + + L A Sbjct: 133 ARPDVPKVLVWVTDGFSSDDISEPMQLL------KDMGVTVFIVSTG--RGNYLELSAAA 184 Query: 338 S--PD---RFYSVQN----SRKLHDAFLRIGK 360 S P+ F V + +++L DA L + + Sbjct: 185 SQPPEKHLHFVDVDDLPIITKELRDAMLDVIQ 216 >gi|123781093|sp|Q3V3R4|ITA1_MOUSE RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|74186862|dbj|BAE20498.1| unnamed protein product [Mus musculus] Length = 1179 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 97/285 (34%), Gaps = 49/285 (17%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS---- 162 ++ ++ ++ + + V+ + F+ C + + H I S V + Sbjct: 99 VNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 158 Query: 163 ------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + LD+++VLD S S + T + ++L + P Sbjct: 159 NSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ-- 208 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + + F L + + N++ T + G++ A + F Sbjct: 209 -VGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEAFTEARG 267 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-------- 324 K K ++ +TDGE S N K+ + C ++I + Sbjct: 268 ARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGN 317 Query: 325 -QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 318 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 362 >gi|153791389|ref|NP_001028400.2| integrin alpha-1 precursor [Mus musculus] gi|189442109|gb|AAI67237.1| Integrin alpha 1 [synthetic construct] Length = 1179 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 97/285 (34%), Gaps = 49/285 (17%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS---- 162 ++ ++ ++ + + V+ + F+ C + + H I S V + Sbjct: 99 VNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 158 Query: 163 ------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + LD+++VLD S S + T + ++L + P Sbjct: 159 NSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ-- 208 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + + F L + + N++ T + G++ A + F Sbjct: 209 -VGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEAFTEARG 267 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-------- 324 K K ++ +TDGE S N K+ + C ++I + Sbjct: 268 ARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGN 317 Query: 325 -QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 318 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 362 >gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens] Length = 946 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 71/201 (35%), Gaps = 27/201 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N +R+ Sbjct: 311 ILFVIDVSGSMWGV------KMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRT------W 358 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +Q + I ++ T L A + +A Sbjct: 359 RNDLFQLQKHRLQIAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVS---- 414 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR---- 341 II ++DG+ + + + E + ++++G+ + FLK ++ + Sbjct: 415 LIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQ 474 Query: 342 --FYSVQNSRKLHDAFLRIGK 360 + + S +L + ++ Sbjct: 475 RIYGNQDTSSQLKKFYNQVST 495 >gi|326916310|ref|XP_003204451.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 2040 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/262 (13%), Positives = 91/262 (34%), Gaps = 31/262 (11%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYE----MPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 +++ + Y ++ ++ P + ++++ P L + ++ Sbjct: 55 DSTVVLEELRAGTTYKVNVFGMFDGGESNPLVGQEMTTLSDTTTEPFLSRG---LECRTR 111 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+++++D S S+ I ++++ PD V+ GL +S Sbjct: 112 AEADIVLLVDGSWSIGRP------NFKTVRNFISRIVEVFDIGPDK---VQIGLAQYSGD 162 Query: 227 IVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + L + + E + L + + G+ + + K+ + + Sbjct: 163 PRTEWNLNAYRTKEALLEAVTNLPYKGG-NTLTGMALDFILKNNFKQDAGLRPR----AR 217 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RF 342 K + +TDG++ + L + G +YAIG++ ++ + PD Sbjct: 218 KIGVLITDGKSQDDVVTPSRRL------RDEGVELYAIGIKNADENELKQIATDPDDIHA 271 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 Y+V + L + + Sbjct: 272 YNVADFSFLASIVEDVTTNLCN 293 >gi|261876473|dbj|BAI47562.1| collagen type VI alpha 2 subunit [Mesocricetus auratus] Length = 1026 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 71/213 (33%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 45 PEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSWRYG 104 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H+ + Sbjct: 105 GLHFSDQVEVFSPPGSDRASFTKSLQSIRSFRRGTFTDCALANMTQQI------RQHVGR 158 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + + +TDG + + A+ G ++A+ +Q L++ A+ Sbjct: 159 GVVN---FAVVITDGHVTGSPCGGIK--MQAERAREEGIRLFAVAPNRNLNEQGLRDIAN 213 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 214 TPHELYRNNYATMRPDSTEIDQDTINRIIKVMK 246 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 619 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 675 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AYN++ + + Sbjct: 676 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 732 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 733 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 767 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 65/180 (36%), Gaps = 17/180 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + L + + D N R L+ F Sbjct: 835 TQRPVDIVFLLDGSERLGEQNFHKARRF---VEEVSRRLTLARKDDDALNA-RMALLQFG 890 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + + I E + R S + G+ +A N + Sbjct: 891 SQNQQEVVFPLTYNLTTIHEALERTTYLNSFSHVGAGIVHAINNVVRGARGGARRHAELS 950 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +FLTDG + +++ + +++ + + V ++ L + DR Sbjct: 951 -----FVFLTDGVTGNDSLEES-----VHSMRKQNVVPTVVAVGSDVDMDVLTKISLGDR 1000 >gi|51597757|ref|YP_071948.1| hypothetical protein YPTB3465 [Yersinia pseudotuberculosis IP 32953] gi|153947845|ref|YP_001399498.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162419629|ref|YP_001605639.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|170022818|ref|YP_001719323.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896948|ref|YP_001874060.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229837270|ref|ZP_04457433.1| hypothetical protein YPS_1184 [Yersinia pestis Pestoides A] gi|51591039|emb|CAH22703.1| Conserved hypothetical protein [Yersinia pseudotuberculosis IP 32953] gi|152959340|gb|ABS46801.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162352444|gb|ABX86392.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|169749352|gb|ACA66870.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699974|gb|ACC90603.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229705393|gb|EEO91403.1| hypothetical protein YPS_1184 [Yersinia pestis Pestoides A] Length = 346 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 58/197 (29%), Gaps = 14/197 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L ++ +++ +K P ++ F+ Sbjct: 3 RLPIFFVLDCSESMIGE------NLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ + RL G T L+ +I K KG + Sbjct: 57 KTIVPL---VEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 LTDG P D + R + AIG+ A L+ ++ Sbjct: 112 YLLTDG---RPTDDTTAEITRWKTHYARKVNLIAIGLGPSADLNILRQLTENVLLFNDTQ 168 Query: 348 SRKLHDAFLRIGKEMVK 364 I + Sbjct: 169 EGDFTQFIKWITASVSA 185 >gi|126272975|ref|XP_001371818.1| PREDICTED: similar to anthrax toxin receptor [Monodelphis domestica] Length = 858 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 52/254 (20%), Positives = 81/254 (31%), Gaps = 32/254 (12%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL-------LITSSVKISSKSDIGLDM 171 + Q K + ++ C+ P ++ P T S Sbjct: 288 ERQRKWRVVGLTCGPQLSSYCCSSPLGVVAAPKPRRRWRNSAPATVPSISISSWTNSSST 347 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++LD S S+ H+ + ++ ++R + FSSK Sbjct: 348 CVILDKSGSVKHHWIEIYSFVESLAEKF------------ISPMLRMSFIVFSSKGTTIM 395 Query: 232 PLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + I++ + L + G T G E A E++ H G II Sbjct: 396 KLTEDREAIRQGLEVLRYEVPGGDTFMHKGFERA-------NEQIYHENYGGLRTASVII 448 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 LTDGE E N A+ GAIVY +GV+ Q S D + V Sbjct: 449 ALTDGELQKEQFYFAE--KEVNRARTFGAIVYCVGVKDFNETQLSTIADSIDHVFPVTGG 506 Query: 349 -RKLHDAFLRIGKE 361 L I K+ Sbjct: 507 FHALRGVIDSILKK 520 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 3/74 (4%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLKNCASPDRFYSVQNSRKLHD 353 + + C K G +V++IG + + Q LKNCAS + Y ++D Sbjct: 475 DGSEKDTRMKASCTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGININD 534 Query: 354 AFLRIGKEMVKQRI 367 AF I +V R+ Sbjct: 535 AFSAIASNVVNLRL 548 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 72/208 (34%), Gaps = 37/208 (17%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 + + ++ G+ ++ +A++ LD ++L +A G++ + D+ Sbjct: 1 MALVFFLIMIAAGGIAVDMMRYEMKRAQIQSTLDSAVLASAGA----PYGSDHRAIIEDY 56 Query: 80 SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF 139 + + L +I S S++ D Y + E+ Sbjct: 57 -------FRVANMTDYLAAEK-EGEIVVTVNSASVTANADMTMDTYLMKLSGIKEL---- 104 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 S + + L++++VLDVS SM + KL ++ Sbjct: 105 -------------RTTGGSTAV--RKVPKLEVVLVLDVSGSMGSN-----SKLVNLKKAA 144 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +E + + + + N V +V FS + Sbjct: 145 KEFVTSLLNGSEPGNTV-ISIVPFSWSV 171 >gi|198421549|ref|XP_002127942.1| PREDICTED: similar to calcium activated chloride channel 4 [Ciona intestinalis] Length = 1075 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 60/198 (30%), Gaps = 33/198 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++VLD S SM+ M + AT I IP G+V FS Sbjct: 325 VLVLDTSGSMSGSNYEYM--MQAATDFIMTY------IPKGAEA---GIVEFSYTATTLS 373 Query: 232 PL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L +++ ++ +T G+ + + Sbjct: 374 QLVSIENKADREYLASRLPG-QPDGSTCIGCGILNGIEVLSNQGRDPAGGQ--------- 423 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY- 343 +I LTDGE + N + A +V +I A L +Y Sbjct: 424 LIVLTDGEENYSPYVNDVR----DNAIEAHVVVDSIFFGASGNGALQQLTEDTKGTMYYN 479 Query: 344 SVQNSRKLHDAFLRIGKE 361 V + L + F ++ + Sbjct: 480 DVTDITGLKETFKQLAES 497 >gi|160882769|ref|ZP_02063772.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|237720675|ref|ZP_04551156.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|260170238|ref|ZP_05756650.1| hypothetical protein BacD2_00060 [Bacteroides sp. D2] gi|293373991|ref|ZP_06620332.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145609|ref|ZP_07038677.1| BatB protein [Bacteroides sp. 3_1_23] gi|315918601|ref|ZP_07914841.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111793|gb|EDO13538.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|229449510|gb|EEO55301.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292631067|gb|EFF49704.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516100|gb|EFI39981.1| BatB protein [Bacteroides sp. 3_1_23] gi|313692476|gb|EFS29311.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 342 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 62/196 (31%), Gaps = 23/196 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + P + K+ + G+++++ LD+S SM +L A Sbjct: 61 IIFVAIGLFSVLLARPQFGS---KLETVKRKGVEVIIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I ++D + + + G++ F+ P+ + + + +K Sbjct: 117 RLISRLVDELDN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A N + + II +TDGEN +G Sbjct: 170 TAIGEAIN-------LAARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAA-----EKG 217 Query: 317 AIVYAIGVQAEAADQF 332 V +GV Sbjct: 218 IQVSVLGVGMPDGAPI 233 >gi|34783791|gb|AAH56811.1| Zgc:112265 protein [Danio rerio] Length = 927 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 65/197 (32%), Gaps = 31/197 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ ++D S SM+ K+ ++ +L + GL+TF ++I Sbjct: 277 VVFIIDRSGSMHG------RKIRQTRSALLTILKDLDEDDHF------GLITFDAEIDFW 324 Query: 229 ---QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + R+ T + + I K Sbjct: 325 RRELLQATKANRENAESFVKRIQDRGATNINDAVLAGVDMINRNPRKGTAS--------- 375 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 +I LTDG+ ++ + ++ + EA +Y +G + F L+N A Sbjct: 376 ILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVAR 435 Query: 341 RFYSVQNSRKLHDAFLR 357 R Y ++ F Sbjct: 436 RIYEDSDADIQLQGFYD 452 >gi|13591884|ref|NP_112256.1| integrin alpha-1 precursor [Rattus norvegicus] gi|124941|sp|P18614|ITA1_RAT RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|56494|emb|CAA36384.1| unnamed protein product [Rattus norvegicus] gi|149059385|gb|EDM10392.1| integrin alpha 1, isoform CRA_b [Rattus norvegicus] Length = 1180 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 94/285 (32%), Gaps = 49/285 (17%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS---- 162 ++ ++ ++ + + V+ F+ C + + H I S V + Sbjct: 99 VNTSIPNVTEIKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 158 Query: 163 ------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + LD+++VLD S S + + ++L + P Sbjct: 159 NSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVIAFLNDLLKRMDIGPKQTQ-- 208 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + F L + + N++ T + G++ A + F Sbjct: 209 -VGIVQYGENVTHEFNLNKYSSTEEVLVAANKIGRQGGLQTMTALGIDTARKEAFTEARG 267 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-------- 324 K K ++ +TDGE S N K+ + C ++I + Sbjct: 268 ARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGN 317 Query: 325 -QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 318 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 362 >gi|320352629|ref|YP_004193968.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121131|gb|ADW16677.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 577 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 75/194 (38%), Gaps = 20/194 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM D +KL + +++ ++ + + V VV +G +V Sbjct: 203 LVFLIDVSGSMQDG-----NKLPLLKQALPLVVRQLGARDRVALVVYAGA----DSVVLP 253 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q I +++L G +T ++ G+ AY + KG ++ +I Sbjct: 254 PTPGDRQQEILAALDQLQAGGSTHASSGIRTAYEL------ARKSFIKGGNNR---VILA 304 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYSVQNSR 349 +DG+ + E E ++ G + +G+ D ++ A Sbjct: 305 SDGDFNVGVTSRDELTRLIEEERKDGIYLTVLGLGMGNYHDDTMEVLADKGNGNYAY-ID 363 Query: 350 KLHDAFLRIGKEMV 363 L +A + KEM Sbjct: 364 SLLEAKKVLVKEMS 377 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 23/137 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 +D+++V+D S SMN+ M ++ + + L+ I+ + + V G V +SS Sbjct: 354 PIDVVLVVDWSGSMNE-----MGRIAEVKKGVDRFLNQIEG-SGIQDSVYMGYVGYSSDG 407 Query: 227 -IVQTFPLAWGV-QHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q G ++E I + T + GL A + + Sbjct: 408 SNYQNKTCQLGKFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLSTQNGH--------- 458 Query: 282 DYKKYIIFLTDGENSSP 298 KK I+ LTDG + Sbjct: 459 --KKVIVLLTDGVPTYS 473 >gi|198435757|ref|XP_002131969.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 594 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 66/166 (39%), Gaps = 27/166 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++++ D S S+ L + ++ +I+ S +++R G ++ ++ Sbjct: 400 LDLVLIFDSSSSVG---------LENWKKLMKFCAEIVGSFTIGRDLMRVGAFRYNQRVD 450 Query: 229 QTFPLAWG----VQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + G ++ K++++ + T++ L +AYN +A + Sbjct: 451 TATEVLLGEIDTFDELKTKVHKIPYNGSGTRTGNALLHAYNHSLNAPGNRPN-------V 503 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + ++ TDG + I+ L RGA + +G++ Sbjct: 504 RDIVLVFTDGVSHDDVIEPARLLQ------SRGADINVVGIKNSRG 543 >gi|167045536|gb|ABZ10188.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_APKG10H12] Length = 356 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 58/165 (35%), Gaps = 25/165 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +VLD S SM+ K+ A R++ L + + + L F Sbjct: 91 NERVPVSLGIVLDTSESMDGQ------KMAAAQRALDRFL--FDLLGPDDEIF---LYRF 139 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + I + T ++DA + G + Sbjct: 140 DYTPELLQDWTVDRIRLSRAIRDIRPRGNT-----------ALYDAVAESVPRVAGGQHF 188 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 KK ++ ++DG +++ D +E + A++YAIG+ + Sbjct: 189 KKALLIISDGNDNNSETDVRELREL---IRESEALIYAIGIDGPS 230 >gi|207079949|ref|NP_001128930.1| DKFZP469A1324 protein [Pongo abelii] gi|55726315|emb|CAH89929.1| hypothetical protein [Pongo abelii] Length = 740 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 79/206 (38%), Gaps = 25/206 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L+ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNPYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + I E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDITEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---- 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTA 388 Query: 301 -DNKESLFYCNEAKRRGAIVYAIGVQ 325 D+ + N+ + +YAIGV Sbjct: 389 VDHIREILNINQKRNDYLDIYAIGVG 414 >gi|306840900|ref|ZP_07473644.1| norD protein [Brucella sp. BO2] gi|306289103|gb|EFM60361.1| norD protein [Brucella sp. BO2] Length = 633 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + YA K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRYATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|304312981|ref|YP_003812579.1| hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] gi|301798714|emb|CBL46947.1| Hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] Length = 979 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 81/210 (38%), Gaps = 37/210 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM +++D+S SM D + + + +L++ K R G+ TF + + Sbjct: 88 DMRVLIDISGSMK-----KTDPQNLRVPATKLLLNLAKPGS------RMGIWTFGQHVDR 136 Query: 230 TFPLA-----WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PLA W Q + NR+ S T L+ A K + Sbjct: 137 LVPLATVDAKW-KQAAAREANRISSSSLYTAIGDALDAAIQ----------GDLKPDPAW 185 Query: 284 KKYIIFLTDG-----ENSSPNIDNKESLF--YCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 ++ ++ L+DG +N + N ++ +F G V+A+ + +A +FLK Sbjct: 186 ERSVVLLSDGMVDISKNPADNQREQQRIFQEVVPRLVAGGYKVHAVALSEQADIEFLKRL 245 Query: 337 --ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A+ F ++ +L F+ + + Sbjct: 246 AEATKGHFSIAHSADQLMHVFVDASDRVNQ 275 >gi|296209627|ref|XP_002751626.1| PREDICTED: collagen alpha-1(XXVIII) chain [Callithrix jacchus] Length = 1125 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 63/179 (35%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + +V ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIVLFDKQKDFVDGLSDKIFRLTPRRSVEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATKLLKREGRKGS-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K + +TDG + N D + ++A+ G IG+ + L+ + Sbjct: 153 -VKVALLMTDGIDHPKNPDVQS---ISDDARISGISFITIGLSTVVNEAKLRLISGDSS 207 >gi|20091285|ref|NP_617360.1| hypothetical protein MA2454 [Methanosarcina acetivorans C2A] gi|19916409|gb|AAM05840.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 551 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 59/169 (34%), Gaps = 31/169 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + V DVS SM G ++ L + + + I GLV++ Sbjct: 372 EKKKPICAVFVADVSGSM---MGEPLNNLKDSLLRGQYYIGEDNMI---------GLVSY 419 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S+ + P+A +N L G T + G+ A I + + A Sbjct: 420 SNDVNIDLPIAKFDLNQRASFAGAVNDLQAGGGTATFDGIAVAMKMI-----QEQRAADP 474 Query: 280 HDDYKKYIIFLTDGENS--SPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + I L+DGE + P D K + G +Y IG A Sbjct: 475 NIRP--VIFVLSDGETNKGHPLNDIKGIVE------DTGIPIYTIGYNA 515 >gi|269796840|ref|YP_003316295.1| von Willebrand factor type A-like protein [Sanguibacter keddieii DSM 10542] gi|269099025|gb|ACZ23461.1| von Willebrand factor type A-like protein [Sanguibacter keddieii DSM 10542] Length = 341 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 57/160 (35%), Gaps = 10/160 (6%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIR 200 F L + + + +D+ V+D + SM + + +L A + Sbjct: 49 FGLVLVVGVMGLGPSVPRTSTDSAAAAVDVFFVVDRTGSMAAEDYDGESKRLDGAKADVL 108 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 +++ I R +++F S+ + PL + ++ T +S L Sbjct: 109 SVVEEIPGA-------RYSVISFDSQATRQLPLTTDTRAVRAWTETADRELTYRSRGSLV 161 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + + L+ + + + FL+DGEN++ Sbjct: 162 D--RPLDELTRALQGSVEQRPANVRLVFFLSDGENTASGT 199 >gi|118593079|ref|ZP_01550466.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118434386|gb|EAV41040.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 858 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 75/244 (30%), Gaps = 37/244 (15%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP + A S+ SS +++LD S SM G ++ V Sbjct: 1 MPLSLPRLAVATSLLVASSAF--SLVPSSAQAADRATILILDASGSMWAQLPEGRSRIEV 58 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-------IVQTFPLA-WGVQHIQEKINR 246 A + + L G++ + I P+ + ++N Sbjct: 59 ARDVLGDYLRSRDGSRP------LGVIAYGHNRKGDCQDIETISPVGVQDPASLGSRLNG 112 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L T L A I E+ + I+ +TDG + + Sbjct: 113 LSPRGKTPLAGSLRRAATLIPKTSEEAD------------IVLVTDGLETCG----LDPC 156 Query: 307 FYCNEAKRRGAIV--YAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLRIGKE 361 + G V + +G + +C + + Q+ +L DA +R+ + Sbjct: 157 AVAASLAQEGIPVRAHVVGFGLTEGEVRQISCIAETTGGMVLAPQSGAELADALVRVTEP 216 Query: 362 MVKQ 365 + ++ Sbjct: 217 VTRE 220 >gi|332216482|ref|XP_003257380.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Nomascus leucogenys] Length = 1398 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 599 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 646 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 647 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 701 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 702 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 756 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 757 NKGFFTQISTLAD 769 >gi|297671074|ref|XP_002813673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Pongo abelii] Length = 987 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 246 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 293 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 294 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 348 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 349 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 403 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 404 NKGFFTQISTLAD 416 >gi|296225455|ref|XP_002758485.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 1 [Callithrix jacchus] gi|296225457|ref|XP_002758486.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 2 [Callithrix jacchus] Length = 1091 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|291393868|ref|XP_002713441.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Oryctolagus cuniculus] Length = 1352 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 523 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 570 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 571 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 625 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 626 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 680 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 681 NKGFFTQISTLAD 693 >gi|194389314|dbj|BAG61618.1| unnamed protein product [Homo sapiens] Length = 525 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 162 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 209 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 210 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 264 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 265 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 319 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 320 NKGFFTQISTLAD 332 >gi|119585707|gb|EAW65303.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Homo sapiens] Length = 992 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 162 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 209 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 210 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 264 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 265 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 319 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 320 NKGFFTQISTLAD 332 >gi|109039062|ref|XP_001082066.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Macaca mulatta] Length = 691 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|73985413|ref|XP_533789.2| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Canis familiaris] Length = 1128 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 293 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 340 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 341 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 395 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 396 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 450 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 451 NKGFFTQISTLAD 463 >gi|54112397|ref|NP_060868.2| voltage-dependent calcium channel subunit alpha-2/delta-3 [Homo sapiens] gi|74723683|sp|Q8IZS8|CA2D3_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|22770596|gb|AAN06673.1| voltage-gated calcium channel alpha(2)delta-3 subunit [Homo sapiens] gi|187950675|gb|AAI37506.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] gi|187953583|gb|AAI37503.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] Length = 1091 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|7105926|emb|CAB75962.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 997 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 162 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 209 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 210 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 264 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 265 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 319 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 320 NKGFFTQISTLAD 332 >gi|7024361|emb|CAB75878.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 519 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 162 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 209 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 210 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 264 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 265 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 319 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 320 NKGFFTQISTLAD 332 >gi|37676262|ref|NP_936658.1| hypothetical protein VVA0602 [Vibrio vulnificus YJ016] gi|37200803|dbj|BAC96628.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 442 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 74/224 (33%), Gaps = 19/224 (8%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+ +LP++ I+M ++ + + AK+ + + L + N Sbjct: 20 KKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEKN 79 Query: 71 NGKKQK--NDFSYRIIKNIWQTDF-RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 QK + + + F R ++G Q + T + +H + Sbjct: 80 QEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWIS 139 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-G 186 + F + P LD+ ++D+S SM + + G Sbjct: 140 YNEGEMGLTKDFEVMGTSTSRKFLPQP--------------LDIYFIIDMSGSMVNPWGG 185 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 G K V +I ++D ++ R ++ F V+ Sbjct: 186 SGKTKYDVVADTINRIVDDLREFKTDRKS-RVAVIGFHHTAVKQ 228 >gi|316973220|gb|EFV56840.1| putative transmembrane cell adhesion receptor mua-3 [Trichinella spiralis] Length = 3249 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 75/207 (36%), Gaps = 29/207 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKI- 227 D+M VLD S S+ V I L + I +N R ++ +S +I Sbjct: 889 DLMFVLDGSGSIGS---------AVFKNEILRFLREFINLFTIGSNHTRLAIIQYSDQIR 939 Query: 228 -VQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD--- 282 F A + E +NR+ TK+ L + F + + + Sbjct: 940 HELDFKEANSKAEVDEALNRVEYLTGLTKTGDALTDMFKIGFSSTFATKFFTILSESRGA 999 Query: 283 ------YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKN 335 + I +TDG + F NEAK+ +++A+GV + + ++ Sbjct: 1000 RPIETGVHRVAIVITDGRSQDIVS------FSANEAKKSNVLMFAVGVTDHVSEAELVEI 1053 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEM 362 S DR + V+ L+ + ++ Sbjct: 1054 AGSKDRVFLVKEFTDLNVRLRSLIQKA 1080 >gi|260837284|ref|XP_002613635.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] gi|229299021|gb|EEN69644.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] Length = 2411 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 72/176 (40%), Gaps = 20/176 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +S+ S ++++V+DVS SM + GP ++L +A ++ +LD + G Sbjct: 131 VSAASPKKKNVVIVIDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRDWG------G 184 Query: 220 LVTFSSKIVQTFPL---------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V+FS++ + +Q+ IN+ + + T G A++ Sbjct: 185 VVSFSARAETPEGCLGDSLGEANPTNIGIMQDFINQRVPETITMYGVGFRKAFDMF---A 241 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 E + +D IIFL+DG + E + E R ++ G+ A Sbjct: 242 EARNKKPEQFEDCYNIIIFLSDGSPTDKAFALDE-ITKGQELMDRSVYIFTYGLGA 296 >gi|326434435|gb|EGD80005.1| hypothetical protein PTSG_10281 [Salpingoeca sp. ATCC 50818] Length = 736 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 64/180 (35%), Gaps = 24/180 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D++ +LD S S+ M + ++ + R ++ ++ Sbjct: 177 GNAAADLLFILDGSGSVGSSNFQTMLSFTRTVATFFDV---------SADTTRIAVMVYA 227 Query: 225 SKIVQTFP----LAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S F L + + I+ + T++ L+YA +F A + Sbjct: 228 SYNYLIFDFNYILTHTKDELLDAISAINYPYGGTRTGGALDYARTVMFTADR---GVRPS 284 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + + +TDG ++ + + G +YAIG+ A A + L ASP Sbjct: 285 SEGIPRVAMVITDGASADDVAAP------AQQLRDEGVTLYAIGI-AGANENELNEIASP 337 >gi|149176499|ref|ZP_01855112.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] gi|148844612|gb|EDL58962.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] Length = 598 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 53/323 (16%), Positives = 105/323 (32%), Gaps = 58/323 (17%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKL------------HYILDHSLLY 58 + +G+ ++ A L M I+ + K ++ I D ++ Sbjct: 18 QSRRGAFMVMAAPFLVATMGFMAFGIDIAVITMTKTRMRNAVEAAALAAAQQITD-AVQT 76 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQT---------------DFRNELRENGFAQ 103 TA I +N + + N + + + + +F +++G Sbjct: 77 TADGIGGSDNVSGDVQDANSIAIDTARAVAEKVARLNGVYIDPETDVEFGKRYQDSGGTF 136 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 + E + +++ KD + F F + +T+S Sbjct: 137 HMVWGENAKPYNVVKVTARKDNATEGQPDSRLQLFFAGFMSEKTA-----AVTTSAIAFI 191 Query: 164 KSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ D+++VLD S SM+ D M + ++ LD I +V SG T Sbjct: 192 EAR---DIVLVLDYSGSMSYDSEFDAMSSYRLGKSAVEANLDDI-----WETLVDSGA-T 242 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +S FP A G I ++ I + Y Y + +E K Sbjct: 243 YSDSGKLKFP-ATGYGRINSEVGTYISSTNDD------YIYRALDLDEEDSSGNLKYPFP 295 Query: 283 YKKYIIFLTDGENSSPNIDNKES 305 + G+N N++ + S Sbjct: 296 QE--------GKNYYGNLNGEPS 310 >gi|326675074|ref|XP_003200270.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 164 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 52/126 (41%), Gaps = 13/126 (10%) Query: 221 VTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V ++ + F L I I + + G T + L++A N +F K + Sbjct: 2 VLYNDRPSAEFYLDTFANKNDIMNYIKIIPYRGGGTATGAALKFAQNNLFTQKR----GS 57 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + K+ I +TDGE+ + E +R G VYA+GV+ + ++ K + Sbjct: 58 RKALGVKQIAIVMTDGESED------DVTTTAAELRRSGVTVYALGVKNASVEELKKIGS 111 Query: 338 SPDRFY 343 P+ + Sbjct: 112 YPEHEF 117 >gi|156741691|ref|YP_001431820.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233019|gb|ABU57802.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 698 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 85/243 (34%), Gaps = 20/243 (8%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI----GLDMMMVLDVSLSMNDHFGPGM 189 MP + W A LL TS D+++++D S SM + Sbjct: 13 RMPAVRML--WVAMLIAILLLPTSPATAQQTGQALDSGNSDVVLIIDNSGSMKQN--DPQ 68 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-- 247 + A ++ D I V R + + +V+ + ++ ++ L Sbjct: 69 NLRLAAANLFIDLSDPRDKIGIVVLSDRMRTRSLTKNLVRIGS-RQDIDELKGLVDALRN 127 Query: 248 -IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G T L+ AY+ + + + +++++ L+DG + + Sbjct: 128 ETKGQETHMGTALDLAYDLL-----DATPGSNRGANQRQFVVLLSDGLPTGVGQRERVDQ 182 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 +RR +++I + EA +L + S + +++ +L D +L + Sbjct: 183 AVQRFRERRYWKIFSIALGDEADPAYLDEKVSSPSGGQVVVARHAGELLDRYLDVYARAG 242 Query: 364 KQR 366 R Sbjct: 243 DDR 245 >gi|113682008|ref|NP_001038479.1| hypothetical protein LOC563353 [Danio rerio] gi|94732542|emb|CAK03688.1| novel collagen protein [Danio rerio] Length = 873 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 78/204 (38%), Gaps = 26/204 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ G+ A R + ++I K + + +V +S Sbjct: 14 DLAFIIDGSSSL------GVPNFETAKRWL---INITKGFDVSSRHTQVAVVQYSDTPRL 64 Query: 230 TFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL Q + E + + G T++ +++A + +F + Sbjct: 65 EIPLGKHQNSQELVEAVGSVSYLGGNTRTGRAIKFATDHVFGMP-----NHTSQSPRNRI 119 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SPDRFY 343 + LTDG + D EA+ + +++A+GV E + L + A + Sbjct: 120 AVVLTDGRSQDDVED------AAMEARAQNIVLFAVGVGNEITNSELVSMANKPASTYVL 173 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 V++ + + + +++ ++ + Sbjct: 174 HVEDYNSIASIWDLMEQKLCEESV 197 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 72/191 (37%), Gaps = 23/191 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW ++ + + ++ S+ +++ ++D S SM + +KL +A ++ + + Sbjct: 190 PWNKDTKLMMVGLNTAAIDMSEKKAS-NLVFLIDTSGSMYEE-----NKLPLAQKAFKML 243 Query: 203 LDIIKSIPDVNNVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 +++ R +VT+ S +V I E ++ L +T + GL Sbjct: 244 ------AENLDENDRISIVTYAGSDTVVLNGVAGSEAYTICEALDSLEASGSTNGSAGLI 297 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY E+ + +I TDG+ + + + E K G + Sbjct: 298 TAYEI----AEQQFIKDGNNR-----VILATDGDLNVGLTSESDLVGLITEEKDSGIFLS 348 Query: 321 AIGVQAEAADQ 331 +G ++ Sbjct: 349 VLGFGSDNLKD 359 >gi|146298483|ref|YP_001193074.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152901|gb|ABQ03755.1| BatB-like protein [Flavobacterium johnsoniae UW101] Length = 344 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 52/176 (29%), Gaps = 24/176 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K+ + G+D++ +DVS SM D +DK I L Sbjct: 79 TKMETVKREGIDIVFAVDVSKSMLAEDVAPSRLDKSKQLVSQIINSLGN----------D 128 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G+V ++ P+ + + + + L+ A Sbjct: 129 RIGIVAYAGSAFPVLPITSDYSVAKMFLQSMSPDMVSSQGTSLDEAI-------RLSATY 181 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I ++DGE+ S + G + +GV E Sbjct: 182 FDEKSKTSKLLILISDGEDHSEGASAAAEEA-----NKIGMKIITVGVGTEKGGTI 232 >gi|189347764|ref|YP_001944293.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341911|gb|ACD91314.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 340 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 70/204 (34%), Gaps = 31/204 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G D++ +LDVS SM D+L A + + ++ R L+ F+ Sbjct: 86 RKGADLVFMLDVSNSMLARDVLP-DRLEAARDAALRIGGSVREG-------RRALLLFAG 137 Query: 226 KIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + PL + + + LI T P +E A + G Sbjct: 138 SPLVQCPLTYDREAFSALLGMATPALIEEQGTSFLPAVELALKLFTGSVPLDSD---GTA 194 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-------LK 334 + ++ ++ L+DGE+ + + +R G ++ +G + LK Sbjct: 195 EGERIVVLLSDGEDHEGATAAAAA-----KLRRNGVSLFVLGFGSRNGADIPDPLRPGLK 249 Query: 335 NCASPDRF----YSVQNSRKLHDA 354 R +S Q R+L A Sbjct: 250 KLDGAGRVVTTRFSPQTLRQLASA 273 >gi|327467286|gb|EGF12786.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK330] Length = 470 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 51/148 (34%), Gaps = 17/148 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSG 219 + + + + V D S SM + R I+ + ++ + V Sbjct: 191 PIEGQVNVAISFVFDSSGSMERDMKGRNTNVVKERRISILREKAIEMVKELKEIGNVSVN 250 Query: 220 LVTFSSK----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L FS L G + I++ IN L T GL Y + + +H Sbjct: 251 LSGFSHYGFYVQKDFSQLDKGTEQIEKSINSLPTRGVTNPGDGLRYGMVSL-----QQQH 305 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + KY++ LTDG ++ +D Sbjct: 306 VQL------KYVVLLTDGIPNAYIVDPS 327 >gi|311273682|ref|XP_003133985.1| PREDICTED: integrin alpha-1-like, partial [Sus scrofa] Length = 506 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/280 (15%), Positives = 92/280 (32%), Gaps = 49/280 (17%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS--------- 162 ++ ++ + + V+ + F+ C + + H I S V + Sbjct: 126 PNVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVLNSIAP 185 Query: 163 -SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + LD+++VLD S S + T + ++L + P G+V Sbjct: 186 VQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ---VGIV 234 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIA 277 + + F L + + N ++ T + G++ A + F Sbjct: 235 QYGENVTHEFNLNKYSSTEEVLVAANEIVQRGGRQTMTALGIDTARKEAFTEARGARRGV 294 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------- 330 K K ++ +TDGE + DN + + ++I + Sbjct: 295 K------KVMVIVTDGE----SHDNHRLNKVIQDCEDESIQRFSIAILGSYNRGNLSTEK 344 Query: 331 --QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 345 FVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFAL 384 >gi|291236748|ref|XP_002738300.1| PREDICTED: complement factor B-like, partial [Saccoglossus kowalevskii] Length = 648 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 73/212 (34%), Gaps = 25/212 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I GLD+ +LD S S+ + + ++ +++ + D R G Sbjct: 234 TIDLNHAGGLDLYFMLDASASVGEE------NFKIGLNFVKRLVEKVGVSAD-KGGTRVG 286 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAKE 271 ++T+ S ++ F L+ + + + L T + L+ + Sbjct: 287 VLTYGSDVIINFHLSDDLTSTELVVQALDNIDYATHQGRRGTATKDALKTVREIMIP--- 343 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAAD 330 + KK + +TDG+++ + ++ K V+ +G+ + Sbjct: 344 QAAASLVDRSFAKKALFLITDGKSNIGG----DPANEADKLKTEFHVDVHCVGISQASKK 399 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGK 360 Q + + P + +++ +L I Sbjct: 400 QLVDIASKPSRQHLFFIEDYERLEWLISAITD 431 >gi|116283392|gb|AAH14765.1| Itga1 protein [Mus musculus] Length = 685 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 97/285 (34%), Gaps = 49/285 (17%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS---- 162 ++ ++ ++ + + V+ + F+ C + + H I S V + Sbjct: 99 VNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 158 Query: 163 ------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + LD+++VLD S S + T + ++L + P Sbjct: 159 NSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ-- 208 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + + F L + + N++ T + G++ A + F Sbjct: 209 -VGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEAFTEARG 267 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-------- 324 K K ++ +TDGE S N K+ + C ++I + Sbjct: 268 ARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGN 317 Query: 325 -QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 318 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 362 >gi|73953968|ref|XP_853856.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 924 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 2/84 (2%) Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + E++E G + II LTDG + +E+ +++ GA +Y+IG Sbjct: 465 QAGSNVNEQIEEANSGGKKFPSMIIALTDG--TLMPEPYEETKIEAENSRQLGATIYSIG 522 Query: 324 VQAEAADQFLKNCASPDRFYSVQN 347 V DQ L SPD + V N Sbjct: 523 VMDYRRDQLLSIADSPDHVFGVDN 546 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 68/200 (34%), Gaps = 34/200 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFS 224 + D+ +LD S + D + + I ++ ++K P+ +R +T+S Sbjct: 49 EGAFDLYFILDTSETAKDSW-----------KDIYTFVEKMVKKYPNPK--LRVSFITYS 95 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 ++ L I++ + L T +T + E++E G + Sbjct: 96 TEGHTLMKLTSDKNKIRDGLAELQNVVPTGAT----HLQEGFIKVNEQIEEANSGGELPG 151 Query: 285 KYIIFLTDGENSSPNIDNKESL------------FYCNEAKRRGAIVYAIGVQAEAADQF 332 + + G D + F N+A R + VY + + ++ Sbjct: 152 DFTV----GRPKYFQPDITVTTAPIPSTSLPMINFEVNKAVRMKSTVYFVAYKDYKKEEL 207 Query: 333 LKNCASPDRFYSVQNSRKLH 352 +P + Y + + L+ Sbjct: 208 RNIVQTPSQVYKAERFQSLN 227 >gi|330469792|ref|YP_004407535.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812763|gb|AEB46935.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 565 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 82/228 (35%), Gaps = 30/228 (13%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP--GMDKLGVATRSIREML 203 A AP + +V S + + M+ ++DVS SM G + V + R L Sbjct: 343 AAGGLAPDAVERAVSSWSIATLSGRMLCIIDVSGSMKKAVPTANGATRQQVTAEAARRGL 402 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQT---------FPLAWGVQHIQEKINRLIFGSTTK 254 ++ GL FS+++ + PL+ ++ ++ + S Sbjct: 403 NLFDDSWS------IGLWVFSTRLDGSRDYRQVVPTGPLSRQRSTLERSLDTITSSSG-D 455 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAK 313 + Y+ + A + ++ + I+ TDG+N I ++ L K Sbjct: 456 TGL-----YDTLLAAYKDVQQNWEPGKVNS--IVLFTDGKNEDADGISQRQLLAELKRIK 508 Query: 314 --RRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + V IG+ E + L A + + + K+ D FLR Sbjct: 509 DPDQPIQVIIIGIGTEVSKAELDTIAQSAGGGAFVAADPTKIGDIFLR 556 >gi|324991934|gb|EGC23857.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK405] gi|332363536|gb|EGJ41317.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1059] Length = 462 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 48/247 (19%), Positives = 80/247 (32%), Gaps = 35/247 (14%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N +KQ D S K+++ E + G QD I + S D +K +A+ Sbjct: 106 NKQKQDWDVSELGTKSLYNMKLDLEFKTEGAYQDNRLISYNLSGK-YPDTNNKLSIDTAI 164 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM- 189 S + L + + + + V D S SMN Sbjct: 165 SA--LNTKQVFSKVAKGKKGIALAYR---TDPIQGQMNIAVSFVFDTSGSMNWDLQGRNV 219 Query: 190 ------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWGVQH 239 ++ + + M+ + I +V+ LV FS+ L G Sbjct: 220 EKTGNESRMDILRKKSVIMIKDLAEIGNVS----VNLVGFSTSAKYIQQNFSNLDNGTNT 275 Query: 240 IQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I I + L T GL Y + +L KYI+ LTDG + Sbjct: 276 IIATITKRENLNPDGVTNPGDGLRYGMISLQSQPAQL-----------KYIVLLTDGIPN 324 Query: 297 SPNIDNK 303 + +D++ Sbjct: 325 AYLVDSR 331 >gi|328907234|gb|EGG27000.1| aerotolerance protein BatA [Propionibacterium sp. P08] Length = 307 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD SLSM + + D + S+P N +V+ S+ Sbjct: 81 IVVALDSSLSMKADDASP----NRLAAAKAKAKDFVNSLPTGFN---VAVVSISAHPEIR 133 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A + I+ L Sbjct: 134 MPPSTDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMAPGGSKDRVPAA------IVML 187 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG N+ L A VY I E + L + Sbjct: 188 SDGGNTQGG----SPLVAATHAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDV 243 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 244 ADRTDAKSWTADSADKLQEVYKQV 267 >gi|300772296|ref|ZP_07082166.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] gi|300760599|gb|EFK57425.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] Length = 212 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 66/198 (33%), Gaps = 20/198 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +VLD S SM+ + ++ M+ ++ P ++TF S Sbjct: 3 RLPVYLVLDTSGSMSGE------PIEAVKNGVQVMISSLRQNPQAIETAFLSVITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + ++ + TT L+ N I + K +KG +I Sbjct: 57 RQLIPLTD--LGAFQMVD-IRATGTTSLGEALKVVSNCIDNEVAKTTSESKGDWKPLVFI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCASPDRFYSV 345 +TDG P D + L + K Y I A + LK Sbjct: 114 --MTDGI---PTDDWQSGLREFQKRKTA----YTIACAAGSGADTSVLKQITENVVSLDT 164 Query: 346 QNSRKLHDAFLRIGKEMV 363 +S+ + F + + Sbjct: 165 ADSQSISKFFAWVTASIG 182 >gi|152207243|gb|ABS30732.1| voltage-gated calcium channel alpha2-delta subunit 1 [Anopheles gambiae] Length = 1256 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 59/368 (16%), Positives = 122/368 (33%), Gaps = 64/368 (17%) Query: 28 IFIVMGLVIETSHKFFVK-AKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 I ++ L E + K + I+D + ++ + K + F N Sbjct: 91 ILLLRELAAEVKNFMDFKMNAVMRIMDSAEQAALSE--SDPESATSKAHPSAFYDARRIN 148 Query: 87 IWQTD-----------FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM 135 +Q+D R+ R G+ +I+ + +DD + S + Sbjct: 149 EYQSDGRLAEGSRQMLLRHMRRFEGYPVNISLSSVLLPAGVSLDDPETQSAIKWSSHLD- 207 Query: 136 PFI------------------FCTFPWCANSSHAPLLITSSVKIS---------SKSDIG 168 P ++ P S +I+ + Sbjct: 208 PLFANNIERDSALSWQYFGSSTGFLRRFPGTAWPPETSYGSKEINDFRSEDWFIQAASSP 267 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREML--DIIKSIPDVNNVVRSGLVTFSS 225 D++++LD S SM+ +L VAT +I + L D ++ ++ R + F Sbjct: 268 KDVIILLDSSGSMSGKEY----QLAVATASAILDTLGDDDFFNLISFSDQSRVIVPCFQD 323 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 K+V+ P V+ ++ IN + +T + LE A+ + + + + + Sbjct: 324 KMVRATP--DNVKEVKTAINAVECENTANFSAALETAFELL-----RKYNQSSQGSQCNQ 376 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFLK-NCASPDRF 342 I+ +TDG P+ E + + N ++ IG K C + F Sbjct: 377 AIMLITDG----PSDTFMEVIKHYNHP-HMPVRIFTYLIGTDKSGGKNLYKMACENKGFF 431 Query: 343 YSVQNSRK 350 + + + Sbjct: 432 VQINSPEE 439 >gi|314927767|gb|EFS91598.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] Length = 322 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD SLSM + + D + S+P N +V+ S+ Sbjct: 96 IVVALDSSLSMKADDASP----NRLAAAKAKAKDFVNSLPTGFN---VAVVSISAHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ + T ++ + + A + I+ L Sbjct: 149 MPPSTDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMAPGGSKDRVPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG N+ L A VY I E + L + Sbjct: 203 SDGGNTQGG----SPLVAATHAAAAKVPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTDAKSWTADSADKLQEVYKQV 282 >gi|297296749|ref|XP_001083531.2| PREDICTED: integrin alpha-11 [Macaca mulatta] Length = 1149 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 79/221 (35%), Gaps = 37/221 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 K AS F++V + L D +G + +NK Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSLEGEHNK 358 >gi|301606537|ref|XP_002932887.1| PREDICTED: cochlin-like [Xenopus (Silurana) tropicalis] Length = 789 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 66/179 (36%), Gaps = 30/179 (16%) Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +++ ++D S S+ D +F +D + D++++ + + G + F+ Sbjct: 604 SVNIGFLIDGSSSVGDLNFRIMLDFMA----------DVVRAFEVSDVGTKVGAIQFTYD 653 Query: 227 IVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L + + +N L T + + +A +F + + Sbjct: 654 QRLEFGL-NDHSNKDDVLNALRSIRYMSGGTATGDAINFAVRNLFQPTKDGHN------- 705 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++I +TDG+ + D+ + A G ++++GV + + P Sbjct: 706 -KNFLIIVTDGQ----SYDDVRG--PASSAHISGVTIFSVGVAWAPLEDLKDMASEPKN 757 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 59/156 (37%), Gaps = 15/156 (9%) Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 P+ + ++ F + + + ++ +I G T PG +F + Sbjct: 8 PENHGGFEVAVIDFKTNAKLCHDVPTATD-LLGHLHPVIGGGGTNMAPG-------LFIS 59 Query: 270 KEKLEHIAKGHDDYKK-YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 +E LE Y + ++ L+DG S P ++ + K+ A + + +A Sbjct: 60 REILERPIFPSQIYLRPVVVVLSDGLTSHP----AKTSEIATQLKKD-ADIVTVAFGDDA 114 Query: 329 ADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + +L + A S + FY + L F +G + Sbjct: 115 DEPYLISLATSSEHFYHCRTGTDLRAFFASVGTTLS 150 >gi|126665799|ref|ZP_01736780.1| von Willebrand factor type A domain protein [Marinobacter sp. ELB17] gi|126629733|gb|EBA00350.1| von Willebrand factor type A domain protein [Marinobacter sp. ELB17] Length = 607 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 74/203 (36%), Gaps = 35/203 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD----------VN 213 +S L +++ DVSLS + + +A ++ + + +++ D Sbjct: 412 QSQRDLACLVLADVSLSTETYINNHQRVIDIARDGLQLLSEALQASRDPFALFAFSSRRR 471 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + VR + + I+ +I L G T+ + A + E+ Sbjct: 472 DHVRFHHLKGFDEAY--------TSTIRGRIQALEPGFYTRMGAAIRQATKLLGARHEQ- 522 Query: 274 EHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 +K ++ LTDG+ + +++ EA + G + + + EA Sbjct: 523 ----------QKVLLLLTDGKPNDLDLYEGRYGVEDTRMAVQEAIKAGLTPFCVTIDDEA 572 Query: 329 ADQFLKNCASPDRFYSVQNSRKL 351 + ++L + + +++ +L Sbjct: 573 S-EYLPYVFGSNNYVVIRDPAQL 594 >gi|145540134|ref|XP_001455757.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124423565|emb|CAK88360.1| unnamed protein product [Paramecium tetraurelia] Length = 522 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 93/305 (30%), Gaps = 30/305 (9%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + I+ + + N I ++ Q+ Sbjct: 9 DNDSYANTTKAIVINEQFIEGERPAICQEHGKFNDDDAIDVVITNESNYGRKSLSQNYMK 68 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + V + + + A LL + + G+D++ ++D S SM+ Sbjct: 69 QANYVLQDNVELKLSYSGLPTQGTQAVLLSVQTKNQAITIRQGIDLICLIDHSGSMSGE- 127 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ----HIQ 241 K+ + +S++ +L +++ R L+ F + + L Q Sbjct: 128 -----KMHLVKKSLKHLLKMLQPND------RLCLIEFDDQNYRLTRLMRATQENMYKFL 176 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 I+ + T ++ A + + K I L+DGE+ Sbjct: 177 IAIDTIEANGATDIGNAMKMALSIL------KHRRFKNPIAS---IFLLSDGEDEGAAGR 227 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDA-FLRI 358 + N + + G + + + A +FY + K+ + F + Sbjct: 228 VWNDIQSKN--IKEPFTINTFGFGRDCCPKIMSEIAHFKEGQFYYISEISKIDECFFEAL 285 Query: 359 GKEMV 363 G E Sbjct: 286 GGEAS 290 >gi|313225346|emb|CBY06820.1| unnamed protein product [Oikopleura dioica] Length = 369 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 74/185 (40%), Gaps = 30/185 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + A ++ +++ + + R ++ +S Sbjct: 90 DLVFIIDGSWSVGNV------NFRKAKDFMKSLVNPFEI---GWDYTRVSVLQYSDDPRI 140 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY-KK 285 F L + I+ + + G T++ + Y +IF + G Y KK Sbjct: 141 EFYLKDYQDKTTLLNAIDAITYKGGNTRTGEAIRYMMGQIFS-------VEAGSRPYVKK 193 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRF 342 +++ LTDG++ + AK +AIGV +A + LK A+P D Sbjct: 194 HMVLLTDGQSQDDVGAPARA------AKNFNIRTFAIGVG-DAIEDELKLVATPPFSDTL 246 Query: 343 YSVQN 347 Y V++ Sbjct: 247 YHVED 251 >gi|194226347|ref|XP_001489610.2| PREDICTED: similar to Collagen, type VI, alpha 2 [Equus caballus] Length = 1019 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 83/247 (33%), Gaps = 20/247 (8%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISS--KSDIGLDMMMVLDVSLS--MNDHFG 186 S Y+ P + P + + +S K+D +++ VLD S S M Sbjct: 7 STYQ-PLLQLGMPMSYQWKCGDIQGGRPEEANSSEKADCPVNVYFVLDTSESVTMQSPTD 65 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + + L + V R G + FS ++ P + + Sbjct: 66 SLLYHMQQFVPQFISQLQDEFYLDQVALSWRYGGLHFSDQVEVFSPPNSDRASFIKSLQS 125 Query: 247 L-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + F T + L +I H+ KG + + + +TDG + + Sbjct: 126 ISSFRRGTFTDCALANMTQEI------RRHVKKGVN----FAVVITDGHVTGSPCGGIK- 174 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 A+ G ++A+ +Q L++ AS P Y + D+ I ++ + Sbjct: 175 -RQAERAREEGIRLFAVAPNRNLHEQGLRDIASTPLELYRSNYATMRPDS-TDIDQDTIN 232 Query: 365 QRILYNK 371 + I K Sbjct: 233 RIIKVMK 239 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 57/165 (34%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 612 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AYN++ + + Sbjct: 669 GTFEAIQLDDERINSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 725 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ CN V AIG+ Sbjct: 726 -----FAVVITDGRH-DPRDDDLNLRALCNH----DVTVTAIGIG 760 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 61/179 (34%), Gaps = 17/179 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + L + + D N R L+ F Sbjct: 828 TQRPVDIVFLLDGSERLGEQNFHKARRF---VEEVSRRLTLARREDDPLNA-RVALLQFG 883 Query: 225 --SKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + FPL + I E + + S + G+ +A N + Sbjct: 884 GRDEQQVAFPLTSNLTAIHEALASARYLNSFSHVGAGIVHAINYVVQDARAGARRHAELA 943 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +FLTDG + +++ + +++ + + V ++ L + D Sbjct: 944 -----FVFLTDGVTGNDSLE-----EAVHSMRKQNVVPTVVAVGSDVDADVLSKISLGD 992 >gi|48425689|pdb|1SHU|X Chain X, Crystal Structure Of The Von Willebrand Factor A Domain Of Human Capillary Morphogenesis Protein 2: An Anthrax Toxin Receptor Length = 182 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 6 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 51 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 52 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 102 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 103 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 160 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 161 FPVKGG---FQALKGIINSILAQS 181 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 65/180 (36%), Gaps = 24/180 (13%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-------VQTFPLAWGVQHIQE 242 KL ++ +L D+ R ++ FS++I + P + ++ + Sbjct: 4 TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVWKDHLISVTPDS--IRDGKV 55 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 I+ + T L+ A + + + H G I+FLTDG+ + Sbjct: 56 YIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHT 111 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPDRFYSVQNSRKLHDAFLR 357 + L EA R ++ IG+ + + L+NC R + +++ F Sbjct: 112 LKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYD 171 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 65/180 (36%), Gaps = 24/180 (13%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-------VQTFPLAWGVQHIQE 242 KL ++ +L D+ R ++ FS++I + P + ++ + Sbjct: 4 TKLRQTKDALFTILH------DLRPQDRFSIIGFSNRIKVWKDHLISVTPDS--IRDGKV 55 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 I+ + T L+ A + + + H G I+FLTDG+ + Sbjct: 56 YIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHT 111 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPDRFYSVQNSRKLHDAFLR 357 + L EA R ++ IG+ + + L+NC R + +++ F Sbjct: 112 LKILNNTREAARGQVCIFTIGIGNDVDFRLLEKPSLENCGLTRRVHEEEDAGSQLIGFYD 171 >gi|320529910|ref|ZP_08030987.1| von Willebrand factor type A domain protein [Selenomonas artemidis F0399] gi|320137928|gb|EFW29833.1| von Willebrand factor type A domain protein [Selenomonas artemidis F0399] Length = 643 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 58/176 (32%), Gaps = 25/176 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + G +++ V+D S SM ++ ++ +L D R GLV Sbjct: 450 RERKRGANILFVVDASGSMA-----ARARMRAVKGAMLALLREAYVRRD-----RVGLVA 499 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + PL V+ Q + L G T GL A + A + + Sbjct: 500 FRRDRAETLLPLTRSVELAQRLLRELPTGGRTPLAAGLSEALLHLAGAARR-------GE 552 Query: 282 DYKKYIIFLTDGENSSP--NIDNKE-SLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ LTDG ++ D + +L G + + ++ Sbjct: 553 LAETLLVLLTDGRATAAPEGEDPAQAALTAAETIGNTGVR----ALVLDTEQDLVR 604 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 23/137 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-- 225 +D+++V+D S SMN+ M ++ + + L+ I+ + + V G V +SS Sbjct: 335 PIDVVLVVDWSGSMNE-----MGRITEVKKGVDRFLNQIEG-SGIQDSVYMGYVGYSSDG 388 Query: 226 KIVQTFPLAWGV-QHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q G ++E I + T + GL A + + Sbjct: 389 NNYQNKTCQLGKFSEVKETIRTMTPETAAGGTFTQRGLRQAGDMLSTQNGH--------- 439 Query: 282 DYKKYIIFLTDGENSSP 298 KK I+ LTDG + Sbjct: 440 --KKVIVLLTDGVPTYS 454 >gi|308497300|ref|XP_003110837.1| CRE-CUT-6 protein [Caenorhabditis remanei] gi|308242717|gb|EFO86669.1| CRE-CUT-6 protein [Caenorhabditis remanei] Length = 576 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 81/234 (34%), Gaps = 28/234 (11%) Query: 130 VSRYEMPFIFCTF-PWCANSSHAPL---LITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + RY++ TF +S P+ L+ S + + ++++++LD S S+ D Sbjct: 4 IPRYDIIISIITFLTLIQTNSANPIDNGLVDSELIHECVTHKAVEVILLLDASGSIGDDT 63 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPLAWGVQHIQEKI 244 + S + + + + + + FS + L W +Q I+ + Sbjct: 64 FKKQLSFAMHLASRLNISEEGSHMALI-QYAETPKLEFSLGQFNHPTQLEWAIQRIEYQ- 121 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 T + L K K I +TDG++ + + Sbjct: 122 -----SGATNTGQALRLTLEKGLQGARPGI---------PKVAIVITDGQSQDDVSEPSQ 167 Query: 305 SLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 L + +VYAIGV Q + +P R ++V+ +L A Sbjct: 168 LLRDAD------VMVYAIGVTNLVNVHQLHQMTGNPVRVFTVETFEQLDRALAD 215 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 78/220 (35%), Gaps = 37/220 (16%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 C + + +++ + +++ +++D S S++D D+ ++ + D Sbjct: 39 CDSDGQYSFTMEGKTTVTANT---VNVAVIIDSSGSVDD------DEWDMSMAFAK---D 86 Query: 205 IIKSIPDVNNVVRSGLVT---FSSKIVQTFPLAWGVQHIQEKINRLI--FGSTTKSTPGL 259 + S D N G + FSS + + ++ ++ T G+ Sbjct: 87 AVSSFADQNLFTNGGSASIAQFSSSASEGGTF-YSLEDFNAFVDGNTKYSSGGTDIIDGI 145 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + + ++I TDG++SSP + + A+ G IV Sbjct: 146 AKGRELLKASPATTS-----------FMIVTTDGQSSSPKAE-------ADAARDEGTIV 187 Query: 320 YAIGVQAEAADQFLKNCASPD-RFYSVQNSRKLHDAFLRI 358 YA+GV + L + + + V +L A I Sbjct: 188 YAVGVGTGPTQEILLDIGGEEANVFDVDGFDELDVALAGI 227 >gi|332164672|ref|NP_001193680.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] gi|296474881|gb|DAA16996.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] Length = 1091 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +++ +LD + N ++ ++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RMTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 358 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 359 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 413 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 414 NKGFFTQISTLAD 426 >gi|284029817|ref|YP_003379748.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809110|gb|ADB30949.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 550 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 47/241 (19%), Positives = 77/241 (31%), Gaps = 36/241 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM--MMVLDVSLSMNDHF-GPGMDKL 192 P P + + S + + L + V+DVS SM + G ++ Sbjct: 318 PLGEGRGEGEVTQLTKPTVAAVEKILQSWTTLSLSAHSLAVIDVSGSMAEKVAGSAKTRM 377 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF----------PLAWG-VQHIQ 241 + + L K PD GL FS+KI L G Q + Sbjct: 378 QLTIEAADNGL---KMFPDSAE---LGLWVFSTKIGPDSADFRQLVPIGKLTPGHRQRMI 431 Query: 242 EKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP- 298 ++ G T A + + ++ TDG+N P Sbjct: 432 GQLKAQSARVGGGTGLYDTAIAAVRAVRSSYNSGAVNT---------VLLFTDGKNDDPG 482 Query: 299 NIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDA 354 ++ +L N K R + A+G+ +A LK A+ + Y +N L D Sbjct: 483 SLSLDRTLQILNGLKDPARPVRIIALGIGPDADADELKRLAQATGGQAYVARNPTDLKDV 542 Query: 355 F 355 F Sbjct: 543 F 543 >gi|156350148|ref|XP_001622163.1| hypothetical protein NEMVEDRAFT_v1g221080 [Nematostella vectensis] gi|156208611|gb|EDO30063.1| predicted protein [Nematostella vectensis] Length = 2040 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 71/219 (32%), Gaps = 28/219 (12%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + + + +D+ +D S + + KL + D++ S P + Sbjct: 188 GGSRRQQTAPALQMDIAFAVDGSDATENQGSGNFRKLLQFVK------DVMSSFPLSESG 241 Query: 216 VRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEK 272 + +G+V++ +K + + I+ + F G T + G+ +F ++ Sbjct: 242 IHTGVVSYGTKAKTNLGFDVHFSQANFNSAIDGINFPGGATDTGNGIRKVMKSLFKTSKR 301 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ +T G+++ + E + G ++ +GV Sbjct: 302 RS--------IPHILVLITAGKSTG------DPSLNAEELRASGVRLFCVGVGGAYDRTQ 347 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 L AS + L F + + K Sbjct: 348 LDAIASS-----PSTTYVLTAGFDDLSGLVPTLVSRITK 381 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 47/129 (36%), Gaps = 14/129 (10%) Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ R G+V F+S FPL + + I+ + ++ S PG I A Sbjct: 1684 SDKTRVGVVVFNS-PSVQFPLDNYSSKEEVYSAIDTIT--QSSNSGPG--TVGEAIAFAN 1738 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 L K +I +T ++ + +AKR + +GV A+ Sbjct: 1739 TNLFKGKTRSQTP-KILIVITSKKSGDDVTSPSQ------DAKRENITMLVVGVGPGASK 1791 Query: 331 QFLKNCASP 339 + A+P Sbjct: 1792 SDMDVIAAP 1800 >gi|114594052|ref|XP_001144775.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pan troglodytes] Length = 386 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 69/184 (37%), Gaps = 30/184 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 88 QASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 138 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 139 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQ 346 + V+ Sbjct: 197 FPVK 200 >gi|302670289|ref|YP_003830249.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302394762|gb|ADL33667.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 568 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 73/177 (41%), Gaps = 22/177 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM +KL + +S E++D + ++ V SG K+V + Sbjct: 212 LVFLIDVSGSM-----SSRNKLPLLQKSFDELVDSLPDEGTISIVTYSG----EEKVVLS 262 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I++ I++L T G++ AY ++ +G ++ +I Sbjct: 263 GEPMSNKKGIKKAIDKLHANGCTNGQAGMQKAYEI------AQKYFIEGGNNR---VIMA 313 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ----AEAADQFLKNCASPDRFY 343 TDG+ + D + + + K G + +G + Q L +C + + Y Sbjct: 314 TDGDLNVGISDLDDLEKFITDKKDEGVFLSILGFGEGNYKDDKMQTLADCGNGNYSY 370 >gi|47219917|emb|CAF97187.1| unnamed protein product [Tetraodon nigroviridis] Length = 1251 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/275 (14%), Positives = 95/275 (34%), Gaps = 28/275 (10%) Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 ++ K+ D + T + EL +N +++ ++ + ++ + Sbjct: 36 LIGSPLLGQPAKRTGDVYKCPVDKEHNTCVKLELPKNTTVPNLHEVKENMAMGTTLVTNP 95 Query: 123 KDYNLSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 L+ +Y M + + + +S+ + LD++++LD S S+ Sbjct: 96 SGGFLACGPQYGYMCGQQQYISGVCANVSSSFQVLNSIASVQECAKELDIVILLDGSNSI 155 Query: 182 NDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQ 238 + SI + L+ IK+I + + G+V++ + L+ Sbjct: 156 ---YPWP---------SITDFLERFIKTIDIGPKLSQVGIVSYGETVTHNVNLSQFDNTA 203 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH--DDYKKYIIFLTDGENS 296 + + + +L G + + + + KK ++ +TDGE+ Sbjct: 204 ALMDFVRQLP------QQTGFKTMTFMGINTTRLEAFLPERGARQGVKKVMVIVTDGESH 257 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 E + C+ R G + I V + Q Sbjct: 258 DF-HALDEVIAACD---RDGIERFGIAVLGDYNRQ 288 >gi|260797475|ref|XP_002593728.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] gi|229278956|gb|EEN49739.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] Length = 186 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 68/193 (35%), Gaps = 21/193 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ VLD S S+ D + + +D++ + R G+V +S Sbjct: 2 DLFFVLDGSGSVGP------DNFETVKQFV---VDVVSAFTISLTDTRVGVVQYSDFNTL 52 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L IN + + G T + +EYA K+ K + Sbjct: 53 ACNLGDHPDEASFVTAINTMQYQGGGTATGDAMEYARVKLQAVWRPAPTPRK--FPLFQI 110 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSV 345 +I LTDG++ + ++L G VYAIGV + L+ DR + Sbjct: 111 MIVLTDGKSGDDVVAAAQALAA------DGVTVYAIGVANFDTAELLEITNGNQDRVIEL 164 Query: 346 QNSRKLHDAFLRI 358 ++ L + I Sbjct: 165 KDYTALTASINSI 177 >gi|198419582|ref|XP_002123026.1| PREDICTED: similar to CLCA family member 1, chloride channel regulator [Ciona intestinalis] Length = 1001 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 82/212 (38%), Gaps = 41/212 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++VLDVS SM +L + +S E + + +++ + G+V F S Sbjct: 236 FVVVLDVSGSMRGK------RLLMMRQSTSEFISSL-----LSDGDKIGIVQFHSFAQTL 284 Query: 231 FPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P+ +H+ + +R G +T G++ A ++ + Sbjct: 285 LPI----RHVNSQTDRFDICSRFPNRTGGSTCIGCGIQAAMQEMERD---------DPTE 331 Query: 283 YKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPD 340 +I+ LTDG EN SP + S A RG + AI + ++ + Sbjct: 332 PCGHIVVLTDGMENRSPYTVDVSSR-----AVNRGCTIDAIFLTTTQNTALVQLVNRTSG 386 Query: 341 RFYSVQ--NSRKLHDAFLRIGKEMVKQRILYN 370 R++ Q + R+L AF I E R L + Sbjct: 387 RWFFAQDRDLRRLTGAFAVIADEDGDIRNLVS 418 >gi|260581676|ref|ZP_05849473.1| conserved hypothetical protein [Haemophilus influenzae NT127] gi|260095269|gb|EEW79160.1| conserved hypothetical protein [Haemophilus influenzae NT127] Length = 345 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 70/200 (35%), Gaps = 15/200 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D+S SM L I +++ ++ P V ++ F+ Sbjct: 3 RLPVYLVVDISESMAGE------NLRQMQEGISHLVNQLRRDPYALESVYLSVIAFAGAA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + RL GS T L + + + + KG YI Sbjct: 57 GTLAPLT---ELMSFYPPRLPIGSGTSIGAALNHLMDSLEKDIVRSTPEKKGDWKPLIYI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 ++DG SP D +++ + A + IG+ A L + ++ Sbjct: 114 --MSDG---SPTDDPAQAISRWKHHFQNKAKLINIGIGKFANLDTLSEISDLTYRLDDED 168 Query: 348 SRKLHDAF-LRIGKEMVKQR 366 K++ A + ++ Q Sbjct: 169 IEKVYRALCESVADSILSQS 188 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 23/137 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 +D+++V+D S SMN+ M ++ + + L+ I+ + + V G V +SS Sbjct: 350 PIDVVLVVDWSGSMNE-----MGRIAEVKKGVDRFLNQIEG-SGIQDSVYMGYVGYSSDG 403 Query: 227 -IVQTFPLAWGV-QHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q G ++E I + T + GL A + + Sbjct: 404 SNYQNKTCQLGKFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLSTQNGH--------- 454 Query: 282 DYKKYIIFLTDGENSSP 298 KK I+ LTDG + Sbjct: 455 --KKVIVLLTDGVPTYS 469 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 70/175 (40%), Gaps = 29/175 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S S+ + + R + E++ + P R GLV FSS++ Sbjct: 134 DLVLVIDGSKSVGAQ------QFELVKRWVGELVGSLDVSPAG---TRVGLVQFSSRVRT 184 Query: 230 TFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 FPL G + + T + L + + F E A+ + Sbjct: 185 EFPL--GRHGTKAEAEAAVRAVTPMDKGTMTGLALRHLVERGFSEAE----GARPGS--R 236 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +TDG + + + AK RG +++A+GV ++ L+ AS Sbjct: 237 RVGLLVTDGRSQD------DVSPWAARAKDRGIVMFAVGVGKAVEEE-LREIASD 284 >gi|74193348|dbj|BAE20643.1| unnamed protein product [Mus musculus] Length = 682 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 97/285 (34%), Gaps = 49/285 (17%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS---- 162 ++ ++ ++ + + V+ + F+ C + + H I S V + Sbjct: 99 VNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 158 Query: 163 ------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + LD+++VLD S S + T + ++L + P Sbjct: 159 NSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ-- 208 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + + F L + + N++ T + G++ A + F Sbjct: 209 -VGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEAFTEARG 267 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-------- 324 K K ++ +TDGE S N K+ + C ++I + Sbjct: 268 ARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGN 317 Query: 325 -QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E + +K+ AS F++V + L +G+ + Sbjct: 318 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 362 >gi|301065602|ref|YP_003787625.1| hypothetical protein LCAZH_0484 [Lactobacillus casei str. Zhang] gi|300438009|gb|ADK17775.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei str. Zhang] Length = 909 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 86/283 (30%), Gaps = 27/283 (9%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 +T L V+ I+M ++ + + NG Sbjct: 10 ITGHLFAVLLILMSMLTGLVTSG-------SSVVTAAANIRPTYQTDANGTYPTNSWQVT 62 Query: 80 SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF 139 + + N D + N + + + + S D + DY + + + Sbjct: 63 GQQNVINQRGGDQVSGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA--KETNTP 119 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + N V D+++V+D+S SM G D+ G + Sbjct: 120 GLYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESKNNGGTDRAGAVRTGV 169 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINR-----LIFGSTT 253 + L I++ + + V GL+ FSS G +I+ + + + Sbjct: 170 KNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGKAGNISQQQAINS 228 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 +P ++ + KK +I LTDG + Sbjct: 229 ALSPTFNGGTYTQIGLRQGSAMLNTDTSGNKKMMILLTDGVPT 271 >gi|57163755|ref|NP_001009219.1| integrin alpha-E [Felis catus] gi|20530606|gb|AAM27173.1|AF420018_1 alpha E integrin [Felis catus] Length = 1160 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 69/222 (31%), Gaps = 20/222 (9%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + + G ++ ++LD S S++ P A I M+ Sbjct: 169 KTARWRRALKTQERQEDEAAGTEIAIILDGSGSID----PP--DFQKAKDFISNMMKNFY 222 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + N LV + I F L + T A + Sbjct: 223 AKCFECNF---ALVQYGEVIQTEFDLRDSQDALASLARVQNITQVKNVTKT-ASAIQHVL 278 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D H ++ + K I+ +TDG+ D + K +G +AI V E Sbjct: 279 DNIFTPSHGSRKNAP--KVIVVITDGDIFG---DPLNLTTVISSPKMQGVERFAIRVGNE 333 Query: 328 AADQF--LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 + LK AS ++V N L ++ + ++ Sbjct: 334 STKTLKELKLIASDPFERHAFTVTNYSALDGLLSKLQQNIIH 375 >gi|52695907|pdb|1TZN|AA Chain a, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695909|pdb|1TZN|BB Chain b, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695911|pdb|1TZN|CC Chain c, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695913|pdb|1TZN|DD Chain d, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695915|pdb|1TZN|EE Chain e, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695917|pdb|1TZN|FF Chain f, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695919|pdb|1TZN|GG Chain g, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695921|pdb|1TZN|HH Chain h, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695923|pdb|1TZN|II Chain i, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695925|pdb|1TZN|JJ Chain j, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695927|pdb|1TZN|KK Chain k, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695929|pdb|1TZN|LL Chain l, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695931|pdb|1TZN|MM Chain m, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695933|pdb|1TZN|OO Chain o, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor Length = 181 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 74/204 (36%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 5 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 50 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 51 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 101 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 102 TSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 159 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 160 FPVKGG---FQALKGIINSILAQS 180 >gi|309791907|ref|ZP_07686390.1| magnesium chelatase ATPase subunit D [Oscillochloris trichoides DG6] gi|308226079|gb|EFO79824.1| magnesium chelatase ATPase subunit D [Oscillochloris trichoides DG6] Length = 657 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 55/172 (31%), Gaps = 33/172 (19%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G V+D S SM H ++ A ++ +L D L+ F Sbjct: 459 SKAGTLFCFVVDASGSMALH------RMRQAKGAVNALLQQAYVHRDH-----VALLAFR 507 Query: 225 S-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + V+ + ++ L G T L AY A+ + H Sbjct: 508 GERADLLLPPSQSVELAKRALDVLPTGGGTPLAAALLSAYQVAEQARSRGIHRTT----- 562 Query: 284 KKYIIFLTDGENSSP-------NIDNK------ESLFYCNEAKRRGAIVYAI 322 ++ +TDG + P + D + E C+ K G I Sbjct: 563 ---LVLITDGRPNVPLQPMPGQSKDERMAQARTEVQTLCSRLKSAGIGAVVI 611 >gi|306845564|ref|ZP_07478133.1| norD protein [Brucella sp. BO1] gi|306273885|gb|EFM55712.1| norD protein [Brucella sp. BO1] Length = 633 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|291405306|ref|XP_002719067.1| PREDICTED: integrin, alpha E [Oryctolagus cuniculus] Length = 1187 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 61/203 (30%), Gaps = 22/203 (10%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ +VLD S S++ P A I M+ + LV + Sbjct: 215 AGTEIAIVLDGSGSID----PP--DFQRAKDFIANMMTNFSEKCFECSF---ALVQYGRV 265 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 I F L T A + D H ++ K Sbjct: 266 IQTEFNLQDSQNMTASLAKVQNITQVRNVTRT-ASAIQHVLDDIFTQRHGSRKKA--SKV 322 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP--- 339 I+ LTDG+ D N K +G +AIGV Q LK AS Sbjct: 323 IVVLTDGDTFE---DPLNLTVVINSPKMQGIERFAIGVGDAFKKHQTEQELKLIASDPDE 379 Query: 340 DRFYSVQNSRKLHDAFLRIGKEM 362 + V N L R+ + + Sbjct: 380 THAFKVTNYSALDGLLSRLQQSI 402 >gi|261313650|ref|ZP_05952847.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261302676|gb|EEY06173.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] Length = 391 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 201 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 260 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 261 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 304 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 305 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 360 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 361 PALFGRGGYALVANLAKLPVALPAI 385 >gi|256015034|ref|YP_003105043.1| nitric-oxide reductase NorD protein [Brucella microti CCM 4915] gi|255997694|gb|ACU49381.1| nitric-oxide reductase NorD protein [Brucella microti CCM 4915] Length = 633 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|254719982|ref|ZP_05181793.1| hypothetical protein Bru83_10644 [Brucella sp. 83/13] gi|265984989|ref|ZP_06097724.1| norD [Brucella sp. 83/13] gi|306837885|ref|ZP_07470746.1| norD protein [Brucella sp. NF 2653] gi|264663581|gb|EEZ33842.1| norD [Brucella sp. 83/13] gi|306407055|gb|EFM63273.1| norD protein [Brucella sp. NF 2653] Length = 633 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|254712101|ref|ZP_05173912.1| hypothetical protein BcetM6_01657 [Brucella ceti M644/93/1] gi|254715172|ref|ZP_05176983.1| hypothetical protein BcetM_01672 [Brucella ceti M13/05/1] gi|261216880|ref|ZP_05931161.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261319749|ref|ZP_05958946.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|260921969|gb|EEX88537.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261292439|gb|EEX95935.1| conserved hypothetical protein [Brucella ceti M644/93/1] Length = 633 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|161620327|ref|YP_001594213.1| von Willebrand factor type A [Brucella canis ATCC 23365] gi|163844435|ref|YP_001622090.1| hypothetical protein BSUIS_B0256 [Brucella suis ATCC 23445] gi|225628708|ref|ZP_03786742.1| Protein norD [Brucella ceti str. Cudo] gi|254699541|ref|ZP_05161369.1| hypothetical protein Bsuib55_01579 [Brucella suis bv. 5 str. 513] gi|254702662|ref|ZP_05164490.1| hypothetical protein Bsuib36_01702 [Brucella suis bv. 3 str. 686] gi|254706206|ref|ZP_05168034.1| hypothetical protein BpinM_04195 [Brucella pinnipedialis M163/99/10] gi|254711500|ref|ZP_05173311.1| hypothetical protein BpinB_14857 [Brucella pinnipedialis B2/94] gi|256029869|ref|ZP_05443483.1| hypothetical protein BpinM2_04290 [Brucella pinnipedialis M292/94/1] gi|256059516|ref|ZP_05449716.1| hypothetical protein Bneo5_04090 [Brucella neotomae 5K33] gi|256158038|ref|ZP_05455956.1| hypothetical protein BcetM4_04255 [Brucella ceti M490/95/1] gi|256253005|ref|ZP_05458541.1| hypothetical protein BcetB_01592 [Brucella ceti B1/94] gi|260167029|ref|ZP_05753840.1| hypothetical protein BruF5_01342 [Brucella sp. F5/99] gi|260568430|ref|ZP_05838899.1| von Willebrand factor [Brucella suis bv. 4 str. 40] gi|261220099|ref|ZP_05934380.1| protein norD [Brucella ceti B1/94] gi|261319110|ref|ZP_05958307.1| norD [Brucella pinnipedialis B2/94] gi|261323486|ref|ZP_05962683.1| protein norD [Brucella neotomae 5K33] gi|261749993|ref|ZP_05993702.1| protein norD [Brucella suis bv. 5 str. 513] gi|261753246|ref|ZP_05996955.1| protein norD [Brucella suis bv. 3 str. 686] gi|261756415|ref|ZP_06000124.1| von Willebrand factor [Brucella sp. F5/99] gi|265986887|ref|ZP_06099444.1| protein norD [Brucella pinnipedialis M292/94/1] gi|265996553|ref|ZP_06109110.1| protein norD [Brucella ceti M490/95/1] gi|294853265|ref|ZP_06793937.1| norD [Brucella sp. NVSL 07-0026] gi|161337138|gb|ABX63442.1| von Willebrand factor type A [Brucella canis ATCC 23365] gi|163675158|gb|ABY39268.1| Hypothetical protein, conserved [Brucella suis ATCC 23445] gi|225616554|gb|EEH13602.1| Protein norD [Brucella ceti str. Cudo] gi|260155095|gb|EEW90176.1| von Willebrand factor [Brucella suis bv. 4 str. 40] gi|260918683|gb|EEX85336.1| protein norD [Brucella ceti B1/94] gi|261298333|gb|EEY01830.1| norD [Brucella pinnipedialis B2/94] gi|261299466|gb|EEY02963.1| protein norD [Brucella neotomae 5K33] gi|261736399|gb|EEY24395.1| von Willebrand factor [Brucella sp. F5/99] gi|261739746|gb|EEY27672.1| protein norD [Brucella suis bv. 5 str. 513] gi|261742999|gb|EEY30925.1| protein norD [Brucella suis bv. 3 str. 686] gi|262550850|gb|EEZ07011.1| protein norD [Brucella ceti M490/95/1] gi|264659084|gb|EEZ29345.1| protein norD [Brucella pinnipedialis M292/94/1] gi|294818920|gb|EFG35920.1| norD [Brucella sp. NVSL 07-0026] Length = 633 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|148558518|ref|YP_001257290.1| norD protein [Brucella ovis ATCC 25840] gi|148369803|gb|ABQ62675.1| norD protein [Brucella ovis ATCC 25840] Length = 633 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 78/205 (38%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|313238855|emb|CBY13854.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 70/201 (34%), Gaps = 30/201 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S+ M + + ++ N R +V ++ Sbjct: 360 DIVFVVDSSGSIGPKRFDYMKNWVKSIAASFKV---------GENFARFSVVQYTKTAKT 410 Query: 230 TFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-YKKYI 287 + I +KI+ +I+ G + N + E+ + K + K+ + Sbjct: 411 VVDFQTLDLSSISQKIDSMIYFQGRNGRGGKTFTGNAL----ERAHTLLKESEPGRKRIV 466 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + LTDG + E + ++A+GV D+ ++ A R + Sbjct: 467 LLLTDGSSDD------EYGSVAKAIRDDKVDIFAVGVGRARKDELVEITADEQRVWQ--- 517 Query: 348 SRKLHDAFLRIGKEMVKQRIL 368 F IG+ Q++L Sbjct: 518 ----TRTFNNIGQ--FNQKLL 532 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 72/207 (34%), Gaps = 38/207 (18%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--- 232 D+S ++ + ++ +++ ++ + + R ++ FS + + Sbjct: 152 DLSFIVDSSSSITISDYQKLKTWMKSIIEKLEIGDNAS---RVSILQFSGQSARPQGRWI 208 Query: 233 ---LAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + +E + T L+Y Y +F ++ A D + Sbjct: 209 NPVLTFDRSTSKEAVIGAIDGMKKLNGDTCIGEALDYFYRNMFTSQ------AGQRSDVE 262 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---------DQFLKN 335 + +I +TDG+ + P E + + A +YAIG+ + Q L Sbjct: 263 QRVIVMTDGKRNCP----AEIAKPAELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHE 318 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIG 359 AS + + N +L RIG Sbjct: 319 IASKPADKYVFEINNFDQLI--LKRIG 343 >gi|238064101|ref|ZP_04608810.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885912|gb|EEP74740.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 626 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 59/175 (33%), Gaps = 26/175 (14%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM D+ G + A + + +++ + + + F+ + P Sbjct: 450 LDGSGSMGDN--DGWTGIEAAATQVFDPEQAAQNLLQTHPQDVTTVAIFNGGVTGGSP-- 505 Query: 235 WGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 W V+ + + T L A ++ + + D K+ Sbjct: 506 WQVRGNDGDALRDLARSVADYEPEGGTNMYACLLRATTELTGQQ---------NGDRKRL 556 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ +TDG++ + D+ + + V AI +A + L+ A Sbjct: 557 VVLMTDGQSGAEQRDDAL-----DALRSADVPVVAIAFGRDADPRQLEEVAKATN 606 >gi|242097658|emb|CAY86115.1| truncated collagen type VI alpha 4 precursor [Homo sapiens] Length = 385 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 51/156 (32%), Gaps = 28/156 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + ++ + D VR GL ++ I Sbjct: 234 DIVFLVDSSTSIGPQ------NFQKVKNFLYSVILGLDISSDR---VRVGLAQYNDNIYP 284 Query: 230 TFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYA-YNKIFDAKEKLEHIAKGHDDY 283 F L ++ I I T + LE+ N + + Sbjct: 285 AFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRAKDRVP---- 338 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + +I +TDGE++ E + K G +V Sbjct: 339 -QIVILVTDGESND------EVQEVADRLKEDGVVV 367 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 61/174 (35%), Gaps = 17/174 (9%) Query: 196 TRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGST 252 RS+R L I + S + +R GL +S F L+ + + I + F Sbjct: 47 ARSVRNFLYILVNSFNVSSETIRVGLAKYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPG 106 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + A I D + ++ + + + ++ G ++L Sbjct: 107 ---GKKMGLALKFILDHHFQEASGSRASQEVPQIAVVISSGPVEDHVHGPAKAL------ 157 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 ++ G ++YAIGV+ + + +SP +N F + K R Sbjct: 158 RKAGILLYAIGVRDAVWAELREIASSPQ-----ENFTSFVPNFSGLSNLAQKLR 206 >gi|194209663|ref|XP_001495019.2| PREDICTED: similar to collagen, type XXVIII [Equus caballus] Length = 1127 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 63/179 (35%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D + + V ++ ++ + FSS Sbjct: 47 IDVVFIVDSSES------SKIVLFDKQKDFVDSLSDKVFQLTPVRSLKYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++++ + G T S + A + K Sbjct: 101 SVQIDSPFSSWKDLQTFKQRVKSMNLIGQGTFSYYAISNATRLLQREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K + +TDG + N D + +A+ G + IG+ + L + Sbjct: 153 -VKVALLMTDGIDHPKNPDVQS---ISEDARTAGILFITIGLSTVVNEAKLHLISGDSS 207 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 59/164 (35%), Gaps = 26/164 (15%) Query: 165 SDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L+++ V+D S S+ ++F + + T + LD R G++ + Sbjct: 793 KETPLELLFVIDSSESVGPENFQIIKNFVKTLTDQVALDLDT----------ARIGIINY 842 Query: 224 SSKIVQTFPLAWGVQH--IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S K+ L + ++ + G T + L A N +F+A Sbjct: 843 SHKVEMVAHLTQFSSKDDFKLAVDNMQYLGEGTYTATALHAA-NHMFEAARPG------- 894 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KK + +TDG+ D K A ++ IGV Sbjct: 895 --VKKVALVITDGQTD--TRDEKNLTEVVKNASDASVEIFVIGV 934 >gi|161408065|dbj|BAF94136.1| Dual Intracellular Von Willebrand factor domain A [Homo sapiens] Length = 276 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 51/156 (32%), Gaps = 28/156 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + ++ + D VR GL ++ I Sbjct: 125 DIVFLVDSSTSIGPQ------NFQKVKNFLYSVILGLDISSDR---VRVGLAQYNDNIYP 175 Query: 230 TFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYA-YNKIFDAKEKLEHIAKGHDDY 283 F L ++ I I T + LE+ N + + Sbjct: 176 AFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRAKDRVP---- 229 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + +I +TDGE++ E + K G +V Sbjct: 230 -QIVILVTDGESND------EVQEVADRLKEDGVVV 258 >gi|149732356|ref|XP_001492551.1| PREDICTED: similar to complement component 2 [Equus caballus] Length = 751 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 82/217 (37%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+++ G+ S M+D I S V Sbjct: 244 KIQIQRSGHLNLYLLLDASQSVSEK------DFGIFKNSAILMVDRIFSFEIN---VSVA 294 Query: 220 LVTFSSKI-VQTFPLAWGVQHIQEKINRL--------IFGSTTKSTPGLEYAYNKIFDAK 270 ++TF+S+ + L + + E IN L G+ T + L Y + + Sbjct: 295 IITFASRPRIVMSVLHHNSRDVMEVINSLDNIHYKDHENGTGTNTYEALNSVYIMMNNQM 354 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + +I LTDG+++ DN + L + + +YAIGV Sbjct: 355 QRLGMNTVAWQEIRHAVILLTDGKSNMGGSPKLAVDNIKELLNIKQKRNDYLDIYAIGVG 414 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLR 357 D + + ++++ L F Sbjct: 415 NLDVDWRELNELGSKKDGERHAFILKDAEALSQVFEH 451 >gi|94986631|ref|YP_594564.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] gi|94730880|emb|CAJ54243.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] Length = 530 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 50/321 (15%), Positives = 112/321 (34%), Gaps = 43/321 (13%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L + ++ + +G +Q S + + + + L + Sbjct: 249 LQATEEEMPTSMGKQLSDMISGQCSPQQHKGMSVAVTGKLLTAELPDAL-----IMEAQA 303 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 I R+ + Q + S+ SR+ C + H P L +K S + I Sbjct: 304 ISRALRTKLQGLLQSQILRRSSPSRHG---KLCGHGLYRIAVHDPRLF---MKTESVTGI 357 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + ++LD+S SM + +A + + + +IP ++ G+ F + Sbjct: 358 DTAVHILLDISGSM-------TSCIELAGAACYSVALALAAIPGIS----VGVSAFPADY 406 Query: 228 VQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + + L+ T S + + +A + + +++K Sbjct: 407 ---------KEDVAATVYPLLRHGKRITNSFAAEAHGSTPMTEALWWVLGMLSTRPEHRK 457 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +TDG D + + AKR G V IG+ A A + + + Sbjct: 458 IVFVVTDGYPD----DPETAKETIAVAKRMGIEVLGIGIDAPA---IISMIPGSEN---I 507 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + R+L A R+ ++++ ++ Sbjct: 508 TDIRELAPAMFRLLQQIMTEK 528 >gi|22127459|ref|NP_670882.1| hypothetical protein y3585 [Yersinia pestis KIM 10] gi|108809186|ref|YP_653102.1| hypothetical protein YPA_3195 [Yersinia pestis Antiqua] gi|108810627|ref|YP_646394.1| hypothetical protein YPN_0462 [Yersinia pestis Nepal516] gi|150260370|ref|ZP_01917098.1| hypothetical protein YPE_2671 [Yersinia pestis CA88-4125] gi|218927791|ref|YP_002345666.1| hypothetical protein YPO0594 [Yersinia pestis CO92] gi|229840484|ref|ZP_04460643.1| hypothetical protein YPH_2826 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842972|ref|ZP_04463123.1| hypothetical protein YPF_1320 [Yersinia pestis biovar Orientalis str. India 195] gi|229900820|ref|ZP_04515944.1| hypothetical protein YP516_0479 [Yersinia pestis Nepal516] gi|270487812|ref|ZP_06204886.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294502679|ref|YP_003566741.1| hypothetical protein YPZ3_0569 [Yersinia pestis Z176003] gi|21960553|gb|AAM87133.1|AE013961_5 hypothetical [Yersinia pestis KIM 10] gi|108774275|gb|ABG16794.1| hypothetical protein YPN_0462 [Yersinia pestis Nepal516] gi|108781099|gb|ABG15157.1| hypothetical protein YPA_3195 [Yersinia pestis Antiqua] gi|115346402|emb|CAL19274.1| conserved hypothetical protein [Yersinia pestis CO92] gi|149289778|gb|EDM39855.1| hypothetical protein YPE_2671 [Yersinia pestis CA88-4125] gi|229682159|gb|EEO78251.1| hypothetical protein YP516_0479 [Yersinia pestis Nepal516] gi|229690038|gb|EEO82096.1| hypothetical protein YPF_1320 [Yersinia pestis biovar Orientalis str. India 195] gi|229696850|gb|EEO86897.1| hypothetical protein YPH_2826 [Yersinia pestis biovar Orientalis str. PEXU2] gi|270336316|gb|EFA47093.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294353138|gb|ADE63479.1| hypothetical protein YPZ3_0569 [Yersinia pestis Z176003] gi|320016846|gb|ADW00418.1| hypothetical protein YPC_3986 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 327 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 58/197 (29%), Gaps = 14/197 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L ++ +++ +K P ++ F+ Sbjct: 3 RLPIFFVLDCSESMIGE------NLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ + RL G T L+ +I K KG + Sbjct: 57 KTIVPL---VEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 LTDG P D + R + AIG+ A L+ ++ Sbjct: 112 YLLTDG---RPTDDTTAEITRWKTHYARKVNLIAIGLGPSADLNILRQLTENVLLFNDTQ 168 Query: 348 SRKLHDAFLRIGKEMVK 364 I + Sbjct: 169 EGDFTQFIKWITASVSA 185 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 58/383 (15%), Positives = 125/383 (32%), Gaps = 36/383 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +GSISI+ LLP + ++ L I T+ + D S + A + Sbjct: 6 KAQQGSISIVFIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACAYSQRANVSLT 65 Query: 71 NGKKQ--KNDFSY--RIIKNIWQTDFRNELREN-GFAQDINNIERSTSLSIIIDDQHKDY 125 G Q K +F + +++ F + ++ ++S + + Sbjct: 66 QGFAQYYKPNFISHVNAQSTFLSGQKQCKIQIGYAFTPLLKDLLPASSQNKVHASVQIQS 125 Query: 126 NLSAVSRYEMPFIFCTFPWCAN----------SSHAPLLITSSVKISSKSDIGLDMMMVL 175 + E+ + + + I + + ++K++ + +V Sbjct: 126 TSTLTVHSEIKPMDLSLVLDISGSMSGRIGLLKRIINQAIQNIEQQNTKNNTQIRFSIVP 185 Query: 176 DVSL-SMNDHFGPGMDKLGV----ATRSIREMLDIIKSIPDVNNV-----VR----SGLV 221 S S+++ K A +L+ +++ D++ +R L+ Sbjct: 186 FSSGVSISNAPWLAKSKGKALCVDAMSYPGNVLNTAQTVADIDTHPSKLNIRAKEPLSLI 245 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH-IAKGH 280 + PL + +++ ++ L +T S G + + +K + + Sbjct: 246 NDCNVYSLLLPLTNNLSKVRKHVDSLSILGSTASYQGFIWGVRTLLPNWQKAWNLQPETS 305 Query: 281 DDYKKYIIFLTDGENSSPN-IDNKESLFYCNEAKRR-GAIVYAIGVQAEAA--DQFLKNC 336 + +I TDGE+ S + D C + + IG DQF K Sbjct: 306 SLLSQRLILFTDGEDDSRDQFDKLVRSGMCQRIQDDFNIDISFIGFGLSPRRLDQFKKCI 365 Query: 337 ASPDR--FYSVQNSRKLHDAFLR 357 S + Y +N L F Sbjct: 366 GSNGKGVVYDAKNGSDLEKFFAE 388 >gi|81892746|sp|Q6Q473|CLCA4_MOUSE RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel regulator 6; Short=mClca6; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|50882459|gb|AAS86332.2| calcium activated chloride channel [Mus musculus] Length = 924 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 70/200 (35%), Gaps = 36/200 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M +VLDVS SM D+L ++ + L I + N G+V FSS+ Sbjct: 308 MCLVLDVSGSM-----TSYDRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIV 357 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + T G++ A+ + + + + Sbjct: 358 HELIQINSDIERNQLLQTL-PTSANGGTSICSGIKAAFQVFKNGEYQTDGTE-------- 408 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYS 344 I+ L+DGE+S+ +E K G+IV+ I + A + Sbjct: 409 -ILLLSDGEDSTAKD-------CIDEVKDSGSIVHFIALGPLADLAVTNMSILTGGNHKL 460 Query: 345 VQNSRK---LHDAFLRIGKE 361 + + L DAF + E Sbjct: 461 ATDEAQNNGLIDAFGALASE 480 >gi|296327481|ref|ZP_06870027.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296155307|gb|EFG96078.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 530 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 57/324 (17%), Positives = 109/324 (33%), Gaps = 33/324 (10%) Query: 57 LYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 +YT I+++ G Q N+ +Y +IW D + EL+ A D E ++ I Sbjct: 74 VYTYEAIVDEAGGLYQSPQPNEDNYVKKHDIWTEDVQKELKTIKPALD----ENASEEEI 129 Query: 117 IIDDQHKDYNLSAVSRYEMPF-IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL 175 Y + PF F + + T + ++ +++ +VL Sbjct: 130 QHLFNQFLYIVGYDYT---PFETIDRFSYVIFKNDMENPFTHE---KIEENMNVNVEIVL 183 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF---SSKIVQTFP 232 D S SM G + +A SI+++L + + + G+ F Sbjct: 184 DASGSMVKKIGDK-TMMEIAKESIKKVLSEMPANA------KVGIRVFGHKGDNTASKKD 236 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAK--GHDDYKKYIIF 289 + G + I L K+ ++ + I + E K + + Sbjct: 237 ESCGSNELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLKALDGEKTLNILYI 296 Query: 290 LTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAAD-QFLKNC--ASPDRFYS 344 +TDG + + + K ++ IG +A + LK A+ + S Sbjct: 297 ITDGIETCGG----NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYYSS 352 Query: 345 VQNSRKLHDAFLRIGKEMVKQRIL 368 V ++ KL RI + Sbjct: 353 VNDANKLTGELYRINELAFSDYKW 376 >gi|170015985|ref|NP_001116166.1| complement component 2 [Xenopus laevis] gi|169642443|gb|AAI60743.1| LOC734198 protein [Xenopus laevis] Length = 662 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 86/240 (35%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N P ++KI+ D L++ +LD S S+ D + Sbjct: 133 FKASLSGILNIMERPASFGRTIKIT--KDGILNVYFLLDASQSVG---QANFDIYKACSE 187 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKI-NRLIFG-- 250 + + L + ++ G++++++ P+ + H+ I N L + Sbjct: 188 YLVDELALFDM------TIQFGIISYATVPKVIIPIYDEESDNNDHVLTLIRNGLKYSDH 241 Query: 251 ---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-----NIDN 302 + T + LE Y+ + KE ++ + + + II LTDG+ + I Sbjct: 242 KDKTGTNTKAALEEIYSMMSSQKETYKNESVWNSIHH-IIILLTDGKANLGGRPAHTIKR 300 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS----PDRFYSVQNSRKLHDAFLRI 358 E + VY G+ E L AS + ++++ ++ F +I Sbjct: 301 IEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKDGETHVFRMESANEMKTVFQKI 360 >gi|82619290|gb|ABB85337.1| complement C2 [Xenopus laevis] Length = 753 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 86/240 (35%), Gaps = 31/240 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N P ++KI+ D L++ +LD S S+ D + Sbjct: 224 FKASLSGILNIMERPASFGRTIKIT--KDGILNVYFLLDASQSVG---QANFDIYKACSE 278 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKI-NRLIFG-- 250 + + L + ++ G++++++ P+ + H+ I N L + Sbjct: 279 YLVDELALFDM------TIQFGIISYATVPKVIIPIYDEESDNNDHVLTLIRNGLKYSDH 332 Query: 251 ---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-----NIDN 302 + T + LE Y+ + KE ++ + + + II LTDG+ + I Sbjct: 333 KDKTGTNTKAALEEIYSMMSSQKETYKNESVWNSIHH-IIILLTDGKANLGGRPAHTIKR 391 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS----PDRFYSVQNSRKLHDAFLRI 358 E + VY G+ E L AS + ++++ ++ F +I Sbjct: 392 IEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKDGETHVFRMESANEMKTVFQKI 451 >gi|268574618|ref|XP_002642288.1| C. briggsae CBR-CUT-6 protein [Caenorhabditis briggsae] gi|187025290|emb|CAP35757.1| CBR-CUT-6 protein [Caenorhabditis briggsae AF16] Length = 357 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 81/235 (34%), Gaps = 30/235 (12%) Query: 130 VSRYE--MPFIFCTFPWCANSSHAPL---LITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + RY+ +P IF P+ L+ S + + ++++++LD S S+ D Sbjct: 4 IPRYDFIIP-IFTFLSLIPTHLANPIDNGLVDSELIHECVTHKAVEVILLLDASGSIGDD 62 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPLAWGVQHIQEK 243 + S + + + + + + FS + L W +Q I+ + Sbjct: 63 TFKKQLSFAMHLASRLNISEEGSHMALI-QYAETPKLEFSLGQFNHPTQLEWAIQRIEYQ 121 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 T + L K K I +TDG++ + Sbjct: 122 ------SGATNTGQALRLTLEKGLQGARTGI---------PKVAIVITDGQSQDDVSEPS 166 Query: 304 ESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + L + +VYAIGV Q + +P R ++V++ +L A Sbjct: 167 QLLRDAD------VMVYAIGVTNLVNVHQLHQMTGNPVRVFTVESFEQLDRALAD 215 >gi|301612325|ref|XP_002935678.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Xenopus (Silurana) tropicalis] Length = 524 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + ++ + +F K Sbjct: 221 DMVIIVDVSGSVSGL------TLKLMKTSVMEMLDTL------SDDDYVTVASFHEKADP 268 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + I+E + ++ TT G EYA++++ + + Sbjct: 269 VSCFRQLVQANVRNKKVIKEAVQEMVARGTTDYKAGFEYAFSQLQNTSITRANCN----- 323 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F + V+ V D L+ CA+ Sbjct: 324 --KMIMMFTDG-----GEDRVQDVFEKYNWPNKTVRVFTFSVGQHNYDVTPLQWMACANK 376 Query: 340 DRFYSV 345 ++ + Sbjct: 377 GYYFEI 382 >gi|291395815|ref|XP_002714336.1| PREDICTED: complement component 2 isoform 2 [Oryctolagus cuniculus] Length = 613 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 85/217 (39%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+++ + S M+D I S + Sbjct: 112 KIQIQRSGHLNLYLLLDASQSVSEE------DFKIFKDSATHMVDRIFSFEINLS---VA 162 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L + + E IN L G+ T L + + Sbjct: 163 VITFASQPKVIMSVLHDNSRDMTEVINSLENAKYTDHENGTGTNIYKALNAVNIMMNNQM 222 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L G + + II LTDG+++ P +D+ + + E + +YAIGV Sbjct: 223 QRLGMETTGWQEIRHAIILLTDGKSNMGGSPKPAVDSIKDVLNIKEKRNDYLDIYAIGVG 282 Query: 326 -AEAADQFLKNCASPD----RFYSVQNSRKLHDAFLR 357 + + L AS + +Q++R L F Sbjct: 283 KLDVDWRELNELASKKDGERHAFILQDARALQQVFEH 319 >gi|291395813|ref|XP_002714335.1| PREDICTED: complement component 2 isoform 1 [Oryctolagus cuniculus] Length = 744 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 85/217 (39%), Gaps = 28/217 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+++ + S M+D I S + Sbjct: 243 KIQIQRSGHLNLYLLLDASQSVSEE------DFKIFKDSATHMVDRIFSFEINLS---VA 293 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L + + E IN L G+ T L + + Sbjct: 294 VITFASQPKVIMSVLHDNSRDMTEVINSLENAKYTDHENGTGTNIYKALNAVNIMMNNQM 353 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L G + + II LTDG+++ P +D+ + + E + +YAIGV Sbjct: 354 QRLGMETTGWQEIRHAIILLTDGKSNMGGSPKPAVDSIKDVLNIKEKRNDYLDIYAIGVG 413 Query: 326 -AEAADQFLKNCASPD----RFYSVQNSRKLHDAFLR 357 + + L AS + +Q++R L F Sbjct: 414 KLDVDWRELNELASKKDGERHAFILQDARALQQVFEH 450 >gi|113475854|ref|YP_721915.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166902|gb|ABG51442.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 460 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 86/250 (34%), Gaps = 34/250 (13%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP---FIFCTFPWCANSSHAPLLITSSVK 160 +I I + ++ + + K YN + + F T + + Sbjct: 50 EITRIITAEKNTVKM--RIKVYNKAGNFNPNLQESDFSVETISEFGSKKTIKPTVILPTD 107 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVR 217 +K+ D++++LD+S SM K A +I + +D P++ VR Sbjct: 108 TRAKT-TPADIIIMLDMSGSMKFRDSSPGRRRIKFKGAINAIYKFIDAANDKPNL--TVR 164 Query: 218 SGLVTF----------SSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYN 264 GL F + + F + + ++EKI L ++T LE A Sbjct: 165 IGLAPFGKGGNQFKVSNKSLDANFYPS-NSEKLKEKIEELANQELSASTNLYQPLETAVK 223 Query: 265 KIF---DAKEKLEHIAKGHDDYKKYIIFLTDG------ENSSPNIDNKESLFYCNEAKRR 315 + ++ + + +I L+DG + + +++ +++ Sbjct: 224 YLINSVNSTSDSNNKTDDSQSKQLVVIVLSDGFHNHDRDTEEIQFERLKNILQPQDSQMP 283 Query: 316 GAIVYAIGVQ 325 V+ +G Sbjct: 284 KVKVHTLGYG 293 >gi|256374467|ref|YP_003098127.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918770|gb|ACU34281.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 564 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 56/338 (16%), Positives = 107/338 (31%), Gaps = 30/338 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNI--WQTDFRNELRENGFAQDI 105 L + ++L + + + I + +GF Q + Sbjct: 237 LFTASEQAVLAANRAAGGLRVAASYPAEGTMLLDYPVVRIKRASDQPGTGVAASGFEQAL 296 Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS 165 + + + + D +A E N+ P S + + Sbjct: 297 RSAKTRERF-VDAGFRTPDGQAAAGLSAE---RDGVGGDAVNAMPKPSPAEVSELLGTWG 352 Query: 166 DIGLD--MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + LD M+ VLDVS SM + G G ++ A+ + L +PD + GL F Sbjct: 353 AVSLDSRMLAVLDVSGSMTELMGNGQTRMAAASEAALTAL---GMLPDTSE---IGLWAF 406 Query: 224 SSKI------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 S+ V+ PL + + R K L + D Sbjct: 407 STNKRPPNDWVELVPLGPLGEVLGSAPRRTRLQQGAKGLAALVGGGTALNDTTLAAFRRM 466 Query: 278 KGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKR---RGAIVYAIGVQAEAADQF 332 + D +K ++ +TDG N L E++ R + +G+ EA + Sbjct: 467 QSTYDPEKINSVVLITDGRNDDYASITTAQLLQALESESDPARPIPLIMVGLGQEADMEA 526 Query: 333 LK--NCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 L+ + A+ + Y + + L + ++R Sbjct: 527 LQEISSATGGKAYQALEAADIRSVLLD---AISQRRCR 561 >gi|220905850|ref|YP_002481161.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219862461|gb|ACL42800.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 236 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 60/188 (31%), Gaps = 18/188 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSSK 226 L + + D S SM + L A + ML + P+ +VR + FS Sbjct: 19 PLHFIWLCDCSGSMVS--QGKIQSLNAAIKETIPMLQQTAADNPNAQVLVR--AIKFSDG 74 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P V + L G T LE E+L Sbjct: 75 AEWHIPTPTPVDQFRWT--DLTAGGVTDLGMALEMV-------AEQLRVPPMSERALPPV 125 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 ++ ++DG+ + ++L ++ V AI V +A + L+ P R Sbjct: 126 LVLISDGQPTDDFGSGLKALM-AQPWGQKAVRV-AIAVGQDANHEVLQKFIGPSELRVLQ 183 Query: 345 VQNSRKLH 352 N +L Sbjct: 184 ANNPDQLV 191 >gi|297716667|ref|XP_002834627.1| PREDICTED: collagen alpha-2(VI) chain-like, partial [Pongo abelii] Length = 279 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 65/189 (34%), Gaps = 24/189 (12%) Query: 145 CANSSHAPLLITSSVKIS--SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 C S +P+ + K LD++ V+D S S+ ++ Sbjct: 47 CCGSEVSPVPLNDPATPPDCEKRCGALDVVFVIDSSESIG---YTNFTLEKNFVINVVNR 103 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAWGV----QHIQEKINRLI-FGSTTKST 256 L I P R G+V +S + + L +E + L T + Sbjct: 104 LGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTP 163 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L++AY+++ + + + + +TDG + P D+ C+ R Sbjct: 164 SALKFAYDRLIKESRRQKTRV--------FAVVITDGRH-DPRDDDLNLRALCD----RD 210 Query: 317 AIVYAIGVQ 325 V AIG+ Sbjct: 211 VTVTAIGIG 219 >gi|68448495|ref|NP_001020335.1| inter-alpha (globulin) inhibitor H4 [Danio rerio] gi|67677852|gb|AAH96879.1| Zgc:112265 [Danio rerio] Length = 915 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 65/197 (32%), Gaps = 31/197 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 ++ ++D S SM+ ++ ++ +L + GL+TF ++I Sbjct: 271 VVFIIDRSGSMHG------RRIRQTRSALLTILKDLDEDDHF------GLITFDAEIDFW 318 Query: 229 ---QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + R+ T + + I K Sbjct: 319 KRELLQATKANRENAESFVKRIQDRGATNINDAVLAGVDMINRNPRKGTAS--------- 369 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASPD 340 +I LTDG+ ++ + ++ + EA +Y +G + F L+N A Sbjct: 370 ILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVAR 429 Query: 341 RFYSVQNSRKLHDAFLR 357 R Y ++ F Sbjct: 430 RIYEDSDADIQLQGFYD 446 >gi|330798620|ref|XP_003287349.1| hypothetical protein DICPUDRAFT_94383 [Dictyostelium purpureum] gi|325082616|gb|EGC36092.1| hypothetical protein DICPUDRAFT_94383 [Dictyostelium purpureum] Length = 559 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/252 (13%), Positives = 84/252 (33%), Gaps = 12/252 (4%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM--M 172 +I I + + + + +++ + ++ + ++ ++ M Sbjct: 80 NINIPENNTTIAETNSGNAGVSITTFQNSTSNINNNNNSSVNNNEIKNQNGNVTINCIDM 139 Query: 173 MVLDVSLSM-----NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +VLD+S SM PG ++ + I + GLV F S Sbjct: 140 IVLDLSGSMRLSAFKGSKVPGELEMTRIEFAQAIFQTFIDKMVSYELSAACGLVCFGSSA 199 Query: 228 VQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 TF + ++ + T + A I D + + Sbjct: 200 QLTFGITRNFDSFSNELGEIQANMGNTHLWEAIILAAKTIVDFRNNPNIKLAAPEKLLCR 259 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDRFYSV 345 + L+DGE++S + ++ + K+ ++ I + E + + A+ + Sbjct: 260 VFCLSDGEDNSNSSTMLDAY---DYLKKNNVVLDCIPIGLEGRSRLSALSTATGGSCFIA 316 Query: 346 QNSRKLHDAFLR 357 +S++ + F R Sbjct: 317 DSSQEGVELFER 328 >gi|320158392|ref|YP_004190770.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319933704|gb|ADV88567.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 442 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 74/224 (33%), Gaps = 19/224 (8%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+ +LPV+ I+M ++ + + AK+ + + L + N Sbjct: 20 KKQQGVAGIIFMGMLPVLVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEKN 79 Query: 71 NGKKQK--NDFSYRIIKNIWQTDF-RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 QK + + + F R ++G Q + T + +H + Sbjct: 80 QEYAQKIVDHYIPDNKGEVVARVFNRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWIS 139 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-G 186 + F + P LD+ ++D+S SM + + G Sbjct: 140 YNDGEMGLTKDFEVMGTSTSRKFLPQP--------------LDIYFIIDMSGSMVNPWGG 185 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 G K V +I ++D ++ R ++ F V+ Sbjct: 186 SGKTKYDVVADTINRIVDDLREFKTDRKS-RVAVIGFHHTAVKK 228 >gi|312072922|ref|XP_003139287.1| von Willebrand factor domain-containing protein [Loa loa] gi|307765555|gb|EFO24789.1| von Willebrand factor domain-containing protein [Loa loa] Length = 2142 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 80/198 (40%), Gaps = 45/198 (22%) Query: 157 SSVKISSKSDIG-----LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 SS +I SK+ I +D++ ++D S S+ F + ++ + +IK IP Sbjct: 884 SSAEIKSKAQIQDPNCLVDLIFIVDTSQSVEKTF----------QKQLQFAVTLIKQIPP 933 Query: 212 --VNNVVRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 NN +R ++FSS+ F + + ++ G T S G+ A +I Sbjct: 934 SAFNNRIRVAAISFSSEAQINFQFNEFNNRTEILNALLSLTHSGGNTSSVSGINLAIKEI 993 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + +G +D ++ I+ ++DG + ++ L + IVYAI Sbjct: 994 LE---------RGREDVRRMIVLMSDGNSQDC---WEDLLDASDRLHATNTIVYAIA--- 1038 Query: 327 EAADQFLKNCASPDRFYS 344 A+PD ++ Sbjct: 1039 ----------ANPDYYFR 1046 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 77/197 (39%), Gaps = 23/197 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--K 226 +D+M+ LD S S+ F D+ +A +++D + + + ++ ++ F+S + Sbjct: 122 IDIMIALDSSGSV---FNVFEDERKLA----HDLIDSLVPVTLKDGRIQVSVMRFASSAE 174 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +V F ++ I EK++++ F G +T+ ++ A + + + Sbjct: 175 VVIPFKISRTPNEIMEKLDKIKFTGGSTRIAKAVDLALTDLSRWRRNDAI---------Q 225 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRFYS 344 I ++DG N ++ + + V+A+ V + ++ + R Y Sbjct: 226 IFILISDG-NGHELWHVAQTAG--RKLQNANIEVFAVPVSQDHNLNELILYTGDAKRVYV 282 Query: 345 VQNSRKLHDAFLRIGKE 361 + + + Sbjct: 283 GAKQSQFVHTISSLINK 299 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 69/188 (36%), Gaps = 25/188 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + ++ + +D+++++D S P V+ + + E+L ++ I Sbjct: 631 IDAGFSPIIESIKAHNDPVDLIILVDTS-------TPADQDFEVSKKFLAELLRSLQVI- 682 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIF 267 D + VR L TF+ L ++I + +L S + ++ A +I Sbjct: 683 DFQSRVRISLTTFTDNAHIEIELRKPTAKENILYAVGKLQNEYSNASVSAAVDVALAQIS 742 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQA 326 E ++ + LTDG D+ +++ + ++ A VY + + Sbjct: 743 VPGEG---------PRQRIFVILTDGSTQ----DSMQTITTAAAKLRQTDAEVYVVPITE 789 Query: 327 EAADQFLK 334 + L Sbjct: 790 NYSKDELS 797 >gi|296474257|gb|DAA16372.1| complement component 2 precursor [Bos taurus] Length = 750 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 71/180 (39%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S++ D + S M+D I S V Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVS------KDDFEIFKDSASRMVDRIFSFEIK---VSVA 295 Query: 220 LVTFSSKIVQTFPL----AWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK + + V ++ + + + G+ T L Y + + Sbjct: 296 IITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKDHENGTGTNIYEALHAVYIMMNNQM 355 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + II LTDG+++ DN + + N+ ++ +YAIGV Sbjct: 356 NRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGVG 415 >gi|198425808|ref|XP_002121992.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 431 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 82/219 (37%), Gaps = 36/219 (16%) Query: 154 LITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 T + + SK + LD++ +LD S S+ D++ +++ + D + Sbjct: 160 WSTDTKQADSKCPVEDLDLIFLLDGSGSVTVPDPLNFDRVKQWVKNVTDRFD-------I 212 Query: 213 NNVVRSGLVTFSSKIVQTF-------PLAWGVQHIQEK----INRLIFGS-TTKSTPGLE 260 + G++ +S + G Q + ++ + F TT + L Sbjct: 213 STFANVGVIQYSHYYETRTVQPYMKVEIGLGQYKTQAEFQMAVDSIQFQGFTTFTAHALN 272 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 + ++ +K I+ LTDG+++ + S + A+ G ++ Sbjct: 273 RTVEEFMNSTRY------SDPTTRKVIVLLTDGQSNDREFLEETSAY----ARGLGITIF 322 Query: 321 AIGVQAEAADQFLKNCASP-----DRFYSVQNSRKLHDA 354 A+GV+ + ++ L+ S +R + + L+ Sbjct: 323 AVGVEGYSEEE-LQIITSGELGNNERVFGLDTFSDLNKV 360 >gi|111120280|gb|ABH06325.1| complement component 2 precursor [Bos taurus] Length = 787 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 71/180 (39%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S++ D + S M+D I S V Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVS------KDDFEIFKDSASRMVDRIFSFEIK---VSVA 295 Query: 220 LVTFSSKIVQTFPL----AWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK + + V ++ + + + G+ T L Y + + Sbjct: 296 IITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKDHENGTGTNIYEALHAVYIMMNNQM 355 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + II LTDG+++ DN + + N+ ++ +YAIGV Sbjct: 356 NRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGVG 415 >gi|71988814|ref|NP_499400.2| CUTiclin family member (cut-6) [Caenorhabditis elegans] gi|35210138|emb|CAA97806.2| C. elegans protein M142.2, confirmed by transcript evidence [Caenorhabditis elegans] Length = 572 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 75/229 (32%), Gaps = 24/229 (10%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 + C+ + L+ S + + ++++++LD S S+ D Sbjct: 8 ISLSITSFLSLILICSANPIDNGLVDSELIHECVTHKAVEVILLLDASGSIGDDTFKKQL 67 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPLAWGVQHIQEKINRLIF 249 + S + + + + + + FS + L W +Q I+ + Sbjct: 68 SFAMHLASRLNISEDGSHMALI-QYAETPKLEFSLGQFNHPTQLEWAIQRIEYQ------ 120 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T + L K K I +TDG++ + + L Sbjct: 121 SGATNTGQALRLTLEKGLQGARPGI---------PKVAIVITDGQSQDDVSEPSQLLRDA 171 Query: 310 NEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + +VYAIGV Q + +P R ++V++ +L A Sbjct: 172 D------VMVYAIGVTNLVNVHQLHQMTGNPVRVFTVESFEQLDRALAD 214 >gi|77735935|ref|NP_001029664.1| complement C2 precursor [Bos taurus] gi|115311857|sp|Q3SYW2|CO2_BOVIN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|74267667|gb|AAI03358.1| Complement component 2 [Bos taurus] Length = 750 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 71/180 (39%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S++ D + S M+D I S V Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVS------KDDFEIFKDSASRMVDRIFSFEIK---VSVA 295 Query: 220 LVTFSSKIVQTFPL----AWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK + + V ++ + + + G+ T L Y + + Sbjct: 296 IITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKDHENGTGTNIYEALHAVYIMMNNQM 355 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + II LTDG+++ DN + + N+ ++ +YAIGV Sbjct: 356 NRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGVG 415 >gi|313225343|emb|CBY06817.1| unnamed protein product [Oikopleura dioica] Length = 321 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 78/206 (37%), Gaps = 28/206 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S+ + +L K ++ G+ +S Sbjct: 135 DLVILIDGSWSVTP------TNFERVKIFLSALL---KHFSIGHDASMIGIAQYSDNPRL 185 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + +NR+ + G T + L +A + +F + + +K Sbjct: 186 EFGLNEHYDFPSLNAAVNRMKYKGGNTATGKALTFALDHVFGRSSRP--------NAQKV 237 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 ++ +TDGE+ + + G +++IGV E + LK+ A+ + + Sbjct: 238 VLIITDGESLQDTVT-----EPARRLRENGVEIFSIGVGDEINLEELKDMATDPDSNHVF 292 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILY 369 V + ++ K++ + ++ Sbjct: 293 QVGGYNAITGITTQVLKDICRIKVRV 318 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 75/203 (36%), Gaps = 29/203 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ ++D S S+ D + + S LD I + +V FS + Sbjct: 1265 ARADLAFLVDGSWSIGDDNFLKITRF---LYSAVGALDRI-----GPEGTQVAIVQFSDE 1316 Query: 227 IVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + E I+ + + G TK+ +++ + +F + Sbjct: 1317 PRTEVQLKSYRKKERLLEAISSISYKGGNTKTGRAIQHMKDSVFTEEGGAR------TAV 1370 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--R 341 K ++ LTDG + + E +++G +V+AIG + + + P Sbjct: 1371 PKVLVLLTDGRSQD------DVSKVSKELQKQGFVVFAIGFADADYGELVNVASKPSSTH 1424 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 + V + DAF + +E+V Sbjct: 1425 VFFVDD----LDAFKEMEEELVA 1443 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 81/215 (37%), Gaps = 30/215 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT 222 +++ DM+ ++D S S+ R +R L+ + DV+ + R GL Sbjct: 2 QTEAVADMVFLVDGSWSIG----------RTNFRLVRVFLESLVKAFDVDLDRTRIGLAQ 51 Query: 223 FSSKIVQTFPLAWGVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS + + L +E + R + + GL A I + E ++ Sbjct: 52 FSGEPRIEWHL--NTHTTKEAVMEAARNLPYKGGNTLTGL--ALTFILENSFSPESGSRP 107 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 K + LTDG++ + + + G V+AIGV+ + L+ AS Sbjct: 108 GIP--KIGVLLTDGKSQD------DVIPPAQRLRDAGVEVFAIGVKNADEGE-LRAIASV 158 Query: 339 --PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 Y+V + + D + + + ++ +K Sbjct: 159 SEDTHVYNVADFHLMADIVDVLTRTICERMETLHK 193 >gi|52082207|ref|YP_080998.1| YwmC protein [Bacillus licheniformis ATCC 14580] gi|52787598|ref|YP_093427.1| YwmC [Bacillus licheniformis ATCC 14580] gi|319648080|ref|ZP_08002297.1| YwmC protein [Bacillus sp. BT1B_CT2] gi|52005418|gb|AAU25360.1| YwmC [Bacillus licheniformis ATCC 14580] gi|52350100|gb|AAU42734.1| YwmC [Bacillus licheniformis ATCC 14580] gi|317389715|gb|EFV70525.1| YwmC protein [Bacillus sp. BT1B_CT2] Length = 228 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 69/208 (33%), Gaps = 32/208 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS----GLVTFSSK 226 + ++LD S SM G G+ K +A + + N ++R G S K Sbjct: 39 VAIMLDASGSMAKKIG-GVSKYELAKNEAFSF---GSKLENANVLMRVFGSEGNNKNSGK 94 Query: 227 IVQTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + Q + +N + T L+ A N + + Sbjct: 95 VQSCNAIRGVYGFQTYDEQSFRNSLNGIGPTGWTPIANALQDAKNAL----------DQL 144 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLK-NC 336 ++ K + LTDGE + + E ++ A+V IG E Q Sbjct: 145 DNNGKNVVYLLTDGEETCGG----NPVKVATELRKSNAVVNVIGFDYEGDFHGQLTSIAA 200 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A ++ + + F + E+ K Sbjct: 201 AGGGEYFQAKTKNDIKRIFTQEAIELSK 228 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 69/198 (34%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 473 DIGFVIDGSSSV------GTGNFRTVLQFVANL---SKEFDISDTDTRVGAVQYTYEQRL 523 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + I R+ + T + + YA ++F K + +K Sbjct: 524 EFGFDQYTTKPDVLNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 574 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 +I +TDG + + A +G I YAIGV A ++ P D + Sbjct: 575 MILITDGRSYD------DVRIPAMVAHHKGVITYAIGVAWAAQEELEIIATHPARDHAFF 628 Query: 345 VQNSRKLHDAFLRIGKEM 362 V L+ + +I + + Sbjct: 629 VDEFDNLYKSVPKIIQNI 646 >gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio] Length = 868 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 57/176 (32%), Gaps = 21/176 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-----S 225 ++ V+D S SM + K+ ++ +L ++ ++ FS Sbjct: 257 VVFVIDNSYSMYGN------KMAQTKEALGTIL------GELPEDDYFAIIVFSTTFVVW 304 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + V+ QE + + T+ + ++ A+ + Sbjct: 305 RPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQRN----GTAPKNMVL 360 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +I LTDG+ + E +A ++ + +A FL + + Sbjct: 361 MMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDADYGFLDTLSKQNN 416 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 72/185 (38%), Gaps = 20/185 (10%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ + +S ++ ++ ++LD S KL + ++ +MLD + D Sbjct: 1236 PVAKNIDISVSPETKSNTNVQLILDTS---GSMSNSSNGKLAIMKAAVSKMLDQYHDMGD 1292 Query: 212 VNNVVRSGLVTFSSKIV--QTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 VR L+ F+S+ + AW V + +NRL G T ++ A Sbjct: 1293 ----VRVQLIDFNSRSTRLEFNGRAWMTVSEAKYLVNRLTAGGGTDYDDAVKKARQSWDH 1348 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID----NKESLFYCNEAKRRGAIVYAIGV 324 EH+ + + Y F++DG+ + D + E + N G +IG+ Sbjct: 1349 D----EHLQLDNANNVSY--FISDGKPQDGHDDATISDNEETKWANHLISNGITSQSIGI 1402 Query: 325 QAEAA 329 + + Sbjct: 1403 NSSGS 1407 >gi|47220812|emb|CAG00019.1| unnamed protein product [Tetraodon nigroviridis] Length = 1557 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 29/203 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D++ ++D S S+ D + + S LD+I + + + + FS Sbjct: 886 KEAKADLVFLVDGSWSIGDENFMKITRF---LHSTVGSLDLIGT-----DGTQVAIAQFS 937 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F L+ + + E I ++ + G TK+ +++ IF + Sbjct: 938 DDARTEFQLSSHSNKEALLEAIQKISYKGGNTKTGRAIKHVKESIFSLEAGARRG----- 992 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 K ++ LTDG + + E + G I++AIG + + + P Sbjct: 993 -VPKVLVVLTDGRSQD------DVNKVSKEMQMDGYIIFAIGFADADYGELVNIASKPSD 1045 Query: 341 -RFYSVQNSRKLHDAFLRIGKEM 362 + V + DA +I +++ Sbjct: 1046 RHVFFVDD----LDAVKKIEEQL 1064 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 67/161 (41%), Gaps = 17/161 (10%) Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V GL +S + L A+ + + + + L + T A N I + K Sbjct: 83 VFLGLAQYSGDPRIEWHLNAYSTKDAVIDAVRNLPYKGGNTLTG---LALNFILENCFKP 139 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ + ESL + G ++AIGV+ ++ L Sbjct: 140 ESGSREGLP--KIGILITDGKSQDDVVPPAESL------RNAGIELFAIGVKNADENE-L 190 Query: 334 KNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 ++ ASP Y+V + ++ + + + +Q + +K Sbjct: 191 QSIASPPEDTHVYNVADFSVMNSIVEALTRTVCEQVVQQDK 231 >gi|302336994|ref|YP_003802200.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634179|gb|ADK79606.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 324 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 66/186 (35%), Gaps = 31/186 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + T + S + D++ +D+S SM +D +++ V R++ Sbjct: 71 MGFTGTGIDSEQFPEKKDVVFAVDLSRSMLASDVVPSRLERTKVLIRTVL---------- 120 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKI 266 D ++ R GLV F+ + P+ VQ + ++ L + T G+ A Sbjct: 121 DNSSGNRYGLVVFTDLGLVMVPVTEDVQSLVSAVDALSPDLLSSAGTNIAAGISAAGQAF 180 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + + + ++ I+ +DGE S D KE + + I + + Sbjct: 181 PEGERR-----------QRLIVVFSDGEEHSG--DPKE--ITASLRRDHHITTSVIALGS 225 Query: 327 EAADQF 332 Sbjct: 226 ADGAAV 231 >gi|89100236|ref|ZP_01173103.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085086|gb|EAR64220.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 476 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 70/198 (35%), Gaps = 27/198 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK----- 226 +++LD S SM G K+ +A ++R I DV+ V T K Sbjct: 174 LLLLDASSSMLLDVD-GKQKMEIAKSAVRSFAKTIGEENDVSLYVYGHAGTQEDKDKQIS 232 Query: 227 ---IVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I + +PL ++ + + + + T ++ A D + Sbjct: 233 CTTIDEVYPLQSYNEESFFKAVEGVEAKGWTPLAGAIKAAREASMDYEGD---------- 282 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNC-ASP 339 YI+ +DG + +E+ + + R + IG A+A DQ K A Sbjct: 283 ITLYIV--SDGAETCDGNPVEEARLFAETNESRMVNI--IGFNADAKAEDQLKKVAEAGK 338 Query: 340 DRFYSVQNSRKLHDAFLR 357 + N+ +L+ + Sbjct: 339 GEYIGADNADQLNSSISN 356 >gi|226310161|ref|YP_002770055.1| hypothetical protein BBR47_05740 [Brevibacillus brevis NBRC 100599] gi|226093109|dbj|BAH41551.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 477 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 71/212 (33%), Gaps = 21/212 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS--GLV 221 K ++ ++LD S SM G ++ A +I+ + + +V V G Sbjct: 167 KFKESYNVEIILDASGSMAAK-SNGKTRMDAAKEAIQAFAESLPEQANVALRVYGHKGSG 225 Query: 222 TFSSKIVQ--TFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 S K + + L + + + + +N+ T L+ A + Sbjct: 226 KESDKTLSCGSSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLSKLPGDKN 285 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 I ++DG + + + V GV Q + Sbjct: 286 TNM---------IFLVSDGIETCDGDPVEAAKQLAQSEITPIINVIGFGVDGPGQQQLKE 336 Query: 335 NC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A+ R+ +Q+ ++L D F R GKE+ + Sbjct: 337 VAKAAGGRYVLIQDQKELQDEFNR-GKEIANK 367 >gi|51247575|pdb|1T6B|Y Chain Y, Crystal Structure Of B. Anthracis Protective Antigen Complexed With Human Anthrax Toxin Receptor Length = 189 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 70/189 (37%), Gaps = 30/189 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGL 220 + D+ VLD S S+ +++ E+ + ++ + + V+ +R Sbjct: 8 PRGRRAFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSF 53 Query: 221 VTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FSS+ PL I E + R+ T GL+ A +I A Sbjct: 54 IVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA-------- 105 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 G II LTDG+ + + ++ GA VY +GV Q + Sbjct: 106 -GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIAD 162 Query: 338 SPDRFYSVQ 346 S ++ + V+ Sbjct: 163 SKEQVFPVK 171 >gi|110798896|ref|YP_694951.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens ATCC 13124] gi|110673543|gb|ABG82530.1| von Willebrand factor type A domain protein [Clostridium perfringens ATCC 13124] Length = 580 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 79/252 (31%), Gaps = 73/252 (28%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D++MV D S SM R+++ LD I+S + L+ FS + Sbjct: 76 PVDIIMVADKSGSMEYEMP-------TLKRAMKNFLDDIES--SFGDRANISLIEFSGEN 126 Query: 228 VQTFP----------------------------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 ++ KI+++ T L Sbjct: 127 KTYIGRYCDFNGFNCYEKYFIKGSIDDAKVLCDYTSEYSTVKSKIDKITAYGRTDIEAAL 186 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-PNIDNKE--SLFYCNE----- 311 E K+ D KKY++F TDG NI+ +E SL Y + Sbjct: 187 ELVKKKLDDRNSNN----------KKYVVFFTDGLPIQLLNIETREYPSLDYIEKYIIPH 236 Query: 312 ----------AKRRGAIVYAIGVQAEA--------ADQFLKNCASPDRFYSVQNSRKLHD 353 + Y+IG+ A F+K+ + ++ +S +L Sbjct: 237 TKEYFYEKGFLDKNKVNFYSIGLFTGRRFDSEKKIAKDFIKSINNSGSYFITDDSNRLDS 296 Query: 354 AFLRIGKEMVKQ 365 + I ++ + Sbjct: 297 VYNDIAMNIINE 308 >gi|329954839|ref|ZP_08295856.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526943|gb|EGF53954.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 342 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 58/159 (36%), Gaps = 18/159 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F A P + K+ + G+++M+ LD+S SM +L A Sbjct: 61 MVFAAIGLFAVLLARPQFGS---KLETVKRQGVEVMIALDISNSMLAQDVQP-SRLQKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R + +++D +++ + G++ F+ P+ + + + +K Sbjct: 117 RLVAQLVDKMEN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + A N + + +I +TDGEN Sbjct: 170 TAIGAAIN-------LASRSFTPQEGVGRAVIVITDGEN 201 >gi|48425688|pdb|1SHT|X Chain X, Crystal Structure Of The Von Willebrand Factor A Domain Of Human Capillary Morphogenesis Protein 2: An Anthrax Toxin Receptor Length = 181 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 73/204 (35%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 6 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 51 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ A +I A G Sbjct: 52 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLK 102 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDG+ + ++ GA VY +GV Q + S ++ Sbjct: 103 TSSIIIALTDGKLDGLV--PSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 160 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 161 FPVKGG---FQALKGIINSILAQS 181 >gi|162447313|ref|YP_001620445.1| surface-anchored VWFA domain-containing protein [Acholeplasma laidlawii PG-8A] gi|161985420|gb|ABX81069.1| surface-anchored VWFA domain protein [Acholeplasma laidlawii PG-8A] Length = 486 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 58/294 (19%), Positives = 110/294 (37%), Gaps = 35/294 (11%) Query: 76 KNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM 135 +N F + N F + N R+ + + ++ ++ Y Sbjct: 48 ENPFIDVSVNNKSNISLSANTASYSFIRSQINSGRAVDRNAVRIEEMVNF---FNYNYNQ 104 Query: 136 PFIFCTFPWCANSSHAP-------LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 P TF + + P LLI K DI +++++LDVS SM Sbjct: 105 PETDKTFGFKSELIQTPWNNETHLLLIGLETKQVDLGDIPSNIVILLDVSGSM-----SA 159 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINR 246 +KL +A +++ +++ +K ++ LVT+SS K+V + ++ +I Sbjct: 160 TNKLSLAKKAMELLIEQMKPNDVIS------LVTYSSGEKVVFKGKSIDDMAYMTSQIRL 213 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L +T GL+ AY E+ +G ++ II TDG+ + + Sbjct: 214 LKASGSTAGKKGLDMAYKV------AEEYFIEGGNNR---IILATDGDFNVGISSTDMLI 264 Query: 307 FYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 Y +E + G A G D+ L+ A ++ + + AF+ Sbjct: 265 EYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKAFVD 318 >gi|254373669|ref|ZP_04989153.1| hypothetical protein FTDG_01677 [Francisella novicida GA99-3548] gi|151571391|gb|EDN37045.1| hypothetical protein FTDG_01677 [Francisella novicida GA99-3548] Length = 332 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVTIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPSPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + +L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLEELATA 264 >gi|145634726|ref|ZP_01790434.1| hypothetical protein CGSHiAA_03958 [Haemophilus influenzae PittAA] gi|229844728|ref|ZP_04464867.1| hypothetical protein CGSHi6P18H1_03939 [Haemophilus influenzae 6P18H1] gi|145267892|gb|EDK07888.1| hypothetical protein CGSHiAA_03958 [Haemophilus influenzae PittAA] gi|229812442|gb|EEP48132.1| hypothetical protein CGSHi6P18H1_03939 [Haemophilus influenzae 6P18H1] Length = 345 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 70/200 (35%), Gaps = 15/200 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D+S SM L I +++ ++ P V ++ F+ Sbjct: 3 RLPVYLVVDISESMAGE------NLRQMQEGISRLVNQLRRDPYALESVYLSVIAFAGAA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + RL GS T L + + + + KG YI Sbjct: 57 GTLAPLT---ELMSFYPPRLPIGSGTSIGAALNHLMDSLEKDIMRSTPEKKGDWKPLIYI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 ++DG SP D +++ + A + IG+ A L + ++ Sbjct: 114 --MSDG---SPTDDPAQAISRWKHHFQNKAKLINIGIGKFANLDTLNEISDLTYRLDDED 168 Query: 348 SRKLHDAF-LRIGKEMVKQR 366 K++ A + ++ Q Sbjct: 169 IEKVYRALCESVADSILSQS 188 >gi|145632218|ref|ZP_01787953.1| hypothetical protein CGSHi3655_07184 [Haemophilus influenzae 3655] gi|144987125|gb|EDJ93655.1| hypothetical protein CGSHi3655_07184 [Haemophilus influenzae 3655] Length = 345 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 70/200 (35%), Gaps = 15/200 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D+S SM L I +++ ++ P V ++ F+ Sbjct: 3 RLPVYLVVDISESMAGE------NLRQMQEGISRLVNQLRRDPYALESVYLSVIAFAGAA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + RL GS T L + + + + KG YI Sbjct: 57 GTLAPLT---ELMSFYPPRLPIGSGTSIGAALNHLMDSLEKDIMRSTPEKKGDWKPLIYI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 ++DG SP D +++ + A + IG+ A L + ++ Sbjct: 114 --MSDG---SPTDDPAQAISRWKHHFQNKAKLINIGIGKFANLDTLNEISDLTYRLDDED 168 Query: 348 SRKLHDAF-LRIGKEMVKQR 366 K++ A + ++ Q Sbjct: 169 IEKVYRALCESVADSILSQS 188 >gi|29346316|ref|NP_809819.1| hypothetical protein BT_0906 [Bacteroides thetaiotaomicron VPI-5482] gi|253568263|ref|ZP_04845674.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385670|ref|ZP_06995228.1| BatB protein [Bacteroides sp. 1_1_14] gi|29338211|gb|AAO76013.1| BatB, conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] gi|251842336|gb|EES70416.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261811|gb|EFI04677.1| BatB protein [Bacteroides sp. 1_1_14] Length = 342 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 IF + P + + K G+++++ LD+S SM +L A Sbjct: 61 LIFAAIGLFSVLLARPQFGSKQETVKRK---GVEVIIALDISNSMLAQDVQP-SRLEKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 R I ++D + + + G++ F+ P+ + + + +K Sbjct: 117 RLISRLVDELDN-------DKVGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + A N + + II +TDGEN Sbjct: 170 TAIGEAIN-------LATRSFTPQEGVGRAIIVITDGEN 201 >gi|308472823|ref|XP_003098638.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] gi|308268238|gb|EFP12191.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] Length = 417 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 73/213 (34%), Gaps = 14/213 (6%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ ++P + +++ LD++ V+D S M G+ + S+ I Sbjct: 18 SAQYSPQSYVDRQCGTDLNNLWLDVIAVVDNSHGMT---NGGVQSVAANIASVFSSGTRI 74 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 S R GLVT++S L + + + + + + L N Sbjct: 75 GSNSTEPRTTRVGLVTYNSGAKLDADLNKFQDLDGLYNGVFKDLSDVVDTTDSYLATGLN 134 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + + D YKK II S +D + N K G ++ + Sbjct: 135 AAEELLQSQS-LNTTRDHYKKVIIVYASEYKGSGELDP---VPVANRLKGSGVVIVTVAY 190 Query: 325 QAEAADQFLK---NCASPDRFY--SVQNSRKLH 352 + L+ N ASP Y + N+ L Sbjct: 191 DQGGDEGLLRDLANIASPGFAYSNAPNNAGNLV 223 >gi|260800509|ref|XP_002595172.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] gi|229280415|gb|EEN51183.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] Length = 299 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 68/188 (36%), Gaps = 25/188 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD + S+ G +++M+ + P R G+V +S++ Sbjct: 18 VDLVFVLDGTGSV------GATNFERMKTFVQKMISDFELGP---EATRIGVVVYSNRAS 68 Query: 229 QTFPLA--WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + +Q+ + + + T + ++Y F + + +K Sbjct: 69 LEISLDAFEDQESLQDAVAGIAYPGGYTLTGAAIDYTTTFAFSTRNGAR------EGVRK 122 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYS 344 + LTDG + + ++ I YA+G+ + L A PD + Sbjct: 123 VAVILTDG------VSYDDPAEPAQSMRKAAIITYAVGIGSNLDRDQLDVIAGVPDNLFV 176 Query: 345 VQNSRKLH 352 + + L Sbjct: 177 LDDFSMLD 184 >gi|156120138|ref|NP_001095285.1| complement C2 [Sus scrofa] gi|148724911|emb|CAN87699.1| complement component 2 [Sus scrofa] Length = 752 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 74/180 (41%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+++ GV RS ++D I S + Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVSEE------DFGVFKRSASLLVDRIFSFEIN---ISVA 295 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + + E ++ L G+ T L Y + + Sbjct: 296 IITFASKPKIIMSVLKDKSRDVTEVVHSLENINYKDHENGTGTNIYEALNSVYIMMNNQM 355 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + L + + II LTDG+++ P +DN + + E ++ +YAIGV Sbjct: 356 QNLGMNTMAWQEIRHAIILLTDGKSNMGGSPKPAVDNIKEILNIKEKRKDYLDIYAIGVG 415 >gi|115976464|ref|XP_001187868.1| PREDICTED: similar to Synaptotagmin IX [Strongylocentrotus purpuratus] Length = 1426 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 75/220 (34%), Gaps = 33/220 (15%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+ T I + + ++VLD+S SM +L + ++ +++ + +P Sbjct: 233 KPVRSTDPEFIIVRQKLQTTTVLVLDISGSMEGE------RLVLLQQASANIIENV--LP 284 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKI 266 + R G++ FS + + +N L T G+E A K+ Sbjct: 285 MGS---RLGIIAFSEYATVRHNIVEIDSQDTRRSLLNSLPKKADGRTSIGRGVELAVQKL 341 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + + +I +TDGE +S ++ L K V + V Sbjct: 342 REYESDPAGST---------LIVITDGEQNSHPYI-QDVLQ-----KTDNLTVNTVAVGG 386 Query: 327 EAAD--QFLKNCASPDRFYSVQNSRKL-HDAFLRIGKEMV 363 EA + L D F S ++ F +M Sbjct: 387 EADGELELLAVHTGGDSFSHTDKSSEIYLSFFKAAAAQMS 426 >gi|38455778|gb|AAR20890.1| complement C2 [Sus scrofa] Length = 734 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 74/180 (41%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+++ GV RS ++D I S + Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVSEE------DFGVFKRSASLLVDRIFSFEIN---ISVA 295 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+SK L + + E ++ L G+ T L Y + + Sbjct: 296 IITFASKPKIIMSVLKDKSRDVTEVVHSLENINYKDHENGTGTNIYEALNSVYIMMNNQM 355 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + L + + II LTDG+++ P +DN + + E ++ +YAIGV Sbjct: 356 QNLGMNTMAWQEIRHAIILLTDGKSNMGGSPKPAVDNIKEILNIKEKRKDYLDIYAIGVG 415 >gi|332844134|ref|XP_510503.3| PREDICTED: integrin alpha-11 [Pan troglodytes] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|332236004|ref|XP_003267196.1| PREDICTED: integrin alpha-11 [Nomascus leucogenys] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|296213559|ref|XP_002753319.1| PREDICTED: integrin alpha-11 [Callithrix jacchus] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|292669496|ref|ZP_06602922.1| magnesium-chelatase subunit [Selenomonas noxia ATCC 43541] gi|292648859|gb|EFF66831.1| magnesium-chelatase subunit [Selenomonas noxia ATCC 43541] Length = 636 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 57/169 (33%), Gaps = 20/169 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ ++D S SM +++ + +I +L R GL+ Sbjct: 445 RAKRSAANILFLVDASGSMG-----ARERMRMVKGAILALLQEA-----YQKRDRVGLIA 494 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ ++++ L G T GL A + + + + Sbjct: 495 FRRDRAETLLPMTRSVELAEKQLRDLPTGGRTPLAEGLACALQTLRELERRGSE------ 548 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAA 329 K ++ +TDG ++ + A+ G + + E Sbjct: 549 --KTVLVLITDGRTNTARDGDGGVQRALRAAEEIAGTQALTLVLDTERG 595 >gi|291290987|ref|NP_001167519.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 3 [Mus musculus] Length = 1150 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|291290985|ref|NP_001167518.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 1 [Mus musculus] Length = 1156 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|194388296|dbj|BAG65532.1| unnamed protein product [Homo sapiens] Length = 650 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|291290989|ref|NP_001167520.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 4 [Mus musculus] gi|187957756|gb|AAI58059.1| Cacna2d2 protein [Mus musculus] Length = 1147 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|192290043|ref|YP_001990648.1| von Willebrand factor type A [Rhodopseudomonas palustris TIE-1] gi|192283792|gb|ACF00173.1| von Willebrand factor type A [Rhodopseudomonas palustris TIE-1] Length = 636 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 69/196 (35%), Gaps = 31/196 (15%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F A L +T V +S +D +D VLDV + L + + Sbjct: 436 FHLAARPQARDLAVTLLVDVSLSTDAWIDNRRVLDVE-------KEALTVLAHGIEACGD 488 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 I+ + VR V + PL ++ +I L G T+ L + Sbjct: 489 QHSILTFTSRRRDWVRVETVK--GFGERMSPL------VERRIVALKPGYYTRIGAALRH 540 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A ++ ++ KK ++ LTDG+ + ++S EA+R G Sbjct: 541 ASAELAKQPQR-----------KKLLLVLTDGKPNDVDHYEGRFALEDSRRAVQEARRSG 589 Query: 317 AIVYAIGVQAEAADQF 332 V+ + + +A F Sbjct: 590 IAVFGVTIDVDAQSYF 605 >gi|158258322|dbj|BAF85134.1| unnamed protein product [Homo sapiens] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|149692293|ref|XP_001495918.1| PREDICTED: integrin, alpha 11 [Equus caballus] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|149018615|gb|EDL77256.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Rattus norvegicus] Length = 1157 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|148689241|gb|EDL21188.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Mus musculus] Length = 1149 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 295 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 342 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 343 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 397 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 398 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 450 Query: 340 DRFYSV 345 ++ + Sbjct: 451 GYYFEI 456 >gi|119901955|ref|XP_602058.3| PREDICTED: integrin, alpha 11 [Bos taurus] gi|297479009|ref|XP_002690571.1| PREDICTED: integrin alpha 11 subunit-like [Bos taurus] gi|296483745|gb|DAA25860.1| integrin alpha 11 subunit-like [Bos taurus] Length = 1194 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 166 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 214 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 215 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 267 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 268 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEI 323 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 324 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 358 >gi|119598226|gb|EAW77820.1| integrin, alpha 11, isoform CRA_a [Homo sapiens] Length = 1189 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|40737976|gb|AAR89454.1| voltage-gated calcium channel alpha2-delta2 subunit [Mus musculus] Length = 1186 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 333 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 380 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 381 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 435 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 436 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 488 Query: 340 DRFYSV 345 ++ + Sbjct: 489 GYYFEI 494 >gi|161353447|ref|NP_064659.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 2 [Mus musculus] gi|81892698|sp|Q6PHS9|CA2D2_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Protein ducky; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|38614142|gb|AAH56389.1| Cacna2d2 protein [Mus musculus] gi|148689240|gb|EDL21187.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Mus musculus] Length = 1154 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|291290991|ref|NP_001167521.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 5 [Mus musculus] gi|26336631|dbj|BAC31998.1| unnamed protein product [Mus musculus] Length = 1148 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|15553133|gb|AAL01650.1|AF247141_1 voltage-dependent calcium channel alpha-2-delta-2 mutant subunit 2 [Mus musculus] Length = 1084 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 222 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 269 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 270 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 324 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 325 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 377 Query: 340 DRFYSV 345 ++ + Sbjct: 378 GYYFEI 383 >gi|5915662|gb|AAD51919.2|AF137378_1 integrin alpha 11 subunit precursor [Homo sapiens] gi|119598227|gb|EAW77821.1| integrin, alpha 11, isoform CRA_b [Homo sapiens] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|6013141|gb|AAF01258.1|AF109681_1 integrin alpha-11 subunit precursor [Homo sapiens] Length = 1189 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|74000923|ref|XP_535527.2| PREDICTED: similar to integrin, alpha 11 precursor [Canis familiaris] Length = 1183 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 155 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 203 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 204 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 256 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 257 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEI 312 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 313 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 347 >gi|52485853|ref|NP_001004439.1| integrin alpha-11 precursor [Homo sapiens] gi|313104119|sp|Q9UKX5|ITA11_HUMAN RecName: Full=Integrin alpha-11; Flags: Precursor gi|189442879|gb|AAI67840.1| Integrin, alpha 11 [synthetic construct] Length = 1188 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|12044402|gb|AAG47846.1|AF247139_1 voltage-dependent calcium channel alpha-2-delta-2 subunit [Mus musculus] Length = 1156 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|28212250|ref|NP_783182.1| voltage-dependent calcium channel subunit alpha-2/delta-2 [Rattus norvegicus] gi|81871226|sp|Q8CFG6|CA2D2_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|27450706|gb|AAO14653.1|AF486277_1 calcium channel alpha-2 delta-2 subunit [Rattus norvegicus] Length = 1157 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 294 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 341 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 342 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 396 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 397 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 449 Query: 340 DRFYSV 345 ++ + Sbjct: 450 GYYFEI 455 >gi|26006175|dbj|BAC41430.1| mKIAA0558 protein [Mus musculus] Length = 1098 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 69/186 (37%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 242 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 289 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ ++ + Sbjct: 290 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCN----- 344 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 345 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 397 Query: 340 DRFYSV 345 ++ + Sbjct: 398 GYYFEI 403 >gi|116623354|ref|YP_825510.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226516|gb|ABJ85225.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 328 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 31/213 (14%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + V SK + + + ++D S SM +++ + +I++ L + + Sbjct: 100 VEQHVVSLSKEEGPVSVGFIIDASSSMK-------NRMDKSVAAIQQFLRTNMAGDEYF- 151 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 LV FS K I ++ + T + ++ AK Sbjct: 152 -----LVRFSDKPTMVTTFTHNPAEIASELTLIKPLGWTALHDAIYLGTQQMRKAK---- 202 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ + LTDG +++ E N VYAIG+ +FL+ Sbjct: 203 -------NSRRALFVLTDGGDNNSRYTEAEVK---NFVVESDVRVYAIGLFERP--KFLE 250 Query: 335 NCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A + ++ + L DA RI +E Q Sbjct: 251 RLAALTGGEALWAKSLKDLPDAIERISREFRNQ 283 >gi|327310820|ref|YP_004337717.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] gi|326947299|gb|AEA12405.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] Length = 509 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 71/203 (34%), Gaps = 19/203 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +++D S SM D DKL A + + +++ + V V Sbjct: 33 GPAPETNTAFAILIDKSKSMADF-----DKLAHAIEAAKGLVESMAPEDIVAVYV----- 82 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE---KLEHI 276 F K+ P + + KI ++ G+ T L + + K + Sbjct: 83 -FDEKVKALVPPTPVEKARKMLGKIEKIKPGTYTLLYQALLQVIDDLRGIKRGLPLMPRR 141 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A + K+ I+ +TDGE + + A R G + AIG+ + ++ L Sbjct: 142 AVPENIPKR-IVVITDGEPWPYYTEERWYEHLGKAAARYGITISAIGIGDDYNEKILYAL 200 Query: 337 A--SPDRFYSVQNSRKLHDAFLR 357 A S +Y + R + Sbjct: 201 ANSSGGAWYHISQIRDISQVLAN 223 >gi|73974565|ref|XP_532319.2| PREDICTED: similar to collagen, type XIV, alpha 1 [Canis familiaris] Length = 1796 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 79/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F ++ L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIINFLYSTVGALNKI---------GADGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ +++ + +F A E K Sbjct: 1083 TEFKLNAYNTKETLLDAIKHISYKGGNTKTGKAIKHVRDSLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISGEMQSNGYSIFAVGVADADYSELVSIGSKPSSRHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/225 (19%), Positives = 84/225 (37%), Gaps = 27/225 (12%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI- 205 + H P VK + D+++++D S S+ R +R L+ Sbjct: 136 GNGHKPSP-PEEVKFFCEIPAIADIVILVDGSWSIGRF----------NFRLVRLFLENL 184 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAY 263 + + + R GL +S + L A+ + + E + L + T A Sbjct: 185 VTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTG---LAL 241 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N IF+ K E A+ K I +TDG++ I +L + G ++AIG Sbjct: 242 NYIFENSFKPEAGARTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIG 293 Query: 324 VQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 V+ ++ + + PD Y+V +H + + + + Sbjct: 294 VKNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|121583393|ref|YP_973824.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596647|gb|ABM40082.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 350 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 37/168 (22%), Positives = 61/168 (36%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM G + K+ A +I + L ++ P V ++ F+ Sbjct: 3 RLPVFFVLDCSESM---VGANLKKMEGAVAAIVKSL---RTDPQALETVFFSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ + +L G T L+ +I + K KG I Sbjct: 57 RTIAPL---VEIVSFYPPKLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRP--II 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +TDG P + ++ N + A + AIG+ L+ Sbjct: 112 YLVTDG---RPTDNPSRAIERWNSHYAKKATLIAIGLGRSVDFTALRR 156 >gi|296135239|ref|YP_003642481.1| von Willebrand factor type A [Thiomonas intermedia K12] gi|295795361|gb|ADG30151.1| von Willebrand factor type A [Thiomonas intermedia K12] Length = 336 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 73/218 (33%), Gaps = 38/218 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDH----FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + G D++++LD SL+M+ H G +L VA R + R Sbjct: 93 AAAPEGRDILVLLDTSLTMSLHDLTWAGKPASRLAVAQR-------VFADFARARQGDRF 145 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LV F S P + ++ L+ L A + + H K Sbjct: 146 ALVAFGSHAATLLPPTFD-ARAAGQMAGLLAVGQLGPDTALGDAI-ALGLRQAGALHGLK 203 Query: 279 GHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQF--- 332 +I TDG ++++ I +++ A+ G +Y + V + + Sbjct: 204 P------VLILYTDGGQSNTGAISPADAVAL---ARHLGVRIYTVEVGTTPDPGRPYTVP 254 Query: 333 ----------LKNCASPDRFYSVQNSRKLHDAFLRIGK 360 L A+ RFY ++ A IG Sbjct: 255 AYAGPQPDLRLIAEATGGRFYFAASAGAQQAAVRDIGA 292 >gi|313200528|ref|YP_004039186.1| von willebrand factor type a [Methylovorus sp. MP688] gi|312439844|gb|ADQ83950.1| von Willebrand factor type A [Methylovorus sp. MP688] Length = 333 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 69/221 (31%), Gaps = 39/221 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G +M+VLD S SMND F ++ +++ + GL+T Sbjct: 81 GHGAHIMVVLDRSASMNDDFAGKYIQSTDAVQNPSKLKAAREVLAKFIESRQQDLVGLIT 140 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS+ V PL ++ ++ T GL A + Sbjct: 141 FSTSPVFVLPLTHDQVALKAALDSAEAQGMGFTAVARGLGMALDYFKGKPVTGA------ 194 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY-------AIGVQAEAADQFL 333 + I+ ++DG + + A +Y +I VQ D Sbjct: 195 ----RLILLVSDGGAHIDSQTQDRIRSWFASEHAALAWIYIRGTNSPSIFVQPGEGDD-- 248 Query: 334 KNCASPDRF--------------YSVQNSRKLHDAFLRIGK 360 ++P+ F Y +N R L A +GK Sbjct: 249 -AASAPEYFLHEYFKTLGVPYQAYEAENPRALEAALRDVGK 288 >gi|253998453|ref|YP_003050516.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] gi|253985132|gb|ACT49989.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] Length = 333 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 69/221 (31%), Gaps = 39/221 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 G +M+VLD S SMND F ++ +++ + GL+T Sbjct: 81 GHGAHIMVVLDRSASMNDDFAGKYIQSTDAVQNPSKLKAAREVLAKFIESRQQDLVGLIT 140 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS+ V PL ++ ++ T GL A + Sbjct: 141 FSTSPVFVLPLTHDQVALKAALDSAEAQGMGFTAVARGLGMALDYFKGKPVTGA------ 194 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY-------AIGVQAEAADQFL 333 + I+ ++DG + + A +Y +I VQ D Sbjct: 195 ----RLILLVSDGGAHIDSQTQDRIRSWFASEHAALAWIYIRGTNSPSIFVQPGEGDD-- 248 Query: 334 KNCASPDRF--------------YSVQNSRKLHDAFLRIGK 360 ++P+ F Y +N R L A +GK Sbjct: 249 -AASAPEYFLHEYFKTLGVPYQAYEAENPRALEAALRDVGK 288 >gi|126031547|pdb|2ODP|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement gi|126031548|pdb|2ODQ|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement Length = 509 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 71/180 (39%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S++++ + S M+D I S V Sbjct: 1 KIQIQRSGHLNLYLLLDASQSVSEN------DFLIFKESASLMVDRIFSFEIN---VSVA 51 Query: 220 LVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+S+ V L + + E I+ L G+ T + L Y + + Sbjct: 52 IITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQM 111 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 L + + II LTDG+++ D+ + N+ + +YAIGV Sbjct: 112 RLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVG 171 >gi|331658353|ref|ZP_08359315.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] gi|331056601|gb|EGI28610.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] Length = 574 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 68/194 (35%), Gaps = 25/194 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ + +V Sbjct: 207 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDN------IAIV 255 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 T++ P G I I+ L +T GLE AY + KG Sbjct: 256 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKG 309 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA- 337 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 310 GINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIAD 366 Query: 338 -SPDRFYSVQNSRK 350 + + + Sbjct: 367 VGNGNYSYIDTLSE 380 >gi|294054128|ref|YP_003547786.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613461|gb|ADE53616.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 702 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 58/161 (36%), Gaps = 25/161 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F + P K+ S S G D++ LDVS SM +L VA Sbjct: 53 LRVSAFVLLVVALARP---GYDPKMESVSRTGRDVVFALDVSRSMLAEDVLP-SRLEVAK 108 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG----VQHIQEKINRLIFGST 252 ++IR+ LD++ + R GLV ++ PL + +++ R + Sbjct: 109 QAIRDALDVMGNE-------RVGLVAYAGSASILCPLTYDYDFVRYSLEQAHPRTVDFGG 161 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 T +E +++F ++ LTDG Sbjct: 162 TTLQSAVEKTVDQVFLDGRAGVQD----------LVVLTDG 192 >gi|21675085|ref|NP_663150.1| hypothetical protein CT2279 [Chlorobium tepidum TLS] gi|21648325|gb|AAM73492.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 335 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 60/164 (36%), Gaps = 25/164 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G+D++ +LD+S SM D+L A E+L I + + D R L+ F+ Sbjct: 86 QKGIDIIFMLDISNSML-ARDTAPDRLTHAKT---ELLQISRRLGDG----RKALLLFAG 137 Query: 226 KIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 V PL + + ++ LI T + A E + Sbjct: 138 TPVVQCPLTDDEEDFEILLDMAAPELITTQGTDYRRAFDAALKLTNSGGELSSNET---- 193 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 ++ +DGE+ ++ + K RG ++ IGV Sbjct: 194 ----VLVLASDGEDHGNDLGD-----IATAMKTRGVHLHVIGVG 228 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 4/69 (5%) Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLKNCASPD-RFYSVQNSRKLHDAFLRI 358 ++ C A+R G ++YAIG+ ++ + LK CAS + ++ V ++ AF I Sbjct: 336 DRRLRNICAAAQRAGIVIYAIGMDVDSQNSLDLLKECASTEAHYFDVDGL-EIQTAFDMI 394 Query: 359 GKEMVKQRI 367 + R+ Sbjct: 395 AASISMLRL 403 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 50/399 (12%), Positives = 121/399 (30%), Gaps = 49/399 (12%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + GS I++A+LL +M ++ + + V+ L +D++L ++ Sbjct: 11 FSRLNHCTNGSFLIVSAVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYALHNNPNELSV 70 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH-KD 124 KQ+ + I ++++ L + I I +++SI + + Sbjct: 71 G-----TIKQREMLIKKRIGYFLDSNYKGTL---LTEEQIKLIVNQSTVSITERSFYPQQ 122 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSH--------------------APLLITSSVKISSK 164 ++++ + N P T S Sbjct: 123 FHINIELHKNIQLKSLILHMAMNPKKDFNISQRKSSLYKKNVALMVVPFTWTGEWIPPSL 182 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L + + + + + L IK + + Sbjct: 183 FTTQFTVSQDL-LPSDLKTEHFKKTEYFNKRNQFFKMFLSKIKENNLCIAPYHYSAIVYW 241 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-- 282 S+ + ++ L + + + + +T +F E + D Sbjct: 242 SEGIFSYKLPFSTTFLYSFRDIYVKQYSTIWDMKPSNYILDLFAGAELHSNRLTPADPCF 301 Query: 283 -----YKKYIIFLTDGENSSPNIDNKESLFY---CNEA-KRRG------AIVYAIGVQAE 327 KK+++ + G S ++ E C K G VY++G+ + Sbjct: 302 RRGVIQKKFMLIIAAGNQISDRKNSAEYFKMKHGCTLMGKNMGKNPQEEITVYSLGISPD 361 Query: 328 A--ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 ++ PDR+Y +Q+ + + R+ + + Sbjct: 362 PDTKRDLIQCTRHPDRYYEIQSYKDIAPVIDRLERNISS 400 >gi|290995707|ref|XP_002680424.1| vWFA domain-containing protein [Naegleria gruberi] gi|284094045|gb|EFC47680.1| vWFA domain-containing protein [Naegleria gruberi] Length = 382 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 55/142 (38%), Gaps = 25/142 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 +D+++V+D + SM+ ++ A ++++ +L +K +R +++ Sbjct: 65 VDLVIVMDCTGSMS-------GEIDAAKKTVQTILSSLK--DHFKTDLRFSAISYRDHSD 115 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 V+ FP +Q + I+ + L A + + + K Sbjct: 116 DYAVREFPFTKDIQKAKSYIDTMSAQGGGDHPEALASALKVVNEMPFNKKG--------K 167 Query: 285 KYIIFLTD----GENSSPNIDN 302 K I++ D G NS+ D+ Sbjct: 168 KICIWIADAPPHGMNSNQGADS 189 >gi|2645090|dbj|BAA23627.1| collagen-like protein [Hydra vulgaris] Length = 354 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/188 (22%), Positives = 73/188 (38%), Gaps = 27/188 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M+ +LD S S+ D DK+ +SI D+ N + R G++ F S Sbjct: 1 MLFLLDSSGSVGDE---NFDKMKEFVKSIVLNFDV------DNQLTRIGIIRFDSDAEII 51 Query: 231 FPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L+ + +++ +N R G T+ LE A + D K Sbjct: 52 IQLS-DHKTLKDLLNDIDSIRYNEGIQTRIDKALERAMEAFSEKNGGRA-------DATK 103 Query: 286 YIIFLTDGENS--SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ L DG+NS + D E L +AK Y IG+ E + L++ A+ + Sbjct: 104 ALVLLADGQNSFIEGSQDLNEELKPLIDAK---VFRYVIGIGRELDLKELEDIATNNIAI 160 Query: 344 SVQNSRKL 351 + +L Sbjct: 161 YADSFDEL 168 >gi|253701050|ref|YP_003022239.1| von Willebrand factor A [Geobacter sp. M21] gi|251775900|gb|ACT18481.1| von Willebrand factor type A [Geobacter sp. M21] Length = 315 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 20/163 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S SM G +L A ++R+ +K R GLV F+ Sbjct: 81 DILFLLDTSRSML-TRDLGQSRLDAAKEAVRKATAGLKGE-------RVGLVAFAGSAFL 132 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL + + G T PG A A ++ +G K ++ Sbjct: 133 VCPLTTDYALFDQVLKE--AGEQTLPLPGTSLA-----AALKEARRALQGEGGEPKVVVL 185 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 L+DGE+ A+ +YA+ Sbjct: 186 LSDGEDHEGEYAAAARALDAAGAR-----LYAVAAGTSPGGPI 223 >gi|126314598|ref|XP_001371349.1| PREDICTED: similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily c, member 2 [Monodelphis domestica] Length = 3274 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 67/197 (34%), Gaps = 25/197 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ D++ ++D S S+ + + + D+I+S+ + R LV F Sbjct: 30 QTSAAADIIFLVDSSWSIGKEHFQLVREF---------LYDVIESLAVAGSDFRFALVQF 80 Query: 224 SSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAY-NKIFDAKEKLEHIAKG 279 + F L + + N G +K+ GL Y N + A Sbjct: 81 NGNPHTEFLLNTYRTNQEVLSHIANMTYLGGDSKTGRGLRYVIQNHLTPAAGSRARDGVP 140 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I+ LTDG + L V AIGVQ + + P Sbjct: 141 -----QVIVVLTDGRSQDDVAGPSAELKSA-----AAVDVVAIGVQDAEEGELKEMATEP 190 Query: 340 --DRFYSVQNSRKLHDA 354 ++++N LHD Sbjct: 191 LDLHVFNLENFTALHDI 207 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 72/181 (39%), Gaps = 22/181 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D++ ++D S + +++ ++ +P ++ G+V +S Sbjct: 235 AQESADLIFLIDGSDKTGSAHFAAIRDF---------LVNFLERLPIGARQIQVGVVQYS 285 Query: 225 SKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L ++ + + + + L ++ G A + + + ++ + Sbjct: 286 NGPRTAFSLNSYSTKADVLDAVKALRVIGGEETNVG--AALDFVVENHFTQAGGSRVEEG 343 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPDR 341 + ++ ++ +++ D +L + +++ G+ A+ AD+ L++ A+ + Sbjct: 344 VPQVLVLISGSQSTDDIRDGVVALKQAS--------IFSFGLGAQGADRAELQHIATDEN 395 Query: 342 F 342 F Sbjct: 396 F 396 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 75/188 (39%), Gaps = 23/188 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S+N D R + ++D I + + ++ GLV ++S Sbjct: 1636 DIVFLLD--GSINFKR----DNFQEVLRFVSGIVDTI---YEGGDSIQVGLVQYNSDPTD 1686 Query: 230 TFPLAW--GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + IN++++ + GLE+ + E ++ + Sbjct: 1687 EFFLKDFSTKEEILDAINKVVYKGGRQANTLVGLEH----LRKNHFVPEAGSRIDQRIPQ 1742 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +T G + + +L ++G V+A+GV+ + + ++ + V Sbjct: 1743 IAFVITGGSSVEDVEEATRALS------QKGVKVFAVGVRNVDLREVSRIASNSAIAFRV 1796 Query: 346 QNSRKLHD 353 N ++L + Sbjct: 1797 SNVQELSE 1804 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 70/199 (35%), Gaps = 18/199 (9%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 ++ + ++ D++ +LD SL++ + P + +++++ S+ Sbjct: 623 RTLSGTTQVPVNKR---DIIFLLDGSLNVGNANFPYVRDF---------VMNLVNSLDVG 670 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + +R GLV FS F L ST + A + + Sbjct: 671 YDNIRVGLVQFSDTPETEFSLDTYQSKSDLLARLRQLQLKGGSTLNIGSALDFVHSNHFT 730 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ H++ + ++ L G++ + +L R G + + +G + Sbjct: 731 EAGGSRIHENIPQLLLLLAAGQSGDSYLQASNALA------RAGVLTFCVGASQANKAEL 784 Query: 333 LKNCASPDRFYSVQNSRKL 351 + +P Y + + L Sbjct: 785 EQIAFNPSLVYLMDDFSSL 803 >gi|296196207|ref|XP_002745750.1| PREDICTED: anthrax toxin receptor 2-like [Callithrix jacchus] Length = 663 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 79/231 (34%), Gaps = 33/231 (14%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 FP + + + + D+ VLD S S+ +++ Sbjct: 190 LFPGLWLLTLSGPGVLLRAQEQPSCRRAFDLYFVLDKSGSVANNW--------------I 235 Query: 201 EMLDIIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINRLIFGSTTKS 255 E+ + ++ + + V+ +R + FSS+ PL I E + + T Sbjct: 236 EIYNFVQQLAERFVSPEMRLSFIVFSSQASIILPLTGDRDKISKGLEDLKHVNPVGETYI 295 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 GL+ A +I A G II LTDG+ + + ++ Sbjct: 296 HEGLKLANEQIQKA---------GGLKTSSIIIALTDGKLDG--LVPSYAEKEAKISRSL 344 Query: 316 GAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 GA VY +GV Q + S ++ + V+ A I ++ Q Sbjct: 345 GASVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQS 392 >gi|291223817|ref|XP_002731904.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 978 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 66/197 (33%), Gaps = 35/197 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLDVS SM P L V T IR +D + G+V F Sbjct: 328 VVLVLDVSGSMEGK--PLTQLLQVCTNYIRNTID---------DGSYLGVVKFEGIAETL 376 Query: 231 FPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L G Q + I L G T G+ + + D Y Sbjct: 377 VDLMLIDGPQIRETIIANLPTYAGGRTSIGGGVLKGIEVLSNFG--------NEDTTGGY 428 Query: 287 IIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD---R 341 II ++DGE +S I D + G ++ + +A + Sbjct: 429 IILVSDGEETSEPIIDDIWGDIELA------GVVIDTVAFSDKADKKLESLATKTGGLSF 482 Query: 342 FYSVQ-NSRKLHDAFLR 357 F+S ++ L+DAF Sbjct: 483 FFSGNDDTTALYDAFTS 499 >gi|218510669|ref|ZP_03508547.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli Brasil 5] Length = 784 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 102/300 (34%), Gaps = 31/300 (10%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N K + + D + QD + R+ SL KD+ L+ Sbjct: 255 NAKINPVSLTVNLKAGFPLGDVNSSFHAVDIRQDSDQ-ARTISLKGGAVPADKDFELTWK 313 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG--LDMMMVLDVSLSMNDHFGPG 188 + F + L + + +++ V+D S SM+ Sbjct: 314 AALGKTPSAGLFREVKDGKTYLLAFVTPPTAPDAAAAPTKREVVFVIDNSGSMSGQ---- 369 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + A +S+ + + D NV+R + + + +V P + + Sbjct: 370 --SIEQARQSLALAISRLSK-DDRFNVIRFDDTMTDYFNGLVAASP--DNREKAITYVRG 424 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L T+ P LE DA +A G + ++FLTDG I N++ L Sbjct: 425 LSADGGTEMLPALE-------DALRNQGPVASGAL---RQVVFLTDG-----AIGNEQQL 469 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 F A R A V+ +G+ + F+ A R F ++ ++ ++ + ++ Sbjct: 470 FQEISANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQN 529 >gi|149635474|ref|XP_001506111.1| PREDICTED: similar to alpha-1 type VII collagen [Ornithorhynchus anatinus] Length = 2993 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 65/183 (35%), Gaps = 27/183 (14%) Query: 167 IGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++ ++D S S+ + F D L ++ + VR G V +S Sbjct: 35 YAADVVFLVDGSSSIGRNNFRMVRDFLEGLVLPFVNVV--------RDTGVRFGAVQYSD 86 Query: 226 KIVQTFPLAWGV--QHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L Q + + L + T++ GL + F H Sbjct: 87 DPRTEFALGTHASGQEVMRAVRELNYKRGNTRTGAGLRHVAEHFF-------HPQLARPG 139 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 K I +TDG + D ++ + K + V+A+G++ ++ K +SP + Sbjct: 140 VPKVCILITDGRSQD---DVEQGAL---KLKNQNVKVFAVGIKNAHEEELRKVASSPVEE 193 Query: 341 RFY 343 + Sbjct: 194 YHF 196 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 55/192 (28%), Gaps = 24/192 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F +TS D++ ++ + L ++ Sbjct: 1026 FSLIPIFGSIQGPVTSVTMHPRCQRGRADVVFLVHTTRDSAHSAEAVQGLLSRLVSALG- 1084 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIF--GSTTKSTP 257 P + ++ GLV++S + V PL + + + I L + S Sbjct: 1085 --------PLGPDAIQVGLVSYSYRPVPLLPLNRSHDHNTVLQHIRNLSYADPSGNAIGA 1136 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 + +A + D ++ L DG + +++ +AK G Sbjct: 1137 AISFAQRYMLDPGSHGRRANVPG-----VLVILADGSSGD------DAIGPARDAKAAGL 1185 Query: 318 IVYAIGVQAEAA 329 + + + Sbjct: 1186 AWWTLPLWTSPG 1197 >gi|170724979|ref|YP_001759005.1| hypothetical protein Swoo_0614 [Shewanella woodyi ATCC 51908] gi|169810326|gb|ACA84910.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908] Length = 503 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 86/249 (34%), Gaps = 23/249 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G+ +I LL +F ++ L +E + K +L + + L A + N+++ Sbjct: 10 KRQRGAAAIYLVFLLIPLFGMVFLALEGTRYIQKKNRLGDATEAASL--AVSMANRDDKG 67 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII--DDQHKDYNLS 128 + D+ ++NI E+ + + ++ D ++ Y ++ Sbjct: 68 YETQLAKDYISSYMRNI------KEISQ--VKVERKEDIDHYPMADGSFEDREYTQYRVT 119 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 A + + P + + D +D++ V D S SM Sbjct: 120 AKTEHTSWLHSDLIPSFKETETLANRALARAYPEYLGDRDVDIVFVSDFSGSMKGS---- 175 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 ++ +I E+ + I D + R LV ++ ++V+ G ++ Sbjct: 176 --RINSLKDAITEISNEILVPRDGETEIRNRIALVPYNMRVVEGDS---GRSVCMTQLKY 230 Query: 247 LIFGSTTKS 255 T S Sbjct: 231 RNPSGKTGS 239 >gi|331647928|ref|ZP_08349020.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] gi|330912098|gb|EGH40608.1| hypothetical protein ECAA86_02478 [Escherichia coli AA86] gi|331043652|gb|EGI15790.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] Length = 580 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 213 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 265 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 266 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FVKGGV 317 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 318 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 374 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 375 NGNYSYIDTLSE 386 >gi|268532304|ref|XP_002631280.1| C. briggsae CBR-CLEC-61.1 protein [Caenorhabditis briggsae] gi|268532308|ref|XP_002631282.1| C. briggsae CBR-CLEC-61.2 protein [Caenorhabditis briggsae] gi|187036878|emb|CAP23543.1| CBR-CLEC-61.2 protein [Caenorhabditis briggsae AF16] gi|187036880|emb|CAP23545.1| CBR-CLEC-61.1 protein [Caenorhabditis briggsae AF16] Length = 403 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 77/237 (32%), Gaps = 13/237 (5%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + + + + FI C +S V ++++ LD++ V+D S+ M Sbjct: 3 SKKFLLSFLIGFILCLGIPVIQASSCKDSYVDRVCGEDQTNLWLDIVCVVDNSVGMT--- 59 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEK 243 G+ + + S+ + P+ R G+VT++ + L + K Sbjct: 60 NAGLASVAASISSLFVDGQQLGIQPNNPRTTRIGIVTYNQEAQVVADLNNFTSVDELTNK 119 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + ++ +T L D E+ Y+K +I T + + Sbjct: 120 VYSILNKVSTSQDSYLHSGLEAANDLLEQQSFNTARGH-YQKLVIVYT---SEYKGTGPQ 175 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASPDRFYSVQNSRKLHDAFLR 357 + L K+ + + + F L A+P ++ +N Sbjct: 176 DPLPLAMRMKQT-VSIATVAYGQDDILGFLAELTKIATPGYNFTNENGENTISELRS 231 >gi|71898007|ref|ZP_00680212.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] gi|71732251|gb|EAO34306.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] Length = 941 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 56/210 (26%), Gaps = 26/210 (12%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPRPASYKIRRILKGWDHDACWYPEKAAIGMQMAPSVAVYFAIDLSGSMD 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWG 236 G +L ++ LD + V L F L A G Sbjct: 167 YAGSNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLAGFGDAPDHRQTLLRRNCTAQG 226 Query: 237 VQHIQEKINRLIFGSTT---KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + ++ + T T + Y + F+TDG Sbjct: 227 IAELKSWVAARQALYGTYFPAGTMDMPSFYAAASSNA-------------VRVAFFMTDG 273 Query: 294 ENSSPNIDNKES----LFYCNEAKRRGAIV 319 E P+ ++ + + G + Sbjct: 274 EPDPPSATLAQAARADVDQVAHLRCYGITI 303 >gi|163845716|ref|YP_001633760.1| magnesium chelatase [Chloroflexus aurantiacus J-10-fl] gi|222523421|ref|YP_002567891.1| magnesium chelatase [Chloroflexus sp. Y-400-fl] gi|163667005|gb|ABY33371.1| Magnesium chelatase [Chloroflexus aurantiacus J-10-fl] gi|222447300|gb|ACM51566.1| Magnesium chelatase [Chloroflexus sp. Y-400-fl] Length = 696 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 49/133 (36%), Gaps = 18/133 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIV 228 + V+D S SM +++ ++ +L + GLV+F Sbjct: 513 AVCFVVDASWSMAAE-----ERMQATKAAVLSLLR-----DAYQRRDQVGLVSFQRDYAR 562 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL V+ Q ++ + G T + G+ A+ + A+ + + ++ Sbjct: 563 VLLPLTNSVELAQRRLQSMPTGGKTPLSRGMLTAFELLERARRRDA-------EVVPLMV 615 Query: 289 FLTDGENSSPNID 301 LTDG+ + D Sbjct: 616 LLTDGQANVSISD 628 >gi|83643000|ref|YP_431435.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631043|gb|ABC27010.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 733 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 75/207 (36%), Gaps = 39/207 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + +++ V+D S SM + A ++ + LD + R ++ F+S Sbjct: 354 LPRELIWVVDTSGSMEGV------SIQQARDAVLQALDTLTPRD------RFNVIEFNSH 401 Query: 227 IVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + FP A +Q + + L T+ L+ A + Sbjct: 402 ARKLFPQAVPAQERALQQARRFVRGLKADGGTEIAEALDRALSDAAPEG----------- 450 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 Y + ++FLTDG + K+ +++ ++ +G+ F++ A R Sbjct: 451 -YVRQVVFLTDGSVGNELALFKQIDQQLGDSR-----LFTVGIGPSPNRFFMRKAAQFGR 504 Query: 342 --FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + ++ ++ D +I + R Sbjct: 505 GAYSHINDTAEVSD---KIAELTAALR 528 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 83/243 (34%), Gaps = 37/243 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + T C+ ++ + + +D+++VLD S S+ P ++ Sbjct: 132 SYYTTGMCSRANSNFRFSKTVAPALQRCQTYMDIVIVLDGSNSI----YPWVE----VQH 183 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR---LIFGSTTK 254 + +L P ++ G+V + V F L + +++ + + T+ Sbjct: 184 FLINILKKFYIGPGQ---IQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTE 239 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + + G KK +I +TDGE + D+ + ++++ Sbjct: 240 TRTAFGIEF------ARSEAFQKGGRKGAKKVMIVITDGE----SHDSPDLEKVIRQSEK 289 Query: 315 RGAIVYAIGVQAEAADQ------FL---KNCAS---PDRFYSVQNSRKLHDAFLRIGKEM 362 YA+ V + FL K AS F++V + L D +G + Sbjct: 290 DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGDRI 349 Query: 363 VKQ 365 Sbjct: 350 FSL 352 >gi|308472943|ref|XP_003098698.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] gi|308268298|gb|EFP12251.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] Length = 416 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 73/213 (34%), Gaps = 14/213 (6%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 ++ ++P + +++ LD++ V+D S M G+ + S+ I Sbjct: 17 SAQYSPQSYVDRQCGTDLNNLWLDVIAVVDNSHGMT---NGGVQSVAANIASVFSSGTRI 73 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 S R GLVT++S L + + + + + + L N Sbjct: 74 GSNSTEPRTTRVGLVTYNSAAKLDADLNKFQDLDGLYNGVFKDLSDVVDTTDSFLATGLN 133 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + + D YKK II S +D + N K G ++ + Sbjct: 134 AAEELLQSQS-LNTTRDHYKKVIIVYASEYKGSGELDP---VPVANRLKGSGVVIVTVAY 189 Query: 325 QAEAADQFLK---NCASPDRFY--SVQNSRKLH 352 + L+ N ASP Y + N+ L Sbjct: 190 DQGGDEGLLRDLANIASPGFAYSNAPNNAGNLV 222 >gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] Length = 808 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 66/197 (33%), Gaps = 42/197 (21%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + G D + LDVS SM KL ++ K+I + R +V F++ Sbjct: 279 EGGADFVFALDVSGSM-------QGKLHTLASGVK------KAIGQLKPEDRFRVVAFNN 325 Query: 226 KIVQTFPLAWGVQHIQE--------KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F L G E ++++L T G+ A ++ + Sbjct: 326 TA---FDLNRGWVSATEANLRETFARLDQLNSNGGTNVYAGVHLALERLDADRVAT---- 378 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NC 336 +I +TDG + +D K ++ Y + + ++ C Sbjct: 379 ---------LILVTDGVTNQGIVDPKAFYKL---MHKQDLRFYGFLLGNSSNWPLMQLMC 426 Query: 337 -ASPDRFYSVQNSRKLH 352 AS + +V NS + Sbjct: 427 DASGGSYRAVSNSDDII 443 >gi|159900723|ref|YP_001546970.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893762|gb|ABX06842.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 421 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 77/208 (37%), Gaps = 33/208 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +++ VLD S SM DK+ + + L ++ V+ +V F Sbjct: 41 QMPVNVSFVLDHSGSMKG------DKMRCVREATQRALGLMGPQDIVS------VVIFDH 88 Query: 226 K--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + + V +Q ++ ++ TK P LE A N+I ++ + Sbjct: 89 RRETIISAQPVRNVAALQAEVGKIKDAGGTKIAPALEAALNEI--------RRSQNANTI 140 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD- 340 + II LTDG+ ++ L E + + A+GV + + L A S Sbjct: 141 SR-IILLTDGQTEG----ERDCLRLAEEIGKASVPLTALGVGDDWNEDLLIEMANRSGGV 195 Query: 341 -RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 ++S N + F ++ + Sbjct: 196 AEYFS--NPNDIASFFQGAVQQAQSAVV 221 >gi|27366553|ref|NP_762080.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] gi|27358119|gb|AAO07070.1| Protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] Length = 426 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 74/224 (33%), Gaps = 19/224 (8%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G I+ +LP++ I+M ++ + + AK+ + + L + N Sbjct: 4 KKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEKN 63 Query: 71 NGKKQK--NDFSYRIIKNIWQTDF-RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 QK + + + F R ++G Q + T + +H + Sbjct: 64 QEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWIS 123 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-G 186 + F + P LD+ ++D+S SM + + G Sbjct: 124 YNDGEMGLTKDFEVMGTSTSRKFLPQP--------------LDIYFIIDMSGSMVNPWGG 169 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 G K V +I ++D ++ R ++ F V+ Sbjct: 170 SGKTKYDVVADTINRIVDDLREFKTDRKS-RVAVIGFHHTAVKQ 212 >gi|156383257|ref|XP_001632751.1| predicted protein [Nematostella vectensis] gi|156219811|gb|EDO40688.1| predicted protein [Nematostella vectensis] Length = 161 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 41/177 (23%), Positives = 68/177 (38%), Gaps = 24/177 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+ VLD S S+N + M++I+KS + GLV +SS Sbjct: 1 AVDLAFVLDGSTSIN---NADPGNFQLLKNF---MINIVKSFKISSERTHVGLVLYSSFT 54 Query: 228 VQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F I + IN T++ L+ A +++F A + Sbjct: 55 QLKFNFDKYSDSASIVKAINTTDYPKGGTRTGEALKMAKSQLFGASMR---------SVP 105 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K +I LTDG +S ++L K G +++A+GV + L AS + Sbjct: 106 KVLIVLTDGRSSDKVEAPSKAL------KDEGVVIFAVGVGDQIDPSELNVMASDSK 156 >gi|126322632|ref|XP_001381040.1| PREDICTED: similar to collagen, type XIV, alpha 1 [Monodelphis domestica] Length = 1892 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 69/200 (34%), Gaps = 33/200 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1179 DLVFMVDGSWSIGD------DNFNKIINFLYNTVGALDKI--GVDGTQVSIVQFTDDPRT 1230 Query: 230 TFPLAWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L ++ + N G TK+ L++ + +F A+ Sbjct: 1231 EFML--NTYKTKDTLLEGIKNLSYKGGNTKTGKALKHVRDALFTAEGGTRRGI------P 1282 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RF 342 K I+ +TDG + L G ++AIGV + + + P Sbjct: 1283 KVIVVITDGRSQDDVNKISRELQL------EGISIFAIGVADADYAELVSIGSQPSARHV 1336 Query: 343 YSVQNSRKLHDAFLRIGKEM 362 + V + DAF +I E+ Sbjct: 1337 FFVDD----FDAFKKIEDEL 1352 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/225 (19%), Positives = 83/225 (36%), Gaps = 26/225 (11%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI- 205 + P L T K+ D+++++D S S+ R +R L+ Sbjct: 282 GNGSKPSLPTEGNTFVCKTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENL 331 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAY 263 + + N R GL +S + L + + + + + L + T A Sbjct: 332 VTAFNVGANKTRIGLAQYSGDPRIEWHLNTFSTKDAVIDAVRNLPYKGGNTLTG---LAL 388 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N IF+ K E A+ K I +TDG++ I ++L + G ++AIG Sbjct: 389 NYIFENSFKPEAGARTG--VSKIGILITDGKSQDDIIPASKTL------RDSGVELFAIG 440 Query: 324 VQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 V+ + + + PD Y+V +H + K + + Sbjct: 441 VKNADVAELQEIASEPDSTHVYNVAEFDLMHTVVEGLTKTVCTRV 485 >gi|311268857|ref|XP_003132237.1| PREDICTED: collagen alpha-1(VII) chain-like [Sus scrofa] Length = 2945 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YSADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSRAASA-QGVRFAAVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L G + I L + G T++ + + +++F L +A+ Sbjct: 88 PRTEFSLDTLGSGGDVIRAIRELSYKGGNTRTGAAILHVADQVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDAAAQRL------KEQGVKLFAVGIK-NADPEELKRIASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 59/164 (35%), Gaps = 22/164 (13%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGL 259 +++ + D ++ R G V F+ ++ + L + + K+N + + GL Sbjct: 70 EVLSLMDDASSKDRFGFVGFNKEVTKELALTNNIVQAKSKLNEFGKNISPYMANDLSKGL 129 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E A +++ K K I+ +T G +I N+ S +A + Sbjct: 130 EKAVDELT----------KKSTSNDKVIVIMTVG----NSIYNEVSKKLAAKAYEEDITI 175 Query: 320 YAIGVQAE--AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG 359 + I A FL A + + N+ L D ++ Sbjct: 176 HTISFGDPLYADAPFLTEIAKLTGGNYTHSPNAAFLKDVLSKLS 219 >gi|51467747|ref|NP_001003823.1| cochlin [Danio rerio] gi|26788036|emb|CAD58748.1| novel protein similar to human coagulation factor C homolog (cochlin, COCH) [Danio rerio] Length = 553 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 63/180 (35%), Gaps = 37/180 (20%) Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+ ++D S S+ D +F +D L RS + R G + F+ Sbjct: 370 SVDLGFLIDGSSSVGDGNFRLVLDLLVSIARS----------FDISDIGSRIGAIQFT-- 417 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 Q + +++ N L T + + +A +F + Sbjct: 418 YDQRMEFNFNDHVLKD--NALRALQKIPYMSGGTATGDAINFAVRSLFKPR--------- 466 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +K++I +TDG++ + A+R G VYA+GV + + P Sbjct: 467 SSSNRKFLIIITDGQSYD------DVRVPAMAAQREGITVYAVGVAWAPMEDLKAMASEP 520 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 58/171 (33%), Gaps = 24/171 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ D +DM ++LD S + G + + + ++ ++K + G+ Sbjct: 161 ATAHKDCPVDMALLLDSS------YNIGQRRFNLQKNFVSKLATMLKVGTQGPH---VGV 211 Query: 221 VTFSSKIVQTFPLAWGV--QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V S F L + + I + G T + + + F Sbjct: 212 VQTSETPRTEFYLTNYTTAKDVTFAIKEIPYIGGNTNTGKAILHTVRNFFSP------DF 265 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 Y + I+ DG P+ + +E+ A+ G ++ + V + Sbjct: 266 GVRRGYPRVIVVFVDG---WPSDNVEEAAIL---ARESGINIFFVSVAKPS 310 >gi|291402773|ref|XP_002718214.1| PREDICTED: integrin, alpha 11 [Oryctolagus cuniculus] Length = 1188 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 75/215 (34%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V F L + +++ + + T++ + G Sbjct: 209 DAVHEFHL-NDYKSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLKKVIRQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|91215374|ref|ZP_01252345.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] gi|91186326|gb|EAS72698.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] Length = 689 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 66/200 (33%), Gaps = 32/200 (16%) Query: 155 ITSSVKISSKSDIGL---DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML---DIIKS 208 V+ S ++ + + ++++D S SM G M + A+ I L D Sbjct: 259 FGLVVEPESNANTEVIEKNFVLIIDSSGSMRG--GNKMAQAKEASEFIVNNLNIGDNFNV 316 Query: 209 IPDVNNVV--RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 I NN+V + LV ++ + + I ++ T + L A N+ Sbjct: 317 IDFDNNIVLFQPELVEYNIQ---------NSNAALDFIENIVALGATNISESLVTAINQF 367 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA---KRRGAIVYAIG 323 E + I+F TDG + + + L + ++ G Sbjct: 368 EAGAEDKAN----------IIVFFTDGGATEGETNTQNILQLAEDTVNQIETEIFLFTFG 417 Query: 324 VQAEAADQFLKNCASPDRFY 343 + + L A + + Sbjct: 418 IGEDVTTDLLTLLAVQNNGF 437 >gi|332206625|ref|XP_003252399.1| PREDICTED: collagen alpha-1(XXVIII) chain [Nomascus leucogenys] Length = 1129 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIVLFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K + +TDG + N D + +A+ G IG+ + L+ + Sbjct: 153 -VKVALLMTDGIDHPKNPDVQS---ISEDARISGISFITIGLSTVVNEAKLRLISGDSS 207 >gi|260785816|ref|XP_002587956.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] gi|229273111|gb|EEN43967.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] Length = 1412 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 32/164 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+++VLDVS S D+ +A + +D + +N +R G++ ++ + Sbjct: 389 AIDIVLVLDVSSS------IPQDQFLLARDFMMAFVD-CAAFQGLN--IRIGVICYNCEA 439 Query: 228 VQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F P+ G+ + + + G T++ + Y E Sbjct: 440 KTYFGLQPIYNGMSYSIHYV--MYKGGETRTGHAIY--YMTCTSDFEAK----------P 485 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + + LTDG + + A+ G +YA+ + Sbjct: 486 RVAVILTDGRSGD------NEVAEAENARDMGITLYAVRIGDPR 523 Score = 39.4 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 18/109 (16%), Positives = 45/109 (41%), Gaps = 20/109 (18%) Query: 220 LVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +++++ + F P+ G+ + E + R G T++ + + + H Sbjct: 107 VISYTCEAHTYFSLTPITMGMSYEIEHLMRGDGGGETRTGHAIYH-----------MRHT 155 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +K + + LTDG++ +++ +A+ G +YA+ V Sbjct: 156 SKFGAESHHAAVILTDGQSDD------DAIAEAEDARDAGIDLYAVVVG 198 >gi|118496822|ref|YP_897872.1| hypothetical protein FTN_0208 [Francisella tularensis subsp. novicida U112] gi|194324497|ref|ZP_03058269.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|254372186|ref|ZP_04987678.1| TPR domain protein [Francisella tularensis subsp. novicida GA99-3549] gi|118422728|gb|ABK89118.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella novicida U112] gi|151569916|gb|EDN35570.1| TPR domain protein [Francisella novicida GA99-3549] gi|194321332|gb|EDX18818.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 332 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVTIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPSPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + +L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLEELATA 264 >gi|23500008|ref|NP_699448.1| norD protein [Brucella suis 1330] gi|81751586|sp|Q8FX38|NORD_BRUSU RecName: Full=Protein norD gi|23463592|gb|AAN33453.1| norD protein [Brucella suis 1330] Length = 633 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 69/179 (38%), Gaps = 33/179 (18%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + KK ++ LTDG+ + ++S EA+ +G V+A+ V EA+ Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAHL 602 >gi|167041680|gb|ABZ06425.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_009L19] Length = 317 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 87/232 (37%), Gaps = 54/232 (23%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ + +++VLD S S+ G + +L +A ++ E L R GLVTFS Sbjct: 81 EDVPITLLLVLDTSGSV---VGAPLAQLLMAAEAVAEAL---------RPDDRVGLVTFS 128 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + + R+ T A+ + + Sbjct: 129 HNVRVVVEPPSLPASLPDALRRVRATGGTALYDATFAAF------------ALRERTVGR 176 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI---GVQAEA------------- 328 ++ +DG++++ +D ++ L N A+R +VYA+ GV ++ Sbjct: 177 TLMLVFSDGDDTTSWLDPRDVL---NTAQRSDVVVYAVNLAGVAPDSWQERQGRRSARRW 233 Query: 329 --------ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 Q+L A + + Q++ +L AF R+ E ++R L Sbjct: 234 FATEPHLFRGQYLPVLAEETGGSVFVAQDTGRLRAAFARVVDE-FRRRYLLT 284 >gi|149043685|gb|EDL97136.1| procollagen, type VI, alpha 2, isoform CRA_b [Rattus norvegicus] Length = 369 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 71/213 (33%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 46 PEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLEQVALSWRYG 105 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H+ + Sbjct: 106 GLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGR 159 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + + +TDG + + A+ G ++A+ +Q L++ A+ Sbjct: 160 GVVN---FAVVITDGHVTGNPCGGIK--MQAERAREEGIRLFAVAPNRNLNEQGLRDIAN 214 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 215 TPHELYRNNYATMRPDSTEIDQDTINRIIKVMK 247 >gi|281427229|ref|NP_001094211.1| collagen, type VI, alpha 2 [Rattus norvegicus] gi|149043684|gb|EDL97135.1| procollagen, type VI, alpha 2, isoform CRA_a [Rattus norvegicus] Length = 1027 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 71/213 (33%), Gaps = 14/213 (6%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ VLD S S M + + L + V R G Sbjct: 46 PEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLEQVALSWRYG 105 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS ++ P + + + F T + L +I +H+ + Sbjct: 106 GLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGR 159 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G + + + +TDG + + A+ G ++A+ +Q L++ A+ Sbjct: 160 GVVN---FAVVITDGHVTGNPCGGIK--MQAERAREEGIRLFAVAPNRNLNEQGLRDIAN 214 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 215 TPHELYRNNYATMRPDSTEIDQDTINRIIKVMK 247 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 620 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 676 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AYN++ + + Sbjct: 677 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 733 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 734 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 768 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 64/180 (35%), Gaps = 17/180 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + L + + D N R L+ + Sbjct: 836 TQRPVDIVFLLDGSERLGEQNFYKARRF---VEEVSRRLTLARRDDDPLNA-RMALLQYG 891 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + V I E + R S + G+ +A N + Sbjct: 892 SQNQQQVAFPLTYNVTTIHEALERTTYLNSFSHVGTGIVHAINNVVRGARGGARRHAELS 951 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +FLTDG + +++ + +++ + + V + L + DR Sbjct: 952 -----FVFLTDGVTGNDSLEES-----VHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDR 1001 >gi|190336734|gb|AAI62194.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] gi|190339304|gb|AAI62181.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] Length = 553 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 63/180 (35%), Gaps = 37/180 (20%) Query: 168 GLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+ ++D S S+ D +F +D L RS + R G + F+ Sbjct: 370 SVDLGFLIDGSSSVGDGNFRLVLDLLVSIARS----------FDISDIGSRIGAIQFT-- 417 Query: 227 IVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 Q + +++ N L T + + +A +F + Sbjct: 418 YDQRMEFNFNDHVLKD--NALRALQKIPYMSGGTATGDAINFAVRSLFKPR--------- 466 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +K++I +TDG++ + A+R G VYA+GV + + P Sbjct: 467 SSSNRKFLIIITDGQSYD------DVRVPAMAAQREGITVYAVGVAWAPMEDLKAMASEP 520 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 62/194 (31%), Gaps = 24/194 (12%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 T A ++ ++ D +DM ++LD S + G + + Sbjct: 138 SSATVASGAAKKPVKKIVKKPPPATAHKDCPVDMALLLDSS------YNIGQRRFNLQKN 191 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLI-FGSTTK 254 + ++ ++K + G+V S F L + + I + G T Sbjct: 192 FVSKLATMLKVGTQGPH---VGVVQTSETPRTEFYLTNYTTAKDVTFAIKEIPYIGGNTN 248 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + + + F Y + I+ DG P+ + +E+ A+ Sbjct: 249 TGKAILHTVRNFFSP------DFGVRRGYPRVIVVFVDG---WPSDNVEEAAIL---ARE 296 Query: 315 RGAIVYAIGVQAEA 328 G ++ + V + Sbjct: 297 SGINIFFVSVAKPS 310 >gi|91776250|ref|YP_546006.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] gi|91710237|gb|ABE50165.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] Length = 326 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 66/220 (30%), Gaps = 37/220 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G +++V+D S SM++ F +I + + G+VTFS+ Sbjct: 77 GQGAQLVLVIDRSASMDEAFSGAETSGVAGESKAAAAERLITHFVNERSNDMFGMVTFSN 136 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + PL + I I + T GL A L K D Sbjct: 137 SAMHALPLTDSREAILAAIRAAGGAALFQTNIGSGLTAA----------LAQFDKTPDSG 186 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------------- 329 + II L+DG ++ + +Y I ++ A Sbjct: 187 SRAIILLSDGGGRMGANTQQKIRDWLERM---NVTLYWIVLRQPGATSIFDTSYVPPEDN 243 Query: 330 --------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 F K + + Y ++ L A I ++ Sbjct: 244 PLPPALELHDFFKTLRTGYQAYEAEDPTSLAAAIEDINRK 283 >gi|294653580|ref|NP_714599.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630706|gb|AAN51614.2| BatB [Leptospira interrogans serovar Lai str. 56601] Length = 318 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 56/164 (34%), Gaps = 17/164 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S G+D++ ++DVSLSM +L + ML + R G Sbjct: 53 EKKEESFKGVDILFLVDVSLSMQ-AIDSSPTRLAKFKEVLLRMLPSLSGN-------RFG 104 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ F+ P+ V + + L L A+ K + Sbjct: 105 MIVFAGSPFLYCPMTTDVSAFSDYVRGLDVDMVGDRGTDLSQAFTK------AEALLRSE 158 Query: 280 HDDYKKYIIFLTDGEN-SSPNID--NKESLFYCNEAKRRGAIVY 320 + +I +TDGE+ + P + + G IVY Sbjct: 159 KVFRNRILILVTDGEDQNDPQAISFPASFQVWAAGTESGGPIVY 202 >gi|268608768|ref|ZP_06142495.1| hypothetical protein RflaF_04637 [Ruminococcus flavefaciens FD-1] Length = 453 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/241 (12%), Positives = 67/241 (27%), Gaps = 39/241 (16%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 ++ + + + + + + +++LD S SM P + A + + + Sbjct: 159 PGKVTVTVIADAVIPVVNDNSLETYTVLILDASGSMQG--APMTAQKEAAKKFCEDTIGT 216 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 K+ ++T S + + I + +T + L A Sbjct: 217 NKNSNHK-----FAVITLDSGSKTLTDFTNDIDELDSAIAKTTAYGSTNYSAALRNAAEL 271 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDG--------ENSSPNIDNKESLFYCNEA----- 312 + I+ +DG + + Y N A Sbjct: 272 LSKVSADAVRN----------IVLCSDGNPYGGEEKSTGKYTLSDYSDYEYANAAYDIAQ 321 Query: 313 -KRRGAIVYAIGV-------QAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMV 363 ++ +Y +G + +LK+ AS + V L F + V Sbjct: 322 EIKKDYEIYTLGFFHSLSGEDLDFGRTYLKDVASYDSNYAEVNKVDDLQKVFADVAGNAV 381 Query: 364 K 364 Sbjct: 382 S 382 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 75/211 (35%), Gaps = 34/211 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD + SM + G A R + +++ + S R GLVT+++ Sbjct: 261 IVLDPAGSMNIYLVLDGSDSIGASNFTGAKRCLVNLIEKVASYGVRP---RYGLVTYATY 317 Query: 227 IVQ----TFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + P + + EK+N++ + T + L YN + + Sbjct: 318 PNVLVRVSDPKSSDANWVTEKLNQISYEDHKLKTGTNTKRALVEVYNMMSWPGDVP---P 374 Query: 278 KGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIVYAIGVQA 326 +G + + II +TDG ++ P E N K R V+ +G Sbjct: 375 EGWNRTRHVIILMTDGLHNMGGDPVTVINEIRDLLNIGKDRKNPREDYLDVYVFGVGPLV 434 Query: 327 EAA--DQFLKNCASPDRFYSVQNSRKLHDAF 355 E A + + + V++ L D F Sbjct: 435 EPANINALASKKENEQHVFRVKDMEHLEDVF 465 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 69/198 (34%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 412 DIGFVIDGSSSV------GTGNFRTVLQFVANL---SKEFDISDTDTRVGAVQYTYEQRL 462 Query: 230 TFPLAWGVQH--IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + I R+ + T + + YA ++F K + +K Sbjct: 463 EFGFDQYTTKPDVLNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 513 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 +I +TDG + + A +G I YAIGV A ++ P D + Sbjct: 514 MILITDGRSYD------DVRIPAMVAHHKGVITYAIGVAWAAQEELEIIATHPARDHAFF 567 Query: 345 VQNSRKLHDAFLRIGKEM 362 V L+ + +I + + Sbjct: 568 VDEFDNLYKSVPKIIQNI 585 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 6/73 (8%) Query: 303 KESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRI 358 ++L C K + ++Y + A+ +K+CAS PD+FY ++ L AF I Sbjct: 409 TQALALCEAMKEQDVVIYTVYFETTGAKFGKDLMKSCASDPDKFYLAEDRDGLKAAFSAI 468 Query: 359 GKEMVKQRILYNK 371 + I K Sbjct: 469 A--IDNLSIYLAK 479 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 87/233 (37%), Gaps = 34/233 (14%) Query: 33 GLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDF 92 G I+ + ++K+ LD S L A K+ + ++ + Q N+ + D Sbjct: 22 GSGIDLTSALNARSKMANALDASALKLAGKL-SVAKLSDDEIQAG-LEKMFTANLSRFDL 79 Query: 93 R-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 + + L E F D L + D K + + Sbjct: 80 KASALSELEFEVDWTKGI----LDVWSDVSVKTHFIGL--------------GGLGPEKL 121 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + +TS V +S+ L++ +VLDV+ SM+ + L A++ + E L + ++ Sbjct: 122 DVGVTSRVSFASQ---ALELALVLDVTGSMDGD----ISSLKEASQLLFEAL-VPENAGR 173 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAY 263 + +R +V +S + AW V + Q + + T+ P AY Sbjct: 174 HDQRIRVSIVPYSQGVNLGAK-AWKVTNRQSDSSNCVA---TRGGPNAFTDAY 222 Score = 37.9 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 9/72 (12%), Positives = 22/72 (30%), Gaps = 9/72 (12%) Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-------DAKEKLEHIAKGHDDYK 284 PL + + ++ L T G+ + + + + Sbjct: 266 PLTNSRKTLLAAVDALEAQGGTAGQAGIAWGWKALSWTWHPFWPSGSDPAKSF--SSQVG 323 Query: 285 KYIIFLTDGENS 296 K + +TDG+ + Sbjct: 324 KAAVIMTDGDFN 335 >gi|160858155|emb|CAP19997.1| collagen type VI alpha 5 precursor [Homo sapiens] Length = 591 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 74/232 (31%), Gaps = 29/232 (12%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T L I+ + + S ++ F + + T + + + Sbjct: 376 ANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTGC 435 Query: 169 LD-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D + ++D S S+ + R + E+ ++ PD VR G+V + Sbjct: 436 VDTKEADIHFLIDGSSSIQKK------QFEQIKRFMLEVTEMFSIGPDK---VRVGVVQY 486 Query: 224 SSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S F + I + N T + L+Y I + + Sbjct: 487 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK---- 542 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 Y+I LTDG ++ + + V+A+G+ A + Sbjct: 543 --VPCYLIVLTDGMSTD------RVVEPAKRLRAEQITVHAVGIGAANKIEL 586 >gi|73954248|ref|XP_546328.2| PREDICTED: similar to integrin, alpha 1 precursor [Canis familiaris] Length = 1182 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 79/227 (34%), Gaps = 37/227 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ + LD+++VLD S S + T + ++L+ + P Sbjct: 160 VVNSIAPVRECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLERMDIGPKQTQ 211 Query: 215 VVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAK 270 G+V + + F L + + N++I T + G++ A + F Sbjct: 212 ---VGIVQYGENVTHEFNLNKYSSTEEVLVAANQIIQRGGRQTMTALGIDTARKEAFTEA 268 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K K ++ +TDGE S N + + C ++I + Sbjct: 269 RGARRGVK------KVMVIVTDGE-SHDNHLLNKVIQDCE---DENIQRFSIAILGSYNR 318 Query: 331 ---------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 319 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFAL 365 >gi|313239130|emb|CBY14106.1| unnamed protein product [Oikopleura dioica] Length = 509 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + LD ++LD S S G + + +L VR GL+ Sbjct: 314 TKDPNYKLDFFIILDQSSS------IGNENFQKMKNFVINLLMQSNLGQHG---VRVGLI 364 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAK 278 T++ + F + Q ++ +++ T + +++ + + + E + Sbjct: 365 TYNRRPTLRFHMNEMENHQQAINAVDSIVYEGRGTNTGAAIKW----VVENAFRPEFGDR 420 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K ++ +TDG P + +S ++ VYA+G+ + L AS Sbjct: 421 PEVPNK--VLLITDGRARDPPVLKVQSGRLQEQS-----TVYALGIGKQIDYVELNRIAS 473 Query: 339 P---DRFYSVQNSRKLHDAFLR 357 V N L AF + Sbjct: 474 DPSERHVLYVDNFSFLERAFQK 495 >gi|313214907|emb|CBY41128.1| unnamed protein product [Oikopleura dioica] Length = 509 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + LD ++LD S S G + + +L VR GL+ Sbjct: 314 TKDPNYKLDFFIILDQSSS------IGNENFQKMKNFVINLLMQSNLGQHG---VRVGLI 364 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAK 278 T++ + F + Q ++ +++ T + +++ + + + E + Sbjct: 365 TYNRRPTLRFHMNEMENHQQAINAVDSIVYEGRGTNTGAAIKW----VVENAFRPEFGDR 420 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K ++ +TDG P + +S ++ VYA+G+ + L AS Sbjct: 421 PEVPNK--VLLITDGRARDPPVLKVQSGRLQEQS-----TVYALGIGKQIDYVELNRIAS 473 Query: 339 P---DRFYSVQNSRKLHDAFLR 357 V N L AF + Sbjct: 474 DPSERHVLYVDNFSFLERAFQK 495 >gi|123232279|emb|CAM16354.1| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H family (plasma Kallikrein-sensitive glycoprotein) (ITIH) [Danio rerio] Length = 860 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 66/201 (32%), Gaps = 27/201 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D+S SM K+ + ++ + D+ L+TFS + Sbjct: 290 DVIFVIDISGSMIG------TKIKQTKAA------MVSILSDLREGDYFNLITFSDDVHT 337 Query: 230 TFPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK-GHDD 282 V+ +E + ++I T L A + + + Sbjct: 338 WKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQ 397 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDR 341 IIFLTDGE + + L N K G ++ + +A L+ A +R Sbjct: 398 RVPMIIFLTDGEATIGETETDVILH--NAQKSLGLVSLFGLAFGDDADFPMLRRLALENR 455 Query: 342 -----FYSVQNSRKLHDAFLR 357 Y ++ F Sbjct: 456 GVARMVYEDDDAAIQLKGFYD 476 >gi|114800018|ref|YP_760837.1| hypothetical protein HNE_2140 [Hyphomonas neptunium ATCC 15444] gi|114740192|gb|ABI78317.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 583 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 66/201 (32%), Gaps = 35/201 (17%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-----I 227 ++LD S SM G K+ VA + E + G++ + + Sbjct: 44 LILDASGSMWGQLKGGTTKIEVARDVMSEYFRTRNAAEP------LGVIAYGHRRRGDCA 97 Query: 228 VQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 GVQ + ++N++ T T L A +I E+ + Sbjct: 98 DIEVIAETGVQNAATLSSRVNQIRPNGMTPLTDALRMAQKQIPKTAERAD---------- 147 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAADQFLKNC---ASP 339 II +TDG + + E G + + +G + +C A+ Sbjct: 148 --IILVTDGLETC----KADPCALAAELAAEGIEIRAHVVGFGLTEQEAASLSCIPEATG 201 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 Q ++L DA +I + Sbjct: 202 GLLLRPQTGQELSDALGQIAE 222 >gi|7022738|dbj|BAA91707.1| unnamed protein product [Homo sapiens] Length = 218 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/145 (23%), Positives = 57/145 (39%), Gaps = 13/145 (8%) Query: 221 VTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FS++ L + I++ ++ +++ G T G E A +I+ + A Sbjct: 4 IVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTA 63 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II LTDGE E N ++ GAIVY +GV+ Q + Sbjct: 64 S-------VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIAD 114 Query: 338 SPDRFYSVQNS-RKLHDAFLRIGKE 361 S D + V + + L I K+ Sbjct: 115 SKDHVFPVNDGFQALQGIIHSILKK 139 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 72/198 (36%), Gaps = 35/198 (17%) Query: 170 DMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ VLD S S+ + G+ L R +TFS++ Sbjct: 99 DVVFVLDSSASVGVKDYKNGILALQTLIT-------------RAKEDTRYAGITFSTEAN 145 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 TF + ++ T + L+ +++ K+ +K+ +I Sbjct: 146 ITFYFTDPLDAMKGLGGITYAPGMTNTQAALDICRTQLWLNKKSGFRR----LSFKRILI 201 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--------SPD 340 TDG++ NI+ + +L+ + K G ++ + V ++L+ A + Sbjct: 202 V-TDGQS---NINMERTLYNAFQLKNMGIEIFVVAV-----GKYLRGIAEIVGLASSTDA 252 Query: 341 RFYSVQNSRKLHDAFLRI 358 Y V+N R L + I Sbjct: 253 HLYRVRNLRGLLEVVHLI 270 >gi|308472863|ref|XP_003098658.1| hypothetical protein CRE_04224 [Caenorhabditis remanei] gi|308268258|gb|EFP12211.1| hypothetical protein CRE_04224 [Caenorhabditis remanei] Length = 392 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 41/194 (21%), Positives = 66/194 (34%), Gaps = 13/194 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD++ V+D S M G+ + S+ I S R GLVT++ Sbjct: 35 SNLWLDVVAVVDNSQGMT---NEGLSNVAADIFSVFSSGTRIGSNSSEPRTTRLGLVTYN 91 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTT--KSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S Q L Q I + N + +T +T + + + Sbjct: 92 SAATQKADL-NKFQSIGDVANGIGNALSTVVDTTDSYLATGLILAAKMFNEQSVNTNRGH 150 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK---NCASP 339 YK+ +I + I + L N K G + + Q D L+ ASP Sbjct: 151 YKRVVIVF---ASEYKGIGELDPLPVANRLKLSGVNIITVAYQQAGDDGLLQGLSQVASP 207 Query: 340 DRFYSVQNSRKLHD 353 + V N L Sbjct: 208 GFSF-VNNPLNLVT 220 >gi|306814616|ref|ZP_07448778.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] gi|305852010|gb|EFM52462.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] Length = 581 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 214 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 266 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 267 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGI 318 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 319 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 375 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 376 NGNYSYIDTLSE 387 >gi|187932171|ref|YP_001892156.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713080|gb|ACD31377.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 332 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF A S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVAIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVITSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPLPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + +L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLEELATA 264 >gi|270006429|gb|EFA02877.1| hypothetical protein TcasGA2_TC008029 [Tribolium castaneum] Length = 1868 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 71/197 (36%), Gaps = 37/197 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L+++ ++D S S+ + + ++++L + N R + TFSS + Sbjct: 78 LELIFLIDGSSSVGE------TNFRSELKFVKKLLSDVTV---DYNHTRVAIATFSSSVS 128 Query: 229 QTFPLAWGVQH-------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + + + ++++ + G T + E A ++ E Sbjct: 129 KNIDQISDPRKENNKCFLLSKLLSKIEYTGGGTNTLKAFEVAKEIFTQSRNDSE------ 182 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K + +TDG ++ + + E K+ ++ IG+ + + ++P Sbjct: 183 ----KVLFLITDGFSNGG-----DPIPLAAELKKDQVKIFTIGIANGNYKELYELASTPG 233 Query: 341 RFYSVQNSRKLHDAFLR 357 Y L D+F Sbjct: 234 EIY-----SYLLDSFEE 245 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 59/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+M V+D S S+ V S L I P R G+V +S + Sbjct: 536 GALDIMFVIDSSESIGYTNFTLEKNFVVNVVS---RLGSIAKDPKSETGARVGVVQYSHE 592 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + RL T + L++AYNK+ + + Sbjct: 593 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK------ 646 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + +TDG P D+K C R +V IG+ Sbjct: 647 --AQVFAVVITDGR-YDPRDDDKNLGALC----GRDVLVNTIGIG 684 Score = 43.3 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 40/124 (32%), Gaps = 13/124 (10%) Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G + +S + PL K+ + G T + + + + Sbjct: 22 GGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAISNMTQQFQSQTARDV--- 78 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K+ + +TDG + + A+ G ++A+ + +Q L+ A Sbjct: 79 -------KFAVVITDGHVTGSPCGGMK--MQAERARDMGIKLFAVAPSEDVYEQGLREIA 129 Query: 338 SPDR 341 SP Sbjct: 130 SPPH 133 >gi|89255638|ref|YP_512999.1| TPR repeat-containing protein [Francisella tularensis subsp. holarctica LVS] gi|115314142|ref|YP_762865.1| hypothetical protein FTH_0199 [Francisella tularensis subsp. holarctica OSU18] gi|167009920|ref|ZP_02274851.1| hypothetical protein Ftulh_04147 [Francisella tularensis subsp. holarctica FSC200] gi|169656499|ref|YP_001427653.2| hypothetical protein FTA_0220 [Francisella tularensis subsp. holarctica FTNF002-00] gi|254367032|ref|ZP_04983068.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica 257] gi|254368551|ref|ZP_04984567.1| hypothetical protein FTAG_01535 [Francisella tularensis subsp. holarctica FSC022] gi|290953464|ref|ZP_06558085.1| hypothetical protein FtulhU_03740 [Francisella tularensis subsp. holarctica URFT1] gi|295313262|ref|ZP_06803899.1| hypothetical protein FtulhU_03725 [Francisella tularensis subsp. holarctica URFT1] gi|89143469|emb|CAJ78645.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica LVS] gi|115129041|gb|ABI82228.1| hypothetical protein FTH_0199 [Francisella tularensis subsp. holarctica OSU18] gi|134252858|gb|EBA51952.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica 257] gi|157121454|gb|EDO65645.1| hypothetical protein FTAG_01535 [Francisella tularensis subsp. holarctica FSC022] gi|164551567|gb|ABU60697.2| hypothetical protein with von Willebrand factor type A domain [Francisella tularensis subsp. holarctica FTNF002-00] Length = 332 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF A S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVAIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPLPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + +L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLEELATA 264 >gi|198417752|ref|XP_002129197.1| PREDICTED: similar to Clca1 protein [Ciona intestinalis] Length = 1034 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 52/337 (15%), Positives = 99/337 (29%), Gaps = 42/337 (12%) Query: 33 GLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDF 92 L I+ + ++ + + T Q D I+ + + Sbjct: 190 SLGIQGENMIVQNNEIVQDVCNYDPQTLLPNSTDCKFILAWDQDLDLKASIMSYQYVNEI 249 Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 +N D N + + D +K ++ F N ++ Sbjct: 250 NGFCDDND--NDPLNRHNREAPNEHNDKCNKRSVWDVITS------SVDFTGGRNLANPN 301 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + T K ++VLDVS SM +L + + +D V Sbjct: 302 VASTIPTFRVVKPFPYRSFVLVLDVSGSMWG------GRLTKMRQIMNTFVDDF-----V 350 Query: 213 NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFG--STTKSTPGLEYAYNKIFD 268 G+ FS+ + PL I RL +T G+ A + Sbjct: 351 QRGDYVGITIFSTIARKLSPLTRIRDQSDRASLIRRLPRSVRGSTCIGCGINSAVQIMEQ 410 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 L II TDG EN P++ + ++ ++ V A+ Sbjct: 411 HSPDLCGD----------IIVFTDGEENVEPSVADVH-----DKVVKKKCRVSAVFFTTT 455 Query: 328 AADQFLKNC-ASPDRFY--SVQNSRKLHDAFLRIGKE 361 A ++ A+ ++ + L A+ ++ K Sbjct: 456 ANQALVRLVDATSGTWFYGDTDDITPLIGAYNQLSKS 492 >gi|315298071|gb|EFU57340.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] Length = 581 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 53/335 (15%), Positives = 105/335 (31%), Gaps = 43/335 (12%) Query: 39 SHKFFVKAKLHYILDHS-LLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQT------- 90 ++ K L L + A K N G + F +K + Q Sbjct: 73 VQQYSDKQTLQGRLQEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSL 132 Query: 91 --------DFRNELRENG----FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFI 138 + R L + A + I I D+ + S + M + Sbjct: 133 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDK-QSIPASKPIPFAMRYE 191 Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 PW + + I + S+ +++ ++D S SM ++L + S Sbjct: 192 LAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSS 245 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++ ++ ++ ++ V +G S+I I I+ L +T G Sbjct: 246 LKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAG 301 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 LE AY + KG + I+ TDG+ + D K + + G Sbjct: 302 LEMAYQQAAKG------FIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 352 Query: 319 VYAIGVQ-AEAADQFLKNCA--SPDRFYSVQNSRK 350 + +GV + + + A + + + Sbjct: 353 LSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSE 387 >gi|262164956|ref|ZP_06032694.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] gi|262027336|gb|EEY46003.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] Length = 403 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 70/207 (33%), Gaps = 20/207 (9%) Query: 23 ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK--NDFS 80 ++ P + +++ ++ S +F A+L + + L + N +K + + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENNVSYARKVVDRYV 60 Query: 81 YRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIF 139 I +I + ++G Q T ++ +HK +S + Sbjct: 61 VDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKS----WISYENISLKP 116 Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 ++ + L +D+ ++D+S SM + G ++ I Sbjct: 117 EFTVNGSSVTRKFLP------------QPVDVYFIVDMSASMRATWQNGKSQIDEVKEVI 164 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSK 226 +++ +K R L+ + + Sbjct: 165 TRVVNDLKGFDTEVKS-RVSLLAYHNY 190 >gi|157375479|ref|YP_001474079.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] gi|157317853|gb|ABV36951.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] Length = 2812 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 81/291 (27%), Gaps = 22/291 (7%) Query: 50 YILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIE 109 D S + T + N ++ + + ++ Sbjct: 1972 TAGDGSNITTGNLLDNDSGVSSSTHITEVEGVAAVNGVITVTTALGELAVYSEDSTDHRA 2031 Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 + + S Y + + + + IS Sbjct: 2032 GDYEYKLTA-NSTDGDIASESFDYTLTNSLGSNSSASLTVKISDDAPVVHDISQNLQANA 2090 Query: 170 D-----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D + +VLDVS SM D G G L VA ++ +++ + S +VN + V F Sbjct: 2091 DVVTTNLTLVLDVSGSMGDPVGNGQTYLEVAIDALTALINEVDSTGNVNIQI----VNFH 2146 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + L V + L+ T L N + + Sbjct: 2147 SNTGSSGWLIDDVAGAISYLESLVTYGPTHYDAALNAVMNS----------GSLPDGADQ 2196 Query: 285 KYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIV-YAIGVQAEAADQFL 333 + F++DG S +D + + G + IG+ + L Sbjct: 2197 SLLYFISDGSPSPGQEVDPALQSIWESYLVNSGYKTAFGIGIGSAGLSDLL 2247 >gi|19552242|ref|NP_600244.1| hypothetical protein NCgl0978 [Corynebacterium glutamicum ATCC 13032] Length = 594 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 32/200 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------- 224 M+VLD S SM G ++ A ++ ++++ I DV G + Sbjct: 1 MIVLDNSGSMTAQDAGGQTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMG 60 Query: 225 --SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + IN L T L ++ + Sbjct: 61 CQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN----------- 109 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNCAS-- 338 I+ ++DG N + E + G ++ IG+ + A + C + Sbjct: 110 ----IVLVSDGI---ANCTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGV 162 Query: 339 -PDRFYSVQNSRKLHDAFLR 357 + +++ L DA R Sbjct: 163 GGGTYADASDAQSLTDALTR 182 >gi|281341943|gb|EFB17527.1| hypothetical protein PANDA_002811 [Ailuropoda melanoleuca] Length = 652 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 65/191 (34%), Gaps = 33/191 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-RSGLVTFSSKIV 228 D+ V+D S S+ R++ + + I +++ R G V ++ + Sbjct: 469 DIGFVIDGSSSVG----------TGNFRTVLQFVANISKEFEISETDTRVGAVQYTYEQR 518 Query: 229 QTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F + +N + T + + YA ++F K + + Sbjct: 519 LEFGFD-DYHTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKR 568 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRF 342 K +I +TDG + + A +G YAIGV A D+ P D Sbjct: 569 KLMILITDGRSYD------DVRIPAMVAHHKGVTTYAIGVAWAAQDELEVIATHPASDHS 622 Query: 343 YSVQNSRKLHD 353 + V L+ Sbjct: 623 FFVDEFDNLYK 633 >gi|260808371|ref|XP_002598981.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] gi|229284256|gb|EEN54993.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] Length = 193 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 66/201 (32%), Gaps = 27/201 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LDM+ VLD S S+ + ++ D + P R GL+ ++ Sbjct: 5 LDMVFVLDGSGSIQAV------NFAKVKKFAVDLSDGLNISP---TATRVGLIEYTDSPT 55 Query: 229 QTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK- 284 F LA + IN + + T++ L+ A ++ + + Sbjct: 56 VEFKLADHTNKASLATAINNVSYQSGGTQTGRALDAARTQM-----DWRQPPVPNVCFSL 110 Query: 285 -KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--R 341 + I +TDG + ++L + Y +G+ L A D Sbjct: 111 LQAAIVVTDGMSGDNVQQPAKAL------RDNDISAYGVGIGPAINANELNEIAGGDAGH 164 Query: 342 FYSVQNSRKLHDAFLRIGKEM 362 + + N KL +I + Sbjct: 165 VFYIPNYDKLEKEMEKISNSV 185 >gi|208780563|ref|ZP_03247902.1| TPR domain protein [Francisella novicida FTG] gi|208743538|gb|EDZ89843.1| TPR domain protein [Francisella novicida FTG] Length = 332 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVTIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPSPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + +L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLEELATA 264 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 58/221 (26%), Gaps = 59/221 (26%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-------------- 235 +L ++ + +D + G+ FSS P Sbjct: 1184 TRLDALKNAVNQFIDDTAKKSPNSK---IGITVFSSTDDYNRPYGNHGTSVSLGEVGTAD 1240 Query: 236 --GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 V ++ + L T GLE A NK+ + KY++ TDG Sbjct: 1241 SAKVTELKNFVKDLKANGGTDPAVGLEDAKNKLDAMVDTN----------PKYVVLFTDG 1290 Query: 294 ENSSP-----NIDNKESLFYCNEAK----------RRGAIVYAIGVQAEAADQFLKNCAS 338 + + + K + E K + VY IG K S Sbjct: 1291 KPTGGGNKWNSNAQKNAETQAGELKTGLRNNVDNAKNPYTVYTIGFALNDEGDRAKTFLS 1350 Query: 339 PDRF---------------YSVQNSRKLHDAFLRIGKEMVK 364 + + ++ L F I + K Sbjct: 1351 GGTYDGKKDPGIASSSDCAKTADDAASLTQIFQSISSTINK 1391 >gi|218779355|ref|YP_002430673.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218760739|gb|ACL03205.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 504 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 64/204 (31%), Gaps = 32/204 (15%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------- 224 M+VLD S SM + L +KS+P + G+V + Sbjct: 31 MLVLDCSQSMQRTLDTRPG----MENARPAALRFVKSLPQNSLA---GIVAYGQNAAKGC 83 Query: 225 SKIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PLA + + + I ++ L A+ + + Sbjct: 84 DNAEVLVPLAPYDRRALISAIKKVQPQGKAPLAAALRKAWEQGAGLSQGC---------- 133 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNC-ASPD 340 I +TDG + + S+ +A+ G IV IGV E A + ++ AS Sbjct: 134 --VITLITDGWDDC--WGDPVSMVEDLKARGAGIIVNIIGVAPNREDAAKLMRLARASGG 189 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + + L + + Sbjct: 190 AYRAADTRADLILKAAETAESVSA 213 >gi|115496702|ref|NP_001068594.1| collagen alpha-2(VI) chain [Bos taurus] gi|94574217|gb|AAI16098.1| Collagen, type VI, alpha 2 [Bos taurus] gi|296490819|gb|DAA32932.1| collagen, type VI, alpha 2 [Bos taurus] Length = 917 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 611 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 667 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AYNK+ + + Sbjct: 668 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYNKLIKESRRQKTRV--- 724 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ CN V AIG+ Sbjct: 725 -----FAVVITDGRH-DPRDDDLNLRALCNHE----VTVTAIGIG 759 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 67/212 (31%), Gaps = 12/212 (5%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D + + VLD S S M + + L + V R G Sbjct: 37 PEKADCPVHVYFVLDTSESITMQSPTDSLLYHMQQFVLQFISQLQDELYLDQVALSWRYG 96 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + FS + P + + + T + +E H+ KG Sbjct: 97 GLHFSDLVEVFSPPGSDRASFTKSLQSISSFRRGTFTDCM-----LANMTQEVRRHVGKG 151 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + + +TDG + + A+ G ++A+ + +Q L++ A+ Sbjct: 152 VVN---FAVVITDGHVTGSPCGGIK--LQAERAREEGIRLFAVPPNLKLNEQGLRDIANT 206 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 N + I ++ + + I K Sbjct: 207 PHELYRNNYATMRPDSTEIDQDTINRIIKVMK 238 >gi|323141741|ref|ZP_08076613.1| ATPase family associated with various cellular activities (AAA) [Phascolarctobacterium sp. YIT 12067] gi|322413778|gb|EFY04625.1| ATPase family associated with various cellular activities (AAA) [Phascolarctobacterium sp. YIT 12067] Length = 651 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 60/170 (35%), Gaps = 20/170 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG + +D S SM +++ +I ML R G++ Sbjct: 462 REKRIGNTFLFAVDASGSMG-----ARERMRAVKGAIFYMLQEA-----YQKRDRVGMIA 511 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F K P+ V Q+++ + G T GL + + + K Sbjct: 512 FRRDKADMLLPITRSVDLAQKRLAEMPTGGKTPLADGLALSLQTL-------AMMNKRDS 564 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA--KRRGAIVYAIGVQAEAA 329 + + +I +TDG ++ + K+ + K A + ++ + E+ Sbjct: 565 ELEPLLIVVTDGRANAVHEGEKDPVAAAISIAEKIAKAKITSVVIDTESG 614 >gi|148225160|ref|NP_001089228.1| hypothetical protein LOC734275 [Xenopus laevis] gi|58047691|gb|AAH89181.1| MGC98917 protein [Xenopus laevis] Length = 1014 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 79/214 (36%), Gaps = 22/214 (10%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L + + ++ + +D++ +LD SM G + + L + K D Sbjct: 808 DLPCQTELSVAQCTQRPVDLVFLLD--GSMRTGEQNFKYAAGFVEEAAQ-RLSLAKKHDD 864 Query: 212 VNNVVRSGLVTF--SSKIVQTFPLAWGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFD 268 +N R LV + K FPLA+ + + + I +L + ++ P + +A N + Sbjct: 865 PSNA-RVSLVQYGGQDKQSVAFPLAFDLTEVSQAIEKLRYNGSSSVIGPAIIHAINNVLQ 923 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + +F+TDGE ++ + ++ + + V E Sbjct: 924 NTGRPARRYAEPS-----FVFITDGETGQEGLEEAVT-----AMRKNNIVSTLVSVGPED 973 Query: 329 ADQF--LKNCASPDR---FYSVQNSRKLHDAFLR 357 + L+ + +R F S+ ++F+ Sbjct: 974 SIDMKVLRQLSMGERAAIFRQADYSKLTENSFMD 1007 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 78/216 (36%), Gaps = 19/216 (8%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV---RS 218 S K +++ ++D S S+ P L I LD ++ ++ V+ + Sbjct: 30 SDKIKCPINVFFIIDTSESIILQTAPIEILLDNMKVFIPRFLDKLEDAAYLDQVILNWQY 89 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G + +S +++ + Q + K+N + G T + L I + Sbjct: 90 GGLHYSDEVIIFSDITTNKQEYKSKLNAVTYIGRGTFTDCALSNMTALIQRQGGDAIN-- 147 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + +TDG + + + A+ G ++++ + + L+ A Sbjct: 148 --------FAVVITDGHVTGSPCGGM--MHQADRARNAGIKLFSVAASHDVYESGLREIA 197 Query: 338 SPDRFYSVQNSRKLHDAFLR--IGKEMVKQRILYNK 371 + + +NS L A R I + + + I K Sbjct: 198 NAP-YELFRNSYSLTRADDRTVINDKTIDKIIQVMK 232 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 65/181 (35%), Gaps = 18/181 (9%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ ++D S S+ ++ L I P + R G+V +S + Sbjct: 605 GALDIVFIIDSSESIG---YTNFSLEKNFVINVVSRLGSIAKDPKSDTGARVGVVQYSHE 661 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I P + +E + RL T + L++AYNK+ + + Sbjct: 662 GTFEAIQLDDPRIDSLSSFKEAVRRLEWIAGGTWTPSALQFAYNKLIKETRRDK------ 715 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 K + + +TDG + + D + + + + + D+ L + A + Sbjct: 716 --AKVFAVVITDGRHDPRDPDERLQVLCGGDVDVNAIGIGDM-FNKPEEDETLTSIACSN 772 Query: 341 R 341 + Sbjct: 773 K 773 >gi|326927888|ref|XP_003210120.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Meleagris gallopavo] Length = 1069 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 234 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 281 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + + + Sbjct: 282 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNMLNEFNHTGQ-----G 336 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 337 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 391 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 392 NKGFFTQISTLAD 404 >gi|313226593|emb|CBY21739.1| unnamed protein product [Oikopleura dioica] Length = 694 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 27/204 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ V+D S S G + + D R + FS+ Sbjct: 180 ALDIVFVVDESGS------IGTPNFQLIKDFLEHFASDSTIAADA---TRIAIRPFSTSN 230 Query: 228 VQTFPL-AWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + ++I +I + + T + L+ A + + K Sbjct: 231 YLYFSLNDFKTKNIINEIKNMPYNSGNTNTADALDAALTDYGTDRP----------ESVK 280 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ +TDG ++S + + K R +AIGV + + + Sbjct: 281 VMVTITDGASNSF----LSTSAAADRVKNDLRNIQSFAIGVSGANMAELEAIAITDKHVF 336 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 + + +++ + I Sbjct: 337 MLNGWADFEPIKSNLLQKVCEGNI 360 >gi|313226592|emb|CBY21738.1| unnamed protein product [Oikopleura dioica] Length = 766 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 27/204 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ V+D S S G + + D R + FS+ Sbjct: 252 ALDIVFVVDESGS------IGTPNFQLIKDFLEHFASDSTIAADA---TRIAIRPFSTSN 302 Query: 228 VQTFPL-AWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + ++I +I + + T + L+ A + + K Sbjct: 303 YLYFSLNDFKTKNIINEIKNMPYNSGNTNTADALDAALTDYGTDRP----------ESVK 352 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ +TDG ++S + + K R +AIGV + + + Sbjct: 353 VMVTITDGASNSF----LSTSAAADRVKNDLRNIQSFAIGVSGANMAELEAIAITDKHVF 408 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRI 367 + + +++ + I Sbjct: 409 MLNGWADFEPIKSNLLQKVCEGNI 432 >gi|297488656|ref|XP_002697119.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] gi|296474935|gb|DAA17050.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1192 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 68/186 (36%), Gaps = 34/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + VN + +F+ K Sbjct: 340 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSDDDYVN------VASFNEKAQP 387 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +E + ++ TT G EYA++++ + + Sbjct: 388 VSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNPNITRANCN----- 442 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASP 339 K I+ TDG D + +F R V+ V D L+ C + Sbjct: 443 --KMIMMFTDG-----GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACTNK 495 Query: 340 DRFYSV 345 ++ + Sbjct: 496 GYYFEI 501 >gi|224066048|ref|XP_002192868.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha2/delta subunit 3 [Taeniopygia guttata] Length = 1090 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 255 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 302 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + + + Sbjct: 303 VEPCLNGTLVQADRANKEHFREHLDKLFAKGIGMLDIALNEAFNMLNEFNHTGQ-----G 357 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 358 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 412 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 413 NKGFFTQISTLAD 425 >gi|118096863|ref|XP_414338.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-3 subunit [Gallus gallus] Length = 1090 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 255 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 302 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + + + Sbjct: 303 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNMLNEFNHTGQ-----G 357 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 358 SICSQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 412 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 413 NKGFFTQISTLAD 425 >gi|33331711|gb|AAQ11020.1| mesocentin [Caenorhabditis briggsae] Length = 13133 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 88/234 (37%), Gaps = 27/234 (11%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + +RY +P + W + S + S D D+++VLD S ++F Sbjct: 12369 VKNGKARYIVPNVESARTW-PTPRTKATTLAGSRRSCSTIDYESDVIIVLDSS----ENF 12423 Query: 186 GPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ-EK 243 P D+ ++ ++D PDV+ G V +S K+ P+A G + E Sbjct: 12424 TP--DEFDSMKDAVASIVDTGFDLAPDVSK---IGFVIYSDKV--AVPVALGHYEDKIEL 12476 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + +++ + + + G ++ K ++ +T+G+N Sbjct: 12477 LEKIVDAEKINDGVAIAL----YGLNAARQQFQLHGRENATKIVLLITNGKNRGNAAAAA 12532 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASPDRFYSVQNSRKLHD 353 E L G ++A+ V + + + L A+PD V S ++ D Sbjct: 12533 EDLRD-----MYGVQLFAVAVGSNPDELATIKRLVGNANPDNAIEVAQSTEIDD 12581 >gi|88858062|ref|ZP_01132704.1| hypothetical protein PTD2_11769 [Pseudoalteromonas tunicata D2] gi|88819679|gb|EAR29492.1| hypothetical protein PTD2_11769 [Pseudoalteromonas tunicata D2] Length = 637 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 74/235 (31%), Gaps = 34/235 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + P + I + +V+D+SLSM K + Sbjct: 60 LLIVFITLAIVALAGPSWQQQQIPIYQAKQARV---IVMDMSLSM----YSTDIKPNRLS 112 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGST 252 ++ + LD+I+ + + LV +++ PL + I L + Sbjct: 113 QARFKALDMIELFKEGE----TALVAYAADAYVISPLTSDASTLSNLIPSLSPDIMPTKG 168 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + GL A + A II +TDG ID+++ A Sbjct: 169 SNVMAGLTTANELLSQAGYLSGD-----------IILVTDG------IDSEDLSSVQEFA 211 Query: 313 KRRG--AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + G VYA+ + A + + D + + + ++ K Q Sbjct: 212 LQSGHHLHVYAVATEQGAPIELPQGGFLKDNYGQIVVPKAQFTTLKQLAKRGSGQ 266 >gi|260827156|ref|XP_002608531.1| hypothetical protein BRAFLDRAFT_92387 [Branchiostoma floridae] gi|229293882|gb|EEN64541.1| hypothetical protein BRAFLDRAFT_92387 [Branchiostoma floridae] Length = 1634 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 57/175 (32%), Gaps = 27/175 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S L + V+D S SMN+ G L + E++ + ++ + Sbjct: 180 EAASMRRTPLRFVAVIDESGSMNNKVGEDNMTLIQRMKVFAELM-----VQNLKEDDQMA 234 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +VTF++ I P+ G E I L T + GL A L H Sbjct: 235 IVTFATDIQVKLPMTQLNEDGKAQALEAIKTLRTRGQTNLSDGLLAALEMFQSGG--LFH 292 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-------GAIVYAIG 323 I+ TDG + + + NE K + IG Sbjct: 293 NG---------IVLFTDGAANQGITNADHLIQAFNEKKTSVCGEACIPISTFTIG 338 >gi|195539501|ref|NP_001124213.1| inter-alpha (globulin) inhibitor H2 [Gallus gallus] gi|190576833|gb|ACE79193.1| inter-alpha inhibitor heavy chain 2 precursor [Gallus gallus] Length = 948 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 66/173 (38%), Gaps = 15/173 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKIV 228 ++ V+DVS SM +G M + A ++I L D N+ VR + +V Sbjct: 313 ILFVIDVSGSM---WGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRC----WRDNLV 365 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P V+ ++ I + T L A I + + L + I+ Sbjct: 366 SATPAQ--VEDAKKYIQTIHPNGGTNINEALLRA-TFILNEAQNLGMLDPNSVSM---IV 419 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++DG+ + + ++ + ++ +G+ + FL+ A+ +R Sbjct: 420 LVSDGDPTVGELKLTTIQKNVKQSIKDEYSLFCLGIGFDVDYDFLQRIATDNR 472 >gi|89054212|ref|YP_509663.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88863761|gb|ABD54638.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 1356 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 71/201 (35%), Gaps = 38/201 (18%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK------ 226 +VLD S SM G++K+ +A I EML D+ + V GL + + Sbjct: 30 LVLDGSGSM-WGQIDGVNKIVIAREVIAEML------ADMADDVSLGLTVYGHRQRGSCT 82 Query: 227 -IVQTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 I A G Q I + +N + T T + A + +E Sbjct: 83 DIETIVAPAPGTQGRILDAVNAINPRGRTPMTDAVIAAAQSLRSTEEAAT---------- 132 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQA--EAADQFLKNCA--- 337 +I ++DG + N + E + G + IG E + C Sbjct: 133 --VILVSDGIENC----NPDPCAIAAELEATGVDFTAHVIGFDVASEPEARAQMQCIADN 186 Query: 338 SPDRFYSVQNSRKLHDAFLRI 358 + +F + N+ +L A ++ Sbjct: 187 TGGQFLTADNATELSQALEQV 207 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 59/159 (37%), Gaps = 17/159 (10%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG------MDKLGVATRSIREMLDIIKSI 209 T +++ ++ ++ +V+D S SM + G + ++ + +++ + D +K Sbjct: 710 TGGTELNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDALKNLADSLKGH 769 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 N ++ F + + + + + KI+ L T E A+ K Sbjct: 770 DGKIN---VSIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTN----YEDAFLK 822 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 + Y+ FLTDG+ + N D K Sbjct: 823 TTSWFDTQSVTYGKAQGYENLTYFLTDGDPTFSNRDTKN 861 >gi|221486991|gb|EEE25237.1| von willebrand factor type A domain-containing protein, putative [Toxoplasma gondii GT1] Length = 1109 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 63/181 (34%), Gaps = 18/181 (9%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + T + ++ +D + VLD S S++ + + + + Sbjct: 238 VPETPEISTTTCHKGRVDAVAVLDGSGSISRADWKKTRDIAKLFSGALNIAEDQSHV--- 294 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNK-IFDAK 270 +VVR + + P++W + + +I+RL T + LE AY + Sbjct: 295 -SVVRFSTTARADWSLVQ-PVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIFVTSMN 352 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSS--------PNIDNKESLFYCNEAKRRGAIVYAI 322 + EH +K + ++ TDG + P + L + K V I Sbjct: 353 NRDEHDSKH---VHRVLLLATDGCVNQWDRFKFRTPEAHLHDVLERMSSLKNLHIKVLGI 409 Query: 323 G 323 G Sbjct: 410 G 410 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 50/129 (38%), Gaps = 14/129 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D ++VLD S+S+ +L T+ + LD R GLV++S + Sbjct: 20 VDAVVVLDSSMSVGAEHWQ---ELLKLTKQFGDTLDSSAGHS------RLGLVSYSDSVT 70 Query: 229 QTFPLA---WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L G +E++ F T + L+ AY +F + + + Sbjct: 71 VLRKLQKIPSGTAQFEEELGAASFMNGNTFTPKALDSAYE-LFKETIHEDSEGAEDQEKR 129 Query: 285 KYIIFLTDG 293 + ++ TDG Sbjct: 130 RLLLLATDG 138 >gi|110632968|ref|YP_673176.1| hypothetical protein Meso_0611 [Mesorhizobium sp. BNC1] gi|110283952|gb|ABG62011.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 427 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 17/111 (15%), Positives = 40/111 (36%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F R+ + GS++++ AI+ PV+ MGL +E+ + + + KL + D ++ + + Sbjct: 12 FRFFRSLAKDQGGSVAVIAAIVFPVVVGAMGLGVESGYWYLKQRKLQHAADVAVYAASVR 71 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 + + + ++ A T Sbjct: 72 YRAGDARALMETAALRSARVTGYQPSIGTITTGVQAGSTAGSGTVSVELTE 122 >gi|330818825|ref|YP_004351042.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] gi|327374367|gb|AEA65719.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] Length = 660 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 25/214 (11%), Positives = 63/214 (29%), Gaps = 38/214 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN--------- 214 + + + +V+D+S SM+ G + + + + + + Sbjct: 453 EEGLNTAISVVIDLSPSMSYPLGKVEQVMSAGATPVIDGVAQHQQFTVRRDQAAGATAVA 512 Query: 215 --------VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNK 265 V ++++++ G + + + + + +T + + A + Sbjct: 513 LAPIFEMYDVPFEVISYAAGYQIIKSFDDGWEEVSRRSHAVASVGRSTATGMAMTVALSN 572 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + E + ++ LTDG P + AK G V I + Sbjct: 573 LILRDEDR-----------RMMVLLTDGAAGDPVMTAASY----QAAKEAGVEVVTIFIG 617 Query: 326 AE-----AADQFLKNCASPDRFYSVQNSRKLHDA 354 + L F +V + +L Sbjct: 618 RDIQAIALTRSILNATGFGQHFSNVNSPDELAKG 651 >gi|218680121|ref|ZP_03528018.1| hypothetical protein RetlC8_15005 [Rhizobium etli CIAT 894] Length = 168 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 17/88 (19%), Positives = 42/88 (47%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 +F +R + G++ I+ A+ L + + +G + + V+ K+ LD +L+ Sbjct: 12 AFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVK 71 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQ 89 +I N + + K + D+ + ++N + Sbjct: 72 QINNTGDTDALKLKVTDWFHAQVENSYT 99 >gi|224456528|ref|ZP_03665001.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis MA00-2987] gi|254370859|ref|ZP_04986864.1| hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874285|ref|ZP_05246995.1| TPR repeat domain-containing protein [Francisella tularensis subsp. tularensis MA00-2987] gi|151569102|gb|EDN34756.1| hypothetical protein FTBG_00674 [Francisella tularensis subsp. tularensis FSC033] gi|254840284|gb|EET18720.1| TPR repeat domain-containing protein [Francisella tularensis subsp. tularensis MA00-2987] gi|282158590|gb|ADA77981.1| hypothetical protein NE061598_01655 [Francisella tularensis subsp. tularensis NE061598] Length = 332 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 78/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF A S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVAIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPLPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + +L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLEELATA 264 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + E+L P ++ G+V + Sbjct: 136 QTYMDIVIVLDGSNSI----YPWVE----VQHFLIEILTKFYIGPGQ---IQVGIVQYGE 184 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V F L + +++ + + T++ + G Sbjct: 185 DAVHEFHL-NDYKSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 237 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 238 AKKVMIVITDGE----SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEI 293 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 294 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 328 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 65/184 (35%), Gaps = 16/184 (8%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S+ D +A + +++D + + + G F+S++ Sbjct: 501 DVLFILDNSGSVGP------DNFALAQEFVMDLVDQMTI---SSTAINVGAFLFNSQVQM 551 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 IQ I+ + + + A N D + +G I Sbjct: 552 LTAFTDDKTAIQAAISGYSYPPSNTAGTATGAALNFAVDTMLQSGAGYRGGSVLVYVI-- 609 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNS 348 TDG + D+ GA V ++G+ + + L+ A S + +++ Sbjct: 610 -TDGRSQE---DSSFVASAAANLHATGAEVVSVGITSSVDETQLRTIATSDSNVFVLEDF 665 Query: 349 RKLH 352 L+ Sbjct: 666 SDLN 669 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 70/194 (36%), Gaps = 25/194 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ LD S S+ G+D +A + + ++++ PDV +R + F + Sbjct: 180 SLDVLFTLDASGSV------GVDNFDIAKNFVADSVELMDVDPDV---IRVAGMMFHANP 230 Query: 228 VQTFPLAWGV------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F + ++ + T + L Y + A D Sbjct: 231 LPQFDFDFSFDRDVIADAVRSFVYPTDRNWGTATGAALNYIRKYLLVPS------AGNRD 284 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASPD 340 + F+TDG + +++ + GA V AIG+ A Q +SPD Sbjct: 285 PADTIVYFITDGNSQEALSFVQDA---ADNIHATGARVVAIGITDAIDQSQLEIIASSPD 341 Query: 341 RFYSVQNSRKLHDA 354 V++ L + Sbjct: 342 DVIIVEDFADLDEV 355 >gi|254695378|ref|ZP_05157206.1| Von Willebrand factor, type A [Brucella abortus bv. 3 str. Tulya] gi|261215750|ref|ZP_05930031.1| norD protein [Brucella abortus bv. 3 str. Tulya] gi|260917357|gb|EEX84218.1| norD protein [Brucella abortus bv. 3 str. Tulya] Length = 633 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 77/205 (37%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S E + +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVAMPAI 627 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 75/215 (34%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGIVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V F L + +++ + + T++ + G Sbjct: 209 DAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|218559186|ref|YP_002392099.1| hypothetical protein ECS88_2420 [Escherichia coli S88] gi|218365955|emb|CAR03699.1| conserved hypothetical protein [Escherichia coli S88] Length = 580 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 213 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 265 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 266 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGI 317 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 318 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 374 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 375 NGNYSYIDTLSE 386 >gi|260841558|ref|XP_002613979.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] gi|229299369|gb|EEN69988.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] Length = 1796 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 24/159 (15%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-- 249 + L + P + R +V+F + ++ + L Sbjct: 150 KQTEITFVENFLSQLTISPQAS---RVAVVSFDNHARTHIDYINSPKNKCSFLRELKAVK 206 Query: 250 --GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 G +T + G A + + + + K+ +++LTDG+ +K+ + Sbjct: 207 YTGGSTNAEDGFRLAQELL-----RPQSAFTNYQPVKQVVVYLTDGKP------DKDPVG 255 Query: 308 YCNEAKR-RGAIVYAIGV-----QAEAADQFLKNCASPD 340 N K A +Y+IGV + E +C+SP Sbjct: 256 RANNLKSVYNAEIYSIGVDPYSKRYETDGVTSADCSSPG 294 >gi|198420236|ref|XP_002121660.1| PREDICTED: similar to collagen type VI alpha 6 [Ciona intestinalis] Length = 1638 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 84/261 (32%), Gaps = 46/261 (17%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + +I D K+Y + A F + + + + +++ Sbjct: 372 EIKVIASDPDKNYVIEA-------LNFDIIELKRRGLIKSICTDAEQTCPAATA---ELV 421 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 ++D S S+ D R ++ ++D + P R +V F+++ V F Sbjct: 422 FLIDGSTSIGF------DNFEKLKRWLKSIVDAFQVGPH---YTRVAVVQFTNRPVLEFG 472 Query: 233 LAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L Q + + +T + +E+ N++F ++ K +I Sbjct: 473 L-NDHSTTQATLQAIQRIRYRRGSTSTGRAIEFVMNEVFTHSR---------ENVPKILI 522 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 LTDG++ D ++ EA G G+ Q + + D + Sbjct: 523 ALTDGQSQD---DVTQATASAAEA---GVHTLVFGIGNTRPGQLRQLVSKEDHVFQA--- 573 Query: 349 RKLHDAFLRIGKEMVKQRILY 369 F I K K L Sbjct: 574 ----AGFDVIQKMQSKLVSLI 590 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 59/171 (34%), Gaps = 24/171 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +D++ ++D S S G + I ++ + G+V +S Sbjct: 789 AETEMDLIFLIDGSNS------IGPREFETTKEWIGSF---VREFEIGEYNTKIGVVQYS 839 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S++ + + I+ + T + LEY F + H A+ Sbjct: 840 SRVRSEIDIGDYDSKADLLAAISSIEFAAGNTNTGSALEYVRTVGFSGR----HGARNGV 895 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I LTDG +D L R G VYAIGV Q Sbjct: 896 P--KVLIVLTDGNAQDGVLDAASKLH------RDGVAVYAIGVGRPNMGQL 938 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 61/186 (32%), Gaps = 20/186 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+M ++D S S+ + ++ M + PD VR G++ +S Sbjct: 980 ARTATDLMFLVDGSTSVGHESWGII------KSFMQNMTQKFQIGPDA---VRVGMIQYS 1030 Query: 225 SKIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 ++ + + + ++ + + Y A+ + + Sbjct: 1031 TRPKTNIAIGQYNDKESLQEAFGQVEWQLGDTYTARALRYVSKSYARATT----RENLHA 1086 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDR 341 K +I +TDG+ N E G ++AIGV + L + D Sbjct: 1087 TKLLIIITDGQPQDRN----EVKQAVRNLHSEGWRIFAIGVGQSDISELGILASNPDSDH 1142 Query: 342 FYSVQN 347 + N Sbjct: 1143 VFYANN 1148 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 47/318 (14%), Positives = 109/318 (34%), Gaps = 36/318 (11%) Query: 35 VIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN 94 I+ + L +++ ++ + + S + Sbjct: 100 GIDLIQYMSGNTQTGLALRYAIESVFSRARED-SAKVAIVLSDGRSQDQVNEA-----AT 153 Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 LR +G A I S + + +S + + +F + + L Sbjct: 154 SLRSSGIATFAVGIGDEMS----HERMEELRQISVAAAEDQSSVFMAKDFRSIGQLQERL 209 Query: 155 ITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +++ + + + D+ V+D S ++ + I++++ P+ Sbjct: 210 VSAVCEQTVQECPTSKHDLAFVIDASSTIGYN------DFMKVKAWIKKIVKAFDVGPEE 263 Query: 213 NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDA 269 R +V +S+ + + F Q + + I+ + T + L Y ++I+ + Sbjct: 264 ---TRVAVVQYSTSVQEEFNFGHLLSKQQVLDAIDNMDYIMGDTHTGAALTYMLDEIYSS 320 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 D I +TDG+ P++ + N G VY +GV A+ + Sbjct: 321 A------NGDRPDVPDLAIVMTDGKAQEPDL----VVEAANRVHEAGVTVYTVGV-ADYS 369 Query: 330 DQFLKNCAS-PDRFYSVQ 346 + +K AS PD+ Y ++ Sbjct: 370 LEEIKVIASDPDKNYVIE 387 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 75/215 (34%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 137 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGIVQYGE 185 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V F L + +++ + + T++ + G Sbjct: 186 DAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 238 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 239 AKKVMIVITDGE----SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEI 294 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 295 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 329 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 75/215 (34%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGIVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V F L + +++ + + T++ + G Sbjct: 209 DAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|62317822|ref|YP_223675.1| NorD protein [Brucella abortus bv. 1 str. 9-941] gi|83269804|ref|YP_419095.1| von Willebrand factor, type A [Brucella melitensis biovar Abortus 2308] gi|189023075|ref|YP_001932816.1| Von Willebrand factor, type A [Brucella abortus S19] gi|237817362|ref|ZP_04596354.1| Protein norD [Brucella abortus str. 2308 A] gi|254691324|ref|ZP_05154578.1| Von Willebrand factor, type A [Brucella abortus bv. 6 str. 870] gi|254699113|ref|ZP_05160941.1| Von Willebrand factor, type A [Brucella abortus bv. 2 str. 86/8/59] gi|254732556|ref|ZP_05191134.1| Von Willebrand factor, type A [Brucella abortus bv. 4 str. 292] gi|256256509|ref|ZP_05462045.1| Von Willebrand factor, type A [Brucella abortus bv. 9 str. C68] gi|260545056|ref|ZP_05820877.1| von Willebrand factor [Brucella abortus NCTC 8038] gi|260756933|ref|ZP_05869281.1| norD protein [Brucella abortus bv. 6 str. 870] gi|260760365|ref|ZP_05872713.1| nitric oxide reductase activation protein [Brucella abortus bv. 4 str. 292] gi|260763605|ref|ZP_05875937.1| norD protein [Brucella abortus bv. 2 str. 86/8/59] gi|260882746|ref|ZP_05894360.1| protein norD [Brucella abortus bv. 9 str. C68] gi|297249867|ref|ZP_06933568.1| nitric-oxide reductase NorD protein [Brucella abortus bv. 5 str. B3196] gi|75495379|sp|Q576X0|NORD_BRUAB RecName: Full=Protein norD gi|114152095|sp|Q2YJT9|NORD_BRUA2 RecName: Full=Protein norD gi|62198015|gb|AAX76314.1| NorD protein [Brucella abortus bv. 1 str. 9-941] gi|82940078|emb|CAJ13118.1| Von Willebrand factor, type A [Brucella melitensis biovar Abortus 2308] gi|189021649|gb|ACD74370.1| Von Willebrand factor, type A [Brucella abortus S19] gi|237788175|gb|EEP62391.1| Protein norD [Brucella abortus str. 2308 A] gi|260098327|gb|EEW82201.1| von Willebrand factor [Brucella abortus NCTC 8038] gi|260670683|gb|EEX57623.1| nitric oxide reductase activation protein [Brucella abortus bv. 4 str. 292] gi|260674026|gb|EEX60847.1| norD protein [Brucella abortus bv. 2 str. 86/8/59] gi|260677041|gb|EEX63862.1| norD protein [Brucella abortus bv. 6 str. 870] gi|260872274|gb|EEX79343.1| protein norD [Brucella abortus bv. 9 str. C68] gi|297173736|gb|EFH33100.1| nitric-oxide reductase NorD protein [Brucella abortus bv. 5 str. B3196] Length = 633 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 77/205 (37%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S E + +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVAMPAI 627 >gi|116619435|ref|YP_821591.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222597|gb|ABJ81306.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 377 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 64/187 (34%), Gaps = 48/187 (25%) Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L+++ + W I +K+ ++ G Y+ I+ A + E + Sbjct: 142 LISYGNTADIAVNTTWDSDKIADKVRKMKPGGG-------AALYDAIYLACTRRELVKGE 194 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAADQ----- 331 + ++ I+ + DG +++ + +E L A+R +YA+ A DQ Sbjct: 195 PYEPRRVIVVIGDGHDNASKHNLEEVLEL---AQRNLVTIYAVSTMAFGFSNEDQEVLER 251 Query: 332 ----------------------FLKNCASPDRF--------YSVQNSRKLHDAFLRIGKE 361 +L N + + Y+ + S + A IG E Sbjct: 252 LTHKTGGHVEYPLNSLYKGVSGYLSNPSDDGNYALTVGTGGYAAEISNGIIKAVGGIGGE 311 Query: 362 MVKQRIL 368 + Q IL Sbjct: 312 ITTQYIL 318 >gi|302869502|ref|YP_003838139.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504036|ref|YP_004082923.1| von willebrand factor type a [Micromonospora sp. L5] gi|302572361|gb|ADL48563.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315410655|gb|ADU08772.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 572 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 82/219 (37%), Gaps = 29/219 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + T++ S + G M+ V+DVS SM + A+R + + + + Sbjct: 358 ISTATTTWSVATQSGR-MLCVIDVSGSMKK----PVATANGASREQVTVAAASQGLGLFD 412 Query: 214 NVVRSGLVTFSSKIVQTF---------PLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAY 263 + GL TFS+ + + PL+ +Q + + T + AY Sbjct: 413 DSWSIGLWTFSTNLQGSQDWSELVGIKPLSSNRGSLQRGLASIKPSSGNTGLYDTMLAAY 472 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAKRRG--AIVY 320 K+ E + + I+ TDG+N + ++++L N+ K V Sbjct: 473 KKVQQDWEPGKVNS---------IVLFTDGKNEDDDGISQKALLDQLNKLKDDEQPVQVI 523 Query: 321 AIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 IG+ E L++ + + + K+ + FLR Sbjct: 524 IIGIGTEVNRAELESITKVTGGGAFVTTDPSKIGEIFLR 562 >gi|296111730|ref|YP_003622112.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] gi|295833262|gb|ADG41143.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] Length = 894 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 46/245 (18%), Positives = 84/245 (34%), Gaps = 33/245 (13%) Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ 121 + N G + N D N +N ++ + L D Sbjct: 47 QYTNDNKGVYPTNSWTIPGQNTVINHQGGDASNGWDKNSSWSGDSSDTSKSYLKFGTDTS 106 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + DY + + + + N+ + K+ +D+++V+D+S SM Sbjct: 107 NPDYQIRKYA--KETSTPGLYDVYLNAKGNEV----------KNIKPIDIVLVVDMSGSM 154 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGV-- 237 N G D++G A + ++ L I + V G+V FSS I + L+ + Sbjct: 155 NSSVNGGNDRVGAARQGVKNFLKTINDA-GIGKYVNVGVVGFSSPGYISSSGTLSENIDA 213 Query: 238 ---QHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 Q +IN L+ T + G+ + + KK +I LT Sbjct: 214 SDNQAHITRINNLLANDFKGGTFTQLGIRTGQSMLAGDSNDH----------KKMMILLT 263 Query: 292 DGENS 296 DG + Sbjct: 264 DGVPT 268 >gi|147898495|ref|NP_001088330.1| inter-alpha (globulin) inhibitor H2 [Xenopus laevis] gi|71051796|gb|AAH98981.1| LOC495168 protein [Xenopus laevis] Length = 935 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 74/209 (35%), Gaps = 29/209 (13%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-- 227 ++ V+DVS SM +D + + D+N+ + G++ F+ I Sbjct: 306 ILFVIDVSGSMWGLKMKQTVDAMKS-------------ILEDLNSDDQFGIIDFNHNIRC 352 Query: 228 ---VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + + R+ T L A + +A + + Sbjct: 353 WKDELVYASSVEKGDASKYVQRIQPNGGTNINDALLRAIFILKEAS----NKGLLEQNSV 408 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 I+ ++DG+ + + + + ++++G+ + FL+ A + Sbjct: 409 SLIVLVSDGDPTVGELKLPKIQKNVRTNIQDDIALHSLGIGFDVDYDFLERLAQENHGMA 468 Query: 342 ---FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + + +L + + ++ ++K + Sbjct: 469 QRIYGNQDTAAQLKEFYNKVSTPLLKNIV 497 >gi|26248659|ref|NP_754699.1| hypothetical protein c2813 [Escherichia coli CFT073] gi|91211565|ref|YP_541551.1| hypothetical protein UTI89_C2553 [Escherichia coli UTI89] gi|117624462|ref|YP_853375.1| hypothetical protein APECO1_4292 [Escherichia coli APEC O1] gi|227887329|ref|ZP_04005134.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|300983332|ref|ZP_07176546.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301049025|ref|ZP_07196011.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|26109064|gb|AAN81267.1|AE016763_226 Hypothetical protein yfbK [Escherichia coli CFT073] gi|91073139|gb|ABE08020.1| hypothetical protein YfbK [Escherichia coli UTI89] gi|115513586|gb|ABJ01661.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|227835679|gb|EEJ46145.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|294491429|gb|ADE90185.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|300299173|gb|EFJ55558.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300408590|gb|EFJ92128.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554335|gb|ADN47110.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] gi|307626191|gb|ADN70495.1| hypothetical protein UM146_05450 [Escherichia coli UM146] gi|315285863|gb|EFU45301.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|315292192|gb|EFU51544.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|323952065|gb|EGB47939.1| von Willebrand protein type A [Escherichia coli H252] gi|323956039|gb|EGB51792.1| von Willebrand protein type A [Escherichia coli H263] Length = 580 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 213 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 265 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 266 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGI 317 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 318 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 374 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 375 NGNYSYIDTLSE 386 >gi|328676286|gb|AEB27156.1| hypothetical protein FNFX1_0208 [Francisella cf. novicida Fx1] Length = 332 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 79/235 (33%), Gaps = 42/235 (17%) Query: 134 EMPFIFCTFPWCAN-SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P IF S P V + + ++ LDVS SM+ +L Sbjct: 58 LVPLIFLLIWLVTIFSLAGPTWKYKDVPVY---QKNISRVIALDVSQSMDTTDVSP-SRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 A I ++L IK G++ FSS+ PL I+ + + Sbjct: 114 ERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTIENLVTVINSDIV 166 Query: 251 --STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 L+ + I A + II +TD + + +++ Sbjct: 167 PVQGHNIYKALKKSAQLIEQAGVQQGQ-----------IILITD------SSPSPQAISQ 209 Query: 309 CNEAKRRGAI--VYAI-----GVQAEAADQFLKNCASPDRFYSVQ--NSRKLHDA 354 + ++G VYAI G+ + +LK+ +++ + ++L A Sbjct: 210 AKQLAQQGIKTDVYAIGTPMGGIAKDEKGNYLKDSQGNIQYFGIDLSKLQELATA 264 >gi|255513821|gb|EET90086.1| von Willebrand factor type A [Candidatus Micrarchaeum acidiphilum ARMAN-2] Length = 705 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 72/205 (35%), Gaps = 32/205 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D G ++ M+LD+S SM K+ A R + + D + D + V + F Sbjct: 522 KDAGAEIWMLLDISGSMGGQ------KINAAKRILGSIHDSL----DGSKYVHLRMFGFY 571 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 T + + + L T + + YA + + K ++ Sbjct: 572 GS-DGTHVFEFDRKMLMN----LAAMGDTPTDIAIYYAMDLM----------KKDKSNFD 616 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K + +TDG+ ++ N K V+ I + EAA + + D +++ Sbjct: 617 KTLFIITDGDPNNGQETKNALNSLKNAMKN--VNVFTIFISREAARAV-EIFSPSDWYFN 673 Query: 345 VQNSRK----LHDAFLRIGKEMVKQ 365 V + + L I + KQ Sbjct: 674 VSSMDEVEKVLEKGIKGIVDNIKKQ 698 >gi|218662246|ref|ZP_03518176.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli IE4771] Length = 487 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 77/198 (38%), Gaps = 28/198 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFSSKIV 228 ++ V+D S SM+ GP +++ + L+ D NV+R + + +V Sbjct: 49 VVFVIDNSGSMS---GPSIEQARQSLALAISRLNP----DDRFNVIRFDDTMTDYFKGLV 101 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P + + L T+ P LE DA +A G + ++ Sbjct: 102 AATP--DNREKAIAYVRGLTADGGTEMLPALE-------DALRNQGPVASGAL---RQVV 149 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQ 346 FLTDG I N++ LF A R A V+ +G+ + F+ A R F ++ Sbjct: 150 FLTDG-----AIGNEQQLFQEITANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIG 204 Query: 347 NSRKLHDAFLRIGKEMVK 364 ++ ++ + ++ Sbjct: 205 STDQVASRMGELFAKLQN 222 >gi|194675927|ref|XP_001788181.1| PREDICTED: integrin, alpha E [Bos taurus] gi|297486662|ref|XP_002695836.1| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Bos taurus] gi|296476814|gb|DAA18929.1| integrin, alpha E [Bos taurus] Length = 1163 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 45/207 (21%), Positives = 74/207 (35%), Gaps = 30/207 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ +VLD S S++ P A I +M+ K+ R +V + Sbjct: 181 AGTEIAIVLDGSGSID----PP--DFQKAKDFIYDMM---KNFYTKCFECRFAVVQYGEV 231 Query: 227 IVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I L Q ++ ++R+ G TK+ +++ + IF + A Sbjct: 232 IQTELDL-LDSQDVRASLDRVKNISQVGKITKTASAMQHVLDNIFTPNQGSRAKA----- 285 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF----LKNCAS 338 K ++ LTDGE D N K G +AIGV LK AS Sbjct: 286 -SKVMVVLTDGE---IFQDPLNLTTVINSPKMHGVERFAIGVGEAFNKSKAYHELKLIAS 341 Query: 339 P---DRFYSVQNSRKLHDAFLRIGKEM 362 D + V N L ++ + + Sbjct: 342 DPDEDHAFKVTNYMALDGLLNKLQESI 368 >gi|108761535|ref|YP_628568.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108465415|gb|ABF90600.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 445 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 68/208 (32%), Gaps = 27/208 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + + + LD S SM + + + + GL+TF + Sbjct: 62 RAPVAINLALDRSASMRG------------VPLLAAVQAAQALVERASPRDYLGLLTFDA 109 Query: 226 KIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q P+ + + + RL G T +E A E + Sbjct: 110 EPEQVLPMRAMDPNARAQLLKVLARLESGEGTALHEAVERA-------SESARRVLVPGA 162 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 + ++ LTDGE S E G +++A+G+ L+ SP Sbjct: 163 RPQ--VLMLTDGEPSVGPSQLAEFKTQGARIAESGVMLHALGLGRHYLPDILEALTSPSG 220 Query: 341 -RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 F V + L A ++G E+ + + Sbjct: 221 TGFVHVDDPEGLPMAVGQLGAELFGEVV 248 >gi|308509664|ref|XP_003117015.1| CRE-CLEC-60 protein [Caenorhabditis remanei] gi|308241929|gb|EFO85881.1| CRE-CLEC-60 protein [Caenorhabditis remanei] Length = 399 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 18/191 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ L+++ V+D S M G+ + S+ I + P+ R GLVT++ Sbjct: 35 GNLWLEVVAVVDNSKGMT---NGGLISIAANIASVFSNNTRIGTNPNEPRTTRLGLVTYN 91 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + L Q + + + + G S+ +S A +I + Sbjct: 92 AVANTVADLD-QFQSLDDVYDGIFGGLAQVSSSDESYLAHGLAQAEIILEAGQTA---VN 147 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNC 336 Y++ +I S ++D + + K G + + + L+ Sbjct: 148 RSHYERVVIVYASTYKGSGSLDP---IPVADRLKTAGVTIITVAYDQDGDGALLHDLQKI 204 Query: 337 ASPDRFYSVQN 347 ++P ++ + Sbjct: 205 STPPYNFANTD 215 >gi|156741405|ref|YP_001431534.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232733|gb|ABU57516.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 329 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 48/133 (36%), Gaps = 18/133 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIV 228 + V+D S SM +++ ++ +L R GLV+F Sbjct: 143 AVCFVVDASWSMAAE-----ERMRATKAAVLSLLR-----DAYQRRDRVGLVSFQRDYAT 192 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL V Q ++ ++ G T + GL Y + A+ + + ++ Sbjct: 193 LLLPLTNSVDLAQRQLQQMPTGGKTPLSRGLLLGYEVLERARRQDPEVMP-------LLV 245 Query: 289 FLTDGENSSPNID 301 LTDG+ + D Sbjct: 246 LLTDGQANVSMSD 258 >gi|120537195|ref|YP_957252.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120327030|gb|ABM21337.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 584 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 69/198 (34%), Gaps = 35/198 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM--LDIIKSIPDVNNVVRSGLVTFSSKI 227 + ++LD S SM +I + DI S+ N G V+ + Sbjct: 418 SVHVLLDCSGSMQHQQEIANQATVSLALAISTIPKCDIAVSMFPGN----GGSVSPMIRR 473 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q G + G T + YA ++ + + + + Sbjct: 474 GQPVRPNLGRFCVDS-------GGGTPLAEAMLYAARELTASHKAR-----------QVM 515 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I +TDG+ ++P +++ Y N + YAIG+ + A +F + + + + Sbjct: 516 IVITDGDPNNP-----QAVNYLNGLIKGHIDTYAIGIGSPAVKRFF------ENWCVISD 564 Query: 348 SRKLHDAFLRIGKEMVKQ 365 +L A I +++ Sbjct: 565 VSQLQSALFSIASNVLEL 582 >gi|118579690|ref|YP_900940.1| magnesium chelatase [Pelobacter propionicus DSM 2379] gi|118502400|gb|ABK98882.1| protoporphyrin IX magnesium-chelatase [Pelobacter propionicus DSM 2379] Length = 689 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 57/157 (36%), Gaps = 18/157 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +G ++ V+D S SM ++ + ++ +L + R G+++ Sbjct: 497 REKRVGNFLLFVVDASGSMG-----ARGRMAASKGAVMSLL-----LDAYQKRDRVGMIS 546 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F ++ P V+ + + + G T + + +Y ++ + Sbjct: 547 FRKNEAFVNLPPTTSVELAGKLLEEMPVGGRTPLSAAIAKSYEQLRGVLGRDP------- 599 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + +IF+TDG+++ D + A+ Sbjct: 600 TARPIVIFITDGKSNVALGDGRPVDEAMGLARAMAVK 636 >gi|301770509|ref|XP_002920678.1| PREDICTED: integrin alpha-11-like [Ailuropoda melanoleuca] Length = 1203 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 175 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 223 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 224 DVVHEFHL-NDYRSVRDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 276 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK ++ +TDGE + D+ + +++R YA+ V + FL Sbjct: 277 AKKVMVVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEI 332 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 333 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 367 >gi|289425109|ref|ZP_06426886.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154087|gb|EFD02775.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] Length = 322 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 66/204 (32%), Gaps = 33/204 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D SLSM + + D I S+P N +V+ S Sbjct: 96 IVVAIDSSLSMKADDVSP----TRLAAAKAKAKDFINSLPTGFN---VAVVSISEHPEIR 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + ++ + T ++ + + A ++ A I+ L Sbjct: 149 MLPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVML 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC 336 +DG+N+ L N A VY I E + L Sbjct: 203 SDGDNTQGG----SPLVAANRAAAAKVPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + + ++ ++ KL + + ++ Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQV 282 >gi|281343950|gb|EFB19534.1| hypothetical protein PANDA_009430 [Ailuropoda melanoleuca] Length = 1112 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 76/215 (35%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 144 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 192 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 193 DVVHEFHL-NDYRSVRDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 245 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK ++ +TDGE + D+ + +++R YA+ V + FL Sbjct: 246 AKKVMVVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEI 301 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 302 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 336 >gi|229550752|ref|ZP_04439477.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] gi|229315887|gb|EEN81860.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] Length = 898 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 60/408 (14%), Positives = 111/408 (27%), Gaps = 96/408 (23%) Query: 21 TAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFS 80 T L V+ I+M ++ + + NG Sbjct: 11 TGHLFAVLLILMSMLTGLVTSG-------SSVVTAAANIRPTYQTDANGTYPTNSWQVTG 63 Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + + N D + N D + + S D DY + + + Sbjct: 64 QQNVINQRGGDQVSGWDNN-TTWDGDATNTTNSYLKFGDPNKPDYQIRKYA--KETNTPG 120 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + N V D+++V+D+S SM + G ++ G ++ Sbjct: 121 LYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESN-SSGTNRAGAVRTGVK 169 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIV-----QTFPLAWGVQHIQEKINRLI------F 249 L I++ + N V GL+ FSS + G + + F Sbjct: 170 NFLTSIQNA-GLGNYVNVGLIGFSSPGYIGGKSGYISVKLGKAGNASQQQAINGALSPRF 228 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN---------- 299 T + GL + KK +I LTDG + N Sbjct: 229 QGGTYTQIGLRQGSAMLNADASGN----------KKMMILLTDGVPTFSNEVINSEWING 278 Query: 300 ---------------------------------IDNKESLFYCNEAKRRGAIVYAIGVQA 326 +L +AK G V+A+G+Q Sbjct: 279 TLYGTNFGSSRDEPGNTAQLGWPYIDSSGNRIYDTWPATLGEAKKAKDSGNEVHALGIQL 338 Query: 327 EAADQFLKN----------CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +++ SPD + ++ + K+++K Sbjct: 339 ADDRKYMTKEKIRQNMQLITNSPDLYEDADSADAVEAYLNNQAKDIIK 386 >gi|260813584|ref|XP_002601497.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] gi|229286794|gb|EEN57509.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] Length = 260 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 77/193 (39%), Gaps = 25/193 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ G+ + + + P + G++ +S++ Sbjct: 1 PLDVIFLLDGSGSV------GVVNFEKVKQFTLKTVIGFDIGP---TATQVGIIQYSTRP 51 Query: 228 VQTFPL-AWGV-QHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q F + ++ + + I N T + + Y F + A+ Sbjct: 52 RQEFSMNSFQTKESLSTAIENVNYMAGGTLTGRAIRYVTKYGFGESD----GARPGIP-- 105 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 K +I +TDG +S + EA+++G +YAIGV DQ L+ AS +R + Sbjct: 106 KIVILVTDGVSSD------DIEQPALEAQQKGISLYAIGVSGYDMDQ-LERIASNNRTLA 158 Query: 345 VQNSRKLHDAFLR 357 V + L D+ Sbjct: 159 VAENFNLLDSLRN 171 >gi|194043712|ref|XP_001928130.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Sus scrofa] Length = 2972 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 115/310 (37%), Gaps = 32/310 (10%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD + LYT + + N + + + + + L + +Q ++RS Sbjct: 311 LDGAALYTGSALEFVRNNMFTSEAGYRAAEGVPRLLVLITGGKSL--DDVSQPAQELKRS 368 Query: 112 TSLSIIIDDQHKDYNLSAVSRYE-----MPFIFCTFPW--CANSSHAPL-LITSSVKISS 163 + ++ + ++ D ++ +P F P +S APL ++ + ++ + Sbjct: 369 SIMAFAVGNKAADQAELEEIAFDPSLVFIPTEFRASPLQGVLSSLLAPLRTLSGTTEVHA 428 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S+++ P + +++++ S+ + +R GLV F Sbjct: 429 NKR---DIIFLLDGSVNVGRTRFPYVRDF---------VMNLVNSLDVGGDNIRVGLVQF 476 Query: 224 SSKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S V F L + + + + +L S A + + ++ + Sbjct: 477 SDTPVTEFSLNTYQTKSDLLAHVRQLQLKGG--SGLNTGAALSYVHANHFTEAGGSRIQE 534 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++ LT G P+ D+ L N R G + + +G + + +P Sbjct: 535 QVPQLLLLLTAG----PSEDSY--LQAANALARSGILTFCVGASQANKAELEQIAFNPSL 588 Query: 342 FYSVQNSRKL 351 Y + + L Sbjct: 589 VYLMDDFSSL 598 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 44/358 (12%), Positives = 118/358 (32%), Gaps = 54/358 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1266 VRVGVVQFSNDVFPEFYLKTYRSQA---SVLDAIRRLRFKGGSPLNTGKALEFVA---RN 1319 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 + + + + + +++ R + + + + + + Sbjct: 1320 FFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRYSQVMGSSGIVRLGVGDRNIDRTELQTI 1379 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK------------SDIGLDMMMVLDV 177 + + F F N + ++ D++ +LD Sbjct: 1380 TNDPRLVFTVREFRELPNIEERVMNSFGPSGVTPAPPGVDILPPSRPEKKKADIVFLLDG 1439 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S D R + E++D + + + ++ GLV ++S F L Sbjct: 1440 S------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSDPTDEFFLKDFS 1490 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q I + IN++++ + + + + E ++ + +T G++ Sbjct: 1491 TKQQIIDAINKVVYKGGRHANT--KVGLEHLRLNQFVPEAGSRLEQRVPQIAFVITGGKS 1548 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1549 VEDAQEASLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1600 >gi|194043710|ref|XP_001928122.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Sus scrofa] Length = 3178 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 115/310 (37%), Gaps = 32/310 (10%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD + LYT + + N + + + + + L + +Q ++RS Sbjct: 517 LDGAALYTGSALEFVRNNMFTSEAGYRAAEGVPRLLVLITGGKSL--DDVSQPAQELKRS 574 Query: 112 TSLSIIIDDQHKDYNLSAVSRYE-----MPFIFCTFPW--CANSSHAPL-LITSSVKISS 163 + ++ + ++ D ++ +P F P +S APL ++ + ++ + Sbjct: 575 SIMAFAVGNKAADQAELEEIAFDPSLVFIPTEFRASPLQGVLSSLLAPLRTLSGTTEVHA 634 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D++ +LD S+++ P + +++++ S+ + +R GLV F Sbjct: 635 NKR---DIIFLLDGSVNVGRTRFPYVRDF---------VMNLVNSLDVGGDNIRVGLVQF 682 Query: 224 SSKIVQTFPL-AWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S V F L + + + + +L S A + + ++ + Sbjct: 683 SDTPVTEFSLNTYQTKSDLLAHVRQLQLKGG--SGLNTGAALSYVHANHFTEAGGSRIQE 740 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++ LT G P+ D+ L N R G + + +G + + +P Sbjct: 741 QVPQLLLLLTAG----PSEDSY--LQAANALARSGILTFCVGASQANKAELEQIAFNPSL 794 Query: 342 FYSVQNSRKL 351 Y + + L Sbjct: 795 VYLMDDFSSL 804 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 44/358 (12%), Positives = 118/358 (32%), Gaps = 54/358 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1472 VRVGVVQFSNDVFPEFYLKTYRSQA---SVLDAIRRLRFKGGSPLNTGKALEFVA---RN 1525 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 + + + + + +++ R + + + + + + Sbjct: 1526 FFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRYSQVMGSSGIVRLGVGDRNIDRTELQTI 1585 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK------------SDIGLDMMMVLDV 177 + + F F N + ++ D++ +LD Sbjct: 1586 TNDPRLVFTVREFRELPNIEERVMNSFGPSGVTPAPPGVDILPPSRPEKKKADIVFLLDG 1645 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S D R + E++D + + + ++ GLV ++S F L Sbjct: 1646 S------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSDPTDEFFLKDFS 1696 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 Q I + IN++++ + + + + E ++ + +T G++ Sbjct: 1697 TKQQIIDAINKVVYKGGRHANT--KVGLEHLRLNQFVPEAGSRLEQRVPQIAFVITGGKS 1754 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1755 VEDAQEASLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1806 >gi|118384937|ref|XP_001025607.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89307374|gb|EAS05362.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 670 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 70/211 (33%), Gaps = 34/211 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHF------------GPGMDKLGVATRSIREMLDIIKSIPDV 212 S ++ V+DVS SM+ + L V SI+ +++ + S Sbjct: 29 SRTNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNTLGSED-- 86 Query: 213 NNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 +VTFS F L EKI L T+ GL A N + + Sbjct: 87 ----YLSIVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNILLN 142 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 K + + I LTDG+ + ID + + + + G + + Sbjct: 143 NKTPNTNQS---------IFLLTDGQPTDSGIDT-NLVKFKQAYPKLNCTINTFGFSSSS 192 Query: 329 ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + A F + ++ + AF Sbjct: 193 NSELMNKIAMEYNGMFSFIPDASFIATAFAN 223 >gi|326478189|gb|EGE02199.1| U-box domain-containing protein [Trichophyton equinum CBS 127.97] Length = 741 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 73/215 (33%), Gaps = 26/215 (12%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N + I +K + + D+++V+D+S SMN + Sbjct: 39 ILSIHPIPNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSMNSAAPIPTGE-RGGED 97 Query: 198 SIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLI 248 + +LD+ K I +N R +VTF +++ F L + I++L Sbjct: 98 TGLSILDLTKHAAKTIIETLNEKDRLAVVTFCTEVNVAFELDSMNKENKSTVLSAIDKLY 157 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS------PNIDN 302 S+T G++ N + + ++ LTDG + Sbjct: 158 GKSSTNLWHGIKKGLNVLATN---------PVRGNIQSLLVLTDGAPNHMCPVQGYVPKL 208 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +++L +++ G L++ A Sbjct: 209 RQTLLDHRNLTGTLPLIHTFGFGYYLRSPLLQSIA 243 >gi|326474578|gb|EGD98587.1| hypothetical protein TESG_05957 [Trichophyton tonsurans CBS 112818] Length = 741 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 73/215 (33%), Gaps = 26/215 (12%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N + I +K + + D+++V+D+S SMN + Sbjct: 39 ILSIHPIPNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSMNSAAPIPTGE-RGGED 97 Query: 198 SIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLI 248 + +LD+ K I +N R +VTF +++ F L + I++L Sbjct: 98 TGLSILDLTKHAAKTIIETLNEKDRLAVVTFCTEVNVAFELDSMNKENKSTVLSAIDKLY 157 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS------PNIDN 302 S+T G++ N + + ++ LTDG + Sbjct: 158 GKSSTNLWHGIKKGLNVLATN---------PVRGNIQSLLVLTDGAPNHMCPVQGYVPKL 208 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +++L +++ G L++ A Sbjct: 209 RQTLLDHRNLTGTLPLIHTFGFGYYLRSPLLQSIA 243 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 66/183 (36%), Gaps = 18/183 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S +D++ ++D S S DK S+ + + + + + ++ + Sbjct: 36 TVDSTCFIDVVFIVDSSES---SKIILFDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAAL 92 Query: 222 TFSSKIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FSS + + W +Q ++++ + F G T S + A + K Sbjct: 93 QFSSSVQIDPSFSSWKDLQTFKQRVKSMNFIGQGTFSYYAISNATGLLKREGRKDG---- 148 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K + +TDG + N D + +A+ G + IG+ + L+ + Sbjct: 149 -----VKVALLMTDGIDHPKNPDVQS---ISEDARTAGILFITIGLSTVVNEAKLRLISG 200 Query: 339 PDR 341 Sbjct: 201 DSS 203 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 62/164 (37%), Gaps = 26/164 (15%) Query: 165 SDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L+++ V+D S S+ ++F + + + L R G++ + Sbjct: 622 KETPLELLFVIDSSESVGLENFEIIKSLVKTLSDQVALDLAA----------ARVGIINY 671 Query: 224 SSKIVQTFPLA-WGVQH-IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S K+ + L + + + ++R+ G T + L A N +F+A Sbjct: 672 SHKVEKVAHLTQFSTKDDFKLAVDRMQYLGEGTYTASALHEA-NHMFEAARPG------- 723 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KK + +TDG+ D K +A ++ IGV Sbjct: 724 --VKKVALVITDGQTD--TRDEKNLTEVVKKASDINVEIFVIGV 763 >gi|256084536|ref|XP_002578484.1| vacuolar protein sorting 26 vps26 [Schistosoma mansoni] gi|238663859|emb|CAZ34722.1| vacuolar protein sorting 26, vps26, putative [Schistosoma mansoni] Length = 1295 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 70/202 (34%), Gaps = 36/202 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + S D+ + + ++D S SM D + A S+ L KS+P R + Sbjct: 287 VVSSKDMRYEFVFLIDRSGSMEG------DNISYAKTSLLLFL---KSLPMS---CRFQI 334 Query: 221 VTFSSKIVQTFPLAWG-----VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLE 274 + F S FP + L T++ L+ A Sbjct: 335 IGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGGTEAYNALKAAL----------- 383 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + + K IIFLTDG+ N D L N K R V+ IG+ + + Sbjct: 384 HSTPSGEGWFKQIIFLTDGD--VGNADEVIGLVRMNVDKAR---VFTIGLGQGVSTALIG 438 Query: 335 NCASPDRFYS--VQNSRKLHDA 354 A + V++ +L A Sbjct: 439 GVARAGNGTAEFVRDPSQLQSA 460 >gi|149732066|ref|XP_001492602.1| PREDICTED: similar to complement factor B [Equus caballus] Length = 768 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 75/222 (33%), Gaps = 36/222 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +R+ ++ + S R GLVT+++ Sbjct: 261 IVLDPSGSMNIYMVLDGSDSIGAHNFTRAKNCLRDFIEKVASYGVKP---RYGLVTYATV 317 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N + + T + L+ Y+ + Sbjct: 318 PKVLIRVSQERSSDADWVTEKLNEISYEDHKLKTGTNTRKALQAVYSMMSWEGNAP---P 374 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG----------AIVYAIGVQAE 327 +G + + I+ +TDG ++ D + + G +Y GV Sbjct: 375 EGWNRTRHVILLMTDGLHNMGG-DPVPVIHEIRDLLDIGRDRKNPREDYLDIYVFGVGPL 433 Query: 328 AADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + AS + V++ L D F ++ E Sbjct: 434 VNQENINALASKKDGEQHVFKVKDMENLEDVFFQMLDETRTL 475 >gi|115623672|ref|XP_785426.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960633|ref|XP_001186588.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 1028 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 58/176 (32%), Gaps = 31/176 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +++ V+DVS SM K R+ +LD ++ I R ++ FSS Sbjct: 306 NTRKNVIFVIDVSGSMYGQ------KTRQTKRAFTTILDDVRPID------RINIILFSS 353 Query: 226 KI------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + +N L G T L A + LEH Sbjct: 354 YAHVWREDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEIL------LEHDTGD 407 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 II LTDG+ + ++ +++IG FL+ Sbjct: 408 AMPL---IIMLTDGQVGNAAAIVRDVTSVIG----GRLSLFSIGFGNGVDFPFLEK 456 >gi|297680998|ref|XP_002818254.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Pongo abelii] Length = 1125 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + ++ ++ + FSS Sbjct: 47 IDIVFIVDSSES------SKIVLFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W +Q ++K+ + G T S + A + K Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K + +TDG + N D + +A+ G IG+ + L+ + Sbjct: 153 -VKVALLMTDGIDHPRNPDVQS---ISEDARISGISFITIGLSTVVNEAKLRLISGDSS 207 >gi|254447511|ref|ZP_05060977.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] gi|198262854|gb|EDY87133.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] Length = 670 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 82/217 (37%), Gaps = 36/217 (16%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +++S + +++ V+D S SM ++ A +++ +++ ++ R Sbjct: 309 PDEMTSGPRMPREVVFVIDTSGSMAGQ------RMYHAKQALS------QAVERLSPDDR 356 Query: 218 SGLVTFSSK-----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V F+++ A V+ + RL G T P +E A + D Sbjct: 357 FNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSVRSDPAYL 416 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +I +TD ++ N+ + E +R+GA ++ +G+ Sbjct: 417 RQ------------VILITD-----ASVGNEAEILRVVERQRKGARLFTVGIGVSPNSYL 459 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L+ A + + + +++ R+ ++ + Sbjct: 460 LRKAAQVGQGDYVYIASGQEVKARMQRLFAKLENPVL 496 >gi|20089761|ref|NP_615836.1| magnesium-chelatase subunit [Methanosarcina acetivorans C2A] gi|19914699|gb|AAM04316.1| magnesium-chelatase subunit [Methanosarcina acetivorans C2A] Length = 704 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 59/160 (36%), Gaps = 21/160 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG ++ V+D S SM ++ + ++ ML + + GL+ Sbjct: 503 REKKIGNLVLFVVDASGSMG-----ARQRMVASKGAVLSML-----MDAYQKRDKVGLIA 552 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P V+ Q+ + + G T + GL Y I + + Sbjct: 553 FKGESAELLLPPTSSVELAQKYLQEMPTGGKTPLSRGLAKGYEVIKNELRRD-----PDT 607 Query: 282 DYKKYIIFLTDG-ENSSPNIDN--KESLFYCNEAKRRGAI 318 +++ ++DG N S N + +E+ + + RG Sbjct: 608 CP--FMVLISDGRANVSMNGEPPLQETKTIASLFRDRGIQ 645 >gi|47224105|emb|CAG13025.1| unnamed protein product [Tetraodon nigroviridis] Length = 983 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 81/221 (36%), Gaps = 45/221 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++D S S++ L + S+ EML+ + VN +V F++++ + Sbjct: 78 DMLILVDASGSVSGL------TLKLIRTSVTEMLETLSDDDYVN------VVYFNTQVKK 125 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 T + +++ + + T T GLE+A+ ++ + Sbjct: 126 TACFDHLVQANVRNKKLLKDAVQNITAKGITNYTKGLEFAFEQLSVTNVSRANCN----- 180 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG + +++ A + ++ V + C++ Sbjct: 181 --KIIMLFTDG-----GEERAQAILEKYNA-DKKVRIFTFSVGQHNYDKGPIQWMACSNK 232 Query: 340 DRFYSV-------QNSRKLHDAFLR---IGKEMVKQRILYN 370 FY + N+++ D R + + KQ N Sbjct: 233 GYFYEIPSIGAIRINTQEYLDVLGRPMVLADKQAKQVQWTN 273 >gi|330834639|ref|YP_004409367.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566778|gb|AEB94883.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 383 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 68/190 (35%), Gaps = 33/190 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + G +++LD S SM K+ A + E+L K IP N +TFS Sbjct: 34 TATGFHYIVLLDTSGSMEGL------KIESAKKGAIELL---KRIPQGNK---VSFITFS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S++ + + +I L G T L A+N Sbjct: 82 SRVNVVREFV-DPEDLTNEIVNLNAGGQTALFTALLTAFN------------LHNKHGVP 128 Query: 285 KYIIFLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 Y+I LTDG + NI+ L + G+ + + LK+ A S Sbjct: 129 SYVILLTDGNPTDDTNIETYRRLQI-----PSSVQAVSFGLGDDYNESILKSLADKSGGV 183 Query: 342 FYSVQNSRKL 351 FY V ++ ++ Sbjct: 184 FYHVSDAMEI 193 >gi|313233701|emb|CBY09871.1| unnamed protein product [Oikopleura dioica] Length = 663 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 70/211 (33%), Gaps = 38/211 (18%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V + LD+ ++LDVS ++ + D + +L+ ++ N V+ Sbjct: 451 VVCPATLTGRLDLAVLLDVSGTIASNPNKDQDTFD----FFQALLNEFDTV----NQVQL 502 Query: 219 GLVTFSSKIVQTFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + +FS V P+ +G + + RL T GL+ A + + Sbjct: 503 SITSFSDDAVVDLPMGHYNEPDLFGAVKNVDWVGRL-----TDINEGLQTALSTMN---- 553 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 DD +IF++DG + + D + G V ++G Sbjct: 554 -------TTDDVPDIMIFVSDGFD---SFDPGAIGDNAADISNAGVDVVSVGFGLNGFVN 603 Query: 332 FLKNCASPD----RFYSVQNSRKLHDAFLRI 358 F+ D ++ +L I Sbjct: 604 FMALVTVADNEGANVFTASTGDELLSQTTAI 634 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 71/200 (35%), Gaps = 32/200 (16%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREML--DIIKSIPDVNNVVRSGLVTFSSKI---- 227 VLD S SMN D+L R++ ++ D+ + + L+ F S++ Sbjct: 357 VLDTSGSMNG------DRLAQLKRALGQLAGSDVSPTGDRFRDREEVTLMPFGSRVKGVR 410 Query: 228 ------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + P+ + I+ L T L+ AY+ + + L Sbjct: 411 THTVPGERPAPV---LAAIRADAEALTADGDTAIFSSLQAAYDHLAQRRSALGDDRFTS- 466 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKNCA--S 338 I+ +TDGEN++ + +Y R A V+ I Q L++ A + Sbjct: 467 -----IVLMTDGENTTGATASDFDAYYRRLRGPERTAPVFPIVFGDSDRSQ-LQSIATLT 520 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 R + L AF I Sbjct: 521 GGRLFDATK-GSLDGAFEEI 539 >gi|218690433|ref|YP_002398645.1| hypothetical protein ECED1_2737 [Escherichia coli ED1a] gi|218427997|emb|CAR08918.2| conserved hypothetical protein [Escherichia coli ED1a] Length = 580 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 213 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 265 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 266 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGI 317 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 318 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 374 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 375 NGNYSYIDTLSE 386 >gi|156147106|gb|ABU53697.1| CnPolydom [Hydractinia symbiolongicarpus] Length = 551 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 68/197 (34%), Gaps = 22/197 (11%) Query: 179 LSM---NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS------KIVQ 229 SM G M ++ ++ +L ++ +N R + TFSS + Sbjct: 1 GSMVSTGSFQGRTMTGFQISKAFVKALLSEVRVA---SNATRIAIGTFSSDHRINFNYIL 57 Query: 230 TFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFD-AKEKLEHIAKGHDDYKKYI 287 A + ++ F T L+ AYN + + H + + K + Sbjct: 58 NPSYANTKCKFNDDFKKIKFDGFMTNIKGSLQDAYNVFRSLDSDPVTHSRRPRSN--KVV 115 Query: 288 IFLTDGE-NSSPNIDNKESLFYCNEA---KRRG-AIVYAIGVQAEAADQFLKNCAS-PDR 341 I LTDG N N + EA K+ G +Y +GV LK A+ P Sbjct: 116 ILLTDGVGNMVGNRVDSAGADGAPEALRLKQTGYVELYTVGVTHATDQNMLKKIATDPSL 175 Query: 342 FYSVQNSRKLHDAFLRI 358 F ++ L + I Sbjct: 176 FLYSKDFTDLGNLAANI 192 >gi|108803481|ref|YP_643418.1| protoporphyrin IX magnesium-chelatase [Rubrobacter xylanophilus DSM 9941] gi|108764724|gb|ABG03606.1| protoporphyrin IX magnesium-chelatase [Rubrobacter xylanophilus DSM 9941] Length = 616 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 61/160 (38%), Gaps = 24/160 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQ 229 +++V+D S SM ++ ++R +L+ R+ +++F + Sbjct: 450 LVLVVDSSGSMA-----ARSRMSAVKGAVRALLE-----DAYRRRDRAAVISFRGEEARL 499 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P A GV+ ++ L G T GLE A E + A + + ++ Sbjct: 500 LVPPASGVEAAAARLEELPTGGRTPLAAGLELA-------AETVLREASREPERRPLLVV 552 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +TDG ++ ++ L + RG ++ V E Sbjct: 553 ITDGRATAG----EDPLAAARRLRERGVP--SVVVDTEGG 586 >gi|116751108|ref|YP_847795.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter fumaroxidans MPOB] gi|116700172|gb|ABK19360.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter fumaroxidans MPOB] Length = 680 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 65/196 (33%), Gaps = 25/196 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSK 226 G+D VLD+S SM K+ + +L + + V + F+ Sbjct: 302 GVDWTFVLDISGSMTG------RKITTLIEGVSRVLGKMSANDRFRIVTFNTTAADFTGG 355 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 V P VQ +++ ++ G +T GL+ AY + + Sbjct: 356 YVPASPE--NVQTWMQRVKQIQAGGSTALFDGLDLAYRLLDGERTTG------------- 400 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 I+ +TDG + + E L K+ ++ + A + A +++ Sbjct: 401 IVLVTDGVCNVGPTRHDEFLGL---LKQHDVRLFTFVIGNSANQPLMDRLAKESGGFAMN 457 Query: 347 NSRKLHDAFLRIGKEM 362 S A I + Sbjct: 458 VSESDDIAGRLIQAKA 473 >gi|291398585|ref|XP_002715927.1| PREDICTED: chloride channel accessory 2 [Oryctolagus cuniculus] Length = 940 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 79/208 (37%), Gaps = 31/208 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +FSSK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLRLQQAAEFFLMQI---VEIHTFV--GIASFSSKGEIR 361 Query: 231 FPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL ++ ++ L +T++ + K F EKL Y II Sbjct: 362 APLQQINSRDDRKQLVSHLPTTVSTEAEISVCSGLKKGFQVVEKL-----NGKAYGAVII 416 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYSVQ 346 +T G + L G+ V++I + + AA L +F+ Sbjct: 417 LVTSGADEHVGSCLHSVLT-------SGSTVHSIALGSSAARTLEELSRRTGGLKFFVPD 469 Query: 347 --NSRKLHDAFLRIGK---EMVKQRILY 369 NS L DAF RI ++ +Q I Sbjct: 470 QSNSNSLIDAFSRISSGTGDIFQQSIQL 497 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 60/170 (35%), Gaps = 26/170 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + + +D S SM G +++ + +LD ++ F +++ Sbjct: 39 QIALCIDASGSMA---GNDIEQARAGAEWVFGLLD---------EDDYVSIIAFDNEVTT 86 Query: 230 T-FPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P WG + ++ + G T GL A + D + Sbjct: 87 VLAPTRWGTISRETAVDAVADISAGGGTDMYSGLLEAKASLQDLPTDDNTARR------- 139 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 ++ L+DG+++ + D + E G + A G+ ++ ++ ++ Sbjct: 140 -VLLLSDGKDN--SHDPEAFGTLAREIDTEGIRIKAAGIGSDYREETIRT 186 >gi|119620823|gb|EAX00418.1| vitrin, isoform CRA_c [Homo sapiens] Length = 700 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 517 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 566 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 567 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 614 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 615 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 668 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V LH RI + + Sbjct: 669 HSFFVDEFDNLHQYVPRIIQNI 690 >gi|332796610|ref|YP_004458110.1| von Willebrand factor type A [Acidianus hospitalis W1] gi|332694345|gb|AEE93812.1| von Willebrand factor type A [Acidianus hospitalis W1] Length = 381 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 67/187 (35%), Gaps = 31/187 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G +++LD S SM K+ A +++ K IP N V + F+S Sbjct: 36 TGFHYIILLDTSGSMEGL------KIEKAKS---GAIELFKRIPQGNKVT---FIKFAST 83 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + + + +I + T L A+ Y Sbjct: 84 VEVVREFS-DPLDLTNEIQNIAANGQTSLFTALLTAFKIAVK------------YSMPAY 130 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 I+ LTDG + D ++ Y + +G + + G+ + ++ LK + + FY Sbjct: 131 ILLLTDGNPT----DVTDTRTYEKMSIPQGVQIISFGIGDDYNEELLKILSDRTGSTFYH 186 Query: 345 VQNSRKL 351 +Q + ++ Sbjct: 187 IQEASEI 193 >gi|148667418|gb|EDK99834.1| Von Willebrand factor homolog [Mus musculus] Length = 2816 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 30/205 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + DK+ ++ I ++ + ++ + S Sbjct: 1692 PLDVVLLLDGSSSLPES---SFDKMKSFAKAF------ISKANIGPHLTQVSVIQYGSIN 1742 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1743 TIDVP--WNVVQEKAHLQSLVDLMQQEGGPSQIGDALAFAVRYVTS----QIHGARPGAS 1796 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II + ++D ++ + A+ V+ +GV + L+ A P Sbjct: 1797 KAVVIIIM------DTSLDPVDT--AADAARSNRVAVFPVGVGDRYDEAQLRILAGPGAS 1848 Query: 343 YSVQNSRKL--HDAFLRIGKEMVKQ 365 +V +++ +G + Sbjct: 1849 SNVVKLQQVEDLSTMATLGNSFFHK 1873 >gi|119913152|ref|XP_616068.3| PREDICTED: integrin, alpha 1 [Bos taurus] gi|297487671|ref|XP_002696385.1| PREDICTED: integrin, alpha 1 [Bos taurus] gi|296475808|gb|DAA17923.1| integrin, alpha 1 [Bos taurus] Length = 1195 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 39/291 (13%), Positives = 95/291 (32%), Gaps = 49/291 (16%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA------------PLLIT 156 ++ ++ ++ + + V+ + F+ C + H + Sbjct: 115 VNTSIPNVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 174 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +S+ + LD+++VLD S S + T + ++L + P Sbjct: 175 NSIAPVQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ-- 224 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + F L + + +++ T + G++ A + F Sbjct: 225 -VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEARG 283 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-- 330 K K ++ +TDGE + DN + + ++I + Sbjct: 284 ARRGVK------KVMVIVTDGE----SHDNHRLNKVIQDCEDESIQRFSIAILGSYNRGN 333 Query: 331 -------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + +K+ AS F++V + L +G+ + ++ Sbjct: 334 LSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATVDQ 384 >gi|115689614|ref|XP_787130.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant [Strongylocentrotus purpuratus] gi|115969501|ref|XP_001184100.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant [Strongylocentrotus purpuratus] Length = 902 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 62/171 (36%), Gaps = 9/171 (5%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM KL ++ +L+ + N + S V F + Sbjct: 347 VIFIIDVSGSMAGV------KLRQVKDALTTILNDMPETDKFNIIPFSDDVNFLDRNKML 400 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 F + V+ + + L T + + D ++ + ++ +I L Sbjct: 401 FSTSSNVRRAKRFVKSLQERDNTNLHKAIIAGVRMLRDESDQN---VRPDENVVSMLIVL 457 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +DG + ID + EA R ++ +G + FL+ A + Sbjct: 458 SDGNPNHGEIDKEIIERNVEEAIRGDFSLFNLGFGEDLDFPFLERMAYQNH 508 >gi|115511022|ref|NP_035838.3| von Willebrand factor precursor [Mus musculus] gi|37784506|gb|AAP41950.1| von Willebrand factor [Mus musculus] gi|225356464|gb|AAI48364.1| Von Willebrand factor homolog [synthetic construct] Length = 2813 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 30/205 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + DK+ ++ I ++ + ++ + S Sbjct: 1689 PLDVVLLLDGSSSLPES---SFDKMKSFAKAF------ISKANIGPHLTQVSVIQYGSIN 1739 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1740 TIDVP--WNVVQEKAHLQSLVDLMQQEGGPSQIGDALAFAVRYVTS----QIHGARPGAS 1793 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II + ++D ++ + A+ V+ +GV + L+ A P Sbjct: 1794 KAVVIIIM------DTSLDPVDT--AADAARSNRVAVFPVGVGDRYDEAQLRILAGPGAS 1845 Query: 343 YSVQNSRKL--HDAFLRIGKEMVKQ 365 +V +++ +G + Sbjct: 1846 SNVVKLQQVEDLSTMATLGNSFFHK 1870 >gi|86129844|gb|ABC86574.1| VWF [Mus musculus] Length = 2813 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 30/205 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + DK+ ++ I ++ + ++ + S Sbjct: 1689 PLDVVLLLDGSSSLPES---SFDKMKSFAKAF------ISKANIGPHLTQVSVIQYGSIN 1739 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1740 TIDVP--WNVVQEKAHLQSLVDLMQQEGGPSQIGDALAFAVRYVTS----QIHGARPGAS 1793 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II + ++D ++ + A+ V+ +GV + L+ A P Sbjct: 1794 KAVVIIIM------DTSLDPVDT--AADAARSNRVAVFPVGVGDRYDEAQLRILAGPGAS 1845 Query: 343 YSVQNSRKL--HDAFLRIGKEMVKQ 365 +V +++ +G + Sbjct: 1846 SNVVKLQQVEDLSTMATLGNSFFHK 1870 >gi|86129842|gb|ABC86573.1| VWF [Mus musculus] Length = 2813 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 30/205 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + DK+ ++ I ++ + ++ + S Sbjct: 1689 PLDVVLLLDGSSSLPES---SFDKMKSFAKAF------ISKANIGPHLTQVSVIQYGSIN 1739 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1740 TIDVP--WNVVQEKAHLQSLVDLMQQEGGPSQIGDALAFAVRYVTS----QIHGARPGAS 1793 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II + ++D ++ + A+ V+ +GV + L+ A P Sbjct: 1794 KAVVIIIM------DTSLDPVDT--AADAARSNRVAVFPVGVGDRYDEAQLRILAGPGAS 1845 Query: 343 YSVQNSRKL--HDAFLRIGKEMVKQ 365 +V +++ +G + Sbjct: 1846 SNVVKLQQVEDLSTMATLGNSFFHK 1870 >gi|56404690|sp|Q8CIZ8|VWF_MOUSE RecName: Full=von Willebrand factor; Short=vWF; Contains: RecName: Full=von Willebrand antigen 2; AltName: Full=von Willebrand antigen II; Flags: Precursor gi|32454887|gb|AAN07781.2| von Willebrand factor [Mus musculus] Length = 2813 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 30/205 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + DK+ ++ I ++ + ++ + S Sbjct: 1689 PLDVVLLLDGSSSLPES---SFDKMKSFAKAF------ISKANIGPHLTQVSVIQYGSIN 1739 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1740 TIDVP--WNVVQEKAHLQSLVDLMQQEGGPSQIGDALAFAVRYVTS----QIHGARPGAS 1793 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II + ++D ++ + A+ V+ +GV + L+ A P Sbjct: 1794 KAVVIIIM------DTSLDPVDT--AADAARSNRVAVFPVGVGDRYDEAQLRILAGPGAS 1845 Query: 343 YSVQNSRKL--HDAFLRIGKEMVKQ 365 +V +++ +G + Sbjct: 1846 SNVVKLQQVEDLSTMATLGNSFFHK 1870 >gi|324516166|gb|ADY46443.1| C-type lectin domain-containing protein 160 [Ascaris suum] Length = 397 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 76/241 (31%), Gaps = 41/241 (17%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISS----------KSDIGLDMMMVLDVSLSMNDHFG 186 T S+ T SS S + LD+++ +D SLSM Sbjct: 5 LTPLTILMGLLSTSFAANATVPSTSSSYYAERACACQPSKLYLDIVVAIDSSLSM----- 59 Query: 187 PGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 + VA L + + S +R LV F+ V L + + Sbjct: 60 TKEGLIQVAADLATLFLPMNVSSESAQGQFIRVALVAFADNAVIVGDL-NKYHNYASLVE 118 Query: 246 RLIF----GSTT-KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 L G T GL+ A + E H AK I+ + ++ Sbjct: 119 GLFTIDYHGGKTLNIEAGLKAASTVL----ESSRHYAKT------VILLYSSAYSAGGFA 168 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN---CASPDRFYSVQNSR---KLHDA 354 D N+ K G + I + + +K +SP+ + ++ K+ +A Sbjct: 169 DPN---AIANQIKESGTKIITIAFRQQPEGTLVKKLGHLSSPNFAFGSMDTSIIAKITNA 225 Query: 355 F 355 F Sbjct: 226 F 226 >gi|190892740|ref|YP_001979282.1| vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] gi|190698019|gb|ACE92104.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] Length = 794 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 102/300 (34%), Gaps = 31/300 (10%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N K + + D + QD + R+ SL KD+ L+ Sbjct: 255 NAKINPVSLTVNLKAGFPLGDVNSSFHAVDIRQDSDQ-ARTMSLKGDAVPADKDFELTWK 313 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG--LDMMMVLDVSLSMNDHFGPG 188 + F + L + + +++ V+D S SM+ Sbjct: 314 AALGKTPSAGLFREVKDGKTYLLAFVTPPTAPDAAAAPTKREVVFVIDNSGSMSGQ---- 369 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + A +S+ + + D NV+R + + + +V P + + Sbjct: 370 --SIEQARQSLALAISRLSK-DDRFNVIRFDDTMTDYFNGLVAASP--DNREKAITYVRG 424 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L T+ P LE DA +A G + ++FLTDG I N++ L Sbjct: 425 LSADGGTEMLPALE-------DALRNQGPVASGAL---RQVVFLTDG-----AIGNEQQL 469 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVK 364 F A R A V+ +G+ + F+ A R F ++ ++ ++ + ++ Sbjct: 470 FQEISANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQN 529 >gi|146303120|ref|YP_001190436.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145701370|gb|ABP94512.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 394 Score = 52.9 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 83/248 (33%), Gaps = 63/248 (25%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-------------- 188 + + +++ + LD+ +VLDVS SM P Sbjct: 11 HLFSWNGELKFAFRATIVPERVKPVPLDLFIVLDVSGSMGIIDNPPEVDDSLIAGTAEVD 70 Query: 189 -------------MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV----QTF 231 ++L VA +IR +L+ + + R ++TFS + + Sbjct: 71 GHVVRYLKDDIGVNNRLEVALEAIRNLLE------NADTSTRVTIITFSDHVNVLCRRVT 124 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 P E + ++ T ++ A + I EH A+ ++ +T Sbjct: 125 P-----STALEHLEEIVPDGNTALYSAVKKAISLI------DEHPAR--------VLLIT 165 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSR 349 DG + + + S R + IGV E + L++ A S RFY V + Sbjct: 166 DGYPTDVEDETEYSKLEV----PRFSQFIPIGVG-EYNAKILRSLADLSNGRFYHVNDVS 220 Query: 350 KLHDAFLR 357 ++ Sbjct: 221 EISRIMEE 228 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 68/192 (35%), Gaps = 31/192 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M V+D S SM D +++ + I + P + F+++ Sbjct: 712 VMFVVDHSGSMK-----ARDAKNYTANKVKQTIKQIGANPSH-------VYRFNNRPNHE 759 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 A I I+ L+ + I A E + K I+ + Sbjct: 760 ---ATDKADIVSSIDSLLTYKNENRS-------TNIVKALETAIGNFTTNQYTSKAIVLV 809 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCAS--PDRFYSVQ 346 TDG ++S ++ +AK +G ++ + V ++ LK+ +S + ++ Sbjct: 810 TDGYSNSNGLE-----QVLRDAKLKGIAIHTVSVGSYTTVNEKLLKDISSETNGTYQNIT 864 Query: 347 NSRKLHDAFLRI 358 + LH + I Sbjct: 865 SIENLHGSLQAI 876 >gi|321460550|gb|EFX71591.1| hypothetical protein DAPPUDRAFT_255495 [Daphnia pulex] Length = 930 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++V D S SMND + +L ++R + D+++ + G+V F++ Sbjct: 319 FVLVSDTSGSMND--YNRIVRLYESSRRWIKY--------DISDGSKLGMVQFANNARIL 368 Query: 231 FPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ G + I RL T GL+ A + + Sbjct: 369 SPIVEINGDASREALIARLPVTAVGGTCIGCGLQKALDLLRPGG------------PGGV 416 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY- 343 I+ LTDGE + N ++ + GA V +I +A D+ L + ++ Sbjct: 417 ILLLTDGEETDRPFIND----VISDVIKSGARVVSIAFGRKAEDKIEDLATKTNGKSYFI 472 Query: 344 -SVQNSRKLHDAF 355 +S+ L+DAF Sbjct: 473 DDNDSSQGLNDAF 485 >gi|115665362|ref|XP_001180845.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] gi|115941435|ref|XP_001179810.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] Length = 763 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 72/217 (33%), Gaps = 45/217 (20%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 TS I + L +++VLD S SM+ M I + I+SI N+ Sbjct: 153 TSPNFIVVQPSGSLRIVLVLDTSGSMDGERFNKM---------IIGAKNFIQSIVPNNSY 203 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKE 271 V +V F+ + + I K + T G++ A Sbjct: 204 V--AIVEFNYGAIVDSNMTELTSAISRKDLASLLPTYADGATCIGCGIQTAIQVAQYNGM 261 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + Y+I L+DG+ +S + ++ + G IV++I EA Q Sbjct: 262 DSRGV---------YLILLSDGQENSGTLIADTL----DDIEDSGVIVHSIAF-YEADTQ 307 Query: 332 FLK----------NCASPDRFYSVQNSRKLHDAFLRI 358 CA +++ + AF I Sbjct: 308 LEDLAQMTGGISATCADGG------SAQCVISAFESI 338 >gi|15897953|ref|NP_342558.1| hypothetical protein SSO1089 [Sulfolobus solfataricus P2] gi|13814278|gb|AAK41348.1| Hypothetical protein SSO1089 [Sulfolobus solfataricus P2] Length = 436 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 82/213 (38%), Gaps = 39/213 (18%) Query: 151 APLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 P + V I + + ++++D S SM KL A +S +++L + Sbjct: 98 RPTEVGFIVYIVPQQGAITSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLLYSL- 150 Query: 208 SIPDVNNVVRSGLVTFSSKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 N L+ FS+ + A G+ I + + G TT+ + +A N Sbjct: 151 -----NEGDYVTLILFSNHPEIKYQGPAKGI--ITFDVGK---GYTTRLHEAVNFALN-- 198 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +AK K II LTDG+ + D + Y + IG+ + Sbjct: 199 ---------LAKQSQVPNK-IIMLTDGKPT----DKRNVKDYEKFDIPPNTQIITIGIGS 244 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + ++ LK A S +FY +++ +L D F Sbjct: 245 DYNERILKKLADKSSGKFYHLKDISELPDVFES 277 >gi|324006620|gb|EGB75839.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 580 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 213 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 265 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 266 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGI 317 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 318 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 374 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 375 NGNYSYIDTLSE 386 >gi|45655624|ref|YP_003433.1| BatB [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602595|gb|AAS72070.1| BatB [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 347 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 56/164 (34%), Gaps = 17/164 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S G+D++ ++DVSLSM +L + ML + R G Sbjct: 82 EKKEESFKGVDILFLVDVSLSMQ-AIDSSPTRLAKFKEVLLRMLPSLSGN-------RFG 133 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ F+ P+ V + + L L A+ K + Sbjct: 134 MIVFAGSPFLYCPMTTDVSAFSDYVRGLDVDMVGDRGTDLSQAFTK------AEALLRSE 187 Query: 280 HDDYKKYIIFLTDGEN-SSPNID--NKESLFYCNEAKRRGAIVY 320 + +I +TDGE+ + P + + G IVY Sbjct: 188 KVFRNRILILVTDGEDQNDPQAISFPASFQVWAAGTESGGPIVY 231 >gi|237741293|ref|ZP_04571774.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13] gi|294784500|ref|ZP_06749789.1| von Willebrand factor type A domain protein [Fusobacterium sp. 3_1_27] gi|229430825|gb|EEO41037.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13] gi|294487716|gb|EFG35075.1| von Willebrand factor type A domain protein [Fusobacterium sp. 3_1_27] Length = 228 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 63/171 (36%), Gaps = 11/171 (6%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + LD S SM G +++L +R + IK +VTF Sbjct: 15 NPTARVPVCLCLDTSGSME---GKPIEELN---EGVRLFYEAIKEDETALYSAEISVVTF 68 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + IQ RL T + + + K K E+ KG D Y Sbjct: 69 GGNAECIR--DFYSLEIQPDAPRLSAYGKTPMGEAVNIGLDLL--EKRKEEYKDKGVDYY 124 Query: 284 KKYIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQFL 333 + +++ +TDG + N + ++ + ++ V+ IG+ EA L Sbjct: 125 QPWLVLMTDGGPNGDNEELSRAIRRTVDLVNQKKLTVFPIGIGEEADMNVL 175 >gi|301758046|ref|XP_002914871.1| PREDICTED: LOW QUALITY PROTEIN: vitrin-like [Ailuropoda melanoleuca] Length = 686 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 66/191 (34%), Gaps = 33/191 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-RSGLVTFSSKIV 228 D+ V+D S S+ R++ + + I +++ R G V ++ + Sbjct: 503 DIGFVIDGSSSVG----------TGNFRTVLQFVANISKEFEISETDTRVGAVQYTYEQR 552 Query: 229 QTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F + +N + + T + + YA ++F K + + Sbjct: 553 LEFGFD-DYHTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKR 602 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRF 342 K +I +TDG + + A +G YAIGV A D+ P D Sbjct: 603 KLMILITDGRSYD------DVRIPAMVAHHKGVTTYAIGVAWAAQDELEVIATHPASDHS 656 Query: 343 YSVQNSRKLHD 353 + V L+ Sbjct: 657 FFVDEFDNLYK 667 >gi|302560610|ref|ZP_07312952.1| von Willebrand factor type A domain-containing protein [Streptomyces griseoflavus Tu4000] gi|302478228|gb|EFL41321.1| von Willebrand factor type A domain-containing protein [Streptomyces griseoflavus Tu4000] Length = 449 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 62/204 (30%), Gaps = 35/204 (17%) Query: 120 DQHKDYNLSAVSRYEMP-----FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 +++ +P +T+ S ++++ Sbjct: 7 SNVPQFSVDVYQNAYLPEGGREVNAIVTVTATGGGTIGSAVTAPHLYSPGEGPSAAVVLM 66 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +D S SM+ K+ A + +D + + VR ++ + + +P Sbjct: 67 VDCSGSMDY----PPTKMRNARDATAAAIDAL------RDGVRFAVIGGTHVAKEVYPGG 116 Query: 235 WG--------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 G + ++ + +L G T L A + A + H Sbjct: 117 GGLAVAGPGTREQAKQALRKLSAGGGTAIGTWLGLADRLLASADVAIRHG---------- 166 Query: 287 IIFLTDGENSSPNI-DNKESLFYC 309 I LTDG N + D K +L C Sbjct: 167 -ILLTDGRNEHESQEDLKAALEAC 189 >gi|156342098|ref|XP_001620878.1| hypothetical protein NEMVEDRAFT_v1g222619 [Nematostella vectensis] gi|156206294|gb|EDO28778.1| predicted protein [Nematostella vectensis] Length = 252 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 71/193 (36%), Gaps = 24/193 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+M +LD S S++D + A ++I+ M + R ++T +++ Sbjct: 49 FDIMYILDSSSSVSDW------EFQRAVQAIQTM------VAKSKRDNRHAVITIATRAK 96 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + +++ G T + LE A+ +K + + + ++ Sbjct: 97 TFMNFSSRADAVRKLRGISRSGGKTNTQDALELAFQMFTTSK----YGSTPGGLAR--VL 150 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASP-DRFYSV 345 +TDG NI+ + + K G V+ I + E D+ + + Y V Sbjct: 151 VVTDG---RSNIEKHRTERKAFKLKANGIEVFVIAIGDYLEGMDELARMANTKYAHMYRV 207 Query: 346 QNSRKLHDAFLRI 358 ++ + L I Sbjct: 208 EDVKGLARVVKLI 220 >gi|311268149|ref|XP_003131915.1| PREDICTED: integrin alpha-E-like, partial [Sus scrofa] Length = 1032 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 75/206 (36%), Gaps = 30/206 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G ++ ++LD S S++ P A I M+ I + LV + S I Sbjct: 203 GTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNIYEKCFECSF---ALVQYGSVI 253 Query: 228 VQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L Q + ++R+ GS TK+ +++ + IF H +K Sbjct: 254 QTELDLQ-DSQDVAASLDRVQNITQVGSVTKTASAMQHVLDNIFTPS----HGSK--AKA 306 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ----FLKNCASP 339 K ++ LTDG+ D N K +G +AIGV LK AS Sbjct: 307 SKVMVVLTDGD---IFDDPLNLTTVINSPKMQGVERFAIGVGGAFNKSNTYNELKLIASD 363 Query: 340 ---DRFYSVQNSRKLHDAFLRIGKEM 362 D + V N L ++ + + Sbjct: 364 PDEDHAFKVTNYMALDGLLSKLQQRI 389 >gi|224092755|ref|XP_002190101.1| PREDICTED: inter-alpha (globulin) inhibitor H2 [Taeniopygia guttata] Length = 948 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 65/173 (37%), Gaps = 15/173 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKIV 228 ++ V+DVS SM +G M + A ++I L D N+ VR + +V Sbjct: 313 ILFVIDVSGSM---WGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRC----WRDNLV 365 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P V+ ++ I + T L A + +AK I+ Sbjct: 366 SATPSQ--VEDAKKYIQTIHPNGGTNINEALLRATFILNEAKSLGMLDPNSVSM----IV 419 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++DG+ + + ++ + ++ +G+ + FL+ A+ +R Sbjct: 420 LVSDGDPTVGELKLTTIQKNVKQSIKDEFSLFCLGIGFDVDYDFLQRIATDNR 472 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 19/120 (15%), Positives = 41/120 (34%), Gaps = 3/120 (2%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 +SIL A++L V+ +GL ++ + +++L D L A + N + Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLT--GAINLSVPEA 58 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS-IIIDDQHKDYNLSAVSRYEM 135 + + F +L+ N +++ I Y S+ + Sbjct: 59 AGITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCMTSQTGI 118 >gi|255522879|ref|NP_001157342.1| collagen alpha-1(XIV) chain [Equus caballus] Length = 1796 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 84/220 (38%), Gaps = 26/220 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + + + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E A+ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGARSG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADENELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + PD Y+V +H + + + + +K Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 343 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 76/199 (38%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F ++ L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIINFLYSTVGALNKI---------GADGTQVAIVQFTDDPR 1082 Query: 229 QTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + + + I + + G TK+ +++ + +F E K Sbjct: 1083 TEFKLDAYKNKETLLDAIKHISYKGGNTKTGKAIKHVRDTLFTE-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISKEMQSDGYSIFAVGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 >gi|311695165|gb|ADP98038.1| TPR repeat-containing protein [marine bacterium HP15] Length = 604 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 60/176 (34%), Gaps = 29/176 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + G +++ LD+SLSM D+L A R IR++L+ + +GL Sbjct: 86 PTPLKQPGDSLVIALDLSLSMLATDVEP-DRLTRAKRKIRDILE-------LREGSLTGL 137 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + FS PL + I+ +N L + + ++ + A + Sbjct: 138 LVFSGDAHVVTPLTDDSRTIEGMLNVLDPVIMPATGNRADLAVARAKALLEQGAPGEGR- 196 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I+ +TD N +++L G + + V E Sbjct: 197 ----------ILLITDSLNDDYEGTIRDTLS------GTGYALNTLVVGTENGGPI 236 >gi|301768895|ref|XP_002919864.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Ailuropoda melanoleuca] Length = 1127 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 64/176 (36%), Gaps = 18/176 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ ++D S S DK S+ + + + + + ++ + FSS + Sbjct: 47 IDVVFIVDSSES---SKIILFDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQ 103 Query: 229 QTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + W +Q ++++ + F G T S + A + K K Sbjct: 104 IDPSFSSWKDLQTFKQRVKSMNFIGQGTFSYYAISNATGLLKREGRKDG---------VK 154 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + +TDG + N D + +A+ G + IG+ + L+ + Sbjct: 155 VALLMTDGIDHPKNPDVQS---ISEDARTAGILFITIGLSTVVNEAKLRLISGDSS 207 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 62/164 (37%), Gaps = 26/164 (15%) Query: 165 SDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L+++ V+D S S+ ++F + + + L R G++ + Sbjct: 793 KETPLELLFVIDSSESVGLENFEIIKSLVKTLSDQVALDLAA----------ARVGIINY 842 Query: 224 SSKIVQTFPLA-WGVQH-IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S K+ + L + + + ++R+ G T + L A N +F+A Sbjct: 843 SHKVEKVAHLTQFSTKDDFKLAVDRMQYLGEGTYTASALHEA-NHMFEAARPG------- 894 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KK + +TDG+ D K +A ++ IGV Sbjct: 895 --VKKVALVITDGQTD--TRDEKNLTEVVKKASDINVEIFVIGV 934 >gi|320006793|gb|ADW01643.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 248 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 62/190 (32%), Gaps = 23/190 (12%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPL--LITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + + P I T L SSV+ + + +VLD S SM + Sbjct: 2 IVSNGSERAPAISLTKVMERAPDLVDLYKAAGSSVRAHGLEGVRAAVYLVLDRSGSMRPY 61 Query: 185 FGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 + G M L S+ LD +P +V FS+ + L G + Sbjct: 62 YRDGTMQHLAEQVLSLSAHLDDDGVVP---------VVFFSTDVDGCTDLTLGRH--RGL 110 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 +++L +A +++ D H + ++F TDG P Sbjct: 111 MDKLHANLGHMGRTNYHWAMDEVID------HYLASGSEAPALVVFQTDG---GPTSRLA 161 Query: 304 ESLFYCNEAK 313 + C A+ Sbjct: 162 AERYLCKAAR 171 >gi|126309708|ref|XP_001376394.1| PREDICTED: similar to B-factor, properdin [Monodelphis domestica] Length = 764 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 78/230 (33%), Gaps = 34/230 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI +++ +VLD S S+ + A + + ++D + S R Sbjct: 264 KIVLDPAGSMNIYLVLDASDSIGKN------NFTGAKKCLSSLIDKVASYGVEP---RYA 314 Query: 220 LVTFSSKIVQTFPLAW----GVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAK 270 +VT++++ L+ +++++ ++ + T + L Y + Sbjct: 315 VVTYATEAKAVVKLSDKESSNADWVKQELEKIKYSDHRLKAGTNTKKALTMLYEMMIL-- 372 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIV 319 + + + I+ +TDG + P ++ + + + R V Sbjct: 373 -QESQNDINWNKTRHVIVLMTDGNYNMGGDPVAAIEQIREFLDIGRNRKNPRENYLDVYV 431 Query: 320 YAIG--VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + IG V E + + V++ L + F + E + Sbjct: 432 FGIGPLVDQEKINALASKKDGEKHVFKVKDMEDLENVFYMMIDESKALSL 481 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 77/186 (41%), Gaps = 19/186 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ ++ DI ++ N ++ G++ F S + Sbjct: 32 PLDIVFMLDGSRSVRPK------NFQTVKDYVKNFTDIFEAFGP--NDMQVGVIQFGSGV 83 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK-GHDDYK 284 + L + + E I+ + + T T A K+ +EH A+ + Sbjct: 84 REEILLNQFYVRHELMEAIDNIRYMETGTMTG---LALRKLVTETLTVEHGARVDNPIVH 140 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPD--R 341 ++ +TDG++ + + EAK RG ++AIG+ +A + L+ + P Sbjct: 141 TVVVIITDGKSQDYSRG--GVTKWTKEAKARGFEIFAIGIGRKANRKELLEMASEPKELH 198 Query: 342 FYSVQN 347 + VQN Sbjct: 199 TFRVQN 204 >gi|114586712|ref|XP_001158576.1| PREDICTED: alpha 1 type VII collagen isoform 1 [Pan troglodytes] Length = 2944 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 71/191 (37%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A+G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|114586714|ref|XP_516439.2| PREDICTED: alpha 1 type VII collagen isoform 2 [Pan troglodytes] Length = 2912 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 71/191 (37%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A+G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|307825380|ref|ZP_07655599.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733555|gb|EFO04413.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 600 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 52/141 (36%), Gaps = 23/141 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ LD+S SM D +L +A I ++L K + L+ ++ Sbjct: 94 ALVIALDLSRSM-DAADIKPSRLIMARYKIADILKQRKDGQ-------TALLVYAGDAFT 145 Query: 230 TFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL + I +++ L + + + LE A + A + Sbjct: 146 VTPLTDDTETIDSQLSALNTDIMPSQGSNTASALEKAVELLKQAGLQKGR---------- 195 Query: 286 YIIFLTDGENSSPNIDNKESL 306 I+ +TDG + + ++L Sbjct: 196 -ILLVTDGVDMDKTLAAVKTL 215 >gi|281179360|dbj|BAI55690.1| conserved hypothetical protein [Escherichia coli SE15] Length = 580 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ +++ ++D S SM ++L + S++ ++ ++ ++ V +G Sbjct: 213 KSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYAG-- 265 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+I I I+ L +T GLE AY + KG Sbjct: 266 --DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKG------FIKGGI 317 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--S 338 + I+ TDG+ + D K + + G + +GV + + + A Sbjct: 318 NR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVG 374 Query: 339 PDRFYSVQNSRK 350 + + + Sbjct: 375 NGNYSYIDTLSE 386 >gi|261252915|ref|ZP_05945488.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] gi|260936306|gb|EEX92295.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] Length = 3332 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 73/194 (37%), Gaps = 30/194 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHF------------GPGMDKLGVATRSIREMLDIIK 207 K + + ++ +V+D S SM D+ G M ++ + ++ +++ + Sbjct: 2724 KTNVTPGVNYNIALVVDASGSMGDYVYNTDGTVMRNPDGSAMTRMDMMQEALTNLVESLV 2783 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFP---LAWGVQHIQEKINR----LIFGSTTKSTPGLE 260 + N L+ F I TF + + E +++ L G T G E Sbjct: 2784 THDGSIN---IKLIGFDDNIDVTFEALDITNSSDVVAELLSKIENNLPVGGGTDYGVGFE 2840 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A N + + Y+ FLTDGE +S ++N + Y A V Sbjct: 2841 EANNWYASSS-------ISSNGYENMTFFLTDGEPNSGTLNNGLT-EYNELVSTHNAKVM 2892 Query: 321 AIGVQAEAADQFLK 334 A+G+ + D LK Sbjct: 2893 AVGMGNDIDDSVLK 2906 >gi|153002167|ref|YP_001367848.1| putative outer membrane adhesin-like protein [Shewanella baltica OS185] gi|151366785|gb|ABS09785.1| putative outer membrane adhesin like proteiin [Shewanella baltica OS185] Length = 1215 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 74/202 (36%), Gaps = 32/202 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +DM +V+D S SM FG +D A + + + + GLV+FS Sbjct: 309 EGDIDMQIVMDRSGSM---FGSPIDNAKQAAKILVDATAEGSTA--------MGLVSFSG 357 Query: 226 K--------IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + GV Q ++ I+ + +T G + A + + + Sbjct: 358 RSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASG 417 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKN 335 A G + L DG++++ + A +++ G A + Sbjct: 418 APG------VVFVLADGDDNNSIKSESSVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 336 C-ASPDRFY-SVQNSRKLHDAF 355 A+ +++ S ++ DAF Sbjct: 469 ANATGGKYFSSPTTLAEIIDAF 490 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/139 (21%), Positives = 54/139 (38%), Gaps = 20/139 (14%) Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ E + ++ T T G+ N + E +K + + ++ ++ L+DGE++ Sbjct: 382 NVSEALGKIEPLGGTAVTSGMLIGINLMTSKNSEPEAAPSKLNTNTRRVLLILSDGEDNQ 441 Query: 298 PNIDNKESLF---YCNEAKRR------------GAIVYAIGVQAEAADQFL---KNCASP 339 P+ +L C E K + V I D L K C Sbjct: 442 PSEKTLVNLMGAGLCREIKDKMNSLQDPKYGQVEPRVAFIAFGTNLPDNQLNAWKQCV-G 500 Query: 340 DRFYSVQNSRKLHDAFLRI 358 +YSV + + L DAF +I Sbjct: 501 KHYYSVFSKQGLLDAFKQI 519 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 87/250 (34%), Gaps = 48/250 (19%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL-------- 57 ++ F N G +I+TA+L + + + ++ + KA+L D + L Sbjct: 15 VKQFSKNEHGVYAIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAEDNQ 74 Query: 58 -------------YTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQD 104 + + + +E N + + + + Q + LR + D Sbjct: 75 YRKNKDHSDVKRQNVSQQEIEREGRNFSNAKVQAQWKKRNQELVQGVCKLYLR----SDD 130 Query: 105 INNIERSTSLSII-----------IDDQHKDYNLSAVSRYEMPFIFCTF--PWCANSSHA 151 + S+ ++I ++++ +V+ + F PW + Sbjct: 131 SKGQKNSSPVTIKEPFLAECLEEKTQPKNQNGTAKSVACVVQGSVQRKFWLPWGQTLVSS 190 Query: 152 PLLITSSVKISSKSDIG---------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 L V I+S +D+MMV D+S SM + D R I + Sbjct: 191 NQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSGSMKWYIDRKGDAHKPNRR-IDAL 249 Query: 203 LDIIKSIPDV 212 ++++ + ++ Sbjct: 250 VEVVGEVQNI 259 >gi|198426622|ref|XP_002122673.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 721 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 44/229 (19%), Positives = 78/229 (34%), Gaps = 33/229 (14%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 Y + YE+ T C L K S +D++++LD S S+ Sbjct: 476 TYECNKAESYEIFPPNHTLTTCMADLAWDLAPPCCAKKCPPS-APMDLVLILDSSSSVKR 534 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ--TFPLA---WGVQ 238 P + + RSI + N R + ++ ++ L+ Sbjct: 535 ---PNWNTMKQFVRSIITTFNF------GENEARMAVFRYNRQVDTRNQILLSDHINNKT 585 Query: 239 HIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 E ++L + T + L +A N I L + + K I+ +TDG Sbjct: 586 TFLEAYDKLPYNGFGTFTGRALRHAKNVI------LANRNGNRPNVKDVILTITDGR--- 636 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-----DQFLKNCASPDR 341 + DN ++ E + G + IG+Q DQ L +P+ Sbjct: 637 -SQDNVATIS--TELREMGVTTFVIGIQPGNGAGLDQDQLLAMGGTPEN 682 >gi|166796269|gb|AAI59125.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 973 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 74/212 (34%), Gaps = 29/212 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-----S 225 ++ V+D S SM + K+ ++ +L ++ N ++ FS Sbjct: 337 VVFVIDTSASMLGN------KMKQTKEALFTILKDLRPQDHFN------IIGFSKRVKVW 384 Query: 226 KIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + Q ++ ++ ++ I L T G+ + + K Sbjct: 385 QQNQMVKVSPNNIRDAKKFIYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKSVS--- 441 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASP 339 +IFLTDG + I++ + L A + +++IG + L+NC Sbjct: 442 -LMIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMM 500 Query: 340 DRFYSVQNSRKLHDAFLR-IGK-EMVKQRILY 369 R +++ F IG + RI Y Sbjct: 501 RRIQENEDAASQLKGFYDEIGTPLLSDIRIDY 532 >gi|21323788|dbj|BAB98414.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 32/200 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------- 224 M+VLD S SM G ++ A ++ ++++ I DV G + Sbjct: 41 MIVLDNSGSMTAQDAGGQTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMG 100 Query: 225 --SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + IN L T L ++ + Sbjct: 101 CQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN----------- 149 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNCAS-- 338 I+ ++DG N + E + G ++ IG+ + A + C + Sbjct: 150 ----IVLVSDGI---ANCTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGV 202 Query: 339 -PDRFYSVQNSRKLHDAFLR 357 + +++ L DA R Sbjct: 203 GGGTYADASDAQSLTDALTR 222 >gi|297463635|ref|XP_002702824.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] Length = 2933 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 79/225 (35%), Gaps = 29/225 (12%) Query: 135 MPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 M CA + + ++S D++ +LD S S+ Sbjct: 1 MRLRLLVAALCAGILAGAPRVWAQPRERVSCTRLYAADIVFLLDGSSSIGRS------NF 54 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF- 249 + ++ VR V +S F L A G + I L + Sbjct: 55 REVRGFLEGLVLPFSGAA-GAQGVRFAAVQYSDDPRTEFDLDALGSGGDVIRAIRELSYK 113 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T++ + + +++F L +A+ K I +TDG++ + L Sbjct: 114 GGNTRTGAAILHVADRVF-----LPQLARPGVP--KVCILITDGKSQDMVDTAAQRL--- 163 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRKL 351 K +G ++A+G++ A + LK AS D F+ V + L Sbjct: 164 ---KGQGVKLFAVGIK-NADPEELKRIASQPTSDFFFFVNDFSIL 204 >gi|297488708|ref|XP_002697097.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] gi|296474920|gb|DAA17035.1| collagen, type VII, alpha 1 [Bos taurus] Length = 2932 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 79/225 (35%), Gaps = 29/225 (12%) Query: 135 MPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 M CA + + ++S D++ +LD S S+ Sbjct: 1 MRLRLLVAALCAGILAGAPRVWAQPRERVSCTRLYAADIVFLLDGSSSIGRS------NF 54 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIF- 249 + ++ VR V +S F L A G + I L + Sbjct: 55 REVRGFLEGLVLPFSGAA-GAQGVRFAAVQYSDDPRTEFDLDALGSGGDVIRAIRELSYK 113 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G T++ + + +++F L +A+ K I +TDG++ + L Sbjct: 114 GGNTRTGAAILHVADRVF-----LPQLARPGVP--KVCILITDGKSQDMVDTAAQRL--- 163 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQNSRKL 351 K +G ++A+G++ A + LK AS D F+ V + L Sbjct: 164 ---KGQGVKLFAVGIK-NADPEELKRIASQPTSDFFFFVNDFSIL 204 >gi|290980233|ref|XP_002672837.1| vWFA domain-containing protein [Naegleria gruberi] gi|284086416|gb|EFC40093.1| vWFA domain-containing protein [Naegleria gruberi] Length = 340 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 58/163 (35%), Gaps = 21/163 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF- 223 SD +D+++V+D + SM+ ++ VA ++ ++ + +R V++ Sbjct: 23 SDKIVDLVIVMDCTGSMS-------GEINVAKNTVATIITTL--HEHFKTDLRFTAVSYR 73 Query: 224 ---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V+ FP + + +E IN + L A + + + Sbjct: 74 DHTDDYAVKEFPFTKDINNAKEYINTMSAQGGGDYPEALASALKVVNEMPFNKKG----- 128 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 KK ++++ D N C + + + IG Sbjct: 129 ---KKIVVWVADAPPHGMNASGDSYPNGCLDEQGQKIDWVKIG 168 >gi|189347154|ref|YP_001943683.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341301|gb|ACD90704.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 6006 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 61/170 (35%), Gaps = 26/170 (15%) Query: 171 MMMVLDVSLSMN-----DHFGPGMD--KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ LDVS SM+ D G D ++ +A SI EML D V LV F Sbjct: 5453 LLITLDVSGSMSRNLNNDSHPTGNDPTRMDIAVESIAEMLSQY----DYRGDVSVKLVIF 5508 Query: 224 SSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S+ W V+ + +N L+ T +E A + + + + Sbjct: 5509 STNGQSLTTAEWVTVEEAKIMLNSLVANGGTNYDGAIEAADDAFVNTNGMIANA------ 5562 Query: 283 YKKYIIFLTDGENSSPNID-------NKESLFYCNEAKRRGAIVYAIGVQ 325 F++DG S P I E + + YA+G+ Sbjct: 5563 -DNIAYFISDGLPSLPTITAGDIGIQPAEQTIWETFLENNDVTSYAVGIG 5611 >gi|163815330|ref|ZP_02206705.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] gi|158449304|gb|EDP26299.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] Length = 348 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 78/239 (32%), Gaps = 53/239 (22%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D+M+ +D+S S++ +L+ +K D + R G+V F Sbjct: 92 DERYCRDIMLCIDISTSVDYLNE--------------NLLEELKKTVDELHGERFGIVIF 137 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGS------------TTKSTPGLEYAYNKIFDAKE 271 ++ V PL + I+E+++ + +T + ++ Y + + Sbjct: 138 NTSPVLLSPLTDDYEFIKEQLDMIDKSLKVRNSEDDSDLYSTDLSSMYDWLYYEGYITSG 197 Query: 272 KLEHIAKGHDDY-------------------KKYIIFLTDGE-NSSPNIDNKESLFYCNE 311 L + K +IF TD + +P ++ C Sbjct: 198 TLVGNEQRGSSLIGDGLAAAACDFSDKDKTRTKIMIFSTDNDIQGNPVATLDDAASIC-- 255 Query: 312 AKRRGAIVYAIGV--QAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 K G VY +G + + +K ++ +F+ + S I K Q Sbjct: 256 -KNNGVTVYGVGTKEMTDENRESMKKAVESTGGQFFMEEESGTFDQIVTAIEKSSKNQV 313 >gi|160876887|ref|YP_001556203.1| outer membrane adhesin-like protein [Shewanella baltica OS195] gi|160862409|gb|ABX50943.1| outer membrane adhesin like proteiin [Shewanella baltica OS195] gi|315269091|gb|ADT95944.1| outer membrane adhesin like proteiin [Shewanella baltica OS678] Length = 1215 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 74/202 (36%), Gaps = 32/202 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +DM +V+D S SM FG +D A + + + + GLV+FS Sbjct: 309 EGDIDMQIVMDRSGSM---FGSPIDNAKQAAKILVDATAEGSTA--------MGLVSFSG 357 Query: 226 K--------IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + GV Q ++ I+ + +T G + A + + + Sbjct: 358 RSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASG 417 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKN 335 A G + L DG++++ + A +++ G A + Sbjct: 418 APG------VVFVLADGDDNNSIKSESSVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 336 C-ASPDRFY-SVQNSRKLHDAF 355 A+ +++ S ++ DAF Sbjct: 469 ANATGGKYFSSPTTLAEIIDAF 490 >gi|268532310|ref|XP_002631283.1| C. briggsae CBR-CLEC-60 protein [Caenorhabditis briggsae] gi|187036877|emb|CAP23542.1| CBR-CLEC-60 protein [Caenorhabditis briggsae AF16] Length = 408 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 54/165 (32%), Gaps = 17/165 (10%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS------KIVQTFPLAWGVQHIQEKIN 245 + + S+ I + P R GLVT+++ + L I ++ Sbjct: 62 IAASIASVFSNGTRIGTDPYEPRTTRLGLVTYNAVANTVANLDTYQSLDDVYDGIFTALS 121 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 ++ + GL A + + D K K Y++ +I ++ + Sbjct: 122 QVSSSDESYIVHGLAQAEDILEDGK-----SNKNRTHYQRVVIVY---ASTYKGTGPLDP 173 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASPDRFYSVQN 347 + + K G + + + L+ A+P +S + Sbjct: 174 IPVADRLKTAGVTIVTVAYDQDGDGALLADLQLIATPPYNFSNTD 218 >gi|126340361|ref|XP_001365240.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain2 [Monodelphis domestica] Length = 951 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 64/176 (36%), Gaps = 21/176 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVRSGLVTFSS 225 ++ V+DVS SM K+ +++ +LD +++ D N+ VR + Sbjct: 312 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAEDQFSVVDFNHNVR------NW 359 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + ++ I ++ T L A + +A Sbjct: 360 RDDLVLASKAQITDAKKYIEKIQPNGGTNINEALLRAIFILNEASNLGMLDPNSVS---- 415 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 II ++DG+ + + + ++ + ++++G+ + FL+ + + Sbjct: 416 LIILVSDGDPTVGELKLSQIQKNVKQSMQDNISLFSLGIGFDVDYDFLERLSQENH 471 >gi|116284252|gb|AAI24051.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 954 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 74/212 (34%), Gaps = 29/212 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-----S 225 ++ V+D S SM + K+ ++ +L ++ N ++ FS Sbjct: 318 VVFVIDTSASMLGN------KMKQTKEALFTILKDLRPQDHFN------IIGFSKRVKVW 365 Query: 226 KIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + Q ++ ++ ++ I L T G+ + + K Sbjct: 366 QQNQMVKVSPNNIRDAKKFIYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKSVS--- 422 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASP 339 +IFLTDG + I++ + L A + +++IG + L+NC Sbjct: 423 -LMIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMM 481 Query: 340 DRFYSVQNSRKLHDAFLR-IGK-EMVKQRILY 369 R +++ F IG + RI Y Sbjct: 482 RRIQENEDAASQLKGFYDEIGTPLLSDIRIDY 513 >gi|62389907|ref|YP_225309.1| hypothetical protein cg1159 [Corynebacterium glutamicum ATCC 13032] gi|41325243|emb|CAF19723.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 32/200 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------- 224 M+VLD S SM G ++ A ++ ++++ I DV G + Sbjct: 41 MIVLDNSGSMTAQDAGGQTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMG 100 Query: 225 --SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + IN L T L ++ + Sbjct: 101 CQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN----------- 149 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNCAS-- 338 I+ ++DG N + E + G ++ IG+ + A + C + Sbjct: 150 ----IVLVSDGI---ANCTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGV 202 Query: 339 -PDRFYSVQNSRKLHDAFLR 357 + +++ L DA R Sbjct: 203 GGGTYADASDAQSLTDALTR 222 >gi|301604540|ref|XP_002931918.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Xenopus (Silurana) tropicalis] Length = 929 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 74/212 (34%), Gaps = 29/212 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-----S 225 ++ V+D S SM + K+ ++ +L ++ N ++ FS Sbjct: 369 VVFVIDTSASMLGN------KMKQTKEALFTILKDLRPQDHFN------IIGFSKRVKVW 416 Query: 226 KIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + Q ++ ++ ++ I L T G+ + + K Sbjct: 417 QQNQMVKVSPNNIRDAKKFIYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKSVS--- 473 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-----LKNCASP 339 +IFLTDG + I++ + L A + +++IG + L+NC Sbjct: 474 -LMIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMM 532 Query: 340 DRFYSVQNSRKLHDAFLR-IGK-EMVKQRILY 369 R +++ F IG + RI Y Sbjct: 533 RRIQENEDAASQLKGFYDEIGTPLLSDIRIDY 564 >gi|296227292|ref|XP_002759309.1| PREDICTED: collagen alpha-1(XIV) chain [Callithrix jacchus] Length = 1796 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 81/199 (40%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ +D+F ++ L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I R+ + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKRISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISKEMQSDGYSIFAVGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|269124456|ref|YP_003297826.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268309414|gb|ACY95788.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 432 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 74/230 (32%), Gaps = 40/230 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + T S ++++D S SM KL A R+ R +D ++ Sbjct: 25 VTVEATGSSAPPGGGAAEAAEVIIIDTSGSMASD-----GKLAEAKRAARTAVDTLR--D 77 Query: 211 DVNNVVRSGLV----TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 V+ V +G + A + ++ I RL G T L A+ + Sbjct: 78 GVHFAVIAGFHRAEPVYPGGERLAVASASTKKEAKKAIGRLTSGGGTAIGSWLRMAHGLM 137 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES-------LFYCNEAKRRGAIV 319 ++ ++ I LTDG+N + ++ F C+ Sbjct: 138 ----------SRQGAGGVRHAILLTDGQNQHETAEELDAALRAVSGSFVCDCR------- 180 Query: 320 YAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 GV + L+ AS V + R L F + + + + + Sbjct: 181 ---GVGTDWRVAELRKIASALLGSVDIVADPRDLAADFRAMTENAMGKTV 227 >gi|217974748|ref|YP_002359499.1| outer membrane adhesin-like protein [Shewanella baltica OS223] gi|217499883|gb|ACK48076.1| outer membrane adhesin like proteiin [Shewanella baltica OS223] Length = 1215 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 74/202 (36%), Gaps = 32/202 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +DM +V+D S SM FG +D A + + + + GLV+FS Sbjct: 309 EGDIDMQIVMDRSGSM---FGSPIDNAKQAAKILVDATAEGSTA--------MGLVSFSG 357 Query: 226 K--------IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + GV Q ++ I+ + +T G + A + + + Sbjct: 358 RSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASG 417 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKN 335 A G + L DG++++ + A +++ G A + Sbjct: 418 APG------VVFVLADGDDNNSIKSESSVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 336 C-ASPDRFY-SVQNSRKLHDAF 355 A+ +++ S ++ DAF Sbjct: 469 ANATGGKYFSSPTTLAEIIDAF 490 >gi|108758937|ref|YP_629042.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108462817|gb|ABF88002.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 860 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 53/173 (30%), Gaps = 40/173 (23%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+DVS SM + L + R + R ++ F ++ Sbjct: 284 VVFVVDVSGSMAGESLPQAQAALRLCLR-------------HLREGDRFNVIAFENRFQS 330 Query: 230 TFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P ++ + L T+ + A D Sbjct: 331 FQPEPVPFTQRTLEEADRWVAALNADGGTELLAPMRAAVQAAPDG--------------- 375 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ LTDG+ + + L A+ VY+ G+ +D L++ A Sbjct: 376 -VIVLLTDGQVGNEAEILRAVLEARKTAR-----VYSFGIGTNVSDVLLRDMA 422 >gi|21359974|ref|NP_444506.2| vitrin isoform 1 [Homo sapiens] gi|62702118|gb|AAF19243.2|AC007363_1 unknown [Homo sapiens] gi|16552271|dbj|BAB71279.1| unnamed protein product [Homo sapiens] gi|119620821|gb|EAX00416.1| vitrin, isoform CRA_a [Homo sapiens] Length = 693 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 510 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 559 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 560 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 607 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 608 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 661 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V LH RI + + Sbjct: 662 HSFFVDEFDNLHQYVPRIIQNI 683 >gi|49907|emb|CAA44206.1| alpha-2 collagen type VI, subunit [Mus musculus] Length = 764 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 357 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 413 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AYN++ + + Sbjct: 414 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRRQKTRV--- 470 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 471 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 505 >gi|86141572|ref|ZP_01060118.1| hypothetical protein MED217_06122 [Leeuwenhoekiella blandensis MED217] gi|85832131|gb|EAQ50586.1| hypothetical protein MED217_06122 [Leeuwenhoekiella blandensis MED217] Length = 397 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/324 (12%), Positives = 89/324 (27%), Gaps = 42/324 (12%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 + ++ F +L+ T T N + F N+ D + Sbjct: 69 DLTNWSFW----QNLLNQEEADTFTT-----NWKFNTANRFSFQLTDANNLPAVDISVSV 119 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM-----PFIFCTFPWCANSSHA 151 R+ + + + +D ++ + V+ Y+ P+ + Sbjct: 120 RQGATIIWEAKTDNHGTAELWVDLKNSSQKSTDVATYKFYIGNEQLFSTIKPFSEGVNTI 179 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L ++ V +D+ ++D + SM D D L + ++ + Sbjct: 180 KLNTSTPVFTK------VDLAFIVDATGSMGDEMEFLKDDLKQVIQDVKNT--------N 225 Query: 212 VNNVVRSGLVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 N + +G V + +V+ ++ IN + A Sbjct: 226 GNLQITTGTVFYRDVGDDYVVKKSDFTSSLESTLGFINEQKAEGGGDFPEAVHTALKTGI 285 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI-GVQA 326 + + + L D N E A ++G + I Sbjct: 286 SE------LQWSTEARSRIAFLLLDAPPHQENQIIDELHNTIKTAAKKGIKIIPIVASGI 339 Query: 327 EAADQFLKN---CASPDRFYSVQN 347 +FL A+ + + N Sbjct: 340 NKETEFLMRNFSIATNGTYVFITN 363 >gi|260426558|ref|ZP_05780537.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] gi|260421050|gb|EEX14301.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] Length = 1088 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 64/214 (29%), Gaps = 37/214 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDK--LGVATRSIREMLDIIKSIPDVNNVVR---------- 217 +MM++D S SM+ G + L +D VR Sbjct: 454 QIMMIVDRSGSMSWSSNSGQAEVCLNGLDDDNDGTVDEGDCADSRIEFVRAAGRAFVDLQ 513 Query: 218 ------SGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 GL+ F+ P+ Q ++ I+ L G T + + + Sbjct: 514 TSQGIDLGLLEFNEGNTLLRPIDTLNAGNAQDYKDAIDALSPGGDTAIGDAFDASTGEFT 573 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 E Y LTDG N++ + + G ++AI + Sbjct: 574 RVAEV-------GRVRTAY--LLTDGFNTAGG----DPVAAAERLDDIGVRIHAIPAGND 620 Query: 328 AADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIG 359 + L + AS + Y +N L F + Sbjct: 621 VDREELTDIASGTGGQVYEARNVNALTGIFAELA 654 >gi|139948509|ref|NP_001077327.1| complement factor B [Danio rerio] gi|125858059|gb|AAI29235.1| Zgc:158446 protein [Danio rerio] Length = 751 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 85/217 (39%), Gaps = 22/217 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI LD+ + LDVS S+++ A I+ +++ I N Sbjct: 241 KIQVHKGGKLDIYIALDVSDSIDEE------DFERAKDVIKTLIEKISYYEVSPNYEILL 294 Query: 220 LVTFSSKIVQTFPLAWGV-QHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLE 274 T + +I+ G + + + I +L +S + AYNKI+++ Sbjct: 295 FATDTDRIISMREFKNGQGKDLLKIIQKLQDYAYDKKGQRSGTNIAQAYNKIYESMTIEL 354 Query: 275 HIAKGHDDYKKYIIFL-TDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQ--- 325 K ++I+ + TDG+ + P +D +SL N + + +Y G+ Sbjct: 355 MTNKEDFKATQHIVIMFTDGQANMGGSPKPLVDKIKSLVRQNSVEEK-LELYVFGLGNDV 413 Query: 326 -AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 AE + + A+ F+ +++ L + F + E Sbjct: 414 HAEDINDLKTDRANEKFFFKLKSLDDLKETFDNMIDE 450 >gi|146306837|ref|YP_001187302.1| hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] gi|145575038|gb|ABP84570.1| Hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] Length = 3184 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 54/149 (36%), Gaps = 19/149 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++ +VLD+SLSM DKL +++ + ++ V L+TF+S Sbjct: 2363 VNLTLVLDISLSMAG------DKLTALKQAVISLAQ---GYAGLSAPVHVNLITFNSGAA 2413 Query: 229 QTFPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + G + +N L T L A ++ A Sbjct: 2414 EIGDFTFSSVGDAGYTALLTAVNGLTASGFTNYEQALSVAKAQVLSDI--SAPGADPAQQ 2471 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +K Y F++DGE + + + N Sbjct: 2472 HKLY--FISDGEPTVGAQGATLTTWIANN 2498 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 27/197 (13%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDH------FGPGMDKLGVATRSIREMLDIIKS 208 + + K ++ ++LD S SM D +G + L A + L Sbjct: 2728 LGGAADPVVKPAENYNIALILDRSGSMADDPDGSGGYGSRLALLKDAVNAFIGKLGTHTG 2787 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ + + + + L T ++ A N F Sbjct: 2788 QINIALIS-FSSSASLLLSGTLAQIQTALAAPNNVLMALTASGATNYEAAMQQA-NAWFG 2845 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN-----------EAKRRGA 317 E + + FLTDG+ ++ N DN S N A Sbjct: 2846 GVEVNGYNNLAY--------FLTDGDPTTYNGDNSNSGSTVNFNDVNRALDDATTLMARA 2897 Query: 318 IVYAIGVQAEAADQFLK 334 V+AIG+ L+ Sbjct: 2898 EVHAIGIGTGVNSNVLR 2914 >gi|259505645|ref|ZP_05748547.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] gi|259166776|gb|EEW51330.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] Length = 530 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 33/202 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTFSS 225 D VLDVS SM ++ + ++ EM+ S + +R ++ F+ Sbjct: 345 DTTFVLDVSGSMAG------TRMELLRSTMLEMISGEASSLTGDVSLRERENVTIIPFNF 398 Query: 226 KIVQTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + Q + + + L T L AY ++ Sbjct: 399 SPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQVEPGASIPS----- 453 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKNCA- 337 I+ +TDGE +S Y +++ V+ I EA ++N A Sbjct: 454 -------IVLMTDGEQTSGLSFGHFQRLYSELPTEKKRIPVFVILYG-EANITEMENLAG 505 Query: 338 -SPDRFYSVQNSRKLHDAFLRI 358 + + + N L +AF I Sbjct: 506 LTGGKTFDAMNGG-LEEAFKEI 526 >gi|307155059|ref|YP_003890443.1| Vault protein inter-alpha-trypsin domain-containing protein [Cyanothece sp. PCC 7822] gi|306985287|gb|ADN17168.1| Vault protein inter-alpha-trypsin domain protein [Cyanothece sp. PCC 7822] Length = 796 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 64/199 (32%), Gaps = 34/199 (17%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 H L + ++K + D++ ++D S S G + + R L+ + Sbjct: 279 GHFALYLIPAIKYQPSQIVPKDVVFLIDTSGS---QMGAPLAQCQELMRHFINGLNPDDT 335 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 ++ FS Q P+ + INRL G T+ G+ Sbjct: 336 FS---------IIDFSDTTQQLSPVPLANTSQNRSLALNYINRLTAGGGTELMRGIRAVL 386 Query: 264 N-KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 N I DA I+ LTDG + N + L + + G +Y+ Sbjct: 387 NFPITDAGRLRS------------IVLLTDGYIGNEN----QILAEVQQHLKPGNRLYSF 430 Query: 323 GVQAEAADQFLKNCASPDR 341 G + L A R Sbjct: 431 GAGSSVNRFLLNRIAEIGR 449 >gi|327458562|gb|EGF04912.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1] Length = 463 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 48/248 (19%), Positives = 81/248 (32%), Gaps = 36/248 (14%) Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 N KKQ D S K+++ E + G QD I + S D +K +A+ Sbjct: 106 NKKKQDWDVSELGTKSLYNMKLDLEFKTEGAYQDNRLISYNLSGK-YPDTNNKLSIDTAI 164 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM- 189 S +F + + + + V D S SM+ Sbjct: 165 SALNTKQVFSKVAKGKKGIAIAYRTD-----PIQGQMNIAVSFVFDTSGSMDWDLQGRNV 219 Query: 190 -------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK----IVQTFPLAWGVQ 238 ++ + + M+ +K I +++ LV FSS + L G Sbjct: 220 NPNSGTESRMTILRKKAEIMIKDLKGIGNIS----VNLVGFSSSGKYIQKEFSNLDNGAD 275 Query: 239 HIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 I I +L+ T GL Y + +L KY++ LTDG Sbjct: 276 TIIGTIKDPKKLVPDGVTNPGDGLRYGLISLQSQPAQL-----------KYVVLLTDGIP 324 Query: 296 SSPNIDNK 303 ++ +D Sbjct: 325 NTYIVDPS 332 >gi|168699404|ref|ZP_02731681.1| von Willebrand factor, type A [Gemmata obscuriglobus UQM 2246] Length = 367 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 60/165 (36%), Gaps = 7/165 (4%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D ++V+D+S SM + LD++ ++ + R G+V F+++ Sbjct: 110 GRDTVIVIDLSRSMLAEDMADPGAKSRWEAARSGALDLLAAM-ERRGGHRVGVVLFAARP 168 Query: 228 VQTFPLAWGVQH---IQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL +H + +N + P + + A Sbjct: 169 KLVCPLTTDYKHARAVLRAVNGRFPPPECRPGPEADATSGTRFGAALVAAVAAHDPRFVG 228 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + I+ ++DG+ P ++E + N A+ V+ +GV Sbjct: 229 AQDIVLISDGD--DPEESDREWVRGANAARTANVPVHTVGVGNPG 271 >gi|194221204|ref|XP_001915782.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Equus caballus] Length = 1055 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 220 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 267 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 268 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQ-----G 322 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 323 SICIQAIMLITDG-----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 377 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 378 NKGFFTQISTLAD 390 >gi|25028093|ref|NP_738147.1| hypothetical protein CE1537 [Corynebacterium efficiens YS-314] gi|23493377|dbj|BAC18347.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] Length = 531 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 33/202 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTFSS 225 D VLDVS SM ++ + ++ EM+ S + +R ++ F+ Sbjct: 346 DTTFVLDVSGSMAG------TRMELLRSTMLEMISGEASSLTGDVSLRERENVTIIPFNF 399 Query: 226 KIVQTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + Q + + + L T L AY ++ Sbjct: 400 SPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQVEPGASIPS----- 454 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKNCA- 337 I+ +TDGE +S Y +++ V+ I EA ++N A Sbjct: 455 -------IVLMTDGEQTSGLSFGHFQRLYSELPTEKKRIPVFVILYG-EANITEMENLAG 506 Query: 338 -SPDRFYSVQNSRKLHDAFLRI 358 + + + N L +AF I Sbjct: 507 LTGGKTFDAMNGG-LEEAFKEI 527 >gi|260813733|ref|XP_002601571.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] gi|229286869|gb|EEN57583.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] Length = 161 Score = 52.5 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 63/175 (36%), Gaps = 29/175 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ G + ++++ P R G+ +S + Sbjct: 1 DLFLLLDGSGSV------GTANFDKVKQFAADVVNSFDVSP---TATRVGVAQYSDRNSL 51 Query: 230 TFPLAW--GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L IN + + TK+ LE+ ++ A K Sbjct: 52 VFNLGDHADKPSTVSAINGISYQRGGTKTGAALEF----------VRQNAAWRGGAVPKV 101 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +I LTDG++ +SL G VYAIGV + L+ A+ D+ Sbjct: 102 MIVLTDGKSGDAVAAPSQSLA------ADGVAVYAIGVGNFDHAE-LQQIANSDQ 149 >gi|110638307|ref|YP_678516.1| hypothetical protein CHU_1908 [Cytophaga hutchinsonii ATCC 33406] gi|110280988|gb|ABG59174.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406] Length = 319 Score = 52.5 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 63/174 (36%), Gaps = 20/174 (11%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDH 184 A RY + F + LL S K S+ + +++ +LDVSLSM D Sbjct: 35 KTRATLRYRIYIKFALRSIAISCLLIALLGPSFGKSKSEITIRSKNILFILDVSLSM-DA 93 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI---- 240 +L A I +++ N + GL++++S PL + Q Sbjct: 94 RDVSPSRLEKAHTIIHNIVNQ-------NPTDQYGLISYASGATIQCPLTFDTQTFLAFS 146 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 Q L + T + L A N + K + +I L+DGE Sbjct: 147 QTATTSLFDYTGTNTYDALRMANNYLTT-------YKKEGNLKPCVVIMLSDGE 193 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 52.5 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 83/263 (31%), Gaps = 72/263 (27%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 +++VLD S SM P A +++E + + + N+ + GLVT+SS Sbjct: 106 SIVLVLDNSGSMGWGSSP--TPADYARDALKEFANEF--LKNGNSGNKLGLVTYSSGSGV 161 Query: 226 ---------KIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKI--------- 266 K VQ + + + ++ L T G++ A + + Sbjct: 162 PIYDAYDEIKYVQGYSMTENSDIFGQVVDGLQTPSGETDVQMGIKTARDILAADTSGNPQ 221 Query: 267 ----------------FDAKEKLEHIAKGHDDYKKY--IIFLTDG--------------- 293 A E K + F DG Sbjct: 222 FILVFSDGATNRSARPTSAGELGGANISPCTFGDKIYDMTFKFDGFDYGAQGSAVYNDGV 281 Query: 294 --ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--------LKNCASPDRFY 343 N + N ++ AK +G +Y++ A + +KN AS ++ Sbjct: 282 YTTNGNVNAHTVAAVSEALLAKDQGIDIYSVFYHNPALNDLEYGAGVFVMKNSASSGQYT 341 Query: 344 SVQ--NSRKLHDAFLRIGKEMVK 364 + N+ + F I K++ + Sbjct: 342 EISPGNAGAFAEIFTEIEKQIQE 364 >gi|301168170|emb|CBW27759.1| hypothetical protein BMS_2997 [Bacteriovorax marinus SJ] Length = 605 Score = 52.5 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 84/218 (38%), Gaps = 29/218 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + +++ +LDVS SM +KL + SI+ +L +K V+ VV +G Sbjct: 241 TPKTAINSSKNLVFLLDVSGSM-----SSPNKLPLLKESIKLLLRNLKGDDKVSIVVYAG 295 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S +++ ++ I + +N+L G +T G+ AY E K Sbjct: 296 S---SGVVLEPTSVS-DKVKIHKALNQLQSGGSTNGGAGIVAAYKL------AEEEFIKN 345 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA- 337 + +I TDG+ + E + E ++ + +G+ +D L+ + Sbjct: 346 GVNR---VILATDGDFNVGTTSRYELVDLIQEKAKKNIYLTVLGLGMGNYSDSLLEEISN 402 Query: 338 -SPDRFYSVQNSRK--------LHDAFLRIGKEMVKQR 366 + + + + L F+ + K++ Q Sbjct: 403 KGNGNYAYIDSLSEANKILNVDLEKNFVTVAKDVKIQI 440 >gi|297667858|ref|XP_002812180.1| PREDICTED: vitrin-like isoform 1 [Pongo abelii] Length = 693 Score = 52.5 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 510 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 559 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 560 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 607 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 608 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 661 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 662 HSFFVDEFDNLYQYVPRIIQNI 683 >gi|145547190|ref|XP_001459277.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427101|emb|CAK91880.1| unnamed protein product [Paramecium tetraurelia] Length = 603 Score = 52.5 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 77/217 (35%), Gaps = 29/217 (13%) Query: 147 NSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 N+ P +++ K S + +D++ V+DVS SM K+ + S+R ++ Sbjct: 96 NNKFVPAVLSLKTKKVSNNLDRPPIDLVCVVDVSGSMIG------RKINLVKDSLRYLMK 149 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLE 260 I+ R ++ F++ Q +++ I L ++T + G+ Sbjct: 150 ILGPED------RICIIVFTTVAHIVTSFIRNTQENKPLLKKAILELKGLASTNISDGMN 203 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + + + + + I L+DG++ + + +++ Sbjct: 204 KALWMLKN---------RKYKNPVSCIFLLSDGQDDYKGAEQRVFDQLQLLKIEEKFVIH 254 Query: 321 AIGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAF 355 G + + A FY + N K D F Sbjct: 255 TFGYGQDHDAYVMNQIAKYREGNFYYIDNINKASDYF 291 >gi|329928399|ref|ZP_08282269.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937835|gb|EGG34241.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 562 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 83/213 (38%), Gaps = 37/213 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + + V DVS SM+ G +++L + + ++ L SI G Sbjct: 379 KEKKDGNKPVAAVFVADVSGSMD---GEPLNRLKESLLTGQKYLGRDNSI---------G 426 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEK------INRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V++S+ + P+ G + ++ IN L T + G+ A + D Sbjct: 427 FVSYSTDVTINLPI--GKYDLNQQSMFVGAINSLEASGNTATFDGIVVAMKMLQDE---- 480 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + D K I L+DGE + ++D+ L VY IG A+ Q Sbjct: 481 ---MAANPDVKPLIFVLSDGETNVGHSLDDIRGLIQA-----FKIPVYTIGYNAD--IQA 530 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L++ +S + + + D +IG+ + Q Sbjct: 531 LQSISSINE--AASINADTDDVVYKIGQLLNVQ 561 >gi|330918891|ref|XP_003298384.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] gi|311328422|gb|EFQ93524.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] Length = 1367 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 67/183 (36%), Gaps = 17/183 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + + LD S S +D L + L + ++ V GLVTFS+K + Sbjct: 939 FKVYLGLDSSASKQKSHMTRLDVLKQMFDAYINRL----LAYNFHSHV--GLVTFSTKAL 992 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + V++ + K+N L T + A ++I E+ K II Sbjct: 993 VAQKITNAVENFRHKLNNLKASGDTAIWDSIALAQDQIQQYAEQYPGS-------KLRII 1045 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQN 347 ++DGE++ ++ + R V + + + + C+ S ++ + Sbjct: 1046 CISDGEDNKSQNT---AVDLASRLIRDDITVDSFCLDDHSNKELQTLCSLSGGYSFAPKT 1102 Query: 348 SRK 350 + Sbjct: 1103 LDE 1105 >gi|327260860|ref|XP_003215251.1| PREDICTED: collagen alpha-2(VI) chain-like [Anolis carolinensis] Length = 1019 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 71/187 (37%), Gaps = 28/187 (14%) Query: 165 SDIGLDMMMVLDVSLSMND-------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +D++ +LD S + HF + + RS + ++ R Sbjct: 825 TQRPVDVVFLLDGSERIGGLNFHKAHHFVEDVARHLTLARSNSDNMNA-----------R 873 Query: 218 SGLVTFSSKIVQ--TFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLE 274 L+ + S+ FPL + + I + + ++ S++ + YA N + + Sbjct: 874 IALLQYGSENEHVVAFPLTYNITEISDALAQIRYLDSSSNLGSAIIYAVNNLVINPRDRQ 933 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A+ + + +F+TDG + N+D + K++ + + + ++ L Sbjct: 934 RAARRNAELS--FVFITDGITGNKNLD-----EAIDSMKKQNVMPTVVALGSDVDMDVLH 986 Query: 335 NCASPDR 341 + D+ Sbjct: 987 KISLGDQ 993 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 67/185 (36%), Gaps = 26/185 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ ++D S S+ + + S L I P R G+V +S + Sbjct: 609 GPLDIVFIIDSSESIGYNNFSLEKNFVINVVS---RLGSIAKDPKSETGARVGVVQYSHE 665 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + +L T + L++AYN + ++ + Sbjct: 666 GTFEAIQLNDKRIDSLSSFKEAVKKLEWIAGGTWTLSALQFAYNTLIKESQREKARV--- 722 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFLKNC 336 + + +TDG D +++ + R V AIG A+ D+ L++ Sbjct: 723 -----FAVVVTDG-----RHDPRDNDSHLQALCGRNVTVTAIGIGDMFNAKEEDETLRSI 772 Query: 337 ASPDR 341 A D Sbjct: 773 ACNDN 777 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 79/246 (32%), Gaps = 22/246 (8%) Query: 137 FIFCTFPWCANSSHAPL-------LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 F +++ PL +I + + K+D + + V+D S S+ P Sbjct: 2 FSEALLAVLLSATLIPLHAQNDDDVILGASSCADKTDCPVRVYFVIDTSESIALQTVPIQ 61 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVV---RSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + R + E + +++ N V + + FS ++ + +K+N Sbjct: 62 SLVDHIKRFVPEFITRLENELYQNQVSITWQFAGLHFSDVVIFYSDFTNSKEIYLDKLNN 121 Query: 247 LI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + G T + L +I Y + +TDG + + Sbjct: 122 IQYIGRGTFTDCALSNMTAQILAN---------TSPGITNYAVVITDGHVTGSPCGGMK- 171 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 A+ G ++A+ +Q L+ AS N + + E + + Sbjct: 172 -HQAERAREAGIKLFAVAPSQNIYEQGLREIASSPHELYRNNYATTKKHTIEVDTETIDR 230 Query: 366 RILYNK 371 I K Sbjct: 231 IIQVMK 236 >gi|114321541|ref|YP_743224.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] gi|114227935|gb|ABI57734.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] Length = 972 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 81/213 (38%), Gaps = 30/213 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D L++ +++D S SM+ G + A R++ ++++ ++ G+V FS+ Sbjct: 326 DDDLEISLIVDTSGSMS---GAPIINARTAGRTLVDVVEPGRTA--------MGVVRFSA 374 Query: 226 KIVQTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 P+ +++ I+ L T GL +++ D + A Sbjct: 375 SASVVHPMIAIPDPGTAEKDQLKDAIDSLPASGLTAMFDGLILGLDELQD------YSAA 428 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA- 337 D + L+DG ++S +++ +A + A G + A L+ A Sbjct: 429 NDTDAGQVAFLLSDGGDNSSAATEPQTVQAYQDA---NVPIIAFGYGSFAPTGVLRRLAD 485 Query: 338 -SPDRFYS-VQNSRKLHDAFLRIGKEMVKQRIL 368 + F++ ++ +AFL + L Sbjct: 486 NTGGEFFASPTTLAEIQEAFLAANAAVSDAVNL 518 >gi|332706718|ref|ZP_08426779.1| hypothetical protein LYNGBM3L_23300 [Lyngbya majuscula 3L] gi|332354602|gb|EGJ34081.1| hypothetical protein LYNGBM3L_23300 [Lyngbya majuscula 3L] Length = 972 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 58/172 (33%), Gaps = 22/172 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S S G + K R L+ + ++ R S+K + Sbjct: 296 DVVFVIDTSGS---QMGDPLLKSQELMRRFINGLNPKDTFTIIDVSDR--ATQLSTKPLS 350 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + IN+L T + L ++ + + I+ Sbjct: 351 NSPQ--NCRKAINYINQLKANGGT-----------YLLKGIRHLLNLPAAPEGRLRSIVL 397 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 L+DG S+ N + L + + G +Y+ GV + L A R Sbjct: 398 LSDGYISNEN----QVLAEVQQQLKPGNRIYSFGVGSSPNRFLLNRLAEIGR 445 >gi|326911070|ref|XP_003201885.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Meleagris gallopavo] Length = 948 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 66/173 (38%), Gaps = 15/173 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKIV 228 ++ V+DVS SM +G M + A ++I L D N+ VR + +V Sbjct: 313 ILFVIDVSGSM---WGLKMKQTIEAMKAILSELRAADQFSLIDFNHNVRC----WRDNLV 365 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P V+ ++ I + T L A I + + L + I+ Sbjct: 366 SATPAQ--VEDAKKYIQTIHPNGGTNINEALLRA-TFILNEAQNLGMLDPNSVSM---IV 419 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++DG+ + + ++ + ++ +G+ + FL+ A+ +R Sbjct: 420 LVSDGDPTVGELKLTTIQKNVKQSIKDEFSLFCLGIGFDVDYDFLQRIATDNR 472 >gi|221132796|ref|XP_002166108.1| PREDICTED: similar to fibrillar collagen [Hydra magnipapillata] Length = 2213 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 78/217 (35%), Gaps = 38/217 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD D G + + + ML+ + + N ++++S Sbjct: 656 LDILLVLD------DSIKTGQENFKKSKDFSKAMLEWLSIDQNNTN---VAVISYSDIAE 706 Query: 229 QTF---------PLAWGVQHIQEKINRLIFGSTTKSTP--GLEYAYNKIFDAKEKLEHIA 277 PL + +Q KI+ + + + S L A ++F ++ Sbjct: 707 LHISFPVSGSDDPLQ-SLYDLQGKIDSIPYKGGSTSRLDRALSLASTRVFPEGKRT---- 761 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-------RGAIVYAIGVQAEAAD 330 + KK II TDG + + + + K+ + ++ V + Sbjct: 762 ---RNAKKVIILFTDGSTDVSSERLDVASWPLRKQKKRDGENEVNAIRIMSVTVSNKTNS 818 Query: 331 QFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 L N SP + ++ + + ++ +I +E K Sbjct: 819 NGLANVLSPPFIENTFTAADYDDIFNSIQQIAEESCK 855 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 13/81 (16%), Positives = 28/81 (34%), Gaps = 3/81 (3%) Query: 287 IIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCASPDRFY 343 + LTDG + + D + K G +Y++G+ + + L + + Sbjct: 89 LAVLTDGRQTRGSSAPDAVDLHVASRPLKDIGVQIYSLGIGRDYDIGELLDIASDDASVF 148 Query: 344 SVQNSRKLHDAFLRIGKEMVK 364 + +L I + K Sbjct: 149 RSSDVDELVSIVASITETTCK 169 >gi|260793650|ref|XP_002591824.1| hypothetical protein BRAFLDRAFT_125322 [Branchiostoma floridae] gi|229277035|gb|EEN47835.1| hypothetical protein BRAFLDRAFT_125322 [Branchiostoma floridae] Length = 691 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 70/209 (33%), Gaps = 33/209 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S+ GLD++ LD S S++ A + +++ + + + LVT Sbjct: 230 SRGSAGLDLVFALDKSSSIDAV------DFSRAIQFTTSIINEF-GVTNREGGTQVALVT 282 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRL-------IFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 F S+ L W + + K L G T T L+ N++ Sbjct: 283 FGSQAQ----LEWNLGQLDSKRKVLRQLRQLQPEGGGTALTAALQTVLNEVLPVARVGA- 337 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K+ + +TDG+++ + E V+A+GV A L + Sbjct: 338 --------KRALFIITDGKSNVGASPGVFARRLREE---EAFEVFAVGVGANVDKNELNS 386 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKE 361 AS + + + I ++ Sbjct: 387 VASQPFTSHVFLINDFSNFDTLVNTIAEK 415 >gi|110629868|gb|ABG80450.1| fibrillar collagen [Hydra vulgaris] Length = 1883 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 78/217 (35%), Gaps = 38/217 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD D G + + + ML+ + + N ++++S Sbjct: 326 LDILLVLD------DSIKTGQENFKKSKDFSKAMLEWLSIDQNNTN---VAVISYSDIAE 376 Query: 229 QTF---------PLAWGVQHIQEKINRLIFGSTTKSTP--GLEYAYNKIFDAKEKLEHIA 277 PL + +Q KI+ + + + S L A ++F ++ Sbjct: 377 LHISFPVSGSDDPLQ-SLYDLQGKIDSIPYKGGSTSRLDRALSLASTRVFPEGKRT---- 431 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-------RGAIVYAIGVQAEAAD 330 + KK II TDG + + + + K+ + ++ V + Sbjct: 432 ---RNAKKVIILFTDGSTDVSSERLDVASWPLRKQKKRDGENEVNAIRIMSVTVSNKTNS 488 Query: 331 QFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 L N SP + ++ + + ++ +I +E K Sbjct: 489 NGLANVLSPPFIENTFTAADYDDIFNSIQQIAEESCK 525 >gi|156404157|ref|XP_001640274.1| predicted protein [Nematostella vectensis] gi|156227407|gb|EDO48211.1| predicted protein [Nematostella vectensis] Length = 476 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 61/172 (35%), Gaps = 16/172 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + S D+++V+D S SM P + D + + + V R + Sbjct: 203 VEAASPQPKDVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKV 262 Query: 221 VT--------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + F S + + P+ + +++ ++ T A++ + Sbjct: 263 TSGDAKDEKCFESSLAKASPV--NIDILKKFLDGEYASGGTMYAVAFNAAFDIL------ 314 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 ++ + + + I+F+TDG + ++ N+ A + G+ Sbjct: 315 DKYYKEKNTTRRPVILFMTDGAPNDDPGTILNTVKMRNQGLSTKADILTFGL 366 >gi|54038464|gb|AAH84380.1| LOC495168 protein [Xenopus laevis] Length = 554 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 74/209 (35%), Gaps = 29/209 (13%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-- 227 ++ V+DVS SM +D + + D+N+ + G++ F+ I Sbjct: 306 ILFVIDVSGSMWGLKMKQTVDAMKS-------------ILEDLNSDDQFGIIDFNHNIRC 352 Query: 228 ---VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + + R+ T L A + +A + + Sbjct: 353 WKDELVYASSVEKGDASKYVQRIQPNGGTNINDALLRAIFILKEAS----NKGLLEQNSV 408 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 I+ ++DG+ + + + + ++++G+ + FL+ A + Sbjct: 409 SLIVLVSDGDPTVGELKLPKIQKNVRTNIQDDIALHSLGIGFDVDYDFLERLAQENHGMA 468 Query: 342 ---FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + + +L + + ++ ++K + Sbjct: 469 QRIYGNQDTAAQLKEFYNKVSTPLLKNIV 497 >gi|332288897|ref|YP_004419749.1| hypothetical protein UMN179_00822 [Gallibacterium anatis UMN179] gi|330431793|gb|AEC16852.1| conserved hypothetical protein [Gallibacterium anatis UMN179] Length = 345 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 75/200 (37%), Gaps = 15/200 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D+S SM + + +++ ++ P V ++ F+ Sbjct: 3 RLPVYLVIDISESMAGE------NIRQMQEGMSRLVNQLRRDPYALESVYISVIGFAGAA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + + RL GS T L + ++I KE + A+ D+K + Sbjct: 57 GTLAPLT---ELLNFYLPRLPIGSGTSIGTALNHVMDRI--DKEIIPSTAEQKGDWKPLV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 F++DG ++ D +++ + A + IG+ A L A + Sbjct: 112 YFMSDGSSTD---DTSKAIQRWKSLFKHRAKLINIGIGKFADLSTLNEVADLTYRLDDAD 168 Query: 348 SRKLHDAF-LRIGKEMVKQR 366 +++ A I + Q Sbjct: 169 IERVYQALCETIATSISSQS 188 >gi|198434614|ref|XP_002123557.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1105 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 71/203 (34%), Gaps = 33/203 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+M +LD S S+ + +LD PD +R G+ F+ Sbjct: 853 VDLMFLLDSSSSVGRSNW------NLLINFTVALLDKFVISPD---DMRVGVARFNRHFD 903 Query: 229 QTFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + G + +++K+ R+ + T N ++ H Sbjct: 904 RDSEILIGNYSNISELRQKLRRMPYRGRGTLTG------NALWHMNNHSLHAPGNRPGVP 957 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASPDR 341 I+ +TDG + + E L N K + +Y +G+ L++ +S Sbjct: 958 DVIVVITDG------LASDEVLRAANALKEQDVKMYVVGLINRMNRMNLAQLQDISSGTE 1011 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 + + + + + R+ E+ Sbjct: 1012 YLQI-----IDNGYERLADELSD 1029 >gi|295789104|ref|NP_001171440.1| vitrin isoform 2 [Homo sapiens] gi|74739159|sp|Q6UXI7|VITRN_HUMAN RecName: Full=Vitrin; Flags: Precursor gi|37181801|gb|AAQ88704.1| VIT [Homo sapiens] gi|119620824|gb|EAX00419.1| vitrin, isoform CRA_d [Homo sapiens] Length = 678 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 495 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 544 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 545 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 592 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 593 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 646 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V LH RI + + Sbjct: 647 HSFFVDEFDNLHQYVPRIIQNI 668 >gi|17538702|ref|NP_499959.1| hypothetical protein C18H7.1 [Caenorhabditis elegans] gi|14573846|gb|AAF98615.2| Hypothetical protein C18H7.1 [Caenorhabditis elegans] Length = 425 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 59/168 (35%), Gaps = 27/168 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI-----REMLDIIKSIPDVNNVVRSG 219 + LD+++VLD S + + + D + + + VR Sbjct: 234 TGCELDLVLVLDFSTTTDPVYNSYKDLSKRLVSQLKIGPHYTQVAAVTFATVGRTRVRFN 293 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L + ++ + + I+ L G TT G+E A ++ +++ IA Sbjct: 294 LKKYQTQ-----------EEVLRGIDNLKSRGGTTAIGAGIEKALTQLDESEGARPGIAT 342 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 K ++ TDG ++ K + +A G +Y + A Sbjct: 343 ------KVMVVFTDGWSNKGPDPEKRAR----DAVSSGFEMYTVAYTA 380 >gi|261404225|ref|YP_003240466.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261280688|gb|ACX62659.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 562 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 83/213 (38%), Gaps = 37/213 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + + V DVS SM+ G +++L + + ++ L SI G Sbjct: 379 KEKKDGNKPVAAVFVADVSGSMD---GEPLNRLKESLLTGQKYLGRDNSI---------G 426 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEK------INRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V++S+ + P+ G + ++ IN L T + G+ A + D Sbjct: 427 FVSYSTDVTINLPI--GKYDLNQQSMFVGAINSLEASGNTATFDGIVVAMKMLQDE---- 480 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + D K I L+DGE + ++D+ L VY IG A+ Q Sbjct: 481 ---MAANPDVKPLIFVLSDGETNVGHSLDDIRGLIQA-----FKIPVYTIGYNAD--IQA 530 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L++ +S + + + D +IG+ + Q Sbjct: 531 LQSISSINE--AASINADTDDVVYKIGQLLNVQ 561 >gi|224054053|ref|XP_002190891.1| PREDICTED: collagen, type VI, alpha 2 [Taeniopygia guttata] Length = 1016 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 70/210 (33%), Gaps = 36/210 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+M V+D S S+ ++ L I P R G+V +S + Sbjct: 606 GALDIMFVIDSSESIG---YTNFTLEKNFVINVVSRLGSIAKDPKSLTGARVGVVQYSHE 662 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + RL T + L++AYNK+ + + Sbjct: 663 GTFEAIKLDDERIDSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREK------ 716 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + + + +TDG P D+K C R +V IG+ D Sbjct: 717 --AQVFAVVITDGR-YDPRDDDKNLGALC----GRDVVVNTIGIG--------------D 755 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 F + S L + + K R+ + Sbjct: 756 MFDQPEQSETLVSIACNEPQRVQKMRLFSD 785 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 70/174 (40%), Gaps = 18/174 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVT 222 + +D++ +LD S + + A R + ++ + ++ R L+ Sbjct: 822 TQRPVDIVFLLDGSERIGEQ------NFQSAHRFVEDVAQQLMLARSSSDHMNARIALLQ 875 Query: 223 FSSKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + S+ Q FPL + I + + ++ S++ + +A N I + + +A+ Sbjct: 876 YGSERDQDVVFPLTHNLTEISDALAQIKYLDSSSNIGSAIIHAINNIVLSPGNGQRLARR 935 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + + +F+TDG S N++ N K++ + + + ++ L Sbjct: 936 NAELS--FVFITDGITGSKNLE-----EAINSMKKQDVMPTVVALGSDVDMDVL 982 >gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] Length = 686 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 63/192 (32%), Gaps = 24/192 (12%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSI--REMLDIIKSI 209 L++ + + +++ V+D S SM L A R + + ++I+ Sbjct: 300 LMLVPPASQRAAETVPREIVFVVDTSGSMGGVSIKQAKGSLTRALRHLGPNDRFNVIEFN 359 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + + + + E + L T+ P L+ A Sbjct: 360 SSHRALFQHAVPASHHNLQLA----------SEYVRHLEASGGTEMMPALQLALKLPGAQ 409 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E + + +IF+TDG + ++ + G+ ++ +G+ + Sbjct: 410 DELR------PEPALRQVIFITDG---AVGNESALFEHIVDSL--GGSRLFTVGIGSAPN 458 Query: 330 DQFLKNCASPDR 341 F++ A R Sbjct: 459 AWFMRKAAEYGR 470 >gi|226947182|ref|YP_002802255.1| ppkA-like protein [Azotobacter vinelandii DJ] gi|226722109|gb|ACO81280.1| ppkA-related protein [Azotobacter vinelandii DJ] Length = 656 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 44/259 (16%), Positives = 85/259 (32%), Gaps = 46/259 (17%) Query: 118 IDDQHKDYNLSAVSR-YEMPFIFC---TFPWCANSSHAPLLITSSVKISSKSDIGLD--- 170 + + DY + R Y +P + + ++ + S D Sbjct: 157 VVAREPDYAVDLAKRFYLLPVLDAREIMTEKGFRVRELEVASVTAAEPKSAEQAARDKAG 216 Query: 171 -----------MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ V+D ++SM GP +D+ A R I + ++ K + VR G Sbjct: 217 EVNTLKGFSAAVVFVIDSTISM----GPYIDRTREAIRKIYQRIEQEKLLEQ----VRFG 268 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 LV + S I + L + + ++ + + E G Sbjct: 269 LVAYRSNIKEVPALEY---VSKLYVDPAKVQGGEDFLAKMAALKPATVSSSRFDEDAYAG 325 Query: 280 HDDY----------KKYIIFLTD-GENSSPNIDNKESLFYCN---EAKRRGAIVYAIGVQ 325 +YI+ ++D G + + L EAK RG +Y++ ++ Sbjct: 326 VMRALDGIDWTRFGARYIVLVSDAGALDGTDPLSATGLDAAQVRLEAKHRGVAIYSLHLK 385 Query: 326 AEAADQFLKNCASPDRFYS 344 + KN AS + Y Sbjct: 386 TPSGA---KNHASAEAQYR 401 >gi|308472817|ref|XP_003098635.1| hypothetical protein CRE_04180 [Caenorhabditis remanei] gi|308268235|gb|EFP12188.1| hypothetical protein CRE_04180 [Caenorhabditis remanei] Length = 779 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 67/190 (35%), Gaps = 12/190 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ LD++ V+D S+ M G+ + ++ I + R GLVT++ Sbjct: 27 TNLWLDVVAVVDNSIGMT---NEGLANIAANICTVFSAGTRIGTQASEPQTTRVGLVTYN 83 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA--KGHDD 282 Q L Q + + N + ++ ST + + A+ E+ Sbjct: 84 VNAQQNADL-NKFQSLDDLYNNVFADLSSVSTSAQSFLSTGLAAAESLFEYENFGTNRSH 142 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN---CASP 339 YKK +I + +D L N K G + + LK ASP Sbjct: 143 YKKVVIVYASSYAAGGEMDP---LPVANRLKTSGVNIITVAYDQTGDGLLLKQLAEIASP 199 Query: 340 DRFYSVQNSR 349 +S ++ Sbjct: 200 RFNFSNTDNE 209 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 64/189 (33%), Gaps = 20/189 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD+++V+D S MN + + S+ I + R GLVT++ Sbjct: 418 SNLWLDVVLVVDNSQGMNTD---RLHTVTANILSVFGSGTRIGIDETYHRTTRLGLVTYN 474 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-- 282 S Q L ++ + + G + ++ + + + E E + Sbjct: 475 SVATQNADL-----YLYQSFDEASDGIIDATRTAVDTSESYLATGLEMAERMFNEQSVNN 529 Query: 283 ----YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKN 335 YK+ +I +D L N K G + + D L Sbjct: 530 VRSHYKRAVIVYASEYKGDGELDP---LPVANRLKLSGVNIITAAYEQSGDDGLFESLSQ 586 Query: 336 CASPDRFYS 344 ASP +S Sbjct: 587 IASPGFSFS 595 >gi|219848104|ref|YP_002462537.1| magnesium chelatase [Chloroflexus aggregans DSM 9485] gi|219542363|gb|ACL24101.1| Magnesium chelatase [Chloroflexus aggregans DSM 9485] Length = 696 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 49/133 (36%), Gaps = 18/133 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIV 228 + V+D S SM +++ ++ +L + GLV+F Sbjct: 513 AVCFVVDASWSMAAE-----ERMQATKAAVLSLLR-----DAYQRRDQVGLVSFQRDYAR 562 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL V+ Q ++ + G T + GL A+ + A+ + + ++ Sbjct: 563 VLLPLTNSVELAQRRLQSMPTGGKTPLSRGLLTAFELLERARRRDA-------EVVPLMV 615 Query: 289 FLTDGENSSPNID 301 LTDG+ + D Sbjct: 616 LLTDGQANVSISD 628 >gi|148657117|ref|YP_001277322.1| peptidase M23B [Roseiflexus sp. RS-1] gi|148569227|gb|ABQ91372.1| peptidase M23B [Roseiflexus sp. RS-1] Length = 982 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 45/127 (35%), Gaps = 18/127 (14%) Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKES 305 L +T GL+ + + + I+ L+DG EN++P + + Sbjct: 570 LTPRGSTSIGGGLQRSQQLLSASAPGRTRA----------IVLLSDGQENTAPYVSDVLP 619 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 + + V+ IG+ +A Q + + A + + +L + I + Sbjct: 620 -----QIRASQITVHTIGLGTDADQQLMLSIAAQTGGTYNYAPRPDQLAGIYNTISGAVS 674 Query: 364 KQRILYN 370 ++ L Sbjct: 675 NRQTLIT 681 >gi|113460775|ref|YP_718842.1| hypothetical protein HS_0630 [Haemophilus somnus 129PT] gi|112822818|gb|ABI24907.1| conserved hypothetical protein, with von Willebrand factor (vWF) domain [Haemophilus somnus 129PT] Length = 343 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 17/200 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+DVS SM D +I ++ ++ P V ++ F+ Sbjct: 3 RLPIFLVVDVSESMAG------DSHRQMQEAINRLVQRLRCDPYALESVYISVIAFAGAA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + RL GS T L ++I + KG YI Sbjct: 57 GVIAPLT---ELMSFYAPRLPMGSGTSLGAALNLTMDEIQRNVVRSSGDQKGDFKPLVYI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 L+DG + D ++ + + + A+G+ A L A + + + + Sbjct: 114 --LSDGVATD---DPTSAIQRWQQEFKSRTKLIAVGLGNFADLSALNQIA--ELTFRIDD 166 Query: 348 SRKLHDAFLRIGKEMVKQRI 367 + L +A+L + + + + Sbjct: 167 -QDLEEAYLTLTRSIEDSIL 185 >gi|297667860|ref|XP_002812181.1| PREDICTED: vitrin-like isoform 2 [Pongo abelii] Length = 678 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 495 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 544 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 545 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 592 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 593 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 646 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 647 HSFFVDEFDNLYQYVPRIIQNI 668 >gi|145493674|ref|XP_001432832.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124399947|emb|CAK65435.1| unnamed protein product [Paramecium tetraurelia] Length = 618 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 78/218 (35%), Gaps = 34/218 (15%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + + V+I + +++LD S SM+ D+ A + L K N Sbjct: 423 VENQVEIQAAQQPSFHYIILLDDSGSMSG------DRFNQAQNGLISSLSSAKD----NQ 472 Query: 215 VVRSGLVTFSSKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +R ++ F+ +Q I+ + + G T + AY KI Sbjct: 473 NIRVTIIIFNDNARCVVDSQTINMQTIKNAV--VCNGGGTSFQSAFQLAYQKIA------ 524 Query: 274 EHIAKGHDDYKKYIIFL-TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K + + K++IF TDG +S P + + N + + + I + + Sbjct: 525 --AVKNFEQFNKHVIFFYTDGGDSYPTQALNQ---FANLPQAQRMKIDLIACCLDKQQKT 579 Query: 333 L--------KNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + KNC+ + ++ +A+ +++ Sbjct: 580 MINITDFFNKNCSF-GKLQDQMEPSQIGEAWRNQTRQI 616 >gi|60551291|gb|AAH91051.1| Clca1 protein [Xenopus (Silurana) tropicalis] Length = 937 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 37/195 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM G + +L A + + S G+V F S Sbjct: 308 VTLVLDVSGSMGG--GNRIGRLYQAAEVFVMQIVEMGSY--------VGIVQFESTASVR 357 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L ++ + + T G+ + Y Sbjct: 358 SSLLQIVDDTQRNRLKSLLPK-TATGGTNICAGIREGIKV---------NKKYDGSSYST 407 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 ++ LTDGE++ C + G+I++ I + AA + + + Sbjct: 408 ELVLLTDGEDNYAT-------SLCFPDVTNSGSIIHVIALGPNAAKELETIVDMTGGLRF 460 Query: 344 SVQ---NSRKLHDAF 355 +++ L DAF Sbjct: 461 LATDKVDAQGLIDAF 475 >gi|113971723|ref|YP_735516.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] gi|113886407|gb|ABI40459.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] Length = 1215 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 74/202 (36%), Gaps = 32/202 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +DM +V+D S SM P + + A + + + GLV+FSS Sbjct: 309 EGDIDMQIVMDRSGSMYGS--PINNAIQAAKTLVDATAEGSTA---------MGLVSFSS 357 Query: 226 K---------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + P Q ++ I+ + +T G A + + + Sbjct: 358 RSSVKQDFAVQQIPKPDTGIKQALKAAIDNIYASGSTALFDGSSLALDNLITYQTAAASG 417 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKN 335 A G + L DG+++S +I N+ ++ + +++ G A + Sbjct: 418 APG------VVFVLADGDDNS-SIKNESTVITAYQ--NANVPIFSFGYGSASPTGPLVTM 468 Query: 336 C-ASPDRFY-SVQNSRKLHDAF 355 A+ +++ S ++ DAF Sbjct: 469 ANATGGKYFSSPTTLSEIIDAF 490 >gi|191637338|ref|YP_001986504.1| hypothetical protein LCABL_05200 [Lactobacillus casei BL23] gi|190711640|emb|CAQ65646.1| Putative uncharacterized protein yvcC [Lactobacillus casei BL23] Length = 909 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 86/285 (30%), Gaps = 33/285 (11%) Query: 21 TAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFS 80 T L V+ I+M ++ + + NG Sbjct: 11 TGHLFAVLLILMSMLTGLVTSG-------SSVVTAAANIRPTYQTDANGTYPTNSWQVTG 63 Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + + N D + N + + + + S D + DY + + + Sbjct: 64 QQNVINQRGGDQVSGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA--KETNTPG 120 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + N V D+++V+D+S SM G D+ G ++ Sbjct: 121 LYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESKNNGGTDRAGAVRTGVK 170 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGSTTKSTPGL 259 L I++ + + V L+ FSS G +I+ + + T + Sbjct: 171 NFLTSIQNA-GLGDYVNVSLIGFSSPGYIGGGNKTTGPGYIRVGLGK---AGNTSQQQAI 226 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDY--------KKYIIFLTDGENS 296 A + F+ + + KK +I LTDG + Sbjct: 227 NSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPT 271 >gi|189347157|ref|YP_001943686.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] gi|189341304|gb|ACD90707.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] Length = 2825 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 72/189 (38%), Gaps = 15/189 (7%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP--GMDKLGVATRSIREMLDIIKSIPDVN 213 T + + I ++ ++LDVS SMND G +L A +I +++ ++ DV Sbjct: 2143 TVVSESFTAKPIDTNLTIILDVSGSMNDVIPNSGGKTRLQFAKEAIASLINQYDALGDVK 2202 Query: 214 NVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 V+ + S++ + W +IN L +T L +A + Sbjct: 2203 --VQIITFSASAQPIIVSGQVWLDPGTAITQINGLSASGSTNYDDALVDVMTTYNNAGKL 2260 Query: 273 LEHIAKGHDDYKKYIIFLTDGENS----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++ FL+DG + SP I++ E + + YAIGV A A Sbjct: 2261 STPQSQN------VGYFLSDGVPNTPATSPGINSPEEQQWESFLTTNNINNYAIGVGAGA 2314 Query: 329 ADQFLKNCA 337 L A Sbjct: 2315 TQATLDPVA 2323 >gi|295789109|ref|NP_001171442.1| vitrin isoform 4 [Homo sapiens] gi|119620822|gb|EAX00417.1| vitrin, isoform CRA_b [Homo sapiens] Length = 656 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 473 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 522 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 523 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 570 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 571 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 624 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V LH RI + + Sbjct: 625 HSFFVDEFDNLHQYVPRIIQNI 646 >gi|71028596|ref|XP_763941.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350895|gb|EAN31658.1| thrombospondin-related protein [Theileria parva] Length = 552 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 51/314 (16%), Positives = 103/314 (32%), Gaps = 42/314 (13%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 ++ M + + S+ F +++ +L L ++ + + + + + D ++ N Sbjct: 76 LLLTGMTRLKDGSNSFNKNSRISSVLPMETLDRLSEAITRSSEHPVTFEALDGGSVVVTN 135 Query: 87 IWQTD-FRNELRENGFAQDINNIERSTSLS-IIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 T + G + + S I H ++ L Sbjct: 136 NSDTFSIKLYPSLPGLNLTPGMLPTNKPNSHINFTGNHNEHAL---------LKHALHDL 186 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS-MNDHFGPGMDKLGVATRSIREML 203 ++ L K SS LD+ +++D S S M + + + L +S Sbjct: 187 STSNYDRGLYPDGIKKPSSYCHRELDLTILVDESSSIMKEEWEKLIPFLKSLVKS----- 241 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKS 255 I P N V +VTFS+ I ++ I E N T + Sbjct: 242 --ISISP---NYVHLSVVTFSTSIRWLISFLNPSGKDENLALRVIDELKNSKPVFGFTFT 296 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 L + ++ + + K II +TDG ++ PN+ ++ S + Sbjct: 297 GQALNFITEAVYQFGAR--------QNAPKAIILITDGSSTQPNVTSQASAML----REA 344 Query: 316 GAIVYAIGVQAEAA 329 G + +GV Sbjct: 345 GVTILVVGVGMARD 358 >gi|297667864|ref|XP_002812183.1| PREDICTED: vitrin-like isoform 4 [Pongo abelii] Length = 656 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 473 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 522 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 523 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 570 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 571 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 624 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 625 HSFFVDEFDNLYQYVPRIIQNI 646 >gi|194221347|ref|XP_001494879.2| PREDICTED: similar to alpha 1 type VII collagen [Equus caballus] Length = 3065 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSGAASA-QGVRFAAVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIHAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADHEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 74/234 (31%), Gaps = 31/234 (13%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P I F S + +D++ +L + L Sbjct: 1020 PGISYIFSLTPVREGIQGPEASITQSPVCPHGLMDVVFLLHATRDNAHRAEAVKRALERL 1079 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIF--GS 251 ++ P V+ GL+++S + FPL + I +KI + + S Sbjct: 1080 VSALG---------PLGPQAVQIGLLSYSHRPSPLFPLNSSHDPGVILQKIRSIPYMDPS 1130 Query: 252 TTKSTPGLEYAYNKIF--DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 + A+ + DA + +H+ ++ L D E +I Sbjct: 1131 GNNLGIAVVTAHRHLMAPDAPGRRQHVPG-------VMVLLVD-EPLRGDI-----FNPV 1177 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPD---RFYSVQNSRKLHDAFLRIGK 360 EA+ G V +G+ +Q + S D F++V + L A + Sbjct: 1178 REAQAAGLKVMMLGLVGADPEQLRRLVPSTDPVQNFFAVNDGSSLDQAVSSLAT 1231 >gi|190149857|ref|YP_001968382.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307263180|ref|ZP_07544801.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914988|gb|ACE61240.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871542|gb|EFN03265.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 530 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 59/366 (16%), Positives = 118/366 (32%), Gaps = 36/366 (9%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 +S R F + G +++ +L I ++ + +E++ +A+L L+ ++L Sbjct: 4 ISLSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLT 63 Query: 61 TKILNQENGNNGKKQKNDFSYRIIK----------NIWQTDFRNELRENGFAQ----DIN 106 + + N+ K ++ + I T + L + + + Sbjct: 64 AENNSGRKANDYKLGGSNPNDDSFNISSEVGKRDHAIVTTFVKTFLPQTNDDKMNLIPVC 123 Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 +TS +S ++ F + S + + Sbjct: 124 KTVNNTSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVDVASKSKAFKKNTFN 183 Query: 167 IGLDMMMVLDVSLSMNDHFGP-------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I +D+M+V D+S SMN K+ + + E+ D D N R Sbjct: 184 IPIDLMVVADLSGSMNFDLDNNETKKTGKPSKISILKEVLVELADKTLLSEDANQHNRIY 243 Query: 220 LVTFSSKIVQTF-----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI---FDAKE 271 + F+ P +W ++ + N + S N + KE Sbjct: 244 VTPFALGAEINKNSCALPYSWDIESSSKIENIKKILNKENSQYNRADLINNLVYRISTKE 303 Query: 272 KLEHIA---KGHDDYKKYIIFLTDGENSSPNI----DNKESLFYCNEAKRRGAIVYAIGV 324 L +I K + + K L D + S+ D + Y K GA + + GV Sbjct: 304 TLNNINGKQKYNVTFPKNTFCLKDMKTSNQGWYTRSDKSKFTSYVQSIKASGATLASSGV 363 Query: 325 QAEAAD 330 A + Sbjct: 364 LVAANN 369 Score = 36.3 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 9/113 (7%) Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD----GENSSPNIDNKESLFY 308 T+ T L Y ++F ++++ ++ H + Y LTD E ++ L Sbjct: 413 TRITENLIYGKQEVFLSQKQKISLSLSHSTIETY---LTDTQPRNETDGMCKVIRDRLDT 469 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL-RIGK 360 N K + G A A Q ++C +YS + L ++F IG+ Sbjct: 470 LNNDKNTKIVFVEFGY-ASKAKQAWQHCVGNGNYYSANDKESLLNSFKQAIGE 521 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 76/209 (36%), Gaps = 32/209 (15%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + T+ + + +M V+D S SM + ++M++ I + N Sbjct: 683 IFTNPNFSKNSCSLATEMAYVVDYSSSMK--------AVDPTNYRGKKMIEFINQLKAKN 734 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 N+V + ++K ++ T G++ A K + + Sbjct: 735 NIV----IETNTKATILGEGTTDAVLKKDLYKASKDKGATDIFAGIDIALTKFSNDTKTA 790 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA---AD 330 K I+ ++DG+ S + + N+AK++G +Y + + ++ Sbjct: 791 -----------KAIVVVSDGKTSKSKM-----IKAINDAKKQGVKIYTVSMGKKSQINDA 834 Query: 331 QFLKNCA-SPDRFYSVQNSRKLHDAFLRI 358 ++ + +Y ++ +LH F ++ Sbjct: 835 TLMQLSTETGGAYYHALDNLQLHQVFQKL 863 >gi|308238185|ref|NP_001184129.1| chloride channel accessory 1 [Xenopus (Silurana) tropicalis] Length = 933 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 37/195 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM G + +L A + + S G+V F S Sbjct: 304 VTLVLDVSGSMGG--GNRIGRLYQAAEVFVMQIVEMGSY--------VGIVQFESTASVR 353 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L ++ + + T G+ + Y Sbjct: 354 SSLLQIVDDTQRNRLKSLLPK-TATGGTNICAGIREGIKV---------NKKYDGSSYST 403 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 ++ LTDGE++ C + G+I++ I + AA + + + Sbjct: 404 ELVLLTDGEDNYAT-------SLCFPDVTNSGSIIHVIALGPNAAKELETIVDMTGGLRF 456 Query: 344 SVQ---NSRKLHDAF 355 +++ L DAF Sbjct: 457 LATDKVDAQGLIDAF 471 >gi|196232430|ref|ZP_03131283.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196223502|gb|EDY18019.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 879 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 75/210 (35%), Gaps = 22/210 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + M++VLD S SM G K+ +A + ++ ++ G+V +K Sbjct: 421 VAMLVVLDRSGSMT-AAVAGQTKISLADQGAVFAMNALQPKD------YFGVVAVDTKPH 473 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PLA ++KI + G + A+ ++ D +++H+ D Sbjct: 474 TVVPLAPISAKGAAEQKILSITAGGGGIYIYTSMVEAFQQLRDIPARVKHLLLFSDAADA 533 Query: 286 YIIF---LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCA--S 338 ++DG + N SL + +G+ E FL+ A Sbjct: 534 EEKAAGEMSDGIRTGGN-----SLDLASAMLAAKITTSVVGLGTEQDKDTPFLRQLAERG 588 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 RFY ++ L F ++ + ++ Sbjct: 589 SGRFYLTDDATTLPQIFSTETMKVAQSSLI 618 >gi|62087470|dbj|BAD92182.1| PREDICTED: integrin, alpha D variant [Homo sapiens] Length = 1177 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 81/224 (36%), Gaps = 30/224 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ T +D++ ++D S S++ + +++ +++ + ++ Sbjct: 147 WEIIQTVPDATPECPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLF 203 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDA 269 + + F+ +T P Q ++ + T + G+ ++F Sbjct: 204 ALMQYSNLLKIHFTFTQFRTSP------SQQSLVDPIVQLKGLTFTATGILTVVTQLFH- 256 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQ 325 H KK +I +TDG+ D E +A++ G I YAIG Q Sbjct: 257 -----HKNGARKSAKKILIVITDGQ---KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQ 308 Query: 326 AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A Q L +S D + V N A I K++ ++ Sbjct: 309 GPTARQELNTISSAPPQDHVFKVDN----FAALGSIQKQLQEKI 348 >gi|1167550|gb|AAB38547.1| leukointegrin alpha d chain [Homo sapiens] Length = 1162 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 81/224 (36%), Gaps = 30/224 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ T +D++ ++D S S++ + +++ +++ + ++ Sbjct: 131 WEIIQTVPDATPECPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLF 187 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDA 269 + + F+ +T P Q ++ + T + G+ ++F Sbjct: 188 ALMQYSNLLKIHFTFTQFRTSP------SQQSLVDPIVQLKGLTFTATGILTVVTQLFH- 240 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQ 325 H KK +I +TDG+ D E +A++ G I YAIG Q Sbjct: 241 -----HKNGARKSAKKILIVITDGQ---KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQ 292 Query: 326 AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A Q L +S D + V N A I K++ ++ Sbjct: 293 GPTARQELNTISSAPPQDHVFKVDN----FAALGSIQKQLQEKI 332 >gi|62548866|ref|NP_005344.2| integrin alpha-D precursor [Homo sapiens] gi|296434544|sp|Q13349|ITAD_HUMAN RecName: Full=Integrin alpha-D; AltName: Full=ADB2; AltName: Full=CD11 antigen-like family member D; AltName: Full=Leukointegrin alpha D; AltName: CD_antigen=CD11d; Flags: Precursor gi|162317970|gb|AAI56096.1| Integrin, alpha D [synthetic construct] gi|168275856|dbj|BAG10648.1| integrin alpha-D precursor [synthetic construct] Length = 1161 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 81/224 (36%), Gaps = 30/224 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ T +D++ ++D S S++ + +++ +++ + ++ Sbjct: 131 WEIIQTVPDATPECPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLF 187 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDA 269 + + F+ +T P Q ++ + T + G+ ++F Sbjct: 188 ALMQYSNLLKIHFTFTQFRTSP------SQQSLVDPIVQLKGLTFTATGILTVVTQLFH- 240 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQ 325 H KK +I +TDG+ D E +A++ G I YAIG Q Sbjct: 241 -----HKNGARKSAKKILIVITDGQ---KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQ 292 Query: 326 AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A Q L +S D + V N A I K++ ++ Sbjct: 293 GPTARQELNTISSAPPQDHVFKVDN----FAALGSIQKQLQEKI 332 >gi|325108274|ref|YP_004269342.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324968542|gb|ADY59320.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 1584 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 93/301 (30%), Gaps = 60/301 (19%) Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 L + N+ + + Q + + ++ +P +++ A Sbjct: 988 LPHRFLVKPANDSKLLSEDDAHAQFQFRGHEVATSLSLSVPLGQRL-----SAAIADETS 1042 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIP 210 I S++ + +M+VLD S SMN + + K A + ++L+ + +P Sbjct: 1043 AQVTVIDSQAK-PVSVMVVLDCSQSMNQTIPWEASQQDLTKFDAAITATNQILNALSRLP 1101 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----------NRLIFGSTTKSTPGLE 260 + R GLV + ++ W + + + L+ T+ Sbjct: 1102 NA----RVGLVLYGHRV------GWNTASPSQILRQTGYANPIPDSLMPYEDTEIVLPFG 1151 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKY--------------------IIFLTDGENSSPNI 300 DA EKL K + Y I+ +TDG N N Sbjct: 1152 RFARAERDAVEKLLKTVKPWGETPLYLSIRNALNEMNSKSLGGKQQIVVITDGVNKQLNP 1211 Query: 301 DNKESLFYCNEAKRR-GAIVYAI-GVQAEAA------DQFLKNCA-SPDRFYSVQNSRKL 351 + + G I G + QF + + + + + ++ +L Sbjct: 1212 SASAYVSLSSLLNENFGQTSVNILGFGIDPNESQTAARQFEQLASRTGGEYVEINDAGRL 1271 Query: 352 H 352 Sbjct: 1272 L 1272 >gi|220897447|emb|CAX15331.1| complement component 2 (within H-2S) [Mus musculus] Length = 1276 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 773 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 829 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 830 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 886 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 887 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 946 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 947 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 989 >gi|220897445|emb|CAX15329.1| complement component 2 (within H-2S) [Mus musculus] Length = 970 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 467 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 523 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 524 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 580 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 581 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 640 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 641 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 683 >gi|218156291|ref|NP_001136178.1| complement factor B isoform 2 [Mus musculus] gi|220897443|emb|CAX15327.1| complement factor B [Mus musculus] Length = 713 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 260 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 316 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 317 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 373 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 374 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 433 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 434 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|218156289|ref|NP_032224.2| complement factor B isoform 1 [Mus musculus] gi|220897442|emb|CAX15326.1| complement factor B [Mus musculus] Length = 763 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 260 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 316 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 317 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 373 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 374 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 433 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 434 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|148694792|gb|EDL26739.1| complement factor B, isoform CRA_b [Mus musculus] Length = 760 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 257 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 313 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 314 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 370 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 371 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 430 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 431 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 473 >gi|148694794|gb|EDL26741.1| complement factor B, isoform CRA_d [Mus musculus] Length = 731 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 228 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 284 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 285 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 341 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 342 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 401 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 402 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 444 >gi|116126|sp|P04186|CFAB_MOUSE RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|192412|gb|AAA37379.1| factor B [Mus musculus] gi|192414|gb|AAA63293.1| factor B [Mus musculus] gi|2944424|gb|AAC05283.1| complement factor B [Mus musculus] gi|3986764|gb|AAC84160.1| Bf [Mus musculus] gi|13529437|gb|AAH05451.1| Complement factor B [Mus musculus] gi|74146351|dbj|BAE28943.1| unnamed protein product [Mus musculus] gi|148694791|gb|EDL26738.1| complement factor B, isoform CRA_a [Mus musculus] Length = 761 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 258 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 314 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 315 PKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 371 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 +G + + II +TDG N N D + L + K VY GV Sbjct: 372 EGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLV 431 Query: 329 ADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 432 DSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 474 >gi|260814488|ref|XP_002601947.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] gi|229287250|gb|EEN57959.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] Length = 1386 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 74/212 (34%), Gaps = 36/212 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P ++S D+ VLD S S++ + + + ++ Sbjct: 256 PFPTAEPCVVTS------DLFFVLDGSGSVS------VSDFETVKQFVVAVVSAFTI--- 300 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIF 267 R G++ +S+ L IN L++ G T + +E+A Sbjct: 301 GLADTRVGVLQYSTSSSLECNLGDHPDEASFVSAINTLVYQKGGNTYTGAAMEFA----- 355 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ A K +I LTDG++S + ++L V+AIGV + Sbjct: 356 -----RQNAAWRPAPVPKIMIVLTDGKSSDSVVAAAQALAA------DQVAVFAIGVGSF 404 Query: 328 AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRI 358 + L+ P R + + + L + RI Sbjct: 405 DHSELLEITNNKPGRVFELDDFDVLAQSINRI 436 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 71/200 (35%), Gaps = 30/200 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ VLD S S++ + + + ++ R G++ +S+ Sbjct: 477 DLFFVLDGSGSVS------VSDFETVKQFVVAVVSAFTI---GLAETRVGVLQYSTSSTL 527 Query: 230 TFPLAW--GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L IN + + G +T + LE+A ++ A + Sbjct: 528 ACNLGDHPDEASFVSAINTMTYQKGGSTYTGAALEFA----------RQNAAWRPAPVSR 577 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYS 344 +I LTDG++ + ++L V+AIGV + + L+ ++ + Sbjct: 578 IMIVLTDGQSHDSVVAAAQALAA------DQVTVFAIGVGSFDHSELLEITSNKLGHVFE 631 Query: 345 VQNSRKLHDAFLRIGKEMVK 364 + + + +I + + K Sbjct: 632 LDDFNAMAQNITQIVRAVCK 651 >gi|149066378|gb|EDM16251.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1127 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/309 (13%), Positives = 98/309 (31%), Gaps = 45/309 (14%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 ++ N + + + +L + Sbjct: 286 AGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVE-EQDK 344 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + + +Y ++ Y + + +++ K K+ D++ ++D S Sbjct: 345 EIKDTEYKVTVTPIY--------------TVGEGVSVSAPGKTLCKAAKA-DLVFMVDGS 389 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WG 236 S+ D D + + + I + + +V F+ F L Sbjct: 390 WSIGD------DNFNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDAYKT 441 Query: 237 VQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + I + + G TK+ +++ + +F A + K I+ +TDG + Sbjct: 442 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTA------DSGTRRGIPKVIVVITDGRS 495 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHD 353 + E + G ++AIGV + ++ + P + V + D Sbjct: 496 QD------DVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVFFVDD----FD 545 Query: 354 AFLRIGKEM 362 AF +I E+ Sbjct: 546 AFKKIEDEL 554 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 +K ++ D+++++D S S+ R +R L+ + + Sbjct: 143 TSPEEIKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 192 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 193 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 249 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 250 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------REAGVELFAIGVKNADLS 301 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 302 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 344 >gi|326674128|ref|XP_002664631.2| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 540 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 74/205 (36%), Gaps = 22/205 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ VLD S S+ D D A I ++ + N +V + Sbjct: 152 NQGPGTEIAFVLDGSGSIQD------DDFQKAKDFIYNVMSNVWKTCFDCNF---AIVQY 202 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S I L + + T A N + E+ +K + Sbjct: 203 GSLIRTELSLLDNEDRVGSLLKVKQIKQIYNLTKT-ASAINHVLTDIFIPENGSKDNSA- 260 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFLKNCA 337 K II L+DG+ D N+ + +G Y+IGV +A ++ ++ A Sbjct: 261 -KIIIVLSDGKILG---DPMTLDEVLNKPQMKGVTRYSIGVGDGILKNLDATEEMMQ-IA 315 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEM 362 P ++Y+V + R L+D + + + Sbjct: 316 DPGKYYNVSSYRALNDIVSSLERGI 340 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 80/204 (39%), Gaps = 29/204 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-SIPDVNNVVRSGLVTF 223 D G ++ VLD S S+ D D A I ++ +I +++R+ L+ Sbjct: 344 KDPGTEIAFVLDGSGSIQD------DDFQKAKDFIYNVMSNCNFAIVQYGSLIRTELLLL 397 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 ++ L ++I ++ + T S A N + E +K + Sbjct: 398 DNEDRAGSLLK------VKQIKQIYNLTKTAS------AINHVLTDIFIPEKGSKNNTA- 444 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-----AEAADQFLKNCAS 338 K II L+DGE D N+ + +G Y+IGV A + + A Sbjct: 445 -KIIIVLSDGEILE---DPMTLDEVLNKPQMKGVTRYSIGVGDGILKKPNAVKEMMQIAD 500 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEM 362 P ++YSV + L+D + +E+ Sbjct: 501 PGKYYSVSSYGALNDILSSLEREI 524 >gi|209549179|ref|YP_002281096.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534935|gb|ACI54870.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 522 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + LD S SM G D+L A R + + K + + R ++ F + TF Sbjct: 345 ALCLDFSGSMQGD---GEDQLQKAMRFLLTPDEASKVLVQWSPADRIIVIPFDGSVRNTF 401 Query: 232 PLAWGV---QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + +I+R G T A +I + ++ I+ Sbjct: 402 MASGNPLEQEGLLNEISRQKAGGGTDMYTCAAQALQQIARSDRLSTYLPA--------IV 453 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +TDG + D ++ A V+ I Q Sbjct: 454 IMTDGR----SDDQSQAFMSEWNATEPHVPVFGITFGDADKTQL 493 >gi|16416477|gb|AAL18263.1| vitrin [Homo sapiens] Length = 656 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 473 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 522 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 523 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 570 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 571 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 624 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V LH RI + + Sbjct: 625 HSFFVDEFDNLHQYVPRIIQNI 646 >gi|114046070|ref|YP_736620.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] gi|113887512|gb|ABI41563.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] Length = 1215 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 70/202 (34%), Gaps = 32/202 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +DM +V+D S SM P + + A + + + GLV+FSS Sbjct: 309 EGDIDMQIVMDRSGSMYGS--PINNAIQAAKTLVDATAEGSTA---------MGLVSFSS 357 Query: 226 K---------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + P Q ++ I+ + +T G A + + + Sbjct: 358 RSSVKQDFAVQQIPKPDTGIKQALKAAIDNIYASGSTALFDGSSLALDNLITYQTAAASG 417 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKN 335 A G + L DG+++S + + A +++ G A + Sbjct: 418 APG------VVFVLADGDDNSSIKNEATVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 336 C-ASPDRFY-SVQNSRKLHDAF 355 A+ +++ S ++ DAF Sbjct: 469 ANATGGKYFSSPTTLSEIIDAF 490 >gi|47575877|ref|NP_757376.2| complement C2 [Rattus norvegicus] gi|46237595|emb|CAE83973.1| complement component 2 [Rattus norvegicus] gi|47477793|gb|AAH70923.1| Complement component 2 [Rattus norvegicus] Length = 758 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 83/218 (38%), Gaps = 28/218 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+++ + S M+D I S V Sbjct: 250 KIQIQRSGHLNLYLLLDASQSVSEK------DFNIFKESAFLMVDRIFSFEIK---VSVA 300 Query: 220 LVTFSSKIVQTFPLAWGV-QHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAK 270 ++TF+S+ + Q++ E ++ L + T + L Y + + Sbjct: 301 IITFASRPKIIMSVLNERSQNVMEVMDSLDSVCYKDHENATGTNTYEALNSVYLMMNNQM 360 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ P +DN + + + +YAIGV Sbjct: 361 DRLGMETSAWQEIRHAIILLTDGKSNMGGSPKPAVDNIREILGISRNRNDYLDIYAIGVG 420 Query: 326 AEAAD-----QFLKNCASPDRFYSVQNSRKLHDAFLRI 358 D + + +Q+++ + F I Sbjct: 421 KLDVDWKELNELGSKKDGERHAFILQDAKAVQQVFEHI 458 >gi|256420212|ref|YP_003120865.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035120|gb|ACU58664.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 212 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 60/197 (30%), Gaps = 16/197 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +VLD S SM+ + ++ ++ ++ P L+TF S Sbjct: 3 RLPVYLVLDTSGSMSGE------PIEAVKNGVQVLISTLRQDPYALETAFLSLITFDSDA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + Q + L T LE + I K KG + Sbjct: 57 RQLVPLT-DLSSFQ--MPELKASGGTSLGSALELVADSINREVAKSTPDVKGDWKP--LV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG + + L K G V A A LK + + Sbjct: 112 FLMTDGIPTDT---WQNGLNAFQNTK-IGITV-ACAAGNGADVNLLKQITNTVVSLDTAD 166 Query: 348 SRKLHDAFLRIGKEMVK 364 + + F + + Sbjct: 167 AATIKAFFKWVSASVST 183 >gi|149184581|ref|ZP_01862899.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] gi|148831901|gb|EDL50334.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] Length = 528 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 44/300 (14%), Positives = 99/300 (33%), Gaps = 31/300 (10%) Query: 1 MSFLNI-RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYT 59 M F + + + G+ +L A+ PV+F GL ++ + + K ++ Y +D L Sbjct: 1 MGFTKLMKRLQADTSGNALMLVAMGAPVLFGSAGLGVDMAQYYMWKREIQYAVDQGALAG 60 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER-----STSL 114 A N + G K + Y + + + F ++ S L Sbjct: 61 AWSRGNGDMGLEYKTRAKQEFYINLSETKDYLLTHSIELQTFDGTPDSAVYMHATVSAQL 120 Query: 115 SIIIDDQHKDYNLSAVSRY----EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG-- 168 ++ ++ +R + F C + +S + + + +G Sbjct: 121 PFTKVMINEGMTIAVQARATWETQKQFTACLYSLDPSS-TRTMWFNGGPTVDAACGVGAR 179 Query: 169 --LDMMMVLD-VSLSMNDHFGPGMDKLGVATRSIR---------EMLDIIKSIPDVNN-- 214 D +V + S + N ++ + + M+D + + +N Sbjct: 180 SNADNAIVTNGGSGAQNINWVVAGGTINDGAGAFVNAEVVENYDNMVDPWEGLTPPDNAT 239 Query: 215 --VVRSGLVTFSSKIVQTF--PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 V G + + + + + + K + G+ + S PG E AY+ + Sbjct: 240 PRTVTCGSADANWQADEAQLDAITFKYYRGKNKNDAKSAGAISYSGPGSESAYDVTYATN 299 >gi|119572524|gb|EAW52139.1| integrin, alpha D [Homo sapiens] Length = 366 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 81/224 (36%), Gaps = 30/224 (13%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ T +D++ ++D S S++ + +++ +++ + ++ Sbjct: 131 WEIIQTVPDATPECPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLF 187 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDA 269 + + F+ +T P Q ++ + T + G+ ++F Sbjct: 188 ALMQYSNLLKIHFTFTQFRTSP------SQQSLVDPIVQLKGLTFTATGILTVVTQLFH- 240 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQ 325 H KK +I +TDG+ D E +A++ G I YAIG Q Sbjct: 241 -----HKNGARKSAKKILIVITDGQ---KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQ 292 Query: 326 AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A Q L +S D + V N A I K++ ++ Sbjct: 293 GPTARQELNTISSAPPQDHVFKVDN----FAALGSIQKQLQEKI 332 >gi|261819810|ref|YP_003257916.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261603823|gb|ACX86309.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 403 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 77/220 (35%), Gaps = 48/220 (21%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR------------- 217 ++++LD S SM D+L E++ I+ P ++ R Sbjct: 204 VVIILDASGSMALSMDATPDELERWLE-GDELVRDIEREPRRISLARQSTSNIIDKLPKD 262 Query: 218 --SGLVTFSS----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 LV + FP + ++ +INR+ T LE A K+ D E Sbjct: 263 MNISLVAAADCRKVTASTPFPPS-KRTALKNQINRIEPIGKTALAEALEQA-GKLVDGVE 320 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG----AIVYAIGVQAE 327 + I+ +TDG+ + + ++ K+ V I + Sbjct: 321 RDA-----------IILLVTDGDETCGG----DPCEVASKLKKSKPRLQINVVDI-LNTG 364 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A + N + ++V N+++ F I + +K+ I Sbjct: 365 AGNCIASN--TGGSVFAVNNTQE----FSNIMNKAMKEYI 398 >gi|255039144|ref|YP_003089765.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951900|gb|ACT96600.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 935 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 72/196 (36%), Gaps = 27/196 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V S +M+++LDVS SMN + K+ + RSI+ +L +++ ++ Sbjct: 747 VTRSLDGFAPNNMVLLLDVSSSMNSPY-----KMPLLKRSIKSLLTLVRPEDMIS----- 796 Query: 219 GLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V +S K G + I I+ L T G++ AY + + Sbjct: 797 -IVLYSGKARVVLKPTSGAKASEISRMIDLLQSDGDTDGNEGIKLAYKTANKQYIRGGNN 855 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 I+ TDGE P D + N + ++ G + E Q LK Sbjct: 856 R---------IVLATDGE--FPVSDEVMDMIRQNARQDVYLSIFTFG-RHEHTGQKLKKL 903 Query: 337 A--SPDRFYSVQNSRK 350 + + V ++ Sbjct: 904 SELGMGSYAHVTDASA 919 >gi|149920127|ref|ZP_01908600.1| hypothetical protein PPSIR1_33369 [Plesiocystis pacifica SIR-1] gi|149819070|gb|EDM78507.1| hypothetical protein PPSIR1_33369 [Plesiocystis pacifica SIR-1] Length = 400 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 72/225 (32%), Gaps = 55/225 (24%) Query: 170 DMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++M+LD S SM D P + ++ +++ D+++ V G TFS+ Sbjct: 122 SVLMLLDRSGSMMQTGFDADEPDKSRWQALHEALGDVM----LADDMDHFVEFGAKTFST 177 Query: 226 KIVQTFPLAWGVQHIQEKIN------------RLIFG------STTKSTPGLEYAYNKIF 267 + WG + +I+ LI G T + L+ + Sbjct: 178 Q-------GWGECGVSPQIDVPMLLDNSELLLELIPGPLEDVNGGTPTLAALDAGLGMMR 230 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKR-RGAIVYAI 322 D K ++ +TDG D +L A+ G Y + Sbjct: 231 D----------YEAPGAKAVVLITDGSIGC-TDDQAATLEQITTELTLAREVDGIATYVV 279 Query: 323 GVQAEAADQFLK------NCASPDRFYSVQNSRKLHDAFLRIGKE 361 G+ + D ++ N+ LH A ++ + Sbjct: 280 GISPSYNSAKAQLGAMAEAGGGADDYFEAANAESLHAALEQVVAD 324 >gi|284989144|ref|YP_003407698.1| cobaltochelatase subunit [Geodermatophilus obscurus DSM 43160] gi|284062389|gb|ADB73327.1| cobaltochelatase subunit [Geodermatophilus obscurus DSM 43160] Length = 674 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 45/129 (34%), Gaps = 18/129 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQ 229 ++ V+D S SM ++ ++ +L + + GLVTF + Sbjct: 492 VLFVVDASGSMG-----SRSRMTAVKGAVLSLL-----LDAYQRRDKVGLVTFRGAEAEL 541 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P W V+ ++ L G T GL A+ + + + ++ Sbjct: 542 ALPPTWSVEAAAARLTALPTGGRTPLAAGLLRAHETLRVERVRDAQRRP-------LLVV 594 Query: 290 LTDGENSSP 298 +TDG + Sbjct: 595 VTDGRATGA 603 >gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona intestinalis] Length = 1586 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 72/209 (34%), Gaps = 35/209 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++DVS SM FG +D++ A +I L ++ F+S + + Sbjct: 948 VVFLIDVSGSM---FGIKIDQVRQAMNTILHGLAETDFFS---------VIAFNSSVSRW 995 Query: 231 FP------LAWGVQ-HIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 P LA G +I +N L T +E A A G Sbjct: 996 SPSGTAAVLASGTTANINSAMNFLNTTVVTRGGTDILQAVEAAIQLFDSAA------TGG 1049 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +++ LTDG + + + + R + IG L+ A+ Sbjct: 1050 TNTASDFMVLLTDGRPTDGTVSSTAIISAIRNLNRGRFGINTIGFGTLVDMNLLRKIAAQ 1109 Query: 340 DR------FYSVQNSRKLHDAFLRIGKEM 362 + F + + ++ + + I + + Sbjct: 1110 NSGTSIQIFIDLNSYAQISNFYEEISQPI 1138 >gi|149639247|ref|XP_001506420.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Ornithorhynchus anatinus] Length = 1113 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 62/186 (33%), Gaps = 35/186 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 276 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 323 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 324 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 378 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---QAEAADQFLKNCASP 339 K I+ TDG K+ + V+ V + C + Sbjct: 379 --KIIMLFTDGGEERAQEIFKKYNQ------DKKVRVFTFSVGQHNYDRGPIQWMACKNK 430 Query: 340 DRFYSV 345 +Y + Sbjct: 431 GYYYEI 436 >gi|254387393|ref|ZP_05002639.1| conserved hypothetical protein [Streptomyces sp. Mg1] gi|194346184|gb|EDX27150.1| conserved hypothetical protein [Streptomyces sp. Mg1] Length = 206 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 14/177 (7%) Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM+ + ++ M I P + R +VTFS PL+ Sbjct: 2 SGSMSG------GPMAAMNTALPAMQRAILDDPTTGEIARVSVVTFSDTAACVLPLS--- 52 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 ++ L T G + D L A+ H + FL+DG+++S Sbjct: 53 DMAHARMPTLSPQGGTDFAEGFRVGREALVDGIGALGRGARYHRP---VVFFLSDGQHNS 109 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 ++ + GA V + G D + S + ++ Sbjct: 110 SQSWKSGFDRLRSKEDKYGAEVVSFGFGQANRDVIAQ--VSTRHAFFAEDMDPAVAV 164 >gi|206575540|ref|YP_002235854.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] gi|206570384|gb|ACI12030.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] Length = 346 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 54/168 (32%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L +++++ ++ P V ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKRMNDGLQKIVGDLRKDPHALETVWVSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K KG + Sbjct: 57 RTIVPL---HDIVSFYPPRLPVGGGTNLATALRELTTQIDSQVRKTTQEEKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + NE + + A+G+ + L+ Sbjct: 112 YLLTDG---RPTDDISAEVKRWNEFYAKKVNMIAVGIGSSVDLSVLRQ 156 >gi|304310230|ref|YP_003809828.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] gi|301795963|emb|CBL44164.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] Length = 371 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 78/213 (36%), Gaps = 26/213 (12%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V S + + V+D S S+ ++ + ++ A + ++ Sbjct: 104 VSAEPNSSLPFVTVFVMDYSPSVRNN-ETALKRMEEAAKGFVSLMQPKDKA--------- 153 Query: 219 GLVTFSSKIVQT-FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ F+ ++ L + I+ + + Y+K++DA K ++ Sbjct: 154 AVIKFNDRVEVMGAGLTSNHDTLNAAIDSIPP----------QRGYSKLYDAISKAIEVS 203 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + II L+DG++ + + N+ + G V+ IG L+ A Sbjct: 204 NCNPKLVCSIIVLSDGDDVGSALPLADLH---NQLYQAGTAVFPIGYGDNINVNKLQELA 260 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 S +Y+ ++S + + RI + + +L Sbjct: 261 TNSGGAYYTSEDSSQFSAVYERIWARLSNEFML 293 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 81/215 (37%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 163 QTYMDIIIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 211 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ AY F E + + Sbjct: 212 DVVHEFHL-NDYRSVKDVVAAASHIEQRGGTET----RTAYGIEFARSEAFQKGGRKGA- 265 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 K+ +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 266 -KRVMIVITDGE----SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINPEAFLNEI 320 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G+ + Sbjct: 321 KFIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 355 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 81/215 (37%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 163 QTYMDIIIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 211 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ AY F E + + Sbjct: 212 DVVHEFHL-NDYRSVKDVVAAASHIEQRGGTET----RTAYGIEFARSEAFQKGGRKGA- 265 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 K+ +I +TDGE + D+ + ++++ YA+ V + FL Sbjct: 266 -KRVMIVITDGE----SHDSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINPEAFLNEI 320 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G+ + Sbjct: 321 KFIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 355 >gi|73980136|ref|XP_851163.1| PREDICTED: similar to vitrin isoform 2 [Canis familiaris] Length = 645 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 68/199 (34%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 462 DIGFVIDGSSSV------GTGNFRTVLQFVANL---SKEFEISDTDTRIGAVQYTYEQRL 512 Query: 230 TFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + +N + + T + + YA ++F K + +K Sbjct: 513 EFGFD-DYNTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRK 562 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFY 343 +I +TDG + + A +G I YAIGV A D+ P D + Sbjct: 563 LMILITDGRSYD------DVRIPAMVAHHKGVITYAIGVAWAAQDELEVIATHPANDHSF 616 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V L+ ++ + + Sbjct: 617 FVDEFDHLYKFVPKVIQSI 635 >gi|332970881|gb|EGK09858.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 442 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/310 (12%), Positives = 94/310 (30%), Gaps = 31/310 (10%) Query: 56 LLYTATKILNQENGNNGKKQKNDF--SYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 ++ + + N +K+K + K I + +N + + Sbjct: 19 IVSGCSTGEKEANQKKPEKKKETIPKAAHEPKEIMKQKPGRFSGDNYDPKKVEEALDQFP 78 Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 ++ ++ + Y P + + + S +++ + Sbjct: 79 DNLSTEEAYDRLVYLLGENYR-PKYEELMSLDPTIQVNEKTPDNKIDVPSIEQ--MNVEI 135 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV---------RSGLVTFS 224 +LD S SM G+ K+ +A ++IR + V+ V + V+ Sbjct: 136 LLDASGSMAGRVDGGV-KMDLAKQAIRAFASDVPEGAQVSLRVYGHKGSNQKKDKAVSCQ 194 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + ++ ++ +N+ T ++ A + + Sbjct: 195 SNELVYPLKSYDSSQFEQSLNQFKPTGWTPLASAIQAAREDLKEWAGARN---------- 244 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNC--ASPDR 341 + ++DG + +E+ + IG +A Q LK A+ Sbjct: 245 -IVYVVSDGVETCGGDPVREAKKLGESGIEPMVKI--IGFDVDDAGQQQLKKVAEAADGS 301 Query: 342 FYSVQNSRKL 351 + +V + L Sbjct: 302 YQTVTSGDDL 311 >gi|327280282|ref|XP_003224881.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XIV) chain-like [Anolis carolinensis] Length = 1885 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 73/201 (36%), Gaps = 24/201 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIV 228 D+++++D S S+ R +R L+ + + + R GL +S Sbjct: 159 DIVILVDGSWSIGRF----------NFRLVRLFLENLVAAFNVGSEKTRIGLAQYSGDPR 208 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + L + + R + + GL A I + K E A+ K Sbjct: 209 IEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGL--ALTFILENNFKSEAGARPGVP--KIG 264 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 I +TDG++ + + K G ++AIGV+ + + + PD Y+V Sbjct: 265 ILITDGKSQD------DVIPPAKNLKDAGIELFAIGVKNADETELKEIASEPDNTHVYNV 318 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + ++ + K + + Sbjct: 319 ADFSFMNSIVEGLTKTVCSRV 339 >gi|256111841|ref|ZP_05452806.1| NorD protein [Brucella melitensis bv. 3 str. Ether] gi|265993299|ref|ZP_06105856.1| protein norD [Brucella melitensis bv. 3 str. Ether] gi|262764169|gb|EEZ10201.1| protein norD [Brucella melitensis bv. 3 str. Ether] Length = 633 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 77/205 (37%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S E + +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|17989341|ref|NP_541974.1| NorD protein [Brucella melitensis bv. 1 str. 16M] gi|256043169|ref|ZP_05446110.1| NorD protein [Brucella melitensis bv. 1 str. Rev.1] gi|260564388|ref|ZP_05834873.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|265989599|ref|ZP_06102156.1| protein norD [Brucella melitensis bv. 1 str. Rev.1] gi|81850090|sp|Q8YBA4|NORD_BRUME RecName: Full=Protein norD gi|17985210|gb|AAL54238.1| nord protein [Brucella melitensis bv. 1 str. 16M] gi|260152031|gb|EEW87124.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|263000268|gb|EEZ12958.1| protein norD [Brucella melitensis bv. 1 str. Rev.1] Length = 633 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 77/205 (37%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++S E + +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|308472805|ref|XP_003098629.1| hypothetical protein CRE_04229 [Caenorhabditis remanei] gi|308268229|gb|EFP12182.1| hypothetical protein CRE_04229 [Caenorhabditis remanei] Length = 399 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 72/219 (32%), Gaps = 12/219 (5%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 ++ T + A L T + + + LD++ V+D S+ M G+D Sbjct: 2 KLAIAVFTMVLLFYGTSADLSYTDRECGTDLTKLWLDVVAVVDNSIGMT---NEGLDTTA 58 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 S+ I + R GLVT++ Q L Q + + N + T Sbjct: 59 AHIASVFSAGTRIGTQSSEPRTTRVGLVTYNVNAQQNADL-NKFQSVDDLFNGVFADLKT 117 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDD--YKKYIIFLTDGENSSPNIDNKESLFYCNE 311 ST Y + A+ E+ G YKK +I ++ + L Sbjct: 118 VSTSAQSYLSTGLAAAESLFEYENFGTSRSHYKKVVIVY---ASTYVGEGEMDPLPVAIR 174 Query: 312 AKRRGAIVYAIGVQAEAADQFLKN---CASPDRFYSVQN 347 K G + + LK ASP +S + Sbjct: 175 LKTSGVNIITMAYVQNGDGFMLKQLAEIASPRFNFSSTD 213 >gi|307154013|ref|YP_003889397.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984241|gb|ADN16122.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 349 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 44/244 (18%), Positives = 80/244 (32%), Gaps = 35/244 (14%) Query: 134 EMPFIFCTFPWCANSSHAPLLITS------SVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 + IF T + P L+T+ S S +M++LD S SM Sbjct: 19 SLTLIFLTLLGGCETREDPCLLTANPSPSLSTTTSPTPIAKSPVMIILDASGSMVKEKEK 78 Query: 188 GMDKLG--VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 KL +A SI +++ + ++ G ++ ++ +EK Sbjct: 79 IDGKLKLDIAKESITTIINSSDASNLELSLTALG--------HKSKDCKDNIEIFEEKNK 130 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP-----NI 300 +++ P ++ + K I+ LTDG S N Sbjct: 131 KIL-----DVLPAIKGGTETPLTEAIRQASSKFKDKKQKNIIVLLTDGVESCKKDKNHNP 185 Query: 301 DNKESLFYCNEAKRRG-------AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D K L + K+ G + I E ++ K A + Y +N+ +L+ Sbjct: 186 DQKAPLEEVDSLKKEGFNFILNIITLGKIKTNNEILEELAK--AGGGKLYKPENTEELNK 243 Query: 354 AFLR 357 A Sbjct: 244 ALSE 247 >gi|225028941|ref|ZP_03718133.1| hypothetical protein EUBHAL_03231 [Eubacterium hallii DSM 3353] gi|224953716|gb|EEG34925.1| hypothetical protein EUBHAL_03231 [Eubacterium hallii DSM 3353] Length = 291 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 81/203 (39%), Gaps = 18/203 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + + ++DVS SM K+G ++ E+L + + + + V+ ++ FS Sbjct: 48 AKKSMTIFFMIDVSGSMKG------TKIGSLNSTMEELLPSLIGVGEASTDVKIAIMKFS 101 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + P ++ + NRL T + A+ ++ + ++ + Sbjct: 102 TDVEWVTPEPVKIEE-YQYWNRLEADGLT----FMGDAFMELSKKLSRSTFLSSPSLSFA 156 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQFLKNCASPDRF 342 I L+DG SPN D K+ L + K + G + A+G+ ++ L+ D Sbjct: 157 PVIFLLSDG---SPNDDWKKGLDTLKQNKWFQHGLKI-ALGIGSKVNMDVLRAFTGNDEL 212 Query: 343 -YSVQNSRKLHDAFLRIGKEMVK 364 +N+ +L + + + Sbjct: 213 AVQAKNADQLRELIKLLAVTSSQ 235 >gi|77918074|ref|YP_355889.1| hypothetical protein Pcar_0459 [Pelobacter carbinolicus DSM 2380] gi|77544157|gb|ABA87719.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 370 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 61/210 (29%), Gaps = 50/210 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQ 229 + + LD+S SM + + + LD I+S D+ GLV FSS Sbjct: 88 LFLALDLSGSMKR----PVSRF-SSQTLGDLALDGIESFIDMRRHEDYIGLVAFSSYAKL 142 Query: 230 TFPLAWGVQHIQEKIN------------RLIFGSTTKSTPG----LEYAYNKIFDAKEKL 273 PL + ++ K+ L G T ++ L ++ + + Sbjct: 143 LAPLTFDKDLLKAKLALVRSKNHSRIYRELGAGGGTNASEAVWLSLSAFFSMLPEDNRLS 202 Query: 274 EHIAKGHDD------------YKKY----------IIFLTDG--ENSSPNIDNKESL--- 306 G + +K +I TDG E + + L Sbjct: 203 VEQIAGMREFLLGEPGALLDIPQKIRNAGLGTGMAVILFTDGRIEPTLRAHVRRGGLPNL 262 Query: 307 -FYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K G Y I V E K Sbjct: 263 VNIITLMKAVGVRFYIISVGGEVDAAVQKA 292 >gi|30749469|pdb|1MHP|A Chain A, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody gi|30749472|pdb|1MHP|B Chain B, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody Length = 192 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 74/211 (35%), Gaps = 37/211 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD S S + + ++L + P G+V + + Sbjct: 3 LDIVIVLDGSNS--------IYPWESVIAFLNDLLKRMDIGPKQTQ---VGIVQYGENVT 51 Query: 229 QTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + N+++ T + G++ A + F K Sbjct: 52 HEFNLNKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEAFTEARGARRGVK------ 105 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKN 335 K ++ +TDGE S N K+ + C ++I + E + +K+ Sbjct: 106 KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKS 161 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKEMV 363 AS F++V + L +G+ + Sbjct: 162 IASEPTEKHFFNVSDELALVTIVKALGERIF 192 >gi|315644678|ref|ZP_07897809.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315279937|gb|EFU43236.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 562 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 82/213 (38%), Gaps = 37/213 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + + V DVS SM+ G + +L + + ++ L SI G Sbjct: 379 KEKKDGNKPVAAVFVADVSGSMD---GEPLHRLKESLLTGQKYLGRDNSI---------G 426 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEK------INRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 V++S+ + P+ G + ++ IN L T + G+ A + D Sbjct: 427 FVSYSTDVAINLPI--GKYDLNQQSMFVGAINSLEASGNTATFDGIVVAMKMLHDE---- 480 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + D K I L+DGE + ++D+ L K VY IG A Q Sbjct: 481 ---MAVNPDVKPLIFVLSDGETNVGHSLDDIRELI-----KAFKIPVYTIGYNA--NIQA 530 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 L++ +S + + + D +IG+ + Q Sbjct: 531 LQSISSINE--AASINADTDDVVYKIGQLLNVQ 561 >gi|295789107|ref|NP_001171441.1| vitrin isoform 3 [Homo sapiens] Length = 657 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 474 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 523 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 524 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 571 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 572 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 625 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V LH RI + + Sbjct: 626 HSFFVDEFDNLHQYVPRIIQNI 647 >gi|291229678|ref|XP_002734799.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 1003 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 74/196 (37%), Gaps = 34/196 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D S SM ++ +DKL A +L+ + +V G+VTFS+ Sbjct: 288 VLVMDTSGSMEEN--GRIDKLHQAVS--NYILNTLDDGEEV------GVVTFSTTATIQS 337 Query: 232 PLAW----GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + ++ + T GL A++ + + + Sbjct: 338 HLVLINNESRTELLSRVPSMQSVGRWTSIGSGLLKAFDVLEEGERNAAGG---------V 388 Query: 287 IIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD----R 341 I+ ++DG EN P I + + +G V +IG+ +A+ A+ D Sbjct: 389 IVVISDGEENRDPLIADIIPMVL-----EKGVTVDSIGIGTDASTNLEVLPAATDGMTFY 443 Query: 342 FYSVQNSRKLHDAFLR 357 + NS L++A Sbjct: 444 YSEDSNSNGLNEALAA 459 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 12/80 (15%), Positives = 34/80 (42%), Gaps = 2/80 (2%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 99 VRRGLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 158 Query: 66 QENGNNGKKQKNDFSYRIIK 85 + + N + + Sbjct: 159 --AISLSVAEANGIAAGHMN 176 >gi|156096226|ref|XP_001614147.1| sporozoite surface protein 2 [Plasmodium vivax SaI-1] gi|148803021|gb|EDL44420.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 64/181 (35%), Gaps = 30/181 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVV 216 + +D+ +++D S S+ + + K L S+ D I ++ ++ Sbjct: 40 NESVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELI 97 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G I + L+ + E TT T L D +K + Sbjct: 98 RLGS---GQSIDKRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLND 142 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + +I +TDG +S +L N+ K+R + IGV QF + Sbjct: 143 RVNREKAIQLVILMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLI 198 Query: 337 A 337 A Sbjct: 199 A 199 >gi|221068121|ref|ZP_03544226.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] gi|220713144|gb|EED68512.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] Length = 1268 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 59/180 (32%), Gaps = 21/180 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGM---------DKLGVATRSIREMLDIIKSIPDVNNV 215 + + ++M+VLD+S SM + +L +A +++ +++ + D Sbjct: 845 AKLQTNVMIVLDLSGSMAWDSNGKVLPGGGSNANSRLSLAKKALEALINKYEEYGD---- 900 Query: 216 VRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 V LVTF+ W IN L T + A N + Sbjct: 901 VAVKLVTFNGSTANAHA-TWMSAATAIAIINGLTATGGT----PYKAALNAAMGTNGFAD 955 Query: 275 HIAK-GHDDYKKYIIFLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ K + + F+TDG + ++ + + + +G Sbjct: 956 NVGKLTGEGVQNVSYFITDGVPTLGQGVNPRLQAQWEDFLTTHHINSVGVGFGGIKTGDI 1015 >gi|170762586|gb|ACB32196.1| MxaC [uncultured bacterium 16A2] Length = 292 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 69/211 (32%), Gaps = 40/211 (18%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDI 205 P + V+ + G +++ ++D S SMN+ F K A R ++ ++ Sbjct: 69 PFRVGGEVERLGE---GTEIVFLIDRSGSMNETFAGRTPGGGEESKASAAKRLLQGFVER 125 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAY 263 N+ V G+ FS+ + PL + I+ + T GL A Sbjct: 126 -----GRNDFV--GVAAFSTSPMLVMPLTDHRAATKAAIDAVDRPGLDYTNIGRGLAMAL 178 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + I+ ++DG + ID + E K+ +Y + Sbjct: 179 SMFKADASDRSRA----------ILLVSDG---AGVIDPRIQDDLRAEFKKTNVHLYWL- 224 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 FL+ SP + + A Sbjct: 225 --------FLRTAGSPGIYELPEAGSDTPQA 247 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 69/468 (14%), Positives = 140/468 (29%), Gaps = 126/468 (26%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 N +GSI++L A+ + ++GL ++ K +L D + L + +E Sbjct: 18 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV--VTAKEF 75 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 +Q + + I +Q + + ++ + L I+ Q D +S Sbjct: 76 IAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSL---SQLEIVRSGQTLDATVS 132 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM------- 181 + + F ++ V S+ LD +++DVS SM Sbjct: 133 YTATVQSTFGRLFGLSVTTLTNR-------VNASADIAGYLDFYLMVDVSGSMGLPTTDS 185 Query: 182 --------------------------NDHFGPGMDKL--GVATRSIREMLDIIKSIPDVN 213 + G +L ++ +LD P V Sbjct: 186 DAALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASK-PIVP 244 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQE--------------KINRLIFGST---TKST 256 N R G+ F +++ PL+ + N L GST T + Sbjct: 245 NQYRIGIYPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDTGSTQLFTNND 304 Query: 257 PG---------LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 P E A ++ + + + K ++ +TDG +S + + Sbjct: 305 PTTGTGSGGTHFETALPQMKSTIRTFGNGS-SSANPKPFVFLITDGMQNSQTYSSWKDKK 363 Query: 308 Y----------------------------CNEAKRRGAIV------------YA------ 321 C + K+ GA + Y Sbjct: 364 TYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTNEGTIV 423 Query: 322 -----IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + L+ CASP F++ +++ + + + + +K Sbjct: 424 WENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQALK 471 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 76/249 (30%), Gaps = 34/249 (13%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M F ++ K +++ ++ ++LD S SM G+ K Sbjct: 1 MKKRFSLIMMTGLLFGLTSPAFAAEKTETEAKAPANVAVLLDASGSMAKRID-GVSKFNS 59 Query: 195 ATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQTFPL-------AWGVQHIQEKIN 245 A + I + + V V G S K+ + ++ Q +N Sbjct: 60 AKKEISKFASSLPEGTQVKMSVFGSEGNNKNSGKVQSCEAIRNVYGFQSFNEQSFLNSLN 119 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + T L A + + AKG +K + LTDGE + Sbjct: 120 TIGPTGWTPIAKALNEAKSSF------DQLDAKG----EKVVYLLTDGEETCGG----NP 165 Query: 306 LFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 + E ++ V IG E L A ++ + + F Sbjct: 166 IKTAKELQKDNITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPAYTQKDVEKIF------- 218 Query: 363 VKQRILYNK 371 +Q ++ +K Sbjct: 219 TQQSLMLSK 227 >gi|288942397|ref|YP_003444637.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897769|gb|ADC63605.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 357 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 73/166 (43%), Gaps = 19/166 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G +++++LDVS SM ++LG A + +++++ + VR GL+ F++ Sbjct: 96 GNNLLVLLDVSRSMLAEDVSP-NRLGRARQELQDLIVQNRQ-------VRLGLIVFATVP 147 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG-HDDYKKY 286 P+ + +N L S ++PGL+ + + A + E + G +D + Sbjct: 148 HVLAPITEDTTSL---LNALPALSADLASPGLQG--SSLTRALMRAETLLAGLPEDSARA 202 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ ++DG+ P + + + +G ++ +GV Sbjct: 203 VLLISDGDFDEPGLREQVARLA-----EQGVRLHVLGVGTPGGATV 243 >gi|254786772|ref|YP_003074201.1| lipoprotein [Teredinibacter turnerae T7901] gi|237685641|gb|ACR12905.1| putative lipoprotein [Teredinibacter turnerae T7901] Length = 1051 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 77/213 (36%), Gaps = 10/213 (4%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + S +++ + +++LD+S S+++ + A I E + + + R Sbjct: 213 SEPVSNANLIIKTVLLLDISTSLSES--DIITLKQAARAVIYEENNHGVKVSRLIPGQRV 270 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEH 275 + TF S++ + + V + + I+ + T + + + Sbjct: 271 AIYTFDSQVERIIGFSSDVGALADAIDSIPEKIVEGGTTLRGNSTNLIGAVQTGVAQWTN 330 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQFL 333 D Y I +TDGE++ + D + G VYAI V+ A L Sbjct: 331 RFGLDDAETGYAILVTDGEHT--SDDATPASIQGQLVNAAGVRKDVYAIAVRNNADMAAL 388 Query: 334 KNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + ++ Y + L+ + E ++Q Sbjct: 389 SEITGTTNKVYQALDISDLNLRLQEVQAEAIEQ 421 >gi|221506674|gb|EEE32291.1| von willebrand factor type A domain-containing protein, putative [Toxoplasma gondii VEG] Length = 931 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 62/181 (34%), Gaps = 18/181 (9%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + T + ++ +D + VLD S S++ + + + + Sbjct: 238 VPETPEISTTTCHKGRVDAVAVLDGSGSISRADWKKTRDIAKLFSGALNIAEDQSHV--- 294 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNK-IFDAK 270 +VVR + + P++W + + +I+RL T + LE AY + Sbjct: 295 -SVVRFSTTARADWSLVQ-PVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIFVTSMN 352 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSS--------PNIDNKESLFYCNEAKRRGAIVYAI 322 + EH + + ++ TDG + P + L + K V I Sbjct: 353 NRDEHD---SEHVHRVLLLATDGCVNQWDRFKFRTPEAHLHDVLERMSSLKNLHIKVLGI 409 Query: 323 G 323 G Sbjct: 410 G 410 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 50/129 (38%), Gaps = 14/129 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D ++VLD S+S+ +L T+ + LD R GLV++S + Sbjct: 20 VDAVVVLDSSMSVGAEHWQ---ELLKLTKQFGDTLDSSAGHS------RLGLVSYSDSVT 70 Query: 229 QTFPLA---WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L G ++++ F T + L+ AY +F + + + Sbjct: 71 VLRKLQKIPSGTAQFEKELGAASFMNGNTFTPKALDSAYE-LFKETIHEDSEGAEDQEKR 129 Query: 285 KYIIFLTDG 293 + ++ TDG Sbjct: 130 RLLLLATDG 138 >gi|237831727|ref|XP_002365161.1| von Willebrand factor type A domain-containing protein [Toxoplasma gondii ME49] gi|211962825|gb|EEA98020.1| von Willebrand factor type A domain-containing protein [Toxoplasma gondii ME49] Length = 929 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 62/181 (34%), Gaps = 18/181 (9%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + T + ++ +D + VLD S S++ + + + + Sbjct: 238 VPETPEISTTTCHKGRVDAVAVLDGSGSISRADWKKTRDIAKLFSGALNIAEDQSHV--- 294 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNK-IFDAK 270 +VVR + + P++W + + +I+RL T + LE AY + Sbjct: 295 -SVVRFSTTARADWSLVQ-PVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIFVTSMN 352 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSS--------PNIDNKESLFYCNEAKRRGAIVYAI 322 + EH + + ++ TDG + P + L + K V I Sbjct: 353 NRDEHD---SEHVHRVLLLATDGCVNQWDRFKFRTPEAHLHDVLERMSSLKNLHIKVLGI 409 Query: 323 G 323 G Sbjct: 410 G 410 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 50/129 (38%), Gaps = 14/129 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D ++VLD S+S+ +L T+ + LD R GLV++S + Sbjct: 20 VDAVVVLDSSMSVGAEHWQ---ELLKLTKQFGDTLDSSAGHS------RLGLVSYSDSVT 70 Query: 229 QTFPLA---WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L G ++++ F T + L+ AY +F + + + Sbjct: 71 VLRKLQKIPSGTAQFEKELGAASFMNGNTFTPKALDSAYE-LFKETIHEDSEGAEDQEKR 129 Query: 285 KYIIFLTDG 293 + ++ TDG Sbjct: 130 RLLLLATDG 138 >gi|187927679|ref|YP_001898166.1| hypothetical protein Rpic_0583 [Ralstonia pickettii 12J] gi|187724569|gb|ACD25734.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 414 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 34/85 (40%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 +G++ + ++L V+ + GLVI+ S F K +L +D L A ++ + Sbjct: 6 KQRERGAVLPMVGLMLAVLLGMAGLVIDLSGLFVAKTELQSAVDSCALSAAQELDGASDA 65 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRN 94 + + ++Q + Sbjct: 66 LTRATNAGVTAGNANRVVYQASSAS 90 >gi|158298451|ref|XP_318625.3| AGAP009598-PA [Anopheles gambiae str. PEST] gi|157013887|gb|EAA14534.4| AGAP009598-PA [Anopheles gambiae str. PEST] Length = 1124 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 76/193 (39%), Gaps = 22/193 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREML--DIIKSIPDVNNVVRSGL 220 + D++++LD S SM+ +L VAT +I + L D ++ ++ R + Sbjct: 148 AASSPKDVIILLDSSGSMSGKEY----QLAVATASAILDTLGDDDFFNLISFSDQSRVIV 203 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F K+V+ P V+ ++ IN + +T + LE A+ + + + + Sbjct: 204 PCFQDKMVRATP--DNVKEVKTAINAVECENTANFSAALETAFELL-----RKYNQSSQG 256 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFLK-NCA 337 + I+ +TDG P+ E + + N ++ IG K C Sbjct: 257 SQCNQAIMLITDG----PSDTFMEVIKHYNHP-HMPVRIFTYLIGTDKSGGKNLYKMACE 311 Query: 338 SPDRFYSVQNSRK 350 + F + + + Sbjct: 312 NKGFFVQINSPEE 324 >gi|115496418|ref|NP_001068821.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus] gi|122142424|sp|Q0VCM5|ITIH1_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|111308523|gb|AAI20097.1| Inter-alpha (globulin) inhibitor H1 [Bos taurus] Length = 906 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 74/199 (37%), Gaps = 18/199 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV-TF 223 + +++ V+D+S SM KL ++ ++L ++ + V+ V ++ Sbjct: 282 KKLNKNVVFVIDISSSMEGQ------KLKQTKEALHKILGDMRPGDYFDLVLFGSAVQSW 335 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +VQ P ++ + + + T GL + A++ L ++ Sbjct: 336 KGSLVQASPA--NLEAARNFVQQFSLAGATNLNGGLLRGIEILNKAQQSLPELSNHAS-- 391 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-- 341 +I LTDGE + +D + L + + +Y +G + FL+ + + Sbjct: 392 --ILIMLTDGEPTEGVMDRTQILKNVRDGIKGRFPLYNLGFGHDVDLNFLEVMSLENNGR 449 Query: 342 ---FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 450 VQRIYEDHDATQQLQGFYE 468 >gi|296474790|gb|DAA16905.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus] Length = 906 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 74/199 (37%), Gaps = 18/199 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV-TF 223 + +++ V+D+S SM KL ++ ++L ++ + V+ V ++ Sbjct: 282 KKLNKNVVFVIDISSSMEGQ------KLKQTKEALHKILGDMRPGDYFDLVLFGSAVQSW 335 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +VQ P ++ + + + T GL + A++ L ++ Sbjct: 336 KGSLVQASPA--NLEAARNFVQQFSLAGATNLNGGLLRGIEILNKAQQSLPELSNHAS-- 391 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-- 341 +I LTDGE + +D + L + + +Y +G + FL+ + + Sbjct: 392 --ILIMLTDGEPTEGVMDRTQILKNVRDGIKGRFPLYNLGFGHDVDLNFLEVMSLENNGR 449 Query: 342 ---FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 450 VQRIYEDHDATQQLQGFYE 468 >gi|327290735|ref|XP_003230077.1| PREDICTED: collagen alpha-1(XIV) chain-like, partial [Anolis carolinensis] Length = 562 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 73/201 (36%), Gaps = 24/201 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIV 228 D+++++D S S+ R +R L+ + + + R GL +S Sbjct: 52 DIVILVDGSWSIGRF----------NFRLVRLFLENLVAAFNVGSEKTRIGLAQYSGDPR 101 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + L + + R + + GL A I + K E A+ K Sbjct: 102 IEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGL--ALTFILENNFKSEAGARPGVP--KIG 157 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 I +TDG++ + + K G ++AIGV+ + + + PD Y+V Sbjct: 158 ILITDGKSQD------DVIPPAKNLKDAGIELFAIGVKNADETELKEIASEPDNTHVYNV 211 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + ++ + K + + Sbjct: 212 ADFSFMNSIVEGLTKTVCSRV 232 >gi|297667862|ref|XP_002812182.1| PREDICTED: vitrin-like isoform 3 [Pongo abelii] Length = 657 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 474 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 523 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 524 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 571 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 572 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 625 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 626 HSFFVDEFDNLYQYVPRIIQNI 647 >gi|296127202|ref|YP_003634454.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019018|gb|ADG72255.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 542 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 65/161 (40%), Gaps = 24/161 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 SK + +D+++VLD + SM+ + L R IR + ++ + D N R G + Sbjct: 378 SKYEGSIDLVIVLDTTESMHPY-------LKAVKRDIRGV---VRDLFDNNKGSRIGFLL 427 Query: 223 FSSKIVQTF----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F L+ + I ++N + Y I +A EK ++I + Sbjct: 428 YRDVKDTYFTKRIELSDNINTINREVNYFYAAGGGDKAEPM---YEAIQEALEKFDYINE 484 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 KK +I +TD + I + AK +G I+ Sbjct: 485 -----KKLLIVVTDA--PAKVIGRADLALNTKTAKEKGIII 518 >gi|183600995|ref|ZP_02962488.1| hypothetical protein PROSTU_04610 [Providencia stuartii ATCC 25827] gi|188019327|gb|EDU57367.1| hypothetical protein PROSTU_04610 [Providencia stuartii ATCC 25827] Length = 212 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 61/171 (35%), Gaps = 12/171 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ ML+ ++ P V ++T+ ++ Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHAVNVGIQAMLNALRQDPYALESVHIAIITYDNEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL +++ Q + T + LE + + + KG + Sbjct: 57 REFIPLT-ALENFQFTDIVVPSSGGTFTGAALECLIQCVDRDIRRTDDTQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +TDG +P+ + KR + A V +A LK S Sbjct: 114 FLMTDG---TPSDSYAYTEAIKEIKKRSFGSIIACAVGPKAKHDHLKQLTS 161 >gi|149036652|gb|EDL91270.1| anthrax toxin receptor 1 [Rattus norvegicus] Length = 457 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/145 (23%), Positives = 57/145 (39%), Gaps = 13/145 (8%) Query: 221 VTFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FS++ L + I++ ++ +++ G T G E A +I+ + A Sbjct: 4 IVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTA 63 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II LTDGE E N ++ GAIVY +GV+ Q + Sbjct: 64 S-------VIIALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIAD 114 Query: 338 SPDRFYSVQNS-RKLHDAFLRIGKE 361 S D + V + + L I K+ Sbjct: 115 SKDHVFPVNDGFQALQGIIHSILKK 139 >gi|326798073|ref|YP_004315892.1| von Willebrand factor type A [Sphingobacterium sp. 21] gi|326548837|gb|ADZ77222.1| von Willebrand factor type A [Sphingobacterium sp. 21] Length = 622 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 95/258 (36%), Gaps = 43/258 (16%) Query: 132 RYEMPFIFCTFPWCA---------NSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSM 181 +Y +P P N H L I K I + +++ ++DVS SM Sbjct: 203 QYNLPAPQNNEPVAIHTELSQAPWNPHHRLLRIALKAKAIDAAKLPPANLVFLIDVSGSM 262 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-- 239 + G ++L + S++ ++D + R +VT++ W + Sbjct: 263 D-----GPNRLPLVKSSLKMLVD------QLRKEDRVAIVTYAGTARIKLAPVWANEKMR 311 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ I+ L G +T GL+ AY+ EH K ++ II +DG+ + Sbjct: 312 IKNAIDELDAGGSTAGGAGLKMAYDL------AREHFKKDGNNR---IILASDGDFNVGP 362 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQA----EAADQFLKNCASPDRFY-----SVQNS-- 348 N++ + ++ G + +G ++ + L N + Y + + Sbjct: 363 SSNEDMETLIEKERQSGVSLSVLGFGMVNLKDSKMELLANKGHGNYAYIDNLMEAKKAMI 422 Query: 349 RKLHDAFLRIGKEMVKQR 366 + F + K++ Q Sbjct: 423 SEFGATFFTVAKDVKMQV 440 >gi|317488474|ref|ZP_07947025.1| von Willebrand factor type A domain-containing protein [Eggerthella sp. 1_3_56FAA] gi|316912406|gb|EFV33964.1| von Willebrand factor type A domain-containing protein [Eggerthella sp. 1_3_56FAA] Length = 551 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 71/185 (38%), Gaps = 23/185 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + K S G +++ ++DVS SM+D DKL + S + Sbjct: 162 PWNDQTKLLVMGFATE-KDGDASSAGANLVFLIDVSGSMDDP-----DKLPLVKDSFATL 215 Query: 203 LDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ + R +VT++S +++ + I ++ L+ +T GLE Sbjct: 216 VEGLTERD------RVSVVTYASGERVLLEGVPGDDKRRIMRAVDGLVAEGSTNGEAGLE 269 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + ++ + + ++ +DG+ + E + + G + Sbjct: 270 QAYR-LAESSFIEGGVNR--------VVMASDGDLNVGISSESELHDFVERKRETGVYLS 320 Query: 321 AIGVQ 325 +G Sbjct: 321 VLGFG 325 >gi|326911040|ref|XP_003201870.1| PREDICTED: collagen alpha-1(XIV) chain-like [Meleagris gallopavo] Length = 438 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 70/194 (36%), Gaps = 18/194 (9%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DV + G G + + I M +++ +R + + K L Sbjct: 51 VADVLFLVGHSQGAGKESSQLLKDFISSMARSFENVVMGKGGIRLAVALYGEKPRMCIEL 110 Query: 234 AW--GVQHIQEKINRLIFGSTT-KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ + I + ++ K L +A + + H +D K ++ + Sbjct: 111 TDYVTIEEMLVAIQEISIKGSSLKVGSALAFAAHAM-------SHPDTLREDAAKVVVLI 163 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNS 348 T G++S D + L G V+A+G++ + K + P + V + Sbjct: 164 TSGKSSDLVEDKAQVLQ------DAGVTVFAVGIKDADKHELNKIASEPTAEHVIYVDDF 217 Query: 349 RKLHDAFLRIGKEM 362 LH+A ++ + + Sbjct: 218 HLLHNAAQKLSRRL 231 >gi|311253435|ref|XP_001924360.2| PREDICTED: collagen alpha-1(XIV) chain [Sus scrofa] Length = 1795 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 24/214 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 VK ++ D+++++D S S+ RS E +++ + + Sbjct: 144 EEVKFFCQTPAIADIVILVDGSWSIGRFNF-------RLVRSFLE--NLVTAFNVGSEKT 194 Query: 217 RSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R GL +S + L A+ + + E + L + T A N IF+ K E Sbjct: 195 RIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKPE 251 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 A+ K I +TDG++ I +L + G ++AIGV+ ++ + Sbjct: 252 AGARTG--VSKIGILITDGKSQDDVIPPSRNL------RESGVELFAIGVKNADENELRE 303 Query: 335 NCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + PD Y+V +H + + + + Sbjct: 304 IASEPDNTHVYNVAEFDLMHTVVESLTRTVCSRV 337 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 75/198 (37%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D +K+ S LD I + + +V F+ Sbjct: 1031 DLVFMVDGSWSIGDE---NFNKITNFLYSTVGALDKI-----GADGTQVAMVQFTDDPRT 1082 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I + + G TK+ +++ + +F A E K Sbjct: 1083 EFKLNTYKTKETLLDAIKHISYKGGNTKTGKAIKHVRDNLFTA-ESGIRRGIP-----KV 1136 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1137 IVVITDGRSQD------DVNKISKEMQLDGYSIFAVGVADADYSELVSIGSKPSARHVFF 1190 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 VDD----FDAFKKIEDEL 1204 >gi|312072174|ref|XP_003138945.1| hypothetical protein LOAG_03360 [Loa loa] gi|307765891|gb|EFO25125.1| hypothetical protein LOAG_03360 [Loa loa] Length = 1596 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 65/393 (16%), Positives = 138/393 (35%), Gaps = 58/393 (14%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILD--HSLLY 58 +++ N++ Y G + + +PV E + ++ + +LL Sbjct: 722 VNYFNLQR-NYVRVGVMKYGDKVQIPVSLGDYNTQTELLSRISETRRMRGEANLGQALLD 780 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTD---FRNELRENGFAQDINNIERSTSLS 115 + + L + + + FSY + W+ + R++ + + F D+ N T Sbjct: 781 ASGEFLIFGSK-DIPRIVIIFSYGQPRGEWKENARLLRDDTKAHIFLVDVGNQGDKTQNL 839 Query: 116 IIIDDQHKDYNLS------AVSRYEMPF-------IFCTFPWCANSSHAP-----LLITS 157 I+ + + ++ S PF + + P + + + Sbjct: 840 AIVGESNPHRIITIDEWHGVNSEILSPFMDELCRLLPQRQDKTSRDGTWPTRQTEMRVAT 899 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V+I ++ D D+M +LD S ++ I ++D I + ++VR Sbjct: 900 PVRICNRVDFQADVMFILDSSDNVTSEEYSN------LKEGISMLIDEIFDLSP--DIVR 951 Query: 218 SGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G V +S K PL + VQ + + N G T GL A + Sbjct: 952 VGFVEYSDKASVPVPLGYYDNKVQLLADISNSEQLGGTPIILRGLRAAKEQFQRHGRD-- 1009 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFL 333 + + ++ +T G N F ++ + ++ + V A Q + Sbjct: 1010 -------NVSRILLLVTSGANRGNVA------FAADDLREHLNVSIFVLVVNASQGAQIM 1056 Query: 334 KNCASPDRFYS-----VQNSRKLHDA-FLRIGK 360 N + D + + ++ KL +A L+IG+ Sbjct: 1057 LNRLTSDEYTQQRVISISSANKLQEAELLQIGQ 1089 >gi|145494949|ref|XP_001433468.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400586|emb|CAK66071.1| unnamed protein product [Paramecium tetraurelia] Length = 611 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 96/256 (37%), Gaps = 44/256 (17%) Query: 121 QHKDYNLSAVSRYEMPFIFCTF-----PWCANSSHAPLLITSSVKISSKSD--------I 167 Q + L ++Y++ C S++ I + + I +K + I Sbjct: 129 QKQSAKLQNKNKYDLQSSIAFEINSLRTSCKVSNYKSEYIPAMISIKTKENQTEMTERTI 188 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G+D++ ++D S+SM+ D + + +S+ +LD + R ++TF+ Sbjct: 189 GIDLICLIDKSMSMSG------DNINMVKKSLLLLLDFLG------EQDRLQIITFNEHA 236 Query: 228 VQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + PL + ++ I+++ TK + A+ ++ KEK+ Sbjct: 237 QRLTPLKCLTEKNKQYFQAVISQISAEGLTKISSATYIAFKQL---KEKVYRNNVTS--- 290 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 + L+DG + + + + + E + G + Q + + + Sbjct: 291 ---VFLLSDGHDGDALFEISDQIRHVKEV----FTISTFGFGDDHDAQMMTSISNLKNGN 343 Query: 342 FYSVQNSRKLHDAFLR 357 FY V++ L + F Sbjct: 344 FYYVKDITLLDEFFAH 359 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 65/187 (34%), Gaps = 29/187 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 DI +D+ ++D S S+ S+ ++ + V+ Sbjct: 128 PKTSGVCGDISVDLQFIVDSSSSVTRKNFGFAKNFVANVSSVFDL---------RSGDVQ 178 Query: 218 SGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEK 272 G++T+S+ + + G H EK+ + + T + L Y Sbjct: 179 VGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGDTHTGTALRY----------- 227 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + ++ K +IF+TDG I + + +G ++AIGV + Sbjct: 228 ISTNNRWREEVPKILIFVTDGTPQDRAIVPAAARSL----RDKGVRIFAIGVGNAVESEL 283 Query: 333 LKNCASP 339 + + P Sbjct: 284 KEIASEP 290 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 49/137 (35%), Gaps = 21/137 (15%) Query: 236 GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 V ++EK+ ++ F T + LE A + + + K I+ +TDG+ Sbjct: 8 DVDELKEKLMKVPFIQGKTNTGGALERAQQMLAEGRP----------SVPKIILLITDGD 57 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + D + + K+ ++Y IGV L A+ + F V +R Sbjct: 58 AT----DKERLDAQIEKLKKSNILIYTIGVGDLIDRNELNRIATDEDF--VYETRD---- 107 Query: 355 FLRIGKEMVKQRILYNK 371 F I K K Sbjct: 108 FDSISKIKSSLLGRVCK 124 >gi|165975965|ref|YP_001651558.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876066|gb|ABY69114.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 529 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 55/361 (15%), Positives = 113/361 (31%), Gaps = 38/361 (10%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 67 ENGNNGKKQKNDFSYRIIK----------NIWQTDFRNEL----RENGFAQDINNIERST 112 + K ++ S I + L +N + +T Sbjct: 70 RKTTDYKLGGSNPSDESFNISSEVGKRDHAIVTAFVKTFLPQTDEKNMHLTPLCKTINNT 129 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 S +S ++ F + S + +I +D+M Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASQSRAIKKNTFNIPIDLM 189 Query: 173 MVLDVSLSMNDHFGP-------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS- 224 +V D+S SMN K+ + + E+ D D N R + F+ Sbjct: 190 VVADLSGSMNFDLDNNEIKKTGKPSKISILKEVLVELADKTLLSEDANQNNRIYVTPFAL 249 Query: 225 ----SKIVQTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + P +W Q+I+ +N+ ++ Y Sbjct: 250 GAEINNNNCALPYSWSVESSSRTQNIKNILNKQN-SQYNRADLINNLVYKISTKETLDNI 308 Query: 275 HIAKG-HDDYKKYIIFLTDGENSSPNI----DNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + + + + K L D + S+ D E Y K GA + + GV A Sbjct: 309 NGKQNYNVTFSKNAFCLKDMKTSNKGWYSRSDKLEFTNYVQSIKANGATLASSGVLVAAN 368 Query: 330 D 330 + Sbjct: 369 N 369 >gi|313247257|emb|CBY15545.1| unnamed protein product [Oikopleura dioica] Length = 409 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 83/232 (35%), Gaps = 35/232 (15%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 E+ F+ + I S+S + +++ ++ F S PLL S Sbjct: 26 EHHFSSEFTVIVDSSSQKETLQIPAENFE-----NVQVLFEENDDEIEEISRKRPLLCES 80 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 LD++++LD S S+++ + S+ +D + R Sbjct: 81 ---------RPLDLVILLDSSRSIDEKSWLLQKESVERMASVLFPIDDFNT--------R 123 Query: 218 SGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFG--STTKSTPGLEYAYNKIFDAKEKL 273 L+ + L+ + I+EK+ RL T GL A+ + Sbjct: 124 ISLIRYGYSAFLAHRLSEEQSYPMIKEKLFRLEHTYEDQTNVHFGLRKAFTEFSTCPRDR 183 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + KK I+ +TDGE + P I ++ AK+ ++ + + Sbjct: 184 R-----NQNSKKAIVIITDGEFTEPKIAYEDLKI----AKKNEVEIFILAIG 226 >gi|47218379|emb|CAG01900.1| unnamed protein product [Tetraodon nigroviridis] Length = 683 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 64/184 (34%), Gaps = 31/184 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++D S S++ L + S+ +ML+ + VN +V F+ K Sbjct: 229 DMLILVDASGSVSGL------TLKLIQISVSKMLETLSDDDYVN------VVYFNDKAKY 276 Query: 230 TFPLAW-------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ + + TT + G E A+ ++ + Sbjct: 277 ASCFENLVQANVRNKRMLKKAVQNITAKGTTNYSGGFELAFEQLAQMNVSRANCN----- 331 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG-VQAEAADQFLKNCASPDR 341 K I+ TDG +E N + +++G + CA+ Sbjct: 332 --KIIMLFTDG----GEEKAEEIFKKYNPNQEVRIFTFSVGQHNYDKGPIQWMACANKGY 385 Query: 342 FYSV 345 +Y + Sbjct: 386 YYEI 389 >gi|225621507|ref|YP_002722766.1| hypothetical protein BHWA1_02609 [Brachyspira hyodysenteriae WA1] gi|225216328|gb|ACN85062.1| putative membrane protein containing von Willebrand factor (vWA) type A domain [Brachyspira hyodysenteriae WA1] Length = 324 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 68/217 (31%), Gaps = 32/217 (14%) Query: 121 QHKDYNLSAVSR-YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 K+Y + R + + F+ + P KI + + + + + LD+S Sbjct: 21 NDKNYKRISNLRIFSIIFMILASASLVFALMQPKWGIIEQKIKTDNYM---VTIALDLSR 77 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 SM+ +L A I + + ++ LV F+ P ++ Sbjct: 78 SMDADDVWP-SRLERAKLEIEKFVKKTDNLS-------VALVGFAGTSFIASPFTQDMET 129 Query: 240 IQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 ++ L S T+ L A N KK II +TDGE+ Sbjct: 130 FTYILDNLTTKSVTLQGTRIADALVTAKNTFNVDA-----------VSKKSIILITDGED 178 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 D+ + VY +GV E Sbjct: 179 HGGYFDD-----ILKQLNEMNVSVYTVGVGTEVGATI 210 >gi|113475584|ref|YP_721645.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166632|gb|ABG51172.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 1204 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 61/163 (37%), Gaps = 15/163 (9%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LD S SM+ + + ++ +K + V ++TF+S+I Sbjct: 633 IILLLDTSYSMSGE------AITELNQGVKIFQASVKEDELASLRVEIAVITFNSEIEVV 686 Query: 231 FPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + I + L T +E A + K+ ++ + +I Sbjct: 687 Q----DFVTVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRP--WIFL 740 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +TDG+ + D + + R + +A+GV+ + Sbjct: 741 ITDGQPTDTWQDAAKKIEEAET--NRKLLFFAVGVRDADMETL 781 >gi|224047663|ref|XP_002193801.1| PREDICTED: vitrin [Taeniopygia guttata] Length = 746 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 68/193 (35%), Gaps = 35/193 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D+ V+D S S+ G + + I K + R G V ++ Sbjct: 561 AADIGFVIDGSSSV------GTGNFRTVLQFVAN---ISKEFEISDTDTRIGAVQYT--Y 609 Query: 228 VQTFPLAWGVQHIQEKI-NRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 Q ++ ++ + N + T + + YA ++F K + Sbjct: 610 EQRLEFSFDKYSTKQDVLNAIKRISYWSGGTSTGAAISYASEQLFS---------KSKPN 660 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--- 339 +K +I +TDG + + A + G I Y++GV A D+ L+ AS Sbjct: 661 KRKIMILITDGRSYD------DVSVPAMAAHQNGVIAYSVGVAWAAPDE-LEAIASDPAK 713 Query: 340 DRFYSVQNSRKLH 352 + + V L+ Sbjct: 714 EHSFFVDEFDNLY 726 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 51/159 (32%), Gaps = 16/159 (10%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 +D+S M+ + G + + R + + + G+V + F L Sbjct: 360 VDLSFLMDGSWSIGKRRFQLQKRFLGNVAQALGISSAGPL---MGIVQYGDDPSTEFNLK 416 Query: 235 W--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + ++ I ++ + + A + + + +G ++ L D Sbjct: 417 TYVNSKDLRNAIEKIQQKGGLSN---VGKALSFVNKNFFLDANGNRGGAP--NVVVVLVD 471 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 G + + A+ G ++ + V A A + Sbjct: 472 GWPTDRVEEASRL------ARESGINIFFVTVAAAAQSE 504 >gi|309271626|ref|XP_003085374.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain [Mus musculus] Length = 3284 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + II LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIIVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1523 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1524 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1583 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1584 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1642 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1694 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1695 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1750 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1751 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 >gi|309264114|ref|XP_003086228.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Mus musculus] Length = 2349 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 310 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 367 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 368 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 427 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 428 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 475 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 476 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 535 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 536 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 587 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 588 LVYLMDDFRSL 598 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1264 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1317 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1318 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1377 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1378 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1436 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1437 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1488 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1489 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1544 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1545 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1598 >gi|309264110|ref|XP_003086224.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Mus musculus] Length = 3057 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 310 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 367 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 368 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 427 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 428 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 475 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 476 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 535 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 536 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 587 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 588 LVYLMDDFRSL 598 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1264 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1317 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1318 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1377 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1378 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1436 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1437 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1488 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1489 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1544 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1545 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1598 >gi|309264106|ref|XP_003086227.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Mus musculus] Length = 3062 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 310 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 367 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 368 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 427 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 428 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 475 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 476 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 535 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 536 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 587 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 588 LVYLMDDFRSL 598 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1264 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1317 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1318 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1377 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1378 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1436 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1437 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1488 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1489 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1544 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1545 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1598 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 42/111 (37%), Gaps = 20/111 (18%) Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T GL + I + + II LTDGE+ + Sbjct: 53 EASGGTSICSGLRAGFQAIIHSNQSTSGSE---------IILLTDGEDDQ--------IS 95 Query: 308 YC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYSVQNSRKLHDAF 355 C E K+ GA+++ I + AA + L N RFY+ ++ L DAF Sbjct: 96 SCFEEVKQSGAVIHTIALGPSAARELETLSNMTGGYRFYANKDINGLTDAF 146 >gi|309264108|ref|XP_003086225.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Mus musculus] Length = 3263 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + II LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIIVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1523 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1524 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1583 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1584 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1642 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1694 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1695 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1750 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1751 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 >gi|148708139|gb|EDL40086.1| mCG12867, isoform CRA_e [Mus musculus] Length = 2555 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + II LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIIVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1523 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1524 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1583 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1584 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1642 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1694 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1695 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1750 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1751 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 >gi|148708136|gb|EDL40083.1| mCG12867, isoform CRA_b [Mus musculus] Length = 3261 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + II LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIIVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1523 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1524 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1583 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1584 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1642 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1694 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1695 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1750 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1751 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 >gi|148708137|gb|EDL40084.1| mCG12867, isoform CRA_c [Mus musculus] Length = 2207 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + II LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIIVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1523 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1524 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1583 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1584 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1642 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1694 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1695 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1750 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1751 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 >gi|301781320|ref|XP_002926070.1| PREDICTED: collagen alpha-1(VII) chain-like [Ailuropoda melanoleuca] Length = 2994 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGR------GNFREVRGFLEGLVLPFSGAASA-QGVRFAAVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|281343115|gb|EFB18699.1| hypothetical protein PANDA_015680 [Ailuropoda melanoleuca] Length = 2904 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 7 YAADIVFLLDGSSSIGR------GNFREVRGFLEGLVLPFSGAASA-QGVRFAAVQYSDD 59 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 60 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 113 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 114 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 165 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 166 FFFFVNDFSIL 176 >gi|117620923|ref|YP_857205.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562330|gb|ABK39278.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 4260 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 69/186 (37%), Gaps = 18/186 (9%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM----DKLGVATRSIREMLDII 206 P +SV I+ +++MV+D S SM G ++L +A ++ M+D Sbjct: 3184 VPTAKDNSVVITEAGLPPFNLVMVIDTSGSMLWQIGTSTNGSPNRLELAKDALNHMIDSY 3243 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNK 265 ++ ++ F+S V P + I+ L G T L A N+ Sbjct: 3244 VALGVPLVFT---VIDFASGAVL-IPQTSDPDVAKASISGLPTDGGGTNYNAPLVLAQNQ 3299 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + L + A + K Y FL+DG + N+ + F VYA+G+ Sbjct: 3300 LTA---DLANPALAGYETKVY--FLSDGAPNEGNVPAGWTSFV----NSNNVEVYAVGLN 3350 Query: 326 AEAADQ 331 Sbjct: 3351 VSGNAT 3356 >gi|28900585|ref|NP_800240.1| hypothetical protein VPA0730 [Vibrio parahaemolyticus RIMD 2210633] gi|28808965|dbj|BAC62073.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 466 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 47/353 (13%), Positives = 115/353 (32%), Gaps = 75/353 (21%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY-- 58 M N++ F KG I+ LP++ I + + + +K+ + + L Sbjct: 1 MQGFNMKGFTK-QKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALI 59 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSII 117 + N+++ + ++ + + I +I + ++G Q N + +++ Sbjct: 60 ASPGKDNKDDQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVV 119 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 +H +S E+ + + L +D+ +LD Sbjct: 120 ATAEHDS----WISHNEIGVEPKFKVSGDSITRKYLP------------QPVDIYFILDT 163 Query: 178 SLSMN--DHFGPGMDKLGVATRSIR---EMLDIIKSIPDVNNVVRSGLVTFSSK------ 226 S SM+ + ++ V +I + L+ K+ PD + R L+T+++ Sbjct: 164 SQSMSNPWYGERNKTQMQVVKDTITRVVKELENFKTGPDKKS--RVALLTYNAYNAKFDK 221 Query: 227 -------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK-------------- 265 + + ++++ S + P YNK Sbjct: 222 GAGRVKLYDYASEFSHTEASFESIVDKMFDKSVVEQKPHYASDYNKSQDIPLTDKYQEFI 281 Query: 266 ---------------------IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + A ++ + + K + ++ I L+DG ++ Sbjct: 282 DILNSNKVMPARGGGTQSWLGLIAAAKEADKVKKEDRNPEQVFIILSDGADTD 334 >gi|302652520|ref|XP_003018108.1| hypothetical protein TRV_07884 [Trichophyton verrucosum HKI 0517] gi|291181717|gb|EFE37463.1| hypothetical protein TRV_07884 [Trichophyton verrucosum HKI 0517] Length = 240 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 62/165 (37%), Gaps = 20/165 (12%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N + I +K + + D+++V+D+S SMN + G Sbjct: 39 ILSIHSIPNKDSMIVSIQPPLKPGNDVPHVPCDIVLVIDISGSMNSAAPIPTGEKGGEDT 98 Query: 198 SIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLI 248 + +LD+ K I +N R +VTF +++ F L + + +++L Sbjct: 99 GL-SILDLTKHAAKTIIETLNEKDRLAVVTFCTEVNVAFELDYMNKENKSTVLSAVDKLY 157 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 S+T G++ N + + ++ LTDG Sbjct: 158 GKSSTNLWHGIKKGLNVLATN---------PAQGKIQSLLVLTDG 193 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 42/126 (33%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 12 RRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + + + + + N D T ++ K Sbjct: 72 ISLSVAEADGIAAGHLNFVFFQKKSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKYVQ 131 Query: 127 LSAVSR 132 +A Sbjct: 132 CTATLS 137 >gi|108758240|ref|YP_629592.1| putative lipoprotein [Myxococcus xanthus DK 1622] gi|108462120|gb|ABF87305.1| putative lipoprotein [Myxococcus xanthus DK 1622] Length = 659 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 63/195 (32%), Gaps = 39/195 (20%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPG-------MDKLGVA------------TRSI 199 + S S + +++V+D S SM PG + + R++ Sbjct: 49 TRPPSPSGFPVKVVVVIDESGSMCVSDPPGAQLDNGFCQRREILDIIPEGVTEPARVRAL 108 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG-------VQHIQEKINRLI--FG 250 + ++ + + V+ + F + + +P +I I L G Sbjct: 109 KRLVQQFREVNAQGGNVQVSVAPFETNVRNVWPPTTTGDRFARPDNNIDSYIEGLQSQLG 168 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY-IIFLTDG---------ENSSPNI 300 T L YAY+ I + +Y ++FLTDG +N S Sbjct: 169 KGTDYQGALSYAYSLISSDINAVAQSNPELLPRTRYVVVFLTDGTPYPRCSATDNLSVYA 228 Query: 301 DNKES-LFYCNEAKR 314 D L + + + Sbjct: 229 DPDNPDLTWADSLRD 243 >gi|179762|gb|AAA51903.1| calcium channel alpha-2b subunit [Homo sapiens] Length = 1091 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 65/187 (34%), Gaps = 37/187 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGV---QAEAADQFLKNCAS 338 K I+ TDG + + N+ K + V+ V E C + Sbjct: 356 --KIIMLFTDG-------GEERAQEIFNKYNKDKKVRVFRFSVGQHNYERGPIQWMACEN 406 Query: 339 PDRFYSV 345 +Y + Sbjct: 407 KGYYYEI 413 >gi|291394751|ref|XP_002713732.1| PREDICTED: collagen, type XXVIII [Oryctolagus cuniculus] Length = 1132 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 60/179 (33%), Gaps = 24/179 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ ++D S S + + + D I + +++ +R + FSS Sbjct: 47 IDIVFIVDSSES------SKIVHFDKQKDFVERLSDKIFQVTPGHSLKYDIRLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + W ++ +++ L G T S + K Sbjct: 101 SVQIDPPFSSWKDLKTFKQRAKSLNLIGQGTFSYYAISNVTRLFKREGRKNG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K + +TDG + N D K +A+ G IG + + L+ + Sbjct: 153 -VKVALLMTDGIDHPKNPDVKS---ISEDARTSGISFITIGHSTDVNEAKLRLISGDSS 207 >gi|256084538|ref|XP_002578485.1| hypothetical protein [Schistosoma mansoni] gi|238663860|emb|CAZ34723.1| loss of heterozygosity 11 chromosomal region 2 gene a protein homolog (mast cell surface antigen 1) (masa-1), putative [Schistosoma mansoni] Length = 828 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 70/202 (34%), Gaps = 36/202 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + S D+ + + ++D S SM D + A S+ L KS+P R + Sbjct: 287 VVSSKDMRYEFVFLIDRSGSMEG------DNISYAKTSLLLFL---KSLPMS---CRFQI 334 Query: 221 VTFSSKIVQTFPLAWG-----VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLE 274 + F S FP + L T++ L+ A Sbjct: 335 IGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGGTEAYNALKAAL----------- 383 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + + K IIFLTDG+ N D L N K R V+ IG+ + + Sbjct: 384 HSTPSGEGWFKQIIFLTDGD--VGNADEVIGLVRMNVDKAR---VFTIGLGQGVSTALIG 438 Query: 335 NCASPDRFYS--VQNSRKLHDA 354 A + V++ +L A Sbjct: 439 GVARAGNGTAEFVRDPSQLQSA 460 >gi|224142781|ref|XP_002324729.1| predicted protein [Populus trichocarpa] gi|222866163|gb|EEF03294.1| predicted protein [Populus trichocarpa] Length = 751 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 75/220 (34%), Gaps = 45/220 (20%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S+ +++ V+D+S SM G ++ +A + LD S ++ Sbjct: 319 SRKVFRKEIVFVVDISGSME---GAPLEGTKIALSAALTNLDSKDSFN---------IIA 366 Query: 223 FSSKIVQ---TFPLA--WGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F+ + + LA V+ E ++ LI G T L+ A + + Sbjct: 367 FNGETYLFSSSMELASEDTVERAVEWMSMNLIAGGDTNILVPLKQATEMLSKS------- 419 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR---GAIVY----AIGVQAEAA 329 G +I +TDG ++ C+ K G ++ G+ + Sbjct: 420 --GGSIP--FIFLVTDG-------AVEDERHICDIMKSHITGGGSIHPRICTFGIGSYCN 468 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 FL+ A S ++ + + + ++ + I Sbjct: 469 HHFLRMLAMISRGQYDAAYDIDSVESRMQKLLSRISSTII 508 >gi|89896615|ref|YP_520102.1| hypothetical protein DSY3869 [Desulfitobacterium hafniense Y51] gi|89336063|dbj|BAE85658.1| hypothetical protein [Desulfitobacterium hafniense Y51] Length = 641 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 57/160 (35%), Gaps = 21/160 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + G ++ V+D S SM ++G +I +L+ G++ Sbjct: 450 REKRTGATLLFVVDASGSMG-----AKRRMGAVKGAILSLLN-----DAYQKRDSIGMIA 499 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + V Q+ + L G T GL AY + + + Sbjct: 500 FRKDGAEVLLNITRSVDLAQKCLETLPTGGKTPLAAGLAKAYELLKVDR-------IKNP 552 Query: 282 DYKKYIIFLTDGENSSPNIDN---KESLFYCNEAKRRGAI 318 + +YII ++DG+ + P + K++L + + G Sbjct: 553 EALQYIILVSDGKANLPLFSDQALKDALIVGKKIRHEGIR 592 >gi|332832625|ref|XP_520182.3| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Pan troglodytes] Length = 3571 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVIFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|193788254|dbj|BAG53148.1| unnamed protein product [Homo sapiens] Length = 437 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 73/215 (33%), Gaps = 37/215 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S G+ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGS--------NGIYPWVEVQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FL--- 333 KK +I +TDGE + D+ + +++R YA+ V + FL Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEI 317 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 K AS F++V + L D +G + Sbjct: 318 KYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSL 352 >gi|7766811|pdb|1CK4|A Chain A, Crystal Structure Of Rat A1b1 Integrin I-Domain. gi|7766812|pdb|1CK4|B Chain B, Crystal Structure Of Rat A1b1 Integrin I-Domain Length = 198 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 73/213 (34%), Gaps = 37/213 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD S S + + ++L + P G+V + + Sbjct: 5 LDIVIVLDGSNS--------IYPWESVIAFLNDLLKRMDIGPKQTQ---VGIVQYGENVT 53 Query: 229 QTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + N++ T + G++ A + F K Sbjct: 54 HEFNLNKYSSTEEVLVAANKIGRQGGLQTMTALGIDTARKEAFTEARGARRG------VK 107 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKN 335 K ++ +TDGE S N K+ + C ++I + E + +K+ Sbjct: 108 KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKS 163 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 AS F++V + L +G+ + Sbjct: 164 IASEPTEKHFFNVSDELALVTIVKALGERIFAL 196 >gi|304437159|ref|ZP_07397120.1| magnesium chelatase [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369821|gb|EFM23485.1| magnesium chelatase [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 627 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 57/153 (37%), Gaps = 21/153 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ ++D S SM +++ + +I +L + V GL+ Sbjct: 441 RAKRTAANILFLVDASGSMG-----ARERMRMVKGAILSLLQE---AYQKRDCV--GLIA 490 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ ++++ L G T GL +A + + + + Sbjct: 491 FRRDRAETLLPMTRSVELAEKQLRELPTGGRTPLAEGLAHAMQMLHELERRGGG------ 544 Query: 282 DYKKYIIFLTDGENSSPNID--NKESLFYCNEA 312 K ++ +TDG ++ D + +L E Sbjct: 545 --KNVLVLVTDGRANTKEGDAGVQRALQAAEEV 575 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 76/209 (36%), Gaps = 32/209 (15%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + T+ + + ++ V+D S SM + ++M++ I + N Sbjct: 683 IFTNPYFSKNSCSLATEVAYVVDYSSSMK--------AVDPTNYRGKKMIEFINQLKAKN 734 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 N+V + ++K ++ T G++ A K + + Sbjct: 735 NIV----IETNTKATILGEGTTDAVLKKDLYKASKEKGATDIFAGIDIALTKFSNDTKTA 790 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQ 331 K I+ ++DG+ S + NEAK++G +Y + + ++ D Sbjct: 791 -----------KAIVVVSDGKTSKSKMT-----KVINEAKKQGVKIYTVSMGKKSQINDA 834 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L + + +Y ++ +LH F ++ Sbjct: 835 TLMQVSTETGGAYYYALDNLQLHQVFQKL 863 >gi|89899605|ref|YP_522076.1| hypothetical protein Rfer_0795 [Rhodoferax ferrireducens T118] gi|89344342|gb|ABD68545.1| conserved hypothetical protein [Rhodoferax ferrireducens T118] Length = 424 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 69/168 (41%), Gaps = 8/168 (4%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G+++I+ +++ V+ +GL ++ H + K +L D L + ++ + Sbjct: 18 QRGAVAIVVGLMMAVLVGFIGLALDGGHLYLTKTELQNSADACALAASYELTGAPSIAPA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELREN-----GFAQDINNIERSTS-LSIIIDDQHKDYN 126 + + + + + + + DF+N + F D++ + ++ Y Sbjct: 78 SFARAEAAGQAVGQMNKVDFQNSAIASSDIVVSFGTDLSAGNAAIKWVNAGAALPSSKYV 137 Query: 127 LSAVSRYE-MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 ++R MP+ ++ L T+++ ++++ G+ M + Sbjct: 138 RCTITRSNIMPWFMQVLMPSLDTLTVSSLATATL-APAQNNCGIPMAI 184 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 79/224 (35%), Gaps = 36/224 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++LD S SM + G A + + +++ + S R LVT+++ Sbjct: 763 IILDPSGSMNIYLVLDGSDSIGAGNFTGAKKCLVNLIEKVASYGVKP---RYALVTYATY 819 Query: 227 I--------VQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ W + + E IN S T + L+ Y+ + ++ Sbjct: 820 PRIWVKVSDQESSNADWVTKKLSE-INYEDHKLKSGTNTKRALQAVYSMMSWPEDIP--- 875 Query: 277 AKGHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAE 327 +G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 876 PEGWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPL 935 Query: 328 AADQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L D F ++ E + Sbjct: 936 VDQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSL 979 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 13/110 (11%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +FS + T P Q ++ N G+ T + L Y + + + L Sbjct: 160 TSFSHMLGATNP----TQRTKDHEN----GTGTNTYAALNSVYLMMNNQMQLLGMKTMAW 211 Query: 281 DDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + II LTDG+++ D + N+ + +YAIGV Sbjct: 212 QEIRHAIILLTDGKSNMGGSPKTAVDQIREILNINQKRNDYLDIYAIGVG 261 >gi|224024928|ref|ZP_03643294.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] gi|224018164|gb|EEF76162.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] Length = 339 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 76/233 (32%), Gaps = 31/233 (13%) Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS--VKISS 163 +N R L D + + AVS+Y P + W A + +L K+ + Sbjct: 26 SNYRRRKKLRQYGDPVLMAHLMPAVSKYR-PDVKFWLMWSALAMIIFMLARPQFGSKMET 84 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G++ ++ LD+S SM +L + + I +++ N + ++ F Sbjct: 85 VKRQGVETVVALDISNSMLAQDVTP-SRLEKSKKLISRLVETF-------NNDKVAMIVF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINR----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + P+ + + LI T ++ A Sbjct: 137 AGEAFAQLPITSDYISAKMFLESISPSLIATQGTDIRGAIDLAMKSFT-----------P 185 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + I+ +TDGEN + V+ +GV + Sbjct: 186 NEGVGRAIVLITDGENHEGGAVEAAKAAAEKGVR-----VFVLGVGSPDGSPI 233 >gi|326670654|ref|XP_002663415.2| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 3218 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 76/218 (34%), Gaps = 28/218 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 ++P + ++ + T +V + D++ ++D S S+ + Sbjct: 1374 QLPGVEQQLLTKVSTMTRDEISTPAVPRDPLNLGRKDIIFLIDGSDSVGQSGVAHIRD-- 1431 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL--IF 249 I +++D + PD VR LV + + F L Q + I RL + Sbjct: 1432 ----FILKVVDQLDVRPDQ---VRVALVQYGERPKTEFSLNSHDNKQSVISAIKRLRHMG 1484 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G ++Y E ++++ LT G ++S L Sbjct: 1485 GRGADLAEAIKYVIR-----NELQASAGVRLAQASQHLVVLTGGRSTSDVSTYGSIL--- 1536 Query: 310 NEAKRRGAIVYAIGVQAEAAD--QFLKNCASPDRFYSV 345 +G+ V IG+ AE AD Q ++ S D V Sbjct: 1537 -----KGSRVNCIGIGAENADSRQLIQIATSSDDVLQV 1569 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 282 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVNAA-----ASSLKELGV 336 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 337 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 396 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 397 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 446 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 447 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 504 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 505 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 558 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 559 NYALSVADFSEL 570 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G Sbjct: 88 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFAIG 142 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 143 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 202 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 203 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 252 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 253 GANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 310 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 311 ILVLLSGGRSFDSVNAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 364 Query: 346 QNSRKL 351 + +L Sbjct: 365 SDFSEL 370 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 78/219 (35%), Gaps = 21/219 (9%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + P+ T +V + D D++ ++D + + + F D + + LD Sbjct: 1189 AQLRPVYPTETVVPVPRGDKR-DVVFLIDGTSKIRNEFPAIRDMVQRVVEKLDVGLDN-- 1245 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 VR +V +S F L + +++ I RL + G Y Sbjct: 1246 --------VRVSVVQYSDDPKLEFLLNEHSTKEEVRQAIRRLRNKGGNELNTGKALEYVS 1297 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +++I +T G+++ N+ K A+G + Sbjct: 1298 KTIYQRSAGSRVEEGVP--QFLILVTGGKSNDDVSGP------ANQLKLSRVAPLAVGAR 1349 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A++ SP+ Y++++ ++L ++ ++ Sbjct: 1350 DADAEELRLISFSPELAYTIRDFQQLPGVEQQLLTKVST 1388 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/286 (12%), Positives = 95/286 (33%), Gaps = 31/286 (10%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + ++ +L L + ++ G+ + +Y I + + EN Sbjct: 1496 YVIRNELQASAGVRLAQASQHLV---VLTGGRSTSDVSTYGSILKGSRVNCIGIGAENAD 1552 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ + I S+ ++ + + F + P+ I + Sbjct: 1553 SRQLIQIATSSD-DVLQVPSFPNL-------PNIQNKFIARLSGSIVVEPPIEIDETTPG 1604 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ D++ ++D S+++ + + + +L++I + ++ GL Sbjct: 1605 LPQAKAA-DIVFLVDGSINLGRNNFKEVMEF---------ILNLIDLFYTERDRLQIGLA 1654 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ + F L I I R + + G A + E ++ Sbjct: 1655 HYATDVTDVFYLNTYNNKDDIINAITRAEYKGGREIRTG--SAIRHVQKTHFVKEKGSRK 1712 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + ++ +T G + + L K G VYA+GV Sbjct: 1713 DEGIPQILMVVTGGRSRDDSKSAALGL------KASGVRVYAVGVG 1752 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 33/276 (11%), Positives = 79/276 (28%), Gaps = 20/276 (7%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L+E G S S + Y LS ++ + N++ Sbjct: 927 ASSLKELGVLIFAIGSRSSDSQELQRVSNEPSYALSVREFSDLSNVQQKLLSNINTALVE 986 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + ++ D++ ++D S + F + + ++ Sbjct: 987 VTSLTTTVTVESEGPKKDIIFLIDGSEDVGREFPIIQEFVRRVVSNLNV----------G 1036 Query: 213 NNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 N +R G+V + L + + I L TK G A + + Sbjct: 1037 ENKIRIGVVQYGDIPNADIYLNSHRTKEGVLNGIKELRQLRGTKRNLG--QAIDFVRREV 1094 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + ++++ ++ G+ + K+ G +IG + Sbjct: 1095 LASGRGGRKEEGVPQFVVVVSGGKATD------NIRQSATALKQSGIFPLSIGTRDVGTQ 1148 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + P Y V + L+ + + + Sbjct: 1149 ELQVTSYVPRFAYLVDDLPGLYTIQETLINTLTELS 1184 >gi|220941746|emb|CAX15447.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 2026 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 76/218 (34%), Gaps = 28/218 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 ++P + ++ + T +V + D++ ++D S S+ + Sbjct: 244 QLPGVEQQLLTKVSTMTRDEISTPAVPRDPLNLGRKDIIFLIDGSDSVGQSGVAHIRD-- 301 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL--IF 249 I +++D + PD VR LV + + F L Q + I RL + Sbjct: 302 ----FILKVVDQLDVRPDQ---VRVALVQYGERPKTEFSLNSHDNKQSVISAIKRLRHMG 354 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G ++Y E ++++ LT G ++S L Sbjct: 355 GRGADLAEAIKYVIR-----NELQASAGVRLAQASQHLVVLTGGRSTSDVSTYGSIL--- 406 Query: 310 NEAKRRGAIVYAIGVQAEAAD--QFLKNCASPDRFYSV 345 +G+ V IG+ AE AD Q ++ S D V Sbjct: 407 -----KGSRVNCIGIGAENADSRQLIQIATSSDDVLQV 439 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 78/219 (35%), Gaps = 21/219 (9%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + P+ T +V + D D++ ++D + + + F D + + LD Sbjct: 59 AQLRPVYPTETVVPVPRGDKR-DVVFLIDGTSKIRNEFPAIRDMVQRVVEKLDVGLDN-- 115 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 VR +V +S F L + +++ I RL + G Y Sbjct: 116 --------VRVSVVQYSDDPKLEFLLNEHSTKEEVRQAIRRLRNKGGNELNTGKALEYVS 167 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +++I +T G+++ N+ K A+G + Sbjct: 168 KTIYQRSAGSRVEEGVP--QFLILVTGGKSNDDVSGP------ANQLKLSRVAPLAVGAR 219 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A++ SP+ Y++++ ++L ++ ++ Sbjct: 220 DADAEELRLISFSPELAYTIRDFQQLPGVEQQLLTKVST 258 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/286 (12%), Positives = 95/286 (33%), Gaps = 31/286 (10%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + ++ +L L + ++ G+ + +Y I + + EN Sbjct: 366 YVIRNELQASAGVRLAQASQHLV---VLTGGRSTSDVSTYGSILKGSRVNCIGIGAENAD 422 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 ++ + I S+ ++ + + F + P+ I + Sbjct: 423 SRQLIQIATSSD-DVLQVPSFPNL-------PNIQNKFIARLSGSIVVEPPIEIDETTPG 474 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ D++ ++D S+++ + + + +L++I + ++ GL Sbjct: 475 LPQAKAA-DIVFLVDGSINLGRNNFKEVMEF---------ILNLIDLFYTERDRLQIGLA 524 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ + F L I I R + + G A + E ++ Sbjct: 525 HYATDVTDVFYLNTYNNKDDIINAITRAEYKGGREIRTG--SAIRHVQKTHFVKEKGSRK 582 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + ++ +T G + + L K G VYA+GV Sbjct: 583 DEGIPQILMVVTGGRSRDDSKSAALGL------KASGVRVYAVGVG 622 >gi|256084540|ref|XP_002578486.1| hypothetical protein [Schistosoma mansoni] gi|238663861|emb|CAZ34724.1| loss of heterozygosity 11 chromosomal region 2 gene a protein homolog (mast cell surface antigen 1) (masa-1), putative [Schistosoma mansoni] Length = 652 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/202 (21%), Positives = 70/202 (34%), Gaps = 36/202 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + S D+ + + ++D S SM D + A S+ L KS+P R + Sbjct: 287 VVSSKDMRYEFVFLIDRSGSMEG------DNISYAKTSLLLFL---KSLPMS---CRFQI 334 Query: 221 VTFSSKIVQTFPLAWG-----VQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLE 274 + F S FP + L T++ L+ A Sbjct: 335 IGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGGTEAYNALKAAL----------- 383 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + + K IIFLTDG+ N D L N K R V+ IG+ + + Sbjct: 384 HSTPSGEGWFKQIIFLTDGD--VGNADEVIGLVRMNVDKAR---VFTIGLGQGVSTALIG 438 Query: 335 NCASPDRFYS--VQNSRKLHDA 354 A + V++ +L A Sbjct: 439 GVARAGNGTAEFVRDPSQLQSA 460 >gi|61806576|ref|NP_001013521.1| hypothetical protein LOC541376 [Danio rerio] gi|60649627|gb|AAH90437.1| Zgc:113325 [Danio rerio] Length = 450 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 69/187 (36%), Gaps = 34/187 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + D NV R F+ K Sbjct: 220 DMVIIVDVSGSVSGL------TLKLMKTSVIEMLDTLSD-DDYVNVAR-----FNEKAYA 267 Query: 230 TFPLAW--------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P + +E + + TT G ++A++++ + + Sbjct: 268 VVPCFTTLVQANIKNKKIFKEAVMNMQAKGTTDYKTGFQFAFDQLLND------TSAPRA 321 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCAS 338 + K I+ TDG D + +F R V+ V D L+ C + Sbjct: 322 NCNKMIMMFTDG-----GEDRAQDIFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWIACFN 376 Query: 339 PDRFYSV 345 ++ + Sbjct: 377 KGYYFEI 383 >gi|327270788|ref|XP_003220170.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 930 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 72/212 (33%), Gaps = 41/212 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S M ++L R+ + L I + G+VTF+SK Sbjct: 312 VCLVLDASAQMGKD-----NRLSRLIRAAKLFLLQI-----IEEGSWVGIVTFNSKANIQ 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + G+ A+ E Sbjct: 362 AGLQKVFSDKERESLTSHL-PTTASGDSDICEGVTTAFQVFSRKLTSTEGCE-------- 412 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 I+ LT+GE L C ++ + + I++ I ++A+++ K + + + Sbjct: 413 -IVLLTNGEGLD--------LSPCLSKIQSQEIIIHTIAFGSKASNELEKLADMTGGKTF 463 Query: 344 SVQNSRK---LHDAFLRIGK---EMVKQRILY 369 +S L DAF I + +Q I Sbjct: 464 YATDSLDSNGLIDAFGGISSGSGDASQQSIQL 495 >gi|148697327|gb|EDL29274.1| procollagen, type XIV, alpha 1, isoform CRA_c [Mus musculus] Length = 1802 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 VK ++ D+++++D S S+ R +R L+ + + Sbjct: 148 TSPEEVKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 197 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 198 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 254 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 255 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADLS 306 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 307 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 349 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1038 DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRT 1089 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I + + G TK+ +++ + +F + + K Sbjct: 1090 EFKLDSYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTS------DSGTRRGIPKV 1143 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++AIGV + ++ + P + Sbjct: 1144 IVVITDGRSQD------DVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFF 1197 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1198 VDD----FDAFKKIEDEL 1211 >gi|226423922|ref|NP_851794.3| collagen alpha-1(XIV) chain [Mus musculus] gi|148697325|gb|EDL29272.1| procollagen, type XIV, alpha 1, isoform CRA_a [Mus musculus] gi|219519338|gb|AAI45245.1| Collagen, type XIV, alpha 1 [Mus musculus] gi|223459902|gb|AAI38346.1| Collagen, type XIV, alpha 1 [Mus musculus] Length = 1794 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 VK ++ D+++++D S S+ R +R L+ + + Sbjct: 143 TSPEEVKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 192 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 193 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 249 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 250 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADLS 301 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 302 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 344 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1030 DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRT 1081 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I + + G TK+ +++ + +F + + K Sbjct: 1082 EFKLDSYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTS------DSGTRRGIPKV 1135 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++AIGV + ++ + P + Sbjct: 1136 IVVITDGRSQD------DVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFF 1189 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1190 VDD----FDAFKKIEDEL 1203 >gi|148697326|gb|EDL29273.1| procollagen, type XIV, alpha 1, isoform CRA_b [Mus musculus] Length = 1093 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 VK ++ D+++++D S S+ R +R L+ + + Sbjct: 143 TSPEEVKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 192 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 193 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 249 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 250 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADLS 301 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 302 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 344 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 81/216 (37%), Gaps = 30/216 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + +++ K K+ D++ ++D S S+ D D + + + I Sbjct: 364 GVSVSAPGKTLCKAAKA-DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI-- 414 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFD 268 + + +V F+ F L + + + I + + G TK+ +++ + +F Sbjct: 415 GADGTQVAMVQFTDDPRTEFKLDSYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFT 474 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + + K I+ +TDG + + E + G ++AIGV Sbjct: 475 S------DSGTRRGIPKVIVVITDGRSQD------DVNKISREMQADGFNIFAIGVADAD 522 Query: 329 ADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEM 362 + ++ + P + V + DAF +I E+ Sbjct: 523 YSELVQIGSKPSSRHVFFVDD----FDAFKKIEDEL 554 >gi|146345398|sp|Q80X19|COEA1_MOUSE RecName: Full=Collagen alpha-1(XIV) chain; Flags: Precursor Length = 1797 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 VK ++ D+++++D S S+ R +R L+ + + Sbjct: 143 TSPEEVKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 192 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 193 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 249 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 250 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADLS 301 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 302 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 344 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1033 DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRT 1084 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I + + G TK+ +++ + +F + + K Sbjct: 1085 EFKLDSYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTS------DSGTRRGIPKV 1138 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++AIGV + ++ + P + Sbjct: 1139 IVVITDGRSQD------DVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFF 1192 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1193 VDD----FDAFKKIEDEL 1206 >gi|118372347|ref|XP_001019370.1| TPR Domain containing protein [Tetrahymena thermophila] gi|89301137|gb|EAR99125.1| TPR Domain containing protein [Tetrahymena thermophila SB210] Length = 1208 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 78/196 (39%), Gaps = 25/196 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D+S +M +D + + D R L++F+ + Sbjct: 1016 DVVFLMDISETMCQSKYKAIDSFVSIRENYIKQFD------------RLALISFNHNVNV 1063 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 F L ++ + I++ + + G + YN I+D + ++++ Sbjct: 1064 CFELQVSGKN-DKFIDKYFKDAKNLACTGDKALYNAIYDGIKLF--KKTEPQPNSRWLVA 1120 Query: 290 LTDGENSSPNIDNKE--SLFYCNEAKRRGAIVYAIGVQAEAADQ--FLKNCA-SPDRFYS 344 +TD ++ ID ++ LF+ N K + IG+ ++ + +LK C + F Sbjct: 1121 ITDNVDNYSRIDEEQLKPLFFQNNVK-----LILIGLNLKSNAKEVYLKLCKNTGGTFIE 1175 Query: 345 VQNSRKLHDAFLRIGK 360 +S L+ AF I Sbjct: 1176 NPDSMDLNVAFQSITN 1191 >gi|30420885|gb|AAO64442.1| collagen type XIV [Mus musculus] Length = 1797 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 VK ++ D+++++D S S+ R +R L+ + + Sbjct: 143 TSPEEVKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 192 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 193 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 249 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 250 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADLS 301 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 302 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 344 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1033 DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRT 1084 Query: 230 TFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I + + G TK+ +++ + +F + + K Sbjct: 1085 EFKLDSYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTS------DSGTRRGIPKV 1138 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++AIGV + ++ + P + Sbjct: 1139 IVVITDGRSQD------DVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFF 1192 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1193 VDD----FDAFKKIEDEL 1206 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 43/126 (34%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 12 RRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + + + +T + N D T ++ K Sbjct: 72 ISLSVAEADGIAAGHLNFVFFQKTSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKYVQ 131 Query: 127 LSAVSR 132 +A Sbjct: 132 CTATLS 137 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 76/193 (39%), Gaps = 30/193 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++V+DVS SM +DKLG + R+ + DV + G+V FSS Sbjct: 326 FVVVMDVSDSMKQC--NRIDKLGESVRAWIK--------NDVPTGSQLGMVMFSSTAHIV 375 Query: 231 FPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L ++ QE + ++ S T GL+ A + KG++ Sbjct: 376 SELQVISDMKIRQEMMKKVPKDLYSITCIGCGLDLAVQML---------QEKGNNKTGGI 426 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRFY- 343 I+ +TDG NS+ +D + + + G V + A++ + L + +Y Sbjct: 427 IVLVTDGRNSAGYLDISDVE---EDIVKAGIRVVTVAFGSEADSNIERLADVTGGKSYYI 483 Query: 344 -SVQNSRKLHDAF 355 +S L AF Sbjct: 484 KDGDSSEALQRAF 496 >gi|291388471|ref|XP_002710799.1| PREDICTED: collagen, type XIV, alpha 1 [Oryctolagus cuniculus] Length = 1796 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 76/198 (38%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI--GTDGTQVAMVQFTDDPRT 1083 Query: 230 TFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L A+ + + + I R+ + G TK+ +++ + +F A E K Sbjct: 1084 EFKLNAYETKETLLDAIKRISYKGGNTKTGKAIKHVRDTLFTA-ESGTRRGIP-----KV 1137 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++A+GV + + + P + Sbjct: 1138 IVVITDGRSQD------DVNKISREMQSDGYNIFAVGVADADYSELVNIGSKPSARHVFF 1191 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1192 VDD----FDAFKKIEDEL 1205 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFNVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRVGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTGVP--KIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|290986713|ref|XP_002676068.1| predicted protein [Naegleria gruberi] gi|284089668|gb|EFC43324.1| predicted protein [Naegleria gruberi] Length = 413 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 86/217 (39%), Gaps = 35/217 (16%) Query: 158 SVKISSKSDIGLDMMMVLDV--SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S K L++++VLD+ S+S + K+ VA + I E++D +K Sbjct: 35 SCKDGKNERKDLNLVIVLDISGSMSSSMSGKGSESKMKVANKVICEIIDNLKDFE----- 89 Query: 216 VRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AK 270 R G+V F K PL + ++E++ ++ +T G++ + Sbjct: 90 -RLGIVLFDDKAETFLPLTIVQDLEKKSLKERVMKITEKGSTNFEAGMKRGIDLFSTMDS 148 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 L + + II+LTD + D+ + L + G + IG+ + Sbjct: 149 SDLSNSNR--------IIYLTDACPNVGGTDSLDVLTKDANSGPYGILSTFIGIGLDFNS 200 Query: 331 QF------LKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 + ++ C ++SV++S + F +I E Sbjct: 201 EIVEELTKVRGC----NYFSVKSSEE----FKKILNE 229 >gi|163658505|gb|ABY28382.1| complement component factor B/C2 [Branchiostoma belcheri tsingtauense] Length = 752 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 73/209 (34%), Gaps = 33/209 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S+ GLD++ LD S S++ A + R +++ + + + LVT Sbjct: 290 SRGSAGLDLVFALDKSSSIDAV------DFNQAIQFTRSIINEF-GVTNKEGGTQVALVT 342 Query: 223 FSSKIVQTFPLAWGV------QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEH 275 F S+ L W + + + ++ +L G T T L+ N + Sbjct: 343 FGSQAQ----LEWNLGQLDSKRKVFRQLRQLQPEGGGTALTAALQTVLNDVLPVTRAGA- 397 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K+ + +TDG+++ + E V+A+GV A L + Sbjct: 398 --------KRALFIITDGKSNVGASPGVFARRLREE---EAFEVFAVGVGANIDKNELNS 446 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKE 361 AS + + + I ++ Sbjct: 447 VASQPFTSHVFLINDFSNFDTLVNTIAEK 475 >gi|323138702|ref|ZP_08073768.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322396057|gb|EFX98592.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 333 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 70/200 (35%), Gaps = 28/200 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 G ++M++D S SMN+ F K A R ++ +D ++V G+ Sbjct: 82 GAQIVMLIDRSGSMNETFAGRNPSGGEESKAAAAKRILKTFVDS-----RGRDLV--GVA 134 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS+ + PL+ + + I+ + T GL A + + Sbjct: 135 AFSTSPMLVTPLSDHIDATKAAIDAMDRPGLDYTNIARGLAMALSMFRASDADHSRA--- 191 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I+ ++DG + ID K + ++ +Y + ++ E + + + Sbjct: 192 -------ILLISDG---AGVIDPKIQDDLRADFRKNRVNLYWLFLRTEGSRGIYGSSEAE 241 Query: 340 DRFYSVQNSRKLHDAFLRIG 359 R L F +G Sbjct: 242 SESPQAAPERHLDLYFKSLG 261 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/129 (23%), Positives = 45/129 (34%), Gaps = 18/129 (13%) Query: 245 NRLIFGSTTKSTPGLE------YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 N GS T + GL+ AY+ + + Y + I +TDG+ + Sbjct: 440 NGFCSGSGTYTHLGLQLVKDYQQAYSGSTMNNDMAPYPTADETLY--FNILITDGQYNGY 497 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQ----AEAADQFLKNCA-----SPDRFYSVQNSR 349 + N + E G Y IG AA L+N A S + +Y N Sbjct: 498 ST-NAQVQGELEEMYNSGITTYVIGFGDGVDTAAAQAQLQNMAQWGSGSQNNYYDANNQT 556 Query: 350 KLHDAFLRI 358 +L A I Sbjct: 557 ELEQALTTI 565 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 32/77 (41%), Gaps = 2/77 (2%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 G++ I+ I L + + L ++ H K KL ++D + L+ AT++ E Sbjct: 9 LRSKQNGNVLIVFTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAATEL--DEG 66 Query: 69 GNNGKKQKNDFSYRIIK 85 + + + + + Sbjct: 67 ATHEQARAAVVALIQLN 83 >gi|126306104|ref|XP_001362407.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 870 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 46/209 (22%), Positives = 82/209 (39%), Gaps = 41/209 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D S SM G +++L A++ +L II+ +G+VTF S Sbjct: 307 LVLVIDTSRSM--KVGNRLNRLRQASQFF--LLQIIEKGSW------TGIVTFDSSATIQ 356 Query: 231 FPL-----AWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + ++ N + G GL A+ + + + Sbjct: 357 SELIQIESDVQRKTLISRLPNVTVAGGGAHICSGLRTAFMVV------KKKFLTDGSE-- 408 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAI--GVQAEAADQFLKNCASPDR 341 ++ LTDGE+++ N C E K+ GAI++ I G E + L + Sbjct: 409 --MVLLTDGEDNTTNT--------CFEEVKQSGAIIHTIVLGPSTEKGLEKLSEMTGGMK 458 Query: 342 FYSVQNSRK--LHDAFLRI--GKEMVKQR 366 + N + L DAF + G + QR Sbjct: 459 TTATDNVQNNGLIDAFSALSSGNAAITQR 487 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 69/196 (35%), Gaps = 29/196 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ LD+++V D S + + SI + +I+ + R G VT++S Sbjct: 28 DLWLDIVIVADNS-----QRVNQNNSVVDIQNSISNIFEIVPIPIN-----RVGFVTYNS 77 Query: 226 KIVQTFPL----AWG--VQHIQ-EKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIA 277 L +WG Q + N + T GL A + +++ A Sbjct: 78 LATINADLNKFKSWGDLSQGVNDSYNNMNLSSENTSFIGTGLITAGELL-----QVQGSA 132 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---QFLK 334 G Y K II N + +D L N K G + + V + + L Sbjct: 133 IGRVYYPKVIIVYASAFNGTGLLDP---LSVANTLKSAGITIITVAVDTDNNGVIQKQLA 189 Query: 335 NCASPDRFYSVQNSRK 350 + ASP +S+ Sbjct: 190 SIASPGSAFSLDPDDD 205 >gi|331007462|ref|ZP_08330636.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] gi|330418739|gb|EGG93231.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] Length = 699 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 67/209 (32%), Gaps = 36/209 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SS 225 G D + VLD+S SM+ + ++ + + + + R +V F + Sbjct: 294 TGSDWIFVLDISGSMSGKYATLVEGVR-------------QGLEKLRQQDRFKVVLFNNG 340 Query: 226 KIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + T V ++ ++++ G T GL+ I + Sbjct: 341 SVDLTSGFLTVSQANVANVLQQLDNYKVGGGTNLYAGLQKGLTGIDADRPAG-------- 392 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 II +TDG + + K L N ++ + A LK A Sbjct: 393 -----IILVTDGVANVGQTEKKTFLKLLN---DHDVRLFTFIMGNSANRPLLKEMADVSN 444 Query: 342 FYS--VQNSRKLHDAFLRIGKEMVKQRIL 368 ++ V N+ + + ++ Q Sbjct: 445 GFAMSVSNADDIVGHIMLATSKLTHQAFR 473 >gi|317125812|ref|YP_004099924.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589900|gb|ADU49197.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 577 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 78/217 (35%), Gaps = 25/217 (11%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +T+++ + + ++ V DVS SMN+ G ++ + + L+ + Sbjct: 359 VTAAIDVWQAATTSFQLLSVFDVSGSMNEKVGN-TTRVRITQEAAGIALNALPRS----- 412 Query: 215 VVRSGLVTFSS------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + GL FSS + PL + + R + S + ++D Sbjct: 413 -TKLGLWVFSSDKGGGRDYKELVPLGL----LSDDAQRARMAAAAASLSKEVDGWTGLYD 467 Query: 269 AKEKLEHIAKGHDDYKKY--IIFLTDGENSSP--NIDNKESLFYCNEAK--RRGAIVYAI 322 K D ++ ++ LTDG+N P + ++ L +A +R + I Sbjct: 468 TIWAAYSKVKASYDPQRVNAVVILTDGKNEDPGGGLSLEQLLAKIKDATDPKRPIAITTI 527 Query: 323 GVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 G+ + L+ + +Y +N + + Sbjct: 528 GIGPGVDAESLRKISRSSYSDYYGAENPADMTTVLAK 564 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 69/468 (14%), Positives = 140/468 (29%), Gaps = 126/468 (26%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 N +GSI++L A+ + ++GL ++ K +L D + L + +E Sbjct: 14 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV--VTAKEF 71 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 +Q + + I +Q + + ++ + L I+ Q D +S Sbjct: 72 IAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSL---SQLEIVRSGQTLDATVS 128 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM------- 181 + + F ++ V S+ LD +++DVS SM Sbjct: 129 YTATVQSTFGRLFGLSATTLTNR-------VNASADIAGYLDFYLMVDVSGSMGLPTTDS 181 Query: 182 --------------------------NDHFGPGMDKL--GVATRSIREMLDIIKSIPDVN 213 + G +L ++ +LD P V Sbjct: 182 DAALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASK-PIVP 240 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQE--------------KINRLIFGST---TKST 256 N R G+ F +++ PL+ + N L GST T + Sbjct: 241 NQYRIGIYPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDTGSTQLFTNND 300 Query: 257 PG---------LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 P E A ++ + + + K ++ +TDG +S + + Sbjct: 301 PTTGTGSGGTHFETALPQMKSTIRTFGNGS-SSANPKPFVFLITDGMQNSQTYSSWKDKK 359 Query: 308 Y----------------------------CNEAKRRGAIV------------YA------ 321 C + K+ GA + Y Sbjct: 360 TYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTNEGTIV 419 Query: 322 -----IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + L+ CASP F++ +++ + + + + +K Sbjct: 420 WENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQALK 467 >gi|154252742|ref|YP_001413566.1| vault protein inter-alpha-trypsin subunit [Parvibaculum lavamentivorans DS-1] gi|154156692|gb|ABS63909.1| Vault protein inter-alpha-trypsin domain protein [Parvibaculum lavamentivorans DS-1] Length = 755 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 72/213 (33%), Gaps = 48/213 (22%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + SV+ +K + V+D S SM+ + A S+ LD +K N Sbjct: 337 SGSVQPEAKPREAI---FVIDNSGSMSGP------SMVQAKESLLWALDRLKPGDTFN-- 385 Query: 216 VRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ F + FP A + ++ + L T+ P L Sbjct: 386 ----VIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANGGTEMLPALRA--------- 432 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAA 329 L + I+FLTDG + E+ + G + ++ +G+ Sbjct: 433 -SLIDRNVNDGTRLRQIVFLTDG------AISNEAELFHEITSNLGRSRLFTVGIG---- 481 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++P+ ++ + S F IGKE Sbjct: 482 -------SAPNSYFMTRASEAGRGTFTHIGKET 507 >gi|156743748|ref|YP_001433877.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235076|gb|ABU59859.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 936 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 79/220 (35%), Gaps = 27/220 (12%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFGPGMDKLG 193 P F W A L +T + + + +++V+D+S SM + P + Sbjct: 378 PQSFGPGGWRGTPVEAALPVTMDI-PERQRQPPVSIVVVIDISGSMAATEDGIPKLSLAL 436 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ-EKINRLIFGST 252 R I +L + + R G++ PL + + E++N++ G + Sbjct: 437 EGARRIAALLRDEDELTVIPFDDRPGVIV--------GPLPGSRRDVAIEQLNQVRLGGS 488 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + I DA + + ++II +TDG ++ + +L Sbjct: 489 GIN----------IHDALRVAARYTRASERPVRHIITITDGNDT---TQQEGALDIVRSL 535 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRK 350 G + ++ + F+++ A+ R + + + Sbjct: 536 HDEGVTLTSVAIGQGDHVPFIRDMAAVGGGRTFLTERAAD 575 >gi|254491600|ref|ZP_05104779.1| von Willebrand factor type A domain protein [Methylophaga thiooxidans DMS010] gi|224463078|gb|EEF79348.1| von Willebrand factor type A domain protein [Methylophaga thiooxydans DMS010] Length = 338 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 73/207 (35%), Gaps = 33/207 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR--------SIREMLDIIKSIPDVNNVVR 217 G +++++LD S SM+ F V+ + + + +K+ PD R Sbjct: 81 GEGAEIVLLLDRSRSMDAPFAVKTQAAAVSVGGDNSKRSVARDYLTEFVKNRPD----DR 136 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPGLEYAYNKIFDAKEKLE 274 G V FS+K + PL + + I IN G T L + A Sbjct: 137 FGFVFFSTKAINLLPLTYDKESILATINANALGKGLSDTNMAEAL------VSSAGMFEG 190 Query: 275 HIAKGHDDYKKYIIFLTD-GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +G + ++ ++D G+ S + S Y K +Y I +++ Sbjct: 191 QTYRGS----RIVLLVSDGGQQLSAEAKEQISRLY----KEMNLSIYWIYLKSNQEMSLD 242 Query: 334 KNCASPDRFYSVQ-NSRKLHDAFLRIG 359 + D RKLH F IG Sbjct: 243 EG--EDDNLLWADIPERKLHKFFQTIG 267 >gi|119468931|ref|ZP_01611956.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7] gi|119447583|gb|EAW28850.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7] Length = 664 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 69/177 (38%), Gaps = 34/177 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +M+ V+D S SM+ + A +++ L +++S N ++ F + Sbjct: 315 RLPREMVFVVDTSGSMHGQ------SIEQAKKALFYALSLLESDDSFN------IIGFDN 362 Query: 226 KIVQTF--PL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + PL + ++ + I L T+ L Sbjct: 363 NVTAMSDRPLIASDFNLRRAERFIYSLEADGGTEIQGAL------------DAVLDGSTF 410 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 D + + ++FLTDG ++ N+ +LF +AK + ++ +G+ + F++ A Sbjct: 411 DGFVRQVVFLTDG-----SVSNEATLFKNIQAKLGDSRLFTVGIGSAPNSFFMRRAA 462 >gi|73980138|ref|XP_540147.2| PREDICTED: similar to vitrin isoform 1 [Canis familiaris] Length = 649 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 67/199 (33%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 466 DIGFVIDGSSSV------GTGNFRTVLQFVANL---SKEFEISDTDTRIGAVQYTYEQRL 516 Query: 230 TFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + +N + T + + YA ++F K + +K Sbjct: 517 EFGFD-DYNTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRK 566 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFY 343 +I +TDG + + A +G I YAIGV A D+ P D + Sbjct: 567 LMILITDGRSYD------DVRIPAMVAHHKGVITYAIGVAWAAQDELEVIATHPANDHSF 620 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V L+ ++ + + Sbjct: 621 FVDEFDHLYKFVPKVIQSI 639 >gi|306922629|gb|ADN07507.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 60/175 (34%), Gaps = 24/175 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 36 YAADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSGAASA-QGVRFATVQYSDD 88 Query: 227 IVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L G ++ G T++ L + + +F L H+ + Sbjct: 89 PQTEFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRHVSDHVF-----LPHLTRPGIP- 142 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K I +TDG++ + K +G ++A+G++ A + LK AS Sbjct: 143 -KVCILITDGKSQDLVD------PAAQKLKGQGVKLFAVGIK-NADPEELKRVAS 189 >gi|306922621|gb|ADN07500.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 60/175 (34%), Gaps = 24/175 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 36 YAADIVFLLDGSSSIGRS------NFREVRGFLEGLVLPFSGAASA-QGVRFATVQYSDD 88 Query: 227 IVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L G ++ G T++ L + + +F L H+ + Sbjct: 89 PQTEFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRHVSDHVF-----LPHLTRPGIP- 142 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K I +TDG++ + K +G ++A+G++ A + LK AS Sbjct: 143 -KVCILITDGKSQDLVD------PAAQKLKGQGVKLFAVGIK-NADPEELKRVAS 189 >gi|128485808|ref|NP_001076062.1| integrin alpha-M [Ovis aries] gi|124127039|gb|ABM92271.1| CD11b [Ovis aries] Length = 1152 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 46/232 (19%), Positives = 89/232 (38%), Gaps = 30/232 (12%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F + +N P I ++++ + D D+ ++D S S++ M K S + Sbjct: 124 FLFGSNLLQKPRRIPAALRECPQQDS--DIAFLIDGSGSIDPVDFDRMKKFVSTVMSRFQ 181 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + ++ ++ R+ TF+ + P + + I +L T + G+ Sbjct: 182 KSKTLFALMQYSDDFRT-HFTFNDFKRNSDP-----ELLVRPIGQLF--GRTHTATGIRK 233 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 ++F + + A K +I +TDGE +D E EA R+G I Y Sbjct: 234 VVRELFHSSSGARNHAL------KIMIVITDGE---KYLDPLEYRDVIPEADRKGIIRYV 284 Query: 322 IG----VQAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 IG ++ + + L AS D + V N L I ++ ++ Sbjct: 285 IGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEAL----KTIQNQLQEKI 332 >gi|148685685|gb|EDL17632.1| mCG133512, isoform CRA_a [Mus musculus] Length = 1164 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 78/214 (36%), Gaps = 24/214 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +D+ ++D S S++ ++ ++ + S ++++ Sbjct: 145 PATMPECPGQEMDIAFLIDGSGSID------QSDFTQMKDFVKALMGQLASTSTSFSLMQ 198 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + F + Q + + I +L T + G++ ++F +K A Sbjct: 199 YSNILKTHFTFTEFKSSLSPQSLVDAIVQLQ--GLTYTASGIQKVVKELFHSKNGARKSA 256 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFL 333 K K +I +TDG+ D E EA++ G I YAIG + A Q L Sbjct: 257 K------KILIVITDGQKFR---DPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQEL 307 Query: 334 KNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 S D + V N L +I +++ Sbjct: 308 NTIGSAPSQDHVFKVGNFVALRSIQRQIQEKIFA 341 >gi|148685686|gb|EDL17633.1| mCG133512, isoform CRA_b [Mus musculus] Length = 1168 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 78/214 (36%), Gaps = 24/214 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +D+ ++D S S++ ++ ++ + S ++++ Sbjct: 145 PATMPECPGQEMDIAFLIDGSGSID------QSDFTQMKDFVKALMGQLASTSTSFSLMQ 198 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + F + Q + + I +L T + G++ ++F +K A Sbjct: 199 YSNILKTHFTFTEFKSSLSPQSLVDAIVQLQ--GLTYTASGIQKVVKELFHSKNGARKSA 256 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFL 333 K K +I +TDG+ D E EA++ G I YAIG + A Q L Sbjct: 257 K------KILIVITDGQKFR---DPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQEL 307 Query: 334 KNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 S D + V N L +I +++ Sbjct: 308 NTIGSAPSQDHVFKVGNFVALRSIQRQIQEKIFA 341 >gi|124006669|ref|ZP_01691501.1| domain of unknown function protein [Microscilla marina ATCC 23134] gi|123987824|gb|EAY27515.1| domain of unknown function protein [Microscilla marina ATCC 23134] Length = 3238 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 44/136 (32%), Gaps = 17/136 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII----KSIPDVNNVVRSGLVTF 223 ++++ VLD+S SM + P D G R D + ++ RS L+ F Sbjct: 2207 PIEVIYVLDMSGSMKWEY-PKTDDAGKTISRFRAAQDALIYANSALAQQGMSSRSALIVF 2265 Query: 224 SS-KIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + Q + + L T + G+ A + Sbjct: 2266 NDTTAQVMSGFTNNFQQLNSIVENLGAPNGGTPMSKGMLSAKELLKTRSADK-------- 2317 Query: 282 DYKKYIIFLTDGENSS 297 K ++ +TDG + Sbjct: 2318 --KPVVVLITDGVPTY 2331 >gi|88911344|sp|Q3V0T4|ITAD_MOUSE RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|74215609|dbj|BAE21419.1| unnamed protein product [Mus musculus] Length = 1168 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 78/214 (36%), Gaps = 24/214 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + +D+ ++D S S++ ++ ++ + S ++++ Sbjct: 147 PATMPECPGQEMDIAFLIDGSGSID------QSDFTQMKDFVKALMGQLASTSTSFSLMQ 200 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + F + Q + + I +L T + G++ ++F +K A Sbjct: 201 YSNILKTHFTFTEFKSSLSPQSLVDAIVQLQ--GLTYTASGIQKVVKELFHSKNGARKSA 258 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFL 333 K K +I +TDG+ D E EA++ G I YAIG + A Q L Sbjct: 259 K------KILIVITDGQKFR---DPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQEL 309 Query: 334 KNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 S D + V N L +I +++ Sbjct: 310 NTIGSAPSQDHVFKVGNFVALRSIQRQIQEKIFA 343 >gi|148708138|gb|EDL40085.1| mCG12867, isoform CRA_d [Mus musculus] Length = 2281 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + II LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIIVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 1523 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 1524 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 1583 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 1584 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1642 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1643 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1694 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1695 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1750 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1751 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1804 >gi|325831996|ref|ZP_08165093.1| von Willebrand factor type A domain protein [Eggerthella sp. HGA1] gi|325486317|gb|EGC88769.1| von Willebrand factor type A domain protein [Eggerthella sp. HGA1] Length = 548 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 72/185 (38%), Gaps = 23/185 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + K S G +++ ++DVS SM+D DKL + S + Sbjct: 162 PWNDQTKLLVMGFATE-KDGDASPTGANLVFLIDVSGSMDDP-----DKLPLVKDSFAAL 215 Query: 203 LDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ + R +VT++S +++ + I ++ L+ +T GLE Sbjct: 216 VEGLTERD------RVSVVTYASGERVLLEGVPGDDKRRIMRAVDSLVAEGSTNGEAGLE 269 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + ++ + + ++ +DG+ + E + + + G + Sbjct: 270 QAYR-LAESSFIEGGVNR--------VVMASDGDLNVGISSESELHDFVEQKRETGVYLS 320 Query: 321 AIGVQ 325 +G Sbjct: 321 VLGFG 325 >gi|291486255|dbj|BAI87330.1| hypothetical protein BSNT_05611 [Bacillus subtilis subsp. natto BEST195] Length = 227 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 73/224 (32%), Gaps = 34/224 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--R 217 K +++ ++ ++LD S SM G+ K A + I + + V V Sbjct: 26 KTETEAKAPANVAVLLDASGSMAKRID-GVSKFNSAKKEISKFASSLPEGTQVKMSVFGS 84 Query: 218 SGLVTFSSKIVQTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 G S K+ + ++ Q +N + T L A + Sbjct: 85 EGNNKNSGKVQSCEAIRNVYGFQSFNEQSFLNSLNTIGPTGWTPIAKALNEAKSSF---- 140 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAA 329 + AKG +K + LTDGE + + E ++ V IG E Sbjct: 141 --DQLDAKG----EKVVYLLTDGEETCGG----NPIKTAKELQKENITVNVIGFDYKEGY 190 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 L A ++ V + + F +Q ++ +K Sbjct: 191 KGQLNAIAKVGGGEYFPVYTQKDVEKIF-------TQQSLMLSK 227 >gi|254447588|ref|ZP_05061054.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] gi|198262931|gb|EDY87210.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] Length = 1197 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 79/247 (31%), Gaps = 68/247 (27%) Query: 186 GPGMDKLGVAT-RSIREML-DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA---WGVQHI 240 D+ V ++++ +L D+I + +VN GL+ F K +P++ + Sbjct: 248 ENHNDEKTVTRLKAVQNVLHDVIGGMDNVN----FGLMDFYGKGRVIYPVSLVNDDRVQL 303 Query: 241 QEKINRLIFGSTTKSTPGLEY---------------AYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ L T L + F++ K ++ Sbjct: 304 LKTVDDLEANGGTPLQEALHESGRYLMGDTYYYGERGVSSAFESDGKTYDSPILNECQAN 363 Query: 286 YIIFLTDGENSSPNIDN-----------KESLFYCNEAKRRG------------------ 316 ++ LTDGE + DN K C + G Sbjct: 364 AVVLLTDGEPTDYGTDNVIHDMIKPVHEKNGTRRCTSYRDNGNDYTRYCLPEIAQFMRDF 423 Query: 317 -----------AIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMV 363 A + +G +++ + L++ AS ++ +S +L A I +++ Sbjct: 424 DLSSGLDGDQTAQTFTVGFKSD--QELLQDTASKGQGNYFQANDSVELTQALKDIVRKIK 481 Query: 364 KQRILYN 370 + + Sbjct: 482 SESSTFT 488 >gi|17537919|ref|NP_496258.1| C-type LECtin family member (clec-59) [Caenorhabditis elegans] gi|3881709|emb|CAA88984.1| C. elegans protein ZK666.5, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 396 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 69/213 (32%), Gaps = 17/213 (7%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + +F F +SS+ + + S++ LD++ V+D S M G+ + Sbjct: 4 NICLLFFLFHAYLSSSYTDRICGQDL-----SNLWLDVVAVVDNSAGMTK---GGLTSVA 55 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 SI I + P R LVT++ L Q I + + + T Sbjct: 56 ANIASIFSKNTQIGTNPTSPKTTRLALVTYNVDATTAADL-NKFQSIDDIYSGINSALAT 114 Query: 254 KSTPGLEYAYNKIFDAKE--KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 S+ Y + A++ + Y+K +I ++ + + Sbjct: 115 ISSSEESYLARGLSQAEKVFQAGKHGFNRAHYQKVVIVY---ASTYKGSGDLNPVPVAQR 171 Query: 312 AKRRGAIVYAIGVQAEAADQF---LKNCASPDR 341 K G + + L+ A+P Sbjct: 172 LKTSGVTIITVAYDQNKDGDILVDLEKIATPYH 204 >gi|91773456|ref|YP_566148.1| hypothetical protein Mbur_1491 [Methanococcoides burtonii DSM 6242] gi|91712471|gb|ABE52398.1| hypothetical protein with von Willebrand factor type A domain [Methanococcoides burtonii DSM 6242] Length = 1258 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 53/145 (36%), Gaps = 20/145 (13%) Query: 225 SKIVQTFPLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + T L+ + H+ I+ + T GL A N++ Sbjct: 891 WRATVTSSLSNNSLVHLSNSIDTITADGLTAIDEGLYEANNELSAITGNST--------- 941 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCA--SP 339 ++ ++DG +++ + + AK ++Y +G+ + D L A + Sbjct: 942 ---VVLMSDGLDNAGHHSL---IEEALRAKEHNTVIYTVGLGNNEDEVDPILCEIANITG 995 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 ++Y NS L D F+ I E+ Sbjct: 996 GKYYFAPNSTVLEDIFIGIASEITN 1020 >gi|4039081|gb|AAC97485.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 64/181 (35%), Gaps = 30/181 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVV 216 + +D+ +++D S S+ + + K L S+ D I ++ ++ Sbjct: 40 NESVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELI 97 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G I + L+ + E TT T L D +K + Sbjct: 98 RLGS---GQSIDKRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLND 142 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + +I +TDG +S +L N+ K+R + IG+ QF + Sbjct: 143 RVNREKAIQLVILMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLI 198 Query: 337 A 337 A Sbjct: 199 A 199 >gi|4039079|gb|AAC97484.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 64/181 (35%), Gaps = 30/181 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVV 216 + +D+ +++D S S+ + + K L S+ D I ++ ++ Sbjct: 40 NESVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELI 97 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G I + L+ + E TT T L D +K + Sbjct: 98 RLGS---GQSIDKRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLND 142 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + +I +TDG +S +L N+ K+R + IG+ QF + Sbjct: 143 RVNREKAIQLVILMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLI 198 Query: 337 A 337 A Sbjct: 199 A 199 >gi|254784280|ref|YP_003071708.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237684173|gb|ACR11437.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 593 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 18/207 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 K + L +++LDVS S++ G A S+ + K+ + + + + Sbjct: 229 PKESVQLRTVLLLDVSTSIDS--GEMEKLKTAAKASLISYENDTKTSRLLPGQQQVAIYS 286 Query: 223 FSSKIVQTFPLAWGVQHIQEKINR-----LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F S+I + ++ I+ L G++T +E A + D + + Sbjct: 287 FDSEITLLTDYTSDINLLEAAIDTIPNSVLERGNSTNLLGAMEIAAERWNDQIDLIA--- 343 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC- 336 + Y + LTDGE++ + + VYAI V + L+ Sbjct: 344 ----VERGYAVLLTDGEHNFDSRSPADIEADLTNFFGTRKKVYAIAVGNNVNLENLEAIT 399 Query: 337 ASPDRFYSV---QNSRKLHDAFLRIGK 360 AS ++ +V +++ +L F + Sbjct: 400 ASSEQVLTVNSFESAEELEAVFTEVAT 426 >gi|260787567|ref|XP_002588824.1| hypothetical protein BRAFLDRAFT_89745 [Branchiostoma floridae] gi|229273994|gb|EEN44835.1| hypothetical protein BRAFLDRAFT_89745 [Branchiostoma floridae] Length = 1344 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 69/195 (35%), Gaps = 30/195 (15%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 I ++ + D D++ +DVS S++ HF + I +++ + Sbjct: 1040 GADGRWI-GTLPSCAACDSAADIIFSIDVSGSVSGHF-------DTVSTFINGVVNSLTI 1091 Query: 209 IPDVNNVVRSGLVTFS-SKIVQTFPLA--WGVQHIQEKINRL--IFGSTTKSTPGLEYAY 263 + R GL+ F+ S+ L ++ I+ + + S S GL Sbjct: 1092 ---GRSHARVGLIKFAGSRAETYISLTDFDNKYNLVSAISSVFDHYSSGEFSIAGLSVMG 1148 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ K I LTDG++S + + G ++ +G Sbjct: 1149 HEFSTNGRSAAR---------KIGIVLTDGQDSGSGY----VIPDATALRNDGTTIFCVG 1195 Query: 324 VQAEAADQFLKNCAS 338 V A+ + L N AS Sbjct: 1196 V-ADVRRETLDNMAS 1209 >gi|162450402|ref|YP_001612769.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] gi|161160984|emb|CAN92289.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] Length = 865 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 66/198 (33%), Gaps = 39/198 (19%) Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAW 235 S SM L A R++R L + R G++ FSS + ++ + Sbjct: 300 SGSMQGE------SLDQAKRALRLCL------RHLAEGDRFGVIAFSSDFRALEPSLAPF 347 Query: 236 GVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ ++ L T+ L A + DA+ + ++ LTD Sbjct: 348 TQATLKAADAFVDGLRADGGTEMLNPLLAAVGMLGDAERD------------RVVVLLTD 395 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLH 352 G+ + + + +G +Y G+ +D + + A + + + Sbjct: 396 GQVGNEAQIVDRVVQ-----RGKGVRIYTFGIGTNVSDVLVNDLAR-----RTEGAAEFI 445 Query: 353 DAFLRIGKEMVKQRILYN 370 RI +++ Q Sbjct: 446 HPGERIDEKVTAQFARAT 463 >gi|149641794|ref|XP_001509032.1| PREDICTED: similar to integrin alpha E2 [Ornithorhynchus anatinus] Length = 1679 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 76/207 (36%), Gaps = 26/207 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D G ++ +VLD S S+N A I ++ +V + Sbjct: 153 DEDTGTEIAIVLDGSGSINPE------DFEKAKDFIFNIMTRFYKKCFECEF---AVVQY 203 Query: 224 SSKIVQTFPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S I F L + +K+ + G+ TK+ + + +IF+ ++ Sbjct: 204 GSVIQTEFDLLASRDANSSLQKVKNIKQVGNVTKTASAIYHVLKEIFNVEKGSRRQN--- 260 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNCAS 338 K II LTDG+ D K N + +G +AIGV E A+ + L+ AS Sbjct: 261 ---SKIIIVLTDGDVFQ---DPKNLATVMNLPEMKGIERFAIGVGNEFSASKKTLELIAS 314 Query: 339 PDRF---YSVQNSRKLHDAFLRIGKEM 362 + V+ L + +++ Sbjct: 315 EPHHSHKFRVEKFSGLDGLLRGLEQKI 341 >gi|261822923|ref|YP_003261029.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606936|gb|ACX89422.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 212 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 64/196 (32%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM+ + ++ ++ ++ P ++TF S Sbjct: 3 RLPVYLLLDTSGSMHGE------PMEAVKNGVQTLITTLRQDPYALETAYVSVITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + L+ TT L I +K KG +I Sbjct: 57 RQVVPLTDLINFKSPD---LVANGTTALGEALSLVAQAIEREVQKTTAETKGDWRPLVFI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG +P D ++ + A+ G V A A + L+ + Sbjct: 114 --MTDG---APTDDWRKGVAEFTSART-GV-VVACAAGQVAETKVLQEITEIVLQLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S + F + + Sbjct: 167 SNAIKAFFKWVSASIS 182 >gi|149277251|ref|ZP_01883393.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] gi|149232128|gb|EDM37505.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] Length = 629 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 74/196 (37%), Gaps = 25/196 (12%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATR 197 PW N H + I K S + + ++ ++DVS SMND +KL + Sbjct: 221 IAAAPW--NKKHKLVQIGLQGKTISTAKLPSSNLVFLIDVSGSMNDS-----NKLPLLVS 273 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKS 255 S + + D + R +V ++ P G I++ +N+L G +T Sbjct: 274 SFKLLTD------QLRKTDRVAIVVYAGNSGLVLPSTSGDQKTTIKDALNKLSAGGSTAG 327 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 G+ AY ++ KG ++ +I TDG+ + +++ E ++ Sbjct: 328 GAGIRLAYEV------AAKNYIKGGNNR---VILATDGDFNVGASSDEDMEKLIEEKRKS 378 Query: 316 GAIVYAIGVQAEAADQ 331 G + +G Sbjct: 379 GVFLTVLGFGMGNLKD 394 >gi|50085104|ref|YP_046614.1| putative tellurium resistance protein [Acinetobacter sp. ADP1] gi|49531080|emb|CAG68792.1| conserved hypothetical protein; putative tellurium resistance protein [Acinetobacter sp. ADP1] Length = 352 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 65/210 (30%), Gaps = 36/210 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM + + I+ ++ ++ P V ++ F+ + Sbjct: 3 RLPVFFVLDCSESMAGQ------NIQQMQQGIQLIMQKLRQDPYALETVYVSVIAFAGIV 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ +L G T LE+ N+ K KG Y Sbjct: 57 RTLVPL---VEVFAYYSTKLPLGGGTHLGKALEHLMNEFDRHLMKTTEETKGDWKPIAY- 112 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-----IGVQAEAA------DQFLKNC 336 TDG + C +A R YA I + Q ++C Sbjct: 113 -LFTDGRPTD----------ECKDAIYRWQKKYARRCTLIALGMGKNVDYATLKQLTEHC 161 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + N + F I +V Q Sbjct: 162 IA----FDELNEKDFKKFFQWISASVVAQS 187 >gi|329941682|ref|ZP_08290947.1| von Willebrand factor type A /toxic cation resistance protein [Streptomyces griseoaurantiacus M045] gi|329299399|gb|EGG43299.1| von Willebrand factor type A /toxic cation resistance protein [Streptomyces griseoaurantiacus M045] Length = 248 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 57/161 (35%), Gaps = 21/161 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDV 212 SSV+ D+ + +VLD S SM ++ G M L S+ LD ++P Sbjct: 31 TAESSVRGHGLEDVRAAVYLVLDRSGSMRPYYRDGTMQHLAERVLSLSAHLDDDGTVP-- 88 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V FS+ + L G + I++L A + + D Sbjct: 89 -------VVFFSTDVDGCTDLTLGRH--RGHIDKLHENLGHMGRTNYHCAMDAVID---- 135 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 H + D ++F TDG P + C A+ Sbjct: 136 --HYLESGSDAPALVVFQTDG---GPTSRQAAERYLCKAAR 171 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 49/136 (36%), Gaps = 28/136 (20%) Query: 185 FGPGMDKLGVATRSIREMLDIIKSI------PDVNNVVRSGLVTFSSKIVQTFPLAW--- 235 + K+ ++ + +D + S P+V N+ + G+V++++ L Sbjct: 188 YESYTTKIHELKKAAKNFIDSLTSTKTDGQTPNVKNL-KIGIVSYNNSGYINEGLVQVTD 246 Query: 236 -------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + +++ I L T + GL A + + E K +I Sbjct: 247 SDRKNNGNINELKDTIENLRADGGTNTGDGLRKAAYLLNEENEAN-----------KTVI 295 Query: 289 FLTDGENSSPNIDNKE 304 F+ DGE + + D Sbjct: 296 FMGDGEPTYYSSDRWG 311 >gi|126334857|ref|XP_001374633.1| PREDICTED: similar to leukocyte immune-type receptor TS32.15 L1.1a [Monodelphis domestica] Length = 3609 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 76/219 (34%), Gaps = 40/219 (18%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ + L+++ ++D S S+ +++L + + + P V + R Sbjct: 99 VRKLRERSHSLELVFLVDESSSVG--HANFLNELKFVKKLLSD-------FPVVPSATRV 149 Query: 219 GLVTFSSKIVQ-----TFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFD 268 +VTFSSK + QH +NR I G T + + A + Sbjct: 150 AIVTFSSKNNVVPRVDYISSSRAHQHKCSLLNREIPNITYRGGGTYTKGAFQQAAQILRH 209 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++E K I +TDG ++ + + G ++ G+ Sbjct: 210 SRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIFTFGIWQGN 254 Query: 329 ADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + ++P + Y + + + F + + + + Sbjct: 255 IRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 289 >gi|118399120|ref|XP_001031886.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila] gi|89286221|gb|EAR84223.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila SB210] Length = 994 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 87/256 (33%), Gaps = 32/256 (12%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 ++ A +++++ L + + + + S + +V+D S SM Sbjct: 2 ETQQSVQAKAQFKVE---SFEGNEFIDFTLKLKLDETALVQNNSRQK-NYQIVIDNSGSM 57 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHI 240 + + + + E++ + + L+TF++ I L ++ + Sbjct: 58 DG------TNIQLTKQLCNELVQFVIKTQPHSK---ISLMTFNTSIDHVENLHLKSLKQV 108 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 ++ I+ + T I K + + + + I++LTDG+ S Sbjct: 109 EQFISNINANGGT---------IFHITFDKLRDICQKFTNQNEELVIVYLTDGQVQSGQD 159 Query: 301 DN---KESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCAS----PDRFYSVQNSRKL 351 +F K+ V+A+G+ L S + ++ S ++ Sbjct: 160 STNLKDSFIFLQQVLKKFVNNVEVHALGMGTSHDPVILDKIISLQTTQSTYQFIKESSEI 219 Query: 352 HDAFLRIGKEMVKQRI 367 AF I + + I Sbjct: 220 EGAFKNIVDIIGQNFI 235 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 56/169 (33%), Gaps = 9/169 (5%) Query: 9 FFYNCKGSISILTAIL-LPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + GS++I+TA+ L + G+ I+ + K +L LD + L A ++ Sbjct: 10 ILRDECGSVAIITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQNG 69 Query: 68 NGNNGKKQKNDFSYRIIKN----IWQTDFR--NELRENGFAQDINNIERSTSLSIIIDDQ 121 N + K I N + D + N N F++ N ++ + Sbjct: 70 PANVKSEAKEAAENNSIDNNGLILGDNDIKVGNWTEPNFFSKTPYNSV-KIMVNNHSINS 128 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS-SVKISSKSDIGL 169 L+ P P+ +T S++ G+ Sbjct: 129 FFASALNFQQTVSAEATAVIGPLSGKRHLIPIAVTEDEADTMSEAGEGI 177 >gi|218129580|ref|ZP_03458384.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] gi|217988310|gb|EEC54633.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] Length = 212 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 65/199 (32%), Gaps = 20/199 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM +G + ++ ++ ++ P ++TF+S Sbjct: 3 RLPVYLLLDTSGSM---YGEPI---EAVKNGVQTLISTLRGDPYALETAYISIITFNSVA 56 Query: 228 VQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 Q PL + + T L K+ K KG Sbjct: 57 QQVTPLT----ELSAFQQPQIEASGCTALGEALTLLAQKVDTEIVKTTQEVKGDWKP--L 110 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +TDGE P D ++ L +E K+R +V A A LK Sbjct: 111 VFLMTDGE---PTDDLQKGL---DEFKKRKFGMVVACAAGQGANTNTLKKITENVVQLDT 164 Query: 346 QNSRKLHDAFLRIGKEMVK 364 +S + F + + Sbjct: 165 ADSATIKAFFKWVSASIST 183 >gi|124127041|gb|ABM92272.1| CD11b [Ovis canadensis] Length = 1152 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 46/233 (19%), Positives = 88/233 (37%), Gaps = 32/233 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F + +N P I ++++ + D D+ ++D S S++ M K S + Sbjct: 124 FLFGSNLLQKPRRIPAALRECPQQDS--DIAFLIDGSGSIDPVDFDRMKKFVSTVMSRFQ 181 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + ++ ++ R+ TF+ F ++ + I +L T + G+ Sbjct: 182 KSKTLFALMQYSDDFRT-HFTFND-----FKRNSDLELLVRPIGQLF--GRTHTATGIRK 233 Query: 262 AYNKIF-DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 ++F + H K +I +TDGE +D E EA R+G I Y Sbjct: 234 VVRELFHSSNGARNHAL-------KIMIVITDGE---KYLDPLEYRDVIPEADRKGIIRY 283 Query: 321 AIG----VQAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 IG ++ + + L AS D + V N L I ++ ++ Sbjct: 284 VIGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEAL----KTIQNQLQEKI 332 >gi|269104788|ref|ZP_06157484.1| TPR domain protein in aerotolerance operon [Photobacterium damselae subsp. damselae CIP 102761] gi|268161428|gb|EEZ39925.1| TPR domain protein in aerotolerance operon [Photobacterium damselae subsp. damselae CIP 102761] Length = 691 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 61/201 (30%), Gaps = 30/201 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P W K+ + S G + +V+D+S SM ++L A Sbjct: 54 PLYLLGLAWILAVIALAGPSWEKTKLPAYSLSGARV-LVMDMSKSM-YATDIAPNRLTQA 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----S 251 +ML K +GL+T++ PL + I L Sbjct: 112 RFKALDMLPGWKEGS-------TGLITYAGDGYTISPLTDDSSTLANLIPSLSPKIMPIP 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 + + G+ A + + A + II +TDG + KES E Sbjct: 165 GSNAASGIRQAIDLLKQAGDTNGD-----------IILITDG------MTEKESQESLAE 207 Query: 312 AKRRGAIVYAIGVQAEAADQF 332 K V + V E Sbjct: 208 LKNTHYRVSILAVGTEQGAPI 228 >gi|194474004|ref|NP_001124020.1| collagen alpha-1(XIV) chain [Rattus norvegicus] gi|149066377|gb|EDM16250.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_a [Rattus norvegicus] Length = 1794 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 74/198 (37%), Gaps = 29/198 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ D D + + + I + + +V F+ Sbjct: 1030 DLVFMVDGSWSIGD------DNFNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRT 1081 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L + + + I + + G TK+ +++ + +F A + K Sbjct: 1082 EFKLDAYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTA------DSGTRRGIPKV 1135 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYS 344 I+ +TDG + + E + G ++AIGV + ++ + P + Sbjct: 1136 IVVITDGRSQD------DVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVFF 1189 Query: 345 VQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1190 VDD----FDAFKKIEDEL 1203 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDV 212 +K ++ D+++++D S S+ R +R L+ + + Sbjct: 143 TSPEEIKFFCETPAIADIVILVDGSWSIGRF----------NFRLVRNFLENLVTAFNVG 192 Query: 213 NNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + R GL +S + L A+ + + + + L + T A N IF+ Sbjct: 193 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTG---LALNFIFENS 249 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 250 FKPEAGSRSG--VSKIGILITDGKSQDDIIPPSRNL------REAGVELFAIGVKNADLS 301 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + + + PD Y+V +H + + + + +K Sbjct: 302 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 344 >gi|118431306|ref|NP_147675.2| hypothetical protein APE_1031.1 [Aeropyrum pernix K1] gi|116062628|dbj|BAA80016.2| conserved hypothetical protein [Aeropyrum pernix K1] Length = 463 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 59/172 (34%), Gaps = 32/172 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM G +D ++ + R F S Sbjct: 298 IYVLLDKSGSM---VGAKIDWARAVAVALFRR--------SLAENRRFSARFFDSVTYPA 346 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + + G T T ++ A + I + I+ Sbjct: 347 IHLRPRSKPRDFLELVKYLAAVKAGGGTDITAAIKTAADDISRTPRGEQRISD------- 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ +TDGE+ NID E + KR A ++ + +Q + +LK + Sbjct: 400 -IVLITDGED-RLNIDVVE-----DSLKRSDARLHTVIIQ--GHNPYLKRIS 442 >gi|118351474|ref|XP_001009012.1| hypothetical protein TTHERM_00259660 [Tetrahymena thermophila] gi|89290779|gb|EAR88767.1| hypothetical protein TTHERM_00259660 [Tetrahymena thermophila SB210] Length = 2382 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 49/145 (33%), Gaps = 16/145 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S +K + ++V D S SM + K E+LD I + Sbjct: 2223 SIIKKCKSQINPVHFIIVFDESGSMEGDKWMSLRK---------ELLDFIDNRSRYTAQD 2273 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ F+ I L + I++K+ T + L+ + K + Sbjct: 2274 FITLIGFNDSIKLYTKLEKLNEQIKQKVPEKNMNGNTNFSAPLQQVLKILSQDNCKTFNK 2333 Query: 277 AKGHDDYKKYIIFLTDGENSSPNID 301 I FL+DGE + P+ D Sbjct: 2334 NN-------VIFFLSDGEANKPDTD 2351 >gi|332227196|ref|XP_003262777.1| PREDICTED: vitrin isoform 1 [Nomascus leucogenys] Length = 694 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 511 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 560 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 561 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 608 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G YAIGV A ++ P D Sbjct: 609 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVFTYAIGVAWAAQEELEVIATHPARD 662 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 663 HSFFVDEFDNLYQYVPRIIQNI 684 >gi|302142112|emb|CBI19315.3| unnamed protein product [Vitis vinifera] Length = 626 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/138 (22%), Positives = 56/138 (40%), Gaps = 25/138 (18%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +D++ VLDVS SM KL + R++R ++ + R +V Sbjct: 281 PARRAPIDLVTVLDVSASMTGS------KLQMLKRAMRLVISSLGPSD------RLAIVA 328 Query: 223 FSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS+ + PL A G + + I+RL+ + L A + D +E+ + Sbjct: 329 FSASPRRLLPLRRMTAHGQRSARRIIDRLVCSQGSSVGEALRKATKVLEDRRERNPVAS- 387 Query: 279 GHDDYKKYIIFLTDGENS 296 I+ L+DG++ Sbjct: 388 --------IMLLSDGQDD 397 >gi|225458992|ref|XP_002285586.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 738 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/138 (22%), Positives = 56/138 (40%), Gaps = 25/138 (18%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +D++ VLDVS SM KL + R++R ++ + R +V Sbjct: 331 PARRAPIDLVTVLDVSASMTGS------KLQMLKRAMRLVISSLGPSD------RLAIVA 378 Query: 223 FSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS+ + PL A G + + I+RL+ + L A + D +E+ + Sbjct: 379 FSASPRRLLPLRRMTAHGQRSARRIIDRLVCSQGSSVGEALRKATKVLEDRRERNPVAS- 437 Query: 279 GHDDYKKYIIFLTDGENS 296 I+ L+DG++ Sbjct: 438 --------IMLLSDGQDD 447 >gi|147816411|emb|CAN77438.1| hypothetical protein VITISV_007401 [Vitis vinifera] Length = 757 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/138 (22%), Positives = 56/138 (40%), Gaps = 25/138 (18%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +D++ VLDVS SM KL + R++R ++ + R +V Sbjct: 350 PARRAPIDLVTVLDVSASMTGS------KLQMLKRAMRLVISSLGPSD------RLAIVA 397 Query: 223 FSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS+ + PL A G + + I+RL+ + L A + D +E+ + Sbjct: 398 FSASPRRLLPLRRMTAHGQRSARRIIDRLVCSQGSSVGEALRKATKVLEDRRERNPVAS- 456 Query: 279 GHDDYKKYIIFLTDGENS 296 I+ L+DG++ Sbjct: 457 --------IMLLSDGQDD 466 >gi|290976237|ref|XP_002670847.1| vWFA domain-containing protein [Naegleria gruberi] gi|284084410|gb|EFC38103.1| vWFA domain-containing protein [Naegleria gruberi] Length = 389 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 52/147 (35%), Gaps = 21/147 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 +D+++V+D + SM+ ++ VA R++ ++ + + +R V++ Sbjct: 79 VDLVIVMDCTGSMS-------GEIEVAKRTVTTIISTL--HEKFQSDLRFSAVSYRDHTD 129 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 V+ FP + + IN + L A I + + K Sbjct: 130 DYAVKEFPFTKDLNKAKGYINTMSAQGGGDHPEALASALYVINEMPFNKKG--------K 181 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNE 311 K ++++ D + C + Sbjct: 182 KIVVWVADAPPHGMKTSSDSYPEGCKD 208 >gi|257790817|ref|YP_003181423.1| von Willebrand factor type A [Eggerthella lenta DSM 2243] gi|257474714|gb|ACV55034.1| von Willebrand factor type A [Eggerthella lenta DSM 2243] Length = 555 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 72/185 (38%), Gaps = 23/185 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + K S G +++ ++DVS SM+D DKL + S + Sbjct: 162 PWNDQTKLLVMGFATE-KDGDASPTGANLVFLIDVSGSMDDP-----DKLPLVKDSFAAL 215 Query: 203 LDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ + R +VT++S +++ + I ++ L+ +T GLE Sbjct: 216 VEGLTERD------RVSVVTYASGERVLLEGVPGDDKRRIMRAVDSLVAEGSTNGEAGLE 269 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + ++ + + ++ +DG+ + E + + + G + Sbjct: 270 QAYR-LAESSFIEGGVNR--------VVMASDGDLNVGISSESELHDFVEQKRETGVYLS 320 Query: 321 AIGVQ 325 +G Sbjct: 321 VLGFG 325 >gi|82702351|ref|YP_411917.1| von Willebrand factor, type A [Nitrosospira multiformis ATCC 25196] gi|82410416|gb|ABB74525.1| von Willebrand factor, type A [Nitrosospira multiformis ATCC 25196] Length = 888 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 72/230 (31%), Gaps = 45/230 (19%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI---------------SSKSDIGLD 170 +++ S +P + S++ + +V + ++ + Sbjct: 346 SVAVASDPILPVTYARLWLSYTSTNPGDTASGNVTVRCVQTGESWVINIAANTTARPRSA 405 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 + +VLD S SMN+ G G+ K+ + + ++ GLV F+ + Sbjct: 406 VALVLDRSGSMNEDAGDGISKVQKLREAANVFISAMQPADG------IGLVRFNEAAQRL 459 Query: 230 -------TFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G E I + T G+ + DA + Sbjct: 460 MEIQEAGAAPGGTGRTIALEHIAGSDIDPAGATSIGDGVVNGKQMLDDA---QATAGTPY 516 Query: 281 DDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 D ++ LTDG N P + + N YA+G+ + Sbjct: 517 DVTA--MVVLTDGMWNRPPPLADVMGSITANT--------YAVGLGLPSN 556 >gi|301625572|ref|XP_002941978.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like, partial [Xenopus (Silurana) tropicalis] Length = 476 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 56/150 (37%), Gaps = 12/150 (8%) Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQHIQEKINRL 247 ++ L I+ +P+ ++ G++ F K+ + + + ++ ++++ Sbjct: 4 QKIKQTYEAFLKILADLPEEDHF---GILIFDDKVDKWQNTLVKAVPDNIIKAKQFVSKI 60 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T L A + + I+FL+DGE +S ++ E + Sbjct: 61 SARGGTDINKALLAAVKMLKNTSRNKLLPKISTS----IILFLSDGEPTSGVTNHNEIIN 116 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +A R +Y +G + FL+ A Sbjct: 117 NVKKANERQTTLYCLGFGNDVDFNFLEKMA 146 >gi|295394688|ref|ZP_06804906.1| von Willebrand factor [Brevibacterium mcbrellneri ATCC 49030] gi|294972458|gb|EFG48315.1| von Willebrand factor [Brevibacterium mcbrellneri ATCC 49030] Length = 538 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 66/207 (31%), Gaps = 25/207 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR--SIREMLDIIKSIPDVNNVVRSGL 220 + M +D S SM G +++L A S + + + + L Sbjct: 342 ANGRKPATMFFQIDTSGSMR---GERLEQLKTALGILSGTSAKNDTERFLAIQPREKLKL 398 Query: 221 VTFSSKIVQT--FPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 V FS ++ T + L + + KI L T L+ + AK++ Sbjct: 399 VEFSHEVKSTDGYRLTDNGSADKVRKDLDTKIQTLTAEGGTAIYSTLQT---TLESAKKE 455 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ TDG N +Y N + V+A+ +D+ Sbjct: 456 KSDDKITS------VVVFTDGMNEHGISFRAFKDWYSNNQDVQDIPVFAVSFGNADSDEL 509 Query: 333 LKNCA-SPDRFYSVQNSRKLHDAFLRI 358 + + + R + L AF I Sbjct: 510 QELVSLTGGRVFDGN--ADLTAAFKDI 534 >gi|149502255|ref|XP_001506498.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 183 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/149 (24%), Positives = 51/149 (34%), Gaps = 13/149 (8%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FSS+ L + I+ +N L + G T G A E++ Sbjct: 28 RMSFIVFSSRGTTVMKLTEDREAIRRGLNILRSEVPGGDTFMHEGF-------IRANEQI 80 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 G II LTDGE E + A+ GAIVY +GV+ Q Sbjct: 81 SFENSGGLRTASVIIALTDGELQRDQFYYAE--KEADRARSLGAIVYCVGVKDFNETQLS 138 Query: 334 KNCASPDRFYSVQNS-RKLHDAFLRIGKE 361 S D + V L I K+ Sbjct: 139 TIADSIDHVFPVTGGFHALRGVIDSILKK 167 >gi|299139930|ref|ZP_07033101.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598077|gb|EFI54244.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 484 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 66/198 (33%), Gaps = 29/198 (14%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 S P T+S S + + M+++DVS SM + G + A ++ L+ Sbjct: 84 GQSTVPFYATASTAQSETTPLPQVTMLLIDVSGSMLGNDMAGETRFDAARQAAAAFLEGF 143 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQT----FPLAWGVQHIQEKINRLIFGS---TTKSTPGL 259 + D R + FS + VQ Q +++ L T + Sbjct: 144 RDGQD-----RVAIAGFSGRNVQAGIDGARFVSSRSEAQAELDALAAPERRNNTALYSAV 198 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS---SPNID------NKESLFYCN 310 A N++ + ++ LTDG N D N+ Sbjct: 199 SIAANRLAKEARDSHSEVR--------LLVLTDGANDVQPQAGDDANLLVGNEGLEQAAK 250 Query: 311 EAKRRGAIVYAIGVQAEA 328 E ++ G V IG+ +E Sbjct: 251 EVEKDGVSVLPIGLGSEN 268 >gi|196007110|ref|XP_002113421.1| hypothetical protein TRIADDRAFT_57572 [Trichoplax adhaerens] gi|190583825|gb|EDV23895.1| hypothetical protein TRIADDRAFT_57572 [Trichoplax adhaerens] Length = 1343 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 66/201 (32%), Gaps = 29/201 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + ++MVLD S SM L ++ ++ + I G++ FS Sbjct: 293 KAMPVRIVMVLDKSGSMRGS------NLQQLIQAATNVILQLGQIDGS-----IGIIIFS 341 Query: 225 SKIVQTFPL--AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + T PL Q + I L T G+ + + E G Sbjct: 342 TSATVTCPLMAVNNDQDKNKLIGCLPPEASGGTSIGSGILKGIELLLGSVG--EQKPSGG 399 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCAS 338 ++I ++DG+ ++ N + +V +I A+ L Sbjct: 400 -----HLIVMSDGQENA----NPRIKDVMSNITENDVVVTSISFGQSASKVLEDLAKSTG 450 Query: 339 PDRFYSVQNSR-KLHDAFLRI 358 +++ N L +AF I Sbjct: 451 GSSYFASTNGTLTLMNAFTAI 471 >gi|156370955|ref|XP_001628532.1| predicted protein [Nematostella vectensis] gi|156215511|gb|EDO36469.1| predicted protein [Nematostella vectensis] Length = 737 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 77/204 (37%), Gaps = 25/204 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ ++LD S S+ G + + AT+ + + LDI + V+ L+ FS+ + Sbjct: 87 LDLAILLDASYSIT---TTGWEGIATATKDLIDKLDISPAGTHVS------LMKFSTDVE 137 Query: 229 QTFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ I+++ + +++ L A N IF A K Sbjct: 138 TFYMFDNQATKDRLKSLIDKMEYDGEWSRTDIALNAAKNHIFVPSTGARGDA------PK 191 Query: 286 YIIFLTDGENSSP-----NIDNKES-LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I+ TDG+ P NID S + + + ++ + + A Sbjct: 192 AIVLFTDGKTDGPTPIDRNIDWVASLIKPLQDLRDANVTIFCVASGNHPDYSQINWLAGD 251 Query: 340 -DRFYSVQNSRKLHDAFLRIGKEM 362 R ++ + L + R K++ Sbjct: 252 RARVFTTSDMDALVGSLGRASKKV 275 >gi|268324441|emb|CBH38029.1| putitive magnesium-chelatase subunit [uncultured archaeon] Length = 705 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 70/194 (36%), Gaps = 34/194 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQT 230 M V+D S SM + ++ A ++ +L + + G+V F + Sbjct: 509 MFVVDASGSMGANR-----RMESAKGAVLSLL-----LDSYQQRDKVGMVAFKGDQADVL 558 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL E++ L G T GLE N + K + E ++ + Sbjct: 559 LPLCSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHRDEEAIP-------ILLLI 611 Query: 291 TDGENS----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLK----NCA---- 337 +DG + +E L +A+ +G +Y I + E +D F++ C Sbjct: 612 SDGRANVSAGGSKELEQELLALAEQARAKG--IYVIVIDTEIVSDSFIQMQLGYCRAIAN 669 Query: 338 -SPDRFYSVQNSRK 350 S ++Y + + Sbjct: 670 YSGGKYYPIADLTS 683 >gi|311105413|ref|YP_003978266.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] gi|310760102|gb|ADP15551.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] Length = 2061 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 69/169 (40%), Gaps = 19/169 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGM---DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ +VLD+S SM+ ++G G +L A ++++ +L+ + + + L+T Sbjct: 1440 GASYNIALVLDLSGSMDYYWGSGSNQETRLETAKKALKSLLEN--QLATHDGTINVSLIT 1497 Query: 223 FSSK----IVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F+ L V + + L G T A+N+ + Sbjct: 1498 FADSSSKLQKAISGLTPDNVDDMVNILLGLKAGGGT----PYGAAFNETKSWFDGQPTED 1553 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-VYAIGVQ 325 + YK FLTDGE SS N+++ F +K G V+ IG+ Sbjct: 1554 ANGNAYKNLTFFLTDGEPSSEYWYNRDTEF----SKLAGVSDVHGIGIG 1598 >gi|291223809|ref|XP_002731900.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 992 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 67/196 (34%), Gaps = 34/196 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM G + KL A + + G+V FS + Sbjct: 323 LVLVLDRSGSME---GVRLTKLRQAASAFIR--------NTICEGSYLGIVEFSEFAQEL 371 Query: 231 FPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL G + I RL G T G+ + + E Sbjct: 372 APLTLVNGSDSREGLIRRLPHSVGGWTSIGAGIMKGIEVLSTNGQNPEGG---------L 422 Query: 287 IIFLTD-GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ ++D GEN +P + E+L +E G + I +A + A F+ Sbjct: 423 IMAISDGGENRAPTL--SEALQAVDE---SGVTIDTIAYSEQADENLASLAARTGGMSFF 477 Query: 344 SV--QNSRKLHDAFLR 357 +S L DAF Sbjct: 478 YSGDDDSTVLEDAFAT 493 >gi|260813598|ref|XP_002601504.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] gi|229286801|gb|EEN57516.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] Length = 400 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 69/193 (35%), Gaps = 26/193 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD++ ++D S S+ + I++ + P G++ +SS Sbjct: 221 GNSLDIIYLVDGSGSVGAN------NFEKVKLFIKKAVSGFVIGPAATQ---VGVIQYSS 271 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 KI Q F + Q + + + T + + YA F + Sbjct: 272 KIRQEFSM-NSFQTVSGLLGAIDAMEYMQGGTLTGRAIRYASKYGFSVFDGARRG----- 325 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 K ++ +TDG +S EA+R+G VYAIGV A+Q K ++ + Sbjct: 326 -VPKVLVVVTDGVSSDEVAIP------ALEAQRQGIFVYAIGVSNYDAEQLQKIASTNES 378 Query: 342 FYSVQNSRKLHDA 354 V N L Sbjct: 379 SAMVDNFNLLDSV 391 >gi|301767172|ref|XP_002919036.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like [Ailuropoda melanoleuca] Length = 910 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 77/200 (38%), Gaps = 20/200 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVT 222 +++ +++ V+D+S SM K+ ++ ++L I+ D ++V G + + Sbjct: 286 TNLNKNVVFVIDISTSMEGQ------KVKQTKEALLKILGDIRP-GDYFDLVLFGSEVQS 338 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +VQ P ++ Q+ + R T GL + A+ L ++ Sbjct: 339 WRGSLVQASPA--NLRAAQDFVRRFFLAGATNLNGGLLRGIEILNQAQGSLPELSNHAS- 395 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 +I LTDGE + D + L A R +Y +G + FL+ + + Sbjct: 396 ---ILIMLTDGEPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHDVDLNFLEVMSMENNG 452 Query: 342 ----FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 453 RAQRIYEDHDATQQLQGFYD 472 >gi|291221810|ref|XP_002730913.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 858 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 68/194 (35%), Gaps = 34/194 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM + + + T I ++D ++ V G++ FSS Sbjct: 359 VVLVLDTSGSMAGNRIERLHR--DTTHFILNVIDD-------DSFV--GIIQFSSDATVL 407 Query: 231 FPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + + +I T G+ A ++ +++ L+ + + Sbjct: 408 SEMKKIDYASRPQIAASVPYSANGGTNFGAGIRAALQELKESQLSLKGAS---------L 458 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYSV 345 + +TDG+ S + +E G V I A D L + +Y Sbjct: 459 LIITDGQFSYTSDVT-------DEVYASGVRVDTIAYTQAAEDSLRVLSDRTGGSYYYVS 511 Query: 346 QN--SRKLHDAFLR 357 + S +L D+ Sbjct: 512 DDETSTELLDSLTS 525 >gi|281338027|gb|EFB13611.1| hypothetical protein PANDA_007566 [Ailuropoda melanoleuca] Length = 868 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 77/200 (38%), Gaps = 20/200 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVT 222 +++ +++ V+D+S SM K+ ++ ++L I+ D ++V G + + Sbjct: 241 TNLNKNVVFVIDISTSMEGQ------KVKQTKEALLKILGDIRP-GDYFDLVLFGSEVQS 293 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +VQ P ++ Q+ + R T GL + A+ L ++ Sbjct: 294 WRGSLVQASPA--NLRAAQDFVRRFFLAGATNLNGGLLRGIEILNQAQGSLPELSNHAS- 350 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 +I LTDGE + D + L A R +Y +G + FL+ + + Sbjct: 351 ---ILIMLTDGEPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHDVDLNFLEVMSMENNG 407 Query: 342 ----FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 408 RAQRIYEDHDATQQLQGFYD 427 >gi|198422516|ref|XP_002123195.1| PREDICTED: similar to EGF-like domain-containing protein [Ciona intestinalis] Length = 2053 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 68/176 (38%), Gaps = 24/176 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ +LD S S+ + M+D + D VR G++T++ K Sbjct: 600 LDLFFILDSSSSVRAA------NFEKMKNFVDRMVDPLNVGQDR---VRVGVMTYNRKTF 650 Query: 229 QTFPL--AWGVQHIQEKINRLIFGS-TTKSTPGLE-YAYNKIFDAKEKLEHIAKGHDDYK 284 + A EK+ + + TK+ + A N + +++ + ++ Sbjct: 651 KRIDFNEAANNTDFSEKLAAIQYSGRGTKTAQAINFAASNCLHESRGRRPNVPLS----- 705 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +I +TDG + D ++ ++ + ++IG+ ++ L A+ D Sbjct: 706 --VILMTDGR----SQDWRQLPSAAATMHQKANMFFSIGITNRVNERELLTIANDD 755 >gi|297627243|ref|YP_003689006.1| Von Willebrand factor, type A precursor [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296923008|emb|CBL57590.1| Von Willebrand factor, type A precursor [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 323 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 70/236 (29%), Gaps = 28/236 (11%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGV 194 P CA L + + D G D+++V+D + SM D + G ++ Sbjct: 32 PVPVLRLAICAVVMVIGLHPVFTSAPITGRDHGADVVVVIDRTTSMGADDYQGGQPRMSG 91 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IF 249 I ++ + R ++ P + L Sbjct: 92 VASDIARLVGDYRGA-------RFSVIAMDDNARLEVPFTTDGDAVAGYAAALGWQRSTQ 144 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 GS + + G+ A ++ ++ ++ +++ DGE + + Sbjct: 145 GSGSDISSGVGLAAQELEASRRDRPQA-------QRLLVYCGDGEQT-IDTAPASFAPLA 196 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YSVQNSRKLHD--AFLRIGKEM 362 +E +G +A D S + + + D A I ++ Sbjct: 197 DELTDA----LVLGYGTQAGDIMQTYPGSGEYVTFQGAPAVSHIDQAALQAIAEQT 248 >gi|320007113|gb|ADW01963.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 661 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 71/222 (31%), Gaps = 48/222 (21%) Query: 112 TSLSIIIDDQHKDYNLSAVS-RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD 170 TS + I+ + + A R +P T+ + G+D Sbjct: 133 TSPDVWIEASGSLFRVRAALDRGGLPLQGLL---------RVGPETALTLFAPGERSGVD 183 Query: 171 MMMVLDVSLSMN-----------------DHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 ++++ D S SM+ G+ ++ ++ M+D V Sbjct: 184 IVVLADCSGSMSIEDIPAAPSDGGRWFNRGRTSAGISRMNALKDALGAMIDARMRYDGV- 242 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEK--INRLIFG--------STTKSTPGLEYAY 263 R LV F FP WG++ + + + RL S T L A Sbjct: 243 -GTRFALVRFDHDHEPMFPSRWGMEEVSDAHSVQRLREAVSLLNYRQSGTDIGKALHKA- 300 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 E L ++ + ++ ++DG + +P D + Sbjct: 301 ------GELLHRYGVPGNE--RLVVLVSDGAHYAPIPDERSG 334 >gi|220912876|ref|YP_002488185.1| von Willebrand factor A [Arthrobacter chlorophenolicus A6] gi|219859754|gb|ACL40096.1| von Willebrand factor type A [Arthrobacter chlorophenolicus A6] Length = 333 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 68/217 (31%), Gaps = 45/217 (20%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + D+M+ LDVS SM+ +D R R G Sbjct: 81 TTTVPEQRNRDIMLCLDVSGSMSSADAAVVDVFAELAREFDGE--------------RIG 126 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKI-------------NRLIFG-----STTKSTPGLEY 261 L F S Q FPL + ++++ + + G ++ GL Sbjct: 127 LTIFDSTGSQVFPLTDDYGYARDQLLLARDAFDGKPGSSGFLDGTWGGRGSSLIGDGLAS 186 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVY 320 N ++ + ++ TD + + + +L + AK R VY Sbjct: 187 CLNGFPRTQDSDAAADNPAVKRSRSVVLATD---NYVSGNPILTLPQASALAKDRTVRVY 243 Query: 321 AI-------GVQAEAADQFLKNCA--SPDRFYSVQNS 348 A+ G A L+ A + +Y++ N Sbjct: 244 ALNPGDLDYGSDAGQPGAQLRVAAESTGGSYYALDNP 280 >gi|255671683|gb|ACU26442.1| uncharacterized protein [uncultured bacterium HF186_25m_18N5] gi|255671756|gb|ACU26513.1| uncharacterized protein [uncultured bacterium HF186_25m_27D22] Length = 836 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 65/194 (33%), Gaps = 24/194 (12%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + L + +K+ ++ V+D S SM G +DK A R E + Sbjct: 288 TGTFALTLEPPLKVDPDQVTPKELFFVVDTSGSM---MGEPLDKARAAMRYALERMGPDD 344 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + ++ SG+ + + + + P ++ I + T+ G+ A + Sbjct: 345 TFQIID--FASGVASLAPRPLPNTPE--NLRKGLAFIEAMTSQGGTEMLAGIRAALDGPT 400 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + + F+TDG + +A+ +++ GV + Sbjct: 401 PPGRL------------RIVAFMTDGYIGNDGDILDYIDQSVGQAR-----LFSFGVGED 443 Query: 328 AADQFLKNCASPDR 341 L+ A+ R Sbjct: 444 VNRYLLEEMATRGR 457 >gi|228471677|ref|ZP_04056450.1| protein containing von Willebrand factor [Capnocytophaga gingivalis ATCC 33624] gi|228276830|gb|EEK15525.1| protein containing von Willebrand factor [Capnocytophaga gingivalis ATCC 33624] Length = 493 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 77/237 (32%), Gaps = 29/237 (12%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 D Q + + P+ C PL ++ S ++ ++D Sbjct: 87 TPDAQTPSPLRVSYEQAPAPWNAAHQLLCIGLKTKPLDLSQ--TPPSH------LVFLID 138 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 +S SM D+ + K + SI + VR L + L Sbjct: 139 ISGSMIDYNKLPLLKSSLKLLLHNLKAQDKVSIVTYASGVRVAL-----EPTSVREL--- 190 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + I++ ++ L G T G++ AY + A KG ++ II TDG+ + Sbjct: 191 -EKIEKVLDGLEAGGATSGEQGIQLAYEQAHKA------FIKGGNNR---IILATDGDFN 240 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCASP--DRFYSVQNSRK 350 + + + + + G + +G D + A + + N + Sbjct: 241 IGINNPNDLKAFIEKQREGGVYLSVLGFGMGNYRDDMAETLADSGNGNYAYIDNLTE 297 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 75/221 (33%), Gaps = 36/221 (16%) Query: 9 FFYNCKGSISILTAI-LLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + G+++ILTA+ +P++ I +G ++ +AKL LD + L A+ + N Sbjct: 1 MGADRSGNVAILTALAFVPLMLITIG-SLDVVRMTTAQAKLQSTLDSATLAAAS-LSNTA 58 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + + ++ N + T + K + Sbjct: 59 DIEDTVDE-------------------YIQANLPDTAPWTTLKLTMGDVTDSLNAKSVEI 99 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 +A EM + + ++ ++ +++ +VLD+S SM Sbjct: 100 TATVDIEMTILKLA--------GIDKTSVLASSVAQQAAQNIEVSVVLDISSSMGGS--- 148 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 K+ + + +D + + ++ F + Sbjct: 149 ---KITSLREAAKGFIDTMLKEDEDKEYTSLSIIPFGGTVN 186 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 49/178 (27%), Gaps = 45/178 (25%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK-------KY 286 + ++ I+ + T G + + + K Sbjct: 277 SNNTTDLKALIDDMDLSDGTGMDIGALWGAKVLSGSMRGQLGGDFSDRPADFNDEDTLKV 336 Query: 287 IIFLTDGENSS------------------------PNIDNKESLF---------YCNEAK 313 + +TDG ++ NI+ + C Sbjct: 337 AVIMTDGAITAQFRPRDYTTTGKIKNKTQQTIVSKGNINTASTKADDAVAYFKRVCEYLN 396 Query: 314 RRGAIVYAIGVQAEAA---DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 VY IG Q + DQ LK CAS +Y V+ + DAF I + R+ Sbjct: 397 DNNVQVYTIGFQINSGSLPDQLLKYCASSLSNYYFVEGLN-IEDAFNAIASAVNNLRV 453 >gi|221131834|ref|XP_002154795.1| PREDICTED: similar to saxiphilin [Hydra magnipapillata] Length = 1094 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 67/209 (32%), Gaps = 29/209 (13%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 FP + + +T+ + + LD+++V+D S S+ + +D Sbjct: 14 TLQVFNFPLATITESSSSFVTTPLCPNP-----LDIVIVIDSSGSVQKEWNDIIDHAQYF 68 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIF---G 250 + R G+V FS+ + +T + + + L Sbjct: 69 ASTFNV----------SEQHTRIGIVDFSAVANVYKTVDNENTEEQVYNALESLRARPQN 118 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + E+ +K ++ TDG+ + N D + Sbjct: 119 GETWLNLALQRTIELFGSATPQREN-------VRKIMVLYTDGKMT--NKDEESLRDLIK 169 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + Y + V ++ + L AS Sbjct: 170 SHRLVSVESYIVQVNNDSHESTLHEVASS 198 >gi|45384490|ref|NP_990665.1| collagen alpha-1(XIV) chain precursor [Gallus gallus] gi|1705533|sp|P32018|COEA1_CHICK RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|288873|emb|CAA50064.1| collagen XIV [Gallus gallus] Length = 1888 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 78/201 (38%), Gaps = 24/201 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV-NNVVRSGLVTFSSKIV 228 D+++++D S S+ R +R L+ + S +V + R GL +S Sbjct: 158 DIVILVDGSWSIGRF----------NFRLVRLFLENLVSAFNVGSEKTRVGLAQYSGDPR 207 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + L A+G + R + + GL Y I + K E A+ K Sbjct: 208 IEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY--ILENSFKPEAGARPG--VSKIG 263 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 I +TDG++ + + + G ++AIGV+ ++ + + PD Y+V Sbjct: 264 ILITDGKSQD------DVIPPAKNLRDAGIELFAIGVKNADINELKEIASEPDSTHVYNV 317 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + ++ + + + + Sbjct: 318 ADFNFMNSIVEGLTRTVCSRV 338 >gi|288875|emb|CAA50063.1| collagen XIV [Gallus gallus] Length = 1857 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 78/201 (38%), Gaps = 24/201 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV-NNVVRSGLVTFSSKIV 228 D+++++D S S+ R +R L+ + S +V + R GL +S Sbjct: 158 DIVILVDGSWSIGRF----------NFRLVRLFLENLVSAFNVGSEKTRVGLAQYSGDPR 207 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + L A+G + R + + GL Y I + K E A+ K Sbjct: 208 IEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY--ILENSFKPEAGARPG--VSKIG 263 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 I +TDG++ + + + G ++AIGV+ ++ + + PD Y+V Sbjct: 264 ILITDGKSQD------DVIPPAKNLRDAGIELFAIGVKNADINELKEIASEPDSTHVYNV 317 Query: 346 QNSRKLHDAFLRIGKEMVKQR 366 + ++ + + + + Sbjct: 318 ADFNFMNSIVEGLTRTVCSRV 338 >gi|330789584|ref|XP_003282879.1| hypothetical protein DICPUDRAFT_51972 [Dictyostelium purpureum] gi|325087163|gb|EGC40543.1| hypothetical protein DICPUDRAFT_51972 [Dictyostelium purpureum] Length = 536 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 68/194 (35%), Gaps = 12/194 (6%) Query: 172 MMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++V+D+S SM+ + PG ++ + I + GLV F S Sbjct: 117 VIVIDLSGSMSAPAFMGSYKPGELEMKRIEFAQALFQTFIDKMVSYELPAVCGLVCFGSV 176 Query: 227 IVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 TF + ++ + T + A I D + K + K Sbjct: 177 AKLTFGITKNFDSFSTELGEIQANMGGTSLWEAIVLAAKTIVDFRNNPPSDIKLAEPEKL 236 Query: 286 Y--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 + + LTDG+++S N ++ Y K+ ++ +I + + + + + Sbjct: 237 FCRVFCLTDGQDTS-NYPLYDAYSY---LKKNRIVLDSIPIGEISQLLLALSTGTGGSCF 292 Query: 344 SVQNSRKLHDAFLR 357 +++ F R Sbjct: 293 VADTAQEGIGLFER 306 >gi|254468097|ref|ZP_05081503.1| von Willebrand factor, type A [beta proteobacterium KB13] gi|207086907|gb|EDZ64190.1| von Willebrand factor, type A [beta proteobacterium KB13] Length = 326 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 47/239 (19%), Positives = 77/239 (32%), Gaps = 47/239 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHF---------GPGMDKLGVATRSIREMLDIIKS 208 + +K +G + +VLD S SM+D F G G K A R I ++ Sbjct: 69 PQEKVTKVGVGSQIGLVLDRSASMDDPFSGSTQFDDEGVGETKSAAAARLIINFVES--- 125 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKI 266 G++TFS+ + PL I +N + T GL Sbjct: 126 ----RKNDMIGVITFSNSAMFVLPLTQNKSAITGAVNATAGNALFQTNIGAGLSSVSELF 181 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGE------NSSPNIDNKE----SLFYCNEAKRRG 316 AK D + +I L+DG D + L++ + G Sbjct: 182 ----------AKVEDSGSRAVILLSDGAGRIDAPTQQKIRDWFDRFDIGLYWIVLRQPGG 231 Query: 317 AIVYAIGVQAEAADQ---------FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 ++ ++ Q + K SP + Y ++ L A I + KQ Sbjct: 232 ISIFDENLKIRDETQPPPQIELFDYFKTFRSPFQAYEAEDPASLEKAIKDINLKEKKQI 290 >gi|7495466|pir||T32949 hypothetical protein C05G6.3 - Caenorhabditis elegans Length = 341 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 58/166 (34%), Gaps = 27/166 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI-----REMLDIIKSIPDVNNVVRSG 219 + LD+++VLD S + + + D + + + VR Sbjct: 109 TGCELDLVLVLDFSTTTDPVYNSYKDLSKRLVSQLKIGPHYTQVAAVTFATVGRTRVRFN 168 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L + ++ + + I+ L G TT G+E A ++ +++ IA Sbjct: 169 LKKYQTQ-----------EEVLRGIDNLKSRGGTTAIGAGIEKALTQLDESEGARPGIAT 217 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K ++ TDG ++ K + +A G +Y + Sbjct: 218 ------KVMVVFTDGWSNKGPDPEKRAR----DAVSSGFEMYTVAY 253 >gi|134118676|ref|XP_771841.1| hypothetical protein CNBN0230 [Cryptococcus neoformans var. neoformans B-3501A] gi|50254445|gb|EAL17194.1| hypothetical protein CNBN0230 [Cryptococcus neoformans var. neoformans B-3501A] Length = 502 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 61/209 (29%), Gaps = 37/209 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN--NVVRS 218 S +D++ +LD + SM + D + + D+I+ +N + +R Sbjct: 47 SGSSHGKCIDLVFILDCTGSMQKYINSVRDHI-------IGICDMIRGEEGLNGPDDLRV 99 Query: 219 GLVTF-------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 +V + S+ + + P + +Q + L + A E Sbjct: 100 AVVNYRDHPPQDSTYVYKFHPFTSDIPEVQNYLKGLTASGGGDGPEAVTAAMAATLTELE 159 Query: 272 KLEHIAKGHDDYKKYIIFLTD------GENSS----PNIDNKESLFYCNEAKRRGAIVYA 321 + + + + D GE + D + L + G + Sbjct: 160 WR-------REAARMAVLVADAPPHGIGEGGDQFKQGDPDGHDPLVVARMMAQNGITM-- 210 Query: 322 IGVQAEAADQFLKNCASPDRFYSVQNSRK 350 D + D F ++ N Sbjct: 211 --FMVACEDTLSGYSHAVDFFQAICNMTS 237 >gi|316935369|ref|YP_004110351.1| von Willebrand factor type A [Rhodopseudomonas palustris DX-1] gi|315603083|gb|ADU45618.1| von Willebrand factor type A [Rhodopseudomonas palustris DX-1] Length = 636 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 68/196 (34%), Gaps = 31/196 (15%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F A L +T V +S +D +D VLDV + L + + Sbjct: 436 FHLAARPQARDLAVTLLVDVSLSTDAWIDNRRVLDVE-------KEALTVLAHGIEACGD 488 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 I+ + VR V PL ++ +I L G T+ L + Sbjct: 489 QHSILTFTSRRRDWVRVETVK--GFDESMSPL------VERRIAALKPGYYTRIGAALRH 540 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A ++ ++ KK ++ LTDG+ + ++S E++R G Sbjct: 541 ASAELARQPQR-----------KKLLLVLTDGKPNDVDHYEGRFALEDSRRAVQESRRSG 589 Query: 317 AIVYAIGVQAEAADQF 332 V+ + + +A F Sbjct: 590 IAVFGVTIDVDAQAYF 605 >gi|73973310|ref|XP_867431.1| PREDICTED: similar to alpha 1 type XII collagen short isoform precursor isoform 2 [Canis familiaris] Length = 1901 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 70/198 (35%), Gaps = 24/198 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ + P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ D+ PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEDELKMIATDPDDT 189 Query: 341 RFYSVQNSRKLHDAFLRI 358 Y+V + L + Sbjct: 190 HAYNVADFESLSKIVDDL 207 >gi|87306384|ref|ZP_01088531.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] gi|87290563|gb|EAQ82450.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] Length = 1030 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 50/133 (37%), Gaps = 18/133 (13%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + K++R+ G + P L+ A + ++ K++I ++DG+ Sbjct: 506 NRPGMLAKMSRMTPGDMPQFDPSLQMALRAF--------NQLPPNEVAVKHMIIISDGDP 557 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCAS--PDRFYSVQNSRKL 351 S N ++ + G V + + A LK AS ++Y V N + L Sbjct: 558 SPANPFTLSAIA------KAGIKVTTVAIGTHGPANSLELKKIASATGGKYYEVTNPKAL 611 Query: 352 HDAFLRIGKEMVK 364 + R + + + Sbjct: 612 PRIYQREARRIAQ 624 >gi|268324906|emb|CBH38494.1| conserved hypothetical protein [uncultured archaeon] Length = 709 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 70/194 (36%), Gaps = 34/194 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQT 230 M V+D S SM + ++ A ++ +L + + G+V F + Sbjct: 513 MFVVDASGSMGANR-----RMESAKGAVLSLL-----LDSYQQRDKVGMVAFKGDQADVL 562 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL E++ L G T GLE N + K + E ++ + Sbjct: 563 LPLCSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHRDEEAIP-------ILLLI 615 Query: 291 TDGENS----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLK----NCA---- 337 +DG + +E L +A+ +G +Y I + E +D F++ C Sbjct: 616 SDGRANVSAGGSKELEQELLALAEQARAKG--IYVIVIDTEIVSDSFIQMQLGYCRAIAN 673 Query: 338 -SPDRFYSVQNSRK 350 S ++Y + + Sbjct: 674 YSGGKYYPIADLTS 687 >gi|254167839|ref|ZP_04874688.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623130|gb|EDY35696.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 1953 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 56/162 (34%), Gaps = 27/162 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHF-----------GPGMDKLGVATRSIREMLDIIKSIPDV 212 +D++ V+D S SMN G ++ VA ++ +D +K + Sbjct: 1244 NKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAA---IDAVKELGPQ 1300 Query: 213 NNVVRSGLVTFSSK--IVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + R + TF+ + + +Q I + + G T L +A + Sbjct: 1301 D---RVAVFTFNGNSHPEEYMGFTYVTADNLQTIISDLKDIQAGGGTPLYDTLSWAVYYM 1357 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 +K +D + I+ LTDG ++ Y Sbjct: 1358 ----DKYSTDNPDREDATRGILVLTDGLSNYDTYGTSNGARY 1395 >gi|255530103|ref|YP_003090475.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255343087|gb|ACU02413.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 613 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 78/189 (41%), Gaps = 19/189 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW AN + + KI + + +++ ++DVS SMN +KL + S + + Sbjct: 216 PWNANHKLVQIGLQGK-KIPTDNLPASNLVFLIDVSGSMNQP-----NKLPLLIASFKLL 269 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + ++ V VV +G +S +V I+E +N+L G +T G++ A Sbjct: 270 TEQLRPEDKVAIVVYAG----NSGLVLPSTPGNEKTKIKEALNKLSAGGSTAGGAGIQLA 325 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y D + KG ++ II TDG+ + +K+ E ++ G + + Sbjct: 326 YQVATD------NFIKGGNNR---IILATDGDFNVGASSDKDMESLIEEKRKSGVFLTVL 376 Query: 323 GVQAEAADQ 331 G Sbjct: 377 GYGMGNMKD 385 >gi|221127586|ref|XP_002157796.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 449 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 68/198 (34%), Gaps = 21/198 (10%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+ + D S S+ + M K ++ S R + + S Sbjct: 267 TPVDVALAFDSSSSVGELAYEEMKKFAH---------QVVDSFSISQQNARFAALVYGSN 317 Query: 227 IVQTFPL-AWGVQ-HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F + I++ I L S T+ LE A + +F L+ + Sbjct: 318 ASVEFNFVRYDSALEIKQAIQSLSYLKSNTRIDKALEVAKSDLFS----LQGKVRSRRPM 373 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 Y+ F DG + D + + K G + AIGV E LK + + + Sbjct: 374 ILYVFF--DGTVTRSMSDLESVVQP---LKDYGVKIIAIGVGPEVNRYQLKKISEDNAIF 428 Query: 344 SVQNSRKLHDAFLRIGKE 361 S ++ ++L I ++ Sbjct: 429 SGKSFKELAPLLYSIVEQ 446 >gi|196233140|ref|ZP_03131987.1| protein of unknown function DUF1355 [Chthoniobacter flavus Ellin428] gi|196222784|gb|EDY17307.1| protein of unknown function DUF1355 [Chthoniobacter flavus Ellin428] Length = 992 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 63/214 (29%), Gaps = 44/214 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L ++ LDVS SM V +++ ++ + P + GLV F Sbjct: 106 LSVVYALDVSDSMGS---------KVRDQALNWIMQTATTKPQKDEA---GLVVFGRDAA 153 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P E IN + T + A + + + I+ Sbjct: 154 VELPPR--SSFPFEAINSRVAKDGTDLGQAMSLAAAMLPEEHQGR-------------IV 198 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQF----LKNCASPDRF 342 +TDG + + +L +E K RG V + + L Sbjct: 199 LITDGNETEGS-----ALAKVDELKARGVAVDVLPVAFSYDKEVWLERLDLPRVVKAGET 253 Query: 343 YSVQNSRKLHDA------FLRIGKEMVKQRILYN 370 Y A GK++ ++ + YN Sbjct: 254 YEASVLLDSLAAGHGTLRLRENGKQIFEKEVDYN 287 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 75/232 (32%), Gaps = 32/232 (13%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSK------SDIGLDMMMVLDVSLSMNDHFGPGMD 190 + C W S+ S + + + L++ +LD S S Sbjct: 6 LLVCVSLWLLVSAARCQNRRQSGETPNNLTTKQDNACSLEVAFILDSSESAKGLLFSRQK 65 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-- 248 + RS L ++ + + R L+ +SS + Q + +++L Sbjct: 66 EF---VRSFSRRLMEMQ-VSGWHLRTRLALIYYSSSVHINQHF-NDWQDLDVFLDQLEDA 120 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G T ST + A + + +TDG + N D + Sbjct: 121 SYIGQGTYSTYAISNATQLFI--------RETSGQSV-RVSLLMTDGSDHPRNPD---IM 168 Query: 307 FYCNEAKRRGAIVYAIGVQ---AEAADQFLKNCASP--DRFYSVQNSRKLHD 353 EAK ++AIG+ ++ L+ AS +++ R L + Sbjct: 169 TVVAEAKSHNIKIFAIGLSMRAMDSNSAKLRAVASSPAQQYFHSLTDRGLEE 220 >gi|163801668|ref|ZP_02195566.1| hypothetical protein 1103602000597_AND4_09447 [Vibrio sp. AND4] gi|159174585|gb|EDP59387.1| hypothetical protein AND4_09447 [Vibrio sp. AND4] Length = 524 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 72/211 (34%), Gaps = 18/211 (8%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + KG +I A+LL I + +E + ++L + + L + ++ Sbjct: 9 RSLHKQKGVAAIWMALLLVPIMGITFWAVEGTRYIQESSRLRDSAEAAALA----VTIED 64 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 ++ +++I + + E RE G +++ Y + Sbjct: 65 KPGAASVMAENYVRSYVRDIKSINVQAERREPGNSRNEEA------------ADFIQYTV 112 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHF 185 +A + ++ F P + + +SS +D++ V D S SM + Sbjct: 113 NATTTHDSWFANSFIPSFDETQDIAGRSLARKYLSSVGGKNIDIVFVSDFSGSMNFDWMD 172 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 G K+ +IR + + N V Sbjct: 173 PNGNKKIDDLKTAIRAISNKFICQDVRNEFV 203 >gi|73967405|ref|XP_537778.2| PREDICTED: similar to integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Canis familiaris] Length = 1178 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 79/208 (37%), Gaps = 28/208 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ + LV + Sbjct: 203 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECSF---ALVQYGEV 253 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + + N G+ TK+ +++ + IF H ++ + Sbjct: 254 IQTEFDLRDSQDVMTSLAKVQNITQVGNVTKTASAMQHVLDNIFTPN----HGSRKNA-- 307 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQF--LKNCAS- 338 K ++ LTDG+ D N K +G +AIGV E + LK AS Sbjct: 308 SKVMVVLTDGDIFG---DPLNLTTVINSPKMQGVERFAIGVGNAFEKNKTYHELKLIASD 364 Query: 339 -PDRF-YSVQNSRKLHDAFLRIGKEMVK 364 DR+ + V N L ++ + +++ Sbjct: 365 PDDRYAFKVTNYTALDGLLSKLQQTIIQ 392 >gi|332227198|ref|XP_003262778.1| PREDICTED: vitrin isoform 2 [Nomascus leucogenys] Length = 679 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 496 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 545 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 546 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 593 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G YAIGV A ++ P D Sbjct: 594 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVFTYAIGVAWAAQEELEVIATHPARD 647 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 648 HSFFVDEFDNLYQYVPRIIQNI 669 >gi|212703143|ref|ZP_03311271.1| hypothetical protein DESPIG_01182 [Desulfovibrio piger ATCC 29098] gi|212673409|gb|EEB33892.1| hypothetical protein DESPIG_01182 [Desulfovibrio piger ATCC 29098] Length = 512 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 78/216 (36%), Gaps = 37/216 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K +S + + ++LD S SM G + A ++ + L+ + I G Sbjct: 329 KRASVHKLDTAVHILLDSSGSM---HGESIKLAVQACYAVGKALEHLSGIS-------LG 378 Query: 220 LVT---FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + + FPL Q + +++ + T L + ++ +E Sbjct: 379 ITSFPAYRDGKIGVFPLVRHGQKMTDRMQ-MQAHGGTPLAEALWWVMRQMLILRETR--- 434 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 K ++ LTDG D + L ++ G VY IG++ + L + Sbjct: 435 --------KVVLILTDGVPD----DVTQCLQALEALRKTGVEVYGIGMKFDCISSLLPDT 482 Query: 337 ASP--DRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +S RF +L A + + + + Y+ Sbjct: 483 SSRVISRF------EELSPALFEVLQHALLRETRYD 512 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 4/69 (5%) Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPD-RFYSVQNSRKLHDAFLRI 358 ++ C A G ++Y+IG+ + + LK+CAS + ++ V+ ++ AF I Sbjct: 485 DRRLRQICGVANAAGVVIYSIGMDVDNTNSLNLLKDCASSESHYFDVEGL-EIQTAFDMI 543 Query: 359 GKEMVKQRI 367 + R+ Sbjct: 544 AASISMLRL 552 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 12/105 (11%), Positives = 43/105 (40%), Gaps = 2/105 (1%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 11 MRRSLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 70 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 + + + + + ++ + ++ N +++ Sbjct: 71 --AISLSVAEADGIAAGHLNFVFFQNKSVQMSTNANVTFSDSLTD 113 >gi|198426244|ref|XP_002124558.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 430 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 70/206 (33%), Gaps = 32/206 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D ++V+D S S+ P DK+ I ML +R G ++ + Sbjct: 235 ARTDAIIVVDSSSSVKR---PNWDKM---IAFIVNMLRQFTIDQSS---LRIGAFRYNRR 285 Query: 227 IVQTFPL-----AWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + + I L + G T + + + N + A+ Sbjct: 286 VHTDTQILLNQFNNDKDGLLAAIQSLPYNGGGTNTGRAIAHVTNVMLKAE------NGNR 339 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----DQFLKNC 336 D + ++ +TDG ++ L + + GA+V+ + V + Q L+ Sbjct: 340 PDVQDLVVLITDG-----RAQDRVDLVSAD-LRATGAVVFVVAVILPGSTIRLSQMLEIS 393 Query: 337 ASPDRFYSVQNSRK-LHDAFLRIGKE 361 + + V + L AF + + Sbjct: 394 GTNETLLIVDSGFDGLDTAFSSMLTK 419 >gi|194218991|ref|XP_001915421.1| PREDICTED: similar to integrin, alpha D [Equus caballus] Length = 1160 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 83/226 (36%), Gaps = 30/226 (13%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 SH ++ + + +D++ ++D S S+ ++ R++ + + Sbjct: 129 SHLKIIRRVPTALPECPNQEIDIVFLIDGSGSIA---PSEFKQMKDFVRAVMGQFEGTNT 185 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ--EKINRL-IFGSTTKSTPGLEYAYNK 265 + L+ +SS + F + + ++ + T + G+ N+ Sbjct: 186 LFS--------LMQYSSGLKTHFTFSKFRSSLSPLRLVDPIVQLQGLTFTATGILAVVNE 237 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG-- 323 +F +K AK K +I +TDG+ D E +A+R G I YAIG Sbjct: 238 LFHSKNGARRSAK------KILIVITDGQ---KYKDPWEYRDVIPQAERAGIIRYAIGVG 288 Query: 324 --VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 Q A Q L S D + V N L ++ +++ Sbjct: 289 DAFQEPIARQELNTIGSAPSQDHVFKVDNFAALSSIQKQLQEKIFA 334 >gi|153876590|ref|ZP_02003832.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152066946|gb|EDN66168.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 305 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 74/214 (34%), Gaps = 40/214 (18%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +KI + ++++VLD S M K+ A ++L+I+K + Sbjct: 114 EAIIKIWHQYKKPANIVLVLDTSGGMRGE------KILHARTMALQLLEIVKEADYFS-- 165 Query: 216 VRSGLVTFSS-------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 L++F+ I W ++ + N G T + AY + Sbjct: 166 ----LLSFNHSLNWIAKNIQVKSQQKW----LKRQFNYQFPGGGTALYDAIFNAYTFLQK 217 Query: 269 AKEKLEHIAKGHDDYKK--YIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQ 325 + K +I L+DG +S ++ K+ L + ++A+G Sbjct: 218 -----------NSFPDKIAVMIVLSDGGDSHSELNFKDLLSKIPFNSDTSPIRIFAVGYG 266 Query: 326 AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + + L A + +FY + F + Sbjct: 267 SITDKKRLNEIAKMTQGKFYD-GAMVDVDKIFKK 299 >gi|145539690|ref|XP_001455535.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124423343|emb|CAK88138.1| unnamed protein product [Paramecium tetraurelia] Length = 1164 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 71/191 (37%), Gaps = 19/191 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMD--KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++++D+S SM + F D +LGV D + ++ NV+ L F + Sbjct: 725 AIIVLIDISGSMEEEFYNSEDLTRLGVVKAFFNAFADRTMAY-NLKNVI--SLAYFDDRY 781 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + ++ +N+ T L+ A + + K+K + I Sbjct: 782 ILKCGYTELFMQFKDLVNKAKPQGMTALYVALKNAIDSLLQFKKKYPNCILR-------I 834 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAADQFLKNCASPDRFYSV 345 I LTDGE++ + F + I+ + + + + Q K A+ + + Sbjct: 835 IALTDGEDNKGRYSPE---FIAQTILQNQIILDSFVVYDKCDGLKQITK--AAGGQCFCP 889 Query: 346 QNSRKLHDAFL 356 + ++ F Sbjct: 890 KTIQEGLKLFE 900 >gi|224046671|ref|XP_002199336.1| PREDICTED: collagen, type XIV, alpha 1 (undulin) [Taeniopygia guttata] Length = 1883 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 72/182 (39%), Gaps = 24/182 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV-NNVVRSGLVTFSSKIV 228 D+++++D S S+ R +R L+ + S +V + R GL +S Sbjct: 158 DIVILVDGSWSIGRF----------NFRLVRLFLENLVSAFNVGSEKTRVGLAQYSGDPR 207 Query: 229 QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + L A+G + R + + GL Y I + K E A+ K Sbjct: 208 IEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY--ILENSFKPEAGARPG--VSKIG 263 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 I +TDG++ + + + G ++AIGV+ ++ + + PD Y+V Sbjct: 264 ILITDGKSQD------DVIPPAKNLRDAGIELFAIGVKNADINELKEIASEPDSTHVYNV 317 Query: 346 QN 347 + Sbjct: 318 AD 319 >gi|8569519|pdb|1QC5|B Chain B, I Domain From Integrin Alpha1-Beta1 Length = 192 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 73/210 (34%), Gaps = 37/210 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD S S + T + ++L+ + P G+V + + Sbjct: 4 LDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ---VGIVQYGENVT 52 Query: 229 QTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + +++ T + G++ A + F K Sbjct: 53 HEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFTEARGARRGVK------ 106 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------QFLKN 335 K ++ +TDGE + DN + + ++I + + +K+ Sbjct: 107 KVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKS 162 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKEM 362 AS F++V + L +G+ + Sbjct: 163 IASEPTEKHFFNVSDEIALVTIVKTLGERI 192 >gi|119776240|ref|YP_928980.1| von Willebrand factor type A (vWA) domain-containing protein [Shewanella amazonensis SB2B] gi|119768740|gb|ABM01311.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Shewanella amazonensis SB2B] Length = 713 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/304 (12%), Positives = 92/304 (30%), Gaps = 46/304 (15%) Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 Q I+++ + ++ E+G A + + ++ + D + Sbjct: 237 TSQPESLHTGAIESVSANEDTSQGAESGAAGEGKASHQPRAMGLDKDIVF-----YWRLQ 291 Query: 133 YEMPFIFCTF----PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +P P + L T + + G D + VLD S SMN + Sbjct: 292 EGLPGRVDMVTYRDPKVSTKGTVKLTFTPGDDLGPVTQ-GRDWVFVLDKSGSMNGKYATL 350 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEK 243 ++ + + +P + R ++ F + + + Sbjct: 351 VEGVR----------QGLGKLPAQD---RFRIILFDESTQEFSKGFVPVDSNNINQALAW 397 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + + G+ T GL+ A + + ++ +TDG + + + Sbjct: 398 VEGISPGNGTDLYQGLKRALTPLDADRSTG-------------VVLITDGVANVGVTEKR 444 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKE 361 L +++ ++ + A L S SV N+ + + I + Sbjct: 445 RFLEL---MQQQDVRLFTFIMGNSANTPLLVPMTRLSNGVATSVSNADDIVGHLMNITSK 501 Query: 362 MVKQ 365 + Q Sbjct: 502 LTHQ 505 >gi|309358136|emb|CAP34492.2| CBR-DIG-1 protein [Caenorhabditis briggsae AF16] Length = 13580 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 50/342 (14%), Positives = 114/342 (33%), Gaps = 48/342 (14%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 + H+ +L + L + E +G + D+ + + Sbjct: 12710 DLVHRISESRRLKG---RAQLGAGLREALDELSISGVDGVPQIVLVVKNGKASDDYSSAV 12766 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDY---NLSAVSRYE---------MPFIFCTF-P 143 + +++ + D + + + ++ + C P Sbjct: 12767 KSLKSERNVTIFVVDSGDDESQDQNSELTDADKIVVIPQWRGADSEVLGPIADYICKIVP 12826 Query: 144 WCANSSHAPLLITSSVKIS------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 ++ P T + ++ S D D+++VLD S ++F P D+ Sbjct: 12827 NVESARTWPTPRTKATTLAGSRRSCSTIDYESDVIIVLDSS----ENFTP--DEFDSMKD 12880 Query: 198 SIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ-EKINRLIFGSTTKS 255 ++ ++D PDV+ G V +S K+ P+A G + E + +++ Sbjct: 12881 AVASIVDTGFDLAPDVSK---IGFVIYSDKV--AVPVALGHYEDKIELLEKIVDAEKIND 12935 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 + + + G ++ K ++ +T+G+N E L Sbjct: 12936 GVAIAL----YGLNAARQQFQLHGRENATKIVLLITNGKNRGNAAAAAEDLRD-----MY 12986 Query: 316 GAIVYAIGVQAEAAD----QFLKNCASPDRFYSVQNSRKLHD 353 G ++A+ V + + + L A+PD V S ++ D Sbjct: 12987 GVQLFAVAVGSNPDELATIKRLVGNANPDNAIEVAQSTEIDD 13028 >gi|32476038|ref|NP_869032.1| hypothetical protein RB9710 [Rhodopirellula baltica SH 1] gi|32446582|emb|CAD76417.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 1631 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 44/318 (13%), Positives = 94/318 (29%), Gaps = 75/318 (23%) Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN- 126 + Q + + +I + + + + + + L++ +D + Sbjct: 974 KLDTADGQGDLPEGIAVMSIHRDGIKTSDAQQAVSIHATDAPTNVELTLAVDQKRPSLAE 1033 Query: 127 LSAVS-RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++ RY+ + + S + + + I D G + V+D S SMND Sbjct: 1034 VNFRGNRYQSSVMSSGTAFGVTSRGQTVQDGAEITIRDAMDAGRAITFVMDCSASMNDPL 1093 Query: 186 GPGM----------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV------- 228 G M K A ++ EM+ ++ P + GLV + ++ Sbjct: 1094 GEEMGRSALGAQRASKFEAARSAVYEMMRRLQPGPS-----QVGLVLYGHRMAIRAGDPA 1148 Query: 229 --------------------QTFPLA---------------WGVQHI---QEKINRLIFG 250 FP + + ++ + + Sbjct: 1149 KDSGDGSGQTTLLQKRYHKRFPFPPTIQPFEDVEVALPTGRFDTAELELARQHFDAAVPW 1208 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-- 308 T + A I ++ D +K ++ ++DG N N + Sbjct: 1209 GQTPLYLSIWKAMEDI----------SRTGDGVRKDVVVISDGRNYQFNPTPEAIFSIGQ 1258 Query: 309 -CNEAKRRGAIVYAIGVQ 325 +AK G V+ IG Sbjct: 1259 LVTQAKTLGVQVHVIGYG 1276 >gi|332846919|ref|XP_003315346.1| PREDICTED: integrin alpha-E [Pan troglodytes] Length = 1241 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/164 (23%), Positives = 57/164 (34%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ +VLD S S++ P A I M+ N LV + Sbjct: 261 AGTEIAIVLDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 311 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N GS TK+ +++ + IF + A Sbjct: 312 IQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------ 365 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D N K +G +AIGV E Sbjct: 366 SKVMVVLTDG---GIFEDPLNLTTVINSPKMQGVERFAIGVGEE 406 >gi|302517857|ref|ZP_07270199.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426752|gb|EFK98567.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 610 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 77/213 (36%), Gaps = 26/213 (12%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++V+D S SM + G G ++ V S+ + L S D GL FS+++ Sbjct: 404 LLVVVDSSPSMAELVPGRGQSRMDVTKASLLQALAQFTSADD------IGLWEFSTRLDG 457 Query: 230 TFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEY---AYNKIFDAKEKLEHIAKGHDD 282 + ++ T + L+ ++D A Sbjct: 458 DRDYRELVPTDRLGARKGEGVTQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYA 517 Query: 283 YKKY--IIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAADQFLKN 335 K+ ++ LTDG N P + +L + A+ + AI V EAA +K Sbjct: 518 ADKFNAVVLLTDGTNEDPGSLTRGALLTKLRDLADPARP--LPLVAIAVGPEAAGDDVKA 575 Query: 336 C--ASPDRFYSVQNSRKLHDAFLR-IGKEMVKQ 365 A+ + V + ++H+ I + KQ Sbjct: 576 IGSATGGSGFKVDDPAQIHEVINNAIVEAGSKQ 608 >gi|194217500|ref|XP_001502733.2| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Equus caballus] Length = 1163 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 70/206 (33%), Gaps = 28/206 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ + LV + Sbjct: 186 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECSF---ALVQYGEV 236 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N G+ TK+ +++ + IF A Sbjct: 237 IQTEFDLQDSQDVMASLARVQNITQVGNVTKTASAMQHVLDHIFTPSHSSRKKA------ 290 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFLKNCASP 339 K ++ LTDG+ D N K +G +AIG Q D+ LK AS Sbjct: 291 SKIMVVLTDGDIFG---DPLNLTTVINSPKMQGVERFAIGVGEAFQKAKTDKELKLIASD 347 Query: 340 ---DRFYSVQNSRKLHDAFLRIGKEM 362 + V N L ++ + + Sbjct: 348 PDETHAFKVTNYAALDGLLSKLQQNI 373 >gi|332227202|ref|XP_003262780.1| PREDICTED: vitrin isoform 4 [Nomascus leucogenys] Length = 657 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 474 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 523 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 524 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 571 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G YAIGV A ++ P D Sbjct: 572 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVFTYAIGVAWAAQEELEVIATHPARD 625 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 626 HSFFVDEFDNLYQYVPRIIQNI 647 >gi|254448205|ref|ZP_05061667.1| TPR repeat-containing protein [gamma proteobacterium HTCC5015] gi|198262072|gb|EDY86355.1| TPR repeat-containing protein [gamma proteobacterium HTCC5015] Length = 630 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 61/167 (36%), Gaps = 20/167 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D ++++LD+S SM R+ ++LDI+K+ PD R+ L+ ++ Sbjct: 100 DQDRALVILLDLSRSMLSEDIRP----NRLHRARLKVLDILKARPDG----RTALIGYAG 151 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL I L + + P ++ D ++ AK D + Sbjct: 152 DAFLVSPLTRD----NRTIALLTPELSPELMPVQGSRLDRAIDKAGEVLRNAKAGDRAQ- 206 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ L+DG I + + + A+ G + IG Sbjct: 207 -LLLLSDG------IPDAQRERAMDRAQALGLPISVIGFGTTEGAPI 246 >gi|153867800|ref|ZP_01998040.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] gi|152144880|gb|EDN71960.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] Length = 276 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 63/195 (32%), Gaps = 19/195 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++M+ V+DV + + ++ A R++ L + ++ +R L F S Sbjct: 20 VEMIFVVDV----SGSMSSSLSEMVQAARTVANEL----ATSNMPGQIRFALTVFDSHNN 71 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + +N L + + + A+ K ++ Sbjct: 72 IKTSLNDDINTFYGGLNSLTIDGGSDISMAFAPINQLLSQARPHAA----------KIVV 121 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 F TDG + + + +G ++A+ + A P+R N Sbjct: 122 FYTDGYVFHSGKMDA-IVNDAEALRNQGVQIFAVSPPEDDASAMSLITGYPNRVLRPNNL 180 Query: 349 RKLHDAFLRIGKEMV 363 + + F + +V Sbjct: 181 PDIVNRFRYVADAVV 195 >gi|114579064|ref|XP_515640.2| PREDICTED: hypothetical protein LOC459430 isoform 2 [Pan troglodytes] Length = 1294 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 55/170 (32%), Gaps = 30/170 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLKQAQSWIRHIKIGSSTNTLSALKTAFA----DKETRA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P + + E +Y I A++FLK A+ Sbjct: 606 TDGRPDQPPETVIDQVKLFQE-----IPIYTISFNYNDEIANRFLKEVAA 650 >gi|114579066|ref|XP_001156305.1| PREDICTED: hypothetical protein isoform 1 [Pan troglodytes] Length = 1059 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 55/170 (32%), Gaps = 30/170 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLKQAQSWIRHIKIGSSTNTLSALKTAFA----DKETRA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P + + E +Y I A++FLK A+ Sbjct: 606 TDGRPDQPPETVIDQVKLFQE-----IPIYTISFNYNDEIANRFLKEVAA 650 >gi|115525137|ref|YP_782048.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] gi|115519084|gb|ABJ07068.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] Length = 332 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 73/220 (33%), Gaps = 39/220 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G +++V+D S SM+D F A++S ++S G+ FS+ Sbjct: 79 GRGAHIVLVIDRSSSMDDSFAGSRPTAQEASKSAEAR-RFLRSFVANAEHDMFGVAIFST 137 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +Q PL + + ++ + T G+ A + D Sbjct: 138 SPLQALPLTSHREAVLAAVDAIARPGLSETDIARGIAMALSMHDDDP----------SSA 187 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------------- 329 + II ++DG + ID ++R +Y + ++ A Sbjct: 188 SRAIILVSDG---AGVIDRLVQEKLRAAFRKRPVHLYWVFLRTANALGIFDPPAAGERDV 244 Query: 330 ---------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 +F ++ P R + + + DA IG+ Sbjct: 245 PQVAPERHLHRFFQSLRIPYRAFEAERPEAIGDAIAEIGR 284 >gi|149053317|gb|EDM05134.1| rCG33209 [Rattus norvegicus] Length = 1169 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 77/218 (35%), Gaps = 28/218 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +V+ + + G ++ +VLD S S+ A I M+ N Sbjct: 184 QAVEEEDEEEAGTEIAIVLDGSGSIEPS------DFQKAKDFISTMMRNFYEKCFECNF- 236 Query: 217 RSGLVTFSSKIVQTFPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKL 273 LV + I F L + + K+ + TK+ +++ + IF Sbjct: 237 --ALVQYGGVIQTEFDLLDSRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGS 294 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQ 331 A K ++ LTDG+ D + +K +G + +AIGV E + Sbjct: 295 RKKAL------KVMVVLTDGD---IFRDPLNLTTVISSSKMQGVVRFAIGVGNAFENNNT 345 Query: 332 F--LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 + LK AS + V N L ++ + ++ Sbjct: 346 YRELKLIASDPKAAHTFKVTNYSALDGLLSKLQQRIIH 383 >gi|124783268|ref|NP_073725.2| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 precursor [Mus musculus] gi|171769535|sp|A2AVA0|SVEP1_MOUSE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=Polydom; Flags: Precursor gi|123210319|emb|CAM21214.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] gi|123229801|emb|CAM23597.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] Length = 3567 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRAFRSRVRRLRELSGSLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|26330612|dbj|BAC29036.1| unnamed protein product [Mus musculus] Length = 440 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRAFRSRVRRLRELSGSLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|26342893|dbj|BAC35103.1| unnamed protein product [Mus musculus] Length = 848 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRAFRSRVRRLRELSGSLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|11177164|gb|AAG32160.1|AF206329_1 polydom protein [Mus musculus] Length = 3567 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRAFRSRVRRLRELSGSLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|221123691|ref|XP_002160228.1| PREDICTED: similar to CnPolydom [Hydra magnipapillata] Length = 954 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 62/193 (32%), Gaps = 21/193 (10%) Query: 170 DMMMVLDVSLSM---NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++++LD S SM G M + I +L + + + G + K Sbjct: 54 DLVILLDSSGSMYSTGSFQGVTMTGFDIGKTFINALLSKVHISFNATRIA-IGTFGTNHK 112 Query: 227 IVQTFPLAWGVQ----HIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 I F L ++ ++ T L+ + N + + K H Sbjct: 113 IDINFILRPDYSMHKCKFKKDFEKIRIYGGMTNLRGALQDSLNIFRELDSNPDTHKKRHK 172 Query: 282 DYKKYIIFLTDGE-------NSSPNIDN----KESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + +I L+DGE N N + + + VY IGV + Sbjct: 173 T-NRVVILLSDGEGNVMDNPNGRGVTHNDGLARNPHDIAHNLRLGLVEVYTIGVTSAPDR 231 Query: 331 QFLKNCASPDRFY 343 L+ A+ + Sbjct: 232 AVLEGLATEKNLF 244 >gi|157375629|ref|YP_001474229.1| vault protein inter-alpha-trypsin subunit [Shewanella sediminis HAW-EB3] gi|157318003|gb|ABV37101.1| vault protein inter-alpha-trypsin domain protein [Shewanella sediminis HAW-EB3] Length = 770 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 78/189 (41%), Gaps = 26/189 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S++ + ++++V+D S SM+ + A ++++ L + S N ++ Sbjct: 368 KSRNRVSRELILVIDTSGSMSGS------AMEQAKKAMKYALAGLGSDDTFN------VI 415 Query: 222 TFSSKI----VQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F+SK+ P + ++ ++ L T+ LE+A + E Sbjct: 416 EFNSKVSSLSKGPIPASTKNIEMANRFVHSLTSDGGTEMALALEHALGQESGGSSWQETG 475 Query: 277 AKGHDDYK----KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +G D+ + ++F+TDG + N+ LF + + + ++ +G+ + F Sbjct: 476 LQGKDEESTSRLRQVLFMTDG-----AVGNEAELFKLIKYRIGKSRLFTLGIGSAPNSHF 530 Query: 333 LKNCASPDR 341 ++ A R Sbjct: 531 MQRAAEFGR 539 >gi|291569126|dbj|BAI91398.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 396 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 77/235 (32%), Gaps = 29/235 (12%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 I++ + L + P I + + P+ I S + +++++D Sbjct: 19 INNDNVTLRLQVTDGRDRPVIQLQKSDFQVITDDEPVGIKSWKSPQESTPPPAWIVVLVD 78 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----------- 225 +S SMN+ G ++ A + R L+ I D + +V F Sbjct: 79 LSGSMNELDTSGKRRIDGALDATRRFLEQIS---DRGGDTKVAIVPFGEGGRNCPGFEVT 135 Query: 226 --KIVQTFPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFD--AKEKLEHIAKG 279 I F A ++ + + + T L A + + G Sbjct: 136 QRGINSKFFPANDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSG 195 Query: 280 HDDYKKYIIFLTDGENSSPN----IDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + +I L+DG ++ PN DN +L + IV+ +G Sbjct: 196 RLEPRLSVILLSDGFHNQPNEQQDFDNLITLLE----RNNNIIVHTLGYGLTPQQ 246 >gi|261414285|gb|ACX83559.1| plant-derived anthrax toxin receptor-Fc fusion protein [synthetic construct] Length = 171 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 68/182 (37%), Gaps = 30/182 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSSKI 227 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS+ Sbjct: 1 DLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSSQA 46 Query: 228 VQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL I E + R+ T GL+ A +I A G Sbjct: 47 TIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA---------GGLKTS 97 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 II LTDG+ + ++ GA VY +GV Q + S ++ + Sbjct: 98 SIIIALTDGKLDGLV--PSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFP 155 Query: 345 VQ 346 V+ Sbjct: 156 VK 157 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 35/75 (46%), Gaps = 4/75 (5%) Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCA-SPDRFYSVQNSRKLH 352 + D + S C A+ +G ++Y + +A + Q L++CA SP + V N + Sbjct: 1 MNGTEADARLS-DICAAARAQGVVIYTVAFEAPSGGQSALQDCASSPSHHFDV-NGTDIS 58 Query: 353 DAFLRIGKEMVKQRI 367 AF I ++ ++ Sbjct: 59 SAFSAIASDIRALKL 73 >gi|78484767|ref|YP_390692.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78363053|gb|ABB41018.1| CbbO-m protein [Thiomicrospira crunogena XCL-2] Length = 757 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 81/208 (38%), Gaps = 33/208 (15%) Query: 169 LDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +M+++D S S+ + + G L ++ ++ + + +G + + Sbjct: 567 IAVMLLVDTSQSLKERNQETGQTLLELSEEAL--AITAWTIEQLGDKFAIAGFCSDTRHE 624 Query: 228 VQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 V+ + +G ++ +I + +T+ + +A H + Sbjct: 625 VRYQHIKGYSEHYG-DEVKSRIAAMEASYSTRMGAAMRHA-----------AHYLEAQQA 672 Query: 283 YKKYIIFLTDGE-----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 KK ++ LTDGE P + +++ E K +G Y I + AD++++ Sbjct: 673 EKKLMLILTDGEPADIDTKDPQVLIQDTHKAVEELKSKGIYSYCITLD-PNADEYVETIF 731 Query: 338 SPDRFYSVQN-----SRKLHDAFLRIGK 360 D Y+V + KL F++I + Sbjct: 732 --DNHYTVIDHVDKLPEKLPQVFMKITQ 757 >gi|119585300|gb|EAW64896.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_d [Homo sapiens] Length = 2978 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|119585298|gb|EAW64894.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_b [Homo sapiens] Length = 2609 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|119585297|gb|EAW64893.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_a [Homo sapiens] Length = 2944 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|119585301|gb|EAW64897.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_e [Homo sapiens] Length = 2597 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|4104232|gb|AAD01978.1| collagen alpha3(VI) [Mus musculus] Length = 1703 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 32/311 (10%) Query: 51 ILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 L+ S LYT + + N + + + K + L E +Q ++R Sbjct: 516 ALNGSALYTGSSLDFVRNNLFTSSAGHRAAEGVPKLLVLITGGKSLDE--VSQPAQELKR 573 Query: 111 ST--SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP------LLITSSVKIS 162 + +L++ +D + FI F + P +T + ++ Sbjct: 574 GSIMALAVGSKAADEDELKEIAFDSSLVFIPAEFRPAPLQNMLPSLMAPLRTLTGTTEVH 633 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 D++ +LD S ++ + P + +++ S+ ++ +R GLV Sbjct: 634 VNKR---DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQ 681 Query: 223 FSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS V F L + + RL + G +Y E + H Sbjct: 682 FSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTREH 741 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ + + P+ D L N R G + + +G + +P Sbjct: 742 -VPQLLLLLM-----AGPSEDAY--LQAANALVRSGVLTFCVGTNRADKAELEHIAFNPS 793 Query: 341 RFYSVQNSRKL 351 Y + + R L Sbjct: 794 LVYLMDDFRSL 804 Score = 50.6 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ D++ ++D S S G D+ + + D+++S+ +N LV Sbjct: 32 KNGAAADIVFLVDSSWS------AGKDRFLLVQEFLS---DVVESLAVGDNDFHFALVRL 82 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L Q + I + + + T + + ++ D Sbjct: 83 NGNPHTEFLLNTYHSKQEVLSHIANMSYIGGSNQTG---KGLEYVIHSHLTEASGSRAAD 139 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + I+ LTDG++ E K V+A+GV+ + + P Sbjct: 140 GVPQVIVVLTDGQSEDGFALPSA------ELKSADVNVFAVGVEGADERALGEVASEPLS 193 Query: 340 DRFYSVQNSRKL 351 ++++N L Sbjct: 194 MHVFNLENVTSL 205 >gi|495866|gb|AAA58965.1| collagen type VII [Homo sapiens] Length = 2912 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|4502961|ref|NP_000085.1| collagen alpha-1(VII) chain precursor [Homo sapiens] gi|1345650|sp|Q02388|CO7A1_HUMAN RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor gi|987125|gb|AAA75438.1| alpha-1 type VII collagen [Homo sapiens] Length = 2944 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 70/191 (36%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + + +F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVF-----LPQLARPGVP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|317125804|ref|YP_004099916.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589892|gb|ADU49189.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 568 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 48/320 (15%), Positives = 96/320 (30%), Gaps = 30/320 (9%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI 108 ++ TA ++L N+ + L ++ + Sbjct: 254 EAATPLGVVVTAAQLLAHNESNDARALAAVAPVDG-----AAFLGYRLVTLTDKAEVAEL 308 Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + + ++ ++ + + P P + P +++K + Sbjct: 309 VADFAAYLTTEEAKTAFSEAGFATPGGPEPQMPSPLYGTVTDRPAPDAAALKAVRAAWAA 368 Query: 169 ----LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ LDVS SM G +L V + L I +P + R GL +S Sbjct: 369 ATPKRQTLLALDVSGSMLRRTDQG-TRLAVMQEA---TLQAIAGMPGSS---RLGLWAYS 421 Query: 225 SKIVQTF----PLAWG--VQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + PL V H ++ R G T+S G Y+ I ++ Sbjct: 422 LHIGKQGDDFRPLLNAAPVGHSSHLLDLRKQVGGLTRSVGGGRGLYDTIVATYQRARATY 481 Query: 278 KGHDDYKKYIIFLTDGEN-SSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLK 334 I+ TDG N +L + + +G +E + + Sbjct: 482 TKGQLNSVVIV--TDGLNDDDYGASLSVALSRVKKLVDPRNPIRITIVGFGSEPDAKAMT 539 Query: 335 NCA--SPDRFYSVQNSRKLH 352 A + R+ + + L Sbjct: 540 PFAQLTGGRYVNAAEPKDLL 559 >gi|149027999|gb|EDL83450.1| complement factor B, isoform CRA_c [Rattus norvegicus] Length = 739 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 78/221 (35%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++LD S SM + G A R + +++ + S R GLVT+++ Sbjct: 236 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKP---RYGLVTYATV 292 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + + Sbjct: 293 PKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMSWPGDAP---P 349 Query: 278 KGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIVYAIGVQA 326 +G + + II +TDG ++ P ++ + + R V+ +G Sbjct: 350 EGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLV 409 Query: 327 EAA--DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + + + V++ L + F ++ E Sbjct: 410 DPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 450 >gi|218156285|ref|NP_997631.2| complement factor B [Rattus norvegicus] gi|149027998|gb|EDL83449.1| complement factor B, isoform CRA_b [Rattus norvegicus] Length = 763 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 78/221 (35%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++LD S SM + G A R + +++ + S R GLVT+++ Sbjct: 260 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKP---RYGLVTYATV 316 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + + Sbjct: 317 PKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMSWPGDAP---P 373 Query: 278 KGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIVYAIGVQA 326 +G + + II +TDG ++ P ++ + + R V+ +G Sbjct: 374 EGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLV 433 Query: 327 EAA--DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + + + V++ L + F ++ E Sbjct: 434 DPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 474 >gi|33086684|gb|AAP92654.1| Da1-24 [Rattus norvegicus] Length = 1116 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 78/221 (35%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++LD S SM + G A R + +++ + S R GLVT+++ Sbjct: 613 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKP---RYGLVTYATV 669 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + + Sbjct: 670 PKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMSWPGDAP---P 726 Query: 278 KGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIVYAIGVQA 326 +G + + II +TDG ++ P ++ + + R V+ +G Sbjct: 727 EGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLV 786 Query: 327 EAA--DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + + + V++ L + F ++ E Sbjct: 787 DPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 827 >gi|46237594|emb|CAE83972.1| B-factor, properdin [Rattus norvegicus] gi|56268879|gb|AAH87089.1| Complement factor B [Rattus norvegicus] Length = 761 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 78/221 (35%), Gaps = 34/221 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 ++LD S SM + G A R + +++ + S R GLVT+++ Sbjct: 258 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKP---RYGLVTYATV 314 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + + Sbjct: 315 PKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMSWPGDAP---P 371 Query: 278 KGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIVYAIGVQA 326 +G + + II +TDG ++ P ++ + + R V+ +G Sbjct: 372 EGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLV 431 Query: 327 EAA--DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + + + V++ L + F ++ E Sbjct: 432 DPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 472 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 72/207 (34%), Gaps = 28/207 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + ++ ++D+S S++ + + +I +L I + V G + Sbjct: 1149 TSGSYNLTFMIDMSGSISG------TEFQLMKDAINNLLAKFSGISQLQ--VEIGTFADN 1200 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S +V T+ V Q+ ++ L + N + +H Sbjct: 1201 SNVVGTYS---SVTAAQQAVSNLTRSGGGTNYQAALTTLNTMMTVDPVADH--------- 1248 Query: 285 KYIIFLTDGENSSPNIDNK----ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---- 336 KY+ FLTDGE + + N + N G ++ A+G+ + F N Sbjct: 1249 KYVYFLTDGEPTVGSWTNSTQIANGMAALNALTAPGVVINAVGIGVPSGASFGNNLNAID 1308 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 +PD + +V N L + + Sbjct: 1309 NTPDNYLAVDNFDDLSSGLGSLFTAVS 1335 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 68/172 (39%), Gaps = 27/172 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I+S ++ +++++VLD S S+ D + V ++ +++ N++V+ Sbjct: 806 TITSGTNGTVNLVLVLDSSGSIGD------TNMQVIKDAVTNLMNSY-----GNSLVKVM 854 Query: 220 LVTFSSKIVQTF--PLAWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 LV F+ W + ++ + G +T L+ ++ Sbjct: 855 LVDFAGSATVKSVGSQVWLTKDQATGQLTTISSGGSTDYDDALQA----------VQDNY 904 Query: 277 AKGHDDYKKYIIFLTDG--ENSSPNIDNKESLFYCNEAKRRGAI-VYAIGVQ 325 ++ F++DG ++ I+ +E + N ++G YA+G+ Sbjct: 905 GTPPSADNTFVFFISDGVPSSTDDAINTEERNEWTNFLTQKGIDGAYAVGIG 956 >gi|293358946|ref|XP_342760.4| PREDICTED: von Willebrand factor [Rattus norvegicus] Length = 2763 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 63/191 (32%), Gaps = 28/191 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + + P + ++ + S Sbjct: 1640 PLDVVLLLDGSSSL------PASSFDEMKSFAKAFISKANIGPHLTQ---VSVIQYGSIN 1690 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1691 TIDVP--WNVAQEKAYLQSLVDLMQQEGGPSQIGNALAFAVRYVTS----QIHGARPGAS 1744 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +I + ++D+ ++ + A+ V+ IGV + L+ A P Sbjct: 1745 KAVVMIIM------DTSLDSVDT--AVDAARSNRVAVFPIGVGDRYDEAQLRILAGPGAS 1796 Query: 343 YSVQNSRKLHD 353 +V +++ D Sbjct: 1797 SNVVKLQQVED 1807 >gi|109474242|ref|XP_001066203.1| PREDICTED: von Willebrand factor homolog [Rattus norvegicus] gi|149049388|gb|EDM01842.1| rCG29742 [Rattus norvegicus] Length = 2812 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 63/191 (32%), Gaps = 28/191 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++++LD S S+ + + P + ++ + S Sbjct: 1689 PLDVVLLLDGSSSL------PASSFDEMKSFAKAFISKANIGPHLTQ---VSVIQYGSIN 1739 Query: 228 VQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P W V + + L G ++ L +A + H A+ Sbjct: 1740 TIDVP--WNVAQEKAYLQSLVDLMQQEGGPSQIGNALAFAVRYVTS----QIHGARPGAS 1793 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +I + ++D+ ++ + A+ V+ IGV + L+ A P Sbjct: 1794 KAVVMIIM------DTSLDSVDT--AVDAARSNRVAVFPIGVGDRYDEAQLRILAGPGAS 1845 Query: 343 YSVQNSRKLHD 353 +V +++ D Sbjct: 1846 SNVVKLQQVED 1856 >gi|313238993|emb|CBY13979.1| unnamed protein product [Oikopleura dioica] Length = 676 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/140 (20%), Positives = 54/140 (38%), Gaps = 22/140 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+M V+D S S+ +R L +KS + + R G+V F++ Sbjct: 216 DIMFVVDSSGSVGSAGFAR----------VRNFLKEVKSY--MGSTSRIGVVRFATNSEL 263 Query: 230 TFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G+ + + + T + GL+ AY+ + +G K I+ Sbjct: 264 IWGLDDGLDGFTNAVTNMAWTMGGTYTAKGLDLAYDHML---------LRGRRSATKTIV 314 Query: 289 FLTDGENSSPNIDNKESLFY 308 +TDG S+ + + E Sbjct: 315 LMTDGYTSNQSAYDAEIKKI 334 >gi|118763608|gb|AAI28261.1| VIT protein [Homo sapiens] Length = 657 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 474 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 523 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G + + + T + + +A ++F K + Sbjct: 524 ---LEFGFDKYSSRPDIINAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 571 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 572 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 625 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 626 HSFFVDEFDNLYQYVPRIIQNI 647 >gi|25149045|ref|NP_741200.1| DIsplaced Gonad family member (dig-1) [Caenorhabditis elegans] gi|74961937|sp|Q09165|DIG1_CAEEL RecName: Full=Mesocentin; Flags: Precursor gi|22331940|gb|AAM78593.1| mesocentin [Caenorhabditis elegans] gi|26251538|gb|AAA50715.2| Displaced gonad protein 1, isoform a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 13100 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 54/318 (16%), Positives = 104/318 (32%), Gaps = 32/318 (10%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L LD + + K +D+S + + + + + G + Sbjct: 12251 LREALDELSISGVDGVPQIVLIVKNGKASDDYSSAVKSLKAERNVTVFVVDAGDDESQQQ 12310 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS----- 162 T I S V +I P S P T + S Sbjct: 12311 NSELTEEDKTIVISQWRGADSEVLGPIADYICKIVPNVETSRTWPTPRTKATTTSGTGRS 12370 Query: 163 -SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGL 220 S D D+++VLD S ++F P D+ ++ ++D PDV+ G Sbjct: 12371 CSSIDYESDVIIVLDSS----ENFTP--DEFVSMKDAVASIVDTGFDLAPDVSK---IGF 12421 Query: 221 VTFSSKIVQTFPLAWGVQHIQ-EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V +S K+ P+A G + E + ++ + + + G Sbjct: 12422 VIYSDKV--AVPVALGHYEDKIELLEKITDAEKINDGVAIAL----YGLNAARQQFQLHG 12475 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKN 335 ++ K +I +T+G+N E L G ++A+ V + + + L Sbjct: 12476 RENATKVVILITNGKNRGNAAAAAEDLRD-----MYGVQLFAVAVGSNPEELATIKRLVG 12530 Query: 336 CASPDRFYSVQNSRKLHD 353 ++ + V S ++ D Sbjct: 12531 NSNTENVIEVAQSTEIDD 12548 >gi|198412937|ref|XP_002124654.1| PREDICTED: similar to tumor endothelial marker 8, partial [Ciona intestinalis] Length = 360 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 64/186 (34%), Gaps = 23/186 (12%) Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 S S+ H + +++++D S ++R +TF+ K PL + Sbjct: 6 RSSSLLRHGRNHFRE--ETVSFVKQLVDKFTS-----PLLRISFITFNHKAHLVMPLTYD 58 Query: 237 VQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + EK++ L I T G+ +++ +K +I LTDG Sbjct: 59 RTIVVEKLDELRSSIPDGRTLLGLGMLEVTSQMKRMSQKRAS----------VVIILTDG 108 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-FYSVQNSRKLH 352 S+ N A+ G V A+GV Q ++ + + + L+ Sbjct: 109 VLDRSTQIL--SVKEANTARTLGGTVLAVGVGDFNPSQLIEIVGGSKKLVFKAASFDGLN 166 Query: 353 DAFLRI 358 + Sbjct: 167 RIVKHV 172 >gi|291461066|ref|ZP_06026725.2| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] gi|291379168|gb|EFE86686.1| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] Length = 529 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 79/222 (35%), Gaps = 34/222 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS----- 218 + ++ +++ +VLD S SM G + +A SI+++L + + + VR Sbjct: 172 EENMNVNVEIVLDASGSMVKKIGDK-TMMEIAKESIKQVLSEMPT--NAKVGVRVFGHKG 228 Query: 219 -GLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-AKE 271 + + L + V+ I++ + + T +EY + E Sbjct: 229 DNTASKKDESCGANELIYPIEDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLKALDGE 288 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAA 329 K +I YII TDG + + + K ++ IG +A Sbjct: 289 KTLNIL--------YII--TDGIETCGG----NPVEIAKQLKGENTNIVLGIIGFNVDAN 334 Query: 330 D-QFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + LK A+ + SV ++ KL RI + Sbjct: 335 QNRLLKQIADAAGGYYSSVNDADKLTGELYRINELAFSDYKW 376 >gi|198423392|ref|XP_002124188.1| PREDICTED: similar to fibropellin Ia [Ciona intestinalis] Length = 1781 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 95/257 (36%), Gaps = 52/257 (20%) Query: 136 PFIFCTFPWCANSSHAP------LLITSSVKIS-----SKSDIGLDMMMVLDVSLSMNDH 184 P T W +P L V S ++S + +D++ ++D S S+N + Sbjct: 31 PPQLSTLLWQPTRRTSPAVTEDYLWSRGLVTPSWRPIITESSVLVDIIFLVDGSGSVNYN 90 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---------KIVQTFPLAW 235 ++ +++ +I N V G+V +S +I +A Sbjct: 91 APGNWRRVLTWIKAVASGFNI------SNENVNIGVVQYSHWYRTLPMSSQIYLKTEIAL 144 Query: 236 GVQHIQE----KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 Q ++++ T T A NK LE + K +I LT Sbjct: 145 NRCRTQACFQYLVDKIQIMGYTTYTGA---AINKTI-----LEDFSTTPARSNKILILLT 196 Query: 292 DGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQAEAADQFLKNCASPD-----RFYSV 345 DG +K+ + Y + A+R+ ++ IGV + + Q L+ A+ + R Y + Sbjct: 197 DG-------ISKDDVNYASAFARRQNITIFCIGVGSYSLSQ-LQIIANGELNNNWRIYQL 248 Query: 346 QNSRKLHDAFLRIGKEM 362 + +L + +E+ Sbjct: 249 NSFNQLPTTVQNLQREI 265 >gi|91228647|ref|ZP_01262563.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] gi|91187798|gb|EAS74114.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] Length = 2510 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 48/308 (15%), Positives = 96/308 (31%), Gaps = 42/308 (13%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L +D L + L + N I + + + + G Sbjct: 1948 LGQTVDSVTLSEPSTALTSNGVSVTWTLSNYDQMLIGSANGEEVIKISVNDTGVV----- 2002 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV-------- 159 ST L ID + + E+P + NS+ +++ Sbjct: 2003 ---STELLGPIDHSNPSGEDTLNI--EVPVLVSNARGLTNSTTVNVIVEDDSPDSASIIH 2057 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +++ ++ +++DVS SM ++L + S ++L+ +SI R Sbjct: 2058 DVVAETKESANVQLIMDVSGSMRTD-----NRLQIMKDSATQLLNQYESIGQ----TRVQ 2108 Query: 220 LVTFSSKIVQTF--PLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++T+SS W V+ + I L G T L A D + Sbjct: 2109 IITYSSTASTYAIGAATWLTVEEAKAYIETLTAGGATNYNNALNEAKQSWDDVGKLTSAS 2168 Query: 277 AKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQF 332 + FL+DG+ + + I++ + + A G+ + Sbjct: 2169 NVSY--------FLSDGQPNPASSFINDAREQSWIDHLTDPDNQITALAYGMGVNLMPEQ 2220 Query: 333 LKNCASPD 340 L A Sbjct: 2221 LDRVAYDG 2228 >gi|149918791|ref|ZP_01907278.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149820392|gb|EDM79808.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 877 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 71/193 (36%), Gaps = 24/193 (12%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + L + +++ + + +++ V+D S SM G MD R + + + Sbjct: 355 GYFTLTVQPPEQVADEQAVARELVFVVDNSGSMG---GLPMDTAKGLMRKALKDIRPDDT 411 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + S+K++ ++ + ++ + T+ T G++ A D Sbjct: 412 FTVLR--FSESASGLSNKLLPATQ--DNIEAGVDYVDAMQGMGGTQMTEGIKAALRVPHD 467 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + ++FLTDG I N++++F + A ++++GV Sbjct: 468 PDRL------------RVVMFLTDGY-----IGNEQAIFELIDDNIGDARLFSLGVGGAP 510 Query: 329 ADQFLKNCASPDR 341 L AS R Sbjct: 511 NRYLLDGMASVGR 523 >gi|87308939|ref|ZP_01091077.1| hypothetical protein DSM3645_19318 [Blastopirellula marina DSM 3645] gi|87288282|gb|EAQ80178.1| hypothetical protein DSM3645_19318 [Blastopirellula marina DSM 3645] Length = 346 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 58/187 (31%), Gaps = 37/187 (19%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ D S SM+ + + E+L+ + S+P ++ F S+ Sbjct: 188 AGRRFCIIADCSGSMSGVKLDYVKE---------EILETVSSLPREAQFQ---VIFFQSQ 235 Query: 227 IVQTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 V W + E + + T P E A Sbjct: 236 AVPFPQKGWRHPKRDFNALSEWLKTVGPAGGTNPLPAFEIALKF---------------S 280 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 + F+TDG + + + N+ V+AI +A+ ++ A S Sbjct: 281 PRPDAVFFMTDGL---FDDNVVGEVKRQNDLSEPKVKVHAISFMDRSAEPLMRQIAGESG 337 Query: 340 DRFYSVQ 346 + VQ Sbjct: 338 GEYRHVQ 344 >gi|332227200|ref|XP_003262779.1| PREDICTED: vitrin isoform 3 [Nomascus leucogenys] Length = 658 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 475 DIGFVIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRIGAVQYTYEQR- 524 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 525 ---LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 572 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G YAIGV A ++ P D Sbjct: 573 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVFTYAIGVAWAAQEELEVIATHPARD 626 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 627 HSFFVDEFDNLYQYVPRIIQNI 648 >gi|48425220|pdb|1PT6|A Chain A, I Domain From Human Integrin Alpha1-Beta1 gi|48425221|pdb|1PT6|B Chain B, I Domain From Human Integrin Alpha1-Beta1 Length = 213 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 67/218 (30%), Gaps = 33/218 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S+ LD+++VLD S S + T + ++L + P G+V Sbjct: 1 GSECSTQLDIVIVLDGSNS--------IYPWDSVTAFLNDLLKRMDIGPKQTQ---VGIV 49 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + F + L+ G + D K Sbjct: 50 QYGENVTHEF----NLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGA 105 Query: 282 D--YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--------- 330 KK ++ +TDGE + DN + + ++I + Sbjct: 106 RRGVKKVMVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFV 161 Query: 331 QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 162 EEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFAL 199 >gi|225873376|ref|YP_002754835.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] gi|225792625|gb|ACO32715.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] Length = 363 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 78/230 (33%), Gaps = 41/230 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ ++ L + ++LD S S D + A++ +R++L ++ Sbjct: 105 TANANQPLTLGILLDTSGSQQDVLPL---EKQSASKFLRDVLRPKDQAF---------VL 152 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK-------------IFD 268 TF + + + + + + G + ++D Sbjct: 153 TFDVDVNLAQDFTNDIPLLDHALQQAQINTAGGGGSGGIPGLGQGPVPTVGDPKGTVLYD 212 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI------ 322 A + + +K +I LTDGE+ +++ +A++ IVY I Sbjct: 213 AVAQAANDKLREQTGRKALILLTDGEDLGSATKPLQAIA---DAQKANTIVYVILIADRG 269 Query: 323 -----GVQAEAADQFLKNC-ASPDRFYSV-QNSRKLHDAFLRIGKEMVKQ 365 Q + A+ R +V N KL AF I +E+ Q Sbjct: 270 FYGGYTFGYTGDAQMRRLAEATGGRMINVGNNGAKLTAAFKEIARELRTQ 319 >gi|296269297|ref|YP_003651929.1| vault protein inter-alpha-trypsin domain-containing protein [Thermobispora bispora DSM 43833] gi|296092084|gb|ADG88036.1| Vault protein inter-alpha-trypsin domain protein [Thermobispora bispora DSM 43833] Length = 796 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 78/209 (37%), Gaps = 34/209 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + D+++VLD S SM K+ A R+ ++D + R ++ F Sbjct: 295 AARRPRDVVLVLDRSGSMAGW------KMVAARRAAARIVDTLTERD------RLAVLAF 342 Query: 224 SSKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + I + FP E + RL T+ + +A L A Sbjct: 343 DNVIERAFPDGLTAATDRARYRAVEFLARLEARGGTEMLAP-------LEEALTALAAAA 395 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +G D ++ +TDG+ + +++ + ++ G V+A+G+ FL A Sbjct: 396 EGGRDA--VLVLVTDGQ-----VGDEDRILERMASRIGGVRVHAVGIDRAVNAAFLGRLA 448 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 R V++ +L +A I + + Sbjct: 449 VLGAGRCELVESEDRLDEAMEHIHRRIGA 477 >gi|156383823|ref|XP_001633032.1| predicted protein [Nematostella vectensis] gi|156220096|gb|EDO40969.1| predicted protein [Nematostella vectensis] Length = 182 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 65/188 (34%), Gaps = 30/188 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++++ ++D S S+ND G I+++ + K + V FS+ Sbjct: 1 MNLVFLVDSSGSVND------TDFGKFQMFIKDLAEEFKDAISEGDT-EVAAVLFSTIPK 53 Query: 229 QTFPLAWG----VQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + I+ ++ G T++ L + ++F Sbjct: 54 TKIEFDLDDYDHINDIKAAVDAFSHQHGGQTRTGEALTFTLEEVFKKA--------PRPS 105 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---P 339 K ++ LTDG+ + + G V+AIGV + + LK+ AS Sbjct: 106 VKNVLVVLTDGKAQGNVTGP------AQDVRDHGVEVFAIGVGPHSNEAQLKDIASDPDD 159 Query: 340 DRFYSVQN 347 + V + Sbjct: 160 KHVFHVTD 167 >gi|294667358|ref|ZP_06732577.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292602910|gb|EFF46342.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 1223 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 62/176 (35%), Gaps = 20/176 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 +I+ K+ + LD+S SM++ G +L +I +LD + Sbjct: 138 WYPEKAEIAMKAPASASLYFALDLSGSMDEVASNGQTRLTNMKTAINAVLDQVGEAVARG 197 Query: 214 NVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + +V F + + A G+ ++ ++ TT T G+ Sbjct: 198 AAIDCMIVGFGTYPSSRQSILRRNLTAGGIAELKSWVSGRSSSFTTYFTAGV-------- 249 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + G ++ F+TDGE +D + A GAIV ++ Sbjct: 250 --MDMPDFFGGGEAFARRLAFFITDGEP----VDAGSGMTSAQIAAEAGAIVSSVA 299 >gi|159039503|ref|YP_001538756.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157918338|gb|ABV99765.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 583 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 87/223 (39%), Gaps = 27/223 (12%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ + +V S + M+ V+DVS SM G A+R + + + Sbjct: 367 PVAVDRAVASWSIATQSGRMLCVIDVSGSMKGSVAGA----GGASRQQVTLDAARRGLSL 422 Query: 212 VNNVVRSGLVTFSS---------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ + GL FS+ ++V+ PL+ +++ + ++ T+ GL Sbjct: 423 FDDSWQIGLWEFSTNLGSGRDYRRLVEIGPLSNQRSRLEQALTQIQP---TRGDTGL--- 476 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEAKRRG--AIV 319 ++ + A E ++ I+ TDG+N N I ++ L K V Sbjct: 477 FDTVLAAYEAVQEEWDPGQVNS--IVLFTDGKNDDDNGISQQQLLAELERIKDAERPVQV 534 Query: 320 YAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF-LRIG 359 IG+ A+ + L++ + + ++ K+ D F I Sbjct: 535 VLIGIGADVSKAELESITKVTGGGSFVTEDPTKIGDIFLKAIA 577 >gi|3236344|gb|AAC23663.1| integrin alpha E2 [Rattus norvegicus] Length = 1167 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 77/218 (35%), Gaps = 28/218 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +V+ + + G ++ +VLD S S+ A I M+ N Sbjct: 182 QAVEEEDEEEAGTEIAIVLDGSGSIEPS------DFQKAKDFISTMMRNFYEKCFECNF- 234 Query: 217 RSGLVTFSSKIVQTFPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKL 273 LV + I F L + + K+ + TK+ +++ + IF Sbjct: 235 --ALVQYGGVIQTEFDLLDSRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGS 292 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQ 331 A K ++ LTDG+ D + +K +G + +AIGV E + Sbjct: 293 RKKAL------KVMVVLTDGD---IFRDPLNLTTVISSSKMQGVVRFAIGVGNAFENNNT 343 Query: 332 F--LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 + LK AS + V N L ++ + ++ Sbjct: 344 YRELKLIASDPKAAHTFKVTNYSALDGLLSKLQQRIIH 381 >gi|315127492|ref|YP_004069495.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] gi|315016006|gb|ADT69344.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] Length = 666 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/264 (16%), Positives = 94/264 (35%), Gaps = 42/264 (15%) Query: 87 IWQTDFRNELRENGFAQDINNIERST--SLSIIIDDQ---HKDYNLSAVSRYEMPFIFCT 141 D EL + NIE S SI++ Q +KD+ L + Sbjct: 233 TINMDIGLELVDINAKYHQVNIENSAFGQYSIVLKPQQAINKDFVLEFKPLQKAQAQAAL 292 Query: 142 FPWCANSS---HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 F A + +L+ S + + + +M+ V+D S SM+ + A ++ Sbjct: 293 FTQQAKNGEQFALAMLMPPSDQFTQSERLPREMVFVVDTSGSMHGQ------SMEQAKKA 346 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPL---AWGVQHIQEKINRLIFGSTT 253 + L ++ S N ++ F++++ + PL + ++ + I L T Sbjct: 347 LFYALSLLDSDDSFN------IIGFNNQVTAMSDTPLVASDFNLRRARRFIYNLQADGGT 400 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + L+ N + + ++FLTDG S + +++ Sbjct: 401 EIQGALDAVLNGAQFEG------------FVRQVVFLTDG---SVSNEDELFKSIARTLG 445 Query: 314 RRGAIVYAIGVQAEAADQFLKNCA 337 ++ +G+ + F++ A Sbjct: 446 DS--RLFTVGIGSAPNRFFMRRAA 467 >gi|170291084|ref|YP_001737900.1| von Willebrand factor type A [Candidatus Korarchaeum cryptofilum OPF8] gi|170175164|gb|ACB08217.1| von Willebrand factor type A [Candidatus Korarchaeum cryptofilum OPF8] Length = 328 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 75/242 (30%), Gaps = 35/242 (14%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 K+Y+++ E + + S K ++M+LD S S Sbjct: 116 NSKEYSVAYSPGMEFDLEKTIERMIEKCKKVDEMRYEDIVASDKRKRDKSLIMILDSSGS 175 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M + + M+ I R G+V F+S A + Sbjct: 176 M------------TGKKILIAMM-IAAIASHKLRSGRYGVVGFNSTAFVIKSPAENKDSV 222 Query: 241 Q--EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + E+I L+ T + GL+ E H+ KY+ +TDGE + Sbjct: 223 KVIEEILDLVPIGYTNISDGLKKGL-------EISYHLKNP-----KYL-LITDGEYNVG 269 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 ++ K ++ G + K A +++ + + +++ Sbjct: 270 ----EDPRKVARRFKNLCV-IHTRGKRDSRGSVLCKEIARIGGSKYFVIDDIKQIQRVMK 324 Query: 357 RI 358 I Sbjct: 325 SI 326 >gi|115749084|ref|XP_001197592.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115959785|ref|XP_001193076.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 966 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/181 (22%), Positives = 66/181 (36%), Gaps = 29/181 (16%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 TS I + L +++VLD S SM+ G DK+ IR + I+SI N+ Sbjct: 301 TSPNFIVVQPSGSLRIVLVLDTSGSMD---GERFDKM------IRGAKNFIQSIVPNNSY 351 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKE 271 V +V F+ + + + I K + + T G+ A Sbjct: 352 V--AIVEFNYESIVDSYMTELTSVISRKDLASLLPTLADGATCIGCGIVTAIQVAQYNDM 409 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + Y+I L+DGE + ++ + G IV++I EA Q Sbjct: 410 DSRGV---------YLILLSDGEENHGTPIADTM----DDIEGSGVIVHSIAF-YEADTQ 455 Query: 332 F 332 Sbjct: 456 L 456 >gi|297473702|ref|XP_002686779.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Bos taurus] gi|296488587|gb|DAA30700.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Bos taurus] Length = 1085 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 58/163 (35%), Gaps = 32/163 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EML+ + VN + +F+S Sbjct: 253 DMLILVDVSGSVSGL------TLKLIRTSVSEMLETLSDDDFVN------VASFNSNAQD 300 Query: 230 TFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +++ +N + T G +A+ ++ + + Sbjct: 301 VSCFQHLVQANVRNKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRANCN----- 355 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K I+ TDG + + + K + V+ V Sbjct: 356 --KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVG 390 >gi|157375507|ref|YP_001474107.1| hypothetical protein Ssed_2370 [Shewanella sediminis HAW-EB3] gi|157317881|gb|ABV36979.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 461 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 43/108 (39%), Gaps = 1/108 (0%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +NIR + G+I ++ I L + V L ++ H K +L +D S LY A ++ Sbjct: 16 MNIRPYRK-QGGAILVMFTIGLFSLIAVAALALDGGHLLLNKGRLQNAVDASALYAAKEL 74 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 + + ++ + ++ + + + + + + + Sbjct: 75 QDGASLYEAREAATTLLLQNLQYQENGELNSSIDLSAPDYNSTQVAAN 122 >gi|332298718|ref|YP_004440640.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181821|gb|AEE17509.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 566 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 57/196 (29%), Gaps = 19/196 (9%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 WC + + + I + G + V DVS SM G A Sbjct: 60 LAWCFLITAYAGVSWGTELIPVR-RTGSSVSFVFDVSRSMTAKDVS--SGAGAAVSKPVS 116 Query: 202 MLDIIKSIP----DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 LD +K D V V PL + ++ + L T + Sbjct: 117 RLDAVKLYAAALLDRMEGTEVSAVIAKGSGVVAVPLTSDLNAVRSLLPSLSPALLTSTGS 176 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 I E + + I+ TDGE + D + S + K G Sbjct: 177 -------SIGSGIEAAVNSFPPLSAAARTIVVFTDGEET----DGRMSGAVADALK-FGI 224 Query: 318 IVYAIGVQAEAADQFL 333 V +G ++ + L Sbjct: 225 PVVFVGFGSDTESEIL 240 >gi|313681553|ref|YP_004059291.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154413|gb|ADR33091.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 507 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 63/175 (36%), Gaps = 33/175 (18%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 K + + G D++ +D+S SM +L VA + + +++ + R Sbjct: 50 PQKPVTAEEAGSDVIFAVDLSYSMRGTDLAP-SRLEVAKKLLYDVVRS-------DQKDR 101 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKL 273 G++ F++ + PL + ++ + L T LE K+ Sbjct: 102 FGVIGFTTSAIVLSPLTKDTEMLEHLFSSLDESQIITKGTNVMSALEL--------SRKM 153 Query: 274 EHIAKGHDDYKKYIIFLTDG-------ENSSPNIDNKESLFYCNEAKRRGAIVYA 321 H A+ +I LTDG + SS DN +L A G+ + Sbjct: 154 SHSARP------IVILLTDGGDEASYEKESSFVRDNNLALSVVMLATNNGSTLPT 202 >gi|198424353|ref|XP_002120419.1| PREDICTED: similar to mCG120740 [Ciona intestinalis] Length = 1650 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 77/204 (37%), Gaps = 33/204 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++VLD+S SM ++ ++G+ +++ +D + +N V G+V F+ Sbjct: 369 KGFDRRIVLVLDISTSMENY-----GRMGLMRQAVSNFIDTV----PMNTWV--GIVVFA 417 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S+ L + I + + T G+ + + + + G Sbjct: 418 SRANTLARLTEITSYDARNILKTRLVNTTVVGTSIGSGIMKGLEVLETSGPRSLRGSGGS 477 Query: 281 DDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCAS 338 II LTDG E+++P I++ + G V I + + A S Sbjct: 478 ------IIILTDGLEHNNPKINDT-----IERVREFGVRVSTIALGSNVAKDLEWLASVS 526 Query: 339 PDRFYSVQNSR-----KLHDAFLR 357 R ++ + + +L +AF Sbjct: 527 NGRTHAASSGQFGIDAELQEAFAS 550 >gi|254443409|ref|ZP_05056885.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257717|gb|EDY82025.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 257 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 66/197 (33%), Gaps = 21/197 (10%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++ D S SMN+ + K+ +++ + ++ GL+TF + + Sbjct: 74 YVIFDASGSMNELVAQQLPKIEAGKQALVTFAN------NLPEDANLGLLTF-DPVRELL 126 Query: 232 PLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL G Q ++++ T + Y + + ++ + ++ + Sbjct: 127 PLGRGNRQAFIGSVSQIRAKGRTPLVESIVTGYRVLTEQAQRQSGYGRY------VLVIV 180 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDG +S N + V IG L ++ + + + Sbjct: 181 TDGASSDGNPAGVAM----EVTRESPIEVQTIGFGVADHALNLPGVT---QYVTASSPKA 233 Query: 351 LHDAFLRIGKEMVKQRI 367 L DA ++ + I Sbjct: 234 LIDALNQVIASESESFI 250 >gi|156367148|ref|XP_001627281.1| predicted protein [Nematostella vectensis] gi|156214186|gb|EDO35181.1| predicted protein [Nematostella vectensis] Length = 166 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 60/177 (33%), Gaps = 24/177 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD++ ++D S S+ G+ IR D+ VR GLV + + Sbjct: 2 KAPLDVVFLVDGSRSVERQ---GVGNFRRELSMIR---DMSAGFIISRTNVRFGLVVYGT 55 Query: 226 KIVQTFPLA--WGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + +NR + ++ L AY ++ + Sbjct: 56 RPRVVFGLNGFRNNGGLFNALNRPIKNPQTGSRIGLALRAAYTRVLARSPRRG------- 108 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K I+ L DG + +L RG ++AIG+ + L AS Sbjct: 109 -ATKIIVVLADGRSEDDVRRPSNALQ------ARGVKIFAIGIGRYINGRQLDQLAS 158 >gi|326433599|gb|EGD79169.1| hypothetical protein PTSG_09900 [Salpingoeca sp. ATCC 50818] Length = 490 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 67/219 (30%), Gaps = 29/219 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + ++ V D S SM H L A + + + V L Sbjct: 38 PERRQARMHVLFVADNSGSMCYHMQSVNTGLANAITAC------------MQHGVHPNLC 85 Query: 222 TFSSKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F+ I P G + I L T + +++ L +A+ Sbjct: 86 NFNEVIDEHHLPYNQGAAAVASTIQNLESTGGTD----FDIVVDRLVAEMNMLLGVARAQ 141 Query: 281 DDYKK--YIIFLTDGENSSPNIDNKESL------FYCNEAKRRGAIVYAIGVQAEAADQF 332 D + +++ +TDG+ S P+ D L F V A+GV + +F Sbjct: 142 QDRQHRVFLVVMTDGQASMPSEDKFAHLQRLIEEFTALSVHSNEVNVLALGVGGDHQGEF 201 Query: 333 LKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L + +RF+ + I M Sbjct: 202 LDRLSKVVPNSNRFFQCAAGETTDELTNSITDAMGHLTT 240 >gi|297265788|ref|XP_001107747.2| PREDICTED: vitrin-like isoform 3 [Macaca mulatta] Length = 693 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ G + + + K + R G V ++ + Sbjct: 510 DIGFIIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRVGAVQYTYEQR- 559 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 560 ---LEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 607 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 608 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 661 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 662 HSFFVDEFDNLYQYVPRIIQNI 683 >gi|333028467|ref|ZP_08456531.1| putative von Willebrand factor [Streptomyces sp. Tu6071] gi|332748319|gb|EGJ78760.1| putative von Willebrand factor [Streptomyces sp. Tu6071] Length = 588 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 77/213 (36%), Gaps = 26/213 (12%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++V+D S SM + G G ++ V S+ + L S D GL FS+++ Sbjct: 382 LLVVVDSSPSMAELVPGRGQSRMDVTKASLLQALAQFTSADD------IGLWEFSTRLDG 435 Query: 230 TFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEY---AYNKIFDAKEKLEHIAKGHDD 282 + ++ T + L+ ++D A Sbjct: 436 DRDYRELVPTDRLGARKGEGVTQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYA 495 Query: 283 YKKY--IIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAADQFLKN 335 K+ ++ LTDG N P + +L + A+ + AI V EAA +K Sbjct: 496 ADKFNAVVLLTDGTNEDPGSLTRGALLTKLRDLADPARP--LPLVAIAVGPEAAGDDVKA 553 Query: 336 C--ASPDRFYSVQNSRKLHDAFLR-IGKEMVKQ 365 A+ + V + ++H+ I + KQ Sbjct: 554 IGSATGGSGFKVDDPAQIHEVINNAIVEAGSKQ 586 >gi|296482552|gb|DAA24667.1| vitrin precursor [Bos taurus] Length = 652 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + + R G + ++ + Sbjct: 469 DIGFVIDGSSSV------GTSNFRTVLQFVANLSREFEISD---TDTRIGAMQYTYEQR- 518 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + YA ++F K + Sbjct: 519 ---LEFGFDEYSTKSDVLNAIKRVGYWSGGTSTGAAIHYALEQLF---------KKSKPN 566 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A +G I YAIGV A D+ P D Sbjct: 567 KRKLMILITDGRSYD------DIRIPAMLAHHKGVITYAIGVAWAAQDELDIIATHPARD 620 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ ++ + + Sbjct: 621 HAFFVDEFDNLYKVVPKVIQNI 642 >gi|170729849|ref|YP_001775282.1| hypothetical protein Xfasm12_0653 [Xylella fastidiosa M12] gi|167964642|gb|ACA11652.1| conserved hypothetical protein [Xylella fastidiosa M12] Length = 941 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 54/189 (28%), Gaps = 16/189 (8%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPQPASYKIRRILKGWDHDACWYPEKAAIGMQMAPSVAVYFAIDLSGSMH 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWG 236 G G +L ++ LD + V L F L A G Sbjct: 167 YVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLAGFGDAPDHRQTLLRRNCTAQG 226 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + ++ + T Y + A + + F+TDGE Sbjct: 227 IAELKSWVAARQALYGT---------YFPAGTMDMPSFYAAAPSNAV-RVAFFMTDGEPD 276 Query: 297 SPNIDNKES 305 P+ ++ Sbjct: 277 PPSATLAQA 285 >gi|254226123|ref|ZP_04919720.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae V51] gi|125621358|gb|EAZ49695.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae V51] Length = 1637 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 70/233 (30%), Gaps = 34/233 (14%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKI--------SSKSDIGLDMMMVLDVSLSMNDHF 185 +P + ++I + + G ++ ++LD+S SM+ Sbjct: 983 NIPLEAKNAAGAIGTGKVTVVIEDDAPVAKEVFHVAEPELRQGANVQLILDISGSMDTSA 1042 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP--LAW-GVQHIQE 242 G G +L V S +++L+ +++ LV F S+ W V+ Sbjct: 1043 GNGKSRLDVMKESAKQLLEQYQAMGQTK----VQLVVFHSEAEVKSQGNSVWMTVEQAIN 1098 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN------- 295 IN L TT +E A + + Y FL+DGE Sbjct: 1099 YINGLSTKGTTDYDHAIELAEDNWSGLNTGGLLTGATNVSY-----FLSDGEPYDGDYVR 1153 Query: 296 -------SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I+ E + + A G+ L A Sbjct: 1154 SNGNKVWNPNTIEPNELSSWITHLQANQITALAYGMGNNVPQGELDKVAYDGH 1206 >gi|70606980|ref|YP_255850.1| hypothetical protein Saci_1211 [Sulfolobus acidocaldarius DSM 639] gi|68567628|gb|AAY80557.1| conserved protein [Sulfolobus acidocaldarius DSM 639] Length = 380 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 48/202 (23%), Positives = 84/202 (41%), Gaps = 38/202 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G +++LD S SM K+ A + ++++ IP+ N +TFS+ + Sbjct: 37 GFHYIILLDTSGSMYGV------KIETAK---QGAMELLSRIPEGNK---ISFLTFSNNV 84 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 A + ++I ++ G T ++ A E+ IAK HD YI Sbjct: 85 NILSEYA-DAPSLVQQIKQIRSGGQT-----------VLYRALERAIEIAKKHDLPG-YI 131 Query: 288 IFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 I LTDG+ + P D E L Y K V A G+ + ++ LK + Y Sbjct: 132 ILLTDGQPTDVPETDAYEKLNYPEAYK-----VIAFGIGDDYNERLLKVITDKTAGILYH 186 Query: 345 VQNSRKLHD-----AFLRIGKE 361 V++++++ + A IG + Sbjct: 187 VEDAKEIAEMLPQSAVTEIGAK 208 >gi|71274892|ref|ZP_00651180.1| conserved hypothetical protein [Xylella fastidiosa Dixon] gi|71902167|ref|ZP_00684189.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] gi|71164624|gb|EAO14338.1| conserved hypothetical protein [Xylella fastidiosa Dixon] gi|71728071|gb|EAO30276.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] Length = 941 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 54/189 (28%), Gaps = 16/189 (8%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPQPASYKIRRILKGWDHDACWYPEKAAIGMQMAPSVAVYFAIDLSGSMH 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWG 236 G G +L ++ LD + V L F L A G Sbjct: 167 YVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLAGFGDAPDHRQTLLRRNCTAQG 226 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + ++ + T Y + A + + F+TDGE Sbjct: 227 IAELKSWVAARQALYGT---------YFPAGTMDMPSFYAAAPSNAV-RVAFFMTDGEPD 276 Query: 297 SPNIDNKES 305 P+ ++ Sbjct: 277 PPSATLAQA 285 >gi|332884780|gb|EGK05036.1| hypothetical protein HMPREF9456_03189 [Dysgonomonas mossii DSM 22836] Length = 342 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 57/176 (32%), Gaps = 28/176 (15%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ G+++++ +DVS SM +L A I+ I D + Sbjct: 82 TKVEKVDKKGIELVIAIDVSNSMMAEDISP-SRLVKAK-------QILTRIIDERKNDKV 133 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQ---EKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +V F+ + PL Q + E IN L+ T + + Sbjct: 134 AIVVFAGEAFIQLPLTPDNQSAKLFLETINPSLVPVQGTAIGSAI-----------DMSM 182 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 D K I+ +TDGE N + + +G V +G+ Sbjct: 183 SCFSNDADIDKAIVLITDGEGHEGNAEEAAARAA-----SKGVHVNVVGIGTAEGA 233 >gi|297182077|gb|ADI18250.1| hypothetical protein [uncultured Chromatiales bacterium HF0200_41F04] Length = 365 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 45/139 (32%), Gaps = 22/139 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 GLD++ V+D + SM D++ DI+ ++ + R G V F Sbjct: 149 GLDVVFVVDATGSMGWAIDEIKDRI----------YDIVSTVRTLVPAARFGFVAFRDHN 198 Query: 227 ----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V++ PL + + ++ L + D Sbjct: 199 DPEFLVRSEPLTFSTAKLHRFLDPLQAAGGGDIWEEVNAGIAAGIDDSGWRVGA------ 252 Query: 283 YKKYIIFLTDGENSSPNID 301 ++ II + D + D Sbjct: 253 -RRIIILVGDAPPREESFD 270 >gi|39934532|ref|NP_946808.1| dinitrification protein NorD [Rhodopseudomonas palustris CGA009] gi|39648381|emb|CAE26901.1| dinitrification protein NorD [Rhodopseudomonas palustris CGA009] Length = 636 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 69/196 (35%), Gaps = 31/196 (15%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F A L +T V +S +D +D VLD+ + L + + Sbjct: 436 FYLAARPQARDLAVTLLVDVSLSTDAWIDNRRVLDIE-------KEALTVLAHGIEACGD 488 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 I+ + VR V + PL ++ +I L G T+ L + Sbjct: 489 QHSILTFTSRRRDWVRLETVK--GFGERMSPL------VERRIAALKPGYYTRIGAALRH 540 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A ++ ++ KK ++ LTDG+ + ++S EA+R G Sbjct: 541 ASAELARQPQR-----------KKLLLVLTDGKPNDVDHYEGRFALEDSRRAVQEARRSG 589 Query: 317 AIVYAIGVQAEAADQF 332 V+ + + +A F Sbjct: 590 IAVFGVTIDVDAQSYF 605 >gi|308472877|ref|XP_003098665.1| hypothetical protein CRE_04175 [Caenorhabditis remanei] gi|308268265|gb|EFP12218.1| hypothetical protein CRE_04175 [Caenorhabditis remanei] Length = 396 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 80/217 (36%), Gaps = 19/217 (8%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +P + + A+S ++PL + S++ LD++ V+D S M G++++ Sbjct: 5 LPILLFSVSVYADS-YSPLSYVDRPCGTDLSNLWLDVIAVVDNSRGMT---VAGLNEVAA 60 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLI 248 S+ I R GLVT++S Q L I + ++ ++ Sbjct: 61 NIASVFGSGTRIGLNASEPRTTRLGLVTYNSVATQKADLNQYQSIGDVFHGIFDALSNIV 120 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + + GLE A + D + + Y+K +I + + +D L Sbjct: 121 DTNESYLATGLELAERMLID-----QSVNSTRAHYQKVVIVYASEYDGNGELDP---LPI 172 Query: 309 CNEAKRRGAIVYAIGV-QAEAADQFLKNCASPDRFYS 344 K G + + A + N ASP +S Sbjct: 173 AERLKLSGVKIITVAYGNAYGLTKSFSNIASPGFAFS 209 >gi|290990289|ref|XP_002677769.1| predicted protein [Naegleria gruberi] gi|284091378|gb|EFC45025.1| predicted protein [Naegleria gruberi] Length = 754 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 75/205 (36%), Gaps = 30/205 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ LDVS SM G G+D+ A +I + + + IPDV L+ + + Sbjct: 40 QIVIALDVSGSMR---GQGIDQ---AKIAISNLFEQVVDIPDVV------LIAYDTSAEL 87 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + Q + ++ G T T E + + + + I+F Sbjct: 88 YDLRKKPAETRQSTLEQIQAGGGTDFTCVF-----------EAISKLDMFNSQSEVAILF 136 Query: 290 LTDGENSSPNIDNK--ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC----ASPDRFY 343 TDG++ S + K E + E K + + IG + L + F Sbjct: 137 FTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQGTFQ 196 Query: 344 SVQNSRKLHDAFLR-IGKEMVKQRI 367 V+++ +++ + IG + Sbjct: 197 YVKDANEINQSMENLIGLLTSNSSV 221 >gi|260577971|ref|ZP_05845896.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] gi|258603897|gb|EEW17149.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] Length = 551 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 36/201 (17%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVR---SGLVTFSSKI-V 228 +LD S SM + +L + ++D + R L+ FSSK+ Sbjct: 367 LLDTSGSMRGN------RLADLKGILNRLIDGTAGEAGNPKGFGRRETITLMPFSSKVAD 420 Query: 229 QTFPLAWGV------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + +++ +N L T + AY+++ + L Sbjct: 421 GYTQEHYDPDSAEQSRGLRDYVNGLQPRGETAIYDAVLRAYDRVGEGGGSLNS------- 473 Query: 283 YKKYIIFLTDGENSSPN----IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCA 337 I+ +TDGE++S K R V+ I + D+ Sbjct: 474 ----IVLMTDGESNSGTNRQEFITKMKRKMAET--DRKIPVFVILYGEASEDEMNFLADF 527 Query: 338 SPDRFYSVQNSRKLHDAFLRI 358 + + ++ + S L AF I Sbjct: 528 TGGKVFNAR-SGDLSKAFEEI 547 >gi|318059857|ref|ZP_07978580.1| hypothetical protein SSA3_18051 [Streptomyces sp. SA3_actG] gi|318076736|ref|ZP_07984068.1| hypothetical protein SSA3_08427 [Streptomyces sp. SA3_actF] Length = 588 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 77/213 (36%), Gaps = 26/213 (12%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++V+D S SM + G G ++ V S+ + L S D GL FS+++ Sbjct: 382 LLVVVDSSPSMAELVPGRGQSRMDVTKASLLQALAQFTSADD------IGLWEFSTRLDG 435 Query: 230 TFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEY---AYNKIFDAKEKLEHIAKGHDD 282 + ++ T + L+ ++D A Sbjct: 436 DRDYRELVPTDRLGARKGEGVTQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYA 495 Query: 283 YKKY--IIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAADQFLKN 335 K+ ++ LTDG N P + +L + A+ + AI V EAA +K Sbjct: 496 ADKFNAVVLLTDGTNEDPGSLTRGALLTKLRDLADPARP--LPLVAIAVGPEAAGDDVKA 553 Query: 336 C--ASPDRFYSVQNSRKLHDAFLR-IGKEMVKQ 365 A+ + V + ++H+ I + KQ Sbjct: 554 IGSATGGSGFKVDDPAQIHEVINNAIVEAGSKQ 586 >gi|295698899|ref|YP_003606792.1| von Willebrand factor A [Burkholderia sp. CCGE1002] gi|295438112|gb|ADG17281.1| von Willebrand factor type A [Burkholderia sp. CCGE1002] Length = 328 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 69/219 (31%), Gaps = 35/219 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + G +++++D S SM++ G + + + R + F Sbjct: 80 TGSGAQILILMDRSQSMDEPMGSKGVESPRGDSKNHVAREALTRFVGQRPNDRLAFMMFG 139 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + V P + + I+ + G T+ G+ A + FD + A Sbjct: 140 TNPVLAMPFTYNHRVIEAAVAATAIGRGMPDTELDRGMLAAIAQ-FDGRLSSGRRA---- 194 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI----------------GVQ 325 I+ ++DG +D + +R +Y I Sbjct: 195 -----IVLVSDG---GALLDERMQRRIEEGLRRDRIALYFIYLRSSVFSPDLNATQPASG 246 Query: 326 AEAADQFLK---NCASPDRFYSVQNSRKLHDAFLRIGKE 361 + A Q + +P R + ++ + + A I ++ Sbjct: 247 SSAEAQLHRFFLTLKTPYRLFQAEDPKAMMAAIAEINRQ 285 >gi|225686099|ref|YP_002734071.1| protein norD [Brucella melitensis ATCC 23457] gi|256262776|ref|ZP_05465308.1| protein norD [Brucella melitensis bv. 2 str. 63/9] gi|225642204|gb|ACO02117.1| Protein norD [Brucella melitensis ATCC 23457] gi|263092583|gb|EEZ16818.1| protein norD [Brucella melitensis bv. 2 str. 63/9] gi|326410427|gb|ADZ67491.1| protein NorD [Brucella melitensis M28] gi|326553720|gb|ADZ88359.1| protein NorD [Brucella melitensis M5-90] Length = 633 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 76/205 (37%), Gaps = 34/205 (16%) Query: 168 GLDMMMVLDVSLSMN---------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 L + +++DVSLS + D + L + + I+ + VR Sbjct: 443 DLAVTLLVDVSLSTDAWVDNRRVLDVEKEALLVLANGIAACGDRCSILTFTSRRRSWVRV 502 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + V+ F ++G ++ +I L G T+ + +A K+ + Sbjct: 503 -------ETVKDFDESFGP-TVEHRIAALKPGFYTRMGAAMRHATAKLAEQP-------- 546 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK ++ LTDG+ + ++ E + +G V+A+ V EA+ +L Sbjct: 547 ---NRKKLLLLLTDGKPNDVDHYEGRFALEDCRRAAGEVRAKGVNVFAVTVDREAS-AYL 602 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRI 358 + V N KL A I Sbjct: 603 PALFGRGGYALVANLAKLPVALPAI 627 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 76/254 (29%), Gaps = 33/254 (12%) Query: 119 DDQHKDYNLSAVSRYEMP--FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 D Y + +P T + +++ + + VLD Sbjct: 116 DALTDTYRRPTTANATLPAELSSQTIIEAPFPGSKTVTDALIDAYTNQFRVPGETTFVLD 175 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTFSSKIVQTFP 232 VS SM ++ + ++ +++ + N +R ++ FS + Sbjct: 176 VSGSMLGQ------RITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFSFGPHEVIS 229 Query: 233 LAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 G +Q+++ L T + AY E Sbjct: 230 ETLGAVGSPSRIDLQQRVEALQADGGTGIYDAVLAAY------AESAGGDYIPS------ 277 Query: 287 IIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYS 344 I+ +TDGE ++ D + + + R V+ I + A + + + Sbjct: 278 IVLMTDGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGGKTFD 337 Query: 345 VQNSRKLHDAFLRI 358 N L +AF I Sbjct: 338 AIN-GDLDEAFKEI 350 >gi|15837128|ref|NP_297816.1| hypothetical protein XF0526 [Xylella fastidiosa 9a5c] gi|9105381|gb|AAF83336.1|AE003900_15 hypothetical protein XF_0526 [Xylella fastidiosa 9a5c] Length = 941 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 60/209 (28%), Gaps = 24/209 (11%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPQPASYKIRRILKGWDHDACWYPEKAAIGMQMAPCVAVYFAIDLSGSMD 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWG 236 G G +L ++ +LD + V LV F L A G Sbjct: 167 YVGGNGRSRLENMKTALNAVLDQLGQTIASGAAVDILLVGFGDAPDHRQTLLRRNCTAQG 226 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK--KYIIFLTDGE 294 + ++ ++ + Y F A + F+TDG Sbjct: 227 IAELKSWVS------------ARQALYGTYFPAGTMDMPSFYAAAPPNAVRVAFFVTDGV 274 Query: 295 NSSPNIDNKES----LFYCNEAKRRGAIV 319 P+ N ++ + + G + Sbjct: 275 PDPPSATNAQAARADVDQVAHLRCYGITI 303 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 54/171 (31%), Gaps = 49/171 (28%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 G + I +L T GL+ A N +F A + + + + LTDG Sbjct: 343 GFDNALASIGQLSADGGTLVDDGLDMA-NGVFSANPLV-----TGELRNRVTVVLTDGAP 396 Query: 296 SSPNID---NKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCA------------- 337 D E++ +E K G+ V++IG+ A+A+ A Sbjct: 397 GLYGNDRGVANEAISQASELKTAGSTVFSIGIFPGADASGDLPDQSAMGWGDNDSNRFMH 456 Query: 338 -------------------------SPDRFYSVQNSRKLHDAFLRIGKEMV 363 SPD + S +S L+ F I + Sbjct: 457 LLSSNYPDASSMGSPGARFVDEEGSSPDYYLSASDSAGLNSIFQSISQSTG 507 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 60/217 (27%), Gaps = 46/217 (21%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + I D+++VLD S SM+D FG + + L + N VR G ++ Sbjct: 156 EQGIPCDIVLVLDQSGSMDDRFGS-QGSYHALSGYSNKRLG--DLAENGNLYVRGGDGSY 212 Query: 224 -------SSKIVQTFPLAW----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 S I + W G + + + T T + Sbjct: 213 VAVDVDVSGFISLKYSYTWEGLEAPLTSEGRYTVPQFEGVTFYSFQTDQTV---TRIAAL 269 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 DA + + + DG + IG + Sbjct: 270 KDAANSFVQSVRSNSLGE-------DG--------------IAGTVDDVPHRIAVIGFAS 308 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 + AS S Q+ + AF + EM Sbjct: 309 GDNTELFVGSASYPYGQSAQS--QYGSAFQDMTTEMG 343 >gi|307102442|gb|EFN50717.1| hypothetical protein CHLNCDRAFT_142575 [Chlorella variabilis] Length = 575 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 78/204 (38%), Gaps = 14/204 (6%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S K S L+++++LDVS SM + F + + +S DV V Sbjct: 114 SGKASDFRRPRLNLLVLLDVSGSMGESFSSYYYDQLGQQVAQPSGSEEQRSKMDVAKEVL 173 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +G+V ++GV ++ + S T GL+ A ++ + +E Sbjct: 174 AGVVG-----KLGPDDSFGVVLFRQMGRDIADTSGTNMQAGLDAATGEMRACRTCMEADR 228 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ I+ +TD + + +I ++ L G IGV + + ++ + Sbjct: 229 AATENR---IVLITDAQPNQGDISDEGLLARLKANAADGIHTTIIGVGLDFNTELVEGIS 285 Query: 338 S--PDRFYSVQNSRK----LHDAF 355 ++SV + + L D F Sbjct: 286 KVRGANYFSVHSPGEFRRRLTDEF 309 >gi|225387829|ref|ZP_03757593.1| hypothetical protein CLOSTASPAR_01599 [Clostridium asparagiforme DSM 15981] gi|225046072|gb|EEG56318.1| hypothetical protein CLOSTASPAR_01599 [Clostridium asparagiforme DSM 15981] Length = 547 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 80/235 (34%), Gaps = 39/235 (16%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 F P+L S+ + +S + + ++DVS SM ++ Sbjct: 57 NFQAMLGEQSLPVLSVSTAE---QSGLPKTIYCLVDVSGSMKGRME-----------QVK 102 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS-TTKSTPGL 259 E L I + N+ + G + + G + I+ +I+ L + T GL Sbjct: 103 ETLTAISGGLNENDNLVIGKM---GNQITDSAFLSGQEEIKAQIDSLQYTGEDTDLYSGL 159 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + + E A ++ L+DG + + + + ++ V Sbjct: 160 IHGLKFLQQEPEVKTLRA---------LVVLSDGCDDQGAGSTWKEAY--DAVEKADIPV 208 Query: 320 YAIGV--------QAEAADQFLKNCASPDRFYSVQNSRK--LHDAFLRIGKEMVK 364 Y + V QA+ F +N A F ++ L +GKE++K Sbjct: 209 YTVAVILSEKDYEQAKELGSFARNSAGGLHFPKSDDNSSKPLAMTGQEMGKEILK 263 >gi|156405834|ref|XP_001640936.1| predicted protein [Nematostella vectensis] gi|156228073|gb|EDO48873.1| predicted protein [Nematostella vectensis] Length = 250 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 71/212 (33%), Gaps = 25/212 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S+ P + + A I S G++ +S+K Sbjct: 30 DVALLIDASGSIGRRRWPKVVEFTQAI---------INSFNVSEEGSHVGIILYSTKTEL 80 Query: 230 TFPLAW--GVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 G + + IN + G T L+ A ++F E + Sbjct: 81 LVKFNTFQGSELTADNINAKVAAVNYRDWGGLTYIDRALKLANEQLFSP----EGGMRAS 136 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCA 337 D K + TDG+ + E K + VY +G+ E + + Sbjct: 137 KDILKVAVVFTDGKQTKDKGPFTELQIASQPLKDKDVQVYGLGIGDETTIDVQEMQEMAN 196 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 P+ + + +L + +I + + + + Y Sbjct: 197 KPENVLTAKTFEELKNLAAQITQGVCEIKYQY 228 >gi|156394499|ref|XP_001636863.1| predicted protein [Nematostella vectensis] gi|156223970|gb|EDO44800.1| predicted protein [Nematostella vectensis] Length = 175 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 58/164 (35%), Gaps = 17/164 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D + + D++ M+ G+ + ++ + DI P + R+ L+ +S Sbjct: 7 ADKHISITTKADIAFLMDSSGSIGVRDYKKEKQFVQGLSDIFDISPGQS---RASLIIYS 63 Query: 225 SKIVQTFPLAWGV--QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F L GV Q+I + L T+ L A DA+ Sbjct: 64 DFPKLIFDLEDGVTNQNITSVLKNLEYLRGRTRIDKALMMAEEVFADARP---------- 113 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + LTDG+ + + D K+ G +Y IGV Sbjct: 114 TVPRIAFILTDGKQTQ-DYDAIPLDVSSQRLKKMGVKIYVIGVG 156 >gi|148645283|gb|ABR01165.1| complement factor B [Ovis aries] Length = 761 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 42/215 (19%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +R+ ++ + S GLVT++++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSVGAHNFTGAKNCLRDFIEKVASYGVKPKY---GLVTYATE 317 Query: 227 IV--------QTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKL 273 ++ W + E++NR+ + T + L YN + + Sbjct: 318 PKVLIKVFDPKSSEADW----VTEQLNRINYADHKLKAGTNTKRALLEVYNMM---SRDI 370 Query: 274 EHIAKGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG------AIVYAIGV 324 ++ + + + II +TDG ++ P + + + + R +Y GV Sbjct: 371 NNLKETWNRTRHVIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDIYVFGV 430 Query: 325 QAEAADQFLKNCAS----PDRFYSVQNSRKLHDAF 355 + + AS + +Q L D F Sbjct: 431 GPLVNQENINALASKKDKEQHVFKLQGMENLEDVF 465 >gi|113476846|ref|YP_722907.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167894|gb|ABG52434.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 379 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 64/172 (37%), Gaps = 20/172 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------- 223 ++++LD S SM + G KL A ++ RE L+ + + R + F Sbjct: 56 IIVLLDFSGSMKEKDSSGTTKLEGAIKATREFLETTSA---RGSNTRVAIFPFGEGGGRC 112 Query: 224 --------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + K + +++ + + + ++T L+ A + D ++ + Sbjct: 113 NSYKVRRENIKSRFFPADDFKHKNLLDNLAKKTPCASTNIYDPLKEAIRLLSDQEDTDFY 172 Query: 276 IAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + + + +I L+DG ++ + + +V+ +G Sbjct: 173 VPEDSIEPEPRLSVILLSDGYHNKKYENRDFRRLIALLERHDHIVVHTLGYG 224 >gi|327266004|ref|XP_003217797.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Anolis carolinensis] Length = 1098 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 263 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 310 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A+N + D + Sbjct: 311 VEPCLNGTLVQADRANKEHFREHLDKLFAKGIGMLDIALVEAFNMLSDFNHTGQ-----G 365 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +D +++F R ++ IG +A AD CA Sbjct: 366 SICSQAIMLVTDG-----AVDTYDAVFEKYNWPDRKVRIFTYLIGREAAFADNLKWMACA 420 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 421 NKGFFTQISTLAD 433 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 48/253 (18%), Positives = 98/253 (38%), Gaps = 32/253 (12%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 F+ ++E G +++ N++ + + + + D ++ + S Sbjct: 170 FFKPTIKEGGGSRENNSL--NYEMKLTLSDGTSQTIDYGTIESKLSSLNALQVVDRGSPS 227 Query: 151 APLLI----TSSVKISSKSDIGLDMMMVLDVSLSM-----NDHFGPGMDKLGVAT---RS 198 P T +V + + G+ + +VLD S SM D G ++ + Sbjct: 228 DPATAIAYRTDNVYLKKEGAHGV-ISLVLDNSGSMHTRDLKDSHGNKESRINILKVETGK 286 Query: 199 IREMLDIIKSIP------DVNNVVRS----GLVTFS----SKIVQTFPLAWGVQ--HIQE 242 + ++L K+ D N +VRS G + + SK + + V ++E Sbjct: 287 LLKLLSTNKAADVELVPFDNNVLVRSDRKGGYIKPTFYSASKEYREKIIGSNVYEGKLKE 346 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 ++ L S T + GL YA+ I + L + + ++ Y+I L DGE+++ I Sbjct: 347 SMDSLGAYSGTNTGEGLRYAFYSIDEKNNDLLR-ERPEEHFRDYLIILVDGESNAATIIP 405 Query: 303 KESLFYCNEAKRR 315 Y ++ K Sbjct: 406 TLEGNYISKNKSD 418 >gi|261416578|ref|YP_003250261.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261373034|gb|ACX75779.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325972|gb|ADL25173.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 236 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 79/216 (36%), Gaps = 25/216 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVR 217 + S L ++++ DVS SMN+ + KL ++ M+ K + + Sbjct: 6 TVVSIHSRPLPVIILADVSGSMNE-----IGKLDSLKHALNNMISSFKDASSSSLEAEIY 60 Query: 218 SGLVTFSSKI-------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++TF ++ +A + IN++ T L A + D Sbjct: 61 VSIITFGNQAANIILEPQSASEIANDPSKM-NVINKMQAIGNT----PLGKALTSLVDLL 115 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E E Y+ +I+ +DG + + L N + + A A+ + A+A + Sbjct: 116 ENRE--IYPSRAYRPFIVLASDGMPNDLWQQPLDRLL--NSERSKKANRLALAIGADADE 171 Query: 331 QFLKNCASPDRF--YSVQNSRKLHDAFLRIGKEMVK 364 LK + + + N+ ++ F + +K Sbjct: 172 SMLKKFVNNEEMPIFKANNAIEIQKFFKCVTMSAIK 207 >gi|326430083|gb|EGD75653.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 4350 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 74/192 (38%), Gaps = 22/192 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++VLD S S++ + A +++ P R +V FS++ Sbjct: 2634 DVVLVLDSSASLHQSGWADVTDFASA---------FLEAFPSDTTHARVAVVVFSTRASL 2684 Query: 230 TFPLAW---GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + L F T + L + + + + G Sbjct: 2685 VADFDDYLGDLRGLASVVETLPFENGATATDRALRFVRQNLISSVDAGRRSNVGS----- 2739 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +I +TDGE S+ D ++ + +E GA +YA+G +D L N PD + V Sbjct: 2740 VVITVTDGEPSAFQEDVQQRV---DELVAVGAQLYAVGAGDLVSDSTL-NLLGPDGVFRV 2795 Query: 346 QNSRKLHDAFLR 357 +++R L D Sbjct: 2796 ESTRWLFDLLAD 2807 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 82/220 (37%), Gaps = 22/220 (10%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + P+ + + ++ S + D+++VLD S S+ + + + G Sbjct: 2831 SVSSLPFTTEAITSSAVVPLCASDPSADVLYQDVVLVLDSSASLLEEGWAAVAEFGAL-- 2888 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIF-GSTT 253 +++ P R +V FS++ ++ + + L F T Sbjct: 2889 -------FLEAFPSDTTHARVAVVVFSTRASLVADFDDYLGDLRGLASVVETLPFENGAT 2941 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + L++ ++ A G +I +TDGE S+ D ++ + +E Sbjct: 2942 ATDRALKFVRERLVTATGVGRRSNVGS-----VVITVTDGEPSALQEDVQQRV---DELV 2993 Query: 314 RRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 GA +YA+G +D L N PD + V + L D Sbjct: 2994 AVGAQLYAVGAGDLVSDSTL-NLLGPDGVHHVDDVNHLQD 3032 >gi|218463216|ref|ZP_03503307.1| hypothetical protein RetlK5_28967 [Rhizobium etli Kim 5] Length = 227 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 60/189 (31%), Gaps = 19/189 (10%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + LD S SM G D+L A R + + K + + + ++ F ++ TF Sbjct: 50 ALCLDFSGSMQGD---GEDQLQKAMRFLLTPDEASKVLVQWSPADQIIVIPFDGRVRNTF 106 Query: 232 PLAWGV---QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + +I+R T E A +I ++ I+ Sbjct: 107 MASGNPLEQEGLLNEISRQKANGGTNMYACAERALQQIAGTDRLSTYLPA--------IV 158 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +TDG + D + A ++ I +A L + A Sbjct: 159 IMTDGR----SDDQSRAFMSEWNAIEPRVPIFGITFG-DADKTQLDSLAKQTSARVFDGG 213 Query: 349 RKLHDAFLR 357 L AF Sbjct: 214 SDLATAFRT 222 >gi|226498336|ref|NP_001143188.1| hypothetical protein LOC100275688 [Zea mays] gi|195615532|gb|ACG29596.1| hypothetical protein [Zea mays] Length = 599 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/249 (17%), Positives = 92/249 (36%), Gaps = 40/249 (16%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-- 183 +S + + I + N + S K+ +D++ +++++ SM+ Sbjct: 7 RVSIATNPHILLIDSAHTFTLNGKAV---VRVEAPSSMKNHAPIDLVTLININQSMSWPA 63 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----- 234 +D L A + I L + R +V F+ K+++ Sbjct: 64 ASQTEIPSRLDLLKNAMKFIIRQL---------GDDDRLAIVAFNDKVIKENTTGILEIS 114 Query: 235 -WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 G I++K++ L+ T P LE+A K+ D ++ + +I+ ++DG Sbjct: 115 GSGRMAIEKKVDGLVAMGDTAFKPSLEHAV-KLLDDRDDKKRAG--------FIVLISDG 165 Query: 294 ENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV--QN 347 + ES+ + + R V+ G+ + L A S + S+ N Sbjct: 166 LDGQSKW-GDESITPTDPIRGLLRKYPVHTFGLGKAHDPKALHYIADISYGIYSSIVTDN 224 Query: 348 SRKLHDAFL 356 K+ +AF Sbjct: 225 LDKIIEAFA 233 >gi|187607706|ref|NP_001120436.1| hypothetical protein LOC100145523 [Xenopus (Silurana) tropicalis] gi|170284610|gb|AAI61198.1| LOC100145523 protein [Xenopus (Silurana) tropicalis] Length = 911 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 70/207 (33%), Gaps = 18/207 (8%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 + F A+ S + T S K +++ ++D S S+ P L Sbjct: 5 LQLFLVLFALFASLSAQDVAGTRPC--SDKIKCPINVFFIIDTSESIILQTAPIEILLDN 62 Query: 195 ATRSIREMLDIIKSIPDVNNVVRS---GLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FG 250 I LD ++ ++ V + G + +S +++ + Q + K+N + G Sbjct: 63 MKVFIPRFLDKLEDAAYLDQVTLNWLYGGLHYSDEVIIFSDITTSKQEYKSKLNAVNYIG 122 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + L I + + + +TDG + + + Sbjct: 123 RGTFTDCALSNMTALIQKHGGDAIN----------FAVVITDGHVTGSPCGGM--MHQAD 170 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCA 337 A+ G ++++ + + L+ A Sbjct: 171 RARNAGIKLFSVAASHDVYESGLREIA 197 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 54/143 (37%), Gaps = 17/143 (11%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ ++D S S+ ++ L I P + R G+V +S + Sbjct: 605 GALDIVFIIDSSESIG---YTNFSLEKNFVINVVSRLGSIAKDPKSDTGARVGVVQYSHE 661 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I P + +E + RL T + L++AYNK+ + + Sbjct: 662 GTFEAIQLDDPRIDSLSSFKEAVRRLEWIAGGTWTPSALQFAYNKLIKETRRDK------ 715 Query: 281 DDYKKYIIFLTDGENSSPNIDNK 303 K + + +TDG + + D + Sbjct: 716 --AKVFAVVITDGRHDPRDPDER 736 >gi|327263661|ref|XP_003216636.1| PREDICTED: cochlin-like [Anolis carolinensis] Length = 527 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/263 (15%), Positives = 85/263 (32%), Gaps = 47/263 (17%) Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 I + + E G QD+ IE++ L+ + Y +P F T Sbjct: 271 INVFIVSIAKPAPEELGMVQDLGFIEKAVCLN------------NGFFSYNIPSWFGTTK 318 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 L + S +++ ++D S S+ D + R ML Sbjct: 319 -YVKPLVQKLCAHEHMLCSKTCYNSVNVAFLIDGSSSVGDS-------------NFRLML 364 Query: 204 DII----KSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKST 256 + I KS + + V F+ F +++ + + T + Sbjct: 365 EFISNVAKSFEITDIGAKIAAVQFTYDQRTEFSFTDYITKENVLAALRGIRYMSGGTATG 424 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + + +F + K +++ LTDG+ + D+ A++ G Sbjct: 425 EAISHTTRNVFGPVRDGGN--------KNFLVILTDGQ----SYDDVRGPAVA--AQQAG 470 Query: 317 AIVYAIGVQAEAADQFLKNCASP 339 +Y+IG+ D ++P Sbjct: 471 ITIYSIGIAWAPLDDLKDMASAP 493 >gi|290987786|ref|XP_002676603.1| predicted protein [Naegleria gruberi] gi|284090206|gb|EFC43859.1| predicted protein [Naegleria gruberi] Length = 755 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/183 (20%), Positives = 65/183 (35%), Gaps = 33/183 (18%) Query: 171 MMMVLDVSLSMN--------DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ +LDVS SM + G +L + S+R +++++ ++ L+ Sbjct: 133 LVCILDVSGSMGSSAEDLSSSNENTGFSRLDLVKHSVRTLIELMNEKDQIS------LIP 186 Query: 223 FSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS PL G + EK+ L +T GL + AK Sbjct: 187 FSDSARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWDGLRLGMESSLNNP----LCAK 242 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC--NEAKR--RGAIVYAIGVQAEAADQFLK 334 + +I TDGE PNI+ + K + +++ G LK Sbjct: 243 TNTC----LILFTDGE---PNINPPRGIVPTLEKYIKEHPLNSTIHSFGFGYSLDSALLK 295 Query: 335 NCA 337 + A Sbjct: 296 DIA 298 >gi|332222722|ref|XP_003260519.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Nomascus leucogenys] Length = 3535 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|193783708|dbj|BAG53619.1| unnamed protein product [Homo sapiens] Length = 868 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|186896947|ref|YP_001874059.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|186699973|gb|ACC90602.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] Length = 233 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 72/205 (35%), Gaps = 24/205 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP + L S + ++ L + +++D S SM + Sbjct: 1 MPLVSLLI----------LTPLSLILRTTSDMRRLPVYLLIDTSGSMRGE------SIHA 44 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 I+ M+ ++ P V ++T+ ++ + PL +++ Q + T Sbjct: 45 VNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIPLT-ALENFQFTDITVPSAGGTF 103 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + LE + + ++ + KG + +TDG S D E K+ Sbjct: 104 TGAALECLIHCVERDIQRSDGDQKGDWRP--LVFLMTDGTPS----DVYAYGEAIKEVKK 157 Query: 315 RGA-IVYAIGVQAEAADQFLKNCAS 338 R + A V A+A + LK S Sbjct: 158 RAFGSIIACAVGAKAKHEHLKQLTS 182 >gi|55662683|emb|CAH74138.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665761|emb|CAH73557.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 845 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 58 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 108 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 109 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 164 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 165 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 213 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 214 TPKEEHCYLLHSFEE----FEALARRALHE 239 >gi|55662684|emb|CAH74139.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665762|emb|CAH73558.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55957947|emb|CAI14068.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 3548 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 58 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 108 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 109 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 164 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 165 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 213 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 214 TPKEEHCYLLHSFEE----FEALARRALHE 239 >gi|148886654|ref|NP_699197.3| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Homo sapiens] gi|296452942|sp|Q4LDE5|SVEP1_HUMAN RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=CCP module-containing protein 22; AltName: Full=Polydom; AltName: Full=Selectin-like osteoblast-derived protein; Short=SEL-OB; AltName: Full=Serologically defined breast cancer antigen NY-BR-38; Flags: Precursor Length = 3571 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|119579467|gb|EAW59063.1| hCG1794476, isoform CRA_b [Homo sapiens] Length = 1196 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|12803331|gb|AAH02484.1| COL6A2 protein [Homo sapiens] gi|30582665|gb|AAP35559.1| collagen, type VI, alpha 2 [Homo sapiens] Length = 425 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 18 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 74 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 75 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 131 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 132 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 166 >gi|112180424|gb|AAH30816.1| SVEP1 protein [Homo sapiens] Length = 868 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|30584073|gb|AAP36285.1| Homo sapiens collagen, type VI, alpha 2 [synthetic construct] Length = 426 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 18 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 74 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 75 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 131 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 132 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 166 >gi|179711|gb|AAA35620.1| alpha-2 collagen type VI-a' [Homo sapiens] Length = 429 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 22 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 78 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 79 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 135 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 136 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 170 >gi|179710|gb|AAA35619.1| alpha-2 collagen type VI-a [Homo sapiens] Length = 328 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 22 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 78 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 79 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 135 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 136 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 170 >gi|37222213|gb|AAQ89957.1| selectin-like protein [Homo sapiens] gi|68655017|emb|CAF04067.1| SEL-OB protein [Homo sapiens] Length = 3574 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|105706|pir||C35243 collagen alpha 2(VI) chain precursor, short splice form - human (fragment) gi|179709|gb|AAA35618.1| alpha-2 collagen type VI [Homo sapiens] Length = 238 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 22/165 (13%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S+ ++ L I P R G+V +S + Sbjct: 22 GALDVVFVIDSSESIG---YTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 78 Query: 227 -----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I + +E + L T + L++AY+++ + + Sbjct: 79 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRV--- 135 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + P D+ C+ R V AIG+ Sbjct: 136 -----FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIGIG 170 >gi|312136517|ref|YP_004003854.1| magnesium chelatase [Methanothermus fervidus DSM 2088] gi|311224236|gb|ADP77092.1| Magnesium chelatase [Methanothermus fervidus DSM 2088] Length = 269 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 66/192 (34%), Gaps = 24/192 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 K +++V+D+S SM+ DK ++ ++ + + + ++ Sbjct: 67 RKHGSKALIVLVVDISGSMSSE-----DKAEKVKGVLKRIM-----LDAQRHKDKLAIIG 116 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P + ++KI + G TT GL+ A + + K + Sbjct: 117 FKGREARVILPSTRRIFSFKDKIENISVGGTTPMAHGLKKAIEIL-------KREKKKSN 169 Query: 282 DYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRGAIVYAIGV----QAEAADQFLKN 335 +Y ++ L+DG + N ++ L E R I F Sbjct: 170 EYVPILVLLSDGMPNIALKNSPTRDVLELAKELNRSDIHTVIINFEKRIMRGRNFNFELA 229 Query: 336 CASPDRFYSVQN 347 S ++Y V N Sbjct: 230 LTSGGKYYEVGN 241 >gi|297265790|ref|XP_001107629.2| PREDICTED: vitrin-like isoform 1 [Macaca mulatta] Length = 678 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ G + + + K + R G V ++ + Sbjct: 495 DIGFIIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRVGAVQYTYEQR- 544 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 545 ---LEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 592 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 593 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 646 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 647 HSFFVDEFDNLYQYVPRIIQNI 668 >gi|290971865|ref|XP_002668693.1| predicted protein [Naegleria gruberi] gi|284082192|gb|EFC35949.1| predicted protein [Naegleria gruberi] Length = 454 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 76/205 (37%), Gaps = 30/205 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ LDVS SM G G+D+ A +I + + + PDV L+T+ + Sbjct: 40 QIVIALDVSGSMR---GQGIDQ---AKIAISNLFEQVVDTPDVV------LITYDTSAEL 87 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + Q + ++ G T T E + ++ + + I+F Sbjct: 88 YDLRKKPAETRQSTLEQIQAGGGTDFTCVF-----------EAISNLDMFNRQSEVAILF 136 Query: 290 LTDGENSSPNIDNK--ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC----ASPDRFY 343 TDG++ S + K E + E K + + IG + L + F Sbjct: 137 FTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQGTFQ 196 Query: 344 SVQNSRKLHDAFLR-IGKEMVKQRI 367 V+++ +++ + IG + Sbjct: 197 YVKDANEINQSMENLIGLLTSNSSV 221 >gi|145596106|ref|YP_001160403.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145305443|gb|ABP56025.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 576 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 85/226 (37%), Gaps = 33/226 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMND--HFGPGMDKLGVATRSIREMLDIIKSI 209 PL + +V S + M+ V+DVS SM + G+ + V + L + Sbjct: 360 PLAVERAVSSWSIATQSGRMLCVIDVSGSMREPVASANGVSRQQVTLDAAGRGLHLFDDS 419 Query: 210 PDVNNVVRSGLVTFSS---------KIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGL 259 GL FS+ ++V+ PL+ +++ + ++ T + Sbjct: 420 WQ------IGLWEFSTNLGSGRDYRRLVEIGPLSSQRSELEQALAQIQPTRGDTGLFDTV 473 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYCNEAKR--RG 316 AY + + ++ + + I+ TDG+N N I ++ + K R Sbjct: 474 LAAYEAVQEDWDEGQVNS---------IVLFTDGKNDDDNGISQQQLIAELERIKDPERP 524 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF-LRIG 359 V IG+ A+ + L++ + + ++ K+ D F I Sbjct: 525 VQVVLIGIGADVSKAELESITEVTGGGSFITEDPTKIGDIFLKAIA 570 >gi|167626844|ref|YP_001677344.1| hypothetical protein Fphi_0624 [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596845|gb|ABZ86843.1| conserved hypothetical membrane protein with von Willebrand factor type A domain [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 332 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 66/202 (32%), Gaps = 33/202 (16%) Query: 134 EMPFIF-CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +PF+F C + + P V + ++ + ++ LDVS SM+ +L Sbjct: 58 LVPFLFLCLWIVSVVALAGPTWKYKDVPVYQEN---VSRVIALDVSQSMDTTDVSP-TRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLI 248 A ++L IK G++ FSS+ PL ++ + + ++ Sbjct: 114 ERAKYKTLDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTVENLVPVINSDIV 166 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 +E + I A K II +TD + + +++ Sbjct: 167 PVQGNNIYKAIEKSAQLITQAGAKKGQ-----------IILITD------STPSADAIAK 209 Query: 309 CNEAKRRGAIVYAIGVQAEAAD 330 + +G + Sbjct: 210 AKQLAEQGIDTDVYAIGTPKGG 231 >gi|73985485|ref|XP_533794.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H1 precursor (ITI heavy chain H1) (Inter-alpha-inhibitor heavy chain 1) (Inter-alpha-trypsin inhibitor complex component III) (Serum-derived hyaluronan-associated protein) (SHAP) [Canis familiaris] Length = 910 Score = 50.6 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 75/200 (37%), Gaps = 20/200 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVT 222 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G + + Sbjct: 286 TNMNKNVVFVIDISTSMEGQ------KVKQTKEALLKILGDMRP-GDYFDLVLFGSEVQS 338 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + +VQ P ++ Q+ + T GL + A++ L ++ Sbjct: 339 WKGSLVQASPA--NLRAAQDFVKHFFLAGATNLNGGLLRGIEILNQAQKNLPKLSNHAS- 395 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 +I LTDGE + D + L A R +Y +G FL + + Sbjct: 396 ---VLIMLTDGEPTEGVTDRSQILKNVRNAIRGKFPLYNLGFGDNVDFNFLDVMSMENNG 452 Query: 342 ----FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 453 RAQRIYEDHDAAQQLQGFYD 472 >gi|219848228|ref|YP_002462661.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542487|gb|ACL24225.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 1017 Score = 50.6 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 56/154 (36%), Gaps = 24/154 (15%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + S + LD+++++D S SM + D + + R ++D++ Sbjct: 17 ISICTVEPSIAQSGVE-PLDLVLIIDHSGSMENPKYGRSDPHSMRFLAARMLIDLL---- 71 Query: 211 DVNNVVRSGLVTFSSKIVQTFP----LAWGVQHIQEKINRLIFGSTTKST---PGLEYAY 263 N+ R GL+ FS + G ++E I ++ ST T LE A Sbjct: 72 --NDEDRVGLILFSDNAEDYSDGLQLVQTGRGRLKENIAKMESQSTGDFTRYKDALELAG 129 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + +IFLTDG + Sbjct: 130 ELLGETPANRRAA----------VIFLTDGAPTD 153 >gi|169831370|ref|YP_001717352.1| magnesium chelatase [Candidatus Desulforudis audaxviator MP104C] gi|169638214|gb|ACA59720.1| Magnesium chelatase [Candidatus Desulforudis audaxviator MP104C] Length = 670 Score = 50.6 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 66/206 (32%), Gaps = 27/206 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG ++ V+D S SM ++ A ++ +L + R G+V Sbjct: 478 REKRIGNFLVFVVDASGSMG-----AQQRMVAAKGAVLSLL-----LDAYQKRDRVGMVA 527 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P V+ + ++ L G T GL AY H+ K + Sbjct: 528 FKGEHAEVLLPPTNSVELAERRLAELPTGGRTPLAAGLLKAYEV------ARAHLFKDPN 581 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK------- 334 I+ N +E L A + + V E FL Sbjct: 582 LSPLLIVISDGRGNVGLGGGPREDLRRVAALVHEEARIKTLVVDVEKDG-FLSFGLARGL 640 Query: 335 NCASPDRFYSVQN--SRKLHDAFLRI 358 A +Y +++ + L +A I Sbjct: 641 AAALDAEYYKIEDLKADTLVEAVRTI 666 >gi|332534874|ref|ZP_08410696.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] gi|332035673|gb|EGI72162.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] Length = 676 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 72/177 (40%), Gaps = 34/177 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +M+ V+D S SM+ + A +++ L ++ S N ++ F + Sbjct: 322 RLPREMVFVVDTSGSMHGQ------SMEQAKKALFYALSLLDSDDSFN------IIGFDN 369 Query: 226 --KIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + PL + ++ + I L T+ L N + D E Sbjct: 370 IVTPMSDKPLIASDFNLRRAERFIYSLEADGGTEIQGAL----NAVLDGSEFDG------ 419 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + ++FLTDG ++ N+++LF ++K + ++ +G+ + F++ A Sbjct: 420 --FVRQVVFLTDG-----SVSNEDALFKNIQSKLGDSRLFTVGIGSAPNSFFMRRAA 469 >gi|282863104|ref|ZP_06272164.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282562086|gb|EFB67628.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 248 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 53/150 (35%), Gaps = 21/150 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLD S SM ++ M L S+ LD +P +V F Sbjct: 42 EGVRAAVYLVLDRSGSMRPYYRDGSMQHLAEQVLSLSAHLDDDGIVP---------VVFF 92 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+ + + L G + ++++L +A +++ D H + Sbjct: 93 STDVDGSTDLTLGGH--RRRVDKLHANLGHMGRTNYHWAMDEVID------HYLESGSSA 144 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++F TDG P + C A+ Sbjct: 145 PALVVFQTDG---GPTSRFAAERYLCKAAR 171 >gi|332307031|ref|YP_004434882.1| Tetratricopeptide TPR_1 repeat-containing protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174360|gb|AEE23614.1| Tetratricopeptide TPR_1 repeat-containing protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 660 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 67/178 (37%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 PF T W S P + + + + +VLD+SLSM Sbjct: 64 PFYILTTAWLLASVALAGPTWLRLPQPVYQLNSGKV---VVLDMSLSMRATDVSP----N 116 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG--- 250 TR+ + +D++K+I + +GLV ++ PL+ Q++ I L Sbjct: 117 RLTRAKYKAIDLVKAIAEGE----TGLVAYAGDAFTISPLSSDAQNLTTLIPSLSPEIMP 172 Query: 251 -STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 ++ GLE A N + +A + I ++TDG +S + + L Sbjct: 173 VEGSEPFLGLESAINLLHNAGYQQGE-----------IFWITDGIENSQVAEVSKLLE 219 >gi|297265794|ref|XP_001107688.2| PREDICTED: vitrin-like isoform 2 [Macaca mulatta] Length = 656 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ G + + + K + R G V ++ + Sbjct: 473 DIGFIIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRVGAVQYTYEQR- 522 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 523 ---LEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 570 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 571 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 624 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 625 HSFFVDEFDNLYQYVPRIIQNI 646 >gi|153836414|ref|ZP_01989081.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|260365465|ref|ZP_05778002.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] gi|260877530|ref|ZP_05889885.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|260897529|ref|ZP_05906025.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|260901731|ref|ZP_05910126.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|149750316|gb|EDM61061.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|308087122|gb|EFO36817.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|308090607|gb|EFO40302.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|308108829|gb|EFO46369.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|308114384|gb|EFO51924.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] Length = 461 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 45/344 (13%), Positives = 111/344 (32%), Gaps = 74/344 (21%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY--TATKILNQE 67 F KG I+ LP++ I + + + +K+ + + L + N++ Sbjct: 4 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 63 Query: 68 NGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + ++ + + I +I + ++G Q N + +++ +H Sbjct: 64 DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDS-- 121 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DH 184 +S E+ + + L +D+ +LD S SM+ + Sbjct: 122 --WISHNEIGVEPKFKVSGDSITRKYLP------------QPVDIYFILDTSQSMSNPWY 167 Query: 185 FGPGMDKLGVATRSIR---EMLDIIKSIPDVNNVVRSGLVTFSSK-------------IV 228 ++ V +I + L+ K+ PD + R L+T+++ Sbjct: 168 GERNKTQMQVVKDTITRVVKELENFKTGPDKKS--RVALLTYNAYNAKFDKGAGRVKLYD 225 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK----------------------- 265 + + ++++ S + P YNK Sbjct: 226 YASEFSHTEASFESIVDKMFDKSVVEQKPHYASDYNKSQDIPLTDKYQEFIDILNSNKVM 285 Query: 266 ------------IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + A ++ + + K + ++ I L+DG ++ Sbjct: 286 PARGGGTQSWLGLIAAAKEADKVKKEDRNPEQVFIILSDGADTD 329 >gi|51597756|ref|YP_071947.1| TerY-like tellurite resistance protein. [Yersinia pseudotuberculosis IP 32953] gi|170022819|ref|YP_001719324.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|51591038|emb|CAH22702.1| Putative TerY-like tellurite resistance protein [Yersinia pseudotuberculosis IP 32953] gi|169749353|gb|ACA66871.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] Length = 233 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 72/205 (35%), Gaps = 24/205 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP + L S + ++ L + +++D S SM + Sbjct: 1 MPLVSLLI----------LTPLSLILRTTSDMRRLPVYLLIDTSGSMRGE------SIHA 44 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 I+ M+ ++ P V ++T+ ++ + PL +++ Q + T Sbjct: 45 VNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIPLT-ALENFQFTDITVPSAGGTF 103 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + LE + + ++ + KG + +TDG S D E K+ Sbjct: 104 TGAALECLIHCVDRDIQRSDGDQKGDWRP--LVFLMTDGTPS----DVYAYGEAIKEVKK 157 Query: 315 RGA-IVYAIGVQAEAADQFLKNCAS 338 R + A V A+A + LK S Sbjct: 158 RAFGSIIACAVGAKAKHEHLKQLTS 182 >gi|258645421|ref|ZP_05732890.1| magnesium-chelatase, subunit D/I family [Dialister invisus DSM 15470] gi|260402772|gb|EEW96319.1| magnesium-chelatase, subunit D/I family [Dialister invisus DSM 15470] Length = 640 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 51/146 (34%), Gaps = 18/146 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + G + ++D S SM +++ + +ML R G++ Sbjct: 442 EREKRTGNIFLFLVDASGSMG-----ARERMKAVKGVVFKML-----ADAYQKRDRVGMI 491 Query: 222 TFS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F + P+ ++ Q+K+ L G T GL A + + + Sbjct: 492 AFRRDRAEVLLPITRSIEFAQKKLAALPTGGKTPLAQGLIKAEDML-----DRLYKQDPL 546 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESL 306 D +I +TDG ++ N + + Sbjct: 547 QDP--VLILITDGRATNSLNKNTDPV 570 >gi|224029935|gb|ACN34043.1| unknown [Zea mays] Length = 598 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 44/249 (17%), Positives = 91/249 (36%), Gaps = 40/249 (16%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND-- 183 +S + + I + N + S K+ +D++ +++++ SM+ Sbjct: 7 RVSIATNPHILLIDSAHTFTLNGKAV---VRVEAPSSMKNHAPIDLVTLININQSMSWPA 63 Query: 184 ----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----- 234 +D L A + I L + R +V F+ K+++ Sbjct: 64 ASQTEIPSRLDLLKNAMKFIIRQL---------GDDDRLAIVAFNDKVIKENTTGILEIS 114 Query: 235 -WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 G I++K++ L+ T P LE+A K+ D + + +I+ ++DG Sbjct: 115 GSGRMAIEKKVDGLVAMGDTAFKPSLEHAV-KLLDDRADKKRAG--------FIVLISDG 165 Query: 294 ENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV--QN 347 + ES+ + + R V+ G+ + L A S + S+ N Sbjct: 166 LDGQSKW-GDESITPTDPIRGLLRKYPVHTFGLGKAHDPKALHYIADISYGIYSSIVTDN 224 Query: 348 SRKLHDAFL 356 K+ +AF Sbjct: 225 LDKIIEAFA 233 >gi|156975610|ref|YP_001446517.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156527204|gb|ABU72290.1| hypothetical protein VIBHAR_03343 [Vibrio harveyi ATCC BAA-1116] Length = 502 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 77/224 (34%), Gaps = 21/224 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + KG ++ A+ L +F + +E + ++L + L I + N Sbjct: 15 RSQKGIAAVWFALSLVPVFGMTFFAVEGTRYIQETSRLRDAAQTAALA----ITIDDKSN 70 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK-DYNLSA 129 + +++I DI + + D+ K Y++ A Sbjct: 71 QADALATMYINDYVRDI-------------SHVDIQTVRTYEEPTEDNDNTEKIQYSVQA 117 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGP 187 V+ + F + P ++ D +D+++V D S SM + Sbjct: 118 VTTHNSWFASNSIPSFETQEKLAGQAVAAKYPFYLGDKIIDLVLVTDFSGSMNNSWDGEI 177 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +D L A + I + ++ + + R ++ F+ ++ + Sbjct: 178 KIDLLKDAVKQISNRI-LVPREGESEVLNRIAIIPFNLRVQEKI 220 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 38/111 (34%), Gaps = 10/111 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 KG +L +++L ++ V I+ +H+ K +L +D + L A + Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVA----DKT 62 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENG------FAQDINNIERSTSLS 115 Q + +I EL F+ D+ + S + Sbjct: 63 EDVDQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFT 113 >gi|85374101|ref|YP_458163.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] gi|84787184|gb|ABC63366.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] Length = 435 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 42/113 (37%), Gaps = 1/113 (0%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 + + G+ ++ A+ LP + G ++ + + +K +L Y +D + + A + Sbjct: 9 KRLRQSNTGNAMMILALGLPALVGGAGYGLDMAQWYMLKRELQYAVDQAAVAGAYSLSYN 68 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 + + Y ++I G + + S ++S +D Sbjct: 69 GTAGDWSARAEQ-EYDANRSITTGYATANDSTKGVTDYGSFTQNSVTVSATMD 120 >gi|325914146|ref|ZP_08176499.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] gi|325539649|gb|EGD11292.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] Length = 525 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 82/238 (34%), Gaps = 34/238 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 PW +S + IT +S +++ ++DVS SM DKL + S++ Sbjct: 131 TPWNHDSVLLRIGITGRAVAASAMPAA-NLVFLVDVSGSMG-----APDKLPLLQSSLKL 184 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGL 259 + +++ R LVT++ P G Q I E I+ L G T G+ Sbjct: 185 LTRQLRAQD------RITLVTYAGNTAVVLPPTPGNQQARIVEAIDSLQSGGGTAGASGI 238 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E AY + + I+ TDG+ + D E +R G + Sbjct: 239 ELAYKAAQQSYLRDGINR---------ILLATDGDFNVGVTDFDTLKGMVAEKRRSGVAL 289 Query: 320 YAIGVQAEA-ADQFLKNC--ASPDRFYSVQNS--------RKLHDAFLRIGKEMVKQR 366 +G D ++ A + + + +L I +++ Q Sbjct: 290 STLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLEARKVLTHELGATLETIARDVKIQV 347 >gi|225420217|ref|ZP_03762520.1| hypothetical protein CLOSTASPAR_06560 [Clostridium asparagiforme DSM 15981] gi|225041138|gb|EEG51384.1| hypothetical protein CLOSTASPAR_06560 [Clostridium asparagiforme DSM 15981] Length = 231 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 66/175 (37%), Gaps = 15/175 (8%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + ++++D S SM G +++L + + E + + V +++F+S + Sbjct: 23 IACVLLVDTSGSMA---GASINELN---QGLLEFGNALDQDEHARGVADVCVISFNSNVE 76 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK--GHDDYKKY 286 P + L G T + A DA E+ + + + G Y+ + Sbjct: 77 TVVPFCPAANYSAPT---LSAGGLT----SMNEAVIAGLDAIEERKQLYRQLGCSYYRPW 129 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + LTDGE + N++ + + + +G+ + A LK+ Sbjct: 130 MFLLTDGEPTDQNMEGEAKNRLQQALNDKKVNFFPMGIGSGANYAHLKSYTKGGN 184 >gi|118096709|ref|XP_001233876.1| PREDICTED: similar to tumor suppressor candidate 4 [Gallus gallus] Length = 1208 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 66/179 (36%), Gaps = 42/179 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++DVS S++ L + S+ EMLD + ++ V F Sbjct: 87 DMVIIVDVSGSVSGL------TLKLMKTSVCEMLDTLSD----DDYVNVASKVF------ 130 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 +E + ++ TT G EYA++++ ++ + K I+ Sbjct: 131 -----------KEDVQGMVVKGTTDYKAGFEYAFDQLQNSNITRANCN-------KMIMM 172 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLK--NCASPDRFYSV 345 TDG D + +F + V+ V D L+ CA+ ++ + Sbjct: 173 FTDG-----GEDRVQDVFEKYNWPNKTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEI 226 >gi|90409064|ref|ZP_01217189.1| hypothetical protein PCNPT3_10646 [Psychromonas sp. CNPT3] gi|90309821|gb|EAS37981.1| hypothetical protein PCNPT3_10646 [Psychromonas sp. CNPT3] Length = 641 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 57/200 (28%), Gaps = 32/200 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F++ + P S I + +++D+S SM D+L Sbjct: 61 FLYPLAILTLIALAGPSWQKISTPIYDIKKAQV---LIMDMSYSM-YATDIKPDRLSQEK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----ST 252 +++ LV ++ PL I I L Sbjct: 117 YKAMDLIKAWDEGEK-------ALVAYAGDAFTLSPLTRDSNAILNHIPNLTPEIMPVMG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + + L+ A + +A H I+F+TDG I ++ + Sbjct: 170 SNADAALQKAITLLTNAGYTKGH-----------IVFMTDG------ISPTQADSMLDRL 212 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K +V + + E Sbjct: 213 KGTAWVVSVLALATEKGAPI 232 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 38/111 (34%), Gaps = 10/111 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 KG +L +++L ++ V I+ +H+ K +L +D + L A + Sbjct: 20 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVA----DKT 75 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENG------FAQDINNIERSTSLS 115 Q + +I EL F+ D+ + S + Sbjct: 76 EDVDQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFT 126 >gi|116618630|ref|YP_819001.1| von Willebrand factor domain-containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] gi|116097477|gb|ABJ62628.1| von Willebrand factor (vWF) domain containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] Length = 920 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/255 (16%), Positives = 77/255 (30%), Gaps = 34/255 (13%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 + + + G + N D N +N ++ + Sbjct: 39 SAASDITPQYTSNSTGVFPTNSWTIPGQNTVINHQGGDASNGWDKNSSWNGDSSDTSKSY 98 Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 L D + DY + + + + N+ + K+ +D+++ Sbjct: 99 LKFGTDTSNPDYQIRKYA--KETSTPGLYDVYLNAKGNEV----------KNIKPIDIVL 146 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--------S 225 V+D+S SM P ATR + ++ V GLV FS S Sbjct: 147 VVDMSGSMEPANNPSGSNRAQATRDGVKQFLQAIKDAGISQYVNVGLVGFSSPGNYVTGS 206 Query: 226 KIVQTFPL-AWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + E+IN+ F T + G+ + D K Sbjct: 207 NGYLEVGMQSLSTTGQTEQINKTLSPTFSGGTFTQLGIRRGQKMLDDDKNDH-------- 258 Query: 282 DYKKYIIFLTDGENS 296 KK +I LTDG + Sbjct: 259 --KKMMILLTDGVPT 271 >gi|328470527|gb|EGF41438.1| hypothetical protein VP10329_07002 [Vibrio parahaemolyticus 10329] Length = 461 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 48/347 (13%), Positives = 111/347 (31%), Gaps = 80/347 (23%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY--TATKILNQE 67 F KG I+ LP++ I + + + +K+ + + L + N++ Sbjct: 4 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 63 Query: 68 NGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + ++ + + I +I + ++G Q N + +++ +H Sbjct: 64 DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDS-- 121 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DH 184 +S E+ + + L +D+ +LD S SM+ + Sbjct: 122 --WISHNEIGVEPKFKVSGDSITRKYLP------------QPVDIYFILDTSQSMSNPWY 167 Query: 185 FGPGMDKLGVATRSIR---EMLDIIKSIPDVNNVVRSGLVTFSS---------------- 225 ++ V +I + L+ K+ PD + R L+T+++ Sbjct: 168 GERNKTQMQVVKDTITRVVKELENFKTGPDKKS--RVALLTYNAYNAKFDKGAGRVKLYD 225 Query: 226 -----------------------------------KIVQTFPLAWGVQHIQEKINRLIFG 250 Q PL QE I+ L Sbjct: 226 YASEFSHTEASFESIVDKMFDESVVEQKPHYASDYNKSQDIPLT---DKYQEFIDILNSN 282 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + G ++ + A ++ + + K + ++ I L+DG ++ Sbjct: 283 KVMPARGGGTQSWLGLIAAAKEADKVKKEDRNPEQVFIILSDGADTD 329 >gi|75812639|ref|YP_320257.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] gi|75705395|gb|ABA25068.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] Length = 405 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 64/174 (36%), Gaps = 14/174 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-----RSGLVTFSS 225 ++++LD S SMN G K+ A ++IR++ ++K + V +G Sbjct: 82 IIVLLDFSGSMNKLDSRGTKKIEGAIKAIRQLTSVLKDRGENTQVAIVPFGEAGANCPQG 141 Query: 226 KIVQTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 V L + +Q+ + + L ++T L+ A + + + ++ K Sbjct: 142 YPVNKDTLDKFFAANDFKLQNNLDYLASLTPCASTNLYEPLKKAVKFLANTSDSRFYLPK 201 Query: 279 --GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + II L+DG ++ N + IV+ +G Sbjct: 202 DSPQTPPRLSIILLSDGYHNFANEAQDFQSLTTLLKRNTNIIVHTLGYGLTPEQ 255 >gi|296190554|ref|XP_002806559.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Callithrix jacchus] Length = 3582 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 70/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELLFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTSRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVLFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|254881903|ref|ZP_05254613.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|319641094|ref|ZP_07995798.1| von Willebrand factor [Bacteroides sp. 3_1_40A] gi|254834696|gb|EET15005.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|317387338|gb|EFV68213.1| von Willebrand factor [Bacteroides sp. 3_1_40A] Length = 212 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/130 (20%), Positives = 46/130 (35%), Gaps = 11/130 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM +G + ++ ++ ++S P ++TF+S Sbjct: 3 RLPVYLLLDTSGSM---YGEPI---EAVKNGVQTLISTLRSDPYALETAYISIITFNSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + + T LE KI K KG I Sbjct: 57 QQVTPLT---ELAAFQQPNIDASGCTALGGALELLSQKIDSEITKTTAEVKGDWRP--LI 111 Query: 288 IFLTDGENSS 297 +TDG + Sbjct: 112 FIMTDGVPTD 121 >gi|195051568|ref|XP_001993124.1| GH13254 [Drosophila grimshawi] gi|193900183|gb|EDV99049.1| GH13254 [Drosophila grimshawi] Length = 1237 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 78/195 (40%), Gaps = 26/195 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM++ +AT + +LD + VN +VV++ Sbjct: 254 AASSPKDIMILLDASSSMSEK------SFDLATSTAFNILDTLGEDDYVNLITFSDVVKT 307 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P VQ I+ + + T T GLEYA++ + + Sbjct: 308 PVPCFKDRMVRATP--DNVQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 361 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LK--N 335 + I+ +T+ + S K+ + ++ + +++ + L Sbjct: 362 -GSQCNQAIMLITESTSESHKEIIKQYNWP-----HMPVRIFTYLIGSDSGSRSNLHEMA 415 Query: 336 CASPDRFYSVQNSRK 350 C++ F + N + Sbjct: 416 CSNKGFFVQINNYEE 430 >gi|148253599|ref|YP_001238184.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. BTAi1] gi|146405772|gb|ABQ34278.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. BTAi1] Length = 637 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 69/171 (40%), Gaps = 17/171 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++DVSLS D + G+ L V ++ + + + D +++ +TF+S+ Sbjct: 447 DLAVTLLVDVSLS-TDAWVDGVRVLDVEKEALLVLAHGLSACGDSHSI-----LTFTSRR 500 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKKY 286 +W + + G + L Y +I A H + K+ Sbjct: 501 N-----SWVRLETVKAFGEPMSGQVERRIGALRPGYYTRIGTAVRHAAHELADRPERKRL 555 Query: 287 IIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ LTDG+ + +++ EA+R G V+ + + A A F Sbjct: 556 LLVLTDGKPNDVDHYEGRFAIEDTRKAVQEARRAGVAVFGVTIDAAAQSYF 606 >gi|308473944|ref|XP_003099195.1| CRE-CLEC-63 protein [Caenorhabditis remanei] gi|308267668|gb|EFP11621.1| CRE-CLEC-63 protein [Caenorhabditis remanei] Length = 414 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 75/214 (35%), Gaps = 17/214 (7%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 P S+ P S+V D+ LD+++V+D S M G+ ++ ++ Sbjct: 32 PTYGPSTPRPTPGGSNVDRECGGDLANLWLDIVVVVDNSKGMT---NAGITQVAANIATV 88 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 I + R GLVT++ + L + + ++F + T + Sbjct: 89 FGNGTRIGNQYTDPRSTRVGLVTYNKVATEVADL--NHIQSIDDLYSVVFSTLTSVSSED 146 Query: 260 EYAYNKIFDAKEKLEHIAKGH---DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 + A EK+ + +YK+ ++ D + + K G Sbjct: 147 DSYLATGIGAAEKVFQNGRNGNVRSNYKRLVLVYASAYKGDGQNDP---IPVSDRLKSSG 203 Query: 317 AIVYAIGVQAEAADQF---LKNCASPDRFYSVQN 347 ++ I + + L ASP+ ++ ++ Sbjct: 204 VVISTIAFDQDGDEALLAGLAQIASPNYAFTSED 237 >gi|305680383|ref|ZP_07403191.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305659914|gb|EFM49413.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 880 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 81/252 (32%), Gaps = 54/252 (21%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLV 221 + D++ V+D S S+ H G P D + ++++++ + +N + L Sbjct: 54 DNKKKADLIFVIDESASLKGHGGKPATDPNNIRVPAMQDLVTQLGKFAQESNADINVKLS 113 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGST----------TKSTPGLEYAYNKIFDAKE 271 F + G ++++ L T L A + Sbjct: 114 GFGQGYRSQPDVYGGWVNVRDHAGDLTPPIQGFDQRNNDVFTDYGTALNGAMADLAS--- 170 Query: 272 KLEHIAKGHDDYKKYIIFLTDGE---NSSPNIDNKESLFYC------NEAKRRGAIVYAI 322 + + K I+F TDG+ D C + + ++ + Sbjct: 171 ------RPDPESCKAILFFTDGKLTVQGDQKADIVAQKAICSADGQVKKLRDANIQLFTV 224 Query: 323 GV---QAEAADQFLKN------CASP----DRFYSVQ-NSRKLHDAFLRI---------- 358 G+ E+ +Q L++ CA F++ + N+ L AF I Sbjct: 225 GLIPSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESNAASLFSAFRSILPNNAVVEHH 284 Query: 359 GKEMVKQRILYN 370 G K R + + Sbjct: 285 GNTASKMRFMLD 296 >gi|225022540|ref|ZP_03711732.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] gi|224944663|gb|EEG25872.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] Length = 880 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 81/252 (32%), Gaps = 54/252 (21%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLV 221 + D++ V+D S S+ H G P D + ++++++ + +N + L Sbjct: 54 DNKKKADLIFVIDESASLKGHGGKPATDPNNIRVPAMQDLVTQLGKFAQESNADINVKLS 113 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGST----------TKSTPGLEYAYNKIFDAKE 271 F + G ++++ L T L A + Sbjct: 114 GFGQGYRSQPDVYGGWVNVRDHAGDLTPPIQGFDQRNNDVFTDYGTALNGAMADLAS--- 170 Query: 272 KLEHIAKGHDDYKKYIIFLTDGE---NSSPNIDNKESLFYC------NEAKRRGAIVYAI 322 + + K I+F TDG+ D C + + ++ + Sbjct: 171 ------RPDPESCKAILFFTDGKLTVQGDQKADIVAQKAICSADGQVKKLRDANIQLFTV 224 Query: 323 GV---QAEAADQFLKN------CASP----DRFYSVQ-NSRKLHDAFLRI---------- 358 G+ E+ +Q L++ CA F++ + N+ L AF I Sbjct: 225 GLIPSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESNAASLFSAFRSILPNNAVVEHH 284 Query: 359 GKEMVKQRILYN 370 G K R + + Sbjct: 285 GNTASKMRFMLD 296 >gi|119385724|ref|YP_916779.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376319|gb|ABL71083.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 335 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 78/228 (34%), Gaps = 45/228 (19%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ IG ++++++D S SM+D F P A + R +LD I PD R Sbjct: 71 STAYRGIGTNLVLLIDRSSSMDDTFAGRSPQGGDESKAAAARRILLDFIARRPD----DR 126 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEH 275 G+ FS+ + P+ I + L T GL A +A Sbjct: 127 IGIAAFSTAPMLVLPMTESRTAIAAAVAALAEPGLSQTDVGRGLTLAMGMAHEASASDSR 186 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---- 331 ++ ++DG + I + N A RR +Y + ++ + A Sbjct: 187 A----------VVLVSDG---AAVIAPEVQTALRNLAARRQVNIYWLYLRTKGAKGIFEV 233 Query: 332 -------------------FLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 FL++ P + + + DA IG+ Sbjct: 234 PEPGQADTPHLRPERHLHIFLQSLGLPYHAFEADSPGAVEDAVTEIGR 281 >gi|315647020|ref|ZP_07900133.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] gi|315277222|gb|EFU40551.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] Length = 471 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 47/323 (14%), Positives = 98/323 (30%), Gaps = 34/323 (10%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 D L ++I N + N+ + Q + I N Q + Sbjct: 54 DSDLNEPHSEIQNPPDSNHNETQTSLTDEE----ILLKPPGRFAGSNYDEQKVQAALDQL 109 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 ++ D + L Y P++ + S + + ++ + + Sbjct: 110 PSNLTTDQYKDELLLLLAEDYR-PYVTTFMNFDTEISVNNERPEGKITLPVNRNLHISI- 167 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +LD S SM G K+ A +I+ D + + +V+ L + K Sbjct: 168 -LLDASGSMKAQIN-GKSKMDSAKEAIQTFADKLPNNAEVS------LRVYGHKGTGDQK 219 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-------EKLEHIAKGHDDYKK 285 + + IF T ++ A K+ A + + Sbjct: 220 D----KRVSCDSTEEIFHGQGDQTNQIKTALQKVEPAGWTPIANALQSVKKDINPETTDS 275 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCASP--DRF 342 + ++DG + + +++K + V +G + Q L+ A+ F Sbjct: 276 VVYVVSDGIETCGGK-PAQVAKELHQSKVKTI-VNIVGFDVDNEGQKLLRQIAASGGGEF 333 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQ 365 SV N L + + +Q Sbjct: 334 MSVNNDEAL----KNVLNKAYEQ 352 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SSITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex] Length = 891 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 72/238 (30%), Gaps = 63/238 (26%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFS---- 224 ++ VLD S SM ++ +++ +LD ++ D+ +VV SG+ + Sbjct: 307 VIFVLDTSGSMAG------TRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKP 360 Query: 225 ----SKIVQTFPLAWGVQ--------------------------------HIQEKINRLI 248 P + +E + + Sbjct: 361 YKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMD 420 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 S+T L A K + IIFLTDGE ++ D E L Sbjct: 421 VTSSTNINDALLLAL--------KNSQSVQSRVRLTPIIIFLTDGEPTASVTDTTEILKN 472 Query: 309 CNEAKRRGAI-VYAIGVQAEAADQFLKNCASPDR------FYSVQNSRKLHDAFLRIG 359 + + ++ + QFL +S +R + + + +L F + Sbjct: 473 VRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGFFDEVA 530 >gi|260793444|ref|XP_002591722.1| hypothetical protein BRAFLDRAFT_80817 [Branchiostoma floridae] gi|229276931|gb|EEN47733.1| hypothetical protein BRAFLDRAFT_80817 [Branchiostoma floridae] Length = 987 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 92/278 (33%), Gaps = 35/278 (12%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 D + ++ +N + + + D + +++ ++ P ++ Sbjct: 231 DAVDHFCDDSDGNHAHNADAPNKQNRVCDGRSTWDVVTSTEDFKAP------DGTQINTA 284 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKS 208 + T K+ ++V D+S SM+ D + + +++ +++ + Sbjct: 285 RAVASTEPTFRVVKARKPR-FVLVFDISGSMDSIDDVSINTPRRSLLHQTVYKLVR--EG 341 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 IPD ++ G+V F + L Q I + + T GL A Sbjct: 342 IPDGSH---VGMVKFHQWATRLLDLTEIATEEDRQEIADAV-PNEASGGTCIGCGLTEAL 397 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + +I L+DG+ S N+ ++ G V+++ Sbjct: 398 EVLSMNGADPAGG---------IVIILSDGDES-LNVSPNLTVAT-QHLVAAGVTVHSVT 446 Query: 324 VQAEAADQFLKNCASP---DRFYS-VQNSRKLHDAFLR 357 + A + + AS FYS NS L + Sbjct: 447 YSSSADTRMEEVAASTHGRAFFYSGAANSNSLEEGLRE 484 >gi|66775067|gb|AAY56127.1| complement component Bf/C2-B [Ginglymostoma cirratum] Length = 757 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 64/221 (28%), Gaps = 42/221 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----- 223 L + ++LDVS S+ A ++ + +IK N GLV F Sbjct: 261 LHIYILLDVSGSIQKE------DFKKAINALTTFVTMIKQFEVGVNY---GLVMFGSRSC 311 Query: 224 -----------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 S V + I + T T L+ + + K Sbjct: 312 VEVNIAHDDVSDSDSVLQILPTLKYEDITRYSDA-----GTNMTGALKTIFEMMVLKKAS 366 Query: 273 LEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ + + I+ TDG N+ P +D + + VY G+ + Sbjct: 367 MKDKQAEWREVRHAIMIFTDGRNNMGGNPKPMMDRIRNFLDIEKVGEEFLDVYVFGLGND 426 Query: 328 AADQFLKNCASPDR-----FYSVQNSRKLHDAFLRIGKEMV 363 + + AS F+ L F + Sbjct: 427 VDTDEMNSVASKKNNEKHMFF--IQMEDLTTVFNSMLDLTS 465 >gi|153947567|ref|YP_001399499.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|152959062|gb|ABS46523.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] Length = 212 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ M+ ++ P V ++T+ ++ Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL +++ Q + T + LE + + ++ + KG + Sbjct: 57 REYIPLT-ALENFQFTDITVPSAGGTFTGAALECLIHCVDRDIQRSDGDQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFLKNCAS 338 +TDG S D E K+R + A V A+A + LK S Sbjct: 114 FLMTDGTPS----DVYAYGEAIKEVKKRAFGSIIACAVGAKAKHEHLKQLTS 161 >gi|297559736|ref|YP_003678710.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296844184|gb|ADH66204.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 547 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 52/200 (26%), Gaps = 30/200 (15%) Query: 174 VLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV------------RSGL 220 LDVS SM LG T + L VV R+ + Sbjct: 359 ALDVSGSMEGGRLAELQSALGALTGADGGSLARSTQAFQEREVVTLLPFSTWPADPRTFV 418 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V S L+ + L T + L AY + Sbjct: 419 VEPGSVDEVNADLS-------AAVEGLEAEGDTAAYDALVRAYELLESDTGSDG------ 465 Query: 281 DDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-S 338 D ++ +TDGE N ++ V+ + + + + Sbjct: 466 -DPLMSVVLMTDGEVNRGVGLEGFRESLAARSEPVARVPVFTVLFGESDVPEMTELAELT 524 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 R + + + L F I Sbjct: 525 GGRVFDARE-QDLEQVFREI 543 >gi|262199490|ref|YP_003270699.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082837|gb|ACY18806.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 808 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 78/237 (32%), Gaps = 21/237 (8%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKIS---SKSDIGLDMMMVLDVSLSMNDHFGPG 188 R+ + + P L+ ++V + + + ++D+S SM + + P Sbjct: 423 RFRVSMGMVSRPSLTQDGAVDYLLGANVTVPNLTREERPHAVVTFLVDISGSMAE-YSPT 481 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF------PLAWGVQHIQE 242 +D G TR + K++ + +V+F P A + Sbjct: 482 VDAGGAPTRMDIVREGLWKAVSALKPGDIVNVVSFDDAAQIELERGEIRPGAATPRPYLR 541 Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 + RL+ T + G+E AY + II LTD + +ID Sbjct: 542 SVLRLLPRGGTNLSAGIEVAYRV--------ARRNYDPYRINRVII-LTDAYANRGSIDP 592 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLR 357 + G +GV + + FL R ++S+ R AF Sbjct: 593 SLIGDHVLIGDDEGIHFSGLGVGYDFNEDFLNTLTDVGRGTYFSLITERDAARAFGE 649 >gi|193734230|gb|ACF19883.1| PilC1 [Kingella kingae] Length = 1362 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 16/149 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + ++M++LD S SM G G + + S+ ++ + N Sbjct: 73 TTVRGMQGAKPNIMLLLDDSGSMGAQVPGSGRTRQQILQSSLS---QVVAKYGNQINW-- 127 Query: 218 SGLVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 GL+ F+ + PL + I R T + A + + + Sbjct: 128 -GLIAFNDQSSAFNLPLGTNYSTVVNSIKRFPANGLTPTITSYIKAVDTLNKGIQYRCQ- 185 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKES 305 K Y++ L+DG+++ P +E+ Sbjct: 186 -------KSYLVMLSDGDSNYPMYFEREA 207 >gi|298294144|ref|YP_003696083.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296930655|gb|ADH91464.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 356 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 78/223 (34%), Gaps = 43/223 (19%) Query: 165 SDIGLDMMMVLDVSLSMNDHFG---PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 G ++++ D S SM++ F P D+ + + R + + + P R G+ Sbjct: 90 EGTGAHLVLLFDRSSSMDNSFADRAPTGDQESKSAAAKRLLAEFVARRPH----DRIGVA 145 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS+ + PL + + I+ + T GL A++ D Sbjct: 146 AFSTSPMPVLPLTDHHEVVDAAIDAIDRPGLAFTDVGRGLALAFSYFADDT--------- 196 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG---------VQAEAAD 330 D+ + ++ ++DG + ID + + A R +Y + + + Sbjct: 197 -DETSRAVLLVSDG---AALIDRRVQDALRDAAARTPVHLYWLFLRSRGSPGIFEVPPGE 252 Query: 331 Q------------FLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 FL+ P R + + + + DA I ++ Sbjct: 253 DTPQANPERHLHLFLQGLGLPYRAFEATSPQAVADAIAEIDRQ 295 >gi|241667422|ref|ZP_04755000.1| hypothetical protein FphipA2_01540 [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254875971|ref|ZP_05248681.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841992|gb|EET20406.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 332 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 66/202 (32%), Gaps = 33/202 (16%) Query: 134 EMPFIF-CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +PF+F C + + P V + ++ + ++ LDVS SM+ +L Sbjct: 58 LVPFLFLCLWIVSVVALAGPTWKYKDVPVYQEN---VSRVIALDVSQSMDTTDVSP-TRL 113 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLI 248 A ++L IK G++ FSS+ PL ++ + + ++ Sbjct: 114 ERAKYKTLDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSDANTVENLVPVINSDIV 166 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 +E + I A K II +TD + + +++ Sbjct: 167 PVQGNNIYKAIEKSAQLITQAGAKKGQ-----------IILITD------STPSADAIAK 209 Query: 309 CNEAKRRGAIVYAIGVQAEAAD 330 + +G + Sbjct: 210 AKQLAEQGIDTDVYAIGTPKGG 231 >gi|113474645|ref|YP_720706.1| hypothetical protein Tery_0817 [Trichodesmium erythraeum IMS101] gi|110165693|gb|ABG50233.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101] Length = 460 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 61/187 (32%), Gaps = 30/187 (16%) Query: 171 MMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----- 224 ++++LD S SM G KL A +IR+ + R +V F Sbjct: 117 VIVLLDFSGSMKQLDSSGEKTKLEGAIAAIRKFNQDLAK---KGENTRISIVPFGKGGKN 173 Query: 225 --SKIVQTFPLAW----GVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKL 273 V+ L G + +++ +N+L + T L A + ++ Sbjct: 174 CPGNQVRKKQLDNFVLAGKKKVEQTLNKLESKLDNLCAATDIYEPLRQAVQFFGNPEDTR 233 Query: 274 EHIAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNE-------AKRR-GAIVYAIG 323 ++ + + + II L+DG +S E + K V+ +G Sbjct: 234 FNLPINSNLPQPRRSIILLSDGYHSIYGNRENEPELEAQDFEKMMFMLKSYPNITVHTLG 293 Query: 324 VQAEAAD 330 Sbjct: 294 YGLTPQQ 300 >gi|187466178|emb|CAQ51888.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQRIVH 381 >gi|148680762|gb|EDL12709.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQRIVH 381 >gi|88702489|gb|ABD49099.1| integrin alpha E [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQRIVH 381 >gi|119485135|ref|ZP_01619520.1| hypothetical protein L8106_06794 [Lyngbya sp. PCC 8106] gi|119457363|gb|EAW38488.1| hypothetical protein L8106_06794 [Lyngbya sp. PCC 8106] Length = 564 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 53/350 (15%), Positives = 102/350 (29%), Gaps = 56/350 (16%) Query: 33 GLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRII--KNIWQT 90 L + A + + + S++ T KQ N Y I ++ + + Sbjct: 243 SLASDMVKNGSFWASIASVYESSVIAANT-----------NKQSNLVQYEAIYPRSTFTS 291 Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC------TFPW 144 + R L + + + + + R MP + F Sbjct: 292 NMRAVLPNAPWVSADEQAAAEQIIEYLRSPEAQKIATEFGLRPGMPGVALGAKFSPQFGV 351 Query: 145 CANSSHAPLLITSSVKISSK-------SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 N+ + L + + + +++V+D S SM KL Sbjct: 352 DPNARYDSLRSPKPEVVEAMLKSWQEFAKKPSKVVIVVDSSGSMVG------RKLSGVQN 405 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF---PLAWGVQHIQEKINRLIFGSTTK 254 ++R ++ + R L+ F S+I + G IN L TK Sbjct: 406 TLRYYIESLGVKE------RIALIDFDSEIRPPVVVDGTSEGRNQGMLFINNLKAEGGTK 459 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-- 312 A + ++ A +I LTDGE+S+ I + + Sbjct: 460 LYDATLSAQTWLQQNQQPDAINA---------VIILTDGEDSASQISLDKLQQELQKTGF 510 Query: 313 -KRRGAIVYAIGVQAEA--ADQFLKNCASPDR-FYSVQNSRKLHDAFLRI 358 + + IG E LK A + +Y + + + Sbjct: 511 NTDKRIAFFTIGYGKEGEFNSTVLKKIADLNSGYYRQGDPETISTLMADL 560 >gi|25742632|ref|NP_113956.1| integrin, alpha E [Rattus norvegicus] gi|3236342|gb|AAC23662.1| integrin alpha E1 [Rattus norvegicus] gi|187466179|emb|CAQ51889.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQRIVH 381 >gi|4809045|gb|AAD30063.1| Itgae protein [Mus sp.] Length = 895 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 183 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 233 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 234 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 287 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 288 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 344 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 345 HTFKVTNYSALDGLLSKLQQRIVH 368 >gi|74150598|dbj|BAE32321.1| unnamed protein product [Mus musculus] Length = 1166 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 195 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 245 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 246 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 299 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 300 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 356 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 357 HTFKVTNYSALDGLLSKLQQRIVH 380 >gi|87303814|ref|ZP_01086536.1| hypothetical protein WH5701_00200 [Synechococcus sp. WH 5701] gi|87281670|gb|EAQ73660.1| hypothetical protein WH5701_00200 [Synechococcus sp. WH 5701] Length = 273 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 56/155 (36%), Gaps = 23/155 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + G +++ LD+SLSM D+L A R IR++L+ + +GL Sbjct: 86 PTPLKQPGDSLVIALDLSLSMLATDVEP-DRLTRAKRKIRDILE-------LREGSLTGL 137 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + FS PL + I+ +N L + + ++ + A + Sbjct: 138 LVFSGDAHVVTPLTDDGRTIEGMLNVLDPVIMPATGNRADLAVARAKALLEQGALGEGR- 196 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 I+ +TD N +++L + Sbjct: 197 ----------ILLITDSINDDYEGTIRDTLSGTDT 221 >gi|290971985|ref|XP_002668746.1| vWFA domain-containing protein [Naegleria gruberi] gi|284082259|gb|EFC36002.1| vWFA domain-containing protein [Naegleria gruberi] Length = 198 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 49/128 (38%), Gaps = 21/128 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 +D+++V+D + SM+ ++ VA R++ ++ + + +R V++ Sbjct: 79 VDLVIVMDCTGSMS-------GEIEVAKRTVTTIISTL--HEKFQSDLRFSAVSYRDHTD 129 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 V+ FP + + IN + L A I + + K Sbjct: 130 DYAVKEFPFTKDLNKAKGYINTMSAQGGGDHPEALASALYVINEMPFNKKG--------K 181 Query: 285 KYIIFLTD 292 K ++++ D Sbjct: 182 KIVVWVAD 189 >gi|238927210|ref|ZP_04658970.1| magnesium chelatase [Selenomonas flueggei ATCC 43531] gi|238884992|gb|EEQ48630.1| magnesium chelatase [Selenomonas flueggei ATCC 43531] Length = 636 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 58/169 (34%), Gaps = 20/169 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ ++D S SM +++ + +I +L + V GL+ Sbjct: 445 RAKRAAANILFLVDASGSMG-----ARERMRMVKGAILALLQE---AYQKRDCV--GLIA 494 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ ++++ L G T GL A + + + + Sbjct: 495 FRRDRAETLLPMTRSVELAEKQLRDLPTGGRTPLAEGLACAVQTLRELERRGSE------ 548 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAA 329 K +I +TDG ++ + A+ G + + E Sbjct: 549 --KTVLILITDGRTNTARDGDDGVQRALRAAEEIAGTQALTLVLDTERG 595 >gi|2497428|sp|Q60677|ITAE_MOUSE RecName: Full=Integrin alpha-E; AltName: Full=Integrin alpha M290; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor gi|535477|gb|AAC52142.1| alpha M290 integrin [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIGPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDRFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQRIVH 381 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 SYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|116625363|ref|YP_827519.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228525|gb|ABJ87234.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 317 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 77/219 (35%), Gaps = 26/219 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + ++D+ L + M++DVS S + + A++ RE+L + + L+ Sbjct: 66 TKETDLPLTIGMLVDVSGSQRNLIDI---ERSAASQFFREVLRKKDLAFLIMFGEETELL 122 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L G+ H + ++ + G ++DA + Sbjct: 123 Q--DYTGSPRLLTEGLNHLEVSSGVSGIHPGPVPTMGGPRGT---VLYDAVYLAANEKLK 177 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG------------VQAE 327 + +K I+ +TDG + + +++ A++ ++Y+I V Sbjct: 178 GEVGRKVIVVITDGVDQGSRMSRNQAI---EAAQKSDCVIYSIDYSDPRAYGPFNMVGGG 234 Query: 328 AADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + K + R Y V L F + +EM Q Sbjct: 235 GEGELRKMSDETGGRVYKVDRRHTLDQVFKELQEEMRSQ 273 >gi|110832907|ref|YP_691766.1| von Willebrand factor type A domain-containing protein [Alcanivorax borkumensis SK2] gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain [Alcanivorax borkumensis SK2] Length = 698 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 91/293 (31%), Gaps = 43/293 (14%) Query: 82 RIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT 141 R + R+E +N FA N ++ ++ D + + ++P Sbjct: 234 RNASTTIKGKGRSEKGDNNFAPPATNGHPPSAFTLDQD-----IVVYWRHQQDLPGSVDL 288 Query: 142 FPWCANSSHAP--LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + A +L + G D + VLD+S SMN KL + Sbjct: 289 VAYKAPGKDRGTFMLSITPGDDLPPITTGSDWVFVLDISGSMN-------AKLATLGDGV 341 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTK 254 R+ L ++ R +V F + + ++ +KI +L T Sbjct: 342 RQALGKLRGND------RFRIVLFDDRAEELTSGFVDATPNNIRQYTQKIMQLQSRGGTN 395 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 GL A + + I+ +TDG + K+ + + + Sbjct: 396 LFGGLSLALTPLDADRPTG-------------IVLVTDGVANVGKTRQKDFI---DLLEN 439 Query: 315 RGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 ++ + A L AS SV NS + L ++ Q Sbjct: 440 HDVRLFTFVMGNSANRPMLTAMTDASNGFAISVSNSDDIAGQILNATSKVTHQ 492 >gi|297265792|ref|XP_002799252.1| PREDICTED: vitrin-like [Macaca mulatta] Length = 657 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 68/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++D S S+ G + + + K + R G V ++ + Sbjct: 474 DIGFIIDGSSSV------GTGNFRTVLQFVTNL---TKEFEISDTDTRVGAVQYTYEQR- 523 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + +A ++F K + Sbjct: 524 ---LEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------KKSKPN 571 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + D+ K G I YAIGV A ++ P D Sbjct: 572 KRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPARD 625 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ RI + + Sbjct: 626 HSFFVDEFDNLYQYVPRIIQNI 647 >gi|301785912|ref|XP_002928373.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Ailuropoda melanoleuca] Length = 1188 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 78/211 (36%), Gaps = 28/211 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++LD S S++ P A I M+ + LV + Sbjct: 206 EEAAGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFQCSF---ALVQY 256 Query: 224 SSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I F L + + N G TK+ +++ + IF H ++ + Sbjct: 257 GEVIQTEFDLRDSQDAMASLARVQNITQVGKVTKTASAMQHVLDNIFTPS----HGSRKN 312 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQF--LKNC 336 K ++ LTDG+ D + K +G +AIGV E + LK Sbjct: 313 A--SKVMVVLTDGD---IFEDPLNLTTVISSPKMQGVERFAIGVGKAFENNKTYNELKLI 367 Query: 337 AS--PDRF-YSVQNSRKLHDAFLRIGKEMVK 364 AS DR+ + V N L ++ + +++ Sbjct: 368 ASDPDDRYAFKVTNYTALDGLLSKLQQNIIQ 398 >gi|281346139|gb|EFB21723.1| hypothetical protein PANDA_018300 [Ailuropoda melanoleuca] Length = 1151 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 78/211 (36%), Gaps = 28/211 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++LD S S++ P A I M+ + LV + Sbjct: 172 EEAAGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFQCSF---ALVQY 222 Query: 224 SSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I F L + + N G TK+ +++ + IF H ++ + Sbjct: 223 GEVIQTEFDLRDSQDAMASLARVQNITQVGKVTKTASAMQHVLDNIFTPS----HGSRKN 278 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQF--LKNC 336 K ++ LTDG+ D + K +G +AIGV E + LK Sbjct: 279 A--SKVMVVLTDGD---IFEDPLNLTTVISSPKMQGVERFAIGVGKAFENNKTYNELKLI 333 Query: 337 AS--PDRF-YSVQNSRKLHDAFLRIGKEMVK 364 AS DR+ + V N L ++ + +++ Sbjct: 334 ASDPDDRYAFKVTNYTALDGLLSKLQQNIIQ 364 >gi|170743045|ref|YP_001771700.1| hypothetical protein M446_4937 [Methylobacterium sp. 4-46] gi|168197319|gb|ACA19266.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 440 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 40/107 (37%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 IR F + G+++++ + LP++ G +E + +A+L +D + L A ++ Sbjct: 5 IRRFLADRTGAVALIIGLSLPLLVAGSGAAVEYARIHKRRAELQKAVDVAALGAAGELSV 64 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 + + + ++ + R G + T Sbjct: 65 AGSDVSVEAMARRLAFDSARATDPGITRVSAAVVGRGTSVTVAINET 111 >gi|114569600|ref|YP_756280.1| von Willebrand factor, type A [Maricaulis maris MCS10] gi|114340062|gb|ABI65342.1| von Willebrand factor, type A [Maricaulis maris MCS10] Length = 555 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 89/247 (36%), Gaps = 28/247 (11%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPF----IFCTFPWCANSSHAPLLITS 157 + + + + ++ I + DY+ + + PF PW A++ + I Sbjct: 117 SLEAGRLPPTDAVRIEEMVNYFDYDYALPPGPDEPFATHVTVTPTPWNADTQLMHIGIQG 176 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +I +++ ++DVS SM DKL +A +++ ++D +++ Sbjct: 177 -YEIIPDERPRANLVFLIDVSGSM-----NSPDKLPLAVQAMHLLVD------ELHPDDT 224 Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV ++S P + I ++ L G +T GL AY L Sbjct: 225 VALVVYASASGVVLPPTEARNAREIHRALDSLSAGGSTAGGAGLALAY--------DLAE 276 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLK 334 D + + LTDG+ + ++ + + G + +G DQ ++ Sbjct: 277 QNFDEDAVNRVM-LLTDGDFNVGVTQDERLEDFVARKRDSGIYLSVMGFGRGNYNDQMMQ 335 Query: 335 NCASPDR 341 A Sbjct: 336 TIAQAGN 342 >gi|239622779|ref|ZP_04665810.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239514776|gb|EEQ54643.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 401 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 47/334 (14%), Positives = 93/334 (27%), Gaps = 51/334 (15%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR----ENGFAQDINNIERSTSL 114 TAT + + N + + L QD + + ++S+ Sbjct: 77 TATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSM 136 Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA--------PLLITSSVKISSKSD 166 I + D K A S P +F A P+ + S Sbjct: 137 WISMGD-TKHIVKDAASTSTTPIVFGIAKSKAVKLGWADDTGAAKPVSTADILAAVSDGK 195 Query: 167 IGLDMM-----------------------MVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + M V+D S SM+ G+ K + + Sbjct: 196 LTFSMTSATVIDSALNVYQTALRKPSWTIWVVDYSGSMSGEGKNGVVK---GLNAALDPD 252 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYA 262 KS + + + L+ F ++ + + + + T GL A Sbjct: 253 QAKKSYIEPASGDVNILIPFETEAHRPVKATGTSTSDLLHEADATDASGGTDIYEGLLSA 312 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 +++ E ++ I+ +TDG N D+++ +++ R +++I Sbjct: 313 LDELPSESEASQYTTA--------IVLMTDG---RSNSDHQDEFESAYKSRGRDLPIFSI 361 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 Q + S L F Sbjct: 362 MFGDADPSQLKSLATLSNAKVFDGRSGDLAAVFR 395 >gi|330839047|ref|YP_004413627.1| Magnesium chelatase [Selenomonas sputigena ATCC 35185] gi|329746811|gb|AEC00168.1| Magnesium chelatase [Selenomonas sputigena ATCC 35185] Length = 648 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 47/135 (34%), Gaps = 19/135 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ ++D S SM +++ + ++ +L R GL+ Sbjct: 458 RARRASANILFLVDASGSMG-----AKERMKMVKGAVLALLREA-----YQKRDRVGLIA 507 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ ++ + L G T GL A + + K Sbjct: 508 FRRTSAETLLPMTRSVELAEKALRSLPTGGKTPLAEGLAAALKMMDELSRKEG------- 560 Query: 282 DYKKYIIFLTDGENS 296 + ++ +TDG + Sbjct: 561 -AETVLVLVTDGRTN 574 >gi|260887186|ref|ZP_05898449.1| magnesium-chelatase, subunit D/I family [Selenomonas sputigena ATCC 35185] gi|260863248|gb|EEX77748.1| magnesium-chelatase, subunit D/I family [Selenomonas sputigena ATCC 35185] Length = 657 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 47/135 (34%), Gaps = 19/135 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ ++D S SM +++ + ++ +L R GL+ Sbjct: 467 RARRASANILFLVDASGSMG-----AKERMKMVKGAVLALLREA-----YQKRDRVGLIA 516 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ ++ + L G T GL A + + K Sbjct: 517 FRRTSAETLLPMTRSVELAEKALRSLPTGGKTPLAEGLAAALKMMDELSRKEG------- 569 Query: 282 DYKKYIIFLTDGENS 296 + ++ +TDG + Sbjct: 570 -AETVLVLVTDGRTN 583 >gi|168998769|ref|YP_001688037.1| TerY3 [Klebsiella pneumoniae NTUH-K2044] gi|238549790|dbj|BAH66141.1| hypothetical protein KP1_p252 [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 346 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 53/168 (31%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L +++++ ++ P V ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKRMNDGLQKIVSDLRKDPHALETVWVSVIAFAGIA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K KG + Sbjct: 57 RTIVPL---HDIVSFYPPRLPVGGGTNLATALRELTTQIDSQVRKTTLEEKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + NE + + A+G+ L+ Sbjct: 112 YLLTDG---RPTDDISAEVKRWNEFYAKKVNMIAVGIGPSVDLSVLRQ 156 >gi|38639542|ref|NP_943311.1| hypothetical protein LV053 [Klebsiella pneumoniae] gi|38016640|gb|AAR07661.1| hypothetical protein LV053 [Klebsiella pneumoniae] Length = 351 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 53/168 (31%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L +++++ ++ P V ++ F+ Sbjct: 8 RLPVFFVLDCSESMIGE------NLKRMNDGLQKIVSDLRKDPHALETVWVSVIAFAGIA 61 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K KG + Sbjct: 62 RTIVPL---HDIVSFYPPRLPVGGGTNLATALRELTTQIDSQVRKTTLEEKGDWKP--VV 116 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + NE + + A+G+ L+ Sbjct: 117 YLLTDG---RPTDDISAEVKRWNEFYAKKVNMIAVGIGPSVDLSVLRQ 161 >gi|302348849|ref|YP_003816487.1| von Willebrand factor, type A [Acidilobus saccharovorans 345-15] gi|302329261|gb|ADL19456.1| von Willebrand factor, type A [Acidilobus saccharovorans 345-15] Length = 456 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 66/169 (39%), Gaps = 25/169 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRS-GLVTFSSKIV 228 + ++LD S SM G +D ++ + KS+ + VR + +SS +V Sbjct: 290 IYVLLDKSGSM---VGSKIDWARAVAVALFK-----KSMDEGRTFIVRFFDSIPYSSMVV 341 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 ++ V + + R+ G T T + A + I K KL + D II Sbjct: 342 RSNSKPNDVLRLLSYLARVKAGGGTDITRAVSSAVDDI--DKMKLGGKDRPSD-----II 394 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDGE D K+ A ++++ + + FL+ + Sbjct: 395 LITDGE------DRLSPEVLQRMLKKVNAHLHSV--MIQGHNTFLQQVS 435 >gi|301617755|ref|XP_002938301.1| PREDICTED: integrin alpha-E-like [Xenopus (Silurana) tropicalis] Length = 1148 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 48/209 (22%), Positives = 83/209 (39%), Gaps = 31/209 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII--KSIPDVNNVVRSGLVT 222 SD G ++ +VLD S S+++ A I M+ + K + VV+ G + Sbjct: 182 SDTGTEIAIVLDGSGSISEE------DFQKAKDFISNMIGMFWEKCLQCEFAVVQYGAI- 234 Query: 223 FSSKIVQTFPLAWGVQH--IQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 I F L + I +K+ + G+ TK+ L++ +F EH Sbjct: 235 ----IQTEFDLLESRKGRFILQKVQDIKQVGNVTKTASALDHVLESVF----TEEHG--S 284 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFLKN 335 + K I+ LTDG+ +D + N +K + + IG Q E A + LK Sbjct: 285 SETATKIILVLTDGD---IFMDPMDINDVMNNSKMKKIERFVIGVGEAFQKEKALKTLKT 341 Query: 336 CASPD--RFYSVQNSRKLHDAFLRIGKEM 362 AS +V + KL + +++ Sbjct: 342 IASQGEEHLLTVDDYSKLEGLLTSLQQKI 370 >gi|326426493|gb|EGD72063.1| hypothetical protein PTSG_00082 [Salpingoeca sp. ATCC 50818] Length = 571 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 64/188 (34%), Gaps = 27/188 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS---- 218 + + D+++VLD S SM + ++ A + +L+ + V VV Sbjct: 177 AATSGPKDIVIVLDRSGSMATN-----NRWETAMDAAETVLETLTIADFVAIVVFDTSAS 231 Query: 219 ---GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 G +VQ A V ++ + +T + A++ + E+ + Sbjct: 232 QVCGTTIPCGSLVQAT--ADNVGTLRTLLANFNPDGSTNFESAFQVAFSVLKQTGERTSN 289 Query: 276 IAKGHDDYKKYIIFLTDG------ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 I+F+TDG E ++ + A + A+++ A Sbjct: 290 CHTA-------ILFMTDGMITAGLEGNAFLDFVDDEQDALEAAVGKRAVLFTFSFGTGAD 342 Query: 330 DQFLKNCA 337 + K A Sbjct: 343 ETIPKALA 350 >gi|284173928|ref|ZP_06387897.1| hypothetical protein Ssol98_04615 [Sulfolobus solfataricus 98/2] gi|261602665|gb|ACX92268.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 356 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 82/213 (38%), Gaps = 39/213 (18%) Query: 151 APLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 P + V I + + ++++D S SM KL A +S +++L + Sbjct: 18 RPTEVGFIVYIVPQQGAITSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLLYSL- 70 Query: 208 SIPDVNNVVRSGLVTFSSKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 N L+ FS+ + A G+ I + + G TT+ + +A N Sbjct: 71 -----NEGDYVTLILFSNHPEIKYQGPAKGI--ITFDVGK---GYTTRLHEAVNFALN-- 118 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +AK K II LTDG+ + D + Y + IG+ + Sbjct: 119 ---------LAKQSQVPNK-IIMLTDGKPT----DKRNVKDYEKFDIPPNTQIITIGIGS 164 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + ++ LK A S +FY +++ +L D F Sbjct: 165 DYNERILKKLADKSSGKFYHLKDISELPDVFES 197 >gi|213962900|ref|ZP_03391160.1| von Willebrand factor type A [Capnocytophaga sputigena Capno] gi|213954557|gb|EEB65879.1| von Willebrand factor type A [Capnocytophaga sputigena Capno] Length = 347 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 68/195 (34%), Gaps = 14/195 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +LDVS SM D + + ++ +K+ P V ++ F+ K Sbjct: 3 RLPIYFLLDVSESMVG------DPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKS 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL I ++ G T GL N I +E ++ + D+K + Sbjct: 57 KVITPLQ---DIITFYPPKIPIGGGTSLASGLNELMNAI--DREVVKTTLERKGDWKPLV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 TDG P D +++ N RR + AI + L ++ N Sbjct: 112 FLFTDGI---PTDDPAQAIERWNAHYRRKVNLVAISLGENTNYNLLGQLTDQVLQFNNTN 168 Query: 348 SRKLHDAFLRIGKEM 362 + + F I + Sbjct: 169 AAAYKEFFKWITASI 183 >gi|25244377|gb|AAN72414.1| complement component C2 [Rattus norvegicus] Length = 758 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 80/209 (38%), Gaps = 28/209 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L++ ++LD S S+++ + S M+D I S V ++TF+S+ Sbjct: 259 LNLYLLLDASQSVSEK------DFNIFKESAFLMVDRIFSFEIK---VSVAIITFASRPK 309 Query: 229 QTFPLAWGV-QHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + Q++ E ++ L + T + L Y + + ++L Sbjct: 310 IIMSVLNERSQNVMEVMDSLDSVCYKDHENATGTNTYEALNSVYLMMNNQMDRLGMETSA 369 Query: 280 HDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---- 330 + + II LTDG+++ P +DN + + + +YAIGV D Sbjct: 370 WQEIRHAIILLTDGKSNMGGSPKPAVDNIREILGISRNRNDYLDIYAIGVGKLDVDWKEL 429 Query: 331 -QFLKNCASPDRFYSVQNSRKLHDAFLRI 358 + + +Q+++ + F I Sbjct: 430 NELGSKKDGERHAFILQDAKAVQQVFEHI 458 >gi|146162766|ref|XP_001010021.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|146146274|gb|EAR89776.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 783 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 52/118 (44%), Gaps = 16/118 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ V+D+S+SM K+ ++I +++ + N R L+ F++ Sbjct: 84 PLDLIFVIDLSISMRGK------KMNQLKKTICNLINFL------NENDRMALIGFNNSA 131 Query: 228 VQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FPL+ + + + +N ++ T T G+ A ++ + + +A+ + Sbjct: 132 QNLFPLSHLTQQNKKKVTQILNSILPMGLTNITAGMMEAIKQLESSLINISTVAETGE 189 >gi|120406920|ref|YP_956749.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] gi|119959738|gb|ABM16743.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 248 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 56/149 (37%), Gaps = 12/149 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +V DVS SM H +G +S+R+ D + + P + + V+ G++ FS + P Sbjct: 23 LVCDVSASMGPH-------IGTLNQSLRDFRDSLATNPVLADKVQFGVIDFSDTATEVIP 75 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L + ++L T I + + FLTD Sbjct: 76 LG-DFSSADLERHQLRTRGGTSYGQAFTTVQQIIERDLAAGADRFRYFRPA---VFFLTD 131 Query: 293 GENSSPNI-DNKESLFYCNEAKRRGAIVY 320 G+ + + + L + ++A +G Y Sbjct: 132 GQPTDRHWREAFRDLTFFDQASGQGFRSY 160 >gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis] Length = 707 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 57/170 (33%), Gaps = 26/170 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ- 229 ++ ++D S SM+ K+ ++ +L+ D+ GL+TF S I Q Sbjct: 269 VVFIIDQSGSMHG------RKIEQTRTALIHILN------DLAEDDFFGLLTFDSNIFQW 316 Query: 230 ----TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + T + + H +G Sbjct: 317 KRELVQATKANLESAKTFARNIRANGATNINAA------VLKGSSMLNAHPREGSAS--- 367 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +I LTDG+ ++ + + A +Y +G + +FL+ Sbjct: 368 ILILLTDGDPTTGETNPEAIQSNVRNAIAEKFPLYCLGFGFDVNFEFLEK 417 >gi|237747941|ref|ZP_04578421.1| predicted protein [Oxalobacter formigenes OXCC13] gi|229379303|gb|EEO29394.1| predicted protein [Oxalobacter formigenes OXCC13] Length = 1843 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 20/215 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIREMLDIIKSIP-D 211 + + + D+ ++ +V+DVS SM D PG + +L V ++ ++ + + D Sbjct: 1171 FAGNTTVGAVQDVVNNIYLVVDVSASMKDGMDPGSDVSRLKVVADALDKLFEQMNVADAD 1230 Query: 212 VNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + V L FS W + + L + L + Sbjct: 1231 PHTTVNVTLSIFSGGGDTPKEYNLSVSDWAGKTADDIYKMLDDFKHGDTNFEL-----PL 1285 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +A E + AKG Y K I F+TD N + ++ ++ Y ++ G V AI + Sbjct: 1286 GNASEWADSQAKGDGIYNKVI-FVTD-ANGTEILNTGKAQDYVDKILGSGVDVEAIAIIN 1343 Query: 327 EAADQFLKNCASPDRFYSVQN-----SRKLHDAFL 356 Q+ Y + +++L F Sbjct: 1344 SYQKQYKDILEELGNKYPLDPSKQPFAQELISLFS 1378 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 77/209 (36%), Gaps = 32/209 (15%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + T+ + + ++ V+D S SM + ++M+++I + N Sbjct: 682 IFTNPYFSKNSCSLATEIAYVVDYSSSMK--------AVDPTNYRGKKMIELINQLKAKN 733 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 N+V + ++K ++ T G++ A K + + Sbjct: 734 NIV----IETNTKATVLGEGTTENVLKKDLYKASKEKGATDIFAGIDIALTKFSNDSKT- 788 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQ 331 K I+ ++DG+ S + NEAK++G VY + + ++ D Sbjct: 789 ----------SKAIVVVSDGKTSKSKMT-----KVINEAKKQGVKVYTVSMGKKSQVNDA 833 Query: 332 FLKNCAS--PDRFYSVQNSRKLHDAFLRI 358 L +S ++ ++ +LH F ++ Sbjct: 834 TLMQLSSETSGAYFHAIDNMQLHQVFQKL 862 >gi|262073024|ref|NP_001159971.1| integrin alpha-2 [Bos taurus] gi|296475809|gb|DAA17924.1| integrin alpha-2 [Bos taurus] Length = 1179 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 72/210 (34%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 171 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---MGLIQYANNPR 219 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 + + + K + S T G L + I A++ A G K Sbjct: 220 VV----FNLNTFKSKDEMIKATSQTFQYGGDLTNTFKAIQYARDTAYSTAAGGRPGATKV 275 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---LKNCA 337 ++ +TDGE + D + ++ + + + I V A +K A Sbjct: 276 MVVVTDGE----SHDGSKLKAVIDQCNKDNILRFGIAVLGYLNRNALDTKNLIKEIKAIA 331 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L + IG+++ Sbjct: 332 SIPTERHFFNVSDEADLLEKAGTIGEQIFS 361 >gi|118388916|ref|XP_001027552.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila] gi|89309322|gb|EAS07310.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila SB210] Length = 601 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 57/143 (39%), Gaps = 12/143 (8%) Query: 164 KSDIGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + D ++++ DVS SM F P + ++G D + + N++V+ L Sbjct: 131 EEDPAEAIVVIYDVSGSMKSGFFNDPLISRIGAVNAFFSAFADKTLAY-EYNHIVQ--LY 187 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F +I + + H + ++ +T+ L N + K+K +I Sbjct: 188 WFDDRIEKKCDFTKDMNHFIKLVDDANPRGSTRLYDALMEGINSLLLIKKKYPNIILR-- 245 Query: 282 DYKKYIIFLTDGENSSPNIDNKE 304 +I +TDGE++ +E Sbjct: 246 -----LIAMTDGEDNQSKYKPEE 263 >gi|83643773|ref|YP_432208.1| hypothetical protein HCH_00891 [Hahella chejuensis KCTC 2396] gi|83631816|gb|ABC27783.1| uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Hahella chejuensis KCTC 2396] Length = 223 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/156 (23%), Positives = 62/156 (39%), Gaps = 14/156 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQ 229 ++VLD S SM FG + +L + + L D + +R L+ + Q Sbjct: 21 CVLVLDGSSSM---FGEPIRQLNEGLKLLERALK-----EDASTAMRVQLLVIRAGNHDQ 72 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V + + TT + A +KI D +K + A G + +II Sbjct: 73 AEVLTDWVDAMDFNAPEVFANGTTPLGGAMNLALDKIED--QKAAYDANGISSTRPWIIL 130 Query: 290 LTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGV 324 ++DG + N + C A + R +++ IGV Sbjct: 131 ISDGAPTDFNWEAVADR--CRHAEQNRKVVIFPIGV 164 >gi|325679262|ref|ZP_08158849.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109048|gb|EGC03277.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 290 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 67/189 (35%), Gaps = 35/189 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM + F GM ++ E L + D + + TFS + Sbjct: 38 VAVVLDVSGSMAEAFRSGM------VQATLERLLPLAMAFDDDGTMEV--WTFSHGFQRH 89 Query: 231 FPLAWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL G + I N L +G T +P + N + Y+I Sbjct: 90 KPLTRG--NFYNYIKDNGLSYGGGTNYSPVIRDVGNYFIKEEPAC---------LPNYVI 138 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 F+TDG+N D +E+ + I Q SP+ F ++ Sbjct: 139 FITDGDN----FDERETDRVIKH-----ISHFPIFFQFVGIGD-----TSPNAFKYLRKL 184 Query: 349 RKLHDAFLR 357 + + F+ Sbjct: 185 DDMDERFVD 193 >gi|296453244|ref|YP_003660387.1| von Willebrand factor type A (vWA) domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296182675|gb|ADG99556.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 565 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 15/187 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S SM+ G+ K + + KS + + + L+ F ++ + Sbjct: 389 WVVDYSGSMSGEGKNGVVK---GLNAALDPDQAKKSYIEPASGDVNILIPFETEAHRPVK 445 Query: 233 LA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + T GL A +++ E ++ I+ +T Sbjct: 446 ATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 497 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG N D+++ +++ R +++I Q + S L Sbjct: 498 DG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 554 Query: 352 HDAFLRI 358 F ++ Sbjct: 555 AAVFRQV 561 >gi|145632220|ref|ZP_01787955.1| hypothetical protein CGSHi3655_07194 [Haemophilus influenzae 3655] gi|229844730|ref|ZP_04464869.1| hypothetical protein CGSHi6P18H1_03949 [Haemophilus influenzae 6P18H1] gi|144987127|gb|EDJ93657.1| hypothetical protein CGSHi3655_07194 [Haemophilus influenzae 3655] gi|229812444|gb|EEP48134.1| hypothetical protein CGSHi6P18H1_03949 [Haemophilus influenzae 6P18H1] Length = 212 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 66/206 (32%), Gaps = 18/206 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + ++ ++ ++ P ++TF S+ Sbjct: 3 RLPVYLLVDTSGSMMGE------AIESVRNGLQMLVSALRQDPYALETAYLSVITFDSQA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + ++ + T L + I +K KG + Sbjct: 57 KQVTPLT---ELMSFQLPTIEASGLTSMGGALSLLTDCINREVQKGSAEVKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 L+DG P D ++ + K A A A LK + Sbjct: 112 FLLSDGV---PTDDLQKGINALRTVKTG--TFVACAAGAGADTNVLKQITESVVSLDTTD 166 Query: 348 SRKLHDAFLRIGKEM--VKQRILYNK 371 + + F + + Q++ NK Sbjct: 167 ANSIKAFFKWVSASISVSSQKVDLNK 192 >gi|126306098|ref|XP_001362158.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1, partial [Monodelphis domestica] Length = 873 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 51/209 (24%), Positives = 82/209 (39%), Gaps = 41/209 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D S SM G +++L A + +L II+ +G+VTF S Sbjct: 307 LVLVIDTSRSM--KVGNRLNRLRQALQFF--LLQIIEKGSW------TGIVTFDSSATIQ 356 Query: 231 FPLAWGVQHIQEK--INRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L +Q K I+RL G T GL A+ + + + Sbjct: 357 SELMQIESDVQRKTLISRLPTVTVAGGGTHICSGLRTAFMVV------KKKFLTDGSE-- 408 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAI--GVQAEAADQFLKNCASPDR 341 + LTDGE+++ N C E K+ GAI++ I G E + L + Sbjct: 409 --MALLTDGEDNTTNT--------CFEEVKQSGAIIHTIVLGPSTEKGLEKLSEMTGGMK 458 Query: 342 FYSVQNSRK--LHDAFLRI--GKEMVKQR 366 + N + L DAF + G + QR Sbjct: 459 TTATDNVQNNGLIDAFSALSSGNAAITQR 487 >gi|113931618|ref|NP_001039260.1| inter-alpha (globulin) inhibitor H2 [Xenopus (Silurana) tropicalis] gi|72679293|gb|AAI00666.1| hypothetical protein MGC107982 [Xenopus (Silurana) tropicalis] Length = 942 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 78/216 (36%), Gaps = 31/216 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S + +++ V+DVS SM K+ +++ +L+ ++ + ++ F+ Sbjct: 300 SPLPKNILFVIDVSGSMWGL------KMKQTVEAMKSILEDLRPDDQFS------IIDFN 347 Query: 225 SKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 I + + Q + + + T L A + +A KG Sbjct: 348 HNIRCWKDELVYASSVEKQDASKYVQSIQPNGGTNINEALLRAIFILKEAS------NKG 401 Query: 280 HDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + I+ ++DG+ + I + R ++++G+ + FL+ A Sbjct: 402 MLEPNSVSLIVLVSDGDPTVGEIKLSKIQKNVRTNIRDDFALHSLGIGFDVDYDFLERLA 461 Query: 338 SPDR------FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + + +L + + ++ ++K I Sbjct: 462 QENHGMAQRIYGKQDTAAQLKEFYKKVSTPLLKNII 497 >gi|327272010|ref|XP_003220779.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Anolis carolinensis] Length = 919 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 73/209 (34%), Gaps = 33/209 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--------RSGLVT 222 ++ V+DVS SM K+ +++ +LD ++S + + R LV Sbjct: 290 ILFVIDVSGSMWGL------KMRQTVEAMKTILDDLRSNDQFSVLDFNHNVRCWRDSLVQ 343 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S+ + ++ I + T L A + +A Sbjct: 344 ASNAQ---------TEAAKKYIEGIHPNGGTNINDALLRAIFILKEASNMGMLDPSSTSM 394 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 I+ ++DG+ + + + + ++ +G+ + FLK A + Sbjct: 395 ----IVLVSDGDPTVGELKLPTIQKNVKKNIQDDISLFCLGIGFDVDYDFLKRLAQENNG 450 Query: 342 -----FYSVQNSRKLHDAFLRIGKEMVKQ 365 F + + S ++ F ++ ++K+ Sbjct: 451 MAHRVFGNQETSSQMRKFFNQVSTPLLKK 479 >gi|297678516|ref|XP_002817116.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 2 [Pongo abelii] Length = 1899 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQVALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 190 HAYNVADFESL 200 >gi|262199538|ref|YP_003270747.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082885|gb|ACY18854.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 523 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 62/167 (37%), Gaps = 14/167 (8%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + ++S++ ++ L ++ +DVS SM + G TR++ L + + Sbjct: 118 VGMSSTIDPATFERPSLTIVATVDVSGSMGWGYADDQVSAGSLTRNLLGAL-----VDQL 172 Query: 213 NNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R +VT+ S++ L A I I++L +T GL+ AY +A Sbjct: 173 GPEDRIAIVTYGSRVDTALTLRSAGQKDEIHTAIDKLSEAGSTNMEAGLQRAYAIASEAA 232 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 E + I+ TD + + + +E G Sbjct: 233 ADGETDSTR-------IMLFTDVQPNVGATGASQFEAMASEGADSGV 272 >gi|218670347|ref|ZP_03520018.1| hypothetical protein RetlG_01180 [Rhizobium etli GR56] Length = 125 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 24/70 (34%), Gaps = 3/70 (4%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 +TAIL PV+ G+ + K +L + + NG Q F Sbjct: 1 MTAILAPVLLGAAGMAVHVGDMLLSKQQLQ---EAADSAALATATALANGTIQTSQAEAF 57 Query: 80 SYRIIKNIWQ 89 + + W+ Sbjct: 58 ARNFVAGKWR 67 >gi|33600172|ref|NP_887732.1| putative hemolysin [Bordetella bronchiseptica RB50] gi|33567770|emb|CAE31684.1| putative hemolysin [Bordetella bronchiseptica RB50] Length = 3346 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 84/192 (43%), Gaps = 11/192 (5%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK---LGVATRSIREMLDIIKSIPDVN 213 +K + + ++ +VLD+S SMND +G G +K L A +++ +L+ ++ D Sbjct: 2715 GGIKQNVTAGTSYNIALVLDLSDSMNDKWGSGSNKPTRLQTAKDALKALLENQLAVHDGE 2774 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + L+TF+ G+ +++ E ++ ++ G ST A+++ E Sbjct: 2775 --INVSLITFNGSSSALKKSITGLTPENVDEMVD-ILMGLKASSTTPYGAAFDRTTQWFE 2831 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + YK FLTDGE S+ N+++ F A V+ IG+ + + Sbjct: 2832 GQPTVDSEGKPYKNLTFFLTDGEPSTEWSYNRDNEFAELAAISD---VHGIGIGSGVSTS 2888 Query: 332 FLKNCASPDRFY 343 L + +Y Sbjct: 2889 TLNKYDNTGGYY 2900 >gi|45332244|gb|AAS58046.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 655 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 71/194 (36%), Gaps = 18/194 (9%) Query: 134 EMPFIFCTFPWCANSSH-APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P + F A I S K LD +V+D S S++D G Sbjct: 10 SVPLLSLAFLATTGIHAFADKGIGSPKGKQCKKQ--LDFSIVVDESASISDDQWEGQ--- 64 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS 251 +R ++ + N +R L T+S+ Q F L + + +L + + Sbjct: 65 --MIPFLRNLIHTVDL---DNTDIRLSLTTYSTPTRQIFTFLDAAASSTRLALTKLDWMN 119 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TK+ G+ Y + ++ + G + K ++ +TDG +S + + Sbjct: 120 GTKARYGMTYTGRALNYVRK--AILPYGRKNVPKALLLITDGVSSDGSY----TAQVAAM 173 Query: 312 AKRRGAIVYAIGVQ 325 + G V IGV Sbjct: 174 LRDEGVNVMVIGVG 187 >gi|297460574|ref|XP_597723.5| PREDICTED: collagen, type XX, alpha 1 [Bos taurus] Length = 1364 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/266 (13%), Positives = 78/266 (29%), Gaps = 24/266 (9%) Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + K F N + + + + + Sbjct: 74 TTKTPKATVGGLSPSKGYTLQIFELTGSGNILLARREFVIEDLKSNSVSRSSQRPLGATL 133 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 + P A + + + +DM+ ++D S S+ Sbjct: 134 EPTPSLVGSPDLEPPGALAPSQDPPTLGRPQFRCTPPMPVDMIFLVDGSWSIGHSH---- 189 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINRL 247 + +++ + P V+ GL +S + L A+G + + ++ L Sbjct: 190 --FQQVKDFLASVIEPFEIGPGK---VQVGLTQYSGAPQTEWDLNAFGTKEEVLNAVHNL 244 Query: 248 IF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + G T + L + + L+ A + K +I +TDG++ L Sbjct: 245 HYRGGNTFTGLALTHVLE------QNLKPRAGLRPEAAKLVILVTDGKSQDDAHTAGHVL 298 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 K G ++A+GV+ + Sbjct: 299 ------KGLGVDIFAVGVKNADETEL 318 >gi|297481786|ref|XP_002692297.1| PREDICTED: Collagen, type XX, alpha 1-like [Bos taurus] gi|296481265|gb|DAA23380.1| Collagen, type XX, alpha 1-like [Bos taurus] Length = 1342 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/266 (13%), Positives = 78/266 (29%), Gaps = 24/266 (9%) Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + K F N + + + + + Sbjct: 74 TTKTPKATVGGLSPSKGYTLQIFELTGSGNILLARREFVIEDLKSNSVSRSSQRPLGATL 133 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM 189 + P A + + + +DM+ ++D S S+ Sbjct: 134 EPTPSLVGSPDLEPPGALAPSQDPPTLGRPQFRCTPPMPVDMIFLVDGSWSIGHSH---- 189 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINRL 247 + +++ + P V+ GL +S + L A+G + + ++ L Sbjct: 190 --FQQVKDFLASVIEPFEIGPGK---VQVGLTQYSGAPQTEWDLNAFGTKEEVLNAVHNL 244 Query: 248 IF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + G T + L + + L+ A + K +I +TDG++ L Sbjct: 245 HYRGGNTFTGLALTHVLE------QNLKPRAGLRPEAAKLVILVTDGKSQDDAHTAGHVL 298 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 K G ++A+GV+ + Sbjct: 299 ------KGLGVDIFAVGVKNADETEL 318 >gi|83745131|ref|NP_001032954.1| collagen alpha-1(XXVIII) chain precursor [Mus musculus] gi|123789585|sp|Q2UY11|COSA1_MOUSE RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor gi|83423286|emb|CAI67593.1| collagen, type XXVIII [Mus musculus] gi|189442117|gb|AAI67245.1| Collagen, type XXVIII, alpha 1 [synthetic construct] Length = 1141 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 63/177 (35%), Gaps = 24/177 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV---VRSGLVTFSS 225 +D++ +LD S S + + + + I + ++ ++ + FSS Sbjct: 47 IDVVFILDSSES------SKIVLFDNQKDFVDSLSEKIFQLTPGRSLKYDIKLAALQFSS 100 Query: 226 KIVQTFPLA-W-GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL+ W ++ ++++ L G T S + A + K Sbjct: 101 SVQIDPPLSSWKDLRTFKQRVKSLNLIGQGTFSYYAISNATRLLKREGRKDG-------- 152 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K + +TDG + + D + +A+ G +G+ + L+ + Sbjct: 153 -VKVALLMTDGIDHPKSPDVQS---ISEDARILGISFITVGLSTVVNEAKLRLISGD 205 >gi|254445696|ref|ZP_05059172.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198260004|gb|EDY84312.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 923 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 35/206 (16%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + S L + + +D S SM +D + +++ L Sbjct: 567 SLQTAAHGRASSQPLHLTLAIDTSGSM--SRPDRVDIVNSLATALQSNLTEKD------- 617 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN------RLIFGSTTKSTPGLEYAYNKIFD 268 R +V+F + L Q + + N +L T L+ +Y Sbjct: 618 --RLSIVSF----DRQPRLVLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQT--- 668 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQA 326 A+ + A +I +TDG + N + ++ E + RG + + IG Sbjct: 669 AQRHFQENAINR------VILITDGAANLGNTNAEQLRTTVTENRIRGIALDCFGIGFDG 722 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRK 350 D FL++ + R+ +++ Sbjct: 723 HD-DTFLESLSRNGDGRYRFLRSPED 747 >gi|308497646|ref|XP_003111010.1| CRE-CUTL-23 protein [Caenorhabditis remanei] gi|308242890|gb|EFO86842.1| CRE-CUTL-23 protein [Caenorhabditis remanei] Length = 801 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 62/184 (33%), Gaps = 28/184 (15%) Query: 135 MPFIFCTFPWCANSSHA-------PLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDH 184 +P + T + A + P ++ + V + + LD++ +LD S S+ D Sbjct: 7 IPLLILTVTFQAEAVKIIDNGLAPPEIVHTPVSTKPRCKVFAPPLDLIFILDSSGSLRDK 66 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQE 242 F +D + I+K + + R L+ FS F + + Sbjct: 67 FQDEIDIIRR----------ILKHVTIGKSATRVMLIQFSGTQHLEFNFEKFTDREELLA 116 Query: 243 KINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ E+A ++ + K + L+DG D Sbjct: 117 ALDVLRHVSGITRIGGAFEFALQQLKSPG----SGLRDGSVP-KIVYLLSDGRTHDYPKD 171 Query: 302 NKES 305 + S Sbjct: 172 WQMS 175 >gi|297270401|ref|XP_002800091.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Macaca mulatta] Length = 3386 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDDSSSVGEV------NFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 APKEEHCYLLHSFEE----FEALARRALHE 262 >gi|146338842|ref|YP_001203890.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. ORS278] gi|146191648|emb|CAL75653.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. ORS278] Length = 637 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 82/203 (40%), Gaps = 18/203 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++DVSLS D + G+ L V ++ + + + D +++V TF+S+ Sbjct: 447 DLAVTLLVDVSLS-TDAWIDGVRVLDVEKEALLVLAHGLSACGDSHSIV-----TFTSRR 500 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY-NKIFDAKEKLEHIAKGHDDYKKY 286 +W + + G + L Y +I A + K+ Sbjct: 501 N-----SWVRLETLKAFGETMSGQVERRIGALRPGYYTRIGTAVRHAAGELAARPERKRL 555 Query: 287 IIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ LTDG+ + +++ EA+R G V+ + V ++A ++ P Sbjct: 556 LVVLTDGKPNDVDHYEGRFAIEDTRKAVQEARRAGVAVFGVTVD-KSAQSYVPTLFGPAG 614 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 + V N R+L A + +++ Sbjct: 615 YAIVGNIRRLPAALPALYRQLAH 637 >gi|296815358|ref|XP_002848016.1| U-box domain containing protein [Arthroderma otae CBS 113480] gi|238841041|gb|EEQ30703.1| U-box domain containing protein [Arthroderma otae CBS 113480] Length = 748 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 80/235 (34%), Gaps = 28/235 (11%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N + + I +K + D+++V+D+S SMN + Sbjct: 39 ILSVHSIPNKNSMVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSAAPIPTGE-SGGED 97 Query: 198 SIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLI 248 + +LD+ K I +N R +VTF ++I F L + + I+ L Sbjct: 98 TGLSILDLTKHAAKTIIQTLNENDRLAVVTFCTEIRVAFELEFMSEENKSKVLAAIDCLH 157 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS------PNIDN 302 S+T G++ + + A ++ LTDG + Sbjct: 158 GISSTNLWHGIKEGLKVLATNSTQGNVQA---------LLVLTDGAPNHMCPAQGYVPKL 208 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 +++L + +++ G L++ A F + ++ + F Sbjct: 209 RQTLLDHRDLTGSLPLIHTFGFGYYLRSPLLQSIAEIGGGTFAFIPDAGMIGTVF 263 >gi|8569518|pdb|1QC5|A Chain A, I Domain From Integrin Alpha1-Beta1 Length = 192 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 66/208 (31%), Gaps = 33/208 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD S S + T + ++L+ + P G+V + + Sbjct: 4 LDIVIVLDGSNS--------IYPWDSVTAFLNDLLERMDIGPKQTQ---VGIVQYGENVT 52 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD--YKKY 286 F + L+ G + D K KK Sbjct: 53 HEF----NLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKV 108 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------QFLKNCA 337 ++ +TDGE + DN + + ++I + + +K+ A Sbjct: 109 MVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIA 164 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEM 362 S F++V + L +G+ + Sbjct: 165 SEPTEKHFFNVSDEIALVTIVKTLGERI 192 >gi|254304217|ref|ZP_04971575.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324409|gb|EDK89659.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 529 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 74/216 (34%), Gaps = 22/216 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + ++ +++ +VLD S SM G + +A SI+++L + + + G+ F Sbjct: 172 EENMNVNVEIVLDASGSMVKKIGDK-TMMEIAKESIKQVLSEMPANA------KVGIRVF 224 Query: 224 ---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAK- 278 + G + I L K+ ++ + I + E K Sbjct: 225 GHKGDNTASKKDESCGANELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLKA 284 Query: 279 -GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAAD-QFLK 334 + + +TDG + + + K ++ IG +A + LK Sbjct: 285 LDGEKTLNILYIITDGIETCGG----NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLK 340 Query: 335 NC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A+ + SV ++ KL RI + Sbjct: 341 QIADAAGGYYSSVNDADKLTGELYRINELAFSDYKW 376 >gi|91776389|ref|YP_546145.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] gi|91710376|gb|ABE50304.1| MxaC, protein involved in Ca2+ insertion into methanol dehydrogenase [Methylobacillus flagellatus KT] Length = 333 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 69/198 (34%), Gaps = 19/198 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K G +M+VLD S SMND F + + + ++S + G+VTF Sbjct: 84 KVGRGAHVMIVLDRSASMNDSFADSAKHDSESKMAAARRV--LQSFVRQSREDLLGMVTF 141 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ + PL + + + G T GL A + Sbjct: 142 STSPILAAPLGGDREAVLAALRATEAGGMGFTAVARGLGMALDYFEGRPVTGARA----- 196 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I+ ++DG ++D K R+GA +Y + +++ A D Sbjct: 197 -----ILLVSDG---GAHLDVKTQDLLREMFHRQGASLYWVYLRSANGVSIKN--APEDE 246 Query: 342 FYSVQNSRKLHDAFLRIG 359 +LHD F +G Sbjct: 247 DLDAYPEHQLHDYFNSLG 264 >gi|1708567|sp|P53710|ITA2_BOVIN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|439696|gb|AAB59255.1| integrin alpha 2 subunit [Bos taurus] Length = 1170 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 72/210 (34%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 162 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---MGLIQYANNPR 210 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 + + + K + S T G L + I A++ A G K Sbjct: 211 VV----FNLNTFKSKDEMIKATSQTFQYGGDLTNTFKAIQYARDTAYSTAAGGRPGATKV 266 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---LKNCA 337 ++ +TDGE + D + ++ + + + I V A +K A Sbjct: 267 MVVVTDGE----SHDGSKLKAVIDQCNKDNILRFGIAVLGYLNRNALDTKNLIKEIKAIA 322 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L + IG+++ Sbjct: 323 SIPTERHFFNVSDEADLLEKAGTIGEQIFS 352 >gi|332295408|ref|YP_004437331.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] gi|332178511|gb|AEE14200.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] Length = 438 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 63/191 (32%), Gaps = 24/191 (12%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + +L++ V + L + VLD S SM++ K+ + S++++L+ Sbjct: 17 TSGQKLFVLLSIEVSKEVEERGKLFVSFVLDTSGSMSETVNDK-SKIEIVIESLKKILES 75 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH---IQEKINRLIFGSTTKSTPGLEYA 262 + + +VTF ++ P + T G++ + Sbjct: 76 N----ILKDDDEISIVTFDDEVKIVLPFTAATEKEKIFSSFEQIRTGTVGTNLGAGMKVS 131 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + + D K ++ LTDG D +E V ++ Sbjct: 132 LDLLKDKAGI------------KKMVVLTDG----NVFDLDLVEKVLDELVFSNISVISV 175 Query: 323 GVQAEAADQFL 333 GV E + L Sbjct: 176 GVGDEWNEDLL 186 >gi|72007460|ref|XP_780292.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] gi|115939674|ref|XP_001195885.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 1500 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 59/181 (32%), Gaps = 32/181 (17%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 ++ +LD S S+ F ++ L A+ K I + R ++++SS Sbjct: 44 LVFILDSSGSVAQSDFIISVEFLKFAS----------KIISVSASTTRVAVISYSSCNQI 93 Query: 226 ---KIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + P + + + T + LE A + Sbjct: 94 HIRVNYISSPENKNKCTFDNDLTSVNYHPGGTCTAGALEAAGRDVLSHGRPGA------- 146 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ ++ LTDG ++ + + K G ++ IG+ + + S D Sbjct: 147 --QRVVMLLTDGASNDGGPPHAN----AQKLKSEGVKIFTIGIGSIKLSELNAIATSVDE 200 Query: 342 F 342 + Sbjct: 201 Y 201 >gi|323693762|ref|ZP_08107958.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] gi|323502183|gb|EGB18049.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] Length = 1560 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/279 (13%), Positives = 89/279 (31%), Gaps = 62/279 (22%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIK-- 207 L +T ++ D++ V+D S SM + G ++ + ++ ++ +K Sbjct: 585 VTLEVTGDSIQTTIGGGTADIVFVIDKSSSMNQWDYDLGGNRWTILKETVDRFINKLKIT 644 Query: 208 -----------SIPDVNNVVRSG--------LVTFSSKIVQT---FPLAWGVQ----HIQ 241 D+ N R+G + + + + L + Sbjct: 645 SPNSKISFIEYQSSDLTNYTRTGTYDEIRNVAMANTENADKNLEGYSLKYFRTLQRWTAI 704 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK---KYIIFLTDGE---- 294 ++ +GS G A + + ++Y KY+I+L DG Sbjct: 705 SEVGSKPYGSAPGGNQGTHSAGGYLGAERALDRLKKYNPEEYNSNVKYVIYLADGTAGFY 764 Query: 295 -----------NSSPNIDNKESLFYCNEAKRR--GAIVYAIGVQAEAADQF--LKNCASP 339 + + ++ E K++ A +Y + ++++ +K A Sbjct: 765 VDSYGYRDGAGSGGNTNARRAAITQSGELKKKHPDATIYTVAFGSDSSANMNWMKPGAYN 824 Query: 340 DR-----------FYSVQNSRKLHDAFLRIGKEMVKQRI 367 FYS N+ +L F + +++ + Sbjct: 825 GNITNPYNPNVTAFYSATNTEELEKTFDSLAEQVGSSAV 863 >gi|223462031|gb|AAI46869.1| Collagen, type XII, alpha 1 [Homo sapiens] Length = 1899 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 190 HAYNVADFESL 200 >gi|260823774|ref|XP_002606843.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] gi|229292188|gb|EEN62853.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] Length = 1317 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 76/204 (37%), Gaps = 41/204 (20%) Query: 165 SDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + ++++ ++D S S+ ++F + R ++++L N R +VTF Sbjct: 70 ENQTVELVFLVDSSASVGNENFNSEL-------RFVKKLLADFTLAE---NAARVAIVTF 119 Query: 224 SSK------IVQTFPLAWGVQH---IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKL 273 SS+ + ++ ++E++ R+ G T + + A + A+ Sbjct: 120 SSRNKVVNHVDHLSKPSYHKHKCSLLEEELPRIKYAGGGTYTKGAMIKAQEVLRHARPNA 179 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K + +TDG ++ + L + K+ ++ G++ + L Sbjct: 180 ----------TKAVFLMTDGYSNGG-----DPLPEARKLKQNDVQIFTFGIR-SGNVKEL 223 Query: 334 KNCASPDRFYSVQNSRKLHDAFLR 357 +N A+ + D+F Sbjct: 224 QNMATD----PAEEHSYFLDSFAE 243 >gi|119569134|gb|EAW48749.1| collagen, type XII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1899 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 190 HAYNVADFESL 200 >gi|114608142|ref|XP_001142833.1| PREDICTED: collagen alpha-1(XII) chain isoform 1 [Pan troglodytes] Length = 1899 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 190 HAYNVADFESL 200 >gi|93141049|ref|NP_542376.2| collagen alpha-1(XII) chain short isoform precursor [Homo sapiens] Length = 1899 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFDIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I + + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQNFRTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 341 RFYSVQNSRKL 351 Y+V + L Sbjct: 190 HAYNVADFESL 200 >gi|73997822|ref|XP_534930.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-4 subunit isoform 1 [Canis familiaris] Length = 1121 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 70/193 (36%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM ++ +A +I +LD + VN ++ +S I Sbjct: 274 DIVIVVDTSGSMKGL------RMTIAKHTISTILDTLGENDFVN------IIAYSDYIHY 321 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ P L A+ + +E + Sbjct: 322 VEPCFKGTLVQADRDNREHFKQLVDELMVKGVGIVNPALTEAFQILKQFQEARQ-----G 376 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ + +F R V+ IG + AD+ C Sbjct: 377 SLCNQAIMLITDG-----AVEDYKPVFEKYNWPDRKIRVFTYLIGREVTFADRMKWIACN 431 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 432 NKGYYTQISTLAD 444 >gi|313239872|emb|CBY14719.1| unnamed protein product [Oikopleura dioica] Length = 982 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 56/183 (30%), Gaps = 29/183 (15%) Query: 178 SLSMNDHFGP---GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 S SM P ++ + I +++D D R G+V +S L Sbjct: 2 SCSMLTAPRPIFNPLEDFEKVRQWIGKLVDTFDIEEDG-GGTRVGVVIYSDAPRMEISLG 60 Query: 235 WGVQH-------IQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G+ + +N + F T + + YA F Sbjct: 61 NGLGKTDLIKAVLVIYLNLIDFLFAQSLMYERGNTLTGESIRYASEVAFSE---TSGARA 117 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +I LTDG A+ G ++YA+GV D+ + + Sbjct: 118 LSEGINRIMIVLTDGRAQDNVAGPAV------IAQEDGIVIYAVGVGHAIKDELDEIASK 171 Query: 339 PDR 341 P Sbjct: 172 PTH 174 >gi|313212957|emb|CBY36854.1| unnamed protein product [Oikopleura dioica] Length = 1117 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 56/183 (30%), Gaps = 29/183 (15%) Query: 178 SLSMNDHFGP---GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 S SM P ++ + I +++D D R G+V +S L Sbjct: 2 SCSMLTAPRPIFNPLEDFEKVRQWIGKLVDTFDIEEDG-GGTRVGVVIYSDAPRMEISLG 60 Query: 235 WGVQH-------IQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G+ + +N + F T + + YA F Sbjct: 61 NGLGKTDLIKAVLVIYLNLIDFLFAQSLMYERGNTLTGESIRYASEVAFSE---TSGARA 117 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + + +I LTDG A+ G ++YA+GV D+ + + Sbjct: 118 LSEGINRIMIVLTDGRAQDNVAGPAV------IAQEDGIVIYAVGVGHAIKDELDEIASK 171 Query: 339 PDR 341 P Sbjct: 172 PTH 174 >gi|288919483|ref|ZP_06413814.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349086|gb|EFC83332.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 533 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 79/254 (31%), Gaps = 46/254 (18%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--------DIGLDMMMVLDVSLSMND 183 R P + + A +H L + ++ + + VLD+S SM Sbjct: 295 RPATPTVPLDTRFTAGDTHVELPFPGTAAVADELILAYLNHFRAPTHAIFVLDLSGSMEG 354 Query: 184 HFGPGMDK----LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ--------TF 231 + L A S+ ++ + LV F S + + Sbjct: 355 DRISDLRSALIGLTGADSSLTARFTSFRAREKIT------LVPFDSGVNRISDFAVTDPS 408 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 P + ++ ++ + G T Y+ + A ++ Y ++ LT Sbjct: 409 PDSPELKELRRAVEGFNAGGDTAI-------YSALRAAYDRAAADLARDGSYYTSVVLLT 461 Query: 292 DGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 DGEN++ + SL A + + +A L+ A + + Sbjct: 462 DGENTTGASADDFLAHYRSLSPAARA----VPTFTVLFG-DADPDALRQIADVTGGTVFD 516 Query: 345 VQNSRKLHDAFLRI 358 ++ L D F I Sbjct: 517 AGSTS-LPDVFKDI 529 >gi|124262931|ref|YP_001023401.1| hypothetical protein Mpe_B0391 [Methylibium petroleiphilum PM1] gi|124262177|gb|ABM97166.1| hypothetical protein Mpe_B0391 [Methylibium petroleiphilum PM1] Length = 601 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 73/195 (37%), Gaps = 25/195 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S + ++ +M++ D S SMN+ FG M + A+R +++ + V Sbjct: 422 KRSLQEELDTCVMVLDDESGSMNEPFGD-MRREDAASRVCVGAGEVLNNAE-----VPFA 475 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 LV +++ + Q + S+T + + +A ++ + KE+ Sbjct: 476 LVGYNTSLHQYKGFDDSWAETLKDFGP-HSASSTNTHLAVVWALRELINRKERR------ 528 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K + +TDG+ D EAK G + ++ + K + Sbjct: 529 -----KILKVVTDGDPG----DQTVLAAAIEEAKAFGVE---VRFVLISSREEYKYRSMG 576 Query: 340 DRFYSVQNSRKLHDA 354 + ++ +L +A Sbjct: 577 VPYGVANDAPELANA 591 >gi|290996510|ref|XP_002680825.1| vWFA domain-containing protein [Naegleria gruberi] gi|284094447|gb|EFC48081.1| vWFA domain-containing protein [Naegleria gruberi] Length = 395 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 52/147 (35%), Gaps = 21/147 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 +D+++V+D + SM+ ++ VA R++ ++ + + +R V++ Sbjct: 79 VDLVIVMDCTGSMS-------GEIEVAKRTVTTIISTL--HEKFQSDLRFSAVSYRDHTD 129 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 V+ FP + + IN + L A I + + K Sbjct: 130 DYAVKEFPFTKDLNKAKGYINTMSAQGGGDHPEALASALYVINEMPFNKKG--------K 181 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNE 311 K ++++ D + C + Sbjct: 182 KIVVWVADAPPHGMKTSSDSYPEGCKD 208 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 44/129 (34%), Gaps = 2/129 (1%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 KG +L +++L ++ + I+ +H+ K +L +D + L A + N Sbjct: 10 KRTQKGITLVLISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVVVDENGNV 69 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENG--FAQDINNIERSTSLSIIIDDQHKDYNL 127 + + I + + F+ D + S + ++ Y Sbjct: 70 SAAETAAKATLSSISASDGNAELVFTDSNTAVTFSTDRATFVSAASFTPPASGEYDIYVR 129 Query: 128 SAVSRYEMP 136 AV+ + Sbjct: 130 VAVTEIGLT 138 >gi|253996911|ref|YP_003048975.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253983590|gb|ACT48448.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 329 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 71/222 (31%), Gaps = 44/222 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFG-----PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +G + +VLD S SM+D F G +G L I G+ Sbjct: 77 GVGAQIALVLDRSASMDDPFSGTADHNGNTTVGETKSVAAARL--ITEFVKSRQQDMFGM 134 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +TFS+ + PL+ + + + + T GL + +FD A Sbjct: 135 ITFSNSAMYVLPLSENKKAVIAAVQATAGNALFQTNIGSGL-TSSAALFDKVTDSGSRA- 192 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------- 329 +I L+DG + +D + +R +Y I ++ Sbjct: 193 --------VILLSDG---AGRVDANTQQKIRDWFQRLNISLYWIVLRQPGGLSIFDPNFV 241 Query: 330 -------------DQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 ++ + +P + Y ++ R L A I Sbjct: 242 PVEDQPLPAQIELHEYFQTFKTPFKAYEAEDPRSLQLAMNDI 283 >gi|59711129|ref|YP_203905.1| TadG-like protein [Vibrio fischeri ES114] gi|59479230|gb|AAW85017.1| TadG-like protein [Vibrio fischeri ES114] Length = 465 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 67/215 (31%), Gaps = 15/215 (6%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 I G +IL + +PV+F V L + + KA+L + ++L + Sbjct: 2 IMKLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSAYGEE 61 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 E K + N+ + L + + + + + + K Sbjct: 62 DEVSTQTGKDYVAHYMHDMSNLVDIEVEK-LECSELPECTADDNDRPFVEYQVSGRTK-- 118 Query: 126 NLSAVSRYEMPFIFC-TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + S ++ F +F S + +D+ +LD S SMN Sbjct: 119 HKSWFPGNDVTVGFGESFDVTGMSKARKFQSSQP----------MDITFILDFSGSMNYD 168 Query: 185 FGPGM-DKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + + + D+ +VV+ Sbjct: 169 WEGHAPSYMEEEVPKVPGRYSPPSRLSDLKDVVQM 203 >gi|325981245|ref|YP_004293647.1| von Willebrand factor type A [Nitrosomonas sp. AL212] gi|325530764|gb|ADZ25485.1| von Willebrand factor type A [Nitrosomonas sp. AL212] Length = 651 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 77/217 (35%), Gaps = 36/217 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFG---------PGMDKLGVATRSIREMLDIIKSIPDVN 213 + L + +++DVSLS + G + L + R+ I Sbjct: 455 RQHARDLSVAILMDVSLSTDSWIGGRRILDIEKEALITLATGLAACRDTFAIYTFTSRKR 514 Query: 214 NVVR-SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + VR +G+ F+ ++ Q ++ I L G T+ L + Sbjct: 515 DYVRVTGIKDFNE--------SFNTQVLRR-ITALRPGYYTRMGAALRH----------- 554 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +H+ + + I+ LTDG+ + +++ EA+R G V+ I + + Sbjct: 555 TQHLLSQRSERHRLILLLTDGKPNDLDYYEGRYGVEDTRQAIIEARRAGLSVFGITIDHK 614 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A D F + V +L I ++++ Sbjct: 615 AQDYF-PYLFGRGGYAIVTKPERLSHLLPVIYQQLIN 650 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 81/284 (28%), Gaps = 82/284 (28%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-----------------GMDK--- 191 + I+ + ++ +D ++LD + SM P MDK Sbjct: 5 SITISGTATAEYQTAAFMDFYILLDNTPSMGVGATPDDVSKLEAKAGCAFACHQMDKTIN 64 Query: 192 -------------LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-----QTFPL 233 + V ++ + + D K+ ++ R G+ TF +K L Sbjct: 65 NYTIAKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTTISGL 124 Query: 234 AWGVQHIQEKINRLIF----------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + ++ + + T + I + Sbjct: 125 TSDLTKVKNYTDAVDLMTIPYQNYNNDQITNFDSAMTQMNTII-----DQAGDGTSNISA 179 Query: 284 KKYIIFLTDG------------ENSSPNIDNKESLFYCNEAKRRGAIV---YAIGVQAEA 328 +K + F++DG + + +C K RG + Y + + Sbjct: 180 EKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPS 239 Query: 329 ADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRI 358 + ++ CASP ++ V + + DA + Sbjct: 240 NSWYNTWIKPFQGEIPTKMQACASPGFYFEVSPTEGITDAMKAL 283 >gi|254172494|ref|ZP_04879169.1| PKD domain protein [Thermococcus sp. AM4] gi|214033423|gb|EEB74250.1| PKD domain protein [Thermococcus sp. AM4] Length = 1418 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 48/136 (35%), Gaps = 18/136 (13%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +++ S S D++ V D + SM+ D++G R++ E+++ ++ Sbjct: 71 ATIIQKSGSISPADIVFVFDDTGSMD-------DEIGTMKRNVNELVESLEGYGIRA--- 120 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R LVTF P V + +++L + A Sbjct: 121 RYALVTFKDSPSLRLPFTTNVSLFTQTVSKLYASGGGDTPEDDLDAIAM--------ALR 172 Query: 277 AKGHDDYKKYIIFLTD 292 +K +I +TD Sbjct: 173 LNYSRLSQKILILITD 188 >gi|149037634|gb|EDL92065.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_e [Rattus norvegicus] Length = 2254 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 126/359 (35%), Gaps = 57/359 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K +S+L ++ + + +F +N Sbjct: 863 VRIGVVQFSNDVFPEFYLKTHKSQ---NSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 916 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL---------SIIIDDQHKDYNLSA 129 ++ + + + + +++ R + I + + + Sbjct: 917 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTI 976 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-----------SDIGLDMMMVLDVS 178 + + F F N +L + + D++ +LD Sbjct: 977 TNDPRLVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEKKKADIVFLLD-- 1034 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--G 236 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1035 GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFST 1087 Query: 237 VQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + I + IN++I+ + G+E+ + E ++ + + +T G+ Sbjct: 1088 KRQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDERVPQIAFVITGGK 1143 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 + D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1144 SVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1196 >gi|148704833|gb|EDL36780.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_a [Mus musculus] Length = 608 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 388 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 446 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 +I K+ + + V F+ Q ++ + +E + ++ Sbjct: 447 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYNTKENVLAVLANI 495 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 496 RYMSGGTATGDAIAFTVRNVFGPIRD--------SPNKNFLVIVTDGQ----SYDDVRG- 542 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 543 -PAAAAHDAGITIFSVGVAWAPLDDLRDMASKP 574 >gi|56477526|ref|YP_159115.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] gi|56313569|emb|CAI08214.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] Length = 441 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 3/75 (4%) Query: 6 IRNFFYNC---KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 +RN + +G ++I+TA+ L V+ GL ++ H + K +L D L + + Sbjct: 1 MRNRHADRDSQRGVVAIITALSLVVLVGFAGLALDGGHLYLTKTELQNGADACALAASYE 60 Query: 63 ILNQENGNNGKKQKN 77 + + Sbjct: 61 LTGSPISPENFTRAE 75 >gi|317483399|ref|ZP_07942390.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915154|gb|EFV36585.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 401 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 49/334 (14%), Positives = 96/334 (28%), Gaps = 51/334 (15%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELR----ENGFAQDINNIERSTSL 114 TAT + + N + + L QD + + ++S+ Sbjct: 77 TATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSM 136 Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA--------PLLITSSVKISSKSD 166 I + D K A S P +F A P+ + S Sbjct: 137 WISMGD-TKHIVKDAASTSTTPIVFGIAKSKAVKLGWADDTGAAKPVSTADILAAVSDGK 195 Query: 167 IGLDMM-----------------------MVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + M V+D S SM+ G+ K + + Sbjct: 196 LTFSMTSATVIDSALNVYQTALRKPSWTIWVVDYSGSMSGEGKNGVVK---GLNAALDPD 252 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYA 262 KS + + + L+ F ++ + + + + T GL A Sbjct: 253 QAKKSYIEPASGDVNILIPFETEAHRPVKATGTSTSDLLHEADATDASGGTDIYEGLLSA 312 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 +++ E ++ I+ +TDG ++S + D ES + +++ R +++I Sbjct: 313 LDELPSESEASQYTTA--------IVLMTDGRSNSDHQDEFESSY---KSRGRDLPIFSI 361 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 Q + S L F Sbjct: 362 MFGDADPSQLKSLATLSNAKVFDGRSGDLAAVFR 395 >gi|290958909|ref|YP_003490091.1| hypothetical protein SCAB_44831 [Streptomyces scabiei 87.22] gi|260648435|emb|CBG71546.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 708 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 69/197 (35%), Gaps = 27/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +D+ + +VLD S SM ++ G LG T ++ LD K+ V V F Sbjct: 520 TDVRAKVYLVLDRSASMRGYYKDGSAQALGEQTLALAAHLDPEKTTVPV--------VFF 571 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+++ T L K++ L G A ++ K A Sbjct: 572 STELDGTGELTLD--AFDNKVDDLHAGLGRMGRTSYHVAVEEVLAQHRKAAAGA------ 623 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQF-----LKNCA 337 ++F TDG + N+ + A+ G + F LK A Sbjct: 624 PALVVFQTDGAPDAKTPANQ---ALADAAENHPGVHFAFVAFGDPENKAFDYLRKLKT-A 679 Query: 338 SPDRFYSVQNSRKLHDA 354 + F + + R+L DA Sbjct: 680 TTSHFLAGETPRELTDA 696 >gi|237748397|ref|ZP_04578877.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] gi|229379759|gb|EEO29850.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] Length = 736 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 55/141 (39%), Gaps = 9/141 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLDVS SM G +L + ++ ++D I V LV F S + Sbjct: 151 LSLVLDVSNSMYAGLDNGNTRLASSVEALNNLIDSIAAQTAQGGITVNIQLVGFHSSAFE 210 Query: 230 TFPLA---WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFD--AKEKLEHIAKGHDD 282 L+ V ++ +N L S + A+ ++ A++ +G D Sbjct: 211 NSWLSVTQDNVNELKSYVNELANHIRDGGASNTNYQAAFEEVCQWFAQQSAVPPVQGTDV 270 Query: 283 YKKYIIFLTDGENSSPNIDNK 303 K F++DG ++ ++ Sbjct: 271 VNKVF-FISDGVPNAITVNGA 290 >gi|254452693|ref|ZP_05066130.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267099|gb|EDY91369.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 173 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 67/177 (37%), Gaps = 24/177 (13%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F + G++ I + +++ +I G+ ++ +AKL LD + L A Sbjct: 11 FQRFRNDEDGALIIFSLMMMIMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA----- 65 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + K + E + ++++ Sbjct: 66 --DLDQIMPPADVVRDYLDKAGMLHFLQGEPTVSQGINYRVVSAQASAP----------- 112 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 A+ Y++P IF T P+ + + S+ + + +++ +VLDVS SMN Sbjct: 113 --MALFFYDLPRIF-TSPFSPGMTAINVSGASTAE---ERVTDVEVSLVLDVSSSMN 163 >gi|167524114|ref|XP_001746393.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775155|gb|EDQ88780.1| predicted protein [Monosiga brevicollis MX1] Length = 1705 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 65/163 (39%), Gaps = 21/163 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++VLD+S S++ M ++ A I S+P +VR+ LV Sbjct: 762 DVVLVLDMSGSVDVADYNRMLQVARA---------TINSLPINEGLVRAALVLLKGTPSS 812 Query: 230 TFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 LA G + + + ++ L++ ST S AY +E + A G+ + + Sbjct: 813 PVSLAQGTSREALLDGVDNLVYSSTRPSAAATTLAYV-----RETVLTAANGYRGGQATV 867 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRR--GAIVYAIGVQAEA 328 + TDG++ P + + + G + +G + Sbjct: 868 LLFTDGDSQEP---FTQVEAEAQQLRALPGGVKIATVGFPVDP 907 >gi|313230659|emb|CBY18875.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 80/202 (39%), Gaps = 24/202 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S+ + + I + D I PD N R ++ +SS + + Sbjct: 220 DLLILVDESTSIGAENFEHVKR--TLGLMIDNLCDGIS--PDTN---RVAMLRYSSDVKE 272 Query: 230 TFPLAWGVQ--HIQEKINRLIFG-------STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 G + I RL + +T + ++ A IF ++ + Sbjct: 273 DLNFIEGSNEPTVMRNIQRLKYKPITDDRHGSTYTAHAMDKALKTIFTSEAGWRNGTTED 332 Query: 281 DDYKKY-IIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQFLK-NC 336 + ++ +TDGE++ P D ++ + K G VYA+GV D+ + Sbjct: 333 GIKVRTEVVIITDGESNDP--DETFTIQG-QKVKYDEYGIKVYALGVGDIKKDEIRQLTS 389 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 + + + + + L AF RI Sbjct: 390 MDDESIFYLMSWKDL-AAFNRI 410 >gi|223462563|gb|AAI50654.1| Von Willebrand factor A domain containing 3B [Homo sapiens] Length = 1294 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ KR + +Y I A++FLK A+ Sbjct: 606 TDGRPDQP------PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|221040994|dbj|BAH12174.1| unnamed protein product [Homo sapiens] Length = 951 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 166 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 218 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 219 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 262 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ KR + +Y I A++FLK A+ Sbjct: 263 TDGRPDQP------PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 307 >gi|119622317|gb|EAX01912.1| hypothetical protein MGC26733, isoform CRA_a [Homo sapiens] Length = 1080 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ KR + +Y I A++FLK A+ Sbjct: 606 TDGRPDQP------PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|118918435|ref|NP_659429.4| von Willebrand factor A domain-containing protein 3B [Homo sapiens] gi|296439299|sp|Q502W6|VWA3B_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 3B Length = 1294 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ KR + +Y I A++FLK A+ Sbjct: 606 TDGRPDQP------PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|30268323|emb|CAD89964.1| hypothetical protein [Homo sapiens] Length = 1060 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ KR + +Y I A++FLK A+ Sbjct: 606 TDGRPDQP------PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|158312232|ref|YP_001504740.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158107637|gb|ABW09834.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 428 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 73/256 (28%), Gaps = 41/256 (16%) Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + +P + + ++++LD S SM Sbjct: 4 FTAKVYQNEFLPVGGTQVHAVITVTSTGAPAAPPIAGRPTGRPEQALVILLDCSGSMA-- 61 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VR---SGLVTFSSKIVQTFPLAWGVQH 239 K+ A R++R LD S+PD VR S + + A Sbjct: 62 --NPPAKVTQARRAVRAALD---SLPDGAWFAVVRGTGSAAMAYPRSPELVPASAATRAA 116 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-- 297 ++ L T L A + + + + H + LTDG+N Sbjct: 117 ACHVVDALEPHGGTAMGRWLRLANDLLATRPDAIGHA-----------LLLTDGQNGEME 165 Query: 298 ----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKL 351 +D + F C+ GV + + L+ A+ +V L Sbjct: 166 SELLGAVDACQGRFQCDCR----------GVGTDWRVEELRAIATGMLGTVDAVPEPAGL 215 Query: 352 HDAFLRIGKEMVKQRI 367 F RI + + Sbjct: 216 AAEFERIVATALDRAT 231 >gi|326433400|gb|EGD78970.1| hypothetical protein PTSG_11807 [Salpingoeca sp. ATCC 50818] Length = 2673 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 75/209 (35%), Gaps = 27/209 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLS-----MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 I+ + LD++ +LD S S + G D++ + I + Sbjct: 649 IADQCRQNLDLVFLLDGSGSIESTALGGAPGTFQDRVLAFVSQVTTYFTI------GEHD 702 Query: 216 VRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEK 272 R + TF+S L + +++ I + T ++ GL I Sbjct: 703 TRVAVATFASGATVNIRLNDHFDGDALRDAIADIPYPQGQTYTSLGLRAVRQDIL----T 758 Query: 273 LEHIAKGHDD-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + + + ++ LTDG NS P+ D + + V+AIGV + + Sbjct: 759 EANGMRPASEGVPRVLVVLTDG-NSQPSYDPA---TEASILHDQNVNVFAIGVGSSISQS 814 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 L++ AS V N +F IG Sbjct: 815 QLEDIASDPDARHVFN----LRSFSLIGD 839 >gi|301627727|ref|XP_002943021.1| PREDICTED: complement factor B-like, partial [Xenopus (Silurana) tropicalis] Length = 705 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 82/223 (36%), Gaps = 37/223 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+ D +++ +VLD S S+ G DK A + ++ + S Sbjct: 231 KVQILKDGLMNIFIVLDTSKSV------GKDKFNEAKEASILFIEKVSSYDIKPQYC--- 281 Query: 220 LVTFSSKIVQTFPL----AWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAK 270 +++++S+ + L + + E + + T + L Y ++ + + Sbjct: 282 IISYASEAIPVVSLRDQDSKNADAVIEHLENFVYDSHADKQGTNTRAALHSIYQQLIE-Q 340 Query: 271 EKLEHIAKGHDDYKK---YIIFLTDGENSSPNIDNKESLFYC----------NEAKRRGA 317 E + + + K I+ +TDG+ + D +E + ++ + Sbjct: 341 ELVYKNNNNKESFMKIHNVILLMTDGKFNMGG-DPREEMKVIRRFLNVGTSKDDLREEYL 399 Query: 318 IVYAIGVQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFL 356 VY G+ ++ + + AS + ++N K+ + F Sbjct: 400 DVYVFGLGSDIDQPEINDLASKKDKEVHTFHLENVDKMKEFFE 442 >gi|209516840|ref|ZP_03265690.1| von Willebrand factor type A [Burkholderia sp. H160] gi|209502656|gb|EEA02662.1| von Willebrand factor type A [Burkholderia sp. H160] Length = 328 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 71/219 (32%), Gaps = 35/219 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + G +++++D S SM++ G + + + + R L+ F Sbjct: 80 TGSGAQILILMDRSASMDEPMGSKGVEAPRGDSKNQVARAALTRFVEQRPNDRLALMMFG 139 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + V P + + I+ + G T+ G+ A + FD + A Sbjct: 140 TNPVLAMPFTYNHRVIEAAVAATAIGRGMPDTELDRGMLAAIAQ-FDGRLSSGRRA---- 194 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI----------------GVQ 325 I+ ++DG +D +R +Y I + Sbjct: 195 -----IVLVSDG---GALLDEPMQHRIEAGLRRDQIALYFIYLRSSIYSPDLNATLPASE 246 Query: 326 AEAADQFLK---NCASPDRFYSVQNSRKLHDAFLRIGKE 361 A A Q + +P R + ++ + + A I ++ Sbjct: 247 ASAEAQLHRFFLTLRTPYRLFQAEDPKAMMAAIAEINRQ 285 >gi|148694797|gb|EDL26744.1| complement factor B, isoform CRA_g [Mus musculus] Length = 541 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/225 (18%), Positives = 80/225 (35%), Gaps = 34/225 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +VLD S S+ A R + +++ + S R GL+T++ Sbjct: 42 KGGSMNIYLVLDGSDSIGSS------NFTGAKRCLTNLIEKVASYGVRP---RYGLLTYA 92 Query: 225 SKIVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 93 TVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP-- 150 Query: 276 IAKGHDDYKKYIIFLTDG-ENSSPN-----IDNKESLFYCNEAKR---RGAIVYAIGVQA 326 +G + + II +TDG N N D + L + K VY GV Sbjct: 151 -PEGWNRTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGP 209 Query: 327 EAADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS + V++ L + F ++ E + Sbjct: 210 LVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 254 >gi|19703911|ref|NP_603473.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714079|gb|AAL94772.1| hypothetical protein FN0576 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 369 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 74/216 (34%), Gaps = 22/216 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + ++ +++ +VLD S SM G + +A SI+++L ++ + G+ F Sbjct: 11 EENMNVNVEIVLDASGSMVKKIGDK-TMMEIAKESIKKVL------SEMPANAKVGIRVF 63 Query: 224 ---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAK- 278 + G + I L K+ ++ + I + E K Sbjct: 64 GHKGDNTASKKDESCGANELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLKA 123 Query: 279 -GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAAD-QFLK 334 + + +TDG + + + K ++ IG +A + LK Sbjct: 124 LDGEKTLNILYIITDGIETCGG----NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLK 179 Query: 335 NC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A+ + SV ++ KL RI + Sbjct: 180 QIADAAGGYYSSVNDANKLTGELYRINELAFSDYKW 215 >gi|313240178|emb|CBY32528.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 80/202 (39%), Gaps = 24/202 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D S S+ + + I + D I PD N R ++ +SS + + Sbjct: 220 DLLILVDESTSIGAENFEHVKR--TLGLMIDNLCDGIS--PDTN---RVAMLRYSSDVKE 272 Query: 230 TFPLAWGVQ--HIQEKINRLIFG-------STTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 G + I RL + +T + ++ A IF ++ + Sbjct: 273 DLNFIEGSNEPKVMRNIQRLKYKPITDDRHGSTYTAHAMDKALKTIFTSEAGWRNGTTED 332 Query: 281 DDYKKY-IIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQFLK-NC 336 + ++ +TDGE++ P D ++ + K G VYA+GV D+ + Sbjct: 333 GIKVRTEVVIITDGESNDP--DETFTIQG-QKVKYDEYGIKVYALGVGDIKKDEIRQLTS 389 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 + + + + + L AF RI Sbjct: 390 MDDESIFYLMSWKDL-AAFNRI 410 >gi|221121786|ref|XP_002165500.1| PREDICTED: similar to procollagen, type XIV, alpha 1 [Hydra magnipapillata] Length = 3126 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 92/297 (30%), Gaps = 38/297 (12%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENG--FAQDINNIERSTSLSI 116 + K++ + N + K F++ + E + ++ T ++ Sbjct: 648 ASRKLMPKNNTADKIADKAAAKAVDSLKAVTELFKSAISEKAPPGKYEKPVVKNDTVVNF 707 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 + DD+ + P S + KI S LD+ V+D Sbjct: 708 VKDDEPTKSEVGQTHSRFFPL------GALVSKGSKPPCICPDKICSSK---LDLAFVID 758 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--- 233 S G DK+ R + K + ++ GLVT+++ Sbjct: 759 AS---AGSEQNGKDKMAETMEFGRRVASAFKVDQENSH---LGLVTYATDAQIMLNFHHF 812 Query: 234 -AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 R+ + + L A +FD + D +I +TD Sbjct: 813 NDPDTLTEARDAVRVKPHTGKYTGQALSLAKEGLFDKGHR--------SDALDVLILMTD 864 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFYSVQ 346 G +S + +L + G + A+G+ + + L + AS + ++ Sbjct: 865 GPSSDDVTEPSRAL------RDMGVKIIAVGIGNQIDRKQLNDIASDPDDEHVFTAD 915 >gi|260808845|ref|XP_002599217.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] gi|229284494|gb|EEN55229.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] Length = 600 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 62/185 (33%), Gaps = 23/185 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ VLD S S+ D + A ++ + G++ ++ ++ Sbjct: 222 DIVFVLDYSGSIPDSEFVKIKNFVAA---------LVDRFQVGVLDAQIGVIRYNHAVIH 272 Query: 230 TFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + ++ + +T + Y A L D + Sbjct: 273 EFHLNTHDNKADVLSDVSAMPTATTGGTNTAAALTYV----ASTMLLPGNGNRPDAPDVV 328 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-PDRFYSVQ 346 I LTDG +S + + G +AIGV A A L AS PD Y + Sbjct: 329 IVLTDGYSSG-------VVGPASVLHGMGVQTFAIGVGACANSAQLTQIASCPDYIYRLP 381 Query: 347 NSRKL 351 + L Sbjct: 382 DFSAL 386 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 13/122 (10%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G ++++ L V V L ++ +H K +L +D + L AT + N ++ + Sbjct: 10 KKQQGLVAVMITAALLVFLAVSALAVDINHMVVNKTRLQNAVDSATLAAATILDNSKDKD 69 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + N + + Q+I+ S S+ D + + Sbjct: 70 AVDAE-------------VGTALNAMAASTGNQEIDFSTASISIDYSNDPKDFTGTATFD 116 Query: 131 SR 132 S Sbjct: 117 ST 118 >gi|327266508|ref|XP_003218047.1| PREDICTED: complement factor B-like [Anolis carolinensis] Length = 767 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 77/242 (31%), Gaps = 37/242 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + T KI + D L++ +VLD S S D+ A ++++ I S Sbjct: 241 VSTTGKRKIKIEKDGSLNIYIVLDASRS------IKKDQFKHAQNMSIKLIEKISSYDIS 294 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---------TTKSTPGLEYAY 263 R ++TF++++ + + + G T GL Y Sbjct: 295 P---RYAVITFATEVKELVRTTDDQSTDASWVIEKLEGMKYTEHKQKPGTNIQKGLSSVY 351 Query: 264 NKIFDAKEKLEHIAKGHDDYKK----YIIFLTDGENSSPNIDNKESLFYCNEAKRRG--- 316 + + + + I+ L+DG+ + D + E G Sbjct: 352 SMMITQQAAERRRGLNPPPVSEKTRHVIVLLSDGDYNMGG-DPIRVIRQIREFLNIGRNR 410 Query: 317 ---------AIVYAIG--VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 V+A+G V E ++ + + +++ L AF + E Sbjct: 411 THPREDFLDVYVFAVGGTVVMENVNKIASQKSGERHAFKIKDYSDLQLAFEEMIDESETL 470 Query: 366 RI 367 + Sbjct: 471 SM 472 >gi|322688246|ref|YP_004207980.1| hypothetical protein BLIF_0055 [Bifidobacterium longum subsp. infantis 157F] gi|320459582|dbj|BAJ70202.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 362 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 15/187 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S SM+ G+ K + + KS + + + L+ F ++ + Sbjct: 186 WVVDYSGSMSGEGKNGVVK---GLNAALDPDQAKKSYIEPASGDVNILIPFETEAHRPVK 242 Query: 233 LA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + T GL A +++ E ++ I+ +T Sbjct: 243 ATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 294 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG N D+++ +++ R +++I Q + S L Sbjct: 295 DG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 351 Query: 352 HDAFLRI 358 F ++ Sbjct: 352 AAVFRQV 358 >gi|27806781|ref|NP_776396.1| vitrin precursor [Bos taurus] gi|75054534|sp|Q95LI2|VITRN_BOVIN RecName: Full=Vitrin; Flags: Precursor gi|17941422|gb|AAL18262.2| vitrin [Bos taurus] Length = 652 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 67/202 (33%), Gaps = 37/202 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + + R G + ++ + Sbjct: 469 DIGFVIDGSSSV------GTSNFRTVLQFVANLSREFEISD---MDTRIGAMQYTYEQR- 518 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + + YA ++F K + Sbjct: 519 ---LEFGFDEYSTKSDVLNAIKRVGYWSGGTSTGAAIHYALEQLF---------KKSKPN 566 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--D 340 +K +I +TDG + + A +G I YAIGV A D+ P D Sbjct: 567 KRKLMILITDGRSYD------DIRIPAMLAHHKGVITYAIGVAWAAQDELDIIATHPARD 620 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 + V L+ ++ + + Sbjct: 621 HAFFVDEFDNLYKVVPKVIQNI 642 >gi|227547429|ref|ZP_03977478.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212076|gb|EEI79972.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 362 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 15/187 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S SM+ G+ K + + KS + + + L+ F ++ + Sbjct: 186 WVVDYSGSMSGEGKNGVVK---GLNAALDPDQAKKSYIEPASGDVNILIPFETEAHRPVK 242 Query: 233 LA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + T GL A +++ E ++ I+ +T Sbjct: 243 ATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 294 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG N D+++ +++ R +++I Q + S L Sbjct: 295 DG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 351 Query: 352 HDAFLRI 358 F ++ Sbjct: 352 AAVFRQV 358 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|260796039|ref|XP_002593012.1| hypothetical protein BRAFLDRAFT_201563 [Branchiostoma floridae] gi|229278236|gb|EEN49023.1| hypothetical protein BRAFLDRAFT_201563 [Branchiostoma floridae] Length = 862 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 50/143 (34%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D S SMN G L VA ++ L ++ + R LV+F Sbjct: 4 ILFVIDTSASMNQRTYMGTTLLDVAKGAVETFL-KLRQRDPGSRADRYMLVSFEDPP-AA 61 Query: 231 FPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W H+ ++ L T L+ A+ N++ + Sbjct: 62 IKAGWKENHVAFMNELKNLQATGMTTMGQALKQAFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 ++ +TDG + + +E L Sbjct: 122 PAMVVTITDGSKLTSSSGVQEEL 144 >gi|297201072|ref|ZP_06918469.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|197712143|gb|EDY56177.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 516 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 70/188 (37%), Gaps = 24/188 (12%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +VLD S SM ++ + +++ E + ++ + +V FS+++ T Sbjct: 336 YLVLDRSASMRPYY------KDGSAQALGE--QTLALAAHLDPESKVHVVFFSTELDGTG 387 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 L G+ + KI+ L G A E L H K D ++F T Sbjct: 388 EL--GLTDHENKIDELHAGLGRMGRTSYHAAVE------EVLAHHGKNAPDTPALVVFQT 439 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQF--LK--NCASPDRFYSVQ 346 DG +P+ + AK + ++ + F L+ A+ F + + Sbjct: 440 DG---APDAKTPATQALTEAAKTHPNVFFSFVAFGDPENKAFDYLRKLKLANTSHFLAGE 496 Query: 347 NSRKLHDA 354 ++L DA Sbjct: 497 TPKELTDA 504 >gi|182676519|sp|P0C6B8|SVEP1_RAT RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; Flags: Precursor Length = 3564 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRTFRSRVRRLRELSDRLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|293347389|ref|XP_002726583.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Rattus norvegicus] Length = 3578 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRTFRSRVRRLRELSDRLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|109474969|ref|XP_001065678.1| PREDICTED: polydom [Rattus norvegicus] Length = 3583 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 80/227 (35%), Gaps = 40/227 (17%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 S V+ + L+++ ++D S S+ +++L +R++L P Sbjct: 65 LGRTFRSRVRRLRELSDRLELVFLVDESSSVGQT--NFLNELK----FVRKLL---SDFP 115 Query: 211 DVNNVVRSGLVTFSSKIVQ-----TFPLAWGVQH----IQEKINRLIF-GSTTKSTPGLE 260 V+ R +VTFSSK + QH + +I + + G T + + Sbjct: 116 VVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQ 175 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + ++E K I +TDG ++ + + G ++ Sbjct: 176 QAAQILRHSRENS----------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIF 220 Query: 321 AIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G+ + ++P + Y + + + F + + + + Sbjct: 221 TFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 263 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|288940556|ref|YP_003442796.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288895928|gb|ADC61764.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 610 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 70/210 (33%), Gaps = 37/210 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ +++DVS SM + L + L +I G+ F+ + Sbjct: 27 APADVRLLIDVSGSMRQNDP---RNLRAPALQLVNELIPAGAIA--------GVWLFAEQ 75 Query: 227 IVQTFPL-----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P AW + + ++ R+ T + A K Sbjct: 76 TEVLIPPAPVDDAW-KKRLAGRLARIHSRGLFTDIERAIRTATEDWT----------KTP 124 Query: 281 DDYKKYIIFLTDG-------ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + ++++I TDG E S + K G V+AIG+ + + + Sbjct: 125 PEGERHLILFTDGLVDVSKDEAESAASRERILSEQIESLKSEGVKVHAIGLSDQIDEPLM 184 Query: 334 KNCA--SPDRFYSVQNSRKLHDAFLRIGKE 361 + A + Q++ L FLR+ ++ Sbjct: 185 RLLATQTDGWLEVAQDAETLQRLFLRVLEQ 214 >gi|261194779|ref|XP_002623794.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239588332|gb|EEQ70975.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 756 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 22/200 (11%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +D+S SM+ P D G + +LD+ K I +N+ R G+V F Sbjct: 75 DIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLGVVAF 134 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S+ + ++ + + + L S+T GL+ + + E Sbjct: 135 STDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEAL------EEVTPIP 188 Query: 280 HDDYKKYIIFLTDGENSS--PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + YI LTDG + P L + K R +++ G L+ + Sbjct: 189 QNVQALYI--LTDGMPNHMCPRQGYVPKLRSILQQKDRLPMIHTFGFGYYIRSGLLQAIS 246 Query: 338 --SPDRFYSVQNSRKLHDAF 355 + + ++ + F Sbjct: 247 EVGGGTYSFIPDAGMIGTVF 266 >gi|108758859|ref|YP_629631.1| hypothetical protein MXAN_1374 [Myxococcus xanthus DK 1622] gi|108462739|gb|ABF87924.1| hypothetical protein MXAN_1374 [Myxococcus xanthus DK 1622] Length = 424 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 66/177 (37%), Gaps = 26/177 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + ++++ VLD + SM+ K+ SI + + P + GLV Sbjct: 75 AAARPEIEVVFVLDTTGSMSGLLEGAKRKI----YSIASRIAQGRPTPHLK----VGLVA 126 Query: 223 F----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS----TPGLEYAYNKIFDAKEKLE 274 + + + F L+ + + + + G + GL A +K+ +K Sbjct: 127 YRDVGDDYVTKRFDLSDDLDTVFANLRKFEAGGGGDTPEHVGRGLGEAVSKLSWSK---- 182 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + K I + D + N D+ + + +A+ + +V + +A + Sbjct: 183 -----NREVMKAIFLVGDAPPAQRN-DDWDFKHWAKKAREKHIVVNTVRCGGDAETE 233 >gi|115930656|ref|XP_001179084.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 245 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 50/171 (29%), Gaps = 26/171 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + +++++D S SM H L D K N+VR + Sbjct: 2 SSIQVIVLVDTSGSMVTHMEDLKKDLVALI------WDQFKRENISFNIVRFSADIEPWR 55 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P ++ + T + L A+ + Sbjct: 56 PHIVEPTDANCNDAVRWVSSFVPAGNTCTLEALSEAFREKDVDA---------------- 99 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKN 335 I LTDG+ S + N + G V+ I E+A+ FL+ Sbjct: 100 IYLLTDGKPDSSTSKVFREIAQVNTVR--GVKVHTISFNCNDESANTFLRQ 148 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|257092459|ref|YP_003166100.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257044983|gb|ACV34171.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 769 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 68/186 (36%), Gaps = 41/186 (22%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPG-----------MDKLGVATRSIREMLDIIKSIPDVNN 214 L ++++LD+S S+ D G G + L + + + L I D + Sbjct: 576 GRNLAVLVLLDLSQSLGDKAGDGQQTVLELSQEAVSLLAWSIEQLGDALAIAGFHSDTRH 635 Query: 215 VVR-SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 VR L +S + WG ++ ++ + G +T+ + +A Sbjct: 636 DVRYLHLKGYSER--------WG-DSVKSRLAAMQAGYSTRIGGAIRHA----------- 675 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-------RGAIVYAIGVQA 326 H KK ++ LTDG S +D ++ +A++ RG Y I + Sbjct: 676 AHYLAAQKAEKKLLLILTDGRPSD--VDVQDERLLVEDARKAVGEVGRRGIFTYCISLDR 733 Query: 327 EAADQF 332 +A Sbjct: 734 QADAYV 739 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|288556553|ref|YP_003428488.1| hypothetical protein BpOF4_17775 [Bacillus pseudofirmus OF4] gi|288547713|gb|ADC51596.1| hypothetical protein BpOF4_17775 [Bacillus pseudofirmus OF4] Length = 459 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 31/202 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ ++LD S SM D G+DK+ +A +I+ ++ + +V V G V S Sbjct: 157 EEKELNVEILLDASGSMRDEVD-GVDKMTLAREAIQGFVEELPDQANVALRVY-GHVGES 214 Query: 225 --------SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 ++ P + Q+ I+ ++ T L A D Sbjct: 215 PEKSCEGIDRVYDLQP--YDESSFQDAIDGVMANGWT----PLAKAIEVTSDDYRNASKD 268 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAA-DQFL 333 A ++ ++DG ++ + + + + IG A Q L Sbjct: 269 ATN------MVLVVSDGMDTCGG----DPVQAVKDLSELDVTPLISIIGFDVPANEQQQL 318 Query: 334 KNC--ASPDRFYSVQNSRKLHD 353 + AS F +V + +L Sbjct: 319 REMAQASGSAFATVNDQAQLSA 340 >gi|238060728|ref|ZP_04605437.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237882539|gb|EEP71367.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 580 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 80/228 (35%), Gaps = 37/228 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM--DKLGVATRSIREMLDIIKSI 209 P+ I +V S + M+ V+DVS SM + V + L++ Sbjct: 363 PVAIDRAVSSWSIATQSGRMLCVIDVSGSMKQPVPSANNATREQVTVAAASRGLNLFDDS 422 Query: 210 PDVNNVVRSGLVTFSSKIVQTFP---------LAWGVQHIQEKINRLIFGST-TKSTPGL 259 GL TFS+++V T L+ +++ + + S T + Sbjct: 423 WS------IGLWTFSTELVGTLDYRELVPINLLSSNRSRLEQGLATIRPSSGDTGLYDTM 476 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG--- 316 AY + + E + ++ TDG+N N +++ L E K+ Sbjct: 477 LAAYKTVQEDWEPGRVNS---------VVLFTDGKNEDANGISQQKLLA--ELKQAADPE 525 Query: 317 --AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF-LRIG 359 V IG+ + + L + + + ++ K+ D F I Sbjct: 526 RPVQVVIIGIGNDVSKSELDSITKVTGGGSFITEDPTKIGDIFLKAIA 573 >gi|118346233|ref|XP_977011.1| hypothetical protein TTHERM_00035110 [Tetrahymena thermophila] gi|89288362|gb|EAR86350.1| hypothetical protein TTHERM_00035110 [Tetrahymena thermophila SB210] Length = 603 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 84/276 (30%), Gaps = 63/276 (22%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + + + + L S S S+ G+ +++LD S SM + K+ Sbjct: 43 NIENEYLVNMMISIRGQSKLASVQSKIQSEASNKGISYLILLDRSESMQVN-----QKIQ 97 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIF 249 A +S+ I+ I ++ R L+ F P Q E I ++ Sbjct: 98 NAKKSV------IELIQNLTPYDRFCLIPFGGSNGVAIPFTDSNSINKQETFEIIQNIVC 151 Query: 250 GSTTKSTPGLEYAYN--------------------------------------------K 265 T ++ A N + Sbjct: 152 KGKTDIVSVIQTAINTIKQEQIHQNTIKQEFEKTTKKSLQQSINSSLTRVSRNINVDELE 211 Query: 266 IFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + +K ++ + + + L+DGE++ + + C + + + G Sbjct: 212 LLNMSKKHQNKNSSNQIVDRTYCFVLLSDGEDNIHQNYALQRIRECIKNETLNYSINCFG 271 Query: 324 VQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 E L + A + ++Y ++ + ++D Sbjct: 272 FGIEHDGNLLSSIAQLTGGQYYYIKENESIYDYLKE 307 >gi|319649593|ref|ZP_08003749.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] gi|317398755|gb|EFV79437.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] Length = 461 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 42/106 (39%), Gaps = 17/106 (16%) Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + G T+ LE AY+++ + +K+II LTDG+ + N + Sbjct: 6 SITPGGGTEIFTSLEQAYSELEEL-----------QLQRKHIILLTDGQ----SATNGDY 50 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSR 349 K + + + + +A L++ A RFY V +S Sbjct: 51 ELLIEGGKEKNITLSTVALGQDADRGLLEDLAGMGLGRFYDVTDSS 96 >gi|291382819|ref|XP_002708118.1| PREDICTED: polydom [Oryctolagus cuniculus] Length = 3569 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 73/210 (34%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +L +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDESSSVG--HANFLSELK----FVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAWGVQH---------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + +I + + G T + + A + ++E Sbjct: 132 NVVPRVDYISSRRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHSRENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|184199785|ref|YP_001853992.1| hypothetical protein KRH_01390 [Kocuria rhizophila DC2201] gi|183580015|dbj|BAG28486.1| hypothetical protein [Kocuria rhizophila DC2201] Length = 455 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 45/343 (13%), Positives = 105/343 (30%), Gaps = 34/343 (9%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTAT-KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 +T + + L D AT + + + G + R ++ + + + Sbjct: 120 DTGTRELISRALGEARDGQAERAATEQAMKRRARTEGDAAPVQDAARRLQQVAEDPAPST 179 Query: 96 LRENGFAQDINNI-ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 + + + +++ + +DY + P + Sbjct: 180 APAADPSAPAYAAGAQHPAATVVTRGEWEDYRSRHADTTLVASTPGDRPASEAPTADDAA 239 Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREMLDIIKSIPDVN 213 +T ++ ++ +DVS SM P G ++ + + + P+ + Sbjct: 240 LTQALGQWQHLAEPFHALVAIDVSGSMGTKALPDGSTRMDLTKAAATT---AVGLFPEHD 296 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQ--EKINRLIFGST------------TKSTPGL 259 GL TF + + + +++ + G T T S G Sbjct: 297 A---LGLWTFERHLDGDK----DYRSVTPVRELSASVDGGTQRDQLSQDVQSLTFSPDGY 349 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI---DNKESLFYCNEAKRRG 316 Y+ A ++ H + +I L+DG N P+ D S + Sbjct: 350 TGLYDTTLAAYRQVLHDDAPGHL--RTVIVLSDGMNHDPDSIALDELLSTLKAEQDAENP 407 Query: 317 AIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 + +GV +A L+ A+ + + + + F+ Sbjct: 408 VRIITVGVSKDADATVLRQIAEATGGSSHVARTPQDIQKVFVD 450 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 311 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 364 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 365 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 414 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 415 -KKVIVLLTDGVPT 427 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 12/104 (11%), Positives = 41/104 (39%), Gaps = 2/104 (1%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 12 RRGLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS- 70 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 + + + + + ++ ++ N +++ Sbjct: 71 -AISLSVAEADGIAAGHLNFVFFQKTSVQMSTNANVTFSDSLTN 113 >gi|33595651|ref|NP_883294.1| putative hemolysin [Bordetella parapertussis 12822] gi|33565730|emb|CAE36274.1| putative hemolysin [Bordetella parapertussis] Length = 2215 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 83/192 (43%), Gaps = 11/192 (5%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK---LGVATRSIREMLDIIKSIPDVN 213 +K + + ++ +VLD+S SMND +G G +K L A +++ +L+ ++ D Sbjct: 1620 GGIKQNVTAGTSYNIALVLDLSDSMNDKWGSGSNKPTRLQTAKDALKALLENQLAVHDGE 1679 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + L+TF+ G+ +++ E ++ ++ G S A+++ E Sbjct: 1680 --INVSLITFNGSSSALKKSITGLTPENVDEMVD-ILMGLKASSATPYGAAFDRTTQWFE 1736 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + YK FLTDGE S+ N+++ F A V+ IG+ + + Sbjct: 1737 GQPTVDSEGKPYKNLTFFLTDGEPSTEWSYNRDNEFAELAAISD---VHGIGIGSGVSTS 1793 Query: 332 FLKNCASPDRFY 343 L + +Y Sbjct: 1794 TLNKYDNTGGYY 1805 >gi|21740064|emb|CAD39048.1| hypothetical protein [Homo sapiens] Length = 803 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 18 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 70 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 71 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 114 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ KR + +Y I A++FLK A+ Sbjct: 115 TDGRPDQP------PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 159 >gi|322436659|ref|YP_004218871.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164386|gb|ADW70091.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 316 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 46/224 (20%), Positives = 88/224 (39%), Gaps = 33/224 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + L + ++LD S S + ++ +L L+ Sbjct: 67 TQEKKLPLTIGILLDTSGSQQNVLPLEQ---QSGAEFLKTVLTPKDEAF---------LI 114 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAY-NKIFDAKEKLEHIAK 278 +F + + I+ I++ + G+ T S G ++DA H Sbjct: 115 SFDINVDLLSDYTNSPREIKRSIDKATINTGAGTGSVTGNSTPKGTLLYDAVYLAAHDKL 174 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQ----------AE 327 + +K ++ LTDG + ++E+L E A++ AIVY I + + Sbjct: 175 RQEAGRKILVMLTDG----GDQGSQETLKTATEAAQKANAIVYVILIADRGFYSGGGFSF 230 Query: 328 AADQFLKNCA--SPDRFYSV-QNSRKLHDAFLRIGKEMVKQRIL 368 D+ +++ A + R +V N RKL DAF +I E+ Q +L Sbjct: 231 GGDRDMESLAHDTGGRVINVGNNGRKLEDAFDQIQDELRTQYLL 274 >gi|225310539|emb|CAQ19230.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 491 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 82/223 (36%), Gaps = 21/223 (9%) Query: 146 ANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 N ITS+V +K + L++ ++D S S D+ G + + Sbjct: 16 RNGRRKSKPITSNVIPKNKDENCNLELAFLVDSSESAKDNHGQE----KSFVTDLVNHIP 71 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-W-GVQHIQEKINRLIFGS-TTKSTPGLEY 261 I+ R+ L+ +SS ++ W GV Q ++ + F T +T + Sbjct: 72 NIRLQTGQGLNFRTALLQYSSHVITEQSFKDWRGVPSFQSRVASIPFIGHGTYTTYAI-- 129 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 ++ + K I + G + N D +L +AK +G + Sbjct: 130 -------TNLTRIYLEESGPGTVKVAILMYGGASHPKNPDIFSALA---DAKNQGIKFFI 179 Query: 322 IGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +G+ + A + L+ AS V N + +I +E+ K Sbjct: 180 VGLTSAANMEKLQLLASAPASRYVHNIQD-KGVVDKIIREITK 221 >gi|194334882|ref|YP_002016742.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312700|gb|ACF47095.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 336 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 59/196 (30%), Gaps = 29/196 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F + P + + +S D VLDVS SM D+L A Sbjct: 60 LVFVAISLLLFAFSGPRWCSGERLVKRES---FDAAFVLDVSNSMRAQDVRP-DRLTRAK 115 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 R + E+ + LV F+ PL + ++ L+ Sbjct: 116 RELVEVSRRVGRGRRS-------LVVFAGSAALQCPLTADQAVFETMLDIASPELVELQG 168 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L A + K + + I+ +DGE+ + Sbjct: 169 TDLGGALRLAGKTLDSGKGRSSL---------QVIVMASDGEDHVGAGAAVAAELAV--- 216 Query: 313 KRRGAIVYAIGVQAEA 328 RG V+ IGV + Sbjct: 217 --RGTNVFVIGVGGQN 230 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|187956243|gb|AAI50691.1| Integrin alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQHIVH 381 >gi|226823202|ref|NP_032425.2| integrin alpha-E isoform 1 [Mus musculus] gi|56206400|emb|CAI24788.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQHIVH 381 >gi|27370456|ref|NP_766532.1| integrin alpha-E isoform 2 [Mus musculus] gi|26334103|dbj|BAC30769.1| unnamed protein product [Mus musculus] gi|56206399|emb|CAI24787.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S S+ A I M+ N LV + + I Sbjct: 196 IAIVLDGSGSIEPS------DFQKAKNFISTMMRNFYEKCFECNF---ALVQYGAVIQTE 246 Query: 231 FPL--AWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + K+ + TK+ +++ + IF A K + Sbjct: 247 FDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKKAL------KVM 300 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QFLKNCASP---D 340 + LTDG+ D N K +G + +AIGV + + LK AS Sbjct: 301 VVLTDGDIFG---DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEA 357 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 + V N L ++ + +V Sbjct: 358 HTFKVTNYSALDGLLSKLQQHIVH 381 >gi|89095762|ref|ZP_01168656.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89089508|gb|EAR68615.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 432 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 73/216 (33%), Gaps = 25/216 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII--KSIPDVNNVVRSG-- 219 + ++ + +D S SMN G+ K+ +A ++ +I + G Sbjct: 129 EEQQTKNISIQIDSSGSMNGQVSGGV-KMNLAKEAVENFAAGFPEDTIMTLRTYGHKGTG 187 Query: 220 -----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ +S V + + + + T ++ Y + Sbjct: 188 DDKDKAMSCASTEVMYDANTYDQAAFKAALEKFKPSGWTPLAASIKAGYEDL-------- 239 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--AADQF 332 K +D + + ++DG + KE+ + V+ IG + DQ Sbjct: 240 -KKKAGEDTENILYIVSDGIETCEGDPVKEAKALADS--DLNMKVHIIGFDVDDAGQDQL 296 Query: 333 LKNC-ASPDRFYSVQNSRKLHDAFLRI-GKEMVKQR 366 K A ++Y+V + +L + + G+ + R Sbjct: 297 KKTAEAGNGKYYTVNSKLELTNTLNELMGEAISSIR 332 >gi|330834066|ref|YP_004408794.1| protoporphyrin IX magnesium-chelatase [Metallosphaera cuprina Ar-4] gi|329566205|gb|AEB94310.1| protoporphyrin IX magnesium-chelatase [Metallosphaera cuprina Ar-4] Length = 607 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 76/221 (34%), Gaps = 28/221 (12%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL--SAVSRYE-----MPF 137 W+ + N + + + N+ E S S + L S + +Y + Sbjct: 345 NTQWKEELSNRVSKERDETEPNHSEISIPKSKVKGTGFGKGGLFDSILGKYRGEGLHLDL 404 Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + S+K +S L ++++LD S SM ++ +A Sbjct: 405 YASLINMALHKRSWLEPQDLSIK-GVESTGALPILLLLDSSKSMEFS-----KRISLAKS 458 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-FPLAWGVQHIQEKINRLIFGSTTKST 256 ++ +L IK+ + GLVTFS + P+ + ++ I + T + Sbjct: 459 ILKGLL--IKAYQIRSK---VGLVTFSGFSSEYVVPITKNFKKVESSIEAVRPSGKTPIS 513 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL-TDGENS 296 L A I + I+FL +DG+ + Sbjct: 514 SALALAIQIINRETRSRRGVLP--------IVFLISDGKAN 546 >gi|327189769|gb|EGE56913.1| hypothetical protein RHECNPAF_550036 [Rhizobium etli CNPAF512] Length = 533 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 27/58 (46%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +R F+ + +G + LT I +P++ LVI+ + L +D L A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAREL 63 >gi|308472999|ref|XP_003098726.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] gi|308268326|gb|EFP12279.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] Length = 380 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 70/209 (33%), Gaps = 25/209 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 I +++PL + S++ LD+++V+D S M + + Sbjct: 5 IILLSLVAYADTYSPLSYVDRPCGTDLSNLWLDVVLVVDNSQEMGSQ---RLHDVTSNIL 61 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLIFGS 251 S+ I S R GLVT++S L + + + + + Sbjct: 62 SVFGADTRIGSNSVEPRTTRVGLVTYNSAATLNADLNQFQSFSDLRNGVISFLK--VAAN 119 Query: 252 TTKSTPGLEYAYNK-IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T S A + + + +H Y+K II + ++D + + Sbjct: 120 TKDSYLATGLAMAAQVLNVQGLRDH-------YQKVIIVYASKYSGYGDLDPQP---IAD 169 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K G + + + L++ +SP Sbjct: 170 RLKGSGVKIITVAY---GDETVLESLSSP 195 >gi|289622533|emb|CBI50802.1| unnamed protein product [Sordaria macrospora] Length = 803 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMD------KLGVATRSIREMLDIIK-----SIPDVNNVVRS 218 D+++ +DVS SM+ + +LD++K +N R Sbjct: 74 DIVLAIDVSGSMSADAPVPTTTSDDDPDQQHPEHNGLSVLDLVKHAARTIASTLNESDRL 133 Query: 219 GLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+VTFS++ PL A + + + + S T G+ Sbjct: 134 GIVTFSTEAKVLQPLMPMTALNKKKTERNLGGMQPTSATNLWGGIVEGLKLFGKDNGGSG 193 Query: 275 HIAKGHDDYKKYIIFLTDGENSS 297 + ++ LTDG + Sbjct: 194 RVPA--------LMVLTDGMPNH 208 >gi|198437668|ref|XP_002125059.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 983 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 71/213 (33%), Gaps = 27/213 (12%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 ++ ++ +++P P P T + K +++LD S SM Sbjct: 169 NEQGVMTVYPSHKIPNCTSIDP-----RFRP-WYTETAWPKPKR-----FLILLDSSRSM 217 Query: 182 NDHFG--PGMDKLGVATRSIREML---DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 + F P +D + E L D I +I + +RS + F Sbjct: 218 ENTFNSKPMIDIARELIDILLETLRPNDKISAIGFRHEALRSQGCFRN---QLAFASETN 274 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL-TDGEN 295 + ++ + + + T + A+ + E+ K D +KY+I L +DG+ Sbjct: 275 KEKLRSFLRNITPMGESSYTVAFQSAFQLL----EQDYIKYKNKSDTEKYVILLISDGQP 330 Query: 296 SSP---NIDNKESLFYCNEAKRRGAIVYAIGVQ 325 D + N +++ + Sbjct: 331 KEAYGRMQDVYSIIEQQNLKINNSVSIFSYAIG 363 >gi|148728188|ref|NP_002199.3| integrin alpha-E precursor [Homo sapiens] gi|226694184|sp|P38570|ITAE_HUMAN RecName: Full=Integrin alpha-E; AltName: Full=HML-1 antigen; AltName: Full=Integrin alpha-IEL; AltName: Full=Mucosal lymphocyte 1 antigen; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor Length = 1179 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 57/164 (34%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ N LV + Sbjct: 199 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 249 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N GS TK+ +++ + IF + A Sbjct: 250 IQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------ 303 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D N K +G +AIGV E Sbjct: 304 SKVMVVLTDG---GIFEDPLNLTTVINSPKMQGVERFAIGVGEE 344 >gi|119610886|gb|EAW90480.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] Length = 1196 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 57/164 (34%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ N LV + Sbjct: 216 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 266 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N GS TK+ +++ + IF + A Sbjct: 267 IQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------ 320 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D N K +G +AIGV E Sbjct: 321 SKVMVVLTDG---GIFEDPLNLTTVINSPKMQGVERFAIGVGEE 361 >gi|1575519|gb|AAC47463.1| thrombospondin-related anonymous protein [Plasmodium vivax] Length = 510 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 62/181 (34%), Gaps = 30/181 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVV 216 + +D+ +++D S S+ + + K L S+ D I ++ ++ Sbjct: 1 NESVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELI 58 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G I + L+ + E TT T LE + D Sbjct: 59 RLGS---GQSIDKRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND-------- 103 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + + +I +TDG +S +L + K+R + IG+ QF + Sbjct: 104 RVNREKAIQLVILMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLI 159 Query: 337 A 337 A Sbjct: 160 A 160 >gi|4406708|gb|AAB59359.2| integrin alpha E precursor [Homo sapiens] gi|109659254|gb|AAI17208.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|109730475|gb|AAI13437.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|313883908|gb|ADR83440.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1 [synthetic construct] Length = 1179 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 57/164 (34%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ N LV + Sbjct: 199 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 249 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N GS TK+ +++ + IF + A Sbjct: 250 IQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------ 303 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D N K +G +AIGV E Sbjct: 304 SKVMVVLTDG---GIFEDPLNLTTVINSPKMQGVERFAIGVGEE 344 >gi|7239181|gb|AAF43107.1| HUMINAE [Homo sapiens] Length = 1127 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 57/164 (34%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ N LV + Sbjct: 147 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 197 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N GS TK+ +++ + IF + A Sbjct: 198 IQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------ 251 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D N K +G +AIGV E Sbjct: 252 SKVMVVLTDG---GIFEDPLNLTTVINSPKMQGVERFAIGVGEE 292 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|297685094|ref|XP_002820135.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Pongo abelii] Length = 3553 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 71/210 (33%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ G +R++L +P R +VTFSSK Sbjct: 66 RLELVFLVDDSSSV------GQINFRSELMFVRKLLSDFPVVP---TATRVAIVTFSSKN 116 Query: 228 VQTFPLAW-GVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ L+ G T + + A + A+E Sbjct: 117 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHARENS---- 172 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + +TDG ++ + + G ++ G+ + + Sbjct: 173 ------TKVVFLITDGYSNGG-----DPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAS 221 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 222 TPKEEHCYLLHSFEE----FEALARRALHE 247 >gi|262198733|ref|YP_003269942.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082080|gb|ACY18049.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 684 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 43/284 (15%), Positives = 82/284 (28%), Gaps = 88/284 (30%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-----MDKLGVATRSIREMLDIIK 207 + +T+ GLD ++VLD S S+ ++ P +D + A + Sbjct: 56 VTVTAVPNPPLDPRCGLDAVIVLDASSSVRNYNNPPDANGAVDLIAGAGNAFLGAFADTN 115 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS---------------- 251 S R +V++++ L + L G Sbjct: 116 S--------RVAVVSYNADPRLQLDLT------AVTTDSLAAGGAHGIAMGDPGGPQGPM 161 Query: 252 --------------TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG---- 293 T GL YA N + + D K +I +TDG Sbjct: 162 SPTTGYSEHARNGSGTNWEAGLVYAQNVLENNGR---------ADVPKLVIHVTDGRPTR 212 Query: 294 ---------ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDRFY 343 + + E+ ++ K G ++A+GV A + L+ + PD F Sbjct: 213 HLTPDGTVTDEGGMAVHVAEAAEVADQLKASGVHIFAVGVGRAPQFSEELQATSGPDVFD 272 Query: 344 S----------------VQNSRKLHDAFLRIGKEMVKQRILYNK 371 + +L + + ++ + K Sbjct: 273 QTQPGDAFDVVNDDVILAADFDQLEELLRGVADQICGASLTITK 316 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|332970145|gb|EGK09139.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 485 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 81/200 (40%), Gaps = 24/200 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS--GLVTFSSK 226 +++M++LD S SM D G+ K+ +A ++++E + +V+ VV G + K Sbjct: 175 VNIMILLDSSGSMADKVKGGV-KMDLAKKAVKEFASNMPEGANVSLVVYGHKGSNAGADK 233 Query: 227 ------IVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 I + +PL A+ + Q +++ T ++ A K+ Sbjct: 234 KVSCESIEEIYPLAAYDGKTFQGSLDKFQPTGWTPLAGSMKLAQEKLAS---------HT 284 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNC-- 336 + + + ++DG + D ++ NE+ A+V IG + Q LK Sbjct: 285 GSNVQNIVYVVSDGVETCDG-DPVKAAEELNES-NMKAVVNIIGFDVDNEGQKALKEVAD 342 Query: 337 ASPDRFYSVQNSRKLHDAFL 356 A + +V + L + F Sbjct: 343 AGGGTYKTVGSKVGLQEYFE 362 >gi|329894134|ref|ZP_08270119.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] gi|328923306|gb|EGG30626.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] Length = 460 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 77/211 (36%), Gaps = 48/211 (22%) Query: 158 SVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 SV S+ D +++ V+D S SM + R +++ +KS+ + Sbjct: 90 SVLPPSQQDWAAPPREVVFVIDTSGSMAG---------QSIVAARRALVESLKSLRPEDA 140 Query: 215 VVRSGLVTFSSKI----VQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 +V F+ + Q +P + + H I L T+ + DA Sbjct: 141 F---NIVEFNHEASALFAQPYPAENYALAHAIRFIRSLEADGGTEIEAAFDLTLALPTDA 197 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 ++ + IIF+TDG S + +++ E + R ++ +G+ Sbjct: 198 QKLRQ------------IIFITDG---SVSNESELLAKINRELEDR--RLFTVGIG---- 236 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 +SP+R++ + +R F I Sbjct: 237 -------SSPNRYFMEEAARAGRGTFSYIAN 260 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|307354367|ref|YP_003895418.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] gi|307157600|gb|ADN36980.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] Length = 642 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 42/260 (16%), Positives = 82/260 (31%), Gaps = 45/260 (17%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 D D + A R PF N + + + G ++ ++D S Sbjct: 403 DRNKNDIAIDATLRAAAPF---QTARDRNGLAIKVDREDLREKVREKRTGDIILFLVDAS 459 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPLAWGV 237 SM ++ ++ +L+ + V GL+ F + P Sbjct: 460 GSMG-----VKKRMVAVKGAVLSLLND---AYQKRDTV--GLMIFRRKEATLLLPPTRST 509 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + + G T + G+ A + + +K I+ LTDG + Sbjct: 510 DLAHKLLKEIPTGGRTPLSEGVASAVRLLSQGRYSKSTDSKT-------IVILTDGRANY 562 Query: 298 PNI------DNKESLFYCNEAKRRGAIV----------YAIGVQAEAADQFLKNCASPDR 341 + + + +E K R +V +AI + +E +L+ Sbjct: 563 SGSGRNPYEEMRMTALAASEKKIRFVVVDTEEGFPRLDFAIALASELGATYLR------- 615 Query: 342 FYSVQNSRKLHDAFLRIGKE 361 +SRKL + I K+ Sbjct: 616 -LDELDSRKLAQSIETIVKK 634 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|327351891|gb|EGE80748.1| U-box domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 756 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 22/200 (11%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +D+S SM+ P D G + +LD+ K I +N+ R G+V F Sbjct: 75 DIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSILDLTKHAARTIIETLNDNDRLGVVAF 134 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S+ + ++ + + + L S+T GL+ + + E Sbjct: 135 STDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEAL------EEVTPIP 188 Query: 280 HDDYKKYIIFLTDGENSS--PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + YI LTDG + P L + K R +++ G L+ + Sbjct: 189 QNVQALYI--LTDGMPNHMCPRQGYVPKLRSILQQKDRLPMIHTFGFGYYIRSGLLQAIS 246 Query: 338 --SPDRFYSVQNSRKLHDAF 355 + + ++ + F Sbjct: 247 EVGGGTYSFIPDAGMIGTVF 266 >gi|297699669|ref|XP_002826908.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Pongo abelii] Length = 1273 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 58/164 (35%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ N LV + Sbjct: 291 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 341 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + N GS TK+ +++ + IF + A Sbjct: 342 IQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKA------ 395 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D + N K +G +AIGV E Sbjct: 396 SKVMVVLTDG---GIFEDPLDLTTVINSPKMQGVERFAIGVGEE 436 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|71274669|ref|ZP_00650957.1| conserved hypothetical protein [Xylella fastidiosa Dixon] gi|71164401|gb|EAO14115.1| conserved hypothetical protein [Xylella fastidiosa Dixon] Length = 828 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 50/179 (27%), Gaps = 16/179 (8%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 Y P + H I + + + +D+S SM+ G G +L Sbjct: 4 YPQPASYKIRRILKGWDHDACWYPEKAAIGMQMAPSVAVYFAIDLSGSMHYVGGNGRSRL 63 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINR 246 ++ LD + V L F L A G+ ++ + Sbjct: 64 DNMKTALNAALDQLGQSIASGTAVDIMLAGFGDAPDHRQTLLRRNCTAQGIAELKSWVAA 123 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 T Y + A + + F+TDGE P+ ++ Sbjct: 124 RQALYGT---------YFPAGTMDMPSFYAAAPSNAV-RVAFFMTDGEPDPPSATLAQA 172 >gi|156974653|ref|YP_001445560.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] gi|156526247|gb|ABU71333.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] Length = 2127 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 61/169 (36%), Gaps = 22/169 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ ++ +++DVS SM ++L + S ++L+ +SI R ++ Sbjct: 1678 AETKESANVQLIMDVSGSMRTG-----NRLQIMKDSATQLLNQYESIGQ----TRVQIIK 1728 Query: 223 FSSKI--VQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S W V + I+ L G T + A + DA + Sbjct: 1729 YSGSATTYAIAGATWLTVDEAKAYIDTLTAGGATNYNRAINEAKDSWDDAGKLPSASNVS 1788 Query: 280 HDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQA 326 + FL+DG+ + + I++ + + + A+ Sbjct: 1789 Y--------FLSDGQPNPASSFINDARENSWIDHLTDSDNQITALAYGM 1829 >gi|324508797|gb|ADY43711.1| Vitrin [Ascaris suum] Length = 658 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 49/269 (18%), Positives = 89/269 (33%), Gaps = 50/269 (18%) Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI--ERSTSLSIIIDDQHKDY 125 + N + + I+ + D EL E+ +I ER I D + Sbjct: 376 DLNRCGPEGDIKKNDEIQQLSVRDVLKELNESSSHTLNEDIGVERKPDAGIDNDPDINNL 435 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + E+ F P S+ +D+M ++D S S+ + F Sbjct: 436 IAETSTPEEITTNFTRIPL-----------------PIDSNCQVDLMFIIDRSESVENEF 478 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 +L A ++ M S D + VR ++F L + +E+ Sbjct: 479 ---QKQLQFAVDLVKRM-----STSDFASRVRVAAISF----YSKAKLEFSFDEFKEQSK 526 Query: 246 RLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 L G +T + G+ A ++I A ++ ++DG NS Sbjct: 527 VLEALLQIEHIGGSTSAVSGVNLAVDEIKKAGRSNARHM---------VVLISDG-NSQD 576 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 D + L + + A VYA+ V + Sbjct: 577 TWD--KVLEAADRLRAIDADVYAVTVSHD 603 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 19/132 (14%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I V S + +D++++LD S SM D F + +S D Sbjct: 187 AIAREVDEVSSCNDAIDLILLLDKSASMVDDFDSAKKFIEELVKSA--------PPGDYK 238 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKI 266 + +R LVTF+ +G + + + G T + GL A ++ Sbjct: 239 SRIRVALVTFNDNAHVD----FGFEKYSSRDDISFALERAENDGGETSAVSGLNAAMEEM 294 Query: 267 FDAKEKLEHIAK 278 + + + Sbjct: 295 KANRRNASRLVR 306 >gi|327405283|ref|YP_004346121.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327320791|gb|AEA45283.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 451 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 34/205 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-G-----LVTFS 224 ++ +LD S SMN ++G ++ A + + +D + ++ +R G T+ Sbjct: 26 ILFILDASNSMNANWG-AQTRIEAAKELLVKTIDSLDGSANLQIALRVYGHQSPITATYQ 84 Query: 225 SKIVQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + +G ++ KI +I TT LE A + D + Sbjct: 85 DCNDTKLEVPFGPDNFLKVRNKIRTIIAKGTTPIARSLEAAASDFPDTNARN-------- 136 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAE--AADQFLKNCA 337 II +TDG + N + + +G V + IG+ + DQF C Sbjct: 137 ----IIILITDGLEACDN----DPCVIAKKLHDKGVKVTPFVIGLGLDLTYLDQF--KCI 186 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEM 362 + + ++ + + Sbjct: 187 --GSYSEAETKEAFNNVLKTVISKA 209 >gi|313204752|ref|YP_004043409.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312444068|gb|ADQ80424.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 626 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 74/200 (37%), Gaps = 20/200 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW A + + + +I S++ + + +LDVS SM+ +KL + SI+ + Sbjct: 236 PWDATHRLVKIGVKAR-EIPSENLPASNFVFLLDVSGSMD-----VPNKLELVKSSIKLL 289 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + ++ V VV +G +V Q I E ++ L G +T G++ A Sbjct: 290 TNNLRKTDRVAIVVYAGAAG----VVLESTEGTDKQKIMEAVDGLHAGGSTAGGAGIQLA 345 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y + + I+ TDG+ + N E ++ G + + Sbjct: 346 YKIAEKNFIENGNNR---------IVLCTDGDFNVGVSSNNELESLIESKRKTGVYLTVL 396 Query: 323 GV-QAEAADQFLKNCASPDR 341 G D L+ A Sbjct: 397 GYGMGNYKDNKLQILAEKGN 416 >gi|296224100|ref|XP_002757907.1| PREDICTED: vitrin [Callithrix jacchus] Length = 656 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 66/194 (34%), Gaps = 39/194 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVTFSSKIV 228 D+ V+D S S+ R++ + + I K + R G V ++ + Sbjct: 473 DIGFVIDGSSSVG----------TGNFRTVLQFVTNISKEFEISDTDTRIGAVQYTYEQR 522 Query: 229 QTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L +G K + L T + + +A ++F K Sbjct: 523 ----LEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF---------MKSKP 569 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-- 339 + +K +I +TDG + D+ K G I YAIGV A ++ P Sbjct: 570 NKRKLMILITDGR----SYDDVRIPAMAAHLK--GVITYAIGVAWAAQEELEVIATHPAR 623 Query: 340 DRFYSVQNSRKLHD 353 D + V L+ Sbjct: 624 DHSFFVDEFDNLYQ 637 >gi|269795767|ref|YP_003315222.1| hypothetical protein Sked_24760 [Sanguibacter keddieii DSM 10542] gi|269097952|gb|ACZ22388.1| conserved repeat protein [Sanguibacter keddieii DSM 10542] Length = 1516 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 63/225 (28%), Gaps = 36/225 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + +S GL + +V+D+S S+ + + + +D + P Sbjct: 319 GIWQSSLANPGLPPQCGLRVALVVDLSGSVGRY-------IAAMRSAASGFVDSLTGTPS 371 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 L TF+ G+ + + + Y FD Sbjct: 372 S-----VALFTFADNAPAATGANLGLTPVSTT-------AGADTVKNRIATYTAGFDTNW 419 Query: 272 KLE-HIAKGHDDYKKYIIFLTDGENSSPNI-----------DNKESLFYCNEAKRRGAIV 319 + + LTDG + + + +F N K G V Sbjct: 420 DRGLYQVAASATPFDVAVVLTDGNPTVYAAHEGPGDLTRFREVENGIFSANAVKAEGTRV 479 Query: 320 YAIGV-----QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 A+GV A + + + +Y + + +A + Sbjct: 480 IAVGVGDGIGGAPDNLRAISGPSGGSDYYQTDDYAEAGEALRALA 524 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|322690259|ref|YP_004219829.1| hypothetical protein BLLJ_0067 [Bifidobacterium longum subsp. longum JCM 1217] gi|320455115|dbj|BAJ65737.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 380 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 15/187 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S SM+ G+ K + + KS + + + L+ F ++ + Sbjct: 204 WVVDYSGSMSGEGKNGVVK---GLNAALDPDQAKKSYIEPASGDVNILIPFETEAHRPVK 260 Query: 233 LA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + T GL A +++ E ++ I+ +T Sbjct: 261 ATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 312 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG N D+++ +++ R +++I Q + S L Sbjct: 313 DG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 369 Query: 352 HDAFLRI 358 F ++ Sbjct: 370 AAVFRQV 376 >gi|196006922|ref|XP_002113327.1| hypothetical protein TRIADDRAFT_57367 [Trichoplax adhaerens] gi|190583731|gb|EDV23801.1| hypothetical protein TRIADDRAFT_57367 [Trichoplax adhaerens] Length = 933 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 69/199 (34%), Gaps = 33/199 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++VLDVS SM G M++L A + + S G++TFSS Sbjct: 316 VLVLDVSGSMR---GKPMEQLQQAATNFLLNVAQNGSF--------VGIITFSSAASIRS 364 Query: 232 PL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L Q + + +T G++ + + Sbjct: 365 SLVQINDDADRQRLILLL-PSGASGSTSIGAGIQAGVKILKASVGNKSPSGGT------- 416 Query: 287 IIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 +I L+DG EN SP I + + + V I + A+ + C F+ Sbjct: 417 LIVLSDGRENRSPTIADVK-----KQVLDNKITVQGISFGSLASVKLQSLCYETGGLSFF 471 Query: 344 SVQ-NSRKLHDAFLRIGKE 361 ++ +L +AF I ++ Sbjct: 472 VPNDDTARLVNAFTTIAEK 490 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|260581678|ref|ZP_05849475.1| tellurium resistance protein [Haemophilus influenzae NT127] gi|260095271|gb|EEW79162.1| tellurium resistance protein [Haemophilus influenzae NT127] Length = 212 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 67/206 (32%), Gaps = 18/206 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + ++ ++ ++ P ++TF SK Sbjct: 3 RLPVYLLVDTSGSMMGE------AIESVRNGLQMLVSALRQDPYALETAYLSVITFDSKA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + ++ + T L + I +K KG+ + Sbjct: 57 KQVMPLT---ELMSFQLPTIEASGLTSMGEALSLLTDCINREVQKGSAEVKGNWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 L+DG P D ++ + K A A A LK + Sbjct: 112 FLLSDGV---PTDDLQKGINALRTVKTG--TFVACAAGAGADTNVLKQITESVVSLDTAD 166 Query: 348 SRKLHDAFLRIGKEM--VKQRILYNK 371 + + F + + Q++ NK Sbjct: 167 ANSIKAFFKWVSASISVSSQKVDLNK 192 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 68/239 (28%), Gaps = 64/239 (26%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 P +D+L A ++ + + + GLV++ + + L + I I+ Sbjct: 182 EPKIDELQKAAKNFVNKFE-------IKANTKIGLVSYGNNGKEVHSLTNELDRINSSID 234 Query: 246 R-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA------------------KGHDDYKKY 286 L G T G+ A + + + ++I K + Y KY Sbjct: 235 SGLSIGGGTNVGDGIRMANGILNNGSDADKYIVLMTDGMPTAATCYNDIYYKNNRFYSKY 294 Query: 287 ------------------------------------IIFLTDGENSSPNIDNKESLFYCN 310 I + G+N NI S Sbjct: 295 GEILNGNGNPLSYFNYIGNYKYKFEYNPNDYTHEDEKIIMNYGDNDYGNIALNYSKEALK 354 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A G + IG + L A+ + + +L D + RI E+ + Sbjct: 355 RASESGVNNFVIGFSNGINREKLNGIATEGNGYYREAMHGDELTDVYKRIADEINNPVV 413 >gi|58616382|ref|YP_195512.1| tellurium resistance protein [Azoarcus sp. EbN1] gi|56315844|emb|CAI10488.1| tellurium resistance protein [Aromatoleum aromaticum EbN1] Length = 212 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 65/189 (34%), Gaps = 16/189 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM G ++ + V ++ M +++ P V ++TFS + Sbjct: 3 RLPVYLLIDCSGSM---MGEPIEAVKV---GLQTMTSALRTDPYALETVHLSVITFSQQA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q+ PL + +Q ++ + TT L I +K +KG Sbjct: 57 TQSVPLT---ELVQFQVPDIHASGTTALGEALALLAQCIERDVKKTTPDSKGDWKP--VC 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 LTDG+ + D + K + A A L+ + Sbjct: 112 FLLTDGQATD---DLNRGIAALRNVKMG--TLVACAAGPSANTAELRKITESVVSLDTAD 166 Query: 348 SRKLHDAFL 356 S L F Sbjct: 167 SNTLKAFFQ 175 >gi|46190338|ref|ZP_00121620.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440236|ref|YP_001955317.1| hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] gi|189428671|gb|ACD98819.1| Hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] Length = 380 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 15/187 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S SM+ G+ K + + KS + + + L+ F ++ + Sbjct: 204 WVVDYSGSMSGEGKNGVVK---GLNAALDPDQAKKSYIEPASGDVNILIPFETEAHRPVK 260 Query: 233 LA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + T GL A +++ E ++ I+ +T Sbjct: 261 ATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 312 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG N D+++ +++ R +++I Q + S L Sbjct: 313 DG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 369 Query: 352 HDAFLRI 358 F ++ Sbjct: 370 AAVFRQV 376 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|198436180|ref|XP_002124514.1| PREDICTED: similar to von Willebrand factor A domain containing 3A [Ciona intestinalis] Length = 1107 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 88/245 (35%), Gaps = 44/245 (17%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 + DI+ R + +++ + ++ + RY + S + Sbjct: 813 SVDIDGQVRHIDMQESEMVEYEKHVVTLIRRYL---------------TRMQWLLSGSRK 857 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++++DVS SM+ + ++ I E L+ K+ ++ + Sbjct: 858 MFGSILESNVVILIDVSGSMSYNMDELKKEITSL---IWEQLNGNKTAFNIVAFSNT-ST 913 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + I ++ A + ++ L + + ++ A +E Sbjct: 914 KWQDSITESNQSA--CHDAVQWVSALTAHGGSATLKAIQVA----LADEEAEA------- 960 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA-- 337 I LTDG+ P+ K +L + ++ ++ I + A+ FLK+ + Sbjct: 961 -----IYLLTDGK---PDSSIKLTLSEASNLNKKNIPIHTISFNCDNREANDFLKSLSSN 1012 Query: 338 SPDRF 342 S RF Sbjct: 1013 SGGRF 1017 >gi|90577283|ref|ZP_01233094.1| hypothetical protein VAS14_09569 [Vibrio angustum S14] gi|90440369|gb|EAS65549.1| hypothetical protein VAS14_09569 [Vibrio angustum S14] Length = 674 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 55/162 (33%), Gaps = 24/162 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P T W VK+ + ++ ++V+D+S SM K Sbjct: 54 PIAILTLGWLIAVIAMAGPSWQKVKLPA-YNLSGARVLVMDMSRSM----YATDIKPNRL 108 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----S 251 T++ + LD++ + + +GLVT++ + PL + I L Sbjct: 109 TQARFKALDMLPGWKEGS----TGLVTYAGDGYEVSPLTADSHTLANLIPSLSPKIMPIP 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + G+ A + A II +TDG Sbjct: 165 GSNAAAGIAEAIKLLKQAGNSTGD-----------IILVTDG 195 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|148704834|gb|EDL36781.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_b [Mus musculus] Length = 574 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 354 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 412 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 +I K+ + + V F+ Q ++ + +E + ++ Sbjct: 413 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYNTKENVLAVLANI 461 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 462 RYMSGGTATGDAIAFTVRNVFGPIRD--------SPNKNFLVIVTDGQ----SYDDVRG- 508 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 509 -PAAAAHDAGITIFSVGVAWAPLDDLRDMASKP 540 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 319 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 369 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 370 -KKVIVLLTDGVPT 382 >gi|325964121|ref|YP_004242027.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] gi|323470208|gb|ADX73893.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] Length = 345 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 38/103 (36%), Gaps = 6/103 (5%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 G+IS++ AIL+ + + + ++ + +A+L D S + A K G + Sbjct: 11 EGGAISVIVAILMVALLGFVAIAVDIGVIYSERAQLQNGADASAIAVAQKCARDATGVDC 70 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 S +N + D++ R S++ Sbjct: 71 STTSALASGLANRNALDGMSKVH------TIDLDKTTRKVSVT 107 >gi|197335948|ref|YP_002155278.1| hypothetical protein VFMJ11_0524 [Vibrio fischeri MJ11] gi|197317438|gb|ACH66885.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 463 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 45/313 (14%), Positives = 96/313 (30%), Gaps = 37/313 (11%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 G +IL + +PV+F V L + + KA+L + ++L + E Sbjct: 2 KLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSAYGEEDE 61 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 K + ++ L + + + + + + K ++ Sbjct: 62 VSTQTGKDYVAHYLHDMSSLVDIKVEK-LECSELPECTADDNDRPFVEYQVSGRTK--HI 118 Query: 128 SAVSRYEMPFIFC-TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 S ++ F +F +S + +D+ +LD S SMN + Sbjct: 119 SWFPGNDVTVGFGESFDVTGSSKARKFQSSQP----------MDITFILDFSGSMNYDWE 168 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 + M + I +P G + S++ L + VQ + +++ Sbjct: 169 GH---------APSYMEEEIPKVP--------GRYSPPSRL---SDLKYVVQMVTDELQV 208 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + YN+ + KY +DG ++ K+++ Sbjct: 209 YNNSTAGPKHRVAMTGYNRRTVNESSNGKFVIRDQRITKY---NSDGYDAGDTFYPKKTI 265 Query: 307 FYCNEAKRRGAIV 319 K A V Sbjct: 266 NKQFMVKGAAARV 278 >gi|146284567|ref|YP_001165520.1| von Willebrand factor, type A [Enterobacter sp. 638] gi|145320700|gb|ABP62846.1| von Willebrand factor, type A [Enterobacter sp. 638] Length = 401 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 71/225 (31%), Gaps = 41/225 (18%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA-------------TRS 198 PL I +M+++ D S SM+ D+L R Sbjct: 184 PLPIKQICPAKRTKQQAPEMVLIFDASGSMSISMDITPDELRRLMQDRPVKNFDREPRRI 243 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWG-VQHIQEKINRLIFGSTTKS 255 + I +V + LV+ ++ ++ T L++ ++ I+ + T Sbjct: 244 SLAHRSAKQLIDEVPKDMDISLVSAATCQQVSVTPALSFAQRDELKYAIDNIQPVGKTAL 303 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 LE A D I+ +TDGE + + + K++ Sbjct: 304 AEALEKA------------GKLVDGVDRDAIIVLITDGEETCGG----DPCVVAQQLKQQ 347 Query: 316 G--AIVYAIGVQAEAADQFLKNCASP---DRFYSVQNSRKLHDAF 355 V + + A NC + Y+V N+ + ++ Sbjct: 348 KPRLQVNVVDIMNTGAG----NCIASQTGGSVYAVNNTHEFNEMM 388 >gi|296108821|ref|YP_003615770.1| Magnesium chelatase [Methanocaldococcus infernus ME] gi|295433635|gb|ADG12806.1| Magnesium chelatase [Methanocaldococcus infernus ME] Length = 283 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 72/207 (34%), Gaps = 33/207 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + I ++ V+D S SM + ++ +A +IR +L + + N R G++ Sbjct: 99 REKKISSYILFVVDTSGSMG-----ALRRMELAKGAIRSLL--VDAYQKRN---RVGMIV 148 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F P V+ ++ + + G T + AY K +I Sbjct: 149 FRKDSADLILPFTSSVELAEKSLRDVPTGGRTPLSKAFLKAYETFEKELRKNPNIIP--- 205 Query: 282 DYKKYIIFLTDGENSSP--NIDNKESLFYCNEAKRRGAIVYAIGVQAEA--------ADQ 331 ++F++D + + N KE C + +G I + E ++ Sbjct: 206 ----IMVFISDFKPNVAIKNDFIKEIYEICEKIHEKGIN--TIFIDTEPKTFIKLGIGEE 259 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRI 358 K ++Y + KL D I Sbjct: 260 LAKK--FGFKYYKIDEI-KLDDLLKEI 283 >gi|165924896|ref|ZP_02220728.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|167418907|ref|ZP_02310660.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425152|ref|ZP_02316905.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487722|ref|ZP_06204796.1| conserved hypothetical protein [Yersinia pestis KIM D27] gi|165923096|gb|EDR40247.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|166962901|gb|EDR58922.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167055915|gb|EDR65696.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360782|gb|ACY57503.1| membrane protein [Yersinia pestis D106004] gi|270336226|gb|EFA47003.1| conserved hypothetical protein [Yersinia pestis KIM D27] Length = 492 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 78/226 (34%), Gaps = 35/226 (15%) Query: 23 ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYR 82 L+PV ++ L E SH +AKL L+ + L +T+ +ND + Sbjct: 5 ALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE----------NNYRNDRASN 54 Query: 83 IIKNIWQTDFR-NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT 141 N T + + L F+Q + + +YN S Y++ + Sbjct: 55 NRNNYLVTSYAQSYLPSERFSQ------PRVVNTYNESLGYTEYNASLQMNYQLALLNSY 108 Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-----PGMDKLGVAT 196 + + ++ K S +D++ V D S SM+ FG + KL Sbjct: 109 LKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELK 168 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 ++ + I S +N + P +WG + I Sbjct: 169 AIFVKLNNRIFSNDGIN-------------TIGFVPFSWGTKRISA 201 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 350 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 398 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 399 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 458 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 459 TGNFYLAKNAHEL 471 >gi|159044607|ref|YP_001533401.1| hypothetical protein Dshi_2063 [Dinoroseobacter shibae DFL 12] gi|157912367|gb|ABV93800.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12] Length = 251 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 81/247 (32%), Gaps = 52/247 (21%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD--KLGVA 195 CA + P+ + + S+ D M+V D S SM++ MD ++ A Sbjct: 12 SLSLIAACAVAGLMPVPHAGAAQGCSE-----DAMVVFDGSGSMSEMGFNLMDEPRIFAA 66 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFS-------SKIVQTF-PLAWGVQHIQEKINRL 247 R+IR+++ + + + GL+ + I F P+ + +I+RL Sbjct: 67 RRAIRQVMPQVAPVRN------LGLLVYGPGPREACDNIDLRFSPIPDAAPRMIAEIDRL 120 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + T T + A + + ++ +TDG+ + + + Sbjct: 121 MPSGNTPLTASVARAAEAL------------DYRTRPGVVVLVTDGKETCGGAPCQLAAE 168 Query: 308 YCNEAKRRGAIVYAIGVQAEAA--------------DQFLKNC---ASPDRFYSVQNSRK 350 A V+ IG + Q + C + + + + + Sbjct: 169 L--AADAPALTVHVIGFKLRGEHFSWDSENQHDYRQGQTVARCLADQTGGLYLTTETVDE 226 Query: 351 LHDAFLR 357 L A Sbjct: 227 LVAALRT 233 >gi|260832994|ref|XP_002611442.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] gi|229296813|gb|EEN67452.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] Length = 1121 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 72/195 (36%), Gaps = 31/195 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + D D++ +DVS S++ F IR +++ + + R G++ Sbjct: 934 AACDSAADVIFSIDVSGSVSGQF-------DTVRNFIRGVVNCLTI---GGSHARVGVIK 983 Query: 223 FS-SKIVQTFPLA--WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F+ S + L + +I L GS S GL N+ + Sbjct: 984 FAGSNANRQISLTDYNNKNDLLVRIRNLDRSLGSAVGSVAGLSVMRNEFSTSGRPTAR-- 1041 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I LTDG ++S + + + G ++++GV A A + L+N A Sbjct: 1042 -------KIGIVLTDGRDTSSSDAV---IPDAETLRNEGTTIFSVGV-ANARRETLENMA 1090 Query: 338 S---PDRFYSVQNSR 349 S + ++ + Sbjct: 1091 SRPVNENVFTATFAS 1105 >gi|157738379|ref|YP_001491063.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] gi|157700233|gb|ABV68393.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] Length = 1866 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 57/154 (37%), Gaps = 17/154 (11%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG------MDKLGVATRSIREMLDIIKSI 209 T +++ ++ ++ +V+D S SM + G + ++ + +++ + D +K Sbjct: 1367 TGGTQLNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDALKNLADSLKGH 1426 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 N ++ F + + + + + KI+ L T E A+ K Sbjct: 1427 DGKIN---VSIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTN----YEDAFLK 1479 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + Y+ FLTDG+ + N Sbjct: 1480 TTSWFDTQSVTYGKAQGYENLTYFLTDGDPTFSN 1513 >gi|90020471|ref|YP_526298.1| arginine biosynthesis bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase [Saccharophagus degradans 2-40] gi|89950071|gb|ABD80086.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 708 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 47/277 (16%), Positives = 98/277 (35%), Gaps = 22/277 (7%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +EN + ++ I + F +G+ + + I ++ + Sbjct: 244 EENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPEKDAIRAEELINYFDYNYPLPS 303 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + +A + + I PW + + I+ +++ +LDVS SM Sbjct: 304 DSTAPFKPNITVIDS--PWAKGKKLVHIGLKG-YDIAPDQKPRTNLVFLLDVSGSM---- 356 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 DKL + +S+ +L + V VV +G V A Q I + Sbjct: 357 -NSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYAGAAG----TVLEPTPAKDKQKILSAMQ 411 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 RL G +T G+ AY+ + +A + + + +I TDG+ + + +N+ Sbjct: 412 RLQAGGSTAGGAGIALAYD-LAEANFDKKAVNR--------VILATDGDFNVGSTNNETL 462 Query: 306 LFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDR 341 + + +G + +G D ++ A Sbjct: 463 QGFVERKREKGIFLSVLGFGQGNYNDHLMQTLAQNGN 499 >gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex] Length = 829 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 72/238 (30%), Gaps = 63/238 (26%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR--SGLVTFS---- 224 ++ VLD S SM ++ +++ +LD ++ D+ +VV SG+ + Sbjct: 240 VIFVLDTSGSMAG------TRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKP 293 Query: 225 ----SKIVQTFPLAWGVQ--------------------------------HIQEKINRLI 248 P + +E + + Sbjct: 294 YKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMD 353 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 S+T L A K + IIFLTDGE ++ D E L Sbjct: 354 VTSSTNINDALLLAL--------KNSQSVQSRVRLTPIIIFLTDGEPTASVTDTTEILKN 405 Query: 309 CNEAKRRGAI-VYAIGVQAEAADQFLKNCASPDR------FYSVQNSRKLHDAFLRIG 359 + + ++ + QFL +S +R + + + +L F + Sbjct: 406 VRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGFFDEVA 463 >gi|304394408|ref|ZP_07376331.1| von Willebrand factor, type A [Ahrensia sp. R2A130] gi|303293848|gb|EFL88225.1| von Willebrand factor, type A [Ahrensia sp. R2A130] Length = 689 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 73/230 (31%), Gaps = 41/230 (17%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + + A L +T ++ ++M+V D S SM G K+ +A + +L Sbjct: 5 FALILAAASLTLTQPANAQ-QTQSSRNVMVVFDGSGSM-WGQIEGRAKIEIARDVLSSVL 62 Query: 204 DIIKSIPDVNNVVRSGLVTFSSK----------IVQTFPLAWGVQHIQEKINRLIFGSTT 253 S G++ + + +V P A V + + N + T Sbjct: 63 GETTSNMT------IGMIAYGHRKKGQCSDIETVVAPGPAASTVPEMIARANAIKPKGKT 116 Query: 254 KSTPGLEYAYNKI-FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + + A + + E ++ +TDG + N + E Sbjct: 117 PLSDAVRKAAESLRYTENEAT-------------VVLVTDGIETC----NADPCALATEL 159 Query: 313 KRRGA--IVYAIGVQAEAADQFLKNCA---SPDRFYSVQNSRKLHDAFLR 357 + G + +G + C + +F S ++ +L A Sbjct: 160 EESGVDFTTHVVGFGLSKDEGRQVACLAANTGGKFISADDADELKAALDE 209 >gi|303240541|ref|ZP_07327057.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302591943|gb|EFL61675.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 305 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/60 (31%), Positives = 31/60 (51%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +NI + F + KG+ + AI+L VI L ++ F KAKL +D + L A ++ Sbjct: 1 MNIPSLFKSKKGTTFVFFAIILTVIVAFAALSVDVGVIAFEKAKLSNTVDAAALAGAQEL 60 >gi|149028000|gb|EDL83451.1| complement factor B, isoform CRA_d [Rattus norvegicus] Length = 543 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ MVLD S S G A R + +++ + S R GLVT++ Sbjct: 44 KGGSMNIYMVLDGSDS------IGASNFTGAKRCLANLIEKVASYGVKP---RYGLVTYA 94 Query: 225 SKIVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + ++ + EK+N++ S T + L+ Y+ + + Sbjct: 95 TVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMSWPGDAP-- 152 Query: 276 IAKGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG--------AIVYAIGV 324 +G + + II +TDG ++ P ++ + + R V+ +G Sbjct: 153 -PEGWNRTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGP 211 Query: 325 QAEAA--DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + + + + V++ L + F ++ E Sbjct: 212 LVDPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 254 >gi|123718336|emb|CAJ77151.1| collagen type VI alpha 4 [Mus musculus] Length = 229 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 58/162 (35%), Gaps = 21/162 (12%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 S P S + +S D+ ++D S S+ + + Sbjct: 86 MLSGMPPLMSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN------DFIEMKDFM 139 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRL-IFGSTTKST 256 +E++ + PD VR G+V +S KI+ F L + + I+ + G T + Sbjct: 140 KEVIKMFHIGPDR---VRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTG 196 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 L + D +Y+I +TDG+++ P Sbjct: 197 KALSKMVPVFQNTAR---------IDVARYLIVITDGQSTDP 229 >gi|47191295|emb|CAF87304.1| unnamed protein product [Tetraodon nigroviridis] Length = 202 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 60/178 (33%), Gaps = 20/178 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S + + + I ++ + PD VR +V +S + + Sbjct: 8 DIVFLVDGSDTTGETGIAYIRD------FIISVVQQLDVQPDR---VRVAVVQYSDNVQR 58 Query: 230 TFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L Q + + RL S G + Y E D +++ Sbjct: 59 EFALNSHNNKQAVISAVKRLRLMGGRSSDLGEAFKYVT---ENELKPSSGSRPSDASQHL 115 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 + +T G++ +++ Y K +GV Q + +P+ V Sbjct: 116 VVITGGQS------PQDASLYGPWLKSSRVSCIGVGVGGTNTRQLTQIATTPEDVLQV 167 >gi|145597693|ref|YP_001161769.1| hypothetical protein YPDSF_0381 [Yersinia pestis Pestoides F] gi|145209389|gb|ABP38796.1| conserved hypothetical protein [Yersinia pestis Pestoides F] Length = 126 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 42/126 (33%), Gaps = 11/126 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L ++ +++ +K P ++ F+ Sbjct: 3 RLPIFFVLDCSESMIGE------NLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ + RL G T L+ +I K KG + Sbjct: 57 KTIVPL---VEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--VV 111 Query: 288 IFLTDG 293 LTDG Sbjct: 112 YLLTDG 117 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 75/254 (29%), Gaps = 33/254 (12%) Query: 119 DDQHKDYNLSAVSRYEMP--FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 D Y + +P T + +++ + + VLD Sbjct: 287 DALTDTYRRPTTANATLPAELSSQTIIEAPFPGSKTVTDALIDAYTNQFRVPGETTFVLD 346 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTFSSKIVQTFP 232 VS SM ++ + ++ +++ + N +R ++ FS + Sbjct: 347 VSGSMLGQ------RITLLKDTMSDLISGGATTDLANVSLRDREKVSIIPFSFGPHEVIS 400 Query: 233 LAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 G +Q+++ L T + AY E Sbjct: 401 ETLGAVGSPSRTDLQQRVEALQADGGTGIYDAVLAAY------AESAGGDYIPS------ 448 Query: 287 IIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYS 344 I+ +TDGE ++ D + + + R V+ I + A + + Sbjct: 449 IVLMTDGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGGETFD 508 Query: 345 VQNSRKLHDAFLRI 358 N L +AF I Sbjct: 509 AIN-GDLDEAFKEI 521 >gi|328784200|ref|XP_003250409.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Apis mellifera] Length = 2258 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 65/186 (34%), Gaps = 34/186 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----- 223 ++++ ++D S S+ +L ++ P + + R +VTF Sbjct: 101 VELVFLVDSSGSVGSK--NFHSELNFVKH-------VLSDFPVIPSATRIAIVTFGGRRH 151 Query: 224 ---SSKIVQTFPLAWGVQHI--QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + + ++ ++ N G T + L AY + A+ Sbjct: 152 IRRNVDQISRTNENDNKCYLLNKQLTNISYTGGGTYTRGALLEAYRILEKAR-------- 203 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + KK + +TDG ++ + N K GA ++ G++ D+ +S Sbjct: 204 --SNAKKAVFLITDGFSNGG-----DPRPAANLLKGAGATIFTFGIRTGNVDELHDIASS 256 Query: 339 PDRFYS 344 P YS Sbjct: 257 PKDTYS 262 >gi|309264112|ref|XP_003086226.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Mus musculus] Length = 2656 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 863 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 916 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 917 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 976 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 977 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1035 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1036 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1087 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1088 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1143 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1144 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1197 >gi|148708135|gb|EDL40082.1| mCG12867, isoform CRA_a [Mus musculus] Length = 2656 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 123/360 (34%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 863 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 916 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 917 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 976 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 977 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1035 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ +I+ ++ + + +R GLV ++S F L Sbjct: 1036 -GSINFRRDSFQEVLRFAS-------EIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFS 1087 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + + +T G Sbjct: 1088 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDERVPQIAFVITGG 1143 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1144 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1197 >gi|148656025|ref|YP_001276230.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568135|gb|ABQ90280.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 932 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/324 (14%), Positives = 104/324 (32%), Gaps = 37/324 (11%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI 108 D + L A + + + + N R + ++ Sbjct: 303 QSPEDAAGLERALRAAQIDVAVVAPA-AMPDTLLAMSQYDAIALVNVPRRAFSESTLQHL 361 Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + D+ + P F W A L +T + I + Sbjct: 362 AT------YVHDRGGGLIM-----VGGPRSFGPGGWRGTPVEAALPVTMDIPIYR-TMPP 409 Query: 169 LDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + +++V+D+S SM + P + R I +L + + R G+V Sbjct: 410 VSVVIVIDISGSMAMTEDGIPKLSLALDGARRIASLLRDEDELTILPFDDRPGVV----- 464 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 V P + + I++ + GS L A + D ++ Sbjct: 465 -VGPLPGSQRDKAIEQMSQVRLGGSGINIHDALVAA-----------ARYVRASDRPIRH 512 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYS 344 II +TDG ++ + + +L + + +I V + F+++ A+ R + Sbjct: 513 IITITDGNDT---VQQEGALDIVRALRDERVTLTSIAVGQGSHVPFIRDMAAVGGGRTFL 569 Query: 345 VQNSRKLHDAFLRIGKEMVKQRIL 368 + + L D L + +++ I+ Sbjct: 570 TERAADLPDLLLDEAEMIIQPSII 593 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 53/135 (39%), Gaps = 21/135 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSS 225 LD+++V+D S SMN++ +++G + + +D + NN+ + G V +SS Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLADSGITNNININMGYVGYSS 320 Query: 226 KIVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + G ++ I + T + L A + + Sbjct: 321 DGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH--------- 371 Query: 282 DYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 372 --KKVIVLLTDGVPT 384 >gi|145297003|ref|YP_001139824.1| hypothetical protein cgR_2902 [Corynebacterium glutamicum R] gi|140846923|dbj|BAF55922.1| hypothetical protein [Corynebacterium glutamicum R] Length = 230 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 60/211 (28%), Gaps = 46/211 (21%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK----- 226 M+VLD S SM G + A + I E+ GLVT+ Sbjct: 1 MLVLDSSGSMVTPDAGGQSRSDAANQFIDELAGTFD----------LGLVTYGGNTGETP 50 Query: 227 ---------IVQTFPLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 I G + +++ I+ L T L A ++ + Sbjct: 51 EDYEAGCQDITVVRGPTNGQAEQLKQHIDGLQPRGYTPIGESLRKAVAELPEGGSGT--- 107 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLK 334 I+ ++DG + E +G ++ +G + + + + Sbjct: 108 ----------ILLVSDGIATCT---PPPVCEVAAELADQGVDLVINTVGFTLDESARAVL 154 Query: 335 NC---ASPDRFYSVQNSRKLHDAFLRIGKEM 362 C A + ++ L + Sbjct: 155 ECIAQAGNGTYADASDADSLVAELKQAATRT 185 >gi|327270794|ref|XP_003220173.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 917 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 71/212 (33%), Gaps = 41/212 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S M ++L ++ + L I + G+V F+SK Sbjct: 315 VCLVLDASAQMGKD-----NRLSRLIQAAKLFLLQI-----IEEGSWVGIVAFNSKGNIQ 364 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + G+ A+ E Sbjct: 365 AGLQKVFSDIERESLTSHL-PTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE-------- 415 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 I+ LT+GE S L C ++ + + I++ I ++A+++ K + + + Sbjct: 416 -IVLLTNGEGSD--------LSPCLSKIQSQEIIIHTIAFGSKASNELEKLADMTGGKTF 466 Query: 344 SVQNSRK---LHDAFLRIGK---EMVKQRILY 369 +S L DAF I + +Q I Sbjct: 467 YATDSLDSNGLIDAFGGISSGSGDASQQSIQL 498 >gi|255009188|ref|ZP_05281314.1| hypothetical protein Bfra3_08601 [Bacteroides fragilis 3_1_12] gi|313146936|ref|ZP_07809129.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135703|gb|EFR53063.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 456 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 71/198 (35%), Gaps = 29/198 (14%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + + H + K ++ G ++ +D S SM+ + +S Sbjct: 280 MIDYESKEQHRIKDVKVQGKDIAEEQSG-PFIICVDTSGSMSGEREEFV-------KSAI 331 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + + D L++FS I ++ L+ +Q + + + F T TP Sbjct: 332 LAIAELTEQQDRKCY----LISFSDDIACIEIERLSQNIQELADFL-CQSFHGGTDLTPA 386 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L +A + + + ++ ++D E S N + E + + K+ Sbjct: 387 LLHAIHILRTKSYRNAD-----------LVMMSDFEMSPLNDELSEEIK---KIKQNNTY 432 Query: 319 VYAIGVQAEAADQFLKNC 336 +YA+ VQ + +L C Sbjct: 433 IYALSVQKQCEKTYLDIC 450 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 51/409 (12%), Positives = 118/409 (28%), Gaps = 83/409 (20%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 L ++L + + M L + ++ + AT + + Sbjct: 24 PLRSVLASLCAVAMSLGMASASVAAFADDRQPAA-TADPQAAT---ASAGNVDAPQHTKR 79 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH----------KDYNLS 128 S + + Q + ++ + L + + +++ Sbjct: 80 ISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMN 139 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +S Y + ++ T + S+ + + S S++ Sbjct: 140 PISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFY 199 Query: 189 MDK------LGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSS------------- 225 K L ++ LD ++ I D V+ L+ ++ Sbjct: 200 KSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNE 259 Query: 226 ------KIVQTFPLAWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 LAW + +Q++ +N L G T++ GL++A ++ + Sbjct: 260 DGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA--- 316 Query: 277 AKGHDDYKKYIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQ------- 325 +K +F +DG +S ++ + K + V +IG Sbjct: 317 -------QKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSG 369 Query: 326 AEAADQFLKNCAS----------------PDRFYSVQNSRKLHDAFLRI 358 + A++F+ +S +Y+V L F I Sbjct: 370 TDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEI 418 >gi|170746808|ref|YP_001753068.1| hypothetical protein Mrad2831_0362 [Methylobacterium radiotolerans JCM 2831] gi|170653330|gb|ACB22385.1| conserved hypothetical protein; putative vWFA domain protein [Methylobacterium radiotolerans JCM 2831] Length = 437 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 50/434 (11%), Positives = 126/434 (29%), Gaps = 111/434 (25%) Query: 9 FFYNCKGSISILTAIL-LPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 FF G +++ A++ LPV+F ++ + K +L LD ++L ++ N Sbjct: 19 FFRARSGQVAVTFALVTLPVMFATAA-AVDYGRRNAAKTQLDAALDGAVLAVMSQKTNTI 77 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID-DQHKDYN 126 + + F K + + ++ Y Sbjct: 78 PTTTLQNMETQFRTEAAK--------------VPGVTVTSFTPGAPVNTSKTLSLTASYT 123 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN---- 182 + + + + ++ + + + ++ ++LD S SM Sbjct: 124 ATVKTS-----------LASMMQIPAMPVSGTSSATRNTSQYINYYLLLDNSPSMGLAAT 172 Query: 183 -----------------------------------DHFGPGMD-----KLGVATRSIREM 202 D++ + ++ V ++ + Sbjct: 173 DADVQNMKIATNGCAFACHQHTFDKKGNITGDDQNDNYHIALRNNIKLRIQVLREAVSAL 232 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQT-----FPLAWGVQHIQEKIN-RLIFGSTTKST 256 +D + + + TF+ + QT P +++ I+ + + + + Sbjct: 233 VDQANVSMLLPQQFQMEMWTFNDSVTQTKLQAMTPTLNNIKNAAPNIDIAYAYYNQSDNQ 292 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKE----------- 304 E A ++ +++ +TDG E++ ++ N+ Sbjct: 293 TDFERAIARMNTTIPASGDGLTPDKPI-RFLFLVTDGVEDTGGSVTNQSAGFQIQSNRFI 351 Query: 305 ---SLFYCNEAKRRGAIVYAI--------------GVQAEAADQF---LKNCASPDRFYS 344 S C+ K + + I Q L+ CAS ++ Sbjct: 352 GPLSPSTCSALKNKNVKIGIIYTQYLPIYDNDFYNRYVRPYESQIGPSLQACASDGMYFP 411 Query: 345 VQNSRKLHDAFLRI 358 V + + A L++ Sbjct: 412 VTTNGDITAAMLKL 425 >gi|55743096|ref|NP_066933.1| collagen alpha-1(XIV) chain precursor [Homo sapiens] gi|125987815|sp|Q05707|COEA1_HUMAN RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|187954653|gb|AAI40894.1| Collagen, type XIV, alpha 1 [Homo sapiens] Length = 1796 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 79/214 (36%), Gaps = 24/214 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 VK ++ D+++++D S S+ + + ++ + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF------NFRLVRHFLENLVTAFDV---GSEKT 195 Query: 217 RSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R GL +S + L A+ + + E + L + T A N IF+ K E Sbjct: 196 RIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKPE 252 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ K I +TDG++ I +L + G ++AIGV+ ++ + Sbjct: 253 AGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQE 304 Query: 335 NCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + PD Y+V +H + + + + Sbjct: 305 IASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIISFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 >gi|54303503|ref|YP_133496.1| hypothetical protein PBPRB1846 [Photobacterium profundum SS9] gi|46916933|emb|CAG23696.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 648 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 67/205 (32%), Gaps = 34/205 (16%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 PF W + P +++ S S + +V+D+S SM ++L Sbjct: 54 PFAILALGWLLSVLALAGPSWEKNTLPAYSLSGARV---LVIDMSRSM-YATDIAPNRLT 109 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A +ML K +GLV +++ PL ++ I ++ Sbjct: 110 QARFKALDMLPGWKEGS-------TGLVAYAADGYVVSPLTEDSSTLKNLIPNLSPEIMP 162 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 + + G++ A + A + II +TDG + D +L Sbjct: 163 IQGSNAAAGVQEAITLLKQAGHQAGD-----------IIIITDG-MTQQESDQTMAL--- 207 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLK 334 K + + + V + + Sbjct: 208 --VKDQDYRLSILAVGTQQGAPIKQ 230 >gi|294674674|ref|YP_003575290.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294471870|gb|ADE81259.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 212 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 67/198 (33%), Gaps = 18/198 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM +G + ++ ++ ++S P ++TF+S Sbjct: 3 RLPVYLLLDTSGSM---YGEPI---EAVKNGVQTLVSTLRSDPYALETAYISIITFNSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + + T L K+ K KG I Sbjct: 57 QQIAPLT---ELASFQPPVIDASGCTALGEALNLLAQKVDTEIVKTTAEVKGDWKP---I 110 Query: 288 IF-LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +F +TDGE P D ++ L KR+ +V A A LK Sbjct: 111 VFIMTDGE---PTDDLQKGLN--EFRKRKFGMVVACAAGQGANTNTLKQITENVVQLDTA 165 Query: 347 NSRKLHDAFLRIGKEMVK 364 +S + F + + Sbjct: 166 DSATIKAFFKWVSASIST 183 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 23/112 (20%), Positives = 40/112 (35%), Gaps = 16/112 (14%) Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+ A + + D + +I +TDG N+ + + N A Sbjct: 2 TALYSGINSATTEFKN----------YSTDASRIMIVVTDGYNNQSGASSATVI---NNA 48 Query: 313 KRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEM 362 I+Y +GV LKN ++ +Y + +L+ F I E Sbjct: 49 IEENVIIYCVGVG-SVNSTVLKNISESTGGCYYYINQFSQLNGIFENIISET 99 >gi|153008592|ref|YP_001369807.1| hypothetical protein Oant_1261 [Ochrobactrum anthropi ATCC 49188] gi|151560480|gb|ABS13978.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 576 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 37/97 (38%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F G+++ + A++ P+ V ++TS F + +L + D + + A + Sbjct: 5 VSRFLRARGGNLATMAALVSPIFLAVAAFSVDTSSLFLERRQLQSMADFAAVAGAASLSQ 64 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFA 102 + + + N ++ + N +N Sbjct: 65 ANDAVLRQLRANGLDPVLMTGAYDPSVVNGKTDNKTR 101 >gi|15838706|ref|NP_299394.1| hypothetical protein XF2115 [Xylella fastidiosa 9a5c] gi|9107243|gb|AAF84914.1|AE004026_15 hypothetical protein XF_2115 [Xylella fastidiosa 9a5c] Length = 941 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 61/209 (29%), Gaps = 24/209 (11%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPQPASYKIRRILEGWDHDACWYPEKAAIGMQMAPCVAVYFAIDLSGSMD 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWG 236 G G +L ++ +LD + V L F + Q L A G Sbjct: 167 YVGGNGRSRLDNMKTALNAVLDQLGQSIASGTAVDIMLAGFGNAPDQRQTLLRRNCTAQG 226 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK--KYIIFLTDGE 294 + ++ ++ + Y F A + F+TDG Sbjct: 227 IAELKSWVS------------ARQALYGTYFPAGTMDMPSFYAAAPPNAVRVAFFVTDGV 274 Query: 295 NSSPNIDNKES----LFYCNEAKRRGAIV 319 P+ N ++ + + G + Sbjct: 275 PDPPSATNAQAARADVDQVAHLRCYGITI 303 >gi|332290884|ref|YP_004429493.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332168970|gb|AEE18225.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 267 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 55/180 (30%), Gaps = 22/180 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 ++ +D++ +D + SM D++ ++ ++ I + + R L + Sbjct: 54 QASNAIDILFAVDATGSMG-------DEISYLKSELKNIMSRIDASVEQK---RVALTVY 103 Query: 224 ----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + V +++ ++ +E A + A Sbjct: 104 RDHGDDYVTRAIDFSSDVNEVKDFLSAQHAAGGGDYEEAVEEALKVSLSQSWNEKAKA-- 161 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY-AIGVQAEAADQFLKNCAS 338 + + + D + +AK +G + + A +FL S Sbjct: 162 -----RILFLMLDAPPHFNQENVAMIKSQIAKAKEQGIKIIPVVASGANKEVEFLMRSFS 216 >gi|272938026|gb|ACZ96962.1| vwa2 protein [Danio rerio] Length = 260 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 59/188 (31%), Gaps = 22/188 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 LD++ VLD S + +T +V + GLV + Sbjct: 1 GQALDLVFVLDASSGVGKENFIHFQDFVRSTSV---------QFDINRDVAQVGLVVYGR 51 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + V F L + + F S + A + + A+ Sbjct: 52 RPVTVFDLDKYNSGSAVLRAVGDAAFLGGKAS---VGSALLHVLSQSLTVGKGARPG--V 106 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 K ++ LTDG +++ + + G ++ IGV + L+ S D Sbjct: 107 NKAVVVLTDG------TGVEDAAVPAQKIRDSGVSIFLIGVGDIQQELLLRISGSEDHMI 160 Query: 344 SVQNSRKL 351 +V + L Sbjct: 161 TVPSYDDL 168 >gi|158337332|ref|YP_001518507.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158307573|gb|ABW29190.1| von Willebrand factor, type A domain protein [Acaryochloris marina MBIC11017] Length = 686 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 80/216 (37%), Gaps = 33/216 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +++ ++DVS SM +KL + +S+ ++ +K R LV + Sbjct: 305 EKEQPSNLVFLIDVSGSMKRP-----NKLALVKKSLCLLVHQLKPED------RVSLVVY 353 Query: 224 SSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + P G Q I I+RL G +T G++ AY+ H K + Sbjct: 354 AGRAGIVLPSTPGTQKATIMNAIDRLEAGGSTAGAAGIKMAYDM------AERHFLKNGN 407 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA----EAADQFLKNCA 337 + +I TDG+ + + E + + RG + +G + + L N Sbjct: 408 NR---VILATDGDFNVGQSSDAELERLIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKG 464 Query: 338 SPDRFY-----SVQN--SRKLHDAFLRIGKEMVKQR 366 + + Y Q L I K++ Q Sbjct: 465 NGNYAYIDTLLEAQKVLVNDLRGTLFTIAKDVKIQV 500 >gi|119961201|ref|YP_948618.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] gi|119948060|gb|ABM06971.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] Length = 354 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 13/113 (11%), Positives = 39/113 (34%), Gaps = 5/113 (4%) Query: 6 IRNFFYN--CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL---YTA 60 +R ++ +G+++ +TA+L+ + + ++ + + A+L D S + Sbjct: 1 MRRLKHDDRERGAVAPMTAMLMVALLGMTAFAVDVAMMYSEHAQLQNGADSSAIGIAQAC 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 + + + + + + D+ R+TS Sbjct: 61 AQNAASADCAAPTSAATSLAGLNALDGVSNAPQASVNLGTGTVDVTTQSRNTS 113 >gi|156350127|ref|XP_001622153.1| hypothetical protein NEMVEDRAFT_v1g828 [Nematostella vectensis] gi|156208600|gb|EDO30053.1| predicted protein [Nematostella vectensis] Length = 339 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 62/168 (36%), Gaps = 19/168 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S S+ + DI+ + R G+V +S + + Sbjct: 1 LIFLLDSSGSVG--LQNYQKEKNFIK-------DIVSKQNIGRELTRVGVVVYSHEAIVA 51 Query: 231 FPLAW--GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY-KKY 286 L + + + I+ + + TT+ GL A D +++ +A G Y + Sbjct: 52 IKLTDYSSTEDLNKAIDGIDYYGKTTRIDKGLMTA-----DRIDQVFTVAAGSRPYLPRV 106 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ LTDG + C K++ A+GV + L+ Sbjct: 107 LVLLTDGRQTRAPGYKPLKTAVC-PLKKKQVKTLAVGVGRGIDARELR 153 >gi|119510964|ref|ZP_01630086.1| hypothetical protein N9414_01285 [Nodularia spumigena CCY9414] gi|119464403|gb|EAW45318.1| hypothetical protein N9414_01285 [Nodularia spumigena CCY9414] Length = 412 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 65/173 (37%), Gaps = 13/173 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD---------IIKSIPDVNNVVRSGLV 221 ++++LD S SMN G K+ A ++R+ + I +P + R Sbjct: 90 ILVMLDFSGSMNQLDSGGTKKITGAINAVRQFIQVLSQRGGNTQISIVPFGESGSRCQGY 149 Query: 222 TFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + F A +Q+ + + ++T L A + ++E+ ++A+ Sbjct: 150 PVNKETLDKFFAANDFKLQNHLNYLAKSTPCASTNLYEPLNQAVRFLATSEEERFYVAEN 209 Query: 280 HDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 D + II L+DG ++ N + + V+ +G Sbjct: 210 SDQPQPRLSIILLSDGYHNKANEQQDFTALTNLFRRNPLITVHTLGYGLTPQQ 262 >gi|326433447|gb|EGD79017.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 1763 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 53/135 (39%), Gaps = 12/135 (8%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S + Q F + + ++ L+F T + GLE + IF + + + Sbjct: 360 SYTVDQLFSMTQDHDDLAATLDSLVFPDGATHMSAGLEQIRDTIF---QLRNGMREYEQA 416 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---P 339 + +I LTDG+++ E + + RG I+YAIGV + L+ AS Sbjct: 417 IPRVLIVLTDGKSNPG----FEPHEVAEQLRNRGIIIYAIGVG-DYYLPELEAMASEPMD 471 Query: 340 DRFYSVQNSRKLHDA 354 Y + + L Sbjct: 472 RHVYELADPSSLFTI 486 >gi|156390493|ref|XP_001635305.1| predicted protein [Nematostella vectensis] gi|156222397|gb|EDO43242.1| predicted protein [Nematostella vectensis] Length = 229 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 57/165 (34%), Gaps = 21/165 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ ++D + SM ++ + A + + + + I VR LV + Sbjct: 16 LDLAFIVDCTGSMGEY-------IRQAQKHVISISETISRTAYN---VRLALVEYRDHPP 65 Query: 229 QTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAY---NKIFDAKEKLEHIAK 278 Q V+ ++ ++++ + A K+ ++ + Sbjct: 66 QDKSFVTRVHDFTSDVKEMKVWVDKMSASGGGDCPESVADAIFKACKLGYREDATKMCVL 125 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 D + F DG + DN + L C+ +G +Y IG Sbjct: 126 IADAPPHGLGFAHDGFPN-GCPDNHDPLASCHVMAEKGITLYTIG 169 >gi|313225210|emb|CBY21004.1| unnamed protein product [Oikopleura dioica] Length = 1138 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 62/173 (35%), Gaps = 19/173 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV--T 222 S G++ + ++D S SM ++ + + M++ +K N+V V Sbjct: 576 SSFGVNAIFLIDSSGSMMGE------RMEQTREAFKFMIEGLKPGDTF-NIVSFESVNKV 628 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 FS + ++ ++++ G T + L A + K Sbjct: 629 FSDNRMVPVSDRSTFAALK-FMDQIQAGGATDAYAALVKASLLLSQNKRTSSQEN----- 682 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLK 334 I FLTDG ++ + L + A++ ++ I EA D +K Sbjct: 683 ---IIYFLTDGAPTAGVTNLNSILDMVDFIAQKSEIVMNTIAYGEEANDDKMK 732 >gi|295093844|emb|CBK82935.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 343 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 74/235 (31%), Gaps = 49/235 (20%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D+++ +D+S S++ +DKL ++ R G+V F Sbjct: 91 DEKYCRDIILCIDISTSVDYLNENLLDKLKKTVDELQGE--------------RFGIVIF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ V PL ++++++++ L + Y+ I+ G Sbjct: 137 NTSPVLLTPLTDDYEYVKDQLDLIAQCLKSRNEVNLDDAFSSGYDWIYYQAYISSGTLIG 196 Query: 280 HD-----------------------DYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRR 315 ++ + K +IF TD + +P E+ C Sbjct: 197 NEQRGSSLIGDGLAAAAIDFSDADKERTKVVIFSTDNDIQGTPVATLDEAADIC---VSN 253 Query: 316 GAIVYAIGV--QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 VY +G + +KN + +FY + S + I K Sbjct: 254 NVTVYGVGTKEMTPENKESMKNAVEKTGGKFYLEEESGSFGEIVSSIEKLSKNLV 308 >gi|73976417|ref|XP_539432.2| PREDICTED: similar to Matrilin-2 precursor [Canis familiaris] Length = 524 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 59/164 (35%), Gaps = 26/164 (15%) Query: 165 SDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L++M V+D S S+ ++F + + + L R G++ + Sbjct: 174 KETPLELMFVIDSSESVGLENFEIIKSLVKTLSDQVALDLAT----------ARIGIINY 223 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S K+ + L + ++ + G T + L A N +F+A Sbjct: 224 SHKVEKVAHLTQFSNKDDFKLAVDNMQYLGEGTYTATALHEA-NHMFEAARPG------- 275 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 KK + +TDG+ + D K A ++ IGV Sbjct: 276 --VKKVALVITDGQTD--SRDEKNLTEVVKRASDINVEIFVIGV 315 >gi|332260454|ref|XP_003279304.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Nomascus leucogenys] Length = 1160 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 55/170 (32%), Gaps = 30/170 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 423 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 475 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 476 EVNEDNLEQAQSWIRDMKIGSSTNTLSALKTAFA----DKETQA------------IYLL 519 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P + + E +Y I A++FLK A+ Sbjct: 520 TDGRPDQPPEMVIDQVKVFQE-----IPIYTISFNYNDEIANRFLKEVAA 564 >gi|260796397|ref|XP_002593191.1| hypothetical protein BRAFLDRAFT_72730 [Branchiostoma floridae] gi|229278415|gb|EEN49202.1| hypothetical protein BRAFLDRAFT_72730 [Branchiostoma floridae] Length = 994 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 60/197 (30%), Gaps = 36/197 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M++VLD S SM + +L A + S G+VTFS+ Sbjct: 332 MVLVLDTSGSMRGD---PIRRLNQAATHFIRSTVLDDSW--------LGIVTFSTTANTY 380 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL A + ++ G TT L + D Sbjct: 381 HPLLQITSAADRTSLINRV-PSTVGGTTCIGCALLEGVKVL----------EAQGDPSGG 429 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLKNCASPDRFY 343 + ++DG+ + + +G I+ + A+ + L ++ Sbjct: 430 ILFLMSDGQENEAPDIATVTPQVL----AKGIIIDTLAYKRSADPQIESLALLTGGKSYF 485 Query: 344 ---SVQNSRKLHDAFLR 357 +S L+DAF Sbjct: 486 YSGEQGDSTALNDAFTA 502 >gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 684 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 66/206 (32%), Gaps = 36/206 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D + VLD+S SM+ KL + + L ++ R +V F + Sbjct: 302 TGSDWVFVLDISGSMS-------AKLATLGDGVSQALGKLRGGD------RFRIVLFDDR 348 Query: 227 IVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + ++ +K+ +L T GL A N + + Sbjct: 349 AEELTSGFVDATPNNIRQYTKKVMQLQSRGGTNLFGGLSLALNPLDADRPTG-------- 400 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I+ +TDG + K+ + + + ++ + A L + Sbjct: 401 -----IVLVTDGVANVGKTQQKDFI---DLLENHDVRLFTFVMGNSANRPMLTAMTNASN 452 Query: 342 FYS--VQNSRKLHDAFLRIGKEMVKQ 365 ++ V NS + L ++ Q Sbjct: 453 GFAISVSNSDDIAGQILNATAKLTHQ 478 >gi|156355315|ref|XP_001623615.1| predicted protein [Nematostella vectensis] gi|156210333|gb|EDO31515.1| predicted protein [Nematostella vectensis] Length = 1190 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 48/140 (34%), Gaps = 18/140 (12%) Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAY 263 K + N + R G + FS + + PL ++ + L + T++ GLE AY Sbjct: 612 KKLLQNNPMSRVGAIDFSEQANEAIPLTSDQAALETGVQALKYQYQNGMTRTDKGLELAY 671 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC----NEAKRRGAIV 319 + + +I +TDG + D K + + + G Sbjct: 672 DLFKTNVRADAG---------QMLIVVTDGHTTP--WDGKTGIELIKPPSDAIRSLGVHT 720 Query: 320 YAIGVQAEAADQFLKNCASP 339 G+ ++ L AS Sbjct: 721 IVAGIDERVNEEELYYMASD 740 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 54/217 (24%), Gaps = 58/217 (26%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----------- 239 +L ++ ++ + V N R GL F + +PL + Sbjct: 292 RLDAVGYAVNQLFTTANTTKKVANQFRIGLYPFIRYLYSYYPLTTNISGSTSDSSTINYA 351 Query: 240 -------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + N + T L + I + Y+ +TD Sbjct: 352 AANLATLLDTNTNASLGSGGTHIDTALSSVNSLITSVGDGSATTNTLP-----YVFLVTD 406 Query: 293 GE-------------NSSPNIDNKESLFYCNEAKRRGAIVYAIGV--------------- 324 G + S + C K RG I+ + + Sbjct: 407 GAQDPQVKGVPNGSWSGSNHATTINPTTSCTPLKNRGIIISVLYIPYQTINPVNASFAGD 466 Query: 325 -------QAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 L+ CASP FY+ + A Sbjct: 467 EDDYANNNIPNIPPSLQACASPGFFYTANTPADITSA 503 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 59/185 (31%), Gaps = 34/185 (18%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHY----------ILDHSLL 57 F + K +I+ A+ L I +G + S +K KL ++ + Sbjct: 16 RFVGDSKANIATTFALALLPILTAIGCGTDYSMAMRLKVKLQSAADAASIASISVNSAGY 75 Query: 58 YTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSII 117 A + + + G + + NI++ + ++ + ++L Sbjct: 76 AAAMAMTSDGSVTAGVNEAD--------NIFKGNASTFGGYTLTSETSTVTKTRSTL--- 124 Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 + + F+ + + ++ S S + LD + LDV Sbjct: 125 ------SSQVQFTAAVPTTFL-------TVIGYQSITVSGSSSSSVTLPLYLDFYLTLDV 171 Query: 178 SLSMN 182 S SM Sbjct: 172 SGSMG 176 >gi|73962941|ref|XP_547762.2| PREDICTED: similar to coagulation factor C homolog, cochlin precursor [Canis familiaris] Length = 847 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 66/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 627 SYHMPNWFGTTK-YVKPLVQKLCSHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFHLMLE 685 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 686 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 734 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 735 RYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIVTDGQ----SYDDVRG- 781 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 782 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 813 >gi|326680240|ref|XP_001920592.3| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4 [Danio rerio] Length = 1087 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 59/156 (37%), Gaps = 21/156 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 D+++V+DVS SM K+ +A +I +LD + VN + + V++ Sbjct: 249 DLVIVVDVSGSMKGL------KMTIAKHTINTILDTLGENDFVNVIAYTDYVSYVEPCFK 302 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L +H + ++ L K ++ ++ + DA+ + + Sbjct: 303 GTLVQADLD-NREHFKLLVDELHVKGEAKVKKAMKESFRILADARANGQ-----GSLCNQ 356 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 I+ +TDG +E + + V+ Sbjct: 357 AIMLITDGAMEDFQSVFEEFNWP-----DKKVRVFT 387 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 16/163 (9%), Positives = 59/163 (36%), Gaps = 9/163 (5%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+++I+ + L ++ +GL ++ + +++L D L A + + Sbjct: 12 RRGLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTS- 70 Query: 67 ENGNNGKKQKNDFSYRIIK------NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + + + + N Q + + + + + + + + Sbjct: 71 -AISLQVAEADGIAAGHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVANPANVKY 129 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 LS ++ + + + T P ++ + + ++ + + Sbjct: 130 VKCTAQLSNIAHWFIEVL-NTIPGVQVANASQVAASAIATVGA 171 >gi|58262250|ref|XP_568535.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|57230709|gb|AAW47018.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21] Length = 430 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 62/193 (32%), Gaps = 34/193 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN--NVVRS 218 S +D++ +LD + SM + D + + D+I+ +N + +R Sbjct: 47 SGSSHGKCIDLVFILDCTGSMQKYINSVRDHI-------IGICDMIRGEEGLNGPDDLRV 99 Query: 219 GLVTF-------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 +V + S+ + + P + +Q + L + A E Sbjct: 100 AVVNYRDHPPQDSTYVYKFHPFTSDIPEVQNYLKGLTASGGGDGPEAVTAAMAATLTELE 159 Query: 272 KLEHIAKGHDDYKKYIIFLTD------GENSS----PNIDNKESLFYCNEAKRRGAIVYA 321 + + + + D GE + D + L + G ++ Sbjct: 160 WR-------REAARMAVLVADAPPHGIGEGGDQFKQGDPDGHDPLVVARMMAQNGITMF- 211 Query: 322 IGVQAEAADQFLK 334 + V+ + D+ L Sbjct: 212 LRVRIDDCDRELH 224 >gi|300812790|ref|ZP_07093186.1| von Willebrand factor type A domain protein [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] gi|300496224|gb|EFK31350.1| von Willebrand factor type A domain protein [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] Length = 893 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 93/295 (31%), Gaps = 34/295 (11%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT---KILNQENGNNGK 73 ++ +TA ++ + L+I + L + A NG Sbjct: 1 MTKMTAKVVRTGHLFAFLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPT 56 Query: 74 KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 + + N D N + + + + S D + DY + + Sbjct: 57 NSWQVTGQQNVINQRGGDQVLGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA-- 113 Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 + + N V D+++V+D+S SM + G ++ Sbjct: 114 KETNTPGLYDVYLNVKGNTQQNVKPV----------DIVLVVDMSGSMESN-SRGTNRAD 162 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGST 252 ++ L I++ + + V GL+ FSS G +I+ + ++ Sbjct: 163 AVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGKV---GN 218 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY--------KKYIIFLTDGENSSPN 299 TK + A + +F + + KK +I LTDG + N Sbjct: 219 TKQQQAINDALSPMFQGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPTFSN 273 >gi|328469248|gb|EGF40194.1| hypothetical protein VP10329_10206 [Vibrio parahaemolyticus 10329] Length = 617 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 58/200 (29%), Gaps = 30/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F+ W S S ++ + M+VLD+S SM ++L Sbjct: 54 FMIWGLAWAIACVALAGPSWQSNTRPS-FELSQNRMLVLDMSRSMYAS-DIKPNRLTQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGA-------TGLIAYAGDAYSLSPLTTDASTLAGIIENLSPELMPFQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + +E A ++ A I+ L D N D++ + N Sbjct: 165 SNLPAAIELALSQFSQAGANQGD-----------IVVLADDLN-----DSELARSL-NLV 207 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K + V + + Sbjct: 208 KGKNIRVSVLAIGTANGAPI 227 >gi|330802397|ref|XP_003289204.1| hypothetical protein DICPUDRAFT_79965 [Dictyostelium purpureum] gi|325080732|gb|EGC34275.1| hypothetical protein DICPUDRAFT_79965 [Dictyostelium purpureum] Length = 810 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 48/138 (34%), Gaps = 8/138 (5%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + V+D S SM+ GM L +I + I + N R L+T + Sbjct: 3 ITFVVDTSGSMSQKTTNGMTLLDCCKAAIEHFIKIRSKDASMRN-DRFFLITSEENSLTA 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-----DAKEKLEHIAKGHDDY 283 + W +++ L+ + L+ A++ + + + Sbjct: 62 VKIGWKDNFNTFIQEVKNLVCKDMSNLGFSLQKAFDNLNLFRIQSSIDNYGQGRNPWFIE 121 Query: 284 KKYIIFLTDGENSSPNID 301 I+ LTDG + + N+ Sbjct: 122 PAIIVLLTDGSSLTNNMS 139 >gi|209546481|ref|YP_002278399.1| hypothetical protein Rleg2_4401 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537725|gb|ACI57659.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 534 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 27/58 (46%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +R F+ + +G + LT I +P++ LVI+ + L +D L A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAREL 63 >gi|302495833|ref|XP_003009930.1| hypothetical protein ARB_03856 [Arthroderma benhamiae CBS 112371] gi|291173452|gb|EFE29285.1| hypothetical protein ARB_03856 [Arthroderma benhamiae CBS 112371] Length = 705 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 64/173 (36%), Gaps = 28/173 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + N + I +K + + D+++V+D+S SMN + Sbjct: 39 ILSIHSIPNKDSMIVSIQPPLKPENDVPHVPCDIVLVIDISGSMNSAAPIPTGE-RGGED 97 Query: 198 SIREMLDIIK-----SIPDVNNVVRSGLVTF----SSKIVQTFPLAW--------GVQHI 240 + +LD+ K I +N R +VTF + + +++ P+A+ + Sbjct: 98 TGLSILDLTKHAAKTIIETLNEKDRLAVVTFCTEVNVRTIESSPVAFELDYMNKENKSTV 157 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I++L S+T G++ N + + ++ LTDG Sbjct: 158 LSAIDKLYGKSSTNLWHGIKKGLNVLTTN---------PAQGKIQSLLVLTDG 201 >gi|154496349|ref|ZP_02035045.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] gi|150274432|gb|EDN01509.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] Length = 1896 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH 239 SM G +L +A +++ M+ ++ + V FSS+ + Sbjct: 201 SMVQS-GKAQSRLKIAVSAVKNMVGTLREQLGGKLTAKF--VVFSSEGYKNGVDKRASAK 257 Query: 240 IQEK--INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + +++L T + G+ ++ + + + ++ + DG++ Sbjct: 258 VITEAQLDQLTAVGGTDLSAGVALGVDQFKSSSAR------------QVLVVVADGDS-- 303 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 D+ N K + I+Y +G ++D F D + +L +A Sbjct: 304 ---DDGYPNRTANNFKNKDGIIYTVGF-TFSSDSFNNLATDADHALLANSDTELGEAMED 359 Query: 358 IGKEMVKQR 366 I ++ Sbjct: 360 ISTDITAMI 368 >gi|299132141|ref|ZP_07025336.1| von Willebrand factor type A [Afipia sp. 1NLS2] gi|298592278|gb|EFI52478.1| von Willebrand factor type A [Afipia sp. 1NLS2] Length = 634 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 78/207 (37%), Gaps = 26/207 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-----RSGLVT 222 L + +++DVSLS + L V ++ + + + D + + R VT Sbjct: 444 DLSLAVLMDVSLSTDAWMQDR-RVLDVEKGALLALTHGLTACGDEHAIYTFTSRRRTSVT 502 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S+ PL + I +I L G T+ + + ++ ++ Sbjct: 503 VSTIKEFDEPLD---RRIIRRIEALTPGQYTRIGAAVRHVTTELAQRPQRH--------- 550 Query: 283 YKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K ++ LTDG+ + +++ EA++ G V+ + + A D F + Sbjct: 551 --KLLLLLTDGKPNDIDHYEGRYGIEDTRMAIREARKAGLRVFGVTIDENARDYF-PHIF 607 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + V++ +L A I +++ Sbjct: 608 GRGAYSIVRDIARLPAALPAIYRQITT 634 >gi|67968521|dbj|BAE00622.1| unnamed protein product [Macaca fascicularis] Length = 480 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 42/103 (40%), Gaps = 10/103 (9%) Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++ + K E K II +TDGE + +++L +A++ GA Sbjct: 1 MQAGFRKAIQQIETFNSGNK----VPSMIIAMTDGELVAHAF--QDTLREAQKARKLGAN 54 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQN----SRKLHDAFLR 357 VY +GV DQ SP+ ++V+N R DA Sbjct: 55 VYTVGVADYKLDQITAIADSPEHVFAVENGFKAMRDTVDALTS 97 >gi|327538530|gb|EGF25193.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1460 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 44/318 (13%), Positives = 93/318 (29%), Gaps = 75/318 (23%) Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN- 126 + Q + + +I + + + + + + L++ +D + Sbjct: 803 KLDTADGQGDLPEGIAVMSIHRDGIKTSDAQQAVSIQATDAPTNVELTLAVDQKRPSLAE 862 Query: 127 LSAVS-RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 ++ RY+ + + S + + + I D G + V+D S SMND Sbjct: 863 VNFRGNRYQSSVMSSGTAFGVTSRGQTVQKGAGITIRDAMDAGRAITFVMDCSASMNDPL 922 Query: 186 GPGM----------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV------- 228 G M K A ++ EM+ ++ P + GLV + ++ Sbjct: 923 GEEMGRSALGAQRASKFEAARSAVYEMMRRLQPGPS-----QIGLVLYGHRMAIRAGDPA 977 Query: 229 --------------------QTFPLA---------------WGVQHI---QEKINRLIFG 250 FP + + ++ + + Sbjct: 978 KDSGDGSGQTTLLQKRYHKRFPFPPTIQPFEDVEVALPTGRFDTAELELARQHFDAAVPW 1037 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-- 308 T + A I ++ D +K ++ ++DG N N + Sbjct: 1038 GQTPLYLSIWKAMEDI----------SRTGDGVRKDVVVISDGRNYQFNPTPEAIFSIGQ 1087 Query: 309 -CNEAKRRGAIVYAIGVQ 325 AK G V+ IG Sbjct: 1088 LVTRAKTLGVQVHVIGYG 1105 >gi|71051532|gb|AAH36192.1| COL14A1 protein [Homo sapiens] Length = 534 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 79/214 (36%), Gaps = 24/214 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 VK ++ D+++++D S S+ + + ++ + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF------NFRLVRHFLENLVTAFDV---GSEKT 195 Query: 217 RSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R GL +S + L A+ + + E + L + T A N IF+ K E Sbjct: 196 RIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKPE 252 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ K I +TDG++ I +L + G ++AIGV+ ++ + Sbjct: 253 AGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQE 304 Query: 335 NCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + PD Y+V +H + + + + Sbjct: 305 IASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|326670666|ref|XP_003199265.1| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 1455 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 66/191 (34%), Gaps = 20/191 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S +D L IR M+ + D+ VR +V +S + Sbjct: 823 DIVFLLDGS---DDSRNT----LLTIREFIRRMVLDLDIDQDI---VRVAVVQYSEDPLI 872 Query: 230 TFPL-AWGVQHIQEK-INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + IN L G A + D ++ H + + Sbjct: 873 HFLLNTYNSKKAVLYAINGLTAKGGRNINTG--AALQYVRDHVFTTASGSRHHLGVPQVL 930 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I +T G + D E L K G + AIG++ + SP +++ Sbjct: 931 IVMTGGRSIDQVADPAEDL------KNFGVLSIAIGIKNALESELQTIAFSPRFIFNLPV 984 Query: 348 SRKLHDAFLRI 358 S +L I Sbjct: 985 SGELLHIQPDI 995 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 68/218 (31%), Gaps = 26/218 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 P + + S + D+ ++D S + F D + Sbjct: 601 MTPLITVVGETDTIEGAPTPGPSHGE--RDVAFLIDGSDDVRGDFPYIRDFISRV----- 653 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTT--KST 256 I+ + N VR +V S + F L + +N L + Sbjct: 654 -----IEPLDIGINKVRVSVVQHSDRPSPNFFLDTYQTKDEVLRAVNGLTLAGGRGLNTG 708 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + N + ++ + +++I LT G + + +L K G Sbjct: 709 AALTFMKNTVLS----TARGSRAAQNVPQFLIVLTAGRSRDSVREPAVAL------KTEG 758 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + + +GV+ + +P ++V+ +L+ Sbjct: 759 VVPFGVGVKNADPKEIEAISHNPSFAFNVKEFSQLNTV 796 >gi|332215868|ref|XP_003257064.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VII) chain-like [Nomascus leucogenys] Length = 2944 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 71/191 (37%), Gaps = 27/191 (14%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREVRSFLEGLVLPFSGAVSA-QGVRFATVQYSDD 87 Query: 227 IVQTFPL-AWGVQH-IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F L A G + I L + G T++ + + +++F L +A+ Sbjct: 88 PRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVF-----LPQLARPGIP- 141 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PD 340 K I +TDG++ + L K +G ++A+G++ A + LK AS D Sbjct: 142 -KVCILITDGKSQDLVDTAAQRL------KGQGVKLFAVGIK-NADPEELKRVASQPTSD 193 Query: 341 RFYSVQNSRKL 351 F+ V + L Sbjct: 194 FFFFVNDFSIL 204 >gi|258507439|ref|YP_003170190.1| pilus specific protein [Lactobacillus rhamnosus GG] gi|257147366|emb|CAR86339.1| Pilus specific protein, ancillary protein involved in mucus-adhesion, contains von Willebrand factor (VWF) domain [Lactobacillus rhamnosus GG] gi|259648793|dbj|BAI40955.1| putative cell surface protein [Lactobacillus rhamnosus GG] Length = 895 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 47/290 (16%), Positives = 87/290 (30%), Gaps = 43/290 (14%) Query: 21 TAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFS 80 T L V+ I+M + + + + D+ NG Sbjct: 8 TGHLFAVLLILMSM---LTGLVTSGSSVVTATDN----IRPTYQTDANGTYPTNSWQVTG 60 Query: 81 YRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + + N D + N + + + + S D + DY + + + Sbjct: 61 QQNVINQRGGDQVSGWDNN-TIWNGDATDTTNSYLKFGDPNNPDYQIRKYA--KETNTPG 117 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + N V D+++V+D+S SM + G ++ G ++ Sbjct: 118 LYDVYLNVKGNKQQNVKPV----------DIVLVVDMSGSMESNRW-GTNRAGAVRTGVK 166 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIV-----QTFPLAWGVQHIQEKINRLI------F 249 L I++ + N V GL+ FSS + G + + F Sbjct: 167 NFLTSIQNA-GLGNYVNVGLIGFSSPGYIGGKSGYISVKLGKAGNASQQQAINGALSPRF 225 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 T + GL + KK +I LTDG + N Sbjct: 226 QGGTYTQIGLRQGSAMLNADTSGN----------KKMMILLTDGVPTFSN 265 >gi|149051203|gb|EDM03376.1| coagulation factor C homolog (Limulus polyphemus) (predicted), isoform CRA_a [Rattus norvegicus] gi|169642483|gb|AAI60874.1| Coch protein [Rattus norvegicus] Length = 552 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 332 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 390 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 +I K+ + + V F+ Q ++ + +E + ++ Sbjct: 391 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYNTKENVLAVLANI 439 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 440 RYMSGGTATGDAISFTVRNVFGPIRD--------SPNKNFLVIVTDGQ----SYDDVRG- 486 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 487 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 518 >gi|86138567|ref|ZP_01057140.1| von Willebrand factor type A domain protein [Roseobacter sp. MED193] gi|85824627|gb|EAQ44829.1| von Willebrand factor type A domain protein [Roseobacter sp. MED193] Length = 472 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 66/188 (35%), Gaps = 34/188 (18%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 +L+ + D+ +VLD S SM G+ K+ +A I+++L+ + + + Sbjct: 3 TVLLAGMAAPLAAQDLSRS-TLVLDASGSM-WGQVDGVAKITIAQTVIQQLLETLPATQE 60 Query: 212 VNNVVRSGLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 GL+ + S P A I + ++ T + + A Sbjct: 61 Q------GLMAYGHRRKGDCSDIEQLIAPAADTRDAIAAAVAKISPKGKTPISAAVRQAA 114 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYA 321 + + ++EK +I ++DGE + + + + G ++A Sbjct: 115 DALRHSEEKAT------------VIPISDGEETCG----LDPCAVGADLEASGVDFTLHA 158 Query: 322 IGVQAEAA 329 IG Sbjct: 159 IGFGIADD 166 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 43/126 (34%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 12 RRSLHRQRGAVAIVVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + + + ++ + N D T ++ K Sbjct: 72 ISLSVAEADGIAAGHVNFAFFQKSAVQMLTDSNVTFSDALTNPFLTKTAVSTPANVKYVK 131 Query: 127 LSAVSR 132 +A Sbjct: 132 CTATLS 137 >gi|118150796|ref|NP_001071310.1| cochlin precursor [Bos taurus] gi|75057908|sp|Q5EA64|COCH_BOVIN RecName: Full=Cochlin; Flags: Precursor gi|59857775|gb|AAX08722.1| coagulation factor C homolog, cochlin precursor [Bos taurus] Length = 550 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ + M K Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGESNFRLMLK 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NR 246 +I K+ + + V F+ Q ++ +E + N Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 437 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 438 SYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIVTDGQ----SYDDVRG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|325108192|ref|YP_004269260.1| hypothetical protein Plabr_1627 [Planctomyces brasiliensis DSM 5305] gi|324968460|gb|ADY59238.1| protein of unknown function DUF1355 [Planctomyces brasiliensis DSM 5305] Length = 938 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 64/199 (32%), Gaps = 27/199 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L + + LD S SM G K+ +A + I+ + + ++ S Sbjct: 395 KNRLALAIALDRSGSMTAPVSGGKTKMDLAN---LGTAECIRLLSPSDE---VAVIAVDS 448 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL I +++ + G L A N++ + D Sbjct: 449 TPHTIVPLTNVSNPDDIAQQVLGIQSMGGGIFVYEALVAAGNELMKS-----------DL 497 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCAS-- 338 K+II +D +S + + + G V IG+ A+ + L+ A+ Sbjct: 498 ATKHIILFSDAADSE---EPGAYRSLIKDYENAGITVSVIGLGTTADVDAKLLQEIATLG 554 Query: 339 PDRFYSVQNSRKLHDAFLR 357 Q+ +L F Sbjct: 555 SGNIMFTQDVAELPRLFTE 573 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 51/409 (12%), Positives = 118/409 (28%), Gaps = 83/409 (20%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 L ++L + + M L + ++ + AT + + Sbjct: 50 PLRSVLASLCAVAMSLGMASASVAAFADDRQPAA-TADPQAAT---ASAGNVDAPQHTKR 105 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH----------KDYNLS 128 S + + Q + ++ + L + + +++ Sbjct: 106 ISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMN 165 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 +S Y + ++ T + S+ + + S S++ Sbjct: 166 PISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFY 225 Query: 189 MDK------LGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSS------------- 225 K L ++ LD ++ I D V+ L+ ++ Sbjct: 226 KSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNE 285 Query: 226 ------KIVQTFPLAWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 LAW + +Q++ +N L G T++ GL++A ++ + Sbjct: 286 DGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA--- 342 Query: 277 AKGHDDYKKYIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQ------- 325 +K +F +DG +S ++ + K + V +IG Sbjct: 343 -------QKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSG 395 Query: 326 AEAADQFLKNCAS----------------PDRFYSVQNSRKLHDAFLRI 358 + A++F+ +S +Y+V L F I Sbjct: 396 TDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEI 444 >gi|237728578|ref|ZP_04559059.1| TerY3 [Citrobacter sp. 30_2] gi|226910056|gb|EEH95974.1| TerY3 [Citrobacter sp. 30_2] Length = 346 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 54/168 (32%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L T ++ ++ ++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K H AKG + Sbjct: 57 RTIVPL---HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + + R + A+G+ A L+ Sbjct: 112 YLLTDG---RPTDDTTAEVKRWKDHYARKVNLIAVGLGPSADLNILRQ 156 >gi|239814531|ref|YP_002943441.1| hypothetical protein Vapar_1524 [Variovorax paradoxus S110] gi|239801108|gb|ACS18175.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 409 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 29/73 (39%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 G++ I A++L + MG+ ++ F VK +L LD L A ++ + Sbjct: 10 QSGAVIITVALVLLFLLGFMGIALDFGRLFIVKTELQTALDSCALSAAQELDGAGDALTR 69 Query: 73 KKQKNDFSYRIIK 85 + + K Sbjct: 70 ATSAGKTAADLNK 82 >gi|194228584|ref|XP_001915384.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Equus caballus] Length = 1313 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 57/167 (34%), Gaps = 12/167 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKIV 228 ++ V++V SM FG M + A I L ++ V S + Sbjct: 283 VVFVIEVGGSM---FGIKMKQTKKAMNVILGDLQANDYFNIISFSDTVSVWKAGCSIQAT 339 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 V ++ + + G T L A + ++ + + II Sbjct: 340 IQ-----NVYDAKDYLGHMEAGGWTDINTALLAAASVLYPSNXEPGRGPSVGRIS--LII 392 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 FLTDGE ++ + L +A ++++ +A L++ Sbjct: 393 FLTDGEPTADMMTPSVILSNILQALGNRVNLFSLVFWDDADFPLLRH 439 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 36/99 (36%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + +G+++I+ + L V+ +GL ++ + +++L D L A + + Sbjct: 12 RRGLHRQQGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI 105 + + + Q + + N D Sbjct: 72 ISLQVAEADGIAAGHANFAFFQQNAVQMQTDSNVTFSDS 110 >gi|6680956|ref|NP_031754.1| cochlin precursor [Mus musculus] gi|311771523|ref|NP_001185764.1| cochlin precursor [Mus musculus] gi|12644458|sp|Q62507|COCH_MOUSE RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801415|gb|AAC39949.1| Coch-5B2 gene product [Mus musculus] gi|26324626|dbj|BAC26067.1| unnamed protein product [Mus musculus] gi|28277390|gb|AAH45137.1| Coagulation factor C homolog (Limulus polyphemus) [Mus musculus] gi|74178965|dbj|BAE42713.1| unnamed protein product [Mus musculus] gi|74209551|dbj|BAE23310.1| unnamed protein product [Mus musculus] Length = 552 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 332 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 390 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 +I K+ + + V F+ Q ++ + +E + ++ Sbjct: 391 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYNTKENVLAVLANI 439 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 440 RYMSGGTATGDAIAFTVRNVFGPIRD--------SPNKNFLVIVTDGQ----SYDDVRG- 486 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 487 -PAAAAHDAGITIFSVGVAWAPLDDLRDMASKP 518 >gi|327270790|ref|XP_003220171.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 866 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 72/212 (33%), Gaps = 41/212 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S M ++L ++ + L I + G+VTF+SK Sbjct: 315 VCLVLDASAQMGKD-----NRLNRLIQAAKLFLLHI-----IEKGSWVGIVTFNSKGNIQ 364 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L Q + + G+ A+ E Sbjct: 365 AGLQKIFSDIERQSLTSHL-PTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE-------- 415 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 I+ LT+GE S L C ++ + + I++ I ++A+++ K + + + Sbjct: 416 -IVLLTNGEGSD--------LSPCLSKIQSQEIIIHTIAFGSKASNELEKLADMTGGKTF 466 Query: 344 SVQNSRK---LHDAFLRIGK---EMVKQRILY 369 +S L DAF I + +Q I Sbjct: 467 YATDSLDSNGLIDAFGGISSGSGDASQQSIQL 498 >gi|72168566|ref|XP_796840.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115961659|ref|XP_001187264.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 958 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 65/194 (33%), Gaps = 28/194 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM + D+L ++ + L +++ G+ FS Sbjct: 320 VALVLDVSGSMGGN-----DRLTKLNQAATQYLRYT-----IDDGSFVGIAHFSDYSRII 369 Query: 231 FPLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L + +E + I T G+ + D YI Sbjct: 370 ENLTEITDNSREDLVMGLPSIANGPTCIGCGVLDGIKILKGETGME-------DPAGGYI 422 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 + ++DG+ + ++ +E + G IV I A L+ + F+ Sbjct: 423 LLISDGQQNRQPYIDEVF----DEVEEAGVIVDTIAFSDAADPNLLELSVRTNGLGFFYP 478 Query: 346 QNSRK--LHDAFLR 357 + L+DAF Sbjct: 479 DTATSTALNDAFTA 492 >gi|114707528|ref|ZP_01440424.1| von Willebrand factor, type A [Fulvimarina pelagi HTCC2506] gi|114537087|gb|EAU40215.1| von Willebrand factor, type A [Fulvimarina pelagi HTCC2506] Length = 584 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 67/221 (30%), Gaps = 30/221 (13%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS--KSDIGLDMMMVLD 176 D +H ++SA +P+ + P+++T S ++ + V+D Sbjct: 345 DPRHSSLDISATLTAALPWQRFRRQRFPRLTDRPVILTPSDIRIRRLQAKRETATIFVVD 404 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAW 235 S S + +L A +I +L R +V F P Sbjct: 405 ASGS------AALARLAEAKGAIERIL-----AECYRRRDRVAMVAFRGTSAETLLPETK 453 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + L G T G+ A D + E + I+FLTDG+ Sbjct: 454 SLTRARRALAGLSAGGGTPLASGIALA----GDLARQCERQERTPL-----IVFLTDGKA 504 Query: 296 S-------SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + + ++ + +G I Sbjct: 505 NITLDGMAGRSSAREDVNTQATVLRSQGYASLVIDFSLRPG 545 >gi|47228041|emb|CAF97670.1| unnamed protein product [Tetraodon nigroviridis] Length = 1009 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 70/194 (36%), Gaps = 41/194 (21%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI---PDVNNVVRSGLVTF 223 LD++ V+D S S+ G+ + + ++ + S+ P R G+V + Sbjct: 706 GALDIVFVIDSSESV------GLTNFTLEKNFVINTINRLGSLAKDPKSETGTRVGVVQY 759 Query: 224 SSKIVQTF-----PLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 S P + +E + ++ T + L+YAY+ + + + Sbjct: 760 SHSGTFQAIRPDDPKIDSLTSFKEAVKQMEWIAGGTWTPSALKYAYDNLIRDSRRAKASV 819 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ +TDG P D+ + C++ K V AIG+ Sbjct: 820 S--------VVVITDGR-FDPRDDDSLLTYLCSDPK---VDVNAIGIG------------ 855 Query: 338 SPDRFYSVQNSRKL 351 D FY V+ + L Sbjct: 856 --DMFYQVEENEIL 867 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 69/229 (30%), Gaps = 27/229 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 L + + + +++ +D S ++ P + E + + + Sbjct: 35 LADGQTQCVVQDECNIEVYFTIDTSETIALQESPPGSLVESIKDFTIEFVKRLAD-EEYR 93 Query: 214 NVVR----SGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNK 265 VR G + FS + L G + + IN + G T L + Sbjct: 94 GAVRLSWKMGGLHFSQEQRVFSRL--GTK--AQFINGISGIRYLGKGTYIDCALTNMTQE 149 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEAKRRGAIVYAIGV 324 + + ++ + +TDG + + + A+ G ++A+ Sbjct: 150 MTQSP--------SPFKPLRFAVVITDGHVTG---NPCGGIKVSAERARDAGIRIFAVAA 198 Query: 325 QAEAADQFLKNCA-SPDRFYSVQNSR-KLHDAFLRIGKEMVKQRILYNK 371 + ++ A SP Y L I + + + I K Sbjct: 199 SRNIDETGMREIANSPAMVYRDDFMAVDLSQGRPIIHSQTIDRIIQTMK 247 >gi|297265178|ref|XP_002799142.1| PREDICTED: collagen alpha-3(VI) chain [Macaca mulatta] Length = 2568 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 103/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 924 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRAELQTITNDP 983 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + T + ++ AP + + + D++ +LD S Sbjct: 984 RLVFTVREFRELPNIEERIMTSFGTSAATPAPPGVATPSPSRPEKKKA-DIVFLLDGS-- 1040 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1041 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1093 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1094 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1152 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|223938464|ref|ZP_03630357.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223892883|gb|EEF59351.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 723 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 70/197 (35%), Gaps = 35/197 (17%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + +++ ++ ++M+ VLD S SM+ G + + A R + L S Sbjct: 378 GYFTMMLYPPKELGQLGRAPMEMVFVLDCSGSMS---GEPIAQAKAAIRHALKQLQPGDS 434 Query: 209 IPDVNNVVRSGLVTFSSKIVQ--TFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAY 263 ++ FS Q PL ++I++ + L T+ G++ A Sbjct: 435 FQ---------IINFSEHASQLGAKPLEATPENIRKGLAYVEALNSDGPTEMIEGIKAAL 485 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + D + +++ FLTDG I N+ + + + +++ G Sbjct: 486 DFPHDPERL------------RFVCFLTDGF-----IGNEAEILAAVHERIGASRIFSFG 528 Query: 324 VQAEAADQFLKNCASPD 340 V L + A Sbjct: 529 VG-SCNRYLLDHLAKMG 544 >gi|297300028|ref|XP_002805526.1| PREDICTED: collagen alpha-1(XIV) chain-like [Macaca mulatta] Length = 1717 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 79/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ +D+F + L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDDNFNKIISFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDSLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 >gi|291540134|emb|CBL13245.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Roseburia intestinalis XB6B4] Length = 250 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 57/171 (33%), Gaps = 24/171 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR-EMLDIIKSIPDVNNVVRSGLVTFSSK 226 + V+D S SM + M+ + A EM + P+ ++R + FSS Sbjct: 15 PIHFFWVVDCSGSM--SYDGKMEVVNTAIEECIPEMASAADNNPNAQLLIR--ALQFSSG 70 Query: 227 IVQTFPLAWGVQH---IQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +W ++E N L T+ E ++ Sbjct: 71 A------SWLTAKPVPVEEYSWNPLEANGVTEMGKAFELLAAQLSIP-------PMPTRA 117 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 I+ L+DG+ + D L K+ + AI + ++A D L Sbjct: 118 LPPVIVLLSDGQPTDSYKDGLNKLKALPWFKKA-VKI-AISIGSDADDDVL 166 >gi|221116226|ref|XP_002166419.1| PREDICTED: similar to collagen, type XXVIII [Hydra magnipapillata] Length = 680 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 81/252 (32%), Gaps = 37/252 (14%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT- 156 ENG + + +E + + + + P +F N+S Sbjct: 41 ENGTTKSSSKVEPKPPVK-NNAGANSSSSEDYDTTQGYPILFPETHMKRNTSANISTSEP 99 Query: 157 -----SSVKISSKSDIGLD----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 +S ++ S + L+ +D+ ++ + + ++ + + K Sbjct: 100 IDDDVTSEEVYSNGSVLLEERDKCSKTVDLLFLLDSSESVKYSNWKIVIQFVKSLCNRFK 159 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYN 264 + R G++ ++S L I+ + + T++ L+ A Sbjct: 160 L-----STTRVGIIRYASDAEIALHLTRFNDTTSRDTAIDNIFYKTGGTRTDIALKKA-- 212 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-------YCNEAKRRGA 317 + + + + +I +TDG +S I+ + + K G Sbjct: 213 ---------ADVFQFSEQKNQVLILVTDGPTNSLEINKDHFVEGKDLVAGPVDRLKDAGV 263 Query: 318 IVYAIGVQAEAA 329 ++ IG+ ++ Sbjct: 264 AIFCIGIVPDSE 275 >gi|118347390|ref|XP_001007172.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288939|gb|EAR86927.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 669 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 48/135 (35%), Gaps = 19/135 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFSS 225 ++ ++D SLSM K G S LD++K +N R LV FS+ Sbjct: 35 IVCLVDGSLSMGSKLVIHQ-KNGGKKESDMTTLDLVKHTVKTIASSLNPQDRLALVGFST 93 Query: 226 KIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F L G +I+++ G T GL+ + I +++ Sbjct: 94 HSKIYFELTEMDDQGKNVAFTEIDKMWAGGQTNIWGGLQDSLEVIKKGFRPNQNVC---- 149 Query: 282 DYKKYIIFLTDGENS 296 I TDG + Sbjct: 150 -----IFLFTDGRPT 159 >gi|307196324|gb|EFN77937.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Harpegnathos saltator] Length = 2255 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 62/209 (29%), Gaps = 47/209 (22%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++M ++D S S+ + ++L R ++T Sbjct: 62 RNKTNQIELMFLVDASGSVGAE------NFRSELNFVTKLLSDFTV---DATAARVAIIT 112 Query: 223 FSSKIVQTFPLAWGVQHIQEK---------INRLI-----FGSTTKSTPGLEYAYNKIFD 268 F S + V I +N+ G T + L A + Sbjct: 113 FGS----PRNVTRNVDQISRHGGNDHKCYLLNKQFNEIAYSGGGTYTRGALLEALAILEK 168 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++E + K + +TDG ++ + N K GA V+ G++ Sbjct: 169 SRE----------EASKVVFLITDGFSNGG-----DPRPAANLLKNAGATVFTFGIRTGN 213 Query: 329 ADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 ++ + P L D+F Sbjct: 214 VEELHDIASFPGY-----THSYLLDSFAE 237 >gi|301622626|ref|XP_002940637.1| PREDICTED: integrin alpha-L-like [Xenopus (Silurana) tropicalis] Length = 1031 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 80/205 (39%), Gaps = 29/205 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+ ++D S SM + + + M ++IKS+ + + V + FS+ Sbjct: 158 KAEVDLCFMVDGSSSMGEIEVNIVKEF---------MKNVIKSLENETS-VHFAAIQFST 207 Query: 226 KIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F A N + T + ++Y ++IF K Sbjct: 208 HPKTEFTFADFQKDRNPDVLLANYRLLKGFTNTYKAIQYTLDRIFTEK------YGSRPS 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---P 339 KK ++ L DGE++ D +++ ++A+ Y IGV + L+ S Sbjct: 262 AKKVLVILADGESTD--DDTTKAIEKADKAR---VSRYIIGVGQNFKTEDLEAFVSWPAK 316 Query: 340 DRFYSVQ--NSRKLHDAFLRIGKEM 362 + +++ ++++L F + +++ Sbjct: 317 EHTRTIEKFDAQQLTILFAELQRKI 341 >gi|301113718|ref|XP_002998629.1| alpha kinase [Phytophthora infestans T30-4] gi|262111930|gb|EEY69982.1| alpha kinase [Phytophthora infestans T30-4] Length = 748 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 87/245 (35%), Gaps = 31/245 (12%) Query: 45 KAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQ----TDFRNELREN- 99 + K+ + + +AT+ ++ + K + + + + R + + Sbjct: 3 REKIQTM--KAAGASATETTESDDVHKTKSASSIRADAAEAELAKLREQMSKRKFFKFSC 60 Query: 100 --GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 G+ + ++ + + A E+ I + Sbjct: 61 KIGWCSHGGDCRSGFEENVELLKRR-----LAAVEREIALIEHFSVIRFEMERRIQKVVK 115 Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +++ LD+++V+D + SM GP +D++ A SI + ++ K P N VR Sbjct: 116 EIQLKYAKANALDLVIVMDCTGSM----GPWIDEVKSAIVSIID--NVKKDHPSAN--VR 167 Query: 218 SGLVTFSS-----KIVQTFPLAWGVQHIQEKINRLIFGSTT----KSTPGLEYAYNKIFD 268 G V + K +Q + L V +++ I RL GLE A F+ Sbjct: 168 VGFVAYRDFCDGVKRLQVYHLTSDVAAVRKFIFRLAAFGGGDGPEDIPGGLEAALAMPFN 227 Query: 269 AKEKL 273 A+ + Sbjct: 228 AQARR 232 >gi|74224199|dbj|BAE33710.1| unnamed protein product [Mus musculus] Length = 552 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 332 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 390 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG- 250 +I K+ + + V F+ Q ++ + +E + ++ Sbjct: 391 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYNTKENVLAVLANI 439 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 440 RYMSGGTATGDAIAFTVRNVFGPIRD--------SPNKNFLVIVTDGQ----SYDDVRG- 486 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 487 -PAAAAHDAGITIFSVGVAWAPLDDLRDMASKP 518 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 33/75 (44%), Gaps = 4/75 (5%) Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCAS--PDRFYSVQNSRKLH 352 + + + + CN AK ++ I +A A + ++NC++ ++ + ++ Sbjct: 488 TPISTMDAQLQALCNLAKSNNVTIFGIAFEAPANGKTQIQNCSTSRSSHYFDA-SGLEIQ 546 Query: 353 DAFLRIGKEMVKQRI 367 AF I ++ R+ Sbjct: 547 TAFRAIASQISYLRL 561 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 82/227 (36%), Gaps = 36/227 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F + +GSI + +L ++ ++ G ++ + L +D L Sbjct: 28 RFARSEEGSILVFGLMLFILMLMLGGFAVDVMSFEAKRTDLQQAVDRCA-------LTAA 80 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + ++K ++ +++I D+ +Y Sbjct: 81 ALAQTRDPEEVVEDCMLK----------------------AGKADYVTLIDHDEGLNYR- 117 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFG 186 V + P T P A+ L + + + ++++MVLDVS SM D + Sbjct: 118 EVVVTAQQP----TKPLFAHMLGIDSLTAPAATKAEQKVTNVEIVMVLDVSGSMVRDSYS 173 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 DKL + +E +D + D+N+ + +V ++ ++ L Sbjct: 174 RPTDKLKNLKAAAKEFVDTM-LAKDLNHRISIAIVPYNGQVNLGKSL 219 >gi|290981305|ref|XP_002673371.1| predicted protein [Naegleria gruberi] gi|284086954|gb|EFC40627.1| predicted protein [Naegleria gruberi] Length = 353 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 61/168 (36%), Gaps = 31/168 (18%) Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + I ++ R G+V F K PL + ++E + ++ +T G++ Sbjct: 18 EIIDNLREFERLGIVLFDDKAETFLPLTIVQDLDKKSLKETVLKIKEKGSTNFEAGMQRG 77 Query: 263 YNKIFD-AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + L + + II+LTD + D + L +A +++ Sbjct: 78 IDLFSTLDSSDLSNSNR--------IIYLTDACPNVGGTDTLDILT--KDANSGPYNIFS 127 Query: 322 --IGVQAEAADQF------LKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 IG+ + ++ C ++SV+++ F +I + Sbjct: 128 TFIGIGLDFNSDIVEELTQVRGC----NYFSVKSTED----FTKILNQ 167 >gi|89072370|ref|ZP_01158949.1| hypothetical protein SKA34_06340 [Photobacterium sp. SKA34] gi|89051902|gb|EAR57354.1| hypothetical protein SKA34_06340 [Photobacterium sp. SKA34] Length = 683 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 55/162 (33%), Gaps = 24/162 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P T W VK+ + ++ ++V+D+S SM K Sbjct: 54 PIAILTLGWLIAVVAMAGPSWKKVKLPA-YNLSGARVLVMDMSRSM----YATDIKPNRL 108 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----S 251 T++ + LD++ + + +GLVT++ + PL + I L Sbjct: 109 TQARFKALDMLPGWKEGS----TGLVTYAGDGYEVSPLTEDSHTLANLIPSLSPKIMPIP 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + G+ A + A II +TDG Sbjct: 165 GSNAAAGIAEAIKLLKQAGNSTGD-----------IILVTDG 195 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 25/64 (39%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G +L I + I + GL ++ H + K +L LD + L A + + + Sbjct: 11 QRGVTMVLFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGAKVLNDMHDVGQA 70 Query: 73 KKQK 76 Sbjct: 71 TAAA 74 >gi|220924567|ref|YP_002499869.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] gi|219949174|gb|ACL59566.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] Length = 407 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 28/58 (48%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 + F + G+++++ I+LPV+ +V++ ++ + +L D + L + Sbjct: 5 LARFRADADGTMTMIVGIVLPVLLGTAAVVLDGANLHLSQLRLQNAADSAALGAVQVL 62 >gi|146162207|ref|XP_001008983.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146146475|gb|EAR88738.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 790 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 51/141 (36%), Gaps = 25/141 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 D+M+VLD + SM + ++I +++ +K + + R G+V + Sbjct: 42 DLMIVLDCTGSM-------QSWISACKQNILKIIQNVKQQFQGSEL-RVGIVAYRDHCDT 93 Query: 226 -KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +I + +++++ I+ L L A + E Sbjct: 94 QRIEMYPFINQDYENLEKFISSLKATGGGDEPEDLAGALETAINLFEWKSQA-------- 145 Query: 285 KYIIFLTD----GENSSPNID 301 K +I +TD G+ + D Sbjct: 146 KTMIIITDAPCHGDTYHNSND 166 >gi|125829706|ref|XP_696164.2| PREDICTED: collagen alpha-2(VI) chain [Danio rerio] Length = 1015 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 64/184 (34%), Gaps = 31/184 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD---IIKSIPDVNNVVRSGLVTF 223 LD++ V+D S S+ G+ + + ++ I P R G+V + Sbjct: 606 GALDIVFVIDSSESV------GLTNFTLEKNFVINTINRLGSIAKDPSSETGTRVGVVQY 659 Query: 224 SSK-IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 S Q L ++ + +L T + L++AY+ + + + Sbjct: 660 SHNGTFQAIRLNDSKIDSMSAFKDAVKKLEWIAGGTWTPSALKFAYDNLIRDSRRSKANV 719 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFL 333 ++ +TDG P D+K + C V AIG ++ L Sbjct: 720 T--------VVVITDGR-YDPRDDDKLLNYLCT---DTSIDVNAIGIGDMFDQPEENESL 767 Query: 334 KNCA 337 K+ A Sbjct: 768 KSIA 771 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 69/194 (35%), Gaps = 18/194 (9%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D++ +LD S M + ++ L + + D N R L+ + Sbjct: 824 QRPVDLVFMLDGSERMGQENFRYAREF---VENVANRLTLAQGDDDERNA-RVALLQYGD 879 Query: 226 KIVQT--FPLAWGVQHIQEKINRLIFGSTT-KSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + F L I + + + + +T G+ YA N I ++ Sbjct: 880 ENQHQLAFKLTNNFTVIADGLANMRYLDSTSNVGSGIIYAINNIVTSRGTRLARRNAELS 939 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR- 341 + +F+TDG S+ N++ S + G I + + + L + D+ Sbjct: 940 F----VFITDGVTSNKNLEEGISAMR----RAEGVPT-VIAMSNDVDKEILAKISLGDQT 990 Query: 342 -FYSVQNSRKLHDA 354 + ++ +L+ A Sbjct: 991 AVFRGEDFARLNKA 1004 >gi|15897956|ref|NP_342561.1| hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|284173931|ref|ZP_06387900.1| hypothetical protein Ssol98_04630 [Sulfolobus solfataricus 98/2] gi|13814281|gb|AAK41351.1| Hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|261602668|gb|ACX92271.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 380 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 75/190 (39%), Gaps = 33/190 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++ LD S SM + K+ +A + +++ K IP+ N +TFS Sbjct: 34 SATGFHYIVALDTSGSMTGY------KIELAK---QGAIELFKRIPNGNK---VSFITFS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + + +I ++ G T + A +AK + Sbjct: 82 SNVNVIKEFV-DPLDLTNEILQITAGGQTALYTAILTA-----------NSLAKKYQMPT 129 Query: 285 KYIIFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 Y++ LTDG + NI N L Y + VY+ G+ + +Q L++ + + Sbjct: 130 -YLLLLTDGNPTDETNIGNYLKLPYYEK-----IQVYSFGIGDDYNEQLLQSVSDKTGGV 183 Query: 342 FYSVQNSRKL 351 Y + ++ ++ Sbjct: 184 MYHISDANEI 193 >gi|290975425|ref|XP_002670443.1| vWFA domain-containing protein [Naegleria gruberi] gi|284084002|gb|EFC37699.1| vWFA domain-containing protein [Naegleria gruberi] Length = 348 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 60/188 (31%), Gaps = 35/188 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF- 223 S+ +D+++V+D + SM+ D + ++ + +R V++ Sbjct: 32 SEKLVDLVIVMDNTSSMSGEIKIAKDTVQSIITTLHDHFQS---------DLRFSAVSYR 82 Query: 224 ---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +V+ +P V+ + +N + L A + + + Sbjct: 83 DHTDDYVVKEYPFTKNVEKAKGYVNEMFAKGGGDMPEALASALKVVNEIPFNKKGR---- 138 Query: 281 DDYKKYIIFLTD------GENSSPNIDN--------KESLFYCNEAKRRGAIVYAIGVQA 326 K I++ D G + D + + + + + + Y+I Sbjct: 139 ----KICIWIADAPPHGMGASGDYFPDGCKDEQGEIIDWIKLASHLQEKNVVFYSIICGR 194 Query: 327 EAADQFLK 334 DQ L Sbjct: 195 SKNDQQLS 202 >gi|320170832|gb|EFW47731.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 1062 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 62/186 (33%), Gaps = 44/186 (23%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ + K+ SK ++V+D S SM+ G + A S+ + + P Sbjct: 96 PVNTIGAPKVQSK------CIIVIDKSGSMS---GSPI----KAVASVLAYVAKQGADPT 142 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 V +TF ++ + + L G T L N K Sbjct: 143 V--------ITFDTRAE--------TSTLSNVLPNLQAGGGTSFCAALNAISNACMPHKG 186 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK---RRGAIVYAIGVQAEA 328 + ++F+TDG+++ + + K R +V +G EA Sbjct: 187 TIS------------VVFMTDGQDTDMSSLPAAKANFQRMVKSSPDRSVVVSTVGFGGEA 234 Query: 329 ADQFLK 334 FL+ Sbjct: 235 NLAFLR 240 >gi|312886234|ref|ZP_07745848.1| protein of unknown function DUF58 [Mucilaginibacter paludis DSM 18603] gi|311301259|gb|EFQ78314.1| protein of unknown function DUF58 [Mucilaginibacter paludis DSM 18603] Length = 291 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 68/189 (35%), Gaps = 21/189 (11%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 K + + + E++ G + + + E ++ + S V Y++ T W Sbjct: 4 KELLKKVRKIEIKTRGLSNHLFSGEYHSAFK------GRGMAFSEVREYQIGDEIRTIDW 57 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + L +M+++DVS S ++FG + + + L Sbjct: 58 NVTAR-----FNHPYVKVFDEERELTVMILMDVSGS--ENFGT----INQQKQDVATELC 106 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLE 260 + + + N + G++ FS KI + P G H I+E IN T L+ Sbjct: 107 AVLAFSAIQNNDKVGVIFFSDKIEKFIPPKKGRSHILMIIRELINFTPENKGTNVAEALK 166 Query: 261 YAYNKIFDA 269 Y I Sbjct: 167 YFTGAIKKK 175 >gi|296331311|ref|ZP_06873783.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676297|ref|YP_003867969.1| hypothetical protein BSUW23_18125 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151426|gb|EFG92303.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414541|gb|ADM39660.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 227 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 62/213 (29%), Gaps = 34/213 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIV 228 + ++LD S SM G+ K A + I + + V V G S K+ Sbjct: 37 VAVLLDASGSMAKRID-GVSKFNSAKKEISKFASSLPEGTQVKMSVFGSEGNNKNSGKVQ 95 Query: 229 QTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + ++ Q +N + T L A + Sbjct: 96 SCESIRNVYGFQSFNEQSFLNSLNAIGPTGWTPIAKALNEA----------KSSFDQLDT 145 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCA--S 338 +K + LTDGE + + E + V IG E L A Sbjct: 146 KGEKVVYLLTDGEETCGG----NPIKTAKELHKDNITVNVIGFDYKEGYKGQLNAIAKVG 201 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 ++ + + F +Q ++ +K Sbjct: 202 GGEYFPAYTQKDVEKIF-------TQQSLMLSK 227 >gi|332214177|ref|XP_003256207.1| PREDICTED: collagen alpha-1(XIV) chain [Nomascus leucogenys] Length = 1796 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 79/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ +D+F + L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDDNFNKIISFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|309358766|emb|CAP33788.2| hypothetical protein CBG_15420 [Caenorhabditis briggsae AF16] Length = 876 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 66/186 (35%), Gaps = 32/186 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++++ D S + F + + I++ +P + VR G+V +S + Sbjct: 33 LDIIILFDTSGGNDTVFE----------QQKNWTIKIVRDLPVHEDAVRVGIVQYSERND 82 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + + G T++ L A ++IFD A + II Sbjct: 83 IITHLET---------LKFMPGEDTRTGVALSKADDEIFDYDGGARLKAT------RLII 127 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDRFYSVQN 347 TDG + + +R+G +Y I V + + L D + + Sbjct: 128 VFTDGLSMD------KPTLAAKALRRKGVKIYTISVNSIGFVPEMLGIVGDADNVFGPTD 181 Query: 348 SRKLHD 353 ++ + Sbjct: 182 ENRIEE 187 Score = 37.5 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 45/322 (13%), Positives = 99/322 (30%), Gaps = 50/322 (15%) Query: 57 LYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 L T +++ + +K + I N ++ + + I S S Sbjct: 570 LQTFSEVTAFPTLLSMEKAVSRVVTLPINNAVTALGVSKPIKKKIRRIIKRRRNILSSSK 629 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 K + V P I T + ++SV+ +D++ V+D Sbjct: 630 AKKVTKKVLEIKRVPTPTTPPIRSTLRPLIIEEKEEEIFSASVQCP------MDILFVVD 683 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 S S+ + D L +IK + + R GL+ F+ +Q ++ Sbjct: 684 SSGSIARTYDTQKDYLT----------QLIKKVEPSRSH-RVGLIQFAGPHIQKMEWSFD 732 Query: 237 VQHIQEKI---------------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ + TT LE + + ++ E Sbjct: 733 THSKNSQLLSAIRSVRHLTDFASSNFSCPGTTYIGAALELSLILLDSRRKHTETT----- 787 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR--------GAIVYAIGVQAEAADQFL 333 +I ++DG I K F +++ ++ +YAI + ++L Sbjct: 788 -----VILISDGSTEEDFIILKSFRFSQDDSTQQAKLLRQLPNVKMYAISLNKLTNTKYL 842 Query: 334 KNCASPDRFYSVQNSRKLHDAF 355 + + + + + + F Sbjct: 843 TDIVGDRKNLFINDESQWFEEF 864 >gi|256833662|ref|YP_003162389.1| von Willebrand factor type A [Jonesia denitrificans DSM 20603] gi|256687193|gb|ACV10086.1| von Willebrand factor type A [Jonesia denitrificans DSM 20603] Length = 332 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 46/130 (35%), Gaps = 12/130 (9%) Query: 169 LDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD+ V+D + SM + + +L + ++ R +++F S Sbjct: 75 LDVYFVVDRTGSMAAEDYDGNKPRLTGVRNDMTTLMADFSGA-------RFSIISFDSTA 127 Query: 228 VQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + PL + + I L T + L+ ++ A + + + Sbjct: 128 SRQMPLTTDTRALSGWITNLNQEITYYSAGSSLDRVREELSMALRQGATRNPDNQ---RI 184 Query: 287 IIFLTDGENS 296 + TDGEN+ Sbjct: 185 VYLFTDGENT 194 >gi|198415004|ref|XP_002120336.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 652 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 47/232 (20%), Positives = 82/232 (35%), Gaps = 32/232 (13%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLD 204 +++ K ++ I ++++DVS SM+ G L + I +D Sbjct: 29 PTLKEIEVVMELEAKGKPEAGIFNRFVLLIDVSGSMDHTADGQSCTLLDRMKKFIELFID 88 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLE 260 + ++ G+VTFSS L + + L S T + GL Sbjct: 89 ------NASDTSWIGIVTFSSTANVIMELKQMTAEAKVFAKTTVLSLTTESRTNISAGLF 142 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN----KESLFYCNEAKRRG 316 A I K D II TDG + +D+ +E +++ Sbjct: 143 MALEVI--------QKLKPSRDC---IIVFTDGVANEGIVDSGTLIQEYKRISLQSRETN 191 Query: 317 AIVYAIGVQAEAADQFLKNCAS---PDRFY--SVQNSRKLHDAFLRIGKEMV 363 + AI V+ FL A+ D FY ++ ++ + +I KEM Sbjct: 192 IPISAITVEG-YKPDFLYAIATELGSDAFYCLNINSAFEADMMIPQITKEMT 242 >gi|77359583|ref|YP_339158.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas haloplanktis TAC125] gi|76874494|emb|CAI85715.1| conserved protein of unknown function ; putative Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 664 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 71/180 (39%), Gaps = 34/180 (18%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + + + V+D S SM+ + A ++ L ++ S N ++ Sbjct: 316 ATQRLARETVFVVDTSGSMHGQ------SMEQAKNALFYALSLLDSNDSFN------IIG 363 Query: 223 FSSKIVQTF--PL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F + + PL + ++ + I L T+ L + + D + Sbjct: 364 FDNVVTLMSDKPLVASGFNLRRAERFIYGLQADGGTEIQGAL----DAVLDGSQFDG--- 416 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + +IFLTDG ++ N+++LF +AK + ++ +G+ + F++ A Sbjct: 417 -----FVRQVIFLTDG-----SVSNEDALFKSIQAKLGDSRLFTVGIGSAPNSFFMRRAA 466 >gi|262091908|gb|ACY25457.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 667 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 78/248 (31%), Gaps = 29/248 (11%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT------SSVKISSKSDIGLDMMMV 174 + +++ + RY++P + +F + P+ + + L++ +V Sbjct: 245 RAEEWINAFNYRYDLPDVDDSFAVTTDIVEHPIHSDLHLVRIGTQAPEFVDNTPLNVTLV 304 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM + R + + R +V FS+ + Sbjct: 305 LDASGSMREG-----------NRVEIARAAAEAIRRGLRDQDRLAVVHFSTDVFAVVDHR 353 Query: 235 W-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + I L ++T GL + D + A YII ++DG Sbjct: 354 RPNARQLSNSIGSLRPRNSTNVQAGLNRGVE-LADEVRRERPNAYN------YIILMSDG 406 Query: 294 ENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEA-ADQFLKNCASPDR--FYSVQNSR 349 + D L + + IGV + D L+ A + + + Sbjct: 407 VANVDATDPFAILASAEDRDSSNPLRLITIGVGIQNYNDVLLEQLAQYGNGWYRYLSDVA 466 Query: 350 KLHDAFLR 357 + F R Sbjct: 467 QARSTFSR 474 >gi|195999034|ref|XP_002109385.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] gi|190587509|gb|EDV27551.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] Length = 356 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 58/180 (32%), Gaps = 34/180 (18%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + ++ SSK +++VLD S SM+ D+ + +L Sbjct: 198 NYTTETKFKQWYVNAASPSSKR-----LVLVLDRSGSMSG------DRFLKVKEAATAVL 246 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH---------IQEKINR--LIFGST 252 D + + G++ F +I +++ IN + Sbjct: 247 DSLGPNDE------IGVIAFDDEIRIHGGCKVTTVSPATPQSIIFLKDFINNKIQPEFGS 300 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T P L++A++ + I+FLTDG P + + NEA Sbjct: 301 TGYVPALKHAFDMLSTNMTSKAKTKTN------LIVFLTDGHPDEPESQILDVIKNRNEA 354 >gi|301604824|ref|XP_002932067.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2881 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 127/382 (33%), Gaps = 55/382 (14%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 +IR+F + I V ++ E +FF+K L +LL ++ Sbjct: 1234 HIRDFILEVVQNFQIGPN---KVRVGLVQFSNEPVSEFFLKTNLQK---QALLNNIRRLR 1287 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN----------GFAQDI--------- 105 + + ++ KN + + + E G ++D Sbjct: 1288 LKGGAPLNIGKAVEYVA---KNQFVKSAGSRIEEGVPQYLVLLTGGKSEDDVSRSARLLH 1344 Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK- 164 N +S +++ D+ + +R + I + ++++ Sbjct: 1345 NAKVQSVAVASGTTDRKDIETIVTDTR-LIFNIKEFRELPSVERKIYRSFEGEIEVTPTP 1403 Query: 165 ------SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 S D++ ++D S+++ D + + ++D + D + ++ Sbjct: 1404 AITTTISKKEADIVFLVDGSINLGR------DNFKEVLQFVSGIVDAV---FDEEDAIQI 1454 Query: 219 GLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L ++S + F L I + + + + ++ G A + D E Sbjct: 1455 ALAQYNSDVTDEFFLKDFTDRDQIMDAVTKAEYKGGRVASLG--AAIRHLQDKHFVKEAG 1512 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 ++ + +T G++ +L +G V+AIGV A D+ K Sbjct: 1513 SRVSSGVPQIAFVITGGKSVDDGQSAAMALS------NKGVKVFAIGVGAIDGDEVAKIA 1566 Query: 337 ASPDRFYSVQNSRKLHDAFLRI 358 + + V N ++L + +I Sbjct: 1567 SDAPSAFRVPNVQELSELNEQI 1588 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 56/146 (38%), Gaps = 10/146 (6%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G D + I E++ + P+ VR GLV FS++ V F L Sbjct: 1216 DVVFLIDGSSKTGQDGMAHIRDFILEVVQNFQIGPNK---VRVGLVQFSNEPVSEFFLKT 1272 Query: 236 GVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 +Q + I RL + + A + + ++ + +Y++ LT G Sbjct: 1273 NLQKQALLNNIRRLRLKGG--APLNIGKAVEYVAKNQFVKSAGSRIEEGVPQYLVLLTGG 1330 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ D S + AK + V Sbjct: 1331 KSED---DVSRSARLLHNAKVQSVAV 1353 >gi|289596620|ref|YP_003483316.1| von Willebrand factor type A [Aciduliprofundum boonei T469] gi|289534407|gb|ADD08754.1| von Willebrand factor type A [Aciduliprofundum boonei T469] Length = 2166 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 27/161 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHF-----------GPGMDKLGVATRSIREMLDIIKSIPDV 212 +D++ V+D S SMN G ++ VA ++ +D +K + Sbjct: 1246 NKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAA---IDAVKELGPQ 1302 Query: 213 NNVVRSGLVTF--SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + R + TF +S + + + I + + T L +A + Sbjct: 1303 D---RVAVFTFDGNSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYYM 1359 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + +D + I+ LTDG ++S N Sbjct: 1360 ----DTQSADNPDREDATRGILVLTDGLSNSDNYGPNNVRN 1396 >gi|254167891|ref|ZP_04874740.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623182|gb|EDY35748.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 2164 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 27/161 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHF-----------GPGMDKLGVATRSIREMLDIIKSIPDV 212 +D++ V+D S SMN G ++ VA ++ +D +K + Sbjct: 1244 NKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAA---IDAVKELGPQ 1300 Query: 213 NNVVRSGLVTF--SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + R + TF +S + + + I + + T L +A + Sbjct: 1301 D---RVAVFTFDGNSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYYM 1357 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + +D + I+ LTDG ++S N Sbjct: 1358 ----DTQSADNPDREDATRGILVLTDGLSNSDNYGPNNVRN 1394 >gi|158259621|dbj|BAF85769.1| unnamed protein product [Homo sapiens] Length = 1105 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 79/214 (36%), Gaps = 24/214 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 VK ++ D+++++D S S+ + + ++ + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF------NFRLVRHFLENLVTAFDV---GSEKT 195 Query: 217 RSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R GL +S + L A+ + + E + L + T A N IF+ K E Sbjct: 196 RIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKPE 252 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ K I +TDG++ I +L + G ++AIGV+ ++ + Sbjct: 253 AGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQE 304 Query: 335 NCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + PD Y+V +H + + + + Sbjct: 305 IASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|156399648|ref|XP_001638613.1| predicted protein [Nematostella vectensis] gi|156225735|gb|EDO46550.1| predicted protein [Nematostella vectensis] Length = 1841 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 42/237 (17%), Positives = 87/237 (36%), Gaps = 22/237 (9%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 +N+ R +I Q +++ + E C++++ +S ++ Sbjct: 1586 GDNVVRKELETIASSPQSDHVIMTSYNMLEATLEQIKQKVCSSAAKKENTHRASNRMPHD 1645 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D D+ ++D S ++ G G ++ + K P + G V + Sbjct: 1646 CDQAADIGFLIDGSRTIE---VMGKGNFGTIIEFVKNV---TKMFPLSRDAFSVGAVVYG 1699 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ PL + + + ++++ + T TK+ LEYA +F ++ + Sbjct: 1700 SEPSLEIPLGSHNTSKGLLKALSKIKYPGTATKTGKALEYARLNLFGSRNARRKVP---- 1755 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 K ++ LT G + + E L R G V ++G+ L N AS Sbjct: 1756 ---KILVVLTKGSSRDDIREASEDLH------RDGVHVVSVGMGPVNDRMELANMAS 1803 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 78/247 (31%), Gaps = 21/247 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 67 ENGNNGKKQKNDFSYR-----------IIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 N+ K + + T + L + + T + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKTVNN 129 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS----DIGLDM 171 ++ + P + + K + +I +D+ Sbjct: 130 TSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDL 189 Query: 172 MMVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 M+V D+S SM D G K+ + ++E+ D + N R G+ F+ Sbjct: 190 MVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITAFAM 249 Query: 226 KIVQTFP 232 Sbjct: 250 GAEHPKE 256 >gi|220912875|ref|YP_002488184.1| von Willebrand factor A [Arthrobacter chlorophenolicus A6] gi|219859753|gb|ACL40095.1| von Willebrand factor type A [Arthrobacter chlorophenolicus A6] Length = 317 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 24/201 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREM 202 + + P L + + + L++ V+D + SM + +G ++ +++ Sbjct: 21 FLVAALCRPALPGGTAQ---AATTDLNVFFVVDTTTSMVAEDYGTSAPRMDGVR---QDI 74 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 +DI K +P R L+TF +K PL +Q + L T + A Sbjct: 75 MDIAKELP----GARFSLITFDTKAFVRMPLTTDTLALQTMVTVLEPQVTAFAKGSSITA 130 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIV 319 ++ + + + + +L DGE +S + + A Sbjct: 131 ARQVLSERLDAARESHPERP--RLVYYLGDGEQTSGKEPERFNLDGSLVAGGA------- 181 Query: 320 YAIGVQAEAADQFLKNCASPD 340 +G A + +N P Sbjct: 182 -VLGYGTAAGGRMKENSGDPG 201 >gi|74001590|ref|XP_848825.1| PREDICTED: similar to Collagen alpha 1(VI) chain precursor [Canis familiaris] Length = 206 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 44/119 (36%), Gaps = 13/119 (10%) Query: 225 SKIVQTFPLA---WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 ++ PL ++ K++ + FG T + ++ ++ H Sbjct: 96 DEVEIISPLTPMPADRDALKAKVDAVKYFGKGTYTDCAIKKGLEELLVGG--------SH 147 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 KY+I +TDG + L NEAK G V+++ + + + L A+ Sbjct: 148 LKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIAT 206 >gi|23465165|ref|NP_695768.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] gi|23325787|gb|AAN24404.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] Length = 383 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 60/185 (32%), Gaps = 15/185 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 V+D S SM+ G+ K + + KS + + + L+ F ++ + Sbjct: 207 WVVDYSGSMSGEGKNGVVK---GLNAALDPDQAKKSYIEPASGDVNILIPFETEAHRPVK 263 Query: 233 LA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + T GL A +++ E ++ I+ +T Sbjct: 264 ATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 315 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 DG N D+++ +++ R +++I Q + S L Sbjct: 316 DG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 372 Query: 352 HDAFL 356 F Sbjct: 373 AAVFR 377 >gi|327278400|ref|XP_003223950.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 1160 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 71/212 (33%), Gaps = 36/212 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+++++D S S+ ++ I + + ++ R Sbjct: 150 KQPTDIVLLIDGSGSIQ---PNQFSEMKTFISMIMKRFQNTNTQFALSQYAR-------W 199 Query: 226 KIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + L + + + +L + T + ++ + E ++ Sbjct: 200 YREEFTFLDFQRVRNPDELLRPVTQLRGATLTATY------IQRVVREQFVTEKGSRPGA 253 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFLKNCA 337 K +I +TDGE S D + EA+R G I +AIG A Q L + A Sbjct: 254 --SKVLIVITDGEKSG---DPLQYSDVIPEAERAGIIRFAIGVGKAFSGGTAKQELISIA 308 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 S D + V N DA I ++ + Sbjct: 309 SQPEDDHVFPVDN----FDALKDIQNKLQDKI 336 >gi|324983881|gb|ADY68777.1| complement factor B/C2 [Oncorhynchus mykiss] Length = 749 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 81/222 (36%), Gaps = 36/222 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I + L + + LD S S++ +++ + + IP R G Sbjct: 244 SIQMRQGGKLHIYIALDNSGSISRD---NFTVAKNCVKALINQISFFEVIP------RYG 294 Query: 220 LVTFSSKIVQTFPLAWGVQH---IQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 ++TF+S++ + + H + + +N + G+ + + + K+ KE+ Sbjct: 295 ILTFASEVNEIVDIXSPSSHSDMVLQNLNNDTYGKLNGTGSNLSRVFKKILEKMAVFKER 354 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA---------IVYAIG 323 E +D ++ II TDG + + N G +Y G Sbjct: 355 NEL-----EDTQQAIIMFTDGNTNMGGSAEPTVIRIRNLM--EGIHGKDWQKYLDIYVFG 407 Query: 324 VQAEAADQ----FLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 V A+ ++ + +S FY +++ KL F I E Sbjct: 408 VGAKMNEEEINKLVSKKSSEKHFYKLRDDTKLTSLFKMIIDE 449 >gi|154302354|ref|XP_001551587.1| hypothetical protein BC1G_09961 [Botryotinia fuckeliana B05.10] gi|150855449|gb|EDN30641.1| hypothetical protein BC1G_09961 [Botryotinia fuckeliana B05.10] Length = 254 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 20/204 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGP------GMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + +++ LD S+SM G + L + + +++ + + Sbjct: 2 NPNNGPDEAIVICLDKSISMRALLGNNWIGNGQVSTLNRFDEAKQVFRNVVSRLSAYHLN 61 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 V GLVT SK Q ++ + + K+ N T L AY+ + +++ Sbjct: 62 VHVGLVTLGSKAEQEAHISPIGKEFRNKLENSQATDHRTSLFDPLCVAYSMLISHQKRH- 120 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-- 332 K II LTDGE++S +++ C ++ +I + + Sbjct: 121 ----PESKMKLRIIALTDGEDNSSVFTAEDT---CKMLYYDDIVLDSIVIGGSSYGDLFK 173 Query: 333 -LKNCASPDRFYSVQNSRKLHDAF 355 LK + + + L F Sbjct: 174 ILKY--TGGYVFKPTSQMLLFQTF 195 >gi|126459566|ref|YP_001055844.1| von Willebrand factor, type A [Pyrobaculum calidifontis JCM 11548] gi|126249287|gb|ABO08378.1| von Willebrand factor, type A [Pyrobaculum calidifontis JCM 11548] Length = 356 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 61/183 (33%), Gaps = 47/183 (25%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + + LDVS SM ++ + KL VA ++ L + S+ R L+ F+ Sbjct: 200 VYIALDVSGSMKEYMWGDV-KLRVAKNAVARYLRQMASLRG-----RVSLLLFNVDADFM 253 Query: 231 FPLAWGVQHIQEKINRLI-------FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 W + + + ++ G T+ LE Y+ Sbjct: 254 ----WTPYEVHKYLREMLEILEYVYAGGGTELASALEVLYSYGVREA------------- 296 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDR 341 + +TDG D +++ + K ++A+ V LK A+ + Sbjct: 297 ----VLITDGR----TADVEKTWSLVKKFK----RLHAVAV---EKSDLLKQIAKATGGK 341 Query: 342 FYS 344 + Sbjct: 342 YQE 344 >gi|6671158|gb|AAF23117.1|AF191035_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 881 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 68/167 (40%), Gaps = 11/167 (6%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M+ D F DK+ ++ + + Sbjct: 205 TIDVTVTVTPKEIDEGADVMALLDVSKKMSEDDFNNAKDKIKKLVTTLTSKSTDSQPNHN 264 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 N VR L+TF K+ + L +++ K+ + + G++ I A+E Sbjct: 265 ARNSVR--LMTFYRKVNEPIELT--AENVDAKLKEVWEQAKKDWDWGVDL-QGAIHKARE 319 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + + +++I+ + GE + N + N K+ Sbjct: 320 IFNKEKEKNSGKRQHIVLFSQGEATFSYDINDKD----NTVKKNRIT 362 >gi|297618211|ref|YP_003703370.1| von Willebrand factor A [Syntrophothermus lipocalidus DSM 12680] gi|297146048|gb|ADI02805.1| von Willebrand factor type A [Syntrophothermus lipocalidus DSM 12680] Length = 587 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 63/179 (35%), Gaps = 31/179 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 KS + +D+ +++D S SM DK A +L + + +VT Sbjct: 401 RKSYVPIDVCLLIDASASMAG------DKRQAACYLAEHLLLTGRE--------KVAVVT 446 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F P + + + + R+ G T G+ + I ++ H Sbjct: 447 FQEMSATVAVPFTRNQKVLSKGLARIRPGGLTPMADGIFTSVELIRSSR---VHNP---- 499 Query: 282 DYKKYIIFLTDGENSSPNID---NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +I +TDG + P ++L + G IG+ E+ FL++ A Sbjct: 500 ----LLILITDGMPNFPLWSFDAKSDALEAARKVAEAGIKFVCIGL--ESNKVFLRDVA 552 >gi|146304149|ref|YP_001191465.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702399|gb|ABP95541.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 363 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 74/201 (36%), Gaps = 36/201 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++++D S SM KL +A R ++D ++ R L+ F+ K+ Sbjct: 20 VILIDRSYSMKGE------KLEMAKEGARLLVD------NLPKDSRFSLLAFNEKVSIIK 67 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + +++ L GS T L+ A+N E Y+I LT Sbjct: 68 EHE-HPSEMGKELESLKVGSGTAMYKALQEAFNLARKYGEPT------------YVILLT 114 Query: 292 DGENSS----PNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAADQFLKNCASPDR- 341 DG S P + K L C + + + G+ + +++ L + R Sbjct: 115 DGVPSDMGCMPGLSRKFDLNRCLPVYQGLSVPENVQIISFGIGDDYSEEILTEVSEKGRG 174 Query: 342 -FYSVQNSRKLHDAFLRIGKE 361 FY V + ++ + ++ K Sbjct: 175 FFYHVTDPAQIPEKMPKLVKS 195 >gi|4154309|gb|AAD04919.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 71/187 (37%), Gaps = 28/187 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + LD++++LD S S+ + D++ T++ +I + + Sbjct: 1679 TLSPTPDCSQPLDVVLLLDGSSSIPASY---FDEMKSFTKAFISRANI------GPRLTQ 1729 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + S P W V + + + L G ++ L +A + Sbjct: 1730 VSVLQYGSITTIDVP--WNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTS---- 1783 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H A+ K +I +TD ++D+ ++ A+ V+ IG+ ++ Sbjct: 1784 EVHGARPGAS-KAVVILVTD-----VSVDSVDA--AAEAARSNRVTVFPIGIGDRYSEAQ 1835 Query: 333 LKNCASP 339 L + A P Sbjct: 1836 LSSLAGP 1842 >gi|1478046|gb|AAB05549.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 71/187 (37%), Gaps = 28/187 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + LD++++LD S S+ + D++ T++ +I + + Sbjct: 1679 TLSPTPDCSQPLDVVLLLDGSSSIPASY---FDEMKSFTKAFISRANI------GPRLTQ 1729 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + S P W V + + + L G ++ L +A + Sbjct: 1730 VSVLQYGSITTIDVP--WNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTS---- 1783 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H A+ K +I +TD ++D+ ++ A+ V+ IG+ ++ Sbjct: 1784 EVHGARPGAS-KAVVILVTD-----VSVDSVDA--AAEAARSNRVTVFPIGIGDRYSEAQ 1835 Query: 333 LKNCASP 339 L + A P Sbjct: 1836 LSSLAGP 1842 >gi|50950127|ref|NP_001002932.1| von Willebrand factor precursor [Canis lupus familiaris] gi|40786760|gb|AAB93766.2| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 71/187 (37%), Gaps = 28/187 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + LD++++LD S S+ + D++ T++ +I + + Sbjct: 1679 TLSPTPDCSQPLDVVLLLDGSSSIPASY---FDEMKSFTKAFISRANI------GPRLTQ 1729 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + S P W V + + + L G ++ L +A + Sbjct: 1730 VSVLQYGSITTIDVP--WNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTS---- 1783 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H A+ K +I +TD ++D+ ++ A+ V+ IG+ ++ Sbjct: 1784 EVHGARPGAS-KAVVILVTD-----VSVDSVDA--AAEAARSNRVTVFPIGIGDRYSEAQ 1835 Query: 333 LKNCASP 339 L + A P Sbjct: 1836 LSSLAGP 1842 >gi|12644030|sp|Q28295|VWF_CANFA RecName: Full=von Willebrand factor; Short=vWF; Flags: Precursor Length = 2813 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 71/187 (37%), Gaps = 28/187 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + LD++++LD S S+ + D++ T++ +I + + Sbjct: 1679 TLSPTPDCSQPLDVVLLLDGSSSIPASY---FDEMKSFTKAFISRANI------GPRLTQ 1729 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + S P W V + + + L G ++ L +A + Sbjct: 1730 VSVLQYGSITTIDVP--WNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTS---- 1783 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H A+ K +I +TD ++D+ ++ A+ V+ IG+ ++ Sbjct: 1784 EVHGARPGAS-KAVVILVTD-----VSVDSVDA--AAEAARSNRVTVFPIGIGDRYSEAQ 1835 Query: 333 LKNCASP 339 L + A P Sbjct: 1836 LSSLAGP 1842 >gi|156933957|ref|YP_001437873.1| hypothetical protein ESA_01783 [Cronobacter sakazakii ATCC BAA-894] gi|156532211|gb|ABU77037.1| hypothetical protein ESA_01783 [Cronobacter sakazakii ATCC BAA-894] Length = 346 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 53/168 (31%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L T ++ ++ ++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K H AKG + Sbjct: 57 RTIVPL---HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + + + A+G+ A L+ Sbjct: 112 YLLTDG---RPTDDTTAEVKRWKDHYASKVNLIAVGLGPSADLNILRQ 156 >gi|38347889|ref|NP_941138.1| putative tellurium resistance protein [Serratia marcescens] gi|190410231|ref|YP_001965732.1| terY3 [Klebsiella pneumoniae] gi|226807624|ref|YP_002791318.1| TerY3 [Enterobacter cloacae] gi|226809934|ref|YP_002791628.1| TerY3 [Enterobacter cloacae] gi|38259366|emb|CAE51591.1| putative tellurium resistance protein [Serratia marcescens] gi|146151024|gb|ABQ02790.1| terY3 [Klebsiella pneumoniae] gi|226425849|gb|ACO53942.1| TerY3 [Enterobacter cloacae] gi|226426160|gb|ACO54252.1| TerY3 [Enterobacter cloacae] Length = 346 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 53/168 (31%), Gaps = 14/168 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L T ++ ++ ++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K H AKG + Sbjct: 57 RTIVPL---HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 LTDG P D + + + A+G+ A L+ Sbjct: 112 YLLTDG---RPTDDTTAEVKRWKDHYASKVNLIAVGLGPSADLNILRQ 156 >gi|326920703|ref|XP_003206608.1| PREDICTED: cochlin-like, partial [Meleagris gallopavo] Length = 760 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 69/209 (33%), Gaps = 23/209 (11%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y+MP F T L + S +++ ++D S S+ + M + Sbjct: 540 SYQMPSWFGTTK-YVKPLVQKLCSHEQMLCSKTCYNSVNIGFLIDGSSSVGESNFRLMLE 598 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FG 250 E+ DI I V T+ + +F + + I + Sbjct: 599 FISNVAKAFEISDIGSKIATVQ-------FTYDQRTEFSFTDYTTKEKVLSAIRNIRYMS 651 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + + + +F + + K +++ LTDG+ + D+ Sbjct: 652 GGTATGDAISFTTRNVFGPVKDGAN--------KNFLVILTDGQ----SYDDVRGPAVA- 698 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A++ G V+++GV D + P Sbjct: 699 -AQKAGITVFSVGVAWAPLDDLKDMASEP 726 >gi|308472925|ref|XP_003098689.1| hypothetical protein CRE_04171 [Caenorhabditis remanei] gi|308268289|gb|EFP12242.1| hypothetical protein CRE_04171 [Caenorhabditis remanei] Length = 396 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 66/185 (35%), Gaps = 12/185 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD++ V+D S M + G++ + ++ I S R GLVT++ Sbjct: 34 SNLWLDVVAVVDNSQGMTND---GLNNVASDIFTVFSSGTRIGSDSSEPRTTRLGLVTYN 90 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH--DD 282 S Q L Q I + + +T Y + A++ + + + Sbjct: 91 SNATQQADL-NKYQSIDDAFYGIFDSLSTVVYTADSYLATGLILAEKMFNEQSVNNVRSN 149 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASP 339 YK+ +I N +D L N K + + + + L ASP Sbjct: 150 YKRVVIVYASEYNEDGELDP---LPVANRLKLSNVKIITVAYEQPGSVGLENGLSQIASP 206 Query: 340 DRFYS 344 +S Sbjct: 207 GFSFS 211 >gi|297683547|ref|XP_002819437.1| PREDICTED: collagen alpha-1(XIV) chain-like [Pongo abelii] Length = 1761 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|297567411|ref|YP_003686383.1| hypothetical protein Mesil_3037 [Meiothermus silvanus DSM 9946] gi|296851860|gb|ADH64875.1| conserved hypothetical protein [Meiothermus silvanus DSM 9946] Length = 351 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 77/225 (34%), Gaps = 39/225 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 V +++ LD S SM P ++L A R+I + + ++ P Sbjct: 75 PVAAPPLPTNKAAVVIALDASKSMLAGDLNP--NRLEAA-RAIAK--EFVRLAP---ATT 126 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE----- 271 + GL+TFS +QE ++ + T + + KE Sbjct: 127 QIGLITFSDSASVVVAPTTDRAVLQEALDNVKPVQNTSLPSAIVTGVRLLPGRKEVQPPK 186 Query: 272 -----------------KLEHIAKGHDDYKKYIIFLTDGE---NSSPNIDNKESLFYCNE 311 + + + ++ ++DG NS+P + N+ +L + Sbjct: 187 ELQPQNPQNPQPQNPLVQPDTPPIPREFPPGSLLVISDGATNVNSNPRLPNQTALEAAAK 246 Query: 312 -AKRRGAIVYAIGVQAEAADQFLKNCASPDRF--YSVQNSRKLHD 353 A+ G +YA V E + D F + ++ ++L + Sbjct: 247 FAQDNGVKIYAFAVGKEGGA--VVRIEGQDYFVPFEPRSLQQLAE 289 >gi|255583954|ref|XP_002532724.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] gi|223527532|gb|EEF29655.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] Length = 752 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 65/225 (28%), Gaps = 45/225 (20%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S+ D++ ++D+S SM L A ++ L + S N Sbjct: 313 PGNKQSRKAFRKDVIFIIDISGSMKG------GPLENAKNALMSSLSKLNSEDSFN---- 362 Query: 218 SGLVTFSSKIVQTFPL-----AWGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKE 271 ++ F+ + L + +N L G T L+ A + + Sbjct: 363 --IIAFNDETYLFSSLMEPATKEALSKASLWLNDNLTAGGGTNIMVPLKQAMKLLAQTTD 420 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-------GAIVYAIGV 324 + I +TDG ++ CN K + + G+ Sbjct: 421 SIP-----------LIFLITDG-------AVQDEREICNFVKGSLTSGGPISPRICSFGI 462 Query: 325 QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 A FL+ A F S ++ + R+ + Sbjct: 463 GAYCNHYFLQMLAQIGRGYFDSAYDADSVDFRMQRLFTTASSVIL 507 >gi|114621482|ref|XP_001144037.1| PREDICTED: collagen, type XIV, alpha 1 isoform 3 [Pan troglodytes] Length = 1780 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIISFLYSTVGALHKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 >gi|114621484|ref|XP_001143906.1| PREDICTED: collagen, type XIV, alpha 1 isoform 1 [Pan troglodytes] Length = 1800 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIISFLYSTVGALHKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 >gi|114621480|ref|XP_519927.2| PREDICTED: collagen alpha-1(XIV) chain isoform 4 [Pan troglodytes] Length = 1796 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 82/215 (38%), Gaps = 26/215 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNV 215 VK ++ D+++++D S S+ R +R L+ + + + Sbjct: 145 EEVKFVCQTPAIADIVILVDGSWSIGRF----------NFRLVRLFLENLVTAFDVGSEK 194 Query: 216 VRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R GL +S + L A+ + + E + L + T A N IF+ K Sbjct: 195 TRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYIFENSFKP 251 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 E ++ K I +TDG++ I +L + G ++AIGV+ ++ Sbjct: 252 EAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKNADVNELQ 303 Query: 334 KNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + PD Y+V +H + + + + Sbjct: 304 EIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 1032 DLVFMVDGSWSIGDENFNKIISFLYSTVGALHKI---------GTDGTQVAMVQFTDDPR 1082 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 1083 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1136 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1137 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1190 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1191 FVDD----FDAFKKIEDEL 1205 >gi|116619318|ref|YP_821474.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222480|gb|ABJ81189.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 296 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 77/227 (33%), Gaps = 41/227 (18%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 ++ + D L + +VLD S SM + L + + Sbjct: 55 EQPIRSFAHDDAPLSVAIVLDSSGSMGRKWNRACAMLARLCEQLGPEDEFF--------- 105 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVT + +Q ++ T + A + K H Sbjct: 106 ----LVTVQQRARLLLDYTSNCGTMQNRLVMAKPHGMTALLDAIPLAVEHL----RKAAH 157 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--AIGVQAE------ 327 + I+ ++DG ++ + + +A+ A VY +G++AE Sbjct: 158 -------PRHAILIISDGGENASRVRLHDVRR---QAREANAPVYAATLGLEAEFDQGPY 207 Query: 328 ----AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + L+ A + R +S+Q SR++ +A I +E+ Q ++ Sbjct: 208 LDARRGPELLREIAQITGGRAFSIQESRRIEEAAAGIARELHDQYVI 254 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 78/247 (31%), Gaps = 21/247 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 67 ENGNNGKKQKNDFSYR-----------IIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 N+ K + + T + L + + T + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKTVNN 129 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS----DIGLDM 171 ++ + P + + K + +I +D+ Sbjct: 130 TSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDL 189 Query: 172 MMVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 M+V D+S SM D G K+ + ++E+ D + N R G+ F+ Sbjct: 190 MVVADLSGSMKDGIKGEKLNGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITAFAM 249 Query: 226 KIVQTFP 232 Sbjct: 250 GAEHPKE 256 >gi|148694465|gb|EDL26412.1| procollagen, type XII, alpha 1, isoform CRA_a [Mus musculus] Length = 1722 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 69/198 (34%), Gaps = 24/198 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ + P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I K + + Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 341 RFYSVQNSRKLHDAFLRI 358 Y+V + L + Sbjct: 190 HAYNVADFESLSKIVDDL 207 >gi|126662673|ref|ZP_01733672.1| hypothetical protein FBBAL38_04940 [Flavobacteria bacterium BAL38] gi|126626052|gb|EAZ96741.1| hypothetical protein FBBAL38_04940 [Flavobacteria bacterium BAL38] Length = 288 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 58/304 (19%), Positives = 102/304 (33%), Gaps = 56/304 (18%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 K I + + E++ + I + E TS + S V +Y+ W Sbjct: 4 KEILKKVRKIEIKTRRLSDHIFSGEYHTSFK------GRGMTFSEVRQYQYGDDVRAIDW 57 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + + L MM+V+D+S S + FG + R + + Sbjct: 58 NVTARY-----NEPFVKVFEEERELTMMLVVDISGS--ESFGTKNQQ----KRDMVTEIA 106 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + N + GL+ FS +I P G H+ I LI + L A Sbjct: 107 ATLAFSATQNNDKIGLLLFSDQIELFIPPKKGKSHVLRIIRELIEFQPKSNKTDLAQALK 166 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGAIVY- 320 + KK I+F+ + D +++L AK+ G VY Sbjct: 167 YL------------SGVLKKKAIVFM---ISDFMVKDYQQTLKIA--AKKHDVTGIRVYD 209 Query: 321 -------AIGV----QAEAADQFLKNCASP------DRFYSVQNSRKLHDAFLRIGKEMV 363 IG+ AE + L + S +++Y +N D F + G + Sbjct: 210 QREESLPNIGIVNMMDAETGETLLVDTNSKQVRTNYEKYYQ-ENVTYFKDIFSKCGAGTI 268 Query: 364 KQRI 367 R+ Sbjct: 269 SSRV 272 >gi|83643820|ref|YP_432255.1| von Willebrand factor type A domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631863|gb|ABC27830.1| protein containing VWFA domain [Hahella chejuensis KCTC 2396] Length = 407 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 79/242 (32%), Gaps = 33/242 (13%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 N + P+ TS + S+ ++++ LD + SM+ + A +I + Sbjct: 33 AVNQTTHPITTTSPIATVSEKRAKIEVVFALDTTSSMS-------GLIQAAKENIWSIAS 85 Query: 205 IIKSIPDVNNVVRSGLVTFSSK----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + S ++ GLV F + I + L+ + + + + Sbjct: 86 TMASAQPAPE-IKMGLVAFRDRGDSYITRVTDLSPDLDSMYATLMDYQAEGGGDGPESVN 144 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGA 317 A H D Y + G+ + P++D + Y K +G Sbjct: 145 QALF-------DAVHKISWSQDKDSYRVIFLVGD-APPHMDYQNEQQYPQTLKDALSKGI 196 Query: 318 IVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRKLHDA-------FLRIGKEMVKQRI 367 +V I + + A + F+ V+ S + + KEM K R+ Sbjct: 197 VVNTIQAGDDPFTQTEWRRIAQLNQGSFFQVEQSGQAVAVATPFDERLASLSKEMDKTRM 256 Query: 368 LY 369 Y Sbjct: 257 FY 258 >gi|224922718|dbj|BAH28825.1| NorD protein [Ochrobactrum anthropi] Length = 633 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 78/227 (34%), Gaps = 32/227 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T V +S +D +D VLDV + L + + Sbjct: 433 VHLMSRPKANDLAVTILVDVSLSTDAWIDNRRVLDVE-------KEALLVLANGIAACGD 485 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 I + VR + V+ F +G ++ +I L G T+ + + Sbjct: 486 RCSIQTFTSRRRSWVRV-------ETVKDFDETFG-SAVEHRIAALKPGFYTRMGAAIRH 537 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A K+ + + KK ++ LTDG+ + ++S EA+ G Sbjct: 538 ATAKLAEQP-----------NRKKLLLVLTDGKPNDVDHYEGRFALEDSRRAVAEARTTG 586 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 V+A+ V EA +L + V KL A I + M Sbjct: 587 VNVFAVTVDREAN-AYLPTLFGRRNYALVAKLSKLPVALPAIYRMMT 632 >gi|327278404|ref|XP_003223952.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Anolis carolinensis] Length = 1162 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 76/190 (40%), Gaps = 31/190 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+N +++ + L + R L+ FS + ++ Sbjct: 147 DIVFLIDGSGSINK---ADFERMKQFVSETTKRLSGRDT--------RFALIQFSDRYLE 195 Query: 230 TFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 F + + + I+ G T++ + ++F +++ + A + +I Sbjct: 196 HFNFNSEDPEQLVLHIH--QVGGWTETATAIRRVVRELFTSQKGSRNGAT------RILI 247 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQFLKNCAS---PDR 341 +TDG + D + EAK+ G I YAIG + A + L + AS + Sbjct: 248 VITDGVKT----DRLQYSQVIPEAKQAGIIRYAIGVGDAFSSPDAKRELDDIASEPKAEH 303 Query: 342 FYSVQNSRKL 351 ++V N L Sbjct: 304 IFTVYNFNAL 313 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 5/78 (6%) Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKNCAS-PDRFYS 344 F TDG +++ N + + +K+ + +G ++L+ AS P+ +Y Sbjct: 260 FKTDGTDAAGKSWNGDPKDPADYSKQL-VQTFTVGFGQGITPTGKRYLQLAASRPEYYYE 318 Query: 345 VQNSRKLHDAFLRIGKEM 362 L F I +++ Sbjct: 319 ADKPESLSKVFNDIVEQI 336 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 68/173 (39%), Gaps = 23/173 (13%) Query: 133 YEMPFIFCTFPWCANSSH-----APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 Y + AN++ PL + + KI + ++ ++M+ +D S SM+++ G Sbjct: 17 YNIASALALMSGVANAAAGEFAKVPLYLQTESKIDKQPEVKHNIMLFIDDSGSMDEYIGS 76 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV----QHIQEK 243 ++ + ++ ++L+ S + L T ++ P A G I + Sbjct: 77 K-TRIQITKDALSKVLEEHGSTFNW------ALQTLNNSGGSDTPDAKGFTIPATEIANR 129 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + R+ T +T ++I + K Y++ ++DG+ + Sbjct: 130 VKRIKAKGGTPTTSRYFELVSQIVMPN-------VKYRCQKSYVVLMSDGDAN 175 >gi|198434104|ref|XP_002126242.1| PREDICTED: similar to calcium activated chloride channel [Ciona intestinalis] Length = 624 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/131 (20%), Positives = 42/131 (32%), Gaps = 22/131 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+DVS SM D G + R ++ + G+V FS+ Sbjct: 53 VLVIDVSGSM-DSVAGGQTLMQRMKTFARLFINKAATYSW------LGIVAFSNDAHVVL 105 Query: 232 PLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L G + + L T + GL A + I D + I Sbjct: 106 RLTQMNKQGKEKATRAVQTLRTEGLTNISAGLFLALDLIKDRSHSRDS-----------I 154 Query: 288 IFLTDGENSSP 298 I TDG + Sbjct: 155 ILFTDGAANCG 165 >gi|170594383|ref|XP_001901943.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158590887|gb|EDP29502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 415 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 84/239 (35%), Gaps = 36/239 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVK----ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 P F + ++ + + S+ ++ K D++ ++D S ++D +D Sbjct: 197 PIHFVSASSVRTANVSSSAVMSTTSTKDVVNIKPGCLFDVVFLMDFSGGVSDKRDIYIDY 256 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPL---AWGVQHIQEKINR 246 + + RS+ V + +S + F L + I++ Sbjct: 257 VSILIRSL----------DLNRTAVHVAAIYYSGPKRARTLFHLRKHSRAENAIKDLQRA 306 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 G TT++ + YA N+ + + D +K II TDG + + + Sbjct: 307 PSNGGTTRTGEAIYYATNEFNEKFGARK-------DARKMIIIFTDGHSQDNPTEASRT- 358 Query: 307 FYCNEAKRRGAIVYAIGVQ----AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 A+ +G + A+ ++ +Q + P YS +N KL F + Sbjct: 359 -----ARNKGIELKAVSIEDENIPPDTNQIIAITGDPSDAYSSKNFNKLQSFFDEYSRR 412 >gi|149709409|ref|XP_001496268.1| PREDICTED: chloride channel, calcium activated, family member 2 [Equus caballus] Length = 943 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 75/212 (35%), Gaps = 39/212 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F+SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLRLQQAAEFYLMQI---VEIHTFV--GIASFNSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + + + T GL+ + + K Sbjct: 362 AQLHQINNDDDRKLLVSYLPATVSAEAETSICSGLKKGFEVVEKLNGKAHGS-------- 413 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 ++ +T G++ L + G+ ++ I + + A + L + +F Sbjct: 414 -VMVLVTSGDDEH----VANCLL---TVQSSGSTIHTIALGSSAVENLEELSHLTGGLKF 465 Query: 343 YSVQ--NSRKLHDAFLRIGK---EMVKQRILY 369 + NS + DAF RI ++ +Q I Sbjct: 466 FVPDKSNSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|125625081|ref|YP_001033564.1| hypothetical protein llmg_2320 [Lactococcus lactis subsp. cremoris MG1363] gi|124493889|emb|CAL98883.1| hypothetical protein predicted by Glimmer/Critica [Lactococcus lactis subsp. cremoris MG1363] Length = 1444 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 72/270 (26%), Gaps = 96/270 (35%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 +D+++V+D+S SM G + +S ++ V GLV +SS Sbjct: 302 KPVDIVLVVDMSGSME---GAREGAIKQGVKSFLSSIENTA----YAQYVNVGLVGYSSP 354 Query: 226 -----KIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 T P+ G K F T + G+ + + E Sbjct: 355 GYISNSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQMLKEDASGNE-- 412 Query: 277 AKGHDDYKKYIIFLTDGENSSP-------------------------------------- 298 K II +TDG + Sbjct: 413 --------KMIILMTDGVPTFSNKVSSAQLEGGVLYGTDFDSNSLDEPSFTSQLWMMNGN 464 Query: 299 --------------NIDNKESLFYCNEAKRRGAIVYAIGVQA--------EAADQFLKN- 335 N +L AK GA ++ +G+Q ++++ +L Sbjct: 465 NRTPAPYTVSGETINDTWAATLGEAKIAKDDGAEIHTLGIQLGKDSGYTNDSSNTYLSQE 524 Query: 336 --------CASPDRFYSVQNSRKLHDAFLR 357 AS + ++ + D Sbjct: 525 EVRKRTSLIASSGLYQDADSAENITDYLKN 554 >gi|302754788|ref|XP_002960818.1| hypothetical protein SELMODRAFT_402202 [Selaginella moellendorffii] gi|300171757|gb|EFJ38357.1| hypothetical protein SELMODRAFT_402202 [Selaginella moellendorffii] Length = 545 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 15/172 (8%) Query: 170 DMMMVLDVSLSMNDHFGP--GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +VLD S SM+ GP +L VA + E+LD + + V +G F K Sbjct: 148 SLYIVLDTSSSMSISIGPLSSQSRLAVAKGILDELLDTLTNGDQVIVSDMNGGKPFGGKP 207 Query: 228 VQTF----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V ++ I N + K ++ A+ D ++ Sbjct: 208 VSVSLEGLETSFDHAGISALKNAISNARADKLQTDIKKAFVGALDFFNSSSNLN------ 261 Query: 284 KKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 I+ LTDG+ + N+ + + +F K VY IG + F + Sbjct: 262 --VILLLTDGQFANHVNLTDLDPIFKQLNEKNVVVFVYRIGFYISNDETFQR 311 >gi|77456864|ref|YP_346369.1| hypothetical protein Pfl01_0636 [Pseudomonas fluorescens Pf0-1] gi|77380867|gb|ABA72380.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 659 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 60/175 (34%), Gaps = 13/175 (7%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G+I ++ A+ L + + + +V+++ + + L I D + L AT+ N G Sbjct: 13 QRGAIGLMAALTLGMALVFILVVVDSGRLYLERRHLQQIADVAALEAATRGGNCGAGATA 72 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L +++ +++L + + + V Sbjct: 73 NAYAQ--ASVVRNNFPIPSAGRTLAVACGTLNLD----ASNLRVFAVNAASTEAIRVVVS 126 Query: 133 YEMPFIF-----CTFPWCANSSHAPLLITSSVKISS--KSDIGLDMMMVLDVSLS 180 + +P F F ++ L T+ + S + +D S Sbjct: 127 HTVPQSFAGAIGGLFGGAGRNATINLSATAVAAVPPPLASLTIRSTALSVDTGRS 181 >gi|47209360|emb|CAF94677.1| unnamed protein product [Tetraodon nigroviridis] Length = 580 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S SMN G L VA ++ + +++ + R LVTF Sbjct: 4 LLFLLDTSASMNQRTYLGTTFLDVAKGAVEIFM-KLRARDPASRGDRYMLVTFDDSPYGV 62 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 63 KA-GWKENHATFMCELKNLQASGLTTLGYALRAAFDLLNLNRLVSGIDNYGQGRNPFYLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAVIITITDG 131 >gi|332710564|ref|ZP_08430509.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] gi|332350619|gb|EGJ30214.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] Length = 579 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 71/199 (35%), Gaps = 19/199 (9%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + PW + I K++ + +++ +LDVS SM +KL Sbjct: 180 FSITTEISQAPWNPTHQLVHIGIQGE-KMAIEDLPPSNLVFLLDVSGSM-----NTPNKL 233 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + + R +++ ++ V+ VV +G +V I I L G + Sbjct: 234 PLLKDAFRMLVNELREEDQVSIVVYAGAAG----VVLPPTPGNEKDKILTAIENLNAGGS 289 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G++ AY D K + +I TDG+ + + E + + Sbjct: 290 TAGGAGIKLAYKLAQDNFIKSGNNR---------VILATDGDFNVGVSSDTELVKLIEQK 340 Query: 313 KRRGAIVYAIGVQAEAADQ 331 + +G + +G + Sbjct: 341 RNKGVFLTVLGFGSGNLQD 359 >gi|315647075|ref|ZP_07900188.1| magnesium chelatase [Paenibacillus vortex V453] gi|315277277|gb|EFU40606.1| magnesium chelatase [Paenibacillus vortex V453] Length = 622 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 48/140 (34%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S +G ++ V+D S SM ++ +I +L + G Sbjct: 426 QKKKESRVGATLLFVVDASGSMA-----ARKRMKAVKGAILSLLQ-----DAYVKRDQIG 475 Query: 220 LVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 ++ F + P+ V +++ + G T GL Y + K K + Sbjct: 476 MIAFRKGQAELILPVTRSVDAASKQLKGIPTGGKTPLAAGLVKGYETLLSEKRKNKGTWP 535 Query: 279 GHDDYKKYIIFLTDGENSSP 298 +I ++DG + Sbjct: 536 -------VMIVVSDGRANES 548 >gi|311694769|gb|ADP97642.1| von Willebrand factor, type A [marine bacterium HP15] Length = 581 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 70/210 (33%), Gaps = 35/210 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D+S SM D + A R + +L S G+ TF + Sbjct: 42 DVRIIVDISGSMKDTDPDNLR--QPAVRLLARLLPEGASA---------GVWTFGQYVNM 90 Query: 230 TFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P AW IQ T +E A + F + Sbjct: 91 LVPHREVNNAWREMAIQRSAQINSVALRTNLGAAIETASDDYFTDGDLSRT--------- 141 Query: 285 KYIIFLTDGE---NSSPNIDNKESLFYCNE----AKRRGAIVYAIGVQAEAADQFLKNCA 337 + I LTDG+ + P + E + RGA + + + A FLK A Sbjct: 142 -HFILLTDGKVDISDDPAKNTAEETRILDTIVADLIERGATFHPVALSEAADTDFLKALA 200 Query: 338 --SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 S RF + L+ AFL+ V Q Sbjct: 201 TDSGGRFQVADTADALNLAFLQALNTAVPQ 230 >gi|188592039|ref|YP_001796637.1| hypothetical protein RALTA_B0200 [Cupriavidus taiwanensis LMG 19424] gi|170938413|emb|CAP63400.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424] Length = 562 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 15/84 (17%), Positives = 36/84 (42%), Gaps = 4/84 (4%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 N +G++S++ A+L+ + I + I+ H F + +L ++D + + A ++ Sbjct: 10 RTRRNTRGAVSVMAAVLIATVAIAALVSIDVGHVFMRQRQLQNMVDLAAMSAAQQL---- 65 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTD 91 + N + NI + Sbjct: 66 KRADSPANLNAAVLGTVSNIGAKN 89 >gi|34810098|pdb|1QCY|A Chain A, The Crystal Structure Of The I-Domain Of Human Integrin Alpha1beta1 Length = 193 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 65/210 (30%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++VLD S S + T + ++L + P G+V + + Sbjct: 3 LDIVIVLDGSNS--------IYPWDSVTAFLNDLLKRMDIGPKQTQ---VGIVQYGENVT 51 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD--YKKY 286 F + L+ G + D K KK Sbjct: 52 HEF----NLNKYSSTEEVLVAAKKIVQRGGAQTMTALGTDTARKEAFTEARGARRGVKKV 107 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------QFLKNCA 337 ++ +TDGE + DN + + ++I + + +K+ A Sbjct: 108 MVIVTDGE----SHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIA 163 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L +G+ + Sbjct: 164 SEPTEKHFFNVSDELALVTIVKTLGERIFA 193 >gi|149922555|ref|ZP_01910985.1| putative lipoprotein [Plesiocystis pacifica SIR-1] gi|149816582|gb|EDM76077.1| putative lipoprotein [Plesiocystis pacifica SIR-1] Length = 486 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 57/169 (33%), Gaps = 26/169 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + S + S+ + +++ +VLD S SM G M + R+I L I V Sbjct: 141 IGVASEIVSPSE-RLPMNITLVLDESTSMT---GAPMYAMKATARAIAGSLREGDVISLV 196 Query: 213 ----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 +N VR + + + I+ + G T GLE Y + Sbjct: 197 SWSNSNNVRLASHAVAGS---------NDATLLDTIDAIEPGGGTDLHAGLEQGY-ALAQ 246 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 A + I + ++ ++DG + D + G Sbjct: 247 ANFSADRINR--------VVLVSDGGANLGFTDAELIAQMAELEDGEGI 287 >gi|332711435|ref|ZP_08431366.1| Mg-chelatase subunit ChlD [Lyngbya majuscula 3L] gi|332349413|gb|EGJ29022.1| Mg-chelatase subunit ChlD [Lyngbya majuscula 3L] Length = 579 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 69/199 (34%), Gaps = 30/199 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + + +V+DVS SM KL ++ + + S + ++ Sbjct: 391 QSYAKKPSQVALVVDVSGSMRGE------KLSGVQNTLLNYVQNLGSRE------KIAII 438 Query: 222 TFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 FS++I Q F + G + I L G T+ +A N + + A Sbjct: 439 PFSNEIKQPFMVEGTPQGKAEGIKFIGSLKAGGGTRLYDSAIFARNWLKQNFKTDAINA- 497 Query: 279 GHDDYKKYIIFLTDGENSSPNI---DNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFL 333 ++ LTDG++S I + + L N + +G + + + L Sbjct: 498 --------VLILTDGQDSGSEITLDNLSKQLQSSNFEAEESIAFFTVGYGKQGEFSPKVL 549 Query: 334 KNCAS-PDRFYSVQNSRKL 351 + A +Y N + Sbjct: 550 QTIAELNGGYYRQGNPETI 568 >gi|313812314|gb|EFS50028.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] Length = 323 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+M+ LDVS SM G+D+ ++++ + D + R G V F S V Sbjct: 95 DVMLCLDVSGSME-----GVDR---------QVINTYIQLADHISDDRIGFVMFDSSAVT 140 Query: 230 TFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKL---EHIAKGHDDY 283 FPL ++ + G P + Y + Sbjct: 141 VFPLTHDRDSVKAGLK--QAGERLGRADLDPAVRYGPGGSLVGDGLASCVSRFDQLDQPR 198 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 + I+ TD N + A +R +V+ I A D A + Sbjct: 199 SRSIVLATD--NMVAGPSVYTVPQAVDLAVKRHIMVFGI---VPANDDPDYRAARDELHQ 253 Query: 344 SVQNSR 349 VQ + Sbjct: 254 QVQRTH 259 >gi|53804181|ref|YP_113983.1| MxaC protein [Methylococcus capsulatus str. Bath] gi|53757942|gb|AAU92233.1| putative MxaC protein [Methylococcus capsulatus str. Bath] Length = 325 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 72/225 (32%), Gaps = 34/225 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGM--DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G +++++D S SMN++F G + ++ L + +V F Sbjct: 79 GTGAHIVLLMDRSSSMNENFSGRYLGGTAGESKNAVARRL--LADFVRRRGDDLFAMVAF 136 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ PL + + I+ + T PGL A + Sbjct: 137 SAAPRYVMPLTQDREAVLAAIDSVGDRGHGITNIAPGLAMALDFFNGRPATGA------- 189 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---AS 338 + I+ ++DG + +++ R +Y + D+ AS Sbjct: 190 ---RIILLVSDGATRIEEESQDMIRQWFQDSQSRLYWIYLRSTNSAPLDRPPTGASDDAS 246 Query: 339 PDRF--------------YSVQNSRKLHDAFLRIGKEMVKQRILY 369 P+ Y N R + +A IG+ + Q + Y Sbjct: 247 PEYLLHRYFQSLGVPYTAYEATNPRAVEEAIADIGR-LANQPLRY 290 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 21/134 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ +++G + + +D + + N + G V +SS Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLAD-SGITNNINMGYVGYSSD 370 Query: 227 IVQTFPLAWG-VQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + G ++ I + T + L A + + Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGH---------- 420 Query: 283 YKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 421 -KKVIVLLTDGVPT 433 >gi|312068806|ref|XP_003137386.1| CUTiclin-Like family member [Loa loa] gi|307767445|gb|EFO26679.1| CUTiclin-Like family member [Loa loa] Length = 691 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 44/137 (32%), Gaps = 18/137 (13%) Query: 218 SGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEK 272 L+T+S + F + +N L TT + L AY + D Sbjct: 4 LALITYSGQAYVHFKFNDPQIGNNTAVIGHLNALKSIKGTTSTHIALHQAYKLLMD--TD 61 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAA 329 + A+ KK II TDG + D L K ++AI + Sbjct: 62 SGNGAREG--VKKMIIIFTDGHSQQSPQDMALRL------KNESVEIFAITLTPAPYADE 113 Query: 330 DQFLKNCASPDRFYSVQ 346 + L + D ++ Sbjct: 114 GELLSITQNTDHIFTPD 130 >gi|113953124|ref|YP_730426.1| structural toxin protein RtxA [Synechococcus sp. CC9311] gi|113880475|gb|ABI45433.1| structural toxin protein RtxA [Synechococcus sp. CC9311] Length = 2154 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 87/265 (32%), Gaps = 26/265 (9%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 D + I + +N ++ D + + F + + Sbjct: 918 ADPNAADYVFTITLDPSQSNYTVKEYATIDGTTDTTLNYDVKLTDEDLDFVDGDFEVTWA 977 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPF-------IFCTFPWCANSSHAPLLITSSVKISSK 164 S +++ ++ S+ + Y++ + I T ++ ++ V S Sbjct: 978 PSPAVLKVGENISDVASSTTPYKVDYDLVSKSDIIPTGSGTIEGTNGEDILIGDVGGGSL 1037 Query: 165 SDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDII---------KSIPD 211 + +++ +VLDVS SM + + + + +++L I + + Sbjct: 1038 VNQSINLSLVLDVSRSMILSNINFNNASVTRFSALQTATKDLLSEIAQSGATAKVQIVKY 1097 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 G F+S QT + + I+ L G T GL A N I Sbjct: 1098 STEGSDVGYYNFTSGDDQTV-----LNQAFQDIDDLQAGGGTNYEAGLVTALNWISGGIT 1152 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 + D K I F++DGE S Sbjct: 1153 SNTPLNVNQTDKDKVI-FISDGEPS 1176 >gi|68535931|ref|YP_250636.1| putative secreted protein [Corynebacterium jeikeium K411] gi|68263530|emb|CAI37018.1| putative secreted protein [Corynebacterium jeikeium K411] Length = 550 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 34/200 (17%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVR---SGLVTFSSKI-V 228 +LD S SM + +LG + +++D + R L+ FSSK+ Sbjct: 366 LLDTSGSMRGN------RLGDLKSILNKLIDGTAGEGNNPKGFARRETITLMPFSSKVAD 419 Query: 229 QTFPLAWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + Q ++ +N L T + AY+++ + L Sbjct: 420 GYTQENYDPDNAQQKQDLRGYVNGLQPRGETAIYDAVLRAYDRVGEGGGSLNS------- 472 Query: 283 YKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--S 338 I+ +TDG ++S D + + V+ I EA+++ ++ A + Sbjct: 473 ----IVLMTDGASNSGTSRKDFITKMTRMMDETNHKIPVFVILYG-EASEEEMRFLANFT 527 Query: 339 PDRFYSVQNSRKLHDAFLRI 358 + ++V+ L AF I Sbjct: 528 GGKVFNVRG-GDLAKAFEEI 546 >gi|119612406|gb|EAW92000.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_c [Homo sapiens] Length = 849 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 101 DLVFMVDGSWSIGDENFNKIISFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 151 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 152 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 205 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 206 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 259 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 260 FVDD----FDAFKKIEDEL 274 >gi|119612404|gb|EAW91998.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_a [Homo sapiens] Length = 865 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 101 DLVFMVDGSWSIGDENFNKIISFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 151 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 152 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 205 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 206 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 259 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 260 FVDD----FDAFKKIEDEL 274 >gi|2065167|emb|CAA72402.1| collagen type XIV [Homo sapiens] Length = 755 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 7 DLVFMVDGSWSIGDENFNKIISFLYSTVGALNKI---------GTDGTQVAMVQFTDDPR 57 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 58 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 111 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 112 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 165 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 166 FVDD----FDAFKKIEDEL 180 >gi|332305539|ref|YP_004433390.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332172868|gb|AEE22122.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 1359 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 55/186 (29%), Gaps = 42/186 (22%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINR----------------LIFGSTTKSTPGLEYAYN 264 V + + + L G + + R L G + Sbjct: 501 VLYYGGLDVDYGLTRGNNSVSNTVRRNTRVSHRLSYTGQDATLPSGCEEDNLSSSNCITQ 560 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYC-------------- 309 +I L I +I+ L+DGE N++ ++D E+L Sbjct: 561 QIVQGARYLSPITDRQCQVNNHIVLLSDGEANNNHSVDEIETLLSASCTGSGGEKCGLSL 620 Query: 310 --------NEAKRRGAIVYAIGVQAE-AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 I + IG A A+ FL A FY NS++L AF I Sbjct: 621 VRNIADTEESVIDSRIITHTIGFAANTQANSFLNQIALQGGGGFYQADNSQELLGAFQSI 680 Query: 359 GKEMVK 364 K + Sbjct: 681 LKTVKD 686 >gi|300071886|gb|ADJ61286.1| hypothetical protein LLNZ_11975 [Lactococcus lactis subsp. cremoris NZ9000] Length = 1438 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 72/270 (26%), Gaps = 96/270 (35%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 +D+++V+D+S SM G + +S ++ V GLV +SS Sbjct: 296 KPVDIVLVVDMSGSME---GAREGAIKQGVKSFLSSIENTA----YAQYVNVGLVGYSSP 348 Query: 226 -----KIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 T P+ G K F T + G+ + + E Sbjct: 349 GYISNSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQMLKEDASGNE-- 406 Query: 277 AKGHDDYKKYIIFLTDGENSSP-------------------------------------- 298 K II +TDG + Sbjct: 407 --------KMIILMTDGVPTFSNKVSSAQLEGGVLYGTDFDSNSLDEPSFTSQLWMMNGN 458 Query: 299 --------------NIDNKESLFYCNEAKRRGAIVYAIGVQA--------EAADQFLKN- 335 N +L AK GA ++ +G+Q ++++ +L Sbjct: 459 NRTPAPYTVSGETINDTWAATLGEAKIAKDDGAEIHTLGIQLGKDSGYTNDSSNTYLSQE 518 Query: 336 --------CASPDRFYSVQNSRKLHDAFLR 357 AS + ++ + D Sbjct: 519 EVRKRTSLIASSGLYQDADSAENITDYLKN 548 >gi|290543406|ref|NP_001166514.1| cochlin [Cavia porcellus] gi|195970365|gb|ACG60666.1| coagulation factor C [Cavia porcellus] Length = 553 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 64/211 (30%), Gaps = 27/211 (12%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 333 SYHMPNWFGTTK-YVKPLVQKLCSHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 391 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLI- 248 +I K+ + + V F+ F +++ I + Sbjct: 392 FVS---------NIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRSIRY 442 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + + + +F K +++ +TDG+ + D+ Sbjct: 443 MSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIITDGQ----SYDDVRG--P 488 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 489 AAAAHDAGITIFSVGVAWAPLDDLKDMASKP 519 >gi|149921119|ref|ZP_01909577.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] gi|149818006|gb|EDM77465.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] Length = 428 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 76/268 (28%), Gaps = 80/268 (29%) Query: 170 DMMMVLDVSLSM---NDHFG-----PGMDKLGVATRSIREMLDII--------------- 206 +M+V+D S SM + P + + + ++D Sbjct: 99 QVMLVVDASGSMVNNSWDHDLDPNTPQVTRWNTLHGVVSTVMDNFGPAMYAGIQRFPSEE 158 Query: 207 ---KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE----KINRLIFGSTTKSTPGL 259 + P +N SG ++ L G I T +T G+ Sbjct: 159 ACPDATPMSSNCYNSGSCIVGTQPEVGVSLDNGASVIAAIPGPTAGNTEIVGGTPATKGM 218 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN--------KESL-FYCN 310 A H+ + + + +Y++ +TDG + + E+L Sbjct: 219 NSAV----------SHLEQQPEAFPRYVLLITDGAANCDQALSFPDYIEQYDETLPTTVQ 268 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPD----------------------------RF 342 A G + +G+ E L+ + +F Sbjct: 269 AAFDGGITTFVVGIDIE---DMLQGVGTDGSPEANPFERLNDVAIAGGAPKNEGMDLEKF 325 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRILYN 370 Y+ N ++L DA I E+ I Sbjct: 326 YNTTNQQELLDAIQAILGEVTDCTIDLT 353 >gi|45384390|ref|NP_990268.1| cochlin precursor [Gallus gallus] gi|7387581|sp|O42163|COCH_CHICK RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2293562|gb|AAC62253.1| Coch-5B2 [Gallus gallus] Length = 547 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 69/209 (33%), Gaps = 23/209 (11%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y+MP F T L + S +++ ++D S S+ + M + Sbjct: 327 SYQMPSWFGTTK-YVKPLVQKLCSHEQMLCSKTCYNSVNIGFLIDGSSSVGESNFRLMLE 385 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FG 250 E+ DI I V T+ + +F + + I + Sbjct: 386 FISNVAKAFEISDIGSKIATVQ-------FTYDQRTEFSFTDYTTKEKVLSAIRNIRYMS 438 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + + + +F + + K +++ LTDG+ + D+ Sbjct: 439 GGTATGDAISFTTRNVFGPVKDGAN--------KNFLVILTDGQ----SYDDVRGPAVA- 485 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A++ G V+++GV D + P Sbjct: 486 -AQKAGITVFSVGVAWAPLDDLKDMASEP 513 >gi|222635792|gb|EEE65924.1| hypothetical protein OsJ_21784 [Oryza sativa Japonica Group] Length = 578 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 77/211 (36%), Gaps = 33/211 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMND------HFGPGMDKLGVATRSIREMLDII 206 L + + + +D++ VLDVS SMND +L V S++ ++ + Sbjct: 30 LRVEAPPAADLNGHVPIDVVAVLDVSGSMNDPVAASPESNLQATRLDVLKASMKFIIRKL 89 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKINRLIFGSTTKST--PG 258 + R +V F+ V+ + G +KI+RL P Sbjct: 90 D------DGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQAVVAVALRLCPE 143 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L+ A + + + + +I+ LTDG++++ +++ + Sbjct: 144 LQEAVKILDERQGNSRNRVG-------FILLLTDGDDTTGFRWSRDVIHGA----VGKYP 192 Query: 319 VYAIGVQAEAADQFLKNCA--SPDRFYSVQN 347 V+ + A + L + A S + V + Sbjct: 193 VHTFALGAAHDPEALLHIAQESRGTYSFVDD 223 >gi|158321348|ref|YP_001513855.1| magnesium chelatase [Alkaliphilus oremlandii OhILAs] gi|158141547|gb|ABW19859.1| Magnesium chelatase [Alkaliphilus oremlandii OhILAs] Length = 629 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 54/151 (35%), Gaps = 18/151 (11%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + + + G ++ V+D S SM ++G ++ +L+ + Sbjct: 424 LSLVIRSGDIREKVREKHTGATILFVVDASGSMG-----AKRRMGAVKGAVLSLLN--DA 476 Query: 209 IPDVNNVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 +N G++ F + V Q+ + L G T GL AY + Sbjct: 477 YQKRDN---VGIIAFRKDGADTLLNITRSVDLAQKCLTNLPTGGKTPLASGLYKAYELLK 533 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + D +YI+ ++DG+ + P Sbjct: 534 IDR-------IKNADALQYIVLVSDGKGNVP 557 >gi|309362046|emb|CAP28695.2| hypothetical protein CBG_09096 [Caenorhabditis briggsae AF16] Length = 516 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 59/184 (32%), Gaps = 28/184 (15%) Query: 135 MPFIFCTFPWCANSSHA-------PLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDH 184 +P + + + P ++ + V + + LD++ +LD S S+ D Sbjct: 8 IPLLILAVTFQTEAVKIIDNGLAPPEIVHTPVSTKPRCKVFAPPLDLIFILDSSGSLRDK 67 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQE 242 F +D + I+K + R L+ FS F + + Sbjct: 68 FQDEIDIIRR----------ILKHVTIGKTATRVMLIQFSGTQHLEFNFEKFTDREELLA 117 Query: 243 KINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ E+A ++ + K + L+DG D Sbjct: 118 ALDVLRHVSGITRIGGAFEFALQQLKTPG----SGLRDGTVP-KIVYLLSDGRTHDYPKD 172 Query: 302 NKES 305 + S Sbjct: 173 WQMS 176 >gi|309362033|emb|CAP28676.2| hypothetical protein CBG_09117 [Caenorhabditis briggsae AF16] gi|309362045|emb|CAP28694.2| hypothetical protein CBG_09097 [Caenorhabditis briggsae AF16] Length = 787 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 59/184 (32%), Gaps = 28/184 (15%) Query: 135 MPFIFCTFPWCANSSHA-------PLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDH 184 +P + + + P ++ + V + + LD++ +LD S S+ D Sbjct: 8 IPLLILAVTFQTEAVKIIDNGLAPPEIVHTPVSTKPRCKVFAPPLDLIFILDSSGSLRDK 67 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQE 242 F +D + I+K + R L+ FS F + + Sbjct: 68 FQDEIDIIRR----------ILKHVTIGKTATRVMLIQFSGTQHLEFNFEKFTDREELLA 117 Query: 243 KINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ E+A ++ + K + L+DG D Sbjct: 118 ALDVLRHVSGITRIGGAFEFALQQLKTPG----SGLRDGTVP-KIVYLLSDGRTHDYPKD 172 Query: 302 NKES 305 + S Sbjct: 173 WQMS 176 >gi|268572089|ref|XP_002641231.1| Hypothetical protein CBG09097 [Caenorhabditis briggsae] gi|268572157|ref|XP_002641249.1| Hypothetical protein CBG09117 [Caenorhabditis briggsae] Length = 772 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 59/184 (32%), Gaps = 28/184 (15%) Query: 135 MPFIFCTFPWCANSSHA-------PLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDH 184 +P + + + P ++ + V + + LD++ +LD S S+ D Sbjct: 8 IPLLILAVTFQTEAVKIIDNGLAPPEIVHTPVSTKPRCKVFAPPLDLIFILDSSGSLRDK 67 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQE 242 F +D + I+K + R L+ FS F + + Sbjct: 68 FQDEIDIIRR----------ILKHVTIGKTATRVMLIQFSGTQHLEFNFEKFTDREELLA 117 Query: 243 KINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ E+A ++ + K + L+DG D Sbjct: 118 ALDVLRHVSGITRIGGAFEFALQQLKTPG----SGLRDGTVP-KIVYLLSDGRTHDYPKD 172 Query: 302 NKES 305 + S Sbjct: 173 WQMS 176 >gi|268572085|ref|XP_002641230.1| Hypothetical protein CBG09096 [Caenorhabditis briggsae] Length = 564 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 59/184 (32%), Gaps = 28/184 (15%) Query: 135 MPFIFCTFPWCANSSHA-------PLLITSSVKISSKSDI---GLDMMMVLDVSLSMNDH 184 +P + + + P ++ + V + + LD++ +LD S S+ D Sbjct: 8 IPLLILAVTFQTEAVKIIDNGLAPPEIVHTPVSTKPRCKVFAPPLDLIFILDSSGSLRDK 67 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQE 242 F +D + I+K + R L+ FS F + + Sbjct: 68 FQDEIDIIRR----------ILKHVTIGKTATRVMLIQFSGTQHLEFNFEKFTDREELLA 117 Query: 243 KINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++ L T+ E+A ++ + K + L+DG D Sbjct: 118 ALDVLRHVSGITRIGGAFEFALQQLKTPG----SGLRDGTVP-KIVYLLSDGRTHDYPKD 172 Query: 302 NKES 305 + S Sbjct: 173 WQMS 176 >gi|290991702|ref|XP_002678474.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284092086|gb|EFC45730.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 467 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 77/211 (36%), Gaps = 36/211 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 L++++VLD+ + G M + + I + I ++ R G+V F Sbjct: 88 NERKDLNLVIVLDI----SGSMGSAMSSKEKTKMQVANEV-ICEIIENLREFERLGIVLF 142 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 K PL + ++E + + +T G++ + + Sbjct: 143 DDKAETLLPLTIVQDLDKKSLKETVLNIKEKGSTNFEAGMQRGIDLFSSLDSSDLSNSNR 202 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYA--IGVQAEAADQF---- 332 II+LTD + PN+ +L + G +++ +G+ + + Sbjct: 203 -------IIYLTD---ACPNVGGTATLDILTKDANSGPYSIFSTFVGIGLDFNSKIVDEL 252 Query: 333 --LKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 ++ C ++SV+++ + F +I E Sbjct: 253 TRVRGC----NYFSVKSTEE----FKKILNE 275 >gi|126316412|ref|XP_001380737.1| PREDICTED: similar to integrin, alpha 1 [Monodelphis domestica] Length = 1183 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 89/267 (33%), Gaps = 37/267 (13%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 +++ + + + Y + T C++ ++ S + LD+++V Sbjct: 122 TLVTNPKGGFLACGPLYAYRCGHTYYTTGICSDVDSKFQVVNSIAPSVQGCNTQLDIVIV 181 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S S + T + +L + P G+V + + F L Sbjct: 182 LDGSNS--------IYPWTSVTDFLNSLLGKMDIGPKQTQ---VGIVQYGENVTHEFNLN 230 Query: 235 --WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + N+++ T + G++ A + F KK ++ + Sbjct: 231 KYTTTEEVLIAANQIVQRQGRQTMTALGIDTARKEAFTKARGAR------SGVKKVMVIV 284 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKNCAS--- 338 TDGE + DN + + ++I + E + +K+ AS Sbjct: 285 TDGE----SHDNHRLNEVIQDCEDEDIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPT 340 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 F++V + L +G+ + Sbjct: 341 EKHFFNVSDELALVTIAEVLGERIFAL 367 >gi|260778728|ref|ZP_05887620.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] gi|260604892|gb|EEX31187.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] Length = 463 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 48/293 (16%), Positives = 103/293 (35%), Gaps = 54/293 (18%) Query: 14 KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGK 73 KG +++++ I P + + G I++ F VKAKL +D + + A + E+ Sbjct: 14 KGLVALISVIAAPFLILATGTAIDSGRAFLVKAKLFAAVDAAGIAAARAVAEGEDA---- 69 Query: 74 KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 D + D ++ ++ D + ++ + Sbjct: 70 ---------------ARDAAIKFYNANLPTDYHDSTTASPTVTFGYDSFGNISIDLSASA 114 Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKL 192 E+ F H+ L I+++ + + +D+++V+D + S+ G D + Sbjct: 115 EVSTTFL-----GVFGHSSLEISATAQTVRR---PVDLVLVVDNTTSLRLGSIGDVTDDV 166 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL--AWG--VQHIQEKINRLI 248 ++ + + R LV ++ + G I +I+ Sbjct: 167 VARSKDFITNFNE--------SFDRIALVKYAYGAEVPVAFQSSRGHSRSDITTEIDAFD 218 Query: 249 FGST-----TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 FGS T S+ G+ A + + + + K I+F TDG + Sbjct: 219 FGSLSSLQYTNSSEGIYLALDALRNVTDPANL---------KVIVFFTDGAPN 262 >gi|109112823|ref|XP_001117651.1| PREDICTED: integrin alpha-E-like [Macaca mulatta] Length = 956 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 58/164 (35%), Gaps = 21/164 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G ++ ++LD S S++ P A I M+ N LV + Sbjct: 363 AGTEIAIILDGSGSID----PP--DFQRAKDFISNMMRNFYEKCFECNF---ALVQYGGV 413 Query: 227 IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I F L + + + N GS TK+ +++ + IF + A Sbjct: 414 IQTEFDLRDSQDVMASLAKVQNITQVGSVTKTASAMQHVLDNIFTSSHGSRRKA------ 467 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K ++ LTDG D + N K G +AIGV E Sbjct: 468 SKVMVVLTDG---GIFEDPLDLTTVINSPKMHGVERFAIGVGEE 508 >gi|194225621|ref|XP_001916184.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Equus caballus] Length = 3570 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 74/210 (35%), Gaps = 40/210 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +L +R++L +P R +VTFSSK Sbjct: 81 RLELVFLVDESSSVGQ--ANFLSELK----FVRKLLSDFPVVP---TATRVAIVTFSSKN 131 Query: 228 VQTFPLAWGVQH---------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++I + + G T + + A + ++E Sbjct: 132 NVVPRVDYISSRRAHQHKCALLSQEIPAITYRGGGTYTKGAFQQAAQILRHSRENS---- 187 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I +TDG ++ + + G ++ G+ + + Sbjct: 188 ------TKVIFLITDGYSNGG-----DPRPVAASLRDFGVEIFTFGIWQGNIRELNDMAS 236 Query: 338 SP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +P + Y + + + F + + + + Sbjct: 237 TPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|121583396|ref|YP_973827.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596650|gb|ABM40085.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 212 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 68/197 (34%), Gaps = 16/197 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM+ + ++ ++ ++ P ++TF S+ Sbjct: 3 RLPVYLLVDTSGSMSGE------PIEAVKNGVQVLVSTLRQDPYALETAFLSIITFDSEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + +I ++ TT L +KI + KG +I Sbjct: 57 RQVVPLT---ELANFQIPAIVATGTTALGSALSLLADKIEMEVGRTTAEVKGDWKPIVFI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG SP D K+ L + G ++ A A LK + + Sbjct: 114 --MTDG---SPTDDWKKGLERLKTVRT-GMVI-ACAAGPGADTTVLKQITEIVVQLNTAD 166 Query: 348 SRKLHDAFLRIGKEMVK 364 + + F + + Sbjct: 167 ASTIKAFFKWVSASVSS 183 >gi|302804196|ref|XP_002983850.1| hypothetical protein SELMODRAFT_423098 [Selaginella moellendorffii] gi|300148202|gb|EFJ14862.1| hypothetical protein SELMODRAFT_423098 [Selaginella moellendorffii] Length = 544 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 15/172 (8%) Query: 170 DMMMVLDVSLSMNDHFGP--GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +VLD S SM+ GP +L VA + E+LD + + V +G F K Sbjct: 147 SLYIVLDTSSSMSISIGPLSSQSRLAVAKGILDELLDTLTNGDQVIVSDMNGGKPFGGKP 206 Query: 228 VQTF----PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V ++ I N + K ++ A+ D ++ Sbjct: 207 VSVSLEGLETSFDHAGISALKNAISNARADKLQTDIKKAFVGALDFFNSSSNLN------ 260 Query: 284 KKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 I+ LTDG+ + N+ + + +F K VY IG + F + Sbjct: 261 --VILLLTDGQFANHVNLTDLDPIFKQLNEKNVVVFVYRIGFYISNDETFQR 310 >gi|297292637|ref|XP_001090691.2| PREDICTED: anthrax toxin receptor 2 [Macaca mulatta] Length = 488 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 73/204 (35%), Gaps = 33/204 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD--VNNVVRSGLVTFSS 225 D+ VLD S S+ +++ E+ + ++ + + V+ +R + FSS Sbjct: 42 AFDLYFVLDKSGSVANNW--------------IEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 226 KIVQTFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + PL I E + R+ T GL+ KI + ++ Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLVIEKIIYSGDEKXXXXX---- 143 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 I LTDG+ + + ++ GA VY +GV Q + S ++ Sbjct: 144 -----IALTDGKLDG--LVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQV 196 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V+ A I ++ Q Sbjct: 197 FPVKGG---FQALKGIINSILAQS 217 >gi|90414550|ref|ZP_01222524.1| hypothetical protein P3TCK_02211 [Photobacterium profundum 3TCK] gi|90324357|gb|EAS40923.1| hypothetical protein P3TCK_02211 [Photobacterium profundum 3TCK] Length = 668 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 67/203 (33%), Gaps = 30/203 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 PF+ W + + + S G + +V+D+S SM ++L A Sbjct: 54 PFVILALGWLLSVLALTGPSWEKNTLPAYSLSGARV-LVMDMSRSM-YATDIAPNRLTQA 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGS 251 +ML K +GLV +++ PL ++ I ++ Sbjct: 112 RFKALDMLPGWKEGS-------TGLVAYAADGYVVSPLTEDSSTLKNLIPNLSPEIMPIQ 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 + + G++ A + + A + II +TDG + D +L Sbjct: 165 GSNAAAGVQEAISLLKQAGHQAGD-----------IIIITDG-MTQQESDQTMAL----- 207 Query: 312 AKRRGAIVYAIGVQAEAADQFLK 334 K + + + + + + Sbjct: 208 VKDQDYRLSILAIGTQQGAPIKQ 230 >gi|218659662|ref|ZP_03515592.1| hypothetical protein RetlI_08405 [Rhizobium etli IE4771] Length = 81 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 27/58 (46%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +R F+ + +G + LT I +P++ L+I+ + L +D L A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAREL 63 >gi|226326039|ref|ZP_03801557.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] gi|225205581|gb|EEG87935.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] Length = 823 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 61/163 (37%), Gaps = 23/163 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+++++D S SM+++ +K + ++ +++D + S V+ R LVTF + Sbjct: 450 FDIVLIMDTSTSMSNN-----NKWRNSKTAVNKLIDTLSSQTTVDVNYR--LVTFGTTAQ 502 Query: 229 QTFPLAWGVQHIQEKIN--RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 G + ++ ++ + T GL + +K Sbjct: 503 IQTNWTTG-ETVKSTLSNYSIKEDQGTNYEDGLVKTKEALSSGTRADA---------EKI 552 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 I+FLTDG+ + K Y VY + A A Sbjct: 553 IVFLTDGQPTFY----KSGTSYAGPGSSTSYRVYQHALDAAAK 591 >gi|220678711|emb|CAX12780.1| novel protein similar to H.sapiens ANTXR1, anthrax toxin receptor 1 (ANTXR1) [Danio rerio] Length = 270 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 46/132 (34%), Gaps = 15/132 (11%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FS++ L I +N L I G T GLE A +I+ Sbjct: 3 RMSFIVFSTRGTTIMRLTENRDDITRGLNTLKREIPGGDTYMNLGLEEANVQIYHGNYGA 62 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 II LTDGE + + A+ GAIVY +GV+ Q Sbjct: 63 AS----------VIIALTDGELNDHQFVTAQ--QEAQRARSMGAIVYCVGVKDFNETQLA 110 Query: 334 KNCASPDRFYSV 345 + + + V Sbjct: 111 TIADTIEHVFPV 122 >gi|163797398|ref|ZP_02191350.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199] gi|159177317|gb|EDP61874.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199] Length = 683 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 73/203 (35%), Gaps = 30/203 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ +++ V+DVS SM G + + S E L + +V + Sbjct: 324 GNMTREVIFVIDVSGSMK---GEPLRAAKASLTSGIEGLGRNDTF-NVVAFNNKAAAFYD 379 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + + + I+ L G T+ E A D + Sbjct: 380 APVRASGKF---HRAALKVIDGLKAGGGTEMAAAFELALQMPGDPDRLQQ---------- 426 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAK-RRGA-IVYAIGVQAEAADQFLKNCASPDR- 341 ++F+TDG + N+ +LF N+ K GA ++ +G+ + F++ A R Sbjct: 427 --VVFITDG-----AVSNEAALF--NQIKGELGARRLFTVGIGSAPNTFFMEEAARFGRG 477 Query: 342 -FYSVQNSRKLHDAFLRIGKEMV 363 + + ++ + ++ Sbjct: 478 TYTYIGDTSSAERVMRDLFTKIS 500 >gi|296223008|ref|XP_002757441.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Callithrix jacchus] Length = 1289 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 55/170 (32%), Gaps = 30/170 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 VYILIDTSHSMK-------SKLDLVKDKIIKFIQEQLKYKSRFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEENLEQAQSWIRDMKIGSSTNTLSALKTAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P + + E +Y I A++FLK A+ Sbjct: 606 TDGRPDQPPEMVIDQVKVFQE-----IPIYTISFNYNDEIANRFLKEIAA 650 >gi|148974876|ref|ZP_01811856.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] gi|145965385|gb|EDK30634.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] Length = 418 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/265 (13%), Positives = 79/265 (29%), Gaps = 31/265 (11%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 ++ I M ++ S + +L + + L E N + Y + Sbjct: 1 MMVIFMAFSMQMSQQMLAHTRLLEAAEVASLALIASPREDEENNVKYARYLVDRYVVDNT 60 Query: 87 IWQTDFRNEL---RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 ++G Q + + + ++ + A + F Sbjct: 61 DDVDVAVYTSICEYKDGCVQASGELAPFSDFVVRATAKYTSWI--AYEDVNLKPEFSVSG 118 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + P +D+ + D S SM + + G KL V +I+ ++ Sbjct: 119 RAVTRKYLPQP--------------VDVYFIGDFSGSMGNPWKNGKMKLDVVKETIKRVV 164 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D I+ + R L+ ++ PL +I RL +++ ++AY Sbjct: 165 DDIEEF-NSEEKSRVALLGYN-------PL---HVKQSNEIVRLNAYGY-RASWRKKHAY 212 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYII 288 + ++ II Sbjct: 213 DYARNSPATTVRRMFDEPTLYNEII 237 >gi|58395788|gb|AAW72737.1| thrombospondin related adhesive protein [Plasmodium reichenowi] Length = 565 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 69/223 (30%), Gaps = 32/223 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPG 188 +Y + F + I +K + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH---- 61 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--------I 240 ++ + +I+ + + + FS+ + L I Sbjct: 62 ----NWVKHAVPLAIKLIQQLNLNESAIHLYASIFSNNAREIISLHSDASKNKEKALIII 117 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + +N + T + L + D + + ++ LTDG +S Sbjct: 118 KSLLNTNLPYGKTNLSDALLQVRKHLND--------RINRQNANQLVVILTDGIPNSIQD 169 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 SLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLAGCHPSD 208 >gi|304404942|ref|ZP_07386602.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304345821|gb|EFM11655.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 433 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 59/175 (33%), Gaps = 19/175 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 +D+M+V+D + SM+ D+L +++++ + + +R + Sbjct: 225 VDLMLVVDTTGSMS-------DELRYLEAELKDVVTRVSEQNNGQLDIRVSSNFYRDEHD 277 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +V+ FP V ++I +E A + + E + + Sbjct: 278 DYVVRPFPFTRDVDKAVDQIADQEAFGGGDFPEAVELALSNAIEEHEWSK------EALA 331 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA 337 + + + D K+ +A +G + I A + ++ A Sbjct: 332 RLMFVVLDAPPHHEPQIMKKLQKLTAKAAEQGIRIIPIASSGVDIATEHLMRYIA 386 >gi|291456048|ref|ZP_06595438.1| putative von Willebrand factor type A domain protein [Bifidobacterium breve DSM 20213] gi|291382457|gb|EFE89975.1| putative von Willebrand factor type A domain protein [Bifidobacterium breve DSM 20213] Length = 1238 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 38/167 (22%), Positives = 59/167 (35%), Gaps = 25/167 (14%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM--LDIIKSIPDVN 213 T S +S +D+ VLD S SMND G +L +I + L + Sbjct: 493 TVSGTTTSGEKAKIDVAFVLDTSGSMNDKVGNS-TRLKNMQNAITDNGGLSSVLFNSPDK 551 Query: 214 NVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + ++TF+S + T L+ + E +N L T GLE N Sbjct: 552 IDAQVHVITFASGLGLDGTSVLST-KADLDEVVNGLTANGATHWEKGLERVSNISTRPG- 609 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE-------SLFYCNE 311 KY++FLTDG+ + L C++ Sbjct: 610 -----------ATKYVVFLTDGDPGNKGWKETNVYSCGVLGLQTCDD 645 >gi|324996174|gb|EGC28084.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK678] Length = 458 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 50/343 (14%), Positives = 119/343 (34%), Gaps = 58/343 (16%) Query: 19 ILTAILLPVIFIVMGLVI------------ETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 ++ +L+ +I +++G++ E S + ++ + Y +D ++ + + Sbjct: 15 VMAIMLMSMIALIIGIIFKTMFSSRELIEREASIQAEMRTSMQY-VDRTVSKATSIFVLD 73 Query: 67 ENGNNGKKQK----------NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 ++ G KQ + +++ +W + ++ + + N++ Sbjct: 74 DSKFKGSKQGLTREWSYIGLSADGKKVLNYVWNKEKQDWDVSELGTKSLYNMKLDLEFKT 133 Query: 117 ---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI-----------S 162 D++ YNL+ +Y ++ + + + K Sbjct: 134 EGAYQDNRLISYNLT--GKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTDP 191 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKSIPDVNNVV 216 + + + + V D+S SM ++ + M++ ++S+ +V+ + Sbjct: 192 IQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVNL 251 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEH 275 + T S K L I+E I L T GL Y + +L Sbjct: 252 TTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSLQKQHAQL-- 309 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 KY++ LTDG ++ + N++ E KR G Sbjct: 310 ---------KYVVLLTDGVPNAY-LVNQQGQAGGLEMKREGIQ 342 >gi|237836353|ref|XP_002367474.1| microneme protein 2 [Toxoplasma gondii ME49] gi|211965138|gb|EEB00334.1| microneme protein 2 [Toxoplasma gondii ME49] Length = 723 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 70/191 (36%), Gaps = 40/191 (20%) Query: 151 APLLITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 P + ++ S LD+ ++D S S G+ + + + L Sbjct: 5 VPEGVEDVIQSESAIGAAEGCTNQLDICFLIDSSGS------IGIQNFRLVKQFLHTFLM 58 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----------STTK 254 ++ P+ V + +VT+S+ + L W +Q +++ + +T Sbjct: 59 VLPIGPEE---VNNAVVTYSTDVH----LQWDLQS-PNAVDKQLAAHAVLDMPYKKGSTN 110 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 ++ GL+ +F G + K +I +TDGE + + ++ E + Sbjct: 111 TSDGLKACKQILFTGSR------PGREHVPKLVIGMTDGE----SDSDFRTVRAAKEIRE 160 Query: 315 RGAIVYAIGVQ 325 G IV + V Sbjct: 161 LGGIVTVLAVG 171 >gi|17231904|ref|NP_488452.1| hypothetical protein alr4412 [Nostoc sp. PCC 7120] gi|17133548|dbj|BAB76111.1| alr4412 [Nostoc sp. PCC 7120] Length = 820 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 57/177 (32%), Gaps = 32/177 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KI 227 D++ ++D S S G + + R L+ + +V FS + Sbjct: 300 DVVFLIDTSGS---QMGAPLMQCQELMRRFINGLNPDDTFS---------IVDFSDTTRQ 347 Query: 228 VQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + PLA IN+L T+ G+ N +L Sbjct: 348 LSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLNFPVTDPGRL----------- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I+ LTDG + N + L + + G +Y+ G + L A R Sbjct: 397 RSIVLLTDGYIGNEN----QILAEVQQHLKSGNRLYSFGAGSSVNRFLLNRIAELGR 449 >gi|330719552|gb|EGG98147.1| BatA [gamma proteobacterium IMCC2047] Length = 166 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 23/93 (24%) Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------------ 325 + +I LTDG N++ I+ L A+++ ++ IGV Sbjct: 13 SRLLILLTDGANTAGEIEP---LKAAELAQQQQIKIHTIGVGANEMLVPGLFSSRRVNPS 69 Query: 326 AEAADQFLKNCAS--PDRFYSVQNSRKLHDAFL 356 A+ + LK AS +++ Q++ +L + Sbjct: 70 ADLDEDTLKKIASQTGGQYFRAQDTEQLQQIYA 102 >gi|169642421|gb|AAI60698.1| Unknown (protein for IMAGE:4964804) [Xenopus laevis] Length = 519 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 78/217 (35%), Gaps = 35/217 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D +++ +VLD S S+ G ++ A + ++ + + R +++++ Sbjct: 236 KDGLMNIFIVLDTSKSV------GQNRFDEAKSASILFIEKMSNYDIKP---RYCIISYA 286 Query: 225 SKIVQTFPL----AWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 SK + L + + E + +R T + L Y + + + E Sbjct: 287 SKAISVVSLRDPDSNNADAVMEHLEEFQYDRHEDKQGTNTRAALHAIYEHLIEQELAYER 346 Query: 276 IAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----------VYAIG 323 K D K I+ +TDG+ + D +E + G VY G Sbjct: 347 EGKKEDFMKIHNVILLMTDGKFNMGG-DPREEMKLIKRFLDVGIRKDNPREEYLDVYVFG 405 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFL 356 + ++ + + AS + +QN K+ + F Sbjct: 406 LGSDIDQPEINDLASKKEKEVHTFHLQNVEKMKEFFE 442 >gi|296475339|gb|DAA17454.1| cochlin precursor [Bos taurus] Length = 550 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 65/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ + M + Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGESNFRLMLE 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NR 246 +I K+ + + V F+ Q ++ +E + N Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 437 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 438 SYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIVTDGQ----SYDDVRG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|115305395|gb|AAI23842.1| COCH protein [Bos taurus] Length = 550 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 65/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ + M + Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGESNFRLMLE 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NR 246 +I K+ + + V F+ Q ++ +E + N Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 437 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 438 SYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIVTDGQ----SYDDVRG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|148230547|ref|NP_001081234.1| complement factor B [Xenopus laevis] gi|602779|dbj|BAA06179.1| complement factor B [Xenopus laevis] Length = 747 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 78/217 (35%), Gaps = 35/217 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D +++ +VLD S S+ G ++ A + ++ + + R +++++ Sbjct: 236 KDGLMNIFIVLDTSKSV------GQNRFDEAKSASILFIEKMSNYDIKP---RYCIISYA 286 Query: 225 SKIVQTFPL----AWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 SK + L + + E + +R T + L Y + + + E Sbjct: 287 SKAISVVSLRDPDSNNADAVMEHLEEFQYDRHEDKQGTNTRAALHAIYEHLIEQELAYER 346 Query: 276 IAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----------VYAIG 323 K D K I+ +TDG+ + D +E + G VY G Sbjct: 347 EGKKEDFMKIHNVILLMTDGKFNMGG-DPREEMKLIKRFLDVGIRKDNPREEYLDVYVFG 405 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFL 356 + ++ + + AS + +QN K+ + F Sbjct: 406 LGSDIDQPEINDLASKKEKEVHTFHLQNVEKMKEFFE 442 >gi|320533438|ref|ZP_08034121.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] gi|320134346|gb|EFW26611.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] Length = 369 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 81/252 (32%), Gaps = 60/252 (23%) Query: 139 FCTFPWCANSSHAPLLITSSVKIS--SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 ++ A + V+++ S + D+++ LDVS SM Sbjct: 83 LLAVMLVSSLLSAAAIAGRPVRVTERSDALANRDIVLCLDVSTSM--------------V 128 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--------- 247 R +L I + + R GLV ++S PL +++++ L Sbjct: 129 RIDSSVLTTFSEILEDFDGERVGLVAWNSAAQTIVPLTDDYDLLRDQLQELGDVLDIDPK 188 Query: 248 -----------IFGSTTKSTP--GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 T++T G A + + + ++ D + II TD + Sbjct: 189 NVTLKQQRDYEEAFGGTQTTGVNGSSLAGDGLASCAQAFDNQ---GLDRSRSIILATDNQ 245 Query: 295 NSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQF--------------LKNCASP 339 P+ + L N R +++I A+ F LK Sbjct: 246 VIDPDDEQIYPLPDAVNLLAERKIRLFSI-YGADDDQDFQNLLDKSPEESREELKTVTEG 304 Query: 340 ---DRFYSVQNS 348 RFY V++S Sbjct: 305 PGKGRFYDVEDS 316 >gi|307720885|ref|YP_003892025.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978978|gb|ADN09013.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 515 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 41/273 (15%), Positives = 84/273 (30%), Gaps = 43/273 (15%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 F + N++E S ++I L +RY + + P+ + + Sbjct: 22 SFKKTKNSMENIFSQAVIKQLALNTQLLKTSARYR--YFLLVMALMIIALARPVYLKQNA 79 Query: 160 KISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + ++ LDVS SM P + ++L I+ +++ Sbjct: 80 VT---KQMSSSAVIALDVSKSMHASDIYPS-----RLGLARLKLLKFIQKANNLH----V 127 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G++ F+ +PL+ Q + + + T LE + K Sbjct: 128 GILIFAKNSYMLYPLSEDTQALAYMLKNADIKQKLEPNTNLFGVLESGKKMLEKEK---- 183 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 K II LTDG D + + +YAI + + Sbjct: 184 ---------TKNIILLTDGGEDVSRADEISYIL------KNHLKLYAIDFGPKPNNSLKN 228 Query: 335 NCASPDRFYSVQ-----NSRKLHDAFLRIGKEM 362 + +Y + + DA + +++ Sbjct: 229 MTQKSEGYYMKYQWTQSDIEGILDAIQKSSQKI 261 >gi|281420097|ref|ZP_06251096.1| von Willebrand factor, type A [Prevotella copri DSM 18205] gi|281405897|gb|EFB36577.1| von Willebrand factor, type A [Prevotella copri DSM 18205] Length = 289 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 43/102 (42%), Gaps = 10/102 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTMKQMKRDLATEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEY 261 S +I + P G +H I+E +N T LEY Sbjct: 126 SDRIEKYIPPKKGRKHILYIIREMLNFQPQSQRTDIGCALEY 167 >gi|156084610|ref|XP_001609788.1| thrombospondin-related anonymous protein [Babesia bovis] gi|154797040|gb|EDO06220.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 660 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 70/193 (36%), Gaps = 16/193 (8%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 +P + F HA + LD +V+D S S+++ G Sbjct: 10 SVPLLSLAF-LATTGLHAFADKGVGSPKGKQCKKQLDFSIVVDESASISNDQWEGQ---- 64 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGST 252 +R ++ + N +R L T+S+ Q F L + + +L + + Sbjct: 65 -MIPFLRNLIHTVDL---DNTDIRLSLTTYSTPTRQIFTFLDAAASSTRLALTKLDWMAG 120 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 TK+ G+ Y + ++ + G + K ++ +TDG +S + + Sbjct: 121 TKARSGMTYTGRALNYVRK--AILPYGRKNVPKALLLITDGVSSDGSY----TAQVAAML 174 Query: 313 KRRGAIVYAIGVQ 325 + G V IGV Sbjct: 175 RDEGVNVMVIGVG 187 >gi|330506881|ref|YP_004383309.1| magnesium-chelatase subunit ChlD [Methanosaeta concilii GP-6] gi|328927689|gb|AEB67491.1| magnesium-chelatase subunit ChlD [Methanosaeta concilii GP-6] Length = 686 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 41/263 (15%), Positives = 82/263 (31%), Gaps = 38/263 (14%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 + + KD + A R P+ + + + + ++ V+D Sbjct: 445 MPKEGKDIAIDATIRAAAPYQKAR----SGPNAIKVKSEDIREKERARKTSAMLLFVVDG 500 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQTFPLAWG 236 S SM M ++ A ++ +L + + G+V F + P + Sbjct: 501 SGSMG-----AMQRMESAKGAVLSLL-----MESYQKRDKIGMVAFRGTEAELILPPSSS 550 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 V ++ L G T + GL + K + K ++ ++DG + Sbjct: 551 VDLALSRLKELPTGGKTPLSAGLSRGLQLLQGEMRKDA-------ETKLMMVLVSDGRAN 603 Query: 297 --SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-----FLKNC-----ASPDRFYS 344 E + K+ G V+ I + E D L C + ++Y Sbjct: 604 VGMGGKIKDELMEISERTKQLG--VHTIVIDTEVVDSSFMEMRLGYCREIAEMTGGKYYP 661 Query: 345 VQNSRKLHDAFLRIGKEMVKQRI 367 + +A I E K + Sbjct: 662 ISGLSS--EALYSIVDEEQKLLL 682 >gi|325089698|gb|EGC43008.1| U-box domain-containing protein [Ajellomyces capsulatus H88] Length = 759 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 66/174 (37%), Gaps = 27/174 (15%) Query: 143 PWCANSSHAPLLITSSVKISSKSDI---------GLDMMMVLDVSLSMNDHFG-PGMDKL 192 P PL T+S+ +S S + D+++ +DVS SM P D+ Sbjct: 39 PNEVGVQLHPLPDTNSMILSVHSPLHPEKEMPHVPCDIVLCIDVSYSMQSSAPLPTTDES 98 Query: 193 GVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEK 243 G + +LD+ K I +N R G+V FS++ + ++ + + Sbjct: 99 GEREETGLSVLDLTKHAARTIIETLNENDRLGIVAFSTEAEVVYKISKMNESNKKAALKA 158 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + L S+T GL+ + + + + + + LTDG + Sbjct: 159 VEALKPLSSTNLWHGLKLGLKAFENERRTPQSV--------QALYVLTDGMPNH 204 >gi|307305135|ref|ZP_07584884.1| von Willebrand factor type A [Sinorhizobium meliloti BL225C] gi|306902475|gb|EFN33070.1| von Willebrand factor type A [Sinorhizobium meliloti BL225C] Length = 631 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 75/222 (33%), Gaps = 32/222 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T V +S +D D + VLDV + L + + Sbjct: 431 IHMMSRPQAHDLAVTILVDVSLSTDAWFDDLRVLDVE-------KQALQVLAHGLSACGD 483 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +I+ + VR V + + I+ +I L G T+ + + Sbjct: 484 AHEILTFTSRRRDWVRIETVKAFDEAMSAT--------IEARIAALKPGYYTRIGAAIRH 535 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A + + + K +I LTDG+ + ++S EA+R G Sbjct: 536 AAAGLVERPNRR-----------KLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSG 584 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 V+ + V EA +L + + V N +L A I Sbjct: 585 ISVFGVTVDREA-KSYLPVIFGQNGYAVVSNIGRLPAALPAI 625 >gi|149410435|ref|XP_001512838.1| PREDICTED: similar to Coch-5B2 gene product [Ornithorhynchus anatinus] Length = 692 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 57/177 (32%), Gaps = 30/177 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++ ++D S S+ D + + +++ + V F+ Sbjct: 507 SVNIAFLIDGSSSVGDS------NFRLMLDFVARVVETFEISDIGTK---VAAVQFT--Y 555 Query: 228 VQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 Q ++ +E + +I T + + + +F Sbjct: 556 DQRTEFSFTDYTTKENVLAVIRQIRYMSGGTATGDAVAFTVRNVFGPLRD--------SP 607 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K +++ LTDG+ + D+ A + G V++IG+ D + P Sbjct: 608 NKNFLVVLTDGQ----SYDDVRG--PAAAAHKAGITVFSIGMAWAPLDDLKDMASEP 658 >gi|134101426|ref|YP_001107087.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] gi|133914049|emb|CAM04162.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 501 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 75/207 (36%), Gaps = 26/207 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA--------TRSIREMLDIIKSIPDVNN 214 +S ++ VLD S SM+ GP + L A ++ + + K ++ Sbjct: 306 PQSRDPSHVIFVLDFSASMS---GPRIAALRSAFAGFSGADPSAVGKFVRFYKG--EMVT 360 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++R G + A ++ +Q+ I F TT ++ + A K Sbjct: 361 IMRFGGHVLDERDFTITGQA-DLKAVQDYIAADRFDQTTGV-------WSALEAAYAKAA 412 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + H + I+ +TDGEN++ D + + A + Y + + Sbjct: 413 AATRDHPEQPVTIMLMTDGENNAGISLQDFLRNHQARDPAAKA-VHTYTVRFGEANPGEL 471 Query: 333 LKNC-ASPDRFYSVQNSRKLHDAFLRI 358 + A+ R N+ L +AF I Sbjct: 472 DQAARATGGRMVDA-NATSLSEAFKEI 497 >gi|46403702|gb|AAS92908.1| putative TrkA-like nucleotide-binding protein [Sinorhizobium meliloti] Length = 631 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 75/222 (33%), Gaps = 32/222 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T V +S +D D + VLDV + L + + Sbjct: 431 IHMMSRPQAHDLAVTILVDVSLSTDAWFDDLRVLDVE-------KQALQVLAHGLSACGD 483 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +I+ + VR V + + I+ +I L G T+ + + Sbjct: 484 AHEILTFTSRRRDWVRIETVKAFDEAMSAT--------IEARIAALKPGYYTRIGAAIRH 535 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A + + + K +I LTDG+ + ++S EA+R G Sbjct: 536 AAAGLVERPNRR-----------KLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSG 584 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 V+ + V EA +L + + V N +L A I Sbjct: 585 ISVFGVTVDREA-KSYLPVIFGQNGYAVVSNIGRLPAALPAI 625 >gi|16263145|ref|NP_435938.1| NorD accessory protein for nitric oxide reductase [Sinorhizobium meliloti 1021] gi|14523809|gb|AAK65350.1| NorD accessory protein for nitric oxide reductase [Sinorhizobium meliloti 1021] Length = 631 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 75/222 (33%), Gaps = 32/222 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T V +S +D D + VLDV + L + + Sbjct: 431 IHMMSRPQAHDLAVTILVDVSLSTDAWFDDLRVLDVE-------KQALQVLAHGLSACGD 483 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +I+ + VR V + + I+ +I L G T+ + + Sbjct: 484 AHEILTFTSRRRDWVRIETVKAFDEAMSAT--------IEARIAALKPGYYTRIGAAIRH 535 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A + + + K +I LTDG+ + ++S EA+R G Sbjct: 536 AAAGLVERPNRR-----------KLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSG 584 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 V+ + V EA +L + + V N +L A I Sbjct: 585 ISVFGVTVDREA-KSYLPVIFGQNGYAVVSNIGRLPAALPAI 625 >gi|326789665|ref|YP_004307486.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540429|gb|ADZ82288.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 564 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 57/172 (33%), Gaps = 29/172 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + V D+S SM+ L +S+ + I G Sbjct: 381 KEKKNGTKKISAVFVADISGSMDGE------PLNNLKKSLITGAEYIGKDNS------IG 428 Query: 220 LVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LVT+S + P+ + L T + G+ A + + K Sbjct: 429 LVTYSDDVNINLPVGKFDLNHRSLFTGAVMDLEASGGTATFDGMIVALKMLMEEKAA--- 485 Query: 276 IAKGHDDYKKYIIFLTDGENSSP-NIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + D K + L+DGE + ++++ E + + VY IG A Sbjct: 486 ----NPDAKLMLFVLSDGETNRGHSLNDIEGIL-----RTLKIPVYTIGYNA 528 >gi|291008392|ref|ZP_06566365.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 538 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 75/207 (36%), Gaps = 26/207 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA--------TRSIREMLDIIKSIPDVNN 214 +S ++ VLD S SM+ GP + L A ++ + + K ++ Sbjct: 343 PQSRDPSHVIFVLDFSASMS---GPRIAALRSAFAGFSGADPSAVGKFVRFYKG--EMVT 397 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++R G + A ++ +Q+ I F TT ++ + A K Sbjct: 398 IMRFGGHVLDERDFTITGQA-DLKAVQDYIAADRFDQTTGV-------WSALEAAYAKAA 449 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + H + I+ +TDGEN++ D + + A + Y + + Sbjct: 450 AATRDHPEQPVTIMLMTDGENNAGISLQDFLRNHQARDPAAKA-VHTYTVRFGEANPGEL 508 Query: 333 LKNC-ASPDRFYSVQNSRKLHDAFLRI 358 + A+ R N+ L +AF I Sbjct: 509 DQAARATGGRMVDA-NATSLSEAFKEI 534 >gi|164425557|ref|XP_960517.2| hypothetical protein NCU05555 [Neurospora crassa OR74A] gi|157070974|gb|EAA31281.2| predicted protein [Neurospora crassa OR74A] Length = 766 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 72/202 (35%), Gaps = 28/202 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATR-----SIREMLDIIK-----SIPDVNNVVRSG 219 D+++ +DVS SM+ T + +LD++K + +N+ R G Sbjct: 70 DIVLAIDVSGSMSADAPVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSDRLG 129 Query: 220 LVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +VTFS++ PL A + + + + S T G+ K+FD + Sbjct: 130 IVTFSTEAKVLQPLMPMTALNKKKTERNLGGMQPFSATNLWGGIVEGL-KLFDGQSGRMP 188 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 ++ LTDG + + + + A ++ G LK+ Sbjct: 189 A----------LMVLTDGMPNHM-CPAQGYVAKLRAMETLPAAIHTFGFGYSLRSGLLKS 237 Query: 336 CA--SPDRFYSVQNSRKLHDAF 355 A + + ++ + F Sbjct: 238 VAEIGGGGYSFIPDAGMIGTVF 259 >gi|268561228|ref|XP_002646395.1| Hypothetical protein CBG15364 [Caenorhabditis briggsae] gi|187027191|emb|CAP33691.1| hypothetical protein CBG_15364 [Caenorhabditis briggsae AF16] Length = 315 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 52/176 (29%), Gaps = 42/176 (23%) Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M D G+D + S+ I + P R GLVT++S+ Q L Sbjct: 1 MTDE---GLDSVAANLASVFSAGTRIGTSPSEPRTTRLGLVTYNSRAQQNADL------- 50 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 L+ Y +F ++K ++ +S Sbjct: 51 -------------NKFQSLDDLYEGVF------------GSHFQKVVVIY---ASSYRGT 82 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK---NCASPD-RFYSVQNSRKLH 352 ++ L N K G + + L+ ASP F S +N + Sbjct: 83 GEQDPLPVANRLKTSGVRIITVAYDQGGDGTLLRQLSQVASPGFNFSSAENEGNII 138 >gi|260827154|ref|XP_002608530.1| hypothetical protein BRAFLDRAFT_92388 [Branchiostoma floridae] gi|229293881|gb|EEN64540.1| hypothetical protein BRAFLDRAFT_92388 [Branchiostoma floridae] Length = 306 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 67/165 (40%), Gaps = 15/165 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++++LD+S S+ + E+L I ++ R +VTFSS +V Sbjct: 105 DLVLLLDMSGSIGS------TDFTSLKTYVAELLSYICPENEMGTFHRVAVVTFSSSVVL 158 Query: 230 TFPL--AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F A + IQ I+ L G +T++ + + ++F + D Sbjct: 159 NFNFHEATSLGQIQASIHSLPYEGGSTRTADAINFVRTQVFQTG-----NYRDEPDVDLE 213 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 ++ +TDG + ++ + EA A ++A+G + + Sbjct: 214 VLLITDGHPNGAGNSPQD-VELAAEALGERANIFALGYGSAYSSA 257 >gi|260818868|ref|XP_002604604.1| hypothetical protein BRAFLDRAFT_126772 [Branchiostoma floridae] gi|229289932|gb|EEN60615.1| hypothetical protein BRAFLDRAFT_126772 [Branchiostoma floridae] Length = 2219 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 81/208 (38%), Gaps = 41/208 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D++ VLD S S+ + G L + E S+ V+ R ++T+S+ Sbjct: 69 NGQVDLIFVLDRSASIGWYNGAWDSMLQFVENLLYEF-----SVNSVHT--RVSIITYST 121 Query: 226 KIVQTFP-LAWGVQH-----------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + L+ G I++++ T +T LE A + ++ Sbjct: 122 TVSVDVDYLSDGSASSRLTKCKLNDDIEKRLRNKALHGWTATTTALERAKQVLLTSRP-- 179 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN--------EAKRRGAIVYAIGVQ 325 KK I LTDG ++ + + N A+ R ++A+G++ Sbjct: 180 --------TAKKAIFLLTDGRSNIGQPPSIPAREISNLRWSGWDMAAQGRQVEIFALGIE 231 Query: 326 AEAADQFLKNCASP---DRFYSVQNSRK 350 +A + L++ A P D ++ + N + Sbjct: 232 -DAVEAELRSIAYPSQQDHYFLLDNFQD 258 >gi|225620067|ref|YP_002721324.1| hypothetical protein BHWA1_01140 [Brachyspira hyodysenteriae WA1] gi|225214886|gb|ACN83620.1| hypothetical protein BHWA1_01140 [Brachyspira hyodysenteriae WA1] Length = 557 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 78/218 (35%), Gaps = 27/218 (12%) Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL--ITSSVKISSKS 165 + + + + + +YNL A++ P ++ L S ++ Sbjct: 337 MIEKPEVEKNMAENNFNYNLEAMNNLNNASKSFNTPNNYVNNTDELSDKFRSIIERYKNE 396 Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD + SM+ + L R IR M + + D + R G + + Sbjct: 397 G-AIDLVIVLDTTESMHPY-------LKTIKRDIRGM---VTELFDNHKYSRVGFLLYRD 445 Query: 226 ----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + + + I ++N + Y + +A E ++I + Sbjct: 446 VKDTYLTKKIDFSDNINFINREVNYFYAAGGGDKAEPM---YEALQEALETFDYINQ--- 499 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 K+ +I LTD + I + N AK + V Sbjct: 500 --KRLVIVLTDA--PAKVIGRADLDLNLNTAKMKNVTV 533 >gi|58176651|pdb|1RRK|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B gi|58176654|pdb|1RS0|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B Complexed With Di-Isopropyl-Phosphate (Dip) gi|58176655|pdb|1RTK|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B Complexed With 4-Guanidinobenzoic Acid Length = 497 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 80/222 (36%), Gaps = 34/222 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 +++ +VLD S S G A + + +++ + S R GLVT+++ Sbjct: 1 SMNIYLVLDGSDS------IGASNFTGAKKVLVNLIEKVASYGVKP---RYGLVTYATYP 51 Query: 227 ---IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + + + + +++N + S T + L+ Y+ + + + Sbjct: 52 KIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVP---PE 108 Query: 279 GHDDYKKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRG----AIVYAIGVQAEAA 329 G + + II +TDG ++ ID L Y + ++ VY GV Sbjct: 109 GWNRTRHVIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVN 168 Query: 330 DQFLKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + AS V++ L D F ++ E + Sbjct: 169 QVNINALASKKDNEQHVCKVKDMECLEDVFYQMIDESQSLSL 210 >gi|222101614|gb|ACM44012.1| thrombospondin-related anonymous protein [Babesia bovis] gi|222101618|gb|ACM44014.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 71/194 (36%), Gaps = 18/194 (9%) Query: 134 EMPFIFCTFPWCANSSH-APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P + F A I S K LD +V+D S S+++ G Sbjct: 10 SVPLLSLAFLATTGIHAFADKGIGSPKGKQCKKQ--LDFSIVVDESASISNDQWEGQ--- 64 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS 251 +R ++ + N +R L T+S+ Q F L + + +L + + Sbjct: 65 --MIPFLRNLIHTVDL---DNTDIRLSLTTYSTPTRQIFTFLDAAASSTRLALTKLDWMA 119 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TK+ G+ Y + ++ + G + K ++ +TDG +S + + Sbjct: 120 GTKARSGMTYTGRALNYVRK--AILPYGRKNVPKALLLITDGVSSDGSY----TAQVAAM 173 Query: 312 AKRRGAIVYAIGVQ 325 + G V IGV Sbjct: 174 LRDEGVNVMVIGVG 187 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 2/71 (2%) Query: 6 IRNFFYNCKGSISILTAIL-LPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 +R F N +G++ I+ A LP+ V G +E S +K++L D + L + Sbjct: 18 LRRFANNERGNVMIIFAAAALPMAIGVAG-ALEISQYSQLKSQLQEASDRAALSAMAALR 76 Query: 65 NQENGNNGKKQ 75 + + Sbjct: 77 EGPRAMRQQAR 87 >gi|119613594|gb|EAW93188.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_c [Homo sapiens] Length = 1350 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 70/226 (30%), Gaps = 39/226 (17%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSI--REMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++ V+DVS SM + V + + +II VN G + + + Sbjct: 284 VVFVIDVSSSMFGTKMEQTKTAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQN 343 Query: 228 VQTFPL------AWGVQHIQEKINRLIFGST----------------------TKSTPGL 259 V + A G + ++ + L + T L Sbjct: 344 VHSAKDYLHCMEADGSRRLEGGVLALGACQSGSGMRHLPPRPCTTKETDKEQWTDVNSAL 403 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + + + ++ IIFLTDGE ++ L +A + Sbjct: 404 LAAASVLNHSNQEPGRGPSVGRIP--LIIFLTDGEPTAGVTTPSVILSNVRQALGHRVSL 461 Query: 320 YAIGVQAEAADQFLKNCASPDR------FYSVQNSRKLHDAFLRIG 359 +++ +A L+ + +R + + +L + I Sbjct: 462 FSLAFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLKGLYEEIS 507 >gi|115374040|ref|ZP_01461329.1| thrombospondin type 3 repeat family [Stigmatella aurantiaca DW4/3-1] gi|310819324|ref|YP_003951682.1| MtsD protein involved in cell-cell cohesion [Stigmatella aurantiaca DW4/3-1] gi|115368930|gb|EAU67876.1| thrombospondin type 3 repeat family [Stigmatella aurantiaca DW4/3-1] gi|309392396|gb|ADO69855.1| MtsD protein involved in cell-cell cohesion [Stigmatella aurantiaca DW4/3-1] Length = 655 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 59/163 (36%), Gaps = 32/163 (19%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPG----------MDKL--------GVATRSIR 200 S+ S + +++V+D S SM PG +D + R+++ Sbjct: 48 TAPSNPSGFPVKVVLVIDQSGSMCVSDPPGSQGVDGFCEQVDDILLPPGVLEPARVRALK 107 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ-------HIQEKINRLI--FGS 251 +++ + P+ V+ +V F + + +P A + I L G Sbjct: 108 RLVNQFRQQPN----VQISIVPFETNVKNVWPPATTGNRFARPDASLDTYIRGLQNQLGK 163 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY-IIFLTDG 293 T + YAY+ I + +Y ++FLTDG Sbjct: 164 GTDYQGAVGYAYSLIASDINAVSASNPEVLPRTRYVVVFLTDG 206 >gi|297622708|ref|YP_003704142.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297163888|gb|ADI13599.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 329 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/269 (14%), Positives = 94/269 (34%), Gaps = 32/269 (11%) Query: 99 NGFAQDIN---NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT-FPWCANSSHAPLL 154 NG+ + + + + +L++ D + +S T P A + Sbjct: 37 NGYTVERDANGDPTGNVALNVSALDSAGEPIAGRLSNPRATVTSVTPLPGLAAAQRYTAT 96 Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD----IIKSIP 210 T +V I+ + ++ ++ +D S SM + + ++ A + + + P Sbjct: 97 ATITVDITVQE--VINAVLNMDRSGSMRLNDPERL-RVDAAKSFLERVTPEDRIAVMEFP 153 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 ++ R+ ++ ++R+ T ++ + D Sbjct: 154 GQSSGFRA--------STLLQGFTSDKALLEAALDRVGQRGNTPI-------WDSLLDTL 198 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + +G + ++ TDGE + E+L A V+ IG+ ++ Sbjct: 199 DLHA-ADEGGQGASRVVLLFTDGEREGGQVAFGEALA---AALESDVRVFTIGLGSDIDT 254 Query: 331 QFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 L+ A + F +V ++ +L + F R Sbjct: 255 AELQELAAETGGTFANVASAAELEELFQR 283 >gi|239624983|ref|ZP_04668014.1| predicted protein [Clostridiales bacterium 1_7_47_FAA] gi|239521369|gb|EEQ61235.1| predicted protein [Clostridiales bacterium 1_7_47FAA] Length = 878 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 77/249 (30%), Gaps = 49/249 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F T ++ + +V+ + SD G D++ +D S SM + V Sbjct: 21 MMFGTLMAAWGEENSSVAAMDAVEAVNVSDTGYDVVFCIDNSRSMWKQQDIRDQAVRVLA 80 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-----GVQHIQEKINRLIFGS 251 + +R G V F+ + Q L + + +N Sbjct: 81 NLAVG------------SDIRIGGVYFADHVYQRCSLTSLTGEEDTKKVMSFLNFTDKDD 128 Query: 252 T---TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN-------SSPNID 301 T L A + K+ I+ +DG N S Sbjct: 129 GNRDTNIGSALSEALKLFENQ----------DISRKRIIVLFSDGINEDYEGTGSYTARA 178 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCAS---------PDRFYSV--QNSR 349 N + E +Y + ++ + AD+ +L+N + +RF+ V Sbjct: 179 NNMTSQAAGEINEAQIALYCVFLEKDRADEAYLRNLVNYFKEDGQYDQERFFPVAENEID 238 Query: 350 KLHDAFLRI 358 +L D F + Sbjct: 239 RLADKFSDV 247 >gi|158335198|ref|YP_001516370.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305439|gb|ABW27056.1| von Willebrand factor type A domain protein, putative [Acaryochloris marina MBIC11017] Length = 573 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 65/209 (31%), Gaps = 36/209 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + +++V+D S SM KL +++ L K + V L+ Sbjct: 385 QTAAKKPSQVVIVVDSSGSMTG------TKLAAVQSTLQTYL---KGLGPKEKVT---LI 432 Query: 222 TFSSKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 F S + + + G E + L TK + A + A Sbjct: 433 DFDSVVRKPVSVDGSPEGQSKGLEFVVALKADGNTKLYDSILAAQTWLTQNLRPNAINA- 491 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG------AIVYAIGVQAEA--AD 330 +I LTDGE+S + L K+ G ++ +G A Sbjct: 492 --------VIVLTDGEDSGSGQQLPQLLS---ALKKSGFEGEQRIAIFTVGYGNAGDFAP 540 Query: 331 QFLKNCA-SPDRFYSVQNSRKLHDAFLRI 358 LK A + +Y + + + Sbjct: 541 DVLKQIAEANGGYYRQGDPASIAQLMADL 569 >gi|330719551|gb|EGG98146.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC2047] Length = 351 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 49/164 (29%), Gaps = 28/164 (17%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLG 193 + C + P + D +++VLD SLSM P + Sbjct: 67 LALFIAALAVCLVALAGPAWKKIPQPVHQSEDA---LVIVLDQSLSMLATDLKP--SRQV 121 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IF 249 + ++L K + LV ++ PL + IQ + L + Sbjct: 122 TVKHKLIDLLKTRKEGQ-------TALVVYAGDAHIVSPLTDDTRTIQSMVPALSPLIMP 174 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 ++ G+ A + D I+ +TDG Sbjct: 175 ALGSRVEHGIALARQALKDGGINNGR-----------ILLVTDG 207 >gi|310824705|ref|YP_003957063.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397777|gb|ADO75236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 881 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 56/173 (32%), Gaps = 40/173 (23%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM + L + R + R ++ F + Sbjct: 287 VVFVVDTSGSMEGESLPQAQGALRLCLR-------------HLREGDRFNIIAFDTSFQS 333 Query: 230 --TFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P + + +++ + L T+ + A + Sbjct: 334 FAPQPAVFTQKTLEQADRWVAALRANGGTELLQPMLAAVQA----------------APE 377 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ LTDG+ + E L A++ A +Y+ G+ +D LK+ A Sbjct: 378 GVVVLLTDGQ----VGNEAEILQAVLRARKT-ARIYSFGIGTNVSDALLKDMA 425 >gi|195115455|ref|XP_002002272.1| GI17296 [Drosophila mojavensis] gi|193912847|gb|EDW11714.1| GI17296 [Drosophila mojavensis] Length = 1099 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 77/195 (39%), Gaps = 26/195 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM++ +AT + +LD + VN VV++ Sbjct: 149 AASSPKDIMILLDASSSMSEK------SFDLATATAFNILDTLGEDDYVNLITFSEVVKT 202 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 203 PVPCFKDRMVRATP--DNIQEIKSAVKAVKLQDTANFTAGLEYAFSLLHKYNQSGA---- 256 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LK--N 335 + I+ +T+ + S K+ + ++ + +++ + L Sbjct: 257 -GSQCNQAIMLITESTSESHKEIIKQYNWP-----HMPVRIFTYLIGSDSGSRSNLHEMA 310 Query: 336 CASPDRFYSVQNSRK 350 C++ F + + + Sbjct: 311 CSNKGFFVQINDYEE 325 >gi|115372024|ref|ZP_01459336.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Stigmatella aurantiaca DW4/3-1] gi|115370989|gb|EAU69912.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Stigmatella aurantiaca DW4/3-1] Length = 843 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 56/173 (32%), Gaps = 40/173 (23%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM + L + R + R ++ F + Sbjct: 249 VVFVVDTSGSMEGESLPQAQGALRLCLR-------------HLREGDRFNIIAFDTSFQS 295 Query: 230 --TFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P + + +++ + L T+ + A + Sbjct: 296 FAPQPAVFTQKTLEQADRWVAALRANGGTELLQPMLAAVQA----------------APE 339 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 ++ LTDG+ + E L A++ A +Y+ G+ +D LK+ A Sbjct: 340 GVVVLLTDGQ----VGNEAEILQAVLRARKT-ARIYSFGIGTNVSDALLKDMA 387 >gi|268573930|ref|XP_002641942.1| Hypothetical protein CBG16647 [Caenorhabditis briggsae] gi|229553908|sp|A8XP79|CL160_CAEBR RecName: Full=C-type lectin protein 160; Flags: Precursor gi|187026423|emb|CAP34559.1| CBR-CLEC-160 protein [Caenorhabditis briggsae AF16] Length = 635 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 63/201 (31%), Gaps = 31/201 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ V+DVS M G+ L + I ++ + P++ V+ GL+ +S++ Sbjct: 286 IDIVFVVDVSEGM------GLGGLMMVKAEINTLVGQMSLDPEIQKHVQVGLIKYSNESE 339 Query: 229 QTFPLA--WGVQHIQEKI-------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F + E + + L+ A I ++ Sbjct: 340 IVFKPSEYTNEDEFTEDLWTDPRLEDVDEKTDEVNLHLALQQAAKMIGSMRKG------- 392 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--- 336 KK ++ N + D ++ + G + + + + Sbjct: 393 ---VKKVVVIYAASYNDEGDDDARQ---IAANIRESGYEIITVAFVEPESSSLVMKIGEL 446 Query: 337 ASPDRFYSVQNSRKLHDAFLR 357 ASP ++ L D Sbjct: 447 ASPRMNFTSFRDELLVDELED 467 >gi|260828859|ref|XP_002609380.1| hypothetical protein BRAFLDRAFT_124610 [Branchiostoma floridae] gi|229294736|gb|EEN65390.1| hypothetical protein BRAFLDRAFT_124610 [Branchiostoma floridae] Length = 321 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 58/152 (38%), Gaps = 22/152 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--- 225 +DM+ V D S S+ + A +I+++++ P + R V+FS Sbjct: 169 IDMLFVFDKSGSVREV------NFNDAKENIKDLIENF-PAPVGPSDTRVAAVSFSDVDK 221 Query: 226 -KIVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 ++ F + ++ + + + T + L A + +F + +A + Sbjct: 222 TRVEFDFNASGDRDSVKTSLGNIAYEGGWTATATALSLARDDVFQS------VAGSRPNS 275 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 K + +TDG+++ + ++ K Sbjct: 276 AKILFLITDGKSNRGGA----PIPVADQLKGN 303 >gi|46359648|dbj|BAD15328.1| CbbOm [Hydrogenovibrio marinus] Length = 754 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 73/206 (35%), Gaps = 25/206 (12%) Query: 169 LDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT---FS 224 + +M+++D S S+N+ + G L ++ ++ + + +G + Sbjct: 564 IAVMLLVDTSQSLNERNQETGQTLLELSEEAL--AITSWTVEQLGDKFAIAGFCSDTRHE 621 Query: 225 SKIVQTFPLAWG-VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + G ++ +I + +T+ + +A H Sbjct: 622 VRYQHIKGYSEGYTSDVKARIAAMEASYSTRMGAAMRHA-----------AHYLSAQQAE 670 Query: 284 KKYIIFLTDGENSSPNIDNKESL-----FYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 KK ++ LTDGE + + + ++L E K G +Y+ + + Sbjct: 671 KKLMLILTDGEPADIDSKDPQTLIHDTHKAVEELKSEG--IYSYCITLDPNADEYVETIF 728 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 +++ + KL + ++ ++ Sbjct: 729 GNQYTVIDQVEKLPEQLPQVFMKLTH 754 >gi|58616385|ref|YP_195515.1| tellurium resistance protein [Azoarcus sp. EbN1] gi|56315847|emb|CAI10491.1| tellurium resistance protein [Aromatoleum aromaticum EbN1] Length = 349 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/130 (20%), Positives = 50/130 (38%), Gaps = 11/130 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++DVS SM D L + ++ +++ P V ++ F+ K Sbjct: 3 RLPIFFLVDVSESMAG------DNLRQLQEGLERLVRSLRADPYALETVFISVIAFAGKP 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL V+ Q RL GS T + + +++ ++ KG + Sbjct: 57 KTLTPL---VELYQFYAPRLPLGSGTSLGSAMAHLMDEMERTVQRSTPEKKGDWRP--VV 111 Query: 288 IFLTDGENSS 297 LTDG+ + Sbjct: 112 YLLTDGKPTD 121 >gi|325686522|gb|EGD28550.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK72] Length = 458 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 52/335 (15%), Positives = 114/335 (34%), Gaps = 51/335 (15%) Query: 20 LTAILLPVIFIVMGLVIET-----SHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 + A+++ VIF M E S + ++ + Y +D ++ + + ++ G K Sbjct: 23 MIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFILDDSKFKGSK 81 Query: 75 QK----------NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI---IIDDQ 121 Q + +++ +W ++ + + N++ D++ Sbjct: 82 QGLTREWSYIGLSADGKKVMNYVWNKQKQDWDVSGLGTKSLYNMKLDLEFKTEGAYQDNR 141 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI-----------SSKSDIGLD 170 YNL+ +Y ++ + + + K + + + Sbjct: 142 LISYNLT--GKYPDTNNKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTDPIQGQMNIA 199 Query: 171 MMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + V D+S SM ++ + M++ ++S+ +V+ + + T S Sbjct: 200 VSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVNLTTFSTTGS 259 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 K L I+E I L T GL Y + +L Sbjct: 260 YKQAAFSQLDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSLQKQHAQL---------- 309 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 KY++ LTDG ++ + N++ E KR G Sbjct: 310 -KYVVLLTDGVPNAY-LVNQQGQAGGLEMKREGIQ 342 >gi|198426626|ref|XP_002122822.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1823 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 58/172 (33%), Gaps = 27/172 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D +++LD S S+ + + G +I+++ R + ++ + Sbjct: 650 AAMDFVLILDSSSSVKRRNWNIIKRFGK---------NILQAFDLGEETGRMAVFRYNKR 700 Query: 227 IVQTFPLAW-----GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + + + +R+ + G T + L +A N I L Sbjct: 701 VDTNTQILFKDHLKNRTSFFKDYDRIPYNGGGTLTGQALRHAKNVI------LAEENGNR 754 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ +TDG + E + GA+ Y IGVQ + Sbjct: 755 PSVVDVVLTITDGRAQDDVGNISR------ELRANGALTYVIGVQPGNKKKL 800 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 77/213 (36%), Gaps = 34/213 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D +MV+D S S+ + R +L+ D R + ++ K Sbjct: 1626 ASMDFVMVVDSSSSIGRKNW------RIMKEFGRNVLNTFVLAEDAA---RMAVFRYNRK 1676 Query: 227 IVQTFPLAWG-----VQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + +R+ + T + L++A + + A+ Sbjct: 1677 VDNVTQVLLNDHIGDKDAFLAAYDRINYNGRGTWTGHALQHAKDVMLAAE------NGNR 1730 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-----QAEAADQFLKN 335 D K ++ +TDG +D + L + G + YA+G+ +A +Q + Sbjct: 1731 ADIKDVVLTITDGRAQDNVVDISKQL------RDMGVLTYAVGIVPGNKKAIDRNQLMAI 1784 Query: 336 CASPDRFYSVQNSRK-LHDAF-LRIGKEMVKQR 366 S + + V++ + L + F ++ + + R Sbjct: 1785 AGSEENLFMVESGFQGLDEQFASKLARSICTNR 1817 >gi|149546336|ref|XP_001514218.1| PREDICTED: similar to Procollagen, type VI, alpha 2 [Ornithorhynchus anatinus] Length = 1023 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 57/167 (34%), Gaps = 15/167 (8%) Query: 162 SSKSDIGLDMMMVLDVSLS--MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K+D +++ V+D S S M +D + L+ V G Sbjct: 43 PEKTDCPINVYFVIDTSESVAMQPPIESLVDHIKDFVIQFTSQLENEFYQNQVAISWHYG 102 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + FS + PL + + K++ + FG T + + ++ ++ + Sbjct: 103 GLHFSDLVEIFSPLPSNKEAFRNKLSGVKYFGRGTFTDCAIANMTEQVTQSQVAGVN--- 159 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TDG + + A+ G ++++ V Sbjct: 160 -------FAVVITDGHVTGSPCGGMK--LQAERARDAGIKLFSVAVN 197 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 70/209 (33%), Gaps = 33/209 (15%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D++ V+D S S+ + S L I P R G+V +S + Sbjct: 613 GAVDIVFVIDSSESIGFTNFSLEKNFVINVVS---RLGAIAKDPKSETGARVGVVQYSHE 669 Query: 227 -IVQTFPLAWGV----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L +E + L T + L++AYNK+ + + Sbjct: 670 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALQFAYNKLIKESRRKKTKV--- 726 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ----AEAADQFLKNC 336 + + +TDG + P ++ C+ V AIG+ D+ LK+ Sbjct: 727 -----FAVVITDGRH-DPRDNDANLRALCHV----DVTVNAIGIGDMFHKRQEDETLKSI 776 Query: 337 ASPD-------RFYSVQNSRKLHDAFLRI 358 A + +S + + D + Sbjct: 777 ACGNVSQVHGMNLFSELVAEEFIDKMEDV 805 >gi|159897014|ref|YP_001543261.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890053|gb|ABX03133.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 222 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 70/172 (40%), Gaps = 14/172 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++V+D S SM G +D+L + ++ I + + LV F+S+ Sbjct: 17 CILVVDTSGSMQ---GRPIDELN---QGLQVFHQDISNSFSTAQRLEICLVEFNSQADCI 70 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK-YIIF 289 + Q + L TTK G+ A +K+ +E+ Y + +II Sbjct: 71 VEPSLVDQF---HMPILAVAGTTKLVDGVRLAIHKV---QERKSWYRSTGQPYYRPWIIL 124 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +TDGE S + + + + + IGVQ A + L+ ++PDR Sbjct: 125 MTDGEPDSDQDVAGLAREIQHGVNNKQFVFFPIGVQG-ADMRMLQQISTPDR 175 >gi|75907575|ref|YP_321871.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701300|gb|ABA20976.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 819 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 57/177 (32%), Gaps = 32/177 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KI 227 D++ ++D S S G + + R L+ + ++ FS + Sbjct: 300 DVVFLIDTSGS---QMGAPLMQCQELMRRFINGLNPDDTFS---------IIDFSDTTRQ 347 Query: 228 VQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + PLA INRL T+ G+ N +L Sbjct: 348 LSPVPLANNSQNRTRAINYINRLTANGGTEMLRGIRAVLNFPVTDSGRL----------- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I+ LTDG + N + L + + G +Y+ G + L A R Sbjct: 397 RSIVLLTDGYIGNEN----QILAEVQQHLQAGNRLYSFGAGSSVNRFLLNRIAELGR 449 >gi|320103720|ref|YP_004179311.1| hypothetical protein Isop_2184 [Isosphaera pallida ATCC 43644] gi|319751002|gb|ADV62762.1| hypothetical protein Isop_2184 [Isosphaera pallida ATCC 43644] Length = 738 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 56/160 (35%), Gaps = 17/160 (10%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L ++ + D S SM D D+ + E++D S + + +V F Sbjct: 289 RTKLTVVWMFDESNSMKDDQQAIRDRFERISS---ELIDNTSSEQRRSAALLHAVVGFGQ 345 Query: 226 KIVQTFPLAWGVQHIQEKIN--RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +I V I+E I R+ T + ++ ++ E Sbjct: 346 QIHFDLEPTPDVTRIKEAITNLRIDMTGTENTMQSVQAVIARLGSKYITKERRM------ 399 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I+ +TD E+ +E++ C AK VY IG Sbjct: 400 --LIVLVTD-ESGDDGARVEETIQACLAAK---VSVYVIG 433 >gi|158316708|ref|YP_001509216.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112113|gb|ABW14310.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 431 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 64/210 (30%), Gaps = 28/210 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++LD S SM+ ++ A +I + D + + + ++ Sbjct: 54 VILLDCSGSMDYPHSKIIEARRAAQAAIDTLHDGVAFAVVAGTG--QAEMVYPTRQELVE 111 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + RL T L A + + + + H I LT Sbjct: 112 ASPRTREAAKAAVKRLQPHGGTAMGRWLLLARDLMATRPDAIHHA-----------ILLT 160 Query: 292 DGENSSPNIDNKESLFYCN-----EAKRRG-----AIVYAIGVQAEAADQFLKNCASPDR 341 DG+N +L C + + G A + + L+ A Sbjct: 161 DGQNGESEAVFAAALAACEGRFQCDCRGVGADWKVAELRRVASTLLGGVALLREPAE--- 217 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 ++ R L + G + V R+ K Sbjct: 218 --MAEDFRSLIERAQARGIDRVGLRVWTPK 245 >gi|256420216|ref|YP_003120869.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035124|gb|ACU58668.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 211 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 67/198 (33%), Gaps = 14/198 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM ++ ++ ++ ++ P V ++TF ++ Sbjct: 3 RLPVYLLLDTSGSMRGE------RIEAVKNGLQVLVSKLRQDPFALESVWISIITFDREV 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL ++ +Q T LE +K+ K KG + Sbjct: 57 KQLLPLT-ALESLQLPEITTPESGPTNMGAALEMLCSKLDAEVAKGSDTQKGDWRP--LL 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +TDG+ S D + K + + A AEA D FLK Sbjct: 114 FLMTDGKPS----DLAAFREVVPKVKSKNLAALVACAAGAEAQDSFLKELTDNVVHLDTA 169 Query: 347 NSRKLHDAFLRIGKEMVK 364 +S L F + + Sbjct: 170 DSSTLMSFFKWVSASIGT 187 >gi|227819319|ref|YP_002823290.1| hypothetical protein NGR_b10840 [Sinorhizobium fredii NGR234] gi|227338318|gb|ACP22537.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 533 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 26/58 (44%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 I F+ + +G + LT I +P++ LVI+ + L +D L A ++ Sbjct: 7 INRFWDDNRGYVIALTLISMPLLLGFSLLVIDVGRTGNLHTDLQNAVDAMALAGAREL 64 >gi|13537509|dbj|BAB40687.1| ORF3 [Sulfolobus tokodaii] Length = 224 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 62/194 (31%), Gaps = 44/194 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM+ + +A S + + +R F I Sbjct: 62 IYLLLDKSGSMDGEKIIWAKAVALALYSRARR-------ENRDFYIR-----FFDNI--P 107 Query: 231 FPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +PL + V + E I ++ G T + + A I + KG + Sbjct: 108 YPLIKVPKNAKSKDVVKMIEYIGKIRGGGGTDISRSVISACEDIKEGH------VKGVSE 161 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDGE+ + SL N I V + L+ D++ Sbjct: 162 ----IILLTDGEDKIAETTVRRSLKEANST--------LISVMIRGDNADLRRV--SDQY 207 Query: 343 YSVQ--NSRKLHDA 354 V + L Sbjct: 208 LVVYKLDHEDLLKV 221 >gi|332828717|gb|EGK01409.1| hypothetical protein HMPREF9455_02242 [Dysgonomonas gadei ATCC BAA-286] Length = 340 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 50/158 (31%), Gaps = 18/158 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I + P T K+ K G+++++ +DVS SM +L A Sbjct: 61 LILVAIGFGIVVLARPQFGTKVEKVDKK---GIELVIAIDVSNSMMAE-DIKPSRLAKAK 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 ++ I D + +V F+ + PL Q + + + Sbjct: 117 -------QMLTRIIDERRDDKVAIVVFAGEAFIQLPLTPDNQSAKLFLETIDPSLVPVQG 169 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + A + D K I+ +TDGE Sbjct: 170 TAIGSAIDI-------SMSCFSNDTDIDKAIVLITDGE 200 >gi|331006793|ref|ZP_08330059.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC1989] gi|330419411|gb|EGG93811.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC1989] Length = 668 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 75/253 (29%), Gaps = 31/253 (12%) Query: 84 IKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 + D L + A + ++ S + K ++R + + Sbjct: 38 FEQWIDADLLRHLVTDDNADNRSDKNSSHKNDEDGTHKAKKAISHLLARPIYTLLAGAWL 97 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + P + M+++LD+S SM +R +R L Sbjct: 98 IACIALAGPTWEKLPQPLQQSEQA---MVIILDLSPSMRATDNKP-------SRIVRARL 147 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGL 259 I + + + + LV ++ + PL + I ++ L G + + Sbjct: 148 KIKDLLSQRKDGL-TALVVYAGEAHTVTPLTDDTKTIANLLSTLTPGLLPIPGSNIEMAM 206 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 A + D+ + + LTDG + + +L K+ + Sbjct: 207 TMATTLVADSGIQHASY-----------VVLTDGIDPKAINSIERTLE-----KQAHQSL 250 Query: 320 YAIGVQAEAADQF 332 + +G+ Sbjct: 251 FIMGIGTPEGAPI 263 >gi|291403690|ref|XP_002718170.1| PREDICTED: cochlin [Oryctolagus cuniculus] Length = 551 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 66/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ + M + Sbjct: 331 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGESNFRLMLE 389 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 390 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 438 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 439 RYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIITDGQ----SYDDVRG- 485 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 486 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 517 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 32/103 (31%), Gaps = 23/103 (22%) Query: 288 IFLTDGENSSPNIDNKE----------------SLFYCNEAKRRGAIVYAIGVQAE---- 327 I TDG N S + AK G VY IG Sbjct: 31 IVNTDGSNPSSRFPPTNQDLTTPINIRNALDALTTQAYTNAKAAGISVYTIGFSTPSDSI 90 Query: 328 --AADQFLKNCA-SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L NCA S + + ++ L AF +I K + R+ Sbjct: 91 DDKGLSLLSNCASSSSQAFVANDANTLISAFNQIAKSVGSLRL 133 >gi|149566038|ref|XP_001520798.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 4 [Ornithorhynchus anatinus] Length = 808 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 59/161 (36%), Gaps = 31/161 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+DVS SM ++ +A ++ +LD + VN ++ +S + Sbjct: 246 DIVIVVDVSGSMKGL------QMTIAKHTVATILDTLGENDFVN------IIAYSDYVHY 293 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L + L A+ + + A Sbjct: 294 LEPCFQGILVQADRDNREHFKQLLDELQAKGVGSVSKALREAFTVL-----QQVRDAGQG 348 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + I+ +TDG +D ES+F R V+ Sbjct: 349 ALCNQAIMLITDG-----AVDGYESIFEKYNWPGRKVRVFT 384 >gi|50355937|ref|NP_571316.1| complement component bfb [Danio rerio] gi|49904460|gb|AAH76051.1| Complement component bfb [Danio rerio] Length = 456 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 77/222 (34%), Gaps = 31/222 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI LD+ + +DVS S ++ L A + I+ +L+ I + Sbjct: 245 KIYLNKGGKLDIYIAVDVSDS--------INDLKKAKQIIKTLLEKISYYEVSPSYE--- 293 Query: 220 LVTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKS-TPGLEYAYNKIFDAK 270 ++ F++ + Q + + + E +++ F + + Y I D+ Sbjct: 294 ILMFATDVYQIVKMRDFKTEEVAGSLSKVFEDLDKFDFDKKLDQKSSNIAKLYQTILDSM 353 Query: 271 EKLEHIAKGHDDYKK-YIIFLTDGE-NSSPNIDN-----KESLFYCNEAKRRGAIVYAIG 323 + K K +I TDG+ N N + + + + +Y G Sbjct: 354 SNEQIRNKEDFLQTKHVVIVFTDGQANMGGNPKPKVHLIRNLVLKNDANRENKLDLYVFG 413 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKE 361 V + + L S F+ +Q+ ++ F + + Sbjct: 414 VGKDVRTEDLNGLVSEKENERHFFKLQDLDEVQKTFDSLMND 455 >gi|297475370|ref|XP_002707867.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Bos taurus] gi|296487070|gb|DAA29183.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Bos taurus] Length = 1111 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 68/193 (35%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM ++ +A ++ +LD + VN ++ ++ I Sbjct: 269 DIVIVVDTSGSMKGL------RMTIAKHTVSTILDTLGENDFVN------IIAYNDYIHY 316 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ L A+ + +E + Sbjct: 317 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVDRALREAFQILQQFQEAGQ-----G 371 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E + R V+ IG + AD+ C Sbjct: 372 SLCNQAIMLITDG-----AVEDYEPVLEKYNWPDRKVRVFTYLIGREVSFADRLKWIACN 426 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 427 NKGYYTQISTLAD 439 >gi|194667489|ref|XP_001787557.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit [Bos taurus] Length = 1111 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 68/193 (35%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM ++ +A ++ +LD + VN ++ ++ I Sbjct: 269 DIVIVVDTSGSMKGL------RMTIAKHTVSTILDTLGENDFVN------IIAYNDYIHY 316 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ L A+ + +E + Sbjct: 317 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVDRALREAFQILQQFQEAGQ-----G 371 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E + R V+ IG + AD+ C Sbjct: 372 SLCNQAIMLITDG-----AVEDYEPVLEKYNWPDRKVRVFTYLIGREVSFADRLKWIACN 426 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 427 NKGYYTQISTLAD 439 >gi|83312058|ref|YP_422322.1| hypothetical protein amb2959 [Magnetospirillum magneticum AMB-1] gi|82946899|dbj|BAE51763.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1196 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 63/173 (36%), Gaps = 26/173 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++++DVS SM + A S+ + L + N +R LV F V Sbjct: 85 LDIVVLVDVSGSMKTSLPH----IAAAVNSLADQL-----LESGNGNIRLSLVRFDVSAV 135 Query: 229 QTFPLAW--GVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 W + + + + +L + + + +++ +K Sbjct: 136 IDTTPHWTADREVVAQGLRKLAMIDGANDTREAFRRLRELLDESRPGA----------RK 185 Query: 286 YIIFLTDGE-NSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFLK 334 +F TDGE PN+ + +A R G ++AIG+ ++ Sbjct: 186 MAVFFTDGELVVEPNVMPWPEMESEAKALRTGPHALRLFAIGLPGSLNSNMMR 238 >gi|332288899|ref|YP_004419751.1| hypothetical protein UMN179_00824 [Gallibacterium anatis UMN179] gi|330431795|gb|AEC16854.1| conserved hypothetical protein [Gallibacterium anatis UMN179] Length = 212 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 66/206 (32%), Gaps = 18/206 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + ++ ++ ++ P ++TF + Sbjct: 3 RLPVYLLVDTSGSMMGE------AIEAVRNGLQMLVSALRQDPYALETAYLSVITFDTDA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + ++ + T L + I +K KG + Sbjct: 57 KQVTPLT---ELMSFQMPDIQASGVTAMGEALSLLVDCINREVQKGSAEVKGDWKP--VV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 L+DG P D ++ + + K A A A LK + Sbjct: 112 FLLSDGL---PTDDLQKGINAIRQVKTG--TFVACAAGAGADTNVLKQITETVVSLDTAD 166 Query: 348 SRKLHDAFLRIGKEM--VKQRILYNK 371 + + F + + Q++ NK Sbjct: 167 ANSIKAFFKWVSASISVSSQKVDLNK 192 >gi|118357010|ref|XP_001011757.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89293524|gb|EAR91512.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 842 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 47/118 (39%), Gaps = 17/118 (14%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K + ++ D+ VLD + SM + I +L+ +K VN+ R+ Sbjct: 60 TKQETNTEDSCDLAFVLDCTGSMG-------SWIQRCKTQIINILNTLKQ-QFVNSTFRA 111 Query: 219 GLVTF-----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS----TPGLEYAYNKIF 267 G V + +++I + ++ + ++ I +L+ GL A +++ Sbjct: 112 GFVAYRDHCDTNRIEKFSFISSQYEALENFIGKLVATGGGDGPEDLAGGLYTAIHELS 169 >gi|146296332|ref|YP_001180103.1| Ig domain-containing protein [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409908|gb|ABP66912.1| Ig domain protein, group 2 domain protein [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 1831 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 66/182 (36%), Gaps = 21/182 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + S +S D++ V+D + SM+ D++ ++I +D +K Sbjct: 824 KKYLDITGLKSGTVSPSGQA--DIVFVIDTTGSMS-------DEIDAVKQNINNFVDKLK 874 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + + V GLVT+ G + + K G + + Sbjct: 875 T---KDISVNLGLVTYKDITCDGPNSTVGHGFFSSADDFKNALGSIKVDGGGDTP-ETLI 930 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIG 323 DA E + ++ K+I+ LTD ++ ++N+ + +E K IV + Sbjct: 931 DALE-TARLLGFRENSTKFIVVLTD---ANYKLENRFGIKSADEIIERLKSDNIIVSVVS 986 Query: 324 VQ 325 Sbjct: 987 TM 988 >gi|330447846|ref|ZP_08311494.1| tetratricopeptide repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492037|dbj|GAA05991.1| tetratricopeptide repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 699 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 51/162 (31%), Gaps = 24/162 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P W +K+ + ++ ++V+D+S SM ++L A Sbjct: 54 PIAILAIGWLIAVVAMAGPSWQKMKLPA-YNLSGARVLVMDMSRSM-YATDIKPNRLTQA 111 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----S 251 +ML K +GLVT++ + PL + I L Sbjct: 112 RFKALDMLPGWKEGT-------TGLVTYAGDGYEVSPLTEDSHTLANLIPSLSPKIMPIP 164 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + G+ A + A II +TDG Sbjct: 165 GSNAAAGIAEAIKLLKQAGNSTGD-----------IILITDG 195 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 51/145 (35%), Gaps = 22/145 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + L V + ++ + D VR GL ++ I Sbjct: 235 DIVFLVDSSTSIG-----PQNFLKV-KNFLYSVVLGLDISSDR---VRVGLAQYNDNIYP 285 Query: 230 TFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYA-YNKIFDAKEKLEHIAKGHDDY 283 F L ++ + I T + LE+ N + + Sbjct: 286 AFQL--NQHPLKSMVLEQIQNLPYRPGGTNTGSALEFIRTNYLTEESGSRAKDRVP---- 339 Query: 284 KKYIIFLTDGENSSPNIDNKESLFY 308 + +I +TDGE++ + + L Sbjct: 340 -QIVILVTDGESNDEVQEVADRLKE 363 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 62/174 (35%), Gaps = 17/174 (9%) Query: 196 TRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGST 252 RS+R L I + S + +R GL +S F L+ + + I + F Sbjct: 48 ARSVRNFLHILVNSFNVSSETIRVGLAQYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPG 107 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 K+ + A I D + ++ + + ++ G E+L Sbjct: 108 GKN---MGLALKFILDHHFQEASGSRASQGVPQIAVVISSGPAEDHVHGPAEAL------ 158 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +R G ++Y +GV+ + ++ +SP +N F + K R Sbjct: 159 RRAGILLYVVGVRDAVWAELMEIASSPQ-----ENFTSFLPNFSGLSNLAQKLR 207 >gi|302343943|ref|YP_003808472.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] gi|301640556|gb|ADK85878.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] Length = 697 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 72/207 (34%), Gaps = 29/207 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +G ++ ++D S SM ++ ++ +L + R L+T Sbjct: 507 REKRVGNFLLFLVDASGSMG-----AQARMSATKGAVLSLL-----LDAYQKRDRVALIT 556 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P + + + L G T GL A+ ++ H+ K D Sbjct: 557 FRGREAQLALPPTSSIDLAAKLLAELPVGGRTPLAAGLLRAHEQLT------RHLRKDPD 610 Query: 282 DYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLKNCAS 338 ++ LTDG N+ D A+R G V+ + V EA N A+ Sbjct: 611 GRPIVLV-LTDGRANAGLGSDAPPHEEAITMAERMGQDERVHYVVVDTEAPGIVRLNIAA 669 Query: 339 ------PDRFYSVQN--SRKLHDAFLR 357 ++ + + ++ L D + Sbjct: 670 RLATALGGDYFKIDDLKAQDLVDIIKK 696 >gi|227552253|ref|ZP_03982302.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895164|ref|ZP_05674817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293378024|ref|ZP_06624201.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227178583|gb|EEI59555.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257831729|gb|EEV58150.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292643342|gb|EFF61475.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 498 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 51/136 (37%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +S + Sbjct: 310 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSIE 363 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A N + Sbjct: 364 GYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGH-------- 415 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 416 ---KKVIVLLTDGVPT 428 >gi|120437732|ref|YP_863418.1| hypothetical protein GFO_3411 [Gramella forsetii KT0803] gi|117579882|emb|CAL68351.1| conserved hypothetical protein [Gramella forsetii KT0803] Length = 288 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 10/125 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L MM+++DVS S + FG + + + + Sbjct: 58 NVTARYNEPFIKVFEEERELTMMLMVDVSGS--EMFGTQ----QQFKKDVITEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNK 265 N + GL+ F+ +I P G H+ I L G +T L+Y N Sbjct: 112 SATKNNDKIGLMLFTDQIELYIPPKKGRSHVLRIIRELLEFKPVGKSTDIKGALKYLTNV 171 Query: 266 IFDAK 270 + Sbjct: 172 MKKKA 176 >gi|3182939|sp|Q91145|COCA1_NOTVI RecName: Full=Collagen alpha-1(XII) chain gi|632648|gb|AAA80217.1| type XII collagen alpha-1 chain [Notophthalmus viridescens] Length = 929 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 68/190 (35%), Gaps = 28/190 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ + I ++++ D V+ + +S Sbjct: 628 TKAQADIVLLVDGSWSIGRP------NFKIVRNFISRVVEVFDIGSDR---VQIAVSQYS 678 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF-----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + L ++ + + G T + L++ F + A+ Sbjct: 679 GDPRTEWQL--NTHKTKKSLMDAVANLPYKGGNTNTGSALKFILENNFRPGVGMREKAR- 735 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K I LTDG++ + S Y +E G +YA+G++ ++ + + P Sbjct: 736 -----KIAILLTDGKSQDDIVAP--SKRYADE----GIELYAVGIKNADENELKEIASDP 784 Query: 340 DRFYSVQNSR 349 D Y + Sbjct: 785 DELYMYNVAD 794 >gi|115361035|ref|YP_778172.1| hypothetical protein Bamb_6294 [Burkholderia ambifaria AMMD] gi|115286363|gb|ABI91838.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 423 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/163 (9%), Positives = 60/163 (36%), Gaps = 9/163 (5%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R+ + +G+++I+ + L ++ +GL ++ + +++L D L A + + Sbjct: 12 RHSLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTS- 70 Query: 67 ENGNNGKKQKNDFSYRIIK------NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + + + + N Q + + + + + + + + Sbjct: 71 -AISLQVAEADGIAAGHANYAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKY 129 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 LS ++ + + + T P ++ + + ++ + + Sbjct: 130 VKCTAQLSNIAHWFIEVL-NTIPGVQVANASQVAASAIATVGA 171 >gi|156402481|ref|XP_001639619.1| predicted protein [Nematostella vectensis] gi|156226748|gb|EDO47556.1| predicted protein [Nematostella vectensis] Length = 154 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 60/172 (34%), Gaps = 23/172 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ ++D S S+ + G + I+K G++ + K Sbjct: 3 VDLGFIIDSSGSIEFY---GKGNFKKVLDFVGG---IVKEFEVSKQGTHVGIIRYDHKAE 56 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P Q + +KI+++ F T L A + ++ + Sbjct: 57 ILKPFGQVTDKQGVLDKISKITFTGGGTKTGQALTLAMDGLYQ--------IDNRKEVPD 108 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +I LTDG + +D N+ K G V+A+G+ E + L A Sbjct: 109 VLIVLTDGASKDSVVDP------ANKLKNSGVTVFAVGIGKEFKEDELNLIA 154 >gi|153830633|ref|ZP_01983300.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] gi|148873874|gb|EDL72009.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] Length = 1426 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 52/170 (30%), Gaps = 26/170 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSM-------------NDHFGPGMDKLGVATRSIREML 203 +K S + ++ +++D S SM + A ++ + Sbjct: 807 GGIKTSVEPGKNYNIALIVDTSGSMRYDLAGNQNATYDGWNDSWSQTPEQYAASRMKLTI 866 Query: 204 DIIK----SIPDVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKS 255 D +K + D + V L+ F+ P + + KIN LI T Sbjct: 867 DALKVLATQLADHDGTVNITLIGFNGTAADALPFNNLSAANLSDLIGKINLLIADGGTNY 926 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 A + + Y FLTDG+ + N D+ Sbjct: 927 EDAFIEATKWFNTQPGSETDLKFENLTY-----FLTDGDPTVHNGDSSTG 971 >gi|113478293|ref|YP_724354.1| protoporphyrin IX magnesium-chelatase [Trichodesmium erythraeum IMS101] gi|110169341|gb|ABG53881.1| protoporphyrin IX magnesium-chelatase [Trichodesmium erythraeum IMS101] Length = 672 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 69/207 (33%), Gaps = 40/207 (19%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ ++L + + L+ F Sbjct: 470 ARKAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQSRDQVSLIPF 518 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ ++ G + GL A +AK+ D Sbjct: 519 RGEQAEVLLPPTRSITSARRRLEKMPCGGGSPLAHGLTQAVRVGVNAKQ-------SGDV 571 Query: 283 YKKYIIFLTDGENSSP--------------NIDNKESLFYCNEAKRRGAIVYAI-----G 323 + I+ +TDG + P +E L + + G + I Sbjct: 572 GQVVIVAITDGRGNIPLARSLGEPMLDAEKPNIKEELLEIAGKIRGLGIQLLMIDTENKF 631 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRK 350 V A + K S ++Y + + Sbjct: 632 VSTGFAKELAKQ--SGGKYYHLPKASD 656 >gi|308483657|ref|XP_003104030.1| hypothetical protein CRE_02342 [Caenorhabditis remanei] gi|308258687|gb|EFP02640.1| hypothetical protein CRE_02342 [Caenorhabditis remanei] Length = 1471 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 84/292 (28%), Gaps = 40/292 (13%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 E+ +N K+ + N + +H Sbjct: 782 SESEDYSNPTTPKSVIREDEDYDYTNEFAAFPPTVTTTIASTINSTNADDKVTTSKPKHV 841 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + + +P C P N D++ +LD S D Sbjct: 842 SSSEEIQPNFVIPSGQCPTPNQPNI----------------DRNRTDILFLLDSS----D 881 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQ 241 +F + A + I E + + + V+ LV ++ + F L +H+ Sbjct: 882 NFNEQ--RFHRAIKLIGETVSKFNNF--GADGVQVSLVQYNDEPYLEFSLRKHNCKKHLL 937 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + I F + L A K+ + + D + +I +TDG+ + + Sbjct: 938 DDIADTEFMTG---GSQLNKALEKVSQFAFTKKRGDRP--DAENILIIVTDGQYNGRIQE 992 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKL 351 AK V I EA QF+ + + + + + L Sbjct: 993 PTRL------AKDNNVTVLVIST-IEADKQFVNELSGNRAENLFGLDANEDL 1037 >gi|305663060|ref|YP_003859348.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377629|gb|ADM27468.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 411 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 59/168 (35%), Gaps = 15/168 (8%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++ +D S SM+ K+ A +I M+ ++ PD L +F K+ + Sbjct: 41 LIAIDSSWSMDGE------KIFFAKDAILNMVRMLD--PD----DYISLYSFCGKVHKVL 88 Query: 232 PLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + I + + + G T + LE Y I + + +I Sbjct: 89 DFIRIRDMDRIVKAVAGIKLGGGTNTYGVLEQIYMDI-PSVLDRVKKEESDKIPSIRMIM 147 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG + D + + +I IGV + ++ L A Sbjct: 148 VTDGNPTVGIRDEDRIIDIAERLGKYLSISLIIGVGDDYNERLLAKIA 195 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 72/222 (32%), Gaps = 63/222 (28%) Query: 190 DKLGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSS-------------------K 226 +L ++ LD ++ I D V+ L+ ++ Sbjct: 167 TRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNY 226 Query: 227 IVQTFPLAWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 LAW + +Q++ +N L G T++ GL++A ++ + Sbjct: 227 SQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA---------- 276 Query: 284 KKYIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQ-------AEAADQF 332 +K +F +DG +S ++ + K + V +IG + A++F Sbjct: 277 QKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKF 336 Query: 333 LKNCAS----------------PDRFYSVQNSRKLHDAFLRI 358 + +S +Y+V L F I Sbjct: 337 MNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEI 378 >gi|167516596|ref|XP_001742639.1| hypothetical protein [Monosiga brevicollis MX1] gi|163779263|gb|EDQ92877.1| predicted protein [Monosiga brevicollis MX1] Length = 7261 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 38/122 (31%), Gaps = 8/122 (6%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLV 221 S ++ +D+ + +D + SM G +L + ++ + + + +R G V Sbjct: 7034 STFNVDVDLALAMDCTGSMGSWIAAGKSQLAQIVQHVQNQVRERFGELASPR--IRVGFV 7091 Query: 222 TFSSKIVQT-----FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + L VQ + KIN + A I + Sbjct: 7092 AYRDYSDGPNLVDSIDLTTNVQQVIAKINAQVATGGGDGPEEPATALELIAETFSWRAES 7151 Query: 277 AK 278 K Sbjct: 7152 MK 7153 >gi|29828406|ref|NP_823040.1| hypothetical protein SAV_1864 [Streptomyces avermitilis MA-4680] gi|29605509|dbj|BAC69575.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 462 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 54/163 (33%), Gaps = 25/163 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + ++ + V+D S SM KL +++ + ++ Sbjct: 33 PEDAPATEPLPVNFVFVVDTSGSMTG------TKLDTVKSALQTIYRELRPADC------ 80 Query: 218 SGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 G++TF + P + E ++ L T G++Y ++I Sbjct: 81 LGIITFDHNVRTVLPAVAKQDLPPERFAEVVSALTTQGGTDIDLGVQYGIDEI------S 134 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 H G Y +DG+ +S D + AK RG Sbjct: 135 RHSVSGRTVNCLY--LFSDGDPTSGERDWIKVRANV-AAKLRG 174 >gi|15920717|ref|NP_376386.1| hypothetical protein ST0501 [Sulfolobus tokodaii str. 7] gi|15621500|dbj|BAB65495.1| 453aa long conserved hypothetical protein [Sulfolobus tokodaii str. 7] Length = 453 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 62/194 (31%), Gaps = 44/194 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM+ + +A S + + +R F I Sbjct: 291 IYLLLDKSGSMDGEKIIWAKAVALALYSRARR-------ENRDFYIR-----FFDNI--P 336 Query: 231 FPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +PL + V + E I ++ G T + + A I + KG + Sbjct: 337 YPLIKVPKNAKSKDVVKMIEYIGKIRGGGGTDISRSVISACEDIKEGH------VKGVSE 390 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II LTDGE+ + SL N I V + L+ D++ Sbjct: 391 ----IILLTDGEDKIAETTVRRSLKEANST--------LISVMIRGDNADLRRV--SDQY 436 Query: 343 YSVQ--NSRKLHDA 354 V + L Sbjct: 437 LVVYKLDHEDLLKV 450 >gi|114566000|ref|YP_753154.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114336935|gb|ABI67783.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 592 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 33/193 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 KS + +D+ +++D S SM DK A + +L K + +VTF Sbjct: 405 KSYVPIDVCLLIDASGSMAG------DKRQAACFLAQNLLLSGKE--------KVAVVTF 450 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ + T G+ A N I K + Sbjct: 451 QERSSEVVVPFTRNQNILNKGLSTISPAGLTPMADGIMTAVNLI-----------KNNRV 499 Query: 283 YKKYIIFLTDGENSSPNID---NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 ++ ++DG + P ++L + ++ I + E+ +L+ + Sbjct: 500 RNPLLVLISDGIPNIPLWTLDAQADALEAATHIREN--KIHFICIGLESNRFYLEKLSAN 557 Query: 338 SPDRFYSVQNSRK 350 + Y V + K Sbjct: 558 AGGALYLVDDLNK 570 >gi|226487566|emb|CAX74653.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 67/202 (33%), Gaps = 36/202 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG- 219 + S D+ + + ++D S SM D + A S+ L KS+P G Sbjct: 287 VVSSKDMRNEFVFLIDRSGSMEG------DNISYAKTSLLLFL---KSLPVNCRFQIIGF 337 Query: 220 ----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLE 274 FS + + L T++ L+ A Sbjct: 338 GSNFAALFSEPTDYS---EDSLNAAMNYQKDLNADMGGTEAYNALKSAL----------- 383 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + + + K IIFLTDG+ N D L N K R V+ IG+ + + Sbjct: 384 HSSPSGEGWFKQIIFLTDGD--VGNADEVIGLVRMNVDKAR---VFTIGLGQGVSTALIG 438 Query: 335 NCASPDRFYSVQ--NSRKLHDA 354 A + ++ +L A Sbjct: 439 GVARVGNGTAAYVRDASQLQSA 460 >gi|223461391|gb|AAI41092.1| Cacna2d4 protein [Mus musculus] Length = 1091 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 72/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A +I +LD + VN ++ ++ + Sbjct: 270 DIVILVDISGSMKGL------RMAIAKHTITTILDTLGENDFVN------IIAYNDYVHY 317 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E +K Sbjct: 318 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVSQALIEAFEILKQFQE-----SKQG 372 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 373 SLCNQAIMLITDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 427 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 428 NKGYYTQISTLAD 440 >gi|222101622|gb|ACM44016.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 71/194 (36%), Gaps = 18/194 (9%) Query: 134 EMPFIFCTFPWCANSSH-APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P + F A I S K LD +V+D S S+++ G Sbjct: 10 SVPLLSLAFLATTGIHAFADKGIGSPKGKQCKKQ--LDFSIVVDESASISNDQWEGQ--- 64 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS 251 +R ++ + N +R L T+S+ Q F L + + +L + + Sbjct: 65 --MIPFLRNLIHTVDL---DNTDIRLSLTTYSTPTRQIFTFLDAAASSTRLALTKLDWMA 119 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TK+ G+ Y + ++ + G + K ++ +TDG +S + + Sbjct: 120 GTKARSGMTYTGRALNYVRK--AILPYGRKNVPKALLLITDGVSSDGSY----TAQVAAM 173 Query: 312 AKRRGAIVYAIGVQ 325 + G V IGV Sbjct: 174 LRDEGVNVMVIGVG 187 >gi|148667191|gb|EDK99607.1| mCG142165 [Mus musculus] Length = 1025 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 72/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A +I +LD + VN ++ ++ + Sbjct: 270 DIVILVDISGSMKGL------RMAIAKHTITTILDTLGENDFVN------IIAYNDYVHY 317 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E +K Sbjct: 318 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVSQALIEAFEILKQFQE-----SKQG 372 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 373 SLCNQAIMLITDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 427 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 428 NKGYYTQISTLAD 440 >gi|124517718|ref|NP_001028554.2| voltage-dependent calcium channel subunit alpha-2/delta-4 [Mus musculus] gi|162319140|gb|AAI56407.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 4 [synthetic construct] Length = 1144 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 72/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A +I +LD + VN ++ ++ + Sbjct: 298 DIVILVDISGSMKGL------RMAIAKHTITTILDTLGENDFVN------IIAYNDYVHY 345 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E +K Sbjct: 346 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVSQALIEAFEILKQFQE-----SKQG 400 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 401 SLCNQAIMLITDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 455 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 456 NKGYYTQISTLAD 468 >gi|115893455|ref|XP_001199184.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] Length = 240 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 49/167 (29%), Gaps = 26/167 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++++D S SM H L D K N+VR + Sbjct: 1 VIVLVDTSGSMVTHMEDLKKDLVALI------WDQFKRENISFNIVRFSADIEPWRPHIV 54 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P ++ + T + L A+ + I L Sbjct: 55 EPTDANCNDAVRWVSSFVPAGNTCTLEALSEAFREKDVDA----------------IYLL 98 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKN 335 TDG+ S + N + G V+ I E+A+ FL+ Sbjct: 99 TDGKPDSSTSKVFREIAQVNTVR--GVKVHTISFNCNDESANTFLRQ 143 >gi|81871840|sp|Q5RJF7|CA2D4_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-4; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-4; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-4; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-4; Flags: Precursor gi|55832800|tpg|DAA05529.1| TPA_exp: putative voltage-gated calcium channel alpha(2)delta-4 subunit [Mus musculus] Length = 1116 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 72/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A +I +LD + VN ++ ++ + Sbjct: 270 DIVILVDISGSMKGL------RMAIAKHTITTILDTLGENDFVN------IIAYNDYVHY 317 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E +K Sbjct: 318 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVSQALIEAFEILKQFQE-----SKQG 372 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 373 SLCNQAIMLITDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 427 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 428 NKGYYTQISTLAD 440 >gi|186686512|ref|YP_001869708.1| vault protein inter-alpha-trypsin subunit [Nostoc punctiforme PCC 73102] gi|186468964|gb|ACC84765.1| Vault protein inter-alpha-trypsin domain protein [Nostoc punctiforme PCC 73102] Length = 818 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 45/282 (15%), Positives = 84/282 (29%), Gaps = 37/282 (13%) Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRE-NGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N S I + + E+++ N + I I +++ + N Sbjct: 194 NAPILPAGMRSRHDINVTIEINAGIEIQDINSPSHQIQIIREGQLVNVKLGGGDTIPNKD 253 Query: 129 AVSRYEMPFIFCTFPWCANS----SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + RY++ + H L + + + + DM+ ++D S S Sbjct: 254 LILRYQVASNNIQTTTLTQADERGGHFALYLIPAFEYRPDEIVAKDMVFLIDSSGS---Q 310 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQH 239 G + + R L+ + ++ FS Q P+ + Sbjct: 311 SGEPLMQCQELMRRFINGLNPDDTFN---------IIDFSDTTQQLSPVPLPNTSANRLL 361 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 INRL G T+ G+ N +L + I+ LTDG + N Sbjct: 362 AINYINRLNAGGGTEMLGGIRTVLNLKATNPGRLRN-----------IVLLTDGYIGNEN 410 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 E G +++ G + L A R Sbjct: 411 QILAEVKQRLQP----GTRLHSFGAGSSVNRFLLNRIAELGR 448 >gi|313238339|emb|CBY13421.1| unnamed protein product [Oikopleura dioica] Length = 501 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 61/194 (31%), Gaps = 32/194 (16%) Query: 167 IGLDMMMVL--DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G+ M MV D S S++ M K D + R +V F+ Sbjct: 32 AGVQMEMVFLADGSSSVSQVSFDLMKKWIA---------DFVDKFDVSQTATRVSVVQFT 82 Query: 225 SKIVQT----FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + FPL+ ++ + L T + L Y N + K Sbjct: 83 DNVSDRNGFGFPLSGQAARVKSDLANLRYMTGNTHAGNALRYVKNNVLSRARKSA----- 137 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 K +I LTDG + E ++ +V+++G+ L+ A Sbjct: 138 ----SKVLIVLTDGVSQD------EIEKAASDLIGDKVLVFSVGIGNSVDANELEKIAGL 187 Query: 339 PDRFYSVQNSRKLH 352 P+ + N L Sbjct: 188 PEYVFKTANYNALT 201 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 63/178 (35%), Gaps = 35/178 (19%) Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFP------LAWGVQHIQEKI-----NRLIFGSTT 253 II+ + +N R ++ FS + + L + +E + T Sbjct: 253 IIEKLEIGDNASRVSILQFSGQSARPQGRWINPVLTFDRSTSKEAVIGAIDGMKKLNGDT 312 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 L+Y Y +F ++ A D ++ +I +TDG+ + P E + Sbjct: 313 CIGEALDYFYRNMFTSQ------AGQRSDVEQRVIVMTDGKRNCP----AEIAKPAELIR 362 Query: 314 RRGAIVYAIGVQAEAA---------DQFLKNCAS---PDRFYSVQNSRKLHDAFLRIG 359 + A +YAIG+ + Q L AS + + N +L RIG Sbjct: 363 AQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQLI--LKRIG 418 >gi|297567245|ref|YP_003686217.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851694|gb|ADH64709.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 467 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 69/227 (30%), Gaps = 39/227 (17%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 L A + ++ P + ++ S L + V+D S SM + Sbjct: 8 LEASVQPHREYLQANQPGQKLFLALKIRPSAEATR---SRPQLVVAFVVDTSGSMREVVT 64 Query: 187 PGMDKLGVATRS-----------------IREMLDIIKSIPDVNNVVRSGLVTFSS--KI 227 ++ G + R + E L + S P + R +V F ++ Sbjct: 65 EPTERTGQSVRVDGKDYEVVRGAKSKIDLVIEALQNLLSSPQLQPSDRLAIVKFDDVAEV 124 Query: 228 VQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 VQ F A + RL + T+ G+ + + Sbjct: 125 VQPFTPANEKARLVAAAERLTQYSGGTQMGAGMREGMRLLEREAG------------SRR 172 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +I LTDG+ D + + V AIGV E D L Sbjct: 173 LILLTDGQ----TFDEPLVETVAAQLAQARIPVTAIGVGDEWNDDLL 215 >gi|320108233|ref|YP_004183823.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319926754|gb|ADV83829.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 342 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 78/220 (35%), Gaps = 33/220 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + L + ++LD S S + + ++++L L+ Sbjct: 96 TQEKTLPLTIGILLDTSGS---QMNVLPLEKDAGAQFLKDVLQPKDEAF---------LI 143 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY---NKIFDAKEKLEHIAK 278 F + + ++ +++ + + ++DA H Sbjct: 144 KFDINVDMLADYTNSPRELKRALDKAEINTGAGTGSVTGNGTPRGTLLYDAVYLGAHDKL 203 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------------A 326 + +K ++ LTDG + + ++++ +A AI+Y I + + Sbjct: 204 RQEAGRKVLVVLTDGGDQGSQLKLQDAIEAAQKA---NAIIYVILIADRGGFGNFSFGGS 260 Query: 327 EAADQFLKNCASPDRFYSV-QNSRKLHDAFLRIGKEMVKQ 365 D+ + + R +V N +KL DAF +I E+ Q Sbjct: 261 GDMDRLSRE--TGGRMINVGNNGKKLEDAFAQISDELRTQ 298 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 78/247 (31%), Gaps = 21/247 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 67 ENGNNGKKQKNDFSYR-----------IIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 N+ K + + T + L + + T + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKTVNN 129 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS----DIGLDM 171 ++ + P + + K + +I +D+ Sbjct: 130 TSGKGHTSSSEVTCTVSGTIKHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDL 189 Query: 172 MMVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 M+V D+S SM D G K+ + ++E+ D + N R G+ F+ Sbjct: 190 MVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITAFAM 249 Query: 226 KIVQTFP 232 Sbjct: 250 GAEHPKE 256 >gi|226487570|emb|CAX74655.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 837 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 67/202 (33%), Gaps = 36/202 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG- 219 + S D+ + + ++D S SM D + A S+ L KS+P G Sbjct: 292 VVSSKDMRNEFVFLIDRSGSMEG------DNISYAKTSLLLFL---KSLPVNCRFQIIGF 342 Query: 220 ----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLE 274 FS + + L T++ L+ A Sbjct: 343 GSNFAALFSEPTDYS---EDSLNAAMNYQKDLNADMGGTEAYNALKSAL----------- 388 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + + + K IIFLTDG+ N D L N K R V+ IG+ + + Sbjct: 389 HSSPSGEGWFKQIIFLTDGD--VGNADEVIGLVRMNVDKAR---VFTIGLGQGVSTALIG 443 Query: 335 NCASPDRFYSVQ--NSRKLHDA 354 A + ++ +L A Sbjct: 444 GVARVGNGTAAYVRDASQLQSA 465 >gi|183983524|ref|YP_001851815.1| hypothetical protein MMAR_3543 [Mycobacterium marinum M] gi|183176850|gb|ACC41960.1| conserved membrane protein [Mycobacterium marinum M] Length = 983 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 66/194 (34%), Gaps = 35/194 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM K+ A R+ ++D + S R ++TF I Sbjct: 301 LVLVLDRSRSMAGW------KMTAARRAASRIVDALTSDD------RFAVLTFDDGIEYP 348 Query: 231 FPLAWGVQHIQ--------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L G+ E + R+ T+ L A + + A Sbjct: 349 VGLPAGLTEASDRHRYRAVEHLARVEARGDTEMLAPLRRALALLGREQVADTDDA----- 403 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 +I ++DG+ N D + + ++ IGV FL+ A Sbjct: 404 ---VLILISDGQ--VGNEDQLLQELSGDLGR---VRLHTIGVDEAVNAGFLRRLAGVGGG 455 Query: 341 RFYSVQNSRKLHDA 354 R V N +L +A Sbjct: 456 RCVLVDNEDRLDEA 469 >gi|118618215|ref|YP_906547.1| hypothetical protein MUL_2774 [Mycobacterium ulcerans Agy99] gi|118570325|gb|ABL05076.1| conserved membrane protein [Mycobacterium ulcerans Agy99] Length = 981 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 66/194 (34%), Gaps = 35/194 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM K+ A R+ ++D + S R ++TF I Sbjct: 299 LVLVLDRSRSMAGW------KMTAARRAASRIVDALTSDD------RFAVLTFDDGIEYP 346 Query: 231 FPLAWGVQHIQ--------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L G+ E + R+ T+ L A + + A Sbjct: 347 VGLPAGLTEASDRHRYRAVEHLARVEARGDTEMLAPLRRALALLGREQVADTDDA----- 401 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 +I ++DG+ N D + + ++ IGV FL+ A Sbjct: 402 ---VLILISDGQ--VGNEDQLLQELSGDLGR---VRLHTIGVDEAVNAGFLRRLAGVGGG 453 Query: 341 RFYSVQNSRKLHDA 354 R V N +L +A Sbjct: 454 RCVLVDNEDRLDEA 467 >gi|115704003|ref|XP_001203229.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115754998|ref|XP_797312.2| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 1262 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 52/169 (30%), Gaps = 26/169 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +++++D S SM H L D K N+VR + Sbjct: 1021 LAVIVLVDTSGSMVTHMEDLKKDLVALI------WDQFKRENISFNIVRFSADIEPWRSH 1074 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P ++ + T + L A+ + KE I Sbjct: 1075 IVEPTDADCNDAVRWVSSFVPAGNTCTLEALSEAFRE----KEVDA------------IY 1118 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKN 335 LTDG+ S + N + G V+ I E+A+ FL+ Sbjct: 1119 LLTDGKPDSSTSRVFREIAQVNTVR--GVKVHTISFNCNDESANTFLRQ 1165 >gi|157412070|ref|YP_001481410.1| TerY3 [Escherichia coli APEC O1] gi|99867095|gb|ABF67740.1| TerY3 [Escherichia coli APEC O1] Length = 346 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 42/133 (31%), Gaps = 11/133 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L T ++ ++ ++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKKMTDGLQMIIGDLRKDPHALETAWVSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K H AKG + Sbjct: 57 RTIVPL---HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--VV 111 Query: 288 IFLTDGENSSPNI 300 LTDG + Sbjct: 112 YLLTDGRPTDDTT 124 >gi|73971950|ref|XP_532030.2| PREDICTED: similar to sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Canis familiaris] Length = 3569 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 75/211 (35%), Gaps = 40/211 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 L+++ ++D S S+ + +L R +R++L +P R +VTFSSK Sbjct: 80 GRLELVFLVDESSSVGQ--ANFLSEL----RFVRKLLSDFPVVP---TATRVAIVTFSSK 130 Query: 227 IVQTFPLAWGVQH---------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + ++ +I + + G T + + A + ++E Sbjct: 131 NNVVPRVDYISHRRAHQHKCALLRREIPAIAYRGGGTYTKGAFQQAAQILRHSRENS--- 187 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 K I +TDG ++ + + G ++ G+ + Sbjct: 188 -------TKVIFLITDGYSNGG-----DPRPVAASLRDFGVEIFTFGIWQGNIRELNDMA 235 Query: 337 ASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 ++P + Y + + + F + + + + Sbjct: 236 STPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|328858486|gb|EGG07598.1| hypothetical protein MELLADRAFT_77518 [Melampsora larici-populina 98AG31] Length = 503 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 67/227 (29%), Gaps = 55/227 (24%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVT 222 S+ LD+ +LD + SM + AT++I + D I + + + +R GL+ Sbjct: 26 SEKMLDLCFILDTTGSMG-------SYITAATQNIELICDEIINSERLASPECLRIGLIA 78 Query: 223 FSS-------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + I F + +QE + L + A ++ + + Sbjct: 79 YRDHPPQDRSYITLKFDFTSNPKIVQEHLKSLWASGGGDGPEAVTAAMHEALTLDWRPQ- 137 Query: 276 IAKGHDDYKKYIIFLTD------GENSS----PNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + +TD GE + + L + G ++ + Sbjct: 138 -------ASRMAVLITDAPPHGIGEYGDGFSKGDPSGHDPLKLARTMAQNGISLFVVA-- 188 Query: 326 AEAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 C A YS +D F I + Sbjct: 189 ----------CEPAFSGYMYS-------NDFFRAITNITSAMMLPLT 218 >gi|312129894|ref|YP_003997234.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906440|gb|ADQ16881.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 329 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 106/319 (33%), Gaps = 46/319 (14%) Query: 53 DHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 D S T L+ N + + K I + + I + Sbjct: 21 DPSAGSTFNIYLDYWNTTGRNPEIRFNEQKSSKEIKIDFNQTFEGQAEGTLYSKVIIDNF 80 Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + +D + +Y + + YE + + V+ + +D+G +M+ Sbjct: 81 KI---LDRDNNNYTIQNIKAYE----------YRGDNVWKEDVEFKVQYTQTTDVG-EMV 126 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +VLD S S+ F + + + S P+ V+ G+V FS+ + P Sbjct: 127 LVLDRSESLGTDFE------RIKQYAAEFVEQTFASHPE----VKIGVVDFSAYP-SSLP 175 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + I I L + T ++ + + + + + ++ TD Sbjct: 176 ITNNKEVILNYIKNLEMENFTALYDAMDMGVDMLLRSASQ-----------SRILVTFTD 224 Query: 293 GENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAA-----DQFLKNCASPDRFYSV- 345 G ++ + L N K + IG+ + L +S SV Sbjct: 225 GTDNFSRATLNDVLSKINSDKNLNKIRGFFIGLAGKGDLDTSVPTLL---SSKGWIVSVP 281 Query: 346 QNSRKLHDAFLRIGKEMVK 364 Q++ ++ + F + G+ + Sbjct: 282 QSATQVKEVFNKFGRLISN 300 >gi|162138242|gb|ABX82825.1| complement factor B [Sus scrofa] Length = 765 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 74/211 (35%), Gaps = 34/211 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +++ ++ + S GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGVKPKY---GLVTYATD 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + P + + E+++++ + T + L YN + + + Sbjct: 318 PKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWG---VNNFP 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 + + I+ LTDG ++ D ++ L K +Y GV Sbjct: 375 DNWNRTRHVIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNPREDYLDIYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAF 355 + + AS + +++ L D F Sbjct: 435 NQENINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|156120152|ref|NP_001095294.1| complement factor B [Sus scrofa] gi|148724909|emb|CAN87697.1| B-factor, properdin [Sus scrofa] Length = 765 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 74/211 (35%), Gaps = 34/211 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +++ ++ + S GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGVKPKY---GLVTYATD 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + P + + E+++++ + T + L YN + + + Sbjct: 318 PKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWG---VNNFP 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 + + I+ LTDG ++ D ++ L K +Y GV Sbjct: 375 DNWNRTRHVIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNPREDYLDIYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAF 355 + + AS + +++ L D F Sbjct: 435 NQENINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|148724910|emb|CAN87698.1| B-factor, properdin [Sus scrofa] Length = 549 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 74/211 (35%), Gaps = 34/211 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +++ ++ + S GLVT+++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGVKPKY---GLVTYATD 317 Query: 227 ----IVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + P + + E+++++ + T + L YN + + + Sbjct: 318 PKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWG---VNNFP 374 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFYCNEAKR---RGAIVYAIGVQAEA 328 + + I+ LTDG ++ D ++ L K +Y GV Sbjct: 375 DNWNRTRHVIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNPREDYLDIYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAF 355 + + AS + +++ L D F Sbjct: 435 NQENINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|119511137|ref|ZP_01630255.1| Mg chelatase subunit [Nodularia spumigena CCY9414] gi|119464232|gb|EAW45151.1| Mg chelatase subunit [Nodularia spumigena CCY9414] Length = 680 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 67/211 (31%), Gaps = 37/211 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 479 KAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQNRDQVALIPFRG 527 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + ++ RL G + GL A +A+ D + Sbjct: 528 EQAEVLLPPTRSIALARNRLERLPCGGGSPLAHGLTQAVRVGVNAQ-------MSGDIGQ 580 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI----GVQA 326 I+ +TDG E E L + G + I + Sbjct: 581 VVIVAITDGRGNIPLARSLGEPLEPGEKPDIKAELLEIAARIRALGMQLLVIDTESKFVS 640 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + L A+ ++ + S K A + Sbjct: 641 TGFAKELSQTAAGKYYHLPKASDKTIAAMTK 671 >gi|118401451|ref|XP_001033046.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89287392|gb|EAR85383.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 680 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 82/231 (35%), Gaps = 44/231 (19%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG----------MDKLGVAT 196 N L+ +S+K + S+ + V+DVS SM + L + Sbjct: 22 NKLRISLIPPASIKRRTNSN----ICCVVDVSGSMGTEANSNSSVSSSENYCLSILDIVQ 77 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP----LAWGVQHIQEKINRLIFGST 252 +++ +++ + D++ LV FSS ++ F +KI +L + Sbjct: 78 HALKMIVNTLTPDDDLS------LVVFSSMAIEVFDTLRMDDANKILAIDKIEKLEASGS 131 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G++ N + +K + + + LTDG+ D++ + + + Sbjct: 132 TNLQHGIQVGLNILSKSK---------SQNRNQAMYVLTDGQP-----DDRNVMQFLKKY 177 Query: 313 KRRG----AIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLR 357 K+ + G + + L A + + ++ + AF Sbjct: 178 KKDNPQLRCTISTFGFGSSCDSELLDEIAREYNGMYSFIPDATLIATAFAN 228 >gi|330835244|ref|YP_004409972.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329567383|gb|AEB95488.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 450 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 98/309 (31%), Gaps = 64/309 (20%) Query: 72 GKKQKNDFSYRIIKNIWQTD---FRNELRENGFAQDINNIERSTSLSIIID-----DQHK 123 K ++ S + ++ I + + + G D+ + R+T + I++ + Sbjct: 167 AKATEDSNSVKSMQRIVGGNGAGTGSMMNFEGDIHDVLRLARNTEIKKILEFLSGIPRLG 226 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS----SKSDIGL---------- 169 + +RY +F P + ++ ++S + L Sbjct: 227 SFTKKKTARYSKGELFGYEEGSDIERLVPSELALPEEVFDVKLAESQLLLYQKQIKETLG 286 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM+ + +A S + + +R F I Sbjct: 287 PIYLLLDKSGSMDGEKILWAKAVALALYSRARR-------ENRDFYLR-----FFDNI-- 332 Query: 230 TFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +PL + V + E I ++ G T + + A + I D + Sbjct: 333 PYPLIKVIKNAKSKDVIKMVEYIGKIRGGGGTDISRSVMSACDDIKDGHVRGVSE----- 387 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +I LTDGE+ + SL N I V + LK D Sbjct: 388 -----VIILTDGEDKIAETTVRRSLKEANAT--------LISVMIRGDNADLKRV--SDN 432 Query: 342 FYSVQNSRK 350 ++ V + Sbjct: 433 YFVVYRLDQ 441 >gi|166366825|ref|YP_001659098.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089198|dbj|BAG03906.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 460 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 72/223 (32%), Gaps = 46/223 (20%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM----N 182 L+ F+ P L T V +++ + ++D S SM Sbjct: 2 LNVSITPHREFLPADTPDQRLFIMLKLRPTQEV---AQTRPSTTFVFLIDTSGSMYEIVA 58 Query: 183 DHFGP-----------------GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 F P G+ K+ + S+R +++ + P+ R L+ F Sbjct: 59 GDFQPTGETYMQDGKEYTRVVGGIAKIDMVIESLRTLINSGRFTPE----DRIALIQFDD 114 Query: 226 KIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + +++ I +L F T G+ A + + H Sbjct: 115 QASTLIGLTPVTQTRQLEDAIAKLRNFSGGTCMGRGINQALALLANQSMTSRHT------ 168 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +IF TDG+ D + + +G + A+GV Sbjct: 169 ----MIF-TDGD----TFDEDDCQNLAQQFASQGISITALGVG 202 >gi|219127467|ref|XP_002183956.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404679|gb|EEC44625.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 694 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 57/142 (40%), Gaps = 25/142 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI++ S ++++VLD S SM ++ ++LG+ + +++ + R Sbjct: 191 KIAADSG-PKNVVLVLDTSSSMGNY-----NRLGLLQDAAIRIVETLSVGD------RIA 238 Query: 220 LVTFSSKIV----QTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 +V FSS+ + W + ++ + L T + + + D+ + Sbjct: 239 IVQFSSQAKPFESKGQTFFWATKENKIALKTYVEDLELNEGTNTLDAFNKTFAVLDDSID 298 Query: 272 KLEHIAKGHDDYKKYIIFLTDG 293 + H ++FLTDG Sbjct: 299 QELHNECITA-----VLFLTDG 315 >gi|167752255|ref|ZP_02424382.1| hypothetical protein ALIPUT_00498 [Alistipes putredinis DSM 17216] gi|167660496|gb|EDS04626.1| hypothetical protein ALIPUT_00498 [Alistipes putredinis DSM 17216] Length = 294 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 66/185 (35%), Gaps = 21/185 (11%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 +I + + E++ G + +I + T+ + + + S V Y W Sbjct: 6 NDILKRVRKIEIKTRGLSNEIFAGKYHTAF------RGRGMSFSEVREYRAGDDVRDIDW 59 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + L MM+++DVS S D+L ++++ + Sbjct: 60 NVTARSRT-----PHIKVYEEERELTMMLLVDVSGS---RMFGTTDRL---KKNLQTEIA 108 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLE 260 + + N + G + FS +I + P G HI I L+ T+ + L Sbjct: 109 AVLAFSASENNDKVGCIFFSDRIEKFIPPKKGRSHILAIIRELVGFEPQSRGTRISEALR 168 Query: 261 YAYNK 265 + N Sbjct: 169 FLTNV 173 >gi|311070192|ref|YP_003975115.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] gi|310870709|gb|ADP34184.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] Length = 227 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 67/235 (28%), Gaps = 27/235 (11%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M F ++ + + ++ ++LD S SM G+ K + Sbjct: 1 MKKRFSLLMMVGLVLSVASPAFAAEQKVPAAKPDTNVAVLLDASGSMAKRID-GVSKFNL 59 Query: 195 ATRSIREMLDIIKSIPDVNNVV--RSGLVTFSSKIVQTFPLA-------WGVQHIQEKIN 245 A + I + + + V V G S K+ + + Q + +N Sbjct: 60 AKKEIFQFAKSLPTDSQVKMSVFGSEGNNKNSGKVQSCEAIRNVYGFQGFDEQSFRNSLN 119 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + T L A + +K E+ + LTDGE + Sbjct: 120 TIGPTGWTPIAKALNEAKSSFDQLDKKGEN----------VVYLLTDGEETCGG----NP 165 Query: 306 LFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + E + V IG E L A ++ + + F Sbjct: 166 IKTAKELHKHNITVNVIGFDFKEGYKGQLNAIAKVGGGEYFPASSQSDIKQIFKA 220 >gi|256823199|ref|YP_003147162.1| TPR repeat-containing protein [Kangiella koreensis DSM 16069] gi|256796738|gb|ACV27394.1| TPR repeat-containing protein [Kangiella koreensis DSM 16069] Length = 578 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 51/161 (31%), Gaps = 26/161 (16%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + + P + S + +VLD+S SM D +L A Sbjct: 70 LLLAFWILAVAALSGPSWKKLPQPVFSNLASRV---LVLDLSASM-DSPDVKPSRLARAK 125 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----T 252 + ++L IK +G V ++ PL I + L Sbjct: 126 FKLTDILKAIKDGQ-------TGFVVYAGDGFVLSPLTTDTDTIDNMVGVLATNLMPLVG 178 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 ++ + G+E A + + II++TDG Sbjct: 179 SQPSKGIEKAIELLDNGAAGEGD-----------IIWITDG 208 >gi|156358451|ref|XP_001624532.1| predicted protein [Nematostella vectensis] gi|156211319|gb|EDO32432.1| predicted protein [Nematostella vectensis] Length = 180 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 69/183 (37%), Gaps = 29/183 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L++ V+D S S+N+ + G +++M + S P + V +S + Sbjct: 6 LNLAFVVDGSGSINNA------RFGRFREFVKKMAE---SFPVSATNTQVATVVYSEEPE 56 Query: 229 QTFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + I+ ++ + + G TT + L++ ++F K + K Sbjct: 57 LIFNFGKYNDINEIKTAVDNMPYHGKTTHTGKALKFTLEEVFKKARK---------NVKN 107 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFY 343 +I LTDG + + G V+A+GV + + + P D + Sbjct: 108 VLIALTDG------HASDLVKKPAQAVRDYGIEVFAVGVGSPDIAELEEIATDPDKDHVF 161 Query: 344 SVQ 346 +V Sbjct: 162 NVD 164 >gi|41054964|ref|NP_956728.1| integrator complex subunit 6 [Danio rerio] gi|82187780|sp|Q7SYD9|INT6_DANRE RecName: Full=Integrator complex subunit 6; Short=Int6; AltName: Full=Protein DDX26B gi|32766576|gb|AAH54905.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Danio rerio] Length = 892 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRTYLGTTYLDIAKGAVEIFM-KLRARDPASRGDRYMLVTFDDPPYGV 62 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 63 KA-GWKENHATFMSELKNLQASGLTTLGHALRAAFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PSVIITITDG 131 >gi|73669282|ref|YP_305297.1| protoporphyrin IX magnesium-chelatase [Methanosarcina barkeri str. Fusaro] gi|72396444|gb|AAZ70717.1| protoporphyrin IX magnesium-chelatase [Methanosarcina barkeri str. Fusaro] Length = 688 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 47/135 (34%), Gaps = 18/135 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG ++ V+D S SM ++ + ++ ML + + GL+ Sbjct: 487 REKKIGNLVLFVVDASGSMG-----AQQRMVASKGAVLSML-----MDAYQKRDKVGLIA 536 Query: 223 FSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F P V+ Q+ + L G T + GL Y I + Sbjct: 537 FKGTGAELLLPPTSSVEMAQKYLEELPTGGKTPLSHGLMKGYETIHAELRRD-----PDT 591 Query: 282 DYKKYIIFLTDGENS 296 +++ ++DG + Sbjct: 592 CP--FMVLISDGRAN 604 >gi|37523026|ref|NP_926403.1| hypothetical protein gll3457 [Gloeobacter violaceus PCC 7421] gi|35214029|dbj|BAC91398.1| gll3457 [Gloeobacter violaceus PCC 7421] Length = 596 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 44/317 (13%), Positives = 94/317 (29%), Gaps = 46/317 (14%) Query: 55 SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL 114 + + + + + + F K + ++ R L + L Sbjct: 288 ASVGSVYEASVIAANGDLGSDEPRFEAVYPKATFTSNMRAILPSGPWVSPQEQAAAEQIL 347 Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFC------TFPWCANSSHAPLLITSSVKISSKSDIG 168 + + + + R +P + + N+ + L +++ D Sbjct: 348 AYLRTPESQKLAAEHGLRPGVPGVPLGEKFTAQYGVDPNARYDSLRSPKPEVVAAMLDSW 407 Query: 169 LD-------MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++V+D S SM DKL ++++ +D + L+ Sbjct: 408 RSYTKKPSLVVLVIDSSGSMKG------DKLPAVQQTLQAYIDGLGPKET------IALI 455 Query: 222 TFSSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F S I LA G + + I L T+ YA + + + A Sbjct: 456 DFDSDIRDPM-LADASPAGRERAERFIAGLEAEGGTRLYDAALYARDWLVKHRRAGAINA 514 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKE---SLFYCNEAKRRGAIVYAIGVQAEA--ADQF 332 ++ LTDG++ ID L + + +G E + Sbjct: 515 ---------VVVLTDGKDDGSTIDLNRLGAELQKSGFSSDERVAFFTVGYGGEGQFNPEA 565 Query: 333 LKNCA--SPDRFYSVQN 347 LK A + + + Sbjct: 566 LKAIAELNGGYYIEGEP 582 >gi|290996921|ref|XP_002681030.1| predicted protein [Naegleria gruberi] gi|284094653|gb|EFC48286.1| predicted protein [Naegleria gruberi] Length = 353 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 65/186 (34%), Gaps = 32/186 (17%) Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKI 244 M + + I + I ++ R G+V F K PL + ++E + Sbjct: 1 MSSKEKTKMQVANQV-ICEIIDNLREFERLGIVLFDHKAETLLPLTIVQDLDKKSLKETV 59 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFD-AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++ +T G++ + L + + II+LTD + D Sbjct: 60 LKIEEQGSTNFEAGMQRGIDLFSTLDSSDLSNSNR--------IIYLTDACPNVGGTDTL 111 Query: 304 ESLFYCNEAKRRGAIVYA--IGVQAEAADQF------LKNCASPDRFYSVQNSRKLHDAF 355 + L +A +++ IG+ + ++ C ++SV+++ F Sbjct: 112 DILT--KDANSGPYNIFSTFIGIGLDFNSDIVEELTRVRGC----NYFSVRSTED----F 161 Query: 356 LRIGKE 361 +I + Sbjct: 162 TKILNQ 167 >gi|260786377|ref|XP_002588234.1| hypothetical protein BRAFLDRAFT_124701 [Branchiostoma floridae] gi|229273394|gb|EEN44245.1| hypothetical protein BRAFLDRAFT_124701 [Branchiostoma floridae] Length = 1044 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 65/206 (31%), Gaps = 38/206 (18%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSSKIVQT 230 ++VLD S SM + + + + S + ++ G+VTF+S+ Sbjct: 274 VLVLDTSGSMG-------------KKLLFNLRQSLTSHVYNLPIGSSLGIVTFNSEATIN 320 Query: 231 FPLA-WGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ G + ++ + + G T GL+ A + + + Sbjct: 321 APMTVIGNETTRDALVGALPMTTGGKTSIGSGLQEALGLLGNDLGR-------------- 366 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYS 344 II ++DG+ + G V+ + + A+ L FY Sbjct: 367 IILISDGQEDELPHIAD----VLPALRVAGHTVHTVAIGADGDPMLEQLSRDTGGKSFYH 422 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILYN 370 + S I E + L Sbjct: 423 TRWSTNFPGILRTIEAEDTTRVPLKT 448 >gi|145592113|ref|YP_001154115.1| VWA containing CoxE family protein [Pyrobaculum arsenaticum DSM 13514] gi|145283881|gb|ABP51463.1| VWA containing CoxE family protein [Pyrobaculum arsenaticum DSM 13514] Length = 357 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 65/192 (33%), Gaps = 46/192 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + + LDVS SM ++ G G KL +A +I IK + ++ V L F++ Sbjct: 200 VYVALDVSGSMKEYMG-GATKLKIAKDAIARY---IKQMAELRGNV--SLTLFNADADYM 253 Query: 231 FPLAWGVQHIQEKINRLI-------FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 W + ++ T+ LE + A H Sbjct: 254 ----WTPHPAHRYLKEMLEILRYVYSMGGTEIASALE-----LLHADAARSH-------- 296 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 ++ ++DG + D ++ L + + ++ + + LK A + + Sbjct: 297 ---VVIISDGRTN----DPEKVLQLAKKFR----RIHTVA---AERSRLLKQIAKITGGK 342 Query: 342 FYSVQNSRKLHD 353 + + + L Sbjct: 343 YRELNPTLDLLS 354 >gi|291414463|ref|XP_002723479.1| PREDICTED: collagen, type VI, alpha 1 [Oryctolagus cuniculus] Length = 868 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T + SS D+ ++LD S S+ H + A R L ++ P + Sbjct: 658 YTCPITFSS----PTDITILLDGSASVGSHNFDTTKRF--AKRLAERFLSAGRTDPAHD- 710 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY--NKIFDAKEK 272 VR +V +S + Q +Q N + T + A N + Sbjct: 711 -VRVAVVQYSGRGQQQPE----RAALQFLQNYTVLAGTVDTMGFFNDATDVNDALSYVTR 765 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 A KK ++F +DG + + E EA+R G ++ + V Sbjct: 766 FYREASPATAKKKVLLF-SDGNSQGATAEAIE--RAVQEAQRAGIEIFVVVVG 815 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 51/131 (38%), Gaps = 15/131 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG---MDKLGVATRSI 199 P A L + +D +D+ VLD S S+ P +DK+ T++ Sbjct: 59 PGEAVRGSRALPLG--------ADCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKNF 110 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLI-FGSTTKS 255 + L+ D N V +G + +S ++ L G ++ ++ + FG T + Sbjct: 111 IDNLNDRYYRCDRNLVWNAGALHYSDEVEVISGLTRMPAGRDALKASVDAVKYFGKGTYT 170 Query: 256 TPGLEYAYNKI 266 ++ ++ Sbjct: 171 DCAIKRGLEEL 181 >gi|254414923|ref|ZP_05028687.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196178412|gb|EDX73412.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 928 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 88/269 (32%), Gaps = 37/269 (13%) Query: 83 IIKNIWQTDFRNELRE-NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCT 141 I + E++E + + I + + + + + N + RY++ Sbjct: 335 NINVTVDIEAGVEIKEVHSPSHQIQIERQDQGMRVTLSRRDTIPNKDLILRYQVAGDRTQ 394 Query: 142 FPWCANS----SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + H + + +++ + + D++ ++D S S G ++K R Sbjct: 395 TTVLSQADTRGGHFAVYLIPAIEYNPHQLVPKDVVFLIDTSGS---QSGEPLNKCQELMR 451 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAW---GVQHIQEKINRLIFGST 252 L+ + ++ FS + + PLA IN+L Sbjct: 452 RFINGLNPHDTFT---------IIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGG 502 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T+ G++ N +L + I+ LTDG + N + L Sbjct: 503 TQLRRGIQAVLNFPEVDPGRL-----------RSIVLLTDGYIGNEN----QILAEVQRH 547 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + G +++ G + L A R Sbjct: 548 LKLGNRLHSFGAGSSVNRFLLNRIAEIGR 576 >gi|17555734|ref|NP_499381.1| C-type LECtin family member (clec-163) [Caenorhabditis elegans] gi|3880798|emb|CAA16338.1| C. elegans protein Y39A1B.1, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 385 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 77/235 (32%), Gaps = 15/235 (6%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGP 187 + + + +I F C S L + + K + LD++ V+D S M D Sbjct: 2 TAMFLLNYILLGFALCTGSPIVSQLSSYVDRPCGKDPKHLWLDIVAVVDNSKEMTDD--- 58 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 G+ + S+ + + P+ R GLVT++ + L + E N L Sbjct: 59 GVLSIAGELSSLFSHAEQLGIDPNQPRTTRIGLVTYNEEATVVADL-NNITTADELSNTL 117 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD--YKKYIIFLTDGENSSPNIDNKES 305 T S+ Y + A + L + Y+K +I + ++ Sbjct: 118 FAALTWTSSVEHSYLQTGLKAADDLLAKQSFNTSRGHYQKLVIVY---ASEYRVSGTQDP 174 Query: 306 LFYCNEAKRRGAIVYAIGVQAE---AADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 L K + + + + + L ASP ++ N + Sbjct: 175 LPLATRMKTY-LTIATVAYRQDIVVGFNDALTKIASPGYNFTNFNGNNVVSELKT 228 >gi|328882499|emb|CCA55738.1| conserved hypothetical protein SC6D10.11 [Streptomyces venezuelae ATCC 10712] Length = 442 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 24/165 (14%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 TS ++S+ +D G +++++D S SM+ K+ A + +D ++ V Sbjct: 42 TSGRQLSAPADGGAGVVIMVDCSGSMDY----PATKMRGAREATAAAVDTLRDGTAFAVV 97 Query: 216 VRSGLV--TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + L + A +E + RL G T L A + A + Sbjct: 98 AGTHLAKEVYPGNGGLAIADARTRAEAKESLRRLSAGGGTAIGTWLRLADRLLSSAALTI 157 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPN-------IDNKESLFYCNE 311 H I LTDG N + +D F C+ Sbjct: 158 RHG-----------ILLTDGRNEHESPEELRAALDACAGRFTCDA 191 >gi|323492793|ref|ZP_08097935.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] gi|323312864|gb|EGA65986.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] Length = 694 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 87/296 (29%), Gaps = 57/296 (19%) Query: 86 NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF--- 142 IW + N + + + ++ S+ + D + +P Sbjct: 232 TIWTANIGNYIDKTEREGESQQTGQA-SMRLDQDVVF-----YWRLQDGLPGRVDMVAYR 285 Query: 143 -PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGV 194 P + L T ++ + G D + VLD S SM+ + G+ KL Sbjct: 286 DPETSKRGTVKLTFTPGDDLTRVTQ-GRDWVFVLDKSGSMSGKYSTLVEGVRQGLGKLPA 344 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSG---LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 R M D N +G V + V + + ++ + Sbjct: 345 EDRFRVVMFDN-------NTYDLTGGFVAVNPT-----------NVSKALQSVEQVEPSN 386 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T G+ A ++ D + I+ +TDG + + + Sbjct: 387 GTNLYEGMSAAIRRLDDDRPTG-------------IVLVTDGVANVGVTEKRRFFEL--- 430 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 ++ ++ + A L S SV N+ + + I ++ Q Sbjct: 431 MEKHDVRLFTFIMGNSANTPLLVPMTKLSNGVATSVSNADDIIGHLMNITSKLTYQ 486 >gi|329894015|ref|ZP_08270023.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC3088] gi|328923358|gb|EGG30677.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC3088] Length = 621 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 67/208 (32%), Gaps = 34/208 (16%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPG 188 +++ + + + P + V +S + + +VLD+S SM + P Sbjct: 57 KTQHTAKLLGLAWLIAIVGAAGPSWEQTPVPVSQQKAAYV---VVLDLSYSMYAEDTQP- 112 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 ++ A + IR++L + + LV ++ + PL + I+ I L Sbjct: 113 -SRIVKAKQKIRDLL-------SLPQDTQVALVAYAGEAHVVTPLTDDLGTIENLIPALN 164 Query: 249 FGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 G + A + + I++LTD + + + Sbjct: 165 PGMMPVPGSDPIDAFVRAKALLSSSG-----------VPGGSILWLTDDIETHQVSELSD 213 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQF 332 L G + + + +A Sbjct: 214 WLETA------GIRLATMAIGTQAGAPI 235 >gi|308472979|ref|XP_003098716.1| hypothetical protein CRE_04176 [Caenorhabditis remanei] gi|308268316|gb|EFP12269.1| hypothetical protein CRE_04176 [Caenorhabditis remanei] Length = 356 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 61/186 (32%), Gaps = 26/186 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ LD++ V+D S M G++++ S+ I R GLVT++ Sbjct: 33 TNLWLDVIAVVDNSRGMTVD---GLNEIASNIASVFGFGTRIGLNASEPRTTRLGLVTYN 89 Query: 225 SKIVQTFPL----AWG------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 S Q L + G + ++ T GLE A D Sbjct: 90 SVATQKADLNQYQSIGDVFHGIFYALSNTVDTTESYLAT----GLELAEKMFNDQSVNSI 145 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFL 333 Y+K +I + +D + + K G + + A + L Sbjct: 146 RAH-----YQKVVIVYAATYQTKGEMDPES---IADRLKMSGVKIITVAYGDAYGLMKSL 197 Query: 334 KNCASP 339 ASP Sbjct: 198 SVIASP 203 >gi|296106233|ref|YP_003617933.1| hypothetical protein lpa_01009 [Legionella pneumophila 2300/99 Alcoy] gi|295648134|gb|ADG23981.1| hypothetical protein lpa_01009 [Legionella pneumophila 2300/99 Alcoy] Length = 4669 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 50/145 (34%), Gaps = 12/145 (8%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ + + ++M++LD S SM + S E+L+ +++ + Sbjct: 3404 PVASNITRSGLANEGADTNLMLILDTSGSMAGS------GIQTLINSTLELLERYEALGN 3457 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 V VR VTF++ + V + + L G T L A N Sbjct: 3458 VK--VRI--VTFNTSATAIGSVWMTVDAAKNALLGLTAGGNTNFDAALITAMNAFNSGTV 3513 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 G Y F++DG + Sbjct: 3514 GGADGRIGGAQNVSY--FISDGNPT 3536 >gi|224534313|ref|ZP_03674891.1| conserved hypothetical protein [Borrelia spielmanii A14S] gi|224514415|gb|EEF84731.1| conserved hypothetical protein [Borrelia spielmanii A14S] Length = 370 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 21/174 (12%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I +K S S LD+++V+DV+ SM + + + + +++ + + Sbjct: 215 IKKILKNSEDSVYDLDLVLVIDVTDSMKSN-------IEILKEHLFSIIEP--QLQKFKS 265 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R GLV + + A+ I + + G Y + Sbjct: 266 Y-RIGLVFYKDYLEDFLTKAFDFNTIPYLNNILKYVNVGGGGD--------YPEAVFEGI 316 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +++II + D + N AK + +Y I Q Sbjct: 317 DAAVTQFDWRAERRFIIVIGDAPPHEYPRGSIVYKDVINSAKEKDITIYGIIFQ 370 >gi|239832733|ref|ZP_04681062.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] gi|239825000|gb|EEQ96568.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] Length = 579 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 37/97 (38%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 + F G+++ + A++ P+ V ++TS F + +L + D + + A + Sbjct: 5 VSRFLGARGGNLATMAALVSPLFLAVAAFCVDTSSLFLERRQLQNMADLAAVAGAASLSQ 64 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFA 102 + Q N ++ + + N +N Sbjct: 65 ANEAVLRQLQANGVDPVLMTDGYDPSIVNGKADNKTR 101 >gi|158512138|gb|ABW69099.1| structural toxin protein [Legionella pneumophila] Length = 4669 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 50/145 (34%), Gaps = 12/145 (8%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ + + ++M++LD S SM + S E+L+ +++ + Sbjct: 3404 PVASNITRSGLANEGADTNLMLILDTSGSMAGS------GIQTLINSTLELLERYEALGN 3457 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 V VR VTF++ + V + + L G T L A N Sbjct: 3458 VK--VRI--VTFNTSATAIGSVWMTVDAAKNALLGLTAGGNTNFDAALITAMNAFNSGTV 3513 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 G Y F++DG + Sbjct: 3514 GGADGRIGGAQNVSY--FISDGNPT 3536 >gi|24374614|ref|NP_718657.1| TPR domain-containing protein [Shewanella oneidensis MR-1] gi|24349234|gb|AAN56101.1|AE015746_5 TPR domain protein [Shewanella oneidensis MR-1] Length = 679 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 52/164 (31%), Gaps = 28/164 (17%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P L S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHILAFSWFIATLALAGPALNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GL+ F+ PL + + ++ Sbjct: 113 QAKFRATDLLRSLKEGE-------TGLIAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + L A N + II +TDG Sbjct: 166 VLGSNLAAALTQAKNLLAQGGHLRGD-----------IIVMTDG 198 >gi|285808484|gb|ADC36008.1| putative von Willebrand factor type A domain protein [uncultured bacterium 259] Length = 297 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 64/186 (34%), Gaps = 21/186 (11%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + S +D+ +VLD S SM +L R++ +LD Sbjct: 64 QQITSLSIESTPIDLTLVLDTSSSMTGMME----RLKGDVRAVEGLLD---------ESD 110 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R+GL+TFSS + + P+ + L +T + A H+ Sbjct: 111 RAGLITFSSSVREVSPMH--ERGDPAPAAALAPAGSTAFYQAVVAALLSATTPGR--PHL 166 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAEAADQFL 333 A D I L DG + S E++ Y + G+ V +G + L Sbjct: 167 ALVMSDGDDNISLL-DGADVSDLARRSETVLYVVLRGTIRASGSRVGWLGFRGPGDLDVL 225 Query: 334 KNCASP 339 K A+ Sbjct: 226 KEAAAA 231 >gi|192359569|ref|YP_001982631.1| TPR domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685734|gb|ACE83412.1| TPR domain protein [Cellvibrio japonicus Ueda107] Length = 664 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 28/157 (17%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDI 205 + P V + ++++LD+S SM ++ P R+ ++ DI Sbjct: 76 TALAGPTWEKRPVSLEKNQQA---LVLLLDLSPSMLSEDLKPS-----RLLRARLKIADI 127 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEY 261 +K D +GLV ++ PL+ + I + L S + + + Sbjct: 128 LKRRQDG----YTGLVVYAGDAHVVTPLSDDSKTIINLLPDLTPYIMPLSGSNTEAAIGR 183 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 A + D+ ++ LTDG + Sbjct: 184 ALQMMHDSGISQGD-----------LLLLTDGVTADA 209 >gi|145502983|ref|XP_001437469.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404619|emb|CAK70072.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 77/207 (37%), Gaps = 19/207 (9%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPD 211 +T S +I + D ++++ D+S SM+ F + ++G D + + Sbjct: 102 NVTESPQIIQQQDPAEAIVVLYDISGSMSSQFFGDKELSRMGAVNAFFSAFADKTLAF-E 160 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 N++V+ LV F S + + ++ G +TK ++YA NK+ + K+ Sbjct: 161 FNHIVK--LVWFGSTLFDKCEFTSDFNKFIKLVDDANPGGSTKCYDAIDYAINKLLEVKQ 218 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAA 329 K I I+ LTDGE+ N + I+ + +G Sbjct: 219 KYPDIVLR-------ILALTDGED---NASASKPNTLVQRIFDHKIIIDSFVVGDNCVGL 268 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFL 356 AS R Y ++ + F Sbjct: 269 KTLTH--ASNGRCYCPRDLGQGMSLFE 293 >gi|295698033|ref|YP_003602690.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060145|gb|ADF64882.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 346 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 42/133 (31%), Gaps = 11/133 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + VLD S SM L T ++ ++ ++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESMIGE------NLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + RL G T L +I K H AKG + Sbjct: 57 RTIVPL---HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--VV 111 Query: 288 IFLTDGENSSPNI 300 LTDG + Sbjct: 112 YLLTDGRPTDDTT 124 >gi|303246179|ref|ZP_07332460.1| Ssl1 domain protein [Desulfovibrio fructosovorans JJ] gi|302492575|gb|EFL52446.1| Ssl1 domain protein [Desulfovibrio fructosovorans JJ] Length = 587 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 34/162 (20%) Query: 169 LDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L +++ +D S SM P D+L A +R +L +P V +G+V FS + Sbjct: 82 LRLLVAVDCSRSMLARDMAP--DRLSAAKGLVRAVL---AGLPH----VAAGVVGFSGRA 132 Query: 228 VQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P+ + ++ L T T LE + A+ Sbjct: 133 WLACPVTADRPALALFLDALSPAEAPLGGTSVTAALEACRLALTGARSGA---------- 182 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 I+ L+DGE++ P D S + V+ + V Sbjct: 183 ---ILVLSDGEDTVPVRDATGS-------RPDDPPVFTVAVG 214 >gi|221505369|gb|EEE31023.1| microneme protein, putative [Toxoplasma gondii VEG] Length = 723 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 70/191 (36%), Gaps = 40/191 (20%) Query: 151 APLLITSSVKISSKSDIG------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 P + ++ S LD+ ++D S S G+ + + + L Sbjct: 5 VPEGVEDVIQSDSAIGAAEGCTNQLDICFLIDSSGS------IGIQNFRLVKQFLHTFLM 58 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----------STTK 254 ++ P+ V + +VT+S+ + L W +Q +++ + +T Sbjct: 59 VLPIGPEE---VNNAVVTYSTDVH----LQWDLQS-PNAVDKQLAAHAVLEMPYKKGSTN 110 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 ++ GL+ +F G + K +I +TDGE + + ++ E + Sbjct: 111 TSDGLKACKQILFTGSR------PGREHVPKLVIGMTDGE----SDSDFRTVRAAKEIRE 160 Query: 315 RGAIVYAIGVQ 325 G IV + V Sbjct: 161 LGGIVTVLAVG 171 >gi|91775043|ref|YP_544799.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] gi|91709030|gb|ABE48958.1| MxaC family protein, potentially involved in Ca2+ insertion into quinoproteins [Methylobacillus flagellatus KT] Length = 326 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 69/224 (30%), Gaps = 35/224 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S K+ ++ G +MV+D S+SM+ F +I D Sbjct: 68 GSDKVVNRVGKGAQTVMVIDRSVSMDHPFAGDATSGRAGEIKSGAARRLITQFIDSRPDD 127 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLE 274 G+V F++ + + I IN T G+ A +FD+ + Sbjct: 128 MMGVVAFTNSALYGVKITANRDAIHAAINAATSAGINQTNIGAGITQA-ASLFDSIQSSG 186 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----- 329 A II L+DG + + + E + + +Y I ++ Sbjct: 187 SRA---------IILLSDG---AGKLSPRVKARIREELRDKDIKLYWIVLREPDDVSIFG 234 Query: 330 ---------------DQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 D+F K+ + Y N L A I Sbjct: 235 DRVFEEDRGPAAIQLDRFFKSLNITYKAYEADNPVALQMAIQDI 278 >gi|301617432|ref|XP_002938150.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Xenopus (Silurana) tropicalis] Length = 985 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D+S SM ++ +A +I +LD + VN ++ ++ + Sbjct: 222 DIVIVVDISGSMKGL------RMTIAKHTISTLLDTLGENDFVN------IIAYNDYVHY 269 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L L A+ + + +E + Sbjct: 270 IEPCFKGILVQADRDNREHFKQLVDELHAKGVGTVNKALIEAFKILKEFREAGQGGLCNQ 329 Query: 281 DDYKKYIIFLTDG 293 I+ +TDG Sbjct: 330 A-----IMLITDG 337 >gi|222101616|gb|ACM44013.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 71/194 (36%), Gaps = 18/194 (9%) Query: 134 EMPFIFCTFPWCANSSH-APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P + F A I S K LD +V+D S S+++ G Sbjct: 10 SVPLLSLAFLATTGIHAFADKGIGSPKGKQCKKQ--LDFSIVVDESASISNDQWEGQ--- 64 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS 251 +R ++ + N +R L T+S+ Q F L + + +L + + Sbjct: 65 --MIPFLRNLIHTVDL---DNTDIRLSLTTYSTPTRQIFTFLDAAASSTRLALTKLDWMA 119 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TK+ G+ Y + ++ + G + K ++ +TDG +S + + Sbjct: 120 GTKARSGMTYTGRALNYVRK--AILPYGRKNVPKALLLITDGVSSDGSY----TAQVAAM 173 Query: 312 AKRRGAIVYAIGVQ 325 + G V IGV Sbjct: 174 LRDEGVNVMVIGVG 187 >gi|254481586|ref|ZP_05094830.1| tetratricopeptide repeat domain protein [marine gamma proteobacterium HTCC2148] gi|214038214|gb|EEB78877.1| tetratricopeptide repeat domain protein [marine gamma proteobacterium HTCC2148] Length = 603 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 57/168 (33%), Gaps = 31/168 (18%) Query: 170 DMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++VLD+S SM P R+ +++LD++ + ++GL+ ++ Sbjct: 86 ALVLVLDLSYSMMAADLAPS-----RNDRARQKLLDLLSQRKEG----QTGLIAYAGDAH 136 Query: 229 QTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL I + L G + ++ L A ++ A + Sbjct: 137 IVTPLTDDNPTIANLLPALNPGMMPLAGSEPAAALSQAVELMYSAGVQRGR--------- 187 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I+ +TDG + E L G + +G+ Sbjct: 188 --IMLVTDGITEQDREEISELLQ------GSGMGLVIMGIGTATGAPL 227 >gi|119896366|ref|YP_931579.1| hypothetical protein azo0074 [Azoarcus sp. BH72] gi|119668779|emb|CAL92692.1| conseved hypothetical exported protein [Azoarcus sp. BH72] Length = 563 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 81/236 (34%), Gaps = 35/236 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 PW S + I + + ++ +++ ++DVS SM DKL + + Sbjct: 166 IAPTPWNPRSLLLRVGI-QAADPAKQALPPANLVFLVDVSGSM-----NSPDKLPLLQNA 219 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKST 256 ++ + + R LVT++S G I I+ L+ G T Sbjct: 220 LKLF------VAQLRPQDRVALVTYASGTRVVLEPTAGDRKAAITAAIDGLVPGGATAGA 273 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G++ AY + +EH I+ TDG+ + + E ++ G Sbjct: 274 AGIDLAYRMA--EQGFVEHGINR-------ILLATDGDFNVGITRFETLKDRVAERRKSG 324 Query: 317 AIVYAIGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + +G DQ ++ A A+ I Q++L ++ Sbjct: 325 IALSTLGFGGGNYNDQLMEQLADAG-----------DGAYRYIDSLAEAQKVLVDE 369 >gi|86131266|ref|ZP_01049865.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85818677|gb|EAQ39837.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 288 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 66/200 (33%), Gaps = 25/200 (12%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + S V +Y+ W + + + + + L MM+V DVS S Sbjct: 34 KGRGMTFSEVRQYQFGDDVRNIDWNVTARY-----SEPYIKVFEEERELTMMLVADVSGS 88 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + FG I + + + N + GL+ F+ +I P G H+ Sbjct: 89 --EFFGTD----KQFKNEIVTEIAATLAFSAMQNNDKIGLILFTDEIELFIPPKKGKSHV 142 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 I L+ + L A + + +K + D F+TDG + I Sbjct: 143 LRIIRELLEFKPSSKKTDLAQAIKYLSNVMKKKAIVFVLSD-------FITDGYEQTMKI 195 Query: 301 DNKESLFYCNEAKRRGAIVY 320 N+ G +Y Sbjct: 196 -------AANKHDITGIRIY 208 >gi|308472927|ref|XP_003098690.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] gi|308268290|gb|EFP12243.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] Length = 411 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 70/189 (37%), Gaps = 22/189 (11%) Query: 165 SDIGLDMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 S++ LD+++V+D S M D +D G TR + I + R G Sbjct: 39 SNLWLDVVLVVDNSEEMGSQRLFDVAANIIDVFGANTRIGSNSSEPITT--------RVG 90 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L+T++ L+ Q + N + + T ST + + + ++ Sbjct: 91 LITYNFNATLNANLS-QFQSYDDLSNGVFHSLSNVTNSTDSFIGTGLAMAEQLLRRQNFN 149 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEAKRRGAIVYAIGVQAEAA-DQFLKN 335 D YKK II S+ + E+ + + K G + +G + L N Sbjct: 150 TTRDHYKKVIIV----YASAFQRNEDETPEWIADRLKGSGVKIITVGYGNSHGLIKSLSN 205 Query: 336 CASPDRFYS 344 ASP ++ Sbjct: 206 IASPGLSFN 214 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 38/111 (34%), Gaps = 10/111 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 KG +L +++L ++ V I+ +H+ K +L +D + L A + Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVA----DKT 62 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENG------FAQDINNIERSTSLS 115 Q + +I EL F+ D+ + S + Sbjct: 63 EDVDQAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFT 113 >gi|297666856|ref|XP_002811720.1| PREDICTED: von Willebrand factor A domain-containing protein 3B-like [Pongo abelii] Length = 968 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 55/170 (32%), Gaps = 30/170 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 369 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 421 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 422 EVNEDSLEQAQSWIRDMKIGSSTNTLSALKTAFA----DKETQA------------IYLL 465 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P + + E +Y I A++FLK A+ Sbjct: 466 TDGRPDQPPETVIDQVKLFQE-----IPIYTISFNYNDEIANRFLKEVAA 510 >gi|293570439|ref|ZP_06681494.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291609385|gb|EFF38652.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1042 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 53/139 (38%), Gaps = 29/139 (20%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ D++G + +D + + + + G V +SS Sbjct: 289 TPLDLVLVVDWSGSMNDN-----DRIGEVKIGVDRFVDTLSD-SGITDKINMGYVGYSSD 342 Query: 227 IVQTFPLAWGV------QHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + G ++ ++ + T + GL A + + Sbjct: 343 GYNY---SNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGH----- 394 Query: 278 KGHDDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 395 ------KKVIVLLTDGVPT 407 >gi|307727464|ref|YP_003910677.1| von Willebrand factor type A [Burkholderia sp. CCGE1003] gi|307587989|gb|ADN61386.1| von Willebrand factor type A [Burkholderia sp. CCGE1003] Length = 326 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 73/252 (28%), Gaps = 38/252 (15%) Query: 145 CANSSHAPLLITSSVKIS---SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + + S++ G +++++D S SM++ ++ + Sbjct: 55 VLAMLAIVSGLAGPGRSPREVSRTGSGTQIILLMDRSASMDEPMESKGVQVSAGESKNKV 114 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPG 258 + R + F + + P + + I I G T+ G Sbjct: 115 ARASLTEFVAQRPNDRLAFMMFGTSPLLAMPFTYDHRAIDAAIAGTAVGRGMPDTQLDLG 174 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L A + + H ++ I+ ++DG + + R Sbjct: 175 LLTAIGEFNGS----------HSSSRRAIVLVSDG---GAKLSARVRQLIEEGLLRNQIA 221 Query: 319 VYAI----------------GVQAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIG 359 +Y I + A + + S P R + N++ + DA I Sbjct: 222 LYFIYLRSSVYSPDLNAALPANDSSAEAELHRYFLSLKTPYRLFQTGNAKAMRDAMAEIN 281 Query: 360 KEMVKQRILYNK 371 ++ Q + Sbjct: 282 RQQSVQTTFVER 293 >gi|226487568|emb|CAX74654.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 67/202 (33%), Gaps = 36/202 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG- 219 + S D+ + + ++D S SM D + A S+ L KS+P G Sbjct: 287 VVSSKDMRNEFVFLIDRSGSMEG------DNISYAKTSLLLFL---KSLPVNCRFQIIGF 337 Query: 220 ----LVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLE 274 FS + + L T++ L+ A Sbjct: 338 GSNFAALFSEPTDYS---EDSLNAAMNYQKDLNADMGGTEAYNALKSAL----------- 383 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + + + K IIFLTDG+ N D L N K R V+ IG+ + + Sbjct: 384 HSSPSGEGWFKQIIFLTDGD--VGNADEVIGLVRMNVDKAR---VFTIGLGQGVSTALIG 438 Query: 335 NCASPDRFYSVQ--NSRKLHDA 354 A + ++ +L A Sbjct: 439 GVARVGNGTAAYVRDASQLQSA 460 >gi|21703186|gb|AAM76090.1| Vwa1 protein [Boltenia villosa] Length = 599 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 68/184 (36%), Gaps = 31/184 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D ++LD S S+ D M IR +L D + +V ++ Sbjct: 405 NARMDAFVILDSSSSIGDENWLIM------KAFIRNILGSFTISDDTTHF---AIVRYNG 455 Query: 226 KIVQTFPLAWG-----VQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + + ++L + TK+ + + + + + Sbjct: 456 LVDTSTQVLLNDFPNSKAGLLAAFDKLPYNGSGTKTGQAIAHVRDNMMSSA------NGN 509 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----DQFLKN 335 + + ++ +TDG++ + L N+ ++ GA+++AIG+ Q L+ Sbjct: 510 REGIQDLVMVITDGKSQD------DVLKPSNDLRKMGALIFAIGITPPRGALDEAQLLEI 563 Query: 336 CASP 339 SP Sbjct: 564 AGSP 567 >gi|257063141|ref|YP_003142813.1| hypothetical protein Shel_04030 [Slackia heliotrinireducens DSM 20476] gi|256790794|gb|ACV21464.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 268 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 68/185 (36%), Gaps = 13/185 (7%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 SK+ L ++ VLD S SM + + + A R ++L + + + + G+ Sbjct: 7 TQSKARKLLPIIYVLDTSGSM--NVEGRISAVNEAMRETMDVLKDVAAKNPTAEL-KIGV 63 Query: 221 VTFSSKIV-QTFPLAWGVQHIQE----KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + FSS T + G + + N + G T + N++ + + Sbjct: 64 LAFSSGASWVTKDPSTGAPALLDLDDFYWNDMTAGGVTDLGAAMTELDNQLNRSAMLVS- 122 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 G +IF++DG + + ++ + N + A AI V A L Sbjct: 123 -DTGFKVP--VLIFMSDGGPTD-DWESAYNKAVANNRWVKSATKIAIAVGDGADRGVLTR 178 Query: 336 CASPD 340 A + Sbjct: 179 VADGN 183 >gi|213625177|gb|AAI69984.1| Complement factor B [Xenopus laevis] Length = 747 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 78/217 (35%), Gaps = 35/217 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D +++ +VLD S S+ G ++ A + ++ + + R +++++ Sbjct: 236 KDGLMNIFIVLDTSKSV------GQNRFDEAKSASILFIEKMSNYDIKP---RYCIISYA 286 Query: 225 SKIVQTFPL----AWGVQHIQEKI-----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 SK + L + + E + +R T + L Y + + + E Sbjct: 287 SKAISVVSLRDPDSNNADAVMEHLEEFQYDRHEDKQGTNTRAALHAIYEHLIEQELAYER 346 Query: 276 IAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----------VYAIG 323 K D K I+ +TDG+ + D +E + G VY G Sbjct: 347 EGKKEDFMKIHNVILLMTDGKFNMGG-DPREEMKLIKRFLDVGIRKDNPREEYLDVYVFG 405 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFL 356 + ++ + + AS + +QN K+ + F Sbjct: 406 LGSDIDQPEINDLASKKEKEVHTFHLQNVDKMKEFFE 442 >gi|170591600|ref|XP_001900558.1| Immunoglobulin I-set domain containing protein [Brugia malayi] gi|158592170|gb|EDP30772.1| Immunoglobulin I-set domain containing protein [Brugia malayi] Length = 6163 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 66/184 (35%), Gaps = 25/184 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + I + ++I ++ D D++ VLD S D+ + I ++D I + Sbjct: 5420 MRIATPIRICNRVDFQADIIFVLDSS----DNVTSK--EYVNLKEDISMLIDDIFDLSP- 5472 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 ++VR G + +S K PL + VQ + + N G T GL A + Sbjct: 5473 -DIVRIGFIEYSDKASVPVPLGYYDNKVQLLADISNSEQLGGTPVIVRGLHAAKEQFKRH 5531 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + ++ +T G N + L KR V+A+ V Sbjct: 5532 GRNG---------VSRILLLVTSGANRGNVATAADDLR-----KRLKVSVFALVVNTSRG 5577 Query: 330 DQFL 333 Q + Sbjct: 5578 AQMM 5581 >gi|158318893|ref|YP_001511401.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114298|gb|ABW16495.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 238 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 55/148 (37%), Gaps = 10/148 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +++D S SM+ ++++ + E++ I+ P + +VVR G + F+ Sbjct: 15 LAFYILVDASYSMSGAPMLAVNEI------LPEVISTIEQSPTLGDVVRLGALDFADDAR 68 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + + T G +I + G+ Y+ + Sbjct: 69 VVLRLDDLRN--IGGVPQFAARGGTSYAAGFRQLRKEIESD--LAQLKGDGYKVYRPAVF 124 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRG 316 F+TDGE + D + +A RG Sbjct: 125 FITDGEPTDDQKDLDAAFAELTDANFRG 152 >gi|282897675|ref|ZP_06305674.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197354|gb|EFA72251.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 464 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 75/246 (30%), Gaps = 45/246 (18%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND---- 183 +A+ P A +L K + + V+D S SM + Sbjct: 3 NAIKTAIAPNREFMLADKAGQKLFVMLKLRPTKDIATNLPPTSFTFVIDTSGSMYEVVAG 62 Query: 184 -----------------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G K+ + S+ +++ K + R +V F Sbjct: 63 DVEDTGVTYQQDGKEYKQVTGGKSKIDIVIESLLRLVNSGK----LKQQDRVSIVQFDDS 118 Query: 227 IVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 Q L + I+ I +L F T+ GL A++ + + + Sbjct: 119 ASQIIGLTSATETKQIETAIKKLRDFSGGTRMGLGLRRAFDILSE-----------QEMT 167 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 K + TDG+ D + N R + A+GV E + L + + + + Sbjct: 168 VKRALLFTDGQ----TFDEDQCQSIANHFATRNIPITALGVGEEFNEDLLTHLSDYTGGK 223 Query: 342 FYSVQN 347 + V Sbjct: 224 LFYVVP 229 >gi|149919074|ref|ZP_01907558.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149820004|gb|EDM79425.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 520 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 57/177 (32%), Gaps = 22/177 (12%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 LD+ +V+D + SM D ++ +I ++ R LV + Sbjct: 281 SRLDVALVIDATGSMGDELEYLKVEIRDIAEAINHHFPGVEQ--------RFALVVYRDK 332 Query: 226 ---KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++F + Q ++ G P + A K ++ D Sbjct: 333 GDSYVTRSFDFTTNLDSFQRDLSEQSAGGGGD-YP------EAMDAAMAKAAKLSWSKQD 385 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG-VQAEAADQFLKNCAS 338 + + D D ++L ++ + +G VY + +F+ A+ Sbjct: 386 AARVTFLVADAPPHQDKAD--DTLAAVDKLRAKGVAVYPVASSGVAGEAEFVMRSAA 440 >gi|301766292|ref|XP_002918563.1| PREDICTED: cochlin-like [Ailuropoda melanoleuca] Length = 550 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 66/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 437 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 438 RYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIVTDGQ----SYDDVRG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|1923217|gb|AAB63303.1| micronemal protein MIC2 [Toxoplasma gondii] gi|221484107|gb|EEE22411.1| microneme protein, putative [Toxoplasma gondii GT1] Length = 769 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 65/167 (38%), Gaps = 34/167 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ ++D S S G+ + + + L ++ P+ V + +VT+S+ + Sbjct: 75 LDICFLIDSSGS------IGIQNFRLVKQFLHTFLMVLPIGPEE---VNNAVVTYSTDVH 125 Query: 229 QTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L W +Q +++ + +T ++ GL+ +F Sbjct: 126 ----LQWDLQS-PNAVDKQLAAHAVLEMPYKKGSTNTSDGLKACKQILFTGSR------P 174 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 G + K +I +TDGE + + ++ E + G IV + V Sbjct: 175 GREHVPKLVIGMTDGE----SDSDFRTVRAAKEIRELGGIVTVLAVG 217 >gi|285808482|gb|ADC36006.1| von Willebrand factor type A domain protein [uncultured bacterium 259] Length = 311 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 58/144 (40%), Gaps = 27/144 (18%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++++ + +D+ +V+D S SM +D L ++I ML Sbjct: 60 VRQTIEMVDAETLPIDLTVVVDTSGSMRRS----VDDLKADAQAIAAMLRPND------- 108 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R L+TF+ +I +T+ L + ++RL +T + A + Sbjct: 109 --RIRLLTFAGQIRETYALQAPSADLA--LDRLAATGSTSLFDAVAVALAGVT------- 157 Query: 275 HIAKGHDDYKKYIIFLTDGENSSP 298 D + ++ LTDG+++S Sbjct: 158 -----GTDRRHLVVVLTDGQDTSS 176 >gi|284053489|ref|ZP_06383699.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 396 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 76/235 (32%), Gaps = 29/235 (12%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCT-FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 I++ + L P I + + P+ I S + +++++D Sbjct: 19 INNDNVTLRLQVTDERGRPVIQLQKQDFQVITDDEPVGIKSWKSPQESTPPPAWIVVLVD 78 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----------- 225 ++ SMN+ G ++ A + R L+ I D + +V F Sbjct: 79 LTGSMNELDTSGKRRIDGALDATRRFLEQIS---DRGGDTKVAIVPFGKGGANCPGFEVT 135 Query: 226 --KIVQTFPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFD--AKEKLEHIAKG 279 I F A ++ + + + T L A + + G Sbjct: 136 QRGINSKFFPANDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSG 195 Query: 280 HDDYKKYIIFLTDGENSSPN----IDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 + + +I L+DG ++ PN DN +L + IV+ +G Sbjct: 196 RLEPRLSVILLSDGFHNQPNEQQDFDNLITLLE----RNNNIIVHTLGYGLTPQQ 246 >gi|218458530|ref|ZP_03498621.1| hypothetical protein RetlK5_03343 [Rhizobium etli Kim 5] Length = 185 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 66/198 (33%), Gaps = 32/198 (16%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F+ + +G + LT I +P++ L+I+ + L +D L A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAREL-- 63 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +G + + + I N F+ + + +S+ D + Sbjct: 64 --DGRDDAITRAQTAIEKISN-----------SAAFSAGGTGMSLGSHISVTYDAGNDAG 110 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + V + P+ + ++S++ V+ +M F Sbjct: 111 STVTV---------LFLKDIPANDDTPIPSSMETTVASEA----SYAWVIAKPQAMQTIF 157 Query: 186 GPGM----DKLGVATRSI 199 + D + +A ++ Sbjct: 158 PIPVGFTRDTINIAADAV 175 >gi|198421146|ref|XP_002121270.1| PREDICTED: similar to cubilin [Ciona intestinalis] Length = 728 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 70/212 (33%), Gaps = 40/212 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+M +LD S S+ +A +++ + I + G++ +S + Sbjct: 536 DLMFLLDSSGSVTSS------DFQLAANFVKDFITGIDLTS-----FQVGVMQYSHYL-L 583 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK----------- 278 L +I +IN G T++ P + A + I Sbjct: 584 NRELD-DQPYITTEIN---IGEYTEADP-FKTAMDTIQPHGYTTYTAHAVLKAIRVDFPR 638 Query: 279 ----GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + K I+ +TDG + D+ EAK G +YAIGV + + Sbjct: 639 STRFNNSCTSKIIVLITDG----SSSDSSMLRDAALEAKNLGVDIYAIGVGDANTQELVV 694 Query: 335 ----NCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + D+ + + L + ++ Sbjct: 695 LTRPESGTKDKIFQIDQYSSLPSILQGLRTKI 726 >gi|332223236|ref|XP_003260773.1| PREDICTED: cochlin [Nomascus leucogenys] Length = 550 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 437 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 438 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|114621486|ref|XP_001143977.1| PREDICTED: collagen, type XIV, alpha 1 isoform 2 [Pan troglodytes] Length = 1685 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 31/199 (15%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++D S S+ D +F + L ++ ++ + + +V F+ Sbjct: 937 DLVFMVDGSWSIGDENFNKIISFLYSTVGALHKI---------GTDGTQVAMVQFTDDPR 987 Query: 229 QTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L A+ + + + I + + G TK+ ++Y + +F A E K Sbjct: 988 TEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA-ESGTRRGIP-----K 1041 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFY 343 I+ +TDG + + E + G ++AIGV + + + P + Sbjct: 1042 VIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVF 1095 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V + DAF +I E+ Sbjct: 1096 FVDD----FDAFKKIEDEL 1110 >gi|297841999|ref|XP_002888881.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp. lyrata] gi|297334722|gb|EFH65140.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp. lyrata] Length = 757 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 67/229 (29%), Gaps = 45/229 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 I PW ++ L K ++ V+D+S SM L Sbjct: 293 LIKSPSPWDSDDRGIFCLYLFPGTTKHKKLFKRRVVFVIDISASMKWK------PLEDVK 346 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINR-----LIFG 250 +++ E L +++ N ++ F+ +I + + + + LI Sbjct: 347 KALLECLAKLQAEDVFN------IIAFNDEILEFSTSMEFATDETISAVTEWLDTNLIAN 400 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + +G + + +TDG + CN Sbjct: 401 GGTNMLLPLKQAIKLL-----------EGSNIGVPLVYLVTDG-------SVENEREICN 442 Query: 311 EAKRR--------GAIVYAIGVQAEAADQFLKNCASPDR-FYSVQNSRK 350 K + G+ + FL+ A +Y N+ Sbjct: 443 AMKESCSRNGKSISPRISTFGIGSFCNHYFLQMLARIGNGYYDGTNNTD 491 >gi|153011704|ref|YP_001372918.1| von Willebrand factor type A [Ochrobactrum anthropi ATCC 49188] gi|151563592|gb|ABS17089.1| von Willebrand factor type A [Ochrobactrum anthropi ATCC 49188] Length = 633 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 78/227 (34%), Gaps = 32/227 (14%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T V +S +D +D VLDV + L + + Sbjct: 433 VHLMSRPKANDLAVTILVDVSLSTDAWIDNRRVLDVE-------KEALLVLANGIAACGD 485 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 I + VR + V+ F +G ++ +I L G T+ + + Sbjct: 486 RCSIQTFTSRRRSWVRV-------ETVKDFDETFGPA-VEHRIAALKPGFYTRMGAAIRH 537 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A K+ + + KK ++ LTDG+ + ++S EA+ G Sbjct: 538 ATAKLAEQP-----------NRKKLLLVLTDGKPNDVDHYEGRFALEDSRRAVAEARTTG 586 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 V+A+ V EA +L + V KL A I + M Sbjct: 587 VNVFAVTVDREAN-AYLPTLFGRRNYALVAKLSKLPVALPAIYRMMT 632 >gi|149921504|ref|ZP_01909956.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] gi|149817707|gb|EDM77174.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] Length = 560 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 63/171 (36%), Gaps = 19/171 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + +++ +VLD S SM + + A + ++ D + + + L Sbjct: 212 TPEERPPMNVTLVLDTSGSMAGTPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWT-LA 270 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ T P + + EKIN ++ G T GLE Y L + D Sbjct: 271 GYA----VTGP---NDELLLEKINDVVHGGGTNLYGGLESGYE--------LAQMVYDPD 315 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ ++DG ++ D G +Y +GV + D + Sbjct: 316 AINR-LVLISDGGANAGITDLDLIAENAAYGGSDG--IYLVGVGVDDPDDY 363 >gi|328954172|ref|YP_004371506.1| cobaltochelatase subunit [Desulfobacca acetoxidans DSM 11109] gi|328454496|gb|AEB10325.1| cobaltochelatase subunit [Desulfobacca acetoxidans DSM 11109] Length = 678 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 73/211 (34%), Gaps = 28/211 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG ++ ++D S SM ++ + +I +L + R +V+ Sbjct: 487 REKRIGNFLLFLVDASGSMG-----ARGRMTASKGAIMSLL-----LDAYQKRDRIAMVS 536 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + V P ++ + + G T + GL Y I + Sbjct: 537 FRKQEAVLNLPPTSSIETAAHLLKEMPVGGRTPLSAGLAKTYEVIRN-------YLLRDP 589 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN------ 335 + +I +TDG+++ + K + A R G + + Q L Sbjct: 590 TARPIVIMITDGKSNVALGEKKPMTEAFDLASRLGLDERVRFIVVDTESQGLVRFGLARE 649 Query: 336 --CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A ++ +++ + D + I KE V+ Sbjct: 650 LAIAMQAEYFKIEDLKA--DTLVNIAKEKVE 678 >gi|302540662|ref|ZP_07293004.1| von Willebrand factor, type A [Streptomyces hygroscopicus ATCC 53653] gi|302458280|gb|EFL21373.1| von Willebrand factor, type A [Streptomyces himastatinicus ATCC 53653] Length = 340 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 74/205 (36%), Gaps = 32/205 (15%) Query: 174 VLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 V+D S SM+D PG ++ V S+R+ L + + GL FS+++ Sbjct: 136 VVDASASMSDPVPGRPGESRMDVTKASLRQALSRFNAGDE------IGLWEFSTELDGDR 189 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE--------KLEHIAKGHDDY 283 + + + T L A++ + E L K + + Sbjct: 190 DY---RELVATRRLGARTPDGTGQRAELAAAFDALKPLPEGSTGLYDTTLAAYKKAQETF 246 Query: 284 KK----YIIFLTDGENSSPNIDNKESLFYCNEAK-----RRGAIVYAIGVQAEAADQFLK 334 + ++ LTDG N P ++ +L E K R + AI V +A + Sbjct: 247 VRGKFNAVVMLTDGANQDPGSISRGAL--VKELKRLVDPDRPVPLIAIAVGPDADQAACR 304 Query: 335 NC--ASPDRFYSVQNSRKLHDAFLR 357 A+ V + +++ A L+ Sbjct: 305 EIAQATGGSAQQVNDPAQINTAMLK 329 >gi|194207263|ref|XP_001489838.2| PREDICTED: coagulation factor C homolog, cochlin (Limulus polyphemus) [Equus caballus] Length = 549 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 58/177 (32%), Gaps = 30/177 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++ ++D S S+ D M + +I K+ + + V F+ Sbjct: 364 SVNIAFLIDGSSSVGDSNFRLMLEFVS---------NIAKTFEISDIGAKIAAVQFT--Y 412 Query: 228 VQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 Q ++ +E + +I T + + + +F Sbjct: 413 DQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPMRD--------SP 464 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K +++ +TDG+ + D+ A G ++++GV D + P Sbjct: 465 NKNFLVIVTDGQ----SYDDVRG--PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 515 >gi|90021388|ref|YP_527215.1| TPR domain-containing protein [Saccharophagus degradans 2-40] gi|89950988|gb|ABD81003.1| TPR repeat [Saccharophagus degradans 2-40] Length = 658 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 55/163 (33%), Gaps = 26/163 (15%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 P I W S + K D+ ++++ D+S SM + P Sbjct: 62 PLIALIPLWVIASIALAGPTWKKINQPLKQDLS-AVVILWDLSPSMLAEDLKPS-----R 115 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---- 250 A R+ +++D+ SGL+ F+ + PL + ++ +N L Sbjct: 116 AARAKYKLIDLFAK----RQTGLSGLIAFAGEAHIVTPLTDDARTVKNLLNGLSPDMMPV 171 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + LE A + + IIF+TDG Sbjct: 172 QGSNPEMALELAVKLLKEGGVARGD-----------IIFVTDG 203 >gi|70606762|ref|YP_255632.1| hypothetical protein Saci_0977 [Sulfolobus acidocaldarius DSM 639] gi|68567410|gb|AAY80339.1| conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639] Length = 451 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 59/175 (33%), Gaps = 40/175 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM+ K+ A + + + + + +R F I Sbjct: 289 IYLLLDKSGSMDGE------KIIWAKAVALSLYNRARR-ENRDFYIR-----FFDNI--P 334 Query: 231 FPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +PL + V + E I ++ G T + + A I + KG + Sbjct: 335 YPLIKVMKNAKSKDVIKMIEYIGKIRGGGGTDISRSIISACEDIKEGH------VKGVSE 388 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +I LTDGE+ + SL N I V + L+ A Sbjct: 389 ----VILLTDGEDKIAETTVRRSLREANSV--------LISVMIRGDNADLRRIA 431 >gi|327270796|ref|XP_003220174.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 948 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 76/212 (35%), Gaps = 39/212 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +VLD S M + +L A + +L II+S V G+VTF+++ Sbjct: 294 SVCLVLDTSGKMGKD--NRLGRLNQAAKLF--LLQIIESGSWV------GIVTFNNEAAT 343 Query: 230 TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L GV+ +K G+ + + E Sbjct: 344 KTLLQKIVNDGVRQTLTSYLPTTAAGESKICDGVLAGFQVFLNKYPSSEGCE-------- 395 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 I+ LT GE+ ++ C + + G+I++ I + +++ K + + Sbjct: 396 -IVLLTHGED--------PAIRSCFPQIQNSGSIIHTIAFGSGTSNELEKLADMTGGLAF 446 Query: 344 SVQNSRK---LHDAFLRI---GKEMVKQRILY 369 +S L DAF I G ++ +Q I Sbjct: 447 YATDSLDSNGLMDAFSGISSGGGDISQQSIQL 478 >gi|328881544|emb|CCA54783.1| ChlI component of cobalt chelatase involved in B12 biosynthesis or ChlD component of cobalt chelatase involved in B12 biosynthesis [Streptomyces venezuelae ATCC 10712] Length = 689 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 51/144 (35%), Gaps = 18/144 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 488 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 537 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L+TF + P V ++ +L G T + GL A++ + + + Sbjct: 538 LITFRGRDAEVALPPTSSVDAAAARLEKLPTGGRTPLSAGLLKAHDVLRVERLRDASRRP 597 Query: 279 GHDDYKKYIIFLTDGENSSPNIDN 302 ++ +TDG + D Sbjct: 598 -------LLVVVTDGRATGGGADP 614 >gi|296454916|ref|YP_003662060.1| putative von Willebrand factor type A domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296184348|gb|ADH01230.1| putative von Willebrand factor type A domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 1242 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/167 (22%), Positives = 60/167 (35%), Gaps = 25/167 (14%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM--LDIIKSIPDVN 213 T S +S +D+ VLD S SMND G +L +I + L + Sbjct: 497 TVSGTTTSGEKAKIDVAFVLDTSGSMNDKVGNS-TRLKNMQNAITDNGGLSSVLFNSPDK 555 Query: 214 NVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 ++ ++TF+S + T L+ + E +N L T GLE N Sbjct: 556 IDAQAHVITFASGLGLDGTSVLST-KADLDEVVNGLTANGATHWEKGLERVSNISTRPG- 613 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE-------SLFYCNE 311 KY++FLTDG+ + L C++ Sbjct: 614 -----------ATKYVVFLTDGDPGNKGWKETNVYSCGVLGLQTCDD 649 >gi|261855692|ref|YP_003262975.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836161|gb|ACX95928.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 756 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 66/178 (37%), Gaps = 35/178 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +M++LD+S S+N+ G L ++ ++ + I + D + F Sbjct: 559 GRSIAVMLLLDLSESLNEKVAGSEQSILELSQEAVSLLAWAIDKLGDP-----FAIAGFH 613 Query: 225 SKIVQT--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 S + W ++ ++ ++ G +T+ + +A + + K Sbjct: 614 SNTRHEVRYQHIKGYSERWD-DEVKARLAKIEAGYSTRMGAAMRHAAHYLSAQKAD---- 668 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-------GAIVYAIGVQAE 327 KK ++ LTDG+ S +D + +A++ G Y I + A Sbjct: 669 -------KKLMLILTDGKPSD--VDTSDDRLLIEDARQAVKELDQLGIFPYCISLDAS 717 >gi|332828898|gb|EGK01581.1| hypothetical protein HMPREF9455_02113 [Dysgonomonas gadei ATCC BAA-286] Length = 609 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 82/213 (38%), Gaps = 26/213 (12%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDK 191 + + PW NS + + + K + ++ + ++DVS SM G + Sbjct: 206 VRITTEVGSCPW--NSQNRLVKVGLKAKSLASDNLPASNFVFLIDVSGSM-----SGPTR 258 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIF 249 L + S++ +++ ++ R +V ++S + P G Q I+E +N L Sbjct: 259 LDLVKSSLKLLVN------NLRKKDRVAIVVYASSTGEVLPSTSGENKQKIKEALNNLSA 312 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 G +T G++ AY ++ KG ++ II TDG+ + N+ Sbjct: 313 GGSTAGGAGIQLAYKI------AKQNFIKGGNNR---IILCTDGDFNVGVSSNEGLQRLI 363 Query: 310 NEAKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 ++ G + +G D ++ A Sbjct: 364 ENERKTGVFLSILGYGMGNYKDSKMQTLAQAGN 396 >gi|309356625|emb|CAP36866.2| hypothetical protein CBG_19659 [Caenorhabditis briggsae AF16] Length = 346 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 69/226 (30%), Gaps = 20/226 (8%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + F + S V + ++ LD++ V+D S M + Sbjct: 7 LLCLLFGFEICPSKCKDSYVDRVCGEDQKNLWLDIVCVVDNSAGM--------TNVAALI 58 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTK 254 S+ + P+ R G+VT++ L + E I++ + + Sbjct: 59 SSLFVDGQQLGIQPNNPRTTRVGIVTYNQGAHVIADL-NNFTSVDELIDKTFRVLNQVST 117 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 S+ +A + + + + Y+K +I T + +++ L K+ Sbjct: 118 SSDSYLHAGLEAANDLLEQQSFNTARGHYQKLVIVYT---SEYKGTGSQDPLPLATRMKQ 174 Query: 315 RGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQNSRKLHDAFLR 357 + + E L A+P + N + Sbjct: 175 T-VSIATVAYNREDDLGVLSELTKIATPGYNF--TNDEDIVAELRS 217 >gi|298247107|ref|ZP_06970912.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297549766|gb|EFH83632.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 550 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 58/176 (32%), Gaps = 22/176 (12%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--------IQ 241 D+L ++ + + + + + + ++ F+S I +W V+ + Sbjct: 384 DQLKESSELLFDQTKARQYLLQTHPQDLTSVMVFNSDIAAGPDGSWTVEGNDPQKMRGLY 443 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + I T L+ + + +++ K+ II +TDG++ N Sbjct: 444 DNIQAREPDGGTNMYACLQRSVELF---------KQQPNENRKRLIIVMTDGQSEKGN-- 492 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 G V ++ ++A L ++ SV + DA Sbjct: 493 --GVDQIIQSVASLGVPVISVAFGSDADVTQLNEISTATH-GSVTKKDNMVDAMRE 545 >gi|23016188|ref|ZP_00055947.1| hypothetical protein Magn03010637 [Magnetospirillum magnetotacticum MS-1] Length = 408 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 54/171 (31%), Gaps = 23/171 (13%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 IR +G+++I+ I + + + L ET + K L + D + L A ++ + Sbjct: 7 IRRLCTGTEGAVAIMVGIGMTAMIGFLALGTETGLWYAAKRNLQSVADAAALGGAFELGS 66 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 N + + RN + G A I T + + Sbjct: 67 GSNSSVISAAAIQDAG-----------RNGFQATGGAT----IAVHTPPASGKYAGNPQM 111 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 +VS+ F + + T SV G ++ LD Sbjct: 112 VEVSVSQPTTLLFSALFLKSLQVNARAVAKTGSV--------GDACILALD 154 >gi|282897345|ref|ZP_06305347.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] gi|281197997|gb|EFA72891.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] Length = 449 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 61/186 (32%), Gaps = 29/186 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LD S SMN G K+ A +IR + K D + +V F Sbjct: 128 IIVLLDFSGSMNQIDSGGSKKIAGAINAIR---EFTKVSSDRGGDTQISVVPFGEAGKNC 184 Query: 231 FPLAWGVQHI--------------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + E ++ L +T L+ A + + ++ + Sbjct: 185 PEYTVNKDTLDKFLSASDFKLQNSLEYLSGLNPCGSTNLYQPLKKALEFLGNPEDPRFTL 244 Query: 277 AKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAA 329 + + II L+DG +++ N NE K V+ +G + Sbjct: 245 PENSSEPNPRLSIILLSDGYHNAMN-----EFQDFNELKSLLQSYENITVHTLGYGLTPS 299 Query: 330 DQFLKN 335 +K Sbjct: 300 QLGIKY 305 >gi|257892784|ref|ZP_05672437.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] gi|257829163|gb|EEV55770.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] Length = 677 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 88/253 (34%), Gaps = 56/253 (22%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +D++MV+D S SM KL A + ++E + + + N +R G+V + Sbjct: 107 QLKKPIDLVMVIDYSSSMTGE------KLSNALKGLQEFGEELDDSLESGN-IRIGIVAY 159 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTT---------------KSTPGLEYAYNKIF 267 + + T + ++ + N + T KS P E I Sbjct: 160 NRFVYSTDDFLTDINQLEYFLRNTAESHTGTFMQKGLLEGQSLLEEKSRPEAEKMLVHIG 219 Query: 268 DAKEKLEHIAKGHDDY---------------KKYII-FLTD-------GENSSPNIDNKE 304 D ++ K + +Y+ F TD G ++ PN + Sbjct: 220 DDSANRSYLPKENAQVFHNSGEIVDYNGYHTDQYVTEFQTDSEKYQTSGSSTDPNAVSVS 279 Query: 305 SLFYCNE-------AKRRGAIVYAIGVQAEAADQFLKN--CASPDRFYSV-QNSRKLHDA 354 S + K G Y++ + +++ +SP+ + S+ +N L +A Sbjct: 280 SSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLSIDENLTGLGNA 339 Query: 355 FLRIGKEMVKQRI 367 I + K + Sbjct: 340 LKEIANGIDKTIV 352 >gi|260834079|ref|XP_002612039.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] gi|229297412|gb|EEN68048.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] Length = 794 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 63/163 (38%), Gaps = 18/163 (11%) Query: 194 VATRSIREMLDIIKSIPDVNNVV-RSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFG 250 V+ ++ L + + +V+ R ++ ++S + F L + + + IN + Sbjct: 208 VSNADVKTFLKSVVAALNVSQTAARVAVIEYTSVMHSHFDLPTHLTNAQVTSAINSIPAW 267 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T +Y K A + L D K ++ +TD ++ +S Sbjct: 268 GST--------SYRKTGSAIKYLTDYLSWRDGIPKVLVVITDATSNDYVSGPAQS----- 314 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YSVQNSRKLH 352 AK G I+ ++GV + L A+ + Y+V + + Sbjct: 315 -AKNAGLILSSVGVGTSISSTELNTIATNSSYRYTVSSYTDIV 356 >gi|242016552|ref|XP_002428850.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513586|gb|EEB16112.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1945 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 73/182 (40%), Gaps = 30/182 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+ +++++++D S S+ + ++ I++++ ++ P R G++ + Sbjct: 28 KTHKEVELIILVDGSYSVGPK--NFLSEMK----YIQKVISDVEVGP---KAFRLGVIIY 78 Query: 224 SSKIVQTFPLAWGV-----QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 S++ V + + + +I ++ G T + LE A + A++ Sbjct: 79 STQAVDHLSVTDELDLEKCSLLHVQIPKIKYPGKNTNTKEALEKAEKILTRARKGAV--- 135 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I +TDG ++ N L K + I+Y G+ A + LK + Sbjct: 136 -------KIIFLITDGFSNMGN-----PLPMAQILKDQDTIIYTFGIINGNARELLKISS 183 Query: 338 SP 339 P Sbjct: 184 QP 185 >gi|330995096|ref|ZP_08319013.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|332879554|ref|ZP_08447249.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|329576672|gb|EGG58175.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|332682520|gb|EGJ55422.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 289 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+V+DVS S++ + R + + + + N + G++ F Sbjct: 72 EEERELTVMLVIDVSGSLSF------GTVSQTKREMVAEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I L +T +EY I Sbjct: 126 SDRIEKFIPPKKGRKHILRIIRELLDFTPQSKSTDIGQAIEYLTQAIK 173 >gi|297473022|ref|XP_002686353.1| PREDICTED: chloride channel accessory 2 [Bos taurus] gi|296489247|gb|DAA31360.1| chloride channel accessory 2 [Bos taurus] Length = 908 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 69/204 (33%), Gaps = 36/204 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFHSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + + + + T GL+ + + + Sbjct: 362 AQLHQINNDDDRKLLVSYLPVAVSAEAETSVCSGLKKGFEVV---------EKLNGKAFG 412 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 +I +T G++ + +L G+ ++ I + + L + RF Sbjct: 413 SVMILVTSGDDEHISNCFLTAL-------SSGSTIHTIALGSSTVKNLEELSHLTGGLRF 465 Query: 343 YSVQ--NSRKLHDAFLRIGKEMVK 364 + N+ + DAF RI Sbjct: 466 FVPDKSNANSMIDAFSRISSGTGD 489 >gi|57956|emb|CAA79152.1| collagen alpha 1 chain type VI [Mus musculus] Length = 583 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 78/208 (37%), Gaps = 27/208 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H A R L ++ P + VR +V +S + Q Sbjct: 385 DITILLDSSASVGSH--NFETTKVFAKRLAERFLSAGRADP--SQDVRVAVVQYSGQGQQ 440 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY--NKIFDAKEKLEHIAKGHDDYKKYI 287 G +Q N + S+ S + A N + A K+ + Sbjct: 441 QP----GRAALQFLQNYTVLASSVDSMDFINDATDVNDALSYVTRFYREASSGATKKRVL 496 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK----------NCA 337 +F +DG + ++++ EA+R G ++ + V + + ++ + A Sbjct: 497 LF-SDGSQGATAEAIEKAV---QEARRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVA 552 Query: 338 SPDRF-YSVQNSRKLHDA--FLRIGKEM 362 +R + V N + L + + +++ Sbjct: 553 FGERHLFRVPNYQALLRGVLYQTVSRKV 580 >gi|307353172|ref|YP_003894223.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156405|gb|ADN35785.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 231 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 38/173 (21%), Positives = 67/173 (38%), Gaps = 15/173 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 ++LD S SM + DK+ I D I V +++F K+ Sbjct: 21 LILDTSGSMMVN-----DKIAKLNEGIALFKDEIGKDELARKRVDLAVLSFGQKVNVIQD 75 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ + + L+ T ++ A + K E+ +G D Y+ +I +TD Sbjct: 76 FT-SIEEFEPE--ELVADGLTPMGEAIKKAVEMLGS--RKDEYKKEGIDYYRPWIFLITD 130 Query: 293 GENSS---PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDR 341 GE + + KE +E ++ G ++ V E AD + L A P R Sbjct: 131 GEPTDMYEGDEMWKEVTNLVHEGEKAGKFLF-FAVGVEDADLETLAKIAPPTR 182 >gi|303324778|pdb|2XGG|A Chain A, Structure Of Toxoplasma Gondii Micronemal Protein 2 A_i Domain gi|303324779|pdb|2XGG|B Chain B, Structure Of Toxoplasma Gondii Micronemal Protein 2 A_i Domain Length = 178 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 65/167 (38%), Gaps = 34/167 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ ++D S S G+ + + + L ++ P+ V + +VT+S+ + Sbjct: 18 LDICFLIDSSGS------IGIQNFRLVKQFLHTFLMVLPIGPEE---VNNAVVTYSTDVH 68 Query: 229 QTFPLAWGVQHIQEKINRLIFG----------STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L W +Q +++ + +T ++ GL+ +F Sbjct: 69 ----LQWDLQS-PNAVDKQLAAHAVLDMPYKKGSTNTSDGLKACKQILFTGSR------P 117 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 G + K +I +TDGE + + ++ E + G IV + V Sbjct: 118 GREHVPKLVIGMTDGE----SDSDFRTVRAAKEIRELGGIVTVLAVG 160 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 51/136 (37%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +S + Sbjct: 310 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSIE 363 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A N + Sbjct: 364 GYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGH-------- 415 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 416 ---KKVIVLLTDGVPT 428 >gi|301609300|ref|XP_002934201.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 919 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 60/199 (30%), Gaps = 35/199 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM ++G ++ + I V G+V+FS+ Sbjct: 300 VTLVLDVSGSMASD-----GRIGRLYQAAEVFVMQI-----VEEGSHVGIVSFSTSTTVL 349 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L H++ + T G+ + Y Sbjct: 350 SKLVQVIDDTQRNHLK-FLLPKTAVGGTNICAGIREGIKV---------NNQHDGSSYGT 399 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYS 344 I+ LTDGE++ + G IV+ I + + + Sbjct: 400 EIVLLTDGEDNYNTSLCFPDIS------NSGIIVHFIALGPNPNPNLETIVDMTGGLRFL 453 Query: 345 VQ---NSRKLHDAFLRIGK 360 +++ L DAF + Sbjct: 454 ATDKVDAQGLIDAFSSLTA 472 >gi|300795696|ref|NP_001178680.1| voltage-dependent calcium channel subunit alpha-2/delta-4 [Rattus norvegicus] Length = 1145 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 71/193 (36%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A ++ +LD + VN ++ ++ + Sbjct: 298 DIVILVDMSGSMKGL------RMAIAKHTVTTILDTLGENDFVN------IIAYNDYVHY 345 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E + Sbjct: 346 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGIVSQALIEAFQILKQFQESRQ-----G 400 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 401 SLCNQAIMLVTDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 455 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 456 NKGYYTQISTLAD 468 >gi|293347024|ref|XP_001056965.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Rattus norvegicus] Length = 1179 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 71/193 (36%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A ++ +LD + VN ++ ++ + Sbjct: 298 DIVILVDMSGSMKGL------RMAIAKHTVTTILDTLGENDFVN------IIAYNDYVHY 345 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E + Sbjct: 346 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGIVSQALIEAFQILKQFQESRQ-----G 400 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 401 SLCNQAIMLVTDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 455 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 456 NKGYYTQISTLAD 468 >gi|300776964|ref|ZP_07086822.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502474|gb|EFK33614.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 800 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 83/237 (35%), Gaps = 34/237 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + +++ ++DVS SM+D +KL + S + + Sbjct: 413 PWNPKHKLLKIGLQGK-NLPMDKLPASNLVFLIDVSGSMSDE-----NKLPLLKSSFKVL 466 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLE 260 L+ ++ + G+V ++ P I E ++RL G +T G+E Sbjct: 467 LNQLRPKD------KVGIVVYAGSAGMVLPPTSAGEKDKIIEALDRLQAGGSTAGGAGIE 520 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY + K + +I TDG+ + + + ++ G + Sbjct: 521 LAYKLAQENFVKEGNNR---------VIIATDGDFNVGTSSISDLKTLIEDRRKSGVFLT 571 Query: 321 AIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHD--------AFLRIGKEMVKQR 366 +G D L+ A + + N ++ + + I K+M Q Sbjct: 572 CLGFGMGNYKDNTLETLADKGNGNYAYIDNMQEANKFLGKEFAGSMYAIAKDMKIQI 628 >gi|149019071|gb|EDL77712.1| procollagen, type XII, alpha 1, isoform CRA_c [Rattus norvegicus] Length = 1721 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 69/198 (34%), Gaps = 24/198 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + D+++++D S S+ I ++++ + P V+ L +S Sbjct: 30 TRAEADIVLLVDGSWSIGRA------NFRTVRSFISRIVEVFEIGPKR---VQIALAQYS 80 Query: 225 SKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + L + + + + L + + G+ A N I K + + Sbjct: 81 GDPRTEWHLNAHRDKKSLLQAVANLPYKGG-NTLTGM--ALNFIRQQSFKTQAGMRP--R 135 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 +K + +TDG++ + + K G ++AIG++ + PD Sbjct: 136 ARKIGVLITDGKSQDDVEAPSK------KLKDEGVELFAIGIKNADEVELKMIATDPDDI 189 Query: 341 RFYSVQNSRKLHDAFLRI 358 Y+V + L + Sbjct: 190 HAYNVADFESLSKIVDDL 207 >gi|148360702|ref|YP_001251909.1| hypothetical protein LPC_2649 [Legionella pneumophila str. Corby] gi|148282475|gb|ABQ56563.1| hypothetical protein LPC_2649 [Legionella pneumophila str. Corby] Length = 6289 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 50/145 (34%), Gaps = 12/145 (8%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ + + ++M++LD S SM + S E+L+ +++ + Sbjct: 5024 PVASNITRSGLANEGADTNLMLILDTSGSMAGS------GIQTLINSTLELLERYEALGN 5077 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 V VR VTF++ + V + + L G T L A N Sbjct: 5078 VK--VRI--VTFNTSATAIGSVWMTVDAAKNALLGLTAGGNTNFDAALITAMNAFNSGTV 5133 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 G Y F++DG + Sbjct: 5134 GGADGRIGGAQNVSY--FISDGNPT 5156 >gi|301166645|emb|CBW26221.1| putative membrane protein [Bacteriovorax marinus SJ] Length = 287 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 67/192 (34%), Gaps = 34/192 (17%) Query: 167 IGLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G +++ ++DVS SM D G L + S+ I ++ + G +S Sbjct: 113 KGKNIVFLIDVSGSMKTLDKMGQVKAGLKMLITSMPSDYQ----IDVIHFPGKRGARYYS 168 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + +NRL T + L+YA K D + Sbjct: 169 LWSYTQKLGERQKKDVYRFLNRLNPKGATPTRSALKYALTKYPDLTD------------- 215 Query: 285 KYIIFLTDGENSSPNIDNKESLF-YCNEAKR---RGAIVYAIGVQAE-------AADQFL 333 ++ L+DG + N + + +E K+ + + IGV A A FL Sbjct: 216 --VVLLSDGAPTKMNSSEYDDIKDILSEVKKDNFKNIQINTIGVGAAFSLQSTTPASVFL 273 Query: 334 KNCA--SPDRFY 343 K A S FY Sbjct: 274 KELAKQSGGFFY 285 >gi|229822438|ref|YP_002883964.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229568351|gb|ACQ82202.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 399 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 52/136 (38%), Gaps = 12/136 (8%) Query: 171 MMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + V+D + SM + + G +L + + + + R ++ + S+ + Sbjct: 71 LFFVVDRTGSMAAEDWNGGAPRLDGVRNDLVALTEAMAGA-------RYSIIGWDSQATR 123 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTP-GLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + ++ + L + S ++ + DA E G+ + + Sbjct: 124 QLPLTTDARAVRSWADTLRQEVSAYSAGTAVDRPLEALRDALEGAAERNPGNV---RLVF 180 Query: 289 FLTDGENSSPNIDNKE 304 FL+DGEN++ + Sbjct: 181 FLSDGENTNGDDSAAG 196 >gi|126306102|ref|XP_001362319.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 911 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 50/209 (23%), Positives = 82/209 (39%), Gaps = 41/209 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D S SM G +++L A++ +L II+ +G+VTF S Sbjct: 307 LVLVIDTSRSM--KVGNRLNRLRQASQFF--LLQIIEKGSW------TGVVTFDSSATIQ 356 Query: 231 FPLAWGVQHIQEK--INRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L +Q K I+RL G GL A+ + + + Sbjct: 357 SELIQIESDVQRKTLISRLPTVTVAGGGAHICSGLRTAFMVV------KKKFLTDGSE-- 408 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAI--GVQAEAADQFLKNCASPDR 341 + LTDGE+++ N C E K+ GAI++ I G E + L + Sbjct: 409 --MALLTDGEDNTTNT--------CFEEVKQSGAIIHTIVLGPSTEKGLEKLSEMTGGMK 458 Query: 342 FYSVQNSRK--LHDAFLRI--GKEMVKQR 366 + N + L DAF + G + QR Sbjct: 459 TTATDNVQNNGLIDAFSALSSGNAAITQR 487 >gi|116623319|ref|YP_825475.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226481|gb|ABJ85190.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 75/228 (32%), Gaps = 42/228 (18%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S S +SD+ L + +++D S S P ++ LD + + D + Sbjct: 63 SITYFSKESDLPLTIGLLVDTSRSQRGVLEPE-------RKASFTFLDQV--LRDGKDFA 113 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V F + + P + + RL + T + IF A Sbjct: 114 --CVVAFDTDVRLLQPFTSSHAELAGALERLRIPNQTAT---------VIFGAIRDTAEN 162 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI-------------- 322 +K I L+DG + ++ Y A I+Y+I Sbjct: 163 QMRPRKGRKAFIILSDGVSVRDTTTIGTAIEYAQRADT---IIYSILFADHRGLRRPARK 219 Query: 323 ---GVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 G++A + ++ A + F+ V S + + I + + Q Sbjct: 220 AAMGMRALQGKKAMQRLAQETGGEFFEVSASNPITRTYAAIEETLRNQ 267 >gi|17555634|ref|NP_498247.1| CUTiclin-Like family member (cutl-23) [Caenorhabditis elegans] gi|13592472|gb|AAK31565.1| Hypothetical protein Y37B11A.1 [Caenorhabditis elegans] Length = 789 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 59/184 (32%), Gaps = 30/184 (16%) Query: 135 MPFIFCTFPWCANSSHA---------PLLITSSVKISSKSDI---GLDMMMVLDVSLSMN 182 +P + + ++ P ++ + V + + LD++ +LD S S+ Sbjct: 7 LPLLILAVTFLRDTKAVKIIDNGLAPPEIVHTPVSTKPRCKVFAPPLDLVFILDSSGSLR 66 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHI 240 D F +D + I+K + R L+ FS F + + Sbjct: 67 DKFQDEIDIIRR----------ILKHVTIGKTATRVMLIQFSGTQHLEFNFEKFTDREEL 116 Query: 241 QEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 ++ L T+ E+A ++ + K + L+DG Sbjct: 117 LAALDVLRHVSGITRIGGAFEFALQQLKTPG----SGLRDGTVP-KIVYLLSDGRTHDFP 171 Query: 300 IDNK 303 D + Sbjct: 172 KDWQ 175 >gi|95147674|ref|NP_001035616.1| complement factor B precursor [Bos taurus] gi|146345391|sp|P81187|CFAB_BOVIN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=EC-VMFB; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|86438491|gb|AAI12505.1| Complement factor B [Bos taurus] gi|296474252|gb|DAA16367.1| complement factor B precursor [Bos taurus] Length = 761 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 75/211 (35%), Gaps = 34/211 (16%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A +R+ ++ + S GLVT++++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSVGAHNFTGAKNCLRDFIEKVASYGVKPKY---GLVTYATE 317 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + +++N++ + T + L YN + ++E + Sbjct: 318 PKVLIRVSDPKSSEADWVTDQLNQINYADHKLKAGTNTKRALLEVYNMM--SREVNQFKE 375 Query: 278 KGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRG------AIVYAIGVQAEA 328 + + II +TDG ++ P + + + + R +Y GV Sbjct: 376 TWNRT-RHVIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDIYVFGVGPLV 434 Query: 329 ADQFLKNCAS----PDRFYSVQNSRKLHDAF 355 + + AS + +Q L D F Sbjct: 435 NQENINALASKKDKEKHVFKLQGMENLEDVF 465 >gi|327270792|ref|XP_003220172.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 921 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 72/212 (33%), Gaps = 41/212 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S M ++L ++ + L I + G+VTF+SK Sbjct: 315 VCLVLDASAQMGKD-----NRLSRLIQAAKLFLLHI-----IEKGSWVGIVTFNSKGNIQ 364 Query: 231 FPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + G+ A+ E Sbjct: 365 AGLQRIFSDIEREGLTSHL-PTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE-------- 415 Query: 286 YIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFY 343 I+ LT+GE S L C ++ + + I++ I ++A+++ K + + + Sbjct: 416 -IVLLTNGEGSD--------LSPCLSKNQSQEIIIHTIAFGSKASNELEKLADMTGGKTF 466 Query: 344 SVQNSRK---LHDAFLRIGK---EMVKQRILY 369 +S L DAF I + +Q I Sbjct: 467 YATDSLDSNGLIDAFGGISSGSGDASQQSIQL 498 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 267 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 320 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A N + Sbjct: 321 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGNMLSVPNGH-------- 372 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 373 ---KKVIVLLTDGVPT 385 >gi|291240911|ref|XP_002740358.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 1126 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 59/184 (32%), Gaps = 28/184 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++++D S SM H + +L D + N++R + Sbjct: 873 VVVLVDTSGSMVSHMEELVRELVSLI------WDQFQRENIKFNIIRFSGNVEKWRHQIV 926 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL ++ + T + L A+N + L Sbjct: 927 DPLEENCHDAVRWVSTFVASGNTCTLEALYEAFNDRNIDG----------------VYLL 970 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA--SPDRFYSVQ 346 TDG+ S + + N + G ++ I E+A+ FL+ + S RF+ Sbjct: 971 TDGKPDSSTSLVLKEIARLNTTR--GVKIHTISFNCQDESANIFLRQLSAMSRGRFHRCN 1028 Query: 347 NSRK 350 R Sbjct: 1029 AERD 1032 >gi|150260369|ref|ZP_01917097.1| hypothetical protein YPE_2670 [Yersinia pestis CA88-4125] gi|218927792|ref|YP_002345667.1| hypothetical protein YPO0595 [Yersinia pestis CO92] gi|229837271|ref|ZP_04457434.1| hypothetical protein YPS_1185 [Yersinia pestis Pestoides A] gi|229840485|ref|ZP_04460644.1| hypothetical protein YPH_2827 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842971|ref|ZP_04463122.1| hypothetical protein YPF_1319 [Yersinia pestis biovar Orientalis str. India 195] gi|229900821|ref|ZP_04515945.1| hypothetical protein YP516_0480 [Yersinia pestis Nepal516] gi|115346403|emb|CAL19275.1| conserved hypothetical protein [Yersinia pestis CO92] gi|149289777|gb|EDM39854.1| hypothetical protein YPE_2670 [Yersinia pestis CA88-4125] gi|229682160|gb|EEO78252.1| hypothetical protein YP516_0480 [Yersinia pestis Nepal516] gi|229690037|gb|EEO82095.1| hypothetical protein YPF_1319 [Yersinia pestis biovar Orientalis str. India 195] gi|229696851|gb|EEO86898.1| hypothetical protein YPH_2827 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705394|gb|EEO91404.1| hypothetical protein YPS_1185 [Yersinia pestis Pestoides A] gi|320016845|gb|ADW00417.1| hypothetical protein YPC_3985 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 212 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 63/171 (36%), Gaps = 12/171 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ M+ ++ P V ++T+ ++ Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL +++ Q + T + LE + + ++ + KG + Sbjct: 57 REYIPLT-ALENFQFTDITVPSAGGTFTGAALECLIHCVDRDIQRSDGDQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +TD S+P+ KR + A V A+A + LK S Sbjct: 114 FLMTD---STPSDVYAYGEAIKEVKKRAFGSIIACAVGAKAKHEHLKQLTS 161 >gi|327189219|gb|EGE56398.1| hypothetical protein RHECNPAF_7008 [Rhizobium etli CNPAF512] Length = 524 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 25/192 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + LD S SM G G D+L A R + + + + + + ++ F S + F Sbjct: 347 ALCLDFSGSMQ---GNGEDQLQKAMRFLLTPDEASRVLVQWSPSDQIIVIPFDSSVRNMF 403 Query: 232 PLAWGV---QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + +++R T E A +I ++ I+ Sbjct: 404 TASGNPLEQEGLLNEVSRQKADGGTNMYACAERALQQIARTGRLSTYLPA--------IV 455 Query: 289 FLTDG---ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +TDG + S + ++ ++ I +A L A Sbjct: 456 IMTDGRSDDQSQAFMSEWNTIEP-------RVPIFGITFG-DADKTQLDTLAKQTSARVF 507 Query: 346 QNSRKLHDAFLR 357 L AF Sbjct: 508 DGGSDLATAFRT 519 >gi|149922178|ref|ZP_01910616.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] gi|149816918|gb|EDM76403.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] Length = 689 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 43/126 (34%), Gaps = 13/126 (10%) Query: 245 NRLIFGSTTKSTPGLEYAYN---KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 N GS T + GL+ + + +D+ + I +TDG+ + + Sbjct: 414 NTFCSGSGTYTHLGLQLIKDNQVQYQADGLMDGAEFPTNDETIYFNILITDGQYNGYST- 472 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQ----AEAADQFLKNCAS-----PDRFYSVQNSRKLH 352 N + E G Y IG AA L+N A + +Y N +L Sbjct: 473 NAQVQGELEEMYNDGITTYVIGFGDGVDTPAAMAQLQNMAQWGSGDSENYYDANNQAELE 532 Query: 353 DAFLRI 358 A I Sbjct: 533 AALTSI 538 >gi|327542239|gb|EGF28728.1| protein containing von Willebrand factor [Rhodopirellula baltica WH47] Length = 291 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 41/97 (42%), Gaps = 10/97 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++D S S++ R + L ++ + N R GL FS + Sbjct: 77 LAVMLMVDCSASLDF------GTQTQTKRELVTELGATLAMSAIKNNDRVGLTLFSEDVE 130 Query: 229 QTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEY 261 ++FP G +H+ I ++ GS T LE+ Sbjct: 131 KSFPPRQGSRHVLRLIREMLTHPCSGSGTDVGAALEH 167 >gi|315231887|ref|YP_004072323.1| hypothetical protein TERMP_02126 [Thermococcus barophilus MP] gi|315184915|gb|ADT85100.1| hypothetical protein TERMP_02126 [Thermococcus barophilus MP] Length = 1614 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 51/134 (38%), Gaps = 24/134 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--S 225 +D++ ++D S SM+ ++ + + L+ I VR LVTF+ Sbjct: 90 PIDVVFIIDRSDSMD----SYIEAIKNSAYQFSYDLERIGG-----ENVRFALVTFANYD 140 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL V E +N + T+ S G+ A + F+ + Sbjct: 141 DARIDLPLTNNVSEFVEALNSIYTAGGTEWSFGGILKALDLEFNPN------------AQ 188 Query: 285 KYIIFLTDGENSSP 298 K I +TD ++ SP Sbjct: 189 KVFIVVTDEDDQSP 202 >gi|297281590|ref|XP_001082306.2| PREDICTED: calcium-activated chloride channel regulator 4-like, partial [Macaca mulatta] Length = 166 Score = 47.9 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 37/168 (22%), Positives = 61/168 (36%), Gaps = 32/168 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM D+L ++ + L I V N G+V F+S Sbjct: 20 VCLVLDKSESMA-----SYDRLNQMNKAAKYFLLQI-----VENGSWVGMVHFNSTATIV 69 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + K+ T G++ A+ I E H+ Sbjct: 70 NKPIQIISSDERNTLLAKL-PTYAWGGTSICSGIKSAFQVI---GELSSHLDGSE----- 120 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ LTDGE+ + + +E KR GAIV+ I + A + Sbjct: 121 -VVLLTDGEDYTAS-------SCIDEVKRSGAIVHFIALGTAADKAVI 160 >gi|225444686|ref|XP_002277641.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|297738541|emb|CBI27786.3| unnamed protein product [Vitis vinifera] Length = 756 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 57/185 (30%), Gaps = 23/185 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIVQ 229 ++ V+D+S SM L ++ L + S + + +G + FSS + Sbjct: 326 VVFVVDISGSMRGKL------LEDTKNALSAALSKLDSKDSFSIIAFNGEIFIFSSSVQL 379 Query: 230 TFPLAWGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 A +++ + I+ I G T + A + I Sbjct: 380 ATKEA--IENAIQWISMNFIAGGDTNILLPMNKAMELFSHS---------PGSIP--IIF 426 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQ 346 +TDG Y + +Y G+ FLK A + + Sbjct: 427 LITDGSVEDERHICDVMTSYLTNEESIHPRIYTFGIGLYCNHYFLKMLAMIGRGHYDAAY 486 Query: 347 NSRKL 351 ++ + Sbjct: 487 DANSI 491 >gi|158523143|ref|YP_001531013.1| von Willebrand factor type A [Desulfococcus oleovorans Hxd3] gi|158511969|gb|ABW68936.1| von Willebrand factor type A [Desulfococcus oleovorans Hxd3] Length = 1006 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 80/228 (35%), Gaps = 50/228 (21%) Query: 169 LDMMMVLDVSLSMND-----HFGPGMDK------------LGVATRSIREMLDIIKSIPD 211 + ++++D+S S + DK + + S+ M + + ++ D Sbjct: 795 MSTLLLIDMSASTEETAPEVSAEDSQDKKGGKSSRDDKRVIDIEKESLIVMSEALDALGD 854 Query: 212 VNNVVRSGLVTFS-------SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + FS V + ++ +I + +T+ + +A + Sbjct: 855 QY-----AMYGFSGHGREHVDYYVIKSFDESNTEKVKMRICGIEPRQSTRMGTAIRHAVS 909 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDG---------ENSSPNIDNKESLFYCNEAKRR 315 K+ + + + +I L+DG + +S +++ EAKR Sbjct: 910 KLSNREADH-----------RLLILLSDGFPQDLDYGEDRNSREYGLNDTMMAFIEAKRL 958 Query: 316 GAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 G + I + D K CA P+ + +++ L + I + ++ Sbjct: 959 GIKPFCITIDQSGNDYLKKMCA-PEEYLIIKDIAMLPELLPGIVESLM 1005 >gi|119493487|ref|ZP_01624154.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452670|gb|EAW33850.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 843 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 32/177 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV- 228 D++ ++D S S G + K R + L+ + ++ FS Sbjct: 343 DVVFLIDTSGS---QRGEPLAKSKQLMRRFIQSLNPDDTFS---------IIDFSDTTTA 390 Query: 229 -QTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL Q IN+L T+ + + + + Sbjct: 391 LSATPLTNTVTNQQKAIAYINQLEANGGTE-----------LLNGIQTVMDFPSPPVKRL 439 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I+ +TDG + N E L + G +Y+ GV + L A R Sbjct: 440 RSIVLITDGYIGNEN----EVLSVVKNQLKSGNRLYSFGVGSSVNRFLLNRLAEIGR 492 >gi|109083237|ref|XP_001114797.1| PREDICTED: cochlin-like isoform 4 [Macaca mulatta] Length = 550 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKEDVLAVIRNI 437 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 438 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|326433564|gb|EGD79134.1| hypothetical protein PTSG_09864 [Salpingoeca sp. ATCC 50818] Length = 490 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 71/219 (32%), Gaps = 29/219 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + ++ V D S SM+DH L A + + + V L Sbjct: 38 PERRQARMHVLFVADNSGSMSDHMQSVNTGLANAITAC------------MQHGVHPNLC 85 Query: 222 TFSSKIVQT-FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FS + + P G + I L + T + +++ L +A+ Sbjct: 86 NFSDVLNERHLPYDQGAAAVASTIQNLGCDTLTD----FDIVVDRLVAEMNMLLGVARAQ 141 Query: 281 DDYKK--YIIFLTDGENSSPNIDNKESL------FYCNEAKRRGAIVYAIGVQAEAADQF 332 D + +++ +TDG+ S P+ D L F V A+GV + +F Sbjct: 142 QDRQHRVFLVVMTDGQASMPSEDKFAHLQRLIEEFTALSVHSNEVNVLALGVGGDHQGEF 201 Query: 333 LKNCAS----PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L + +RF+ + I M Sbjct: 202 LDRLSKVVPNSNRFFQCAAGETTDELTNSITDAMGHLTT 240 >gi|323498500|ref|ZP_08103493.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] gi|323316389|gb|EGA69407.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] Length = 697 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 36/205 (17%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G D + VLD S SM+ + ++ + + +P + R +V F S Sbjct: 311 GRDWVFVLDKSGSMSGKYSTLVEGVR----------QGLGKLPSED---RFRVVMFDSNT 357 Query: 228 VQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 A V + + ++ + T G+ A K+ D + Sbjct: 358 YDLTGGFVAVNAANVTKALQSVEQVEPSNGTNLYEGMAAATRKLDDDRPTG--------- 408 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 I+ +TDG + + + + ++ ++ + A L S Sbjct: 409 ----IVLVTDGVANVGVTEKRRFF---DLMEKHDVRLFTFIMGNSANTPLLVPMTKLSNG 461 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQ 365 SV N+ + + I ++ Q Sbjct: 462 IATSVSNADDIIGHLMSITSKLTYQ 486 >gi|301619004|ref|XP_002938896.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 496 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 33/208 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S S++ + + I +L P V++ +V S+ V Sbjct: 36 DLIFLLDSSGSVSYY------EFAKVREFIGNLLRPFTFGP---QDVQASIVHISTNPVL 86 Query: 230 TFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 FP + ++I R I T + L Y +FD + + + Sbjct: 87 EFP--FNQYGSSQEIQRAIQNIKQRMGDTNTGKALSYIKENLFDER------SGSRAEVP 138 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD---- 340 K ++++TDG ++ + L K G V+ + + ++P Sbjct: 139 KVMVWVTDGLSTDDISQPMQLL------KDMGVTVFIVSTGRGNYLELSAAASTPSDTHL 192 Query: 341 RFYSVQNSRKLHDAFLR-IGKEMVKQRI 367 F V + + I + + +R+ Sbjct: 193 HFVDVDDLHIITKELRDSIIELIRARRL 220 >gi|300688886|gb|ADK31557.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000] Length = 509 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 54/168 (32%), Gaps = 24/168 (14%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T I +K + +D+++VLD + SM P + + +SI + + Sbjct: 342 TQVYNIINKYNSAVDLVLVLDTTESM----HPYLTSIKEEIKSISKQVFKKDINS----- 392 Query: 216 VRSGLVTFSS----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R G + + + + + I +N + Y I A E Sbjct: 393 -RIGFLLYRDVKDTYLTKKIDFDNNINKIYRDVNYFYASGGGDKAEPM---YEAIQKALE 448 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ K +I +TD N ++ AK + + Sbjct: 449 DFDYKNDN-----KVVIVITDAPAKVIGKANADTNK--KTAKEKNIKI 489 >gi|239613390|gb|EEQ90377.1| U-box domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 766 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 76/208 (36%), Gaps = 30/208 (14%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +D+S SM+ P D G + +LD+ K I +N+ R G+V F Sbjct: 75 DIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLGVVAF 134 Query: 224 SSKIVQTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S+ + ++ + + + L S+T GL+ + + E Sbjct: 135 STDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEAL------EEVTPIP 188 Query: 280 HDDYKKYIIFLTDG--------ENSSPNIDNKESLFYCNEA--KRRGAIVYAIGVQAEAA 329 + YI LTDG + + + Y ++A K R +++ G Sbjct: 189 QNVQALYI--LTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGYYIR 246 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAF 355 L+ + + + ++ + F Sbjct: 247 SGLLQAISEVGGGTYSFIPDAGMIGTVF 274 >gi|114570700|ref|YP_757380.1| vault protein inter-alpha-trypsin subunit [Maricaulis maris MCS10] gi|114341162|gb|ABI66442.1| Vault protein inter-alpha-trypsin domain protein [Maricaulis maris MCS10] Length = 740 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 56/173 (32%), Gaps = 32/173 (18%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D S SM + A ++ L ++ R ++ F + + Q FP Sbjct: 349 VIDNSGSMGGA------SMRQARAALITALQRLEPGD------RFNVIRFDNTMEQVFPQ 396 Query: 234 A-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 A V RL T P L A + I+ Sbjct: 397 AVDASPDNVATALTFARRLEAQGGTVMLPALNAALRDTSPDDDSRVRQ----------IV 446 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 FLTDG I N+ LF EA + ++ +G+ + F+ A R Sbjct: 447 FLTDG-----AIGNEAELFAAIEAGLGRSRLFPVGIGSAPNGYFMSRAARLGR 494 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 56/169 (33%), Gaps = 8/169 (4%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 + KG ++I + + V+ L ++ + K KL I+D + L A I + Sbjct: 11 YRAQKGVVAIFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAIDLGGDQ 70 Query: 70 NNGKKQKNDFSYRIIK-------NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 N+ + I TD E ++ N S + + I+D Sbjct: 71 AEAIVAGNEAINNNLTLDGFGSMTIDNTDIHYEFSDSLPFDSSTNTATSPYVRVRIEDVD 130 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS-SVKISSKSDIGLD 170 L A+ +M ++S + S+ S+S L Sbjct: 131 VADYLVAIFNIDMSARSSAVAGPSSSITTTCNVVPLSICEGSESSTTLS 179 >gi|300788143|ref|YP_003768434.1| hypothetical protein AMED_6298 [Amycolatopsis mediterranei U32] gi|299797657|gb|ADJ48032.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 534 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 81/205 (39%), Gaps = 27/205 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ + +++DVS SMN G ++++ V + + + + K + ++ FS Sbjct: 332 NLSARVQVLIDVSGSMNAQVPGTNLNRMQVTMEAAAKAMHLFKPATQLR------MLAFS 385 Query: 225 SKIVQTF------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +++ P+A QH+ L K+TP ++D+ A+ Sbjct: 386 TRLDGDKDYRELLPMASVAQHLAS--GALEKLGQVKATPDGGTG---LYDSVLDTYRTAR 440 Query: 279 GHDDYKKY--IIFLTDGENSSPNIDNKESLFY----CNEAKRRGAIVYAIGVQAEAADQF 332 + + +I +TDG N P ++ L +A+ R + +G+ +A Sbjct: 441 REWEPGRLNLVIVMTDGRNEDPRGISRADLLTELAGLQDAR-RPIPLIGVGIGPDADKAE 499 Query: 333 LKN--CASPDRFYSVQNSRKLHDAF 355 L A+ + + + K+ D F Sbjct: 500 LDQLTAATGGQAFLAPDPAKITDVF 524 >gi|284997525|ref|YP_003419292.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|284445420|gb|ADB86922.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 380 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 75/190 (39%), Gaps = 33/190 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++ LD S SM+ + K+ +A + +++ K IP N +TFS Sbjct: 34 SATGFHYIVALDTSGSMSGY------KIELAK---QGAIELFKRIPKGNK---VSFITFS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + + +I ++ G T + A +AK + Sbjct: 82 SNVNVIKEFV-DPLDLTNEILQIAAGGQTALYTAILTA-----------NSLAKKYQMPT 129 Query: 285 KYIIFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 Y++ LTDG + N+ N L Y + VY+ G+ + +Q L+N + + Sbjct: 130 -YLLLLTDGNPTDETNVGNYLKLPYFEKM-----QVYSFGIGDDYNEQLLQNISDKTSGV 183 Query: 342 FYSVQNSRKL 351 Y + ++ ++ Sbjct: 184 MYHISDANEI 193 >gi|197101601|ref|NP_001126282.1| inter-alpha-trypsin inhibitor heavy chain H1 [Pongo abelii] gi|55730948|emb|CAH92192.1| hypothetical protein [Pongo abelii] Length = 911 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLQGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 456 QRIYEDRDATQQLQGFYS 473 >gi|332708216|ref|ZP_08428205.1| hypothetical protein LYNGBM3L_12190 [Lyngbya majuscula 3L] gi|332353010|gb|EGJ32561.1| hypothetical protein LYNGBM3L_12190 [Lyngbya majuscula 3L] Length = 453 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 82/251 (32%), Gaps = 42/251 (16%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCT---FPWCANSSHAPLLITSSVKISSKSDIGLD 170 L+ ++D+ + + Y+ P T F N+ + Sbjct: 43 LNATVEDEQVKIRIQVKNEYDQPVTNLTDENFQVYVNNDKVTFKPKHWKNPTKAEQPSAW 102 Query: 171 MMMVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--- 225 ++ +LD+S SM D G KL AT ++R + IK+ D ++ + +V F Sbjct: 103 IIFLLDMSGSMAKPDKPGSSQSKLEGATAAMR---EFIKNTADRSSHTQVAIVPFGEPHP 159 Query: 226 -------------KIVQTFPLAWGVQHIQEKINRLIFGS---TTKSTPGLEYAYNKIFDA 269 + + FP+ G +Q + L T L A N + + Sbjct: 160 KNCPKGGYPVNQATLNKFFPV--GDSELQNYLGELASKKPCAATNVYKPLTEAVNFLSEQ 217 Query: 270 KEKLEHIAKGH-------DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-----RRGA 317 + K + II L+DG +S+ E + K Sbjct: 218 RGDPRFYPKKEKFWQPQPQQPRLSIILLSDGYHSTSATKEAEEQEF-ETLKRLIKVNDNI 276 Query: 318 IVYAIGVQAEA 328 IV+ +G + Sbjct: 277 IVHTLGYGLKP 287 >gi|156364713|ref|XP_001626490.1| predicted protein [Nematostella vectensis] gi|156213368|gb|EDO34390.1| predicted protein [Nematostella vectensis] Length = 1043 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+ ++LD S +++ + K D++ + R + TFS+ Sbjct: 90 AAIDLAILLDASEAISPQEWSKLLKFTA---------DLMDYYGISEDGTRISVATFSTD 140 Query: 227 IV------QTFPLAWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + + + ++ I + G + L+ A +F+ Sbjct: 141 VDIVLSFNEFSGVEMNAASVKRGILGAKQSRGPGLRIDKALKAADKDLFN------RRFG 194 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +D KK + +T G + + + RG +YA+GV L+N AS Sbjct: 195 MREDQKKVCLLVTSGAQTKDQGPSTQLGTVTALLSARGVDIYAVGVGDGVDSSELRNIAS 254 Query: 339 PDRF-YSVQNSRKLHDAFLRIGK 360 + F Y+ + +++ G Sbjct: 255 TEDFIYTASSFEEINKVLEPFGT 277 >gi|71984286|ref|NP_498819.2| C-type LECtin family member (clec-160) [Caenorhabditis elegans] gi|47117847|sp|P34393|CL160_CAEEL RecName: Full=C-type lectin domain-containing protein 160; Flags: Precursor gi|28894818|gb|AAK84522.2|L11247_8 C-type lectin protein 160, confirmed by transcript evidence [Caenorhabditis elegans] Length = 639 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 65/198 (32%), Gaps = 25/198 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ V+DVS M G L + I ++ + P++ V+ GL+ +S K Sbjct: 288 IDIIFVIDVSEGM------GQGGLMMVKAEINTLVGQMSLDPNIQKHVQVGLIKYSDKAE 341 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK-----IFDAKEKLEHIAKGHDD- 282 F + + N F S P LE K + ++ + + Sbjct: 342 VVFKPS-------DYTNEDEFTEDLWSDPRLEDVDEKSDEVNLHLGLQQAAKMTASMRNG 394 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN---CASP 339 +K I+ N N D ++ + G + + + + ASP Sbjct: 395 VRKVIVVYAASYNDEGNDDARQ---IAANIRETGYAIITVAFVEPESSNLVMKIGEIASP 451 Query: 340 DRFYSVQNSRKLHDAFLR 357 ++ L + Sbjct: 452 RMNFTSFRDDLLVEQMED 469 >gi|296274053|ref|YP_003656684.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296098227|gb|ADG94177.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 2811 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 85/266 (31%), Gaps = 23/266 (8%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERS 111 LD S T T +N + N W + ++ + + + + Sbjct: 1721 LDGSESLTVTITNVPDNATLTTNNSSYTLTNNHNNTWTVNLPEGAKDVSDSITMTVPKGT 1780 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM 171 ++ + I + + N + + L +S + I ++ Sbjct: 1781 ENIDLGITARATEANDNIDGDNYAETTDSDAVVYSEDETQTLNFDG----ASSAAIATNV 1836 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++ LDVS SM + +++L +A ++ +M++ +S VN LVTF+ Sbjct: 1837 VITLDVSGSMTSN-DEHVNRLALAKEALAKMINEYESQGSVN----VKLVTFNDDGHAVN 1891 Query: 232 PLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 W + IN L G T + YN + F+ Sbjct: 1892 --TWMSAKDAISAINNLSSGGKTNYEDAVYETYNNYTEPSADRT-----------VAYFI 1938 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRG 316 +DGE + N + + G Sbjct: 1939 SDGEPTKENNEGCDPCNNIGTDSENG 1964 >gi|14030589|gb|AAK52969.1|AF367015_1 serum opacity factor VT37.1 [Streptococcus pyogenes] Length = 1015 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 73/173 (42%), Gaps = 11/173 (6%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + +T V + D G D+M +LDVS M D F DK+ ++ Sbjct: 212 TIDVTVPVT-PKEIDKGADVMALLDVSKKMTEDDFNNAKDKIKKLVTTLTSK--SADGQQ 268 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 ++NN R L+TF +I ++ + + + ++ L + G++ I A Sbjct: 269 NLNNRNRVRLMTFYREISDPIDISGKTETQLDQLLDELREKAKANYDWGVDL-QGAIHKA 327 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 +E + + +++I+ + GE++ + D K N++K + A + + Sbjct: 328 REIFKRDQEKKSGKRQHIVLFSQGEST-FSYDIKNK----NDSKLKKARLTTV 375 >gi|260904334|ref|ZP_05912656.1| von Willebrand factor type A [Brevibacterium linens BL2] Length = 358 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 61/157 (38%), Gaps = 20/157 (12%) Query: 156 TSSVKISSKS-DIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 +K S++ + D+ ++D + SM + + +L + +MLD+ K +P Sbjct: 55 GIPIKSSTEEYEAAADVYFLVDTTTSMAAEDYDGDKTRLEGVKK---DMLDLAKQLP--- 108 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R +++F+S PL ++ L + S I +A +L Sbjct: 109 -GTRLSIISFASTASTVMPLTTDHAAFASAVDVLSPEMSLNSNGS------SITEAGAEL 161 Query: 274 EHIAKGHDD----YKKYIIFLTDGENS-SPNIDNKES 305 + K + + + + DGE + ++D+ S Sbjct: 162 DKRMKSNQEDRPDNNSLVFYFGDGEQTAETSVDSWSS 198 >gi|332974518|gb|EGK11438.1| PilC protein [Kingella kingae ATCC 23330] Length = 1328 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 55/144 (38%), Gaps = 19/144 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGP--GMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + ++M++LD S SM G + G+ S+ +++D S + Sbjct: 42 TTVRGMQGAKPNIMLLLDDSGSMRAEVPGSYGQTRQGILRNSLSKIVDKYGSRINW---- 97 Query: 217 RSGLVTFSSKIVQTFPLAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 GLV+F+ + L+ G + I TT + A N + + + Sbjct: 98 --GLVSFNDSS-SRYNLSLGTSYLTVANAIRNFPASGTTPTITSYLKAVNMLNEGIKYRC 154 Query: 275 HIAKGHDDYKKYIIFLTDGENSSP 298 K Y++ L+DG+++ P Sbjct: 155 Q--------KSYVVLLSDGDSNWP 170 >gi|282896738|ref|ZP_06304746.1| Magnesium chelatase ATPase subunit D [Raphidiopsis brookii D9] gi|281198456|gb|EFA73344.1| Magnesium chelatase ATPase subunit D [Raphidiopsis brookii D9] Length = 669 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 66/205 (32%), Gaps = 40/205 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 469 KAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLT-----ESYQNRDQIALIPFRG 517 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + ++ +L G + GL A +A+ G D + Sbjct: 518 EQAEVLLPPTRSIALAKNRLEKLPCGGGSPLAHGLTQAVRVGVNAQ-------MGGDIGQ 570 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI-----GVQ 325 I+ +TDG E E L + G + I V Sbjct: 571 VVIVAITDGRGNIPLSRSLGESPEPGEKPDIKGELLDIAGRIRASGMQLLVIDTESKFVS 630 Query: 326 AEAADQFLKNCASPDRFYSVQNSRK 350 A + K + ++Y + + Sbjct: 631 TGFAKELAKT--AGGKYYQLPKATD 653 >gi|160858159|emb|CAP19999.1| collagen type VI alpha 5 [Homo sapiens] Length = 527 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 9/105 (8%) Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 L++A N +F EH ++ + K+ +I +TDGE + D+ + E + +G Sbjct: 3 ALKHA-NALF----TEEHGSRIKQNVKQMLIVITDGE----SHDHDQLNDTALELRNKGI 53 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 ++A+GV + + + V N KL D F + + M Sbjct: 54 TIFAVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERM 98 >gi|281416613|ref|ZP_06247633.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281408015|gb|EFB38273.1| von Willebrand factor type A [Clostridium thermocellum JW20] Length = 1363 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/268 (13%), Positives = 80/268 (29%), Gaps = 37/268 (13%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + + S++I + + +T N L + ++ + I+ D+ + + Sbjct: 36 KTDEDFESAQISFKISEEVLKTTDINNLVIFYYDEENDTIK----FLETEVDEETNTIKT 91 Query: 129 AVSRYEMPFIFCTFPWCANSSHAP-----LLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 V + + + + + L D++ V+D + SM Sbjct: 92 TVDHFSIYGVIDIVRFAQSWGIKDILDKLLNPGGETIPPVAEIGQADIVFVIDTTGSMGS 151 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----SSKIVQTFPLAW--GV 237 + + + ++ + N VR GL+ + + T L W V Sbjct: 152 VINNVKNNIT----------NFANTLMENNVDVRLGLIDYKDLEEDGMDSTKNLGWFDNV 201 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD---GE 294 +N + + A E + K+I+ TD E Sbjct: 202 SDFIASVNNMRATGGGDAPESTVDALE------EARRMDFRPG--VNKFIMLFTDVSYKE 253 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAI 322 ++ D + + K +V AI Sbjct: 254 STRFE-DVQSMKTVIEKLKEDKIVVSAI 280 >gi|254413248|ref|ZP_05027019.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196179868|gb|EDX74861.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 1037 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 81/241 (33%), Gaps = 26/241 (10%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS----SHAPLLITSSVK 160 + + S+++ + + DQ N + RY++ + H + +++ Sbjct: 597 VTTQQTSSTVRVELADQETIPNKDLILRYQVAGADTQATVLTQADERGGHFATYLIPAIE 656 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + D++ ++D S S + +L D I N+ + Sbjct: 657 YQQNEIVPKDVVFLVDTSGSQSGSPIVQSKELMRQFIQGLNPQDTFTIIDFANSTTQL-- 714 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S K + P + INRL T+ G++ N +L Sbjct: 715 ---SDKPLANTPQ--NRKKALNYINRLDANGGTELMNGIDTVLNFPAAPAGRL------- 762 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 + ++ LTDG D+++ + + + G +Y+ GV + ++ A Sbjct: 763 ----RSVVLLTDGLIG----DDEQIIAEIRDRLKPGNRLYSFGVGSSTNRFLIERLAELG 814 Query: 341 R 341 R Sbjct: 815 R 815 >gi|118356595|ref|XP_001011553.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89293320|gb|EAR91308.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 899 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 43/333 (12%), Positives = 108/333 (32%), Gaps = 69/333 (20%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 ++ N N+ Q N N + + +NE ++ + + + T +++I + Sbjct: 163 KSEINNNSDISQNNQ--KSRSTNPKKREIKNE-QQEEIEETKQDYVKQTPKTVVIKKELP 219 Query: 124 DYNLSAVSRY-----EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 + +Y E F + + L I S+ + LD+M ++D + Sbjct: 220 AQVKQFIDQYYSKNEEARNFITNFHFLKDDMTYFLPIFQSLFPKME----LDLMFIVDCT 275 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----KIVQTFPLA 234 SM+ ++ +I+ + +R V + + F + Sbjct: 276 GSMSSWIDAVKLEITGIVAAIKN--------QHHGSQIRVSFVGYRDYGDSERYSIFNFS 327 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD-- 292 ++ Q+ I+++ + + + + K + KY + L D Sbjct: 328 EDLEKFQDFISKVQACGGNDAAEDVAGGFKQANSQNWKSQ---------AKYAVLLADAP 378 Query: 293 -------GENSS-------PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G+ + + D + ++G +YA+ + Sbjct: 379 AHGIQYHGDKADFYDRYPKGDPDGIDLKKEFQNLIKKGVKLYAVEIM------------- 425 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 +++ ++D F + KE+ Q++ K Sbjct: 426 -------NSTKMMYDIFQQYNKEVNGQQLDITK 451 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 5/75 (6%) Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQA---EAADQFLKNCA-SPDRFYSVQNSRKLH 352 +P + C K +G +VY I + ++A L+NCA SP + ++ + Sbjct: 522 APETKDDRMADICGATKSKGIVVYTIAFEMGEFDSAADRLENCASSPSQHFNATTLN-IS 580 Query: 353 DAFLRIGKEMVKQRI 367 AF I + K R+ Sbjct: 581 QAFGSIAANVQKLRL 595 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 74/221 (33%), Gaps = 34/221 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F G+I+IL+ L V+ V GL I+T + L LD ++L A + Sbjct: 9 RFVTRDDGAITILSLFLFVVMLAVAGLGIDTMRHEMARTHLQATLDSAVLAGAGAPADAT 68 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + ++ F D L DI + S+S ++ + + + Sbjct: 69 AADVKLIVEDYFD--------AADLSQYLNTIDPETDIVASLNAKSVSASVELEMDTFLM 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 L T+ ++ + +++++ LDVS SM Sbjct: 121 RLSG-------------------VDTLTTAGGATAAIAAPRMEIVLALDVSGSMAGE--- 158 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +L + ++ + + S D +V +S + Sbjct: 159 ---RLTKMKSAAKQFVTDVMSASDQGTTT-ISIVPYSWSVT 195 >gi|32475537|ref|NP_868531.1| hypothetical protein RB8767 [Rhodopirellula baltica SH 1] gi|32446079|emb|CAD75908.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 291 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 41/97 (42%), Gaps = 10/97 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++D S S++ R + L ++ + N R GL FS + Sbjct: 77 LAVMLMVDCSASLDFGTQSQ------TKRELVTELGATLAMSAIKNNDRVGLTLFSEDVE 130 Query: 229 QTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEY 261 ++FP G +H+ I ++ GS T LE+ Sbjct: 131 KSFPPRQGSRHVLRLIREMLTHPSSGSGTDVGAALEH 167 >gi|297467492|ref|XP_002705103.1| PREDICTED: polydom [Bos taurus] Length = 3396 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 59/167 (35%), Gaps = 34/167 (20%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L+++ ++D S S+ + +L +R++L +P R +VTFSSK Sbjct: 83 RLELVFLVDESSSVGQ--ANFLSELK----FVRKLLSDFPVVP---TATRVAIVTFSSKN 133 Query: 228 VQTFPLAWGVQH---------IQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + +I + + G T + + A + ++E Sbjct: 134 NVVPRVDYISSRRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHSRENS---- 189 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K I +TDG ++ + + G ++ G+ Sbjct: 190 ------TKVIFLITDGYSNGG-----DPRPIAASLRDFGVEIFTFGI 225 >gi|326791577|ref|YP_004309398.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326542341|gb|ADZ84200.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 235 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 64/193 (33%), Gaps = 27/193 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVA-TRSIREMLDIIKSIPDVNNVVRSGLVTFS-- 224 L + ++D S SM G + L A ++I +M + P+ +R +TFS Sbjct: 16 PLHFIWLVDCSGSM---MGEKIQSLNYAIKQTIPDMRAAAEENPNAQLYIR--AITFSEG 70 Query: 225 --SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I P+ E I+ + G T LE +LE Sbjct: 71 AKWHIASPTPVQD-----FEWID-VEAGGLTDLGKALELV-------AGQLEMPPMPERA 117 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNCASPDR 341 ++ L+DG+ + L K+ + AI + +A + L + P+ Sbjct: 118 LPPVLVLLSDGQPTDTYEAALNQLLSLPWGKKA-VKL-AIAIGNDADKEVLARFVDQPEM 175 Query: 342 -FYSVQNSRKLHD 353 N L Sbjct: 176 PVLEANNPSALIK 188 >gi|149049604|gb|EDM02058.1| similar to putative voltage-gated calcium channel alpha(2)delta-4 subunit (predicted) [Rattus norvegicus] Length = 700 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 71/193 (36%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D+S SM ++ +A ++ +LD + VN ++ ++ + Sbjct: 298 DIVILVDMSGSMKGL------RMAIAKHTVTTILDTLGENDFVN------IIAYNDYVHY 345 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ + L A+ + +E + Sbjct: 346 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGIVSQALIEAFQILKQFQESRQ-----G 400 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 401 SLCNQAIMLVTDG-----AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACN 455 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 456 NKGYYTQISTLAD 468 >gi|22127458|ref|NP_670881.1| hypothetical protein y3584 [Yersinia pestis KIM 10] gi|108809185|ref|YP_653101.1| hypothetical protein YPA_3194 [Yersinia pestis Antiqua] gi|108810628|ref|YP_646395.1| hypothetical protein YPN_0463 [Yersinia pestis Nepal516] gi|145597694|ref|YP_001161770.1| hypothetical protein YPDSF_0382 [Yersinia pestis Pestoides F] gi|167467475|ref|ZP_02332179.1| putative tellurium resistance protein [Yersinia pestis FV-1] gi|21960552|gb|AAM87132.1|AE013961_4 hypothetical [Yersinia pestis KIM 10] gi|108774276|gb|ABG16795.1| hypothetical protein YPN_0463 [Yersinia pestis Nepal516] gi|108781098|gb|ABG15156.1| hypothetical protein YPA_3194 [Yersinia pestis Antiqua] gi|145209390|gb|ABP38797.1| hypothetical protein YPDSF_0382 [Yersinia pestis Pestoides F] Length = 233 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 71/204 (34%), Gaps = 22/204 (10%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP + L S + ++ L + +++D S SM + Sbjct: 1 MPLVSLLI----------LTPLSLILRTTSDMRRLPVYLLIDTSGSMRGE------SIHA 44 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 I+ M+ ++ P V ++T+ ++ + PL +++ Q + T Sbjct: 45 VNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIPLT-ALENFQFTDITVPSAGGTF 103 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + LE + + ++ + KG + +TD S+P+ KR Sbjct: 104 TGAALECLIHCVDRDIQRSDGDQKGDWRP--LVFLMTD---STPSDVYAYGEAIKEVKKR 158 Query: 315 RGAIVYAIGVQAEAADQFLKNCAS 338 + A V A+A + LK S Sbjct: 159 AFGSIIACAVGAKAKHEHLKQLTS 182 >gi|45442680|ref|NP_994219.1| hypothetical protein YP_2915 [Yersinia pestis biovar Microtus str. 91001] gi|45437546|gb|AAS63096.1| conserved hypothetical protein [Yersinia pestis biovar Microtus str. 91001] Length = 233 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 71/204 (34%), Gaps = 22/204 (10%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP + L S + ++ L + +++D S SM + Sbjct: 1 MPLVSLLI----------LTPLSLILRTTSDMRRLPVYLLIDTSGSMRGE------SIHA 44 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 I+ M+ ++ P V ++T+ ++ + PL +++ Q + T Sbjct: 45 VNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIPLT-ALENFQFTDITVPSAGGTF 103 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 + LE + + ++ + KG + +TD S+P+ KR Sbjct: 104 TGAALECLIHCVDRDIQRSDGDQKGDWRP--LVFLMTD---STPSDVYAYGEAIKEVKKR 158 Query: 315 RGAIVYAIGVQAEAADQFLKNCAS 338 + A V A+A + LK S Sbjct: 159 AFGSIIACAVGAKAKHEHLKQLTS 182 >gi|225559690|gb|EEH07972.1| U-box domain-containing protein [Ajellomyces capsulatus G186AR] Length = 759 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 53/138 (38%), Gaps = 18/138 (13%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +DVS SM P D+ G + +LD+ K I +N R G+V F Sbjct: 75 DIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRLGIVAF 134 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S++ + ++ + + + L S+T GL+ E H + Sbjct: 135 STEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLKLGLKAF----ENERHTLQS 190 Query: 280 HDDYKKYIIFLTDGENSS 297 + LTDG + Sbjct: 191 VQA----LYVLTDGMPNH 204 >gi|170031185|ref|XP_001843467.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] gi|167869243|gb|EDS32626.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] Length = 1165 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 55/132 (41%), Gaps = 12/132 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVA----TRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++++LD S SM + + +L V T S + ++I K DV+ +V FS Sbjct: 221 DIVILLDNSGSMTG-YRNYIAQLTVKSVLDTFSNNDFINIYKYSNDVDPLV----PCFSD 275 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +VQ P ++ + E + L A+ + + +E + + Sbjct: 276 ILVQATPE--NIRFLNEYVKELQPEGYANVGKAFVKAFELLQNYREIR-RCNESISGCNQ 332 Query: 286 YIIFLTDGENSS 297 I+ +TDG S+ Sbjct: 333 AIMLITDGVPSN 344 >gi|157120592|ref|XP_001659677.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] gi|108874883|gb|EAT39108.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] Length = 1122 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 70/199 (35%), Gaps = 34/199 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + D+M++LD S SM+ + +A + ++D + N LV+F Sbjct: 148 AASSPKDVMILLDSSGSMSGK------EYQLAVATASAIMDTLGDDDFFN------LVSF 195 Query: 224 SSKIVQTFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 S + P ++ ++ I + +T + LE A+ + + Sbjct: 196 SDQPKVIVPCFQDKMVRATPDNIKEVKTAIQSVECENTANFSAALESAFELL-----RRY 250 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQF 332 + + + I+ +TDG P+ + + + N ++ IG + Sbjct: 251 NQSSQGSQCNQAIMLITDG----PSDTFADVIKHYNHP-HMPVRIFTYLIGTDKSSGKNL 305 Query: 333 LK-NCASPDRFYSVQNSRK 350 + C + + + + + Sbjct: 306 YQMACDNKGFYVQINSVEE 324 >gi|297537911|ref|YP_003673680.1| type 1 secretion target domain-containing protein [Methylotenera sp. 301] gi|297257258|gb|ADI29103.1| type 1 secretion target domain-containing protein [Methylotenera sp. 301] Length = 1187 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/177 (22%), Positives = 59/177 (33%), Gaps = 34/177 (19%) Query: 171 MMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ LD S SMND G +L A +SI ++DI + +R LVTFS+ Sbjct: 681 LLITLDTSNSMNDVSGINAETRLQSAVKSIERLMDIYDGFGE----IRVRLVTFSNNAET 736 Query: 230 TFPLAWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 W + I L G TT + A D + + Sbjct: 737 QGT-EWVTLDTAKTILDGILTIGGTTNYDGAIANAMTAFADPGKISGAQNISY------- 788 Query: 288 IFLTDG------------ENSSPNIDNKESLFYCNEAKRRG------AIVYAIGVQA 326 F +DG N ++ + EA +G YAIG+ Sbjct: 789 -FFSDGNPNRGDGSNTTLSNVGSSVGPDNGIQVAEEAIWKGFLNDNQIKSYAIGMGT 844 >gi|291229809|ref|XP_002734863.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 2065 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 71/205 (34%), Gaps = 35/205 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----- 225 ++ ++D S S G IRE+ I PD R +VT+S Sbjct: 57 IVFLVDSSGS------IGASNFHFEINFIREISTIFSMSPDEA---RVSVVTYSDSSKIV 107 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + + G + L T + LE A + A+ Sbjct: 108 RQIDYIGSSVGKNKC-TFLGELSLIRYEAGWTDTKGALEEADRVLQHAR----------S 156 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++ LTDG+++ + + + +G + AIGV D+ L + A+ Sbjct: 157 GANRLVVLLTDGQSTEG-----DPVGIATRIRNKGIRIVAIGVGNVNMDE-LTSIATAQY 210 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + + D RI ++ ++ Sbjct: 211 VFILDRLSYVVDLATRIKNDVKEKS 235 >gi|226327211|ref|ZP_03802729.1| hypothetical protein PROPEN_01077 [Proteus penneri ATCC 35198] gi|225204429|gb|EEG86783.1| hypothetical protein PROPEN_01077 [Proteus penneri ATCC 35198] Length = 178 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ ML+ ++ P V ++T+ ++ Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHAVNVGIQTMLNALRQDPYALESVHISIITYDNEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL I + T + LE + ++ + KG + Sbjct: 57 REFIPLTALEDFQFSDIT-VPSSGGTFTGAALECLIQCVDRDIKRSDGDQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 +TDG S + KR + A V +A LK Sbjct: 114 FLMTDGTPSDAYAYGEAIKEV---KKRSFGSIIACAVGPKAKHDHLK 157 >gi|3236370|gb|AAC23667.1| type VI collagen alpha 3 subunit [Mus musculus] Length = 2657 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 49/360 (13%), Positives = 119/360 (33%), Gaps = 58/360 (16%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K S+L ++ + + +F +N Sbjct: 863 VRIGVVQFSNDVFPEFYLKTHKSQ---SSVLEAIRRLRFKGGSPLNTGRALEFVA---RN 916 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 917 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTI 976 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + S+ D++ +LD Sbjct: 977 TNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPEKKKADIVFLLD- 1035 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L A+ + + + SI R GLV ++S F L Sbjct: 1036 -GSINFRRDSFQEVLRFASVIVDTVYEDGDSI-------RVGLVQYNSDPTDEFFLRDFS 1087 Query: 236 GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + I + IN++++ + G+E+ + E ++ + +T G Sbjct: 1088 TKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDVRVPQIAFVITGG 1143 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++ D +L ++G V+A+GV+ +++ K ++ + V + ++L + Sbjct: 1144 KSVEDAQDVSLALT------QKGVKVFAVGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1197 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 74/204 (36%), Gaps = 24/204 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S + F + ++ +++ + PD VR LV +S + Sbjct: 419 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPDR---VRVALVQYSDRTRP 468 Query: 230 TFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L Q + I RL T TP A + ++ + + + Sbjct: 469 EFYLNSHMDQQGVISAIRRLTLLGGT--TPNTGAALEFVLRNILTSSTGSRIAEGVPQLL 526 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I LT + P+ D+ K+ GA+ IG+ + PD ++ Sbjct: 527 IVLT----AEPSGDDVRGPSVV--LKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIPT 580 Query: 348 SRKLHDAFLRIGKEMVKQRILYNK 371 R+L I + + ++ I N+ Sbjct: 581 FREL----GTIQRVISERVIQLNR 600 >gi|254491098|ref|ZP_05104279.1| von Willebrand factor type A domain protein [Methylophaga thiooxidans DMS010] gi|224463611|gb|EEF79879.1| von Willebrand factor type A domain protein [Methylophaga thiooxydans DMS010] Length = 326 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 69/215 (32%), Gaps = 39/215 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFG---PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + G ++ VLD S SMN+ FG P D+ A + R + + + + P G+ Sbjct: 78 TGSGAHIVFVLDRSASMNETFGGETPDEDEQSKAKAARRILSNFVTNRPH----DLFGVA 133 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS++ PL IQ IN + T GL + Sbjct: 134 GFSTQPFYISPLTEHKTAIQAAINSMETPGLAFTNVAKGLGMGLSYF----------KAQ 183 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK----------RRGAIVYA-------- 321 + I+ ++DG + + K + + G +Y+ Sbjct: 184 PHTGSRVIVLVSDGAATLDHRAQKTLREWFERYRVSLYWFFLRTENGQGIYSEPESNRDD 243 Query: 322 -IGVQAEA-ADQFLKNCASPDRFYSVQNSRKLHDA 354 V E +F + + P Y V L A Sbjct: 244 NPRVMPERYLHKFFNSLSVPYHAYEVDTPESLQAA 278 >gi|198436258|ref|XP_002122777.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 4258 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/161 (20%), Positives = 55/161 (34%), Gaps = 18/161 (11%) Query: 214 NVVRSGLVTFSSKIVQ--TFPLAWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAK 270 N R + F+ ++ +F + + + E+I+R+ + T + L YA N++F Sbjct: 2119 NSTRIAALRFNRDVIPLWSFAQSTSSEDLIERIDRVTYDGSGTHTGKALTYAANRLFTEA 2178 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E D + LTDG D ++L K +G V AI V Sbjct: 2179 EGDR------PDVPDLAVVLTDGRAQDNPGDTVQAL------KNKGVKVIAIAVTNRVDI 2226 Query: 331 QFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 + AS + V + L I Q Sbjct: 2227 NEIYAIASDPDEQNAFFVNDFEGLFSVVENIALFACGQPTW 2267 >gi|295838849|ref|ZP_06825782.1| von Willebrand factor type A domain-containing protein [Streptomyces sp. SPB74] gi|197695399|gb|EDY42332.1| von Willebrand factor type A domain-containing protein [Streptomyces sp. SPB74] Length = 440 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 57/207 (27%), Gaps = 38/207 (18%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 + + + +P + + S D + ++++D S Sbjct: 7 PEGPRFEVEVYQNPYLPEGSGEVHAVVTVTATGGGTGALSATGSGQDAAV--VLMVDCSG 64 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------- 232 SM KL A + LD + + R ++ + + +P Sbjct: 65 SMQY----PPSKLHHAKEATGAALDTL------RDGTRFAVIEGTHVAREVYPRGGALAV 114 Query: 233 -LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 ++ + L T L+ A + A ++ I LT Sbjct: 115 ADDRTRAEAKQALRALRASGGTAVGRWLQLAGRLLSQAP-----------VPIRHGILLT 163 Query: 292 DGENSS-------PNIDNKESLFYCNE 311 DG N +D+ F C+ Sbjct: 164 DGRNEHETPEELRAALDDCAGRFTCDA 190 >gi|114652511|ref|XP_509886.2| PREDICTED: coagulation factor C homolog, cochlin isoform 7 [Pan troglodytes] Length = 540 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 320 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 378 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 379 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 427 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 428 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 474 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 475 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 506 >gi|325066442|ref|ZP_08125115.1| von Willebrand factor type A [Actinomyces oris K20] Length = 370 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 81/251 (32%), Gaps = 58/251 (23%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDI--GLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A + V+++ ++D D+++ LDVS SM Sbjct: 84 LLAVMLVSGLVSAAAIAGRPVRVTERTDALANRDIVLCLDVSTSM--------------V 129 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTT 253 R +L I + + R G+V ++S PL + +++++ L + Sbjct: 130 RIDSSVLTTFSEILEDFDGERVGIVAWNSAAQTIVPLTDDYELLRDQLTELGDVLDIDPE 189 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKG----------------HDDYKKYIIFLTDGENSS 297 T + AY + F + + II TD N Sbjct: 190 NVTYKQQLAYQEAFGGTVNTSINGSSLAGDGLASCAQAFDNQGLERSRSIILATD--NQV 247 Query: 298 PNIDNKESLFYCNEAK---RRGAIVYAIGVQAEAADQF--------------LKNCA--- 337 + DN++ + AK R +++I A+ + LK Sbjct: 248 IDPDNEQIYPLPDAAKLLAERKIRLFSI-YGADEDQPYQNLLDKTPEESREELKTVTEEQ 306 Query: 338 SPDRFYSVQNS 348 RFY V++S Sbjct: 307 GKGRFYDVEDS 317 >gi|213961715|ref|ZP_03389981.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] gi|213955504|gb|EEB66820.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] Length = 550 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 46/251 (18%), Positives = 93/251 (37%), Gaps = 28/251 (11%) Query: 106 NNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS 165 + R + D + A S + PW N H L I K + Sbjct: 128 KDAIRIEEMINYFDYDYPAPTKEATSPLRVTPELAPTPW--NPEHLLLRIGLQAKKLDLA 185 Query: 166 DIGLD-MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ ++DVS SM++ +KL + S + +L +K R +VT++ Sbjct: 186 QAPPSNIVFLIDVSGSMDEP-----NKLPLLKSSFKLLLTQLKPTD------RVAIVTYA 234 Query: 225 S--KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S K+ + Q I++ ++ L +T + G++ AY KE ++ K ++ Sbjct: 235 SGTKVALSSTPVKERQKIEKVLDNLYASGSTSGSSGIQLAY------KEAQKNFIKNGNN 288 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SP 339 II TDG+ + + +E + + + G + +G D + A Sbjct: 289 R---IILATDGDFNVGISNPRELEKFIEKQRESGIYMSVLGFGMGNYRDDMAETIADKGN 345 Query: 340 DRFYSVQNSRK 350 + + + + Sbjct: 346 GNYAYIDDLTE 356 >gi|149199576|ref|ZP_01876610.1| von Willebrand factor, type A [Lentisphaera araneosa HTCC2155] gi|149137372|gb|EDM25791.1| von Willebrand factor, type A [Lentisphaera araneosa HTCC2155] Length = 512 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 67/195 (34%), Gaps = 30/195 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + + +++D S SM G A+ ++ + L + I + S Sbjct: 338 GLNTSVHLLMDASGSMQGRMSLG----SQASYALLKALRSVPGIKSAMTCFPGSSLQPSV 393 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 V + Q + KI+ + TT P L + + + +K Sbjct: 394 HQVLKYD-----QPLHNKIS-VRPDGTTPLAPALWWLMQQ-----------DALRTETRK 436 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 II LTDG+ + S +A G +Y IG+Q + Q L PD + Sbjct: 437 IIIILTDGDPDC----VEMSKKAIAKATSLGFEIYGIGIQCSSIKQLL-----PDAHKII 487 Query: 346 QNSRKLHDAFLRIGK 360 + + L A + + Sbjct: 488 HDLKDLAPALFSLLQ 502 >gi|114652507|ref|XP_001170996.1| PREDICTED: coagulation factor C homolog, cochlin isoform 3 [Pan troglodytes] Length = 534 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 314 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 372 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 373 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 421 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 422 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 468 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 469 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 500 >gi|321265474|ref|XP_003197453.1| hypothetical protein CGB_N0210C [Cryptococcus gattii WM276] gi|317463933|gb|ADV25666.1| Hypothetical Protein CGB_N0210C [Cryptococcus gattii WM276] Length = 445 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 55/179 (30%), Gaps = 33/179 (18%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN--NVVRS 218 S +D++ +LD + SM + D + + D+I+ +N + +R Sbjct: 48 SGSSRGKCIDLVFILDCTGSMQKYINSVRDHI-------IGICDMIRGEEGLNGPDDLRV 100 Query: 219 GLVTF-------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 +V + S+ + + P + +Q + L + A E Sbjct: 101 AVVNYRDHPPQDSTYVYKFHPFTSDIPDVQNYLKGLTASGGGDGPEAVTAAMAATLTELE 160 Query: 272 KLEHIAKGHDDYKKYIIFLTD------GENSS----PNIDNKESLFYCNEAKRRGAIVY 320 + + + + D GE + D + L + G ++ Sbjct: 161 WR-------REAARMAVLVADAPPHGIGEGGDQFKQGDPDGHDPLVIARMMAQNGITMF 212 >gi|298484177|ref|ZP_07002343.1| conserved hypothetical protein [Bacteroides sp. D22] gi|298269681|gb|EFI11276.1| conserved hypothetical protein [Bacteroides sp. D22] Length = 289 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|297587450|ref|ZP_06946095.1| conserved hypothetical protein [Finegoldia magna ATCC 53516] gi|297575431|gb|EFH94150.1| conserved hypothetical protein [Finegoldia magna ATCC 53516] Length = 249 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 60/167 (35%), Gaps = 11/167 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + V+D S SM G + ++ A I L I + V+ +++F+S+I Sbjct: 17 LFFVIDTSGSMK---GTKIGEVNSAIEEILPELSDISNSNPDAE-VKMAILSFNSEIQWI 72 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P V L TT+ + A+ ++ + + Y I + Sbjct: 73 TPKTGPVDPGVYLWRDLNANGTTR----MGAAFEELESKLHGDKFMKSATSSYAPVIFLM 128 Query: 291 TDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQFLKN 335 +DG + + L K + G V A+G+ +A L+ Sbjct: 129 SDGMPTETEEQFQSGLNKLKANKWFKSGIKV-ALGIGQDADLDVLEA 174 >gi|145502106|ref|XP_001437032.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404179|emb|CAK69635.1| unnamed protein product [Paramecium tetraurelia] Length = 556 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 87/239 (36%), Gaps = 16/239 (6%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 ++ +KQ ND +I KN + + + + I + I + K ++ Sbjct: 22 IDDPQKQINDLKKQIFKNFLENGINIDENQVLLQDEEGFILNESE---TIGNLLKTNDIV 78 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVL-DVSLSMNDHF-- 185 V + E + ++++ ++ ++VL D+S SM + Sbjct: 79 KVVQQEQKQNIVNQQQQQQIVAQEEQQEQLQQDLNQNNDPVEAIVVLFDISGSMGGMYFK 138 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 + ++G D + + N++V+ LV F S I + + ++ Sbjct: 139 EEELSRIGAVNAFFSAFADKTLAF-EFNHIVK--LVWFESFITDKCDFTNDFNNFIKLVD 195 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 TK + YA ++ + K+K +I II LTDG+++ + + Sbjct: 196 DASPRGGTKCYDAIAYAIEQLKEIKKKYPNIILR-------IIALTDGDDNQSKENPQS 247 >gi|115767162|ref|XP_791423.2| PREDICTED: similar to calcium-activated chloride channel-2 [Strongylocentrotus purpuratus] gi|115976268|ref|XP_001180011.1| PREDICTED: similar to calcium-activated chloride channel-2 [Strongylocentrotus purpuratus] Length = 1175 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 65/189 (34%), Gaps = 32/189 (16%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + P+ T+ V + +++VLD+S SM+ + + +S + + + Sbjct: 219 AGVQPVPDTTPVFEVFQLSSVRSVVLVLDISGSMSGN------RFDRMIQSSADYIMNVI 272 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAY 263 + G++ F S L + + ++ L G T G+ Sbjct: 273 PLDSK-----LGIIGFESTSHIRTLLTDITDIASRERLVDALPPSAGGGTCIECGILSGI 327 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + Y++ L+DG+ S+ N+ + N+ G I+ I Sbjct: 328 QVLGSYAQGG------------YLLLLSDGQGSAQNLRDTY-----NDIDNAGVIIDTIT 370 Query: 324 VQAEAADQF 332 + A + Sbjct: 371 ISNSADQEM 379 >gi|326772466|ref|ZP_08231750.1| von Willebrand factor type A domain-containing protein [Actinomyces viscosus C505] gi|326637098|gb|EGE38000.1| von Willebrand factor type A domain-containing protein [Actinomyces viscosus C505] Length = 370 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 75/229 (32%), Gaps = 56/229 (24%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V S + D+++ LDVS SM R +L I + + R Sbjct: 106 VTERSDALANRDIVLCLDVSTSM--------------VRIDSSVLTTFSEILEDFDGERV 151 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 G+V ++S PL + ++++++ L + T + AY + F Sbjct: 152 GIVAWNSAAQTIVPLTDDYELLRDQLSELGDVLDIDPENVTYKQQLAYQEAFGGTVNSSI 211 Query: 276 IAKG----------------HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK---RRG 316 + + II TD N + DN++ + AK R Sbjct: 212 NGSSLAGDGLASCAQAFDNQGLERSRSIILATD--NQVIDPDNEQIYPLPDAAKLLAERK 269 Query: 317 AIVYAIGVQAEAADQF--------------LKNCA---SPDRFYSVQNS 348 +++I A+ + LK RFY V++S Sbjct: 270 IRLFSI-YGADEDQPYQDLLDKTPEESREELKTVTEEQGKGRFYDVEDS 317 >gi|281337632|gb|EFB13216.1| hypothetical protein PANDA_007041 [Ailuropoda melanoleuca] Length = 524 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 66/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 304 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 362 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 363 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 411 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ Sbjct: 412 RYMSGGTATGDAISFTVRNVFGPVRD--------SPNKNFLVIVTDGQ----SYDDVRG- 458 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 459 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 490 >gi|156383259|ref|XP_001632752.1| predicted protein [Nematostella vectensis] gi|156219812|gb|EDO40689.1| predicted protein [Nematostella vectensis] Length = 157 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 65/173 (37%), Gaps = 23/173 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 +D+ ++D S S+ D G ++ + + G+VTF++ Sbjct: 3 VDLAFLIDGSKSIED---AGKGNFKRCLDFVKRIALSFDISASGTH---IGIVTFATDPT 56 Query: 227 IVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + F ++ I I+ + + T + LE +F+ ++ + + Sbjct: 57 VELEFDQSFDNTSIATIIDNIRNPDALTFTGKALETVKKDLFEKSQRA--------NVHR 108 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +I LTDG + + K G +YA+GV + + LK+ AS Sbjct: 109 MLIVLTDGRS------WDAVQEPAKQLKESGVTLYAVGVGQDYDLEQLKDIAS 155 >gi|309358783|emb|CAP33694.2| hypothetical protein CBG_15368 [Caenorhabditis briggsae AF16] Length = 391 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 65/169 (38%), Gaps = 8/169 (4%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD++ V+D S M + G+ + S+ I + + R GLVT+S Sbjct: 31 SNLWLDVVAVVDNSQGMTND---GLTAIAANIASVFSEGTKIGTNSNDPRTTRLGLVTYS 87 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 SK + L Q I + + + T S + A+E LE + Sbjct: 88 SKATKNAYLD-KFQSIDDLYDNIFTDLATVSQTDDSNLETGLEAAEEILEAGKNEKRKFY 146 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K +I + S+ N + E++ N K G + + D+ L Sbjct: 147 KKLILI---YASTFNRNG-EAISIANRLKSAGTKLVTVAYDQGGGDEQL 191 >gi|282900088|ref|ZP_06308045.1| Mg chelatase subunit [Cylindrospermopsis raciborskii CS-505] gi|281194970|gb|EFA69910.1| Mg chelatase subunit [Cylindrospermopsis raciborskii CS-505] Length = 671 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 66/205 (32%), Gaps = 40/205 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 471 KAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLT-----ESYQNRDQIALIPFRG 519 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + ++ +L G + GL A +A+ G D + Sbjct: 520 EQAEVLLPPTRSIALAKTRLEKLPCGGGSPLAHGLTQAVRVGVNAQ-------MGGDIGQ 572 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI-----GVQ 325 I+ +TDG E E L + G + I V Sbjct: 573 VVIVAITDGRGNIPLSRSLGESQEPGEKPDIKGELLDIAGRIRASGMQLLVIDTESKFVS 632 Query: 326 AEAADQFLKNCASPDRFYSVQNSRK 350 A + K + ++Y + + Sbjct: 633 TGFAKELAKT--AGGKYYQLPKATD 655 >gi|150377243|ref|YP_001313838.1| hypothetical protein Smed_5128 [Sinorhizobium medicae WSM419] gi|150031790|gb|ABR63905.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 436 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 59/175 (33%), Gaps = 4/175 (2%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F G+++++ A+ PV+ MGL ET + + K KL + D S A + Sbjct: 21 LRRFLKAEGGAVAVIAAVAFPVLVGAMGLGAETGYWYLEKRKLQHAADVSAYAAAVRHRA 80 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + + G + ++ T + Sbjct: 81 GDQQSALEAAARRVAGGSGFSPGDLTLSTASAAAGGSNNVTVELTETHPRLF-SSVFGTG 139 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 ++ +R T A + +V ++ +++ L V+ S + Sbjct: 140 TITIKAR---AVAQVTGGSKACVLALSNSASGAVTVTGSTEVQLSGCSVVSNSSA 191 >gi|316969306|gb|EFV53424.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 412 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 55/163 (33%), Gaps = 25/163 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D +D+++VLD S S+ F R I ++ +P + ++ +S Sbjct: 41 DAKVDLVLVLDSSGSVERTFENY----KAVARHI------VQLLPIGYDRTLMSILQYSK 90 Query: 226 KIVQTFPLAWG--VQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P + + + E + ++ F + T + ++ + Sbjct: 91 DAHVLLPFSADQRPEQLNEIVEQIQFLGSITATAEAVQMGLAQFGCGTRSDA-------- 142 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 K I +TDG + N + + N + GA V + Sbjct: 143 -SKVFILITDG---NSNNKWPDVVNAANALQSSGATVAVVAFG 181 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 49/136 (36%), Gaps = 23/136 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S S+ + F + L ++S+ N +V + Sbjct: 233 DIVIVVDSSQSVEEQFEQYKAE----------ALQFVRSLDVGINSTLCSIVQYGRHATV 282 Query: 230 TFPLAWGVQH---IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P + I N G TT + ++ A +++ + KK Sbjct: 283 ILPFSEHQSKEIIIASLQNLKHLGGTTHTADAIQLALHQLKLNGRRGS---------KKL 333 Query: 287 IIFLTDGENSSPNIDN 302 + +TDG NS+ + D Sbjct: 334 FLLMTDG-NSADSWDT 348 >gi|268324283|emb|CBH37871.1| hypothetical secreted protein, CARDB family [uncultured archaeon] Length = 1149 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 76/250 (30%), Gaps = 33/250 (13%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL----------DMMMVLDVSLSMN 182 ++P P + P + V I ++ + D++ V D++ SM Sbjct: 42 PKLPETIKQIPLTPVKT--PNFVIPIVTIPVCKELQMHLDCVTTKNADIVFVFDITGSMG 99 Query: 183 DHFGPGM-------DKLGVATRSIREMLDIIKSIPDVNNVVRSG-LVTFSSKIVQTFPLA 234 + D L A R L + P + G F+ K+ L+ Sbjct: 100 EEISEMKDISKNFADGLAAAGIDYRFGLTEFRDFPVTCDGTVCGDANDFAYKVYNGGVLS 159 Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG- 293 + I+ L A ++ D K I+ ++D Sbjct: 160 NSSSMFKSWIDSLNPSGGGDLPESTLAALMHTVKDQKWR------GGDASKIIVLISDAY 213 Query: 294 ---ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR- 349 + N + + R G VY +G EA+ + + N ++ +F+ ++ Sbjct: 214 PHSDEHCCNQEKNTFDGVISALTRSGMTVYVVGPD-EASMEMIAN-STGGKFFHIRAEGV 271 Query: 350 KLHDAFLRIG 359 L I Sbjct: 272 SLKPVLEEIA 281 >gi|4758022|ref|NP_004077.1| cochlin precursor [Homo sapiens] gi|205277471|ref|NP_001128530.1| cochlin precursor [Homo sapiens] gi|114652503|ref|XP_001171057.1| PREDICTED: cochlin isoform 6 [Pan troglodytes] gi|7387582|sp|O43405|COCH_HUMAN RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801413|gb|AAC39545.1| Coch-5B2 gene product [Homo sapiens] gi|37182918|gb|AAQ89259.1| COCH [Homo sapiens] gi|58802453|gb|AAW82432.1| coagulation factor C homolog, cochlin (Limulus polyphemus) [Homo sapiens] gi|119586367|gb|EAW65963.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|119586369|gb|EAW65965.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|158258885|dbj|BAF85413.1| unnamed protein product [Homo sapiens] Length = 550 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 330 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 388 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 389 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 437 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 438 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 484 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 485 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 516 >gi|308472975|ref|XP_003098714.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] gi|308268314|gb|EFP12267.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] Length = 412 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 65/197 (32%), Gaps = 21/197 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMD--------------KLGVATRSIREMLDIIKSIP 210 +++ LD++ V+D S M + + ++ S+ I Sbjct: 34 TNLWLDVIAVVDNSRGMTVNGLNYVSIFSLTMPSSLFTKFQIASNIASVFGFGTRIGLNA 93 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFD 268 R GLVT++S Q L Q + + NR+ +T +T + + Sbjct: 94 SEPRTTRLGLVTYNSVATQMADL-NQYQSLHDAFNRIFDDLSNTVDTTESYLSTGLTLAE 152 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAE 327 + + Y+K +I ++ + + + K G + + A Sbjct: 153 KMFNDQSVNSTRAHYQKVVIVYASKYQTNGESNPES---IADRLKLSGVKIITVAYGNAY 209 Query: 328 AADQFLKNCASPDRFYS 344 + L ASP +S Sbjct: 210 GLMKSLSIIASPGFAFS 226 >gi|160882772|ref|ZP_02063775.1| hypothetical protein BACOVA_00733 [Bacteroides ovatus ATCC 8483] gi|237720678|ref|ZP_04551159.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|156111796|gb|EDO13541.1| hypothetical protein BACOVA_00733 [Bacteroides ovatus ATCC 8483] gi|229449513|gb|EEO55304.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] Length = 289 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|291220858|ref|XP_002730442.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 937 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 73/198 (36%), Gaps = 31/198 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + L +++VLD+S SM+ + D+ + ++ + + V G+V F + Sbjct: 308 ETELRIVLVLDISGSMDRN-----DRFELMIQASTKYIGYT-----VPRGTWIGIVEFDN 357 Query: 226 KIVQTFPL--AWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L ++ QE I+ L GS T GLE + + E Sbjct: 358 TASILSYLIQIDDMETRQELIDLLPDGTGSGTSIGSGLEAGIEVLEEGWETPAGG----- 412 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD- 340 + +TDGE ++P + +E + +V + + EA + A Sbjct: 413 ----ILFLITDGEENTPPFIDD----VIDELVEKEIVVDTLALSDEADPGLAELSAETGG 464 Query: 341 ---RFYSVQNSRKLHDAF 355 + +S LHDAF Sbjct: 465 TAYWYSESDDSTALHDAF 482 >gi|305665953|ref|YP_003862240.1| hypothetical protein FB2170_06715 [Maribacter sp. HTCC2170] gi|88710728|gb|EAR02960.1| hypothetical protein FB2170_06715 [Maribacter sp. HTCC2170] Length = 288 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 43/111 (38%), Gaps = 10/111 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L MM+++DVS S + FG + + + + + N + GL+ F Sbjct: 72 EEERELTMMLMVDVSGS--ELFGTT----NQFKKGVITEISATLAFSALQNNDKVGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAK 270 S ++ P G H+ I L+ + T L+Y N + Sbjct: 126 SDEVELFIPPKKGKTHVLRIIRELLEFKPKSNKTDIAEALKYLTNVMKKKA 176 >gi|296141040|ref|YP_003648283.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296029174|gb|ADG79944.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 527 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 78/222 (35%), Gaps = 47/222 (21%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L ++V+D S SMN+ G ++G+ +++ I PD N GL TFS I Sbjct: 325 PLKALVVVDTSGSMNESAGD-TTRIGMLASGFTKVVTQI---PDANA---VGLWTFS--I 375 Query: 228 VQTFPLAWG-----------------VQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFD 268 W Q + + +N L G T AY + + Sbjct: 376 GSATRPDWTEVVPTARLDARRGDKSQRQALLDGVNALPRKVGGATGLYDTTLAAYRRAVE 435 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSP---NIDN--KESLFYCNEAKRRGAIVYAIG 323 + + +I LTDG + P ++D+ + + A+ ++ +G Sbjct: 436 NFDPAYSNS---------LILLTDGSDEKPGGMSLDDLVAQLRTLVDPARP--VNIHTVG 484 Query: 324 VQAEAADQFLKNC--ASPDRFYSVQNSRKLHDAF-LRIGKEM 362 + +A LK A+ + +++ F I K Sbjct: 485 ISKDADLPALKRIADATGGTAQEADSEQQMLTDFVTAIAKRA 526 >gi|170751541|ref|YP_001757801.1| vault protein inter-alpha-trypsin subunit [Methylobacterium radiotolerans JCM 2831] gi|170658063|gb|ACB27118.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium radiotolerans JCM 2831] Length = 729 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 80/238 (33%), Gaps = 31/238 (13%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRY-EMPFIFCTFPWCANSSHAPLLITSS-VKISSKSD 166 R+ +L+ + +D L+ MP + A ++ ++ ++ + + Sbjct: 270 ARTVTLAEGVVPADRDLELTWAPVPSRMPGLGLFRETVAGRTYLLAAVSPPAIEADAAAR 329 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D+ V+D S SM+ G M + + L + + Sbjct: 330 PARDVTFVIDNSGSMS---GASMRQAKAGLLAGLGRLSPRDRFNVIRFDD-------TWD 379 Query: 227 IVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P+ + E + L T+ L+ A + D + + Sbjct: 380 ALHPEPVPATRAALAEAEAFVAALEARGGTEMLAPLKAA---LADPHPEDGRV------- 429 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + ++FLTDG + ++E +F A ++ +G+ + +++ A R Sbjct: 430 -RQVVFLTDG-----AVGDEERIFAAIHADLGRTRLFMVGIGSAPNGHLMRHAAEIGR 481 >gi|293373988|ref|ZP_06620329.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] gi|292631064|gb|EFF49701.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] Length = 270 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 53 EEERELTVMLMVDVSGSLEF------GTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 106 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 107 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 154 >gi|296086006|emb|CBI31447.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 64/214 (29%), Gaps = 62/214 (28%) Query: 93 RNELRENGFAQD-INNIERSTSLSIIIDDQHKDY-NLSAVSRYEMPFIFCTFPWCANSSH 150 RN N + + +E T + + K Y N + + + N S+ Sbjct: 123 RNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNMSN 182 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +PL + +D++ VLD+S SM KL + R++ L Sbjct: 183 SPLNSHNP-------RAPVDLVTVLDISGSMAG------TKLALLKRAMGFALQA----- 224 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +N L+ T GL + D K Sbjct: 225 ---------------------------------VNSLVANGGTNIAEGLRKGAKVMEDRK 251 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 E+ + II L+DG+++ + Sbjct: 252 ER---------NPVSSIILLSDGQDTYTTESVIQ 276 >gi|254491469|ref|ZP_05104648.1| von Willebrand factor type A domain protein [Methylophaga thiooxidans DMS010] gi|224462947|gb|EEF79217.1| von Willebrand factor type A domain protein [Methylophaga thiooxydans DMS010] Length = 341 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 68/208 (32%), Gaps = 32/208 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-------PGM 189 C + P +V+ + G + +++LD S SM+D F P Sbjct: 53 LASCVIMALLMTLAGPFTPEKTVERYRQ---GAEFIVLLDRSRSMDDIFARRPLNSLPVE 109 Query: 190 DKLGVATR--SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 +L + R S +++ +K PD R G V FS K + L + + + Sbjct: 110 KELIRSKRKVSRDYLVEFVKRRPD----DRFGYVLFSDKPTEILRLTYNKAAVLATVEAG 165 Query: 248 IFGST---TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 G T L+ A + + + I+ ++DG +++ Sbjct: 166 GLGKGLSKTNIFSALKLAADM----------YQREDYRGSRNILLISDG---GQVFTDEQ 212 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +Y I +++ Sbjct: 213 KRYLARTYPDMKLSLYWIYLRSMRGMTL 240 >gi|260837260|ref|XP_002613623.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] gi|229299009|gb|EEN69632.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] Length = 655 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 66/182 (36%), Gaps = 27/182 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM+++LD S S+ G + + T + + L ++ V +G F++ Sbjct: 301 DMLILLDTSGSVE---GRSLSLMKHTTWFLLDRLTE-------DDYVATG--YFNAYAQA 348 Query: 230 TFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L + I + ++ L GLEYA+ KIF+ E + + Sbjct: 349 VSCLSSFVQATTHNKEVIHKSLDNLEAADQANYYAGLEYAF-KIFNNFEMEDRFENQGAE 407 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-QFLKNCASPDR 341 K I+ +T+ P ++ R V+ I V D L+ A +R Sbjct: 408 CNKVIVLVTENAELYPEAVFQKYNP------DRNIRVFVIVVGEPIHDWSVLQKMACDNR 461 Query: 342 FY 343 Y Sbjct: 462 GY 463 >gi|114652501|ref|XP_001171019.1| PREDICTED: coagulation factor C homolog, cochlin isoform 4 [Pan troglodytes] Length = 569 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 349 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 407 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 408 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 456 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 457 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 503 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 504 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 535 >gi|298249212|ref|ZP_06973016.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297547216|gb|EFH81083.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 420 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 64/183 (34%), Gaps = 21/183 (11%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D S SM+ K+ A +++ I + +VTF+ F A Sbjct: 45 DRSGSMDGA------KMRAARDGAVKVVQAIDASMQFM------VVTFNDNARIIFGPAA 92 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 G++ + + I S + A N I D + A I+FLTDG+N Sbjct: 93 GIEENKNRAIAAIQTVYAASGTRMSTALNTIVDKFGNNQSRATR-------ILFLTDGKN 145 Query: 296 -SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 P + ++ C+ A + +G +AA+ A+ + ++ A Sbjct: 146 EGEPRVALDRAVARCSAA-NISISAWGVGTDWDAAELLHMAEATRGSADIIPTPNQVEAA 204 Query: 355 FLR 357 F Sbjct: 205 FSS 207 >gi|222101620|gb|ACM44015.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 71/194 (36%), Gaps = 18/194 (9%) Query: 134 EMPFIFCTFPWCANSSH-APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 +P + F A I S K LD +V+D S S+++ G Sbjct: 10 SVPLLSLAFLATTGIHAFADKGIGSPKGKQCKKQ--LDFSIVVDESASISNDQWGGQ--- 64 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS 251 +R ++ + N +R L T+S+ Q F L + + +L + + Sbjct: 65 --MIPFLRNLIHTVDL---DNTDIRLSLTTYSTPTRQIFTFLDAAASSTRLALTKLDWMA 119 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TK+ G+ Y + ++ + G + K ++ +TDG +S + + Sbjct: 120 GTKARSGMTYTGRALNYVRK--AILPYGRKNVPKALLLITDGVSSDGSY----TAQVAAM 173 Query: 312 AKRRGAIVYAIGVQ 325 + G V IGV Sbjct: 174 LRDEGVNVMVIGVG 187 >gi|120603653|ref|YP_968053.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120563882|gb|ABM29626.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 533 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 61/198 (30%), Gaps = 40/198 (20%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-- 230 ++LD S SM+ ++ V GL F + Sbjct: 365 ILLDSSGSMSGCMTLASTACHATASAL------------AACGVNVGLTAFPGHYLTAQA 412 Query: 231 -----FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL Q + K+N + G T L + + E K Sbjct: 413 NWASVSPLIRHGQRVHPKVN-VTSGGGTPLAESLWWTMQTMLPLPESR-----------K 460 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 I+ +TDG+ S + + A R G VY IG+ + A L P + Sbjct: 461 LILIITDGDPDSGVQAEEALVG----AARAGFEVYGIGIISTAILSLL-----PGNSLVI 511 Query: 346 QNSRKLHDAFLRIGKEMV 363 + +L A + ++ + Sbjct: 512 SSMGELAPAMFTLLQKAM 529 >gi|73970088|ref|XP_531792.2| PREDICTED: hypothetical protein XP_531792 [Canis familiaris] Length = 1465 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 54/171 (31%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 710 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKRKFNFVQFDAQAVAWQEKLV 762 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ D + I L Sbjct: 763 EINEDNLRGAQSWIRDIQIGSSTNTLHALQIAFA----------------DKETQVIYLL 806 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P + K + +Y I A+ FLK AS Sbjct: 807 TDGRPDQP------PEMVIEQVKVFQKIPIYTISFNYNDEIANGFLKELAS 851 >gi|46580450|ref|YP_011258.1| hypothetical protein DVU2043 [Desulfovibrio vulgaris str. Hildenborough] gi|46449869|gb|AAS96518.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311234196|gb|ADP87050.1| von Willebrand factor type A [Desulfovibrio vulgaris RCH1] Length = 533 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 61/198 (30%), Gaps = 40/198 (20%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-- 230 ++LD S SM+ ++ V GL F + Sbjct: 365 ILLDSSGSMSGCMTLASTACHATASAL------------AACGVNVGLTAFPGHYLTAQA 412 Query: 231 -----FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL Q + K+N + G T L + + E K Sbjct: 413 NWASVSPLIRHGQRVHPKVN-VTSGGGTPLAESLWWTMQTMLPLPESR-----------K 460 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 I+ +TDG+ S + + A R G VY IG+ + A L P + Sbjct: 461 LILIITDGDPDSGVQAEEALVG----AARAGFEVYGIGIISTAILSLL-----PGNSLVI 511 Query: 346 QNSRKLHDAFLRIGKEMV 363 + +L A + ++ + Sbjct: 512 SSMGELAPAMFTLLQKAM 529 >gi|303327225|ref|ZP_07357667.1| hemolysin-type calcium-binding region [Desulfovibrio sp. 3_1_syn3] gi|302863213|gb|EFL86145.1| hemolysin-type calcium-binding region [Desulfovibrio sp. 3_1_syn3] Length = 1149 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 48/282 (17%), Positives = 84/282 (29%), Gaps = 44/282 (15%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 EL+ D + S + + + +Q ++S + + + Sbjct: 402 ELKFETRDSDGDIASTSAKVPLEVVEQTTTEGGDSISNSD--DVINIAGGDGVAGTLVAG 459 Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 T V + ++ VLD S SM+ +LGVAT+SI + D Sbjct: 460 DTGGVTEGQQVGSNYNVCFVLDTSGSMDGAVSGHETRLGVATQSIENFIKNSIHEGDFVG 519 Query: 215 VVRSGLVTFSSKIVQTFPL-----------AWGVQHIQEK----------INRLIFGSTT 253 V +V F+S+ + +G + + L T Sbjct: 520 TVNLAVVPFASEAGSVIKVSITKTAQGERYTFGEEVYDNYADFSKAFETSLGNLNANGGT 579 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC--NE 311 G A + + A G+ Y FL+DG + + Y ++ Sbjct: 580 NYEAGFSNAADWFNGLEGTS--NATGNITY-----FLSDGVPTYHGTSSYGGGNYATLDD 632 Query: 312 AK------------RRGAIVYAIGVQAEAADQFLKNCASPDR 341 K V AIG + D+ +K A D Sbjct: 633 VKGAWDGYQELLGSAANMQVNAIGFGKDLDDKAMKTLAMLDN 674 >gi|257886572|ref|ZP_05666225.1| predicted protein [Enterococcus faecium 1,141,733] gi|257822626|gb|EEV49558.1| predicted protein [Enterococcus faecium 1,141,733] Length = 689 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 88/253 (34%), Gaps = 56/253 (22%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +D++MV+D S SM KL A + ++E + + + N +R G+V + Sbjct: 107 QLKKPIDLVMVIDYSSSMTGE------KLSNALKGLQEFGEELDDSLESGN-IRIGIVAY 159 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTT---------------KSTPGLEYAYNKIF 267 + + T + ++ + N + T KS P E I Sbjct: 160 NRFVYSTDDFLTDINQLEYFLRNTAESHTGTFMQKGLLEGQSLLEEKSRPEAEKMLVHIG 219 Query: 268 DAKEKLEHIAKGHDDY---------------KKYII-FLTD-------GENSSPNIDNKE 304 D ++ K + +Y+ F TD G ++ PN + Sbjct: 220 DDSANRSYLPKENAQVFHNSGEIVDYNGYHTDQYVTEFQTDSEKYQTSGSSTDPNAVSVS 279 Query: 305 SLFYCNE-------AKRRGAIVYAIGVQAEAADQFLKN--CASPDRFYSV-QNSRKLHDA 354 S + K G Y++ + +++ +SP+ + S+ +N L +A Sbjct: 280 SSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLSIDENLTGLGNA 339 Query: 355 FLRIGKEMVKQRI 367 I M K + Sbjct: 340 LKEIANGMDKTIV 352 >gi|149921114|ref|ZP_01909572.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] gi|149818001|gb|EDM77460.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] Length = 389 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 76/254 (29%), Gaps = 49/254 (19%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIK 207 + + + +I ++++VLD S SM D G + S+ + +D + Sbjct: 71 GMPTCETTTATVN-NIPPNVVLVLDKSRSMVVNAWDDDGNPDTEDVTRWHSLHDTVDTVG 129 Query: 208 SIPDVNNVVRSGLVTFSSK-----------IVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 + + GL F S + + + G+ + + + + T Sbjct: 130 H--QYQDGMSLGLTLFPSVDAESSFDGACPVNEVPEVGVGLGNAEALLAAIPAADDTD-- 185 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP----------NIDNKESL 306 L A L H+ D +I +TDG + D L Sbjct: 186 --LHGATPAAAGIATALAHLEALEDGRPAAMILVTDGAANCSAGANDITKFSQYDEDLPL 243 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF--------LKNCASPDR---------FYSVQNSR 349 + R G Y +G+ + + + L A FY +++ Sbjct: 244 VVADAWDRAGIPTYVVGIDIQESSEHPFTNPREKLHEVAEAGGVARSDGEVGFYDAGDAQ 303 Query: 350 KLHDAFLRIGKEMV 363 L A I + Sbjct: 304 ALTAALDEIAASVS 317 >gi|149920555|ref|ZP_01909022.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149818599|gb|EDM78045.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 820 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 67/202 (33%), Gaps = 35/202 (17%) Query: 145 CANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 ANS +L ++ + S + D++++LD S SM G + T ++ L Sbjct: 277 AANSYGRLVLTPPPIEPGREVSAVPRDLIVLLDTSGSMR---GEPLAHAQAVTEALIRSL 333 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWG-VQHIQEKINRLIFGSTTKSTPG 258 + R LV FSS++ ++ + + L T G Sbjct: 334 ---------RDRDRLELVEFSSRVRRWSQAPASMSAAKREEALRWVGALRASGGTHMRDG 384 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + A + + ++ I+ +TDG E + + + G Sbjct: 385 ILAALASLRP-------------EAQRQILLITDGL----IAFESEIVQAARQHRPPGCR 427 Query: 319 VYAIGVQAEAADQFLKNCASPD 340 V+ +G+ + + A Sbjct: 428 VHTLGIGSSVNRSLTRPVALAG 449 >gi|75909292|ref|YP_323588.1| protoporphyrin IX magnesium-chelatase [Anabaena variabilis ATCC 29413] gi|75703017|gb|ABA22693.1| protoporphyrin IX magnesium-chelatase [Anabaena variabilis ATCC 29413] Length = 678 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 67/205 (32%), Gaps = 40/205 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 477 KAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQNRDQVALIPFRG 525 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + ++ RL G + GL A +A+ G D + Sbjct: 526 EQAEVLLPPTRSIALARNRLERLPCGGGSPLAHGLTQAVRVGVNAQ-------MGGDVGQ 578 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI-----GVQ 325 I+ +TDG E+ E L + G + I V Sbjct: 579 VVIVAITDGRGNIPLSRSLGEPQESGEKPDIKAELLDIAARIRALGMQLLVIDTESKFVS 638 Query: 326 AEAADQFLKNCASPDRFYSVQNSRK 350 A + K S ++Y + + Sbjct: 639 TGFAKELAKT--SGGKYYHLPKATD 661 >gi|260900623|ref|ZP_05909018.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus AQ4037] gi|308107063|gb|EFO44603.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus AQ4037] Length = 665 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 57/200 (28%), Gaps = 30/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + M+VLD+S SM ++L Sbjct: 54 FTIWGLAWTIACVALAGPSWQSNTRPS-FELSQNRMLVLDMSRSMYAS-DIKPNRLAQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGA-------TGLIVYAGDAYSLSPLTTDASTLAGIIENLSPELMPFQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + +E + ++ A I+ L D ++D+ E + Sbjct: 165 SNLPAAIELSLSQFSQAGANQGD-----------IVVLAD------DLDDSELARSLDLV 207 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K + V + + Sbjct: 208 KGKNIRVSVLAIGTANGAPI 227 >gi|260895272|ref|ZP_05903768.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus Peru-466] gi|308085683|gb|EFO35378.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus Peru-466] Length = 631 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 57/200 (28%), Gaps = 30/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + M+VLD+S SM ++L Sbjct: 54 FTIWGLAWTIACVALAGPSWQSNTRPS-FELSQNRMLVLDMSRSMYAS-DIKPNRLAQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGA-------TGLIVYAGDAYSLSPLTTDASTLAGIIENLSPELMPFQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + +E + ++ A I+ L D ++D+ E + Sbjct: 165 SNLPAAIELSLSQFSQAGANQGD-----------IVVLAD------DLDDSELARSLDLV 207 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K + V + + Sbjct: 208 KGKNIRVSVLAIGTANGAPI 227 >gi|260880155|ref|ZP_05892510.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus AN-5034] gi|308092400|gb|EFO42095.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus AN-5034] Length = 621 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 57/200 (28%), Gaps = 30/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + M+VLD+S SM ++L Sbjct: 54 FTIWGLAWTIACVALAGPSWQSNTRPS-FELSQNRMLVLDMSRSMYAS-DIKPNRLAQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGA-------TGLIVYAGDAYSLSPLTTDASTLAGIIENLSPELMPFQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + +E + ++ A I+ L D ++D+ E + Sbjct: 165 SNLPAAIELSLSQFSQAGANQGD-----------IVVLAD------DLDDSELARSLDLV 207 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K + V + + Sbjct: 208 KGKNIRVSVLAIGTANGAPI 227 >gi|260362867|ref|ZP_05775736.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus K5030] gi|308112067|gb|EFO49607.1| tetratricopeptide repeat protein [Vibrio parahaemolyticus K5030] Length = 613 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 57/200 (28%), Gaps = 30/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + M+VLD+S SM ++L Sbjct: 54 FTIWGLAWTIACVALAGPSWQSNTRPS-FELSQNRMLVLDMSRSMYAS-DIKPNRLAQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGA-------TGLIVYAGDAYSLSPLTTDASTLAGIIENLSPELMPFQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + +E + ++ A I+ L D ++D+ E + Sbjct: 165 SNLPAAIELSLSQFSQAGANQGD-----------IVVLAD------DLDDSELARSLDLV 207 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K + V + + Sbjct: 208 KGKNIRVSVLAIGTANGAPI 227 >gi|162449863|ref|YP_001612230.1| hypothetical protein sce1592 [Sorangium cellulosum 'So ce 56'] gi|161160445|emb|CAN91750.1| hypothetical protein sce1592 [Sorangium cellulosum 'So ce 56'] Length = 368 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 68/197 (34%), Gaps = 23/197 (11%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 M+++D+S S+ + + + + D + + G+ F ++ Sbjct: 96 MLLVDMSGSITES-----GQADALVDAAQSFSDRVGKSQ------KVGVYAFDGEVKIHS 144 Query: 232 --PLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P +Q + L ++T G+ ++ +K K Sbjct: 145 VVPFTEAQGSVQGGLEGLRSYKPKDTSTNLHGGVVEGIRELKKQLDKDRRPLKFGT---- 200 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ +DG + + + ++ L + + ++ +GV AE L + Sbjct: 201 -LVVFSDGTDRANRVSREDMLNELKKEEYENYQIFVVGVGAEIEKARLDEIGRDGTELAA 259 Query: 346 QNSRKLHDAFLRIGKEM 362 + K+ ++F +I ++ Sbjct: 260 DQA-KVKESFDKIAAKI 275 >gi|109077204|ref|XP_001095246.1| PREDICTED: integrin alpha-2 [Macaca mulatta] Length = 1180 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 173 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 221 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 222 VVFNL--NTYKTKEEMIVATSQTSQHGGDLTNTFGAIQYARKYAYSAA------SGGRRS 273 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---L 333 K ++ +TDGE+ ++ K + CN + + I V A + Sbjct: 274 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 329 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K AS F++V + L + +G+++ Sbjct: 330 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 363 >gi|91791025|ref|YP_551976.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91700905|gb|ABE47078.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 753 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 65/206 (31%), Gaps = 40/206 (19%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS---IREMLDIIKSIPDVNNVVR 217 SS I ++LD S SM+ +++ I ++ I+ P V Sbjct: 569 TSSVCGIDAAATILLDRSGSMSRCIVEAAGAALSCSQALERISKVKTSIEMFPGYAKCV- 627 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 V + + ++N + T L+ ++ Sbjct: 628 -------GNTVALQAFGQSARQVARRVNEVDAEGGTPLAEALQEVMPRLL---------- 670 Query: 278 KGHDDYKKYIIFL-TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA----ADQF 332 KK I+FL TDG + + +L +A++ G IG+ Sbjct: 671 --AQRVKKRIVFLVTDGIPN----NRPGALEEIGKAEKLGVEFVGIGIGVHGRAIEGLTP 724 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRI 358 C + ++ +L DAF ++ Sbjct: 725 FSIC--------INDASELPDAFEKL 742 >gi|28901310|ref|NP_800965.1| hypothetical protein VPA1455 [Vibrio parahaemolyticus RIMD 2210633] gi|28809857|dbj|BAC62798.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 618 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 57/200 (28%), Gaps = 30/200 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + M+VLD+S SM ++L Sbjct: 54 FTIWGLAWTIACVALAGPSWQSNTRPS-FELSQNRMLVLDMSRSMYAS-DIKPNRLAQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGA-------TGLIVYAGDAYSLSPLTTDASTLAGIIENLSPELMPFQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + +E + ++ A I+ L D ++D+ E + Sbjct: 165 SNLPAAIELSLSQFSQAGANQGD-----------IVVLAD------DLDDSELARSLDLV 207 Query: 313 KRRGAIVYAIGVQAEAADQF 332 K + V + + Sbjct: 208 KGKNIRVSVLAIGTANGAPI 227 >gi|194367003|ref|YP_002029613.1| TPR repeat-containing protein [Stenotrophomonas maltophilia R551-3] gi|194349807|gb|ACF52930.1| TPR repeat-containing protein [Stenotrophomonas maltophilia R551-3] Length = 612 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 65/214 (30%), Gaps = 39/214 (18%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +++VLD+S + P L V + + + + GLV ++ Sbjct: 97 QASAPLLVVLDLSSRITATDLPPSRLLQVRAK--------VGELLRARQGGQVGLVVYAD 148 Query: 226 KIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 PL ++ ++ L + ++ G+++A + Sbjct: 149 DAYTVAPLTDDGSNVALYLDALSPEVMPRDGQRADRGIDWATRLMRQIGALRGQ------ 202 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I+ +TD D + L +A+ G V +G+ A + S Sbjct: 203 -----ILLVTD------QADGEAGLAAA-QARSLGLQVSVLGLGTPAGAAYRDG--SGQI 248 Query: 342 FYSVQNSRKLHDA-------FLRIGKEMVKQRIL 368 + + L + RI + R L Sbjct: 249 RQAALDEASLRAVVTAGGGRYARIAADDSDLRAL 282 >gi|186681556|ref|YP_001864752.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186464008|gb|ACC79809.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 224 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 61/181 (33%), Gaps = 21/181 (11%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 H L + V+ + + +++LD S SM D + + + + D + Sbjct: 2 HDTLRLDEVVEFAENPEPRCPCVLLLDTSGSMQG------DPIEALNQGLLSLKDELVKN 55 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIF 267 V +VTF S + ++ N L T G+ A + I Sbjct: 56 SLAARRVEVAIVTFDSNVNVVQDF-----VTADQFNPPILTAQGLTTMGAGIHKALDIIQ 110 Query: 268 DAKEKLEHIAKGHDDYKKYIIFL-TDGENSS---PNIDNKESLFYCNEAKRRGAIVYAIG 323 D + Y + +F+ TDGE ++ +EA +R + +G Sbjct: 111 D---RKSQYRTNGIAYYRPWVFMITDGEPQGELENVVEQASVRLQGDEANKR-VAFFTVG 166 Query: 324 V 324 V Sbjct: 167 V 167 >gi|323350757|ref|ZP_08086417.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] gi|322123037|gb|EFX94736.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] Length = 458 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 51/343 (14%), Positives = 119/343 (34%), Gaps = 58/343 (16%) Query: 19 ILTAILLPVIFIVMGLVI------------ETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 I+ +L+ +I +++G++ E S + ++ + Y +D ++ + + Sbjct: 15 IMAIMLMSMIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFILD 73 Query: 67 ENGNNGKKQK----------NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 ++ G KQ + +++ +W + ++ + + N++ Sbjct: 74 DSKFKGSKQGLTREWSYIGLSADGKKVLNYVWNKEKQDWDVSELGTKSLYNMKLDLEFKT 133 Query: 117 ---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI-----------S 162 D++ YNL+ +Y ++ + + + K Sbjct: 134 EGAYQDNRLISYNLT--GKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTDP 191 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKSIPDVNNVV 216 + + + + V D+S SM ++ + M++ ++S+ +V+ + Sbjct: 192 IQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVNL 251 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEH 275 + T S K L I+E I L T GL Y + +L Sbjct: 252 TTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSLQKQHAQL-- 309 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 KY++ LTDG ++ + N++ E KR G Sbjct: 310 ---------KYVVLLTDGVPNAY-LVNQQGQAGGLEMKREGIQ 342 >gi|303325524|ref|ZP_07355967.1| OmpA family protein [Desulfovibrio sp. 3_1_syn3] gi|302863440|gb|EFL86371.1| OmpA family protein [Desulfovibrio sp. 3_1_syn3] Length = 332 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 64/176 (36%), Gaps = 26/176 (14%) Query: 170 DMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 V+D S SM + DK+ VA + + ++ D N + + ++ + I+ Sbjct: 34 SFDFVVDYSGSMMMQNKQLKQDKIVVAKN-VLQRVNAAIPALDYNGGLHT--ISPNGMII 90 Query: 229 QTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P W + I++L IFG T GL+ I K Sbjct: 91 AQGP--WDRNAMSVGIDKLRSGFQIFGRMTSMGNGLQKYEPFISSMKRDAA--------- 139 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCAS 338 +I +TDG+N+ + + A +R +++ I + +K A+ Sbjct: 140 ---LILVTDGDNNRGTDIVEVARQL--YASQRNMVIHIISFADTPHGEAVIKEIAA 190 >gi|301607027|ref|XP_002933125.1| PREDICTED: collagen alpha-1(VII) chain-like [Xenopus (Silurana) tropicalis] Length = 2671 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/130 (20%), Positives = 55/130 (42%), Gaps = 15/130 (11%) Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 G + +Q N G T++ GL YA + F I + + K I +TDG+ Sbjct: 77 NGTELVQAIRNLGYKGGNTRTGTGLRYAADNFFGP-----TIIRPNVP--KVAILITDGK 129 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLH 352 + + L K +G ++A+G++ + + + ++P D F+ V + R L Sbjct: 130 SQDDVDPPTQRL------KSQGIKMFAVGIKNADSRELTRVASTPTEDFFFYVNDFRILG 183 Query: 353 DAFLRIGKEM 362 + +++ Sbjct: 184 SLLPVVTRKV 193 >gi|90577458|ref|ZP_01233269.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] gi|90440544|gb|EAS65724.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] Length = 1679 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 66/201 (32%), Gaps = 41/201 (20%) Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG----- 236 +D+F + +A +S + + I + + + +VTFSS + + Sbjct: 1244 SDNFNSLPSLIDMAKKSYQTLTSSIIDSVEDKSKITFNMVTFSSDVKGNTSFHYDETSKT 1303 Query: 237 -----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 Q I I+ L+ G T+ L I D + I FL+ Sbjct: 1304 FVNDQHQTINNYIDSLVAGGGTQFEGALSDISRHITDPSMRN------------VIYFLS 1351 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA------------SP 339 DG++ + F +G + +I V A + A +P Sbjct: 1352 DGKDED-KFHPQGIHFL------KGTEIVSIAVGPSADATQINQIAQMGTGYDHNNPNAP 1404 Query: 340 DRFYSVQNSRKLHDAFLRIGK 360 + N+ +L F IG+ Sbjct: 1405 SYSKIITNANELDGVFHNIGQ 1425 >gi|312133570|ref|YP_004000909.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311772822|gb|ADQ02310.1| Hypothetical protein BBMN68_1309 [Bifidobacterium longum subsp. longum BBMN68] Length = 362 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 58/188 (30%), Gaps = 21/188 (11%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG---LVTFSSKIVQ 229 V+D S SM+ G+ + + LD ++ G L+ F ++ + Sbjct: 186 WVVDYSGSMSGEGKNGV------VKGLNAALDPDQAKKSYIEPASGGVNILIPFETEAHR 239 Query: 230 TFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + T L A +++ E ++ I+ Sbjct: 240 PVKATGTSTSDLLHEADATDASGGTDIYEVLLSALDELPSESEASQYTTA--------IV 291 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 +TDG N D+++ +++ R +++I Q + S Sbjct: 292 LMTDG---RSNSDHQDEFESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRS 348 Query: 349 RKLHDAFL 356 L F Sbjct: 349 GDLAAVFR 356 >gi|299145606|ref|ZP_07038674.1| conserved hypothetical protein [Bacteroides sp. 3_1_23] gi|298516097|gb|EFI39978.1| conserved hypothetical protein [Bacteroides sp. 3_1_23] Length = 289 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|260170241|ref|ZP_05756653.1| hypothetical protein BacD2_00075 [Bacteroides sp. D2] gi|315918604|ref|ZP_07914844.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313692479|gb|EFS29314.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 289 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|237716503|ref|ZP_04546984.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408101|ref|ZP_06084649.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645095|ref|ZP_06722821.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294809496|ref|ZP_06768199.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] gi|229444150|gb|EEO49941.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262354909|gb|EEZ04001.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639601|gb|EFF57893.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294443314|gb|EFG12078.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] gi|295084187|emb|CBK65710.1| Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [Bacteroides xylanisolvens XB1A] Length = 289 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|149918752|ref|ZP_01907239.1| hypothetical protein PPSIR1_31758 [Plesiocystis pacifica SIR-1] gi|149820353|gb|EDM79769.1| hypothetical protein PPSIR1_31758 [Plesiocystis pacifica SIR-1] Length = 325 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 42/116 (36%), Gaps = 10/116 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + L +M+++D+S S++ + + R++ L + Sbjct: 74 NVTARTGEPHVKLFSEERDLTVMLLVDMSASLD------LGSTVASKRNLVARLAATFAF 127 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEY 261 + N R GL+ F+ ++ P G +H+ + ++ T L+ Sbjct: 128 SAIRNNDRVGLIGFTDRVEVFVPPRSGRKHVLSVVQQILTHRPSSRRTDVGVALQT 183 >gi|326666584|ref|XP_687953.4| PREDICTED: collagen alpha-1(VII) chain [Danio rerio] Length = 2001 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 59/174 (33%), Gaps = 24/174 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S GP I +++ + +R G+ +S Sbjct: 39 DIVFLVDDS----WSVGP--TSFQQIKEFIADIIRAFQGNVFGQEGIRFGVTVYSDLPRM 92 Query: 230 TFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + E + + G ++++ LE+ +F + K Sbjct: 93 RIALT-DYSTLDEVLRAVEDVPYEGGSSRTGLALEFLEESVFSPS-------IIRESAPK 144 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +T+G++ D +++ G ++A+GV+ + K P Sbjct: 145 IAVLITNGQSDDQVDDPAKAVA------DSGISLFAVGVRNADQSELKKIVTEP 192 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 61/188 (32%), Gaps = 27/188 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G + G + + SI + +V +S + Sbjct: 960 DIVFLVDESWS------IGTNNFGKLKDFLFRTVTYFPSIGP--KGTQIAVVHYSDQPRI 1011 Query: 230 TFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + +G TK+ G+ Y ++F + Sbjct: 1012 EFNFNTHKDRNSVLRALREVRYGGGNTKTGRGISYVLREMFQES------LGMRQEAPHV 1065 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF---Y 343 ++ LTDG A G V IG+ A A + ++ ASP + + Sbjct: 1066 LVLLTDGRAQDDVEPPSR------IAHALGVSVLVIGI-AHADMEEVRTIASPTTYKNIF 1118 Query: 344 SVQNSRKL 351 + L Sbjct: 1119 YASDFDDL 1126 >gi|261822921|ref|YP_003261027.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606934|gb|ACX89420.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 212 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 58/168 (34%), Gaps = 12/168 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ ML ++ P V ++T+ + Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHAVNVGIQAMLSALRQDPYALESVHISIITYDNDA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL ++ Q + T + LE + ++ + KG + Sbjct: 57 REFIPLT-PLEDFQFTDIVVPSAGGTFTGAALECLIQSVDRDIKRSDGDQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +TDG S ++ KR + A V +A + LK Sbjct: 114 FLMTDGSPSDAYAYDEAVTEV---KKRAFGSIIACAVGPKAKHEHLKK 158 >gi|167535296|ref|XP_001749322.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772188|gb|EDQ85843.1| predicted protein [Monosiga brevicollis MX1] Length = 415 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 55/156 (35%), Gaps = 23/156 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + + +D S SM GP M T S+ L ++ L+TF Sbjct: 18 QHEGRTHLTICMDCSGSM---MGPKMTHAREGTLSLYANLHPGDTVE---------LITF 65 Query: 224 SSKIVQTFP--LAWGVQ--HIQEKINRLIFGSTTKSTPGLEYAYNKI-------FDAKEK 272 SS + P L + R+ +T + + + ++ Sbjct: 66 SSMVATAIPRVLKDDSTDDRFAAAVQRMCARGSTAFYDAILKGLESLSRADALRGNDQKA 125 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 A+ K+ ++ +TDGE+++ + +++ Sbjct: 126 KADQAERTVSTKRVLVVVTDGEDTASHRALSDAVHA 161 >gi|94732541|emb|CAK05117.1| novel protein similar to vertebrate collagen family [Danio rerio] Length = 1721 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 59/174 (33%), Gaps = 24/174 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S GP I +++ + +R G+ +S Sbjct: 8 DIVFLVDDS----WSVGP--TSFQQIKEFIADIIRAFQGNVFGQEGIRFGVTVYSDLPRM 61 Query: 230 TFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + E + + G ++++ LE+ +F + K Sbjct: 62 RIALT-DYSTLDEVLRAVEDVPYEGGSSRTGLALEFLEESVFSPS-------IIRESAPK 113 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +T+G++ D +++ G ++A+GV+ + K P Sbjct: 114 IAVLITNGQSDDQVDDPAKAVA------DSGISLFAVGVRNADQSELKKIVTEP 161 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 61/188 (32%), Gaps = 27/188 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G + G + + SI + +V +S + Sbjct: 849 DIVFLVDESWS------IGTNNFGKLKDFLFRTVTYFPSIGP--KGTQIAVVHYSDQPRI 900 Query: 230 TFPLAW--GVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F + + + +G TK+ G+ Y ++F + Sbjct: 901 EFNFNTHKDRNSVLRALREVRYGGGNTKTGRGISYVLREMFQES------LGMRQEAPHV 954 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF---Y 343 ++ LTDG A G V IG+ A A + ++ ASP + + Sbjct: 955 LVLLTDGRAQDDVEPPSR------IAHALGVSVLVIGI-AHADMEEVRTIASPTTYKNIF 1007 Query: 344 SVQNSRKL 351 + L Sbjct: 1008 YASDFDDL 1015 >gi|301762312|ref|XP_002916580.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 3529 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 80/246 (32%), Gaps = 42/246 (17%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGPGMDK 191 +PF SS + +K+ ++ ++D S S+ + + Sbjct: 3 SLPFNKIQSAANEACLTTSADFASSSQKLAKTPGKEAQRLVFLVDESSSVGQ--ANFLSE 60 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH---------IQE 242 L +R++L +P R +VTFSSK + + + Sbjct: 61 LK----FVRKLLSDFPVVP---TATRVAIVTFSSKNNVVPRVDYISHRRAHQHKCALLSR 113 Query: 243 KINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 +I + + G T + + A + ++E K I +TDG ++ Sbjct: 114 EIPAITYRGGGTYTKGAFQQAAQILRHSRENS----------TKVIFLITDGYSNGG--- 160 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYSVQNSRKLHDAFLRIG 359 + + G ++ G+ + ++P + Y + + + F + Sbjct: 161 --DPRPVAASLRDFGVEIFTFGIWQGNIRELNDMASTPKEEHCYLLHSFEE----FEALA 214 Query: 360 KEMVKQ 365 + + + Sbjct: 215 RRALHE 220 >gi|162420405|ref|YP_001605638.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165928280|ref|ZP_02224112.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165937836|ref|ZP_02226397.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009072|ref|ZP_02229970.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|167398987|ref|ZP_02304511.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167421939|ref|ZP_02313692.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426433|ref|ZP_02318186.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487811|ref|ZP_06204885.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|162353220|gb|ABX87168.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165914248|gb|EDR32864.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|165919722|gb|EDR37055.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165992411|gb|EDR44712.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166960076|gb|EDR56097.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051491|gb|EDR62899.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167054656|gb|EDR64463.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270336315|gb|EFA47092.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] Length = 222 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 63/171 (36%), Gaps = 12/171 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ M+ ++ P V ++T+ ++ Sbjct: 13 RLPVYLLIDTSGSMRGE------SIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQA 66 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL +++ Q + T + LE + + ++ + KG + Sbjct: 67 REYIPLT-ALENFQFTDITVPSAGGTFTGAALECLIHCVDRDIQRSDGDQKGDWRP--LV 123 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +TD S+P+ KR + A V A+A + LK S Sbjct: 124 FLMTD---STPSDVYAYGEAIKEVKKRAFGSIIACAVGAKAKHEHLKQLTS 171 >gi|268530440|ref|XP_002630346.1| C. briggsae CBR-CLEC-62 protein [Caenorhabditis briggsae] Length = 386 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 48/146 (32%), Gaps = 17/146 (11%) Query: 213 NNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 VR GLVT++++ L + + + +L GL+ A + + Sbjct: 72 KRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLSASDEVYLAKGLDAAESVL 131 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ + +I+ +D D + + K G + + Sbjct: 132 SAGRKNATRSNYK----QLVLIYASD-YRDDGEEDPRPT---AERMKSSGVSIATVAFDQ 183 Query: 327 EAADQFLKN---CASPDRFYSVQNSR 349 + +K ASP ++ +++ Sbjct: 184 TGNEGVVKAIGEIASPGFNFTNEDAD 209 >gi|332291976|ref|YP_004430585.1| protein of unknown function DUF58 [Krokinobacter diaphorus 4H-3-7-5] gi|332170062|gb|AEE19317.1| protein of unknown function DUF58 [Krokinobacter diaphorus 4H-3-7-5] Length = 288 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 65/200 (32%), Gaps = 25/200 (12%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + S V +Y+ W + + + + + L MM+V DVS S Sbjct: 34 KGRGMTFSEVRQYQFGDDVRNIDWNVTARY-----SEPYIKVFEEERELTMMLVADVSGS 88 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + FG I + + + N + GL+ F+ +I P G H+ Sbjct: 89 --EFFGTD----KQFKSEIVTEVAATLAFSAMQNNDKIGLILFTDEIELFIPPKKGKSHV 142 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 I L+ L A + + +K + D F+TDG + I Sbjct: 143 LRIIRELLEFKPKSKKTDLSQAIKYLSNVMKKKAIVFVLSD-------FITDGYEQTMKI 195 Query: 301 DNKESLFYCNEAKRRGAIVY 320 N+ G +Y Sbjct: 196 -------AANKHDITGIRIY 208 >gi|308502682|ref|XP_003113525.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] gi|308263484|gb|EFP07437.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] Length = 861 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 70/199 (35%), Gaps = 28/199 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++ D+S S++ P + + D N+ R G++TF+ + Sbjct: 370 LDIIIAFDISESLSRIILPKYVAFAKRIVAQYKYKD--------NDFTRVGVLTFNDIVT 421 Query: 229 QTFPLAWGV--QHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L GV I I+ + G T T L+ A +F + H K Sbjct: 422 EKLTLQKGVDLATINAAIDSVEYLGGLTDVTAALKAA-KDLFSKESDNAH--------SK 472 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 +I L+D + ++ + G + +G + D L +P + Sbjct: 473 VLIVLSDAVPTVDTYADE--IAAGQALSAAGVATFFVGYNHYSDDVLKQLGQVTNPAYVF 530 Query: 344 SVQNSRKLHDAFLRIGKEM 362 + +F I +++ Sbjct: 531 --GDMSD--ASFNGITQQI 545 >gi|47226573|emb|CAG08589.1| unnamed protein product [Tetraodon nigroviridis] Length = 960 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 110/340 (32%), Gaps = 50/340 (14%) Query: 47 KLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN 106 ++ +H +++ + +K D + + + L+++ D+ Sbjct: 22 QVQQATNH---EGKWLLVSAPWSGYSRNRKGDLYKCPVSESRTSCDKLNLQDSLSIPDVK 78 Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS- 165 N+ +TSL + + L + F + +PL + Sbjct: 79 NVNDNTSLGLTLAQNDGGDLLMCGPLWGQQCSSQRFYPGICARLSPLFQPQPAFSPAVQV 138 Query: 166 -DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +D+++VLD S S + T I++++ + P ++ + Sbjct: 139 CGGPMDVVIVLDGSNS--------IYPWEPMTAFIQKLIPSLDIGPQATQ---LSIIQYG 187 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F L + K + L G +T + ++YA F Sbjct: 188 VDPKFEFRL----NQYRTKEDALAAASRITQMYGHSTNTFQAIQYASQWGFHQN------ 237 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAE 327 G + K ++ +TDGE+ + L C+ ++G + I V E Sbjct: 238 NGGRPEAAKVMVVVTDGESHDVAF-RESVLDECD---KKGITRFGIAVLGYYIRNSIDTE 293 Query: 328 AADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 +K+ AS + F++V L + +G + Sbjct: 294 NLIAEIKSIASKPTANYFFNVSEEAALSNIAGTLGDRIFN 333 >gi|94309590|ref|YP_582800.1| hypothetical protein Rmet_0645 [Cupriavidus metallidurans CH34] gi|93353442|gb|ABF07531.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] Length = 434 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 15/141 (10%), Positives = 43/141 (30%), Gaps = 4/141 (2%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +++ +G+++I+ +++ V+ +GL ++ + K++L D L A + Sbjct: 9 MSLPRIHERQRGAVAIIVGLMIVVLVGFIGLALDLGKLYVSKSELQNRADSCALAAARDL 68 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 G N ++ + + + +D Sbjct: 69 ----TGATPLTVSEAAGLTAAARNLVLFQGNLEQQPNITSAESVTYSDSLANPFLDKNSV 124 Query: 124 DYNLSAVSRYEMPFIFCTFPW 144 Y L+ + + Sbjct: 125 TYALNTIKYVKCDVSRGNIAN 145 >gi|195438080|ref|XP_002066965.1| GK24276 [Drosophila willistoni] gi|194163050|gb|EDW77951.1| GK24276 [Drosophila willistoni] Length = 1252 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/133 (22%), Positives = 55/133 (41%), Gaps = 18/133 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM++ +A + +LD + VN VV+S Sbjct: 266 AASSPKDIMILLDASSSMSEK------SFDLAMATAFNILDTLGEDDYVNLITFSEVVKS 319 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P VQ I+ + + T T GLEYA++ + + Sbjct: 320 PVPCFKDRMVRATP--DNVQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 373 Query: 279 GHDDYKKYIIFLT 291 + I+ +T Sbjct: 374 -GSQCNQAIMLIT 385 >gi|154496734|ref|ZP_02035430.1| hypothetical protein BACCAP_01027 [Bacteroides capillosus ATCC 29799] gi|150273986|gb|EDN01086.1| hypothetical protein BACCAP_01027 [Bacteroides capillosus ATCC 29799] Length = 786 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 62/176 (35%), Gaps = 29/176 (16%) Query: 171 MMMVLDVSLSMN--DHFGPGMDKLGVATRSIRE--MLDIIKSIPDVNNVVRSGLVTFSSK 226 +++++D S SMN D + G+ S+ ++D + R G+++FS+ Sbjct: 125 IVLLIDKSGSMNATDPERLAVSAAGMFVNSLYNESLMDQATGAGGPRS--RVGVISFSAD 182 Query: 227 IVQTF---PLAW--GVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L V + +I+ + + T + + A++ Sbjct: 183 AQTETIPVELTSEAEVSFVAGEIDAITYDKVNTGATDLGRAVLSGTEMLRGAQDG----- 237 Query: 278 KGHDDYKKYIIFLTDGENSS--PNIDNKESLFYC---NEAKRRGAIVYAIGVQAEA 328 K II TDG + P + S A++ G +Y +G+ + Sbjct: 238 ----VRKDMIILFTDGYTDALTPEGMERSSAMMAEGLEAARQLGCEIYVVGLNYQG 289 >gi|149197491|ref|ZP_01874542.1| hypothetical protein LNTAR_00880 [Lentisphaera araneosa HTCC2155] gi|149139509|gb|EDM27911.1| hypothetical protein LNTAR_00880 [Lentisphaera araneosa HTCC2155] Length = 890 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 73/214 (34%), Gaps = 26/214 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + ++ +V+D S SM G K+ +A + ++++ ++ V+ + T + Sbjct: 383 KLRSNLSIVMDRSGSMGMTVKGGKTKMELANEGAAQTIELLGAMDSVSV---IAVDTEAH 439 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 IV L + + G GLE ++ ++ + +K Sbjct: 440 AIVPQTVLKDAPEIASQARRVKSQGGGIYVYTGLEESWRQL------------EGREGQK 487 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCA--SPDR 341 ++I +D S+ + + + K G V I + + FL + A R Sbjct: 488 HVILFSD---SNDSEEPGRYKELLADMKDEGMTVSVIALGERTDVDSPFLIDIANRGRGR 544 Query: 342 FYSVQNSRKLHDAFL----RIGKEMVKQRILYNK 371 + + L F + + + + K Sbjct: 545 IFFTDDPLSLPSIFAQETVTVARSAFLKEVTATK 578 >gi|148253479|ref|YP_001238064.1| hypothetical protein BBta_1965 [Bradyrhizobium sp. BTAi1] gi|146405652|gb|ABQ34158.1| hypothetical protein BBta_1965 [Bradyrhizobium sp. BTAi1] Length = 769 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 67/167 (40%), Gaps = 27/167 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK- 226 L ++++D S S D G L V ++ + + ++ + D GL+ FSS Sbjct: 577 DLSALLLIDTSESTRDRLASGATVLDVERLAVALLAEAMEELGDT-----FGLLAFSSDG 631 Query: 227 -------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ F A+ + ++ L G +T+ L +A + A Sbjct: 632 RDDVRMTSIKRFSEAYDRD-CRARLAGLSAGLSTRLGTALRHAGGVLGKA---------- 680 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +K +I LTDGE S ID + +A+R ++A G+ A Sbjct: 681 -STSRKLLIVLTDGEPSD--IDVPDPFDLIEDARRAAIGLHAQGIDA 724 >gi|71899173|ref|ZP_00681336.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] gi|71731031|gb|EAO33099.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] Length = 941 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 54/189 (28%), Gaps = 16/189 (8%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 K A++ Y P + H I + + + +D+S SM+ Sbjct: 107 KGGKYGAMNPYPQPASYKIRRILKGWDHDACCYPEKAAIGMQIAPSVAVYFAIDLSGSMH 166 Query: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL------AWG 236 G G +L ++ LD + V L F L A G Sbjct: 167 YVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLAGFGDAPDHRQTLLRRNCTAQG 226 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + ++ + T Y + A + + F+TDGE Sbjct: 227 IAELKSWVAARQALYGT---------YFPAGTMDMPSFYAAAPSNAV-RVAFFMTDGEPD 276 Query: 297 SPNIDNKES 305 P+ ++ Sbjct: 277 PPSATLAQA 285 >gi|148686422|gb|EDL18369.1| mCG2843 [Mus musculus] Length = 1122 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 63/178 (35%), Gaps = 26/178 (14%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGL 259 D+ + VN+ G+V + + + F L + + N++ T + G+ Sbjct: 136 DVSPTFQVVNSFAPVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGI 195 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + A + F K K ++ +TDGE S N K+ + C Sbjct: 196 DTARKEAFTEARGARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQR 245 Query: 320 YAIGV---------QAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 ++I + E + +K+ AS F++V + L +G+ + Sbjct: 246 FSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 303 >gi|17227774|ref|NP_484322.1| Mg chelatase subunit [Nostoc sp. PCC 7120] gi|17135256|dbj|BAB77802.1| Mg chelatase subunit [Nostoc sp. PCC 7120] Length = 678 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 67/205 (32%), Gaps = 40/205 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 477 KAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQNRDQVALIPFRG 525 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + ++ RL G + GL A +A+ G D + Sbjct: 526 EQAEVLLPPTRSIALARNRLERLPCGGGSPLAHGLTQAVRVGVNAQ-------MGGDIGQ 578 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI-----GVQ 325 I+ +TDG E+ E L + G + I V Sbjct: 579 VVIVAITDGRGNIPLSRSLGEPQESGEKPDIKAELLDIAARIRALGMQLLVIDTESKFVS 638 Query: 326 AEAADQFLKNCASPDRFYSVQNSRK 350 A + K S ++Y + + Sbjct: 639 TGFAKELAKT--SGGKYYHLPKATD 661 >gi|12850100|dbj|BAB28591.1| unnamed protein product [Mus musculus] Length = 431 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 52/135 (38%), Gaps = 13/135 (9%) Query: 231 FPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L + I++ ++ +++ G T G E A +I+ + A I Sbjct: 1 MKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS-------VI 53 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 I LTDGE E N ++ GAIVY +GV+ Q + S D + V + Sbjct: 54 IALTDGELHEDLFFYSE--REANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFPVND 111 Query: 348 S-RKLHDAFLRIGKE 361 + L I K+ Sbjct: 112 GFQALQGIIHSILKK 126 >gi|301609302|ref|XP_002934202.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 904 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 67/198 (33%), Gaps = 32/198 (16%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + P S++ + S S + + +VLDVS SM G +++L A Sbjct: 270 IMNSTDIKSTPPQADSNIPVPSFSLLQSSDRVVTLVLDVSGSMAS--GGRIERLYQAAEV 327 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTK 254 + V G++TFS+ I L +++ Q K + T Sbjct: 328 FLMQI--------VEEGSYVGILTFSTSISLLSNLVQVIENTQRKQLKSLLPTNAFGGTD 379 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 G+ + Y ++ LTDGE DN ++ + Sbjct: 380 ICLGIREGIKI---------NRQYDGSSYGTELVLLTDGE------DNYDTSKCFPDITN 424 Query: 315 RGAIVYAIGVQAEAADQF 332 GAI++ I + AA Sbjct: 425 SGAIIHVIALGPNAAKAL 442 >gi|226357799|ref|YP_002787539.1| Von Willebrand factor type A domain protein, precursor [Deinococcus deserti VCD115] gi|226320042|gb|ACO48035.1| putative Von Willebrand factor type A domain protein, precursor [Deinococcus deserti VCD115] Length = 669 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 76/216 (35%), Gaps = 29/216 (13%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 L +++S ++S + + ++LD S SM G ++ A + + + + + Sbjct: 13 LSLALFLSASGAGQAQSGPTM-IQLILDSSGSMFSRLPGGDTRMATAQAVMTDFIGRLPN 71 Query: 209 IPDVNNVVRS--GLVTFSS-KIVQTFPL-----AWGVQHIQEKINRLIFGSTTKSTPGLE 260 P++N +R + + Q L + + T L+ Sbjct: 72 DPNLNVGLRLYGAAINAADPGACQDSKLVLPMRGLDRSALLAAVRGARPKGATPIVYSLQ 131 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-- 318 A ++ ++ +TDG+ S D K +L + RG Sbjct: 132 QAAQDF------------PTTAGRRIVVLVTDGQESCQG-DVKGAL---EAFRSRGLEVD 175 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 V IG+ +A Q + A F + +++ +L A Sbjct: 176 VRVIGIDLDARAQ--ASFAGVGTFVNTRSAGELASA 209 >gi|257456194|ref|ZP_05621391.1| putative BatB protein [Treponema vincentii ATCC 35580] gi|257446280|gb|EEV21326.1| putative BatB protein [Treponema vincentii ATCC 35580] Length = 320 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 63/185 (34%), Gaps = 23/185 (12%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 ++ PL T + + ++ +D+S SM ++L A + + +++ + Sbjct: 64 AAAVPLWGTKQTTVIKHGNA---VIFAVDISRSMTVADVAP-NRLEFAKQYVAFLIEHLP 119 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 GLVT + PL++ Q + L + T + LE+ Sbjct: 120 ETA-------CGLVTIKGQGTLAVPLSFNHQSVLTAAETLSPFNATSAGSNLEHGLRIAL 172 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ + K ++ TDG ++ ++ F ++ + +G E Sbjct: 173 ES-------FPENRLTGKTVVLCTDGGETAGSVSRILPRF-----RQDNVQLIIVGFGTE 220 Query: 328 AADQF 332 Sbjct: 221 TGGTL 225 >gi|119623953|gb|EAX03548.1| complement component 2, isoform CRA_a [Homo sapiens] Length = 437 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 68/176 (38%), Gaps = 20/176 (11%) Query: 136 PFIFCTFPWCANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + +F +++ S+ KI + L++ ++LD S S++++ Sbjct: 218 PALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSEN------DFL 271 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--- 249 + S M+D I S V ++TF+S+ V L + + E I+ L Sbjct: 272 IFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANY 328 Query: 250 -----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 G+ T + L Y + + L + + II LTDG+++ Sbjct: 329 KDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGS 384 >gi|46362531|gb|AAH68979.1| Slc35c2 protein [Danio rerio] Length = 1816 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 67/192 (34%), Gaps = 34/192 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ ++++++D S S+ ++ + P+ V+ L +S Sbjct: 571 AEAQAELVLLVDGSWSIGR------TNFRKVREFLQGLAVPFHIGPEG---VQIALSQYS 621 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 W + + K L G T + L +A K E A Sbjct: 622 GDPRTE----WHLNNFTSKEPLLEAVRNFRYKGGNTFTGQALIHALE----NNLKEEVGA 673 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + ++++ LTDG++ +++ N K G + AIGV+ + + + Sbjct: 674 RPNTP--QFLLLLTDGKSQD------DAIAAANRLKNAGVEIIAIGVKNADEAELRQVAS 725 Query: 338 SP--DRFYSVQN 347 P Y V + Sbjct: 726 EPLELNVYKVND 737 >gi|186472639|ref|YP_001859981.1| von Willebrand factor type A [Burkholderia phymatum STM815] gi|184194971|gb|ACC72935.1| von Willebrand factor type A [Burkholderia phymatum STM815] Length = 332 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 75/229 (32%), Gaps = 39/229 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE----MLDIIKSIPDVN 213 S + + + G ++++++D S SMN G + A + D I + Sbjct: 73 SGEQTRITGTGAEILILMDGSGSMNQAISSGSMNVADAPTAGETKNQMARDAITAFVAQR 132 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAK 270 R + F + + P I I G T T G++ A +FD + Sbjct: 133 ANDRLAFMLFGTHPMLAVPFTRNRTVIDAAIAATGVGRGTPDTLLDRGIQSAVE-LFDGR 191 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA---- 326 + A I+ ++DG +D+ R G +Y I +++ Sbjct: 192 PRTSSRA---------IVLVSDG---GARLDDVAREHIRAGLSRNGVALYFIYLRSGIYS 239 Query: 327 -EAADQFLKNCASPD--------------RFYSVQNSRKLHDAFLRIGK 360 + + SP+ R Y + +++ A I + Sbjct: 240 PDLHVRLADADHSPEAELHRFFLSLPTAYRLYQADSPQQVARAMSDIAR 288 >gi|295395240|ref|ZP_06805448.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] gi|294972002|gb|EFG47869.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] Length = 328 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 66/154 (42%), Gaps = 14/154 (9%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLD 204 A + P + T V + K + + +V+DV+ SM + + +L + + E++D Sbjct: 48 ATAIARPGIPTEVVVL--KQEASAQVYLVVDVTASMIAEDWDGSEPRLEGLKKDLTELVD 105 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAY 263 + + L+TF S+ PL ++ I+ ++ T+S+ G +A Sbjct: 106 AMPGA-------KFSLITFGSESHVRVPLTTDDAAVKSAIS-ILAPEITRSSAGTSPFAP 157 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + ++ A +D +Y+ + DGE +S Sbjct: 158 AETVSSRLAKGQEAHPGED--QYVFYFGDGEKTS 189 >gi|239941198|ref|ZP_04693135.1| hypothetical protein SrosN15_09409 [Streptomyces roseosporus NRRL 15998] gi|239987673|ref|ZP_04708337.1| hypothetical protein SrosN1_10243 [Streptomyces roseosporus NRRL 11379] gi|291444639|ref|ZP_06584029.1| conserved hypothetical protein [Streptomyces roseosporus NRRL 15998] gi|291347586|gb|EFE74490.1| conserved hypothetical protein [Streptomyces roseosporus NRRL 15998] Length = 453 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 52/177 (29%), Gaps = 37/177 (20%) Query: 151 APLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 PL S G +++++D S SM+ K+ A + +D + Sbjct: 46 IPLADASPSSAPGAGQGPGAAVVLMVDCSGSMDY----PPTKMRNARDATAAAIDTL--- 98 Query: 210 PDVNNVVRSGLVT--------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 R +V + A +E + +L G T L Sbjct: 99 ---REGTRFAVVAGTHVAKDVYPGNGRLAVADARTKAQAKEALRKLSAGGGTAIGTWLRL 155 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-------IDNKESLFYCNE 311 A + A + H I LTDG N + +D+ F C+ Sbjct: 156 ADRLLNSADVAIRHG-----------ILLTDGRNEHESPEGLRAALDSCAGRFTCDA 201 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 56/167 (33%), Gaps = 28/167 (16%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V D+S SMN ++ S+ + I S GLV++S K+ P+ Sbjct: 389 VTDISGSMNG------TRIKSLKNSLLSTMQYIDSSS------YIGLVSYSDKVYINLPI 436 Query: 234 AW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 A + + L G T + + + + +K D + Sbjct: 437 AQFDNKQRAYFSGAVKDLDVGGQTATYDAVLVGMQMLME-------KSKEVPDANMMLFV 489 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKN 335 L+DG ++ + + G VY IG ++ + LK Sbjct: 490 LSDGAQNAGFELKRITPIV----GGLGISVYTIGYEMTDSDKEDLKA 532 >gi|300022838|ref|YP_003755449.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524659|gb|ADJ23128.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 334 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 47/131 (35%), Gaps = 15/131 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G +++++D S SM+D F G + + + L +K R G+ FS Sbjct: 77 GQGAHIVLLIDRSSSMDDTFAGRPPSGGQESKSAAAKRL--LKDFVTERAHDRFGVAAFS 134 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + P+ + I I+ + T GL A + + + Sbjct: 135 TSPIHVLPITDHKKLILGAIDAIDRPGLAFTDVGRGLAMALDMTQEDISQASRA------ 188 Query: 283 YKKYIIFLTDG 293 I+ ++DG Sbjct: 189 ----IVLVSDG 195 >gi|260803822|ref|XP_002596788.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] gi|229282048|gb|EEN52800.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] Length = 547 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 74/217 (34%), Gaps = 41/217 (18%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 S + P+ S D++ ++D S S+ GP + G+A + I + L Sbjct: 19 TTSQYLPVPQPEPQPPGPSSGCAADIVFIVDDSSSI---LGP---RFGLALQFIIDFLQC 72 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + + G++ ++ PL G T S PGL +A + Sbjct: 73 FT-----DQDIGIGVILYNCVPRTGIPL----------------GMYTISNPGLPFAISN 111 Query: 266 IFDAKE--KLEHIAKGHDDYKKY-------IIFLTDG--ENSSPNIDNKESLFYCNEAKR 314 + + H D K+ I L+DG ++ + + A+ Sbjct: 112 LTQEGGLSRTGHALSFMTDTSKFRTGIPRTAILLSDGFPQSDANAQAMDDYEAQAEAARD 171 Query: 315 RGAIVYAIGVQAEA--ADQFLKNCA-SPDRFYSVQNS 348 G +YA+GV A L+ S DR + N Sbjct: 172 AGIDLYAVGVGAAGLVNWDVLETITGSSDRVFRSDNP 208 >gi|193213113|ref|YP_001999066.1| hypothetical protein Cpar_1468 [Chlorobaculum parvum NCIB 8327] gi|193086590|gb|ACF11866.1| conserved hypothetical protein [Chlorobaculum parvum NCIB 8327] Length = 352 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 54/183 (29%), Gaps = 6/183 (3%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 M L +G+++I+ AI L V+ L ++ K +L D + L A Sbjct: 1 MILLRQNKLPNGQRGAVTIMFAIFLVVLLGFAALALDLGRMNLTKVQLQSAADAAALGGA 60 Query: 61 TKILNQENGNN--GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 ++N ++ + + E G+ + + + + Sbjct: 61 GSLVNSSLSTYDWDAAEQKGLVLAQHNIVNGEQIQQATIEAGYWNSSDGFRHHGTSGVPV 120 Query: 119 DDQHKDYNLSAV---SRYEMPFIFCTFP-WCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 + ++ P P N S+ P +++ + + Sbjct: 121 TGDVPAVRATVALTSTQNNGPLKLFFAPFLGINESNIPASAIAAIYPPAGGVGMFPFTLG 180 Query: 175 LDV 177 DV Sbjct: 181 KDV 183 >gi|326670664|ref|XP_003199264.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 2265 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 97/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G + Sbjct: 1547 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFVIG 1601 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 1602 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 1661 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 1662 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 1711 Query: 228 VQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 1712 EANFYLNTYTTNEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 1769 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 1770 ILVLLSGGRSFDSVDAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 1823 Query: 346 QNSRKL 351 + +L Sbjct: 1824 SDFSEL 1829 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 97/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G + Sbjct: 732 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFVIG 786 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 787 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 846 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 847 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 896 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 897 EANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQFVRDNVFTASSGSRRLEGVPQ 954 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 955 ILVLLSGGRSFDSVDAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 1008 Query: 346 QNSRKL 351 + +L Sbjct: 1009 SDFSEL 1014 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 1741 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 1795 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 1796 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 1855 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 1856 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 1905 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 1906 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 1963 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 1964 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 2017 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 2018 NYALSVADFSEL 2029 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 79/261 (30%), Gaps = 20/261 (7%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP 152 + L+E G S S + Y LS E+P + + P Sbjct: 972 ASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIP 1031 Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + TS + S D++ +LD S F D + ++ Sbjct: 1032 VTPTSPTVTAEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIE 1081 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 N R +V +S F L + I +++ L G A + D Sbjct: 1082 ANRDRVSVVQYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNV 1139 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++ + + +I L+ G + SL K G + IG + + Sbjct: 1140 FTASSGSRRLEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSR 1193 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 + + + SV + +L Sbjct: 1194 ELQRISYEANYALSVADFSEL 1214 Score = 36.3 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 48/312 (15%), Positives = 93/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + A L Y+ D+ ++ + S+ + + L+E G Sbjct: 127 LYTGAALQYVKDNVFTASSGSRRLEGVPQILVLLSGGKSFDSV-----DAAASSLKELGV 181 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS+ Sbjct: 182 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSTTVT 241 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 242 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 291 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 292 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 349 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 350 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 403 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 404 NYALSVADFSEL 415 >gi|325297737|ref|YP_004257654.1| hypothetical protein Bacsa_0585 [Bacteroides salanitronis DSM 18170] gi|324317290|gb|ADY35181.1| protein of unknown function DUF58 [Bacteroides salanitronis DSM 18170] Length = 289 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 45/105 (42%), Gaps = 6/105 (5%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSNSLDF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +HI I L+ T + ++ A + + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFQPTSTRTDIQCAIEYLTN 170 >gi|307825232|ref|ZP_07655452.1| Magnesium chelatase [Methylobacter tundripaludum SV96] gi|307733688|gb|EFO04545.1| Magnesium chelatase [Methylobacter tundripaludum SV96] Length = 629 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 80/267 (29%), Gaps = 37/267 (13%) Query: 43 FVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSY---RIIKNIWQTDFRNELREN 99 ++ L D T+ N ++ + Q+ F+ ++ I N + Sbjct: 332 LMQQALQPP-DEGESDADTEQENNQDNGESQDQQQVFATASAGNVRRIEVDTVANR-YAS 389 Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G + R + ++ D + A R + + V Sbjct: 390 GKRSTAQDAPRGRVVRVVQDQNPSSLAVGATLR------SAALRGADDFQVTKSDLHQQV 443 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ +++ ++ V+D S SM ++ V ++ +L + + Sbjct: 444 RVGKNANL---ILFVVDASGSMA-----AQRRMEVVKGAVLSLLT--DAYQQRDE---VA 490 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +++F + V ++ + L G T L A + K Sbjct: 491 VISFRGESAQLMLSPTRSVDLAEQNLRELPTGGRTPLPHALSVALETL----------EK 540 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKES 305 ++ LTDG+ + D + Sbjct: 541 TSMPP--LLVLLTDGKANVALTDGNDP 565 >gi|218549490|ref|YP_002383281.1| hypothetical protein EFER_2160 [Escherichia fergusonii ATCC 35469] gi|218357031|emb|CAQ89663.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469] gi|325497894|gb|EGC95753.1| hypothetical protein ECD227_1991 [Escherichia fergusonii ECD227] Length = 219 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 62/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A N + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKAMNMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 66/209 (31%), Gaps = 54/209 (25%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFG--------------------------------- 186 +I S + ++VLD S SM+ +F Sbjct: 394 QIIPTSQKPIQYVVVLDASGSMSANFDGQCNNSGGVKQCANGPSGFPDVQVSNTGYDYWW 453 Query: 187 --PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIV--QTFPLAWGV 237 ++ VA +++ ++ + + + +V F+ + QT Sbjct: 454 TTESQRRIYVAKKALERLVTLSNMPGNPGYTNTRPSDQMAVVWFNDGVSSSQTQAFTNNP 513 Query: 238 QHIQEKINRLI-------FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ I L T GL A + ++ K + +YK+ ++F+ Sbjct: 514 TTLKNYITTLNNVNGNYRSAGGTNGAGGLYRA-SLLYQNAPKTVSFNGTNVEYKRVVLFV 572 Query: 291 TDGENSS----PNIDNKESLFYCNEAKRR 315 TDG ++ D K L + K+ Sbjct: 573 TDGVSNYFLNTSASDLKGPLSSYDTFKKN 601 >gi|301756599|ref|XP_002914168.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Ailuropoda melanoleuca] Length = 1127 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 69/193 (35%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM ++ +A +I +LD + VN ++ +S I Sbjct: 274 DIVIVVDTSGSMKGL------RMTIAKHTISTILDTLGENDFVN------IIAYSDYIHY 321 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ L A+ + +E + Sbjct: 322 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVNQALTEAFQILKQFQEARQ-----G 376 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 377 SLCNQAIMLITDG-----AVEDYEPVFEKYNWPDRKVRVFTYLIGREVTFADRMKWIACN 431 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 432 NKGYYTQISTLAD 444 >gi|281349604|gb|EFB25188.1| hypothetical protein PANDA_001994 [Ailuropoda melanoleuca] Length = 805 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 69/193 (35%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S SM ++ +A +I +LD + VN ++ +S I Sbjct: 274 DIVIVVDTSGSMKGL------RMTIAKHTISTILDTLGENDFVN------IIAYSDYIHY 321 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L+ L A+ + +E + Sbjct: 322 IEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVNQALTEAFQILKQFQEARQ-----G 376 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAADQFL-KNCA 337 + I+ +TDG +++ E +F R V+ IG + AD+ C Sbjct: 377 SLCNQAIMLITDG-----AVEDYEPVFEKYNWPDRKVRVFTYLIGREVTFADRMKWIACN 431 Query: 338 SPDRFYSVQNSRK 350 + + + Sbjct: 432 NKGYYTQISTLAD 444 >gi|114652499|ref|XP_001171038.1| PREDICTED: coagulation factor C homolog, cochlin isoform 5 [Pan troglodytes] Length = 594 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 374 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 432 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 433 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 481 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 482 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 528 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 529 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 560 >gi|258624850|ref|ZP_05719778.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582848|gb|EEW07669.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 128 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 39/94 (41%), Gaps = 23/94 (24%) Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA------------------EA 328 +I L+DG N++ +D L N AK+ +Y +GV A + Sbjct: 1 MILLSDGSNTAGVLDP---LEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDL 57 Query: 329 ADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGK 360 ++ L+ AS +++ +N + L + I + Sbjct: 58 DEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 91 >gi|209527388|ref|ZP_03275895.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492179|gb|EDZ92527.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 396 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 62/181 (34%), Gaps = 28/181 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----- 225 +++++D++ SMN+ G ++ A + R L+ + D + +V F Sbjct: 73 IVVLVDLTGSMNELDTSGKRRIDGALEATRRFLEQ---MSDRGGDTKVAIVPFGKGGANC 129 Query: 226 --------KIVQTFPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFD--AKEKL 273 I F A ++ + + + T L A + + Sbjct: 130 PGFEVTQRGIDSRFFPANDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFY 189 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPN----IDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 G + + +I L+DG ++ PN DN +L + IV+ +G Sbjct: 190 VPEDSGRPEPRLSVILLSDGFHNQPNEQQDFDNLITLLE----RNNNIIVHTLGYGLTPQ 245 Query: 330 D 330 Sbjct: 246 Q 246 >gi|149176293|ref|ZP_01854908.1| hypothetical protein PM8797T_23389 [Planctomyces maris DSM 8797] gi|148844895|gb|EDL59243.1| hypothetical protein PM8797T_23389 [Planctomyces maris DSM 8797] Length = 695 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 69/173 (39%), Gaps = 16/173 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +S+ GL + ++LD S SM F + A + +D + S+P++ V + Sbjct: 504 ESESGLALCLLLDESGSM---FQGDPSRFDRARQVAALFVDALGSVPNIELEVYTHSSCG 560 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + +G + NR + + GL Y + I A E Sbjct: 561 LEERDCLVRYCFGRR------NRDLACIG-DAVEGLNYDHQAIRTAGELFRSNTSSQRP- 612 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEA----KRRGAIVYAIGVQAEAADQF 332 +++I ++DG + + + + ++ +A ++ G V + ++ A++Q Sbjct: 613 -RWMIVVSDGSPNGVDYEGEPAIKATRDAVIQLRKSGIRVLNVAIEDYASEQI 664 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 40/111 (36%), Gaps = 16/111 (14%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN-RLI 248 K+ ++ + ++ ++ + GLV++ +K L + I I+ L Sbjct: 182 SKIDELQKAAKNFVNKFETKSNTK----IGLVSYGNKGEVVHSLTNELDRINSSIDYGLS 237 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 T G+ A + + + KYI+ +TDG ++ Sbjct: 238 VYGATNIGDGIRKANGLLNN-----------GSNADKYIVLMTDGIPTAAT 277 Score = 37.5 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 32/97 (32%), Gaps = 7/97 (7%) Query: 278 KGHDDYKKYI-----IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K + K YI I + G+N NI + A G + +G + Sbjct: 326 KFEYNPKDYIYSDENIIINRGDNDYGNIALNYAKESLKRASENGVNNFVVGFSNGISRDK 385 Query: 333 LKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 L A + + +L D + RI E+ + Sbjct: 386 LTQIADAGNGYYREAMHGDELEDVYNRIADEIKNPVV 422 >gi|48477361|ref|YP_023067.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] gi|48430009|gb|AAT42874.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] Length = 379 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 67/209 (32%), Gaps = 32/209 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G ++ +DVS SM KL +A +++ I +N+V L+ F Sbjct: 34 KASGFHYIIAIDVSNSM------RKGKLDLAKEGAMNLIEKI----PRDNIV--SLIAFG 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + E I L T Y + A + + Sbjct: 82 DTAKVIVEGKEPTFAL-EAIPSLKVAGNTAM-------YTALLTATKLADKYNMPGR--- 130 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--F 342 II LTDG + +++ E G + IG+ D LK A F Sbjct: 131 --IILLTDGMPTDVSMNESYENLQVPE----GFTIDCIGIGDNYRDDLLKLLADKGNSIF 184 Query: 343 YSVQNSRKLHDAF-LRIGKEMVKQRILYN 370 Y ++N +L + ++ + + + Sbjct: 185 YHLENPEELPKVMESTVSSDISAKNVQVD 213 >gi|323650214|gb|ADX97193.1| inter-alpha-trypsin inhibitor heavy chain h2 [Perca flavescens] Length = 550 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 61/178 (34%), Gaps = 21/178 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S + +++ V+DVS SM K+ +++ +LD + + ++ F+ Sbjct: 258 SPLPKNIVFVIDVSGSMWGV------KMKQTVEAMQAILDDLTIDDHFS------IIDFN 305 Query: 225 SKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + ++ I + T L A + A + + Sbjct: 306 HNVRCWSEELVAGSSIQIADAKKYIQNIKPNGGTNINEALMRAVQMLLKASNQGLIDPRS 365 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 II ++DG+ + I R ++++G+ + FL+ A Sbjct: 366 VSM----IILVSDGDPTVGEIKLSTIQKNVKRVMREEFSLFSLGIGFDVDYDFLERIA 419 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 43/246 (17%), Positives = 90/246 (36%), Gaps = 20/246 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 67 ENGNNGK---KQKNDFSYRIIKNIWQTD---FRNELRENGFAQDINNIERSTSLSIIIDD 120 N+ K K + S+ I + + D ++ D N+ + I ++ Sbjct: 70 RKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTDEKNMHLTPLCKTINNN 129 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI--------GLDMM 172 K + S+ + W +I V ++SKS +D+M Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASKSKALKKNTFNIPIDLM 189 Query: 173 MVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +V D+S SM D G K+ + ++E+ D + N R G+ F+ Sbjct: 190 VVADLSGSMKDGIKGEKLEGGTNSKIYILREVLKELADKSLFTQESNEYNRIGITAFAMG 249 Query: 227 IVQTFP 232 Sbjct: 250 AEHPKE 255 >gi|298492778|ref|YP_003722955.1| magnesium chelatase ATPase subunit D ['Nostoc azollae' 0708] gi|298234696|gb|ADI65832.1| magnesium chelatase ATPase subunit D ['Nostoc azollae' 0708] Length = 668 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 68/212 (32%), Gaps = 39/212 (18%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 467 KAGALVVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQNRDQVSLIPFRG 515 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + ++ RL G + GL A +A+ G D + Sbjct: 516 EQAEVLLPPTRSIALAKNRLERLPCGGGSPLAHGLTQAVRVGINAQ-------MGGDIGQ 568 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI-----GVQ 325 I+ +TDG E E L + G + I V Sbjct: 569 VVIVAITDGRGNIPLARSLGEPQEPGEKPDIKAELLDIAGRIRAAGMQLLVIDTESKFVS 628 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 A + K A ++ + + K A R Sbjct: 629 TGFAKELAKT-AGGKYYHLPKATDKAIAAMTR 659 >gi|240137440|ref|YP_002961911.1| hypothetical protein MexAM1_META1p0705 [Methylobacterium extorquens AM1] gi|240007408|gb|ACS38634.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 473 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 75/464 (16%), Positives = 132/464 (28%), Gaps = 122/464 (26%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 N +GSI++L A+ + ++GL ++ K +L D + L Sbjct: 14 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIA 73 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N + + ++ F + FA + L I+ Q D +S Sbjct: 74 ANAQQSDVTASGIKAGESQALKAFNANASKVPFAT-----VSLSQLEIVRTGQTLDATVS 128 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 + + F ++ V S LD +++DVS SM Sbjct: 129 YTATVQSTFGRIFGLSATTLTNR-------VNASVDLASYLDFYLMVDVSGSMGLPTKDS 181 Query: 183 -----------------------DHFG--PGMDKLGVATRSIRE-MLDIIKSI--PDVNN 214 D G K+ + + ++ + +++K P V N Sbjct: 182 DAEVLAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTPVVPN 241 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQ--------------EKINRLIFGST---TKSTP 257 R G+ F +++ PL + N L GST T + P Sbjct: 242 QYRIGIYPFINQLATLAPLTDTTTSLAALRTAAQCDKVWPLAFTNLLDTGSTQLFTNNDP 301 Query: 258 --GLEYAYNKIFDAKEKLEHIAKG------HDDYKKYIIFLTDGENS------------- 296 G A K++ K + + ++ +TDG + Sbjct: 302 KTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYSAWKDTKTF 361 Query: 297 ---------SPNIDNKESL------FYCNEAKRRGAIV------YAIGVQAEAADQF--- 332 PN D S C + K GA + Y I Sbjct: 362 SGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYNNDSYIVWE 421 Query: 333 --------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 L+ CASP FY+ + + + K+ Sbjct: 422 NGRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQA 465 >gi|269124668|ref|YP_003298038.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268309626|gb|ACY96000.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 476 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 72/234 (30%), Gaps = 27/234 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 P + A L +T+ ++V+D S SM + AT + Sbjct: 16 FLPVDDDEIQAVLTVTARGTAPPAGPPRHAEVIVVDCSGSMGVP-ATKISAARRATVAAL 74 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 L V R+ +V + K + + L T L Sbjct: 75 RDLPDGTLFAVVQGAERARMVYPAHKRLAEASPRT-RAEAIAAVQHLDSSGGTAMHTWLA 133 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + + + H +I LTDG N + +E + C V+ Sbjct: 134 RARRLLAGSTADIRH-----------VILLTDGHNRAAQTALEEEVERC-------VGVF 175 Query: 321 A-----IGVQAEAADQFL--KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + IG E D + + R ++ + +L F + + ++ +R+ Sbjct: 176 SCDPTGIGEDWEPRDLLMIARRLGGRARAHADGGAAELEADFTAVTRAVMAKRV 229 >gi|74185538|dbj|BAE30236.1| unnamed protein product [Mus musculus] Length = 411 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 56/143 (39%), Gaps = 22/143 (15%) Query: 173 MVLDVSLSM------NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +VLD S SM + G A R + +++ + S R GL+T+++ Sbjct: 260 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRP---RYGLLTYATV 316 Query: 227 IVQTFPLAWGVQH----IQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ + EK+N++ S T + L+ Y+ + A + Sbjct: 317 PKVLVRVSDERSSDVDWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAP---P 373 Query: 278 KGHDDYKKYIIFLTDG-ENSSPN 299 +G + + II +TDG N N Sbjct: 374 EGWNRTRHVIIIMTDGLHNMGGN 396 >gi|309364363|emb|CAP25017.2| CBR-CLEC-62 protein [Caenorhabditis briggsae AF16] Length = 393 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 48/146 (32%), Gaps = 17/146 (11%) Query: 213 NNVVRSGLVTFSSKIVQTFPL------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 VR GLVT++++ L + + + +L GL+ A + + Sbjct: 79 KRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLSASDEVYLAKGLDAAESVL 138 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ + +I+ +D D + + K G + + Sbjct: 139 SAGRKNATRSNYK----QLVLIYASD-YRDDGEEDPRPT---AERMKSSGVSIATVAFDQ 190 Query: 327 EAADQFLKN---CASPDRFYSVQNSR 349 + +K ASP ++ +++ Sbjct: 191 TGNEGVVKAIGEIASPGFNFTNEDAD 216 >gi|117919903|ref|YP_869095.1| TPR repeat-containing protein [Shewanella sp. ANA-3] gi|117612235|gb|ABK47689.1| Tetratricopeptide TPR_2 repeat protein [Shewanella sp. ANA-3] Length = 690 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 50/164 (30%), Gaps = 28/164 (17%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P L ++ + + + +V+D+S+SM ++L Sbjct: 57 PLHILAFTWVIATLALAGPALNKQTLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG--- 250 ++L +K +GL+ F+ PL + + L Sbjct: 113 QTKFRATDLLRGLKEGE-------TGLIAFAGDAFTISPLTRDTGTLLNLLPTLSPDIMP 165 Query: 251 -STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + L A N + II +TDG Sbjct: 166 VLGSNLAAALTQAKNLLAQGGHLRGD-----------IIVMTDG 198 >gi|325963534|ref|YP_004241440.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469621|gb|ADX73306.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 347 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 41/251 (16%), Positives = 73/251 (29%), Gaps = 57/251 (22%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + CT + + A + + + + D+M+ LD S SM+ +D Sbjct: 65 VACTSLLLSTAVAAARPVEVTTVRPEQHN--RDIMLCLDASGSMSSADAAVVDVFATLAS 122 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----------------WGVQHI 240 R GL F S +Q FPL GV Sbjct: 123 EFEGE--------------RIGLTIFDSTAIQVFPLTDDYGYAQEQLKLARDAFTGVPGS 168 Query: 241 QEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD----------YKKYIIF 289 ++ G ++ GL N K + D + ++ Sbjct: 169 SGFLDGTWSGRGSSLIGDGLASCLNSFPHGKFEHRKPESSGDSGTGRNLTEHQRSRTVVL 228 Query: 290 LTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIG-----VQAEAADQ--FLKNCA--SP 339 TD + + + +L + A+ R VYA+ A L+ A + Sbjct: 229 ATD---NFLSGEPILTLDEAASLARDRAVRVYALNPGDLDYGAGPGQPGGHLRAAAESTG 285 Query: 340 DRFYSVQNSRK 350 +Y++ N Sbjct: 286 GAYYALDNPEA 296 >gi|254423938|ref|ZP_05037656.1| magnesium chelatase ATPase subunit D [Synechococcus sp. PCC 7335] gi|196191427|gb|EDX86391.1| magnesium chelatase ATPase subunit D [Synechococcus sp. PCC 7335] Length = 680 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 69/223 (30%), Gaps = 43/223 (19%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 477 ARKAGSLIIFVVDASGSMA------LNRMQNAKGAVLSLLT-----DAYQNRDQVALIPF 525 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ + G + GL A +A+ K D Sbjct: 526 RGEQADVLLPPTRSITAAKRRLETMACGGGSPLAHGLMQALRVGMNAQ-------KSGDV 578 Query: 283 YKKYIIFLTDGENSSPNIDN---------------KESLFYCNEAKRRGAIVYAI----- 322 I+ +TDG + P + E L + G + I Sbjct: 579 GSVVIVAITDGRGNVPLARSLGDELDEPEEKPDLKAELLEIAGNIRTAGYQMLVIDTERK 638 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 V + K + ++Y + + A + + + + Sbjct: 639 FVSTGFGKELAKT--AGGKYYQLPKATD--QAIASMARSAMSE 677 >gi|196007642|ref|XP_002113687.1| hypothetical protein TRIADDRAFT_57368 [Trichoplax adhaerens] gi|190584091|gb|EDV24161.1| hypothetical protein TRIADDRAFT_57368 [Trichoplax adhaerens] Length = 921 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 47/288 (16%), Positives = 94/288 (32%), Gaps = 49/288 (17%) Query: 82 RIIKNIWQTDFRNEL-RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC 140 + TD N L N FA + N+ + + +H+D+ + ++ Sbjct: 234 DSVNTFCDTDNSNPLTYHNAFALNNQNLYCLFQSTWTVIMRHQDFTNNNNPARQIDITVP 293 Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 + +VLD S SM G + ++ +AT + Sbjct: 294 NIRIVRRKDIRTV-------------------LVLDTSGSMQ---GVRLQQMRLATTNFI 331 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV-QHIQEKINRLIFG---STTKST 256 + G+V+F+S+ L V Q+++ + I T Sbjct: 332 --------LNSAVEGEFLGIVSFNSRTTIMSSLTKIVDQNVKNNLIAQIPSAAVGLTSVG 383 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 GL A N + + + +I L+DGE + + N+ Sbjct: 384 GGLLSALNMLKSSVNQSFPCGGR-------MIVLSDGEENVGPYISS----VINDLVSNQ 432 Query: 317 AIVYAIGVQAEAADQFLKNCASPDR---FYSVQNSRKLHDAFLRIGKE 361 IV+ + + + A+++ S + + L+ AFL I ++ Sbjct: 433 IIVHTVSLGSSASERLQNVSYSTGGKAIYAPSGDIATLNSAFLSISQQ 480 >gi|74146354|dbj|BAE28944.1| unnamed protein product [Mus musculus] Length = 452 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 49/131 (37%), Gaps = 21/131 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 ++ V+DVS SM K+ +++ +LD +++ + +V F+ + Sbjct: 315 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDDQFS------VVDFNHNVRTW 362 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I ++ T L A + +A ++ + D Sbjct: 363 RNDLVSATKTQIADAKRYIEKIQPSGGTNINEALLRAIFILNEA----SNMGLLNPDSVS 418 Query: 286 YIIFLTDGENS 296 II ++DG+ + Sbjct: 419 LIILVSDGDPT 429 >gi|268324627|emb|CBH38215.1| hypothetical protein containing von Willebrand factor type A domain [uncultured archaeon] Length = 523 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 70/175 (40%), Gaps = 19/175 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + ++D+S S + + + L +I + R +V Sbjct: 344 TENRTRDIAVAFLVDMSGSTVGSTIRCEKEALILMSEALKELGDAFAIYGFSGYGRDNVV 403 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F +++ F ++ Q +Q KI+ + +T+ P + +A K+ +E+ Sbjct: 404 FF---LIKDFEDSYD-QRVQCKISTMTNKQSTRIAPAIRHATTKLRRREERT-------- 451 Query: 282 DYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +I L+DG+ + N +++ EA+R G + I V EAA+ Sbjct: 452 ---RMLILLSDGKPLDRDYYGNYAIEDTRMALKEAQRYGVKSFCITVDREAAEYL 503 >gi|260592522|ref|ZP_05857980.1| von Willebrand factor, type A [Prevotella veroralis F0319] gi|260535568|gb|EEX18185.1| von Willebrand factor, type A [Prevotella veroralis F0319] Length = 290 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 10/102 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+V+DVS S++ G R + + + N + G++ F Sbjct: 72 EEERELTVMLVIDVSGSLDF------GTAGQLKRECVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEY 261 S + + P G +HI I ++ T GLEY Sbjct: 126 SDHVEKYIPPKKGRKHILYLIREMLTFTPNSKKTDVGVGLEY 167 >gi|325689280|gb|EGD31286.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK115] Length = 471 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 93/297 (31%), Gaps = 52/297 (17%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 ++ + Y+ TA IL+ N K +SY + + + Sbjct: 57 MRTSMQYVDRTISKSTAIFILDDSKYGNKSKFTKGWSYVGLSEDGKKILNYIWDKANNDW 116 Query: 104 DINNIERSTSLSIIID---DQHKDYNLSAVSRYEMPFIF------CTFPWCANSSHAPLL 154 I + + + ++ ++ DY + + YE+ + + ++ + + Sbjct: 117 IIRELGTKSLYDLKMELDFKKNDDYKDNRLVSYELKGKYAGSQNQLSIHTAMSALNTKQV 176 Query: 155 ITSSVKISS-----------KSDIGLDMMMVLDVSLSMNDHFG-------------PGMD 190 + K + + + V D S SM Sbjct: 177 FSKVAKGKRGIALAYRDDPIEGQANVAISFVFDASGSMEFSLDGTEKVNPYSNNPLKNRS 236 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWGVQHIQEKINR 246 ++ + ++M+ ++ I +V+ LV F+S L G+ I I+ Sbjct: 237 RIDILREKTKKMMADLQPIGNVS----VNLVQFNSHASFVQQNFIELDKGLTSINSAIDN 292 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 L T GL Y + +L KY++ LTDG +S + + Sbjct: 293 LNPEHATNPGDGLRYGMVSLQSNAAQL-----------KYVVLLTDGVPNSYMVGPQ 338 >gi|323444971|gb|EGB01833.1| hypothetical protein AURANDRAFT_69450 [Aureococcus anophagefferens] Length = 376 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 44/130 (33%), Gaps = 17/130 (13%) Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGV----QHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 +++ R GLV + ++ LA + + +L G++T GLE +++ Sbjct: 35 SLDDTCRLGLVVYDAQARVVVGLARVTPAHVAKVHAALEKLAPGTSTNLWGGLELGVDEL 94 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIG 323 ++ LTDG ++ + + + + G V+A G Sbjct: 95 VGGAGDNARA----------VLLLTDGVPNNSPPEGEVAALRAKRLTKDGSETVAVFAAG 144 Query: 324 VQAEAADQFL 333 L Sbjct: 145 FGYALKSDLL 154 >gi|260893743|ref|YP_003239840.1| Magnesium chelatase [Ammonifex degensii KC4] gi|260865884|gb|ACX52990.1| Magnesium chelatase [Ammonifex degensii KC4] Length = 660 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 66/210 (31%), Gaps = 28/210 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + IG ++ V+D S SM ++ A +I +L + + G Sbjct: 464 EKEREKRIGNFLLFVVDASGSMG-----AQQRMVAAKGAIFSLL-----LDAYQKRDKVG 513 Query: 220 LVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +V F + P V+ ++ L G T GL AY E + Sbjct: 514 MVVFKGERAEVVLPPTNSVELAHVQLKELPTGGRTPLAAGLLKAY-------EVARSYLR 566 Query: 279 GHDDYKKYIIFLTDGENS--SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD----QF 332 D +I ++DG + + E + R + + + V E Sbjct: 567 RDPDLAPLLIVVSDGRANVSMGGGNPWEEVERVASLIREESRIKTLVVDVEQGGFLRFGL 626 Query: 333 LKNCASP-DRFY---SVQNSRKLHDAFLRI 358 + A +Y + L A +I Sbjct: 627 ARRLADALGAYYCPLEELKAESLLAAVRKI 656 >gi|163755380|ref|ZP_02162500.1| von Willebrand factor type A like domain [Kordia algicida OT-1] gi|161324800|gb|EDP96129.1| von Willebrand factor type A like domain [Kordia algicida OT-1] Length = 718 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 47/308 (15%), Positives = 108/308 (35%), Gaps = 46/308 (14%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELREN----GFAQDINNIE------RSTSLS 115 EN + ++ + + + + + ++ + + +NI+ R +S Sbjct: 172 GENTSGETVFNQPYTEKGVGDTFNYNLDVQINAGIPIADVSSNTHNIKVHYPNTRKAEIS 231 Query: 116 IIIDDQHKDYNLSAVSRYEM---PFIFCTFPWCANSSHAPLLITSSVKISSKSDI-GLDM 171 + ++++ + +Y M + N + K S + + Sbjct: 232 LTSENKNPSNR-DFILKYSMRGNEIQSGLLLYEENGEKFFAYMMEPPKASVNIKLTAKEY 290 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + V+DVS SMN + + V+ + +R +L + N ++ +G + ++ Sbjct: 291 LFVVDVSGSMNGY------PMEVSKKLLRNLLVNLPETDHYNILLFAGGSS----VLAPE 340 Query: 232 PLAWGVQHIQEKINRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PLA ++IQ+ IN L G T+ L+ AY + + + ++ Sbjct: 341 PLACTKENIQKGINFLTNERGGGGTRLLNALKTAYA-----------LPRMDKTSARSMV 389 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQ 346 +TDG S + +A V+ G+ + L+ A S + Sbjct: 390 VITDGYVSVERKAFEMIEQNLGQA-----NVFTFGIGSGVNRYLLEGMAKISNSETFIAT 444 Query: 347 NSRKLHDA 354 + +D Sbjct: 445 EMNEANDV 452 >gi|261878614|ref|NP_001159906.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform b [Homo sapiens] Length = 769 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 145 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 197 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 198 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 254 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 255 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 313 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 314 QRIYEDHDATQQLQGFYS 331 >gi|221042234|dbj|BAH12794.1| unnamed protein product [Homo sapiens] Length = 769 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 145 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 197 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 198 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 254 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 255 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 313 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 314 QRIYEDHDATQQLQGFYS 331 >gi|206575542|ref|YP_002235853.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] gi|206570386|gb|ACI12032.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] Length = 212 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + ++ ML ++ P V ++T+ ++ Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHSVNVGVQAMLSALRQDPYALESVHISIITYDNEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL I + T + LE + + + KG + Sbjct: 57 REYVPLTALADFQFSDI-EVPSAGGTFTGAALECLIQCVDRDIRRSDGEQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 +TDG +P+ +R + A V +A + LK Sbjct: 114 FLMTDG---TPSDAWAYGEAVKEVQRRSFGSIIACAVGPKAGHEHLK 157 >gi|189054356|dbj|BAG36876.1| unnamed protein product [Homo sapiens] Length = 911 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|123473701|ref|XP_001320037.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] gi|121902834|gb|EAY07814.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] Length = 967 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 70/197 (35%), Gaps = 17/197 (8%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 + +D S SM K+ +A + +++ + + GL TF+S + Sbjct: 461 ICIDTSGSMGG------TKIQIAKTCFKVIVNRAYEVGPSSLW---GLYTFNSTPERKLK 511 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L+ ++++L T + A N+I + + + + K II LTD Sbjct: 512 LSPIPADFHTQVDKLGVSGCTALYYCIIAAMNEINEKVKSDS----SYKNALKRIIALTD 567 Query: 293 GENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 G +++ + + + + G + I + + +N A + + Sbjct: 568 GGDNTYSHNTARGIADLTKQLIDNGIYLDYIELGNVGDMKLPRNMA---YYTGGDYLKFT 624 Query: 352 HDAFLRIGKEMVKQRIL 368 D F + ++I Sbjct: 625 SDIFQSSSSRVDIRKIR 641 >gi|119585665|gb|EAW65261.1| inter-alpha (globulin) inhibitor H1, isoform CRA_c [Homo sapiens] Length = 911 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|119585664|gb|EAW65260.1| inter-alpha (globulin) inhibitor H1, isoform CRA_b [Homo sapiens] Length = 893 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|169624118|ref|XP_001805465.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] gi|111056124|gb|EAT77244.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] Length = 1587 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 25/118 (21%), Positives = 49/118 (41%), Gaps = 10/118 (8%) Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 P + +L V + ++ + + N GLV F SK P+ V++ + K+N Sbjct: 1189 PHLSRLDVLKQMFDAFINRLLAY---NFQTHIGLVAFGSKASVAQPITNAVENFRHKLNN 1245 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 ++ T + A +++ EK + + II ++DGE++ N D Sbjct: 1246 MLASGDTAIWDSIALAQDQLQTYAEK-------YPTARLRIICISDGEDTKSNQDKVN 1296 >gi|38639543|ref|NP_943312.1| hypothetical protein LV052 [Klebsiella pneumoniae] gi|168998770|ref|YP_001688038.1| TerY2 [Klebsiella pneumoniae NTUH-K2044] gi|38016641|gb|AAR07662.1| hypothetical protein LV052 [Klebsiella pneumoniae] gi|238549791|dbj|BAH66142.1| hypothetical protein KP1_p253 [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 212 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + ++ ML ++ P V ++T+ ++ Sbjct: 3 RLPVYLLIDTSGSMRGE------SIHSVNVGVQAMLSALRQDPYALESVHISIITYDNEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL I + T + LE + + + KG + Sbjct: 57 REYVPLTALADFQFSDI-EVPSAGGTFTGAALECLIQCVDRDIRRSDGEQKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 +TDG +P+ +R + A V +A + LK Sbjct: 114 FLMTDG---TPSDAWAYGEAVKEVQRRSFGSIIACAVGPKAGHEHLK 157 >gi|34527040|dbj|BAC85316.1| unnamed protein product [Homo sapiens] Length = 401 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 181 SYHMPNWFGTTK-YVKPLVQKLCSHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 239 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ +E + +I Sbjct: 240 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAVIRNI 288 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 289 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 335 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 336 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 367 >gi|156119625|ref|NP_002206.2| inter-alpha-trypsin inhibitor heavy chain H1 isoform a [Homo sapiens] gi|2851501|sp|P19827|ITIH1_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; AltName: Full=Inter-alpha-trypsin inhibitor complex component III; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|825630|emb|CAA49279.1| inter-alpha-trypsin inhibitor heavy chain H1 [Homo sapiens] gi|62550746|gb|AAH69464.1| Inter-alpha (globulin) inhibitor H1 [Homo sapiens] gi|119585663|gb|EAW65259.1| inter-alpha (globulin) inhibitor H1, isoform CRA_a [Homo sapiens] gi|158258937|dbj|BAF85439.1| unnamed protein product [Homo sapiens] Length = 911 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|825681|emb|CAA34346.1| inter-alpha-trypsin inhibitor C-terminal [Homo sapiens] Length = 837 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 213 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 265 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 266 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 322 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 323 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 381 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 382 QRIYEDHDATQQLQGFYS 399 >gi|33989|emb|CAA45188.1| inter-alpha-trypsin inhibitor heavy chain ITIH1 [Homo sapiens] Length = 911 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|323474472|gb|ADX85078.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477209|gb|ADX82447.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 356 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 72/198 (36%), Gaps = 34/198 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++D S SM KL A +S +++L ++N L+ Sbjct: 32 QSSVTSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLL------YNLNEGNYVTLI 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS+ +I + + ++ +A ++AK Sbjct: 80 LFSNHP---------------EIKYQGPAKGIITFDVGKGYTTRLHEAVSFTINLAKQSQ 124 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K II LTDG+ + D + Y + IG+ + ++ LK A S Sbjct: 125 VPTK-IIMLTDGKPT----DKRNVKDYEKLDIPPNTQIITIGIGNDYNERILKKLADRSS 179 Query: 340 DRFYSVQNSRKLHDAFLR 357 +FY +++ +L + F Sbjct: 180 GKFYHIKDISELPNIFES 197 >gi|121535594|ref|ZP_01667401.1| von Willebrand factor, type A [Thermosinus carboxydivorans Nor1] gi|121305834|gb|EAX46769.1| von Willebrand factor, type A [Thermosinus carboxydivorans Nor1] Length = 586 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 70/246 (28%), Gaps = 39/246 (15%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS--------KSDIGL 169 + Q + + + + A + I S K Sbjct: 351 MHGQEQRHAKRWTVGGSLGQLAVAETVIAAAQRCAAGPGGPFTIGSQDIHHFIRKKKSKT 410 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S SM+ ++ ++ R ++ F + Sbjct: 411 DICLIIDASASMSGQR--------------VGAAKLLAKHLLLSTSDRVAVIVFQENQAR 456 Query: 230 -TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 PL + + + +T GL+ + +++ K +I Sbjct: 457 VQVPLTRDFAQAESSLAHIESFGSTPLALGLKVGIEYLKESRAKNP-----------LVI 505 Query: 289 FLTDGENSSPNIDN---KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYS 344 +TDG + +I ++L K G IG++ D + A+ Y Sbjct: 506 LITDGVPTVGDITGDPLADALTAAASIKSHGYGFTCIGLK-PHRDYLTQVAQAAGGNIYV 564 Query: 345 VQNSRK 350 + K Sbjct: 565 LDELEK 570 >gi|332817002|ref|XP_003309880.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan troglodytes] Length = 769 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 145 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 197 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 198 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 254 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 255 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 313 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 314 QRIYEDRDATQQLQGFYS 331 >gi|328945098|gb|EGG39253.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1087] Length = 471 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 93/297 (31%), Gaps = 52/297 (17%) Query: 44 VKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ 103 ++ + Y+ TA IL+ N K +SY + + + Sbjct: 57 MRTSMQYVDRTISKSTAIFILDDSKYGNKSKFTKGWSYVGLSEDGKKILNYIWDKANNDW 116 Query: 104 DINNIERSTSLSIIID---DQHKDYNLSAVSRYEMPFIF------CTFPWCANSSHAPLL 154 I + + + ++ ++ DY + + YE+ + + ++ + + Sbjct: 117 IIRELGTKSLYDLKMELDFKKNDDYKDNRLVSYELKGKYAGSQNQLSIHTAMSALNTKQV 176 Query: 155 ITSSVKISS-----------KSDIGLDMMMVLDVSLSMNDHFG-------------PGMD 190 + K + + + V D S SM Sbjct: 177 FSKVAKGKRGIALAYRDDPIEGQANVAISFVFDASGSMEFSLDGTEKVNPYSNNPLKNRS 236 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI----VQTFPLAWGVQHIQEKINR 246 ++ + ++M+ ++ I +V+ LV F+S L G+ I I+ Sbjct: 237 RIDILREKTKKMMADLQPIGNVS----VNLVQFNSHASFVQQNFIELDKGLTSINSAIDN 292 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 L T GL Y + +L KY++ LTDG +S + + Sbjct: 293 LNPEHATNPGDGLRYGMVSLQSNAAQL-----------KYVVLLTDGVPNSYMVGPQ 338 >gi|291223815|ref|XP_002731903.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 996 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 70/202 (34%), Gaps = 35/202 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VLD S SM + ++ ++ +D I D + G+V FSS V Sbjct: 350 VVLVLDTSGSMAGN------RIQRLYQTATYFIDT--KIEDGSF---VGIVGFSSTAVIL 398 Query: 231 FPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + I + + T GLE A + + E + Sbjct: 399 AGMTEIKYGFQRDDIASNV-PQVVDGFTSIGAGLELALQVLENGNVASEGAS-------- 449 Query: 286 YIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 ++ +TDG EN SP I N Y G V E+A L+N + + + Sbjct: 450 -LLLITDGAENRSPFIANVLPDIY-----DSGVRVDTFAY-TESAQLILQNLSDTTGGLY 502 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 + V ++ + + Sbjct: 503 FYVPDNDNSTAFIDSLAATITD 524 >gi|256599878|pdb|3IBS|A Chain A, Crystal Structure Of Conserved Hypothetical Protein Batb From Bacteroides Thetaiotaomicron gi|256599879|pdb|3IBS|B Chain B, Crystal Structure Of Conserved Hypothetical Protein Batb From Bacteroides Thetaiotaomicron Length = 218 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 50/136 (36%), Gaps = 17/136 (12%) Query: 161 ISSKSDIGLDMMMVLDVSLS-MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ G+++++ LD+S S + P +L A R I ++D + + + G Sbjct: 4 SNAVKRKGVEVIIALDISNSXLAQDVQP--SRLEKAKRLISRLVDELDN-------DKVG 54 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + F+ P+ + + + +K + A N Sbjct: 55 XIVFAGDAFTQLPITSDYISAKXFLESISPSLISKQGTAIGEAIN-------LATRSFTP 107 Query: 280 HDDYKKYIIFLTDGEN 295 + + II +TDGEN Sbjct: 108 QEGVGRAIIVITDGEN 123 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 51/181 (28%), Gaps = 7/181 (3%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN-- 68 +G+I L + L +F + L ++ + + D + L + N++ Sbjct: 10 RGRRGTILPLLGVCLIGLFGFVALAVDLGMLAVSRTQSQNGADVAALVGTRTLNNRDGVA 69 Query: 69 -----GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 Q + S + + + +++ + D + + Sbjct: 70 YNNLPAAVTAAQASVTSNPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGGA 129 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 S M + S +++ D+ VLD++ SM Sbjct: 130 MSAPGGNSWTAMRVTLGVSQPTYFMRVFGVNSMPSGAVATAVYRPRDIAFVLDMTGSMAF 189 Query: 184 H 184 Sbjct: 190 S 190 >gi|114587328|ref|XP_001172432.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 2 [Pan troglodytes] Length = 893 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 456 QRIYEDRDATQQLQGFYS 473 >gi|114587324|ref|XP_001172464.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 5 [Pan troglodytes] Length = 911 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 456 QRIYEDRDATQQLQGFYS 473 >gi|114587330|ref|XP_001172443.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 3 [Pan troglodytes] Length = 898 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 456 QRIYEDRDATQQLQGFYS 473 >gi|114587326|ref|XP_001172455.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 4 [Pan troglodytes] Length = 911 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 456 QRIYEDRDATQQLQGFYS 473 >gi|17560426|ref|NP_504602.1| hypothetical protein F31F7.2 [Caenorhabditis elegans] gi|15617822|gb|AAB52325.2| Hypothetical protein F31F7.2 [Caenorhabditis elegans] Length = 689 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 73/198 (36%), Gaps = 23/198 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ + +D+M ++D S S G++ + I E+L + P + R ++ Sbjct: 476 PARKLLPIDLMFLVDTSSS------IGINNFDIQKNFICEILKDVDIAPGRS---RIAMI 526 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYA------YNKIFDAKEK 272 +S F + + ++ + RL G T + L +A + +K Sbjct: 527 QYSQDPSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKK 586 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +++ D + + ++DG + +KES+ + ++A+ ++ D+ Sbjct: 587 HQYLPTPKHDRLQVLCLVSDGYSDDN--ADKESVNLHDHLH---VKIFAVVTRSFNKDKL 641 Query: 333 LKNCASPDRFYSVQNSRK 350 ++V Sbjct: 642 APITRFDGSVFTVHQRES 659 >gi|326670660|ref|XP_003199262.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1404 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G Sbjct: 693 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFAIG 747 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 748 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 807 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 808 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 857 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 858 EANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 915 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 916 ILVLLSGGRSFDSVDAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 969 Query: 346 QNSRKL 351 + +L Sbjct: 970 SDFSEL 975 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 88 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 142 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 143 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 202 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 203 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 252 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 253 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 310 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 311 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 364 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 365 NYALSVADFSEL 376 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 887 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 941 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 942 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 1001 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 1002 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 1051 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 1052 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 1109 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 1110 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 1163 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 1164 NYALSVADFSEL 1175 >gi|326670658|ref|XP_003199261.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1823 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G Sbjct: 88 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFAIG 142 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 143 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 202 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 203 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 252 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 253 EANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 310 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 311 ILVLLSGGRSFDSVDAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 364 Query: 346 QNSRKL 351 + +L Sbjct: 365 SDFSEL 370 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G Sbjct: 887 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFAIG 941 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 942 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 1001 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 1002 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 1051 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 1052 EANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 1109 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 1110 ILVLLSGGRSFDSVDAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 1163 Query: 346 QNSRKL 351 + +L Sbjct: 1164 SDFSEL 1169 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 282 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 336 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 337 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 396 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 397 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 446 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 447 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 504 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 505 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 558 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 559 NYALSVADFSEL 570 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 90/312 (28%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 1081 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 1135 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 1136 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 1195 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 1196 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 1245 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 1246 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 1303 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + Sbjct: 1304 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRDLQRISYEA 1357 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 1358 NYALSVADFSEL 1369 >gi|296161494|ref|ZP_06844300.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295888309|gb|EFG68121.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 328 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 80/259 (30%), Gaps = 52/259 (20%) Query: 145 CANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGV 194 + + + + + G +++++D S SM++ G K V Sbjct: 57 VLAMAAIVFGLAGPGRSQRQVLRTGSGAQILILMDRSASMDETMNSKGVESPAGESKNKV 116 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST-- 252 A S+ + R + F + V P + ++ I+ I G Sbjct: 117 ARASLTNFV-------AQRPNDRLAFMMFGTSPVLAMPFTYDLRAIEAAIAGTAVGRGMP 169 Query: 253 -TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T+ GL A + ++ I+ ++DG +D + + Sbjct: 170 DTQLDRGLLAAIGEFNGRA----------SSGRRAIVLVSDG---GARLDTQVRRLIQDG 216 Query: 312 AKRRGAIVYAIGVQ----------------AEAADQFLKNCAS---PDRFYSVQNSRKLH 352 R +Y I ++ + A + + S P R + N+R + Sbjct: 217 LMRNQIALYFIYLRSGTYSPDLNAAVPANESSAEAELHRYFLSLKTPYRLFQAGNARAMK 276 Query: 353 DAFLRIGKEMVKQRILYNK 371 DA I ++ + + Sbjct: 277 DAMAEINRQQNARTSFVER 295 >gi|226941831|ref|YP_002796905.1| VCBS [Laribacter hongkongensis HLHK9] gi|226716758|gb|ACO75896.1| VCBS [Laribacter hongkongensis HLHK9] Length = 1087 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 74/222 (33%), Gaps = 37/222 (16%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 D N S+ + L+ +MP + + +V + S Sbjct: 214 DENTRAFDVSIKVSDGQTTTTAPLNITVEDDMPVVA--------------NTSQTVSLPS 259 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPG------MDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + ++++ LDVS SMND G + +L +A ++I ++LD ++ D V+ Sbjct: 260 Q---DTNLLLTLDVSGSMNDPSGVKDANGKDLSRLALAKQAISQLLDQYDALGD----VK 312 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHI 276 LV FS V + + + G T L+ A + Sbjct: 313 VQLVKFSEGGSVQSNNWMTVAEAKAALAGITKGDGGTNYDEALDLARQAFAKPGQLDGAK 372 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNK-ESLFYCNEAKRRGA 317 + F +DG+ + N K S + K G Sbjct: 373 NVSY--------FFSDGDPTLSNSGQKNNSGATVDPDKGDGI 406 >gi|220923701|ref|YP_002499003.1| hypothetical protein Mnod_3796 [Methylobacterium nodulans ORS 2060] gi|219948308|gb|ACL58700.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 439 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 35/95 (36%), Gaps = 1/95 (1%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + G+ +++ A+ PV+ MGL ET + + + KL + D S + Sbjct: 16 RALKRDVSGTAAVIAALAFPVVIGGMGLGAETGYWYLTQRKLQHAADLSAHAAGVRKRAG 75 Query: 67 ENGNNGKKQKNDFSYRI-IKNIWQTDFRNELRENG 100 + + + + + + N +G Sbjct: 76 DPKSQIDAAALNIALNSGMSSSLGNMLANSPPTSG 110 >gi|296214738|ref|XP_002807270.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Callithrix jacchus] Length = 594 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 68/213 (31%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 374 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 432 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--- 248 +I K+ + + V F+ Q ++ + +E + +I Sbjct: 433 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRMEFSFTDYNTKENVLAVIRNI 481 Query: 249 --FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 482 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 528 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G ++++GV D + P Sbjct: 529 -PAAAAHDAGITIFSVGVAWAPLDDLKDMASKP 560 >gi|119504634|ref|ZP_01626713.1| TPR domain protein [marine gamma proteobacterium HTCC2080] gi|119459656|gb|EAW40752.1| TPR domain protein [marine gamma proteobacterium HTCC2080] Length = 611 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 48/128 (37%), Gaps = 23/128 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++VLD+S SM+ ++ A + I ++L + V +GLV ++ Sbjct: 94 ALVIVLDLSASMS-AADIQPSRIQRAKQKILDLL------ARRDEGV-TGLVVYAGDAHV 145 Query: 230 TFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL + I+ ++ L + T LE A + A Sbjct: 146 VAPLTDDHRTIENLLSALTPDIMPLPGSNVTAALEVATGLLQTAGVANGQ---------- 195 Query: 286 YIIFLTDG 293 ++ +TDG Sbjct: 196 -VLLITDG 202 >gi|302519121|ref|ZP_07271463.1| von Willebrand factor type A [Streptomyces sp. SPB78] gi|302428016|gb|EFK99831.1| von Willebrand factor type A [Streptomyces sp. SPB78] Length = 453 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 55/207 (26%), Gaps = 38/207 (18%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 Q + + +P + + D + ++++D S Sbjct: 20 PQGPRFEVEVYQNPYLPEGSGEVHAVVTVTATGGGTGALTAAGPGQDAAV--VLMVDCSG 77 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------- 232 SM KL A + LD + + R +V + + +P Sbjct: 78 SMQY----PPSKLHHAKEATGAALDTL------RDGTRFAVVEGTHVAREVYPRGGALAV 127 Query: 233 -LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 +E + L T L A + A + H I LT Sbjct: 128 ADDRTRAEAKEALRALRASGGTAVGRWLRLAERLLSQAPVTIRHG-----------ILLT 176 Query: 292 DGENSS-------PNIDNKESLFYCNE 311 DG N +D+ F C+ Sbjct: 177 DGRNEHETPEELRAALDDCAGRFTCDA 203 >gi|269126092|ref|YP_003299462.1| type II secretion system protein [Thermomonospora curvata DSM 43183] gi|268311050|gb|ACY97424.1| type II secretion system protein [Thermomonospora curvata DSM 43183] Length = 660 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 69/206 (33%), Gaps = 30/206 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +M+VLD S SM+ A LD++ + V +G T + Sbjct: 92 ARAVMLVLDTSGSMSAADL------AAAKAGAVAFLDLLPA------DVPAGFTTTGTPT 139 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + +++ + R+ G T + A +++ A + Sbjct: 140 RPVIDPTTDRRSLRKALGRMRTGGETALYDAMSAAVDRLARANAAEGR-----------L 188 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-FLKNCA--SPDRFYS 344 + L+DG++S+ + L KR + + A + L+ A S R S Sbjct: 189 VVLSDGKDSASTSTLAQVLA---RLKRTRIAADVVAFKTAATSEGTLRQLAADSGGRLLS 245 Query: 345 VQNSRKLHDAFLRIGKEMVKQRILYN 370 + R+L+ A +Q + Sbjct: 246 SPDPRRLNAA-FADAAASFRQSMWIT 270 >gi|323650180|gb|ADX97176.1| inter-alpha (globulin) inhibitor h4 isoform 1 [Perca flavescens] Length = 354 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 61/180 (33%), Gaps = 26/180 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 SS S I +++ V+D S SM+ K+ ++ +L+ + GL Sbjct: 72 PSSLSRIPKNVVFVIDQSGSMSG------RKMQQTRIALIHILNDLAEDDHF------GL 119 Query: 221 VTFSSKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +TF S I ++ + + TT + A+ H Sbjct: 120 ITFDSSIFHWKRELVQANKKNLESAKTFARNIQDRGTTDINAA------VLEGARMLNAH 173 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +G II LTDG+ +S + + +Y +G + +FL+ Sbjct: 174 PREGSAS---IIILLTDGDPTSGETNLERIQSNVRRDIADKFPLYCLGFGHDVNFEFLEK 230 >gi|311900457|dbj|BAJ32865.1| hypothetical protein KSE_71090 [Kitasatospora setae KM-6054] Length = 717 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 41/280 (14%), Positives = 85/280 (30%), Gaps = 44/280 (15%) Query: 92 FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHA 151 F N + G +++ + + Y +A + + + P + Sbjct: 118 FTNPVLRTGPGSGSDSVASPYAFTTPALAAGNTYRSTA--DFMISDDYRNLPTASG---- 171 Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + S + GLD+ +VLD+S S+ +L + D + P Sbjct: 172 GVWQDSRDNPPLPARCGLDVALVLDLSASVG-------SELPFLKTAADRFTDALTGTPS 224 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R + +F A V +++ + + L + Sbjct: 225 -----RLAVFSFD-----QASPATSVSANHPELHPVSTPAGAAEFKALYAGWTLGKGTNW 274 Query: 272 KLEHIAKGHDDYKKY--IIFLTDGENSSPNI------------DNKESLFYCNEAKRRGA 317 + + +Y ++ LTDG + D + +F N K G+ Sbjct: 275 DTALWSVANAAP-RYDAVVVLTDGNPTRFADDAQGDGSRTHFRDVENGIFSANAVKAEGS 333 Query: 318 IVYAIGVQ---AEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + A+GV A + L+ + P + + L A Sbjct: 334 RLIALGVGKGVAGDSGLNLRAVSGPTAY---ADGGDLTAA 370 >gi|218460899|ref|ZP_03500990.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 373 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 54/134 (40%), Gaps = 14/134 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW ++ + I + + +++ ++DVS SM++ DKL Sbjct: 224 FKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHA-NLVFLIDVSGSMDEP-----DKL 277 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFG 250 + S R +++ +K+ V +VT++ I I+RL G Sbjct: 278 PLLKSSFRLLVNRLKADDTV------AIVTYAGNAGTVLEPTRVSEKSKILSAIDRLEAG 331 Query: 251 STTKSTPGLEYAYN 264 +T G+E AY+ Sbjct: 332 GSTGGAEGIEAAYD 345 >gi|239617868|ref|YP_002941190.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239506699|gb|ACR80186.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 612 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 70/210 (33%), Gaps = 34/210 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +D+++VLD S SM + G D + A IK + + +R LVTF + Sbjct: 68 SDMDIVIVLDSSGSMRNVIGIMDDLMEKA----------IKKLKEEGLRLRYALVTFGDE 117 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTK----STPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I + + ++ S L A N FD K Sbjct: 118 IRELKGFTSRDDVFISWLRDVVPFGGGDDPEISLDALSMASNLPFDVK------------ 165 Query: 283 YKKYIIFLTDG----ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 KK I+ +T+ N +E +G + + +++K S Sbjct: 166 AKKVIVLITNAPAHFVEDGTAYSNVSVNGLIDELNGKGVELL---LLVPPEPEYVKISES 222 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +F+++ + AF + + R+ Sbjct: 223 LGGKFFNIFKIAGPNKAFEELANLNFRTRL 252 >gi|167045641|gb|ABZ10290.1| putative magnesium chelatase, subunit ChlI [uncultured marine microorganism HF4000_APKG10K24] Length = 709 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 48/137 (35%), Gaps = 18/137 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ G ++ V+D S SM ++ ++ +L + R GL++ Sbjct: 498 RETKTGTLILFVVDASGSMA-----AQRRMVAVKGAVHSLL-----MDAYQRRDRVGLIS 547 Query: 223 FSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F P V+ Q + + G T + GL A + + + + Sbjct: 548 FRGTDAKLLLPPTNSVELAQTHLADMPTGGRTPLSQGLYVALQ-LIETERLKDRDVVP-- 604 Query: 282 DYKKYIIFLTDGENSSP 298 +I L+DG + P Sbjct: 605 ----LVILLSDGRANVP 617 >gi|190891604|ref|YP_001978146.1| hypothetical protein RHECIAT_CH0002007 [Rhizobium etli CIAT 652] gi|190696883|gb|ACE90968.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 524 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 25/192 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + LD S SM G G D+L A R + + + + + + ++ F S + F Sbjct: 347 ALCLDFSGSMQ---GNGEDQLQKAMRFLLTPDEASRVLVQWSPSDQIIVIPFDSSVRNMF 403 Query: 232 PLAWGV---QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + +++R T E A +I ++ I+ Sbjct: 404 TASGNPLEQEGLLNEVSRQKADGGTNMYACAERALQQIARTDRLSTYLPA--------IV 455 Query: 289 FLTDG---ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +TDG + S + ++ ++ I +A L A Sbjct: 456 IMTDGRSDDQSQAFMSEWNTIEP-------RVPIFGITFG-DADKTQLDTLAKQTSARVF 507 Query: 346 QNSRKLHDAFLR 357 L AF Sbjct: 508 DGGSDLATAFRT 519 >gi|189190514|ref|XP_001931596.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187973202|gb|EDU40701.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 1331 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 62/152 (40%), Gaps = 14/152 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + L ++LD LS + + +L V + ++ + + + GL Sbjct: 1041 APINASQPLVFKLLLDT-LSTPSNETNHLTRLDVLKQMFDAYINRVLAYSFQPH---IGL 1096 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 VTF++K + V++ + K+N L T + A ++I + AK + Sbjct: 1097 VTFNTKTQVAQKITNAVENSRHKLNNLAAYGDTAIWDSVALAQDQI-------QQHAKQY 1149 Query: 281 DDYKKYIIFLTDGENS---SPNIDNKESLFYC 309 + K II ++DGE++ + D + L C Sbjct: 1150 PNAKLRIICISDGEDNTSLNTVEDVAKRLTRC 1181 >gi|261250852|ref|ZP_05943426.1| TPR domain protein in aerotolerance operon [Vibrio orientalis CIP 102891] gi|260937725|gb|EEX93713.1| TPR domain protein in aerotolerance operon [Vibrio orientalis CIP 102891] Length = 714 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 55/151 (36%), Gaps = 26/151 (17%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + P T ++ + S + +V+D+S+SM + ++L A ++L+ Sbjct: 66 IIALAGPSFKTQELQSYANSSARV---LVMDMSMSMYAN-DIKPNRLTQARYKAIDLLEN 121 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLEY 261 + +GLV ++ Q P+ + I + L+ ++ G+E Sbjct: 122 WQEGS-------TGLVAYAGDAYQVSPMTSDSKTIANLLPNLSPELMPYPGADASKGIEL 174 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 A N + + I+ +TD Sbjct: 175 AINMMTNTGLATGD-----------IVLITD 194 >gi|326670656|ref|XP_003199260.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1401 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G Sbjct: 693 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFAIG 747 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 748 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 807 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 808 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 857 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 858 EANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 915 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 916 ILVLLSGGRSFDSVDAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 969 Query: 346 QNSRKL 351 + +L Sbjct: 970 SDFSEL 975 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 887 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 941 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 942 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 1001 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 1002 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 1051 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 1052 QYSRDAEVHFYLNSYTKKEDILDRVKGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 1109 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 1110 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 1163 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 1164 NYALSVADFSEL 1175 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 92/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L ++ D+ ++ + S+ + + L+E G Sbjct: 88 LYTGEALQFVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSV-----DAAASSLKELGV 142 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TSS Sbjct: 143 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSSTVT 202 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 203 AEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRV----------VEKFNIEANRDRVSVV 252 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 253 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNVFTASSGSRR 310 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 311 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 364 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 365 NYALSVADFSEL 376 >gi|331239018|ref|XP_003332163.1| hypothetical protein PGTG_13530 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309311153|gb|EFP87744.1| hypothetical protein PGTG_13530 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 501 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 65/198 (32%), Gaps = 39/198 (19%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVT 222 S+ LD+ +LD + SM + AT++I + D I + + + +R GL+ Sbjct: 26 SEKMLDLCFILDTTGSMG-------SYITAATQNIELICDEIINSERLASPECLRIGLIA 78 Query: 223 FSS-------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + + F + ++E + L + A ++ + + Sbjct: 79 YRDHPPQDMSYVTLKFAFTSNPKAVKENLKTLWASGGGDGPEAVTAAMHEALTLDWRPQ- 137 Query: 276 IAKGHDDYKKYIIFLTD------GENSS----PNIDNKESLFYCNEAKRRGAIVYAIGV- 324 K + +TD GE + + L + + G ++ + Sbjct: 138 -------ASKMAVLITDAPPHGIGEYGDGFSRGDPSGHDPLQLARKMAQTGISLFVVACE 190 Query: 325 ----QAEAADQFLKNCAS 338 ++ F + AS Sbjct: 191 PAFSGYSYSNDFFRAIAS 208 >gi|217970127|ref|YP_002355361.1| von Willebrand factor type A [Thauera sp. MZ1T] gi|217507454|gb|ACK54465.1| von Willebrand factor type A [Thauera sp. MZ1T] Length = 840 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 59/181 (32%), Gaps = 28/181 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + L + +++D S SM D + A R+++ I + R L Sbjct: 261 PAAAHPLAVKILVDCSGSMQG------DSIAAARRALQ------AIIAGLREGERFSLSR 308 Query: 223 FSSKIVQTFPLAWGVQHIQEKIN-----RLIFG-STTKSTPGLEYAYNKIFDAKEKLEHI 276 F S + W + +L T+ L + E Sbjct: 309 FGSTVEHRSRALWRTSAATRQAGQRWAMQLQADLGGTEMENALAS-TLALAGDAEPSPGT 367 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +G ++ +TDG+ + + K A+ G ++ +G+ + A+ L+ Sbjct: 368 EEGAAAVD--LLLITDGQIHAIDRTVKR-------ARALGNRIFVVGIGSAPAEGVLRRL 418 Query: 337 A 337 A Sbjct: 419 A 419 >gi|12805443|gb|AAH02194.1| Col6a1 protein [Mus musculus] Length = 406 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 76/208 (36%), Gaps = 26/208 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H A R L ++ P + VR +V +S + Q Sbjct: 207 DITILLDSSASVGSH--NFETTKVFAKRLAERFLSAGRADP--SQDVRVAVVQYSGQGQQ 262 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY--NKIFDAKEKLEHIAKGHDDYKKYI 287 G +Q N + S+ S + A N + A K+ + Sbjct: 263 QP----GRAALQFLQNYTVLASSVDSMDFINDATDVNDALSYVTRFYREASSGATKKRVL 318 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK----------NCA 337 +F +DG + + E EA+R G ++ + V + + ++ + A Sbjct: 319 LF-SDGNSQGATAEAIE--KAVQEAQRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVA 375 Query: 338 SPDRF-YSVQNSRKLHDA--FLRIGKEM 362 +R + V N + L + + +++ Sbjct: 376 FGERHLFRVPNYQALLRGVLYQTVSRKV 403 >gi|307720337|ref|YP_003891477.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978430|gb|ADN08465.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 599 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 71/204 (34%), Gaps = 15/204 (7%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + +++ D+SLS + + V S+ + ++ + D Sbjct: 400 TTYEKKTRDMSTLLLADISLSTEGGITQELRIIDVIKDSLMVFSEALEKLEDK-----FA 454 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + TFSS + I+ ++ + Y ++ A + I Sbjct: 455 IYTFSSLQNKKVYF----NIIKNFKDKYDALIRGRIESIKPQYYTRMGAAIRESAKILDK 510 Query: 280 HDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 K ++ ++DG+ + +++ E K++G + I + +A ++L Sbjct: 511 QQSANKLLLIISDGKPNDEDRYDGRYGIEDTKKALQEIKKKGITPFCITIDLDA-KEYLN 569 Query: 335 NCASPDRFYSVQNSRKLHDAFLRI 358 + + V++ +KL + Sbjct: 570 YLFGQNGYAIVRDGQKLPKVLTEV 593 >gi|126334038|ref|XP_001370553.1| PREDICTED: similar to Integrin alpha-X precursor (Leukocyte adhesion glycoprotein p150,95 alpha chain) (Leukocyte adhesion receptor p150,95) (Leu M5) (CD11c antigen) [Monodelphis domestica] Length = 1224 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 61/167 (36%), Gaps = 18/167 (10%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 ++ M+ + ++++ + F + + + ++I +L Sbjct: 224 FQKMKNFVKAMISQFEKPSTQFSLMQFASNFKIHFTFEKFKNSHDPRRLVDEITQL--SG 281 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TK+ G++ N++F K +I +TDGE D E Sbjct: 282 VTKTASGIKKVINELFQKTRGAR------QYATKILIVITDGEKYD---DPLEYSQVIPT 332 Query: 312 AKRRGAIVYAIGVQA----EAADQFLKNCASP---DRFYSVQNSRKL 351 A++ G I YAIGV ++ Q L+ AS D + V N L Sbjct: 333 AEKAGIIRYAIGVGEAFERPSSRQELEEIASEPSKDHIFWVDNFGAL 379 >gi|114799760|ref|YP_759488.1| hypothetical protein HNE_0760 [Hyphomonas neptunium ATCC 15444] gi|114739934|gb|ABI78059.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 576 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 33/81 (40%), Gaps = 1/81 (1%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 +S +R G+++IL A++ P+ ++M + I+ + + + D + + A Sbjct: 7 LSLAALRR-AREQGGNVAILFALIAPIATLMMAMAIDLGMVNLQRRNMQSMTDLAAITAA 65 Query: 61 TKILNQENGNNGKKQKNDFSY 81 + E +N F Sbjct: 66 GDLHKAETRVLTLLSENGFGD 86 >gi|74318654|ref|YP_316394.1| rubisco activation protein cbbO [Thiobacillus denitrificans ATCC 25259] gi|74058149|gb|AAZ98589.1| rubisco activation protein cbbO [Thiobacillus denitrificans ATCC 25259] Length = 773 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 69/174 (39%), Gaps = 27/174 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPD------VNNVVRS 218 + +M++LD+S S+N+ G G L ++ ++ + I+ + D ++ R Sbjct: 576 GRDIAVMLLLDLSESLNEKAAGAGQTILELSQEAVSLLAWSIEKLGDPFAIAGFHSNTRH 635 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F K W ++ ++ + G +T+ + +A + + Sbjct: 636 DVRYFHIKGYSE---RWN-DDVKARLAAMEAGYSTRMGAAMRHAAHYLSARPAD------ 685 Query: 279 GHDDYKKYIIFLTDGENSSPNIDN-----KESLFYCNEAKRRGAIVYAIGVQAE 327 KK ++ LTDG S + + +++ E R+G Y I + A+ Sbjct: 686 -----KKLMLILTDGRPSDVDAADERLLVEDARQAVKELDRQGIFAYCISLDAQ 734 >gi|261878616|ref|NP_001159907.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo sapiens] gi|261878618|ref|NP_001159908.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo sapiens] Length = 623 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 66/192 (34%), Gaps = 16/192 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D+S SM K+ ++ ++L ++ D ++V G S K Sbjct: 5 VVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQSWKGSLV 57 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +Q Q+ + T GL + +E L ++ +I L Sbjct: 58 QASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS----ILIML 113 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-----FYSV 345 TDG+ + D + L A R +Y +G FL+ + + Y Sbjct: 114 TDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 173 Query: 346 QNSRKLHDAFLR 357 ++ + F Sbjct: 174 HDATQQLQGFYS 185 >gi|225028938|ref|ZP_03718130.1| hypothetical protein EUBHAL_03228 [Eubacterium hallii DSM 3353] gi|224953713|gb|EEG34922.1| hypothetical protein EUBHAL_03228 [Eubacterium hallii DSM 3353] Length = 267 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 46/141 (32%), Gaps = 11/141 (7%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + + + ++D S SM KL R + ++L + + + V+ Sbjct: 14 PLETMPPAKKSMVIFFLVDTSKSMEGS------KLESLNRVMGDILPELIGVGEAGTDVK 67 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +++FSS P ++ Q N L T E K+ Sbjct: 68 VAVMSFSSGCEWITPEPVLIEEYQRWEN-LRADGVTDLGDACEELCQKLSRNSFLRA--- 123 Query: 278 KGHDDYKKYIIFLTDGENSSP 298 Y I +TDG + Sbjct: 124 -PSLSYAPVIFLMTDGYPTDN 143 >gi|221044458|dbj|BAH13906.1| unnamed protein product [Homo sapiens] Length = 623 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 66/192 (34%), Gaps = 16/192 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D+S SM K+ ++ ++L ++ D ++V G S K Sbjct: 5 VVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQSWKGSLV 57 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +Q Q+ + T GL + +E L ++ +I L Sbjct: 58 QASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS----ILIML 113 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-----FYSV 345 TDG+ + D + L A R +Y +G FL+ + + Y Sbjct: 114 TDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 173 Query: 346 QNSRKLHDAFLR 357 ++ + F Sbjct: 174 HDATQQLQGFYS 185 >gi|318060223|ref|ZP_07978946.1| hypothetical protein SSA3_19931 [Streptomyces sp. SA3_actG] gi|318080223|ref|ZP_07987555.1| hypothetical protein SSA3_26915 [Streptomyces sp. SA3_actF] Length = 440 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 55/207 (26%), Gaps = 38/207 (18%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 Q + + +P + + D + ++++D S Sbjct: 7 PQGPRFEVEVYQNPYLPEGSGEVHAVVTVTATGGGTGALTAAGPGQDAAV--VLMVDCSG 64 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------- 232 SM KL A + LD + + R ++ + + +P Sbjct: 65 SMQY----PPSKLHHAKEATGAALDTL------RDGTRFAVIEGTHVAREVYPRGGALAV 114 Query: 233 -LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 +E + L T L A + A + H I LT Sbjct: 115 ADDRTRAEAKEALRALRASGGTAVGRWLRLAERLLSQAPVTIRHG-----------ILLT 163 Query: 292 DGENSS-------PNIDNKESLFYCNE 311 DG N +D+ F C+ Sbjct: 164 DGRNEHETPEELRAALDDCAGRFTCDA 190 >gi|227830108|ref|YP_002831887.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|229578921|ref|YP_002837319.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|229582327|ref|YP_002840726.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|227456555|gb|ACP35242.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] gi|228009635|gb|ACP45397.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|228013043|gb|ACP48804.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 380 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 74/190 (38%), Gaps = 33/190 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++ LD S SM + K+ +A + +++ K IP N +TFS Sbjct: 34 SATGFHYIVALDTSGSMTGY------KIELAK---QGAIELFKRIPKGNK---VSFITFS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + + +I ++ G T + A +AK + Sbjct: 82 SNVNVIKEFV-DPLDLTNEILQIAAGGQTALYTAILTA-----------NSLAKKYQMPT 129 Query: 285 KYIIFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 Y++ LTDG + N+ N L Y + VY+ G+ + +Q L+N + + Sbjct: 130 -YLLLLTDGNPTDETNVGNYLKLPYFEKM-----QVYSFGIGDDYNEQLLQNISDKTSGV 183 Query: 342 FYSVQNSRKL 351 Y + ++ ++ Sbjct: 184 MYHISDANEI 193 >gi|149197907|ref|ZP_01874956.1| BatB [Lentisphaera araneosa HTCC2155] gi|149139128|gb|EDM27532.1| BatB [Lentisphaera araneosa HTCC2155] Length = 718 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 81/252 (32%), Gaps = 44/252 (17%) Query: 137 FIFCTFPWC-ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLG 193 F+ C C + P T + G ++ +LD+S SM D + ++K Sbjct: 57 FLLCFSVACLIIALMRPSWGTEDKLLRKD---GHSVVFILDISNSMRAEDVYPNRLEKSK 113 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IF 249 L+ R GLV F+ PL + ++ + + Sbjct: 114 NLIAECVSSLEEH----------RVGLVVFAGSASIKCPLTLDYDFFLKMLDTVNYDSVA 163 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T+ L A +K+F G K II ++DG + +D Sbjct: 164 HGGTRIEDALMKACDKLFS----------GDTQQHKDIILISDGGDQGELLDKAIETVNE 213 Query: 310 NEAK---------RRGAIVYAI---GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 +A+ + GA + I G + L S Y + + + +L Sbjct: 214 KQARLMLIGMGDEKNGAPIPTIDGNGYMMYEGREVLTKLESETMLYLSEQCK--NAVYLP 271 Query: 358 IGKEMVKQRILY 369 +G + + +Y Sbjct: 272 LGTKQMNLARIY 283 >gi|15807991|ref|NP_285654.1| hypothetical protein DR_A0331 [Deinococcus radiodurans R1] gi|6460745|gb|AAF12450.1|AE001863_75 conserved hypothetical protein [Deinococcus radiodurans R1] Length = 494 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 68/217 (31%), Gaps = 39/217 (17%) Query: 144 WCANSSHAPLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 A S+ L +T +VK ++++ L ++ V+D S SM + ++ G T+ Sbjct: 47 LLAQSAGQKLFLTLTVKPTAEARQARPDLSVVFVVDTSGSMREVVTEPTERTGRTTQVDG 106 Query: 201 EMLDIIKSIPDVNNV-----------------VRSGLVTFSSKIVQTFPL--AWGVQHIQ 241 ++ +++K + + R LV F P A + Sbjct: 107 QVYEVVKGGKNKMQLMIEALRGIVTSTLIRPGDRLALVKFDDTAEVLVPFITASNQAQLA 166 Query: 242 EKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 +++L + T G+ + + + ++ ++DG+ Sbjct: 167 AAVDKLDWYSGGTHMGAGMRAGAGLLTGERG------------SRRMVLISDGQ----TF 210 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 D V + V E L A Sbjct: 211 DAPLVEDQIGALAGLQVPVTVVAVGDEVNADLLTAIA 247 >gi|332817004|ref|XP_003339088.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan troglodytes] gi|332817006|ref|XP_516520.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 6 [Pan troglodytes] Length = 623 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 67/192 (34%), Gaps = 16/192 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D+S SM K+ ++ ++L ++ D ++V G S K Sbjct: 5 VVFVIDISGSMRGQ------KVKQTKEALLKILGDMQP-GDYFDLVLFGTRVQSWKGSLV 57 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +Q Q+ + T GL + +E L ++ +I L Sbjct: 58 QASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS----ILIML 113 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-----FYSV 345 TDG+ + D + L A R +Y +G FL+ + + Y Sbjct: 114 TDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 173 Query: 346 QNSRKLHDAFLR 357 +++ + F Sbjct: 174 RDATQQLQGFYS 185 >gi|312878214|ref|ZP_07738140.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor lactoaceticus 6A] gi|311795008|gb|EFR11411.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor lactoaceticus 6A] Length = 1831 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 66/180 (36%), Gaps = 21/180 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + S +S D++ V+D + SM+ D++ ++I +D +K Sbjct: 824 KKYLDITGLKSGTVSPSGQA--DIVFVIDTTGSMS-------DEIDAVKQNINNFVDKLK 874 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + + V GLVT+ G + + K G + + Sbjct: 875 T---KDISVNLGLVTYKDITCDGPNSTVGHGFFSSADDFKNALGSIKVDGGGDTP-ETLI 930 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIG 323 DA E + ++ K+I+ LTD ++ ++N+ + +E K IV + Sbjct: 931 DALE-TARLLGFRENSTKFIVVLTD---ANYKLENRFGIKSADEIIERLKSDNIIVSVVS 986 >gi|297669807|ref|XP_002813080.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 3 [Pongo abelii] Length = 2575 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 48/297 (16%), Positives = 105/297 (35%), Gaps = 27/297 (9%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 924 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 983 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 984 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1040 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1041 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1093 Query: 239 HIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I + IN++++ + GLE+ + E ++ + +T G++ Sbjct: 1094 QIIDAINKVVYKGGRHANTRVGLEH----LRVNHFVPEAGSRLDQRVPQIAFVITGGKSV 1149 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1150 EDAQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 289 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 342 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A N + Sbjct: 343 GYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGH-------- 394 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 395 ---KKVIVLLTDGVPT 407 >gi|146342693|ref|YP_001207741.1| hypothetical protein BRADO5860 [Bradyrhizobium sp. ORS278] gi|146195499|emb|CAL79524.1| Conserved hypothetical protein; putative Von Willebrand factor type A domain [Bradyrhizobium sp. ORS278] Length = 760 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 68/166 (40%), Gaps = 25/166 (15%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L ++++DVS S D G L + ++ + + + + D GL+ F+S Sbjct: 568 DLSALLLIDVSESTRDRLASGATVLDLERLAVALLAEAMDQLGDT-----FGLLAFASDG 622 Query: 228 VQTFPLA----WGVQH---IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + +G + + ++ L+ G +T+ L +A + A Sbjct: 623 RDDIRMTSIKNFGEAYDRDCRARLAGLVSGLSTRLGAALRHAGAVLGVA----------- 671 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +K +I LTDGE S ID + L +A+R ++A G+ A Sbjct: 672 TSSRKLLIVLTDGEPSD--IDVSDPLDLIEDARRAALGLHAQGIDA 715 >gi|284031056|ref|YP_003380987.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283810349|gb|ADB32188.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 837 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 49/159 (30%), Gaps = 18/159 (11%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPL 233 +D S SM ++ +I +L + + GLVTF+ S+ P Sbjct: 661 VDASGSMGTKR-----RMSEVKTAIVSLL-----LDAYQRRDKVGLVTFARSQATVALPP 710 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 V+ ++ L G T GL A + + A + ++ +TDG Sbjct: 711 TGSVETAVRRLESLPTGGRTPLAEGLVRAADVLRIAAIRDPRRRP-------LLVLVTDG 763 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + +G + + + Sbjct: 764 RATHGESAFSRARQAAEWIGHQGIAAVVVDCEPRRGVRL 802 >gi|298207019|ref|YP_003715198.1| hypothetical protein CA2559_02160 [Croceibacter atlanticus HTCC2559] gi|83849653|gb|EAP87521.1| hypothetical protein CA2559_02160 [Croceibacter atlanticus HTCC2559] Length = 288 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/166 (22%), Positives = 58/166 (34%), Gaps = 38/166 (22%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L M++++D+S S + FG + I + + N + GL+ F Sbjct: 72 EEERELTMVLMVDISGS--EFFGTN----QAFKKDIIIEISATLAFSATQNNDKIGLLLF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S +I P G H+ I LI T T L+Y N + Sbjct: 126 SDEIELFIPPKKGRLHVLRIIRELIEFKPQSKKTDITKALKYLSNMLK------------ 173 Query: 280 HDDYKKYII-----FLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 KK I+ F+TDG + I K+ G VY Sbjct: 174 ----KKAIVFVLSDFMTDGYEQTMKIVGKKHDVT-------GIRVY 208 >gi|291059650|gb|ADD72385.1| putative von Willebrand factor type A domain protein [Treponema pallidum subsp. pallidum str. Chicago] Length = 650 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 62/184 (33%), Gaps = 27/184 (14%) Query: 167 IGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +++ V+D S SM + + L V+ S+++ + + R S Sbjct: 428 AQIEVSFVVDNSGSMNKEKIASAREALAVSMLSLKDFGEYSDMLAAGR---RERTTIHSE 484 Query: 226 KIVQTFPL----AWGVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEK 272 ++G ++ + + T L++ I E+ Sbjct: 485 VYYFGSSFIKVKSFGKSKSKDFNSAQLIKASVNLDGRFGGTNDAEVLKH----ILADVER 540 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAAD 330 + K ++ +TDG +S P+ ES E +RRG ++ + IG+ + Sbjct: 541 RRARVSSDTSFVKVVLVITDGCSSYPH----ESRRTIEELRRRGVMIFGFQIGLMSPEET 596 Query: 331 QFLK 334 Sbjct: 597 ALFH 600 >gi|15639238|ref|NP_218686.1| hypothetical protein TP0246 [Treponema pallidum subsp. pallidum str. Nichols] gi|189025479|ref|YP_001933251.1| hypothetical protein TPASS_0246 [Treponema pallidum subsp. pallidum SS14] gi|14285852|sp|O83274|Y246_TREPA RecName: Full=Uncharacterized protein TP_0246 gi|3322523|gb|AAC65240.1| predicted coding region TP0246 [Treponema pallidum subsp. pallidum str. Nichols] gi|189018054|gb|ACD70672.1| hypothetical protein TPASS_0246 [Treponema pallidum subsp. pallidum SS14] Length = 597 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 62/184 (33%), Gaps = 27/184 (14%) Query: 167 IGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +++ V+D S SM + + L V+ S+++ + + R S Sbjct: 375 AQIEVSFVVDNSGSMNKEKIASAREALAVSMLSLKDFGEYSDMLAAGR---RERTTIHSE 431 Query: 226 KIVQTFPL----AWGVQHIQEKINRLIF---------GSTTKSTPGLEYAYNKIFDAKEK 272 ++G ++ + + T L++ I E+ Sbjct: 432 VYYFGSSFIKVKSFGKSKSKDFNSAQLIKASVNLDGRFGGTNDAEVLKH----ILADVER 487 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAAD 330 + K ++ +TDG +S P+ ES E +RRG ++ + IG+ + Sbjct: 488 RRARVSSDTSFVKVVLVITDGCSSYPH----ESRRTIEELRRRGVMIFGFQIGLMSPEET 543 Query: 331 QFLK 334 Sbjct: 544 ALFH 547 >gi|241191500|ref|YP_002968894.1| hypothetical protein Balac_1485 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241196905|ref|YP_002970460.1| hypothetical protein Balat_1485 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|240249892|gb|ACS46832.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis Bl-04] gi|240251459|gb|ACS48398.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295794492|gb|ADG34027.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis V9] Length = 671 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 71/220 (32%), Gaps = 63/220 (28%) Query: 192 LGVATRSIREMLDIIKS----IPDVNNVVRSGLVTFSS-------------------KIV 228 + ++ LD ++ I D V+ L+ ++ Sbjct: 1 MDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQ 60 Query: 229 QTFPLAWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 LAW + +Q++ +N L G T++ GL++A ++ + +K Sbjct: 61 TVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA----------QK 110 Query: 286 YIIFLTDGENSSP----NIDNKESLFYCNEAKRRGAIVYAIGVQ-------AEAADQFLK 334 +F +DG +S ++ + K + V +IG + A++F+ Sbjct: 111 LTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMN 170 Query: 335 NCAS----------------PDRFYSVQNSRKLHDAFLRI 358 +S +Y+V L F I Sbjct: 171 YVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEI 210 >gi|238762416|ref|ZP_04623387.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] gi|238699401|gb|EEP92147.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] Length = 459 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 77/217 (35%), Gaps = 21/217 (9%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F + F N G+I + I+ P + +++E S KAKL ++ + L A Sbjct: 10 FNRLTQFKKNEHGAILVSFIIIFPFFIALTFIILEVSIFLQKKAKLSDAIEQATL--ALT 67 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 + N N ++ KN + + + L GF+ I NI+ +T+ Sbjct: 68 VENDGIPNAAQQTKN-------RELVLSYANAYLPSEGFSDPIINIDDNTNYLGYNAAVT 120 Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Y + + R + N + + D++ V D S SM Sbjct: 121 MTYPVEFLGRSPLTNSISNIQTTDNGEAIKNKTIEVSEPT-------DVVFVADYSGSML 173 Query: 183 DHFGPGMD-----KLGVATRSIREMLDIIKSIPDVNN 214 F + ++ + R + + IK+ +VN Sbjct: 174 LSFSDDVSIKNGERINALRSAFRILHNTIKNNSNVNT 210 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 41/127 (32%), Gaps = 26/127 (20%) Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 E I + T + G+ A N IF H KK +I L+DG +S + Sbjct: 319 ENIIEMAPLGGTLISSGILSA-NNIFKETADNGH--------KKLMIILSDGMDSYNSTM 369 Query: 302 NKESLFY----------CNEAKRRGAIVYAIGVQAEAADQF-------LKNCASPDRFYS 344 F+ C + G + I + + K C D +Y Sbjct: 370 LPNKGFFISKTLIDEGMCEMIIKNGIQMAFIAIAYSPENNVNAPEYINWKQCVGEDNYYE 429 Query: 345 VQNSRKL 351 N+ +L Sbjct: 430 AHNAHEL 436 >gi|114587332|ref|XP_001172399.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 1 [Pan troglodytes] Length = 774 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 71/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQPG-DYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 456 QRIYEDRDATQQLQGFYS 473 >gi|254508395|ref|ZP_05120516.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] gi|219548708|gb|EED25712.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] Length = 696 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 43/293 (14%), Positives = 88/293 (30%), Gaps = 51/293 (17%) Query: 86 NIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF--- 142 W+ + N + + A ++ S+ +D Y + +P Sbjct: 232 TTWKANVSNHIAD---AVQEGESMQTAQASMRLDQDVVFY---WRLQDGLPGRVDMVAYR 285 Query: 143 -PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGV 194 P + L T ++ + G D + VLD S SM+ + G+ KL Sbjct: 286 DPESSKRGTVKLTFTPGDDLTRVTQ-GRDWVFVLDKSGSMSGKYSTLVEGVRQGLGKLPS 344 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 R M D N +G F + + + + ++ + T Sbjct: 345 EDRFRVVMFDS-------NTYDLTG--GFVAVNQS------NISKALQAVEQVEPSNGTN 389 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 G+ A K+ D + I+ +TDG + + + ++ Sbjct: 390 LYEGMAAAIRKLDDDRPTG-------------IVLVTDGVANVGVTEKRRFFEL---MEK 433 Query: 315 RGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 ++ + A L S SV N+ + + + ++ Q Sbjct: 434 HDVRLFTFIMGNSANTPLLVPMTKLSNGVATSVSNADDIIGHLMSMTSKLTHQ 486 >gi|163782258|ref|ZP_02177256.1| hypothetical protein HG1285_05710 [Hydrogenivirga sp. 128-5-R1-1] gi|159882291|gb|EDP75797.1| hypothetical protein HG1285_05710 [Hydrogenivirga sp. 128-5-R1-1] Length = 624 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 77/216 (35%), Gaps = 41/216 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR---EMLDIIKSIPDVNNVVRSGLV 221 L +++D+S SM +K A RS+ E+L+ +K + Sbjct: 440 ERKDLAFELLIDISTSMKKE-----EKFVNALRSLLLVSEVLNKLKMPFSIK-------- 486 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 F+ + + + KI LI G T + + + Sbjct: 487 VFNENVYDLKDFEEDYRLAKAKIMELISSVGGGTDLGKAINIGLESL--------ELYIK 538 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K +I TDGE + + +E + + K++ I +GV +A Q +K Sbjct: 539 STHRKGILILFTDGEPTKG-MRGEELKSFILQMKQKFPI---VGVGVGSATQLVK----- 589 Query: 340 DRFYS-----VQNSRKLHDAFLRIGKEMVKQRILYN 370 ++ V++ KL AF I + +K+ + N Sbjct: 590 -DYFDRTGVSVEDVSKLPAAFSFIVENQLKRLLSVN 624 >gi|156370019|ref|XP_001628270.1| predicted protein [Nematostella vectensis] gi|156215242|gb|EDO36207.1| predicted protein [Nematostella vectensis] Length = 1552 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 8/124 (6%) Query: 239 HIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 +I+ I+ L T L+ + K+F K + KK + LTDG + Sbjct: 15 NIKRDIDELRLERGLTFIDKALKISAEKLFTEK------NGMRLNRKKVALVLTDGIQTK 68 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF-YSVQNSRKLHDAFL 356 K +G VY++G+ ++ L AS +++ ++ ++ +L Sbjct: 69 DKGPFTPLQKASQPMKDKGVEVYSLGIGSDIDVSELITFASGEKYVFNAKSFDELQLQVE 128 Query: 357 RIGK 360 I + Sbjct: 129 NITQ 132 >gi|171914252|ref|ZP_02929722.1| von Willebrand factor, type A [Verrucomicrobium spinosum DSM 4136] Length = 212 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 52/139 (37%), Gaps = 14/139 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM FG ++++ ++ ++ ++ P ++TF S Sbjct: 3 RLPVFLLLDTSGSM---FGEPIEQV---KNGVQMLVSALRQDPYALETAFLSVITFDSNA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL + ++ L T L + K KG +I Sbjct: 57 TEKVPLT---ELTAFQVPALTAQGATSLGDALTLVADVASKQLAKTTATTKGDWKPMVFI 113 Query: 288 IFLTDGENSSPNIDNKESL 306 +TDG +P D K+ L Sbjct: 114 --MTDG---APTDDWKKGL 127 >gi|332815788|ref|XP_003309587.1| PREDICTED: collagen alpha-3(VI) chain [Pan troglodytes] Length = 2570 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 924 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 983 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 984 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1040 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1041 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1093 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1094 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1152 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|328947151|ref|YP_004364488.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447475|gb|AEB13191.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 538 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 62/200 (31%), Gaps = 29/200 (14%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-GMDKLGVAT 196 F + + + + + G + +V D+S SM P G+ +L + Sbjct: 58 FFRFICGICVVLALAGISFGTDSVPVQKN-GKAVSLVFDISYSMEAKDAPGGISRLQAVS 116 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GST 252 E+LD + V + PL Q ++ I L Sbjct: 117 SYASELLDRMNGCA-------VSAVLAKGDGIIAVPLTEDFQSVRSLIENLSPLLMTSEG 169 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G++ A + + +I TD E + ++ + SL C Sbjct: 170 TSLGNGIKSALSSFPEQSSAAS-----------FIWLFTDCEETDNSL--QSSLSEC--- 213 Query: 313 KRRGAIVYAIGVQAEAADQF 332 + G V IG +E + Sbjct: 214 LKSGIPVVIIGFGSERESEI 233 >gi|307180277|gb|EFN68310.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Camponotus floridanus] Length = 2214 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 42/117 (35%), Gaps = 20/117 (17%) Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 ++ N G T + L A + ++EK K + +TDG ++ Sbjct: 169 KQFGNITYSGGGTYTRGALLEALTILEKSREKA----------NKVVFLITDGFSNGG-- 216 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + K GA V+ G++ + L + ASP + L D+F Sbjct: 217 ---DPRPAADLLKNTGATVFTFGIRT-GNVEELHDIASPPGY----THSYLLDSFAE 265 >gi|240255535|ref|NP_476507.3| collagen alpha-3(VI) chain isoform 4 precursor [Homo sapiens] Length = 2570 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 924 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 983 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 984 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1040 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1041 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1093 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1094 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1152 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|219841772|gb|AAI44596.1| COL6A3 protein [Homo sapiens] gi|223462379|gb|AAI50626.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 924 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 983 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 984 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1040 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1041 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1093 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1094 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1152 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|219521324|gb|AAI71790.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 924 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 983 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 984 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 1040 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 1041 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 1093 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 1094 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 1152 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|170093379|ref|XP_001877911.1| predicted protein [Laccaria bicolor S238N-H82] gi|164647770|gb|EDR12014.1| predicted protein [Laccaria bicolor S238N-H82] Length = 360 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 64/179 (35%), Gaps = 32/179 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN-VVRSGL 220 S + +D++ + D + S + + A ++I+++ + + +++ +R GL Sbjct: 8 KSTAAKSIDIVFLQDATGSQGPY-------IRAARQAIQQICSKVSASAELSQGAIRFGL 60 Query: 221 VTFSSK-------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + F + + F +++ ++ LI A + Sbjct: 61 IAFRDHPPQDMSFVTKNFGFTAEQSVMKKNLDGLIASGGGDGPEAQTAALADALN----- 115 Query: 274 EHIAKGHDDYKKYIIFLTD------GENSSP---NIDNKESLFYCNEAKRRGAIVYAIG 323 + + K ++ +TD GE++ + D + L + RG ++ I Sbjct: 116 ---LEWAEGAAKMVVLITDAPPHGIGEDNDGFAESPDQNDPLQLARQMAERGITLFVIA 171 >gi|62088852|dbj|BAD92873.1| alpha 3 type VI collagen isoform 5 precursor variant [Homo sapiens] Length = 1702 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 102/295 (34%), Gaps = 23/295 (7%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI---NNIERSTSLSIIIDD 120 +G Q + F +R +G NI+R+ +I D Sbjct: 56 SAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDP 115 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + ++ AP + + + D++ +LD S Sbjct: 116 RLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKA-DIVFLLDGS-- 172 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQ 238 D R + E++D + D ++ ++ GLV ++S F L + Sbjct: 173 ----INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSDPTDEFFLKDFSTKR 225 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I + IN++++ + + + E ++ + +T G++ Sbjct: 226 QIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 283 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 D +L +RG V+A+GV+ +++ K ++ + V N ++L + Sbjct: 284 AQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 332 >gi|227830111|ref|YP_002831890.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|227456558|gb|ACP35245.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] Length = 356 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 72/197 (36%), Gaps = 34/197 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++D S SM KL A +S +++L ++N L+ Sbjct: 32 QSSVTSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLL------YNLNEGNYVTLI 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS+ +I + + ++ +A ++AK Sbjct: 80 LFSNHP---------------EIKYQGPAKGIITFDVGKGYTTRLHEAVSFTINLAKQSQ 124 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K II LTDG+ + D + Y + IG+ + ++ LK A S Sbjct: 125 VPTK-IIMLTDGKPT----DKRNVKDYEKLDIPPNTQIITIGIGNDYNERILKKLADRSS 179 Query: 340 DRFYSVQNSRKLHDAFL 356 +FY +++ +L + F Sbjct: 180 GKFYHIKDISELPNIFE 196 >gi|322433016|ref|YP_004210265.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] gi|321165243|gb|ADW70947.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] Length = 213 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 55/170 (32%), Gaps = 13/170 (7%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + D + +LD S SM G +D L RS + V ++T Sbjct: 8 TNPDPRCACIFLLDTSGSMK---GAPIDALNEGLRSFEHDIQDDALA---RRRVEIAIIT 61 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F Q P W + L G T + I K E+ A G Sbjct: 62 FGGITRQIQP--WVSAGAFQA-PVLTTGGGTPMGEAMYEGVRMI--NIRKAEYKAAGLSY 116 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 Y+ ++ +TDG + + + A+ G +A+GV D Sbjct: 117 YQPWVFLITDGTPTDEWLQAAAVVRRETAAR--GLTFFAVGVGDADMDTL 164 >gi|323528254|ref|YP_004230406.1| von Willebrand factor type A [Burkholderia sp. CCGE1001] gi|323385256|gb|ADX57346.1| von Willebrand factor type A [Burkholderia sp. CCGE1001] Length = 329 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 73/252 (28%), Gaps = 38/252 (15%) Query: 145 CANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + + + + + G +++++D S SM++ ++ + Sbjct: 58 VLAMLAIVVGLAGPGRSHREVLRTGSGAQILILMDRSASMDEPINSRGVEVSAGESKNKV 117 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPG 258 + R + F + + P + + I I G T+ G Sbjct: 118 ARASLTEFVTERPNDRLAFMMFGTSPLLAMPFTYDHRAIDAAIAGTAVGRGMPDTQLDLG 177 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L A + D H ++ I+ ++DG +D + R Sbjct: 178 LLTAIGEFNDQ----------HSSGRRAIVLVSDG---GAKLDARVRQLVEEGLLRNQIA 224 Query: 319 VYAI----------------GVQAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIG 359 +Y I + A + + S P R + N++ + DA I Sbjct: 225 LYFIYLRSSIYSPDLNARAPASETSAEAELHRYFLSLKTPYRLFQTGNAKAMRDAMAEIN 284 Query: 360 KEMVKQRILYNK 371 ++ + Sbjct: 285 RQQNALTTFVER 296 >gi|15890597|ref|NP_356269.1| nitric oxide reductase NorD protein [Agrobacterium tumefaciens str. C58] gi|15158850|gb|AAK89054.1| nitric oxide reductase NorD protein [Agrobacterium tumefaciens str. C58] Length = 632 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 72/171 (42%), Gaps = 25/171 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-----RSGLVT 222 L + +++DVSLS + F L V ++ + + + D+++++ R V Sbjct: 442 DLAVTILVDVSLSTDAWFDNR-RVLDVEKEALLVLAHGLSACGDIHSILTFTSRRRSWVR 500 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P+ I+ +I L G T+ P + +A K+ + E+ Sbjct: 501 VETVKDFDEPMG---HTIERRIAALKPGFYTRIGPAIRHASAKLHERPERR--------- 548 Query: 283 YKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 K ++ LTDG+ + ++S +EA+R G V+ + V ++A Sbjct: 549 --KLLLLLTDGKPNDVDHYEGRFAIEDSRRAVSEARRSGVSVFGVTVDSKA 597 >gi|227827418|ref|YP_002829197.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584633|ref|YP_002843134.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619574|ref|YP_002914399.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459213|gb|ACP37899.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019682|gb|ACP55089.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380643|gb|ACR41731.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 356 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 72/197 (36%), Gaps = 34/197 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++D S SM KL A +S +++L ++N L+ Sbjct: 32 QSSVTSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLL------YNLNEGNYVTLI 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS+ +I + + ++ +A ++AK Sbjct: 80 LFSNHP---------------EIKYQGPAKGIITFDVGKGYTTRLHEAVNFTINLAKQSQ 124 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K II LTDG+ + D + Y + IG+ + ++ LK A S Sbjct: 125 VPTK-IIMLTDGKPT----DKRNVKDYEKLDIPPNTQIITIGIGNDYNERILKKLADRSS 179 Query: 340 DRFYSVQNSRKLHDAFL 356 +FY +++ +L + F Sbjct: 180 GKFYHIKDISELPNIFE 196 >gi|153806289|ref|ZP_01958957.1| hypothetical protein BACCAC_00545 [Bacteroides caccae ATCC 43185] gi|149130966|gb|EDM22172.1| hypothetical protein BACCAC_00545 [Bacteroides caccae ATCC 43185] Length = 289 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMITEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESHRTNIRLALEYLTNVMK 173 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 85/264 (32%), Gaps = 56/264 (21%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 IR F + G + +LT +LL + +V G+ ++ + KL + D ++L A Sbjct: 58 IRQFRNDEDGGLIVLTLLLLISMLVVGGMAVDFMRFESERTKLQSVADRAVLAAA----- 112 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 N N ++ + + GF I + SI + Sbjct: 113 --NLNQEREAADVITD-------------FFTAEGFGGSIVG-----TPSIQKNRNGSTI 152 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 L ++ + F ++ AP T+ + +++ +VLD+S SM Sbjct: 153 RLESIVDVD---TFYLRLVGIDTLSAPANATAI-----EGTGNVEVSLVLDISGSMGSRM 204 Query: 186 ------------------GPGMDKLGVATRSIREMLDIIKSI-----PDVNNVVRSGLVT 222 G + + R + + K I + + V LV Sbjct: 205 TGDAYLYDSDGEIRLDPDGNPLTERRTEDRMFFLIQEANKFIGDLLKDEYRDRVSINLVA 264 Query: 223 FSSKIVQTFPLAWGVQHIQEKINR 246 +S + L + + I+ Sbjct: 265 YSQHVRLGDDLYTALNTTPDSIDE 288 >gi|126661250|ref|ZP_01732323.1| Mg chelatase subunit; ChlD [Cyanothece sp. CCY0110] gi|126617479|gb|EAZ88275.1| Mg chelatase subunit; ChlD [Cyanothece sp. CCY0110] Length = 672 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 68/219 (31%), Gaps = 45/219 (20%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 471 ARKAGALIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVALIPF 519 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ L G + GL A + +AK D Sbjct: 520 RGEQADVLLPPTRSISLARKRLETLPCGGGSPLAHGLTQAVHVGMNAK-------MSGDI 572 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----G 323 + I+ +TDG + P +E L + + G + I Sbjct: 573 GQVVIVAITDGRGNIPLAKSLGEPIPEGEKPDIKQELLDIAKKIRGLGIKLLMIDTEKKF 632 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 V + + + +Y + + I + Sbjct: 633 VSTGFGKELAQT--AGGTYYQLPRATD-----QAIAQMA 664 >gi|34525892|emb|CAE46626.1| trombospondin-related protein [Plasmodium falciparum] Length = 331 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + L FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYLNIFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + I+ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENASQLIVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + +G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVVGIGQGINVAFNRFLVGCHPSD 209 >gi|226327519|ref|ZP_03803037.1| hypothetical protein PROPEN_01390 [Proteus penneri ATCC 35198] gi|225204045|gb|EEG86399.1| hypothetical protein PROPEN_01390 [Proteus penneri ATCC 35198] Length = 505 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 66/226 (29%), Gaps = 40/226 (17%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 C N + + + + ++++ D S SM L S ++ Sbjct: 282 CENRLANKIKQMCPAERPPE--LAPQVILIFDASGSMALSMNLTESDLDY-IASTGQLFP 338 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA------------------WGVQHIQEKINR 246 + P R+ + + I + + I R Sbjct: 339 GYDAEPRRITTARNAAIKIINNIPSDMKITTVVASDCGVVKSSPAYGGNERSKLLNYIKR 398 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + S T ++ A N I I+ L+DG S +++ Sbjct: 399 IEPDSGTPLAESIKRASNLIKGNNRDT------------IIVLLSDGLESC----DQDPC 442 Query: 307 FYCNEAKRR--GAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 KR A++ + + A + N A+ + ++ +N+ + Sbjct: 443 SAARTLKRAHPRAVINVVDILGTGAGNCVAN-ATGGKVFTARNANE 487 >gi|59714087|ref|YP_206862.1| transporter [Vibrio fischeri ES114] gi|59482335|gb|AAW87974.1| transporter [Vibrio fischeri ES114] Length = 591 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 55/169 (32%), Gaps = 18/169 (10%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD---MMMVLDVSLSMNDHFGPGMD 190 ++P N L+ S G D +++VLD S SM + Sbjct: 71 QLPLKLLFIVIFLNI----LICAGPTWQKQASPFGEDKAPLLIVLDTSNSMLEKDVLPNR 126 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + + I+ + + + ++GL+ +S PL + + Sbjct: 127 LIRAKQK--------IQDLIALRDGGKTGLIVYSGTAHIAMPLTQDSAVFSPYLAAIEPE 178 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + AY + K++ +++ +I +TDG + N Sbjct: 179 IMPVEG---KSAYKTLPLIKQQFSALSQTTPSLGGTVILITDGVTTLDN 224 >gi|307329476|ref|ZP_07608637.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306884885|gb|EFN15910.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 733 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 51/160 (31%), Gaps = 21/160 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDV 212 +V+ + + +VLD S SM +F G + L ++ D ++P Sbjct: 517 AAGEAVQAHGLAGQRAVVYLVLDRSGSMRGYFKDGTVQHLAEQALALSAQFDATGTVP-- 574 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V FS+ I T L +I L A N + Sbjct: 575 -------VVFFSTDIDGTADLELDQHT--GRIEELHGSLGHMGRTNYHTAVNAVI----- 620 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 EH ++IF TDG ++ K C A Sbjct: 621 -EHYKASGTTAPAFVIFQTDGAPTAKTAAEK---ALCEAA 656 >gi|12957162|dbj|BAB32650.1| complement factor B/C2-A3 [Cyprinus carpio] Length = 754 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 79/222 (35%), Gaps = 29/222 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI LD+ + LDVS S+++ A I+ +++ I N Sbjct: 241 KIRVGKGGKLDIYIALDVSDSIDEE------DFEKAKGVIKTLIEKISYYEVSPNYE--- 291 Query: 220 LVTFSSKIVQTFPL-------AWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKE 271 ++ F++ + + + + I +++ + S T + AY I ++ + Sbjct: 292 ILIFATDVARIVSMRDFKSAQKNNLLEILKRLKDYEYNSKGDRTGTNIAQAYRSILESMQ 351 Query: 272 KLEHIAKGHDDYKKYIIFL-TDGENSSPNIDNKESLFYCNEAKRR-------GAIVYAIG 323 + K ++I+ + TDG+ + + K+ +Y G Sbjct: 352 IEQMTNKEEFKTTQHIVIMFTDGQANMGGNPRPWVDQIKDLVKKNSPSEEEENLDLYVFG 411 Query: 324 ----VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 V AE + + + F+ ++N L + F + E Sbjct: 412 MGDDVNAEDINDLKTDRGNEKFFFKLKNLEDLQETFDSMIDE 453 >gi|291411005|ref|XP_002721795.1| PREDICTED: integrin alpha M [Oryctolagus cuniculus] Length = 1155 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 90/230 (39%), Gaps = 26/230 (11%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F + +N +P + +++ + + D+ ++D S S++ M + Sbjct: 124 FLFGSNLLQSPQRVPETLRGCPQQES--DIAFLIDGSGSIDSTDFQRMKEFVSTVMEQFT 181 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 + + ++ + R+ TFS P + + + I +L+ T + G+ Sbjct: 182 KSNSLFALMQYSEEFRT-HFTFSDFKRNPNP-----RALVKPIRQLL--GRTHTATGILK 233 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 ++F + A+ K ++ +TDGE ++ ++ + A+R G I Y Sbjct: 234 VVTELFHSSSGARANAR------KVLVVITDGEKFGDTLEYEDVIP---RAEREGVIRYV 284 Query: 322 IG----VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 +G +E + Q L AS + + V N L+ ++ +++ Sbjct: 285 VGVGDAFNSEQSRQELNTIASKPSREHVFRVNNFEALNTIRNQLQEKIFA 334 >gi|221042196|dbj|BAH12775.1| unnamed protein product [Homo sapiens] Length = 645 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQPG-DYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|156346879|ref|XP_001621559.1| hypothetical protein NEMVEDRAFT_v1g221836 [Nematostella vectensis] gi|156207635|gb|EDO29459.1| predicted protein [Nematostella vectensis] Length = 377 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 50/134 (37%), Gaps = 21/134 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 LD+ ++LDVS+SM + + S+++++ GL+TF Sbjct: 189 DCQEQLDIAILLDVSMSMEWGLSQAQNFTSLVIGSLKDIISE--------GGTHVGLITF 240 Query: 224 SSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +++ L + + I++++ T + L+ A F + Sbjct: 241 ANEAEIVIGLDDIRSRDWTAV---IHKVMLSGNTYANKALKLAETTFF----TEAKGMRP 293 Query: 280 HDDYKKYIIFLTDG 293 K +I L+DG Sbjct: 294 GSA--KVVITLSDG 305 >gi|84499825|ref|ZP_00998113.1| hypothetical protein OB2597_07840 [Oceanicola batsensis HTCC2597] gi|84392969|gb|EAQ05180.1| hypothetical protein OB2597_07840 [Oceanicola batsensis HTCC2597] Length = 244 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 71/223 (31%), Gaps = 47/223 (21%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++ S D M+V D S SM + + ++ A +IR ++P V V R G Sbjct: 24 NAGSGCATDAMLVFDGSGSMVEFGYDPRQVTRIREAREAIRH------AMPLVAPVRRIG 77 Query: 220 LVTFS-------SKIVQTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 L+ + S I FP + ++ L G T + A Sbjct: 78 LLIYGPNDGDSCSGIDLRFPPRPDAADPVIRAVDALSPGGLTPLA-------RSVGVAAR 130 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ------ 325 L+H K I+ +TDG + EA+ ++ IG + Sbjct: 131 VLDHREKAG-----IIVVVTDGNETCGGRPCATGAALAAEARD--LTIHVIGFRALVDYW 183 Query: 326 --------AEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLR 357 A + + C + + + +L +A Sbjct: 184 TWDNPEQEAHVGEDTVARCLAEKTGGMYVRTETVGELVEALQA 226 >gi|113969744|ref|YP_733537.1| TPR repeat-containing protein [Shewanella sp. MR-4] gi|113884428|gb|ABI38480.1| Tetratricopeptide TPR_2 repeat protein [Shewanella sp. MR-4] Length = 681 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 50/164 (30%), Gaps = 28/164 (17%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P L ++ + + + +V+D+S+SM ++L Sbjct: 57 PLHILAFTWVIATLALAGPALNKQTLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG--- 250 ++L +K +GL+ F+ PL + + L Sbjct: 113 QTKFRATDLLRGLKEGE-------TGLIAFAGDAFTISPLTRDTGTLLNLLPTLSPDIMP 165 Query: 251 -STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + L A N + II +TDG Sbjct: 166 VLGSNLAAALTQAKNLLAQGGHLRGD-----------IIVMTDG 198 >gi|114046973|ref|YP_737523.1| TPR repeat-containing protein [Shewanella sp. MR-7] gi|113888415|gb|ABI42466.1| Tetratricopeptide TPR_2 repeat protein [Shewanella sp. MR-7] Length = 687 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 50/164 (30%), Gaps = 28/164 (17%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P L ++ + + + +V+D+S+SM ++L Sbjct: 57 PLHILAFTWVIATLALAGPALNKQTLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG--- 250 ++L +K +GL+ F+ PL + + L Sbjct: 113 QTKFRATDLLRGLKEGE-------TGLIAFAGDAFTISPLTRDTGTLLNLLPTLSPDIMP 165 Query: 251 -STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + L A N + II +TDG Sbjct: 166 VLGSNLAAALTQAKNLLAQGGHLRGD-----------IIVMTDG 198 >gi|109899476|ref|YP_662731.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Pseudoalteromonas atlantica T6c] gi|109701757|gb|ABG41677.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudoalteromonas atlantica T6c] Length = 1364 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 60/186 (32%), Gaps = 42/186 (22%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINR----------------LIFGSTTKSTPGLEYAYN 264 V + + + LA G + + R L G + Sbjct: 506 VMYYGGLDVNYGLARGNNSVSNTVRRNTRVSHRLSYSGQDATLPSGCEEDNLSSSNCITQ 565 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYC-------------- 309 +I + I+ +I+ L+DGE N++ ++D ESL Sbjct: 566 QIPQGARYISPISDRQCQVNNHIVLLSDGEANNNHSVDEIESLLSASCTGSGGEKCGLSL 625 Query: 310 --NEAKRRG------AIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 N A G I + IG A A+ FL A FY NS++L AF I Sbjct: 626 VRNVADSEGSVIDSRIITHTIGFAANTEANSFLNQIALQGGGGFYQADNSQELLGAFQSI 685 Query: 359 GKEMVK 364 K + Sbjct: 686 LKTVKD 691 >gi|239931228|ref|ZP_04688181.1| hypothetical protein SghaA1_23611 [Streptomyces ghanaensis ATCC 14672] Length = 448 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 50/170 (29%), Gaps = 22/170 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND---HFGPGMDKLGVATRSIRE 201 + + S +++++D S SM+ D A ++R+ Sbjct: 37 ATGGGTIGSAVAAPHLYSPGRGPSAAVVLMVDCSGSMDYPPTKMRNARDATAAAIDALRD 96 Query: 202 MLD-IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + + V V G A ++ + RL G T L Sbjct: 97 GVHFAVVGGTHVAKEVYPGA------GRLAVADATTRDQAKQALRRLSAGGGTAIGTWLR 150 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 A + A + H I LTDG N + D + +L C Sbjct: 151 LADRLLASADVAIRHG-----------ILLTDGRNEHESPEDLRAALDAC 189 >gi|225012024|ref|ZP_03702461.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003579|gb|EEG41552.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 288 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 59/175 (33%), Gaps = 28/175 (16%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+V+DVS S + FG R I + + Sbjct: 58 NVTARYNEPFVKVFEEERELTLMLVVDVSGS--EKFGTK----TQFKREILTEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + N + GLV FS +I P G HI I L+ + + A + Sbjct: 112 SALQNNDKVGLVLFSDQIELFIPPKKGRSHILRIIRELLEFKPKSTETNISAALEFL--- 168 Query: 270 KEKLEHIAKGHDDYKKYIIF-LTDGENSSPNIDNKESLFYCNEAKR---RGAIVY 320 KK I+F L+D +S + + AK+ G VY Sbjct: 169 ---------SGVLKKKAIVFILSDFMDSGYEKTLRIT------AKKHDLTGIRVY 208 >gi|291439602|ref|ZP_06578992.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291342497|gb|EFE69453.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 455 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 50/170 (29%), Gaps = 22/170 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND---HFGPGMDKLGVATRSIRE 201 + + S +++++D S SM+ D A ++R+ Sbjct: 44 ATGGGTIGSAVAAPHLYSPGRGPSAAVVLMVDCSGSMDYPPTKMRNARDATAAAIDALRD 103 Query: 202 MLD-IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 + + V V G A ++ + RL G T L Sbjct: 104 GVHFAVVGGTHVAKEVYPGA------GRLAVADATTRDQAKQALRRLSAGGGTAIGTWLR 157 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 A + A + H I LTDG N + D + +L C Sbjct: 158 LADRLLASADVAIRHG-----------ILLTDGRNEHESPEDLRAALDAC 196 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 28/67 (41%), Gaps = 4/67 (5%) Query: 6 IRNF----FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 IR + KG ++I + + V+ L ++ + K KL ++D + L A Sbjct: 3 IRRINASPYRAQKGVVAIFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAK 62 Query: 62 KILNQEN 68 I + + Sbjct: 63 AIDSGSD 69 >gi|327285304|ref|XP_003227374.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Anolis carolinensis] Length = 3587 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 61/186 (32%), Gaps = 44/186 (23%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V+ + L+++ ++D S S+ G R + ++L +P R Sbjct: 66 VRRLREGSARLELVFLVDESSSV------GQANFRSELRFVTKLLSDFPVVP---TATRV 116 Query: 219 GLVTFSS------KIVQTFPLAWGVQH-------------IQEKINRLIF-GSTTKSTPG 258 +VTFSS ++ P + G + +I + + G T + Sbjct: 117 AIVTFSSKNHVLSRVDYISPPSPGRTQPQQQRQQQHKCALLGREIPGIGYRGGGTYTKGA 176 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + A + ++ K I +TDG ++ + + G Sbjct: 177 FQQAAQILLHSRANA----------TKVIFLITDGYSNGG-----DPRPIAASLREFGVE 221 Query: 319 VYAIGV 324 ++ G+ Sbjct: 222 IFTFGI 227 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 53/142 (37%), Gaps = 14/142 (9%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +TFS++ +F A EK+N L T + L A F+ + Sbjct: 100 ITFSTEAEISFRYA-NRSQAIEKLNDLPYMACKTNTQLALNLAEMIFFN------NTLGP 152 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCA 337 ++ I+ TDG++ N+ + +L+ K+RG +Y + V + + Sbjct: 153 LRPGRRRILIFTDGQS---NVKEQMTLYRAFRLKKRGVEIYVVAVGKYLYGMHEIIGLAT 209 Query: 338 SPDRF-YSVQNSRKLHDAFLRI 358 S Y V++ + I Sbjct: 210 SSSHHLYRVRSMKDFVKIVQLI 231 >gi|156363275|ref|XP_001625971.1| predicted protein [Nematostella vectensis] gi|156212829|gb|EDO33871.1| predicted protein [Nematostella vectensis] Length = 331 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 50/129 (38%), Gaps = 21/129 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ ++LDVS+SM + + S+++++ GL+TF+++ Sbjct: 148 LDIAILLDVSMSMEWGLSQAQNFTSLVIGSLKDIISE--------GGTHVGLITFANEAE 199 Query: 229 QTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + I++++ T + L+ A F + Sbjct: 200 IVIGLDDIRSRDWTAV---IHKVMLSGNTYANKALKLAETTFF----TEAKGMRPGSA-- 250 Query: 285 KYIIFLTDG 293 K +I L+DG Sbjct: 251 KVVITLSDG 259 >gi|86360582|ref|YP_472470.1| hypothetical protein RHE_PE00308 [Rhizobium etli CFN 42] gi|86284684|gb|ABC93743.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 533 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 58/196 (29%), Gaps = 32/196 (16%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 F+ + +G + LT I +P++ LVI+ + L +D L A ++ ++ Sbjct: 8 RFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDGRD 67 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + N +G ++ ++S++ + + Sbjct: 68 DAITR------------AQTAIEKIANSAAFSGGGT-GMSLGSNSSVTYEAGNDAGN--- 111 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 P+ + S++ V+ +M F Sbjct: 112 --------TVTVLFLKSIPADDDTPIPASMETTEPSEA----SYAWVIAKPQAMQTIFPI 159 Query: 188 GM----DKLGVATRSI 199 + D + +A ++ Sbjct: 160 PVGFNRDTINIAADAV 175 >gi|332978757|gb|EGK15449.1| hypothetical protein HMPREF9373_0075 [Psychrobacter sp. 1501(2011)] Length = 2843 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 59/151 (39%), Gaps = 18/151 (11%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH--FGPGMDKLGVATRSIREMLDIIKS 208 P S V I++ + ++ ++LD S SM+ G G +L +A +I ++LD Sbjct: 2567 RPAQSESDVDIAAATSDS-NITIILDTSGSMSVEVASGGGETRLDLAKAAIVKLLDGYDV 2625 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 D V+ L+TF+ + W + +++ + + T L+ A Sbjct: 2626 AGD----VKVQLITFAKEAQAQPQ--WMTIDELKDLLPGINARGGTNYDAALDVAMTSYD 2679 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +A Y FLTDGE + Sbjct: 2680 NAGGIFGANN--------YSYFLTDGEPTFG 2702 >gi|261212660|ref|ZP_05926944.1| TPR domain protein in aerotolerance operon [Vibrio sp. RC341] gi|260837725|gb|EEX64402.1| TPR domain protein in aerotolerance operon [Vibrio sp. RC341] Length = 628 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 51/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W S + S + + +++D+S SM P T+ Sbjct: 55 LLGVGWMIAVLALAGPSWQSAERPSVQNSAARV-LIMDMSQSMYATDLAP-----NRLTQ 108 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + I L + Sbjct: 109 ARYKALDLLKGWHEGS----TGLVAYAADAYVVSPLTRDSATLGNLIPSLSPEIMPYQGA 164 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + A + +I LTD Sbjct: 165 NAANAVGLAISMLQQAGHQKGD-----------LILLTD 192 >gi|255693882|ref|ZP_05417557.1| von Willebrand factor, type A [Bacteroides finegoldii DSM 17565] gi|260620311|gb|EEX43182.1| von Willebrand factor, type A [Bacteroides finegoldii DSM 17565] Length = 289 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|167526046|ref|XP_001747357.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774192|gb|EDQ87824.1| predicted protein [Monosiga brevicollis MX1] Length = 889 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 65/191 (34%), Gaps = 26/191 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD S S+ G + S+ ++L + V++ G +F ++ Sbjct: 557 IVLALDDSTSVG---YAGFQRQVTWASSLAQLLTF------DSGVIQMGAFSFDDRVQPI 607 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ ++ L T L A + + D+++ ++ L Sbjct: 608 SRFT----SVEVLVDTLQSAKWTGGASSLAEA---LSFTGARYFQDLSLPADHRRILVIL 660 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-----AEAADQFLKNCASPDRFY-- 343 DG + ++ G ++AI VQ + Q + +S ++ Sbjct: 661 LDGAADDTVAQVAR---QATQLRQMGVTIFAIAVQDSNDVQQQEAQLMAAVSSTVEYHLM 717 Query: 344 SVQNSRKLHDA 354 V N L DA Sbjct: 718 RVPNMEALMDA 728 >gi|148250139|gb|ABQ53165.1| MtsD [Myxococcus fulvus HW-1] Length = 659 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 56/164 (34%), Gaps = 29/164 (17%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPG-------MDKLGVA------------TRSI 199 + + S + +++V+D S SM PG + + R++ Sbjct: 49 TRPPNPSGFPVKVVVVIDESGSMCVSDPPGAQLDSGFCQRREILDIIPEGVTEPARVRAL 108 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG-------VQHIQEKINRLI--FG 250 + ++ + + V+ + F + + +P A +I I L G Sbjct: 109 KRLVQQFREVNAQGGNVQVSVAPFETNVRNVWPPATTGDRFARPDNNIDSYIEGLQSQLG 168 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY-IIFLTDG 293 T L YAY+ I + +Y ++FLTDG Sbjct: 169 KGTDYQGALSYAYSFISLDINAVAQSNPELLPRTRYVVVFLTDG 212 >gi|308509926|ref|XP_003117146.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] gi|308242060|gb|EFO86012.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] Length = 713 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 51/388 (13%), Positives = 120/388 (30%), Gaps = 56/388 (14%) Query: 8 NFFYNCKGSISILTAI-LLPVIFIVMGLV--------------IETSHKFFV------KA 46 F N + + A ++ + + I + + F + K Sbjct: 195 KFVGNYNMNFHMQDAAPIVQELQGATAMANCFCRDGWSQYEWPINSGNLFGICVGNLNKN 254 Query: 47 KLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDIN 106 + ++ T N + K F+Y +I N + +Q+ + Sbjct: 255 EYQM---NAQKSCHTAAKNGYLVSEFDVNKRSFNYDVITNKGAKTVASY-YNGLISQNGS 310 Query: 107 NI--ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS-S 163 + + +D + + ++ + + W S L + S S Sbjct: 311 WYWDQPDGKPLLPLDPSSGSVSAKSGCVADVKYSDGSIAWTPISCGNRLPYLLPLSQSVS 370 Query: 164 KSDIG-------LDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 G LD+++V+D S +M +D+ + + I +S Sbjct: 371 DRQCGNDTNFIWLDIILVIDNSPTMISDNVYESISSIFGPNSQIGTGYTDPRSS------ 424 Query: 216 VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R +VT+++ ++ ++ ++ L G + ++ + + + Sbjct: 425 -RVAIVTYNNYATTVADFRTIKSLKQLKTQLTALDQAGN----SGNQSYLDQGLISAQSI 479 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQF 332 ++YKK ++ T SS + + K G V + + ++ Sbjct: 480 LTKFDDRENYKKVVLIFT----SSFDFIDSRPDRLAETMKSNGVTVITVNTGKDKTVEKQ 535 Query: 333 LKNCASPDRFY--SVQNSRKLHDAFLRI 358 L+ AS + S + L +A I Sbjct: 536 LEGVASAGFAFKMSTNTTETLQNALTTI 563 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++ LD+++V+D + M MD + +++ I + R +VT++ Sbjct: 32 NLWLDIVLVIDNTRIMQ------MDGVYDTIQAMFGQSVRIGAGHPDPRSTRVAIVTYNE 85 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + +++ +L + S + + A Y Sbjct: 86 VAKVEAGFDEFKSLTALNQELEKLNATQKSDSFDAFMDLGLSAANNLITAANRANDRKQY 145 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 KK I+ T S+ + N+ ++ G + + A + Sbjct: 146 KKLIVLFT----SNYSFKNQRPDVLALSIRQSGIDISTVYTGAGTSG 188 >gi|88601588|ref|YP_501766.1| magnesium chelatase, ChlI subunit [Methanospirillum hungatei JF-1] gi|88187050|gb|ABD40047.1| protoporphyrin IX magnesium-chelatase [Methanospirillum hungatei JF-1] Length = 619 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 68/226 (30%), Gaps = 20/226 (8%) Query: 78 DFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPF 137 F+ +I +F + + + G I S + K Y+++ + + Sbjct: 346 QFTKDLIVQKKNKNF-HSMVKTGKKGSHQKISDSGRYFRSKNPSGKIYDIAFDATFR-AA 403 Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + + K G ++ V+D S SM ++ Sbjct: 404 APHQITRSNGTLALNISVQDIRVKERKRKSGRTIIFVVDSSGSMG-----AAKRMSAVKG 458 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 ++ +L N + L++F ++ L G T + Sbjct: 459 AVLSLLK-----DAYINRDQVALISFRGPGAEVLLKPTRSGMTAYHQLAHLPTGGQTPLS 513 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 G+ + I + K H ++I ++DG + DN Sbjct: 514 SGIYTTVSLIRTIRRKNSHDEP-------FVIIISDGRANHARSDN 552 >gi|156400007|ref|XP_001638792.1| predicted protein [Nematostella vectensis] gi|156225915|gb|EDO46729.1| predicted protein [Nematostella vectensis] Length = 974 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 51/142 (35%), Gaps = 19/142 (13%) Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTK----STPGLE 260 I++++ R G V +S + + F I ++N + + + LE Sbjct: 100 IVRNLDMPKTGARVGTVIYSKRSIVLFDFKDNKTDIILQLNTIKYREKPNRALSTGQALE 159 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A ++F + KK I +T GE S + + K G I+Y Sbjct: 160 LAQKRLFQNARRNS---------KKIAILVT-GEKSQ-----DDVIIPSKLMKDSGVIIY 204 Query: 321 AIGVQAEAADQFLKNCASPDRF 342 +GV L++ AS + Sbjct: 205 TVGVGEGYFLPQLESIASSPSY 226 >gi|149026148|gb|EDL82391.1| chloride channel, calcium activated, family member 2 (predicted), isoform CRA_b [Rattus norvegicus] Length = 674 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 71/205 (34%), Gaps = 38/205 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +V+DVS M + D+L ++ L + + + V G+VTF SK Sbjct: 312 VCLVIDVSRKMAEG-----DRLLRLQQAAELFLMQV---VEAHTFV--GIVTFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL + + + + T G++ + + + + Sbjct: 362 APLQQINSDDDRKLLVSYLPTTVSTEAETNICAGVKKGFEVVEKQNGRADGS-------- 413 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDR 341 +I +T G + + C A G+ ++++ + + AA + L + Sbjct: 414 -VMILVTSGADEH--------ITNCLLTAMSSGSTIHSMALGSSAARRLGELSHVTGGLE 464 Query: 342 FYSVQN--SRKLHDAFLRIGKEMVK 364 F+ S ++ +AF I Sbjct: 465 FFIPDPFTSSRMTEAFSGISSGAGD 489 >gi|157818603|ref|NP_001100920.1| calcium-activated chloride channel regulator 2 [Rattus norvegicus] gi|149026147|gb|EDL82390.1| chloride channel, calcium activated, family member 2 (predicted), isoform CRA_a [Rattus norvegicus] Length = 942 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 71/205 (34%), Gaps = 38/205 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +V+DVS M + D+L ++ L + + + V G+VTF SK Sbjct: 312 VCLVIDVSRKMAEG-----DRLLRLQQAAELFLMQV---VEAHTFV--GIVTFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PL + + + + T G++ + + + + Sbjct: 362 APLQQINSDDDRKLLVSYLPTTVSTEAETNICAGVKKGFEVVEKQNGRADGS-------- 413 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDR 341 +I +T G + + C A G+ ++++ + + AA + L + Sbjct: 414 -VMILVTSGADEH--------ITNCLLTAMSSGSTIHSMALGSSAARRLGELSHVTGGLE 464 Query: 342 FYSVQN--SRKLHDAFLRIGKEMVK 364 F+ S ++ +AF I Sbjct: 465 FFIPDPFTSSRMTEAFSGISSGAGD 489 >gi|319952787|ref|YP_004164054.1| hypothetical protein Celal_1241 [Cellulophaga algicola DSM 14237] gi|319421447|gb|ADV48556.1| protein of unknown function DUF58 [Cellulophaga algicola DSM 14237] Length = 288 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 42/125 (33%), Gaps = 10/125 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L MM+V+DVS S + FG I + + Sbjct: 58 NVTARYNEPYIKVFEEERELTMMLVVDVSGS--ELFGSS----NQFKNEIITEISATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNK 265 + N + GL+ FS +I P G H+ I LI + T L+Y Sbjct: 112 SALQNNDKIGLILFSDQIELFIPPKKGKSHVLRIIRELIEFTPKSNKTDLAVALKYLTRV 171 Query: 266 IFDAK 270 + Sbjct: 172 MKKKA 176 >gi|298291152|ref|YP_003693091.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927663|gb|ADH88472.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 636 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 84/208 (40%), Gaps = 16/208 (7%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + L + +++DVSLS + G L V ++ + + + D + Sbjct: 438 TAARALERDLSLAVLMDVSLSTDAWIGAH-RVLDVEKGALLALTHGLTACGDEH-----A 491 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + TF+S+ + ++ V+ +E +N I +PG Y ++ A + Sbjct: 492 IYTFTSRRRTSVTVST-VKGFEEPLNPAILRRIMALSPG---QYTRMGAAVRHVAKALAE 547 Query: 280 HDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ LTDG+ + +++ EA+++G V+ + V EA D F Sbjct: 548 RPQRHRLLLLLTDGKPNDIDYYEGRYGIEDTRAAIREARKQGLTVFGVTVDGEARDYF-P 606 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + V ++ +L A I +++ Sbjct: 607 YLFGKGGYAIVPDAERLPTALPAIYRQL 634 >gi|218261909|ref|ZP_03476582.1| hypothetical protein PRABACTJOHN_02253 [Parabacteroides johnsonii DSM 18315] gi|218223703|gb|EEC96353.1| hypothetical protein PRABACTJOHN_02253 [Parabacteroides johnsonii DSM 18315] Length = 285 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S + + V + + + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGSRDF------GSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G +H I+E I+ T L+Y N I Sbjct: 126 SDKIEKFIPPQKGKKHILYIIRELIDFQPDKKQTNIAQALKYLTNAIKK 174 >gi|149202861|ref|ZP_01879832.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] gi|149143407|gb|EDM31443.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] Length = 212 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 72/212 (33%), Gaps = 46/212 (21%) Query: 172 MMVLDVSLSMND--HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF------ 223 M+V D S SM + H ++ A ++R + +PDV R GL+++ Sbjct: 1 MLVFDGSGSMAEVGHDPTAATRIIEARAALR------RVMPDVAPYRRIGLLSYGAGGSH 54 Query: 224 --SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S P+ + ++ L+ G T + A E LE+ + Sbjct: 55 PCSGITRHFAPMPDAAAAVVVAVDALMPGGLTPIAA-------SVAAAAEVLEYRTQPG- 106 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-------------EA 328 ++ +TDG + +L A+ R V+ IG + Sbjct: 107 ----IVVLVTDGNETCGG--TPCALGAALTAEARDLTVHVIGFRVVHDPFSWNSPEAQGY 160 Query: 329 ADQFLKNC---ASPDRFYSVQNSRKLHDAFLR 357 Q + C A+ F S + +L +A Sbjct: 161 DGQTVAKCLADATGGVFVSTETVDELVEALRE 192 >gi|159900457|ref|YP_001546704.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893496|gb|ABX06576.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 831 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 53/184 (28%), Gaps = 50/184 (27%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGP----GMDK-------LGVATRSIREMLDII---- 206 S + D+ +V+D+S SM + +D + +MLD+ Sbjct: 370 PSINTTSYDISLVVDISGSMQWCYDSQRTCSVDANARWYRVKDFLAKFSYKMLDVWNAPA 429 Query: 207 ------KSIPDVNNVV------RSGLVTFSSKIVQTFP---------------------- 232 S+ +V R V FS V + P Sbjct: 430 GQNMNNASLFPGEALVGKGGDNRIAAVRFSGNAVTSSPSFGFVTSPAGSDQASVSARTTT 489 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + + I + +T GL FD + + K ++ LTD Sbjct: 490 MRSNMNSLISWITKANMSGSTSGGRGLREGIRY-FDNVSAHTRVDRFGRPIKLVMVMLTD 548 Query: 293 GENS 296 G + Sbjct: 549 GLTN 552 Score = 43.3 bits (100), Expect = 0.055, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 30/89 (33%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R G + +AIL V+ L I+T F + + + + + ++ Sbjct: 8 RRVAQPVAGQSLVFSAILFFVMIAFAALAIDTGEAFSRQRQQQAASTAASIAGLESMNSE 67 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 +G +G Q+ I + N Sbjct: 68 IDGTDGAVQQAIRDALAANGITNAVYING 96 >gi|313667479|ref|YP_004047763.1| hypothetical protein NLA_1240 [Neisseria lactamica ST-640] gi|313004941|emb|CBN86368.1| conserved hypothetical protein [Neisseria lactamica 020-06] Length = 219 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 60/163 (36%), Gaps = 14/163 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++V+D S SM +G +++L + + L + + V G++ + + Sbjct: 19 CLLVVDTSGSM---YGEPIEQLNQGVQQFIQALQEDEIA---SYSVEVGILAAGGHVEEI 72 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + +T +E + D K E+ G Y+ +++ + Sbjct: 73 IPFT--TAEQLDYTSTFTAQGSTPLGSAVEQGLKMLED--RKREYQKNGVAYYQPWLVVI 128 Query: 291 TDGENSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQAEAADQF 332 +DG SP + + A+ R + +GV D+ Sbjct: 129 SDG---SPTDSWQNAAQEARTLAENRKLVSLMVGVNDADMDKL 168 >gi|221117277|ref|XP_002154725.1| PREDICTED: similar to HyTSR1 protein, partial [Hydra magnipapillata] Length = 3382 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 76/207 (36%), Gaps = 28/207 (13%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +D+ ++D+S S+ + P DK I ++ P + V Sbjct: 2982 PIECKTQVDLAFIVDMSESVT-NVEP--DKQEAIINFIIKL---ALMFPISKDQVHFAYF 3035 Query: 222 TFSSKIVQTFPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + ++ P + Q +E + ++I + G Y ++ A+ Sbjct: 3036 PYGTEPKLDIPDQFFDGAMNNWTLSDKQKQEEFLRKMIILE----SRGFSYTWSAAITAR 3091 Query: 271 EKLEHIAKG-HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E + KG + K+ I TDG + D ++ Y K +G V AIG+ + Sbjct: 3092 ETIFTTQKGMRSNVKRVAILFTDGVYN-GKHDTQKEWQY---VKDQGIQVVAIGIGSPIN 3147 Query: 330 DQFLKNCASPDRF-YSVQNSRKLHDAF 355 L+ AS F ++ + DAF Sbjct: 3148 ISNLELWASSKNFVFNATTYEE-ADAF 3173 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 79/216 (36%), Gaps = 27/216 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S++ +D+ ++D+S S+ + P DK I ++ P + V Sbjct: 2723 STECKTQVDLAFIVDMSESVT-NVEP--DKQEAIINFIIKL---ALMFPISKDQVHFAYF 2776 Query: 222 TFSSKIVQTFPLAW-----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + ++ P + Q +E + ++I + G Y ++ A+ Sbjct: 2777 PYGTEPKLDIPDQFFDGAMNNWTLSDKQKQEEFLRKMIILE----SRGFSYTWSAAITAR 2832 Query: 271 EKLEHIAKG-HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 E + KG + K+ I TDG + D ++ Y K +G V A+G+ + Sbjct: 2833 ETIFTTQKGMRSNVKRVAILFTDGVYN-GKHDTQKEWQY---VKDQGIQVIAVGIGSPIN 2888 Query: 330 DQFLKNCASPDR-FYSVQNSRKLHDAFLRIGKEMVK 364 L+ AS +S ++ IG + K Sbjct: 2889 TDNLELWASNKNSVFSATTYQEADAFISTIGDNVCK 2924 >gi|88857994|ref|ZP_01132636.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] gi|88819611|gb|EAR29424.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] Length = 974 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 77/264 (29%), Gaps = 72/264 (27%) Query: 171 MMMVLDVSLSMNDH------FGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVTF 223 +M+V D S SM S ++ +K++ + N + GL+ F Sbjct: 57 VMIVFDTSGSMESSVATGEYCYNRKGNQVYCADSRIKVAQAAMKALVEQNTDIEFGLMRF 116 Query: 224 SSKI--VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK----EKLEHIA 277 + L + I KI L G +T L +Y + + + Sbjct: 117 NGGSGGYILAGLGSSKEAILTKIESLRAGGSTPLAETLYESYLYLTGGGVNYAKNIADRD 176 Query: 278 KGHDDYKKY-----------------------IIFLTDGENS-----SPNIDNKESLFYC 309 K +D Y +I +TDG+ + + NI + + Y Sbjct: 177 KDIEDNSSYDSPFKPKKDDSGLDILRCDNSINMIIMTDGDPTEDGGQNGNIKSLYNSKYG 236 Query: 310 NEAKRRG--------------------------AIVYAIGV---QAEAADQFLKNCASP- 339 K + A + IG ++ LK AS Sbjct: 237 AYPKSKSGSYLNSLAKYMLNVDLFPTTSGVTDIARTFTIGFGSGMSDDGLDLLKQTASDG 296 Query: 340 -DRFYSVQNSRKLHDAFLRIGKEM 362 + + +L +A + ++ Sbjct: 297 GGEYLLASTAEQLTEALKKTITKI 320 >gi|254228715|ref|ZP_04922138.1| TPR repeat containing protein [Vibrio sp. Ex25] gi|262396563|ref|YP_003288416.1| TPR domain protein in aerotolerance operon [Vibrio sp. Ex25] gi|151938662|gb|EDN57497.1| TPR repeat containing protein [Vibrio sp. Ex25] gi|262340157|gb|ACY53951.1| TPR domain protein in aerotolerance operon [Vibrio sp. Ex25] Length = 599 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 48/164 (29%), Gaps = 29/164 (17%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +VLD+S SM ++L ++L K +GL+ ++ P Sbjct: 89 LVLDMSRSM-FATDIKPNRLAQTRYKALDLLPKWKEGA-------TGLIAYAGDAYNLSP 140 Query: 233 LAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + I L+ +E A N+ A + II Sbjct: 141 LTTDSSTLAGIIENLSPELMPFQGANLPSAIELALNQFSQAGVQQGD-----------II 189 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 L D ++D E N + + + + V Sbjct: 190 VLAD------DLDESELSRALNLVEGKNIRISVLAVGTANGAPI 227 >gi|290976446|ref|XP_002670951.1| predicted protein [Naegleria gruberi] gi|284084515|gb|EFC38207.1| predicted protein [Naegleria gruberi] Length = 1082 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 55/165 (33%), Gaps = 21/165 (12%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DVS SM P ++ + I R V + S + + Sbjct: 136 DVSGSMQGR--PWEQVCTSLKHFAQQSFNNPAII------CRM--VAYESSA-KEIDMKG 184 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-DAKEKLEHIAKGHDDYKKYIIFLTDGE 294 +Q I I G T + A I ++ + E++ G+ I FLTDGE Sbjct: 185 TLQSIIRNIETAFTGGGTDFASAFQLACTIITRESGQDRENLPFGNV----VITFLTDGE 240 Query: 295 NSSPNIDNKESLFYCNEAKRR----GAIVYAIGVQAEAADQFLKN 335 + + L Y +E R ++ +G + + L N Sbjct: 241 D-FSKVGKPGGLQYLSEEINRVYRGDITIHTVGFGSHHNLELLDN 284 >gi|257889535|ref|ZP_05669188.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|257825895|gb|EEV52521.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] Length = 857 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 17 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 70 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A + + Sbjct: 71 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGH-------- 122 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 123 ---KKVIVLLTDGVPT 135 >gi|154492263|ref|ZP_02031889.1| hypothetical protein PARMER_01897 [Parabacteroides merdae ATCC 43184] gi|154087488|gb|EDN86533.1| hypothetical protein PARMER_01897 [Parabacteroides merdae ATCC 43184] Length = 289 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S + + V + + + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGSRDF------GSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G +H I+E I+ T L+Y N I Sbjct: 126 SDKIEKFIPPQKGKKHILYVIRELIDFQPDKKQTNIAQALKYLTNAIKK 174 >gi|39933646|ref|NP_945922.1| hypothetical protein RPA0569 [Rhodopseudomonas palustris CGA009] gi|39647492|emb|CAE26013.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 372 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 16/113 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ +++ VLD + SM+ K+ +I + + + +R GLV Sbjct: 25 AQARPSVEVAFVLDTTGSMSGLIEGAKRKIWSIATTILD--------DNPDADIRMGLVA 76 Query: 223 F----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTK----STPGLEYAYNKIF 267 + +V++ L +Q + ++ +L L+ A NK+ Sbjct: 77 YRDIGDDYVVRSVDLTTDIQDLYGQLLQLQARGGGDWPESVNEALDTAINKLH 129 >gi|291569186|dbj|BAI91458.1| magnesium chelatase subunit D [Arthrospira platensis NIES-39] Length = 663 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 68/205 (33%), Gaps = 39/205 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 ++ V+D S SM ++++ A ++ ++L + + L+ F + Sbjct: 468 VVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQSRDQVSLIPFRGEQAEV 516 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + ++ R+ G + GL A +AK+ D + I+ Sbjct: 517 LLPPTRSIAAAKRRLERMPCGGGSPLAHGLTQAVRVGMNAKQ-------SGDIGQVVIVA 569 Query: 290 LTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----GVQAEAAD 330 +TDG + P E L + + G + I V A Sbjct: 570 ITDGRGNIPLARSLGEPILEGEKPDIKGELLEIAAKIRGLGIQLLVIDTENKFVSTGFAK 629 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAF 355 + K A ++ + + + A Sbjct: 630 ELAKT-AGGKYYHLPKATDQAIAAM 653 >gi|254559618|ref|YP_003066713.1| hypothetical protein METDI1076 [Methylobacterium extorquens DM4] gi|254266896|emb|CAX22695.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 473 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 75/464 (16%), Positives = 132/464 (28%), Gaps = 122/464 (26%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 N +GSI++L A+ + ++GL ++ K +L D + L Sbjct: 14 LASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIA 73 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 N + + ++ F + FA + L I+ Q D +S Sbjct: 74 ANAQQSDVTASGIKAGESQALKAFNANASKVPFAT-----VSLSQLEIVRSGQTLDATVS 128 Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------ 182 + + F ++ V S LD +++DVS SM Sbjct: 129 YTATVQSTFGRTFGLSATTLTNR-------VNASVDLASYLDFYLMVDVSGSMGLPTKDS 181 Query: 183 -----------------------DHFG--PGMDKLGVATRSIRE-MLDIIKSI--PDVNN 214 D G K+ + + ++ + +++K P V N Sbjct: 182 DAEALAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTPVVPN 241 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQ--------------EKINRLIFGST---TKSTP 257 R G+ F +++ PL + N L GST T + P Sbjct: 242 QYRIGIYPFINQLATLAPLTDTTTSLAALRTAAQCDKIWPLAFTNLLDTGSTQLFTNNDP 301 Query: 258 --GLEYAYNKIFDAKEKLEHIAKG------HDDYKKYIIFLTDGENS------------- 296 G A K++ K + + ++ +TDG + Sbjct: 302 KTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYSAWKDTKTF 361 Query: 297 ---------SPNIDNKESL------FYCNEAKRRGAIV------YAIGVQAEAADQF--- 332 PN D S C + K GA + Y I Sbjct: 362 SGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYNNDSYIVWE 421 Query: 333 --------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 L+ CASP FY+ + + + K+ Sbjct: 422 NGRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQA 465 >gi|308094406|ref|ZP_05889083.2| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308095541|ref|ZP_05906689.2| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308125336|ref|ZP_05774598.2| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|308126258|ref|ZP_05908794.2| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308089078|gb|EFO38773.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091462|gb|EFO41157.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109129|gb|EFO46669.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308114017|gb|EFO51557.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 400 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 16/99 (16%), Positives = 33/99 (33%), Gaps = 10/99 (10%) Query: 23 ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYR 82 ++L ++ V I+ +H+ K +L +D + L A + Q Sbjct: 1 MVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVA----DKTEDVDQAEAAVIA 56 Query: 83 IIKNIWQTDFRNELRENG------FAQDINNIERSTSLS 115 + +I EL F+ D+ + S + Sbjct: 57 TLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFT 95 >gi|222530086|ref|YP_002573968.1| Ig domain-containing protein group 2 domain-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222456933|gb|ACM61195.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor bescii DSM 6725] Length = 1831 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 66/180 (36%), Gaps = 21/180 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + S +S D++ V+D + SM+ D++ ++I +D +K Sbjct: 824 KKYLDITGLKSGTVSPSGQA--DIVFVIDTTGSMS-------DEIDAVKQNINNFVDKLK 874 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + + V GLVT+ G + + K G + + Sbjct: 875 T---KDISVNLGLVTYKDITCDGLNSTVGHGFFSSADDFKNALGSIKVDGGGDTP-ETLI 930 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIG 323 DA E + ++ K+I+ LTD ++ ++N+ + +E K IV + Sbjct: 931 DALE-TARLLGFRENSTKFIVVLTD---ANYKLENRFGIKSADEIIERLKSDNIIVSVVS 986 >gi|254410148|ref|ZP_05023928.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196183184|gb|EDX78168.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 615 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 56/177 (31%), Gaps = 33/177 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKI 227 D++ ++D S S G + K R + L+ + D +N VR Sbjct: 288 DVVFLMDTSGSQQ---GAPLHKCQELMRRFIQGLNPDDTFTILDFSNAVR---------H 335 Query: 228 VQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + PLA + IN+L T+ G+ A N Sbjct: 336 LSQTPLANTPENRELAINYINQLHASGATEMLSGIRAAINVPAPEGRLRS---------- 385 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ L+DG + N L + + G +Y+ G + L A R Sbjct: 386 --VVLLSDGYIGNEN----RILAEVQQELKPGNRLYSFGAGSSVNRFLLNRIAEIGR 436 >gi|114586163|ref|XP_526141.2| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Pan troglodytes] Length = 891 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 63/168 (37%), Gaps = 18/168 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D+ + + ++ ++ + PD VR GLV +S + F L Sbjct: 651 DLIFLIEEFSRVRQPNFQQVVNFLKTIVSSLSIRPD---TVRFGLVFYSEEPRLEFSLDA 707 Query: 235 -WGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 I E +++L + TK+ L++ N++F E + + ++ + + Sbjct: 708 FQNPAKILEHLDKLTYRERKGRTKTGAALDFLRNEVF----IQEKGSWSNHGVQQIAVVI 763 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 T+G + +R G +YA+G + + L+ AS Sbjct: 764 TEG------FSQDRVSRPASRLRRAGVTIYAVGTHNVSESKDLEKIAS 805 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 47/145 (32%), Gaps = 22/145 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + ++ + D VR GL ++ I Sbjct: 201 DIVFLVDNSTSIGPQ------NFQKVKNFLYSVILGLDISSDR---VRVGLAQYNDNIYP 251 Query: 230 TFPLAWGVQHIQEKINRLIFG-----STTKSTPGLEYA-YNKIFDAKEKLEHIAKGHDDY 283 F L ++ I I T + LE+ N + + Sbjct: 252 AFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRAKDRVP---- 305 Query: 284 KKYIIFLTDGENSSPNIDNKESLFY 308 + +I +TD E++ + + L Sbjct: 306 -QIVILVTDRESNDEVQEVADRLKE 329 >gi|2935363|gb|AAC05096.1| complement component BfB [Danio rerio] Length = 359 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 74/216 (34%), Gaps = 31/216 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI LD+ + +DVS S ++ L A + I+ +L+ I + Sbjct: 150 KIYLNKGGKLDIYIAVDVSDS--------INDLKKAKQIIKTLLEKISYYEVSPSYE--- 198 Query: 220 LVTFSSKIVQTFPLAW--------GVQHIQEKINRLIFGSTTKS-TPGLEYAYNKIFDAK 270 ++ F++ + Q + + + E +++ F + Y I D+ Sbjct: 199 ILMFATDVYQIVKMRDFKTEEVAGSLSKVFEDLDKFDFDKKLDQKGSNIAKLYQTILDSM 258 Query: 271 EKLEHIAKGHDDYKK-YIIFLTDGE-NSSPNIDN-----KESLFYCNEAKRRGAIVYAIG 323 + K K +I TDG+ N N + + + + +Y G Sbjct: 259 SNEQIRNKEDFLQTKHVVIVFTDGQANMGGNPKPKVHLIRNLVLKNDANRENKLDLYVFG 318 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAF 355 V + + L S F+ +Q+ ++ F Sbjct: 319 VGKDVRTEDLNGLVSEKENERHFFKLQDLDEVQKTF 354 >gi|125975601|ref|YP_001039511.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715826|gb|ABN54318.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 1300 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 52/179 (29%), Gaps = 28/179 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L D++ V+D + SM + + + ++ + Sbjct: 58 LNPGGETIPPVAEIGQADIVFVIDTTGSMGSVINNVKNNIT----------NFANTLMEN 107 Query: 213 NNVVRSGLVTF----SSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 N VR GL+ + + T L W V +N + + A Sbjct: 108 NVDVRLGLIDYKDLEEDGMDSTKNLGWFDNVSDFIASVNNMRATGGGDAPESTVDALE-- 165 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTD---GENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 E + K+I+ TD E++ D + + K +V AI Sbjct: 166 ----EARRMDFRPG--VNKFIMLFTDVSYKESTRFE-DVQSMKTVIEKLKEDKIVVSAI 217 >gi|328865997|gb|EGG14383.1| type A von Willebrand factor domain-containing protein [Dictyostelium fasciculatum] Length = 923 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 51/143 (35%), Gaps = 10/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + V+D S SM GM L A +I L I + R L+T + + Q Sbjct: 3 ITFVVDTSGSMGQKTSNGMTLLDCAKAAIEHFLKIRSKDTQIFRNDRYFLIT-TEECPQA 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-----EKLEHIAKGHDDY 283 + W +++ L + L A++ + + + Sbjct: 62 VKIGWRDNFNSFLQELKNLQPKDLSNIPYSLVKAFDHLNQFRIQSLIDNYGLGRNPWFIE 121 Query: 284 KKYIIFLTDG--ENSSPNIDNKE 304 ++ LTDG +NS+ +DN + Sbjct: 122 PAVVVLLTDGGAQNSNGLLDNIQ 144 >gi|221042216|dbj|BAH12785.1| unnamed protein product [Homo sapiens] Length = 677 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 16/198 (8%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKILGDMQPG-DYFDLVLFGTRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K +Q Q+ + T GL + +E L ++ Sbjct: 340 WKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--- 341 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 342 --FYSVQNSRKLHDAFLR 357 Y ++ + F Sbjct: 456 QRIYEDHDATQQLQGFYS 473 >gi|146338399|ref|YP_001203447.1| hypothetical protein BRADO1313 [Bradyrhizobium sp. ORS278] gi|146191205|emb|CAL75210.1| hypothetical protein; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 526 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 66/192 (34%), Gaps = 22/192 (11%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + LD+S SM H G +L A R + + + + ++ F+ ++ Sbjct: 348 ALCLDLSGSMQGH---GETQLLEAMRFLLTPARTREMLVQWSKQDEILVLPFNDHVLWVA 404 Query: 232 PLAWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + Q + +L G T A + + H+A I+ Sbjct: 405 SASGDEQEQAGLLKQALQLHAGGGTDFYQCGARALAAMKPTLDGGAHLAA--------IV 456 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQ 346 +TDG+ + ++ + A V+ + EA + L A + R + Sbjct: 457 IMTDGK----SYGDRATFEEPWRADGGRVPVFGVTFGDEADRKQLDALAKLTGGRVF--D 510 Query: 347 NSRKLHDAFLRI 358 ++ L DAF + Sbjct: 511 GTKNLTDAFRAV 522 >gi|156744078|ref|YP_001434207.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235406|gb|ABU60189.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 429 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 64/204 (31%), Gaps = 36/204 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---- 226 + ++LD S SM + +L +++ +I I + L T+ + Sbjct: 172 IAVILDASGSM-------LARLDGTPKTVIARQALIALINRLPETTNVALRTYGHRRADD 224 Query: 227 ------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 I PL + +IN + + ++ + D + L I Sbjct: 225 CSDTELIQALAPLQRD--ALIARINAIRPVNGGRTPIA-----QSLADMAQDLAGIEGNV 277 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK----NC 336 I+ ++DG+ + + A + IG E + + Sbjct: 278 L-----IVLVSDGDETCGGDPVATASML--RAANSQLRISVIGFDVEQEEWRRRLEGIAV 330 Query: 337 ASPDRFYSVQNSRKLHDAF-LRIG 359 A ++ N+ +L DA I Sbjct: 331 AGGGAYFDASNAEQLADALDQAIA 354 >gi|221044450|dbj|BAH13902.1| unnamed protein product [Homo sapiens] Length = 623 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 66/192 (34%), Gaps = 16/192 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D+S SM K+ ++ ++L ++ D ++V G S K Sbjct: 5 VVFVIDISGSMRGQ------KVKQIKEALLKILGDMQP-GDYFDLVLFGTRVQSWKGSLV 57 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +Q Q+ + T GL + +E L ++ +I L Sbjct: 58 QASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHAS----ILIML 113 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-----FYSV 345 TDG+ + D + L A R +Y +G FL+ + + Y Sbjct: 114 TDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRAQRIYED 173 Query: 346 QNSRKLHDAFLR 357 ++ + F Sbjct: 174 HDATQQLQGFYS 185 >gi|194381994|dbj|BAG64366.1| unnamed protein product [Homo sapiens] Length = 1207 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 93/249 (37%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + ++ AP + + + Sbjct: 970 NIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKK 1029 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S D R + E++D + D ++ ++ GLV ++S Sbjct: 1030 KA-DIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV--YEDGDS-IQVGLVQYNSD 1079 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + I + IN++++ + + + E ++ Sbjct: 1080 PTDEFFLKDFSTKRQIIDAINKVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVP 1137 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ D +L +RG V+A+GV+ +++ K ++ + Sbjct: 1138 QIAFVITGGKSVEDAQDVSLALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFR 1191 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1192 VGNVQELSE 1200 >gi|172035989|ref|YP_001802490.1| magnesium chelatase, ATPase subunit D [Cyanothece sp. ATCC 51142] gi|171697443|gb|ACB50424.1| magnesium chelatase, ATPase subunit D [Cyanothece sp. ATCC 51142] Length = 682 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 68/219 (31%), Gaps = 45/219 (20%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 481 ARKAGALIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVALIPF 529 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ L G + GL A + +AK D Sbjct: 530 RGEQADVLLPPTRSISLARKRLETLPCGGGSPLAHGLTQAVHVGMNAK-------MSGDI 582 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----G 323 + I+ +TDG + P +E L + + G + I Sbjct: 583 GQVVIVAITDGRGNIPLAKSLGEPIPEGEKPDIKQELLDIAGKIRGLGIKLLMIDTEKKF 642 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 V + + + +Y + + I + Sbjct: 643 VSTGFGKELAQT--AGGTYYQLPRATD-----QAIAQMA 674 >gi|153874614|ref|ZP_02002765.1| conserved hypothetical protein [Beggiatoa sp. PS] gi|152068921|gb|EDN67234.1| conserved hypothetical protein [Beggiatoa sp. PS] Length = 232 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 55/155 (35%), Gaps = 28/155 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +M++LD+S SM P +L A L I+ + D + GL+ F+ Sbjct: 99 LMILLDLSESMRVKDLPH-SRLEQA-------LQDIEELLDREEDIYVGLMVFAGIPHLV 150 Query: 231 FPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL ++ + L ++ TP L+ A + + + Sbjct: 151 TPLTDDYNTLRHLLYELDIDLLPVQGSQLTPALDSATRWLKGQAATMVP----------H 200 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 I+ ++DGE D + SL + +Y Sbjct: 201 ILVISDGE--FEEDDFQNSLALIQKT----IYIYT 229 >gi|291452853|ref|ZP_06592243.1| predicted protein [Streptomyces albus J1074] gi|291355802|gb|EFE82704.1| predicted protein [Streptomyces albus J1074] Length = 923 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 77/259 (29%), Gaps = 42/259 (16%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFI-FCTFPWCANSSHAP----LLITSSVKISSKSDIGL 169 ++ + + L+A + Y F + S+ +S V + GL Sbjct: 126 TVTTPYRFQTPRLAAATTYSSTATGPGGFMVGSGGSNRTASTGTWQSSRVNPALPERCGL 185 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +++D+S SM+ + +D ++ P ++V R ++S Sbjct: 186 RVALIMDLSGSMSGSVP-------ALKTAADTFVDALQGTP--SSVARFTFSSYS----- 231 Query: 230 TFPLAWGVQHIQEKINRLIFGSTT---KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P G + ++ ++ A + ++ + Sbjct: 232 --PATRGGANAPGLVSVSTLADAAAFKRTYASWTNATAEGSTNWDRALYEPASATSQYDV 289 Query: 287 IIFLTDGENSSPNI----------------DNKESLFYCNEAKRRGAIVYAIGV--QAEA 328 + +TDG ++ +I + + + N K G V A+GV Sbjct: 290 AVVITDGMPTNYSIPGGPSGGASGSVTRFRELESGIASANALKNEGTRVLAVGVGEGTSG 349 Query: 329 ADQFLKNCASPDRFYSVQN 347 S Y N Sbjct: 350 NAALNLASISGTEKYDGDN 368 >gi|170077221|ref|YP_001733859.1| magnesium chelatase ATPase subunit D (Mg-protoporphyrin IX chelatase subunit D) [Synechococcus sp. PCC 7002] gi|169884890|gb|ACA98603.1| magnesium chelatase ATPase subunit D (Mg-protoporphyrin IX chelatase subunit D) [Synechococcus sp. PCC 7002] Length = 682 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 68/214 (31%), Gaps = 38/214 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G M+ ++D S SM ++++ A ++ +L N + L+ F Sbjct: 481 ARKAGALMIFLVDASGSMA------LNRMQAAKGAVMRLLTEA-----YENRDQIALIPF 529 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ L G + + GL A + +A+ D Sbjct: 530 RGEMAEVLLPPTRSITLARNRLETLPCGGGSPLSHGLTQAVHIGVNAQ-------MSGDV 582 Query: 283 YKKYIIFLTDGENSSP---------------NIDNKESLFYCNEAKRRGAIVYAI----G 323 + I+ +TDG + P +E L + + G + I Sbjct: 583 GEVVIVAITDGRGNIPLSRSLGEKPEEGEEKPDIKEELLDIAGKIRASGMKLLIIDTEKK 642 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A + + + + A + Sbjct: 643 FISSGFGKELAAKAGGKYYRLPKATDQAIAAMAK 676 >gi|166212064|ref|ZP_02238099.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166206810|gb|EDR51290.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] Length = 222 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 63/171 (36%), Gaps = 12/171 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM + I+ M+ ++ P V ++T+ ++ Sbjct: 13 RLPVYLLIDTSGSMRGE------SIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQA 66 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL +++ Q + T + LE + + ++ + KG + Sbjct: 67 REYIPLT-ALENFQFTDITVPSAGGTFTGAALECLIHCVDRDIQRSDGDQKGDWRP--LV 123 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 +TD S+P+ KR + A V A+A + LK S Sbjct: 124 FLMTD---STPSDVYAYGEAIKEVKKRAFGSIIACAVSAKAKHEHLKQLTS 171 >gi|119575268|gb|EAW54873.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_b [Homo sapiens] Length = 1179 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 173 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 221 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 222 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 273 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---L 333 K ++ +TDGE+ ++ K + CN + + I V A + Sbjct: 274 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 329 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K AS F++V + L + +G+++ Sbjct: 330 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 363 >gi|115522645|ref|YP_779556.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] gi|115516592|gb|ABJ04576.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] Length = 638 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 75/183 (40%), Gaps = 29/183 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + L + +++DVSLS D + L V ++ + + + D +++ +T Sbjct: 443 RQQGYDLAVTLLVDVSLS-TDSWIDNRRVLDVEKEALLVLAHGLSACGDHHSI-----LT 496 Query: 223 FSSK----IVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 F+S+ + A+G + ++ +I L G T+ + +A ++ ++ Sbjct: 497 FTSRRRDWVRVESVKAFGEPMSERVERRIGALKPGYYTRIGAAIRHAAAELALQPQRR-- 554 Query: 276 IAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K ++ LTDG+ + ++S EA+R G V+ + + +A Sbjct: 555 ---------KLLLVLTDGKPNDVDYYEGRFAIEDSRKAVQEARRAGTCVFGVTIDTDAQA 605 Query: 331 QFL 333 FL Sbjct: 606 YFL 608 >gi|258650425|ref|YP_003199581.1| Magnesium chelatase [Nakamurella multipartita DSM 44233] gi|258553650|gb|ACV76592.1| Magnesium chelatase [Nakamurella multipartita DSM 44233] Length = 705 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 44/144 (30%), Gaps = 18/144 (12%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQTFPL 233 +D S SM ++ ++ +L + GLVTF P Sbjct: 528 VDASGSMA-----ARARMEAVKAAVLSLLT-----DAYQRRDKVGLVTFRGGAADLALPP 577 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 V+ ++ L G T GL A + + ++E I ++ +TDG Sbjct: 578 TSSVEAAARRLEMLPAGGRTPLAEGLLCAAHTL-----RVERIRDPRRRP--LLVVVTDG 630 Query: 294 ENSSPNIDNKESLFYCNEAKRRGA 317 +S S G Sbjct: 631 RATSGPDAVARSRRAATALAAEGI 654 >gi|220906001|ref|YP_002481312.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219862612|gb|ACL42951.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 230 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 63/172 (36%), Gaps = 15/172 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LD S SM G ++ L + D + V +VTF Sbjct: 27 VILLLDTSGSMA---GAPIEALN---NGLLAFKDAVIQDEQAALRVDLAIVTF-----GP 75 Query: 231 FPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L I + I L T + YA + + K+ + G Y+ ++ Sbjct: 76 VQLVQDFITIDQFIPPQLKAQGLTPMGEAINYALDLL--ENRKVAYRNNGIQYYRPWLFL 133 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +TDG + + + + N + R +A+GVQ + L A P+R Sbjct: 134 ITDGAPNRDSPWIQSADRIRNAERERKLSFFAVGVQGADMNT-LSQIAPPER 184 >gi|228982551|ref|ZP_04142810.1| hypothetical protein bthur0002_56840 [Bacillus thuringiensis Bt407] gi|228776734|gb|EEM25042.1| hypothetical protein bthur0002_56840 [Bacillus thuringiensis Bt407] Length = 690 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 54/164 (32%), Gaps = 17/164 (10%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI-PDVNNVVRSGLVTFSSKIVQT 230 M++D S SM D +K+ + ++ + + IK P + + R + Sbjct: 504 YMLVDGSGSMMDRVDENNNKVSCSQKACAVIEEGIKGFIPFKSTIFR--ASGRNVMHTTV 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + + + A ++ E+ + ++ L Sbjct: 562 SEFSNDSKFNESWNAYFEANGGNMDGFSIRIAIKELLQRPERN-----------RLLVIL 610 Query: 291 TDGENSSP---NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +DG S+ + + EA++ G V +I ++ + Sbjct: 611 SDGLPSAYPSQKLGQADVKDAVKEARKNGIKVVSICFGSKRHRE 654 >gi|198283923|ref|YP_002220244.1| von Willebrand factor type A [Acidithiobacillus ferrooxidans ATCC 53993] gi|218667403|ref|YP_002426556.1| von Willebrand factor type A domain protein [Acidithiobacillus ferrooxidans ATCC 23270] gi|198248444|gb|ACH84037.1| von Willebrand factor type A [Acidithiobacillus ferrooxidans ATCC 53993] gi|218519616|gb|ACK80202.1| von Willebrand factor type A domain protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 759 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 69/178 (38%), Gaps = 34/178 (19%) Query: 165 SDIGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + + + ++LD+S S+N+ G G L ++ ++ + I+ + D + Sbjct: 562 AGRDIAVTLLLDLSESLNESVKTGGGDGQTVLQLSQEAVSLLAWSIEQLGDP-----LAI 616 Query: 221 VTFSSKIVQT--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 F+S F WG ++ ++ L G +T+ + +A + + K Sbjct: 617 AGFNSNTRHEVRYQHIKGFSEPWG-DVVKGRLAALQAGYSTRMGAAMRHAGHYLATRKAD 675 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDN-----KESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ LTDG S ++ + +++ NE R G Y I + Sbjct: 676 -----------KKLMLVLTDGRPSDVDVQDDRLLIEDARQAVNELDRDGIFTYCISLD 722 >gi|120556027|ref|YP_960378.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120325876|gb|ABM20191.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 607 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 70/198 (35%), Gaps = 25/198 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-----RS 218 ++ L +++ D+SLS + + + +A ++ + + + + D + R Sbjct: 412 QNQRDLACLVLTDISLSTDTYINNRQRVIDIARDGLQLLSEALTASRDPFALFAFSSRRR 471 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F P + +I L G T+ + + + E Sbjct: 472 DHIRFHHIKSFDEPYNDTTRG---RIQALEPGYYTRMGAAIRQSIKLLQTRPEH------ 522 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +K ++ LTDG+ + +++ EA + G + + + EA ++L Sbjct: 523 -----QKILLLLTDGKPNDLDLYEGRYGVEDTRMAVQEAHKAGLTPFCVTIDEEAN-EYL 576 Query: 334 KNCASPDRFYSVQNSRKL 351 F +++ +L Sbjct: 577 PYVFGSSNFVVIKDPTQL 594 >gi|149608697|ref|XP_001516114.1| PREDICTED: similar to hCG2002731, partial [Ornithorhynchus anatinus] Length = 180 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 56/164 (34%), Gaps = 26/164 (15%) Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKL 273 G+V + + F L + + N +I T + G++ A + F Sbjct: 1 VGIVQYGENVTHEFNLNKYTTTEEVLTASNEIIQRGGRQTMTALGIDTARKEAFTEPRGA 60 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--- 330 KK ++ +TDGE S N D ++ + C +AI + Sbjct: 61 RRG------VKKVMVIVTDGE-SHDNHDLEKVIQDCE---NENIQRFAIAILGSYNRGNL 110 Query: 331 ------QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +K+ AS F++V + L +G+ + Sbjct: 111 STEKFVEEIKSIASEPTEKHFFNVSDELALLTIVEALGERIFAL 154 >gi|222150614|ref|YP_002559767.1| hypothetical protein MCCL_0364 [Macrococcus caseolyticus JCSC5402] gi|222119736|dbj|BAH17071.1| hypothetical protein [Macrococcus caseolyticus JCSC5402] Length = 1102 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 59/166 (35%), Gaps = 23/166 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLV 221 S + +D+++V+D S SM G G + + + +D + N +R +V Sbjct: 499 SVTQETVDIVLVVDNSASMGTIIGNGKTRWQSMKDDVFQFIDEVTQANTAANTKIRIDVV 558 Query: 222 TFSS--KIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEH 275 F+S + + I+ K G T + GL NK+ ++ Sbjct: 559 NFASALRPEINSGFSGDPAVIKSKFYPSYVPSGQGGGTFTQQGLISGSNKLDTSRAS--- 615 Query: 276 IAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRG 316 KK ++ LTDG + ++ + N K G Sbjct: 616 --------KKVMVVLTDGAPTLSYKGVSATGSESITSFSNTIKGNG 653 >gi|47228607|emb|CAG07339.1| unnamed protein product [Tetraodon nigroviridis] Length = 766 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 74/193 (38%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 196 DVVILVDVSGSMKGL------RLTIARQTVSSILDTLGDDDFFN------IIAYNEELHY 243 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G H +E +++L L A++ + D E Sbjct: 244 VEPCLNGTLVQADVTNKDHFREHLDKLFAQGIGMLDVALTEAFSLLRDFNETGR-----G 298 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLK--NCA 337 D + I+ +TDG +D +++F R ++ + E+A + LK CA Sbjct: 299 SDCSQAIMLVTDG-----AVDTYDTIFAKYNWPERKVRIFPYLIGRESAFAENLKWMACA 353 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 354 NKGYFTQISTLAD 366 >gi|327272219|ref|XP_003220883.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Anolis carolinensis] Length = 1068 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 61/195 (31%), Gaps = 38/195 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-- 227 D+++V+D+S SM L +A +I +LD + VN ++ ++ + Sbjct: 225 DIVIVVDISGSMKGLL------LTIAKHTIVTILDTLGENDFVN------IIAYNDYVHY 272 Query: 228 -------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + +H ++ ++ L L A+ + + A Sbjct: 273 VESCFKGILVQADRDNREHFKQLVDELHAKGVGTVNKALTEAFRIL-----REFRNAGQG 327 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN----- 335 + I+ +TDG ++ + R V+ F +N Sbjct: 328 GLCNQAIMLITDGAMEDYEYVFEKFNWP-----DRKVRVFT--YLIGREVSFAQNVKWIA 380 Query: 336 CASPDRFYSVQNSRK 350 C + + + Sbjct: 381 CNNKGYYTQISTLAD 395 >gi|114600323|ref|XP_526928.2| PREDICTED: integrin alpha-2 [Pan troglodytes] Length = 1181 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 173 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 221 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 222 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 273 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---L 333 K ++ +TDGE+ ++ K + CN + + I V A + Sbjct: 274 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 329 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K AS F++V + L + +G+++ Sbjct: 330 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 363 >gi|116622485|ref|YP_824641.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225647|gb|ABJ84356.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 67/208 (32%), Gaps = 34/208 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 G + ++ D S M + + D + R L+ + S + TFS + Sbjct: 89 GTAVFVLFDTSNRMYNSYPYVRDAIAEFIR----RLEPVDSA---------AIYTFSRNL 135 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL ++ + G + + DA E +K I Sbjct: 136 YRAAPLTRDHVRAGAGLDNISAG---DDSALFNALLLTLRDAAE---------VPGRKAI 183 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI----GVQAEAADQFLKN--CASPDR 341 + + NSS N A G +Y I GVQ + L A+ + Sbjct: 184 VVFS---NSSDNASVLSPYDVGRLAVNEGVPIYVISTRDGVQDPVTNSALHYLTAATGGK 240 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRILY 369 YS + +K AF I +++ Y Sbjct: 241 LYSARTWQKQAGAFQAIREDIRNSYTAY 268 >gi|220909014|ref|YP_002484325.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865625|gb|ACL45964.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 589 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 68/206 (33%), Gaps = 37/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D S SM KL ++ ++ + V L+ FSS + Sbjct: 410 VVIVVDTSGSMAGE------KLANVQNTLNTYINGLSPQDQV------ALMRFSSDVGTP 457 Query: 231 FPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + G + I+ L T A N + A + Sbjct: 458 VVVDGTPAGRDRGLQFISSLRANGNTHLYDATLAARNWLTQNLRSDAINA---------V 508 Query: 288 IFLTDGENSSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEAA--DQFLKNCAS-PDR 341 + LTDGE++ I ++ ++ + + +G E Q L+ A+ Sbjct: 509 LVLTDGEDTGSAISLEQLGPELQKSGFNSDQRISFFTVGYGEEGEFDPQALQQIANVNGG 568 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQRI 367 +YS + IG+ M ++ Sbjct: 569 YYSKGDP-------ASIGRLMADLQL 587 >gi|108758819|ref|YP_634592.1| hypothetical protein MXAN_6470 [Myxococcus xanthus DK 1622] gi|108462699|gb|ABF87884.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 914 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 72/212 (33%), Gaps = 28/212 (13%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 PLL S + L + +++D S SM G K+ +A + L ++ + Sbjct: 378 PLLPVSLEMREEQRRASLALSVLMDASCSMGMTVPDGRTKMELAAEGVVAALTLLNPKDE 437 Query: 212 VN-NVVRSGL--VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 V+ ++V + + S + PL + + G L +I Sbjct: 438 VSVHMVDTAAHEIFPLSPVEAGLPLD----AVARGFSG---GGGIYVGEALRAGRTEILR 490 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE- 327 +++ ++++ +D +S D + +L + + V IG+ Sbjct: 491 SEK-----------PTRHVLLFSDAADSEEPDDYQRTLA---HLREQEVTVSVIGLGVPS 536 Query: 328 -AADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 L+ A R Y +++ L F Sbjct: 537 DPDADLLREVAHRGGGRVYFAEDAMSLPRIFS 568 >gi|116295258|ref|NP_002194.2| integrin alpha-2 precursor [Homo sapiens] gi|21105795|gb|AAM34795.1|AF512556_1 integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [Homo sapiens] gi|119575267|gb|EAW54872.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_a [Homo sapiens] gi|151556518|gb|AAI48597.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|162319056|gb|AAI56716.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|168278403|dbj|BAG11081.1| integrin alpha-2 precursor [synthetic construct] Length = 1181 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 173 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 221 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 222 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 273 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---L 333 K ++ +TDGE+ ++ K + CN + + I V A + Sbjct: 274 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 329 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K AS F++V + L + +G+++ Sbjct: 330 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 363 >gi|304382528|ref|ZP_07365023.1| von Willebrand factor [Prevotella marshii DSM 16973] gi|304336359|gb|EFM02600.1| von Willebrand factor [Prevotella marshii DSM 16973] Length = 289 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +M+++DVS S++ + + S + N + G++ F Sbjct: 72 EEEREMTVMLLIDVSGSLDF------GTTQRTKAELVTEIAATLSFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S +I + P G +HI I ++ + A + ++ D Y Sbjct: 126 SDQIEKYIPPQKGRKHILYIIREMLDFHAESRRTDIGMAVEFLTRVMKRRCTAFVLSDFY 185 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFLKNC-ASPDR 341 + D + L CN VY I + LK C A Sbjct: 186 TR--------------QDFERQLQICNSKHDVVAIQVYDIRAKELPDVGLLKVCDAETGH 231 Query: 342 FYSVQNSRKLHDA 354 + S K A Sbjct: 232 EMYIDTSSKKLRA 244 >gi|156408321|ref|XP_001641805.1| predicted protein [Nematostella vectensis] gi|156228945|gb|EDO49742.1| predicted protein [Nematostella vectensis] Length = 981 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 94/290 (32%), Gaps = 54/290 (18%) Query: 54 HSLLYTATKILNQENGNNG-------KKQKNDFSYRIIKNIWQTDFR---NELRENGFAQ 103 ++LL T + N N + + T +++G Q Sbjct: 93 NALLNAKTAVTNTGASNTRIIECCAMPAIARANYSYRFRAMLNTSVACKTRNPKDSGIIQ 152 Query: 104 DINNIERSTSLSIIIDDQHK-DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS 162 N++ + +++ Y ++ Y F N S + S + Sbjct: 153 GENSLVNTFRNNLVDSSVISWQYFGTSTGNY---LQFPASGKVCNGSSSFDPRFQSWYVE 209 Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + + ++++V+D S SM ++ +A ++ +LD + + G+V Sbjct: 210 AVTRMRTNIVVVIDRSSSM-----STAGRMALARQAAVTVLDTLGPND------KVGVVA 258 Query: 223 FSSKIVQTFP---------LAWGVQHIQEKINRLI----------------FGSTTKSTP 257 FS I++ L + I+ + L F TK P Sbjct: 259 FSHFIIKPPGCFGGNVAEALPKNINRIKAWVEALTPRGKVSLQKTNLRYVSFPGATKYVP 318 Query: 258 GLEYAYNKIFDA--KEKLEHIAKG--HDDYKKYIIFLTDGENSSPNIDNK 303 LE A+ + + L H + I+FLTDG+ N D Sbjct: 319 ALEAAFEMLGGDFNIKILHHPLIALIKRSAENMILFLTDGDPFDRNPDVS 368 >gi|333030671|ref|ZP_08458732.1| protein of unknown function DUF58 [Bacteroides coprosuis DSM 18011] gi|332741268|gb|EGJ71750.1| protein of unknown function DUF58 [Bacteroides coprosuis DSM 18011] Length = 289 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 53/132 (40%), Gaps = 12/132 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + ++N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTKKQTKKDMLTEIAATLAFSAIHNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S +I + P G +H I+E +N T G+EY N + + I Sbjct: 126 SDQIEKFIPPKKGRKHILFIIRELLNFNAQSRKTDIKLGIEYLTNVM--KRRCTAFILSD 183 Query: 280 HDDYKKYIIFLT 291 D K YI LT Sbjct: 184 FIDQKDYINALT 195 >gi|302338923|ref|YP_003804129.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301636108|gb|ADK81535.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 390 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 83/227 (36%), Gaps = 46/227 (20%) Query: 155 ITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML---DIIKSIP 210 + S + + + G +++ ++LD SLSM + K VA + +L D+ I Sbjct: 14 LGVSGILYADTRKGNIELFVLLDKSLSMVEEIDSV--KEYVADELVGRLLIPGDLFVLIN 71 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDA 269 R FS ++ ++ +++ + + T L+ Sbjct: 72 FYGKTDR----FFSGEVESRA----DIEALKDSLTSITADGRFTDIGSALDTL------- 116 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN---------KESLFYCNEAKRRGAIVY 320 EK K+Y++ LTDG+ +P L + E ++G ++ Sbjct: 117 -EKTVEAIPVRQGRKRYLLLLTDGKQEAPPDSPYYSPDGSFNHRLLEHTKEIAKKGWKIH 175 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +G+ E A + L +++L AF + ++ ++ I Sbjct: 176 ILGIGTETAAEEL--------------AKELSAAFSSVREKPEEREI 208 >gi|295132196|ref|YP_003582872.1| hypothetical protein ZPR_0316 [Zunongwangia profunda SM-A87] gi|294980211|gb|ADF50676.1| conserved hypothetical protein [Zunongwangia profunda SM-A87] Length = 288 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 52/154 (33%), Gaps = 15/154 (9%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + S V +Y+ W + + + + L MM+V+DVS S Sbjct: 34 KGRGMTFSEVRQYQFGDDVRNIDWNVTARY-----NEPFVKVFEEERELTMMLVVDVSGS 88 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + FG + + + + N + GL+ FS +I P G H+ Sbjct: 89 --EFFGTQ----SQFKKEVITEIAATLAFSATQNNDKIGLLMFSDQIENYIPPKKGKSHV 142 Query: 241 QEKINRLIF----GSTTKSTPGLEYAYNKIFDAK 270 I L+ T GL+Y N + Sbjct: 143 LRIIRELLEFQPKSKKTDIGLGLKYLSNVMKKKA 176 >gi|262117974|dbj|BAI48012.1| collagen type VI alpha 3 subunit [Mesocricetus auratus] Length = 2675 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 43/358 (12%), Positives = 117/358 (32%), Gaps = 54/358 (15%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 V V+ + +F++K L +L ++ + + +F +N Sbjct: 863 VRIGVVQFSNDVFPEFYLKTHKSQSL---VLDAIRRLRFKGGSPLNTGKALEFVA---RN 916 Query: 87 IWQTDFRNELREN--------GFAQDINNIERSTSL-------------------SIIID 119 ++ + + + + +++ R + + Sbjct: 917 LFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIMSLGIGDRNIDRADLQTI 976 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCAN-SSHAPLLITSSVKISSK-SDIGLDMMMVLDV 177 + E+P I S P + + D++ +LD Sbjct: 977 TNDPKMVFTVREFRELPNIEERVMLSFGPSGSTPRPPGVDIFTPPQPEKKKADIVFLLD- 1035 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-- 235 S+N + L + +I+ ++ + + +R GLV ++S F L Sbjct: 1036 -GSINFQRESFGEVLRFVS-------EIVDTVYEEGDSIRVGLVQYNSDPTDEFFLKDYS 1087 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + I + IN++++ + + + ++ + + +T G+ Sbjct: 1088 TKRQIIDAINKVVYKGGRHANT--KVGIEHLRLNHFVPGAGSRLDERVPQIAFVITGGK- 1144 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHD 353 ++++ + K G V+AIGV+ +++ K ++ + V + ++L + Sbjct: 1145 ---SVEDAQGASLALTQK--GVKVFAIGVRNIDSEEVGKIASNSATAFRVGSVQELSE 1197 >gi|192289002|ref|YP_001989607.1| von Willebrand factor type A [Rhodopseudomonas palustris TIE-1] gi|192282751|gb|ACE99131.1| von Willebrand factor type A [Rhodopseudomonas palustris TIE-1] Length = 372 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 16/113 (14%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ +++ VLD + SM+ K+ +I + + + +R GLV Sbjct: 25 AQARPSVEVAFVLDTTGSMSGLIEGAKRKIWSIATTILD--------DNPDADIRMGLVA 76 Query: 223 F----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTK----STPGLEYAYNKIF 267 + +V++ L +Q + ++ +L L+ A NK+ Sbjct: 77 YRDIGDDYVVRSVDLTTDIQDLYGQLLQLQARGGGDWPESVNEALDTAINKLH 129 >gi|289449442|ref|YP_003474780.1| von Willebrand factor type A domain-containing protein [Clostridiales genomosp. BVAB3 str. UPII9-5] gi|289183989|gb|ADC90414.1| von Willebrand factor type A domain protein [Clostridiales genomosp. BVAB3 str. UPII9-5] Length = 572 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 51/152 (33%), Gaps = 21/152 (13%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSG 219 S+ + D++ VLD S + R+ LD IK D + ++ G Sbjct: 61 PGSEDVLAQDIVFVLDKSGA---------SDQKGIDSQARQFLDDIKQQADEKGLNIKIG 111 Query: 220 LVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +V F L IQ K+ + G T GL A + D Sbjct: 112 IVNFYYAGKVRQELTDVVKNYNDIQNKLKSSVLGFGTNMHAGLLAAKKMLDDDTAVAAKN 171 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 K+II ++DG + +N + Y Sbjct: 172 --------KHIILISDGATYLYSKNNDFTTAY 195 >gi|307151097|ref|YP_003886481.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 7822] gi|306981325|gb|ADN13206.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 7822] Length = 672 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 68/213 (31%), Gaps = 37/213 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 471 ARKAGALIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVALIPF 519 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ L G + + GL A + +AK D Sbjct: 520 RGEKADVLLPPTRSIALAKRRLESLPCGGGSPLSHGLTQAVHVGMNAK-------MSGDI 572 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI----GV 324 + I+ +TDG + P E L + + G + I Sbjct: 573 GQVVIVAITDGRGNIPLAKSLGEPPAEGEKPDIKAELLEIAAKIRGLGMKLLVIDTEKKF 632 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A ++ + + + + + Sbjct: 633 VSTGFAKELAAKAGGKYYHLPRATDQAIASMAK 665 >gi|327281097|ref|XP_003225286.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 292 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 78/209 (37%), Gaps = 36/209 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + D++ +LD S S+ M ++K P+ + L+ FSS Sbjct: 47 NEARDIVFLLDGSTSVRPSDFIEMKAFVAL---------MMKRFPENTHF---ALLQFSS 94 Query: 226 KIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + F H+ ++N+L S T + G+ A K Sbjct: 95 HFQEHFDFRHFQRNRDPDHLMREVNQLRGSSYTAT--GIRKATELFTTQKGARA------ 146 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA----EAADQFLKNC 336 K++++ +TDGE ++ E + N AK +AIGV A + L + Sbjct: 147 -TAKRFLVVVTDGEKFGDMLEYAEVIEEANRAK---ITRFAIGVGIVFTSRVAQRELHSI 202 Query: 337 AS---PDRFYSVQNSRKLHDAFLRIGKEM 362 S PD + V++ L D ++ +++ Sbjct: 203 GSHPVPDHVFVVRHFTGLRDIQTQLKEKI 231 >gi|256071156|ref|XP_002571907.1| dihydropyridine-sensitive l-type calcium channel [Schistosoma mansoni] gi|238657056|emb|CAZ28137.1| dihydropyridine-sensitive l-type calcium channel, putative [Schistosoma mansoni] Length = 421 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 76/213 (35%), Gaps = 26/213 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-----IIKSIPDVNNVVRSG 219 S + D+ ++LD S SM G + ++ + + E LD + P + V Sbjct: 217 SSVPKDLFILLDTSGSMT---GQSLKLANLSAQKLIEALDVDDYFTVAHFPGAKDHVAPM 273 Query: 220 LVTFSSKIV------QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +VT +++ + ++ L + L++AY + E Sbjct: 274 IVTANNESEPICFNSFVQATRRNKLRLFYDLSTLKARGYSDFPASLKFAYEMFRNLTE-S 332 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-VYAIG--VQAEAAD 330 +G + K ++ LTD ++ D + L + K +Y++G V A Sbjct: 333 ARGDRGKELRNKILVLLTD---NAFVFD-ESVLSQLKQQKSNITTFIYSLGEPVGAAYEH 388 Query: 331 QFLKNCASPDRFYS---VQNSRKLHDAFLRIGK 360 + K CA+ D + V L I K Sbjct: 389 KM-KACATNDYYQYLPTVGAVSNLMKYNKNITK 420 >gi|53803884|ref|YP_114279.1| hypothetical protein MCA1842 [Methylococcus capsulatus str. Bath] gi|53757645|gb|AAU91936.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath] Length = 653 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 44/251 (17%), Positives = 81/251 (32%), Gaps = 49/251 (19%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK------ 164 S +D K Y L + E+ S A +S + + Sbjct: 156 SVEPQEYVDPAKKFYLLPILQAEELTSPRNVRVLEVASVTAKRGESSPAETRREVPDAPS 215 Query: 165 --SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + ++ V+D ++SM GP +D+ A R + ++ VR GLV Sbjct: 216 VLRNFSAAVVFVIDSTISM----GPYIDRTREAVRRVYTRIEKAGLADQ----VRFGLVA 267 Query: 223 FSSKIVQTFPLAW------GVQHIQE------KINRLIF---GSTT---KSTPGLEYAYN 264 F S L + ++ K+ L S+T S G+ A Sbjct: 268 FRSSTQAVPGLEYVSKVYADPSEVKTGKDFLAKVASLSPAKVSSSTFDEDSYAGIMTALQ 327 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN----EAKRRGAIVY 320 KI K +Y++ +TD N ++ + EA++ G ++ Sbjct: 328 KI-----------KWSGFGGRYVVLITDAGAIDGNDPLSQTKLGADQVRIEAEQLGVALF 376 Query: 321 AIGVQAEAADQ 331 + ++ A Sbjct: 377 GLHLKTPAGKA 387 >gi|124942|sp|P17301|ITA2_HUMAN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|33907|emb|CAA34894.1| unnamed protein product [Homo sapiens] Length = 1181 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 173 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 221 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 222 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 273 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---L 333 K ++ +TDGE+ ++ K + CN + + I V A + Sbjct: 274 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 329 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K AS F++V + L + +G+++ Sbjct: 330 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 363 >gi|86131933|ref|ZP_01050530.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85817755|gb|EAQ38929.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 483 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 21/190 (11%), Positives = 59/190 (31%), Gaps = 22/190 (11%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + ++ S + +D++ +D + SM D++ ++ +++ I + Sbjct: 260 AVEITIASSVAATNTVDVLFAVDATGSMG-------DEIAYLKSELKNIMNRIDAQIAQK 312 Query: 214 NVVRSGLVTFSS----KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R L + + ++ V +++ +++ +E A Sbjct: 313 ---RVALTVYRDHGDSYVTRSIDFKENVNEVKDFLSQQDANGGGDYEEAVEEALKVSLSQ 369 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY-AIGVQAEA 328 ++ A K + + D + +A+ G + + A Sbjct: 370 SWNVDAKA-------KLLFLMLDAPPHYTEQNVALIKSQIKKAQEMGIRIIPVVASGANK 422 Query: 329 ADQFLKNCAS 338 +FL S Sbjct: 423 EVEFLMRSFS 432 >gi|167522505|ref|XP_001745590.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775939|gb|EDQ89561.1| predicted protein [Monosiga brevicollis MX1] Length = 1927 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 54/188 (28%), Gaps = 33/188 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++V+D S +M+ ++ A + +L + + GL+ FS + Sbjct: 1135 DIVIVIDASGAMS------INDFTAAKTTALAILRRLALAQP---DISVGLIFFSQQAQV 1185 Query: 230 TFPL--AWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL Q + L G T L A + + + Sbjct: 1186 ALPLLDINDETEFQLLLLVLQAGQYQGQATNLGSALSSAADLLETSNNGA---------- 1235 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG-VQAEAADQFLKNCASPDRF 342 + I +DG + + + G + + V L + D Sbjct: 1236 -RQFILFSDGSSDD------QGTLVAQNIRATGIQILTVANVMNANVYNLLLIAGAADNI 1288 Query: 343 YSVQNSRK 350 + S Sbjct: 1289 FLTVQSDA 1296 >gi|14030583|gb|AAK52966.1|AF367012_1 serum opacity factor VT3.1 [Streptococcus pyogenes] Length = 1023 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 23/175 (13%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M ++F +++ ++ K + Sbjct: 207 TIDVTVTVTPKEIDKGADVMALLDVSQKMTKENFDKAKEQIKKMVTTLTGEPTDGKENRN 266 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAY 263 N VR L+TF KI + L+ I +K+ + + + A Sbjct: 267 RRNSVR--LMTFYRKISEPIDLSGKTSEEVEKELDNIWDKVKKEDWDWGVDLQGAIHKAR 324 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + KE + ++I+ + GE++ N + N ++ Sbjct: 325 DIFKKEKESKKR---------QHIVLFSQGESTFSYDINDKDKN--NTVRKNRIT 368 >gi|29346319|ref|NP_809822.1| hypothetical protein BT_0909 [Bacteroides thetaiotaomicron VPI-5482] gi|253568260|ref|ZP_04845671.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385673|ref|ZP_06995231.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] gi|29338214|gb|AAO76016.1| von Willebrand factor-like, type A [Bacteroides thetaiotaomicron VPI-5482] gi|251842333|gb|EES70413.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261814|gb|EFI04680.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] Length = 289 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTNIRLALEYLTNVMK 173 >gi|282898869|ref|ZP_06306855.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] gi|281196242|gb|EFA71153.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] Length = 1499 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 79/207 (38%), Gaps = 35/207 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREML---DIIKSIPDVNNVVRSG-------- 219 ++ V+DVS S F G+ V I+ + +I I N++VR G Sbjct: 852 IVFVIDVSGSTRGPFQ-GIPVGDVNKDGIQNTILDAEIAGFIALNNSLVRKGFGSRAKVS 910 Query: 220 LVTFSSKIVQ-------TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V+F+S T G + ++EK+ L G T L+ A + D Sbjct: 911 IVSFASDAKTLLTTNPETDSNKNGTKDVEEKLISLKSGGETNFEIALQEAAKTLRDIGTT 970 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + +IF++DG+ + N ++ + ++ G + A GV A+ Sbjct: 971 AGNGN---------VIFMSDGQPNQGNYTDEVL-----DLQKAGVKLSAFGVGTGASIDS 1016 Query: 333 LKNCASPDRFYSVQNSRKLHDAFLRIG 359 LK + ++ +L F +G Sbjct: 1017 LKLINPNASIF--TSTDQLLGVFDGLG 1041 >gi|15674025|ref|NP_268200.1| hypothetical protein L107379 [Lactococcus lactis subsp. lactis Il1403] gi|12725093|gb|AAK06141.1|AE006434_3 unknown protein [Lactococcus lactis subsp. lactis Il1403] Length = 1450 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 48/142 (33%), Gaps = 29/142 (20%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 +D+++V+D+S SM K + + + L I++ + V G+V +SS Sbjct: 324 KPVDIVLVIDMSGSMQGA------KETAVRQGVSDFLSTIQNTA-YADYVNVGIVGYSSP 376 Query: 226 ---------KIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 I H++ L F T + GL + Sbjct: 377 GNYVTGASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQLGLRKGTEMLEQDSSDN- 435 Query: 275 HIAKGHDDYKKYIIFLTDGENS 296 +K +I +TDG + Sbjct: 436 ---------QKMMILMTDGVPT 448 >gi|237728320|ref|ZP_04558801.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|226909798|gb|EEH95716.1| conserved hypothetical protein [Citrobacter sp. 30_2] Length = 637 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 59/176 (33%), Gaps = 26/176 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S D+ ++LD S SM+ + + A S+ LD I V + Sbjct: 457 SKAVDVSSAFTILLDSSGSMSRS----VKEAEAAVVSMLYALDGI-------QGVTTSAY 505 Query: 222 TFSSKIVQTFPLAWGVQ-HIQEKINRLIFGSTTKS-TPGLEYAYNKIFDAKEKLEHIAKG 279 F + L G + ++ I FG T+ TP E + + D Sbjct: 506 HFPHAAHNSVGLLKGREQTLRTAIGTHQFGIGTQGCTPLCESLWPALADLTSAKADRH-- 563 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 ++ TDG+ D + AK IV IG +A D +K+ Sbjct: 564 ------VLVIATDGQPD----DMASARAMIQSAKDDDIIVIGIGFG-DANDSMMKS 608 >gi|221043432|dbj|BAH13393.1| unnamed protein product [Homo sapiens] Length = 367 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 71/196 (36%), Gaps = 24/196 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 ++ T +D++ ++D S S++ + +++ +++ + ++ Sbjct: 131 WEIIQTVPDATPECPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLF 187 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDA 269 + + F+ +T P Q ++ + T + G+ ++F Sbjct: 188 ALMQYSNLLKIHFTFTQFRTSP------SQQSLVDPIVQLKGLTFTATGILTVVTQLFH- 240 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---- 325 H KK +I +TDG+ D E +A++ G I YAIGV+ Sbjct: 241 -----HKNGARKSAKKILIVITDGQ---KYKDPLEYSDVIPQAEKAGIIRYAIGVRLFFT 292 Query: 326 -AEAADQFLKNCASPD 340 A + C++ Sbjct: 293 PAPGSACISFRCSAAG 308 >gi|160940244|ref|ZP_02087589.1| hypothetical protein CLOBOL_05133 [Clostridium bolteae ATCC BAA-613] gi|158436824|gb|EDP14591.1| hypothetical protein CLOBOL_05133 [Clostridium bolteae ATCC BAA-613] Length = 683 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 74/236 (31%), Gaps = 46/236 (19%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAP----LLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + Y + N+ L++ ++ ++ + + VLDVS SM Sbjct: 266 FILDYRLTGQEINCGLMLNTGEKENFFMLMVQPPERVRAEEIPPREYIFVLDVSGSM--- 322 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQH 239 FG +D A I ++ ++ N L+ FS V P A ++ Sbjct: 323 FGYPLD---TAKELIGNLVGNLRDSDQFN------LILFSDTAVSMAPKSVPATAENIRQ 373 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + I R G T+ P LE A + D + I+ +TDG S + Sbjct: 374 AIDLIERQDGGGGTELAPALEQAVSLPRDPRMARS------------IVTITDGYMSDES 421 Query: 300 ID---NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSVQNSRK 350 +L + ++ G+ + A + V + Sbjct: 422 SIFSLINRNLKTAD--------FFSFGIGTSVNRYLIDGIAKAGSGEAFVVTEPSQ 469 >gi|325918668|ref|ZP_08180769.1| hypothetical protein XVE_4800 [Xanthomonas vesicatoria ATCC 35937] gi|325535118|gb|EGD07013.1| hypothetical protein XVE_4800 [Xanthomonas vesicatoria ATCC 35937] Length = 646 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 104/322 (32%), Gaps = 51/322 (15%) Query: 53 DHSLLYTATKILNQENGNNGKKQKND---FSYRIIKNIWQTDFRNELRENGFAQDINNIE 109 D+++L IL+ N + +++D + + N R EL+ + A +N Sbjct: 343 DNTVLAALQSILDPSNTDKRAGERSDAIKDALAQVVNQVAPRSRGELKIDSPAMHVNVSN 402 Query: 110 RSTSLSIIIDD---------------QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLL 154 +TS++ + + N+ A+S E+ + H + Sbjct: 403 VTTSINTNANPDCDMGAALAVTSPLRRSFRRNMEAISAAEVSIHRRGRRMSTSHIHRVAV 462 Query: 155 ITSSVKISSKSDIGLDMMMVL--DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 S V S +I LD +VL D S SM+ G + A + L+ I Sbjct: 463 GDSRVFKSVSEEIALDTAVVLGLDASSSMS---GSKIKLTCEAVYASAVALEGI------ 513 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 V G TF P +H+ L T G+ + ++ Sbjct: 514 -EGVTCGAFTFPRN-NIILPFGRKAKHVPGHFQ-LHASGGTPMDDGVYLGIRMLQAQRK- 569 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +K +I TDGE + + + + A G + V Sbjct: 570 ----------PRKVMILTTDGEPN--SDRVESTKAAVEYALSLGIE---VAVMGTYGA-- 612 Query: 333 LKNCASPDRFYSVQNSRKLHDA 354 +N + + + + +L + Sbjct: 613 -RNVCGFENWVDLTDISQLPEV 633 >gi|302877281|ref|YP_003845845.1| von Willebrand factor type A [Gallionella capsiferriformans ES-2] gi|302580070|gb|ADL54081.1| von Willebrand factor type A [Gallionella capsiferriformans ES-2] Length = 757 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 71/188 (37%), Gaps = 36/188 (19%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + +VLD+S S+N+ G L ++ ++ + I+ + D + Sbjct: 561 RNDGRNIAVSVVLDLSESLNEKAAGSTQTILELSQEAVSLLAWAIEKLGDP-----FAIS 615 Query: 222 TFSSKIVQT--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 F S F W ++ ++ + +T+ + +A Sbjct: 616 GFHSNTRHDVRFMHIKGFSEKWD-DEVKGRLAAMQASYSTRMGAAMRHA----------- 663 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-------RGAIVYAIGVQA 326 H + KK ++ LTDGE + ID+++ +A++ +G Y I + Sbjct: 664 SHYLEQQQADKKLMLILTDGEPAD--IDSRDGRILIEDARKAVTELDQKGIYAYCINLD- 720 Query: 327 EAADQFLK 334 AD+++K Sbjct: 721 PKADEYVK 728 >gi|145503661|ref|XP_001437805.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404961|emb|CAK70408.1| unnamed protein product [Paramecium tetraurelia] Length = 265 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 84/214 (39%), Gaps = 31/214 (14%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWC-ANSSHAPL 153 +L + +Q+I + + ++ + + Y+L +E+ + ++ P Sbjct: 69 QLPQTAISQEIFDDDDQVQTNL-VQAKPNMYDLEKELIFEIKTLQKMIKLSKISTQQLPG 127 Query: 154 LITSSVKISSKSD----IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +I+ K +G+D++ ++D S SMN K+ +S++ +L + Sbjct: 128 IISIKTKDQLNDQDLNRVGVDLICLIDKSSSMNGS------KIETVKQSLKVLLTFL--- 178 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 +N R L+ F++ + PL + + I+++ T+ + + A ++ Sbjct: 179 ---SNQDRLQLIIFNTHAKRLTPLKRITEDNKLYFTQMIDQIKSDGGTQISSATQIAISQ 235 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + + + + L+DG+++ Sbjct: 236 L------KGRKYRNNVSS---VFLLSDGQDNDAT 260 >gi|146298479|ref|YP_001193070.1| hypothetical protein Fjoh_0716 [Flavobacterium johnsoniae UW101] gi|146152897|gb|ABQ03751.1| conserved hypothetical protein [Flavobacterium johnsoniae UW101] Length = 287 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L M++++D+S S G + I + + N + GL+ F Sbjct: 72 EEERELTMVLMVDISGS------EGFGSKSQFKKDIVTEIAATMAFSATQNNDKIGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S + P G H+ I LI + A + Sbjct: 126 SDNVELYIPPKKGRSHVLRIIRELIEFEPKSQKTDVAQALKFL------------SGTQK 173 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIG 323 KK I+F+ + + + +++L ++ G VY I Sbjct: 174 KKAIVFM---ISDFMSENYEQTLKIASKKHDLTGVRVYDIR 211 >gi|118347184|ref|XP_001007069.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288836|gb|EAR86824.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 821 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 76/206 (36%), Gaps = 37/206 (17%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 ++++ ++D S SM F ++ +++ ++ +PD + Sbjct: 325 EKTVLNAEKHSKAQFFFLIDRSGSMCTIF----------QKARDTLIEFLQRLPDDSYF- 373 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQ-----EKINRLIFG-STTKSTPGLEYAYNKIFDAK 270 +++F S F A E+I++ T+ LE + Sbjct: 374 --NVISFGSGYQFLFEEAKKKNKQSMKSALEQISKFSADMGGTEIYQPLEKIFQ------ 425 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 +D Y+ I LTDG+ S P++ + ++A+ V+ IG+ + Sbjct: 426 -----CKNVNDLYQMQIFLLTDGQVSQPDMVVQLIRNNSHKAR-----VHCIGLGSGVDK 475 Query: 331 QFLKNCASPDR--FYSVQNSRKLHDA 354 Q L+ C+ R V N+ +L + Sbjct: 476 QLLRRCSESGRGANRQVDNASELKEV 501 >gi|4995870|emb|CAB44273.1| Serum opacity factor 13 [Streptococcus pyogenes] Length = 409 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 98/254 (38%), Gaps = 28/254 (11%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI 108 + A + N G + + + + + + +++ ++ +++ Sbjct: 69 QAATISTSSTPAAGTSSNSNQVTGTEAEPQTMDVERYTVDKENSKLNIKDGDKPKNRSSV 128 Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-DI 167 ++ T L D + +D + + ++ I +V + K D Sbjct: 129 DKDTKLIRNRDGKQRD-IVDVTRTVK--------------TNEDGTIDVTVTVKPKQIDE 173 Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 G D+M +LDVS M+ D F DK+ ++ + + N VR L+TF K Sbjct: 174 GADVMALLDVSKKMSEDDFNNAKDKIKKLVTTLTSKSANGQQNLNNRNTVR--LMTFYRK 231 Query: 227 IVQTFPLAWGV-QHIQEKINRLIFGSTT---KSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I L+ + ++E++N++ T L+ A +K D +K + K Sbjct: 232 ISDPIDLSGKTSEEVEEELNKIWDKVKTKDWDWGVDLQGAIHKARDIFKKEKESKK---- 287 Query: 283 YKKYIIFLTDGENS 296 +++I+ + GE++ Sbjct: 288 -RQHIVLFSQGEST 300 >gi|84387242|ref|ZP_00990263.1| hypothetical protein V12B01_22471 [Vibrio splendidus 12B01] gi|84377889|gb|EAP94751.1| hypothetical protein V12B01_22471 [Vibrio splendidus 12B01] Length = 142 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 55/151 (36%), Gaps = 15/151 (9%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + +G ++ AI PV+ +GLVIE + + + + Y +++ + + G Sbjct: 6 KHQRGFAAVEMAIATPVLLFFLGLVIELGNVLIHYNVISKSVQNGARYAVSEVYDTKGGT 65 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + +N + + + T+ + + D + Sbjct: 66 IAPTLE---------------IQNVVVYGQSSVGTAVLSTLTTADVTVTPPSIDSYVRVS 110 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 Y+ +F + P A S PL +TS +++ Sbjct: 111 VTYDYVPLFLSIPLSATSFSIPLSVTSVMRV 141 >gi|311693026|gb|ADP95899.1| von Willebrand factor, type A-like protein [marine bacterium HP15] Length = 590 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 71/201 (35%), Gaps = 41/201 (20%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-----GLVTFS 224 + ++LD S SM+ + +A ++ + I +IP + V G V+ Sbjct: 424 SVHVLLDTSGSMSQ-------RQEIANQATVSLALAISTIPKCDIAVSMFPGCGGSVSPM 476 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q G + G T + YA ++ + + + Sbjct: 477 IHRGQPVRPNLGRFLVSS-------GGGTPLAEAMLYAARELSASHK-----------PR 518 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +I +TDG ++ + N + + K + YAIG+ + A + + Sbjct: 519 QVLIVITDGSPNNGHAVN----YLLDLMKHQ-IDTYAIGIGSNAVKSYF------GNWTV 567 Query: 345 VQNSRKLHDAFLRIGKEMVKQ 365 + + R+L A RI ++ Sbjct: 568 INDVRELQSALFRIAGNVLDL 588 >gi|222080976|ref|YP_002540339.1| hypothetical protein Arad_7191 [Agrobacterium radiobacter K84] gi|221725655|gb|ACM28744.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 649 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 40/115 (34%), Gaps = 6/115 (5%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F + KG I+I+T LPV+ L IE + V+ + D + A N Sbjct: 12 LRLFCDDRKGGIAIMTVFCLPVVIGFAALSIEYGYGLLVRDQNQRTADLASYAGALAYSN 71 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 N + + + + L A ++ ++ + + + Sbjct: 72 -ANSEDQMTDA-ALRVAKLNGVDAANVVVSL----TASPKDSRVQAVHVDVTTTN 120 >gi|197337244|ref|YP_002158801.1| iron-regulated protein FrpC [Vibrio fischeri MJ11] gi|197314496|gb|ACH63945.1| iron-regulated protein FrpC [Vibrio fischeri MJ11] Length = 3927 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 50/143 (34%), Gaps = 20/143 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF---SSKI 227 + + LD S SM D +++ + S ++L+ I++ P+ + V+ L+ F S Sbjct: 3451 ITLALDSSGSMGDVEVDDKERMQLVLESSIKLLEDIQNQPN-SGTVQVQLIDFDNRHSDE 3509 Query: 228 VQTFPLAWGVQHIQEKI-------------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + G + + I + G T + + Y + Sbjct: 3510 TNDTATSLGWYSVADAITHLQDAIEIEMISDPYHIGGGTDYSEAI---YAILDGYSNDKV 3566 Query: 275 HIAKGHDDYKKYIIFLTDGENSS 297 + I F++DG N+ Sbjct: 3567 PNDVDLSNTNDVIYFISDGHNNE 3589 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 267 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 320 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A + + Sbjct: 321 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGH-------- 372 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 373 ---KKVIVLLTDGVPT 385 >gi|253569758|ref|ZP_04847167.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840139|gb|EES68221.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 621 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 76/195 (38%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSV-KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N+ H + I +I + +++ ++DVS SM G + Sbjct: 217 VKITMEAGVCPW--NADHRLVRIGLKAREIPTDKLPESNLVFLIDVSGSM-----WGPTR 269 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V VV +G ++ + Q I++ I+ L G Sbjct: 270 LDLVKSSLKLLVNNLREKDKVAIVVYAG----NASVKLESTPGSDKQKIRDAIDELTSGG 325 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY +H + + II +DG+ + + + Sbjct: 326 STAGGAGIQLAYKV-------AKHNFLPKGNNR--IILCSDGDFNVGVSSVEGLEQLIEK 376 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 377 ERKSGVFLSVLGYGM 391 >gi|134299282|ref|YP_001112778.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134051982|gb|ABO49953.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 599 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 75/199 (37%), Gaps = 22/199 (11%) Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 ++++++P +S++ + + I++++ LD+ +++D S SM G Sbjct: 381 QVINQFQLPSGELDEDSLYQASYSNRIFRQTEIINTRTRK-LDICLLVDTSASMVYPAGK 439 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRS----GLVTFSSKIVQTFPLAWGVQHIQEK 243 + ++ +A ++ ++ + + V G V + P + + Sbjct: 440 EISRVELARNLAALFVEALEPVDSIKTWVFGFNLKGAV---NMYELYSPALPN----KAR 492 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 I TT ++YA L +A+G +K +I + DG N +P + K Sbjct: 493 IGVTGAEGTTPEGTAVKYA---------ALRMMAEGRPFVQKVLIVIADG-NPNPGPETK 542 Query: 304 ESLFYCNEAKRRGAIVYAI 322 K G +I Sbjct: 543 LVKEQVKRLKALGCKTISI 561 >gi|313203642|ref|YP_004042299.1| hypothetical protein Palpr_1167 [Paludibacter propionicigenes WB4] gi|312442958|gb|ADQ79314.1| protein of unknown function DUF58 [Paludibacter propionicigenes WB4] Length = 288 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 43/111 (38%), Gaps = 10/111 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +++++DVS S + + I + + + N + G++ F Sbjct: 72 EEERELTVVLLIDVSGSREF------GTVSQLKKDIFTEVAATLAFSTIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAK 270 S KI + P G +H+ + I L + T L Y N I + Sbjct: 126 SDKIEKFIPPKKGKKHVLQIIRELIDFEPESNKTDIAGALRYLTNAIKKSS 176 >gi|59714345|ref|YP_207120.1| RTX repeat-containing calcium-binding cytotoxin RtxA2 [Vibrio fischeri ES114] gi|59482593|gb|AAW88232.1| RTX (repeats in toxin) calcium-binding cytotoxin RtxA2 [Vibrio fischeri ES114] Length = 3933 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 50/143 (34%), Gaps = 20/143 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF---SSKI 227 + + LD S SM D +++ + S ++L+ I++ P+ + V+ L+ F S Sbjct: 3457 ITLALDSSGSMGDVEVDDKERMQLVLESSIKLLEDIQNQPN-SGTVQVQLIDFDNRHSDE 3515 Query: 228 VQTFPLAWGVQHIQEKINRLI-------------FGSTTKSTPGLEYAYNKIFDAKEKLE 274 + G + + I L G T + + Y + Sbjct: 3516 TNDTATSLGWYSVADAITHLQDAIEIEMISDPYYIGGGTDYSEAI---YAILDGYSNDKV 3572 Query: 275 HIAKGHDDYKKYIIFLTDGENSS 297 + I F++DG N+ Sbjct: 3573 PNDVDLSNTNDVIYFISDGHNNE 3595 >gi|307249749|ref|ZP_07531728.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858257|gb|EFM90334.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 530 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 45/265 (16%), Positives = 97/265 (36%), Gaps = 27/265 (10%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 67 ENGNN----GKKQKNDFSYRIIKNIWQTD---FRNELRENGFAQDINNIERSTSLSIIID 119 N+ G K + S+ I + + D ++ + + + D Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNEEKMHLIPTCKTKTD 129 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI--------GLDM 171 K + S+ + W + +I V ++SKS +D+ Sbjct: 130 TNKKGHTSSSEVTCTVSGTIEHKSWFPLKVGSVEVIPHEVNVASKSKAFKKNTFNIPIDL 189 Query: 172 MMVLDVSLSMNDHF-------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 M+V D+S SMN G M KLG+ + E+ + + N+ R + F+ Sbjct: 190 MVVADLSGSMNYDLSNKNEIVGSPMSKLGILQDVLSELAEKTLLSEEANHNNRIYVTPFA 249 Query: 225 SKIVQTF-----PLAWGVQHIQEKI 244 + P +W + +++ Sbjct: 250 LGAEISSSNCAIPYSWDMNKNNQEL 274 >gi|170690099|ref|ZP_02881266.1| von Willebrand factor type A [Burkholderia graminis C4D1M] gi|170144534|gb|EDT12695.1| von Willebrand factor type A [Burkholderia graminis C4D1M] Length = 329 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 76/252 (30%), Gaps = 38/252 (15%) Query: 145 CANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + + + + + G +++++D S SM++ ++ + Sbjct: 58 VLAMLAIVVGLAGPGRSHREVLRTGSGAQILILMDRSASMDEPINSKGVEVSAGESKNKV 117 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPG 258 + R + F + + P + + I I G T+ G Sbjct: 118 ARASLTEFVAQRPNDRLAFMMFGTSPLLAMPFTYDHRAIDAAIAGTSVGRGMPDTQLDLG 177 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 L A + + H ++ I+ ++DG ++ + + R Sbjct: 178 LLTAIGEF----------NQQHSSSRRAIVLVSDG---GAKLNARVRQLIEDGLLRNDIA 224 Query: 319 VYAI----------------GVQAEAADQFLKNCAS---PDRFYSVQNSRKLHDAFLRIG 359 +Y I ++ A + + S P R + N++ + DA I Sbjct: 225 LYFIYLRSSIYSPDLNAKVPASESSAEAELHRYFLSLKTPYRLFQTGNAKAMRDAMAEIN 284 Query: 360 KEMVKQRILYNK 371 ++ Q + Sbjct: 285 RQQNAQTTFVER 296 >gi|118359381|ref|XP_001012930.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294697|gb|EAR92685.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 713 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 67/198 (33%), Gaps = 29/198 (14%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-----SIPD 211 + KS + + V+DVS SM G + +LDI+K + + Sbjct: 52 QILSPKGKSKVSNSICCVVDVSGSMGSRAVTKQS--GGNSELGYSVLDIVKHSLNTIVQN 109 Query: 212 VNNVVRSGLVTFSSKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 ++ +VTFS Q + ++ + IN+ ++T G+E ++ Sbjct: 110 LDEGDEFSMVTFSDNSKLVCNYQQMTES-NIKSSVDLINQCQPDASTNIWAGIEQGLEQM 168 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI-----VYA 321 + K ++ +I LTDG+ + + L N + I + Sbjct: 169 QNDSNKNKNQQ---------LIVLTDGQPN--VNPPRGILTTLNNFYNKNIISPKPSINT 217 Query: 322 IGVQAEAADQFLKNCASP 339 G L N A Sbjct: 218 FGFGYYLDSHLLFNIAQD 235 >gi|83645400|ref|YP_433835.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83633443|gb|ABC29410.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 749 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 72/200 (36%), Gaps = 35/200 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++DVS SM + R ++ + +P+ ++ G+ TF + + Sbjct: 33 DVRVLIDVSGSMKKNDP------KNLRRPALNLVTEL--LPEGDSA---GVWTFGQYVNE 81 Query: 230 TFP---LAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P + G + + + R I + T LE A + K+ Sbjct: 82 LAPHQVVDPGWRRLAKDKAREISSTALYTNIGAALEKASEDFVEGKDYSNT--------- 132 Query: 285 KYIIFLTDG---ENSSPNIDNKESLFYCNEAKRR----GAIVYAIGVQAEAADQFLKNCA 337 + I LTDG + P + E + +R GA ++ I + A L+ + Sbjct: 133 -HFILLTDGVVDISQKPGENVAERDRVLTQVLKRVAGFGAKIHTIALSRNADQMLLQRLS 191 Query: 338 SPDRFYS--VQNSRKLHDAF 355 + +NS +L F Sbjct: 192 IGSNGINAIAENSEQLSRVF 211 >gi|324991258|gb|EGC23192.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK353] Length = 451 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 59/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM G + + +++I R GL TFS Sbjct: 174 KAGSADIVFVVDRSGSMGGTIGIVRANIN----------EFVRNITKEGITARFGLATFS 223 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 224 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTTALNQIIS 283 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 284 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 337 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 338 G--IYKNFATEGRVLDIEN 354 >gi|260801245|ref|XP_002595506.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] gi|229280753|gb|EEN51518.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] Length = 1641 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 68/188 (36%), Gaps = 28/188 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++++D S S+ P + K +L + N + G+ + S + Sbjct: 587 IILLVDGSKSVTYLNFPNVLKF---------ILKLAAGFEIGPNAAKLGVYQYGSDVRTE 637 Query: 231 FPLA-WGVQH--IQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 FP+ + + + +N T + LE Y DD KK Sbjct: 638 FPIGQYNTREDVLNAVLNIQYMNQWGTFTGKALEEVYKTF-----------PAGDDAKKV 686 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +I +TDG+ +D + + K GA++ A+GV + +S D ++ Sbjct: 687 VIIITDGK----AMDEEVLRKASQDVKADGAMICAVGVGGFRLKELSLLASSQDLVFTAT 742 Query: 347 NSRKLHDA 354 + K+ Sbjct: 743 DFDKMDAI 750 >gi|29349873|ref|NP_813376.1| putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341784|gb|AAO79570.1| conserved hypothetical protein, putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] Length = 621 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 77/195 (39%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSV-KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N+ H + I +I + +++ ++DVS SM G + Sbjct: 217 VKITMEAGVCPW--NADHRLVRIGLKAREIPTDKLPESNLVFLIDVSGSM-----WGPTR 269 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V VV +G ++ + Q I++ I+ L G Sbjct: 270 LDLVKSSLKLLVNNLREKDKVAIVVYAG----NASVKLESTPGSDKQKIRDAIDELTSGG 325 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G++ AY + KG++ II +DG+ + + + Sbjct: 326 STAGGAGIQLAYKVAKQN-----FLPKGNNR----IILCSDGDFNVGVSSVEGLEQLIEK 376 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 377 ERKSGVFLSVLGYGM 391 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 47/153 (30%), Gaps = 14/153 (9%) Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG------VQHIQ 241 G+ ++ ++ I R GLV++++ L + Sbjct: 69 GLTQVDGNIATVFGYDTRIGVRDYEPKTTRVGLVSYNADAKILAGLDTYQSYDDLANGVF 128 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + +N + + GL A K+F+ + + YKK +I +S Sbjct: 129 DSLNSVSATDESYLAKGLSAA-EKVFEEGKSTANR----TQYKKVVIVY---ASSYKGTG 180 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + + K G + + D LK Sbjct: 181 ELNPVPVADRMKTAGVKIITVAFSQNNDDGLLK 213 >gi|300936621|ref|ZP_07151525.1| von Willebrand factor type A domain protein [Escherichia coli MS 21-1] gi|300458202|gb|EFK21695.1| von Willebrand factor type A domain protein [Escherichia coli MS 21-1] Length = 219 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A N + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAHGDTPMGAAITKALNMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|239980987|ref|ZP_04703511.1| putative surface-anchored fimbrial subunit [Streptomyces albus J1074] Length = 893 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 77/259 (29%), Gaps = 42/259 (16%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFI-FCTFPWCANSSHAP----LLITSSVKISSKSDIGL 169 ++ + + L+A + Y F + S+ +S V + GL Sbjct: 96 TVTTPYRFQTPRLAAATTYSSTATGPGGFMVGSGGSNRTASTGTWQSSRVNPALPERCGL 155 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +++D+S SM+ + +D ++ P ++V R ++S Sbjct: 156 RVALIMDLSGSMSGSVP-------ALKTAADTFVDALQGTP--SSVARFTFSSYS----- 201 Query: 230 TFPLAWGVQHIQEKINRLIFGSTT---KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P G + ++ ++ A + ++ + Sbjct: 202 --PATRGGANAPGLVSVSTLADAAAFKRTYASWTNATAEGSTNWDRALYEPASATSQYDV 259 Query: 287 IIFLTDGENSSPNI----------------DNKESLFYCNEAKRRGAIVYAIGV--QAEA 328 + +TDG ++ +I + + + N K G V A+GV Sbjct: 260 AVVITDGMPTNYSIPGGPSGGASGSVTRFRELESGIASANALKNEGTRVLAVGVGEGTSG 319 Query: 329 ADQFLKNCASPDRFYSVQN 347 S Y N Sbjct: 320 NAALNLASISGTEKYDGDN 338 >gi|209521403|ref|ZP_03270114.1| von Willebrand factor type A [Burkholderia sp. H160] gi|209498150|gb|EDZ98294.1| von Willebrand factor type A [Burkholderia sp. H160] Length = 275 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 72/268 (26%), Gaps = 43/268 (16%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSR-----YEMPFIFCTFPWCANSSHAPLLITSSV 159 + + SL + Q + ++ R P + + Sbjct: 14 VTATACAASLQAVAVSQPEPGTVTVTVRAPGAAPSADAFTLQLPGADTTPTRVPAQSVEA 73 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 D+ +++ +D S SM+ L + +R Sbjct: 74 ANELSPDLATAVLLCVDRSGSMHSAVPAIKAALK-------------DVLARPRPDLRIA 120 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEH 275 L++F S P + E ++ + T+ L + + + Sbjct: 121 LMSFGSDTPAPTPFYSESAPVIEAVDAIRAETGRDGKTRLYDALNIGMSMLAN------- 173 Query: 276 IAKGHDDYKKYIIFLTDG--ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQ 331 K+ I+ +TDG E S D L RG + AI A+ Sbjct: 174 --VPLRGPKRLIV-ITDGKDEGSQTRFDVLSVLLQ-----GRGQPMDAIAFGQSAQKTSS 225 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLR 357 L A S F N L +A Sbjct: 226 GLATLANKSSGAFVLATNPSSLVEALRN 253 >gi|145482457|ref|XP_001427251.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394331|emb|CAK59853.1| unnamed protein product [Paramecium tetraurelia] Length = 568 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 66/179 (36%), Gaps = 29/179 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ V+DVS SM G + + + R I+++L R LVTF Sbjct: 127 AKANIDLVCVVDVSGSME---GEKISLVKDSLRYIQKILSPND---------RIALVTF- 173 Query: 225 SKIVQTFPLAW------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L W Q I++ I + +T G++ I + K+K Sbjct: 174 -GTYSGINLPWTINKPENKQKIKDAIIGMKIRDSTNIADGVKLGLRMIKERKQK------ 226 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + LTDG++ + D + + ++ + G + + + N + Sbjct: 227 ---NPVTCMFVLTDGQDDNKGADERCQQAINEYQIQDTFVINSFGYGQDHDAKVMNNIS 282 >gi|47218505|emb|CAF97239.1| unnamed protein product [Tetraodon nigroviridis] Length = 1060 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 56/173 (32%), Gaps = 22/173 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + + I ++ K VR G+ F Sbjct: 462 DIVFLVDESWSVGQNSFSHV------KDFISAIITSFKDSVVGTEGVRFGVTVFGDVPKM 515 Query: 230 TFPLAW--GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + I L + + + L + +F + +H K Sbjct: 516 RIALTDYSSQEEVLRAIRDLPYEGRSRRIGDALTFLVQHVFSPVIRRDHGP-------KI 568 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + +T+G + P + G ++A+GV A + + + P Sbjct: 569 AVLITNGRSDDPVDAAARLVA------DSGISLFAVGVGGADASELRRMVSEP 615 >gi|292619294|ref|XP_692164.4| PREDICTED: integrin alpha-M-like [Danio rerio] Length = 806 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 81/264 (30%), Gaps = 40/264 (15%) Query: 121 QHKDYNLSAVSRYEMPFIFCTF----PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 QH + A+ +P T C + +D+ +LD Sbjct: 88 QHPQSSKLAICGPTIPKNCTTATNYRGMCFIGNSGDFGPPIPKSKYRDCLGQIDIAFLLD 147 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI---VQTFPL 233 S S G+ V + ++ + + + +S+ L Sbjct: 148 GSGS------IGIYDFTVMKGFVTNVIRRF-----IERDAQFAIAQYSNDCDIHYNFNDL 196 Query: 234 AWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + K+ + + T + ++ N +F AK K ++ +TD Sbjct: 197 KLDDGTWESKVANIPYHEGGTFTASAIQKLVNYLFTPNGGTRPSAK------KILVVITD 250 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE----AADQFLKNCASP---DRFYSV 345 GE + D ++A++ + +AIGV A + L AS D + V Sbjct: 251 GE----SHDRNLLKDAASQAEKNSIVRFAIGVGKAFDYYNAREELNTIASDPDTDYVFKV 306 Query: 346 QNSRKLHDAFLRIGKEMVKQRILY 369 + L I +++ I Sbjct: 307 TDFNAL----KNILQKLEGNIIAI 326 >gi|114775649|ref|ZP_01451217.1| rubisco activation protein cbbO [Mariprofundus ferrooxydans PV-1] gi|114553760|gb|EAU56141.1| rubisco activation protein cbbO [Mariprofundus ferrooxydans PV-1] Length = 745 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 17/93 (18%) Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 WG ++ ++ + G +T+ + +A H K KK ++ LTDGE Sbjct: 625 WG-DEVKSRLAEMEAGFSTRMGGAMRHA-----------AHYLKAQQADKKLMLILTDGE 672 Query: 295 NSSPNIDNKE-----SLFYCNEAKRRGAIVYAI 322 S ++D+++ + NE R G Y I Sbjct: 673 PSDIDVDDEQLLIADARQAVNELDRDGIYSYCI 705 >gi|253996156|ref|YP_003048220.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253982835|gb|ACT47693.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 2114 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 72/190 (37%), Gaps = 27/190 (14%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDH----------FGPGMDKLGVATRSIREML 203 IT +++ +S + + ++++VLD S SM + +++ +A +I +++ Sbjct: 1344 NITQTLQAASAA-LTYNVVIVLDRSGSMAQDANGLWSNQSGYDASTNRMEIAKEAIAQLI 1402 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + +VN VTFSS V++ V ++ + G T+ + L Sbjct: 1403 ARYDGLGNVN----VKFVTFSSDAVESEWYIDNVTGAVRYVDNVQAGGGTQYSTALNETM 1458 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + K F+TDG+ NS +D + N G I + I Sbjct: 1459 SGFTQPVAD-----------KTLFYFITDGQPNSGYEVDATLQTQWQNFVAANGNISFGI 1507 Query: 323 GVQAEAADQF 332 G+ + Sbjct: 1508 GIGTASLSSL 1517 >gi|114567581|ref|YP_754735.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338516|gb|ABI69364.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 776 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 69/205 (33%), Gaps = 42/205 (20%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--IVQTF 231 ++D+S SM K+ A +I+ L + N L+ F S+ Sbjct: 282 LIDISRSMEGK------KIEHAADAIQICLRNLDEGDSFN------LLAFESENHAFAPK 329 Query: 232 PLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + +++ + L T P ++ A + D +K +I Sbjct: 330 SLPYNQENLDKASAWVKNLHAMGGTNILPAVQLALKEAGDQ--------------QKVVI 375 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR-----FY 343 TDG+ + N E + Y + + + ++++G+ F+ A Y Sbjct: 376 LATDGQVGNEN----EIINYVRK-RNQNLCLFSLGIDTAVNSYFINQIAEAGNGCAEFSY 430 Query: 344 SVQNSRK-LHDAFLRIGKEMVKQRI 367 ++ + + F RI + Sbjct: 431 PGESLEEKMLRHFARINATSMDNVT 455 >gi|326407579|gb|ADZ64650.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 1433 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 48/142 (33%), Gaps = 29/142 (20%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 +D+++V+D+S SM K + + + L I++ + V G+V +SS Sbjct: 307 KPVDIVLVIDMSGSMQGA------KETAVRQGVSDFLSTIQNTA-YADYVNVGIVGYSSP 359 Query: 226 ---------KIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 I H++ L F T + GL + Sbjct: 360 GNYVTGASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQLGLRKGTEMLEQDSSDN- 418 Query: 275 HIAKGHDDYKKYIIFLTDGENS 296 +K +I +TDG + Sbjct: 419 ---------QKMMILMTDGVPT 431 >gi|224171915|ref|XP_002198669.1| PREDICTED: similar to collagen, type XX, alpha 1, partial [Taeniopygia guttata] Length = 152 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 48/123 (39%), Gaps = 15/123 (12%) Query: 220 LVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L +SS + L+ + + E + L + G T + L + + L+ Sbjct: 1 LSQYSSDPRTEWELSTYSTREQVLEAVRNLRYKGGNTFTGLALTHVLE------QNLKPD 54 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 A + +K +I LTDG++ ++ K G ++AIGV+ + + Sbjct: 55 AGARLEAEKLVILLTDGKSQD------DANLAAQTLKNLGIEIFAIGVKNADEAELRQVA 108 Query: 337 ASP 339 + P Sbjct: 109 SEP 111 >gi|158337607|ref|YP_001518782.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158307848|gb|ABW29465.1| von Willebrand factor type A domain protein, putative [Acaryochloris marina MBIC11017] Length = 708 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 60/187 (32%), Gaps = 32/187 (17%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S + D++ ++D S S GP + + + L+ + Sbjct: 332 KYKSNQIVPKDVVFLIDTSGS---QSGPPIVQSRKLMTQFLDKLNPNDTFS--------- 379 Query: 220 LVTFSSKIVQ--TFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ FS+ + PLA + E I +L T+ G+ + Sbjct: 380 IINFSNTTSKLSPKPLANTPANRKKALEYIKKLDANGGTELMNGINT-----------VA 428 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 D + ++ LTDG D++ + + + G +Y GV L Sbjct: 429 AFPPAPDGRLRSVVLLTDGLIG----DDETIIAAVRDRLKPGNRIYPFGVGFSTNRFLLD 484 Query: 335 NCASPDR 341 A R Sbjct: 485 RLAEVGR 491 >gi|228931399|ref|ZP_04094324.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228828205|gb|EEM73915.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 448 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 80/217 (36%), Gaps = 32/217 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ ++LD S SM G K+ A ++I LD I +V V + + Sbjct: 144 KEKSLNVEILLDASGSMAGKVN-GQVKMEAAKKAIYNYLDKIPDNSNVMLRVYGHKGSNN 202 Query: 225 SKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ G + +E+ N + K L A + D ++ Sbjct: 203 EN---DKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKEYTGE 259 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQ 331 + YI+ +DGE + N AK A+V IG + ++Q Sbjct: 260 ENLNVV---YIV--SDGEETCGG-------EPVNAAKNLNQSSTHAVVNIIGFDVKNSEQ 307 Query: 332 F-LKNCASP--DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 LKN A + +V + +L+ + +++ KQ Sbjct: 308 LQLKNTAEAGKGNYATVSTADELYQTLNKEYEKLYKQ 344 >gi|145516893|ref|XP_001444335.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411746|emb|CAK76938.1| unnamed protein product [Paramecium tetraurelia] Length = 1460 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 71/199 (35%), Gaps = 38/199 (19%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK----IV 228 ++LD S SM F A + + L I+ P+ R ++ F+ + + Sbjct: 1278 LILDDSGSMEGAF------FEAAKKGLVAFLQEIQKNPES----RVTIILFNHQARCVVD 1327 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P Q Q++I G T L+ A++KI + + + I Sbjct: 1328 YEIP---DAQVQQKEIQ--FRGGGTDFDEPLKLAFDKIANNPDFDNFSSHS-------IF 1375 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 F TDG+ P ++ + ++ KR + A + ++ ++ +++ Sbjct: 1376 FYTDGQAQYPTKAMEKVKQFPSD-KREKIELVACSFE-DSPTTLVRVVEFGKQYF----- 1428 Query: 349 RKLHDAFLRIGKEMVKQRI 367 F +I M Q I Sbjct: 1429 -----GFAKIQASMEPQMI 1442 >gi|307591429|ref|YP_003900228.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986283|gb|ADN18162.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 426 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 60/181 (33%), Gaps = 26/181 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LD+S SM G K+ A ++IRE +I K D + +V F Sbjct: 108 IIVLLDMSGSMAKEDSRGTTKIEGAIKAIREFTEIAK---DRGGNTQVSIVPFGDPGKNC 164 Query: 231 FPLAWGVQHIQEK--------------INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + L ++T L A + + + Sbjct: 165 AGYPIDSNTLDNFSRVDDAKLQIFLDNLASLSPCASTNLYEPLSKAVRFLGKKNDSRFYP 224 Query: 277 AKGHDD-----YKKYIIFLTDGENSSPNIDNKESLFYCNEA--KRRGAIVYAIGVQAEAA 329 + + II L+DG ++ PN + N+ K IV+ +G A Sbjct: 225 LDSSGNPIEPQPRLSIILLSDGYHNKPN--EAQDFQSLNQLLKKNNQIIVHTLGYGLTAQ 282 Query: 330 D 330 Sbjct: 283 Q 283 >gi|296133928|ref|YP_003641175.1| hypothetical protein TherJR_2435 [Thermincola sp. JR] gi|296032506|gb|ADG83274.1| hypothetical protein TherJR_2435 [Thermincola potens JR] Length = 621 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 41/110 (37%), Gaps = 11/110 (10%) Query: 1 MSFLN--IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY 58 M +N + F N KG++++ I+ ++ I +GL I+ + + +L + + + Sbjct: 1 MYIINSVLERFLKNEKGTVTVYLVIVFMIMVIFIGLFIDLARIKTAQNQLRRVANAAACS 60 Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNI-WQTDFRNELRENGFAQDINN 107 + K DF K + DF ++ N N Sbjct: 61 VLA--------DYHTSTKQDFGLFTYKGANYDQDFAKYVKANLTFSADQN 102 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 40/102 (39%), Gaps = 1/102 (0%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 F + +G+I+++ A+ P + ++ +E +AKL I D + L A ++ + Sbjct: 14 FGRDERGNIALIFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTAALAGANELALAID 73 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIER 110 ++ F + W++ +D + + Sbjct: 74 DAAAIERAKVFIDGHVSE-WKSAPAVTPEIAVILRDKQRVIQ 114 >gi|146455221|dbj|BAF62177.1| complement factor B [Triakis scyllium] Length = 765 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 60/178 (33%), Gaps = 25/178 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L + ++LDVS S+ D + T S +M+ + + G+VTF S Sbjct: 260 GQKLHIYIILDVSGSIKK--VEFCDAISALT-SFIDMMSRFEVVIHY------GVVTFGS 310 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF-----------GSTTKSTPGLEYAYNKIFDAKEKLE 274 + + + + + T T L+ Y + K ++ Sbjct: 311 RSQIIVNIGHAESGFPDYVADVPKDLKYEDVVMGNNKGTNMTGALKTVYEMMSFQKASMK 370 Query: 275 HIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + + + I+ TDG + P +DN E ++ VY G+ + Sbjct: 371 NRQLAWIEVRHVIMIFTDGRTNMGGSPKPMMDNIEGFLEVKNSREDFLDVYVFGLGDD 428 >gi|74189919|dbj|BAE24585.1| unnamed protein product [Mus musculus] Length = 319 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 79/231 (34%), Gaps = 37/231 (16%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKIS---- 162 ++ ++ ++ + + V+ + F+ C + + H I S V + Sbjct: 99 VNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVV 158 Query: 163 ------SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + LD+++VLD S S + T + ++L + P Sbjct: 159 NSFAPVQECSTQLDIVIVLDGSNS--------IYPWESVTAFLNDLLKRMDIGPKQTQ-- 208 Query: 217 RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEK 272 G+V + + + F L + + N++ T + G++ A + F Sbjct: 209 -VGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEAFTEARG 267 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 K K ++ +TDGE S N K+ + C ++I Sbjct: 268 ARRGVK------KVMVIVTDGE-SHDNYRLKQVIQDCE---DENIQRFSIA 308 >gi|291231094|ref|XP_002735502.1| PREDICTED: ubiquitin-conjugating enzyme E2D 1-like [Saccoglossus kowalevskii] Length = 1053 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 73/209 (34%), Gaps = 20/209 (9%) Query: 170 DMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++++LD+S SM + M K+ D + + GL TF + IV Sbjct: 531 AIVVLLDISASMAELCLKQSMKKIEAVKHLFHAFADRSIAYDFWHV---IGLTTFCTDIV 587 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + +EK++++ +T + A +++ + E + K+ I+ Sbjct: 588 VVDECTEALNSFKEKVDQVFPRDSTAMYDAVINAVSQLNEIGETYPNC-------KRRIL 640 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKN----CASPDRF 342 LTDG + N+ K + +R + ++ V + K+ C P Sbjct: 641 CLTDGYD---NVSKKSIVEAATSLQRSNVCLDSVLVGGANDNLKSLTKSTGGCCFHPTSL 697 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 ++ ++ V + K Sbjct: 698 KEALKLFEMETVLSVRARKTVTKAKTITK 726 >gi|294502680|ref|YP_003566742.1| hypothetical protein YPZ3_0570 [Yersinia pestis Z176003] gi|262360710|gb|ACY57431.1| hypothetical protein YPD4_0522 [Yersinia pestis D106004] gi|262364657|gb|ACY61214.1| hypothetical protein YPD8_0524 [Yersinia pestis D182038] gi|294353139|gb|ADE63480.1| hypothetical protein YPZ3_0570 [Yersinia pestis Z176003] Length = 205 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 61/166 (36%), Gaps = 12/166 (7%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 M++D S SM + I+ M+ ++ P V ++T+ ++ + P Sbjct: 1 MLIDTSGSMRGE------SIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIP 54 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L +++ Q + T + LE + + ++ + KG + +TD Sbjct: 55 LT-ALENFQFTDITVPSAGGTFTGAALECLIHCVDRDIQRSDGDQKGDWRP--LVFLMTD 111 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 S+P+ KR + A V A+A + LK S Sbjct: 112 ---STPSDVYAYGEAIKEVKKRAFGSIIACAVGAKAKHEHLKQLTS 154 >gi|227827415|ref|YP_002829194.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584630|ref|YP_002843131.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619571|ref|YP_002914396.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459210|gb|ACP37896.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019679|gb|ACP55086.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380640|gb|ACR41728.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 380 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 74/190 (38%), Gaps = 33/190 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++ LD S SM + K+ +A + +++ K IP N +TFS Sbjct: 34 SATGFHYIVALDTSGSMTGY------KIELAK---QGAIELFKRIPKGNK---VSFITFS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + + +I ++ G T + A +AK + Sbjct: 82 SNVNVIKEFV-DPLDLTNEILQIAAGGQTALYTAILTA-----------NSLAKKYQMPT 129 Query: 285 KYIIFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 Y++ LTDG + N+ N L Y + VY+ G+ + +Q L++ + + Sbjct: 130 -YLLLLTDGNPTDETNVGNYLKLPYFEKM-----QVYSFGIGDDYNEQLLQSISDKTSGV 183 Query: 342 FYSVQNSRKL 351 Y + ++ ++ Sbjct: 184 MYHISDANEI 193 >gi|150024247|ref|YP_001295073.1| hypothetical protein FP0135 [Flavobacterium psychrophilum JIP02/86] gi|149770788|emb|CAL42253.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86] Length = 288 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 56/165 (33%), Gaps = 29/165 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L MM+++D+S S + FG I + + N + GL+ F Sbjct: 72 EEERELTMMLMVDISGS--ESFGTK----NQLKSEIVTEIAATMAFSATQNNDKIGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + +I P G H+ I LI + T + L+ Sbjct: 126 TDEIELYIPPKKGKPHVLRIIRELIEFKPKSNKTDISQALK----------------FLS 169 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIG 323 KK I+F+ + D +++L + G VY I Sbjct: 170 GTQKKKAIVFMI--SDFITEDDYEKTLKIAGKKHDITGIRVYDIR 212 >gi|312138512|ref|YP_004005848.1| hypothetical protein REQ_10630 [Rhodococcus equi 103S] gi|325676202|ref|ZP_08155882.1| hypothetical protein HMPREF0724_13665 [Rhodococcus equi ATCC 33707] gi|311887851|emb|CBH47163.1| putative secreted protein [Rhodococcus equi 103S] gi|325552986|gb|EGD22668.1| hypothetical protein HMPREF0724_13665 [Rhodococcus equi ATCC 33707] Length = 545 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 63/193 (32%), Gaps = 7/193 (3%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V DVS SM + GP S + GL S + Sbjct: 345 LVVEDVSGSMAEQAGPETRIALTVQASETGARLFPDNAQLGLWAFSIGLGGGSQDYKELA 404 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIF 289 P+ + + +R ++ P L ++D K D +I Sbjct: 405 PIRRLDETVDGVSHRQRLTDAVRTLPSLVKGGTGLYDTTLAAFRKVKEGYDPAAINSVIL 464 Query: 290 LTDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYS 344 LTDG N P+ + + L + R I+ IG+ +A L+ A+ + Sbjct: 465 LTDGANEDPSTISLDELLATLKREQDPARPVIIVTIGITEDADAAVLQKISAATGGTSHV 524 Query: 345 VQNSRKLHDAFLR 357 + ++ F+ Sbjct: 525 ARTPAEIPGVFVD 537 >gi|51245384|ref|YP_065268.1| hypothetical protein DP1532 [Desulfotalea psychrophila LSv54] gi|50876421|emb|CAG36261.1| unknown protein [Desulfotalea psychrophila LSv54] Length = 420 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 45/118 (38%), Gaps = 2/118 (1%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 +G++++LTAIL+ V+ + L ++ + + V+ +L D L A ++L+ ENG Sbjct: 11 KKGEEGAVAVLTAILMAVLIMFAALAVDLGYLYGVRNELQNGADAGALAGAHELLDVENG 70 Query: 70 --NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + E G + + Q K + Sbjct: 71 ILTRDDAIAEAERVVSLNSTGNDAVQFKPIETGHWSFTTSTFSPNPTDTQGEWQEKSF 128 >gi|126316414|ref|XP_001380743.1| PREDICTED: similar to integrin alpha 2 subunit [Monodelphis domestica] Length = 1214 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 72/210 (34%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + PD GL+ + + Sbjct: 207 IDVVVVCDESNS--------IYPWSAVKNFLVKFVQGLDIGPDKTQ---VGLIQYGNYPR 255 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 F + + K + S T G L + I A+E A G K Sbjct: 256 VVF----NMSTFKTKEEMVKATSQTIQHGGDLTNTFKAIQFAREFAYSEASGGRPSATKV 311 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---LKNCA 337 ++ +TDGE+ + KE + CN + + I V A +K A Sbjct: 312 MVVVTDGESHDGSF-LKEVIGQCN---DDNILRFGIAVLGYLNRNALDTKNLIKEIKAIA 367 Query: 338 SP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L + +G+ + Sbjct: 368 STPTERYFFNVSDEDALLEKAGTLGERIFS 397 >gi|123390689|ref|XP_001299929.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121880878|gb|EAX86999.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 661 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 48/248 (19%), Positives = 91/248 (36%), Gaps = 43/248 (17%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNL--SAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 ++I+ IE + S +I D H + + + S ++ S+ Sbjct: 165 TDKNISKIEANNSATINQFDNHNAFINLDKFEPAIFVQTLISDQDKSTAVSSDDYIVVST 224 Query: 159 VKISSKS----DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 K S + D V+D S SM D++ A + +R ML + + Sbjct: 225 YKEFSSKSNCYECKADYFFVIDRSASMEG------DRIEKAVKCMRLMLQSLPMM----- 273 Query: 215 VVRSGLVTFSSKIVQTFPL-AWGVQHIQ---EKINRLIFG-STTKSTPGLEYAYNKIFDA 269 R +V F S+ P+ + +++ I + T LEY +++ Sbjct: 274 -CRFSIVCFGSEFQSLLPIVEYNNENVLLAMNLIKNINANMGGTDIYHPLEYIFSQNG-- 330 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 K I LTDGE+S ++++ + E K+ G +Y +G+ + A Sbjct: 331 -------------MTKKIFLLTDGEDS----NSEDIIRLVQENKQFG-NIYTVGIGSGAD 372 Query: 330 DQFLKNCA 337 ++N A Sbjct: 373 SGLIRNLA 380 >gi|37520439|ref|NP_923816.1| Mg chelatase subunit [Gloeobacter violaceus PCC 7421] gi|35211433|dbj|BAC88811.1| Mg chelatase subunit [Gloeobacter violaceus PCC 7421] Length = 669 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 65/208 (31%), Gaps = 36/208 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G + V+D S SM ++++ A ++ ++L N + LV F Sbjct: 468 ARKAGALITFVVDASGSMA------LNRMRSAKGAVLKLLTEA-----YQNRDKVALVPF 516 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ L G L +A + A + D Sbjct: 517 RGEKADVLLPPTRSIAQARRRLESLPCGGG----SPLAHA---LSQAIRLGVNAQSAGDV 569 Query: 283 YKKYIIFLTDGENS--------SPNIDNK-----ESLFYCNEAKRRGAIVYAI----GVQ 325 + I+ +TDG + P D+K E L + G + I Sbjct: 570 GQVIIVAITDGRGNIPLARSLGEPPGDDKPDIKGELLALAGRIRTLGFKLLVIDTENRFV 629 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKLHD 353 + + L A FY + S + Sbjct: 630 STGFAKELTTQAGGRYFYLPRASEQTLA 657 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 289 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 342 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A + + Sbjct: 343 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGH-------- 394 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 395 ---KKVIVLLTDGVPT 407 >gi|325286049|ref|YP_004261839.1| hypothetical protein Celly_1140 [Cellulophaga lytica DSM 7489] gi|324321503|gb|ADY28968.1| protein of unknown function DUF58 [Cellulophaga lytica DSM 7489] Length = 288 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 45/125 (36%), Gaps = 10/125 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L MM+++DVS S ++FG + I + + Sbjct: 58 NVTARYNEPYIKVFEEERELTMMLMVDVSGS--ENFGTT----NQFKKDIITEISATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNK 265 + N + GL+ FS ++ P G H I L+ + T + L+Y N Sbjct: 112 SALQNNDKVGLILFSDEVELFIPPKKGKSHALRIIRELLEFTPKSTKTNLSEALKYLTNV 171 Query: 266 IFDAK 270 + Sbjct: 172 MKKKA 176 >gi|170730092|ref|YP_001775525.1| hypothetical protein Xfasm12_0914 [Xylella fastidiosa M12] gi|167964885|gb|ACA11895.1| conserved hypothetical protein [Xylella fastidiosa M12] Length = 795 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 44/147 (29%), Gaps = 22/147 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + + +D+S SM+ G G +L ++ LD + V L F Sbjct: 6 SVAVYFAIDLSGSMHYVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLAGFGDAP 65 Query: 228 VQTFPL------AWGVQHIQEKINRLIFGSTT---KSTPGLEYAYNKIFDAKEKLEHIAK 278 L A G+ ++ + T T + Y Sbjct: 66 DHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAAA------------ 113 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKES 305 + + F+TDGE P+ ++ Sbjct: 114 -PSNAVRVAFFMTDGEPDPPSATLAQA 139 >gi|158319036|ref|YP_001511544.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114441|gb|ABW16638.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 608 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 70/212 (33%), Gaps = 35/212 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSSKIVQT 230 + VLD S SMN+ + ++ LD KS IP GL FS+++ Sbjct: 409 LAVLDTSGSMNEEVPGSAGRSRLSVA-----LDAAKSAIPLFAEDSDLGLWQFSTRLRGD 463 Query: 231 FP-------------LAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L G Q + + +NR+ T Y+ A + Sbjct: 464 QDWEELVPLGPMGERLGAGTRSQAVMDAVNRIEPRGDTGL-------YDTALAAFRYMNQ 516 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNI---DNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + ++ LTDG+NS P D + + +R V IG A+ Sbjct: 517 HYVPGRPNQ--VVLLTDGKNSDPGSIALDELVRILRREYSPQRPVQVITIGYGADTDLAA 574 Query: 333 LKNC--ASPDRFYSVQNSRKLHDAFLRIGKEM 362 L A+ Y + + + + E+ Sbjct: 575 LSRISAATGAETYPALDPNTIFEVLVDALTEV 606 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 55/153 (35%), Gaps = 2/153 (1%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G ++ + + VI V L I+ +H K +L LD + L AT + N+ Sbjct: 11 KAQQGLTLVVMTVSMAVIVGVAALSIDANHLMVSKNRLQNALDTAALAGAT-VANRTYEE 69 Query: 71 NGKKQKNDFSYRIIKNIWQTD-FRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + K+ +Y + + D ++G + IE S + + Sbjct: 70 DDAKEAIVEAYNKVTSAAGNDELVLAASDDGTSLKSLTIEYSDNANSGFSSNFPSSADYI 129 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS 162 R ++ + + ++ + +S+V Sbjct: 130 YVRLQVSNVELSEYLAGLLGYSKSINSSTVAGP 162 >gi|291222847|ref|XP_002731426.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 973 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 56/266 (21%), Positives = 93/266 (34%), Gaps = 40/266 (15%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 +D ++ R S + S +++ F AN L + I Sbjct: 253 EEDGDSKSRHNSF----APNKHNAQCVWKSAWDVMLTTPDFTNNANPPKDGLDTSPIFSI 308 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + ++V+D+S SM D + +L ATR I L N G+V Sbjct: 309 V--KETPFRTVLVMDLSGSM-DSYNRVDLQLQAATRYIGNTL---------PNDTWVGIV 356 Query: 222 TFSSKIVQTFPLA-WGVQHIQ-EKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 F + + L + + + I++L G +T GLE + ++ Sbjct: 357 RFDTDALVVADLTKLDSEETRMDLISKLPSNTGGSTCIGCGLELGIEVLENSPFGDAGGG 416 Query: 278 KGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 I TDG ENSSP I++ E V + + AD+ L+N Sbjct: 417 --------VIFLTTDGEENSSPYIEDVLPELLVKE-----VRVDCLAFG-DDADETLQNL 462 Query: 337 A--SPDR---FYSVQNSRKLHDAFLR 357 A + R + + S LHD+F Sbjct: 463 AEETGGRFAWYSELVTSTALHDSFTA 488 >gi|260800507|ref|XP_002595171.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] gi|229280414|gb|EEN51182.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] Length = 352 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 59/171 (34%), Gaps = 22/171 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD + S+ G +++M+ P R G+V +S++ Sbjct: 201 VDLVFVLDGTGSV------GATNFEKMKTFVQKMISDFDLGP---EATRIGVVVYSNRAS 251 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + +Q+ + + + T A + + A+ +K Sbjct: 252 LEISLDAYDDQEALQDAVADIAYPGGYTLTGA---AIDYTTTFAFSTRNGARDG--VRKV 306 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + LTDG + + ++ I YA+G+ + L A Sbjct: 307 AVILTDG------VSYDDPAEPAQSMRKAAIITYAVGIGSNLDRDQLDVIA 351 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 62/174 (35%), Gaps = 24/174 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD++ +LD S S++ + + + +++ + + + GLV +S + Sbjct: 3 LDLVFILDSSDSVS------VQDFEMMRQFLKK---TVGDFNIGYDATQIGLVQYSDEAE 53 Query: 229 QTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L +++ I + G T + L+Y +F D K Sbjct: 54 TIFALDSFSSPTSLRDAIETIQYTGGATNTGNALDYMVQYMF------ASRNGARQDSTK 107 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 I LT G +S + ++ I YAIG+ +E L A Sbjct: 108 IAIVLTGGASSD------DIKAAAQRMRKSSVITYAIGIGSELDYDQLDYIAGA 155 >gi|154280917|ref|XP_001541271.1| predicted protein [Ajellomyces capsulatus NAm1] gi|150411450|gb|EDN06838.1| predicted protein [Ajellomyces capsulatus NAm1] Length = 759 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 53/138 (38%), Gaps = 18/138 (13%) Query: 170 DMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTF 223 D+++ +DVS SM P D+ G + +LD+ K I +N R G+V F Sbjct: 75 DIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRLGIVAF 134 Query: 224 SSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S++ + ++ + + + L S+T GL+ E H + Sbjct: 135 STEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLKLGLKAF----ENERHTPQS 190 Query: 280 HDDYKKYIIFLTDGENSS 297 + LTDG + Sbjct: 191 VQA----LYVLTDGMPNH 204 >gi|295698036|ref|YP_003602693.1| putative tellurium resistance protein TerY [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060148|gb|ADF64885.1| putative tellurium resistance protein TerY [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 197 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 56/172 (32%), Gaps = 10/172 (5%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + ++ +L +K P ++TF S Q PL + ++ L Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT---DLLSFQMPALTASG 62 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT L + I +K KG + +TDG SPN D ++ L Sbjct: 63 TTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDG---SPNDDWRKGLNDFKA 117 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 A+ +V A +A LK +S + F + + Sbjct: 118 ARTG--VVVACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSASIS 167 >gi|293410436|ref|ZP_06654012.1| predicted protein [Escherichia coli B354] gi|291470904|gb|EFF13388.1| predicted protein [Escherichia coli B354] Length = 219 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 62/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|260785822|ref|XP_002587959.1| hypothetical protein BRAFLDRAFT_87354 [Branchiostoma floridae] gi|229273114|gb|EEN43970.1| hypothetical protein BRAFLDRAFT_87354 [Branchiostoma floridae] Length = 602 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 28/204 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+++ LD+S S D+ +A + +D + + + +R ++ ++ + Sbjct: 397 AIDIVLALDLSSS------IPQDQFELARDFMVAFVDC-EVFQEKD--IRIAVLNYTCEA 447 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F LA + +I +L+ G + G + ++ H Sbjct: 448 DTYFDLAPIAYGMSYEIGQLMRGDGGITRTGHAINHMRLTSKFGADAHHTA--------- 498 Query: 288 IFLTDGENSSPNIDNKESLFYCNE----------AKRRGAIVYAIGVQAEAADQFLKNCA 337 + LTDG+ L + A+ G +YA+ L Sbjct: 499 VILTDGQTMICYTVLLSGLPFPQSEDDQQAAAADARNAGIELYAVEFGYLVNGMALATMT 558 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKE 361 V ++ + DA +I + Sbjct: 559 GDPSGSRVFDTSQACDAAQKIVAD 582 >gi|110681632|dbj|BAE98271.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium vivax] Length = 2106 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 75/204 (36%), Gaps = 21/204 (10%) Query: 170 DMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ +++D S S+ H P DK+ ++ I +N +R + Sbjct: 108 DLTLIIDESGSIGIKNWEKHVIPFTDKI--IKDLHIGENEVHAGILLFSNFIRDYVTFDE 165 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + L ++ + + + G+ TK L+YA EK H KG + Sbjct: 166 DESYKKDKL---LKKVGQLKKKYAAGAGTKIVSALDYAL-------EKYTHHKKGRPNAP 215 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRF 342 K I TDG N + + + + L +++ + +GV A A D L+ A Sbjct: 216 KVTILFTDG-NDTSSSSSTKLLDMGLTYRKKNVKLLVLGV-AAAKDVNLRAIAGCGDKNV 273 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + D I K++ + Sbjct: 274 PCPYAMKAEWDTINDITKKLTNKI 297 >gi|91783675|ref|YP_558881.1| hypothetical protein Bxe_A2139 [Burkholderia xenovorans LB400] gi|91687629|gb|ABE30829.1| Hypothetical TPR domain protein [Burkholderia xenovorans LB400] Length = 530 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 59/166 (35%), Gaps = 30/166 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++VL+++ SM+ R+ +++LD+ ++ R+GLV F++ Sbjct: 97 LVVVLELAHSMDATDIAP----TRLERARQKVLDLARA----RKGARTGLVVFAATAHLV 148 Query: 231 FPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P ++ + L + GL+ A + + Sbjct: 149 VPPTEDPAMLELYVPALSPALMPRDGKNAAAGLDVAEQLLANDPAAGT------------ 196 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I+F++DG D E+ + +AK + + V E Sbjct: 197 IVFMSDG------FDTNEADAFVQKAKSLRHQLLWLAVGTEHGGPI 236 >gi|110633696|ref|YP_673904.1| hypothetical protein Meso_1343 [Mesorhizobium sp. BNC1] gi|110284680|gb|ABG62739.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 571 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 6/55 (10%), Positives = 23/55 (41%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + G+++++ +L PV+ + ++ + + + + D + + I Sbjct: 13 LRDRSGNVAVMAGLLFPVMLLGAVFGVDQGSLYLERREAQALTDLAAVTAVANIS 67 >gi|332254890|ref|XP_003276566.1| PREDICTED: integrin alpha-2 [Nomascus leucogenys] Length = 1181 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 173 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDVGPTKTQ---VGLIQYANNPR 221 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 222 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 273 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---L 333 K ++ +TDGE+ ++ K + CN + + I V A + Sbjct: 274 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 329 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K AS F++V + L + +G+++ Sbjct: 330 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 363 >gi|291302016|ref|YP_003513294.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290571236|gb|ADD44201.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 425 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 58/180 (32%), Gaps = 39/180 (21%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 S +++ + ++++D S SM ++ A R+ I Sbjct: 23 AIVTVTSAGNGSGTVTADA-ADFAQVIMVDCSGSMTGS------RIAEAKRA------TI 69 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE--------KINRLIFGSTTKSTPG 258 +I ++ R +V + + +P ++ ++ L G T Sbjct: 70 AAIESLDEGCRFAIVKGTDEAQMVYPDDETTAVVKSSTRSAAVKRVQILRAGGGTAMGTW 129 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-------IDNKESLFYCNE 311 L K I G D KY + LTDG N +D+ E +F C+ Sbjct: 130 LA-----------KTSRILSGTDAAVKYGLLLTDGRNQHETEEELREHLDDCEGVFTCDA 178 >gi|123454693|ref|XP_001315098.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121897764|gb|EAY02875.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 665 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 90/250 (36%), Gaps = 47/250 (18%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNL--SAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 ++I+ IE + S +I D H + + + S + S+ Sbjct: 165 TDKNISKIEANKSATINQFDNHNAFINLDKFEPAIFVQTLISDQDKSTAVSSDDYIAVST 224 Query: 159 VKISSKSDIGL----DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 K S G D V+D S SM D++ A + +R +L + Sbjct: 225 YKEFSSKSNGYECKADYFFVIDCSGSMKG------DRIEKAVKCMRLILQSLPMK----- 273 Query: 215 VVRSGLVTFSSKIVQTFPL-AWGVQHI------QEKINRLIFGSTTKSTPGLEYAYNKIF 267 R +V F S+ P+ + +++ + I ++ T LEY +++ Sbjct: 274 -CRFSIVCFGSEFQTLLPIVEYNNENVLLAMNLIKYIQAIM--GGTDIYHPLEYIFSQNG 330 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 K I LTDGE+S +++E + E K+ G +Y +G+ Sbjct: 331 ---------------MTKKIFLLTDGEDS----NSEEIIRLVQENKQFG-NIYTVGIGIG 370 Query: 328 AADQFLKNCA 337 A ++N A Sbjct: 371 ADSGLIRNLA 380 >gi|269217609|ref|ZP_06161463.1| putative von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 848 str. F0332] gi|269212544|gb|EEZ78884.1| putative von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 848 str. F0332] Length = 327 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 71/206 (34%), Gaps = 11/206 (5%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S K+ G ++ V+D + SM + + +L IR I+ IP+ Sbjct: 57 SVALPREKTVRGANVFFVVDTTGSMNAEDYDGKKPRLEGVRADIR---QIVSRIPNA--- 110 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 R +++F+S + P+ V ++ + + + S Y + K +LE Sbjct: 111 -RYSIISFNSGATRELPITTDVAAVESWVQTVKPEQSGYSNGS--SVYRPVKVLKRELER 167 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + H + + +DGE K N G + Y + Sbjct: 168 SRRSHPQSVQLVYVFSDGEPRGDKGQQKTYDQIRNLVDDGGVLGYGTAAGGPMKETTFSG 227 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKE 361 S Y +++ + A +I ++ Sbjct: 228 TGSGTGSY-IEDPQTRQTAISKIDEK 252 >gi|113460773|ref|YP_718840.1| hypothetical protein HS_0628 [Haemophilus somnus 129PT] gi|112822816|gb|ABI24905.1| conserved hypothetical protein, with von Willebrand factor (vWF) domain [Haemophilus somnus 129PT] Length = 212 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 65/189 (34%), Gaps = 16/189 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM G ++ + ++ ++ ++ P ++TF S Sbjct: 3 RLPVYLLVDTSGSM---MGEPIESVR---SGLQTLVSALRQDPYALETAYLSVITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + ++ + T L + + +E + A+ D+K + Sbjct: 57 RQVTPLT---DLMSFQLPSIEASGLTAMGEALGLLADCV--NREVNKGSAEVKGDWKPVV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 LTDG P D + +F K A A A + LK + Sbjct: 112 FLLTDGI---PTDDLQSGIFALKNVKTG--TFVACAAGAGADTEELKKITETVVSLDTAD 166 Query: 348 SRKLHDAFL 356 + + F Sbjct: 167 ANSIKAFFK 175 >gi|311273688|ref|XP_003133992.1| PREDICTED: integrin alpha-2 [Sus scrofa] Length = 1186 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 70/210 (33%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 178 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 226 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 + + + K + S T G L + I A++ A G K Sbjct: 227 VV----FNLNTFKTKAEMVEATSHTTQYGGDLTNTFKAIQYARDSAYSAAAGGRPGATKV 282 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---LKNCA 337 ++ +TDGE + D ++ + + I V A +K A Sbjct: 283 MVVVTDGE----SHDGSMLKAVIDQCNNDNILRFGIAVLGYLNRNALDTKNLIKEIKAIA 338 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L + +G+++ Sbjct: 339 SIPTERYFFNVSDEADLLEKAGTLGEQIFS 368 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 289 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 342 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A + + Sbjct: 343 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGH-------- 394 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 395 ---KKVIVLLTDGVPT 407 >gi|149186759|ref|ZP_01865070.1| hypothetical protein ED21_29711 [Erythrobacter sp. SD-21] gi|148829667|gb|EDL48107.1| hypothetical protein ED21_29711 [Erythrobacter sp. SD-21] Length = 526 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 23/114 (20%), Positives = 42/114 (36%), Gaps = 16/114 (14%) Query: 157 SSVKISSKSDI----GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 SV I S LD+ +V+DV+ SM D+L +R ++D ++ Sbjct: 269 QSVTIRSDRAAEAVRKLDLALVVDVTGSMG-------DELRFLKEELRSIIDQLE-ARHR 320 Query: 213 NNVVRSGLVTFSSK----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + +R + + + QTF V Q ++ G ++ A Sbjct: 321 DIDIRVAFSFYRDEGDDFVTQTFDFDSDVARAQTRLAAQHAGGGGDYEEAMQDA 374 >gi|160880107|ref|YP_001559075.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160428773|gb|ABX42336.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 513 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 57/166 (34%), Gaps = 36/166 (21%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM FG G ++L A I + + + + + + ++ FSS+ L Sbjct: 347 LDYSGSM---FGEGNEQLVAAMEKILDHKLASEDMIQFSKMDKIFVIPFSSE------LK 397 Query: 235 W--------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 W ++ +I T +E+A + D D Y K Sbjct: 398 WVDSAISGIDTANLISRIKDTEAHGKTNIYAPVEHAIEILKD---------FDADVYTKS 448 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I+ +TDGE S+ N + V++I Q Sbjct: 449 IVLMTDGE-SAGNF---------RKGTSYDIPVFSIMFGEANPKQL 484 >gi|327402315|ref|YP_004343153.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327317823|gb|AEA42315.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 639 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 51/168 (30%), Gaps = 18/168 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KIS + +D+ V+D + SM D++ + +++ + I + N R Sbjct: 303 KISCQEVENVDVAFVVDATGSMG-------DEINFLKAEMNDVIFESRKISNKLNF-RFA 354 Query: 220 LVTFSSKIVQTFPLAWGVQHI----QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 V + + I I+ G ++ + + EH Sbjct: 355 NVFYRDMTDSYITKSMNFNRILTESTAFIDEQNAGGGGDFPEAMDVGLDSAINNLSWSEH 414 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + + D N+ +A +G + IG Sbjct: 415 ARA------RILFLILDAPPHQSPQVNERLQKLILQAAEKGIRIVPIG 456 >gi|189913184|ref|YP_001964413.1| Hypothetical BatB protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167781252|gb|ABZ99549.1| Hypothetical BatB protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 547 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 65/189 (34%), Gaps = 29/189 (15%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS--SHAPLLITSSVKISSKSDI-GLDMM 172 I D+ + ++ I T + + L S+ I S + D++ Sbjct: 32 IQFSKNKLDFVTRLSTSNQLLLILRTISFLVAMCFAIVSLYKIKSIDIESTKEFQSTDIL 91 Query: 173 MVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V+DVSLSMN D + + ++ +PD+ R G++ F+ + Sbjct: 92 FVVDVSLSMNAIDIRPNRLKRFQDL---------SLRLLPDLKGN-RVGIIVFAGQSFSF 141 Query: 231 FPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + + + I L T LE A + K + Sbjct: 142 CPMTSDLSAVSDYIQSLSVEMVGSKGTNLAVALERA----------EKIRKKSQNIQSNI 191 Query: 287 IIFLTDGEN 295 + +TDGE+ Sbjct: 192 TVIVTDGED 200 >gi|189912859|ref|YP_001964748.1| BatBC [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777535|gb|ABZ95835.1| BatBC [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] Length = 547 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 65/189 (34%), Gaps = 29/189 (15%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS--SHAPLLITSSVKISSKSDI-GLDMM 172 I D+ + ++ I T + + L S+ I S + D++ Sbjct: 32 IQFSKNKLDFVTRLSTSNQLLLILRTISFLVAMCFAIVSLYKIKSIDIESTKEFQSTDIL 91 Query: 173 MVLDVSLSMN--DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V+DVSLSMN D + + ++ +PD+ R G++ F+ + Sbjct: 92 FVVDVSLSMNAIDIRPNRLKRFQDL---------SLRLLPDLKGN-RVGIIVFAGQSFSF 141 Query: 231 FPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + + + I L T LE A + K + Sbjct: 142 CPMTSDLSAVSDYIQSLSVEMVGSKGTNLAVALERA----------EKIRKKSQNIQSNI 191 Query: 287 IIFLTDGEN 295 + +TDGE+ Sbjct: 192 TVIVTDGED 200 >gi|123490500|ref|XP_001325627.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908529|gb|EAY13404.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 67/216 (31%), Gaps = 44/216 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 W K+ S S+ ++D S SM+ + A + + Sbjct: 213 WSDGYIAISTFTYFETKVHSNSE----FYFIIDCSGSMSGSC------IQNAKLCLNIFM 262 Query: 204 DIIKSIPDVNNVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPG 258 + R ++ F S V P + +++ E + +L T Sbjct: 263 HSLPI------GCRFSIIKFGSDYEVALHPCDYTDENVSEAMKQLNNIDAEMGGTDILSP 316 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR--G 316 L+Y + K + LTDG++S+ N C A+ Sbjct: 317 LKYVME------------LTPKQGFIKQVFLLTDGQDSNTN-------ELCALAQENRTN 357 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRK 350 +++IG+ + A + N + S + V + Sbjct: 358 NRIFSIGIGSGADKDLIINVSQKSGGNYVFVDDDES 393 >gi|296225410|ref|XP_002758285.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Callithrix jacchus] Length = 844 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 60/161 (37%), Gaps = 11/161 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L ++ D ++V G S Sbjct: 287 TNMNKNVVFVIDISGSMKGQ------KVKQTKEALLQILGDMRP-GDYFDLVLFGSRVQS 339 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + +Q ++ + T GL + +E L ++ Sbjct: 340 WRGSLVQASQANLQAARDFVRGFSLDEATNLNGGLLRGIETLNKVRESLPELSNHAS--- 396 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 +I LTDG+ + D + L A R +Y +G Sbjct: 397 -ILIMLTDGDPTEGVTDRSQILKNVRSAIRGRFPLYNLGFG 436 >gi|220913390|ref|YP_002488699.1| hypothetical protein Achl_2645 [Arthrobacter chlorophenolicus A6] gi|219860268|gb|ACL40610.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 319 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 60/171 (35%), Gaps = 21/171 (12%) Query: 6 IRNFFYN--CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY----- 58 +R + KG++S++ AILL + + + ++ + +A+L D S + Sbjct: 1 MRRLGADNTEKGAVSVIVAILLVTLLGFVAIAVDVGAIYSERAQLQSGADASAIALAQKC 60 Query: 59 -----------TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 T+T + N N+ N +S ++ K +E G + + Sbjct: 61 ARDTANADCSTTSTLAGSLANQNSLDGMSNVYSIQLDKTARTVSVTTSAKETGSPDNSVS 120 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 + + ++ I + + + + + P I P + + Sbjct: 121 LFFAKAIGIPSKEVGAKASATWGNPSKGPVI---LPLAIAYCKLNIPAGGT 168 >gi|260061453|ref|YP_003194533.1| hypothetical protein RB2501_07630 [Robiginitalea biformata HTCC2501] gi|88785585|gb|EAR16754.1| hypothetical protein RB2501_07630 [Robiginitalea biformata HTCC2501] Length = 288 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 46/125 (36%), Gaps = 10/125 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+++D+S S + FG +G R + + + Sbjct: 58 NVTARYNEPFVKVFEEERELTLMLLVDISGS--EFFGT----VGQFKREVVTEVSATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNK 265 + N + GL+ FS ++ P G H+ I L+ T + L++ N Sbjct: 112 SALQNNDKVGLILFSDEVELYIPPKKGKSHVLRIIRELLEFEPKSHRTDLSEALKFLSNV 171 Query: 266 IFDAK 270 I Sbjct: 172 IKKKA 176 >gi|194334244|ref|YP_002016104.1| magnesium chelatase ATPase subunit D [Prosthecochloris aestuarii DSM 271] gi|194312062|gb|ACF46457.1| magnesium chelatase ATPase subunit D [Prosthecochloris aestuarii DSM 271] Length = 619 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 57/179 (31%), Gaps = 33/179 (18%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISS-------------KSDIGLDMMMVLDVSLS 180 +P + PW + L T V + + G + ++D S S Sbjct: 382 LIPTLISAAPWQESRRLERLRKTGKVSTTGLIINKEDVKVKKFRDKSGTLFIFIVDASGS 441 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQTFPLAWGVQH 239 M ++++ A ++ +L D + L++F + P + V Sbjct: 442 MA------LNRMRQAKGAVSHLLQNAYVHRD-----QVALISFRGKEAQLLLPPSQSVDR 490 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + +++ L G T + A+ A+ + + +TDG + Sbjct: 491 AKRELDVLPTGGGTPLASAIYLAWE--------TAKQARTKGVSQIMFVLITDGRGNIG 541 >gi|313672125|ref|YP_004050236.1| von willebrand factor type a [Calditerrivibrio nitroreducens DSM 19672] gi|312938881|gb|ADR18073.1| von Willebrand factor type A [Calditerrivibrio nitroreducens DSM 19672] Length = 1174 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 59/157 (37%), Gaps = 10/157 (6%) Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +L +++ + + R G V FSS + ++ + + IN G +T + + Sbjct: 215 GAVLGVLQKLSNEPRNPRVGAVLFSSNNISVIKPSYDYISLIKAINDTKAGGSTNTKGAI 274 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI------DNKESLFYCNEAK 313 + N + + E + + K + I ++DGE + D ++ N Sbjct: 275 DTISNY-YKSTEAYQFDSNVVPCAKNFAIVVSDGEWNVGGDPLPTIRDMWKTDLMGNLTG 333 Query: 314 RRGAIVYAIGVQAEA---ADQFLKNCASPDRFYSVQN 347 ++ Y + + ++ + LK+ A + + Sbjct: 334 KQNVKTYTLAMFMDSSSNGTRALKHMAVFGGYNDIDK 370 >gi|119476218|ref|ZP_01616569.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] gi|119450082|gb|EAW31317.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] Length = 750 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 75/233 (32%), Gaps = 42/233 (18%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F F W + + +++ D+ +V+D+S SM + Sbjct: 15 IFFTMFAWGQQPENDVAPVLG-------NELTPDVRVVIDISGSMKKNDPNY------LR 61 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFGS 251 R E+L I+ P+ + G+ TF + P AW + Sbjct: 62 RPALELL--IQLFPEGSAA---GVWTFGQWVNNLVPSNTVTTAWRANASAQAEKISSVAL 116 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG--ENSSPNIDNKESL--- 306 T LE A + DY ++I LTDG + S +IDN+ + Sbjct: 117 RTNIPAALEKAMADVKSK----------GTDYSIHLILLTDGMVDVSLSSIDNETARQRI 166 Query: 307 --FYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 K G ++ + + A + ++ A + + + L F Sbjct: 167 VAEILPALKGAGVTIHTVALSQNADLELMELLAAETGGLSAVAETAEDLSRVF 219 >gi|12655866|gb|AAK00631.1| complement factor B [Halocynthia roretzi] Length = 1084 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 79/216 (36%), Gaps = 21/216 (9%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S +S+ +I ++ + DVS S+ + + A R ++D +K+ V Sbjct: 517 SRTISASEVNIFHYVIFIFDVSKSVTKKYQDFSSGIEFAKR----LIDRLKNFGGVLKYS 572 Query: 217 RSGLVTFSSKIVQTFP-LAWGVQHIQEKINRL---------IFGSTTKSTPGLEYAYNKI 266 + + ++ + V+ + ++++ L T+S A + Sbjct: 573 IIAYASSNKTQLEITDRFSTNVKEVIKRLDNLDSQVKEAVSELIEETRSGTATAKALKSL 632 Query: 267 FDAKEKLEHIAKGHDDYKK-YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 D +EH + K ++ TDG ++ + E + Y+I Q Sbjct: 633 RDMMLFMEHDIRNDQTNDKCHVFLFTDGMHNEGK-NPVEVRKEMQKIFGSNIEFYSISAQ 691 Query: 326 AEAA----DQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + ++ + + P+ + +++ L ++L Sbjct: 692 EDPSPEAFEELIGLASEPENYIYIEDIH-LLSSYLD 726 >gi|332708512|ref|ZP_08428486.1| protoporphyrin IX magnesium-chelatase [Lyngbya majuscula 3L] gi|332352609|gb|EGJ32175.1| protoporphyrin IX magnesium-chelatase [Lyngbya majuscula 3L] Length = 669 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 69/214 (32%), Gaps = 39/214 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ ++L N + L+ F Sbjct: 466 ARKAGSLVVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YENRDQVSLIPF 514 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ RL G + GL A + +A+ D Sbjct: 515 RGEQADVLLPPTRSIALAKRRLERLPCGGGSPLAHGLTQAVHVGMNAQ-------LSGDI 567 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----G 323 + I+ +TDG + P E L + G + I Sbjct: 568 GQVVIVAITDGRGNIPLARSLGEPLPEGEKPDIKGELLEIAARIRANGVQLLVIDTESKF 627 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 V A + K A ++ + + K A + Sbjct: 628 VSTGFAKELAKQ-AGGKYYHLPKATDKAIAAMTK 660 >gi|315126123|ref|YP_004068126.1| hypothetical protein PSM_A1030 [Pseudoalteromonas sp. SM9913] gi|315014637|gb|ADT67975.1| conserved hypothetical protein [Pseudoalteromonas sp. SM9913] Length = 631 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 45/137 (32%), Gaps = 23/137 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D+S SM + S + LD+++ + + + LV ++ Sbjct: 91 VIVMDMSYSMYSTDIAPNRLMQ----SRFKALDMVELFKEGD----TALVAYAGTAYTIS 142 Query: 232 PLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + I L + + GL+ A + A I Sbjct: 143 PLTNDAKTLANLIPSLSPEIMPDKGSNVLAGLDMAQELLSQAGYLDGD-----------I 191 Query: 288 IFLTDGENSSPNIDNKE 304 I +TDG D Sbjct: 192 ILITDGIEQQQQSDVSR 208 >gi|294997269|ref|NP_001025043.3| integrin alpha-D isoform 1 [Mus musculus] Length = 1202 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 81/245 (33%), Gaps = 50/245 (20%) Query: 158 SVKISSKSDIGLDMMMVLDVSLS--------MNDHFGPGMDKLGVATRSIREM------- 202 + +D+ ++D S S M D M +L + S M Sbjct: 145 PATMPECPGQEMDIAFLIDGSGSIDQSDFTQMKDFVKALMGQLASTSTSFSLMQYSNILK 204 Query: 203 ----LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN-----------RL 247 KS ++V IVQ L + IQ+ ++ L Sbjct: 205 THFTFTEFKSSLSPQSLV--------DAIVQLQGLTYTASGIQKVVDSNRLLLLRVWSAL 256 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENSSPNIDNKESL 306 G + + + ++ + ++L H G KK +I +TDG+ D E Sbjct: 257 KPGCHYNTHMPW-WVWQRVRFSWKELFHSKNGARKSAKKILIVITDGQKFR---DPLEYR 312 Query: 307 FYCNEAKRRGAIVYAIG----VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIG 359 EA++ G I YAIG + A Q L S D + V N L +I Sbjct: 313 HVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQ 372 Query: 360 KEMVK 364 +++ Sbjct: 373 EKIFA 377 >gi|269965332|ref|ZP_06179452.1| hypothetical protein VMC_08820 [Vibrio alginolyticus 40B] gi|269829978|gb|EEZ84207.1| hypothetical protein VMC_08820 [Vibrio alginolyticus 40B] Length = 589 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 47/164 (28%), Gaps = 29/164 (17%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +VLD+S SM ++L ++L K +GL+ ++ P Sbjct: 89 LVLDMSRSM-FATDIKPNRLAQTRYKALDLLPKWKEGA-------TGLIAYAGDAYNLSP 140 Query: 233 LAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + I L+ +E A N+ A + II Sbjct: 141 LTTDSSTLAGIIENLSPELMPFQGANLPAAIELAINQFSQAGTQQGD-----------II 189 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 L D ++D E + + + + V Sbjct: 190 VLAD------DLDTSELSRALDLVQGTKFRISVLAVGTPNGAPI 227 >gi|194758427|ref|XP_001961463.1| GF14980 [Drosophila ananassae] gi|190615160|gb|EDV30684.1| GF14980 [Drosophila ananassae] Length = 1138 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 77/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM++ + + +LD + VN VV++ Sbjct: 150 AASSPKDIMILLDASSSMSEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 203 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 204 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGV---- 257 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 258 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 306 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 307 LHDMACSNKGFFVQINDYEE 326 >gi|74220882|dbj|BAE33629.1| unnamed protein product [Mus musculus] Length = 1202 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 81/245 (33%), Gaps = 50/245 (20%) Query: 158 SVKISSKSDIGLDMMMVLDVSLS--------MNDHFGPGMDKLGVATRSIREM------- 202 + +D+ ++D S S M D M +L + S M Sbjct: 145 PATMPECPGQEMDIAFLIDGSGSIDQSDFTQMKDFVKALMGQLASTSTSFSLMQYSNILK 204 Query: 203 ----LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN-----------RL 247 KS ++V IVQ L + IQ+ ++ L Sbjct: 205 THFTFTEFKSSLSPQSLV--------DAIVQLQGLTYTASGIQKVVDSNRLLLLRVWSAL 256 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD-DYKKYIIFLTDGENSSPNIDNKESL 306 G + + + ++ + ++L H G KK +I +TDG+ D E Sbjct: 257 KPGCHYNTHMPW-WVWQRVRFSWKELFHSKNGARKSAKKILIVITDGQKFR---DPLEYR 312 Query: 307 FYCNEAKRRGAIVYAIG----VQAEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIG 359 EA++ G I YAIG + A Q L S D + V N L +I Sbjct: 313 HVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQ 372 Query: 360 KEMVK 364 +++ Sbjct: 373 EKIFA 377 >gi|326671057|ref|XP_003199352.1| PREDICTED: integrin alpha-2-like [Danio rerio] Length = 1181 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 97/274 (35%), Gaps = 28/274 (10%) Query: 103 QDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS 162 + NNI S L++ ++ + ++ +P + S+ + Sbjct: 102 SNGNNINMSLGLTLTPTTKNNGFMTCGPLWAQLCGSQYFYPGVCADVSPQFTLQSAFSPA 161 Query: 163 SKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ +D+ +VLD S S+ + P +D + ++L+ + P ++ Sbjct: 162 VQTCGSLMDIAIVLDGSNSI-YPWEPIVD-------FLVKLLENLNIGPQSTQ---VSVM 210 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L + K + + S + GLE K D + + Sbjct: 211 QYGVDTTFQFYL----NSYKTKESMIKAASNMQQKQGLETNTFKAIDFARTNAFLPQNGG 266 Query: 282 DY--KKYIIFLTDGENSSPNIDNKESLFYCN--EAKRRGAIV--YAIGVQAEAADQF--L 333 K ++ +TDGE+ NI NK + C+ R G V Y I + + + Sbjct: 267 RPGATKVMVVVTDGESHDANIRNK-VIAECDSQNITRFGIAVLGYYIRNDIDTSKLIAEI 325 Query: 334 KNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K+ AS F++V L + +G + Sbjct: 326 KSIASKPTEKFFFNVSEEAALIEIVGTLGDRIFN 359 >gi|168700937|ref|ZP_02733214.1| hypothetical protein GobsU_15533 [Gemmata obscuriglobus UQM 2246] Length = 350 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 56/160 (35%), Gaps = 18/160 (11%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLD------MMMVLDVSLSMNDHFGPGMDKLGVATR 197 + L + +V+ S + +++VLDVS SM G K + R Sbjct: 72 LALATVAWGLTVLLTVEPKSHKPGEIKESEYRHLVLVLDVSRSMEVEDAGGGGKQKRSER 131 Query: 198 SIREMLDIIKSIPDVNNVVR--SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 + D+I+S + R + ++ +S+ + I+ + L Sbjct: 132 AA----DLIQSFFERVQAERYKTTVIAVASEAKPVVLDTTDREVIRNILTELPM------ 181 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +F E+ +AK ++ +TDG+ Sbjct: 182 RHAFKPGETNMFAGLEEAARVAKPWPPGSAVLMVVTDGDT 221 >gi|15899118|ref|NP_343723.1| hypothetical protein SSO2360 [Sulfolobus solfataricus P2] gi|284173767|ref|ZP_06387736.1| hypothetical protein Ssol98_03803 [Sulfolobus solfataricus 98/2] gi|13815663|gb|AAK42513.1| Conserved hypothetical protein [Sulfolobus solfataricus P2] gi|261600867|gb|ACX90470.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 452 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 57/168 (33%), Gaps = 26/168 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GLVTFSSKIVQ 229 + ++LD S SM+ + +A S + + + +R + + VQ Sbjct: 290 IYLLLDKSGSMDGEKILWAKAVALALYSRAKR-------ENRDFYLRFFDNIPYPLIKVQ 342 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + V + E I ++ G T + + A I + KG + II Sbjct: 343 KNAKSKDVIKMIEYIGKIRGGGGTDISRSIISACEDIKEGH------VKGVSE----IIL 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 LTDGE+ + SL N I V + L+ + Sbjct: 393 LTDGEDKIAETTVRRSLKEANSQ--------LISVMIRGDNADLRRVS 432 >gi|300022115|ref|YP_003754726.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299523936|gb|ADJ22405.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 635 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 79/209 (37%), Gaps = 30/209 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++DVSLS + L V ++ + + + D + + TF+S+ Sbjct: 445 DLSVAVLMDVSLSTDAWMQER-RVLDVEKGALLALTHGLTACGDEH-----AIYTFTSRR 498 Query: 228 VQTFPLAWGV-------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + Q I +I+ L G T+ + + +++ ++ Sbjct: 499 RTNVTVTTLKGYDEALDQRIARRISALKPGQYTRIGAAVRHVTSELVKRPQRH------- 551 Query: 281 DDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + ++ LTDG+ + +++ EA++ G V+ + + A D F + Sbjct: 552 ----RLLLLLTDGKPNDVDHYEGRFGIEDTRMAIREARKAGLRVFGVTIDENARDYF-PH 606 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 F V++ +L A I +++ Sbjct: 607 IFGRGAFAIVRDIARLPAALPAIYRQLTT 635 >gi|209527953|ref|ZP_03276438.1| magnesium chelatase ATPase subunit D [Arthrospira maxima CS-328] gi|209491612|gb|EDZ91982.1| magnesium chelatase ATPase subunit D [Arthrospira maxima CS-328] Length = 662 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 68/205 (33%), Gaps = 39/205 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 ++ V+D S SM ++++ A ++ ++L D+ L+ F + Sbjct: 467 VVFVVDASGSMA------LNRMQSAKGAVMQLLTEAYQSRDM-----VSLIPFRGEQAEV 515 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + ++ R+ G + GL A +AK+ D + I+ Sbjct: 516 LLPPTRSIAAAKRRLERMPCGGGSPLAHGLTQAVRVGMNAKQ-------SGDIGQVVIVA 568 Query: 290 LTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----GVQAEAAD 330 +TDG + P E L + + G + I V A Sbjct: 569 ITDGRGNIPLARSLGEPILEGEKPDIKGELLEIAAKIRGLGIQLLVIDTENKFVSTGFAK 628 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAF 355 + K A ++ + + + A Sbjct: 629 ELAKT-AGGKYYHLPKATDQAIAAM 652 >gi|217978821|ref|YP_002362968.1| hypothetical protein Msil_2684 [Methylocella silvestris BL2] gi|217504197|gb|ACK51606.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 429 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 25/48 (52%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDH 54 R F Y+ G ++++ + LPVI ++ L E S + K ++ + D Sbjct: 20 RRFRYDRGGGVALMIGLALPVIIGMIALGTEISFLLYKKFQMQSVADS 67 >gi|172060502|ref|YP_001808154.1| hypothetical protein BamMC406_1450 [Burkholderia ambifaria MC40-6] gi|171993019|gb|ACB63938.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 648 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 1/66 (1%) Query: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 + AI + V +V+G VI+ ++ + K L ++D + L + + +G + N Sbjct: 1 MAAIWVMVAIVVLG-VIDIANLYLQKRDLQRVVDLAALAAVQPMTSDPSGCLSDAKNNVT 59 Query: 80 SYRIIK 85 S I Sbjct: 60 SSANIN 65 >gi|171185854|ref|YP_001794773.1| phage head-tail adaptor [Thermoproteus neutrophilus V24Sta] gi|170935066|gb|ACB40327.1| phage head-tail adaptor [Thermoproteus neutrophilus V24Sta] Length = 357 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 65/192 (33%), Gaps = 46/192 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + + LDVS SM ++ GP KL VA ++ L + + LV F+ Sbjct: 200 IYVALDVSGSMKEYLGPS-TKLRVAKNAVARYLRQVAQLRGS-----VSLVLFNVGADFM 253 Query: 231 FPLAWGVQHIQEKINRLI-------FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 W + + ++ T+ LE +N H Sbjct: 254 ----WTPHPVHRYLGDMLEILRYIYAMGGTELASALELLHN----------HGVVND--- 296 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 ++ +TDG S D ++ L +R ++ I + FLK A + R Sbjct: 297 ---VVVITDGRTS----DVEKVLDLA----KRFRRLHVIAT---EKNPFLKQIAKITGGR 342 Query: 342 FYSVQNSRKLHD 353 + + + + + Sbjct: 343 YRELAPTLDILE 354 >gi|119486743|ref|ZP_01620718.1| Mg chelatase subunit [Lyngbya sp. PCC 8106] gi|119456036|gb|EAW37169.1| Mg chelatase subunit [Lyngbya sp. PCC 8106] Length = 664 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 75/217 (34%), Gaps = 40/217 (18%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K+ ++ L + +V D S SM ++++ A ++ ++L D + Sbjct: 458 AKLLARKSGALVVFLV-DASGSMA------LNRMQSAKGAVMQLLTEAYQGRD-----QV 505 Query: 219 GLVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L+ F + P + + ++ R+ G + GL A +AK+ Sbjct: 506 ALIPFRGEQAEVLLPPTRSIAAAKRRLERMPCGGGSPLAHGLTQAVRVGMNAKQ------ 559 Query: 278 KGHDDYKKYIIFLTDGENS--------SPNIDNKES------LFYCNEAKRRGAIVYAI- 322 D + I+ +TDG + P +D K+ L + + G + + Sbjct: 560 -SGDIGQVVIVAITDGRGNIPLARSLGEPILDEKKPDIKGELLEIAAKIRGLGIQLLVVD 618 Query: 323 ----GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAF 355 V A + K A ++ + + + A Sbjct: 619 TENKFVSTGFAKELAKT-AGGKYYHLPKATDQAIAAM 654 >gi|5762340|gb|AAD51108.1|AF174430_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 923 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 32/159 (20%), Positives = 68/159 (42%), Gaps = 10/159 (6%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M D F DK+ ++ D + + Sbjct: 208 TIDVTVTVTPKEIDEGADVMALLDVSKKMTDADFKNAKDKIKKLVTTLTSNSDNAEHKHN 267 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN---KIFD 268 N VR L+TF +I ++ ++++++ K+ ++ + I Sbjct: 268 SRNSVR--LMTFYREISDPIDIS---GKTDAELDKILNDLREKAKANYDWGVDLQGAIHK 322 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 A+E + + +++I+ + GE++ N ++ Sbjct: 323 AREIFKKDQEKKSGKRQHIVLFSQGESTFSYDINDKTKL 361 >gi|221116649|ref|XP_002154434.1| PREDICTED: similar to Ints6 protein [Hydra magnipapillata] Length = 854 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 49/131 (37%), Gaps = 10/131 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L I + + R LVTF + ++ Sbjct: 4 LVFLVDTSASMNQKTCLGTTYLDLAKGAVESFLKIRARDINASRGDRYMLVTF-DEHNKS 62 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY-- 286 + W + +++ L T L+ A++ + + I Y Sbjct: 63 VKVGWRENLAQFLKELKNLEAYGLTDLGGALKQAFD-LLNMTRLQSGIDNYGLGRNPYYL 121 Query: 287 ----IIFLTDG 293 ++ +DG Sbjct: 122 EPALVMLFSDG 132 >gi|323474469|gb|ADX85075.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477206|gb|ADX82444.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 380 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 74/190 (38%), Gaps = 33/190 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++ LD S SM + K+ +A + +++ K IP N +TFS Sbjct: 34 SATGFHYIVALDTSGSMTGY------KIELAK---QGAIELFKRIPKGNK---VSFITFS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S + + +I ++ G T + A +AK + Sbjct: 82 SNVTVIKEFV-DPLDLTNEILQIAAGGQTALYTAILTA-----------NSLAKKYQMPT 129 Query: 285 KYIIFLTDGENSS-PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 Y++ LTDG + N+ N L Y + VY+ G+ + +Q L++ + + Sbjct: 130 -YLLLLTDGNPTDETNVGNYLKLPYFEKM-----QVYSFGIGDDYNEQLLQSISDKTSGV 183 Query: 342 FYSVQNSRKL 351 Y + ++ ++ Sbjct: 184 MYHISDANEI 193 >gi|229116005|ref|ZP_04245400.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] gi|228667499|gb|EEL22946.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] Length = 452 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 72/204 (35%), Gaps = 22/204 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ ++LD S SM G K+ A ++I LD I + +V V + + Sbjct: 148 KEKSLNVEILLDASGSMAGKVN-GQVKMEAAKKAIYNYLDKIPNNANVMLRVYGHKGSNN 206 Query: 225 SKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ G + +E+ N + K L A + + ++ Sbjct: 207 EN---DKSLSCGSSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVNNDFKEYTGE 263 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKN 335 + YI+ +DGE + + + + IG + Q LKN Sbjct: 264 ENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAVVNI--IGFDVKNSEQQQLKN 316 Query: 336 CASP--DRFYSVQNSRKLHDAFLR 357 A + +V N+ +L+ + Sbjct: 317 TAEAGKGNYATVSNAEELYQTLNK 340 >gi|45659178|ref|YP_003264.1| hypothetical protein LIC13359 [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602424|gb|AAS71901.1| conserved hypothetical protein [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 377 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 85/225 (37%), Gaps = 42/225 (18%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +SS++ + + +D S SMN++ G K+ +A + + I ++P Sbjct: 54 TNVSSENQTPSQLFI-IDASGSMNEYLGIYQ-KIHLAKKHVSHY---ISTLPQETE---I 105 Query: 219 GLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 G + + SS++ Q + + ++ L T + A I K Sbjct: 106 GFLAYGNRLPGCSSSRLYQPLEMG-NRDTFKNRLFSLTPSGATPLAESIRIAGTLISQRK 164 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEA 328 ++ E II +TDG S D K+ L K++G + +G+ ++ Sbjct: 165 KETE------------IILVTDGIESCYG-DPKKEL---QALKQKGIPFQFHVLGLGLKS 208 Query: 329 ADQFLKNCAS---PDRFYSVQNSRKLHDAFLRI----GKEMVKQR 366 ++ + +++S+++ + A + K+ ++ Sbjct: 209 HEELQMKILTEEGNGKYFSIEDDSSFYTALDSLRNSPAKKTSQKS 253 >gi|41615188|ref|NP_963686.1| hypothetical protein NEQ403 [Nanoarchaeum equitans Kin4-M] gi|40068912|gb|AAR39247.1| NEQ403 [Nanoarchaeum equitans Kin4-M] Length = 216 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 69/225 (30%), Gaps = 43/225 (19%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 Y +++ T+ + L+ + ++ + + +LD S SM Sbjct: 30 GYTQGLKPIWDLDLRKTTYMLAITKNLLYSLVFRVYRPLAEKEYIV----LLDCSGSMKG 85 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 + ++ + R GL+ F++KI+++ P + Sbjct: 86 EKFEKALAIALSI---------------IYKYKRVGLILFNNKIIKSIPPTENKTLL--- 127 Query: 244 INRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 +N L I T + LE A E I +TD + +D Sbjct: 128 VNSLFVIPREKTNISIALEEAMKYAKPKSE---------------IFIITDAVPTDETVD 172 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 E + + + V+ IG+ + + A S Y Sbjct: 173 --ELIETVRKLALKNIKVHVIGINLKEGKIVAETIAKISSSLLYE 215 >gi|84996511|ref|XP_952977.1| thrombospondin-related protein [Theileria annulata strain Ankara] gi|71532874|emb|CAJ20069.1| thrombospondin-related protein, putative [Theileria annulata] Length = 606 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 45/309 (14%), Positives = 107/309 (34%), Gaps = 28/309 (9%) Query: 27 VIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN 86 ++ M + + S+ F +++ +L L ++ + + N + + ++ Sbjct: 76 LLLTGMTRLKDGSNSFNKNSRISSVLPMETLDRLSEAITK-NSEHPVTFEGLDGGSVVVT 134 Query: 87 IWQTDFRNELRENGFAQDIN-NIERSTSLS--IIIDDQHKDYNLSAVSRYEMPFIFCTFP 143 F +L + ++ + +T + I H ++ L Sbjct: 135 NNSDTFSIKLYPSLPGLNLTPGMLPTTKPNSHINFTGNHNEHAL---------VKHALHD 185 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 +++ L K SS LD+ +++D S S +++ ++ ++ Sbjct: 186 LSSSNYDKGLYPDGIKKPSSYCHRELDLTILVDESSS------IYIEEWNKLIPFLKSLV 239 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 I P N V +VTFS+ I E++ + S P Y + Sbjct: 240 RSINISP---NYVHLSMVTFSTSIRWLISF-LDPASKDEQLALAVLDKLKNSKPVFGYTF 295 Query: 264 N-KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + + + ++ + K II +TDG ++ N+ ++ S + G + + Sbjct: 296 TGQALNFISEAVYMFGARRNSPKGIIIITDGSSTQTNVTSQASALL----RDAGVTILVV 351 Query: 323 GVQAEAADQ 331 GV + Sbjct: 352 GVGKAKESE 360 >gi|189465621|ref|ZP_03014406.1| hypothetical protein BACINT_01979 [Bacteroides intestinalis DSM 17393] gi|224540001|ref|ZP_03680540.1| hypothetical protein BACCELL_04913 [Bacteroides cellulosilyticus DSM 14838] gi|189437895|gb|EDV06880.1| hypothetical protein BACINT_01979 [Bacteroides intestinalis DSM 17393] gi|224518391|gb|EEF87496.1| hypothetical protein BACCELL_04913 [Bacteroides cellulosilyticus DSM 14838] Length = 289 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 45/108 (41%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTVKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T GLEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKAESRRTDIRLGLEYLTNVMK 173 >gi|159129667|gb|EDP54781.1| von Willebrand domain protein [Aspergillus fumigatus A1163] Length = 946 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 68/188 (36%), Gaps = 33/188 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSI-REMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM D L V +S+ + I S ++ + + ++++ Sbjct: 295 IIFVIDRSGSMMDKIDTLKSALRVFLKSLPVGVCFNICSFGSRHSFLWKQSLFYTAES-- 352 Query: 230 TFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 +Q ++ + T+ +E KE ++ Sbjct: 353 -------LQEALSFVDGVRANMGGTEMQEAVEATVRSRMKDKELE-------------VL 392 Query: 289 FLTDGENSSPNIDNKESL--FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY--S 344 LTDG+ I N+++L F A GA +++G+ A+ ++ A + Sbjct: 393 ILTDGQ-----IWNQQTLFGFIRETAADNGARFFSLGIGNGASHSLVEGIARAGNGFSQM 447 Query: 345 VQNSRKLH 352 V N +L Sbjct: 448 VVNYEELD 455 >gi|38347892|ref|NP_941141.1| putative tellurium resistance protein [Serratia marcescens] gi|226807627|ref|YP_002791321.1| TerY1 [Enterobacter cloacae] gi|226809937|ref|YP_002791631.1| TerY1 [Enterobacter cloacae] gi|1354148|gb|AAC44737.1| terY [Plasmid R478] gi|38259369|emb|CAE51594.1| putative tellurium resistance protein [Serratia marcescens] gi|226425852|gb|ACO53945.1| TerY1 [Enterobacter cloacae] gi|226426163|gb|ACO54255.1| TerY1 [Enterobacter cloacae] Length = 197 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 56/172 (32%), Gaps = 10/172 (5%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + ++ +L +K P ++TF S Q PL + ++ L Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSARQAVPLT---DLLSFQMPALTASG 62 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT L + I +K KG + +TDG SPN D ++ L Sbjct: 63 TTSLGEALSLTASSIAKEVQKTTADTKGDWRP--LVFLMTDG---SPNDDWRKGLNDFKA 117 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 A+ +V A +A LK +S + F + + Sbjct: 118 ARTG--VVVACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSASIS 167 >gi|71001856|ref|XP_755609.1| von Willebrand domain protein [Aspergillus fumigatus Af293] gi|66853247|gb|EAL93571.1| von Willebrand domain protein [Aspergillus fumigatus Af293] Length = 946 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 68/188 (36%), Gaps = 33/188 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSI-REMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM D L V +S+ + I S ++ + + ++++ Sbjct: 295 IIFVIDRSGSMMDKIDTLKSALRVFLKSLPVGVCFNICSFGSRHSFLWKQSLFYTAES-- 352 Query: 230 TFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 +Q ++ + T+ +E KE ++ Sbjct: 353 -------LQEALSFVDGVRANMGGTEMQEAVEATVRSRMKDKELE-------------VL 392 Query: 289 FLTDGENSSPNIDNKESL--FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY--S 344 LTDG+ I N+++L F A GA +++G+ A+ ++ A + Sbjct: 393 ILTDGQ-----IWNQQTLFGFIRETAADNGARFFSLGIGNGASHSLVEGIARAGNGFSQM 447 Query: 345 VQNSRKLH 352 V N +L Sbjct: 448 VVNYEELD 455 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 53/136 (38%), Gaps = 23/136 (16%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 289 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 342 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A +++ Sbjct: 343 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSRLSVPNGH-------- 394 Query: 281 DDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 395 ---KKVIVLLTDGVPT 407 >gi|294011131|ref|YP_003544591.1| hypothetical protein SJA_C1-11450 [Sphingobium japonicum UT26S] gi|292674461|dbj|BAI95979.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 418 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 14/101 (13%), Positives = 34/101 (33%), Gaps = 1/101 (0%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + G++ ++ A +P++ GL +T K ++ D + L A + Sbjct: 8 RLLRDRTGNVLMMAAASMPLLVGAAGLATDTVQWTLWKRQIQRQADSAALAGAYAVAQGF 67 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNI 108 N ++ + ++ N FA + + Sbjct: 68 NASD-SATADISRMALVALTQTPTIENAPTSGPFAGNAQAV 107 >gi|268610164|ref|ZP_06143891.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 285 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 77/210 (36%), Gaps = 19/210 (9%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + + S L + ++D S SM K+G + E++ I+ + + + V+ Sbjct: 40 PLSATGVSRKSLVIFFLIDTSGSMKGK------KMGELNTVMEELIPEIRRVGEADTEVK 93 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++TFS+ + + ++ + RL T + A+ ++ + + Sbjct: 94 VAVLTFSTDVRWMYSTPIPIEDF--EWARLRANGVT----SMGAAFKELSLRMSRNSFLN 147 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFY--CNEAKRRGAIVYAIGVQAEAADQFL-K 334 + I +TDG P+ D +E L N + G A+G+ EA D L + Sbjct: 148 SPSLSFAPVIFLMTDG---YPSDDYREGLKELQSNSWYKFGLKA-ALGIGNEANDDVLAE 203 Query: 335 NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 S D + +L I + Sbjct: 204 FTGSKDTVVHAYSGGQLAQMIKIIAVTSSQ 233 >gi|311898833|dbj|BAJ31241.1| hypothetical protein KSE_54660 [Kitasatospora setae KM-6054] Length = 442 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 47/145 (32%), Gaps = 13/145 (8%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G +++V+D S SM +H M AT + + L + V ++G V + Sbjct: 52 GPGAAVVLVIDCSGSM-EHPAGKMRHAREATAAAIDELRDGVAFAVVAGTHQAGDV-YPG 109 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + +E + RL T + A E + Sbjct: 110 NNRLAVADRATREQAKEALRRLNPSGGTAIGTWIAKAQQLFASHPEASI----------R 159 Query: 286 YIIFLTDGENSSPN-IDNKESLFYC 309 + I LTDG N D + SL Sbjct: 160 HAILLTDGRNEHEKPADLQRSLDAA 184 >gi|206890389|ref|YP_002248900.1| magnesium-chelatase subunit ChlD [Thermodesulfovibrio yellowstonii DSM 11347] gi|206742327|gb|ACI21384.1| magnesium-chelatase subunit ChlD [Thermodesulfovibrio yellowstonii DSM 11347] Length = 614 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 22/175 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +++ V+D S SM ++ ++ +L I + ++ F Sbjct: 423 ERRMSHNVIFVVDGSGSMGVE-----QRMKATKGAVLSLL-----IDCYKKRDKVAMIVF 472 Query: 224 -SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 K PL V+ +++ + G T + GL AY + K+ H + Sbjct: 473 RKDKAEILLPLTSSVELALKRLREIPTGGKTPLSAGLMEAYKLM-----KITHFKYPENR 527 Query: 283 YKKYIIFLTDGENSSPNIDN---KESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 I+ +TDG+ + D +E C K + I + E D+F+K Sbjct: 528 L--LILIITDGKPNVSLSDKPVLEELKSVCFMLKDFPLTDF-IVIDTEKKDKFMK 579 >gi|319952927|ref|YP_004164194.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421587|gb|ADV48696.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 212 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 61/196 (31%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM + ++ M+ ++ P ++TF S Sbjct: 3 RLPVYILLDTSGSMMGE------PIEAVKNGVQVMISSLRQNPQAIESAFLSIITFDSSA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL ++ + TT LE I + +KG +I Sbjct: 57 RQIVPLT---DLASFQMPDIQATGTTSLGEALELVSTCIDNEVASTTSESKGDWKPLVFI 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG + L KRR A + A A LK + Sbjct: 114 --MTDGIPTDDMQKGLSELK-----KRRTAYIVACAAGNGADSTLLKQITENVVSLDTAD 166 Query: 348 SRKLHDAFLRIGKEMV 363 S+ + F + + Sbjct: 167 SQSISKFFAWVTASIG 182 >gi|296194626|ref|XP_002745014.1| PREDICTED: integrin alpha-2 [Callithrix jacchus] Length = 1208 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 40/284 (14%), Positives = 86/284 (30%), Gaps = 48/284 (16%) Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 ++ I+ + SL + + F+ C W + ISS Sbjct: 132 NVTEIKTNMSLGLTLTRNMGTGG----------FLTCGPLWAQQCGNQYYTTGVCSDISS 181 Query: 164 KSDIGLDMMMVLDVSLSMND-----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + S+ D + + + + + P Sbjct: 182 DFQLSSSFSPAVQTCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGLDIGPTKTQ---V 238 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKIN------RLIFGSTTKSTPGLEYAYNKIFDAKEK 272 L+ +++ F L +E++ G T + ++YA + A Sbjct: 239 ALIQYANNPRVVFNL--NTYRTKEEMTLATSQTSQYGGDLTNTFGAIQYARKYAYSAA-- 294 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QA 326 + G K ++ +TDGE+ ++ K + CN + + I V A Sbjct: 295 ----SGGRPSATKVMVVVTDGESHDGSM-LKAVIEQCNH---DNILRFGIAVLGYLNRNA 346 Query: 327 EAADQF---LKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 +K AS F++V + L + +G+++ Sbjct: 347 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 390 >gi|297527215|ref|YP_003669239.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256131|gb|ADI32340.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 190 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 49/151 (32%), Gaps = 16/151 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM+ F + A + + + + D VR G V F SK Sbjct: 5 IVFIIDTSYSMSRKFNDFVPNKIRAVKEVLAY--ALTRLFDKYKDVRVGAVVFFSKAYPI 62 Query: 231 FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L + + I L G + GL A + E I Sbjct: 63 LSLTISREGVLMSIRELEILGEGSAPGDGLIEAVKMMRRKNGLKET------------IM 110 Query: 290 LTDGENSSPNIDNKESLFYCNE-AKRRGAIV 319 +TDG + N +++ N K + Sbjct: 111 ITDGGFNEGIPLNIAAIYAANSGVKTSFIAI 141 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 80/220 (36%), Gaps = 25/220 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 Y + F+ + + + ++ S SDI LD+++V+D S ++K Sbjct: 5 YALLFLIGASGKQIVTIYPLTVCSAEQCGGSISDIWLDVVVVVDNS--------QRVNKR 56 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLI 248 + + + +I + R G VT++S+ L ++G + Sbjct: 57 SFVSSTRDTINNIFREASIPRT--RVGFVTYNSQATTNADLNKFKSYGDLQQGVYNSYND 114 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLE--HIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + + TP + + A E L+ A GH ++ K II N + +D L Sbjct: 115 MNLSPEKTPYIGTG---LIAAGELLQIQGSADGHVNHPKVIIAYATALNGTGLLDP---L 168 Query: 307 FYCNEAKRRGAIVYAIGVQAEAAD---QFLKNCASPDRFY 343 N K G + I V + + L ASP + Sbjct: 169 SVANTLKSAGITIITIAVDTDDNGVIEKQLAPLASPGAAF 208 >gi|284052435|ref|ZP_06382645.1| protoporphyrin IX magnesium-chelatase [Arthrospira platensis str. Paraca] Length = 374 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 37/204 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 ++ V+D S SM ++++ A ++ ++L + + L+ F + Sbjct: 179 VVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQSRDQVSLIPFRGEQAEV 227 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + ++ R+ G + GL A +AK+ D + I+ Sbjct: 228 LLPPTRSIAAAKRRLERMPCGGGSPLAHGLTQAVRVGMNAKQ-------SGDIGQVVIVA 280 Query: 290 LTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI----GVQAEAADQ 331 +TDG + P E L + + G + I + + Sbjct: 281 ITDGRGNIPLARSLGEPILEGEKPDIKGELLEIAAKIRGLGIQLLVIDTENKFVSTGFAK 340 Query: 332 FLKNCASPDRFYSVQNSRKLHDAF 355 L A ++ + + + A Sbjct: 341 ELAKTAGGKYYHLPKATDQAIAAM 364 >gi|157694069|ref|YP_001488531.1| hypothetical protein BPUM_3318 [Bacillus pumilus SAFR-032] gi|157682827|gb|ABV63971.1| hypothetical protein YwmD [Bacillus pumilus SAFR-032] Length = 225 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 70/214 (32%), Gaps = 29/214 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +K ++++LD+S SM G K+ +A RSI+ I+ D ++R Sbjct: 20 PAHAVTKKQEPAHVVILLDLSGSMAQSVE-GEKKIDIAKRSIQSFASILS--DDTQVLLR 76 Query: 218 S-GLVTFSSKIVQTFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 G + + A +G L T + + A Sbjct: 77 VFGHEGTNKNAGKAISCASSEAVYGFGSYESSTFQQALNVYKPT--------GWTPLAKA 128 Query: 270 KEKLEHIAKGHDDYKKYIIF-LTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQA 326 + + H K I++ ++DG+ + E G IV IG Sbjct: 129 LTDTKQDFEDHQAEGKNIVYVVSDGKETCGG----SPSQAAKELHEDGIDTIVNIIGFDV 184 Query: 327 -EAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 E + LK+ A ++ N+ +L+ Sbjct: 185 NEKEAKSLKSVAKAGGGQYQPAANAEELNHILQN 218 >gi|296236487|ref|XP_002763348.1| PREDICTED: protein DDX26B [Callithrix jacchus] Length = 861 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLVDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 + W ++ L T L ++ N++ + Sbjct: 62 IKVGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|91223291|ref|ZP_01258557.1| hypothetical protein V12G01_05591 [Vibrio alginolyticus 12G01] gi|91192104|gb|EAS78367.1| hypothetical protein V12G01_05591 [Vibrio alginolyticus 12G01] Length = 611 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 47/164 (28%), Gaps = 29/164 (17%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +VLD+S SM ++L ++L K +GL+ ++ P Sbjct: 89 LVLDMSRSM-FATDIKPNRLAQTRYKALDLLPKWKEGA-------TGLIAYAGDAYNLSP 140 Query: 233 LAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + I L+ +E A N+ A + II Sbjct: 141 LTTDSSTLAGIIENLSPELMPFQGANLPAAIELAINQFSQAGTQQGD-----------II 189 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 L D ++D E + + + + V Sbjct: 190 VLAD------DLDTSELSRALDLVQGTKFRISVLAVGTPNGAPI 227 >gi|324997281|ref|ZP_08118393.1| von Willebrand factor type A [Pseudonocardia sp. P1] Length = 184 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 50/142 (35%), Gaps = 24/142 (16%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + + + ++ + +V+DVS SM ++ AT + S Sbjct: 1 MPEDTDIVVRERTGSPRAITLVVDVSGSMRGEKV----RIAAATVAAL-------SGDAG 49 Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 R LV F S PL + +++ R+ T GL A+ ++ A+ Sbjct: 50 ATGDRLALVAFWSDAAVLSPLDEPATPGVLLDRLVRIPARGLTDVGFGLSVAHAELAGAR 109 Query: 271 EKLEHIAKGHDDYKKYIIFLTD 292 E+ + + L+D Sbjct: 110 ERR-----------RVAVLLSD 120 >gi|308272641|emb|CBX29245.1| hypothetical protein N47_J02260 [uncultured Desulfobacterium sp.] Length = 676 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 80/225 (35%), Gaps = 28/225 (12%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 A+ L + + IG ++ ++D S SM ++ + +I +L Sbjct: 468 ASGLAVVLRSEDIREKIRERRIGNFLLFLVDASGSMG-----ARSRMIASKGAIMSLL-- 520 Query: 206 IKSIPDVNNVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + R +++F S+ V P ++ + + G T + G+ Y Sbjct: 521 ---LDAYQKRDRVAMISFRKSEAVVNLPPTSSIELAAGLLREMPVGGRTPLSAGIAKTYE 577 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRG-AIVY 320 + + + +I +TDG ++ + K E++ + G + Sbjct: 578 VLRN-------YLLREPTARPIVIIITDGRSNVSLGEKKPMDEAVDFAARLSDDGRIRII 630 Query: 321 AIGVQAEAADQ--FLKNCASPDR--FYSVQNSRKLHDAFLRIGKE 361 + +++ + + A +Y +++ + ++ L I KE Sbjct: 631 VVDTESQGLVKFGLAQKLAGAMNAQYYKIEDLKS--ESLLNIAKE 673 >gi|297694868|ref|XP_002824689.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Pongo abelii] Length = 482 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 66/213 (30%), Gaps = 31/213 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y MP F T L + S +++ ++D S S+ D M + Sbjct: 262 SYHMPNWFGTTK-YVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLE 320 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NR 246 +I K+ + + V F+ Q ++ +E + N Sbjct: 321 FVS---------NIAKTFEISDIGAKIAAVQFT--YDQRTEFSFTDYSTKENVLAAIRNI 369 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + + + +F K +++ +TDG+ + D+ + Sbjct: 370 RYMSGGTATGDAISFTVRNVFGPIR--------ESPNKNFLVIVTDGQ----SYDDVQG- 416 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 A G V+++GV D + P Sbjct: 417 -PAAAAHDAGITVFSVGVAWAPLDDLKDMASKP 448 >gi|297199739|ref|ZP_06917136.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|297147478|gb|EFH28643.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 448 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 53/178 (29%), Gaps = 28/178 (15%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + ++ + +++D S SM+ K+ A + +D Sbjct: 37 STGGGTIGSAVGAPHLWTAGQGPSAAVAIMVDCSGSMDY----PPTKMRNARDATAAAID 92 Query: 205 IIKSIPDVNNVVRSGLV----TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++ V+ V G + A ++ + RL G T L Sbjct: 93 TLR--DGVHFAVIGGTHVAKEVYPGGGSLAVADATTRDQAKQALRRLSAGGGTAIGTWLR 150 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-------IDNKESLFYCNE 311 A + A + H I LTDG N + +D+ F C+ Sbjct: 151 LADRLLSSADVAIRHG-----------ILLTDGRNEHESPEDLKAALDSCAGRFTCDA 197 >gi|170575265|ref|XP_001893167.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158600964|gb|EDP37999.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 448 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 71/192 (36%), Gaps = 29/192 (15%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAP--LLITSSVKISSKSDIGLDMMMVLDVSL 179 +++ +P T P +S+ P I+S S D + V+D + Sbjct: 214 TQNHTRETSRSTRLP---STVPVQPDSTKRPENRPISSLSGRLDCSLAPFDTLFVVDSTS 270 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---- 235 S+ F I E++++I +P+ +N R G++ +SS + + L + Sbjct: 271 SVRQFFEDH-------RTYIIEIINLI--LPEFDNDTRIGIIEYSSSLRRQVKLRFIAHK 321 Query: 236 GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 I E + +L F T + L+ A + D + + ++ LTDG Sbjct: 322 NRTEIVETVKKLPFFAGITATGAALKLALEILQDRRSNVLTN----------VVVLTDGF 371 Query: 295 NSSPNIDNKESL 306 + + L Sbjct: 372 SYDLVDEPSSML 383 >gi|126327494|ref|XP_001368527.1| PREDICTED: similar to candidate tumor suppressor protein DICE1 isoform 1 [Monodelphis domestica] Length = 889 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 46/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 II +TDG + +E L Sbjct: 122 PAIIITITDGSKLTTTSGVQEEL 144 >gi|126327496|ref|XP_001368564.1| PREDICTED: similar to candidate tumor suppressor protein DICE1 isoform 2 [Monodelphis domestica] Length = 852 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 46/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 II +TDG + +E L Sbjct: 122 PAIIITITDGSKLTTTSGVQEEL 144 >gi|110833328|ref|YP_692187.1| pilin biogenesis-like protein [Alcanivorax borkumensis SK2] gi|110646439|emb|CAL15915.1| pilin biogenesis related protein [Alcanivorax borkumensis SK2] Length = 1009 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/112 (20%), Positives = 41/112 (36%), Gaps = 9/112 (8%) Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY-CNEA------KRRG 316 KI D + ++ +I+ TDGE + N+ Y C A + Sbjct: 269 KKIVDNGNYVSPRNTANECESNHIVLFTDGEANDVNLPCGGGSSYDCQRAISDYLDRDFE 328 Query: 317 AIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRKLHDAFLRIGKEMVKQR 366 Y +G+ E ++ +S Y+ ++ L AFL I + K+ Sbjct: 329 IKTYNVGLHMEDNRADMETVSSDGADGTYTASDAESLASAFLDIFDLIEKES 380 >gi|297559546|ref|YP_003678520.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843994|gb|ADH66014.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 587 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 79/217 (36%), Gaps = 22/217 (10%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 IT ++ ++ + ++ ++D+S SM + G GM ++ V + + + L++ + Sbjct: 360 ITGLIRTWNQLKMDSRVLAIVDISGSMLAEVPGTGMTRMQVTSAAATQGLEMFTPSSE-- 417 Query: 214 NVVRSGLVTFSSKI-----VQTFPLAWGVQHIQE--KINRLIFGSTTKSTPGLEYAYNKI 266 GL FS+ + Q +Q + +R + S L + Sbjct: 418 ----LGLWEFSTNVNNELHYQEIAPIRELQAAADDGTAHRDVLAGALASLQPLPQGDTAL 473 Query: 267 FDAKEKLEHIAKGHDDYKK--YIIFLTDGENSSP---NIDNKESLFYCNEAKRRGAIVYA 321 ++ + I+ LTDG+N +P +D S + R + Sbjct: 474 YETYLAAYQEMSRTYQPDRTNVILMLTDGDNDNPGGLGLDELMSQIESLASPSRPIPIIT 533 Query: 322 IGVQAE-AADQFLK--NCASPDRFYSVQNSRKLHDAF 355 I + + L+ A+ Y ++ ++ + F Sbjct: 534 IAFGPDVQNLEPLQEIAAATGGAAYMTEDPTEIGEIF 570 >gi|254428069|ref|ZP_05041776.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196194238|gb|EDX89197.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 657 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 73/221 (33%), Gaps = 30/221 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 F + H LL+ K + + + + ++D S SM + A S+ Sbjct: 272 FHEEIDGEHYALLMVVPPKTGQVTALPRETLFIIDSSGSMGGA------PMRQAKASLHL 325 Query: 202 MLDIIKSIP--DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 L +K ++ + + F + + + Q Q+ ++ L T P L Sbjct: 326 ALQRLKPGDRFNITDFDSQHTLLFETPVTVS---DNSRQQAQDFVDGLQASGGTHMLPAL 382 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ Y + +IF+TDG + + + EA+ + Sbjct: 383 SATLSQPASDG------------YLRQVIFITDGAVGNESGIFRALHQQLGEAR-----L 425 Query: 320 YAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 + +G+ + F+ A F + + ++ + Sbjct: 426 FTVGIGSAPNSHFMTRAAQFGRGSFTYINDQNQVQQGMDTL 466 >gi|304411848|ref|ZP_07393459.1| Tetratricopeptide TPR_1 repeat-containing protein [Shewanella baltica OS183] gi|307303384|ref|ZP_07583139.1| TPR repeat-containing protein [Shewanella baltica BA175] gi|304349708|gb|EFM14115.1| Tetratricopeptide TPR_1 repeat-containing protein [Shewanella baltica OS183] gi|306913744|gb|EFN44166.1| TPR repeat-containing protein [Shewanella baltica BA175] Length = 694 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLIATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + GL A + II +TDG ++ D +L Sbjct: 166 VRGSNLAAGLTQAKKLLAQGGHIRGD-----------IIVMTDGITAAQFDDANSALS 212 >gi|300868543|ref|ZP_07113161.1| protoporphyrin IX magnesium-chelatase (fragment) [Oscillatoria sp. PCC 6506] gi|300333456|emb|CBN58351.1| protoporphyrin IX magnesium-chelatase (fragment) [Oscillatoria sp. PCC 6506] Length = 195 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 66/199 (33%), Gaps = 40/199 (20%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQT 230 M V+D S SM ++++ A ++ ++L + + L+ F + Sbjct: 1 MFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQSRDQVALIPFRGEQAEVL 49 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + ++ RL G + GL A +A+ K D + I+ + Sbjct: 50 LPPTRSIALARNRLERLPCGGGSPLAHGLTQAVRVGVNAQ-------KSGDVGQVAIVAI 102 Query: 291 TDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----GVQAEAADQ 331 TDG + P E L + + G + I V A + Sbjct: 103 TDGRGNIPLARSLGEPILDGEKPDIKAELLEIAGKIRGLGMQLLVIDTENKFVSTGFAKE 162 Query: 332 FLKNCASPDRFYSVQNSRK 350 K S ++Y + + Sbjct: 163 LAK--VSGGKYYHLPKATD 179 >gi|115687249|ref|XP_792282.2| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 2422 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 79/219 (36%), Gaps = 42/219 (19%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S+ ++++ ++D S S+ G++ + ++++L PD R ++TFS Sbjct: 196 SNGQVELVFLVDSSASV------GIENFFNELKFVKKLLADFTVAPDA---TRVAIITFS 246 Query: 225 --SKIVQTF-PLAWGVQH--------IQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEK 272 ++ L + + E + ++ G T + E A + A+ Sbjct: 247 SKHRVELNVNQLDNSIAGKHHHKCALLNEDLPKITYVGGGTYTKGAFELAKKVLQGARAN 306 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + LTDG ++ PN + K G V++ G++ + Sbjct: 307 STQA----------VFLLTDGLSNGPN-----PVPVAVSLKDDGVEVFSFGIRDGYIPEL 351 Query: 333 LKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 L+ + + Y + + + F + + + + + Sbjct: 352 LQMASEKKDEHCYILDSFAE----FEALARRALHEDLRM 386 >gi|1915900|emb|CAA67559.1| collagen VI-alpha-1 chain [Homo sapiens] Length = 125 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 36/88 (40%), Gaps = 4/88 (4%) Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCA 337 H KY+I +TDG + L NEAK G V+++ + + + L A Sbjct: 2 SHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIA 61 Query: 338 SPDRF---YSVQNSRKLHDAFLRIGKEM 362 + + ++ + + DA I + + Sbjct: 62 TDHTYRRNFTAADWGQSRDAEEAISQTI 89 >gi|323484869|ref|ZP_08090225.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] gi|323401865|gb|EGA94207.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] Length = 2032 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 51/373 (13%), Positives = 110/373 (29%), Gaps = 66/373 (17%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 + K+ F +++ + E+ +G T + + Sbjct: 964 SVKLAGVSETTVDIPVGKYFYVDEVQSQGAKAVKWEVSSSGTGTYPQVSGSKTEI-YEVT 1022 Query: 120 DQHKDYNLSAVSRY-EMPFIFCTFPW----CANSSHAPLLITSSVKISSKSDIGLDMMMV 174 + + Y L + Y E+ W + L +T ++ D++ V Sbjct: 1023 KESQGYLLQCTNIYGELNPQVAKRAWKDYTADDEYDVTLEVTGDSIQTTIGGGTADIVFV 1082 Query: 175 LDVSLSMND-----------HFGPGMDKL---------GVATRSIREMLDIIKSIPDVNN 214 +D S SMND +D+ I + + + + Sbjct: 1083 IDKSSSMNDWDDSLDDYRWNKLESTVDRFINKLKITSPNSKISFIEYQSSDLGTYQETYD 1142 Query: 215 VVRSGLVTFSSKIVQTFP---LAWGVQ----HIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 +R V + + L + ++ +GS G A + Sbjct: 1143 DIRKVAVADTESADKNLEGNKLKYFRTLQKWTAISEVGSKPYGSAPGYNQGTHSAGGYLG 1202 Query: 268 DAKEKLEHIAKGHDDY---KKYIIFLTDGENSSPNIDN---------------KESLFYC 309 + D+Y KYII+L DG +N + ++ Sbjct: 1203 AERALDRLKTYNPDEYNSNSKYIIYLADGTAGYYVNNNGTQAGSGSGGNANARRAAITQS 1262 Query: 310 NEAKRR--GAIVYAIGVQAEAADQF--LKNCASPDR-----------FYSVQNSRKLHDA 354 E K++ A +Y + ++++ +K A FYS N+++L + Sbjct: 1263 GELKKKHPDATIYTVAFGSDSSANMNWMKPGAYNGNSDNPYNPNVTAFYSAANTKELEET 1322 Query: 355 FLRIGKEMVKQRI 367 F + ++ + Sbjct: 1323 FDNLAAQVGSSAV 1335 >gi|229489135|ref|ZP_04383001.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229324639|gb|EEN90394.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 551 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 66/206 (32%), Gaps = 38/206 (18%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS------ 225 + V+DVS SM D+ G+ ++ ++ I+ P GL FS Sbjct: 353 LAVVDVSGSM-DYMQDGVTRMAATAQAGDI---AIRMFPANAQ---LGLWAFSIDLGEGT 405 Query: 226 ---------KIVQTFPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLE 274 ++ T + +I+ L G T + AY + + Sbjct: 406 DYRELEPVARMDATEGDTDHRSKLLSRIDSLSSIVGGGTGLYDSVLAAYRSMQQTYDPAS 465 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAEAADQ 331 + +I LTDG N P+ + + L + R + IGV +A Sbjct: 466 INS---------VILLTDGANDDPSSISLQELLDTLTREQDPTRPVPIITIGVTDDADTD 516 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAF 355 L+ + + + + F Sbjct: 517 VLEQISALTGGNSHFAPTPADIPKVF 542 >gi|257060420|ref|YP_003138308.1| magnesium chelatase ATPase D [Cyanothece sp. PCC 8802] gi|256590586|gb|ACV01473.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 8802] Length = 673 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 66/213 (30%), Gaps = 37/213 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 472 ARKAGALIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVALIPF 520 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ L G + GL A + +AK D Sbjct: 521 RGEQADVLLPPTRSIALARRRLETLPCGGGSPLAHGLTQAVHVGMNAK-------MSGDI 573 Query: 283 YKKYIIFLTDGENSSPNIDNK--------------ESLFYCNEAKRRGAIVYAI----GV 324 + I+ +TDG + P + E L + + G + I Sbjct: 574 GQVVIVAITDGRGNIPLAKSLGEPLPEGEKPDIKGELLEIAGKIRALGMKLLVIDTEKKF 633 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A + + + + + Sbjct: 634 VSTGFGKELATTAGGTYYQLPRATDQAIAQMAK 666 >gi|218248255|ref|YP_002373626.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 8801] gi|218168733|gb|ACK67470.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 8801] Length = 673 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 66/213 (30%), Gaps = 37/213 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 472 ARKAGALIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVALIPF 520 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ L G + GL A + +AK D Sbjct: 521 RGEQADVLLPPTRSIALARRRLETLPCGGGSPLAHGLTQAVHVGMNAK-------MSGDI 573 Query: 283 YKKYIIFLTDGENSSPNIDNK--------------ESLFYCNEAKRRGAIVYAI----GV 324 + I+ +TDG + P + E L + + G + I Sbjct: 574 GQVVIVAITDGRGNIPLAKSLGEPLPEGEKPDIKGELLEIAGKIRALGMKLLVIDTEKKF 633 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A + + + + + Sbjct: 634 VSTGFGKELATTAGGTYYQLPRATDQAIAQMAK 666 >gi|38197044|gb|AAH05159.2| COL6A1 protein [Homo sapiens] Length = 445 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P + VR +V +S Q Sbjct: 246 DITILLDGSASVGSHNFDTTKRF--AKRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 301 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 302 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 352 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 353 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 392 >gi|1915902|emb|CAA67576.1| collagen (VI) alpha-1 chain [Homo sapiens] Length = 436 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H + A R L ++ P + VR +V +S Q Sbjct: 237 DITILLDGSASVGSHNFDTTKRF--AKRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 292 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 293 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 343 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 344 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 383 >gi|333027046|ref|ZP_08455110.1| putative von Willebrand factor type A [Streptomyces sp. Tu6071] gi|332746898|gb|EGJ77339.1| putative von Willebrand factor type A [Streptomyces sp. Tu6071] Length = 453 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 55/207 (26%), Gaps = 38/207 (18%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 Q + + +P + + D + ++++D S Sbjct: 20 PQGPRFEVEVYQNPYLPEGSGEVHAVVTVTATGGGTGALTAAGPGQDAAV--VLMVDCSG 77 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP------- 232 SM KL A + LD + + R +V + + +P Sbjct: 78 SMQY----PPSKLHHAKEATGAALDTL------RDGTRFAVVEGTHVAREVYPRGGALAV 127 Query: 233 -LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 +E + L T L A + A + H I LT Sbjct: 128 ADDRTRAEAKEALRTLRASGGTAVGRWLRLAERLLSQAPVAIRHG-----------ILLT 176 Query: 292 DGENSS-------PNIDNKESLFYCNE 311 DG N +D+ F C+ Sbjct: 177 DGRNEHETPEELRAALDDCAGRFTCDA 203 >gi|319639064|ref|ZP_07993821.1| PilC protein [Neisseria mucosa C102] gi|317399642|gb|EFV80306.1| PilC protein [Neisseria mucosa C102] Length = 1123 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 62/184 (33%), Gaps = 14/184 (7%) Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 +H+ + S S + F + P + + I+ + + ++M ++D S Sbjct: 12 SGRHRLIHASVASALAL-ISFSVQANTQQFAKIPFYLQNETSINGQPKVKHNIMFLIDDS 70 Query: 179 LSMNDHFGPGMDKLGVATRSIR--EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP---- 232 SM + + R E L + + GL T + P Sbjct: 71 GSMQWNVQGKETSIWADKRITITKEALKSVLKEYGEKQRFQWGLQTLHNNGRTDTPDEEG 130 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + +Q +++ + G T T + ++ + + K Y+I ++D Sbjct: 131 FTDDWKDVQRRVDGIDPGHATPIT-------RRYYEVVKNFVMPNIKYRCQKSYVIVMSD 183 Query: 293 GENS 296 G+ + Sbjct: 184 GDAN 187 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 49/134 (36%), Gaps = 32/134 (23%) Query: 235 WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 W + +++ L F S T + ++ A + + DA + DG+ Sbjct: 311 WDRNY-KDEKRGLRFFSRTLAEKDIKTAKDGLDDAGKSW------------------DGD 351 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKNCAS-PDRFYSVQNSRK 350 S P + +K+ + +G +E ++L+ AS PD +++ Sbjct: 352 PSDPKG--------VDYSKQL-VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKKED 402 Query: 351 LHDAFLRIGKEMVK 364 L +AF I + Sbjct: 403 LLEAFKTIVDNIEN 416 >gi|219848048|ref|YP_002462481.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542307|gb|ACL24045.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 446 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 60/197 (30%), Gaps = 33/197 (16%) Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 +S F +D A S+ E L R L+ +S + P G Sbjct: 87 LSSEARSQFSSPIDYTVRALHSVVERLTP---------DDRMALIACASDALVLAPSTPG 137 Query: 237 VQ-----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + ++ L G +T GL+ A L + + ++ LT Sbjct: 138 HRRTDLIGAIARLPVLRLGESTNLAQGLQLA----------LAQFVVTDEPAVRRVVLLT 187 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSR 349 DG + D EA R + IG+ + L A S R VQ + Sbjct: 188 DGFTT----DTTMCTALAREAADRSITISTIGLGNTFEETLLTQIADLSGGRASFVQEAG 243 Query: 350 KLHDAFLRIGKEMVKQR 366 + I E+ R Sbjct: 244 HIPTI---ISAELEHAR 257 >gi|327262910|ref|XP_003216266.1| PREDICTED: integrin alpha-2-like [Anolis carolinensis] Length = 1302 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 74/217 (34%), Gaps = 37/217 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K +D+++V D S S + +++ + + P GL+ + Sbjct: 289 KCSSAVDVVVVCDESNS--------IYPWDAVKAFLKKFVQGLDIGPTKTQ---VGLIQY 337 Query: 224 SSKIVQTFPLAW--GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + F L + + ++ T + +EYA F + G Sbjct: 338 GNNPREVFNLNTYTNKEDAVQAMSETYQNGGEYTNTFRAIEYARRYAFS------KESGG 391 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAAD 330 K ++ +TDGE+ + +E + CNE + I V + Sbjct: 392 RPSASKVMVVVTDGESHDGSK-LQEVIAKCNE---DNITRFGIAVLGYLIRNELDTKNLI 447 Query: 331 QFLKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + +K AS F++V + L + +G+ + Sbjct: 448 KEIKGIASLPTSKYFFNVSSEAALLEEAGTLGERLFS 484 >gi|316968556|gb|EFV52821.1| integrator complex subunit 6 [Trichinella spiralis] Length = 702 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 32/159 (20%), Positives = 66/159 (41%), Gaps = 13/159 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S+SMN + LG A +I +L P+ R L+TF + + Sbjct: 4 ILFLVDNSVSMNQVSYQRISFLGYAQHAIETILKYRSRDPNARGFDRYMLLTF-DRPPKN 62 Query: 231 FPLAWGVQHIQ--EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK--- 285 W E++ L +++ T L A++ + + + G+ Y + Sbjct: 63 IKAGWKESQTVFMEELRNLKAENSSSITAPLSSAFHLLNVNRLQSGIDNFGYGRYPQWLE 122 Query: 286 --YIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVY 320 ++I TDG +D+ +LF ++ + G+ +Y Sbjct: 123 QAFVILFTDG---HSFVDDDGTLFDIDKIRLSSYGSELY 158 >gi|297465782|ref|XP_609132.4| PREDICTED: collagen, type VI, alpha 3-like isoform 1, partial [Bos taurus] Length = 1803 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 91/249 (36%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + + AP + + + Sbjct: 148 NIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAPPGVDTPSPSRPEKK 207 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S D R + E++D + + + ++ GLV ++S Sbjct: 208 KA-DIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSD 257 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 258 PTDEFFLKDFPTKQQIIDAINKVVYKGGRHANT--KVGLEHLRRNHFVPEAGSRLDQRVP 315 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ + +L +RG V+A+GV+ +++ K ++ + Sbjct: 316 QIAFVITGGKSVEDAQEASMALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFR 369 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 370 VGNVQELSE 378 >gi|297473452|ref|XP_002686619.1| PREDICTED: collagen, type VI, alpha 3-like isoform 4 [Bos taurus] gi|296488814|gb|DAA30927.1| collagen, type VI, alpha 3-like isoform 4 [Bos taurus] Length = 2555 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 91/249 (36%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + + AP + + + Sbjct: 969 NIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAPPGVDTPSPSRPEKK 1028 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S D R + E++D + + + ++ GLV ++S Sbjct: 1029 KA-DIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSD 1078 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 1079 PTDEFFLKDFPTKQQIIDAINKVVYKGGRHANT--KVGLEHLRRNHFVPEAGSRLDQRVP 1136 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ + +L +RG V+A+GV+ +++ K ++ + Sbjct: 1137 QIAFVITGGKSVEDAQEASMALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFR 1190 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1191 VGNVQELSE 1199 >gi|262272104|gb|ACY40027.1| MIP14152p [Drosophila melanogaster] Length = 603 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 48 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 101 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 102 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 155 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 156 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 204 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 205 LHDMACSNKGFFVQINDYDE 224 >gi|195579432|ref|XP_002079566.1| GD21946 [Drosophila simulans] gi|194191575|gb|EDX05151.1| GD21946 [Drosophila simulans] Length = 1100 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 149 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 202 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 203 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 256 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 257 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 305 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 306 LHDMACSNKGFFVQINDYDE 325 >gi|195475490|ref|XP_002090017.1| GE19394 [Drosophila yakuba] gi|194176118|gb|EDW89729.1| GE19394 [Drosophila yakuba] Length = 1136 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 149 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 202 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 203 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 256 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 257 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 305 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 306 LHDMACSNKGFFVQINDYDE 325 >gi|195338633|ref|XP_002035929.1| GM14254 [Drosophila sechellia] gi|194129809|gb|EDW51852.1| GM14254 [Drosophila sechellia] Length = 1119 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 149 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 202 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 203 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 256 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 257 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 305 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 306 LHDMACSNKGFFVQINDYDE 325 >gi|194857574|ref|XP_001968984.1| GG24200 [Drosophila erecta] gi|190660851|gb|EDV58043.1| GG24200 [Drosophila erecta] Length = 1136 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 149 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 202 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 203 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 256 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 257 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 305 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 306 LHDMACSNKGFFVQINDYDE 325 >gi|161076922|ref|NP_001097164.1| CG4587, isoform C [Drosophila melanogaster] gi|320545115|ref|NP_001188817.1| CG4587, isoform D [Drosophila melanogaster] gi|157400167|gb|ABV53684.1| CG4587, isoform C [Drosophila melanogaster] gi|318068460|gb|AAF53476.3| CG4587, isoform D [Drosophila melanogaster] Length = 1243 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 253 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 306 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 307 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 360 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 361 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 409 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 410 LHDMACSNKGFFVQINDYDE 429 >gi|161076920|ref|NP_001097163.1| CG4587, isoform B [Drosophila melanogaster] gi|157400166|gb|ABV53683.1| CG4587, isoform B [Drosophila melanogaster] Length = 1209 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 36/200 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM + + + +LD + VN VV++ Sbjct: 202 AASSPKDIMILLDASSSMTEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKT 255 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 256 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGA---- 309 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQF- 332 + I+ +T E++S + + K+ ++ + +++ + Sbjct: 310 -GSQCNQAIMLIT--ESTSESHK--------DVIKQYNWPHMPVRIFTYLIGSDSGSRSN 358 Query: 333 LK--NCASPDRFYSVQNSRK 350 L C++ F + + + Sbjct: 359 LHDMACSNKGFFVQINDYDE 378 >gi|149176863|ref|ZP_01855473.1| hypothetical protein PM8797T_13972 [Planctomyces maris DSM 8797] gi|148844300|gb|EDL58653.1| hypothetical protein PM8797T_13972 [Planctomyces maris DSM 8797] Length = 291 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 42/116 (36%), Gaps = 10/116 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+++D+S S FG + L ++ Sbjct: 58 NVTARSGEPYVKLFREERELSVMLLIDLSGS--QSFGTNQQTKREVVTEVGATL----AM 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEY 261 + N + GL F+ I ++ P G +H+ I + G+ T LE+ Sbjct: 112 SAIKNNDKVGLTLFTDHIEKSVPARKGSRHVLRLIREMLYCEPMGTGTDIRQALEH 167 >gi|123468942|ref|XP_001317686.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121900426|gb|EAY05463.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 50/316 (15%), Positives = 105/316 (33%), Gaps = 52/316 (16%) Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK-----DYNLSAVSRY 133 S + K I ++ ++ F+ +N + + + +D + Sbjct: 143 LSTKYQKGIMTNEYSDKPDSFHFSLKVNTQKELSDFKVSVDGTKNVIDSHNATFETNEAP 202 Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKS------DIGLDMMMVLDVSLSMNDHFGP 187 + IF P + + IS+ + + V+D S SM+ Sbjct: 203 KKDAIFIETPIKDEDKSIAVSSDGYIAISTNPSFSGKIESNSEFYFVVDCSGSMSG---- 258 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQTFPLAWGVQHIQEKINR 246 +++ M I+S+P R ++ F +S P + +++ + +N Sbjct: 259 -----ARIINAVKCMRLFIQSLPLG---CRFSIIKFGTSFETVLQPCDYSDENVDKALNL 310 Query: 247 LIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 L T L++ K + K I LTDGE ++P+I Sbjct: 311 LKSVNAKMGGTDILSPLQH------------IAGLKPQPGFVKQIFLLTDGEVNNPDITC 358 Query: 303 KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF----- 355 +L NE + +++IG+ + A +K A S + + + +++ Sbjct: 359 ATALKNRNENR-----IFSIGLGSGADPGLIKGLAKKSGGNYIMIADEDNMNEKVITLLS 413 Query: 356 LRIGKEMVKQRILYNK 371 I I +K Sbjct: 414 SAIAPAATNISIQTDK 429 >gi|297473450|ref|XP_002686618.1| PREDICTED: collagen, type VI, alpha 3-like isoform 3 [Bos taurus] gi|296488813|gb|DAA30926.1| collagen, type VI, alpha 3-like isoform 3 [Bos taurus] Length = 2962 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 91/249 (36%), Gaps = 20/249 (8%) Query: 107 NIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSD 166 NI+R+ +I D + + + + AP + + + Sbjct: 1376 NIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAPPGVDTPSPSRPEKK 1435 Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S D R + E++D + + + ++ GLV ++S Sbjct: 1436 KA-DIVFLLDGS------INFRRDSFQEVLRFVSEIVDTV---YEGGDSIQVGLVQYNSD 1485 Query: 227 IVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L Q I + IN++++ + + + E ++ Sbjct: 1486 PTDEFFLKDFPTKQQIIDAINKVVYKGGRHANT--KVGLEHLRRNHFVPEAGSRLDQRVP 1543 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 + +T G++ + +L +RG V+A+GV+ +++ K ++ + Sbjct: 1544 QIAFVITGGKSVEDAQEASMALT------QRGVKVFAVGVRNIDSEEVGKIASNSATAFR 1597 Query: 345 VQNSRKLHD 353 V N ++L + Sbjct: 1598 VGNVQELSE 1606 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 85/223 (38%), Gaps = 26/223 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 I C SS AP +I + D++ ++D S G V Sbjct: 208 IVGNLVACVRSSMAPERAGG-TEIPKDITAQDSADIIFLIDGSN------NTGSVNFAVI 260 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQH-IQEKINRLIFGSTT 253 + +L+ + +R G+V +S + F L ++ + + + + L F Sbjct: 261 LDFLVNLLERLSI---GTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGE 317 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + GL A + + + ++ + + ++ ++ G +S D +L + Sbjct: 318 LANVGL--ALDFVVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDGVIALKQAS--- 372 Query: 314 RRGAIVYAIGVQAEAADQF-LKNCASPDR-FYSVQNSRKLHDA 354 V++ G+ A+AA + L++ A+ D ++V R L D Sbjct: 373 -----VFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 410 >gi|53713712|ref|YP_099704.1| hypothetical protein BF2421 [Bacteroides fragilis YCH46] gi|60681983|ref|YP_212127.1| hypothetical protein BF2503 [Bacteroides fragilis NCTC 9343] gi|253565660|ref|ZP_04843115.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|52216577|dbj|BAD49170.1| conserved hypothetical protein [Bacteroides fragilis YCH46] gi|60493417|emb|CAH08203.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343] gi|251945939|gb|EES86346.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 289 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTNIRLALEYLTNVMK 173 >gi|291395335|ref|XP_002714014.1| PREDICTED: integrin alpha 2 [Oryctolagus cuniculus] Length = 1207 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 70/210 (33%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 196 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 244 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 F + + K + S T G L + I A++ A G K Sbjct: 245 VVF----NLNTFKTKDGMITATSQTYQYGGDLTNTFKAIQFARDFAYSAASGGRPGATKV 300 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---LKNCA 337 ++ +TDGE + D ++ + + I V A +K A Sbjct: 301 MVVVTDGE----SHDGSMLRAVIDQCNNDNILRFGIAVLGYLNRNALDTKNLIKEIKAIA 356 Query: 338 S---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L + +G+++ Sbjct: 357 SIPTERYFFNVSDEAALLEKAGTLGEQIFS 386 >gi|37654264|gb|AAQ96237.1| LRRGT00024 [Rattus norvegicus] Length = 344 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 46/145 (31%), Gaps = 9/145 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 153 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 210 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 211 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 270 Query: 284 KKYIIFLTDGENSSPNIDNKESLFY 308 II +TDG + ++ L Sbjct: 271 PAIIITITDGSKLTTTSGVQDELKE 295 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 48/168 (28%), Gaps = 8/168 (4%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 + +G +IL A+ LPV+ L ++ + + +L D + L A + + Sbjct: 13 LQSERGGAAILFALTLPVLLGFAALAVDLARIHLTRVELQNAADAAALGGARSLSDSGGN 72 Query: 70 NNGKKQKNDFSYRIIKNIWQT--DFRNELRENGF---AQDINNIERSTSLSIIIDDQHKD 124 + I + ++ L E G+ + + + Sbjct: 73 PYNWSAAGSAALDIARRNVANGAGIQDALIETGYWNIQDPSEGLRAPGTPGVPAAGDVAA 132 Query: 125 YNLSAVSRYEM---PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 ++ + P P + + +V GL Sbjct: 133 VQVTITISRTLNNGPLRLFFAPVLGIAEQDVQGSSVAVIAPPAGGTGL 180 >gi|332970884|gb|EGK09861.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 440 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 63/207 (30%), Gaps = 40/207 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----- 225 + ++LD S SM G K+ VA ++ + +V+ L+ + Sbjct: 130 VSILLDASGSMAAQVSGG-RKMDVAKDAVETFVSTFPEEANVS------LIAYGHKGSNS 182 Query: 226 ---------KIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 +I + +PL + + + T ++ + + +K Sbjct: 183 QKDKVLSCSEIEEIYPLSTYDQSTFSNALKTVDATGWTPLADAIKKSGKTLKANADKNS- 241 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQA-EAADQF 332 K + ++DG + KE ++ G V IG A Q Sbjct: 242 --------KNVVYIVSDGLETCGGHPAKE----AEALQKDGISATVNIIGFDVNNAEQQA 289 Query: 333 LKNC--ASPDRFYSVQNSRKLHDAFLR 357 LK A F S + L F Sbjct: 290 LKEVAEAGGGTFTSANSKSDLESYFES 316 >gi|309774616|ref|ZP_07669641.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] gi|308917647|gb|EFP63362.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] Length = 613 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 72/234 (30%), Gaps = 50/234 (21%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV---- 228 +V+D S SM D + ++ A +++ + + ++ +V F++ Sbjct: 49 LVIDTSGSMGD-YQRLINAKNTAKEFVKKY-----AGDEPDSGRYLAIVNFATNTNIILN 102 Query: 229 -QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN----KIFDAKEKLEHIAKGHDDY 283 A G IN+L T G++ A E I Sbjct: 103 WTDVSSADGKASADNAINQLYADGGTNLHAGIKQASQLFDDTAIQDIESKNTIVLTDGAP 162 Query: 284 KKYI---------IFLT-------------DGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 Y+ IF T DG+ S I++ + K + VY Sbjct: 163 TYYLKNCTSDINCIFYTHVTISNTRYHVGGDGDKGSETINDA-TAAEAKTLKGKS-TVYT 220 Query: 322 IGVQAEAA---------DQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVK 364 I A +LKN + + Y +S L DAF I + + Sbjct: 221 ICYGASREMTYQGGPTVSAYLKNNIASEMKNAYDADDSDDLVDAFKAITETITS 274 >gi|260807874|ref|XP_002598733.1| hypothetical protein BRAFLDRAFT_230653 [Branchiostoma floridae] gi|229284007|gb|EEN54745.1| hypothetical protein BRAFLDRAFT_230653 [Branchiostoma floridae] Length = 794 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 60/206 (29%), Gaps = 37/206 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M++VLD S SM ++ VA IR V + G+V FS++ Sbjct: 298 MVLVLDSSGSMRGERIQKLN--QVAQHFIRNT---------VADGSWLGIVDFSTRATTA 346 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + T GL + Sbjct: 347 HALIQVSDDSARDQLAAAVPNSTL-GWTCIGCGLLEGIQVLEANGRNAAGG--------- 396 Query: 286 YIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD---- 340 ++ ++DG EN PNI +G IV I A A Sbjct: 397 ILLVISDGEENQHPNITEAMPTVL-----DKGVIVDTIAYSDAADANLESLAARTGGMAF 451 Query: 341 RFYSVQNSRKLHDAF-LRIGKEMVKQ 365 + NS L+DAF + ++ Sbjct: 452 FYPEGDNSTALNDAFTATVAARASEE 477 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 45.6 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 32/87 (36%), Gaps = 2/87 (2%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +G+++I+ + L V+ +GL ++ + K++L +D L A + Sbjct: 17 QRGAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVT--GATPLL 74 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELREN 99 + + E+ EN Sbjct: 75 VSEAAGLATGTSNAALFQGKAVEMFEN 101 >gi|301764623|ref|XP_002917727.1| PREDICTED: integrator complex subunit 6-like isoform 2 [Ailuropoda melanoleuca] Length = 850 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|297694110|ref|XP_002824337.1| PREDICTED: integrator complex subunit 6-like isoform 2 [Pongo abelii] Length = 887 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|297274484|ref|XP_001105975.2| PREDICTED: integrator complex subunit 6-like isoform 4 [Macaca mulatta] Length = 883 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|301764621|ref|XP_002917726.1| PREDICTED: integrator complex subunit 6-like isoform 1 [Ailuropoda melanoleuca] gi|281346272|gb|EFB21856.1| hypothetical protein PANDA_006082 [Ailuropoda melanoleuca] Length = 887 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 52/367 (14%), Positives = 100/367 (27%), Gaps = 143/367 (38%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK--------SDIGLDMMMVLDV 177 +L+ + E+P F ++S P ++ + + K + ++++VLD+ Sbjct: 27 SLNVKATGEIPDKP-DFDLEISASPNPAMVGEDITVGGKIIPKPFETAIPAKEIVLVLDI 85 Query: 178 SLSMNDHFGPGMD----------------------------------------------- 190 S SM++ Sbjct: 86 SGSMDEEIENPCTNKRVRYCTRHSSSDPNHEEWFLSWHRWINDYCVEHNTSGEHNITANN 145 Query: 191 -KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK------ 243 K+ R+ ++ +K +P++ G+V +SS G + ++ Sbjct: 146 KKIDELKRAANGFIERMKDVPNLK----IGIVAYSSIATINPNSKSGTKKVKSLDSNSSH 201 Query: 244 -----------------------INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 IN L T G+ A + G Sbjct: 202 DVTNYNSLGANFLQSNDSRLTSVINNLEALGGTNIGEGMRKAVYML----------DSGD 251 Query: 281 DDYKKYIIFLTDG-----------ENSSPNIDNKE-----------------SLFYCNEA 312 K I+ +TDG +N+ IDN E S Sbjct: 252 KSASKTIVLMTDGLPTFYSVTGSNKNNYMTIDNTEPKIAGIGTGLDTKSINYSRAVGEII 311 Query: 313 KRRGAIVYAIGVQ--AEAADQFL---------KNCASPDRF----YSVQNSRKLHDAFLR 357 K RG ++IG + D L PD + + ++ + F R Sbjct: 312 KSRGYNSFSIGYGLDTDGNDTLLSIHEAMTGVSIKGKPDLYESSGFFPTSTNAIQAVFNR 371 Query: 358 IGKEMVK 364 I +++ Sbjct: 372 IATQILD 378 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 68/158 (43%), Gaps = 28/158 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++VLDVS M D L + S+ L ++ + + GL+TFS+++ Q Sbjct: 520 DIVIVLDVSQEMKDS-------LTIVKNSLFNKLLNKDALKISKS--QYGLITFSNQVKQ 570 Query: 230 TFPLAWGVQHIQ-EKINRLIFGSTTKS-TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + ++ I L T S TP + ++ I + G D KK + Sbjct: 571 EIPLTDNITNLNDNYIKSL----ATDSLTPNISKTFDSITKV------LNSGRADAKKNV 620 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 IF++ G+ S + + + + +G + ++ + Sbjct: 621 IFISTGQASYTDSELS-------KLRDKGYNIVSLSMN 651 >gi|194384430|dbj|BAG59375.1| unnamed protein product [Homo sapiens] Length = 289 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|194384068|dbj|BAG64807.1| unnamed protein product [Homo sapiens] Length = 170 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|148704134|gb|EDL36081.1| integrator complex subunit 6 [Mus musculus] Length = 884 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|118431122|ref|NP_147355.2| hypothetical protein APE_0605.1 [Aeropyrum pernix K1] gi|116062446|dbj|BAA79575.2| hypothetical protein [Aeropyrum pernix K1] Length = 195 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 62/171 (36%), Gaps = 20/171 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + G D++ +D+S SM G G+ + S + + + R G+V F Sbjct: 7 NRGYDVVWAIDLSKSMARTAG-GLGASKLKVSSGIIAMASSRILS--RPGSRVGIVGFHD 63 Query: 226 KIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + + + + L G + G+ + + + + Sbjct: 64 RAFPILPSTDNYRRVLDSLTLLRAVGEGSAGGDGIVESVKMLRGSGR------------E 111 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 ++++ ++DG ++ +++ A G ++ I V + D ++ Sbjct: 112 RHVVMVSDGGFNTGIPIPLATIY----ALNMGVRLHFIIVGGQPGDVVKRS 158 >gi|157412071|ref|YP_001481411.1| TerY2 [Escherichia coli APEC O1] gi|226807625|ref|YP_002791319.1| TerY2 [Enterobacter cloacae] gi|226809935|ref|YP_002791629.1| TerY2 [Enterobacter cloacae] gi|99867096|gb|ABF67741.1| TerY2 [Escherichia coli APEC O1] gi|226425850|gb|ACO53943.1| TerY2 [Enterobacter cloacae] gi|226426161|gb|ACO54253.1| TerY2 [Enterobacter cloacae] Length = 213 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D S SM + I+ ML ++ P V ++T+ ++ Sbjct: 4 RLPVYLVIDTSGSMRGE------SIHSVNVGIQAMLSALRQDPYALESVHISIITYDNEA 57 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL I + T + LE + + + KG + Sbjct: 58 REFIPLTPLEDFQFSDI-VVPSAGGTFTGAALECLMQCVERDVRRSDGDTKGDWRP--LV 114 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKR---RGA-IVYAIGVQAEAADQFLKN 335 +TDG ++L Y K RG + A V +A + LK Sbjct: 115 FLMTDG-------TPSDALAYGEAVKAIRGRGFGSIIACAVGPKAGHEHLKQ 159 >gi|33872145|gb|AAH13358.1| INTS6 protein [Homo sapiens] Length = 827 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|35193175|gb|AAH58637.1| Ints6 protein [Mus musculus] Length = 874 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|51467749|ref|NP_032741.2| integrator complex subunit 6 [Mus musculus] gi|81885556|sp|Q6PCM2|INT6_MOUSE RecName: Full=Integrator complex subunit 6; Short=Int6; AltName: Full=DBI-1; AltName: Full=Protein DDX26 gi|37589266|gb|AAH59263.1| Integrator complex subunit 6 [Mus musculus] Length = 883 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|38347890|ref|NP_941139.1| putative tellurium resistance protein [Serratia marcescens] gi|156933958|ref|YP_001437874.1| hypothetical protein ESA_01784 [Cronobacter sakazakii ATCC BAA-894] gi|190410232|ref|YP_001965733.1| terY2 [Klebsiella pneumoniae] gi|237728579|ref|ZP_04559060.1| TerY2 [Citrobacter sp. 30_2] gi|38259367|emb|CAE51592.1| putative tellurium resistance protein [Serratia marcescens] gi|146151025|gb|ABQ02791.1| terY2 [Klebsiella pneumoniae] gi|156532212|gb|ABU77038.1| hypothetical protein ESA_01784 [Cronobacter sakazakii ATCC BAA-894] gi|226910057|gb|EEH95975.1| TerY2 [Citrobacter sp. 30_2] Length = 212 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D S SM + I+ ML ++ P V ++T+ ++ Sbjct: 3 RLPVYLVIDTSGSMRGE------SIHSVNVGIQAMLSALRQDPYALESVHISIITYDNEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL I + T + LE + + + KG + Sbjct: 57 REFIPLTPLEDFQFSDI-VVPSAGGTFTGAALECLMQCVERDVRRSDGDTKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKR---RGA-IVYAIGVQAEAADQFLKN 335 +TDG ++L Y K RG + A V +A + LK Sbjct: 114 FLMTDG-------TPSDALAYGEAVKAIRGRGFGSIIACAVGPKAGHEHLKQ 158 >gi|11024694|ref|NP_036273.1| integrator complex subunit 6 isoform a [Homo sapiens] gi|74753376|sp|Q9UL03|INT6_HUMAN RecName: Full=Integrator complex subunit 6; Short=Int6; AltName: Full=DBI-1; AltName: Full=Protein DDX26; AltName: Full=Protein deleted in cancer 1; Short=DICE1 gi|6062874|gb|AAF03046.1| candidate tumor suppressor protein DICE1 [Homo sapiens] gi|24980821|gb|AAH39829.1| Integrator complex subunit 6 [Homo sapiens] gi|55957338|emb|CAI12905.1| integrator complex subunit 6 [Homo sapiens] gi|55958605|emb|CAI15586.1| integrator complex subunit 6 [Homo sapiens] gi|78100165|tpg|DAA05730.1| TPA_exp: integrator complex subunit 6 [Homo sapiens] gi|119629281|gb|EAX08876.1| integrator complex subunit 6, isoform CRA_d [Homo sapiens] gi|119629282|gb|EAX08877.1| integrator complex subunit 6, isoform CRA_d [Homo sapiens] Length = 887 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|308472829|ref|XP_003098641.1| hypothetical protein CRE_04228 [Caenorhabditis remanei] gi|308268241|gb|EFP12194.1| hypothetical protein CRE_04228 [Caenorhabditis remanei] Length = 395 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 66/190 (34%), Gaps = 12/190 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD++ V+D S MN ++++ S+ I S + R GLVT++ Sbjct: 35 SNLWLDVVAVVDNSQGMN---NGKLNEVTSNILSVFMSGTRIGSDANEPRTTRLGLVTYN 91 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE--KLEHIAKGHDD 282 + Q L Q I + N + ++ Y + A+ + + Sbjct: 92 NVASQKADL-NQYQSIADAANGIFVALSSTVGTSESYLATGLEMAERMFNEQSVNTTRAH 150 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASP 339 Y+K +I +D L N K + + Q D L ASP Sbjct: 151 YRKVVIVYASEYKGDGELDP---LPVSNRLKLSNVDIITVAYQQSGDDGLFESLSQIASP 207 Query: 340 DRFYSVQNSR 349 + N + Sbjct: 208 GFSFINDNGQ 217 >gi|291408281|ref|XP_002720452.1| PREDICTED: DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Oryctolagus cuniculus] Length = 862 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLD 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG+ + +E L Sbjct: 122 PSILITITDGKKLTSTAGVQEEL 144 >gi|229125374|ref|ZP_04254461.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] gi|228658081|gb|EEL13834.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] Length = 452 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 71/204 (34%), Gaps = 22/204 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ ++LD S SM G K+ A ++I LD I +V V + + Sbjct: 148 KEKSLNVEILLDASGSMAGKVN-GQVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 206 Query: 225 SKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ G + +E+ N + K L A + D ++ Sbjct: 207 EN---DKSLSCGSSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVNDDFKEYTGE 263 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKN 335 + YI+ +DGE + + + + IG + Q LKN Sbjct: 264 ENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAVVNI--IGFDVKNSEQQQLKN 316 Query: 336 CASP--DRFYSVQNSRKLHDAFLR 357 A + +V N+ +L+ + Sbjct: 317 TAEAGKGNYATVSNADELYQTLNK 340 >gi|229588184|ref|YP_002870303.1| hypothetical protein PFLU0636 [Pseudomonas fluorescens SBW25] gi|229360050|emb|CAY46904.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 651 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 47/143 (32%), Gaps = 4/143 (2%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 +G+I ++ A L V + M LV+++ + K KL I D S L A + Sbjct: 4 RLRSRQRGAIGLMAAGTLAVALVFMLLVVDSGRLYMEKRKLQSIADTSALEAAGR----G 59 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + ND++ T + + S+ + D K+ + Sbjct: 60 GLCSPTTTANDYAKENATRNGFTVVAGDNSRGLVVTCGLLTTNANSVRVFTPDATKNDAV 119 Query: 128 SAVSRYEMPFIFCTFPWCANSSH 150 V+ + T W S Sbjct: 120 RVVATRSVMTSIATGIWSMFSGA 142 >gi|56675030|gb|AAW19657.1| matrilin-3 [Cervus elaphus] Length = 146 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 53/149 (35%), Gaps = 19/149 (12%) Query: 218 SGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLE 274 +V ++S + F L Q ++ + R+ + T S ++ A ++ F Sbjct: 1 VAVVNYASTVKIEFHLQTHSDKQSLKRAVARITPLSTGTMSGLAIQTAMDEAFT---VEA 57 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + K I +TDG + A+ G +YA+GV A + LK Sbjct: 58 GARGPSSNIPKVAIIVTDGRPQD------QVNEVAARARASGIELYAVGVD-RADMESLK 110 Query: 335 NCAS---PDRFYSVQN---SRKLHDAFLR 357 AS + + V+ KL F Sbjct: 111 MMASEPLDEHVFYVETYGVIEKLSSRFQE 139 >gi|293351261|ref|XP_002727737.1| PREDICTED: RIKEN cDNA 6330505F04-like [Rattus norvegicus] Length = 856 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 122 PSILITITDGNKLTSTASVQEEL 144 >gi|290999659|ref|XP_002682397.1| predicted protein [Naegleria gruberi] gi|284096024|gb|EFC49653.1| predicted protein [Naegleria gruberi] Length = 1065 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 60/176 (34%), Gaps = 28/176 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +SK+ L +++ LD S SM + A ++ +L ++ + R + Sbjct: 78 TSKNQGKL-LIIALDKSGSMAGS------GISEAKLALETLLSNVEGCNE-----RILFI 125 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F S +++ + + ++ G T + + N ++ Sbjct: 126 VFDSNSELIDMTNMELENKLQVVKKVSAGGGTDFSSVFKIIRNYGGSLNGQVA------- 178 Query: 282 DYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 IIF TDG++ + +E SL + + IG + + L Sbjct: 179 -----IIFFTDGQDQYSSNSTREGSIKSLQERLNTESESYEFHTIGFTSVHDARLL 229 >gi|198426777|ref|XP_002120162.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 367 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 78/231 (33%), Gaps = 47/231 (20%) Query: 145 CANSSHAPLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + H + S+ SD +D++ VLD S S+N+ + + ++I + Sbjct: 140 YTGTGHGTVWSPLSIVEGFNSDCPHDQVDLLFVLDGSTSINEADPNNFNTVKNWVKNITK 199 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 DI S G + +L T + L Sbjct: 200 RFDITSS---------------------------GSAAVAMDQIKLRL-GATFTAAALSK 231 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A ++ KK ++ LTDG+++ D + + + A Sbjct: 232 ATTVFKNSSRF------NDPLTKKVLVLLTDGQSN----DREGLNASATQVRNLNITTIA 281 Query: 322 IGVQAEAADQFLKNCASP-----DRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 +GV+A+ + L+ A+ DR Y +++ L I +E+ K + Sbjct: 282 VGVKADVLQE-LQIIANGVIGNNDRVYQLRDFSNLDSIVQSIFQEIEKVSL 331 >gi|157265460|ref|YP_001468018.1| Von Willebrand factor type A domain [Thermus phage P74-26] gi|156905355|gb|ABU96998.1| Von Willebrand factor type A domain [Thermus phage P74-26] Length = 563 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 76/238 (31%), Gaps = 40/238 (16%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG--------------V 194 I+ ++I + + L +++++D S SM D Sbjct: 341 KALAANISPFMRIEPEEERSLHLVVLVDESGSMGTSIYGLTDISQGRSGEPFLGAASGVA 400 Query: 195 ATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGS 251 + R+ L I + N VR + PL ++++E + + Sbjct: 401 SNRAFLAKLTSIILYEGLKNADVRMQFFGYGDTA-LPPPLDELGNHEYVRELVLPYALAT 459 Query: 252 TTKST-----PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + L +A+ + + K + +I+L DGE I + + Sbjct: 460 ITHKSNNGDLSALSHAFGVLRASPAKH-----------RVVIYLADGE-----ISSSRLV 503 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + G VY + + + + + R + V + A I +E+ + Sbjct: 504 EAFRKVTSSGIKVYWLDLSG-RKRPYSSSALAAARRFVVNTFDDVLKAIQAIFEEVAE 560 >gi|157265344|ref|YP_001467903.1| Von Willebrand factor type A domain [Thermus phage P23-45] gi|156905239|gb|ABU96883.1| Von Willebrand factor type A domain [Thermus phage P23-45] Length = 563 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 76/238 (31%), Gaps = 40/238 (16%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG--------------V 194 I+ ++I + + L +++++D S SM D Sbjct: 341 KALAANISPFMRIEPEEERSLHLVVLVDESGSMGTSIYGLTDISQGRSGEPFLGAASGVA 400 Query: 195 ATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGS 251 + R+ L I + N VR + PL ++++E + + Sbjct: 401 SNRAFLAKLTSIILYEGLKNADVRMQFFGYGDTA-LPPPLDELGNHEYVRELVLPYALAT 459 Query: 252 TTKST-----PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T + L +A+ + + K + +I+L DGE I + + Sbjct: 460 ITHKSNNGDLSALSHAFGVLRASPAKH-----------RVVIYLADGE-----ISSSRLV 503 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + G VY + + + + + R + V + A I +E+ + Sbjct: 504 EAFRKVTSSGIKVYWLDLSG-RKRPYSSSALAAARRFVVNTFDDVLKAIQAIFEEVAE 560 >gi|52082206|ref|YP_080997.1| YwmD protein [Bacillus licheniformis ATCC 14580] gi|52005417|gb|AAU25359.1| YwmD [Bacillus licheniformis ATCC 14580] Length = 230 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 70/211 (33%), Gaps = 22/211 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S +++ +VLD S SM G K +A S+ + D++ S +V L F Sbjct: 32 NSKKDVNVAVVLDASGSMAQKVE-GERKFDIAKESVTDFADLLSSDANVM------LNVF 84 Query: 224 SS---KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAKG 279 ++ G + F S + G++ ++ I + Sbjct: 85 GHKGNNKNSGKEVSCGTTETVYDLQPFSFNSFENALSGIKPTGWSPIAKSLYD-VKDDLA 143 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----VYAIGVQAEAADQFLKN 335 D K Y+ +TDGE + + + + ++ + + V+ LK Sbjct: 144 DKDGKNYVYIVTDGEETCGG----DPVQAAKDLRKSNIKTIVNIVGLDVKTVKEKAKLKK 199 Query: 336 C--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A + ++ + G ++ + Sbjct: 200 VADAGGGKLIEADSASDFKKVWKEEGVKLSQ 230 >gi|158318118|ref|YP_001510626.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158113523|gb|ABW15720.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 946 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 71/217 (32%), Gaps = 50/217 (23%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 A +S L + +D ++++DVS S+++ + + R+ Sbjct: 61 AAASSGSLSGPLPGSAGAADSGAMDAVILVDVSRSLDEP------TIALEARAAG----- 109 Query: 206 IKSIPDVNNVVRSGLVTFSSK-----------IVQTFPLAWG--------VQHIQEKINR 246 + + D++ R + F+S PLA + I R Sbjct: 110 VIAATDLSTRTRIAVTAFASMGPSGSSTARPAAETLCPLAPVDDPDDRDRISGCLSGIMR 169 Query: 247 LIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE--------NS 296 G T L+ ++ + A A G +K I LTDG+ + Sbjct: 170 RTAGQGQDTDYVTALKSGFDTLLSAP------ATGSGPARKVIFILTDGQLSTAGTDSSG 223 Query: 297 SPNIDNKESLFYCNE----AKRRGAIVYAIGVQAEAA 329 + D ES +E A++ G ++ G + Sbjct: 224 GSSTDRSESGQIRDEILPAARKAGIEIWPFGFGQKPN 260 >gi|332216197|ref|XP_003257231.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 2 [Nomascus leucogenys] Length = 769 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 73/208 (35%), Gaps = 36/208 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L +P + LV F Sbjct: 145 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKIL---GDMPPGDYFD---LVLFG 192 Query: 225 SKIVQTFPLAW----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +++ +W ++ Q+ + T GL + +E L Sbjct: 193 TRVQ-----SWKGSLVQASEANLRAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLP 247 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ +I LTDG+ + D + L A R +Y +G FL+ Sbjct: 248 ELSNHAA----ILIMLTDGDPTEGVTDRSQILKNIRSAIRGRFPLYNLGFGHNVDFNFLE 303 Query: 335 NCASPDR-----FYSVQNSRKLHDAFLR 357 + + Y +++ + F Sbjct: 304 VMSMENNGRAQRIYEDRDATQQLQGFYS 331 >gi|332216195|ref|XP_003257230.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 1 [Nomascus leucogenys] Length = 911 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 73/208 (35%), Gaps = 36/208 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +++ V+D+S SM K+ ++ ++L +P + LV F Sbjct: 287 TNMNKNVVFVIDISGSMRGQ------KVKQTKEALLKIL---GDMPPGDYFD---LVLFG 334 Query: 225 SKIVQTFPLAW----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 +++ +W ++ Q+ + T GL + +E L Sbjct: 335 TRVQ-----SWKGSLVQASEANLRAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLP 389 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ +I LTDG+ + D + L A R +Y +G FL+ Sbjct: 390 ELSNHAA----ILIMLTDGDPTEGVTDRSQILKNIRSAIRGRFPLYNLGFGHNVDFNFLE 445 Query: 335 NCASPDR-----FYSVQNSRKLHDAFLR 357 + + Y +++ + F Sbjct: 446 VMSMENNGRAQRIYEDRDATQQLQGFYS 473 >gi|315182005|gb|ADT88918.1| hypothetical protein vfu_B00697 [Vibrio furnissii NCTC 11218] Length = 653 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 32/193 (16%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + P T S ++ + +V+D+SLS+ + K ++ ++LD Sbjct: 65 AVTALAGPSFETQPRPSYSANNARV---VVMDMSLSL---HATDI-KPNRLAQARYKVLD 117 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLE 260 ++K P+ + +GLV ++ PL I + ++ + G++ Sbjct: 118 LLKGWPEGS----TGLVAYAGDAYTVSPLTNDSATIANLVPNLSPEIMPFPGANAATGVQ 173 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + +A II L D ++S N + L + Sbjct: 174 RAIEMLKNAGLNRGD-----------IILLADDLDASENKAIRALLEGTQW------KLM 216 Query: 321 AIGVQAEAADQFL 333 +G+ +A L Sbjct: 217 IVGIGTQAGAPIL 229 >gi|300870310|ref|YP_003785181.1| hypothetical protein BP951000_0681 [Brachyspira pilosicoli 95/1000] gi|300688009|gb|ADK30680.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000] Length = 315 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 61/175 (34%), Gaps = 26/175 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K GLD+++V D++ SM+ + K ++++ +++D + R G++ + Sbjct: 54 KRKKGLDLVLVGDLTGSMSAYREKLRSKFMELSKTLFQIIDNL----------RIGIIFY 103 Query: 224 SS------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + L+ V + IN + G + +E A N +++ Sbjct: 104 LDHGSGDPYITKVHELSVNVDSLLNFINNVPNGHGGDANEAVEDALNDVYNIN------- 156 Query: 278 KGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K I+ D PN I + + ++ + + Sbjct: 157 -WSQINSKSIVLFGDACPHEPNECIHKYDYFELTKKLYQKQVTINTVFCNTGYGG 210 >gi|91201645|emb|CAJ74705.1| hypothetical protein kuste3942 [Candidatus Kuenenia stuttgartiensis] Length = 336 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 57/180 (31%), Gaps = 39/180 (21%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 GL+++ VLDVS+SM ++L A I ++ + R GLV F+ Sbjct: 87 EKEGLEIVFVLDVSMSMLAEDVKP-NRLECAKMEIANLVRGL-------EDDRVGLVVFA 138 Query: 225 SKIVQTFPLAWG---------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ P + + E R + T L A + Sbjct: 139 ARAFSLLPYPTKDYEMVFLRILNMVNEHYVRFVPYG-TNIGNALIAAMETFSNEAG---- 193 Query: 276 IAKGHDDYKKYIIFLTDGENS----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 KK II LTDGE S ++ L + Y IG+ Sbjct: 194 --------KKIIILLTDGEEQLLRRSQVVEAIRLLLE-----KNDISTYIIGIGDPNNST 240 >gi|265764036|ref|ZP_06092604.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263256644|gb|EEZ27990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 289 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTNIRLALEYLTNVMK 173 >gi|239831665|ref|ZP_04679994.1| Protein norD [Ochrobactrum intermedium LMG 3301] gi|239823932|gb|EEQ95500.1| Protein norD [Ochrobactrum intermedium LMG 3301] Length = 633 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 70/196 (35%), Gaps = 31/196 (15%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 L +T V +S +D +D VLDV + L + + Sbjct: 433 VHLMNRPKANDLAVTILVDVSLSTDAWIDNRRVLDVE-------KEALLVLANGIAACGD 485 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 I + VR + V+ F A+G ++ +I L G T+ + + Sbjct: 486 RCSIQTFTSRRRSWVRV-------ETVKDFDEAFGPA-VEHRIAALKPGFYTRMGAAIRH 537 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 + K+ + + KK ++ LTDG+ + ++S EA+ RG Sbjct: 538 STAKLAEQP-----------NRKKLLLVLTDGKPNDVDHYEGRFALEDSRRAVQEARARG 586 Query: 317 AIVYAIGVQAEAADQF 332 V+A+ V EA+ Sbjct: 587 VNVFAVTVDREASSYL 602 >gi|21672903|ref|NP_660968.1| hypothetical protein CT0062 [Chlorobium tepidum TLS] gi|21645957|gb|AAM71310.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 307 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 34/156 (21%), Positives = 50/156 (32%), Gaps = 31/156 (19%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 T W ++ L VKI ++ + +M+VLD S SM G +L +I Sbjct: 71 TIDWNTSAHKNDLY----VKIFTEERERI-LMLVLDGSGSMLFGSGRLKKELAAEVSAIL 125 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKST 256 + V N GL+ FS + P G H +N + G T Sbjct: 126 -------AFSAVQNNDMVGLLVFSDTVETYIPPRKGRAHALVILNEIFSMRQCGRKTDID 178 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L + + K I LTD Sbjct: 179 AALSF---------------LRRTQKRKSIIFLLTD 199 >gi|326670662|ref|XP_003199263.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1024 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 96/306 (31%), Gaps = 25/306 (8%) Query: 48 LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINN 107 L Y+ D+ ++ + S+ + + L+E G Sbjct: 88 LQYLRDNVFTASSGSRRVEGVPQLLILLSGARSFDNVDT-----PASSLKELGVLIFAIG 142 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI 167 S S + Q Y LS ++P + N + T+S I+ Sbjct: 143 SRSSDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQ 202 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D++ +LD S + F ++ M++ + + + R +V +S + Sbjct: 203 RRDVVFLLDGSDGTRNGFP-------AMKDFVQRMVEKLDVAENRD---RISVVQYSREP 252 Query: 228 VQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + I + + L G A + D ++ + + Sbjct: 253 GANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTG--EALQYVRDNVFTASSGSRRLEGVPQ 310 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ L+ G + SL K G + + IG + + + + P SV Sbjct: 311 ILVLLSGGRSFDSVNAAASSL------KELGVLTFGIGSRGSDSRELQRISYEPSYALSV 364 Query: 346 QNSRKL 351 + +L Sbjct: 365 SDFSEL 370 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 91/312 (29%), Gaps = 25/312 (8%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 + L Y+ D+ ++ + S+ + + L+E G Sbjct: 282 LYTGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVNAA-----ASSLKELGV 336 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 S S + Y LS E+P + + P+ TS Sbjct: 337 LTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVT 396 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + S D++ +LD S F D + ++ N R +V Sbjct: 397 AEYSTPRKDVVFLLDGSDGTRSSFLAMRDFVQRV----------VEKFNIEANRDRVSVV 446 Query: 222 TFSSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 +S F L + I +++ L G A + D ++ Sbjct: 447 QYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTG--AALQYVRDNIFTASSGSRR 504 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + + +I L+ G + SL K G + IG + + + + Sbjct: 505 LEGVPQILILLSGGRSFDSVDAAASSL------KELGVLTLGIGSRGSDSRELQRISYEA 558 Query: 340 DRFYSVQNSRKL 351 + SV + +L Sbjct: 559 NYALSVADFSEL 570 >gi|313227391|emb|CBY22538.1| unnamed protein product [Oikopleura dioica] Length = 455 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 75/206 (36%), Gaps = 33/206 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++ +D+ ++D S S+ + + DI+ + + + Sbjct: 256 APCEDNVAIDLAFIIDSSSSIGEDNFNLITSFVG---------DIVSNFTISETLTHIAV 306 Query: 221 VTFSSKIVQTFPLAW-----GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLE 274 + ++S++ P+ W I I+ + GS T++ L YA N + + Sbjct: 307 LRYNSQV---SPVLWFDTFNTKSEILAAISAINYSGSGTRTGKALGYAANYVLNPTYGAR 363 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 ++ + +TDGE+ ++ + A + + V AIG+ + ++ Sbjct: 364 ------EEVPTVTLVITDGES-------QDEITTAARALKSKSAVMAIGIGDAKETELVQ 410 Query: 335 NCASPDR--FYSVQNSRKLHDAFLRI 358 +SP + + N L + Sbjct: 411 MASSPLSFGYKRISNFEDLSSVTSDV 436 >gi|294789255|ref|ZP_06754493.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] gi|294482680|gb|EFG30369.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] Length = 346 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 14/172 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+DVS SM L I ++++ ++ P V ++ F+ + Sbjct: 3 RLPIFLVIDVSESMVGS------PLRHMQEGINQLINELRKDPYALETVYLSVIAFAGVV 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL ++ RL G+ T L + ++I KG DYK + Sbjct: 57 KTLAPL---MELYAFYPPRLPVGAGTSLGAALNHVMDEITQKVILSSPTQKG--DYKPIV 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++DG + N D + + + +R + ++G+ A L ++ Sbjct: 112 YLMSDGSATD-NPDAAIARWKRDFMQRA--TLVSVGIGPFADLSRLSEISTS 160 >gi|229583190|ref|YP_002841589.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|228013906|gb|ACP49667.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 452 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 57/168 (33%), Gaps = 26/168 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GLVTFSSKIVQ 229 + ++LD S SM+ + +A S + + + +R + + VQ Sbjct: 290 IYLLLDKSGSMDGEKILWAKAVALALYSRAKR-------ENRDFYLRFFDNIPYPLIKVQ 342 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + + E I ++ G T + + A I + KG + II Sbjct: 343 KNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGH------VKGVSE----IIL 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 LTDGE+ + SL N I V + L+ + Sbjct: 393 LTDGEDKIAETTVRRSLKEANSQ--------LISVMIRGDNADLRRVS 432 >gi|119476362|ref|ZP_01616713.1| hypothetical protein GP2143_07924 [marine gamma proteobacterium HTCC2143] gi|119450226|gb|EAW31461.1| hypothetical protein GP2143_07924 [marine gamma proteobacterium HTCC2143] Length = 635 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 56/178 (31%), Gaps = 30/178 (16%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKL 192 P++ W + P I + + D +++VLD+SLSM + P Sbjct: 64 PWLALLIAWIIAAVAMSGPTWIKLPQPVHQRQDA---LVIVLDLSLSMLAEDIKPS---- 116 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS- 251 R+ ++LDI+ + N S L+ +S PL + L Sbjct: 117 -RLVRARHKVLDIL----NHRNEGLSALIAYSGDAHVVSPLTDDNPTVANLAPALTPDMM 171 Query: 252 ---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + L A +A ++ LTDG + D L Sbjct: 172 PVYGSDPVAALTLAKQLFTNAGITSGR-----------VLLLTDGVTQNDVDDIDSEL 218 >gi|301163421|emb|CBW22972.1| conserved hypothetical protein [Bacteroides fragilis 638R] Length = 289 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTNIRLALEYLTNVMK 173 >gi|256811138|ref|YP_003128507.1| Magnesium chelatase [Methanocaldococcus fervens AG86] gi|256794338|gb|ACV25007.1| Magnesium chelatase [Methanocaldococcus fervens AG86] Length = 246 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 57/168 (33%), Gaps = 20/168 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V+ + I ++ V+D S SM M ++ A +I +L + + N Sbjct: 51 IVEKVRQRKISSHILFVVDASGSMG-----AMKRMEAAKGAIISLL--LDAYQKRNK--- 100 Query: 218 SGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 G++ F + P V+ ++ + L G T +Y E Sbjct: 101 IGMIAFRKDRAELILPFTSSVELGEKLLKDLPTGGKTPLADAFIKSYEVF-----DREIR 155 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDN--KESLFYCNEAKRRGAIVYAI 322 + +I ++D + + + KE C + +G V I Sbjct: 156 KNPNIIP--IMIVISDFKPNVAVKGDYIKEVFDACEKIVEKGINVVLI 201 >gi|115889695|ref|XP_786483.2| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115970407|ref|XP_001195936.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 2064 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 59/188 (31%), Gaps = 37/188 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVTFSSKI 227 + +++DVS SM +H G +K+ R ++E L N+V+ G + + Sbjct: 581 QVYILIDVSNSMENHLGLVKEKM---IRLMQEQLRHKTKF----NLVKFGSKASGWKDRA 633 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 V+ + +Q + + L +T + L+ A D + I Sbjct: 634 VEVSESS--LQSAWQWVRGLEVAGSTNTLMALKTALA----------------DTATQAI 675 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD-----QFLKNCASPDRF 342 LTDG P R+ ++ I D L + Sbjct: 676 YLLTDGRPDHP-----PPTILAQVQLRKSIPIHCISFNCNDRDANDFMALLSSDTGGRYH 730 Query: 343 YSVQNSRK 350 Y + Sbjct: 731 YYADDLEA 738 >gi|313221459|emb|CBY32209.1| unnamed protein product [Oikopleura dioica] Length = 1138 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 61/173 (35%), Gaps = 19/173 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV--T 222 S G++ + ++D S SM ++ + + M++ +K N+V V Sbjct: 576 SSFGVNAIFLIDSSGSMMGE------RMEQTREAFKFMIEGLKPGDTF-NIVSFESVNKV 628 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 FS + ++ ++++ G T L A + K Sbjct: 629 FSDNRMVPVNDRSMFAALK-FMDQIQAGGATDVYAALVKASLLLSQNKRTSNQEN----- 682 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLK 334 I FLTDG ++ + L + A++ ++ I EA D +K Sbjct: 683 ---IIYFLTDGAPTAGVTNLNSILDMVDFIAQKSEIVMNTIAYGEEANDDKMK 732 >gi|229578924|ref|YP_002837322.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|284997528|ref|YP_003419295.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|228009638|gb|ACP45400.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|284445423|gb|ADB86925.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 356 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 34/197 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++D S SM KL A +S +++L ++N L+ Sbjct: 32 QSSVTSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLL------YNLNEGNYVTLI 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS+ +I + + ++ +A ++AK Sbjct: 80 LFSNHP---------------EIKYQGPAKGIITFDVGKGYTTRLHEAVSFTINLAKQSQ 124 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K II LTDG+ + D + Y + IG+ ++ LK A S Sbjct: 125 VPTK-IIMLTDGKPT----DKRNVKDYEKLDIPPNTQIITIGIGNNYNERILKKLADRSS 179 Query: 340 DRFYSVQNSRKLHDAFL 356 +FY +++ +L + F Sbjct: 180 GKFYHIKDISELPNIFE 196 >gi|254480742|ref|ZP_05093989.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214039325|gb|EEB79985.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 493 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 70/204 (34%), Gaps = 37/204 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++DVS M P + + + D K+ G+ F + Sbjct: 31 DLRILIDVSGGMKTS-DPRNRRASALVLLVNLLPDGAKA----------GIWVFGKEAEL 79 Query: 230 TFP-----LAWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P AW Q Q+ I L T LE A + + Y Sbjct: 80 LVPHGLVDAAW-RQRAQQAIATLKAAGQRTNIPAALEQATADL----------EQPTSGY 128 Query: 284 KKYIIFLTDGENS---SPNIDNKESLFYCN----EAKRRGAIVYAIGVQAEAADQFLKNC 336 + ++ LT G+ SP I+ ES N E G V+ I A+A L++ Sbjct: 129 RTSVLLLTAGKVDVAESPIINVNESRKLLNGRAVELGELGVPVHTIAFSAQADAMLLRSL 188 Query: 337 A--SPDRFYSVQNSRKLHDAFLRI 358 A + ++ +L F+R+ Sbjct: 189 ARQTNGTSRQADSADELSVMFMRV 212 >gi|159041653|ref|YP_001540905.1| hypothetical protein Cmaq_1087 [Caldivirga maquilingensis IC-167] gi|157920488|gb|ABW01915.1| conserved hypothetical protein [Caldivirga maquilingensis IC-167] Length = 448 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 50/162 (30%), Gaps = 16/162 (9%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P + L S K + +V+D S SM + + Sbjct: 250 PTNRALAKFAKPIYAYKLATGSLTVKERKMMRKPKIYLVIDKSGSMFYTVMNNIFEYSSV 309 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKS 255 ++ I + + VV F +I Q L + I + + L+ T Sbjct: 310 SKITWAAALAIVMVMKGHEVV---ARFFDQQIYQ---LMTNKKDIIKTLLSLVPLGGTNI 363 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 T + AY H +YK ++ +TDGE+ Sbjct: 364 TSAIRVAY--------DDAHRNPALRNYK--LVLITDGEDDE 395 >gi|56751834|ref|YP_172535.1| magnesium-chelatase subunit ChlD [Synechococcus elongatus PCC 6301] gi|81301083|ref|YP_401291.1| protoporphyrin IX magnesium-chelatase [Synechococcus elongatus PCC 7942] gi|93141241|sp|O07345|CHLD_SYNE7 RecName: Full=Magnesium-chelatase subunit ChlD; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase gi|56686793|dbj|BAD80015.1| magnesium-chelatase subunit ChlD [Synechococcus elongatus PCC 6301] gi|81169964|gb|ABB58304.1| protoporphyrin IX magnesium-chelatase [Synechococcus elongatus PCC 7942] Length = 677 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 65/210 (30%), Gaps = 37/210 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ ++D S SM ++++ A ++ +L N + L+ F Sbjct: 477 KAGALVIFLVDASGSMA------LNRMQSAKGAVIRLLTEA-----YENRDQVALIPFRG 525 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + ++++ ++ G + GL A +A + D + Sbjct: 526 EQAEVLLPPTRSITAARKRLEKMPCGGGSPLAHGLTQAVRVGTNAAQ-------SGDIGQ 578 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI----GVQA 326 I+ +TDG E +E L + + I Sbjct: 579 VVIVAITDGRGNIPLARSLGQPMEEGEKPDLKEELLDIAKRIRGLSMQLLVIDTERKFVG 638 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 + L N A ++ + S + A Sbjct: 639 AGFGKELANAAGGQYYHLPKVSDQAIAAMA 668 >gi|126174973|ref|YP_001051122.1| TPR repeat-containing protein [Shewanella baltica OS155] gi|125998178|gb|ABN62253.1| Tetratricopeptide TPR_2 repeat protein [Shewanella baltica OS155] Length = 690 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLIATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + GL A + II +TDG ++ D +L Sbjct: 166 VRGSNLAAGLTQAKTLLAQGGHIRGD-----------IIVMTDGITAAQFDDANSALS 212 >gi|149773089|emb|CAO01894.1| collagen type VI alpha 6 [Mus musculus] Length = 226 Score = 45.6 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 77/199 (38%), Gaps = 21/199 (10%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 DV ++ G+ + I +M I S+P N R L +S + F L Sbjct: 26 DVVFLVDSSDHLGLKSFPLVKTFIHKM---ISSLPIEANKYRVALAQYSDALHNEFQLGT 82 Query: 236 --GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + + G + K L+ A+ F A + + ++ L Sbjct: 83 FKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTN----GRDKKQFPPILVVL- 137 Query: 292 DGENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++++ + +A + G + ++GVQ +A+++ LK A+ ++++ +R Sbjct: 138 ------ASAESEDDVEEAAKALREDGVKIISVGVQ-KASEENLKAMATSQFHFNLRTARD 190 Query: 351 LHDAFLRIGKEMVKQRILY 369 L F E++K Y Sbjct: 191 L-SVFAPNMTEIIKDVTQY 208 >gi|302872519|ref|YP_003841155.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor obsidiansis OB47] gi|302575378|gb|ADL43169.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor obsidiansis OB47] Length = 1789 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 65/180 (36%), Gaps = 21/180 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + + S +S D++ V+D + SM+ D++ ++I +D +K Sbjct: 824 KKYLDITGLKSGTVSPSGQA--DIVFVIDTTGSMS-------DEIDAVKQNINNFVDKLK 874 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + + V GLVT+ G + + K G + + Sbjct: 875 T---KDISVNLGLVTYKDITCDGPNSTVGHGFFSSADDFKNALGSIKVDGGGDTP-ETLI 930 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE----AKRRGAIVYAIG 323 D E + ++ K+I+ LTD ++ ++N+ + +E K IV + Sbjct: 931 DTLE-TARLLGFRENSTKFIVVLTD---ANYKLENRFGIKSADEIIERLKSDNIIVSVVS 986 >gi|291243650|ref|XP_002741714.1| PREDICTED: integrator complex subunit 6-like [Saccoglossus kowalevskii] Length = 911 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 49/143 (34%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L VA ++ + I+S + R L ++ T Sbjct: 4 LVFLVDTSASMNQRTYVGTTLLDVAKGAVETFM-KIRSRDPASRGDRYMLASY-DDPPAT 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L S + P L+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAISLSGLGPALKQTLDLLNVNRLHSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 II LTDG S + +E L Sbjct: 122 PAMIITLTDGNKLSSSTGTQEEL 144 >gi|221115448|ref|XP_002154505.1| PREDICTED: similar to polydom [Hydra magnipapillata] Length = 2514 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 62/189 (32%), Gaps = 35/189 (18%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + D++ ++DVS S++D D + +L I P R +VTF Sbjct: 162 NSKADIIFLIDVSGSISD------DGFNTEREFVSSLLSKISVQPSAA---RIAVVTFGR 212 Query: 226 KIVQTFP------LAWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 I + L E+ R+ T L+ A + A EK Sbjct: 213 DINKDIDYIDYGYLDKNKCTFNEEFKRVKHRKEGWTNINGALQKAKALLDSANEKKFKRH 272 Query: 278 KGHDDYKKYIIFLTDGENSSPN------IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + + LTDG + + + Y + +++IGV + Sbjct: 273 NVNTVA----VLLTDGGWNYGGSPYDTATNLRTGFHYVD--------IFSIGVGHWLDRK 320 Query: 332 FLKNCASPD 340 LKN A + Sbjct: 321 QLKNIAGKE 329 >gi|219886181|gb|ACL53465.1| unknown [Zea mays] Length = 561 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 54/162 (33%), Gaps = 23/162 (14%) Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPL----AWGVQHI 240 +D L A + + L+ + R +V FS + + L A G ++ Sbjct: 16 SRLDLLKTAAKFMVAKLE---------DGDRLSIVAFSDRPVRELSSGLLYMTADGRRNA 66 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPN 299 +++L T P E A + + +I+ LTDG E++S + Sbjct: 67 IRSLDQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAEDASGS 119 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 E R V+A G+ + L A R Sbjct: 120 FTLSERRREVIRGALRKYPVHAFGLGTAHGPEVLLYLAQESR 161 >gi|115666324|ref|XP_798926.2| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] gi|115975269|ref|XP_001180487.1| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] Length = 870 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 67/199 (33%), Gaps = 20/199 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP--DVNNVVRSGLVTFSSKIV 228 ++ VLD S SM +G + + A +I + ++ ++ V SG ++S + Sbjct: 329 VVFVLDFSASM---YGNKIKQTKEAMYTILDEMNDSDRFNVLPFSDYVYSG---WNSGQM 382 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + ++ ++ I +L T L A E + + I+ Sbjct: 383 VDVNP-YNIRDAKDFIRQLDIQRGTN----LNDALLGGLSLLESTGSMNSTSSNPMVCIL 437 Query: 289 F-LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------ 341 F LTDG+ S E A + + +G FL A +R Sbjct: 438 FVLTDGKPSEGVTSLSEIERNVRNANNQRCSIVTLGFGRLVNYNFLVRLALQNRGMARKI 497 Query: 342 FYSVQNSRKLHDAFLRIGK 360 + + +L + + Sbjct: 498 YEDSSAAGQLRGVYSEVAT 516 >gi|116251677|ref|YP_767515.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256325|emb|CAK07406.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 517 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 66/203 (32%), Gaps = 38/203 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D+S +M D ++ A I++++D R+ +V ++ Sbjct: 94 LVIAVDLSQTM-DAIDVTPSRIERAKLKIKDLIDT-------RQGARTAIVAYAGTAHLV 145 Query: 231 FPLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P ++ + RL+ + L A + K Sbjct: 146 LPPTEDAALLESYSDALATRLMPRPGKDTAAALSLADKLLEKEKASGT------------ 193 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-----ADQFLKNCASPDR 341 I+ LTDG + + + + G I+ +G A F + AS R Sbjct: 194 ILLLTDGVEDAASKAIESN--------SNGIIILGVGTSAGGPVKTHDGGFASD-ASGSR 244 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 +S + L + G ++ Sbjct: 245 LFSKLDLAGLEETEKETGADVAT 267 >gi|254786434|ref|YP_003073863.1| hypothetical protein TERTU_2434 [Teredinibacter turnerae T7901] gi|237685047|gb|ACR12311.1| TPR domain protein [Teredinibacter turnerae T7901] Length = 655 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 52/166 (31%), Gaps = 28/166 (16%) Query: 134 EMPFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 +P W ++ P + +S +++ D+S SM Sbjct: 61 RLPVYGLLIAWLLATFAAAGPAWQKLPQPVVKESSA---LVIAWDLSPSMAAQDVKP--- 114 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRL 247 R+ +++D++K + + L+ +S + PL I + + Sbjct: 115 -SRLVRARLKLIDLLKQRKEGL----TALIAYSGEAHVVTPLTDDTDTIISLLYGLDPAI 169 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + L A + + I+FLTDG Sbjct: 170 MPAKGSNAEMALTLANQLLKEGANGRGD-----------IVFLTDG 204 >gi|125973851|ref|YP_001037761.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|256004304|ref|ZP_05429286.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281418012|ref|ZP_06249032.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|125714076|gb|ABN52568.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|255991738|gb|EEU01838.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281409414|gb|EFB39672.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|316939952|gb|ADU73986.1| hypothetical protein Clo1313_0918 [Clostridium thermocellum DSM 1313] Length = 233 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 70/195 (35%), Gaps = 21/195 (10%) Query: 2 SFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT 61 S+ N + + KGS ++ AI++P + + M +I S + K+ + I D + Sbjct: 6 SWRNCKGLVRDRKGSFTVEAAIIIPAVILTMFALILVSEFLYQKSCIQAIADRTA-QRGA 64 Query: 62 KILNQENGNNGKKQKNDFSYRIIKNIWQ----------TDFRNELRENGFAQDINNIERS 111 +I N + + Q + I W+ + + E D + Sbjct: 65 EIWNSPSKDMIYGQITLDNMDDIDLYWRIWEMSKRKKQKEEKIEKYAGYLLSDSPILGEP 124 Query: 112 TSLSIIIDDQH-----KDYNLSAVSRYEMPFIFCTFPWCANSS-----HAPLLITSSVKI 161 L I K +S ++Y+ PF + + H+ +I V+ Sbjct: 125 IELEIEAGIVEDYIVYKKLRVSVKAKYKNPFSSLLRVFGIGKTITIKAHSDAVINEPVEF 184 Query: 162 SSKSDIGLDMMMVLD 176 +D +D++ +D Sbjct: 185 IRTTDFVIDVVKEVD 199 >gi|294828490|ref|NP_714387.2| hypothetical protein LA_4207 [Leptospira interrogans serovar Lai str. 56601] gi|293386343|gb|AAN51405.2| hypothetical protein LA_4207 [Leptospira interrogans serovar Lai str. 56601] Length = 344 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 85/225 (37%), Gaps = 42/225 (18%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +SS++ + + +D S SMN++ G K+ +A + + I ++P Sbjct: 21 TNVSSENQTPSQLFI-IDASGSMNEYLGIYQ-KIHLAKKHVSHY---ISTLPQETE---I 72 Query: 219 GLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 G + + SS++ Q + + ++ L T + A I K Sbjct: 73 GFLAYGNRLPGCSSSRLYQPLEMG-NRDTFKNRLFSLTPSGATPLAESIRIAGTLISQRK 131 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEA 328 ++ E II +TDG S D K+ L K++G + +G+ ++ Sbjct: 132 KETE------------IILVTDGIESCYG-DPKKEL---QALKQKGIPFQFHVLGLGLKS 175 Query: 329 ADQFLKNCAS---PDRFYSVQNSRKLHDAFLRI----GKEMVKQR 366 ++ + +++S+++ + A + K+ ++ Sbjct: 176 HEELQMKILTEEGNGKYFSIEDDSSFYTALDSLRNSPAKKTSQKS 220 >gi|163754946|ref|ZP_02162067.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] gi|161325013|gb|EDP96341.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] Length = 593 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 74/222 (33%), Gaps = 22/222 (9%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 PW ++ + + +I + + ++DVS SM +KL + + Sbjct: 205 LGQTPWNKDTRLLKIALKGK-EIPLEEIPPSNFTFLIDVSGSM-----FAQNKLPLLKSA 258 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 M++ ++ V VV +G +V Q I + +N L G +T G Sbjct: 259 FTLMVNKMRPEDKVAIVVYAGAAG----MVLEPTSGNNKQKILDALNNLRAGGSTAGGAG 314 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 +E AY + K + +I TDG+ + + + G Sbjct: 315 IELAYKTATENFIKNGNNR---------VIMATDGDFNVGATSETAMETLIEQKRETGVY 365 Query: 319 VYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 + +G D L+ A V ++ H F + Sbjct: 366 LSVLGFGMGNYQDDRLELLADKGNGNHAYVDTMQEAHKIFGK 407 >gi|52787597|ref|YP_093426.1| YwmD [Bacillus licheniformis ATCC 14580] gi|319648079|ref|ZP_08002296.1| YwmD protein [Bacillus sp. BT1B_CT2] gi|52350099|gb|AAU42733.1| YwmD [Bacillus licheniformis ATCC 14580] gi|317389714|gb|EFV70524.1| YwmD protein [Bacillus sp. BT1B_CT2] Length = 223 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 70/211 (33%), Gaps = 22/211 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S +++ +VLD S SM G K +A S+ + D++ S +V V F Sbjct: 25 NSKKDVNVAVVLDASGSMAQKVE-GERKFDIAKESVTDFADLLSSDANVMLNV------F 77 Query: 224 SS---KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAKG 279 ++ G + F S + G++ ++ I + Sbjct: 78 GHKGNNKNSGKEVSCGTTETVYDLQPFSFNSFENALSGIKPTGWSPIAKSLYD-VKDDLA 136 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----VYAIGVQAEAADQFLKN 335 D K Y+ +TDGE + + + + ++ + + V+ LK Sbjct: 137 DKDGKNYVYIVTDGEETCGG----DPVQAAKDLRKSNIKTIVNIVGLDVKTVKEKAKLKK 192 Query: 336 C--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A + ++ + G ++ + Sbjct: 193 VADAGGGKLIEADSASDFKKVWKEEGVKLSQ 223 >gi|326916561|ref|XP_003204575.1| PREDICTED: matrilin-3-like [Meleagris gallopavo] Length = 363 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 61/202 (30%), Gaps = 68/202 (33%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ + LD++ ++D S S+ + + EM+D + R ++ Sbjct: 3 TACKNRPLDLVFIIDSSRSVRPE------EFEKVKIFLSEMIDTLDV---GERTTRVAVM 53 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++S + FPL Sbjct: 54 NYASTVKVEFPLR----------------------------------------------T 67 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--- 338 + K +I +TDG + A+ G +YA+GV A Q L+ AS Sbjct: 68 YFDKVVIVVTDGRPQD------QVENVAANARTAGIEIYAVGVG-RADMQSLRIMASEPL 120 Query: 339 PDRFYSVQN---SRKLHDAFLR 357 + + V+ KL F Sbjct: 121 DEHVFYVETYGVIEKLTAKFRE 142 >gi|218780334|ref|YP_002431652.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] gi|218761718|gb|ACL04184.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] Length = 387 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 62/187 (33%), Gaps = 21/187 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GLVTFSSKIV 228 ++LD S SM +G G+ K A + + M + ++ D+ +R+ GL S Sbjct: 52 SFYVILDGSESMAGSYG-GLKKFTCAKKIVLFMNQAVANL-DLKAGLRTYGLNAVPSAPQ 109 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + E N ++ + A + +I Sbjct: 110 TDLQYGFATYSSTEFANAVMPLWGPNGPSPMGLAITEADKDLCSTSGRTA--------LI 161 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRR---GAIVYAIGVQAEA-ADQFLKNCASPDRFYS 344 L+DG S +++ + K R + I V A+A + L A + Sbjct: 162 ILSDGHPSD------DAVKAARKLKDRLGGKLCIATIAVGADASGKEILSGIAKIGGCDA 215 Query: 345 VQNSRKL 351 +++ L Sbjct: 216 AFDAKNL 222 >gi|67924730|ref|ZP_00518134.1| Magnesium chelatase, ChlI subunit [Crocosphaera watsonii WH 8501] gi|67853417|gb|EAM48772.1| Magnesium chelatase, ChlI subunit [Crocosphaera watsonii WH 8501] Length = 682 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 68/219 (31%), Gaps = 45/219 (20%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 481 ARKAGALIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVSLIPF 529 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ L G + GL A + +AK D Sbjct: 530 RGEQADVLLPPTRSISLARKRLETLPCGGGSPLAHGLTQAVHVGMNAK-------MSGDI 582 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----G 323 + I+ +TDG + P +E L + + G + I Sbjct: 583 GQVVIVAITDGRGNIPLAKSLGEPIPEGEKPDIKQELLDIAGKIRALGIKLLMIDTEKKF 642 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 V + + + +Y + + I + Sbjct: 643 VSTGFGKEMAQT--AGGTYYQLPRATD-----QAIAQMA 674 >gi|238618655|ref|YP_002913480.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|238379724|gb|ACR40812.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 452 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 57/168 (33%), Gaps = 26/168 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GLVTFSSKIVQ 229 + ++LD S SM+ + +A S + + + +R + + VQ Sbjct: 290 IYLLLDKSGSMDGEKILWAKAVALALYSRAKR-------ENRDFYLRFFDNIPYPLIKVQ 342 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + + E I ++ G T + + A I + KG + II Sbjct: 343 KNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGH------VKGVSE----IIL 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 LTDGE+ + SL N I V + L+ + Sbjct: 393 LTDGEDKIAETTVRRSLKEANSQ--------LISVMIRGDNADLRRVS 432 >gi|115345667|ref|YP_771848.1| von Willebrand factor type A domain-containing protein [Roseobacter denitrificans OCh 114] gi|115292988|gb|ABI93440.1| von Willebrand factor type A domain protein, putative [Roseobacter denitrificans OCh 114] Length = 349 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 50/148 (33%), Gaps = 19/148 (12%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T S GLD+++VLD S SM D G + + +++ + + Sbjct: 60 TGVASGVGSSGDGLDLVLVLDSSGSMAD-INNGKSRQQWQRDAAIALVNSLPAA-----T 113 Query: 216 VRSGLVTFSSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V F S L + I IN + T G+ A ++ A Sbjct: 114 TSVSIVEFDSNANVVTGLTSLTPASNIPAIIAAINGVNASGGTNIASGIAAAAGELTGAN 173 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSP 298 K ++ ++DG+ ++ Sbjct: 174 ATTGR--------SKQMVVISDGDPTAG 193 >gi|118357550|ref|XP_001012024.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293791|gb|EAR91779.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 853 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 58/167 (34%), Gaps = 29/167 (17%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +LD S SM+ + A ++ L + I NVV G Sbjct: 319 LLDRSGSMSGQ------SIKQACEALVLFLQSL-PIDSYFNVVSFGSSFEKLFPSSQKYN 371 Query: 234 AWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + ++ + IN+ T+ L +N+ K + Y K I LTD Sbjct: 372 SQNLEQAVQIINQYSANLGGTEIYQPLHNVFNE------------KKIEGYNKQIFLLTD 419 Query: 293 GENSSPNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQFLKNCA 337 G+ +P N K+ + +++IG +A Q ++ A Sbjct: 420 GQVDNPK-------QVVNLIKKNNKFSRIHSIGFGNDADKQLIQETA 459 >gi|120553803|ref|YP_958154.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Marinobacter aquaeolei VT8] gi|120323652|gb|ABM17967.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter aquaeolei VT8] Length = 1056 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 60/217 (27%), Gaps = 45/217 (20%) Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT--------FPLAWGVQHIQEKI 244 V + M D+ K + D V GL+ F+ + + K+ Sbjct: 176 TVRKTRMEIMQDVAKRLADTVTGVNIGLMAFNQSQNGEGGRVLNNVSNVKDNASAFKAKV 235 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD--YKKY------------IIFL 290 + L T + L A K L+ Y +I L Sbjct: 236 DGLYPSGQTPLSETLFGAMRYFQGGKPFLDRNPVSGTVDGSNNYKSPIELECQANNVILL 295 Query: 291 TDG----ENSSPNIDNKESLFYCN----------------EAKRRG---AIVYAIGVQAE 327 TDG + + + C+ A G Y +G + Sbjct: 296 TDGAPTSDTNHNSFIGSAIGKTCSGNCLDEIAGYMATNDMSAAFSGDQTIKTYTVGFSID 355 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A +Y N+++L DAF I + ++ Sbjct: 356 DPLLGAAATAGGGEYYVANNAQQLADAFDDILRSVMD 392 >gi|332216199|ref|XP_003257232.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 3 [Nomascus leucogenys] gi|332216201|ref|XP_003257233.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 4 [Nomascus leucogenys] Length = 623 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 69/202 (34%), Gaps = 36/202 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V+D+S SM K+ ++ ++L +P + LV F +++ Sbjct: 5 VVFVIDISGSMRGQ------KVKQTKEALLKIL---GDMPPGDYFD---LVLFGTRVQ-- 50 Query: 231 FPLAW----------GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +W ++ Q+ + T GL + +E L ++ Sbjct: 51 ---SWKGSLVQASEANLRAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHA 107 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +I LTDG+ + D + L A R +Y +G FL+ + + Sbjct: 108 A----ILIMLTDGDPTEGVTDRSQILKNIRSAIRGRFPLYNLGFGHNVDFNFLEVMSMEN 163 Query: 341 R-----FYSVQNSRKLHDAFLR 357 Y +++ + F Sbjct: 164 NGRAQRIYEDRDATQQLQGFYS 185 >gi|254497958|ref|ZP_05110722.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] gi|254352852|gb|EET11623.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] Length = 607 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 71/201 (35%), Gaps = 38/201 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++DVS SM + A + +++ + + TFS+ + Q Sbjct: 34 IHLLIDVSGSM--KHTDPQNLRVTAVKLFNYLVNNRAVVS---------VSTFSNDLEQI 82 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + +++K G+ T L + K KK Sbjct: 83 IPPQIVTAKFQESFLKKKKQIKSDGAWTNIDAALN--------------GVNKSWSKNKK 128 Query: 286 YIIFLTDGENSSPNID-NKESLFYCNE-----AKRRGAIVYAIGVQAEAADQFLKNCASP 339 II LTDG + NK+S NE +R VY IG+ EA L N + Sbjct: 129 VIILLTDGMLDLGSDALNKKSTQQLNETTIPILQREHVQVYTIGLSNEADSTLLSNISLK 188 Query: 340 DR--FYSVQNSRKLHDAFLRI 358 F V +++ L +A I Sbjct: 189 TNALFQPVISAKDLDNALYAI 209 >gi|227826569|ref|YP_002828348.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|227829211|ref|YP_002830990.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|229577980|ref|YP_002836378.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|229583733|ref|YP_002842234.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|284996566|ref|YP_003418333.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|227455658|gb|ACP34345.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] gi|227458364|gb|ACP37050.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228008694|gb|ACP44456.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|228018782|gb|ACP54189.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|284444461|gb|ADB85963.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|323473648|gb|ADX84254.1| von Willebrand factor type A domain protein [Sulfolobus islandicus REY15A] gi|323476297|gb|ADX81535.1| von Willebrand factor type A domain protein [Sulfolobus islandicus HVE10/4] Length = 452 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 57/168 (33%), Gaps = 26/168 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-GLVTFSSKIVQ 229 + ++LD S SM+ + +A S + + + +R + + VQ Sbjct: 290 IYLLLDKSGSMDGEKILWAKAVALALYSRAKR-------ENRDFYLRFFDNIPYPLIKVQ 342 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + + E I ++ G T + + A I + KG + II Sbjct: 343 KNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGH------VKGVSE----IIL 392 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 LTDGE+ + SL N I V + L+ + Sbjct: 393 LTDGEDKIAETTVRRSLKEANSQ--------LISVMIRGDNADLRRVS 432 >gi|31873374|emb|CAD97678.1| hypothetical protein [Homo sapiens] Length = 974 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 291 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 338 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 339 IEPCFKGILVQADRDNREHFKLLVEELMVNGVGVVDQALREAFQILKQFQE-----AKQG 393 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 394 SLCNQAIMLISDG 406 >gi|160889561|ref|ZP_02070564.1| hypothetical protein BACUNI_01985 [Bacteroides uniformis ATCC 8492] gi|270296685|ref|ZP_06202884.1| conserved hypothetical protein [Bacteroides sp. D20] gi|317480053|ref|ZP_07939165.1| hypothetical protein HMPREF1007_02282 [Bacteroides sp. 4_1_36] gi|156861078|gb|EDO54509.1| hypothetical protein BACUNI_01985 [Bacteroides uniformis ATCC 8492] gi|270272672|gb|EFA18535.1| conserved hypothetical protein [Bacteroides sp. D20] gi|316903795|gb|EFV25637.1| hypothetical protein HMPREF1007_02282 [Bacteroides sp. 4_1_36] Length = 289 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 45/108 (41%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTVKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T GLEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFHAESRRTNIRLGLEYLTNVMK 173 >gi|148253705|ref|YP_001238290.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] gi|146405878|gb|ABQ34384.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] Length = 409 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 57/159 (35%), Gaps = 2/159 (1%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G IS++ AI I + +G ++ + VKA+L +D ++L + N Sbjct: 15 RRFGWASDGQISMIFAIASIPILVSVGAAVDFAKSSDVKAQLQKSIDAAVLAGVVR-PND 73 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + + F + + +++ S + + Sbjct: 74 QQISTAAAVFSGAYRGRFDTAATASFASNTDGSLTGTATTSVKTSFLNVMGTSALGVTAS 133 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS 165 +A + + C + + LL+ S ++++ S Sbjct: 134 ATAKAGAQAQSSVCIL-LVSTVNAQSLLVNSGAQLNAPS 171 >gi|82593966|ref|XP_725227.1| ookinete protein [Plasmodium yoelii yoelii str. 17XNL] gi|23480151|gb|EAA16792.1| ookinete protein-related [Plasmodium yoelii yoelii] gi|110681630|dbj|BAE98270.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium yoelii] Length = 2401 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 65/179 (36%), Gaps = 31/179 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----S 225 D+ ++LD S S+ +++ T L+I + + V G++ F+ Sbjct: 816 DLTVILDESGSIGS--YNWKNQVYPFTEQFINNLEISE------DKVHVGIMLFAQFNRD 867 Query: 226 KIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ + ++ +H+ + I L G T L Y E H Sbjct: 868 FVMFSDKESYDKEHMMKLIKGLKDSYKSGGYTYIIEALNYGL-------ENYTHHKDSRS 920 Query: 282 DYKKYIIFLTDGENSSPNID--NKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNC 336 D K + TDG N++P + SL Y K + +GV A A + L C Sbjct: 921 DVPKVTMLFTDGNNTNPGDKLLSDASLLY----KEENVKLLVVGVGASTMANLRLLAGC 975 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 69/205 (33%), Gaps = 25/205 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----S 225 D+ ++LD S S + K + I+K + N + G++ F+ Sbjct: 65 DLTLILDESGS--------IKKKHWVKYVVPFTEQIVKGLKVGENDIHVGILLFALKNRD 116 Query: 226 KIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 I + + + +K+N L GS T L+Y+ NK +K D Sbjct: 117 YITFDNDIRYKKTELLKKVNDLNDDYRSGSDTYILEALKYSMNKYSMSKNAR-------D 169 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 D K I TDG + + E +E + + + +GV A + + Sbjct: 170 DAPKVTILFTDG--NDRHASKSEFHKMYSEYQEKHIKLLVLGVSAAEEKKLKVIAGCEEH 227 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + I ++ + Sbjct: 228 SSCPSAMKAEWETINNITNKLTNKI 252 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 60/173 (34%), Gaps = 29/173 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ + ++++ + + V G++ FS IV Sbjct: 291 DLTLILDDSGSITLNKWE-----KDVIPFSEKLINNLNI---GKDNVHVGIMRFSKDIVI 342 Query: 230 TFPLAWGVQHIQEKI--------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + ++I+ ++ + +GS T L+Y+ Sbjct: 343 DVDYSQDTRYIKNELANIVEGLSKKYRYGSRTDIVDALDYSLKNFT-------RHPNSRT 395 Query: 282 DYKKYIIFLTDGENSSPNIDNKE--SLFYCNEAKRRGAIVYAIGVQAEAADQF 332 D K I TDG ++S + + + Y + + +GV ++ Sbjct: 396 DAPKVTILFTDGNDTSKTLAEERNMGILY----RSEQIRLILVGVGQASSIDL 444 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 62/174 (35%), Gaps = 20/174 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ + ++ + A DII + + V G++ FS + Sbjct: 595 DVTLILDESASIGESRWV-LEVIPFAK-------DIINHLNIDYDSVHVGVLLFSHYALD 646 Query: 230 TFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + + + +KI+ L T G E K G + K Sbjct: 647 LVPFSDEARYNKISLLKKIDSLK----TNYGNGYESFIVKTLKYALYNYIKDSGRSNAPK 702 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCA 337 + TDG + + + + + + + IGV + ++ L CA Sbjct: 703 ITMLFTDG--NDSSESDMDMYNIGSLYRTERVKLLVIGVSMASENKLKQLVGCA 754 >gi|311070191|ref|YP_003975114.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] gi|310870708|gb|ADP34183.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] Length = 226 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 41/225 (18%), Positives = 80/225 (35%), Gaps = 17/225 (7%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS 208 + + S + K ++ ++ D S SM G G K+ VA +S++ +++ Sbjct: 11 GLLTVSLASPSFAAEKKTPDTNVAVLFDGSGSMVQKTG-GERKIDVAKKSVKSFAELL-- 67 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE----YAYN 264 D N ++R F + E I L + L + Sbjct: 68 PEDTNLMLRV----FGHEGNNKLSGKALSCSTTETIYGLHPYEGSLFNNALSQINPTGWT 123 Query: 265 KIFDA-KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 I A + E K + + K + +TDGE + D + + EA V IG Sbjct: 124 PIAKALSDTREEFQKVNAEGKNVVYLITDGEETCGG-DPAKEIEKLREANVDTI-VNIIG 181 Query: 324 VQAE-AADQFLKN--CASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + +Q +K A + S +N+ + A+ ++ ++ Sbjct: 182 FNFDVKGNQKMKQAAVAGGGEYISAKNAEEFEQAWENEAQKFTEE 226 >gi|307176121|gb|EFN65819.1| Voltage-dependent calcium channel subunit alpha-2/delta-4 [Camponotus floridanus] Length = 953 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 53/129 (41%), Gaps = 15/129 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-----FS 224 D+++++DVS SM GM K +A ++ +LD + + V + S T F Sbjct: 259 DVVILIDVSGSM-----TGMGK-TIAKTTVNSILDTLSNNDLVTLLKYSNETTELVPCFK 312 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 ++Q P + ++ ++ L T A+N + +E A Sbjct: 313 DMLIQATPE--NLDTFKKSLDTLKTEGPANLTDAFTKAFNLLKTYRESRGCDA--DTPCN 368 Query: 285 KYIIFLTDG 293 + I+ +TDG Sbjct: 369 QLIMVVTDG 377 >gi|255009409|ref|ZP_05281535.1| hypothetical protein Bfra3_09732 [Bacteroides fragilis 3_1_12] gi|313147168|ref|ZP_07809361.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135935|gb|EFR53295.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 289 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLEF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T LEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPESRRTNVRLALEYLTNVMK 173 >gi|223558080|gb|ACM91084.1| aerotolerance-related membrane protein [uncultured bacterium Rlip1] Length = 201 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 45/120 (37%), Gaps = 9/120 (7%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSM 181 +D + + S F F F + L S + G+D+ + +DVS SM Sbjct: 46 RDSLVPSYSSRRANFKFIVFLLMISCVILALANLQSGSKMEEVKREGIDLYIAVDVSNSM 105 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 N +L + ++I +++ +K R G++ F+ K P+ + Sbjct: 106 NAEDIVP-SRLERSKQAINKLISDMKG-------DRIGVIVFADKAFVQLPITTDYSAAK 157 >gi|82777418|ref|YP_403767.1| hypothetical protein SDY_2190 [Shigella dysenteriae Sd197] gi|309787854|ref|ZP_07682464.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1617] gi|81241566|gb|ABB62276.1| conserved hypothetical protein [Shigella dysenteriae Sd197] gi|308924253|gb|EFP69750.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1617] Length = 219 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRLWIFLITDGVPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|14030585|gb|AAK52967.1|AF367013_1 serum opacity factor VT2.2 [Streptococcus pyogenes] Length = 1013 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 66/169 (39%), Gaps = 13/169 (7%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + ++K++ K D G D+M +LDVS M ++F +++ ++ K + Sbjct: 203 TLDVTLKVTPKQIDEGADVMALLDVSKKMTKENFDKAKEQIKKMVTTLTGEPTDGKENHN 262 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDA 269 N VR L+TF K+ L +++ K+N L+ A +K + Sbjct: 263 RRNSVR--LMTFYRKVNDPIELT--TKNVDAKLNEFWEQAKKDWDWGVDLQGAIHKAREI 318 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + ++ K +++I+ + GE++ N S Sbjct: 319 FKNEKNSKK-----RQHIVLFSQGESTFSYDINANSKANLKAITEDKIT 362 >gi|329963579|ref|ZP_08301058.1| hypothetical protein HMPREF9446_02652 [Bacteroides fluxus YIT 12057] gi|328528568|gb|EGF55539.1| hypothetical protein HMPREF9446_02652 [Bacteroides fluxus YIT 12057] Length = 289 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 45/108 (41%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTIKQMKKDMATEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T GLEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFHAESRRTNIRLGLEYLTNVMK 173 >gi|223939328|ref|ZP_03631208.1| von Willebrand factor type A [bacterium Ellin514] gi|223892041|gb|EEF58522.1| von Willebrand factor type A [bacterium Ellin514] Length = 229 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 66/178 (37%), Gaps = 27/178 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++VLDVS SM G +D L + L + V + ++TF Sbjct: 22 CVLVLDVSSSMR---GAAIDFLNLGVDLFAHDLTRSRLACKR---VETAIITFGD----- 70 Query: 231 FPLAWGVQHIQEKIN-------RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 GV +Q+ ++ R G T + A + K K ++ A G + Sbjct: 71 -----GVHIVQDFVSPSAFVPPRFEAGGKTPMGEAVVQACELL--EKRKRKYRAAGVSYF 123 Query: 284 KKYIIFLTDGENSSP-NIDNKESLFYCNEAK-RRGAIVYAIGVQAEAADQFLKNCASP 339 + +I +TDGE + + ++++ + + + + + V + + C + Sbjct: 124 RPWIFLITDGEPTDYETANWRQAVEIVRAGEVDKKLMFFGVAVSDANQGKLNELCPAS 181 >gi|149922101|ref|ZP_01910541.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] gi|149817038|gb|EDM76520.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] Length = 350 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 76/219 (34%), Gaps = 22/219 (10%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 +L + ++ + ++++D+S S+ D D T + ++S Sbjct: 78 VSVLESQQTILNPEVAASHYTLLLIDMSGSVVDS-----DDATKVTEAALLFTSEVESNN 132 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKI 266 V G KI + + + N L +T + + Sbjct: 133 KVAIYAFDG----EEKIHKISDFTGSEGAAEARANSLAGFQPKDKSTNLNGAIVQGLEVL 188 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + EK ++ + ++ TDG + + + E L K V+AIG+ A Sbjct: 189 DEGLEKAKNPLRLGT-----LVVFTDGTDRASRVSEDEMLS---AVKDTPYEVFAIGLGA 240 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 E ++ L+ ++S ++ AF IG+ + K Sbjct: 241 ELSEGDLQKIGKSGTA-LAKDSGEIQTAFESIGERIEKL 278 >gi|260890922|ref|ZP_05902185.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] gi|260859475|gb|EEX73975.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] Length = 516 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 74/220 (33%), Gaps = 40/220 (18%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K + ++ +++ ++LD S SM G + +A SI K + ++ ++ Sbjct: 155 TKQKIEENMNVNLEIILDASGSMKQKIGDK-TMMEIAKESIE------KVVSEMPANTKA 207 Query: 219 GLVTFSSKIV--------------QTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 GL F K P+ ++ + + T +E Sbjct: 208 GLRVFGHKGDNTASKKQESCSANELISPIETLNKDKLKSSLAPIQPTGWTSIAKSIENGT 267 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--A 321 N + K + YII TDG + + + + K + Sbjct: 268 NDLKALKGEKTLNIL-------YII--TDGIETC----DGNPVETAKKFKNENTDIVLGI 314 Query: 322 IGVQAEAAD-QFLKNC--ASPDRFYSVQNSRKLHDAFLRI 358 IG +A + LK A+ + SV ++ KL + RI Sbjct: 315 IGFNVDAHQNKVLKEIANAANGYYSSVNDAAKLTEELQRI 354 >gi|307324381|ref|ZP_07603589.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890112|gb|EFN21090.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 249 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 53/149 (35%), Gaps = 29/149 (19%) Query: 170 DMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + +VLD S SM ++ + L T ++ LD +P +V FS++I Sbjct: 50 AVYLVLDRSGSMRPYYRDGSVQHLAEQTLALAANLDDDGIVP---------VVFFSTEIE 100 Query: 229 QTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 T ++ Q++IN L T ++ + H D Sbjct: 101 GTAEISLDAY--QDRINPLHDSMGHMGRTNYHVAMQAVID----------HYQSSGADDP 148 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAK 313 +++F TDG +S C AK Sbjct: 149 AFVVFQTDGSPTSKAAAEH---VLCTAAK 174 >gi|261880543|ref|ZP_06006970.1| von Willebrand factor [Prevotella bergensis DSM 17361] gi|270332766|gb|EFA43552.1| von Willebrand factor [Prevotella bergensis DSM 17361] Length = 289 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 45/102 (44%), Gaps = 8/102 (7%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ M R + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDFGTNTQM------KRDMAAEIAATIAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLI--FGSTTKSTPGLEYAY 263 S +I + P G +HI I ++ + K+ G+ AY Sbjct: 126 SDQIEKYIPPKKGRKHILYIIREMLHFRPVSNKTDVGMATAY 167 >gi|198424466|ref|XP_002124191.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 73/202 (36%), Gaps = 11/202 (5%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML--DIIKSIPDVNNVVRSGLVTFSSK 226 LD++ V+D S S++D M ++ V E D + + S + Sbjct: 193 LDLVYVVDSSNSISDANFTIMKQIIVNASEAFEASIGDTTQVAVLQYGNLDSAAFDHTDS 252 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P G + + NR I + T + I A E +K D KK Sbjct: 253 KYYKSPTKLGDCNDIDCFNRAIKANMTHLNAANTFTSLAIRRAVEFDFAQSKNKDKAKKI 312 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA----SPDRF 342 ++ +TDG+ + + + VYAIGV A +D L+ A S +R Sbjct: 313 LVLITDGQANFQSQLIVSYRLT----QSHNITVYAIGV-ALKSDAELRISANGGVSKERV 367 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 N +L A + + + + Sbjct: 368 LDANNYSELSKALRNLTETIAQ 389 >gi|148710198|gb|EDL42144.1| RIKEN cDNA 6330505F04, isoform CRA_a [Mus musculus] gi|148710199|gb|EDL42145.1| RIKEN cDNA 6330505F04, isoform CRA_a [Mus musculus] Length = 898 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 122 PSILITITDGNKLTSTASVQEEL 144 >gi|148710200|gb|EDL42146.1| RIKEN cDNA 6330505F04, isoform CRA_b [Mus musculus] Length = 863 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 6 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 63 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 64 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 123 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 124 PSILITITDGNKLTSTASVQEEL 146 >gi|26327743|dbj|BAC27615.1| unnamed protein product [Mus musculus] gi|29881543|gb|AAH51161.1| Ddx26b protein [Mus musculus] Length = 444 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 122 PSILITITDGNKLTSTASVQEEL 144 >gi|27370148|ref|NP_766367.1| protein DDX26B [Mus musculus] gi|81896823|sp|Q8BND4|DX26B_MOUSE RecName: Full=Protein DDX26B gi|26350879|dbj|BAC39076.1| unnamed protein product [Mus musculus] gi|44890643|gb|AAH66792.1| Ddx26b protein [Mus musculus] Length = 861 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 122 PSILITITDGNKLTSTASVQEEL 144 >gi|325679008|ref|ZP_08158606.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109512|gb|EGC03730.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 270 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 73/205 (35%), Gaps = 15/205 (7%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ S L + ++D S SM K+G + E++ I+ + + + V+ + Sbjct: 28 ATAVSKKSLVIFFLIDTSGSMKGK------KMGQLNTVMEELIPEIRRVGEADTDVKVAV 81 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +TF + + + ++ + RL T + A+ ++ + + Sbjct: 82 LTFDTDVKWMYSAPISIEEF--EWARLGAQGVT----SMGAAFTELAARMSRNSFLNSPS 135 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNCASP 339 + + +TDG S ++L N + G A+G+ EA D L + S Sbjct: 136 LSFAPVMFLMTDGYPSDDYKAGLKALQ-ANSWYKFGLKA-ALGIGDEANDDMLEEFTGSK 193 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 D +L I + Sbjct: 194 DTVVHAYTGGQLAAMIKIIAVTASQ 218 >gi|218886590|ref|YP_002435911.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757544|gb|ACL08443.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 2478 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 66/178 (37%), Gaps = 29/178 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFG---PGMD-KLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + I +++++LDVS SMND PG D +L +A +I ++L + VN Sbjct: 1707 QGGITSNVVVMLDVSGSMNDDANGNAPGTDSRLTMAIDAINQLLHAYDDLGAVN----VK 1762 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 LV F G +Q+ + L G T + +++ Sbjct: 1763 LVWFDDSAQTHTGWLMGTTAVQQALTILEGNLNGGGATNYDAAINLVMSQLGTTGTPPAD 1822 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-------AKRRGAIVYAIGVQA 326 + FL+DGE + P D E + + A +VYA+G+ Sbjct: 1823 KTVAY--------FLSDGEPNRP--DGSEGISGSEQTTWETFLAANNFDMVYALGIGT 1870 >gi|166367319|ref|YP_001659592.1| magnesium protoporphyrin IX chelatase subunit D [Microcystis aeruginosa NIES-843] gi|166089692|dbj|BAG04400.1| magnesium protoporphyrin IX chelatase subunit D [Microcystis aeruginosa NIES-843] Length = 665 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 69/213 (32%), Gaps = 37/213 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 465 ARKAGSLIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQIALIPF 513 Query: 224 SS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ L G + GL A + +A+ D Sbjct: 514 RGERADVLLPPTKSITLAKKRLETLPCGGGSPLAHGLTQAVHVGMNAR-------LSGDI 566 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI----GV 324 + I+ +TDG ++ P E L + + G + I Sbjct: 567 GQVVIVAITDGRSNIPLAKSLGEPIPEGEKPDIKAELLDIAAKIRSLGIKLLVIDTEKKF 626 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A ++ + + + + R Sbjct: 627 VSTGFAKELALKAGGKYYHLPRATEQAIASMAR 659 >gi|159027822|emb|CAO87035.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 704 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 69/213 (32%), Gaps = 37/213 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 504 ARKAGSLIVFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQIALIPF 552 Query: 224 SS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ L G + GL A + +A+ D Sbjct: 553 RGERADVLLPPTKSITLAKKRLETLPCGGGSPLAHGLTQAVHVGMNAR-------LSGDI 605 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI----GV 324 + I+ +TDG ++ P E L + + G + I Sbjct: 606 GQVVIVAITDGRSNIPLAKSLGEPIPEGEKPDIKAELLDIAAKIRSLGIKLLVIDTEKKF 665 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A ++ + + + + R Sbjct: 666 VSTGFAKELALKAGGKYYHLPRATEQAIASMAR 698 >gi|123509108|ref|XP_001329794.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121912842|gb|EAY17659.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 694 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 76/202 (37%), Gaps = 39/202 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-NVVRSGLVT---FSS 225 ++ +LD S SM +++ A +++ L ++ P V +VR G F Sbjct: 238 SIVFLLDCSGSMTID-----NRIENAIKAMDLFLHSLE--PGVKFEIVRFGSTFNSLFDF 290 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 K+ + + I T+ ++ YN++ Sbjct: 291 KLTEYNDDSLNTA--LAFIKGTSANLGGTEIFNPIKQIYNELSPD--------------- 333 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRF 342 + LTDG +DN +++ + + ++++G+ A A ++N AS Sbjct: 334 -VLFVLTDG-----AVDNSQAVL--DFVRDSSTKIFSLGLGAGADMNLVRNLASFTGGVS 385 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 V ++ +L D+ +R+ ++ Sbjct: 386 EHVLDASQLRDSIIRLLEDSTN 407 >gi|146303450|ref|YP_001190766.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145701700|gb|ABP94842.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 451 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 59/188 (31%), Gaps = 42/188 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM+ + +A S + + +R F I Sbjct: 289 IYLLLDKSGSMDGEKILWAKAVALALYSRARR-------ENRDFYLR-----FFDNI--P 334 Query: 231 FPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +PL + V + E I ++ G T + + A + I D + Sbjct: 335 YPLIKVIKNAKSKDVIKMVEYIGKIRGGGGTDISRSVMSACDDIKDGHVRGVSE------ 388 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +I LTDGE+ + SL N I V + LK D + Sbjct: 389 ----VIILTDGEDKIAETTVRRSLKEANAT--------LISVMIRGDNADLKRV--SDNY 434 Query: 343 YSVQNSRK 350 V + Sbjct: 435 LVVYRLDQ 442 >gi|88856118|ref|ZP_01130779.1| von Willebrand factor, type A [marine actinobacterium PHSC20C1] gi|88814686|gb|EAR24547.1| von Willebrand factor, type A [marine actinobacterium PHSC20C1] Length = 585 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/252 (15%), Positives = 82/252 (32%), Gaps = 25/252 (9%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + + P + +TS ++ + ++ ++ V+DVS S Sbjct: 332 NAEGFRDNVGGGEILASGVIAAPTATKNVGDTAELTSILRNWNALNLRSRILTVVDVSGS 391 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQH 239 M + G ++ + S + ++ G+ FS + V T W V Sbjct: 392 MLERAWDGQRRIEMFQNSAE------DAFSTLSEDDELGMWLFSKERVGTE--DWEDVAG 443 Query: 240 IQE-----KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTD 292 I+ + L S P Y + D + + +K ++ +TD Sbjct: 444 IRSMNDPDHVKNLQAI--IDSLPSRINGYTGLNDTVLAAVTHVREDYNSEKVNSVMLITD 501 Query: 293 GENSSPNIDNKESL-----FYCNEAKRRGAIVYAIGVQAEAADQFLKNC--ASPDRFYSV 345 G N N + + L + V +G+ + + ++ A+ Y Sbjct: 502 GRNEDDNGISLQKLLDKLTEMIQKDTDEPVPVVLVGIGPDTDVEAMRKIAQATGGTAYQA 561 Query: 346 QNSRKLHDAFLR 357 N +L + L Sbjct: 562 NNPTELSNVMLE 573 >gi|324114253|gb|EGC08225.1| von Willebrand protein type A [Escherichia fergusonii B253] Length = 219 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A N + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALNMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|323968208|gb|EGB63617.1| von Willebrand protein type A [Escherichia coli M863] gi|327253206|gb|EGE64860.1| von Willebrand factor type A domain protein [Escherichia coli STEC_7v] Length = 219 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A N + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALNMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|283779589|ref|YP_003370344.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438042|gb|ADB16484.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1740 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 43/270 (15%), Positives = 84/270 (31%), Gaps = 47/270 (17%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFGPGMD--- 190 P + P +V I ++ VLD S +M D Sbjct: 1145 PLLLRAAGGKRIEFAKPTYGPPTVTIHGFRRRQASVVFVLDCSHTMKALDEVESPDGGVR 1204 Query: 191 ----KLGVATRSIREMLDIIKSIPDVNNVVR-----SGLVT-----------FSSKIVQT 230 ++ +A ++R ML + D VR G T ++ +I Sbjct: 1205 ATSQRMELAKGALRSMLTQLAERGDARVGVRFFGHRVGWSTVEANKLLRQNRYAGEIPPE 1264 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA--------YNKIFDAKEKLEHIAKGHDD 282 V++I + R + ++ Y I +A + Sbjct: 1265 LQPYADVENILP-LGRFDSVIAGRVFEAMKTVEPWGESPIYLSITEALRDFAQ---EDGE 1320 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAE------AADQFL 333 +K I+ +TDG+N N ++ + + RG V+ +G E A+ +F Sbjct: 1321 SEKSIVVITDGKNYQFNAPSQLARTKDDVLAARGSRDIKVHIVGFNIEPSESEVASREFR 1380 Query: 334 KNC-ASPDRFYSVQNSRKLHDAFLRIGKEM 362 + A+ F ++ L + + + Sbjct: 1381 EIAEATGGEFLPATSAGSLVRSLESVLRAA 1410 >gi|194333615|ref|YP_002015475.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194311433|gb|ACF45828.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 5009 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 75/202 (37%), Gaps = 26/202 (12%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPD 211 + +T S + I ++M+++D S SM+ G PG +L + R+++D +++ D Sbjct: 4497 IPVTESATATP---IDTNIMLIMDTSGSMDWPSGIPGYTRLQATVAAARQLVDKYEALGD 4553 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDA 269 VR +V F + + ++ L+ G +T L A + D Sbjct: 4554 ----VRVNIVEFDTDGNKWTTGWVDGATADSRLTALLTQGGGSTNFDDALLTAMDAWDDT 4609 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGE-----------NSSPNIDNKESLFYCNEAKRRGAI 318 + + Y FL+DG+ + I +E ++ N + Sbjct: 4610 TGLTQIPDAQNVSY-----FLSDGDPTARTRWSSGWPNQNGIQTQEQNYWENWLESNEIT 4664 Query: 319 VYAIGVQAEAADQFLKNCASPD 340 YA G+ + + L A Sbjct: 4665 SYAFGLGTQVTEDNLDPIAYDG 4686 >gi|5732089|gb|AAD31491.3|AF138798_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 873 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 65/148 (43%), Gaps = 10/148 (6%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M D F DK+ ++ D + + Sbjct: 208 TIDVTVTVTPKEIDEGADVMALLDVSKKMTDADFKNAKDKIKKLVTTLTSNSDNAEHKHN 267 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN---KIFD 268 N VR L+TF +I ++ ++++++ K+ ++ + I Sbjct: 268 SRNSVR--LMTFYREISDPIDIS---GKTDAELDKILNDLREKAKANYDWGVDLQGAIHK 322 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENS 296 A+E + + +++I+ + GE++ Sbjct: 323 AREIFKKDQEKKSGKRQHIVLFSQGEST 350 >gi|327283159|ref|XP_003226309.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1076 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 62/164 (37%), Gaps = 18/164 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L+M +LD S S +F K V ++ ++ + + R L+ + Sbjct: 49 NENCILEMAFLLDSSES-AKNFNHEQQKKFVLET--VNRMNGLQLSSNRHLSWRIALLQY 105 Query: 224 SSKIVQTFPLA-W-GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 SS ++ W G + +I + + T +T + ++ +G Sbjct: 106 SSTVLIEQTFKDWKGPDAFKSRIAPITYIGHGTYTTYAI---------TNLTQLYMTEGT 156 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K I LTDG + N D + +AK +G + + +G+ Sbjct: 157 HGSVKVAILLTDGVDHPRNPDIFSATA---DAKHQGIVFFTVGM 197 >gi|325285570|ref|YP_004261360.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321024|gb|ADY28489.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 235 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 57/175 (32%), Gaps = 32/175 (18%) Query: 177 VSLSMNDHFGPGMDK-----LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT- 230 VS ++ G + K R+++ IK +PD + + +F++ + + Sbjct: 75 VSKAIGGKIGSILGKQVTKEATKLGAVKRKLIPAIKGLPDGKKFL---VFSFNNNVTKQA 131 Query: 231 --FPLAWGVQHIQE--KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F +A + L T + GL A + Sbjct: 132 TEFRVASNTTRTSSNIFVQNLKASGGTNTLEGLLEAL----------------STADVQE 175 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I+ ++DG PN K L + I++ I +A F++ A + Sbjct: 176 IVLMSDGL---PNSGPKAVLEEIKKVNTSNIIIHTIAFGEDADLDFMRTLAQENN 227 >gi|315452880|ref|YP_004073150.1| von Willebrand factor type A domain-containing protein [Helicobacter felis ATCC 49179] gi|315131932|emb|CBY82560.1| von Willebrand factor type A domain protein [Helicobacter felis ATCC 49179] Length = 224 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 63/170 (37%), Gaps = 14/170 (8%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + LD S SM+ G + ++ + + + P +VTF Sbjct: 15 NPTKRVPVCLCLDTSSSMS---GAPIGEVN---QGVDLFYQAVNDHPIAKQAADVCIVTF 68 Query: 224 SSKIVQTFPLAWGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L + I++K G T + A +++ + K E+ G + Sbjct: 69 GD---SGVSLVQDFESIRDKSAPSFNAGGYTPMGAAVNEALDRL--EERKKEYKDSGTEY 123 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ +++ +TDGE + + S + A + ++ I V+ A+ Sbjct: 124 FQPWLVIITDGEPTD--DISSASRRSSDLANNKKLSIFPIIVEGGDANTL 171 >gi|313247647|emb|CBY15808.1| unnamed protein product [Oikopleura dioica] Length = 765 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 61/200 (30%), Gaps = 20/200 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + VLD S SMN G + +A +I L +++ + L+T + Sbjct: 4 LCFVLDTSASMNRRTAQGQRVIDLAKTAIDNFLKKVRAKEPSARTDQYLLLTTDREQTVR 63 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE-----HIAKGHDDYKK 285 +++ L + + A++ + + E + Sbjct: 64 VGPKDSPTIFWRELHNLQADDASDVGYAITTAFDILNMERFDHELDSIGRGRQPWSIRPS 123 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDRFYS 344 +I L+DG + + L + I V + + C + + Y Sbjct: 124 AVIVLSDGGMLTNKSLTRTDLELPDG----------ISVSIPS----VSLCEVTGGKSYQ 169 Query: 345 VQNSRKLHDAFLRIGKEMVK 364 V + + + I E+ Sbjct: 170 VAHPAAVMETLQAINGELQT 189 >gi|217972769|ref|YP_002357520.1| tetratricopeptide repeat-containing protein [Shewanella baltica OS223] gi|217497904|gb|ACK46097.1| Tetratricopeptide TPR_2 repeat protein [Shewanella baltica OS223] Length = 692 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLIATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + GL A + II +TDG ++ D +L Sbjct: 166 VRGSNLAAGLTQAKTLLAQGGHIRGD-----------IIVMTDGITAAQFDDANSALS 212 >gi|313230976|emb|CBY18974.1| unnamed protein product [Oikopleura dioica] Length = 522 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 61/207 (29%), Gaps = 38/207 (18%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV------RSG 219 + LD+ +V+D S S DK +++L + V + R G Sbjct: 278 NERLDISVVIDTSGSQVG------DKQSAVKDFFKDLLKEFDTQTAVKISITDIGDGREG 331 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V + ++ I +N L + TT I D + Sbjct: 332 QV---NTVLGPTQFV-DTSDINSALNSLTWYGTT----------TAIADGITEGASQMDT 377 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCA 337 D+ +I +TDG + + S G AIG L++ A Sbjct: 378 TDNVNDVMIVITDGFDGDLSSLQTASAAVAT----AGITAIAIGYDENGGIIGSTLEDIA 433 Query: 338 S--PDRFYSVQNSRKL----HDAFLRI 358 + ++ +L F I Sbjct: 434 NGVSSNVIEATSTSELDGLALSVFNSI 460 >gi|300856050|ref|YP_003781034.1| hypothetical protein CLJU_c28840 [Clostridium ljungdahlii DSM 13528] gi|300436165|gb|ADK15932.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528] Length = 297 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 28/56 (50%) Query: 12 NCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 N +G+++I++ +L+ + V++ + K KL +D L A ++ + E Sbjct: 5 NDRGNVAIISCLLITALLGFTAYVLDIGMIYIEKTKLTNAIDSGALAAALELPDNE 60 >gi|229582324|ref|YP_002840723.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|228013040|gb|ACP48801.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 356 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 34/197 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + ++++D S SM KL A +S +++L ++N L+ Sbjct: 32 QSSVTSSIHYIIMIDNSPSMRGE------KLNTAVQSAQKLL------YNLNEGNYVTLI 79 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FS+ +I + + ++ +A ++AK Sbjct: 80 LFSNHP---------------EIKYQGPAKGIITFDVGKGYTTRLHEAVSFTINLAKQSQ 124 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SP 339 K II LTDG+ + D + Y + IG+ ++ LK A S Sbjct: 125 VPTK-IIMLTDGKPT----DKRNVKDYEKLDIPPNTQIITIGIGNNYNERILKKLADRSS 179 Query: 340 DRFYSVQNSRKLHDAFL 356 +FY +++ +L + F Sbjct: 180 GKFYHIKDISELPNIFE 196 >gi|123444851|ref|XP_001311192.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121892992|gb|EAX98262.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 698 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 72/208 (34%), Gaps = 46/208 (22%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 S++ +S+ V+D S SM ++ A + +R + + Sbjct: 236 FEGSIESNSE------FYFVVDCSGSMAGK------RIENAVKCMRLFIQSLPV------ 277 Query: 215 VVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDA 269 R ++ F S+ V + ++ + T L++ Sbjct: 278 GCRFAILKFGSQFQTVLEPCDYTDENVARAMKLLDTIKADMGGTDILSPLQH-------- 329 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR--GAIVYAIGVQAE 327 K + + K I FLTDGE +P+ C A++ G +++IG+ + Sbjct: 330 ----VSELKAKEGFVKQIFFLTDGEVHNPD-------MICATAQKNRSGNRIFSIGLGSG 378 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHD 353 A +K A S + + + +++ Sbjct: 379 ADPGLIKGMARKSGGNYSIIGDDDNMNE 406 >gi|313200801|ref|YP_004039459.1| outer membrane adhesin-like protein [Methylovorus sp. MP688] gi|312440117|gb|ADQ84223.1| outer membrane adhesin like proteiin [Methylovorus sp. MP688] Length = 1543 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 71/183 (38%), Gaps = 29/183 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLD-------------MMMVLDVSLSMNDHF 185 T +++ + ++ ++ + D + +++ LD+S SM H Sbjct: 988 VLTLNVGVSATANGVTSVGNLTVNVEDDSPVANPITANLSTTNTNLLITLDISGSMRTHD 1047 Query: 186 GPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 G G +L A +SI+ +LD ++ D R LV FS+ Q + + ++ Sbjct: 1048 GVGDTTRLASAIQSIKVLLDKYDALGD----TRISLVVFSTTAAQVGTDWMTIDQAKAQL 1103 Query: 245 NRLI---FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 ++++ G T L A + D + + + F++DGE ++ + Sbjct: 1104 DQILVKGPGGNTNYDSALANAMDAFDDPGKL--------TNAQNVAYFISDGEPNTGSGS 1155 Query: 302 NKE 304 N Sbjct: 1156 NTS 1158 >gi|268556154|ref|XP_002636066.1| Hypothetical protein CBG01306 [Caenorhabditis briggsae] gi|187038428|emb|CAP22594.1| CBR-CLEC-218 protein [Caenorhabditis briggsae AF16] Length = 381 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 64/189 (33%), Gaps = 17/189 (8%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN--VVRSGLVTF 223 ++ LD+ +V+D S M G L S+ L + D ++ R ++T+ Sbjct: 33 NLWLDVYVVIDDSTKM------GTSGLSQVASSVFSTLANSRVGSDYSDKRGARVAVITY 86 Query: 224 SSKIVQTFPLAWGVQH--IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + L+ + ++ I+ L + S ++ I + + ++ Sbjct: 87 NENAYIRSNLSDLTSNQDLENVISSLQVSKSDISN--IQTPLKLINEMMGYKDGNGPKNN 144 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LKNCASPD 340 I++ D + D + K G + + ++ LK AS Sbjct: 145 TMSVIIMYAAD----YIDYDQPTANQLAYYLKENGVTIITVANMDDSNKIMKLKALASEG 200 Query: 341 RFYSVQNSR 349 +S+ + Sbjct: 201 YGFSLSDEN 209 >gi|303242523|ref|ZP_07329002.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302589941|gb|EFL59710.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 1855 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 55/161 (34%), Gaps = 26/161 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-- 227 D++ +D + SM D + + + + ++ +R GL+ + Sbjct: 760 DIIFAIDSTGSMGDKIENVITNVNEFAEELSKNVE-----------IRFGLIDYKDIYEV 808 Query: 228 -VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 T W V ++++++ ++ E A + + +A+ + K Sbjct: 809 GETTINCGWFTDVNELKKRVDEILVYGGGDVP---ESAVDALEEARTMGFRP-----NAK 860 Query: 285 KYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIG 323 K+I+ LTD + D + K G I + Sbjct: 861 KFIVLLTDADYKDGTHFTDVTTMAQEIDLLKNDGIITSVVS 901 >gi|268324839|emb|CBH38427.1| hypothetical protein containing von Willebrand factor type A domain [uncultured archaeon] Length = 542 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 83/207 (40%), Gaps = 20/207 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + ++D+S S + + + L +I + R +V Sbjct: 352 TENRTRDIAVAFLVDMSGSTVGSTILCEKEALILMSEALKELGDAFAIYGFSGYGRDNVV 411 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F +++ F ++ ++ +Q KI+ + +T+ P + +A K+ + + Sbjct: 412 FF---LIKDFEDSYDLR-VQCKISTMTNKQSTRIAPAIRHATTKL-----------RRRE 456 Query: 282 DYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + +I L+DG+ + N +++ EA+R G + I V EAA ++L Sbjct: 457 EGTRMLILLSDGKPLDRDYYGNYAIEDTRMALKEAQRYGVKSFCITVDREAA-EYLPRMY 515 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + R+ + + KL RI K Sbjct: 516 ADSRWVVIDDVLKLPAKITRIYKRFTT 542 >gi|218131128|ref|ZP_03459932.1| hypothetical protein BACEGG_02733 [Bacteroides eggerthii DSM 20697] gi|317476994|ref|ZP_07936236.1| hypothetical protein HMPREF1016_03220 [Bacteroides eggerthii 1_2_48FAA] gi|217986648|gb|EEC52982.1| hypothetical protein BACEGG_02733 [Bacteroides eggerthii DSM 20697] gi|316906787|gb|EFV28499.1| hypothetical protein HMPREF1016_03220 [Bacteroides eggerthii 1_2_48FAA] Length = 289 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTVKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I L T GLEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFHPESRRTNIRLGLEYLTNVMK 173 >gi|282899257|ref|ZP_06307228.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] gi|281195826|gb|EFA70752.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] Length = 431 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 67/174 (38%), Gaps = 15/174 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREM--LDIIKSIPDVNNVVRSGLVTFS---- 224 ++++LD S SMN G K+ A +I + + + ++V G + Sbjct: 110 IIVLLDFSGSMNQTDSGGSKKITGAINAIGQFAKVSSERGGDTQISIVPFGEAGKNCPEY 169 Query: 225 ----SKIVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + + + +Q+ E ++ L +T L+ + + + ++ + + Sbjct: 170 PVDKDTLDKFISASDFKLQNSLEYLSTLNPCGSTNLYQPLKKSLQFLGNPEDPRFTLPEN 229 Query: 280 HDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAAD 330 D + II L+DG +++ N + ++ + K V+ +G Sbjct: 230 SSDPQPRLSIILLSDGYHNAMN-EYRDFSELKSLLKDYENITVHTLGYGLTPEQ 282 >gi|167399327|ref|ZP_02304851.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167051831|gb|EDR63239.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] Length = 253 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 28/133 (21%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +I + T ++ G+ + ++ + K +I L+DG++ Sbjct: 111 NSKGDINEILNMKAEGGTLASSGILVGNKMLTES-----------QNNNKLMIILSDGDD 159 Query: 296 S----SPNIDNKESLF----------YCNEAKRRGAIVYAIGVQAEAADQFL---KNCAS 338 + S D K + C + K G + IG+ + + K+C Sbjct: 160 NTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVG 219 Query: 339 PDRFYSVQNSRKL 351 FY +N+ +L Sbjct: 220 TGNFYLAKNAHEL 232 >gi|288925754|ref|ZP_06419685.1| von Willebrand factor, type A [Prevotella buccae D17] gi|288337409|gb|EFC75764.1| von Willebrand factor, type A [Prevotella buccae D17] Length = 281 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTVRQTKRDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I ++ T +EY + Sbjct: 126 SDRIEKYIPPKKGRRHILYIIREMLDFEAQSKKTDIGAAVEYLTRVMK 173 >gi|269104660|ref|ZP_06157356.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268161300|gb|EEZ39797.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3986 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 67/191 (35%), Gaps = 36/191 (18%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH----------IQ 241 + ++ + ++D I + + + + +VTF+S + + + I Sbjct: 3555 FDMTKQAYQVLVDEILTNTNDKSSLNFNVVTFNSTVGGDSSFHYDAESNSFVNSRGTDIH 3614 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 +N LI G T+ L+ + I + FLTDG++++ + Sbjct: 3615 NYLNSLIAGGGTEFEAPLKTISDHIVTDGNTRN-----------VVYFLTDGKDNTGFSN 3663 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS------------PDRFYSVQNSR 349 + + + A + A V +I V + A P + N+ Sbjct: 3664 SANNS---DYAALKHAEVISIAVGPSGDADQVNQIAQLGEGYNNNNDSEPSYSKVITNTN 3720 Query: 350 KLHDAFLRIGK 360 +L D F IG+ Sbjct: 3721 ELTDIFKDIGQ 3731 >gi|154687788|ref|YP_001422949.1| YwmD [Bacillus amyloliquefaciens FZB42] gi|154353639|gb|ABS75718.1| YwmD [Bacillus amyloliquefaciens FZB42] Length = 226 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 76/231 (32%), Gaps = 29/231 (12%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD---- 204 L + S V + K ++ ++ D S SM G G K+ VA S+ + Sbjct: 11 GLLTLSLGSPVFAAEKKHEETNVAVLFDASGSMIQKTG-GERKIDVAKESVTSFAEVLPE 69 Query: 205 ----IIKSIPDVNNVVRSG-LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +++ N SG V+ ++ + V ++ ++++ T L Sbjct: 70 DTNLMLRVFGHKGNNKNSGKAVSCNATETLYGLQPYAVTPFEQSLSKIKPTGWTPIAKAL 129 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-I 318 + E + K + +TDGE + + ++ Sbjct: 130 ----------SDTREEFERAGATGKNVVYLITDGEETCGG----NPQAEIQKLRKANVNT 175 Query: 319 VYAI---GVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + I + ++ K A + S ++ + A+ K++ K+ Sbjct: 176 IVNIIGFNFGMKGSESLEKAAEAGGGTYVSADSADEFKQAWEDAAKDLAKE 226 >gi|189423337|ref|YP_001950514.1| von Willebrand factor A [Geobacter lovleyi SZ] gi|189419596|gb|ACD93994.1| von Willebrand factor type A [Geobacter lovleyi SZ] Length = 572 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 53/141 (37%), Gaps = 22/141 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+++ +M+++D S SM K+ VA+++ + + + SIP V Sbjct: 401 KTEVNTAIMILVDRSGSMQHQ------KIEVASKTAYVVAEALDSIPGCFAAV---AAFP 451 Query: 224 SSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL +G + K + G T L +A ++ +E Sbjct: 452 VGNSDGVAPLVRFGERPCSSKFG-MSAGGGTPLAQALYWAGVELLKREE----------- 499 Query: 283 YKKYIIFLTDGENSSPNIDNK 303 +K ++ +TDGE + Sbjct: 500 PRKILLSVTDGEPDDLRTTKR 520 >gi|156742508|ref|YP_001432637.1| magnesium chelatase ATPase subunit D [Roseiflexus castenholzii DSM 13941] gi|156233836|gb|ABU58619.1| magnesium chelatase ATPase subunit D [Roseiflexus castenholzii DSM 13941] Length = 608 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 76/216 (35%), Gaps = 25/216 (11%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKISS-KSDIGLDM 171 SI D + ++SA R PF + LL + ++I +S G Sbjct: 367 SIPGDPRRGRIDVSATLRAAAPFQPLRRAQANAHAAPSRVLLRSDDLRIKQYRSKAGALF 426 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQT 230 + +D S SM H ++ A +++ +L D R L+ F Sbjct: 427 LFAVDASGSMALH------RMRQAKGAVQALLQKAYVHRD-----RVALLAFRGQNAELL 475 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + V+ + ++ L G T L A A+ + + ++ L Sbjct: 476 LPPSQSVELARRALDLLPTGGGTPLAAALLAAIEVAQQARARG--------IMQTVLVLL 527 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 TDG + +E + +E + G V A G+Q Sbjct: 528 TDGRANIGLRAGREGV--ADELQTIGRAVVAAGIQT 561 >gi|327266506|ref|XP_003218046.1| PREDICTED: complement factor B-like [Anolis carolinensis] Length = 764 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 70/235 (29%), Gaps = 37/235 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +I + D L++ +V+D S S D+ A ++++ I S G Sbjct: 242 RIKIEKDGSLNIYIVIDSSRS------IKKDQFQEAQNMSIKLIEKISSYDISPKY---G 292 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFG---------STTKSTPGLEYAYNKIFDAK 270 ++TF++++ + + + G T GL Y + + Sbjct: 293 VITFATEVKELVSTMDERSSDAAWVIEKLEGIKHSDHKQKPGTNIYKGLFAVYTMMISQE 352 Query: 271 EKLEHIAKGHDDYKK----YIIFLTDGENSSPNIDNKESLFYC----------NEAKRRG 316 + I+ L+DG+ + D + + Sbjct: 353 AAERRRGLNPPPVSEKTRHVIVLLSDGDYNMGG-DPTSVIKQIRGFLHIGRNHTNPREDY 411 Query: 317 AIVYAIGVQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKEMVKQRI 367 VY V + + AS + +++ L AF + E + Sbjct: 412 LDVYIFAVGGTVNMENVNKIASQKSGERHAFKLKDYNDLQVAFDEMIDESETLSM 466 >gi|94265396|ref|ZP_01289149.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] gi|93454098|gb|EAT04430.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] Length = 756 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 75/215 (34%), Gaps = 37/215 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L+ + + L++ +++D S SM+ D + A +++ E+L K D Sbjct: 248 LVSFQPGISLPEKRLPLNIKILIDCSSSMSG------DSMEQARQALGEILKHFK-ADDH 300 Query: 213 NNVVRSGLV--TFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDA 269 N+V G + V+ P ++ + L T+ L+ Sbjct: 301 FNIVMFGDSCEALFERQVKATPA--NLKRAGRLLQVLDADMGCTELYEALQMTLQIPGPD 358 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + I+ +TDGE N D L K+ ++++GV + + Sbjct: 359 NTPAD------------ILLITDGE--VWNHDRVVGL-----VKKAKHRIFSVGVGSCVS 399 Query: 330 DQFLKNCA--SPDRFYSVQN----SRKLHDAFLRI 358 + +K A S V + K+ F RI Sbjct: 400 EDLVKRLAGISGGACELVAPNEQMAEKIIRHFQRI 434 >gi|17537923|ref|NP_496260.1| C-type LECtin family member (clec-61) [Caenorhabditis elegans] gi|3881711|emb|CAA88986.1| C. elegans protein ZK666.7, confirmed by transcript evidence [Caenorhabditis elegans] Length = 403 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 79/226 (34%), Gaps = 15/226 (6%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + FC + SS+ V ++ + LD++ V+D S M D G+ + Sbjct: 10 FLLGITFCLGSPVSQSSNCQDGYMDRVCGEDETHLWLDIVAVVDNSKGMTDK---GVVTV 66 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-- 250 S+ + P+ R G+VT++ L +++ ++FG Sbjct: 67 AGQIVSLFVDGQQLGIDPNQPRTTRIGIVTYNRDATVVADL--NKITSIDQLADIVFGAL 124 Query: 251 -STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 + +A + D + + A YKK +I + ++ L Sbjct: 125 HKASSIADSYLHAGLEAADDLLQRQSFATSRGHYKKLVIVY---ASEYSGTGTQDPLPLA 181 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLK---NCASPDRFYSVQNSRKLH 352 K + + + + FL+ A+P ++ QN +L Sbjct: 182 TRMKVD-VAIATVAYGQDNVNGFLRQLSQIATPGYNFTNQNGIQLV 226 >gi|14030591|gb|AAK52970.1|AF367016_1 serum opacity factor VT25 [Streptococcus pyogenes] Length = 963 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 67/169 (39%), Gaps = 13/169 (7%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + ++K++ K D G D+M +LDVS M ++F +++ ++ K + Sbjct: 192 TLDVTLKVTPKQIDEGADVMALLDVSKKMTKENFDKAKEQIKKMVTTLTGEPTDGKENHN 251 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDA 269 N VR L+TF K+ L +++ K+N + L+ A +K + Sbjct: 252 RRNSVR--LMTFYRKVNDPIELT--TKNVDAKLNEVWEQAKKDWDWGVDLQGAIHKAREI 307 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + ++ K +++I+ + GE++ N S Sbjct: 308 FKNEKNSKK-----RQHIVLFSQGESTFSYDINANSKANLKAITEDKIT 351 >gi|302342832|ref|YP_003807361.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] gi|301639445|gb|ADK84767.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] Length = 680 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 45/135 (33%), Gaps = 18/135 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + G +M +D S SM ++ V+ +++ +L R GLV F Sbjct: 467 ARRGRLIMFCVDASGSM-----NAAARMRVSKQAVLGLLTEA-----YQKRDRVGLVAFG 516 Query: 225 SKIV-QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P V+ ++ + L G T GL + +L K Sbjct: 517 GNAARLLLPPTGSVEVARKLLAELPTGGKTPLAAGLAVTAQAV---SRELARDPK----L 569 Query: 284 KKYIIFLTDGENSSP 298 ++ TDG + P Sbjct: 570 TPLVVVFTDGRPNVP 584 >gi|325689802|gb|EGD31806.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK115] Length = 450 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTTALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|324993014|gb|EGC24934.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK405] gi|327462310|gb|EGF08637.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1] gi|327489663|gb|EGF21455.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1058] Length = 450 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTTALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|313222038|emb|CBY39058.1| unnamed protein product [Oikopleura dioica] Length = 1721 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 44/207 (21%), Positives = 75/207 (36%), Gaps = 31/207 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-RSGLVTFSS--K 226 D+ VLD S S SI E + I + GL+ +S + Sbjct: 219 DIFFVLDGSYS----------TKRAGWESIIEFVQRFVKINFEHGGDMNYGLLQYSDFVE 268 Query: 227 IVQTFPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + +F G + I E I + G +T + + YA F ++ + Sbjct: 269 PILSFADKEGQREIAEFIEILSQIKYHSGFSTLTGTAMRYAAETEFMSERGARKNVR--- 325 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCASPD 340 K +I +TDG + +ID+ + G V A+GV + ++ +SPD Sbjct: 326 ---KIMIVVTDGR--AKDIDDNTVKIVGESLRAAGDLTVVAVGVNKAVESELVEIASSPD 380 Query: 341 RFYSVQNSRKLHD----AFLRIGKEMV 363 ++ + LH+ F I KE Sbjct: 381 FVHNTKKFEDLHNFLSPVFAEICKETG 407 >gi|297671393|ref|XP_002813835.1| PREDICTED: collagen alpha-1(VII) chain-like [Pongo abelii] Length = 2889 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 61/188 (32%), Gaps = 35/188 (18%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 D++ +LD S S+ + ++ VR V +S Sbjct: 35 YAADIVFLLDGSSSIGRS------NFREIRSFLEGLVLPFSGAASA-QGVRFATVQYSDD 87 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 W + S G + N + + L + + D K Sbjct: 88 PRGQ----W-------------ASDASDSFEGTGLSINVLM-IRAILSSVTQARDCVPKV 129 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS---PDRFY 343 I +TDG++ + L K +G ++A+G++ A + LK AS D F+ Sbjct: 130 CILITDGKSQDLVDTAAQRL------KGQGIKLFAVGIK-NADPEELKRVASQPTSDFFF 182 Query: 344 SVQNSRKL 351 V + L Sbjct: 183 FVNDFSIL 190 >gi|290976796|ref|XP_002671125.1| predicted protein [Naegleria gruberi] gi|284084691|gb|EFC38381.1| predicted protein [Naegleria gruberi] Length = 1058 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 37/191 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD S SM+ + A ++ +L I + R + + + Sbjct: 65 LVIALDRSGSMSGS------AISEAKLALESLLTNIDGHNE-----RVLFIPYDTSAELI 113 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++ R+ G T + G D + I+F Sbjct: 114 DMSRMSLTEKLNQVQRVHAGGGTDFACVF------------DAIRLFTGALDGQIAIVFF 161 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR------GAIVYAIGVQAEAADQFL----KNCASPD 340 TDG++ N+E+ + K+R + IG + + L + ++ Sbjct: 162 TDGQDGYNG--NRET--AIDMMKKRLTTESESFEFHTIGFSSGHDARLLTDMTRLGSAQG 217 Query: 341 RFYSVQNSRKL 351 F +++ + Sbjct: 218 TFQYAESASSI 228 >gi|284922067|emb|CBG35146.1| conserved hypothetical protein [Escherichia coli 042] Length = 219 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|301768028|ref|XP_002919433.1| PREDICTED: calcium-activated chloride channel regulator 2-like [Ailuropoda melanoleuca] gi|281352225|gb|EFB27809.1| hypothetical protein PANDA_008061 [Ailuropoda melanoleuca] Length = 943 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 72/212 (33%), Gaps = 39/212 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I I G+V+F+SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQIVEIHTF-----MGIVSFNSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + + + T GL+ + + Y Sbjct: 362 AQLHQINSDDDRKLLVSHLPMTVSAEAETSICSGLKKGFEVV---------EKLNGKAYG 412 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 +I +T G++ L G+ +++I + + + L +F Sbjct: 413 SVMILVTSGDDEHMGNCLLTVL-------SSGSTIHSIALGSSVVENLEELSRRTGGLKF 465 Query: 343 YSVQ--NSRKLHDAFLRIGK---EMVKQRILY 369 + NS + +AF RI ++ +QRI Sbjct: 466 FVPDKSNSNSMTEAFSRISSGTGDIFQQRIQL 497 >gi|198430887|ref|XP_002124814.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 400 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 59/325 (18%), Positives = 110/325 (33%), Gaps = 28/325 (8%) Query: 49 HYILDHSLLYTATKIL----NQENGNNGKKQKNDFSYRI-IKNIWQTDFRNELRENGFAQ 103 ++ + K+L N N G Q + + + + + +N Sbjct: 48 QTVIVKASANGTAKLLIGAPNSRRNNFGSSQGVLYGIKFAVASGIYDIVAKDPEKNAPIV 107 Query: 104 DINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTF-PWCANSSHAP-LLITSSVKI 161 N + + ++ I + + + + A RY C NS ++ + Sbjct: 108 VTNLRKDADAIGISVSNSKEAVTVCAPMRYRHCRSHNIVSGSCFNSKDFGESWQSAKPTV 167 Query: 162 SSKSDIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSI-REMLDIIKSIPDVNNVVRSG 219 SD LD++ VLD S S+ + +FG + + +S E D I I + Sbjct: 168 PFCSDQALDLLFVLDGSFSVGETNFGLVKNWVVALAKSFDIEKQDNIGIIQYSHWYP--- 224 Query: 220 LVTFSSKIVQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V +S + + G+ + EKI + I + +A NK E Sbjct: 225 GVPYSQQPYMKTEVPLGLYKNFTLFEKIAQNISLQGFTTYTA--HALNKTVLDFMASERF 282 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 H++ K +I +TDG D + + +G I +AIG+ L+ Sbjct: 283 M--HENVTKVMILITDGR----ADDAIDLYSSAEYVRSQGIITFAIGIGNSVLRDQLQIV 336 Query: 337 AS-----PDRFYSVQNSRKLHDAFL 356 A+ R + V L+ Sbjct: 337 ANGKLGEDTRVFEVTTFTSLNSILS 361 >gi|198430141|ref|XP_002124276.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 606 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 63/165 (38%), Gaps = 27/165 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++++LD S S+N+ + + V + I + + + + + ++++ Sbjct: 410 DLVIILDASSSVNEDNWRRIKSITV--QLIIKFMS--------SESTQFAVFRYNNRPDV 459 Query: 230 TFPL----AWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + I + +G T++ L +A + I H G+ Sbjct: 460 ATQILLQNTNDAAALLNAIENIPYGGVGTRTGIALGHAADVIL-------HTDNGNRPKA 512 Query: 285 KYIIF-LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 I+ +TDG+ + N + + GA V A+G++ E Sbjct: 513 ADIVLIMTDGQVNV----NDNTQVPADALHNMGATVMAVGIEPEK 553 >gi|158337841|ref|YP_001519017.1| hypothetical protein AM1_4727 [Acaryochloris marina MBIC11017] gi|158308082|gb|ABW29699.1| conserved hypothetical protein [Acaryochloris marina MBIC11017] Length = 971 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 58/347 (16%), Positives = 110/347 (31%), Gaps = 67/347 (19%) Query: 50 YILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTD-FRNELRENGFAQDINNI 108 D S A+ + + N G + + + +D + ++ +NG + + Sbjct: 450 NASDTSYSIQASSL-SVTNSTGGPATLHAWVDFDGNGTFDSDEYTSQPVQNGISSPDGAL 508 Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP-------LLITSSVKI 161 S + + Y +R+ + L IT+ V Sbjct: 509 TWSGAGVSGMSGGTTTY-----ARFRITTDGGINANTPGGFARDGEVEDHALAITTPVNP 563 Query: 162 SSKSD---IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 ++ + DMM +LD S S++ + + + ++ ML+ + D N R Sbjct: 564 DIDNNFCQVSSDMMFILDKSGSVS------LSERRLQRDAVMAMLNYL---VDNNITSRV 614 Query: 219 GLVTFSS---------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 G+V F S + + +N I G T G + A + Sbjct: 615 GIVRFDSTSATVIGYTDVTAANLPTFESALNTNYVN--IGGGATNWEAGFQQAISLGVSP 672 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + F DG +S N E+L + K+ GA +Y IG+Q+ Sbjct: 673 G------------SPDVVFFFADGNINSGGSPNDEALQF----KQAGAHIYGIGIQSLDI 716 Query: 330 DQFLKNC--------------ASPDRFYSVQNSRKLHDAFLRIGKEM 362 D FL A+ + V + L D + + + Sbjct: 717 DDFLDITDGSNTTQFDAALDNANSADYVEVNSYDDLADDMTSLLRSL 763 >gi|17533683|ref|NP_496743.1| C-type LECtin family member (clec-63) [Caenorhabditis elegans] gi|3876684|emb|CAB03058.1| C. elegans protein F35C5.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 411 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 58/171 (33%), Gaps = 17/171 (9%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK------IVQTFPLAWGVQH 239 G+ ++ ++ I + R GLVT++ + + + Q Sbjct: 73 NEGITEIAANIVTVFGNGTRIGNQYSDPRSTRLGLVTYNGRSTIVADLNLLQSIDDLYQS 132 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + +N++ + G+ A N + + +YK+ ++ Sbjct: 133 VFSTLNQVSNSDDSFLAKGIGAAENVLQSGRTNGVR-----SNYKRLVVVYASAYKGEGE 187 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASPDRFYSVQN 347 +D + + K G +V + + + L N ASP+ ++ ++ Sbjct: 188 LDP---IPVADRLKSSGVVVSTVAFDQDGDEALLAGLTNIASPNYAFTSKD 235 >gi|91777715|ref|YP_552923.1| putative MxaC-like protein [Burkholderia xenovorans LB400] gi|91690375|gb|ABE33573.1| Putative MxaC-like protein [Burkholderia xenovorans LB400] Length = 328 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 78/259 (30%), Gaps = 52/259 (20%) Query: 145 CANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGP-------GMDKLGV 194 + + + + G +++++D S SM++ G K V Sbjct: 57 VLAMVAIVFGLAGPGRSQRQVLRTGSGAQILILMDRSASMDETMNSKGVESPAGESKNKV 116 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST-- 252 A S+ + R + F + V P + + I I G Sbjct: 117 ARASLTNFV-------AQRPNDRLAFMMFGTSPVLAMPFTYDHRAIAAAIAGTAVGRGMP 169 Query: 253 -TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T+ GL A E + G ++ I+ ++DG +D + + Sbjct: 170 DTQLDRGLLAAI------GEFNGRASSG----RRAIVLVSDG---GARLDTQVRRLIQDG 216 Query: 312 AKRRGAIVYAI----------------GVQAEAADQFLKNCAS---PDRFYSVQNSRKLH 352 R +Y I ++ A + + S P R + N+R + Sbjct: 217 LTRNQIALYFIYLRSGTYSPDLNAAVPANESSAEAELHRYFLSLKTPYRLFQAGNARAMK 276 Query: 353 DAFLRIGKEMVKQRILYNK 371 DA I ++ + + Sbjct: 277 DAMAEINRQQNARTSFVER 295 >gi|74312597|ref|YP_311016.1| hypothetical protein SSON_2125 [Shigella sonnei Ss046] gi|82543447|ref|YP_407394.1| hypothetical protein SBO_0899 [Shigella boydii Sb227] gi|193069246|ref|ZP_03050202.1| von Willebrand factor type A domain protein [Escherichia coli E110019] gi|218554640|ref|YP_002387553.1| hypothetical protein ECIAI1_2149 [Escherichia coli IAI1] gi|256017755|ref|ZP_05431620.1| hypothetical protein ShiD9_02482 [Shigella sp. D9] gi|300921723|ref|ZP_07137889.1| von Willebrand factor type A domain protein [Escherichia coli MS 182-1] gi|301329923|ref|ZP_07222644.1| von Willebrand factor type A domain protein [Escherichia coli MS 78-1] gi|332278773|ref|ZP_08391186.1| von Willebrand factor type A domain-containing protein [Shigella sp. D9] gi|73856074|gb|AAZ88781.1| conserved hypothetical protein [Shigella sonnei Ss046] gi|81244858|gb|ABB65566.1| conserved hypothetical protein [Shigella boydii Sb227] gi|192957379|gb|EDV87826.1| von Willebrand factor type A domain protein [Escherichia coli E110019] gi|218361408|emb|CAQ98995.1| conserved hypothetical protein [Escherichia coli IAI1] gi|300421861|gb|EFK05172.1| von Willebrand factor type A domain protein [Escherichia coli MS 182-1] gi|300844022|gb|EFK71782.1| von Willebrand factor type A domain protein [Escherichia coli MS 78-1] gi|320183712|gb|EFW58550.1| hypothetical protein SGF_04108 [Shigella flexneri CDC 796-83] gi|323168804|gb|EFZ54484.1| von Willebrand factor type A domain protein [Shigella sonnei 53G] gi|323183854|gb|EFZ69245.1| von Willebrand factor type A domain protein [Escherichia coli 1357] gi|324019209|gb|EGB88428.1| von Willebrand factor type A domain protein [Escherichia coli MS 117-3] gi|332097069|gb|EGJ02052.1| von Willebrand factor type A domain protein [Shigella boydii 3594-74] gi|332101125|gb|EGJ04471.1| von Willebrand factor type A domain-containing protein [Shigella sp. D9] Length = 219 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|308472901|ref|XP_003098677.1| hypothetical protein CRE_04172 [Caenorhabditis remanei] gi|308268277|gb|EFP12230.1| hypothetical protein CRE_04172 [Caenorhabditis remanei] Length = 392 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 53/164 (32%), Gaps = 9/164 (5%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G+D + ++ I S R GLVT++S Q L Q I + N Sbjct: 52 NNGLDNVAANILNVFSSGTRIGSNSSEPRTTRLGLVTYNSNATQQADL-NKYQSINDAGN 110 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 + +T Y + A++ + + YK+ +I N +D Sbjct: 111 GIFDSLSTVVYTADSYLATGLTLAEKMFIEQSVNTIRGHYKRVVIVYASEYNEDGELDP- 169 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEAADQF---LKNCASPDRFYS 344 L N K + + + + L ASP +S Sbjct: 170 --LSVANRLKLSNVKIITVAYEQPGSVGLENGLSQIASPGFSFS 211 >gi|220914886|ref|YP_002490194.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219952637|gb|ACL63027.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 353 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 48/134 (35%), Gaps = 20/134 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + G + +++D S SMN+ F P + A S R + + + P + + Sbjct: 82 TGTGAQVSLLIDRSGSMNESFAGRQPSGAEESKAAASRRLLAEFVGRRPH----DQVAVS 137 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 FS+ + P+ + ++ I + T GL A ++ Sbjct: 138 VFSTAPIAVLPMTDRSEAVRAAIRAIDRPGLDATNVARGLGLALSQFGA----------- 186 Query: 280 HDDYKKYIIFLTDG 293 + ++ ++DG Sbjct: 187 GASTGRVLLLVSDG 200 >gi|221131058|ref|XP_002157772.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 490 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 32/144 (22%), Positives = 52/144 (36%), Gaps = 10/144 (6%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++LDVSLSM+ DK+ + + +L + + S LV FSS Sbjct: 4 VILLDVSLSMSRSINTR-DKISLIDVAQSNLLYFFDQLSSRCKLEHSSLVVFSSLYEVLV 62 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + ++ L TK L E L G +IF+T Sbjct: 63 KFTRDYEALKTACLSLTTYDKTKIETAL--------FGIEDLVTEEWGSFVPIN-LIFIT 113 Query: 292 DGENSSPNIDNKESLFYCNEAKRR 315 DG+ +ESL N+ ++ Sbjct: 114 DGQCGIGEYSLQESLKTINQKQKD 137 >gi|332018134|gb|EGI58743.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Acromyrmex echinatior] Length = 2218 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 67/198 (33%), Gaps = 25/198 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++M+ ++D S S+ ++F ++ + +D + + + G V Sbjct: 88 RNKTDQVEMVFLVDASGSIGAENFRSELNFVTKLLSDFT--VDALAARIALVTFGGRGSV 145 Query: 222 TFSSKIVQTFPLAWGVQHI--QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + ++ ++ N G T + L A + ++E Sbjct: 146 YRNIDQISRHGPNDHKCYLLNKQFSNITYSGGGTYTRGALLEALAILEKSREAA------ 199 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 K + +TDG ++ + + K GAIV+ G++ ++ + P Sbjct: 200 ----NKVVFLITDGFSNGG-----DPRPAAHLLKNTGAIVFTFGIRTGNVEELHDIASHP 250 Query: 340 DRFYSVQNSRKLHDAFLR 357 + L D+F Sbjct: 251 EY-----THSYLLDSFAE 263 >gi|320105608|ref|YP_004181198.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924129|gb|ADV81204.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 373 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 80/264 (30%), Gaps = 75/264 (28%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +++ L + +++DVS S ++ ++ + ML + D + L+ Sbjct: 80 RDNNLQLTLGLLVDVSGSQRQVLD---EEREASSSFLDNML-----VADRDKAF---LIQ 128 Query: 223 FSSKIVQTF-------PLAWGVQHIQEKINRLIFGSTTK------------STPGLEYAY 263 F + L G++ + + R F + G A Sbjct: 129 FGHTVELLTDVTGSIPKLQSGLKQVDTQAARPQFSNNQDPNDNSGGSSQRGRRGGGGGAG 188 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++DA ++K +I LTDGE++ ++ A VY+I Sbjct: 189 TALYDAIFLASDEVIHKQPFRKALILLTDGEDNGSKESLSSAIEAAQRADTA---VYSIY 245 Query: 324 VQAE----------------------------------------AADQFLKNCA--SPDR 341 + E + L+ + + R Sbjct: 246 FKGEEHNDTSSRRPSFGGGGFPGGGGRHGGGGGQGGGQPQRTHVDGKKILQRISDETGGR 305 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQ 365 F+ V L + + +I +E+ Q Sbjct: 306 FFEVSKKEPLAEIYKKIAQELRSQ 329 >gi|294495108|ref|YP_003541601.1| von Willebrand factor A [Methanohalophilus mahii DSM 5219] gi|292666107|gb|ADE35956.1| von Willebrand factor type A [Methanohalophilus mahii DSM 5219] Length = 996 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 78/204 (38%), Gaps = 33/204 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K + + + ++D+S S L V ++ M + ++SI D + F Sbjct: 797 KRERDVATLFLVDISASTRKKLDNSKSILDVEKEALVLMTEALESIGDKY-----AIYAF 851 Query: 224 SS----KIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 S + +G ++++ KI+ L T+ P + ++ K+ Sbjct: 852 SGDTRNDVEYYTIKDFGEVFSENVECKIDALEPADNTRLGPVIRHSITKL---------- 901 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDN---------KESLFYCNEAKRRGAIVYAIGVQAE 327 K D K ++ L+DGE I + +++ E K G + I V ++ Sbjct: 902 -KEIDAKIKLLVLLSDGEPYDFGIADGKYENEVAIEDTKMAIQEGKALGMHFFCITVDSK 960 Query: 328 AADQFLKNCASPDRFYSVQNSRKL 351 A+D ++ + S + + N+ L Sbjct: 961 ASD-YMHSIFSDVGYTIIDNATTL 983 >gi|311277028|ref|XP_003135467.1| PREDICTED: protein DDX26B-like isoform 2 [Sus scrofa] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|311277026|ref|XP_003135466.1| PREDICTED: protein DDX26B-like isoform 1 [Sus scrofa] Length = 898 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|301791219|ref|XP_002930578.1| PREDICTED: protein DDX26B-like isoform 1 [Ailuropoda melanoleuca] Length = 898 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|297711112|ref|XP_002832196.1| PREDICTED: protein DDX26B-like [Pongo abelii] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|301791221|ref|XP_002930579.1| PREDICTED: protein DDX26B-like isoform 2 [Ailuropoda melanoleuca] gi|281340117|gb|EFB15701.1| hypothetical protein PANDA_021070 [Ailuropoda melanoleuca] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|223461292|gb|AAI40716.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Homo sapiens] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|194228324|ref|XP_001490675.2| PREDICTED: similar to Protein DDX26B [Equus caballus] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ + Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-S 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|156382057|ref|XP_001632371.1| predicted protein [Nematostella vectensis] gi|156219426|gb|EDO40308.1| predicted protein [Nematostella vectensis] Length = 297 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 67/193 (34%), Gaps = 26/193 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMV--LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 LL S V S+ + M +V +D S S+ + ++ P Sbjct: 13 LLAYSVVTCRSEEACAVTMDLVFLVDGSGSIGSS------NFDRLKEFVSTVIGGFVISP 66 Query: 211 DVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYAYNKI 266 +V + S K +F A + ++ I+ + + S T + GL A+ + Sbjct: 67 QGTQ---ISVVVYHSSAKTHLSFGDAQDLISVRRIISSIAYPSGPQTYTDRGLVEAHQRF 123 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + ++ + DG+++ +++ K G ++ A+G + Sbjct: 124 AKENGARSSRTT------RVVVVINDGKSNGESLEASS-----KPLKDEGIVIMALGFGS 172 Query: 327 EAADQFLKNCASP 339 + L+ AS Sbjct: 173 SVRVEELQTMASS 185 >gi|119632131|gb|EAX11726.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B, isoform CRA_b [Homo sapiens] Length = 898 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|119632130|gb|EAX11725.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B, isoform CRA_a [Homo sapiens] Length = 450 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|114690272|ref|XP_001136594.1| PREDICTED: DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B isoform 1 [Pan troglodytes] Length = 894 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|114690274|ref|XP_001136749.1| PREDICTED: DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B isoform 2 [Pan troglodytes] Length = 897 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|114690278|ref|XP_521277.2| PREDICTED: protein DDX26B isoform 5 [Pan troglodytes] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|114690276|ref|XP_001137050.1| PREDICTED: protein DDX26B isoform 4 [Pan troglodytes] Length = 898 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|109132369|ref|XP_001100789.1| PREDICTED: protein DDX26B-like isoform 4 [Macaca mulatta] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|86747842|ref|YP_484338.1| von Willebrand factor, type A [Rhodopseudomonas palustris HaA2] gi|86570870|gb|ABD05427.1| von Willebrand factor, type A [Rhodopseudomonas palustris HaA2] Length = 372 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 52/145 (35%), Gaps = 20/145 (13%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M I + APL + S+ + +++ VLD + SM+ K+ Sbjct: 1 MTRITLMKALAFTALLAPLALPSTAS----ARPTVEVAFVLDTTGSMSGLIEGAKRKIWS 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTF----SSKIVQTFPLAWGVQHIQEKINRLIFG 250 +I + S PD + +R GLV + + + L +Q + ++ L Sbjct: 57 IATAI------LDSNPDAD--IRMGLVMYRDIGDDYVTRRVELTSDIQDLYARLLELQAR 108 Query: 251 STTK----STPGLEYAYNKIFDAKE 271 L+ A NK+ KE Sbjct: 109 GGGDWPESVNEALDVAVNKLHWTKE 133 >gi|332861720|ref|XP_003317760.1| PREDICTED: protein DDX26B [Pan troglodytes] gi|21756064|dbj|BAC04813.1| unnamed protein product [Homo sapiens] gi|119632132|gb|EAX11727.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B, isoform CRA_c [Homo sapiens] Length = 365 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|94536743|ref|NP_872346.3| protein DDX26B [Homo sapiens] gi|74742010|sp|Q5JSJ4|DX26B_HUMAN RecName: Full=Protein DDX26B gi|57209096|emb|CAI40867.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Homo sapiens] gi|57209211|emb|CAI40723.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Homo sapiens] Length = 861 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|315608296|ref|ZP_07883286.1| von Willebrand factor [Prevotella buccae ATCC 33574] gi|315250077|gb|EFU30076.1| von Willebrand factor [Prevotella buccae ATCC 33574] Length = 289 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTVRQTKRDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I ++ T +EY + Sbjct: 126 SDRIEKYIPPKKGRRHILYIIREMLDFEAQSKKTDIGAAVEYLTRVMK 173 >gi|47565090|ref|ZP_00236133.1| gram positive anchor protein, putative [Bacillus cereus G9241] gi|47557876|gb|EAL16201.1| gram positive anchor protein, putative [Bacillus cereus G9241] Length = 997 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 50/148 (33%), Gaps = 17/148 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD--IIKSIPDVNNV 215 S K + +D ++V D S S L + D ++ S D Sbjct: 55 SFKWPAPKQKKVDFVIVQDASGSFKGTMPNVKKALSNIVDELNPATDRIMVTSYQDYKGY 114 Query: 216 -VRSGLVTFS----SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 G V S K L + IN++ S T + GL++A + Sbjct: 115 KASDGRVLESRGNGIKTTLQAGLTNNFTSAKNGINKITPDSGTPTASGLQFALAE----- 169 Query: 271 EKLEHIAKGHDDYKKYIIFL--TDGENS 296 AKG +D + +FL TDG + Sbjct: 170 ---YEKAKGQNDPDRETVFLLVTDGVAN 194 >gi|225181947|ref|ZP_03735381.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225167387|gb|EEG76204.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 272 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 65/183 (35%), Gaps = 45/183 (24%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR-----SGLVT 222 L++ +++D S SMN + + +K++ D N+VR L+T Sbjct: 91 PLEVCLLVDTSGSMN-----------------GKRIREVKTLAD--NLVRQMHEPLSLIT 131 Query: 223 FS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + ++ + + T G+ A N + + K Sbjct: 132 FQEGDVGVKVRSTRNDLMVRRGLAAMSAAGLTPMGEGIRTAVNYLCGRRGK--------- 182 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV-----YAIGVQAEAADQFLKNC 336 K +I +TDG + + D L +A GA++ + I + E +FL+ Sbjct: 183 --KHLVILITDGLPTWASGDKDPYL----DAIEAGALIKKHKMHLICIGLEPQRKFLEKL 236 Query: 337 ASP 339 A Sbjct: 237 AES 239 >gi|218695695|ref|YP_002403362.1| hypothetical protein EC55989_2329 [Escherichia coli 55989] gi|218352427|emb|CAU98201.1| conserved hypothetical protein [Escherichia coli 55989] Length = 219 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 62/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++I VQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIAVQGADMKTLAQ 171 >gi|218528581|ref|YP_002419397.1| hypothetical protein Mchl_0537 [Methylobacterium chloromethanicum CM4] gi|218520884|gb|ACK81469.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 480 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 11/105 (10%), Positives = 35/105 (33%), Gaps = 1/105 (0%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI-LNQE 67 F + G+++++ A+ + ++G I+ + + + D ++ + L Sbjct: 30 FHQDRGGTVTVIVALAATTLMGLVGGAIDYARLVSAQRHIQQATDAGVMAGGNALKLVVS 89 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERST 112 N + IK+ + ++ + + T Sbjct: 90 NTASVIGLTTQTIQDEIKDSAKNPVTIQVDVASDKTSVTAVVEQT 134 >gi|42526263|ref|NP_971361.1| magnesium chelatase subunit D/I family protein [Treponema denticola ATCC 35405] gi|41816375|gb|AAS11242.1| magnesium chelatase, subunit D/I family [Treponema denticola ATCC 35405] Length = 643 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 62/172 (36%), Gaps = 24/172 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 K+ IG ++ ++D S SM M ++ +I +L + ++T Sbjct: 440 RKTRIGASIIFLVDASGSMG-----AMKRMKETKNTILSLL-----MDSYQKHDEVSMIT 489 Query: 223 FSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F+ ++ P V + ++ + T GL A H K D Sbjct: 490 FAGTRVEIILPFTRSVLLAKRELQLIPTIGKTPLALGLNKALEYF------KIHRLKNKD 543 Query: 282 DYKKYIIFLTDGENSSPNI----DNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + +TDG + ++ K++LF + K +Y++ + E+ Sbjct: 544 MIP-LLFLITDGRTNHGSVFFDEPIKDALFISKKIKNAN--IYSVVIDTESG 592 >gi|332885553|gb|EGK05799.1| hypothetical protein HMPREF9456_02063 [Dysgonomonas mossii DSM 22836] Length = 580 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 81/212 (38%), Gaps = 26/212 (12%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD-MMMVLDVSLSMNDHFGPGMDKL 192 ++ PW NS + + I + + ++ + ++DVS SM G +L Sbjct: 178 KISTEVGNCPW--NSQNRLVKIGLKARSMAGENLPASNFVFLIDVSGSM-----YGATRL 230 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFG 250 + S++ + + ++ R +V ++ + P G Q I+E ++ L G Sbjct: 231 DLVKSSLKLLTN------NLREKDRVAIVVYAGSAGEVLPSTSGANKQKIKEALDNLNAG 284 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G++ AY ++ KG ++ II TDG+ + N L Sbjct: 285 GSTTGGAGIQLAYKI------AKQNFIKGGNNR---IILCTDGDFNVGVSSNDGLLALIE 335 Query: 311 EAKRRGAIVYAIGV-QAEAADQFLKNCASPDR 341 + ++ G + +G D ++ A Sbjct: 336 QERKSGVFLSILGYGMGNYKDSKMQTLAQAGN 367 >gi|329954836|ref|ZP_08295853.1| hypothetical protein HMPREF9445_00688 [Bacteroides clarus YIT 12056] gi|328526940|gb|EGF53951.1| hypothetical protein HMPREF9445_00688 [Bacteroides clarus YIT 12056] Length = 303 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 86 EEERELTVMLLVDVSGSLEF------GTVKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 139 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I L T GLEY N + Sbjct: 140 SDRIEKFIPPKKGRKHILYIIRELLDFHPESRRTNIRLGLEYLTNVMK 187 >gi|153836054|ref|ZP_01988721.1| TPR repeat containing protein [Vibrio parahaemolyticus AQ3810] gi|149750808|gb|EDM61553.1| TPR repeat containing protein [Vibrio parahaemolyticus AQ3810] Length = 514 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 50/165 (30%), Gaps = 29/165 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 M+VLD+S SM ++L ++L K +GL+ ++ Sbjct: 1 MLVLDMSRSMYAS-DIKPNRLAQTRYKALDLLPKWKEGA-------TGLIVYAGDAYSLS 52 Query: 232 PLAWGVQHIQEKIN----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + I L+ + +E + ++ A I Sbjct: 53 PLTTDASTLAGIIENLSPELMPFQGSNLPAAIELSLSQFSQAGANQGD-----------I 101 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + L D ++D+ E + K + V + + Sbjct: 102 VVLAD------DLDDSELARSLDLVKGKNIRVSVLAIGTANGAPI 140 >gi|126340243|ref|XP_001373492.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 4, [Monodelphis domestica] Length = 1125 Score = 45.2 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 48/133 (36%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ I Sbjct: 283 DIVILVDVSGSMKGL------QMAIAKHTITTILDTLGENDFVN------IIAYNDYIHY 330 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ + L+ L A+ + +E Sbjct: 331 VEPCFKGILVQADRDNREHFKQLVEELMVKGVGIVDKALSEAFKILMQFREDGRGGLCNQ 390 Query: 281 DDYKKYIIFLTDG 293 I+ +TDG Sbjct: 391 A-----IMLITDG 398 >gi|262164787|ref|ZP_06032525.1| TPR domain protein in aerotolerance operon [Vibrio mimicus VM223] gi|262027167|gb|EEY45834.1| TPR domain protein in aerotolerance operon [Vibrio mimicus VM223] Length = 620 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 46/137 (33%), Gaps = 15/137 (10%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + ++M +D+S SM P T+ Sbjct: 56 LLALVWLFATLALAGPSWQSAERPSVQNSAARVLM-MDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTT 253 + + LD++K + +GLVT+++ PL + I ++ Sbjct: 110 ARYKALDLLKGWQEGT----TGLVTYAADAYVVSPLTSDTATLANLIPNLSPEIMPYQGA 165 Query: 254 KSTPGLEYAYNKIFDAK 270 + + A + + A Sbjct: 166 NAANAVSLAISMLQQAG 182 >gi|227551457|ref|ZP_03981506.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895933|ref|ZP_05675586.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293377027|ref|ZP_06623237.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227179401|gb|EEI60373.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257832498|gb|EEV58919.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292644243|gb|EFF62343.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 477 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 54/139 (38%), Gaps = 29/139 (20%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ D++G + +D + + + + G V +SS+ Sbjct: 289 TPLDLVLVVDWSGSMNDN-----DRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 342 Query: 227 IVQTFPLAWGV------QHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + G ++ ++ + T + GL A + + Sbjct: 343 GHNY---SNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGH----- 394 Query: 278 KGHDDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 395 ------KKVIVLLTDGVPT 407 >gi|154149676|ref|YP_001403294.1| von Willebrand factor, type A [Candidatus Methanoregula boonei 6A8] gi|153998228|gb|ABS54651.1| von Willebrand factor, type A [Methanoregula boonei 6A8] Length = 1081 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 44/175 (25%), Gaps = 53/175 (30%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + +N T G+ A ++ +KG+ + + II + DG Sbjct: 671 TTNQNALVNTVNAYTAYGGTDYAAGINAALQEL---------QSKGNPSHNQTIIIMGDG 721 Query: 294 ENSSPNIDN-----------------------------------KESLFYCNEAKRRGAI 318 N I +L AK G Sbjct: 722 VNMMAPIAPGSFESYWPSDWNPRNGTGISEGPNLWYLDESDVGKAAALNASTTAKNLGIT 781 Query: 319 VYAIGVQAEAA--------DQFLKNCASP-DRFYSVQNSRKLHDAFLRIGKEMVK 364 +Y I F + +SP +Y + + F +I ++ Sbjct: 782 IYGIQFPTPDNYGHNINDTAFFQQMVSSPTSTWYYAPDPTTMTGIFQQIEGQIQN 836 >gi|116327069|ref|YP_796789.1| hypothetical protein LBL_0242 [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116119813|gb|ABJ77856.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] Length = 379 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 78/213 (36%), Gaps = 35/213 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + + ++D S SMN++ G K+ +A + + + + + + G + Sbjct: 59 ASENHERSKLFIVDASGSMNEYLGIYQ-KIHLAKKHVSRYISTLPTETE------IGFIA 111 Query: 223 FSSKIVQTF------PLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + ++I PL + ++ L T + A N I K++ E Sbjct: 112 YGNRIPGCSSSRLYEPLQRENHGTFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE- 170 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQA-EAADQF 332 II +TDG S D K+ L K++G + +G+ ++ Sbjct: 171 -----------IILITDGVESCYG-DPKKEL---QALKQQGIYFKFHILGLGLKPDEERK 215 Query: 333 LKNCASPDR--FYSVQNSRKLHDAFLRIGKEMV 363 +K A ++ +++ + A + + V Sbjct: 216 MKILAEEGNGKYFGIEDDSSFYTALDSLKNQTV 248 >gi|113476689|ref|YP_722750.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167737|gb|ABG52277.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 420 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 64/216 (29%), Gaps = 40/216 (18%) Query: 159 VKISSKSDI-----GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 V+I K+D + M + LD S SM KL A + + + + Sbjct: 30 VRIQPKTDANLPSLPIRMAIALDTSQSMKGE------KLQRAKEAC------LAVVSHLR 77 Query: 214 NVVRSGLVTFSSKI----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + L +S+++ + I L T+ L++ + Sbjct: 78 DPDYLSLAGYSTRVTPLLESLAGGGAAAGFAEGAIADLQARGVTRIDLALDWIEESLL-- 135 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSS----PNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + +TDG ++ P D K + K G I+ A+G+ Sbjct: 136 ---------PEKSPPLVGVLITDGHATNAGGTPLDDMKPFIVKARNMKSCGIILCAVGLG 186 Query: 326 --AEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 A FL + + F KL Sbjct: 187 DAANFNTSFLTDLSDQGGGAFIYADTPDKLLSDLQN 222 >gi|158333727|ref|YP_001514899.1| magnesium protoporphyrin IX chelatase subunit D [Acaryochloris marina MBIC11017] gi|158303968|gb|ABW25585.1| magnesium protoporphyrin IX chelatase, D subunit [Acaryochloris marina MBIC11017] Length = 682 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 42/309 (13%), Positives = 93/309 (30%), Gaps = 49/309 (15%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +G Q F+ + ++ R F++D + + K Sbjct: 392 DPDGVVLDPQVLYFAQNASR-----QGKSGSRSIVFSEDRGRYVKPMLPKGGV----KRI 442 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + A R P+ N K ++ L ++ V+D S SM Sbjct: 443 AVDATLRAAAPYQKARRLRHPNRRVIVEQSDVRTKRLARKAGAL-VVFVVDASGSMA--- 498 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQTFPLAWGVQHIQEKI 244 ++++ A ++ +L N + L+ F + P ++ + ++ Sbjct: 499 ---LNRMQSAKGAVMRLLTEA-----YQNRDQVSLIPFRGERADVLLPPTRSIEAARRRL 550 Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN-- 302 + L G + GL A +A D + ++ +TDG + P + Sbjct: 551 DTLPCGGGSPLAHGLTQAVRVGMNA-------LSSGDIGQVVLVAITDGRGNIPLTTSLG 603 Query: 303 ------------KESLFYCNEAKRRGAIVYAI-----GVQAEAADQFLKNCASPDRFYSV 345 E L + + I + A + +N A + ++ Sbjct: 604 EPPVDGEKPDIKAELLEIAGRIRALNLKLLVIDTENKFISTGFAKELAQN-AGGNYYHLP 662 Query: 346 QNSRKLHDA 354 + + + A Sbjct: 663 KATDQAIAA 671 >gi|18858247|ref|NP_571413.1| complement factor B [Danio rerio] gi|1015970|gb|AAB19093.1| complement factor B [Danio rerio] Length = 737 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 78/216 (36%), Gaps = 29/216 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KIS LD+ + +D S S++ A + I+ +++ I N Sbjct: 248 KISLDRGGKLDIYIAVDASDSIDPK------DFDKAKKIIKTLIEKISYYEVSPNYE--- 298 Query: 220 LVTFSSKIVQTFPL-AWGVQHIQEKINRLIFG--------STTKSTPGLEYAYNKIFDAK 270 ++ F++ + Q + + KI ++ ++ + Y KI D+ Sbjct: 299 ILMFATDVDQIVKMRDFKTNEKARKILKIFEDLDNFNYDKKGDRTGTNIAKLYLKILDSM 358 Query: 271 EKLEHIAKGH-DDYKKYIIFLTDGE-----NSSPNIDNKESLFYCNEA-KRRGAIVYAIG 323 + K + II TDG+ N P +D ++L N A + +Y G Sbjct: 359 SLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYVFG 418 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAF 355 V + + + S F+ + + ++ + F Sbjct: 419 VGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTF 454 >gi|257127006|ref|YP_003165120.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] gi|257050945|gb|ACV40129.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] Length = 509 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 70/211 (33%), Gaps = 22/211 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 K + ++ +++ ++LD S SM + +A SI K + ++ + Sbjct: 149 TKQKIEENMNVNLEIILDASGSMKQKISDK-TMMEIAKESIE------KVVSEMPANTKV 201 Query: 219 GLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLE 274 GL F + + I L S ++ + I + E Sbjct: 202 GLRVFGHKGDNTASKKQESCSANELISPIETLDKDKLKSSLAPIQPTGWTSIAKSIENGT 261 Query: 275 HIAK--GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--AIGVQAEAAD 330 + K + + +TDG + + + + K + IG +A Sbjct: 262 NDLKALKGEKTLNILYIITDGIETC----DGNPVETAKKFKNENTDIVLGIIGFNVDAHQ 317 Query: 331 -QFLKNC--ASPDRFYSVQNSRKLHDAFLRI 358 + LK A+ + S ++ KL + RI Sbjct: 318 NKVLKEIANAANGYYSSANDAAKLTEELQRI 348 >gi|167763118|ref|ZP_02435245.1| hypothetical protein BACSTE_01487 [Bacteroides stercoris ATCC 43183] gi|167699458|gb|EDS16037.1| hypothetical protein BACSTE_01487 [Bacteroides stercoris ATCC 43183] Length = 289 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTVKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I L T GLEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFRPESRRTNIRLGLEYLTNVMK 173 >gi|319902107|ref|YP_004161835.1| hypothetical protein Bache_2281 [Bacteroides helcogenes P 36-108] gi|319417138|gb|ADV44249.1| protein of unknown function DUF58 [Bacteroides helcogenes P 36-108] Length = 289 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 45/108 (41%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEF------GTIKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E I+ T GLEY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFHAESRRTNICLGLEYLTNVMK 173 >gi|182439436|ref|YP_001827155.1| putative magnesium-chelatase subunit [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467952|dbj|BAG22472.1| putative magnesium-chelatase subunit [Streptomyces griseus subsp. griseus NBRC 13350] Length = 679 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 50/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 485 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 534 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF + P V ++ L G T GL A++ + ++E + Sbjct: 535 LVTFRGREAEVALPPTSSVDAAASRLESLPTGGRTPLAAGLLKAHDVL-----RVERLRD 589 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 590 PSRRP--LLVVVTDGRATGG 607 >gi|145487105|ref|XP_001429558.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124396651|emb|CAK62160.1| unnamed protein product [Paramecium tetraurelia] Length = 891 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 13/150 (8%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL 169 +++S + I D + K + + T AN+ + + T+ +K + + Sbjct: 179 KNSSYTDIRDTRRKSCLFEQSTSIMVQSEKFTRSDVANA-YFDKIYTNQIKKMVQKAQVI 237 Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS-IPDVNNVVRSGLVTFSS--- 225 D++ ++D++ SM L A +I ++++ ++ I N+VR+ +V + Sbjct: 238 DVLFIIDITGSMQRW-------LTSAKSNIHKIIEEFQTQIDKKKNIVRTAIVAYRDFGD 290 Query: 226 -KIVQTFPLAWGVQHIQEKINRLIFGSTTK 254 + + I E +N+L Sbjct: 291 EDNMLYREFTSDTKIIFEFLNQLQAKGGGD 320 >gi|153872431|ref|ZP_02001326.1| PpkA [Beggiatoa sp. PS] gi|152071102|gb|EDN68673.1| PpkA [Beggiatoa sp. PS] Length = 523 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 75/191 (39%), Gaps = 19/191 (9%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T +V + +GLD+ V+D++ SM GP + + A + M+ + +++ Sbjct: 83 TETVTDTQAKTLGLDIKFVMDMTGSM----GPFIKRTKEAIAKVATMI----AKENLDIQ 134 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTPGLEYA--YNKIFDAKE 271 VR GLV + +V L + + + + + K++P + + Y + A Sbjct: 135 VRFGLVGYRDDLVTVPGLKFVINNFTPTLVTVDEFDKVIAKASPATDKSGDYQEEAFAGI 194 Query: 272 KLEHIAKGHDDYKKYIIFLTDG-------ENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + +D+ K+II + D + ++ +D + N K ++ Sbjct: 195 NEALNSDWNDNTLKFIILVGDASSHPVSHQQNTTGLDAVQLRELANSQKINIISIHLKAS 254 Query: 325 QAEAADQFLKN 335 + +A + Sbjct: 255 RFKADHALAER 265 >gi|149911740|ref|ZP_01900347.1| TPR domain protein [Moritella sp. PE36] gi|149805213|gb|EDM65231.1| TPR domain protein [Moritella sp. PE36] Length = 263 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 62/205 (30%), Gaps = 29/205 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 FI W + S + S I +MVLD+SLSM ++L Sbjct: 60 FIPLAGIWVLTTMALSGPSFSYTERPVAS-ISQAKVMVLDMSLSMRAT-DLKPNRLAQLR 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV----QHIQEKINRLIFGST 252 ++L IK GLV ++ PL + ++ Sbjct: 118 FKSTDILSAIKEGE-------IGLVAYAGDAFVISPLTTDTSTLLNLLPNLSPEIMPVKG 170 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + T G+ A + +A + I+ +TDG N DN +L Sbjct: 171 SNPTAGIRQAIALLTNAGYQQGQ-----------ILLVTDGIN-PTQADNINTLL-AGTD 217 Query: 313 KRRGAIVYAIGVQAE---AADQFLK 334 + + A Q LK Sbjct: 218 YSLSILGFGTAQGAPIKMPNGQLLK 242 >gi|316932039|ref|YP_004107021.1| von Willebrand factor type A [Rhodopseudomonas palustris DX-1] gi|315599753|gb|ADU42288.1| von Willebrand factor type A [Rhodopseudomonas palustris DX-1] Length = 372 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 41/106 (38%), Gaps = 16/106 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 +++ VLD + SM+ K+ +I + + + +R GLV + Sbjct: 31 VEVAFVLDTTGSMSGLIEGAKRKIWSIATAILD--------DNPDAEIRMGLVAYRDIGD 82 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTK----STPGLEYAYNKI 266 +V++ L +Q + ++ +L L+ A NK+ Sbjct: 83 DYVVRSVDLTTDIQDLYGQLLQLQARGGGDWPESVNEALDTAINKL 128 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 12/49 (24%), Positives = 27/49 (55%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYIL 52 +NIR + KGS ++ A+ L V+ ++ L ++ + K +++ I+ Sbjct: 3 MNIRKWIKEEKGSYIVIFALFLTVLLGMISLAVDVGMMYLKKNRMYEIV 51 >gi|116694147|ref|YP_728358.1| hypothetical protein H16_B0192 [Ralstonia eutropha H16] gi|113528646|emb|CAJ94993.1| Hypothetical protein H16_B0192 [Ralstonia eutropha H16] Length = 562 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 31/60 (51%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G+IS++ A+L+ + I + I+ H F + +L ++D + + A ++ ++ N Sbjct: 17 GAISVMAALLIATVAIAALVSIDVGHVFMRQRQLQNVVDLAAMSAAQQLKRADSAANLNA 76 >gi|297539403|ref|YP_003675172.1| von Willebrand factor type A [Methylotenera sp. 301] gi|297258750|gb|ADI30595.1| von Willebrand factor type A [Methylotenera sp. 301] Length = 323 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 69/227 (30%), Gaps = 35/227 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + K G ++V+D S+SM+ F A +I D Sbjct: 68 GASRKEQKVGKGAQTVLVIDRSVSMDHPFAGQTTSGRAAEIKSAAARRLITDFIDSRPDD 127 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+V F++ + + I IN + T G+ A N +FD + Sbjct: 128 MMGVVGFTNSALYGMKITTNRDAIHAAINAATGSALNQTNIGAGITEAVN-LFDNIQSSG 186 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----- 329 A +I L+DG + + + L + +Y I ++ Sbjct: 187 SRA---------VILLSDG---AGKLSPRVKLKISQYFIDKKLNLYWIVLREPDDISIFT 234 Query: 330 ---------------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 DQ+ K+ + + N L A I + Sbjct: 235 KETYSEDKVPDSIVLDQYFKSLKIKYKAFEADNPTALQSALQYIDSK 281 >gi|228473096|ref|ZP_04057853.1| conserved hypothetical protein [Capnocytophaga gingivalis ATCC 33624] gi|228275678|gb|EEK14455.1| conserved hypothetical protein [Capnocytophaga gingivalis ATCC 33624] Length = 288 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 23/110 (20%), Positives = 40/110 (36%), Gaps = 6/110 (5%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S L I + + + N ++GL+ F Sbjct: 72 EEERELTLMLLVDVSGS------ELFGSLQQFKNEILTEIAATLAFAALQNNDKTGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 S +I P G HI I LI + A+ + +K Sbjct: 126 SDQIELYIPPKKGKSHILRIIRELIEFQPKNKQTDIAKAFEFLNKITKKK 175 >gi|298372682|ref|ZP_06982672.1| von Willebrand factor, type A [Bacteroidetes oral taxon 274 str. F0058] gi|298275586|gb|EFI17137.1| von Willebrand factor, type A [Bacteroidetes oral taxon 274 str. F0058] Length = 293 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 69/188 (36%), Gaps = 21/188 (11%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 + + R E++ + +I E T+ + + S V Y+ W Sbjct: 4 NELIKKVRRIEIKTRRISNNIFAGEYHTAFK------GRGMSFSEVREYQYGDEVRFIDW 57 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + + L +M+++DVS S + FG G R I + Sbjct: 58 NVTARYRR-----PYIKVYEEERELTVMLLVDVSKS--NLFGSG----EQLKRDIINEIA 106 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI----QEKINRLIFGSTTKSTPGLE 260 + + N + G++ FS KI + P G +H+ +E I+ ++T L+ Sbjct: 107 ATLAFSTIQNNDKVGVIFFSDKIEKFIPPQKGRKHVLYIIRELIDFKPESNSTDIEVVLK 166 Query: 261 YAYNKIFD 268 Y N I Sbjct: 167 YLTNVIKK 174 >gi|296220039|ref|XP_002756140.1| PREDICTED: integrin alpha-X [Callithrix jacchus] Length = 1078 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 80/204 (39%), Gaps = 28/204 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + M A S + S+ ++ ++ TF Sbjct: 151 DIVFLIDGSGSILYNNFAMMKSFVRAVMSHFQRPSTQFSLMQFSSKFKT-HFTFEEFRES 209 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + PL+ + +++L G T + ++ ++F A + K +I Sbjct: 210 SNPLS-----LLASVDQL--GGYTHTATAIQKVVKELFLASNGARR------EATKILIV 256 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-----AADQFLKNCASP--DRF 342 +TDG+ +D + + ++A G I YAIGV + + + + P + Sbjct: 257 ITDGKKEGDWLDYDDVIPMADKA---GIIRYAIGVGSAFQNRNSWKELNDIASKPSQEHI 313 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V++ DA I ++ ++ Sbjct: 314 FKVED----FDALKDIQNQLKEKI 333 >gi|194385074|dbj|BAG60943.1| unnamed protein product [Homo sapiens] Length = 506 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 27/124 (21%), Positives = 48/124 (38%), Gaps = 14/124 (11%) Query: 216 VRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKI 266 V ++TF+S+ V L + + E I+ L G+ T + L Y + Sbjct: 45 VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMM 104 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYA 321 + L + + II LTDG+++ D+ + N+ + +YA Sbjct: 105 NNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYA 164 Query: 322 IGVQ 325 IGV Sbjct: 165 IGVG 168 >gi|148749338|gb|ABR09545.1| CD11c protein [Callithrix jacchus] Length = 1161 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 80/204 (39%), Gaps = 28/204 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S+ + M A S + S+ ++ ++ TF Sbjct: 151 DIVFLIDGSGSILYNNFAMMKSFVRAVMSHFQRPSTQFSLMQFSSKFKT-HFTFEEFRES 209 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + PL+ + +++L G T + ++ ++F A + K +I Sbjct: 210 SNPLS-----LLASVDQL--GGYTHTATAIQKVVKELFLASNGARR------EATKILIV 256 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-----AADQFLKNCASP--DRF 342 +TDG+ +D + + ++A G I YAIGV + + + + P + Sbjct: 257 ITDGKKEGDWLDYDDVIPMADKA---GIIRYAIGVGSAFQNRNSWKELNDIASKPSQEHI 313 Query: 343 YSVQNSRKLHDAFLRIGKEMVKQR 366 + V++ DA I ++ ++ Sbjct: 314 FKVED----FDALKDIQNQLKEKI 333 >gi|116332274|ref|YP_801992.1| hypothetical protein LBJ_2829 [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116125963|gb|ABJ77234.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 379 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 78/213 (36%), Gaps = 35/213 (16%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + + ++D S SMN++ G K+ +A + + + + + + G + Sbjct: 59 ASENHERSKLFIVDASGSMNEYLGIYQ-KIHLAKKHVSRYISTLPTETE------IGFIA 111 Query: 223 FSSKIVQTF------PLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + ++I PL + ++ L T + A N I K++ E Sbjct: 112 YGNRIPGCSSSRLYEPLQRENHGTFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE- 170 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQA-EAADQF 332 II +TDG S D K+ L K++G + +G+ ++ Sbjct: 171 -----------IILITDGVESCYG-DPKKEL---QALKQQGIYFKFHILGLGLKPDEERK 215 Query: 333 LKNCASPDR--FYSVQNSRKLHDAFLRIGKEMV 363 +K A ++ +++ + A + + V Sbjct: 216 MKILAEEGNGKYFGIEDDSSFYTALDSLKNQTV 248 >gi|262167806|ref|ZP_06035507.1| TPR domain protein in aerotolerance operon [Vibrio cholerae RC27] gi|262023714|gb|EEY42414.1| TPR domain protein in aerotolerance operon [Vibrio cholerae RC27] Length = 622 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|229506664|ref|ZP_04396173.1| TPR domain protein in aerotolerance operon [Vibrio cholerae BX 330286] gi|229357015|gb|EEO21933.1| TPR domain protein in aerotolerance operon [Vibrio cholerae BX 330286] Length = 636 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|227811795|ref|YP_002811805.1| hypothetical protein VCM66_A0167 [Vibrio cholerae M66-2] gi|227010937|gb|ACP07148.1| conserved hypothetical protein [Vibrio cholerae M66-2] Length = 628 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|163842175|ref|YP_001626580.1| von Willebrand factor type A domain-containing protein [Renibacterium salmoninarum ATCC 33209] gi|162955651|gb|ABY25166.1| von Willebrand factor type A domain, putative [Renibacterium salmoninarum ATCC 33209] Length = 267 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 54/169 (31%), Gaps = 29/169 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+M+ LDVS SM D +D + R GLV F Sbjct: 29 EQNNRDIMLCLDVSGSMVDTDAKIVDVFANLAQEFHGE--------------RLGLVIFD 74 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI-----------FGSTTKSTPGLEYAYNKIFDAKEKL 273 S VQ FPL +I++++N + F T S G + + A Sbjct: 75 STAVQVFPLTEDYGYIKDELNVALKAMTDQSDDTGFFGGTYSGRGSSLIGDGL--ATCVN 132 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 G + + I+F TD N A + VY I Sbjct: 133 SFPKLGAEQRSRSIVFATD--NVLLGKPLFSLTDAAGLATKNSIRVYGI 179 >gi|298500026|ref|ZP_07009832.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|297542007|gb|EFH78058.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 626 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|153816861|ref|ZP_01969528.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|126512664|gb|EAZ75258.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] Length = 630 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|121586717|ref|ZP_01676500.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121549014|gb|EAX59051.1| conserved hypothetical protein [Vibrio cholerae 2740-80] Length = 642 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|147671681|ref|YP_001215943.1| hypothetical protein VC0395_1107 [Vibrio cholerae O395] gi|146314064|gb|ABQ18604.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227014796|gb|ACP11005.1| conserved hypothetical protein [Vibrio cholerae O395] Length = 646 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|121728215|ref|ZP_01681249.1| conserved hypothetical protein [Vibrio cholerae V52] gi|121629538|gb|EAX61962.1| conserved hypothetical protein [Vibrio cholerae V52] Length = 652 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +S+ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYSADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|302754792|ref|XP_002960820.1| hypothetical protein SELMODRAFT_402207 [Selaginella moellendorffii] gi|300171759|gb|EFJ38359.1| hypothetical protein SELMODRAFT_402207 [Selaginella moellendorffii] Length = 563 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 62/188 (32%), Gaps = 15/188 (7%) Query: 170 DMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVTFSS 225 + ++LD S SM++ G + VA I+ +L+ + V G + S Sbjct: 201 SLYILLDTSASMSNPTGVLSSQTRFNVANNIIKRLLNTFTNGDQVAVSTIGGEKIGAPVS 260 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + I + + S T S ++ D ++ Sbjct: 261 VVLGVQETSLDLAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSNLN-------- 312 Query: 286 YIIFLTDGE---NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II TDG+ + N + A+ VY IG F +S + Sbjct: 313 VIILFTDGQFVTPGNFNFTQLSPVLAQLNARGVVVFVYRIGSFTTNDATFQHMQSSLNMS 372 Query: 343 YSVQNSRK 350 Y V N K Sbjct: 373 YEVINDDK 380 >gi|326674126|ref|XP_003200076.1| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 940 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 66/202 (32%), Gaps = 27/202 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D G ++ VLD S S+ D A I + +V + S Sbjct: 179 DPGTEIAFVLDGSGSIQY------DDFEKAKDFIHTTMSNCNFA----------IVQYGS 222 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 I L + T A N + E+ +K + + K Sbjct: 223 SIRTELSLLDNEDGARSLQKVKQIKQIYNLTKT-ASAINHVLTDIFIPENGSKNNSE--K 279 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAEAADQFLK---NCASPD 340 II L+DG+ D N+ + +G Y+IGV +K A P Sbjct: 280 IIIVLSDGKILG---DPMTLDEVLNKTQMKGVTRYSIGVGDGILKNKDAIKEMTQIADPG 336 Query: 341 RFYSVQNSRKLHDAFLRIGKEM 362 ++YSV + L+D + + + Sbjct: 337 KYYSVSSYGALNDILSSLERGI 358 >gi|87308731|ref|ZP_01090870.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] gi|87288442|gb|EAQ80337.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] Length = 625 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 67/194 (34%), Gaps = 17/194 (8%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY-----T 59 N+ N KG +L A+++ V+F + I+T + ++ +D + L T Sbjct: 7 NLTNRRSARKGIFVVLAAVVMVVLFGFISFGIDTGLISLEQTRMQNAVDAAALAASQEIT 66 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 + ++G + F+ ++ ++ + + + + Sbjct: 67 SAVAQAGDSGGDPNSISISFAKQMAVDVAAANGVYLNADRDIVFGKRTYDPGSGEWAYDW 126 Query: 120 DQHKDYNLSAVSRYEMPFI-------FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 + + + P + F W PL+ +++ + DM+ Sbjct: 127 TTGPYNVVKVEAHRDQPNLEAPDGRVPLAFGWAVGVPSIPLVTSATSFVE-----ARDMV 181 Query: 173 MVLDVSLSMNDHFG 186 +VLD S SMND Sbjct: 182 VVLDFSGSMNDDSQ 195 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 67/212 (31%), Gaps = 56/212 (26%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPL-------AWGVQHIQEKINRLIF------GSTTKS 255 + D++ G+V++ L G +I E + L S Sbjct: 414 LDDLDFGDEVGIVSYDESARVEHTLDEHGEYATLGGNNISEDYDTLDLIQRHKQSGHYGS 473 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP--------NIDNKE--- 304 G+ Y + +A E L ++ ++ +TDG+ + N + + Sbjct: 474 YTGMGYG---VKEASELLTEHSRHGARPT--MVVMTDGQANRYPNRWSLPYNWNWADYTD 528 Query: 305 ------------------SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS-------- 338 + + +A + ++ + V +A + A Sbjct: 529 FDGDGNADYSTSDRSKQYAFYEAVQAHKLDVTIHTMSVGLDADRSLMTAIAFACGGVHIA 588 Query: 339 -PDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 P + ++L DAF +I ++ +++Y Sbjct: 589 VPGGATVAEMEQQLLDAFGQIAAKVPPPQLVY 620 >gi|294673289|ref|YP_003573905.1| hypothetical protein PRU_0533 [Prevotella ruminicola 23] gi|294472665|gb|ADE82054.1| conserved domain protein [Prevotella ruminicola 23] Length = 611 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 83/244 (34%), Gaps = 31/244 (12%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK-- 123 ++ + ++ ++ + + I N+L E + +++ + + I Sbjct: 331 EKVKDALEQLASETTISEFERILSAAGLNQLSEKDKEKLWFSVQSAGMIPIEEVGNEGAK 390 Query: 124 ---DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL-----DMMMVL 175 Y + + + + P + T + S G+ D ++V+ Sbjct: 391 DNYTYPTNWRIGDSIADLDMMLTYSTAPKLIPGITTKKWEQSVNEYFGVEQRQKDALLVV 450 Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D S SM D + A + +L+ +S LV FS + + Sbjct: 451 DTSGSMG-AVSRETDNMHQAVLAAFGILNYFESRKGK-----IALVEFSDNVKEYISWTN 504 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + I++K+ T + G ++ + I + E+ ++ + +TDGE Sbjct: 505 EYERIRDKL-------LTNGSGGTQFPIHSIREILEQSKNELIT--------VIITDGEL 549 Query: 296 SSPN 299 + N Sbjct: 550 GNIN 553 >gi|237689767|gb|ACQ59162.1| factor B/C2 [Larimichthys crocea] Length = 755 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 83/224 (37%), Gaps = 39/224 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ + +DVS S+ D + + T I++++ I S N +V FS Sbjct: 238 KEGTLNIYIAVDVSASIEDKY------ISNTTSVIKKLIQKISSFTVSPNYE---IVFFS 288 Query: 225 SKIVQTFPLA--WGVQHIQEKI-NRLIFGS-------TTKSTPGLEYAYNKI-FDAKEKL 273 S+I + + W + +++ I +L T + ++ F Sbjct: 289 SEIEEVVNILDFWEKKDVRKDILEKLDTFGINDRSSTGTDLNLVFFTFWERMSFIKTRVG 348 Query: 274 EHIAKGHDDYKKYIIFLTDG-ENSSPNIDNK----ESLFYCNEA-------KRRGAIVYA 321 + K H II TDG N + + + Y N+ + VY Sbjct: 349 ANAFKDHRH---VIILFTDGAYNMGGSPAPTLARIKDMVYMNQTGEKEVNSREEYLDVYV 405 Query: 322 IGVQAEAADQFLKNC----ASPDRFYSVQNSRKLHDAFLRIGKE 361 + AE D LK+ A F+ V+ K++DAF I E Sbjct: 406 FAIGAETFDDDLKSFTVGKAEEKHFFKVKGIDKMYDAFDDIIDE 449 >gi|254460998|ref|ZP_05074414.1| von Willebrand factor, type A [Rhodobacterales bacterium HTCC2083] gi|206677587|gb|EDZ42074.1| von Willebrand factor, type A [Rhodobacteraceae bacterium HTCC2083] Length = 627 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 80/219 (36%), Gaps = 20/219 (9%) Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 DY + PW N+ + + + ++ L+++ ++D S SMN Sbjct: 231 DYAAPTQGVFATSLAMMATPWNENTQLLRIALQGQM-PEVEARPPLNLVFLIDTSGSMNQ 289 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 +KL + +S+R +L +++ V V +G S+ V A I Sbjct: 290 P-----NKLPLLKQSLRLLLGQLRAEDQVAIVAYAG----SAGQVLEPTAAHERDTILGA 340 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++ L G T GL+ AY +E+ E ++ TDG+ + + + Sbjct: 341 LDNLAAGGETNGQAGLQQAYALAKQMQEEGEVSR---------VLLATDGDFNVGLSNAE 391 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 Y + + G + +G D+ ++ A Sbjct: 392 ALKGYIADKRDSGTFLSVLGFGRGNLDDETMQALAQNGN 430 >gi|194476965|ref|YP_002049144.1| Magnesium chelatase ATPase subunit D [Paulinella chromatophora] gi|171191972|gb|ACB42934.1| Magnesium chelatase ATPase subunit D [Paulinella chromatophora] Length = 702 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 66/208 (31%), Gaps = 41/208 (19%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++D S SM + ++ A ++ ++L N L+ F Sbjct: 501 QRKAGALVIFLVDASGSMAFN------RMESAKGAVIKLLTEA-----YKNRDEVALIPF 549 Query: 224 SS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ ++ G L + + A + + Sbjct: 550 RGERAEVLLPPTRSITAAKRRLEQMPCGGG----SPLAHG---LTQAARVGVNSLATGNL 602 Query: 283 YKKYIIFLTDGENS--------------SPNIDNKESLFY-CNEAKRRGAIVYAI----- 322 + ++ +TDG + +ID KE + + + G + I Sbjct: 603 AQIVVVAITDGRGNVSLSRSLGQLPLEGEESIDLKEEIKIVASRYRSLGIKLLVIDTERN 662 Query: 323 GVQAEAADQFLKNCASPDRFYSVQNSRK 350 + A + A+ R+ + + Sbjct: 663 FIGAGMGKDLAE--AAGGRYIQLPKASD 688 >gi|194334089|ref|YP_002015949.1| Magnesium chelatase [Prosthecochloris aestuarii DSM 271] gi|194311907|gb|ACF46302.1| Magnesium chelatase [Prosthecochloris aestuarii DSM 271] Length = 650 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 52/135 (38%), Gaps = 18/135 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +G ++ V+D S SM ++ + ++ +L I + LV+ Sbjct: 459 REKRLGNLLIFVVDASGSMG-----ARGRMAASKGAVMSLL-----IDAYQKRDKLALVS 508 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F ++ V P+ V+ + + + G T GL Y H+ K D Sbjct: 509 FRKNEAVVNLPVTSSVELAGKLLREMPVGGRTPFAAGLVKGYEI------ARSHLMKEPD 562 Query: 282 DYKKYIIFLTDGENS 296 +IF+TDG+ + Sbjct: 563 ARP-MVIFVTDGKAN 576 >gi|228954949|ref|ZP_04116966.1| Gram positive anchor protein [Bacillus thuringiensis serovar kurstaki str. T03a001] gi|228804676|gb|EEM51278.1| Gram positive anchor protein [Bacillus thuringiensis serovar kurstaki str. T03a001] Length = 997 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 49/146 (33%), Gaps = 17/146 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD--IIKSIPDVNNV-V 216 K + +D ++V D S S L + D ++ S D Sbjct: 57 KWPAPKQKKVDFVIVQDASGSFKGTMPNVKKALSNIVDELNPATDRIMVTSYQDYKGYKA 116 Query: 217 RSGLVTFS----SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 G V S K L + IN++ S T + GL++A + Sbjct: 117 SDGRVLESRGNGIKTTLQAGLTNNFTSAKNGINKITPDSGTPTASGLQFALAE------- 169 Query: 273 LEHIAKGHDDYKKYIIFL--TDGENS 296 AKG +D + +FL TDG + Sbjct: 170 -YEKAKGQNDPDRETVFLLVTDGVAN 194 >gi|160875971|ref|YP_001555287.1| TPR repeat-containing protein [Shewanella baltica OS195] gi|160861493|gb|ABX50027.1| Tetratricopeptide TPR_2 repeat protein [Shewanella baltica OS195] gi|315268166|gb|ADT95019.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 692 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLMATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + GL A + II +TDG ++ D +L Sbjct: 166 VRGSNLAAGLTQAKTLLAQGGHIRGD-----------IIVMTDGITAAQFDDANSALS 212 >gi|332827674|gb|EGK00413.1| hypothetical protein HMPREF9455_03261 [Dysgonomonas gadei ATCC BAA-286] Length = 402 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 72/179 (40%), Gaps = 23/179 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + + ++ +LD + SM+ G +K+ T S L + +PD+ G Sbjct: 45 KADRTTRPKMQIVFLLDATGSMSGLIGAAKEKIWSITSS----LSQSEPVPDIE----VG 96 Query: 220 LVTFSSK----IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++ + + I + PL + ++ E++ + + A +++ K++ Sbjct: 97 MLFYRDRGDDFITRIIPLGTDMDNLYEQLMAMDASGGGDGPESVNQA---LYEGVNKMQW 153 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY---CNEAKRRGAIVYAIGVQAEAADQ 331 + +++ G+ P++D + + Y C+E ++G ++ I + E Sbjct: 154 DNLPNTYRAIFLV----GD-YPPHMDYRNDVHYPETCSEGIKKGIVINTILMGNEPTAA 207 >gi|15965603|ref|NP_385956.1| hypothetical protein SMc00158 [Sinorhizobium meliloti 1021] gi|15074784|emb|CAC46429.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] Length = 577 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 15/75 (20%), Positives = 33/75 (44%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 R + G+I + A+++P++ MGL I+ + + +L + D + + A + + Sbjct: 8 FRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAADVSS 67 Query: 66 QENGNNGKKQKNDFS 80 E + ND Sbjct: 68 AEEAVLAHFRSNDLG 82 >gi|312136388|ref|YP_004003725.1| magnesium chelatase chli subunit [Methanothermus fervidus DSM 2088] gi|311224107|gb|ADP76963.1| magnesium chelatase ChlI subunit [Methanothermus fervidus DSM 2088] Length = 636 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 56/152 (36%), Gaps = 20/152 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++LD S SM ++ +A ++++ + + F S+IV Sbjct: 472 VIILDSSGSMA-----VKKRIRIAKGIAEKLIE--DGYTKRSKMAFILAKGFKSEIV--V 522 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL- 290 P I ++I R G T + L+ L K ++ K F+ Sbjct: 523 PPTKNYWQIVDEIERAPTGGRTPLSSALQQLI--------SLSKREKMKEESLKVKAFMI 574 Query: 291 TDGENSSPNIDN--KESLFYCNEAKRRGAIVY 320 TDG + P D+ KE + K+ G +Y Sbjct: 575 TDGRANVPLTDDIEKEMVNLAKNIKKEGIELY 606 >gi|156382210|ref|XP_001632447.1| predicted protein [Nematostella vectensis] gi|156219503|gb|EDO40384.1| predicted protein [Nematostella vectensis] Length = 8745 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 53/283 (18%), Positives = 98/283 (34%), Gaps = 32/283 (11%) Query: 90 TDFRNELRENGFAQDINNIERSTS-LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS 148 +D L E +++ + + ++I D Y ++++ + P A + Sbjct: 1510 SDLGKALAEATKVFRNDDVRPNAKKVLVVITDSKSSYGVASLVKAAEPLEHAGIRVLAVA 1569 Query: 149 ----------SHAPLLITSSVKISSKSD-IGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 +K SS D GL +V S + + LG A Sbjct: 1570 IGADADASELRSIVQHQGDVIKASSSEDPSGLADRLV---SAMLKETRKIPKVDLGFAVS 1626 Query: 198 S--------IREMLDIIKSIPDV--NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + +R M DIIKSI +R GL+++ + T G+ + + + Sbjct: 1627 AGSSNSDATLRHMKDIIKSIVTRYGAERMRYGLMSYGDDVTTTIFFTDGITNPNSLLPFI 1686 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T S LE A + K+ EH K+++ +TD +++ + + Sbjct: 1687 DILQKTPSGSSLEKA---LVGGKKLFEHGGTRPG-VDKFLVVITDRKSTGDAAQTENAK- 1741 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + AK G V I V +E + L A+ V + Sbjct: 1742 -ADLAK-AGVNVLVIAVGSEVDHKELNETATTPSHKLVTAPGE 1782 Score = 40.2 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 58/178 (32%), Gaps = 21/178 (11%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D+ +++ G + L +I+ ++D + +R G++ F V Sbjct: 5728 DMLFALSAGSGDPVSNLARMKNTIKSIIDRY-----GSTSIRYGVIVFGGVPVIKMGFEK 5782 Query: 235 --WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 I+ ++ + + LE A + ++ +T Sbjct: 5783 EYRSDSDIKRYLDSINREKQGSSLHRSLEEARKVFVQQGRPDA---------QHVLVIMT 5833 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 DG++ S + + + ++ + + E+ L+ S DR+ V + Sbjct: 5834 DGQSESLPDEIETARNPLDQM---NVTIIPVAFGDESDPAQLRVATSDDRYLIVSKTT 5888 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 67/213 (31%), Gaps = 12/213 (5%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 I +VK S+ LD+ + S + +D M + A I + +I Sbjct: 4794 IGEAVKEESQPKPSLDIGFAISASSADSDANFAKMKNIIKAVLDIYGTEGVHYTIVTFGQ 4853 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKL 273 ++ +V S L ++ I+R+ T L +A Sbjct: 4854 TAKT-VVQNSKGAELRAGLPTLATELKRLIDRITKNPEGTSIVSALNRIKQSFKEATSSR 4912 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK +I +TD + D + + G V +GV + L Sbjct: 4913 PNT-------KKTLIIMTDSNHEDGVDDV---IIAMEGVEDDGIQVLYVGVDDSVDLEEL 4962 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 K A D + R D +I + + R Sbjct: 4963 KIVAPDDNHVISADGRTPGDVAEQIARRINATR 4995 >gi|91216723|ref|ZP_01253688.1| hypothetical protein P700755_08674 [Psychroflexus torquis ATCC 700755] gi|91185192|gb|EAS71570.1| hypothetical protein P700755_08674 [Psychroflexus torquis ATCC 700755] Length = 287 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 58/173 (33%), Gaps = 24/173 (13%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L MM+++D+S S + I + + Sbjct: 58 NVTARYNEPFIKVFEEERELTMMLLVDISES------TAFGTRETFKKDIITEMSATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 N + GL+ FS +I P G H+ I LI + T L A+ + D Sbjct: 112 SATQNNDKIGLMLFSDQIELYIPPKKGRMHVLRIIRELIEFKPSHKTTDLTKAFKYLIDM 171 Query: 270 KEKLEHIAKGHDDYKKYIIF-LTDGENSSPNIDNKESLFYCNEAKR-RGAIVY 320 + KK I+F ++D + D + +L + G ++ Sbjct: 172 MK------------KKAIVFVMSD----FLSDDYEHTLKILGKKHDVTGIRIF 208 >gi|148231161|ref|NP_001079180.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Xenopus laevis] gi|27370986|gb|AAH41202.1| Cacna2d3-A protein [Xenopus laevis] Length = 394 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 54/133 (40%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++++I Sbjct: 259 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNTEIQY 306 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G +H +E +++L L A++ + + E Sbjct: 307 VEPCLNGTLVQADRNNKEHFREHLDKLYAKGIGMLNIALSEAFDLLSEFNHSGE-----G 361 Query: 281 DDYKKYIIFLTDG 293 + I+ +TDG Sbjct: 362 SICSQAIMLVTDG 374 >gi|157155169|ref|YP_001463425.1| von Willebrand factor type A domain-containing protein [Escherichia coli E24377A] gi|331653501|ref|ZP_08354502.1| putative von Willebrand factor type A domain protein [Escherichia coli M718] gi|157077199|gb|ABV16907.1| von Willebrand factor type A domain protein [Escherichia coli E24377A] gi|331048350|gb|EGI20426.1| putative von Willebrand factor type A domain protein [Escherichia coli M718] Length = 219 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGTPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|321464007|gb|EFX75018.1| hypothetical protein DAPPUDRAFT_323732 [Daphnia pulex] Length = 923 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 66/191 (34%), Gaps = 33/191 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V DVS SM D+ +++G S + + D+ G+V F S Sbjct: 304 VLVTDVSGSMVDY-----NRIGRLYDSAQRWIQY-----DIPEGTSLGIVKFHSYATVLS 353 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPG---LEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + ++++ + T ST L+ A + + I+ Sbjct: 354 NMTVVNDETRQQLMDKVPNKTEDSTCIGCVLQMAIDILHPGGNGG------------VIV 401 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY--S 344 +TDG + + + V +I +A ++ L + ++ Sbjct: 402 LVTDGLENEYPFIRDVTPELID----AKIQVVSIAFGRDAENEIENLATKTNGKSYFIND 457 Query: 345 VQNSRKLHDAF 355 N+ +L+DAF Sbjct: 458 NGNNDELNDAF 468 >gi|308501381|ref|XP_003112875.1| CRE-CLEC-160 protein [Caenorhabditis remanei] gi|308265176|gb|EFP09129.1| CRE-CLEC-160 protein [Caenorhabditis remanei] Length = 632 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 66/204 (32%), Gaps = 38/204 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ V+DVS M G+ L + I ++ + P++ V+ GL+ +S++ Sbjct: 282 IDIVFVIDVSEGM------GLGGLMMVKAEINTLVGQMSLDPNIQKHVQVGLIKYSNEAE 335 Query: 229 QTFPLA------------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F + W + + ++++ GL+ A I + + + Sbjct: 336 IVFKPSDYDDEDEFTEDLWSDPRL-QDVDKVDE---VNLHLGLQKAAKMIGSMRRGVRKV 391 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN- 335 + N N D ++ + G + + + + Sbjct: 392 VVVYAAS----------YNDEGNDDARQ---IAANIRESGYEIITVAFVEPESSNLVMKI 438 Query: 336 --CASPDRFYSVQNSRKLHDAFLR 357 ASP ++ L + Sbjct: 439 GEIASPRMNFTSFRDDLLVEEMED 462 >gi|194016210|ref|ZP_03054824.1| YwmD [Bacillus pumilus ATCC 7061] gi|194011683|gb|EDW21251.1| YwmD [Bacillus pumilus ATCC 7061] Length = 225 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 70/221 (31%), Gaps = 29/221 (13%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +K ++++LD+S SM G K+ +A RSI+ I+ D ++R Sbjct: 20 PAHAVTKKQEPAHVVILLDLSGSMAQSVE-GEKKIDIAKRSIQSFASILS--DDTQVLLR 76 Query: 218 S-GLVTFSSKIVQTFP-------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 G + + +G L T + + A Sbjct: 77 VFGHEGTNKNAGKAISCESSEAVFGFGSYESSTFQQALNVYKPT--------GWTPLAKA 128 Query: 270 KEKLEHIAKGHDDYKKYIIF-LTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQA 326 + + H K I++ ++DG+ + E G IV IG Sbjct: 129 LTDTKQDFEDHQAEGKNIVYVVSDGQETCGG----SPSQAAKELHEGGIDTIVNIIGFDV 184 Query: 327 -EAADQFLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 E + LK+ A ++ N+ +L+ + Sbjct: 185 NEKEARSLKSVAKAGGGQYQPAANAEELNYILQNAASTFSQ 225 >gi|4454062|emb|CAA10812.1| NorD protein [Bradyrhizobium japonicum] Length = 642 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 84/211 (39%), Gaps = 30/211 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L + ++ DVSLS D + G L V ++ + + + D +++ +TF+S Sbjct: 450 GHDLAVTLLADVSLS-TDAWVDGYRVLDVEKEALLVLAHGLSACGDHHSI-----LTFTS 503 Query: 226 KIVQTFPL----AWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + L A+G ++ +I L G T+ + +A ++ ++ Sbjct: 504 RRRSWVRLETIKAFGEPMSGLVERRIGALKPGYYTRIGAAVRHASAELARQPQR------ 557 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 KK ++ LTDG+ + +++ EA+R G V+ + V A A + Sbjct: 558 -----KKLLLVLTDGKPNDVDHYEGRFAVEDTRKSVQEARRLGIAVFGVTVDATA-QSYF 611 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + V N ++L A I +++ Sbjct: 612 PTLFGRGGYAIVGNIKRLPAALPAIYRQVAH 642 >gi|42521785|ref|NP_967165.1| hypothetical protein Bd0148 [Bdellovibrio bacteriovorus HD100] gi|39574315|emb|CAE77819.1| hypothetical protein predicted by Glimmer/Critica [Bdellovibrio bacteriovorus HD100] Length = 739 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 88/287 (30%), Gaps = 25/287 (8%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN- 68 N +G I++ A++ ++F+ +VI K L +D + Y A K N Sbjct: 12 LNNKRGQIALFVALIFQILFLFFAMVINVGLLVHHKINLQNSVDLAAYYGAMKQAEGMNV 71 Query: 69 ---GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH--- 122 N +Q + + I + SI Q+ Sbjct: 72 IAHTNYQIRQSWKLLAWRYR--MLGTAGDFEEHPFDKVGGGRIRTTDDDSINPGAQNFYD 129 Query: 123 -KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + ++ + MP T ++ + + V I S G + L++ S+ Sbjct: 130 SPSFCITYIPFKPMPDGENTCKALSSHTGINVFKIPPVLIDLPSVSGRTRSLALNLRASL 189 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 +++ ML +++ R+ L+++ SK + I Sbjct: 190 -------IERCKDFGSFNYLMLGGFVVAFNIDQGNRALLISYLSKAMSPSSPDADFFDID 242 Query: 242 EKINRL--------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 K +L + ++ YN + + AKG Sbjct: 243 GKSVKLGMENTLKNNLTAANNTSDLKMSIYNSLGHSACNGSQGAKGE 289 >gi|27378328|ref|NP_769857.1| NorD protein [Bradyrhizobium japonicum USDA 110] gi|27351475|dbj|BAC48482.1| NorD protein [Bradyrhizobium japonicum USDA 110] Length = 639 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 84/211 (39%), Gaps = 30/211 (14%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L + ++ DVSLS D + G L V ++ + + + D +++ +TF+S Sbjct: 447 GHDLAVTLLADVSLS-TDAWVDGYRVLDVEKEALLVLAHGLSACGDHHSI-----LTFTS 500 Query: 226 KIVQTFPL----AWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + L A+G ++ +I L G T+ + +A ++ ++ Sbjct: 501 RRRSWVRLETIKAFGEPMSGLVERRIGALKPGYYTRIGAAVRHASAELARQPQR------ 554 Query: 279 GHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 KK ++ LTDG+ + +++ EA+R G V+ + V A A + Sbjct: 555 -----KKLLLVLTDGKPNDVDHYEGRFAVEDTRKSVQEARRLGIAVFGVTVDATA-QSYF 608 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + V N ++L A I +++ Sbjct: 609 PTLFGRGGYAIVGNIKRLPAALPAIYRQVAH 639 >gi|307302722|ref|ZP_07582478.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|307318570|ref|ZP_07598004.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306895910|gb|EFN26662.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306903086|gb|EFN33677.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 577 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 15/75 (20%), Positives = 33/75 (44%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 R + G+I + A+++P++ MGL I+ + + +L + D + + A + + Sbjct: 8 FRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAADVSS 67 Query: 66 QENGNNGKKQKNDFS 80 E + ND Sbjct: 68 AEEAVLAHFRSNDLG 82 >gi|291450680|ref|ZP_06590070.1| magnesium-chelatase subunit [Streptomyces albus J1074] gi|291353629|gb|EFE80531.1| magnesium-chelatase subunit [Streptomyces albus J1074] Length = 277 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++G ++ +L + + G Sbjct: 80 QATREGREGNLVLFVVDASGSMA-----ARRRMGAVKGAVLSLL-----LDAYQRRDKVG 129 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF + P V ++ L G T GL A++ + + + Sbjct: 130 LVTFRGREAEVALPPTSSVDAAAARLQALPTGGRTPLAAGLLKAHDVLRVERLRDATRRP 189 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 190 -------LLVVVTDGRATGG 202 >gi|239978792|ref|ZP_04701316.1| putative magnesium-chelatase subunit [Streptomyces albus J1074] Length = 336 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++G ++ +L + + G Sbjct: 139 QATREGREGNLVLFVVDASGSMA-----ARRRMGAVKGAVLSLL-----LDAYQRRDKVG 188 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF + P V ++ L G T GL A++ + + + Sbjct: 189 LVTFRGREAEVALPPTSSVDAAAARLQALPTGGRTPLAAGLLKAHDVLRVERLRDATRRP 248 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 249 -------LLVVVTDGRATGG 261 >gi|193788347|dbj|BAG53241.1| unnamed protein product [Homo sapiens] Length = 325 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 27/124 (21%), Positives = 48/124 (38%), Gaps = 14/124 (11%) Query: 216 VRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKI 266 V ++TF+S+ V L + + E I+ L G+ T + L Y + Sbjct: 78 VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMM 137 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYA 321 + L + + II LTDG+++ D+ + N+ + +YA Sbjct: 138 NNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYA 197 Query: 322 IGVQ 325 IGV Sbjct: 198 IGVG 201 >gi|319778364|ref|YP_004129277.1| PpkA [Taylorella equigenitalis MCE9] gi|317108388|gb|ADU91134.1| PpkA [Taylorella equigenitalis MCE9] Length = 656 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 69/187 (36%), Gaps = 25/187 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + S ++ ++ V+D ++SM+ P ++K A R + + ++ + V+ Sbjct: 212 AQPSKLTEFKAAVVFVIDSTISMD----PYINKTREAVRELYKQIEKDDLLDQ----VKF 263 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 GLV F S L + + +N + +KE E Sbjct: 264 GLVAFRSSTEAVPDLEY---TSKMYVNPNEVKDGKDFMDKVASLKQAKVSSKEFNEDSYA 320 Query: 279 G----------HDDYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 G +D +YI+ +TD N+ + + EA+ +G +Y + + Sbjct: 321 GINQALNDINWNDFGARYIVLITDAGAIDGNNPLSSTGFGAQQLRQEAQHKGVAIYTLHL 380 Query: 325 QAEAADQ 331 + +A + Sbjct: 381 KTQAGSK 387 >gi|313669375|ref|YP_004049802.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] gi|313156572|gb|ADR35249.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 588 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 69/202 (34%), Gaps = 15/202 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +++ DVSLS + + + ++ + + + D R + TFSS Sbjct: 397 DMSTLILADVSLSTEAGITQEIRVIDMIQDALMVFSESLHRLRD-----RFAIYTFSSIK 451 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 I+ + Y ++ + I K + Sbjct: 452 NTKV----NFHIIKNFKENYSDTIRGRIHAIKPGYYTRMGAGIRESAKILDKQQSANKLL 507 Query: 288 IFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 + L+DG+ + +++ E K++G + I + +A ++L + + Sbjct: 508 LILSDGKPNDVDRYDGRYGIEDTKKAIEEVKKKGITPFCITIDIDA-KEYLPYLFGRNSY 566 Query: 343 YSVQNSRKLHDAFLRIGKEMVK 364 +++++KL I + K Sbjct: 567 AVIRDAKKLPKVLPEIYMNLTK 588 >gi|257063144|ref|YP_003142816.1| hypothetical protein Shel_04060 [Slackia heliotrinireducens DSM 20476] gi|256790797|gb|ACV21467.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 272 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 78/229 (34%), Gaps = 21/229 (9%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 C N P+ +K+ L ++ V+D S SM +F + + A ++L Sbjct: 1 MCTNRLEDPMPNIEYTM--AKARKLLPIIYVIDTSGSM--NFYGRISAVNRAMNETLDVL 56 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIV-QTFPLAWGVQHIQE----KINRLIFGSTTKSTPG 258 + + + V+ ++ FS+ T G + + ++L +T Sbjct: 57 GDVAAKNPTAD-VKVAVLGFSTGAEWITTDPVTGKPALMDLEDFYWDKLKARGSTDLGAA 115 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF-YCNEAK-RRG 316 L ++ + G IIF++DG P D + + C + R Sbjct: 116 LIELGEQLTRDAMLVS--ETGFKVP--VIIFMSDG---GPTDDWESAFEKVCANNRWVRA 168 Query: 317 AIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMV 363 A A+ V A + L A +P+ V +S L + Sbjct: 169 ATKIALAVGDNADREVLARIADGNPEAVVPVSDSATLQKLIKVVSVTAS 217 >gi|260829815|ref|XP_002609857.1| hypothetical protein BRAFLDRAFT_115280 [Branchiostoma floridae] gi|229295219|gb|EEN65867.1| hypothetical protein BRAFLDRAFT_115280 [Branchiostoma floridae] Length = 516 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 51/136 (37%), Gaps = 9/136 (6%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +++LD+SLSM G + +I+ + ++ + N + + LV FSS Sbjct: 4 VILLDLSLSMTRPVVSGDNDFQRKHLAIQGVSTLLDYLAAHNRLEFTALVAFSSLWELVV 63 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 P + ++E +NR+ T GL + D ++ +T Sbjct: 64 PFTRDYKELKEGLNRMDDYDKTCIETGLAGVSQIVMDEWGVGLPCQ---------VVLVT 114 Query: 292 DGENSSPNIDNKESLF 307 DG KESL Sbjct: 115 DGSPGVGKGSLKESLE 130 >gi|145516504|ref|XP_001444144.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411548|emb|CAK76747.1| unnamed protein product [Paramecium tetraurelia] Length = 829 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 71/190 (37%), Gaps = 38/190 (20%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 ++D S SM+ ++ A ++ L KS+P + +++F + + + + Sbjct: 330 IIDRSGSMSGS------RIKKAKEALILFL---KSLPQDSEY---NIISFGTNFTKLWNV 377 Query: 234 AWG-----VQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + ++ + + + G T P + Y+K + A + Sbjct: 378 SQNYSQNTLETAIKHVEEMDADMGGTCIIAPLKQMIYHKNYGASKNTTLN---------- 427 Query: 287 IIFLTDGENS-SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFY 343 + LTDG+++ P ID + +Y +G+ E + ++ A ++ Sbjct: 428 VFLLTDGQDTADPIIDLVQKNNLAQT------RIYTLGIGRECSQYLIRRVAEVGNGKYQ 481 Query: 344 SVQNSRKLHD 353 V + +++ Sbjct: 482 IVSDKEDINE 491 >gi|301648002|ref|ZP_07247775.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|301073903|gb|EFK88709.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] Length = 219 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A N + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAHGDTPMGAAITKALNMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|301064754|ref|ZP_07205134.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441129|gb|EFK05514.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 303 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 10/103 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++DVS S FG + R + + + + + N + GLV F+ Sbjct: 80 LTIMLIIDVSAS--GEFGSA----TQSKREMMAEIGSVLAFSAMRNNDKVGLVLFTDDAE 133 Query: 229 QTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIF 267 P G HI I ++F G+ T +++ I Sbjct: 134 LYIPPGKGRSHILRVIREILFFQPSGTGTNLAAAMDFTSQVIK 176 >gi|290997816|ref|XP_002681477.1| predicted protein [Naegleria gruberi] gi|284095101|gb|EFC48733.1| predicted protein [Naegleria gruberi] Length = 452 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 51/148 (34%), Gaps = 31/148 (20%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + S ++++LDVS SM + ++ ATRS+ L Sbjct: 69 WFMGTASTSK------SIIIILDVSSSMGAYH--RLENAIYATRSVINYLTEKD------ 114 Query: 214 NVVRSGLVTFSSKIVQTFPLA--------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 G+V F++ + + + I ++ +T +N Sbjct: 115 ---YVGIVLFNAGAFTCKKQTEFLLKATAQNKKTLIDCIENMMPFGSTNFEAAFNETFN- 170 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 +FD E++ + ++FLTDG Sbjct: 171 LFDRSEEIA-----SSTCDRVVLFLTDG 193 >gi|260834997|ref|XP_002612496.1| hypothetical protein BRAFLDRAFT_75382 [Branchiostoma floridae] gi|229297873|gb|EEN68505.1| hypothetical protein BRAFLDRAFT_75382 [Branchiostoma floridae] Length = 370 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 61/202 (30%), Gaps = 50/202 (24%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD ++ LD S SM G GM +L A R + S ++ V G+V F +K Sbjct: 2 PLDTVLCLDTSGSMA---GRGMRELKKAVREFILGVQETASKHNLRENV--GVVEFGAKT 56 Query: 228 VQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PL + + L+ K TP Sbjct: 57 RIVQPLTNNYAAVLRAVGGVLVLSGQKKMTPR---------------------------- 88 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRG---------AIVYAIGVQAEAADQFLKNCA 337 +I +TDG D + L G + +G + L+ A Sbjct: 89 VILMTDGHPD----DKQNVLKAALSFGPAGWQAVGLPHPIPIACVGCGGDVDGDLLQAVA 144 Query: 338 --SPDRFYSVQNSRKLHDAFLR 357 + + + +L + F R Sbjct: 145 KLTNGMYVQ-GDIGQLSEFFRR 165 >gi|52550325|gb|AAU84174.1| Von Willebrand factor type A domain [uncultured archaeon GZfos37D1] Length = 568 Score = 44.8 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 80/207 (38%), Gaps = 20/207 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + ++D+S S + V + L +I + R +V Sbjct: 378 TENRTRDIAVAFLVDMSGSTVGSTIRCEKEALVLMSEALKELGDAFAIYGFSGYGRDNVV 437 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F +++ F + Q +Q KI+ + +T+ P + + K+ +E+ Sbjct: 438 FF---LIKDFEDPYD-QRVQSKISTMTNKQSTRIAPAIRHTTTKLRRREERT-------- 485 Query: 282 DYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + +I L+DG+ + N +++ EA+ G + I V EAA+ + A Sbjct: 486 ---RMLILLSDGKPLDRDYYGNYAIEDTRMALKEAQGYGVKSFCITVDREAAEYLPRMYA 542 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + R+ + + KL D RI K Sbjct: 543 NS-RWVVIDDVLKLPDKITRIYKRFTT 568 >gi|315618148|gb|EFU98739.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 219 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG ++ +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTNEWQAAANKVFRGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|299140482|ref|ZP_07033620.1| von Willebrand factor, type A [Prevotella oris C735] gi|298577448|gb|EFI49316.1| von Willebrand factor, type A [Prevotella oris C735] Length = 289 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTVRQTKREMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S ++ + P G +H I+E ++ T + Y + Sbjct: 126 SDRVEKYIPPKKGRRHILYIIRELLDFKPASVRTDIGMAVSYLTRIMK 173 >gi|290960313|ref|YP_003491495.1| hypothetical protein SCAB_59371 [Streptomyces scabiei 87.22] gi|260649839|emb|CBG72955.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 449 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 55/197 (27%), Gaps = 21/197 (10%) Query: 120 DQHKDYNLSAVSRYEMP-----FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 +++ +P + + S + ++ Sbjct: 7 SNVPQFSVEVYQNEYLPEGGREVNAIVTVSATGGGTIGTAVAAPHLYSPGQGPSAAVALM 66 Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-SIPDVNNVVRSGLVTFSSKIVQTFPL 233 +D S SM+ + +I + D + ++ D +V R + Sbjct: 67 VDCSGSMDYPPTKMRNARDATAAAIDTLRDGVHFAVIDGTHVARE---VYPGGGRLAVAD 123 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + ++ + RL G T L A + + H I LTDG Sbjct: 124 SATRAQAKQALRRLSAGGGTAIGTWLRLADRLLASEDVAIRHG-----------ILLTDG 172 Query: 294 ENSSPN-IDNKESLFYC 309 N + D K +L C Sbjct: 173 RNEHESPEDLKAALDAC 189 >gi|294673500|ref|YP_003574116.1| hypothetical protein PRU_0757 [Prevotella ruminicola 23] gi|294473545|gb|ADE82934.1| conserved hypothetical protein [Prevotella ruminicola 23] Length = 289 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 48/122 (39%), Gaps = 6/122 (4%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDF------GTQKQMKRDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S KI + P G +HI I ++ + ++ A + +++ D Y Sbjct: 126 SDKIEKYIPPKKGRKHILYIIREMLDFKPESTRTDIKQAIEFLSSVQKRRTTAFILSDFY 185 Query: 284 KK 285 + Sbjct: 186 VR 187 >gi|118374645|ref|XP_001020510.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila] gi|89302277|gb|EAS00265.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila SB210] Length = 579 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 52/147 (35%), Gaps = 10/147 (6%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV-ATRSIREMLDIIKSIPDVNNVVR 217 V+ + D ++++ D+S SM F + A + + N+VV Sbjct: 107 VEQQKEEDPKEAIVILYDISGSMGSTFYNEQGLPRIGAVNAFFSAFADKTLAMEYNHVV- 165 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 L F SKI + + + ++ T+ L N + + K+K + Sbjct: 166 -SLFWFDSKIEKKCEFIRDMNTFIKLVDDAAPRGGTRLYDSLVEGVNSLIEYKKKHPNTI 224 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKE 304 +I LTDGE++ E Sbjct: 225 LR-------MIALTDGEDNESKYKPGE 244 >gi|291438431|ref|ZP_06577821.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291341326|gb|EFE68282.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 487 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 69/194 (35%), Gaps = 22/194 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +VLD S SM ++ + +++ E + + D V+ V FS Sbjct: 298 TGTRAKVYLVLDRSASMRPYY------KDGSAQALAEQTLALAAHLDPEATVKV--VFFS 349 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +++ T L KI+ L A ++ +K D Sbjct: 350 TELDGTGELTLTDHE--NKIDTLHASLGRMGRTSYHAAVAEVLALHDKSA----AGPDTP 403 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQF--LKNCASPD- 340 ++F TDG +P+ + AK A+ ++ + F L+ +P+ Sbjct: 404 ALVVFQTDG---APDAKTPATQALTEAAKTHPAVFFSFVAFGDPENKAFDYLRKLKAPNT 460 Query: 341 -RFYSVQNSRKLHD 353 F + + ++L D Sbjct: 461 AHFLAGETPKELTD 474 >gi|239930093|ref|ZP_04687046.1| hypothetical protein SghaA1_17826 [Streptomyces ghanaensis ATCC 14672] Length = 578 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 69/194 (35%), Gaps = 22/194 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +VLD S SM ++ + +++ E + + D V+ V FS Sbjct: 389 TGTRAKVYLVLDRSASMRPYY------KDGSAQALAEQTLALAAHLDPEATVKV--VFFS 440 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +++ T L KI+ L A ++ +K D Sbjct: 441 TELDGTGELTLTDHE--NKIDTLHASLGRMGRTSYHAAVAEVLALHDKSA----AGPDTP 494 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQF--LKNCASPD- 340 ++F TDG +P+ + AK A+ ++ + F L+ +P+ Sbjct: 495 ALVVFQTDG---APDAKTPATQALTEAAKTHPAVFFSFVAFGDPENKAFDYLRKLKAPNT 551 Query: 341 -RFYSVQNSRKLHD 353 F + + ++L D Sbjct: 552 AHFLAGETPKELTD 565 >gi|53733375|gb|AAH83523.1| Cfb protein [Danio rerio] Length = 653 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 78/216 (36%), Gaps = 29/216 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KIS LD+ + +D S S++ A + I+ +++ I N Sbjct: 245 KISLDRGGKLDIYIAVDASDSIDPK------DFDKAKKIIKTLIEKISYYEVSPNYE--- 295 Query: 220 LVTFSSKIVQTFPL-AWGVQHIQEKINRLIFG--------STTKSTPGLEYAYNKIFDAK 270 ++ F++ + Q + + KI ++ ++ + Y KI D+ Sbjct: 296 ILMFATDVDQIVKMRDFKTNEKARKILKIFEDLDNFNYDKKGDRTGTNIAKLYLKILDSM 355 Query: 271 EKLEHIAKGH-DDYKKYIIFLTDGE-----NSSPNIDNKESLFYCNEA-KRRGAIVYAIG 323 + K + II TDG+ N P +D ++L N A + +Y G Sbjct: 356 SLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYVFG 415 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAF 355 V + + + S F+ + + ++ + F Sbjct: 416 VGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTF 451 >gi|120598361|ref|YP_962935.1| TPR repeat-containing protein [Shewanella sp. W3-18-1] gi|120558454|gb|ABM24381.1| TPR repeat-containing protein [Shewanella sp. W3-18-1] Length = 663 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 56/178 (31%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLIATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + T GL A + II +TDG + +L Sbjct: 166 VRGSNLTAGLIQAKMLLAQGGHIRGD-----------IILMTDGITPHQFNEANSALS 212 >gi|257887439|ref|ZP_05667092.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] gi|257823493|gb|EEV50425.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] Length = 1107 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 54/139 (38%), Gaps = 29/139 (20%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMN++ D++G + +D + + + + G V +SS+ Sbjct: 267 TPLDLVLVVDWSGSMNNN-----DRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 320 Query: 227 IVQTFPLAWGV------QHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + G ++ ++ + T + GL A + + Sbjct: 321 GHNY---SNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGH----- 372 Query: 278 KGHDDYKKYIIFLTDGENS 296 KK I+ LTDG + Sbjct: 373 ------KKVIVLLTDGVPT 385 >gi|91211361|ref|YP_541347.1| hypothetical protein UTI89_C2348 [Escherichia coli UTI89] gi|117624269|ref|YP_853182.1| hypothetical protein APECO1_1163 [Escherichia coli APEC O1] gi|218558953|ref|YP_002391866.1| hypothetical protein ECS88_2172 [Escherichia coli S88] gi|237704530|ref|ZP_04535011.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|91072935|gb|ABE07816.1| hypothetical protein YegL [Escherichia coli UTI89] gi|115513393|gb|ABJ01468.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|218365722|emb|CAR03458.1| conserved hypothetical protein [Escherichia coli S88] gi|226900896|gb|EEH87155.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|294490406|gb|ADE89162.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|307626381|gb|ADN70685.1| von Willebrand factor type A [Escherichia coli UM146] gi|315285696|gb|EFU45136.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|323951827|gb|EGB47701.1| von Willebrand protein type A [Escherichia coli H252] gi|323956109|gb|EGB51861.1| von Willebrand protein type A [Escherichia coli H263] Length = 219 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNTGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|296131764|ref|YP_003639011.1| von Willebrand factor type A [Thermincola sp. JR] gi|296030342|gb|ADG81110.1| von Willebrand factor type A [Thermincola potens JR] Length = 584 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 63/219 (28%), Gaps = 35/219 (15%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + H + + KS +D + +D S SM K+ + L Sbjct: 381 FLQGQRHMSIKREDIRQYGRKSFAPIDTCLAIDCSGSMVGE------KIRAVAYLAQHFL 434 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQ-TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + + +VTF + P Q + E + + T G+ A Sbjct: 435 LTSRE--------KVSVVTFQETSSKVVIPFTKSYQKLVEGLRSIQPEGMTPLAKGILEA 486 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN---KESLFYCNEAKRRGAIV 319 I + + ++ +TDG + P ++L + Sbjct: 487 VELIKKKRARNP-----------LLVLITDGIPNYPLWTTDAQADALKAAEMIAENKIRL 535 Query: 320 YAIGVQAEAADQFLKNCA--SPDRFYSVQ--NSRKLHDA 354 IGV + FLK A Y V + L D Sbjct: 536 VCIGVL--PNESFLKELAKIGKGNLYIVDELDKNSLLDV 572 >gi|115680671|ref|XP_001197987.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] gi|115719718|ref|XP_001196973.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] Length = 528 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 61/185 (32%), Gaps = 30/185 (16%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P+ T+ V + +++VLD+S SM + ++ +S + Sbjct: 89 PVSNTTPVFEVLQLSSVRSVVLVLDISNSMKEK-----NRFDRMIQSSTVYIMSAIPADS 143 Query: 212 VNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIF 267 G+V F S + L Q +N L TT G++ + Sbjct: 144 K-----LGIVVFESASREIAELTDITDTASRQRLVNALNTSPKGTTCIGCGIKSGLKVLG 198 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + YI+ L+DG + D + Y ++ G I+ I + E Sbjct: 199 SYAQGG------------YILLLSDGVENE---DPSITDMY-DDINNSGVIIDTITISNE 242 Query: 328 AADQF 332 A Q Sbjct: 243 ADQQM 247 >gi|260868782|ref|YP_003235184.1| hypothetical protein ECO111_2793 [Escherichia coli O111:H- str. 11128] gi|257765138|dbj|BAI36633.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|323177030|gb|EFZ62620.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 219 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|221107805|ref|XP_002169300.1| PREDICTED: similar to high choriolytic enzyme [Hydra magnipapillata] Length = 1810 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 35/252 (13%), Positives = 87/252 (34%), Gaps = 22/252 (8%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH----APLLITSSVKISSKSDIG- 168 + II +++ ++ + E+ I + +H L I++ V S ++ Sbjct: 90 VYIIKLNENIIHSEKRDNSVELENIKLYVGSSEHITHNGFIRNLFISNQVNSKSNKNLCG 149 Query: 169 --LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD++ ++D S +M D+F + +S ++ F S+ Sbjct: 150 GLLDVVFLIDSSTNMKDNFAKVKRFMKEIAKSFGL----------SREGSHVSIILFGSE 199 Query: 227 IVQTFPLAWGV--QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + L+ + + + + ++ KE G + Sbjct: 200 AKVSLKLSDNQDESTFDSACDSIDENAFPSTKVPIKIEKALQLAQKELFVQANTGREFIP 259 Query: 285 KYIIFLTDGENSSPNIDNK--ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP-DR 341 K +I +T G+ + + + + E + +G + ++ V +E L ++ Sbjct: 260 KTVILITAGKQNPSKYGERVEKPFPFALELREKGVSIISVAVGSEIEKSELTGITGNIEK 319 Query: 342 FYSVQNSRKLHD 353 + V N + L + Sbjct: 320 VFLVNNFQDLLE 331 >gi|170679715|ref|YP_001743066.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218699334|ref|YP_002406963.1| hypothetical protein ECIAI39_0941 [Escherichia coli IAI39] gi|170517433|gb|ACB15611.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218369320|emb|CAR17078.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 219 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|167919151|ref|ZP_02506242.1| hypothetical protein BpseBC_11405 [Burkholderia pseudomallei BCC215] Length = 594 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 15 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 69 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 70 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 126 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 127 QSVPYFFL 134 >gi|194436063|ref|ZP_03068165.1| von Willebrand factor type A domain protein [Escherichia coli 101-1] gi|253772996|ref|YP_003035827.1| von Willebrand factor A [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|254162064|ref|YP_003045172.1| hypothetical protein ECB_01978 [Escherichia coli B str. REL606] gi|291283314|ref|YP_003500132.1| hypothetical protein G2583_2597 [Escherichia coli O55:H7 str. CB9615] gi|293415361|ref|ZP_06658004.1| yegL protein [Escherichia coli B185] gi|297520674|ref|ZP_06939060.1| von Willebrand factor type A [Escherichia coli OP50] gi|300899770|ref|ZP_07117990.1| von Willebrand factor type A domain protein [Escherichia coli MS 198-1] gi|300928180|ref|ZP_07143721.1| von Willebrand factor type A domain protein [Escherichia coli MS 187-1] gi|301026364|ref|ZP_07189811.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|168986340|dbj|BAG11880.1| predicted protein [Escherichia coli O55:H7] gi|194424791|gb|EDX40776.1| von Willebrand factor type A domain protein [Escherichia coli 101-1] gi|242377723|emb|CAQ32484.1| conserved protein [Escherichia coli BL21(DE3)] gi|253324040|gb|ACT28642.1| von Willebrand factor type A [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|253973965|gb|ACT39636.1| hypothetical protein ECB_01978 [Escherichia coli B str. REL606] gi|253978159|gb|ACT43829.1| hypothetical protein ECD_01978 [Escherichia coli BL21(DE3)] gi|290763187|gb|ADD57148.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain protein [Escherichia coli O55:H7 str. CB9615] gi|291433009|gb|EFF05988.1| yegL protein [Escherichia coli B185] gi|299879720|gb|EFI87931.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|300356649|gb|EFJ72519.1| von Willebrand factor type A domain protein [Escherichia coli MS 198-1] gi|300463775|gb|EFK27268.1| von Willebrand factor type A domain protein [Escherichia coli MS 187-1] gi|320657496|gb|EFX25294.1| von Willebrand factor type A [Escherichia coli O55:H7 str. 3256-97 TW 07815] gi|320663312|gb|EFX30617.1| von Willebrand factor type A [Escherichia coli O55:H7 str. USDA 5905] gi|323961761|gb|EGB57363.1| von Willebrand protein type A [Escherichia coli H489] gi|323972778|gb|EGB67977.1| von Willebrand protein type A [Escherichia coli TA007] Length = 219 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|167902891|ref|ZP_02490096.1| hypothetical protein BpseN_11585 [Burkholderia pseudomallei NCTC 13177] Length = 593 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 14 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 68 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 69 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 125 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 126 QSVPYFFL 133 >gi|119513236|ref|ZP_01632280.1| hypothetical protein N9414_00925 [Nodularia spumigena CCY9414] gi|119462103|gb|EAW43096.1| hypothetical protein N9414_00925 [Nodularia spumigena CCY9414] Length = 224 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 61/181 (33%), Gaps = 21/181 (11%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 H L + V+ + + +++LD S SM D + + + + D + Sbjct: 2 HDTLKLDEIVEFAENPEPRCPCVLLLDTSGSMQG------DAIEALNQGLLSLKDELVKN 55 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIF 267 V +VTF S + ++ N L T G+ A + + Sbjct: 56 SLAARRVEVAIVTFDSNVNIVQDF-----VTADQFNPPILTAQGLTTMGAGIHKALDMV- 109 Query: 268 DAKEKLEHIAKGHDDYKKYIIFL-TDGENS---SPNIDNKESLFYCNEAKRRGAIVYAIG 323 +E+ Y + +F+ TDGE I+ EA +R + +G Sbjct: 110 --QERKSLYRTNGIAYYRPWVFMITDGEPQGELDHLIEQAAQRLQGEEANKR-VAFFTVG 166 Query: 324 V 324 V Sbjct: 167 V 167 >gi|108760718|ref|YP_633803.1| hypothetical protein MXAN_5665 [Myxococcus xanthus DK 1622] gi|108464598|gb|ABF89783.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 293 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 10/97 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+V+DVS S FG + + + + N R GLV FS ++ Sbjct: 77 LTVMLVVDVSAS--KEFGSR----ERTKSEVAAEVAAQIAFSAIANNDRVGLVLFSDRVE 130 Query: 229 QTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEY 261 + P G H+ ++ ++ G T GL Y Sbjct: 131 KVVPPRKGRTHVLRLVSDILTFQPQGRGTDLAAGLNY 167 >gi|84385675|ref|ZP_00988706.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] gi|84379655|gb|EAP96507.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] Length = 520 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 60/194 (30%), Gaps = 20/194 (10%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 +G ++ +LL I + +E + ++L + + + Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAV--------TI 67 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + Q + + ++N + L + F Q + + ++ Sbjct: 68 EDQPDQARGLATKYVENYVRDIKSTNLSADRFHQAEDEGAGVLEYIQYTVNAKTTHDSWF 127 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG- 188 S + +P S + D +D++ V D S SMND +G Sbjct: 128 ASSF-IPSFDQQQDLAGRSLARKYPVYL-------GDNNIDIVFVSDFSGSMNDRWGSSR 179 Query: 189 ---MDKLGVATRSI 199 +D L A +I Sbjct: 180 HIKIDDLKTAIDAI 193 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 54/150 (36%), Gaps = 10/150 (6%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +S L+ + + IN + T + G+ + D + Sbjct: 364 NSSQFSNIRLSNKLSDL-NPINSMWADGGTAAFQGILRGSQVLHDGDPNSSDQEEQQVYN 422 Query: 284 KKY--IIFLTDGENSSPNIDNKE--SLFYCNEAKRR--GAIVYAIGVQAEAADQF-LKNC 336 KK ++ L+DG+ S N K C++A+ G + IG+ A+ Q ++C Sbjct: 423 KKIKMLLILSDGQESPNNGILKGLVDRGMCDKARNEIPGLYIGVIGIDFRASQQSGFQDC 482 Query: 337 ASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 + V N +L + + ++ K Sbjct: 483 VVDPNEDIIDVSNLDELIEKIEELIRKGSK 512 >gi|307544656|ref|YP_003897135.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] gi|307216680|emb|CBV41950.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] Length = 612 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 68/199 (34%), Gaps = 33/199 (16%) Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM + S E+L + +P V +G+ TF ++ PL Sbjct: 47 SGSMKRNDP------DQLAASAMELL--VSLLPGG---VSAGVWTFGERVDNPLPLG--- 92 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG---- 293 +++ P L Y + D + LE A + +++I +TDG Sbjct: 93 -----EVSEQWRERALALPPALRD-YQQYTDIETALEQAASAEANGWRHLILMTDGVIDL 146 Query: 294 ---ENSSPNIDNKESLFYCNEA----KRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYS 344 + P ID RG +V+AI EA ++ A + Sbjct: 147 PPSRGNKPGIDEASRQRLVESMAPRFADRGVVVHAIAFSDEADLALVEQLAQRTGGLASV 206 Query: 345 VQNSRKLHDAFLRIGKEMV 363 ++ L AFL I + + Sbjct: 207 AKSPESLLGAFLDIIERIF 225 >gi|238027555|ref|YP_002911786.1| membrane protein [Burkholderia glumae BGR1] gi|237876749|gb|ACR29082.1| Membrane protein [Burkholderia glumae BGR1] Length = 620 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 28/53 (52%), Gaps = 1/53 (1%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +G+ +++T I + V+ V+G+ ++ + FF + L I D + L ++ Sbjct: 22 RRERGAFAMMTIIFMTVMIAVLGM-LDIGNVFFQRRDLQRIADMAALAGVQRL 73 >gi|218781370|ref|YP_002432688.1| magnesium chelatase [Desulfatibacillum alkenivorans AK-01] gi|218762754|gb|ACL05220.1| Magnesium chelatase [Desulfatibacillum alkenivorans AK-01] Length = 680 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 70/203 (34%), Gaps = 28/203 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + IG ++ V+D S SM ++ + +I +L + + +V+ Sbjct: 489 REKRIGNFLLFVVDASGSMG-----AKGRMAASKGAIMSLL-----LDAYQKRDKVAMVS 538 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + + P V+ + + L G T G+ E + + Sbjct: 539 FRKDEALVNLPPTSSVELAAKLLAELPIGGKTPLAAGM-------VKCAEMVRNALVRDP 591 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQFLKNCAS- 338 + ++F+TDG+ + K A+ + V+ + V E+ A Sbjct: 592 SMRPIVLFITDGKANVGLNGGKPVPEAMELAERLSQDERVHYVVVDTESRGLVRFGLAGR 651 Query: 339 -----PDRFYSVQN--SRKLHDA 354 R++ V++ + +L D Sbjct: 652 LAESLGARYFQVEDLRAEQLVDI 674 >gi|118361105|ref|XP_001013783.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295550|gb|EAR93538.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 368 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 50/161 (31%), Gaps = 21/161 (13%) Query: 174 VLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V+D+S SM+ F G + +L + + + N++ G K Sbjct: 193 VIDISGSMDYTFKANGETISRLAFVKSQLTKTIADQLKPFQKFNIIIFGNSASQWKTDYV 252 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 VQ IN+L T + GL+ A+N K I L Sbjct: 253 DATPENVQAAIAYINKLTTNGATNISSGLDLAFNT---------------KQALKAIYLL 297 Query: 291 TDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAEA 328 +DG +S Y N ++ +V I Sbjct: 298 SDGVPNSGVQTVDGIKKYLADKNASRTDKVVVNTISFIMGG 338 >gi|303237431|ref|ZP_07323997.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302482381|gb|EFL45410.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 625 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 72/216 (33%), Gaps = 26/216 (12%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS--- 157 F ++I + + + Y + + + + + P + T Sbjct: 385 FIAQSSDIIPIEEETNTGNPINYSYPNTWRIGDPIESLDLILSYITSPKLIPGITTKKWE 444 Query: 158 --SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + D+++V+D S SM D + A + +L+ +S Sbjct: 445 RITTESHGSEKKLRDLLLVVDTSGSMK-SVTREADNMHQAVLASFGILNYFES-----KK 498 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + L+ FS + + W VQ+ + + T + G ++ ++I EK ++ Sbjct: 499 CKVALIEFSDSV--RVDITWTVQY-----DEIREKLLTNGSGGTQFPIHRIQSTLEKSKN 551 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 + +TDGE + Y NE Sbjct: 552 EL--------VTVVITDGELGNLQESITYFREYLNE 579 >gi|281423278|ref|ZP_06254191.1| von Willebrand factor, type A [Prevotella oris F0302] gi|281402614|gb|EFB33445.1| von Willebrand factor, type A [Prevotella oris F0302] Length = 289 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTVRQTKREMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S ++ + P G +H I+E ++ T + Y + Sbjct: 126 SDRVEKYIPPKKGRRHILYIIRELLDFKPASVRTDIGMAVSYLTRIMK 173 >gi|270007556|gb|EFA04004.1| hypothetical protein TcasGA2_TC014153 [Tribolium castaneum] Length = 813 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 77/223 (34%), Gaps = 34/223 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV--TFSSKIV 228 ++ VL+ L+M G +D+L A + I L + +VR G + S Sbjct: 299 VVFVLNHGLTM---HGRKIDQLIDAMQKILSELTENDAFD----IVRFGATPSVWDSTRH 351 Query: 229 QTFPLAWGVQH--IQEKINRLIFGSTTKS-TPGLEYAYNKIFDAKEKLEHIAKGHDD--- 282 + L + ++ + +L T+K+ +E A + I+D + Sbjct: 352 KFIRLPDLRHYGNLEPYVKKLFLPRTSKAVRQNIEAARSTIYDKSGLGLSNPVYALEVGL 411 Query: 283 ----------YKKY---IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +Y IIFLTD + E + + ++++ + Sbjct: 412 FLAKRIQDNLPNRYQPMIIFLTDSYPTVGMTSQNEIINTVTKVNNNRIPIFSLSFGEDVD 471 Query: 330 DQFLKNCAS-----PDRFYSVQNSR-KLHDAFLRIGKEMVKQR 366 F++ A+ Y ++ ++ + + I ++ Q Sbjct: 472 KNFMRQLAAKNLGFSGHIYEALDASVQILNFYRSISSPVLSQV 514 >gi|218189604|gb|EEC72031.1| hypothetical protein OsI_04925 [Oryza sativa Indica Group] Length = 1128 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 77/221 (34%), Gaps = 31/221 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+L+ +V ++ +D++ +LD+S G +L + +++ ++ + + Sbjct: 599 LPVLVRVAVPATAARRAPVDLVTLLDISC--GGGGGAPARRLDLLRKAMDLVIGNLGADD 656 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +V F S +V L G K+ L TK P L A + Sbjct: 657 ------RLAIVPFHSSVVDATGLLEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEIL 710 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + + ++ ++DG++ EA V+A G + Sbjct: 711 ------EARCWEAKRERVGAVVLISDGDD----------RTIFREAINPRYPVHAFGFRG 754 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRK-LHDAFLRIGKEMVK 364 + + + A + + + + + DAF + + Sbjct: 755 AHDARAVHHVADHTSGVYGVLDDEHDRVTDAFAACVRRVTS 795 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 66/202 (32%), Gaps = 24/202 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +S +D++ VLDVS MD L A + + L + V Sbjct: 42 PPAAASSERAPIDLVAVLDVSCCGGLGPVNRMDLLKKAMGFVIDKLGEHDRLAVVPVQAS 101 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + G + + + L K + L+ A + K Sbjct: 102 AAIAEKHDLVEMNAE---GRKEATRMVQSSLTVTGENKLSTALKKAATIL--EGRKDHDK 156 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + +I+ ++DG+++S N+A V+A G + + + Sbjct: 157 KRPG-----FIVLISDGDDAS----------VLNDAMNLNCSVHAFGFRDAHNARAMHRI 201 Query: 337 A--SPDRFYSVQNSRK-LHDAF 355 A S + + + L DAF Sbjct: 202 ANTSAGTYGILNDGHDGLADAF 223 >gi|125570256|gb|EAZ11771.1| hypothetical protein OsJ_01640 [Oryza sativa Japonica Group] Length = 1034 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 77/221 (34%), Gaps = 31/221 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+L+ +V ++ +D++ +LD+S G +L + +++ ++ + + Sbjct: 505 LPVLVRVAVPATAARRAPVDLVTLLDISC--GGGGGAPARRLDLLRKAMDLVIGNLGADD 562 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +V F S +V L G K+ L TK P L A + Sbjct: 563 ------RLAIVPFHSSVVDATGLLEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEIL 616 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + + ++ ++DG++ EA V+A G + Sbjct: 617 ------EARCWEAKRERVGAVVLISDGDD----------RTIFREAINPRYPVHAFGFRG 660 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRK-LHDAFLRIGKEMVK 364 + + + A + + + + + DAF + + Sbjct: 661 AHDARAVHHVADHTSGVYGVLDDEHDRVTDAFAACVRRVTS 701 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 66/202 (32%), Gaps = 24/202 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +S +D++ VLDVS MD L A + + L + V Sbjct: 42 PPAAASSERAPIDLVAVLDVSCCGGLGPVNRMDLLKKAMGFVIDKLGEHDRLAVVPVQAS 101 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + G + + + L K + L+ A + K Sbjct: 102 AAIAEKHDLVEMNAE---GRKEATRMVQSSLTVTGENKLSTALKKAATIL--EGRKDHDK 156 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + +I+ ++DG+++S N+A V+A G + + + Sbjct: 157 KRPG-----FIVLISDGDDAS----------VLNDAMNLNCSVHAFGFRDAHNARAMHRI 201 Query: 337 A--SPDRFYSVQNSRK-LHDAF 355 A S + + + L DAF Sbjct: 202 ANTSAGTYGILNDGHDGLADAF 223 >gi|6127227|gb|AAD31503.2|AF138810_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 447 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 60/153 (39%), Gaps = 19/153 (12%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V + K D G D+M +LDVS M+ D F +K+ ++ K + Sbjct: 191 TIDVTVTVKPKQIDEGADVMALLDVSKKMSEDDFKNAKEKIKTLVTTLTGKSSDGKENLN 250 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAY 263 N VR L+TF KI + L+ I +K+ + + + A Sbjct: 251 NRNTVR--LMTFYRKISEPIDLSGKTSEEVEKELDNIWDKVKKEDWDWGVDLQGAIHRAR 308 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + +EK +++I+ + GE++ Sbjct: 309 DIFKKDQEKKSGK-------RQHIVLFSQGEST 334 >gi|302542925|ref|ZP_07295267.1| putative tellurium resistance protein [Streptomyces hygroscopicus ATCC 53653] gi|302460543|gb|EFL23636.1| putative tellurium resistance protein [Streptomyces himastatinicus ATCC 53653] Length = 257 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 56/154 (36%), Gaps = 29/154 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S + +VLD S SM ++ + L T ++ LD +P +V F Sbjct: 53 SGQRAAVYLVLDRSGSMRPYYRDGSVQHLAEQTLALAVNLDDDGVVP---------VVFF 103 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S++I T ++ ++ +++IN L T ++ + H Sbjct: 104 STEIDGTAEIS--LEAYRDRINPLHDSMGHMGRTNYHVAMQAVID----------HYQSC 151 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + +++F TDG +S C AK Sbjct: 152 GAEDPAFVVFQTDGSPTSKAAAEH---VLCTAAK 182 >gi|239817420|ref|YP_002946330.1| hypothetical protein Vapar_4453 [Variovorax paradoxus S110] gi|239803997|gb|ACS21064.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 589 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 49/159 (30%), Gaps = 3/159 (1%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 RN +GSI + TAI L +I I + + E + F++K +L D + L A +I Sbjct: 11 RNAGRQIRGSILVNTAIALSLIVITL-IGTELGYLFYMKRELQKATDLAALAGAKEISYA 69 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTD--FRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 + + K + + L + + ++TS + Sbjct: 70 GSCPSAKTAAKLSANGTGSTDRNRNLPISFSLEDAEIECGQWDPAKTTSDHFDSAPPDQQ 129 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 + + F + + S Sbjct: 130 NAIRITLNRTPATLLSFFEGNRTIQTKAVATNDPIAAFS 168 >gi|111226678|ref|XP_642029.2| type A von Willebrand factor domain-containing protein [Dictyostelium discoideum AX4] gi|122056765|sp|Q54Z23|INT6_DICDI RecName: Full=Integrator complex subunit 6 homolog gi|90970713|gb|EAL68149.2| type A von Willebrand factor domain-containing protein [Dictyostelium discoideum AX4] Length = 1107 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 47/138 (34%), Gaps = 8/138 (5%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + V+D S SM+ GM L + +I + I + N R L+T + Sbjct: 3 ITFVVDTSGSMSQKTTNGMTLLDCSKAAIEHFIKIRSKDASMRN-DRFFLITSEENPITA 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-----DAKEKLEHIAKGHDDY 283 + W +++ L + L+ +++ + + + Sbjct: 62 VKIGWKDNFNSFIQEVKNLQTKDMSNLGFSLQKSFDFLNQFRVQSSIDNYGQGRNPWFIE 121 Query: 284 KKYIIFLTDGENSSPNID 301 II LTDG + + + Sbjct: 122 PAIIILLTDGSSLTNSSS 139 >gi|86748912|ref|YP_485408.1| TadE-like [Rhodopseudomonas palustris HaA2] gi|86571940|gb|ABD06497.1| TadE-like [Rhodopseudomonas palustris HaA2] Length = 181 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 48/152 (31%), Gaps = 13/152 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYIL-DHSLLYTATKILN 65 R F N +GS +I A++ P+ ++ +IET+ F L + D + L Sbjct: 14 RRFSRNRRGSAAIEFAMIAPIFIALLFAIIETAFVFLASQVLETAVQDSARL-------- 65 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 G+ Q ++ K + +G D+ + S I Sbjct: 66 ---ILTGQAQAASYTQSQFKTDLCNRLKALFSCDGVYVDVQSYGSDFSTVSITTPIDSSK 122 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 N +Y P PL +T Sbjct: 123 NFVNTMKYS-PGAAGDIVVVRAFYQWPLFVTG 153 >gi|153001302|ref|YP_001366983.1| TPR repeat-containing protein [Shewanella baltica OS185] gi|151365920|gb|ABS08920.1| Tetratricopeptide TPR_2 repeat protein [Shewanella baltica OS185] Length = 693 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPW--CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLMATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + GL A + II +TDG ++ D +L Sbjct: 166 VRGSNLAAGLTQAKTLLAQGGHIRGD-----------IIVMTDGITAAQFDDANSALS 212 >gi|327288542|ref|XP_003228985.1| PREDICTED: integrator complex subunit 6-like isoform 2 [Anolis carolinensis] Length = 861 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVTF + Sbjct: 4 LLFLIDTSASMNQRAYLGTSYLDIAKGAVELFM-KLRARDPASRGDRYMLVTF-DEAPFC 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|327288540|ref|XP_003228984.1| PREDICTED: integrator complex subunit 6-like isoform 1 [Anolis carolinensis] Length = 898 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVTF + Sbjct: 4 LLFLIDTSASMNQRAYLGTSYLDIAKGAVELFM-KLRARDPASRGDRYMLVTF-DEAPFC 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|326912337|ref|XP_003202509.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Meleagris gallopavo] Length = 1086 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 49/133 (36%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 245 DIVIIVDVSGSMKGL------RMTIAKHTIVTILDTLGENDFVN------IIAYNDYVHF 292 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ ++ L L ++ + + A Sbjct: 293 IEPCFKGILVQADRDNREHFKQLVDELQAKGVGTVNKALTESFKIL-----REFRDAGQG 347 Query: 281 DDYKKYIIFLTDG 293 + I+ +TDG Sbjct: 348 GLCNQAIMLITDG 360 >gi|171911164|ref|ZP_02926634.1| hypothetical protein VspiD_08320 [Verrucomicrobium spinosum DSM 4136] Length = 854 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 51/333 (15%), Positives = 104/333 (31%), Gaps = 51/333 (15%) Query: 55 SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTD-FRNELRENGFAQD------INN 107 + L A + N + + ++ ++ D LR GF + + N Sbjct: 255 ARLMAADDAVAGNNAASQWVEVQGGPRVVLATAYENDPLAQALRAQGFEVEAITDLGLLN 314 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS------HAPLLITSSVKI 161 + T +++ + Y L + F A S V+ Sbjct: 315 VGTLTGTKVVVLNNVPAYRLDPRFTKALDFFVNHQGGGLAMVGGKHSFAAGGYFGSPVEP 374 Query: 162 ---------SSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPD 211 + + M +V+D S SM+ G + K+ +A ++++ Sbjct: 375 LLPVSMELKQEHRKLAVAMAIVMDRSGSMSMTAPGTSLVKMQLANEGAARGIELLGDSDM 434 Query: 212 VNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIF 267 + S+ + PL +Q + R+ G GL+ A+ ++ Sbjct: 435 AC------VYAVDSEPHEVSPLVAVGSNRGTLQNAVRRVESTGGGIYVYQGLKRAWAELE 488 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 AK +++II D ++ + E + + V IG+ E Sbjct: 489 KAK-----------VGQRHIILFADAADAE---EPGEYKALLEKMSKEKGTVSVIGLGTE 534 Query: 328 A--ADQFLKNCA--SPDRFYSVQNSRKLHDAFL 356 FLK+ A R + + ++L F Sbjct: 535 KDSDADFLKDVALRGNGRIFFNSDPKELPALFA 567 >gi|149601522|ref|XP_001515070.1| PREDICTED: similar to integrin alpha 2 subunit, partial [Ornithorhynchus anatinus] Length = 329 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 72/210 (34%), Gaps = 33/210 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ + ++ Sbjct: 111 IDVVVVCDESNS--------IYPWNAVKNFLEKFVQGLDIGPTKTQ---VGLIQYGNEPR 159 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKY 286 F + + K + S T G L + I A++ +A G K Sbjct: 160 VVF----NMNKFKTKEEMVQETSRTNQNGGSLTNTFKAIEFARQNAFSLASGGRPHATKV 215 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV---------QAEAADQFLKNCA 337 ++ +TDGE + D ++ ++ I + I V + +K A Sbjct: 216 MVVVTDGE----SHDGSNLKKVIDQCEQDNIIRFGIAVLGYLLRNELDTKNLITEIKAIA 271 Query: 338 SP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 S F++V + L + +G+ + Sbjct: 272 SSPTEKYFFNVSDEAALLEKAGTLGERIFS 301 >gi|67078187|ref|YP_245807.1| D-amino acid dehydrogenase, large subunit [Bacillus cereus E33L] gi|66970493|gb|AAY60469.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 452 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 67/198 (33%), Gaps = 18/198 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L++ ++LD S SM G K+ A ++I LD I +V V + + Sbjct: 148 KEKSLNVEILLDASGSMAGKVN-GQVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 206 Query: 225 SKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ G + +E+ N + K L A + D ++ Sbjct: 207 EN---DKSLSCGSSEVMYPLQPYKKEQFNAALSKFGPKGWTPLASAIESVNDDFKEYTGE 263 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + YI+ +DGE + + + + V+ Q + Sbjct: 264 ENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLMNTA 318 Query: 337 -ASPDRFYSVQNSRKLHD 353 A + +V N+ +L+ Sbjct: 319 EAGKGNYATVSNADELYQ 336 >gi|312135597|ref|YP_004002935.1| von willebrand factor type a [Caldicellulosiruptor owensensis OL] gi|311775648|gb|ADQ05135.1| von Willebrand factor type A [Caldicellulosiruptor owensensis OL] Length = 667 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 83/270 (30%), Gaps = 83/270 (30%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF- 223 S + +++V+D + K+ A + + ++D K+ + + G+V F Sbjct: 71 SRSPVSVVLVID-----SSGSMSASSKMTAAKNAAKNLIDSFKN--SAKSGDKLGIVDFD 123 Query: 224 -------------------------SSKIVQTFPL-------------AWGVQHIQEKIN 245 SS I + L + + I+ Sbjct: 124 TFVNDNSNFYVKGFYLQNGSWQKGNSSTIYGPYSLPNTCTSSLLDLTNTSAINSAKNLID 183 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP------- 298 + T L A N + + KY+IF+TDG + Sbjct: 184 NMNASGGTNMEAALNKAKNLLNASPSGN----------DKYVIFITDGMPTFYLNGTHNG 233 Query: 299 ------------NIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD------QFLKNCASPD 340 N E+L + G ++ +GV AD + + + A+ Sbjct: 234 YPLVDGPGLQPNNTTKSETLSAVQSLSQSGTKLFVVGVDTTGADVDKTFIELMASTANGK 293 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 +Y + ++ L+ I +++ + Y+ Sbjct: 294 SYY-ISSTNALNSILQDIF-KIINLAVTYD 321 >gi|300917093|ref|ZP_07133784.1| von Willebrand factor type A domain protein [Escherichia coli MS 115-1] gi|300415660|gb|EFJ98970.1| von Willebrand factor type A domain protein [Escherichia coli MS 115-1] gi|323172661|gb|EFZ58295.1| von Willebrand factor type A domain protein [Escherichia coli LT-68] Length = 219 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|296208405|ref|XP_002751079.1| PREDICTED: calcium-activated chloride channel regulator 2 [Callithrix jacchus] Length = 942 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 77/213 (36%), Gaps = 41/213 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + L + + ++ I+ +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAE----ANRLLQLQQAAEFYLMQIV----EIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + + + + GL+ + + Y Sbjct: 362 AQLHQINSDDDRKLLVSYLPTAVTAKTEVSICSGLKKGFEVV---------EKLNGKAYG 412 Query: 285 KYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQFLKNCASPD--R 341 +I +T G++ + + C G+ +++I + + A + S + Sbjct: 413 SVMILVTSGDD--------KFIGNCLPTVLSSGSTIHSIALGSSAVPNLEEFSHSTGGLK 464 Query: 342 FY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 F+ + NS + DAF RI ++ +QRI Sbjct: 465 FFVPDISNSNSMTDAFSRISAGTGDIFQQRIQL 497 >gi|239816025|ref|YP_002944935.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239802602|gb|ACS19669.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 334 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 78/234 (33%), Gaps = 43/234 (18%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE-----MLDIIKSIP 210 S + ++ G ++++++D S SM+ + L A R +E + D++ Sbjct: 69 GQSGAVVERAGRGAEVLILMDRSSSMDATV--HTNGLKTAGRMSQEPKAKVVRDLLSEFV 126 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST---TKSTPGLEYAYNKIF 267 R +TFS+ + P +Q + G T+ L A + Sbjct: 127 AKRPDNRFAFMTFSTVPIAVVPFTQKADTVQAALAATAIGRGLPETRMGLALLAAIEEFE 186 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 + I+ ++DG + + + A+ + +Y I V++ Sbjct: 187 G----------RSYSGSRVILIVSDG--GAQLDEPTRQRVHAGLAREK-IGLYWIYVRSG 233 Query: 328 AA--------------------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 QF K ++P R Y V +S + A I ++ Sbjct: 234 PNSPNLNTETVSAYGLGEELALHQFFKTLSTPYRLYQVDDSNAMAAAMAEIDRQ 287 >gi|30063554|ref|NP_837725.1| hypothetical protein S2315 [Shigella flexneri 2a str. 2457T] gi|56480041|ref|NP_708009.2| hypothetical protein SF2190 [Shigella flexneri 2a str. 301] gi|30041807|gb|AAP17534.1| hypothetical protein S2315 [Shigella flexneri 2a str. 2457T] gi|56383592|gb|AAN43716.2| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301] gi|281601564|gb|ADA74548.1| hypothetical protein SFxv_2416 [Shigella flexneri 2002017] gi|313649826|gb|EFS14246.1| von Willebrand factor type A domain protein [Shigella flexneri 2a str. 2457T] gi|332755960|gb|EGJ86314.1| von Willebrand factor type A domain protein [Shigella flexneri K-671] gi|332756714|gb|EGJ87062.1| von Willebrand factor type A domain protein [Shigella flexneri 2747-71] gi|332766479|gb|EGJ96688.1| conserved protein [Shigella flexneri 2930-71] gi|333017105|gb|EGK36426.1| von Willebrand factor type A domain protein [Shigella flexneri K-304] Length = 378 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAMEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|320010772|gb|ADW05622.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 453 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 53/162 (32%), Gaps = 30/162 (18%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 +++++D S SM+ K+ A + E +D + + Sbjct: 53 GPSARVPGQAPAAAVVLMVDCSGSMDY----PPTKMRHARDATAEAIDTL------RDGT 102 Query: 217 RSGLVTFSSKIVQTFP----LAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 R +V + +P LA +E + RL G T L A + Sbjct: 103 RFAVVAGTHVAKDVYPGNGRLATAGPQTKAQAKEALRRLSAGGGTAIGTWLRLADRLLGA 162 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 A+ + H I LTDG N + D + +L C Sbjct: 163 AEVDIRHG-----------ILLTDGRNEHESPEDLRAALESC 193 >gi|212704357|ref|ZP_03312485.1| hypothetical protein DESPIG_02412 [Desulfovibrio piger ATCC 29098] gi|212672216|gb|EEB32699.1| hypothetical protein DESPIG_02412 [Desulfovibrio piger ATCC 29098] Length = 331 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 66/177 (37%), Gaps = 30/177 (16%) Query: 170 DMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL--VTFSSK 226 V+D S SM + DK+ VA +++ + I ++ + GL + + Sbjct: 33 SFDFVVDYSGSMMMKNDKMKQDKIEVAKIALKRVNAAIPALD-----FKGGLHTIAPNGT 87 Query: 227 IVQTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 +++ P W + IN L FG T GL+ Y + E+ Sbjct: 88 VIEQGP--WNRAAMDSGINTLRSGFATFGRMTNMGDGLQT-YEPFLSSMERSAA------ 138 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA 337 +I +TDG+N+ + + A +R +++ I + + +K A Sbjct: 139 -----LILVTDGDNNRGMDLVEVARQV--YATQRNMVIHVISLADTPQGEATVKAIA 188 >gi|77552210|gb|ABA95007.1| expressed protein [Oryza sativa Japonica Group] Length = 629 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 77/221 (34%), Gaps = 31/221 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P+L+ +V ++ +D++ +LD+S G +L + +++ ++ + + Sbjct: 100 LPVLVRVAVPATAARRAPVDLVTLLDISC--GGGGGAPARRLDLLRKAMDLVIGNLGADD 157 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 R +V F S +V L G K+ L TK P L A + Sbjct: 158 ------RLAIVPFHSSVVDATGLLEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEIL 211 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + + ++ ++DG++ EA V+A G + Sbjct: 212 ------EARCWEAKRERVGAVVLISDGDD----------RTIFREAINPRYPVHAFGFRG 255 Query: 327 EAADQFLKNCA--SPDRFYSVQNSRK-LHDAFLRIGKEMVK 364 + + + A + + + + + DAF + + Sbjct: 256 AHDARAVHHVADHTSGVYGVLDDEHDRVTDAFAACVRRVTS 296 >gi|311105412|ref|YP_003978265.1| hemolysin-type calcium-binding repeat family protein 2 [Achromobacter xylosoxidans A8] gi|310760101|gb|ADP15550.1| hemolysin-type calcium-binding repeat family protein 2 [Achromobacter xylosoxidans A8] Length = 1396 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 49/140 (35%), Gaps = 14/140 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMD---KLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + ++ ++LD+S SM D L A ++++ +L+ + + L+ Sbjct: 760 AGTSYNIALLLDLSYSMGWANSANPDGDTALDAAKKALKHLLES--QLATHEGTINVSLI 817 Query: 222 TFSSK----IVQTFPLA-WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 TF+ + L V I + L G T L +H Sbjct: 818 TFNDEDIRVQKSISDLTPDNVDEIVSSLLHLETGPNTPYGAALHETKKWFDGQPTVDDH- 876 Query: 277 AKGHDDYKKYIIFLTDGENS 296 + YK FLTDGE + Sbjct: 877 ---GNPYKNLTYFLTDGEPT 893 >gi|260892924|ref|YP_003239021.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] gi|260865065|gb|ACX52171.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] Length = 298 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 10/65 (15%), Positives = 26/65 (40%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + +G++++L A L + + LV++ + +L +D + L + +G Sbjct: 4 KSERGTVAVLVAAALTFLLGLAALVVDGGGLLLARERLVNAVDAAALAGVQFLPGDPSGA 63 Query: 71 NGKKQ 75 Sbjct: 64 VQTAL 68 >gi|296136111|ref|YP_003643353.1| von Willebrand factor type A [Thiomonas intermedia K12] gi|295796233|gb|ADG31023.1| von Willebrand factor type A [Thiomonas intermedia K12] Length = 753 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 65/182 (35%), Gaps = 36/182 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + ++LD+S S+ND G L ++ ++ + I+ + D + F Sbjct: 560 GRDIAVTLLLDLSASLNDKVKGSEQTILELSQEAVSLLAWAIERLGDP-----LAIAGFQ 614 Query: 225 SKIVQT--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 S F WG ++ ++ + +T+ L +A H Sbjct: 615 SNTRHEVRYLHIKGFGEPWG-DEVKARLAAMQANYSTRMGAALRHA-----------AHS 662 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + KK ++ LTDGE + ID ++ +A++ V+ D C Sbjct: 663 LRARKSDKKLLLVLTDGEPAD--IDVQDERLLIEDARQ--------AVRELERDGLFTYC 712 Query: 337 AS 338 S Sbjct: 713 IS 714 >gi|198426249|ref|XP_002120426.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1937 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 68/204 (33%), Gaps = 25/204 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LDM++++D S S+ M R+I + ++ + + +V S++I+ Sbjct: 90 LDMVILIDSSSSVGSENWNIMKNF---VRTIINSFQRSATSTQISVLRYNRVVDTSTQIL 146 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L + G T + L Y + I + + + +I Sbjct: 147 LNEYLTDQSGFLAAYDRIPYNGGGTLTGNALRYVNDVILTG----ANGDRPG--VRDVLI 200 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV----QAEAADQFLKNCASPDRFYS 344 LTDG + L + +G Y +G+ A Q L+ S DR + Sbjct: 201 TLTDGRAHDNVLAPSRML------RAKGVETYVVGIQSRLGALRESQLLEISGSRDRMFI 254 Query: 345 VQNSRKLHDAFLRIGKEMVKQRIL 368 L F + + ++ Sbjct: 255 ------LTAGFASLSRSFANMLMM 272 >gi|82617834|gb|ABB84829.1| VIT-vWFA-RpoN multidomain protein [uncultured delta proteobacterium DeepAnt-1F12] Length = 1156 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 63/213 (29%), Gaps = 34/213 (15%) Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 + A + +++H L I + + D++ ++D S SM+ Sbjct: 256 FVARAAVGASLNAARSATAGGEDAAHGLLTIVPPQSKDALPCLPRDLICLIDTSGSMS-- 313 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT-----FPLAWGVQH 239 G + + ++ + L + R L+ F+ ++ + G Sbjct: 314 -GRPLAQAQRVVAALVDRL---------GDDDRLELIEFNHQVRRFRSEPVPATTLGKAA 363 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 + L G T+ + A + ++ +I +TDG Sbjct: 364 AMTWLGSLTAGGATEMHTAVLAALRPLRS-------------RAQRQVILITDGHIGFEQ 410 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 KE + E ++ +GV Sbjct: 411 QIVKELIENLPETS----RLHTVGVGTSVNRTL 439 >gi|327542166|gb|EGF28659.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1014 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 52/136 (38%), Gaps = 21/136 (15%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + R+ G + P + A + D K++I ++DG+ Sbjct: 505 NRRAMLAAVGRMTPGDMPEFDPAMRMAVTGLVRT-----------DASVKHLIIISDGDP 553 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKL 351 P+ ++ K + + V++ + L++ A + ++Y+V++ R L Sbjct: 554 GPPSNSVIQAF------KDNSITISTVAVESHGLSDSRRLQDIARATGGKYYAVKSGRAL 607 Query: 352 HDAFLRIGKEMVKQRI 367 F R + + + I Sbjct: 608 PGIFQREARRVTRPLI 623 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 53/167 (31%), Gaps = 28/167 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +M VLD S S+ M + +++ ++ + R+G++ F Sbjct: 69 RITVMYVLDQSESIPSSKRTAM---------LDYVIESVRRHRNATRGDRAGIIVFGRDA 119 Query: 228 VQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + P V I+ + L T L A A +D + Sbjct: 120 MIEIPPYDDNVPPIRRLESLLERTDATNLETALNLA-------------QASMPEDTSRR 166 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 I+ +TDG + S G + + V EA + L Sbjct: 167 IVIVTDGNENMGEARRIGSRIV-----DAGIGIDVVPVLKEAGGEVL 208 >gi|313235811|emb|CBY19795.1| unnamed protein product [Oikopleura dioica] Length = 1120 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 70/177 (39%), Gaps = 19/177 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLG-VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+M+++D S + A ++ L+I I ++ V G ++ + V Sbjct: 42 DVMILMDSSACHQGERWDLIRNWSFKAATQLKRQLNIDDVITMESSRVAVGSFSYRTNDV 101 Query: 229 QTFPLAWGVQHIQ-EKINRLIFGSTTKSTPGLEYAYNKIFDA--KEKLEHIAKGHDDYKK 285 L++ + + R++ T G Y+ + D ++ H D K Sbjct: 102 ----LSFDDNKLSHNDLLRIVSNEHNNLTGG-STDYSGMVDNVISAFSARQSENHKDNDK 156 Query: 286 YIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 Y++ +T+G+++ NI N + + + + G V + + K C SP++ Sbjct: 157 YVVLVTNGKDAEGNIGNFDQVDELASRLRSSGIRVIPVAIA--------KRC-SPEK 204 >gi|313221851|emb|CBY38921.1| unnamed protein product [Oikopleura dioica] Length = 888 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 70/177 (39%), Gaps = 19/177 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLG-VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D+M+++D S + A ++ L+I I ++ V G ++ + V Sbjct: 42 DVMILMDSSACHQGERWDLIRNWSFKAATQLKRQLNIDDVITMESSRVAVGSFSYRTNDV 101 Query: 229 QTFPLAWGVQHIQ-EKINRLIFGSTTKSTPGLEYAYNKIFDA--KEKLEHIAKGHDDYKK 285 L++ + + R++ T G Y+ + D ++ H D K Sbjct: 102 ----LSFDDNKLSHNDLLRIVSNEHNNLTGG-STDYSGMVDNVISAFSARQSENHKDNDK 156 Query: 286 YIIFLTDGENSSPNIDNKESL-FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 Y++ +T+G+++ NI N + + + + G V + + K C SP++ Sbjct: 157 YVVLVTNGKDAEGNIGNFDQVDELASRLRSSGIRVIPVAIA--------KRC-SPEK 204 >gi|288928456|ref|ZP_06422303.1| von Willebrand factor, type A [Prevotella sp. oral taxon 317 str. F0108] gi|288331290|gb|EFC69874.1| von Willebrand factor, type A [Prevotella sp. oral taxon 317 str. F0108] Length = 292 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + I + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDF------GTTQRTKREMATEMAAILAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I+ + T +EY + Sbjct: 126 SDRIEKYIPPKKGRKHILYIIHEMLDFKPESKRTNVAAAIEYLTRVMK 173 >gi|156933960|ref|YP_001437876.1| hypothetical protein ESA_01786 [Cronobacter sakazakii ATCC BAA-894] gi|156532214|gb|ABU77040.1| hypothetical protein ESA_01786 [Cronobacter sakazakii ATCC BAA-894] Length = 197 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 56/172 (32%), Gaps = 10/172 (5%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + ++ +L +K P ++TF S Q PL + ++ L Sbjct: 6 IEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPLT---DLLSFQMPALTASG 62 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT L + I +K KG + +TDG SPN D ++ L Sbjct: 63 TTSLGEALTLTASSIAKEVQKTTADTKGDWRP--LVFLMTDG---SPNDDWRKGLNDFKA 117 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV 363 A+ +V A +A LK +S + F + + Sbjct: 118 ARTG--VVVACAAGHDADTSALKEITEIVVQLDTADSSTIKAFFKWVSASIS 167 >gi|32477499|ref|NP_870493.1| hypothetical protein RB12409 [Rhodopirellula baltica SH 1] gi|32448053|emb|CAD77570.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 1032 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 52/136 (38%), Gaps = 21/136 (15%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + R+ G + P + A + D K++I ++DG+ Sbjct: 523 NRRAMLAAVGRMTPGDMPEFDPAMRMAVTGLVRT-----------DASVKHLIIISDGDP 571 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSVQNSRKL 351 P+ ++ K + + V++ + L++ A + ++Y+V++ R L Sbjct: 572 GPPSNSVIQAF------KDNSITISTVAVESHGLSDSRRLQDIARATGGKYYAVKSGRAL 625 Query: 352 HDAFLRIGKEMVKQRI 367 F R + + + I Sbjct: 626 PGIFQREARRVTRPLI 641 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 53/167 (31%), Gaps = 28/167 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +M VLD S S+ M + +++ ++ + R+G++ F Sbjct: 87 RVTVMYVLDQSESIPSSKRTAM---------LDYVIESVRRHRNATRGDRAGIIVFGRDA 137 Query: 228 VQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + P V I+ + L T L A A +D + Sbjct: 138 MIEIPPYDDNVPPIRRLESLLERTDATNLETALNLA-------------QASMPEDTSRR 184 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 I+ +TDG + S G + + V EA + L Sbjct: 185 IVIVTDGNENMGEARRIGSRIV-----DAGIGIDVVPVLKEAGGEVL 226 >gi|254292588|ref|YP_003058611.1| hypothetical protein Hbal_0212 [Hirschia baltica ATCC 49814] gi|254041119|gb|ACT57914.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 447 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 66/179 (36%), Gaps = 15/179 (8%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 +++ F + +G++S++ A+ + ++ V+ I+ S K + +L ++D + L A ++ Sbjct: 1 MSLYKFCKSTQGNVSVMAALFIGLLIAVIAGTIDISQKSSLNRELQSVVDAAALAAAREM 60 Query: 64 -LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII---- 118 ++ + + + + W + ++ I + + +I Sbjct: 61 AVSSADQTRVQSVASSYVDAN----WTGEQATTHAVLDVSKGIITVSSTAPKTIASILKK 116 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWC------ANSSHAPLLITSSVKISSKSDIGLDM 171 D + ++ AV+ C N + + ++ S SD M Sbjct: 117 DQKETNFYAEAVAEVSGGGNVCLIGLSDHEQGTINLQQRARITAENCQVYSNSDDRYSM 175 >gi|118361111|ref|XP_001013786.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295553|gb|EAR93541.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 357 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 54/162 (33%), Gaps = 31/162 (19%) Query: 174 VLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V+D S SM+ F G + ++ + + ++ + + ++TFS++ Sbjct: 182 VIDTSGSMDFKFKVNGESISRIQFVKAQLTKTIN-----EQLKKYQKFNIITFSNQATYW 236 Query: 231 FPLAWGVQ-----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P IN+L T + GL+ A+ + Sbjct: 237 KPDVIDATPENILAAITYINKLGTSGATNISGGLDLAFR---------------SKEVLN 281 Query: 286 YIIFLTDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGV 324 I L+DG +S + + Y N+ ++ + I Sbjct: 282 TIYLLSDGVPNSGVMTIEGIKKYLTDKNQNRQEKVKINTISF 323 >gi|222637454|gb|EEE67586.1| hypothetical protein OsJ_25118 [Oryza sativa Japonica Group] Length = 755 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 53/190 (27%), Gaps = 44/190 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM H L ++ L + N ++TF+ ++ Sbjct: 334 AVVFVIDTSGSMQGH------PLENVKNAMSTALSELTEGDYFN------IITFNDELHS 381 Query: 230 TFPL--AWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + I ++ + + G T L A + A H Sbjct: 382 FSSCLEKVNEKSIASALDWINLNFVAGGGTDIMHPLNEAMASLSSA-----HDVLPQ--- 433 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-------IVYAIGVQAEAADQFLKNC 336 I +TDG + C K + G+ FL+ Sbjct: 434 ---IFLMTDG-------SVDDEHNICQTVKTELISRGSKSPRISTFGLGLYCNHYFLRML 483 Query: 337 ASPDR-FYSV 345 AS R Y Sbjct: 484 ASIGRGHYDA 493 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 51/139 (36%), Gaps = 5/139 (3%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 +SF + N G+++I+ ++ + + +G I+ + + L +D ++L A Sbjct: 5 LSFFS--RLIDNRDGAVAIIVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLAAA 62 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI-NNIERSTSLSIIID 119 K + + N F ++ + + ++ ++ + + + Sbjct: 63 AKYKHGMPEASISGTVNAFLSA--NGTLKSAVIGKPEVSSDEAELCLDVGDAVPTTFMQV 120 Query: 120 DQHKDYNLSAVSRYEMPFI 138 + +S S +P + Sbjct: 121 ANIQSVPISVRSCAALPGV 139 >gi|218200012|gb|EEC82439.1| hypothetical protein OsI_26857 [Oryza sativa Indica Group] Length = 863 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 53/190 (27%), Gaps = 44/190 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM H L ++ L + N ++TF+ ++ Sbjct: 442 AVVFVIDTSGSMQGH------PLENVKNAMSTALSELTEGDYFN------IITFNDELHS 489 Query: 230 TFPL--AWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + I ++ + + G T L A + A H Sbjct: 490 FSSCLEKVNEKSIASALDWINLNFVAGGGTDIMHPLNEAMASLSSA-----HDVLPQ--- 541 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-------IVYAIGVQAEAADQFLKNC 336 I +TDG + C K + G+ FL+ Sbjct: 542 ---IFLMTDG-------SVDDEHNICQTVKTELISRGSKSPRISTFGLGLYCNHYFLRML 591 Query: 337 ASPDR-FYSV 345 AS R Y Sbjct: 592 ASIGRGHYDA 601 >gi|156405886|ref|XP_001640962.1| predicted protein [Nematostella vectensis] gi|156228099|gb|EDO48899.1| predicted protein [Nematostella vectensis] Length = 336 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 70/204 (34%), Gaps = 24/204 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D+ ++D S S+ + K+ + + + +I V ++T S Sbjct: 118 EASVDLGFLIDSSASIGYQNFKSVRKM-------VDRIINVFTISPKQTHV--AIITISD 168 Query: 226 KI------VQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + ++ ++ L F T+ L A+ ++F + Sbjct: 169 RPAHVLRFNTLQGADLNSASVRRVVDNLRFTRDKTRIDLALRMAHKEMFS------KLGG 222 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNC 336 G +K ++ TDG + ++ K+ G + +GV+ E L Sbjct: 223 GRKHAQKILVVFTDGIQTWRPDKMEKLSKASEPLKKMGVKIIPVGVRGEEINVGSLLDMS 282 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGK 360 ++++ +L A ++ K Sbjct: 283 LDTYSVFNMEFFPELLQALKKMSK 306 >gi|312082747|ref|XP_003143572.1| von Willebrand factor type A domain-containing protein [Loa loa] gi|307761265|gb|EFO20499.1| von Willebrand factor type A domain-containing protein [Loa loa] Length = 307 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 75/236 (31%), Gaps = 33/236 (13%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSK---SDIGLDMMMVLDVSLSMNDHFGPGM 189 Y++ I N P ++ S ++ + LD++ +LD S S+ + F + Sbjct: 34 YQLIVITTAIKVINNGLAPPEIVHSPIRTKPRCIVKAEPLDLVFMLDSSGSLKNKFQDEI 93 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 D + + R L+ FS L + + +++ + L Sbjct: 94 DIIRRIVNHVTI----------GEPATRVMLIQFSGVQH----LEFNFKKFKDRDDILGA 139 Query: 250 -------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 T+ EY + + + D K I L+DG D Sbjct: 140 LDVLRHVSGITRMGDAFEYTLSMLNEKNGMR------SSDVPKIIYLLSDGRTHDYPKDT 193 Query: 303 KESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + + +YA G A ++ L P + + QN L F R Sbjct: 194 EMAEQLRQQI--DNVDIYAYGTGEYVAINELLAITKDPKKIVTNQNLDDLEPMFDR 247 >gi|119505574|ref|ZP_01627646.1| transporter [marine gamma proteobacterium HTCC2080] gi|119458683|gb|EAW39786.1| transporter [marine gamma proteobacterium HTCC2080] Length = 554 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 74/211 (35%), Gaps = 42/211 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+DVS SM L R I+++ D++K +PD R G++ ++ Sbjct: 96 LVIVIDVSESMATTDIAPSRLL----RGIQKIRDLLKRVPDK----RIGVIAYAGSAHTV 147 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL + + + + G ++ EYA I Y + + + Sbjct: 148 LPLTSDHEITESFLAVMTPGIAPRAGKFPEYALPGI---------DRLLAQSYYRSSVLM 198 Query: 291 -TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--------FLKNCASP-- 339 TDG + +C + + IVY G + Q L N AS Sbjct: 199 VTDGL---GARSPELLKSWCRD-RDYQLIVY--GFGDPSLSQSTVPLEREALMNLASSCN 252 Query: 340 DRFYSVQ--------NSRKLHDAFLRIGKEM 362 ++Y + L DAF + E Sbjct: 253 GKYYDATIDPGDVDAIAGALTDAFKIVDDEA 283 >gi|88707027|ref|ZP_01104724.1| von Willebrand factor type A domain protein, membrane [Congregibacter litoralis KT71] gi|88698755|gb|EAQ95877.1| von Willebrand factor type A domain protein, membrane [Congregibacter litoralis KT71] Length = 326 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 52/162 (32%), Gaps = 19/162 (11%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 P A + P S + +++ +D+S SMN+ +L A Sbjct: 68 VSIVLGPLVAFAVAGPSWQRGE---SPFAQDAAALIIAIDLSTSMNESDLQP-SRLQRAR 123 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST 256 D + + + + L+ ++ PL+ + ++ L G + Sbjct: 124 -------DKVLKLAEARGDAYTALIAYAGSAHTVLPLSDDSNMLLHYLDALKVGMLPRRG 176 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 E + A+ L G ++ +TDG N+ Sbjct: 177 KAPE---EVLPIAERLLAEQGHGGS-----LLIVTDGANNQS 210 >gi|327485391|gb|AEA79797.1| TPR domain protein in aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 612 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|326429223|gb|EGD74793.1| hypothetical protein PTSG_07026 [Salpingoeca sp. ATCC 50818] Length = 8642 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 42/138 (30%), Gaps = 13/138 (9%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 + +++++V+D + +M D + R +R D + VR G V + Sbjct: 8416 LEIEVVIVMDCTGTMRQWMRAARDHVQEMVRFVR---DQAEKTYTGEAKVRLGFVAYRDY 8472 Query: 226 ---KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 ++ Q L + + I R A + + + Sbjct: 8473 DCREVTQQCDLTEDIDKVTAFIARQKATGGKD------KAEEVLSGLRAAASMKWTVNPS 8526 Query: 283 YKKYIIFLTDGENSSPNI 300 K ++ + D P Sbjct: 8527 AMKMVVVVADAPQHGPEF 8544 >gi|220917018|ref|YP_002492322.1| tyrosinase [Anaeromyxobacter dehalogenans 2CP-1] gi|219954872|gb|ACL65256.1| tyrosinase [Anaeromyxobacter dehalogenans 2CP-1] Length = 874 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 52/157 (33%), Gaps = 10/157 (6%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 M+MVLD S SM G G TR+ ++ + + + +F Sbjct: 344 AMVMVLDRSNSMTFDSGVG----AGVTRADVLRFSAPTAVVVLEDTNAMAVCSFDHDAHP 399 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK-KYII 288 P+ +L + +S + I + I D++ K ++ Sbjct: 400 GIPMT-----EAAGAGKLTISAAIQSYAPNPNGWTSIGEGVALAHGIVAPVTDHEVKALV 454 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 LTDGE + + + VYAIG+ Sbjct: 455 VLTDGEENHGPHARRYIHDVEDLIASLNGRVYAIGLG 491 >gi|47216852|emb|CAG11659.1| unnamed protein product [Tetraodon nigroviridis] Length = 1042 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 58/168 (34%), Gaps = 30/168 (17%) Query: 213 NNVVRSGLVTFSSK--IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDA 269 R G+V FS V + + ++ + + G T + + A+ + Sbjct: 717 QEAARMGVVLFSHTSLAVASLQPRSSLSELKATVRSMPYLGEGTFTGSAIHRAHQLFQAS 776 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY---CNEAKRRGAIVYAIGVQ- 325 + +K + LTDG+ D ++ + EA+ RG ++ IGV Sbjct: 777 RPG----------VRKVALVLTDGQ-----ADPRDVVQVGVSAAEAQARGIEMFVIGVMN 821 Query: 326 -----AEAADQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +K AS + +Q+ LH I ++ +Q Sbjct: 822 ESHPLYPDFSAEMKAIASNPKEGHVHLIQDFGSLHVLEKVIVNQICEQ 869 >gi|291226976|ref|XP_002733462.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 394 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 76/226 (33%), Gaps = 47/226 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+ +D + SM + A +++R +++ I + + VR LV + Sbjct: 21 LDLAFAMDCTGSMG-------SYIATAQQNVRTIVEEI--VCKEKSDVRLALVEYRDHPP 71 Query: 229 QTF-------PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 Q V ++ ++ + + A ++I Sbjct: 72 QESTFVTRVHDFTSSVSRMKSWLDAASASGGGDTPEAVADALHQIL--------KLDWRQ 123 Query: 282 DYKKYIIFLTDG------------ENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--QAE 327 D K +F+ D +N P D + + N+ + +Y +G Sbjct: 124 DSTKICVFIADAPPHGLGCLGDGFQNGCP--DGLDPIQTGNQLAEKCITMYMVGCEPSIS 181 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIG---KEMVKQRIL 368 +F A + ++ ++N+ L IG +E+ QR++ Sbjct: 182 PYKEFFMAIAHMTGGQYVPLRNAALLSQVI--IGGAQEEISLQRLM 225 >gi|72008858|ref|XP_787086.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115969503|ref|XP_001184155.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 429 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 61/199 (30%), Gaps = 28/199 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV- 228 ++ V+DVS SM KL +++ MLD ++N ++TFS + Sbjct: 164 QIVFVIDVSASM------YGTKLSQTKEALKTMLD------NLNPTDYFNIITFSDGVQY 211 Query: 229 ------QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + ++ L S T L A K + + + D Sbjct: 212 WRENNRLAPAQRRYMDDAMAYVDSLRDDSETN----LNEAIVKAGELLDSEARYNRPGDS 267 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA----- 337 +I LTDG S D +E L E + +G L A Sbjct: 268 VYSMMILLTDGRPSVGTTDQQEILDNAREVIAGKHSLNILGFGRLVDFDLLVKLAYENNG 327 Query: 338 SPDRFYSVQNSRKLHDAFL 356 + Y + + F Sbjct: 328 TAKMIYEGTTAAEQLREFY 346 >gi|332838292|ref|XP_001152728.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4 isoform 4 [Pan troglodytes] Length = 1166 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 320 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 367 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 368 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 422 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 423 SLCNQAIMLISDG 435 >gi|332266684|ref|XP_003282331.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like, partial [Nomascus leucogenys] Length = 666 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 208 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 255 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 256 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 310 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 311 SLCNQAIMLISDG 323 >gi|319952925|ref|YP_004164192.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421585|gb|ADV48694.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 211 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 66/198 (33%), Gaps = 14/198 (7%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +LD S SM + ++ M++ ++S + + +VTF ++ Sbjct: 3 RLPVYFLLDTSGSMVGE------PIEALNNALSGMINTLRSDAQASETLWISIVTFDREV 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL +Q Q T + L+Y + K+ K KG + Sbjct: 57 KEIMPLT-DLQSFQLPEITCPQSGPTFTGKALDYLHEKVTKDLRKGTPEQKGDWKP--LL 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 TDG+ S D ++ + K + A A D LK Sbjct: 114 FLFTDGKPS----DVQQYKEVIPKIKALNFGAIVACAAGHLADDDKLKELTDTVVHLQTV 169 Query: 347 NSRKLHDAFLRIGKEMVK 364 +S L F + + + Sbjct: 170 DSNTLKQFFTWVSDTIEQ 187 >gi|152012483|gb|AAI50187.1| CACNA2D4 protein [Homo sapiens] Length = 601 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 291 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 338 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 339 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 393 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 394 SLCNQAIMLISDG 406 >gi|119609324|gb|EAW88918.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4, isoform CRA_a [Homo sapiens] Length = 1120 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 274 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 321 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 322 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 376 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 377 SLCNQAIMLISDG 389 >gi|119609326|gb|EAW88920.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4, isoform CRA_c [Homo sapiens] Length = 824 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 274 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 321 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 322 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 376 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 377 SLCNQAIMLISDG 389 >gi|114642818|ref|XP_522312.2| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit isoform 6 [Pan troglodytes] Length = 980 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 297 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 344 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 345 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 399 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 400 SLCNQAIMLISDG 412 >gi|114642814|ref|XP_001152605.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit isoform 3 [Pan troglodytes] Length = 1111 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 280 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 327 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 328 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 382 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 383 SLCNQAIMLISDG 395 >gi|114642812|ref|XP_001152399.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit isoform 2 [Pan troglodytes] Length = 1136 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 297 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 344 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 345 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 399 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 400 SLCNQAIMLISDG 412 >gi|114642816|ref|XP_001152346.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit isoform 1 [Pan troglodytes] Length = 1128 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 297 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 344 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 345 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 399 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 400 SLCNQAIMLISDG 412 >gi|114642810|ref|XP_001152797.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit isoform 5 [Pan troglodytes] Length = 1143 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 297 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 344 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 345 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 399 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 400 SLCNQAIMLISDG 412 >gi|22770594|gb|AAN06672.1| voltage-gated calcium channel alpha(2)delta-4 subunit [Homo sapiens] Length = 1120 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 274 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 321 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 322 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 376 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 377 SLCNQAIMLISDG 389 >gi|74205761|dbj|BAE23197.1| unnamed protein product [Mus musculus] Length = 751 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 63/176 (35%), Gaps = 23/176 (13%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIF 249 + + + I + + +V F+ F L + + + I + + Sbjct: 3 FNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDSYKTKETLLDAIRHISY 60 Query: 250 -GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 G TK+ +++ + +F + + K I+ +TDG + + Sbjct: 61 KGGNTKTGKAIKHVRDTLFTS------DSGTRRGIPKVIVVITDGRSQD------DVNKI 108 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEM 362 E + G ++AIGV + ++ + P + V + DAF +I E+ Sbjct: 109 SREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD----FDAFKKIEDEL 160 >gi|31873376|emb|CAD97679.1| hypothetical protein [Homo sapiens] gi|190690077|gb|ACE86813.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4 protein [synthetic construct] gi|190691451|gb|ACE87500.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4 protein [synthetic construct] Length = 1137 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 291 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 338 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 339 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 393 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 394 SLCNQAIMLISDG 406 >gi|324514578|gb|ADY45916.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 432 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 61/169 (36%), Gaps = 24/169 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S LD+++V+D+S + +A + +L R +TFS Sbjct: 238 SGCELDLVLVMDLSTT---THPIYQHYKEMAVELVGRLL-------IGPRFTRVAFITFS 287 Query: 225 SKIVQTFPLAWGV----QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S Q I E I R+ G TT G+ A + ++ + Sbjct: 288 SVGKSRTHFNLNRYDNAQQIIEAIRRVESTGGTTAVGEGIRIATQQ-----QEKRMGGRP 342 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 KK ++ TDG ++ + S EAK G ++Y+IG + Sbjct: 343 ISIAKKAMLIFTDGWSNKGPDPEEMS----KEAKAAGFVLYSIGYEGNG 387 >gi|331663616|ref|ZP_08364526.1| putative von Willebrand factor type A domain protein [Escherichia coli TA143] gi|331059415|gb|EGI31392.1| putative von Willebrand factor type A domain protein [Escherichia coli TA143] Length = 378 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAMEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|295698034|ref|YP_003602691.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060146|gb|ADF64883.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 212 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 57/172 (33%), Gaps = 20/172 (11%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +V+D S SM + I+ ML ++ P V ++T+ + Sbjct: 3 RLPVYLVIDTSGSMRGE------SIHSVNVGIQAMLSALRQDPYALESVHISIITYDNGA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL I + T + LE + + + KG + Sbjct: 57 REFIPLTPLEDFQFSDI-VVPSAGGTFTGAALECLMQCVERDVRRSDGDTKGDWRP--LV 113 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKR---RGA-IVYAIGVQAEAADQFLKN 335 +TDG ++L Y K RG + A V +A + LK Sbjct: 114 FLMTDG-------TPSDALAYGEAVKAIRGRGFGSIIACAVGPKAGHEHLKQ 158 >gi|288919097|ref|ZP_06413437.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349537|gb|EFC83774.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 222 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 62/187 (33%), Gaps = 18/187 (9%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + L VLD S SM+ D L + D + + P V+ Sbjct: 1 MTSPAGGARCLPTYAVLDTSKSMSRFQQLLNDTLE-------NVYDGLWAKPAVSEFAHL 53 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEH 275 +++F++ P++ KI+ L G +T +I K+ Sbjct: 54 SIISFNTDAHVILPMSD-----IGKIDSLPMLACGGSTNYGKAFRLIATQI--DKDVTAL 106 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 A+G + + F+TDG + E R V G +A + L+ Sbjct: 107 RAQGRKVLRPAVFFITDGAPQDAGVWESEFAKLTAPDWPRHPHVITFGFG-DANEAVLRR 165 Query: 336 CASPDRF 342 ++ F Sbjct: 166 ISTKSAF 172 >gi|218699286|ref|YP_002406915.1| hypothetical protein ECIAI39_0892 [Escherichia coli IAI39] gi|300936567|ref|ZP_07151475.1| von Willebrand factor type A domain protein [Escherichia coli MS 21-1] gi|218369272|emb|CAR17029.1| conserved hypothetical protein [Escherichia coli IAI39] gi|300458301|gb|EFK21794.1| von Willebrand factor type A domain protein [Escherichia coli MS 21-1] Length = 378 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAMEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|254390750|ref|ZP_05005963.1| conserved hypothetical protein [Streptomyces clavuligerus ATCC 27064] gi|294812670|ref|ZP_06771313.1| von Willebrand factor type A [Streptomyces clavuligerus ATCC 27064] gi|326441175|ref|ZP_08215909.1| hypothetical protein SclaA2_08914 [Streptomyces clavuligerus ATCC 27064] gi|197704450|gb|EDY50262.1| conserved hypothetical protein [Streptomyces clavuligerus ATCC 27064] gi|294325269|gb|EFG06912.1| von Willebrand factor type A [Streptomyces clavuligerus ATCC 27064] Length = 452 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 48/153 (31%), Gaps = 29/153 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 G +++++D S SM + K+ A + +I + + V +V + Sbjct: 61 GGPGAAVVIMVDCSGSM----NYPVAKMDHAREA------TAAAIDSLRDGVEFAVVAGT 110 Query: 225 SKIVQTFP--------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V FP A + + L T L A + A+ ++ H Sbjct: 111 HHAVDVFPGGRRLATADATNRARAKAALRELRPNGGTAIGTWLTLADRLLSTARSEIRHG 170 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 I LTDG N + + S Sbjct: 171 -----------ILLTDGRNEHESPEALRSALAA 192 >gi|67677889|gb|AAH97100.1| Cfb protein [Danio rerio] Length = 761 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 79/216 (36%), Gaps = 29/216 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KIS LD+ + +D S S++ A + I+ +++ I N Sbjct: 248 KISLDRGGKLDIYIAVDASDSIDPK------DFDKAKKIIKTLIEKISYYEVSPNYE--- 298 Query: 220 LVTFSSKIVQTFPL--------AWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFDAK 270 ++ F++ + Q + A + + E ++ + T + Y KI D+ Sbjct: 299 ILMFATDVDQIVKMRDFKTNEKARNIVKVFEDLDNFNYDKKGDRTGTNIAKLYLKILDSM 358 Query: 271 EKLEHIAKGH-DDYKKYIIFLTDGE-----NSSPNIDNKESLFYCNEA-KRRGAIVYAIG 323 + K + II TDG+ N P +D ++L N A + +Y G Sbjct: 359 SLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYVFG 418 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAF 355 V + + + S F+ + + ++ + F Sbjct: 419 VGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTF 454 >gi|268608354|ref|ZP_06142081.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 245 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 69/185 (37%), Gaps = 15/185 (8%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVTFSSK 226 + + V+D S SM K+G ++ E++ ++ I D ++ ++ ++ FS+ Sbjct: 15 MTLFYVIDTSGSMQGS------KIGQVESALEEVMQTLQEISDESDDAEIKIAVLEFSTG 68 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P + + T L A ++ + E + Y Sbjct: 69 ASWVTPEPVSPEGYR--FKSFEACGVTD----LGAACKELDKKLSRNEFLKTSAGAYPPV 122 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 I+ +DG + I +L N KR I +AIG A+ D + S + V+ Sbjct: 123 ILLFSDGGPTDNWISPLNTLKENNWFKRSIKIAFAIGDDADR-DVLARFSGSIETVLDVR 181 Query: 347 NSRKL 351 N +L Sbjct: 182 NKDQL 186 >gi|119470786|ref|ZP_01613397.1| hypothetical protein ATW7_05586 [Alteromonadales bacterium TW-7] gi|119446013|gb|EAW27292.1| hypothetical protein ATW7_05586 [Alteromonadales bacterium TW-7] Length = 631 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 68/216 (31%), Gaps = 33/216 (15%) Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 L ++ D+ I + +I D K + +FC+ ++ + Sbjct: 25 LIKHKKTTDVQLIAPHLAQFVISDTNTKASQPLWLIA-----VFCSLGVFFSAGPSFEKK 79 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 V S + ++V+D+S SM T++ + LD+I+ + + Sbjct: 80 QVPVFQSKSAR-----VIVMDMSFSMYSTDILP----NRLTQARFKSLDMIELFKEGD-- 128 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKE 271 + LV ++ PL + I L + + GL+ A + A Sbjct: 129 --TALVAYAGTAYTISPLTNDATTLSNLIPSLSPDIMPDKGSNVLAGLDMAKELLNQAGY 186 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 II +TDG D + Sbjct: 187 IDGD-----------IILITDGIEQEEQSDVTSFIN 211 >gi|291514856|emb|CBK64066.1| Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [Alistipes shahii WAL 8301] Length = 294 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 77/237 (32%), Gaps = 28/237 (11%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 +I + + E++ G + +I + T+ + + + S V Y W Sbjct: 6 NDILKRVRKIEIKTRGLSNEIFAGKYHTAF------RGRGMSFSEVREYRAGDDVRDIDW 59 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + L MM+++DVS S D+L ++I + Sbjct: 60 NVTARSRK-----PHIKVYEEERELTMMLLVDVSAS---RMFGSTDRL---KKNIITEIA 108 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + + N + G + FS +I + P G HI I LI + L Sbjct: 109 AVLAFSAAQNNDKVGCIFFSDRIEKFIPPKKGRSHILMIIRELIGFRPESAGTKLSEPVR 168 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVY 320 + + +K D F+ +++ ++L G VY Sbjct: 169 FLTNVNKKRCTTFILSD-------FM---DSTGDKSALDDALKIAGSKHDLVGIRVY 215 >gi|83951471|ref|ZP_00960203.1| hypothetical protein ISM_12950 [Roseovarius nubinhibens ISM] gi|83836477|gb|EAP75774.1| hypothetical protein ISM_12950 [Roseovarius nubinhibens ISM] Length = 188 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 66/153 (43%), Gaps = 10/153 (6%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R F+ GS + + I LP++F+ ++ E FF + + D + YT +L+ Sbjct: 12 LRRFWRRDDGSFVVESVIALPLLFLAAMVIYE----FFEVHRFNSARDKAS-YTVADMLS 66 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 +E G ++ + + +I + ++LR + D++ + ++ + Sbjct: 67 REMGTVNTTYIDN-TKSLFDSIVDDNAGSQLRVTAISYDVD----TDRYAVYWSEVRGTG 121 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 +S ++ ++ T P ++ H L+ + S Sbjct: 122 PMSVLTTSDIATSHATLPLMSDGEHILLIESVS 154 >gi|333002408|gb|EGK21970.1| von Willebrand factor type A domain protein [Shigella flexneri K-218] Length = 378 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAMEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALINVGAQIAA 362 >gi|332285110|ref|YP_004417021.1| hypothetical protein PT7_1857 [Pusillimonas sp. T7-7] gi|330429063|gb|AEC20397.1| hypothetical protein PT7_1857 [Pusillimonas sp. T7-7] Length = 510 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 54/157 (34%), Gaps = 27/157 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D+S SM+ P +L A ++R++ ++GL+ +S Sbjct: 96 LIVAVDLSASMDGADVPP-TRLEAAKHTLRDL-------AVRRTGAKTGLIAYSGSSHLV 147 Query: 231 FPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P + + I L + +E + + A E++ Sbjct: 148 LPPTDDLNLLDLFIQALSTDLIEHPGRDAAGAIELGASML--AAERVGGT---------- 195 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++ +TDG +E A+ ++ +G Sbjct: 196 LLLMTDGAAGQQLDAVQER---AQAARDMQILIMVVG 229 >gi|198473081|ref|XP_001356166.2| GA18279 [Drosophila pseudoobscura pseudoobscura] gi|198139291|gb|EAL33226.2| GA18279 [Drosophila pseudoobscura pseudoobscura] Length = 1138 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 73/193 (37%), Gaps = 22/193 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM++ + + +LD + VN VV++ Sbjct: 150 AASSPKDIMILLDASSSMSEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKA 203 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 204 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGS---- 257 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NCA 337 + I+ +T+ + S K+ + + Y IG + + C+ Sbjct: 258 -GSQCNQAIMLITESTSESHKDIIKQYNWPHMPVR---IFTYLIGSDSSSRSNLHDMACS 313 Query: 338 SPDRFYSVQNSRK 350 + F + + + Sbjct: 314 NKGFFVQINDYEE 326 >gi|195175237|ref|XP_002028364.1| GL15442 [Drosophila persimilis] gi|194117953|gb|EDW39996.1| GL15442 [Drosophila persimilis] Length = 1149 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 73/193 (37%), Gaps = 22/193 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S SM++ + + +LD + VN VV++ Sbjct: 150 AASSPKDIMILLDASSSMSEK------SFDLGMATAFNILDTLGEDDFVNLITFSEVVKA 203 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P +Q I+ + + T T GLEYA++ + + Sbjct: 204 PVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGS---- 257 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK-NCA 337 + I+ +T+ + S K+ + + Y IG + + C+ Sbjct: 258 -GSQCNQAIMLITESTSESHKDIIKQYNWPHMPVR---IFTYLIGSDSSSRSNLHDMACS 313 Query: 338 SPDRFYSVQNSRK 350 + F + + + Sbjct: 314 NKGFFVQINDYEE 326 >gi|145298663|ref|YP_001141504.1| hemolysin-type calcium-binding repeat-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851435|gb|ABO89756.1| hemolysin-type calcium-binding repeat protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 1156 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 58/155 (37%), Gaps = 21/155 (13%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG--MDKLGVATRSIREMLDIIKSIPD 211 + + ++ S+ G ++M++LD S SM+D G ++ +A SI +++ + D Sbjct: 306 PVDLNTRVEEFSNPGTNLMIILDTSGSMDDASGVAGFATRMAIAKASILQLIGDYDDVGD 365 Query: 212 VNNVVRSGLVTFSSKIVQTF---PLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYN 264 V +VR LV F+S F W + I + T L A + Sbjct: 366 V--MVR--LVGFASSATTNFLGAGDVWLTATQALNVINGITDYLGNGGTDYDDALIKAMS 421 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 G + FL+DGE + Sbjct: 422 AY-----DSAGKIIGGQS---VLYFLSDGEPTEST 448 >gi|2159935|dbj|BAA20346.1| magnesium-chelatase subunit [Synechococcus elongatus PCC 7942] Length = 677 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 64/210 (30%), Gaps = 37/210 (17%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 G ++ ++D S SM ++++ A ++ +L N + L+ F Sbjct: 477 KAGALVIFLVDASGSMA------LNRMQSAKGAVIRLLTEA-----YENRDQVALIPFRG 525 Query: 226 K-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + P + ++++ ++ G GL A +A + D + Sbjct: 526 EQAEVLLPPTRSITAARKRLEKMPCGGGLPLAHGLTQAVRVGTNAAQ-------SGDIGQ 578 Query: 285 KYIIFLTDG--------------ENSSPNIDNKESLFYCNEAKRRGAIVYAI----GVQA 326 I+ +TDG E +E L + + I Sbjct: 579 VVIVAITDGRGNIPLARSLGQPMEEGEKPDLKEELLDIAKRIRGLSMQLLVIDTERKFVG 638 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 + L N A ++ + S + A Sbjct: 639 AGFGKELANAAGGQYYHLPKVSDQAIAAMA 668 >gi|253998722|ref|YP_003050785.1| outer membrane adhesin-like protein [Methylovorus sp. SIP3-4] gi|253985401|gb|ACT50258.1| outer membrane adhesin like proteiin [Methylovorus sp. SIP3-4] Length = 1536 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 71/198 (35%), Gaps = 30/198 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLD-------------MMMVLDVSLSMNDHF 185 T +++ + ++ ++ + D + +++ LD+S SM Sbjct: 981 VLTLNVGVSATANGVTSVGNLTVNVEDDSPVANPISANLSTTDTNLLITLDISGSMRTQD 1040 Query: 186 GP-GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 G G +L A +SI+ +LD ++ D R LV FS+ Q + + ++ Sbjct: 1041 GVGGTTRLASAIQSIKTLLDKYDALGD----TRISLVVFSTTAAQVGTDWMTIDQAKAQL 1096 Query: 245 NRLIFG---STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNI 300 ++++ T L A + DA + + + F++DGE N+ Sbjct: 1097 DQILVNGPKGNTNYDSALANAMDAFDDAGKL--------TNAQNVAYFISDGEPNTGSGS 1148 Query: 301 DNKESLFYCNEAKRRGAI 318 + + G Sbjct: 1149 NTSLTGSTNTNGSDAGIQ 1166 >gi|167824377|ref|ZP_02455848.1| hypothetical protein Bpseu9_11953 [Burkholderia pseudomallei 9] Length = 589 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 10 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 64 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 65 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 121 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 122 QSVPYFFL 129 >gi|26248447|ref|NP_754487.1| hypothetical protein c2599 [Escherichia coli CFT073] gi|300978923|ref|ZP_07174436.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301048799|ref|ZP_07195797.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|331658152|ref|ZP_08359114.1| putative von Willebrand factor type A domain protein [Escherichia coli TA206] gi|26108852|gb|AAN81055.1|AE016763_14 Hypothetical protein yegL [Escherichia coli CFT073] gi|300299384|gb|EFJ55769.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300409556|gb|EFJ93094.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|315294406|gb|EFU53754.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|331056400|gb|EGI28409.1| putative von Willebrand factor type A domain protein [Escherichia coli TA206] Length = 219 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGSAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|260912476|ref|ZP_05919012.1| von Willebrand factor [Prevotella sp. oral taxon 472 str. F0295] gi|260633395|gb|EEX51549.1| von Willebrand factor [Prevotella sp. oral taxon 472 str. F0295] Length = 292 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 43/108 (39%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + I + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDF------GTTERTKREMATEMAAILAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I+ + T +EY + Sbjct: 126 SDRIEKYIPPKKGRKHILYIIHEMLDFKPESKRTNVAAAIEYLTRVMK 173 >gi|256820509|ref|YP_003141788.1| hypothetical protein Coch_1682 [Capnocytophaga ochracea DSM 7271] gi|256582092|gb|ACU93227.1| conserved hypothetical protein [Capnocytophaga ochracea DSM 7271] Length = 287 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 48/145 (33%), Gaps = 26/145 (17%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+++DVS S + FG + + + + Sbjct: 58 NVTARYNEPFVKVFEEERELTLMLMIDVSGS--ELFGTKQE----FKSEVITEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNK 265 + N ++GL+ FS +I P G H+ I LI T + L+ Sbjct: 112 SALQNNDKTGLILFSDQIELYIPPKKGKSHVLRIIRELIEFQPKSFKTNISEALQ----- 166 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFL 290 KK I+F+ Sbjct: 167 -----------FLSRVTKKKAIVFM 180 >gi|167816001|ref|ZP_02447681.1| hypothetical protein Bpse9_12727 [Burkholderia pseudomallei 91] Length = 588 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 9 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 63 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 64 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 120 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 121 QSVPYFFL 128 >gi|294340316|emb|CAZ88697.1| putative Rubisco activation protein CbbO [Thiomonas sp. 3As] Length = 752 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 79/215 (36%), Gaps = 41/215 (19%) Query: 166 DIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + ++LD+S S+ND G L ++ ++ + I + D + F Sbjct: 559 GRDIAVTLLLDLSASLNDKVKGSDQTILELSQEAVSLLAWAIDRLGDP-----LAIAGFQ 613 Query: 225 SKIVQT--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 S F WG ++ ++ + +T+ L +A H Sbjct: 614 SNTRHEVRYQHFKGFGEPWG-DEVKARLAAMQANYSTRMGAALRHA-----------AHS 661 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK-------RRGAIVYAIGVQAEAA 329 + KK ++ LTDGE + ID ++ +A+ R G Y I + A Sbjct: 662 LRARKSDKKLLLILTDGEPAD--IDVQDERLLIEDARQAVRELERNGLFTYCISLD-PRA 718 Query: 330 DQFLKNCASPDRFYSVQN----SRKLHDAFLRIGK 360 D ++ + R+ + N +L + FL + K Sbjct: 719 DAYVSD-IFGRRYTVIDNIERLPERLPEVFLALTK 752 >gi|239617788|ref|YP_002941110.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239506619|gb|ACR80106.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 365 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 75/248 (30%), Gaps = 51/248 (20%) Query: 127 LSAVSRYEMP-FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 L+ R +P + P ++ ++ G D++ ++DV+ SM Sbjct: 66 LAVKIRVSIPGYAEGLTPDNFLVFEDDRAQGFALVKEAEERRGADIVFIVDVTGSMG--- 122 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV-------- 237 ++ S+ + +++ VR G+V + + + Sbjct: 123 ----TEIAGVKNSMVNFIQALEA---GGLDVRVGIVPYGDYAPARADTSDNIGFDPAFLN 175 Query: 238 ----QHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + N L G + + YA+N + + ++ I Sbjct: 176 LSDPTVAEGYANGLGVGYGADGPENAYGAIMYAWNNMAWRRGT-----------QRIFIL 224 Query: 290 LTDGENSSPNIDNKESLFYCN--------EAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +TD S D + L + + + K V + A L AS Sbjct: 225 ITDAF-SHYTGDIADGLEWPSGWGPSNDFDPKYTKDEV----INALYGFATLHIVASTGG 279 Query: 342 FYSVQNSR 349 +Y+ ++ Sbjct: 280 YYNTTDTD 287 >gi|150021377|ref|YP_001306731.1| von Willebrand factor, type A [Thermosipho melanesiensis BI429] gi|149793898|gb|ABR31346.1| von Willebrand factor, type A [Thermosipho melanesiensis BI429] Length = 349 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 67/200 (33%), Gaps = 38/200 (19%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++ +D+M+VLD + SM + + S++ +++ +++ + G++ Sbjct: 100 ETETRNQVDIMIVLDTTGSM-------YNAIEGVKNSVQNLIETLQA---SGLDAKVGII 149 Query: 222 TFSSKI-VQTFPLAWGV------QHIQEKINRLIFGSTTK----STPGLEYAYNKIFDAK 270 F + + L ++ ++ + G+ YA+N Sbjct: 150 PFDDAVPSKDITLTPNWLDLSDAGSAKDFVSNITAYGGADFPENPYAGIMYAFN------ 203 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 A +K II +TD ++ + + ++ I Sbjct: 204 -----NASWRASSQKIIILITD-ASAHYKSETYPGDAEGETLYDKDKVIDTIQGFVTVHG 257 Query: 331 QFLKNCASPDRFYSVQNSRK 350 F+ P +YS +S Sbjct: 258 AFI-----PGYYYSSTDSED 272 >gi|76811718|ref|YP_333349.1| hypothetical protein BURPS1710b_1950 [Burkholderia pseudomallei 1710b] gi|237812074|ref|YP_002896525.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254188638|ref|ZP_04895149.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254261132|ref|ZP_04952186.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|76581171|gb|ABA50646.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|157936317|gb|EDO91987.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|237503192|gb|ACQ95510.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254219821|gb|EET09205.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 602 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|237724723|ref|ZP_04555204.1| conserved hypothetical protein [Bacteroides sp. D4] gi|229436918|gb|EEO46995.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] Length = 516 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 39/256 (15%), Positives = 83/256 (32%), Gaps = 31/256 (12%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 F NI S + Y + + + + P + T + Sbjct: 276 FNVQSANIIPIVEFSNTGNKSSYTYPSVWRIGDPIEELDLMLTYMTSPRLIPGMTTKKWE 335 Query: 161 ISSKSDIG-----LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S D G +D+++V+D S SM + + A + +L +S Sbjct: 336 YVSNDDNGTESKQMDLLLVVDTSGSMGSAMKESAN-MHQAVLASYGILSYFESTKSK--- 391 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + FS KI + ++E++ T G ++ ++I + Sbjct: 392 --VAFIGFSDKIDAYVDWSDKYDDVRERL-------LTNGHGGTKFPISRIKSTLDVRSR 442 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY-AIGVQAEAADQFLK 334 + +TDG+ + N Y N+ ++ +G ++ + + LK Sbjct: 443 DL--------VTVLITDGDLGNINESVSYFRDYLND--DNKLYIFLLVGSKSLHSYEPLK 492 Query: 335 NCASPDRFYSVQNSRK 350 N + Y+ N+ + Sbjct: 493 NI--GAKIYNANNANE 506 >gi|221197788|ref|ZP_03570834.1| membrane protein [Burkholderia multivorans CGD2M] gi|221181720|gb|EEE14121.1| membrane protein [Burkholderia multivorans CGD2M] Length = 634 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 83/267 (31%), Gaps = 26/267 (9%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R +GS++++ AI + V IV+G I+ + +F + L + D + L + Sbjct: 16 RAGSRAQRGSVAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQSV--- 71 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-----SIIIDDQ 121 N+ Q + N T NGF D S S+ + Sbjct: 72 ---NDLCPQTDTTVTASGSNAVVTAAYRGAALNGF--DAQASGNSMSIACGRWDVSDYGA 126 Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 Y +A ++ + T P I+++ + + + L + S Sbjct: 127 AAGYFGTATNQLNAVRVVATKTVPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGSN 186 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 D G + T + +L + + + S I LA + Sbjct: 187 QDCAGNSVSADQRNTGLVNALLGALLN------------TSLSLNIGSYQALACTRVKVG 234 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFD 268 + + + T + N++ Sbjct: 235 DLVKAQVGAGTVDQLLATKLTLNQLVS 261 >gi|218460583|ref|ZP_03500674.1| hypothetical protein RetlK5_14236 [Rhizobium etli Kim 5] Length = 309 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 38/266 (14%), Positives = 92/266 (34%), Gaps = 18/266 (6%) Query: 50 YILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKN-IWQTDFRNELRENGFAQDINNI 108 LD +L+ +I N E+ + KK+ +D+ + + N + + + + Sbjct: 1 SDLDAALIAAVKQIDNVEDADTLKKKVSDWFHAQVDNSYSLGEIEIDTANHNITATASGT 60 Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 +T + I + ++S S + P + + +L+ ++ + G Sbjct: 61 VPTTFMKI---ANIESVDVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTSGQATMYAG 117 Query: 169 LDMMMVLDVSLS-------MNDHFGPGMDKL-----GVATRSIREMLDIIKSIPDVNNVV 216 + + +++ +K VA ++RE+LD+I + + Sbjct: 118 IGCQFACHTGDAHTIGKKKYANNYEYSSEKKIKLRADVAGDAVREVLDMIDESDANHERI 177 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK--LE 274 + GL + + + + ++ +G T+ ++ Y + K+K Sbjct: 178 KVGLYSLGDTLTEVLTPTLSTDTARTRLADASYGLTSSTSKAATYFDVSLATLKQKVGAG 237 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNI 300 K ++ LTDG S Sbjct: 238 GDGSTSGSPLKLVLLLTDGVQSQREW 263 >gi|153214418|ref|ZP_01949389.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115367|gb|EAY34187.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 644 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 44.4 bits (103), Expect = 0.025, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 46/120 (38%), Gaps = 5/120 (4%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + GS++ TA+LLP I I M ++ + ++ L D + L A + + Sbjct: 1 MLRDESGSVTAATAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAA-RYTGERP 59 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNEL---RENGFA-QDINNIERSTSLSIIIDDQHKD 124 + + + + + EL ++ F+ D ++ R + + + + K Sbjct: 60 SAVRQARVSVAVNDSFRAERLASRQIELGRWQDGSFSDMDASDPRRPNAARVTVRSEAKT 119 >gi|326671946|ref|XP_002663860.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 [Danio rerio] Length = 1082 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 51/133 (38%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ +I Sbjct: 251 DVVILVDVSGSMKGL------RLTIARQTVASILDTLGDDDFFN------IIAYNQEIHY 298 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P + H +E +++L L A+ + + + Sbjct: 299 VEPCLNGTLVQADSTNKDHFKEHLDKLFAKGIGLLGNALSEAFTILNEINQTGR-----G 353 Query: 281 DDYKKYIIFLTDG 293 + I+ +TDG Sbjct: 354 SSCSQAIMLITDG 366 >gi|302552806|ref|ZP_07305148.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] gi|302470424|gb|EFL33517.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] Length = 271 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 66/195 (33%), Gaps = 25/195 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLD S SM ++ L T ++ LD ++P +V F Sbjct: 83 TGTRAKVYLVLDRSASMRPYYKDGSAQALAEQTLALAAHLDPEATVP---------VVFF 133 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+++ T + KI+ L G A + K A Sbjct: 134 STELDGTGEITLTDHE--NKIDTLHAGLGRMGRTSYHAAVEAVLAHHTKHATPATPAL-- 189 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQF--LKNCASPD 340 +IF TDG +P+ + A + ++ + F L+ +P+ Sbjct: 190 ---VIFQTDG---APDAKTPATQALTEAAATHPTVFFSFVAFGDPENKAFDYLRKLKTPN 243 Query: 341 --RFYSVQNSRKLHD 353 F + + R+L D Sbjct: 244 TSHFLAGETPRELTD 258 >gi|146293561|ref|YP_001183985.1| TPR repeat-containing protein [Shewanella putrefaciens CN-32] gi|145565251|gb|ABP76186.1| TPR repeat-containing protein [Shewanella putrefaciens CN-32] Length = 663 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 56/178 (31%), Gaps = 28/178 (15%) Query: 136 PFIFCTFPWCAN--SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P F W + P + S+ + + + +V+D+S+SM ++L Sbjct: 57 PLHLLAFTWLIATFALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSM-FATDLAPNRLT 112 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIF 249 A ++L +K +GLV F+ PL + + ++ Sbjct: 113 QAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIMP 165 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + T GL A + II +TDG + +L Sbjct: 166 VRGSNLTAGLIQAKMLLAQGGHIRGD-----------IILMTDGITPHQFNEANSALS 212 >gi|54293628|ref|YP_126043.1| structural toxin protein RtxA [Legionella pneumophila str. Lens] gi|53753460|emb|CAH14915.1| structural toxin protein RtxA [Legionella pneumophila str. Lens] Length = 7919 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 49/256 (19%), Positives = 96/256 (37%), Gaps = 14/256 (5%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNI---WQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 ++ + N FSY IK + L ++ + ++IE + S S+ Sbjct: 6760 VSNSSLNGETFDIGLFSYNTIKTTPSEININMGLSLTDSDGDKVTSSIEINLSPSVFKVG 6819 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 ++ D S+ + + + + I + + I +LD S S Sbjct: 6820 ENVDDTSSSNVSHRVGGDTGVIDSSGGADILVGDVGGVEIIGTTARIAF----ILDESGS 6875 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M+ +FG G +L V +++ ++L + + P+ + V LV F+S + T + Sbjct: 6876 MDQNFG-GTTRLEVLKQTMTDILTELSNTPNASITVH--LVKFASVVNGTGTFEITGGEL 6932 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 Q+ ++ I G + Y + + G D ++ + F TDG + Sbjct: 6933 QQALD-FISGLQIQQGLLAGTNYEAALGQTLQWYNSQSGTVDVQQTL-FFTDGAPTFYMD 6990 Query: 301 DNKESLFYCNEAKRRG 316 N S Y N A+ G Sbjct: 6991 GN--STEYTNIARVYG 7004 >gi|4139906|pdb|1AOX|A Chain A, I Domain From Integrin Alpha2-Beta1 gi|4139907|pdb|1AOX|B Chain B, I Domain From Integrin Alpha2-Beta1 Length = 203 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 8 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 56 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 57 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 108 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFLK-- 334 K ++ +TDGE+ ++ K + CN + + I V A +K Sbjct: 109 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 164 Query: 335 ----NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + F++V + L + +G+++ Sbjct: 165 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 198 >gi|226196351|ref|ZP_03791933.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|254297788|ref|ZP_04965241.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157807412|gb|EDO84582.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|225931568|gb|EEH27573.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 602 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|320187063|gb|EFW61773.1| Mg-chelatase subunit ChlD [Shigella flexneri CDC 796-83] Length = 1135 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 59/191 (30%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D P + L +R+ LV F + +V Sbjct: 973 QLVLLVDQSGSMVDSVIPS-----AVMAACLWQL----------PGIRTHLVAFDTSVV- 1016 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 1017 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 1061 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 1062 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 1108 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 1109 ALVNVGAQIAA 1119 >gi|82543488|ref|YP_407435.1| hypothetical protein SBO_0944 [Shigella boydii Sb227] gi|81244899|gb|ABB65607.1| conserved hypothetical protein [Shigella boydii Sb227] gi|332097119|gb|EGJ02102.1| VWA domain containing CoxE-like family protein [Shigella boydii 3594-74] Length = 378 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 59/191 (30%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D P + L +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDSVIPS-----AVMAACLWQL----------PGIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|320652019|gb|EFX20372.1| von Willebrand factor type A [Escherichia coli O157:H- str. H 2687] Length = 219 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADIKTLAQ 171 >gi|254413290|ref|ZP_05027061.1| magnesium chelatase ATPase subunit D [Microcoleus chthonoplastes PCC 7420] gi|196179910|gb|EDX74903.1| magnesium chelatase ATPase subunit D [Microcoleus chthonoplastes PCC 7420] Length = 676 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 67/207 (32%), Gaps = 40/207 (19%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ ++D S SM ++++ A ++ ++L N + L+ F Sbjct: 473 ARKAGALVVFLVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQNRDQVSLIPF 521 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + +++RL G + GL A +AK D Sbjct: 522 RGEQAEVLLPPTRSIALARGRLDRLPCGGGSPLAHGLTQAVRVGTNAK-------MSGDI 574 Query: 283 YKKYIIFLTDGENSSPNI--------------DNKESLFYCNEAKRRGAIVYAI-----G 323 + I+ +TDG + P E L + G + I Sbjct: 575 GQIVIVAITDGRGNIPLARSLGEPIPEGEKPDIKGELLEIAARIRAMGIQLLVIDTENKF 634 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRK 350 V A + K + ++Y + + Sbjct: 635 VSTGFAKELAKQ--AGGKYYHLPKATD 659 >gi|149376602|ref|ZP_01894362.1| hypothetical protein MDG893_00582 [Marinobacter algicola DG893] gi|149359120|gb|EDM47584.1| hypothetical protein MDG893_00582 [Marinobacter algicola DG893] Length = 602 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 49/128 (38%), Gaps = 15/128 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P + + ++D +++VLD+SLSM D+L VA R IR++L Sbjct: 79 PSWRKAPTPLQQQND---SLVIVLDLSLSMLATDVTP-DRLTVAKRKIRDILKA------ 128 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIF 267 + LV +++ PL I+ ++ L + ++ G++ + + Sbjct: 129 -RQGSLTALVVYAADAHAVTPLTDDRNTIEGMLDVLEPVIMPAAGNRTDLGIQRGLDLLE 187 Query: 268 DAKEKLEH 275 Sbjct: 188 QGAPGTGR 195 >gi|148657455|ref|YP_001277660.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569565|gb|ABQ91710.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 429 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 80/229 (34%), Gaps = 34/229 (14%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIG---LDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 P ++ L T V+++S + ++ ++LD S SM + ++ A +++ Sbjct: 141 PTATPAASVVLPGTLGVRMASGERLPGATRNLAIILDASGSM-------LARIDGAPKTV 193 Query: 200 REMLDIIKSIPDVNNVVRSGLVTF--------SSKIVQTFPLAWGVQHIQEKINRLIFGS 251 +I + + L T+ S + P + +IN + + Sbjct: 194 IARQALIALVERLPATTNVALRTYGHRRADDCSDTELVQAPAPIQRADLINRINAIRPVN 253 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 ++ + D L D + I+ ++DG+ + D + + Sbjct: 254 GGRTPIA-----QSLEDMARDLA-----GVDGEVLIVLVSDGDETCGG-DPVATAAALHT 302 Query: 312 AKRRGAIVYAIGVQAEAA--DQFLKNCAS--PDRFYSVQNSRKLHDAFL 356 A R V IG E + L+ A+ ++ N+ +L DA Sbjct: 303 ANPR-LRVSVIGFNIEQEEWRRRLEGIAAYGGGAYFDAANAVQLADALE 350 >gi|15802553|ref|NP_288580.1| hypothetical protein Z3241 [Escherichia coli O157:H7 EDL933] gi|15832135|ref|NP_310908.1| hypothetical protein ECs2881 [Escherichia coli O157:H7 str. Sakai] gi|168750415|ref|ZP_02775437.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4113] gi|168757974|ref|ZP_02782981.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4401] gi|168761818|ref|ZP_02786825.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4501] gi|168769829|ref|ZP_02794836.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4486] gi|168775476|ref|ZP_02800483.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4196] gi|168782359|ref|ZP_02807366.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4076] gi|168788745|ref|ZP_02813752.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC869] gi|168799042|ref|ZP_02824049.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC508] gi|195938267|ref|ZP_03083649.1| hypothetical protein EscherichcoliO157_17851 [Escherichia coli O157:H7 str. EC4024] gi|208809428|ref|ZP_03251765.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4206] gi|208817158|ref|ZP_03258250.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4045] gi|208821068|ref|ZP_03261388.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4042] gi|209395812|ref|YP_002271316.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4115] gi|217329554|ref|ZP_03445633.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. TW14588] gi|254793857|ref|YP_003078694.1| hypothetical protein ECSP_2829 [Escherichia coli O157:H7 str. TW14359] gi|261225077|ref|ZP_05939358.1| hypothetical protein EscherichiacoliO157_10843 [Escherichia coli O157:H7 str. FRIK2000] gi|261257577|ref|ZP_05950110.1| hypothetical protein EscherichiacoliO157EcO_17392 [Escherichia coli O157:H7 str. FRIK966] gi|12516275|gb|AAG57135.1|AE005433_7 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933] gi|13362350|dbj|BAB36304.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai] gi|187768972|gb|EDU32816.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4196] gi|188015377|gb|EDU53499.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4113] gi|189000214|gb|EDU69200.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4076] gi|189355130|gb|EDU73549.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4401] gi|189361283|gb|EDU79702.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4486] gi|189367794|gb|EDU86210.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4501] gi|189371544|gb|EDU89960.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC869] gi|189378529|gb|EDU96945.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC508] gi|208729229|gb|EDZ78830.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4206] gi|208730777|gb|EDZ79467.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4045] gi|208741191|gb|EDZ88873.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4042] gi|209157212|gb|ACI34645.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4115] gi|209747972|gb|ACI72293.1| hypothetical protein ECs2881 [Escherichia coli] gi|209747974|gb|ACI72294.1| hypothetical protein ECs2881 [Escherichia coli] gi|209747976|gb|ACI72295.1| hypothetical protein ECs2881 [Escherichia coli] gi|217317322|gb|EEC25751.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. TW14588] gi|254593257|gb|ACT72618.1| conserved protein [Escherichia coli O157:H7 str. TW14359] gi|320191760|gb|EFW66408.1| hypothetical protein ECoD_01435 [Escherichia coli O157:H7 str. EC1212] gi|320641372|gb|EFX10827.1| von Willebrand factor type A [Escherichia coli O157:H7 str. G5101] gi|320646742|gb|EFX15629.1| von Willebrand factor type A [Escherichia coli O157:H- str. 493-89] gi|320668101|gb|EFX34980.1| von Willebrand factor type A [Escherichia coli O157:H7 str. LSU-61] gi|326338430|gb|EGD62258.1| hypothetical protein ECoA_05044 [Escherichia coli O157:H7 str. 1044] gi|326347047|gb|EGD70780.1| hypothetical protein ECF_00219 [Escherichia coli O157:H7 str. 1125] Length = 219 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADIKTLAQ 171 >gi|296412728|ref|XP_002836073.1| hypothetical protein [Tuber melanosporum Mel28] gi|295629876|emb|CAZ80230.1| unnamed protein product [Tuber melanosporum] Length = 997 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 87/241 (36%), Gaps = 56/241 (23%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 G D ++ R+T + + + Y+ T + AP T+S+ Sbjct: 431 GSNSDEDDDFRATKTTRRVSSVNSAYS-------------GTHGGGRSVVTAPTEYTASI 477 Query: 160 -----KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +IS I LD+++V+ VS SM K+ + S++ ++ ++ Sbjct: 478 RGAKPQISPAVHIPLDIVVVIPVSSSMQGL------KINLIRDSLKFLIH------NLGE 525 Query: 215 VVRSGLVTF--SSKIVQTFPL---AW-GVQHIQEKINRLIFGST----TKSTPGLEYAYN 264 R GLVTF SS V PL +W G + +N + T G A + Sbjct: 526 RDRMGLVTFGSSSGGVALTPLSVKSWSGWAKV---VNSIRPVGQKSLRTDVVDGANVAMD 582 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + + I+ ++D S D + F + A+ +++ G+ Sbjct: 583 LLMQ---------RKSSNPIASILLISD----SSTSDTENVDFVVSRAEAAKITIHSFGL 629 Query: 325 Q 325 Sbjct: 630 G 630 >gi|224096382|ref|XP_002192789.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 4 [Taeniopygia guttata] Length = 1068 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 49/133 (36%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 245 DIVIIVDVSGSMKGL------RMTIAKHTIVTILDTLGENDFVN------IIAYNDYVHF 292 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H ++ + L + L ++ + + A Sbjct: 293 IEPCFKGILVQADRDNREHFKQLVEELQAKGVGTVSKALTESFKIL-----REFRDAGQG 347 Query: 281 DDYKKYIIFLTDG 293 + I+ +TDG Sbjct: 348 GLCNQAIMLITDG 360 >gi|325687363|gb|EGD29384.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK72] Length = 450 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 61/199 (30%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM G +D + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSM----GSTIDIVRANIN------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|223558083|gb|ACM91087.1| hypothetical protein BACUNI_01985 [uncultured bacterium Rlip1] Length = 292 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 49/129 (37%), Gaps = 13/129 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +M+++DVS S + + R + L + + + N + G++ F Sbjct: 72 EEEREMTVMLLIDVSGSNDFGSQSQ------SKRDLTAELAAVLAFSAIQNNDKVGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 SSKI + P G HI I ++ + + + A ++ D Sbjct: 126 SSKIEKFIPPKKGSSHILRIIREIVDFKPVERGTDIGEGLRFLTSAIKRRTTAFLISD-- 183 Query: 284 KKYIIFLTD 292 F+TD Sbjct: 184 -----FMTD 187 >gi|296109872|ref|YP_003616821.1| von Willebrand factor type A [Methanocaldococcus infernus ME] gi|295434686|gb|ADG13857.1| von Willebrand factor type A [Methanocaldococcus infernus ME] Length = 370 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 55/168 (32%), Gaps = 39/168 (23%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 +++ LDVS SM L +A ++ +N + ++ F + Sbjct: 214 LVICLDVSGSMRGKKEEWAKALALAITNL-----------SINEGKKVHIIFFDDGVREV 262 Query: 228 -VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L+ I + +G T L+ A D Sbjct: 263 KEFNKKLSMND---ILYIASVFYGGGTNFEKPLKKAMEYNGD------------------ 301 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 IIF+TDGE I + + E K+RG +Y+ + ++ K Sbjct: 302 IIFITDGE---AEISKNFTNKFIEEKKKRGIKLYSFFINTKSTYSLKK 346 >gi|307154849|ref|YP_003890233.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306985077|gb|ADN16958.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 251 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 63/189 (33%), Gaps = 18/189 (9%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATR-SIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 L + D S SM ++ L A R SI EM + P+ +VR V FS Sbjct: 21 PLHFFWICDCSGSM--EIDGKIEALNYAIRESIPEMRKVANENPNAQLLVR--AVKFSDG 76 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 A + + + L T L E+L+ Sbjct: 77 AQWHVADATPIDNFTW--SDLTAQGMTDMGKALSL-------VAEQLKMPPMTDRALPPV 127 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY--S 344 ++ ++DG+ + + L ++ R A+ AI + +A + L+ S F Sbjct: 128 LVLISDGQPTDDFNRGLKDLL--DQPWGRKAVRIAISIGRDADKEVLEKFISHPEFQPLE 185 Query: 345 VQNSRKLHD 353 N+ L Sbjct: 186 ANNAAALIK 194 >gi|223936325|ref|ZP_03628237.1| conserved hypothetical protein [bacterium Ellin514] gi|223894843|gb|EEF61292.1| conserved hypothetical protein [bacterium Ellin514] Length = 318 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 22/99 (22%), Positives = 43/99 (43%), Gaps = 6/99 (6%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++DVS S FG G + R + + + + + N + GL+ F+ ++ Sbjct: 77 LTLMLLVDVSGS--GLFGSG----DQSKRELAAEIASVMAFSAIRNNDKVGLILFTEEVE 130 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + P G +H+ I ++F K L A + Sbjct: 131 KFIPPRKGRRHVLRVIREILFYEPKKRGTNLNLALEFLT 169 >gi|126342421|ref|XP_001375707.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 895 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R LVT S Sbjct: 4 LLFLIDTSASMNQRTELGTTYLDIAKNAVEIFL-KLRARDPASRGDRYMLVT-SEDPPYC 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L+ A+ N++ + Sbjct: 62 IKAGWKENHATFMTELKNLQASGLTTLGQALKSAFDLLNVNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II TDG Sbjct: 122 PALIIAFTDG 131 >gi|319426862|gb|ADV54936.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 663 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 56/179 (31%), Gaps = 30/179 (16%) Query: 136 PFIFCTFPWCANSSHA--PLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKL 192 P F W + P + S+ + + + +V+D+S+SM P ++L Sbjct: 57 PLYLLAFTWLIATLALAGPAVNKQSLPVFAAEQGRV---LVMDMSVSMFATDLVP--NRL 111 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLI 248 A ++L +K +GLV F+ PL + + ++ Sbjct: 112 TQAKFRATDLLRNLKEGE-------TGLVAFAGDAFTISPLTRDTGTLLNLLPTLSPEIM 164 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 + GL A + II +TDG + +L Sbjct: 165 PVRGSNLAAGLIQAKMLLAQGGHIRGD-----------IILMTDGITPHQFNEANSALS 212 >gi|269969412|sp|C7G0B5|PIF_PINFU RecName: Full=PIF; Contains: RecName: Full=Pif97; Contains: RecName: Full=Pif80; AltName: Full=Aragonite-binding protein; Flags: Precursor gi|256252203|dbj|BAH97338.1| Pif177 [Pinctada fucata] Length = 1007 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 75/202 (37%), Gaps = 23/202 (11%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V + S D+++ +D S +D DKL A ML +++ + +N +R Sbjct: 18 VGVKSDECKTADVVVNVDAS---DDVSDQDFDKLKRA------MLMMVRGLSIDDNQIRL 68 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G+VT+ S++ + PL + I + +K G+ A Sbjct: 69 GMVTYGSEVCDSIPLQGDRLDLARTIRYMKKPTGPSKPFKGMGEARRMFSSRGRYNVPHI 128 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + I+ + + K+ + ++A+ V AIG+ A+ +++ A Sbjct: 129 TMNLGGD--IV----------DTEVKDLMDETDKARDEDIKVMAIGLGAKVDRDEIESIA 176 Query: 338 SP-DRFYSVQNSRKLHDAFLRI 358 D+ Y + + L I Sbjct: 177 YDRDQAYFMDDEDDLIRKVKEI 198 >gi|298491820|ref|YP_003721997.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233738|gb|ADI64874.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 224 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L + V+ + + +++LD S SM ++ + + D + Sbjct: 4 TLKLDEVVEFAENPEPRCPCVLLLDTSGSMQGE------RIEALNQGLLTFKDELVKNSL 57 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDA 269 V ++TF S + ++ N L T G+ + I Sbjct: 58 AARRVEVAIITFDSHVNVVQDF-----VTADQFNPPILTAQGLTTMGAGINKSLEII--- 109 Query: 270 KEKLEHIAKGHDDYKKYIIFL-TDGENS---SPNIDNKESLFYCNEAKRRGAIVYAIGV 324 +E+ Y + +F+ TDGE I+ +EA +R + +GV Sbjct: 110 QERKSQYRTNGIAYYRPWVFMITDGEPQGELDNVIEQAVQRLQGDEANKR-VAFFTVGV 167 >gi|254281814|ref|ZP_04956782.1| hypothetical protein NOR51B_303 [gamma proteobacterium NOR51-B] gi|219678017|gb|EED34366.1| hypothetical protein NOR51B_303 [gamma proteobacterium NOR51-B] Length = 631 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 72/206 (34%), Gaps = 34/206 (16%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMD 190 R++ P I + P V + +++ ++++LD+S SM P Sbjct: 59 RWKNPVITVLLLIAVVAIAGPSTQKVDVPVYQRANA---VVVILDLSASMLAADVQPS-- 113 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 A R+ +++LD++ + +GLV ++ PL V+ I+ + L Sbjct: 114 ---RAQRARQKILDLLDRRTEGL----TGLVAYAGDAHVVAPLTDDVRTIENLLPVLSPD 166 Query: 251 ----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 + + +E A + A ++ +TD P +D + + Sbjct: 167 IMPLPGSDAGSAIEKAAELLGAAGMSSGT-----------LLLITD---EIPKLDTERTR 212 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 +GA + +GV Sbjct: 213 SL---LSDQGARLSILGVGTATGAPI 235 >gi|282861923|ref|ZP_06270986.1| cobaltochelatase subunit [Streptomyces sp. ACTE] gi|282562948|gb|EFB68487.1| cobaltochelatase subunit [Streptomyces sp. ACTE] Length = 677 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 50/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++G ++ +L + + G Sbjct: 483 QATREGREGNLVLFVVDASGSMA-----ARRRMGAVKGAVLSLL-----LDAYQRRDKVG 532 Query: 220 LVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF P V ++ L G T GL A++ + ++E + Sbjct: 533 LVTFRGKDAEVALPPTSSVDAAAARLESLPTGGRTPLAAGLLKAHDVL-----RVERLRD 587 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 588 PSRRP--LLVVVTDGRATGG 605 >gi|282879635|ref|ZP_06288366.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1] gi|281306583|gb|EFA98612.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1] Length = 298 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 41/108 (37%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + + N + G+V F Sbjct: 74 EEERELTVMLLIDVSGSLDF------GTKKQTKREMVTEIAATLAFSAIQNNDKIGVVFF 127 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I + T +EY + Sbjct: 128 SDRIEKYIPPKKGRKHILYIIREMLDFEPQSRKTDVGMAIEYLTRMMK 175 >gi|114778217|ref|ZP_01453089.1| TPR domain protein [Mariprofundus ferrooxydans PV-1] gi|114551464|gb|EAU54019.1| TPR domain protein [Mariprofundus ferrooxydans PV-1] Length = 648 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 52/138 (37%), Gaps = 23/138 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M+++LD+S SM D +L A + ++++L +GLV F+ Sbjct: 98 MVIILDLSRSM-DAGDLKPSRLQRAKQKVQDIL-------HKRTEGETGLVVFAGSAFDV 149 Query: 231 FPLAWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P+ + I + ++ + ++ L++A + K Sbjct: 150 VPMTTDNKAILALLPSLDTSMMPVQGSVASDALKHAGAMLKRGAIKHGS----------- 198 Query: 287 IIFLTDGENSSPNIDNKE 304 ++ LTDG ++ KE Sbjct: 199 VVMLTDGVDADATAVAKE 216 >gi|303252178|ref|ZP_07338346.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302648961|gb|EFL79149.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 532 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 84/245 (34%), Gaps = 21/245 (8%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 +S R F + G +++ +L I ++ + +E++ +A+L L+ ++L Sbjct: 4 ISLSQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLT 63 Query: 61 TKILNQENGNN------GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL 114 + + N+ K+ + F + + + Q T + Sbjct: 64 AENNSGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMATKFVKAFLPQTNEEKMHLTPV 123 Query: 115 SIIIDD-QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI------ 167 I++ K + S+ + W +I V ++SKS Sbjct: 124 CKTINNTNGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPHEVNVASKSKAFKKNTF 183 Query: 168 --GLDMMMVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 +D+M+V D+S SM KLG+ + E+ + D N R Sbjct: 184 NIPIDLMVVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIY 243 Query: 220 LVTFS 224 + F+ Sbjct: 244 VTPFA 248 >gi|108945862|gb|ABG23473.1| integrin subunit alpha 11 [Tupaia belangeri] Length = 139 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 48/125 (38%), Gaps = 12/125 (9%) Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGL 259 +++I+K ++ G+V + +V F L V+ + E + + T++ Sbjct: 8 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHLNDYRNVKDVVEAASHIEQRGGTETRTAF 67 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 + G KK +I +TDGE + D+ + ++++ Sbjct: 68 GIEF------ARSEAFQKGGRKGAKKVMIVITDGE----SHDSPDLEKVIQQSEKDNVTR 117 Query: 320 YAIGV 324 YA+ V Sbjct: 118 YAVAV 122 >gi|332534651|ref|ZP_08410483.1| TPR domain protein in aerotolerance operon [Pseudoalteromonas haloplanktis ANT/505] gi|332035931|gb|EGI72412.1| TPR domain protein in aerotolerance operon [Pseudoalteromonas haloplanktis ANT/505] Length = 643 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 45/137 (32%), Gaps = 23/137 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D+S SM + S + LD+I+ + + + LV ++ Sbjct: 91 VIVMDMSYSMYSTDILPNRLMQ----SRFKALDMIELFKEGD----TALVAYAGTAYVIS 142 Query: 232 PLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + I L + + GL+ A + A I Sbjct: 143 PLTNDATTLSNLIPSLSPEIMPDKGSNVLAGLDMAKELLTQAGYIDGD-----------I 191 Query: 288 IFLTDGENSSPNIDNKE 304 I +TDG + D Sbjct: 192 ILVTDGIDQQEQSDVSS 208 >gi|220905238|ref|YP_002480550.1| OmpA/MotB domain-containing protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219869537|gb|ACL49872.1| OmpA/MotB domain protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 331 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 62/168 (36%), Gaps = 25/168 (14%) Query: 170 DMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 V+D S SM + DK+ VA + + ++ D N + + ++ + IV Sbjct: 33 SFDFVVDYSGSMMMKNAQLKQDKIIVAKN-VLQRINAAIPALDYNGGLHT--ISPNGVIV 89 Query: 229 QTFPLAWGVQHIQEKINRL-----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P W + I +L IFG T GL+ AY + ++ Sbjct: 90 AQGP--WDRNTMAAGIKKLRSDFQIFGRMTSMGTGLQ-AYEPFLSSMKRDAA-------- 138 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +I +TDG+N+ + + A +R +V+ I + Sbjct: 139 ---VILVTDGDNNRGADLVEVARQM--YASQRNLVVHIISFADSKNGE 181 >gi|254416017|ref|ZP_05029773.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196177192|gb|EDX72200.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 744 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 54/177 (30%), Gaps = 32/177 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S K R + L+ + ++ FS Q Sbjct: 256 DVVFLIDTSGSQKGD---PFRKSQELMRRFIQGLNPQDTFTILD---------FSDITTQ 303 Query: 230 --TFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 PLA IN+L T + + + + + Sbjct: 304 LSAKPLANTPQNRIKALTYINQLKANGGT-----------YLLNGIRAVLNFPAAPEGRL 352 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 + I+ +TDG + + E Y + G +Y+ GV + L A R Sbjct: 353 RSIVLITDGYIGNESEILAEVKQYL----KSGNRLYSFGVGSSPNRFLLNRMAELGR 405 >gi|113867619|ref|YP_726108.1| hypothetical protein H16_A1608 [Ralstonia eutropha H16] gi|113526395|emb|CAJ92740.1| conserved hypothetical protein containing von Willebrand factor (vWF) type A [Ralstonia eutropha H16] Length = 547 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 60/198 (30%), Gaps = 29/198 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + + +D+S +M D +L A IK++ R+ + ++ Sbjct: 99 LAIAVDLSRTM-DAVDVTPTRLERAKL-------KIKALLARRGGGRTAIYAYAGSTHLV 150 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 PL +Q ++ L + A I L I+FL Sbjct: 151 LPLTDDTNLLQTFVDALQTRIMPVPGRDMAQALRTI---DADLAREPVPGT-----ILFL 202 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDG +D + +A + + + E L+N A + +R Sbjct: 203 TDG------VDPAAGRAFRAQADSGRSQPVVLAIGTEQGGP-LRNAAGG--YVEQDGARV 253 Query: 351 L----HDAFLRIGKEMVK 364 AF R G + Sbjct: 254 FARLDVAAFKRFGDDSGT 271 >gi|5748800|gb|AAD32681.3|AF141140_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 454 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 23/175 (13%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M ++F +++ ++ K + Sbjct: 180 TIDVTVTVTPKEIDEGADVMALLDVSQKMTKENFDKAKEQIKKMVTTLTGEPTDGKENRN 239 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAY 263 N VR L+TF KI + L+ I +K+ + + + A Sbjct: 240 RRNSVR--LMTFYRKISEPIDLSGKTSEEVEKELDNIWDKVKKEDWDWGVDLQGAIHKAR 297 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + KE + ++I+ + GE++ N + N ++ Sbjct: 298 DIFKKEKESKKR---------QHIVLFSQGESTFSYDINDKDKN--NTVRKNRIT 341 >gi|229527850|ref|ZP_04417241.1| TPR domain protein in aerotolerance operon [Vibrio cholerae 12129(1)] gi|229334212|gb|EEN99697.1| TPR domain protein in aerotolerance operon [Vibrio cholerae 12129(1)] Length = 620 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|258623678|ref|ZP_05718664.1| hypothetical protein VMD_37100 [Vibrio mimicus VM573] gi|258584044|gb|EEW08807.1| hypothetical protein VMD_37100 [Vibrio mimicus VM573] Length = 562 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 47/139 (33%), Gaps = 17/139 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S + +M++LD S SM D+L + + I ++ + ++G Sbjct: 91 EASPFGEDSASLMVLLDSSESMQQKDIAP-DRLTRSKQKILDLTEA-------RKGGKTG 142 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L+ F+ PL + +Q + + + A N + + Sbjct: 143 LMVFAGSAHVAMPLTSDNRVLQPYLAAINPNVMPVEGKAAQSALNLLHEQLPPYVGNT-- 200 Query: 280 HDDYKKYIIFLTDGENSSP 298 ++ +TDG S Sbjct: 201 -------LLLVTDGVTDST 212 >gi|239940333|ref|ZP_04692270.1| putative magnesium-chelatase subunit [Streptomyces roseosporus NRRL 15998] gi|291443766|ref|ZP_06583156.1| chelatase [Streptomyces roseosporus NRRL 15998] gi|291346713|gb|EFE73617.1| chelatase [Streptomyces roseosporus NRRL 15998] Length = 678 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 50/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 484 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 533 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF + P V ++ L G T GL A++ + ++E + Sbjct: 534 LVTFRGREAEVALPPTSSVDAAAARLESLPTGGRTPLAAGLLKAHDVL-----RVERLRD 588 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 589 PSRRP--LLVVVTDGRATGG 606 >gi|160720|gb|AAA29775.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 574 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYASVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|118361109|ref|XP_001013785.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295552|gb|EAR93540.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 368 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 48/161 (29%), Gaps = 21/161 (13%) Query: 174 VLDVSLSMNDHF---GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 V+D+S SM+ F G + +L + + L NV+ G K Sbjct: 193 VIDISGSMDYTFKANGETISRLAFVKSQLTKTLAEQLKPYQKFNVIIFGNSASQWKTDYI 252 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +Q IN+L T + GL+ A+N I L Sbjct: 253 DATPENIQAAIAYINKLTTNGATNISSGLDLAFNTKQALNG---------------IYLL 297 Query: 291 TDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAEA 328 +DG +S Y N ++ + I Sbjct: 298 SDGVPNSGVQTVDGIKKYLADKNASRNEKVHINTISFIMGG 338 >gi|124513464|ref|XP_001350088.1| Thrombospondin-related anonymous protein, TRAP [Plasmodium falciparum 3D7] gi|160691|gb|AAA29767.1| sporozoite surface protein 2 [Plasmodium falciparum] gi|23615505|emb|CAD52497.1| Thrombospondin-related anonymous protein, TRAP [Plasmodium falciparum 3D7] Length = 574 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYASVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|326670400|ref|XP_003199206.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Danio rerio] Length = 1109 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 73/193 (37%), Gaps = 34/193 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ + Sbjct: 276 DVVILVDVSGSMKGL------RLTIARQTVSSILDTLGDDDFFN------VIAYNEDLHY 323 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P G H ++++++L+ L A+ + + + Sbjct: 324 VEPCLNGTLVQADITNKDHFRQRLDKLLAKGIGMLDVALTEAFELLSNFNQTGR-----G 378 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV--YAIGVQAEAADQFL-KNCA 337 + + I+ +TDG ++ +++F R V Y IG ++ AD CA Sbjct: 379 SECSQAIMLVTDG-----AVETYDAVFAVYNWPDRKVRVFPYLIGRESAFADNLKWMACA 433 Query: 338 SPDRFYSVQNSRK 350 + F + Sbjct: 434 NKGYFTQISTLAD 446 >gi|225310537|emb|CAQ19229.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 215 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 70/192 (36%), Gaps = 20/192 (10%) Query: 146 ANSSHAPLLITSSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 N ITS+V +K + L++ ++D S S D+ G + + Sbjct: 40 RNGRRKSKPITSNVIPKNKDENCNLELAFLVDSSESAKDNHGQE----KSFVTDLVNHIP 95 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-W-GVQHIQEKINRLIFGS-TTKSTPGLEY 261 I+ R+ L+ +SS ++ W GV Q ++ + F T +T + Sbjct: 96 NIRLQTGQGLNFRTALLQYSSHVITEQSFKDWRGVPSFQSRVASIPFIGHGTYTTYAI-- 153 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 ++ + K I + G + N D +L +AK +G + Sbjct: 154 -------TNLTRIYLEESGPGTVKVAILMYGGASHPKNPDIFSALA---DAKNQGIKFFI 203 Query: 322 IGVQAEAADQFL 333 +G+ + A + L Sbjct: 204 VGLTSAANMEKL 215 >gi|191170005|ref|ZP_03031559.1| von Willebrand factor type A domain protein [Escherichia coli F11] gi|300981669|ref|ZP_07175655.1| von Willebrand factor type A domain protein [Escherichia coli MS 200-1] gi|190909521|gb|EDV69106.1| von Willebrand factor type A domain protein [Escherichia coli F11] gi|281179171|dbj|BAI55501.1| conserved hypothetical protein [Escherichia coli SE15] gi|300307470|gb|EFJ61990.1| von Willebrand factor type A domain protein [Escherichia coli MS 200-1] gi|324006449|gb|EGB75668.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] gi|324013878|gb|EGB83097.1| von Willebrand factor type A domain protein [Escherichia coli MS 60-1] Length = 219 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|254179953|ref|ZP_04886552.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184210493|gb|EDU07536.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 602 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|283778759|ref|YP_003369514.1| hypothetical protein Psta_0969 [Pirellula staleyi DSM 6068] gi|283437212|gb|ADB15654.1| hypothetical protein Psta_0969 [Pirellula staleyi DSM 6068] Length = 343 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 69/211 (32%), Gaps = 48/211 (22%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + ++ + G + V D S SM + G + + ++++ I + +V+ Sbjct: 159 SEAETQVFGITGKGSRFVYVFDRSGSMEGYGGRPI------AAAKEQLIESISHLGNVHQ 212 Query: 215 VVRSGLVTFSSKIVQTFPL-----------AWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 ++ ++ + P + Q + ++ T+ L+ A Sbjct: 213 FQ---IIFYNERPEVMNPFRGQSLQLFFGNERDKELAQRFVRGIVASGGTEHLVPLKMAL 269 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 D I FLTD + + + ES+ N RG ++AI Sbjct: 270 KLDPD-----------------VIFFLTDADEPALRPEELESIHRMN----RGTTIHAIE 308 Query: 324 VQAEAADQ---FLKNCASPD----RFYSVQN 347 A FL+ A + R+ V + Sbjct: 309 FGAGPKASRWNFLQQIAKENGGEHRYIDVTS 339 >gi|260575021|ref|ZP_05843022.1| von Willebrand factor type A [Rhodobacter sp. SW2] gi|259022643|gb|EEW25938.1| von Willebrand factor type A [Rhodobacter sp. SW2] Length = 558 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 72/229 (31%), Gaps = 46/229 (20%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 +CA PL + K ++VLD S SM G KL +A ++ Sbjct: 12 LAFCALGLATPLAAQAEGKS----------IIVLDGSGSM-WGQIDGRAKLEIAREALSG 60 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQT--------FPLAWGVQHIQEKINRLIFGSTT 253 +L + + GL+ + + P A I + F T Sbjct: 61 VLAGVAPETE------LGLMVYGHRSKGDCNDIELMVPPAAGTGPAIAAAAAGMQFLGKT 114 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 T + A ++ +EK +I +TDG + + + E + Sbjct: 115 PLTEAVRRAAAELRSTEEKAT------------VILITDGIETC----DADPCALGAELE 158 Query: 314 RRGA--IVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLHDAFLR 357 G + +G A + C A+ + +++ L +A Sbjct: 159 ASGVDFTAHVVGFGLTAEEGKQVACLADATGGLYIEAKDAGSLTEALKT 207 >gi|4650825|gb|AAD26900.1|AF082074_1 serum opacity factor [Streptococcus pyogenes] Length = 1026 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 63/145 (43%), Gaps = 6/145 (4%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 L+ T + + D G D+M +LDVS M ++F +++ ++ K + Sbjct: 214 LMFTGNGLKPKQIDEGADVMALLDVSQKMTKENFDKAKEQIKKMVTTLTGEPTDGKENHN 273 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 N VR L+TF K+ + L +++ + ++ + + G++ I A+E Sbjct: 274 RRNSVR--LMTFYRKVNEPIELT--AENVDKTLDEVWKKAKEDWDWGVDLQ-GAIHKARE 328 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 + +++I+ + GE++ Sbjct: 329 IFNKEKEKKSGKRQHIVLFSQGEST 353 >gi|327470147|gb|EGF15611.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK330] Length = 450 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 61/199 (30%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM G +D + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSM----GSTIDIVRANIN------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|315223610|ref|ZP_07865464.1| von Willebrand factor [Capnocytophaga ochracea F0287] gi|314946391|gb|EFS98386.1| von Willebrand factor [Capnocytophaga ochracea F0287] Length = 287 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 10/115 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+++DVS S + FG + + + + Sbjct: 58 NVTARYNEPFVKVFEEERELTLMLMIDVSGS--ELFGTKQE----FKSEVITEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLE 260 + N ++GL+ FS +I P G H+ I LI T + L+ Sbjct: 112 SALQNNDKTGLILFSDQIELYIPPKKGKSHVLRIIRELIEFQPKSFKTNISEALQ 166 >gi|269125512|ref|YP_003298882.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310470|gb|ACY96844.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 814 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 78/224 (34%), Gaps = 33/224 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P + V+ SK + + M+ ++D+S SM G G+ ++ R + L +P+ Sbjct: 594 PATVQELVQAWSKLSLSIRMLSIIDISGSMLAPVGGGLTRMQATARVAQGGL---SLLPN 650 Query: 212 VNNVVRSGLVTFSSKIVQTF---------PLA------WGVQHIQEKINRLIFGSTTKST 256 + G FS+K+ PL Q + ++R+ T + Sbjct: 651 DSE---LGQWVFSTKLDGDKDYKEIVPMGPLGERVGSVTRRQLLLSSLSRIEPKPTGDTG 707 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN-SSPNIDNKESLFYCNEAKR- 314 Y+ I A + K ++ I+ TDG+N +++L Sbjct: 708 L-----YDTILAAYRYMSKTYKP--EFGNSILLFTDGKNEDDDGPTLRQTLRELESMIDP 760 Query: 315 -RGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 R V +G LK A + Y QN ++ F Sbjct: 761 TRPIQVIMLGFGPGVDVNELKQIAKVTRGDVYVTQNPNEIQKIF 804 >gi|239979500|ref|ZP_04702024.1| hypothetical protein SalbJ_08693 [Streptomyces albus J1074] gi|291451369|ref|ZP_06590759.1| conserved hypothetical protein [Streptomyces albus J1074] gi|291354318|gb|EFE81220.1| conserved hypothetical protein [Streptomyces albus J1074] Length = 452 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 45/154 (29%), Gaps = 36/154 (23%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT--------FS 224 +++D S SM+ K+ A + R ++ + + R ++ + Sbjct: 61 LMVDCSGSMD----HPSSKMRHARDATRT------AVAALRDGTRFAVIAGTHVAREVYP 110 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + +E + RL G T L A + + H Sbjct: 111 GGGRLATADDRTREEAREAVLRLRAGGGTAIGSWLRLAEQLLTSEPAGVRHG-------- 162 Query: 285 KYIIFLTDGENSS-------PNIDNKESLFYCNE 311 I LTDG N +D F C+ Sbjct: 163 ---ILLTDGRNEHETPEELRAALDLCAGRFTCDA 193 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 25/50 (50%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 G+++I+ + L V+ +GL ++ + K++L +D L A + Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVT 68 >gi|134282290|ref|ZP_01768995.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246328|gb|EBA46417.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 602 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|311978223|ref|YP_003987343.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] gi|82000061|sp|Q5UQ35|YR811_MIMIV RecName: Full=Putative ariadne-like RING finger protein R811 gi|55417421|gb|AAV51071.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308205062|gb|ADO18863.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] Length = 990 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 60/175 (34%), Gaps = 33/175 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +D+ +V+D + SM G + L + + I +++DI I ++N ++ + Sbjct: 2 DNQAVVDLAIVVDATGSM----GTFLSSLSESLQQIVQIIDITNVIQNIN------IIMY 51 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL------ 273 + G + I L + + A N + D + Sbjct: 52 RDYCDSVITASSGWVSKIDDLIPFIRGLRASGGGDTPEAGKTAANNLLDVVKNNTIVIWY 111 Query: 274 -----EHIAKGHDDYKKYIIFLTDGENSSPNIDNK-ESLFYCNEAKRRGAIVYAI 322 H + D++ + I N+ D + + C+ R IVY I Sbjct: 112 ADAPPHHKSNARDNFAREI-------NTLIGSDKIFDWIELCDTLAARNIIVYPI 159 >gi|126440828|ref|YP_001058814.1| hypothetical protein BURPS668_1775 [Burkholderia pseudomallei 668] gi|126220321|gb|ABN83827.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 602 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|224024931|ref|ZP_03643297.1| hypothetical protein BACCOPRO_01662 [Bacteroides coprophilus DSM 18228] gi|224018167|gb|EEF76165.1| hypothetical protein BACCOPRO_01662 [Bacteroides coprophilus DSM 18228] Length = 289 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 43/109 (39%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSNSLDF------GTIQQMKKDMVTEIAATIAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +HI I L T +EY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLGFKPESKRTNLQGAIEYLTNVLKK 174 >gi|167894484|ref|ZP_02481886.1| hypothetical protein Bpse7_12104 [Burkholderia pseudomallei 7894] Length = 587 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 8 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 62 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 63 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 119 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 120 QSVPYFFL 127 >gi|160882570|ref|ZP_02063573.1| hypothetical protein BACOVA_00521 [Bacteroides ovatus ATCC 8483] gi|156112014|gb|EDO13759.1| hypothetical protein BACOVA_00521 [Bacteroides ovatus ATCC 8483] Length = 475 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 61/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 313 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 360 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 361 DLGSSFDRLVDFLSQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 409 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + ++ AK + +YAI + A +LK C D++ YSVQN+ Sbjct: 410 FEMRPVDQLLSRTI---EHAKAKQTKMYAISLGGKSAETSYLKLC---DKYWEYSVQNAE 463 Query: 350 KLHD 353 L+ Sbjct: 464 SLNK 467 >gi|310824462|ref|YP_003956820.1| hypothetical protein STAUR_7237 [Stigmatella aurantiaca DW4/3-1] gi|309397534|gb|ADO74993.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 913 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 66/199 (33%), Gaps = 26/199 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +++D S SM G K+ VA + L ++ D + + Sbjct: 390 QRRTSVALSVLMDCSCSMGVTVPDGRTKMEVAAEGVVGALTLLNEKDDAS------VHMV 443 Query: 224 SSKIVQTFPLAWGVQHI-QEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ + F L+ + + K+ R G L +I + + H Sbjct: 444 DTEPHEIFSLSSVGEGLPLNKVARGFSGGGGIFVGEALREGKTQILRSDKATRH------ 497 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA-- 337 ++ +D +S D + +L +R V IG+ L+ A Sbjct: 498 -----VLLFSDAADSEEPDDYRATLA---ALRRENVTVSVIGLGTPKDSDADLLREVAQL 549 Query: 338 SPDRFYSVQNSRKLHDAFL 356 R Y +++ L F Sbjct: 550 GGGRIYFAEDALSLPRIFS 568 >gi|291443930|ref|ZP_06583320.1| toxic cation resistance protein [Streptomyces roseosporus NRRL 15998] gi|291346877|gb|EFE73781.1| toxic cation resistance protein [Streptomyces roseosporus NRRL 15998] Length = 294 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 55/160 (34%), Gaps = 21/160 (13%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVN 213 +S++ + +VLD S SM ++ G + L + LD ++P Sbjct: 78 AGASLEKHGLGGHRAAVYLVLDYSGSMKPYYQDGSVQALADRVLGLSSHLDDDGTVP--- 134 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +V FS+ + +A H +I++++ G A + + D Sbjct: 135 ------VVFFSTDVDAVTDIALDNHH--GRIDKIVAGLGHMGKTSYHLAMDAVID----- 181 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 H ++F TDG P + C AK Sbjct: 182 -HYLDSGSTAPALVVFQTDG---GPINKLAAERYLCKAAK 217 >gi|239940501|ref|ZP_04692438.1| hypothetical protein SrosN15_05838 [Streptomyces roseosporus NRRL 15998] gi|239986983|ref|ZP_04707647.1| hypothetical protein SrosN1_06727 [Streptomyces roseosporus NRRL 11379] Length = 250 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 55/160 (34%), Gaps = 21/160 (13%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVN 213 +S++ + +VLD S SM ++ G + L + LD ++P Sbjct: 34 AGASLEKHGLGGHRAAVYLVLDYSGSMKPYYQDGSVQALADRVLGLSSHLDDDGTVP--- 90 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +V FS+ + +A H +I++++ G A + + D Sbjct: 91 ------VVFFSTDVDAVTDIALDNHH--GRIDKIVAGLGHMGKTSYHLAMDAVID----- 137 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 H ++F TDG P + C AK Sbjct: 138 -HYLDSGSTAPALVVFQTDG---GPINKLAAERYLCKAAK 173 >gi|196003032|ref|XP_002111383.1| hypothetical protein TRIADDRAFT_55341 [Trichoplax adhaerens] gi|190585282|gb|EDV25350.1| hypothetical protein TRIADDRAFT_55341 [Trichoplax adhaerens] Length = 1173 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 49/145 (33%), Gaps = 19/145 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI-PDVNNVVRSGLV 221 + + D+++V+D LSM + + + D + + ++ +G Sbjct: 235 AATSGPKDVVIVIDCGLSMQGNRFKIAKSVAKTVLATLTRNDYVNIVCTRFSHWDETGKW 294 Query: 222 TFSSKIVQ--------TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 F V L + + I+ L G T++ G + A+ + + Sbjct: 295 HFYETTVLGCYKDQLIPASLT-NRKSLSNAIDNLKAGGTSEMKKGFQKAFKLLRGSHRTG 353 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSP 298 + +I +TDGE + Sbjct: 354 C---------QSIMIVITDGEKTDG 369 >gi|115377018|ref|ZP_01464236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365996|gb|EAU65013.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 884 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 66/199 (33%), Gaps = 26/199 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +++D S SM G K+ VA + L ++ D + + Sbjct: 361 QRRTSVALSVLMDCSCSMGVTVPDGRTKMEVAAEGVVGALTLLNEKDDAS------VHMV 414 Query: 224 SSKIVQTFPLAWGVQHI-QEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ + F L+ + + K+ R G L +I + + H Sbjct: 415 DTEPHEIFSLSSVGEGLPLNKVARGFSGGGGIFVGEALREGKTQILRSDKATRH------ 468 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA-- 337 ++ +D +S D + +L +R V IG+ L+ A Sbjct: 469 -----VLLFSDAADSEEPDDYRATLA---ALRRENVTVSVIGLGTPKDSDADLLREVAQL 520 Query: 338 SPDRFYSVQNSRKLHDAFL 356 R Y +++ L F Sbjct: 521 GGGRIYFAEDALSLPRIFS 539 >gi|331647724|ref|ZP_08348816.1| putative von Willebrand factor type A domain protein [Escherichia coli M605] gi|330911909|gb|EGH40419.1| hypothetical protein ECAA86_02279 [Escherichia coli AA86] gi|331043448|gb|EGI15586.1| putative von Willebrand factor type A domain protein [Escherichia coli M605] Length = 219 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGSAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ + IGVQ + Sbjct: 122 YRPWIFLITDGTPTDEWQAAANKVFQGEEDKK--FAFFTIGVQGADMKTLAQ 171 >gi|307294184|ref|ZP_07574028.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] gi|306880335|gb|EFN11552.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] Length = 417 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 8/47 (17%), Positives = 19/47 (40%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDH 54 + G++ ++ A +P++ GL +T K ++ D Sbjct: 8 RLLRDRTGNVLMMAAACMPILIGAAGLATDTVQWTLWKRQVQRQADS 54 >gi|218440957|ref|YP_002379286.1| magnesium chelatase ATPase D [Cyanothece sp. PCC 7424] gi|218173685|gb|ACK72418.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 7424] Length = 671 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 68/213 (31%), Gaps = 37/213 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A ++ +L N + L+ F Sbjct: 470 ARKAGALIIFVVDASGSMA------LNRMQSAKGAVMRLLTEA-----YENRDQVALIPF 518 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ L G + + GL A + +A D Sbjct: 519 RGEKADVLLPPTRSIALAKKRLETLPCGGGSPLSHGLTQAVHVGMNA-------MMSGDI 571 Query: 283 YKKYIIFLTDGENSSP--------------NIDNKESLFYCNEAKRRGAIVYAI----GV 324 + I+ +TDG + P E L + + G + I Sbjct: 572 GQVVIVAITDGRGNIPLVKSLGEPLPEGEKPDIKAELLEIAGKIRGVGMKLLVIDTEKKF 631 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLR 357 + + L A ++ + + + + + Sbjct: 632 VSTGFAKELATQAGGKYYHLPRATDQAIASMAK 664 >gi|167515430|ref|XP_001742056.1| hypothetical protein [Monosiga brevicollis MX1] gi|163778680|gb|EDQ92294.1| predicted protein [Monosiga brevicollis MX1] Length = 2049 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 53/165 (32%), Gaps = 29/165 (17%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +VLD S SM G + ++ + + L+ R +V+++S Sbjct: 890 LVLDTSGSMA---GRPIREINTVLHELPKRLEGNA---------RVHVVSYNSTASLYP- 936 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + ++ + S + G+ ++ + I F+TD Sbjct: 937 ---DARLEDMELRGMGLTSFMSAFHGMIRLLEQLKPQAQDSVR-----------ICFMTD 982 Query: 293 GENSSPNIDNK--ESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 GEN+ + D E N R +V + Q L+ Sbjct: 983 GENTRDSYDLAMIELREALNALNVRACVVDVVAFGVTRVHQLLET 1027 >gi|132566536|ref|NP_758952.4| voltage-dependent calcium channel subunit alpha-2/delta-4 [Homo sapiens] gi|296434419|sp|Q7Z3S7|CA2D4_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-4; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-4; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-4; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-4; Flags: Precursor Length = 1137 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + +N ++ ++ + Sbjct: 291 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFIN------IIAYNDYVHY 338 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 339 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 393 Query: 281 DDYKKYIIFLTDG 293 + I+ ++DG Sbjct: 394 SLCNQAIMLISDG 406 >gi|126451635|ref|YP_001066065.1| hypothetical protein BURPS1106A_1796 [Burkholderia pseudomallei 1106a] gi|242315320|ref|ZP_04814336.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|126225277|gb|ABN88817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|242138559|gb|EES24961.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 602 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|47207521|emb|CAG14087.1| unnamed protein product [Tetraodon nigroviridis] Length = 1048 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 26/164 (15%) Query: 218 SGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKL 273 G++ + + + L Q + E + T++ + A + F + Sbjct: 1 VGILQYGEVALHEWSLKDYQTTQEVVEAAKNISRQEGRETRTAYAIHKACTEAFSPERGA 60 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF- 332 A K +I +TDGE+ ++L C R YAI V + Sbjct: 61 REGAT------KVMIVVTDGESHDGEELP-DALQECE---DRNITRYAIAVLGHYIRRQQ 110 Query: 333 --------LKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 +K AS F++V + L+D +G + Sbjct: 111 DPETFISEIKYIASDPDDKYFFNVTDEAALNDIVDALGDRIFTL 154 >gi|17231895|ref|NP_488443.1| hypothetical protein all4403 [Nostoc sp. PCC 7120] gi|17133539|dbj|BAB76102.1| all4403 [Nostoc sp. PCC 7120] Length = 570 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 58/192 (30%), Gaps = 30/192 (15%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D S SM + KL +++ + + LV F S+I + + Sbjct: 396 DSSGSMEGN------KLPAVQNTLQNYIKNLGKKEQ------IALVDFDSEIREPVLVDG 443 Query: 235 --WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 G + I+ L T+ A N + + + A ++ LTD Sbjct: 444 TPQGRDRGVQFISGLRADGGTRLYDAAMQARNWLQKNRREGAINA---------VLILTD 494 Query: 293 GENSSPNIDNKESLFYCNE---AKRRGAIVYAIGVQAEAA---DQFLKNCASPDRFYSVQ 346 GE+S I + + + + +G E D K +YS Sbjct: 495 GEDSGSQISLDNLSAELQKSGFSTDQRIGFFTVGYGEEGEFNPDALKKIAELNGGYYSKG 554 Query: 347 NSRKLHDAFLRI 358 + + + Sbjct: 555 DPETISRLMSDL 566 >gi|294630425|ref|ZP_06708985.1| toxic cation resistance protein [Streptomyces sp. e14] gi|292833758|gb|EFF92107.1| toxic cation resistance protein [Streptomyces sp. e14] Length = 244 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 82/248 (33%), Gaps = 30/248 (12%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS------SVKISSKSDIGLD 170 +D S + + P P S APLL T+ ++K + + Sbjct: 3 TPEDSTPTPEDSTATPEDSPTPKPATPLARVKSRAPLLTTAYKAAAPAIKKNDLTGARAK 62 Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +VLD S SM ++ + +S+ E + ++ + FS+++ T Sbjct: 63 TYLVLDRSASMRPYY------KDGSAQSLAE--QTLALAAHLDPEATVHVTFFSTEVDGT 114 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L KI+ + G A E L H H ++F Sbjct: 115 ATLTLADHE--NKIDEVHAGLGRMGRTTYHAAIE------EVLAHHDAHHPTTPALVVFQ 166 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQF--LKNCASPD-RFYSVQ 346 TDG +P+ + + AK + ++ + F L+ + + F+ Sbjct: 167 TDG---APDAKTPATQALADAAKTHPHVFFSFVAFGEHDNKAFDYLRKLKTGNTSFFHAG 223 Query: 347 N-SRKLHD 353 R+L D Sbjct: 224 PTPRELTD 231 >gi|293371931|ref|ZP_06618334.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] gi|292633100|gb|EFF51678.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] Length = 448 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 62/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 286 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 333 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 334 DLGSSFDRLVDFLSQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 382 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + +S+ AK + +YAI + A +LK C D++ YSVQN+ Sbjct: 383 FEMRPVDKPLSQSI---EHAKAKQTKMYAISLGGKSAETSYLKLC---DKYWEYSVQNAE 436 Query: 350 KLHD 353 L+ Sbjct: 437 SLNK 440 >gi|288576300|ref|ZP_05978557.2| pilus-associated protein [Neisseria mucosa ATCC 25996] gi|288565840|gb|EFC87400.1| pilus-associated protein [Neisseria mucosa ATCC 25996] Length = 1081 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 60/153 (39%), Gaps = 13/153 (8%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREMLDI- 205 PL + + +I + I ++M +D S SMN + + PG ++ V ++ +L+ Sbjct: 26 PLYLQNESQIIEQPKIKHNIMFFIDDSQSMNRNAVTGEYTPGPTRMQVTKNALNGILENH 85 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + +G + ++ P Q + +K+N++ T P Y Sbjct: 86 KDKFNWGLQTLYNGGSSDTTPEETFDPEKASWQKMIDKVNKMKPTGLT---PATSRYYEV 142 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + + + K Y++ ++DG+ + Sbjct: 143 VT----QTVMPNIKYRCQKSYVVMMSDGDANFG 171 Score = 37.5 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 27/68 (39%), Gaps = 5/68 (7%) Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKNCAS-PDRFYSVQNSRKLHDAFL 356 D + + +K+ + +G ++A FL AS D +++ L +AF Sbjct: 286 DPSKDPKGVDYSKQL-VQTFTVGFGQGISDAGKAFLTRGASQDDWYFNADKPEDLENAFN 344 Query: 357 RIGKEMVK 364 +I + Sbjct: 345 KIISLIST 352 >gi|194220392|ref|XP_001916340.1| PREDICTED: von Willebrand factor A domain containing 3B [Equus caballus] Length = 1301 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 56/171 (32%), Gaps = 32/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 VYILIDTSHSMK-------GKLDLVKDKIIQFIQEQLKYKRKFNFVQFDAQAIAWREKLV 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q + + GS+T + L+ A+ KE I L Sbjct: 562 EIDEDNLKRAQSWVRDIKIGSSTNTLNALQIAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEA--ADQFLKNCAS 338 TDG P ++ K + +YAI A+ FLK A+ Sbjct: 606 TDGRPDQP------LEMVIDQVKVFQKIPIYAISFNYNDEIANGFLKELAA 650 >gi|149188657|ref|ZP_01866949.1| transporter [Vibrio shilonii AK1] gi|148837567|gb|EDL54512.1| transporter [Vibrio shilonii AK1] Length = 562 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 67/225 (29%), Gaps = 35/225 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 S+ + + P S ++++VLD S SM P Sbjct: 62 WKSQLPLKLLSAILSLAIIVCAGPTWQRESSPFGEDQG---ELVIVLDTSQSMMQSDLPP 118 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 +L A IR++L ++ L+ F+ P+ Q +N + Sbjct: 119 -SRLERAKYKIRDLLQA-------RQGGKTSLIVFAGSAHTAMPMTEDNQVFLPFLNAIS 170 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + A +I + KG II +DG S K++ Sbjct: 171 PEVIPVQGKSAQAAIPQI-------QSQLKGSSSGSVLII--SDGVASEAIEQYKKAFE- 220 Query: 309 CNEAKRRGAIVYAIGVQAEAAD-------QFLKNCA--SPDRFYS 344 +V + V +E LK A + R+Y+ Sbjct: 221 -----DSNVLVMVMAVGSERNQSSAATDWDSLKQLANDTGGRYYA 260 >gi|32477849|ref|NP_870843.1| hypothetical protein RB13068 [Rhodopirellula baltica SH 1] gi|32448406|emb|CAD77921.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 499 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 26/59 (44%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNN 71 G++ IL ILL +F + GL+I+ +A + + D + L ++ N Sbjct: 108 RGGAVLILIVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGANQTT 166 >gi|268324651|emb|CBH38239.1| hypothetical protein BSM_17160 [uncultured archaeon] Length = 537 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 70/182 (38%), Gaps = 19/182 (10%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + ++D+S S + V + L +I + R +V Sbjct: 347 TENRTRDIAVAFLVDMSGSTVGSTILCEKEALVLMSEALKELGDAFAIYGFSGYGRDNVV 406 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F +++ F + Q +Q KI+ + +T+ P + + K+ +E+ Sbjct: 407 FF---LIKDFEDPYD-QRVQSKISTMTNKQSTRIAPAIRHTTTKLRRREERT-------- 454 Query: 282 DYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + +I L+DG+ + N +++ EA+ G + I V EAA+ + A Sbjct: 455 ---RMLILLSDGKPLDRDYYGNYAIEDTRMALKEAQGYGVKSFCITVDREAAEYLPRMYA 511 Query: 338 SP 339 + Sbjct: 512 NS 513 >gi|153802361|ref|ZP_01956947.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122080|gb|EAY40823.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 634 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 LLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|114800198|ref|YP_761811.1| hypothetical protein HNE_3135 [Hyphomonas neptunium ATCC 15444] gi|114740372|gb|ABI78497.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 253 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 65/172 (37%), Gaps = 13/172 (7%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI-PDVNNVV 216 V I + + ++LD + SM++ + + + S + D+ V Sbjct: 24 VVPIKPPKEDTVQSYILLDRTGSMSNIWDEALSSVNAYADSFAADAPGAEIAGADIKTSV 83 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + +++K++ + + T +FDA K+ Sbjct: 84 TVAVFDYQDGMQFDV--------LRDKVDPSTWKTITN-DEANPRGMTPLFDAIGKIITR 134 Query: 277 AKGHDDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 A+ + K I+ +TDG ENSS + + + A+ RG V I + AE Sbjct: 135 AEADNPEKAVIVIMTDGLENSSKEFTKEGAKAALDRAEARGWEV--IFLGAE 184 >gi|289621292|emb|CBI52075.1| unnamed protein product [Sordaria macrospora] Length = 914 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 69/192 (35%), Gaps = 35/192 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V D S SM ++ ++R L KSIP + +F S+ Sbjct: 299 IVFVCDRSGSMGG------TRIEGLKSALRIFL---KSIPVGAKF---NICSFGSRHEFL 346 Query: 231 FP---LAWGVQHIQ---EKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 FP ++ + +Q E N + T+ LE A+ K + + Sbjct: 347 FPEGSRSYDQETLQRAMEYTNLMRADFGGTEMYRPLEAAFEKRYKDMDLE---------- 396 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 + LTDGE + +E+K ++ +G+ + + ++ A + Sbjct: 397 ---VFLLTDGEIWDQGQLFTMTNEKVSESKGA-IRLFTLGIGNDVSHALIEGVARAGNGF 452 Query: 344 S--VQNSRKLHD 353 + V + K++ Sbjct: 453 AQSVTDGEKMNA 464 >gi|157109793|ref|XP_001650826.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] gi|108878928|gb|EAT43153.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] Length = 1173 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 61/156 (39%), Gaps = 13/156 (8%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVA----TRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++++LD S SM + + +L V T S + ++I K DV+ +V F+ Sbjct: 221 DIVILLDNSGSMTG-YRNYIAQLTVKSILDTFSNNDFINIYKYSNDVDPLVDC----FAD 275 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++Q P ++ + EK+ L A+ + +E + + Sbjct: 276 MLIQATPE--NIRFMNEKVRGLEPDGYANVKKAFVKAFELLQHYREMR-RCNETVSGCNQ 332 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 I+ +TDG S+ D E + + V+ Sbjct: 333 AIMLITDGVPSNI-TDVFEQYNWFENGTKIPVRVFT 367 >gi|157128122|ref|XP_001661316.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] gi|108872689|gb|EAT36914.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] Length = 1100 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 61/156 (39%), Gaps = 13/156 (8%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVA----TRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++++LD S SM + + +L V T S + ++I K DV+ +V F+ Sbjct: 195 DIVILLDNSGSMTG-YRNYIAQLTVKSILDTFSNNDFINIYKYSNDVDPLVDC----FAD 249 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++Q P ++ + EK+ L A+ + +E + + Sbjct: 250 MLIQATPE--NIRFMNEKVRGLEPDGYANVKKAFVKAFELLQHYREMR-RCNETVSGCNQ 306 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 I+ +TDG S+ D E + + V+ Sbjct: 307 AIMLITDGVPSNI-TDVFEQYNWFENGTKIPVRVFT 341 >gi|308500716|ref|XP_003112543.1| hypothetical protein CRE_30864 [Caenorhabditis remanei] gi|308267111|gb|EFP11064.1| hypothetical protein CRE_30864 [Caenorhabditis remanei] Length = 661 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 70/198 (35%), Gaps = 23/198 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ +D+M ++D S S G++ + I E+L + P + R +V Sbjct: 448 PARKLPPIDLMFLVDTSSS------IGINNFDIQKNFICEILKDVDIAPGRS---RISMV 498 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYA------YNKIFDAKEK 272 ++ F + + ++ + RL G T + L +A + ++ Sbjct: 499 QYAQDPSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKR 558 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + D + + ++DG + D+ N R ++A+ ++ D+ Sbjct: 559 HQFLPTPKHDRLQVLCLVSDGYS-----DDSADKESVNLHDRLHVKIFAVVTRSFNKDKL 613 Query: 333 LKNCASPDRFYSVQNSRK 350 + ++V Sbjct: 614 VPITRFDGSVFTVHQRES 631 >gi|299145326|ref|ZP_07038394.1| conserved hypothetical protein [Bacteroides sp. 3_1_23] gi|298515817|gb|EFI39698.1| conserved hypothetical protein [Bacteroides sp. 3_1_23] Length = 475 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 61/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 313 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 360 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 361 DLGSSFDRLVDFLSQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 409 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + ++ AK + +YAI + A +LK C D++ YSVQN+ Sbjct: 410 FEMRPVDKLLSRTI---EHAKAKQTKMYAISLGGKSAETSYLKLC---DKYWEYSVQNAE 463 Query: 350 KLHD 353 L+ Sbjct: 464 SLNK 467 >gi|58429533|gb|AAW78170.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429547|gb|AAW78177.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYASVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|75907582|ref|YP_321878.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701307|gb|ABA20983.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 570 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 58/192 (30%), Gaps = 30/192 (15%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA- 234 D S SM + KL +++ + + L+ F S+I + + Sbjct: 396 DSSGSMEGN------KLPAVQNTLQNYIKNLGKKEQ------IALIDFDSEIREPVLVDG 443 Query: 235 --WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 G + I+ L TK A N + + + A ++ LTD Sbjct: 444 TPQGRDRGVQFISGLRADGGTKLYDAAIQARNWLQKNRRQGAINA---------VLILTD 494 Query: 293 GENSSPNIDNKESLFYCNE---AKRRGAIVYAIGVQAEAA---DQFLKNCASPDRFYSVQ 346 GE+S I + + + + +G E D K +YS Sbjct: 495 GEDSGSKISLDNLSAELQKSGFSTDQRIGFFTVGYGEEGEFNPDALKKIAELNGGYYSKG 554 Query: 347 NSRKLHDAFLRI 358 + + + Sbjct: 555 DPETISRLMSDL 566 >gi|289642330|ref|ZP_06474478.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289507862|gb|EFD28813.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 426 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 65/203 (32%), Gaps = 29/203 (14%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VR---SGLVTFSSK 226 +++LD S SM G K+ R+ +D + PD VR + + Sbjct: 49 VILLDCSGSM----GNPRTKIVEMRRAAAAAVDAL---PDGTWFAIVRGSGIAEAVYPAG 101 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 +E+I R+ T L A ++ + H A Sbjct: 102 RELVRADERTRAAAKERIGRVDADGGTAIGRWLTRA-RELMATRPDAIHHA--------- 151 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 I LTDG N + E++ C + GV A+ L+ A+ + Sbjct: 152 -ILLTDGRNGERAAEFDEAVAACVGRFQCDCR----GVGADWRVDELRRVATALLGTVDA 206 Query: 345 VQNSRKLHDAFLRIGKEMVKQRI 367 V+ + L F + + + Sbjct: 207 VRRPQDLATDFQSMINNATNRAV 229 >gi|189237279|ref|XP_973594.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 77/223 (34%), Gaps = 34/223 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV--TFSSKIV 228 ++ VL+ L+M G +D+L A + I L + +VR G + S Sbjct: 299 VVFVLNHGLTM---HGRKIDQLIDAMQKILSELTENDAFD----IVRFGATPSVWDSTRH 351 Query: 229 QTFPLAWGVQH--IQEKINRLIFGSTTKS-TPGLEYAYNKIFDAKEKLEHIAKGHDD--- 282 + L + ++ + +L T+K+ +E A + I+D + Sbjct: 352 KFIRLPDLRHYGNLEPYVKKLFLPRTSKAVRQNIEAARSTIYDKSGLGLSNPVYALEVGL 411 Query: 283 ----------YKKY---IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +Y IIFLTD + E + + ++++ + Sbjct: 412 FLAKRIQDNLPNRYQPMIIFLTDSYPTVGMTSQNEIINTVTKVNNNRIPIFSLSFGEDVD 471 Query: 330 DQFLKNCAS-----PDRFYSVQNSR-KLHDAFLRIGKEMVKQR 366 F++ A+ Y ++ ++ + + I ++ Q Sbjct: 472 KNFMRQLAAKNLGFSGHIYEALDASVQILNFYRSISSPVLSQV 514 >gi|171847247|gb|AAI61503.1| LOC548952 protein [Xenopus (Silurana) tropicalis] gi|213624339|gb|AAI70959.1| hypothetical protein LOC548952 [Xenopus (Silurana) tropicalis] Length = 896 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTYLGTTYLDIAKGAVEIFM-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 122 PSILITITDGNKLTSTASVQEEL 144 >gi|73958793|ref|XP_536940.2| PREDICTED: hypothetical protein XP_536940 [Canis familiaris] Length = 1110 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 87/273 (31%), Gaps = 36/273 (13%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 + + ++ I+ + ++ + R +I C + S + Sbjct: 817 SVQGLASAKHCAILPSIEINGRVRHIQWMPREIEVYITCLEKVMKRYVQRLQWLLSGSRR 876 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIK--SIPDVNNVVRS 218 + + + +++D S SM + +L + +R+ D S + + R Sbjct: 877 LFGTILERKVCILVDTSGSMGPYLQQVKTELVLLIWEQLRKHCDSFNLLSFAEDLQLWRD 936 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LV + + + + L +T L A+ H + Sbjct: 937 TLVETTDEA---------CHEAMQWVTHLRAHGSTSVLQALLKAF---------TFHDVE 978 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQ--AEAADQFLKN 335 G + LTDG+ P+ L G ++ I + AA FL+N Sbjct: 979 G-------LYLLTDGK---PDTSCSLILREVQRLTETGDVKMHTIALNHSGRAAGDFLRN 1028 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A + R++ + L + K ++ +R Sbjct: 1029 LAALTGGRYHCPVDEDTLLRIHGLLTKGIMHER 1061 >gi|62859103|ref|NP_001016198.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Xenopus (Silurana) tropicalis] Length = 896 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 47/143 (32%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTYLGTTYLDIAKGAVEIFM-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 +I +TDG + +E L Sbjct: 122 PSILITITDGNKLTSTASVQEEL 144 >gi|121599267|ref|YP_993091.1| hypothetical protein BMASAVP1_A1771 [Burkholderia mallei SAVP1] gi|124385206|ref|YP_001026132.1| hypothetical protein BMA10229_A0124 [Burkholderia mallei NCTC 10229] gi|126450090|ref|YP_001080596.1| hypothetical protein BMA10247_1041 [Burkholderia mallei NCTC 10247] gi|167002573|ref|ZP_02268363.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|217423806|ref|ZP_03455307.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238563902|ref|ZP_00438064.2| membrane protein [Burkholderia mallei GB8 horse 4] gi|254177770|ref|ZP_04884425.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254206222|ref|ZP_04912574.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|121228077|gb|ABM50595.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293226|gb|ABN02495.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126242960|gb|ABO06053.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147753665|gb|EDK60730.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|160698809|gb|EDP88779.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|217393664|gb|EEC33685.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238519717|gb|EEP83185.1| membrane protein [Burkholderia mallei GB8 horse 4] gi|243061787|gb|EES43973.1| conserved hypothetical protein [Burkholderia mallei PRL-20] Length = 602 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCA 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + N L D + + + + + + Sbjct: 78 QPNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QSVPYFFL 142 >gi|320355194|ref|YP_004196533.1| protoporphyrin IX magnesium-chelatase [Desulfobulbus propionicus DSM 2032] gi|320123696|gb|ADW19242.1| protoporphyrin IX magnesium-chelatase [Desulfobulbus propionicus DSM 2032] Length = 681 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 59/177 (33%), Gaps = 18/177 (10%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + D+ L A R P+ + L + IG ++ V+D S S Sbjct: 448 NNGDFALDATLRAAAPYQQQRAGKNGSDLAVQLRPQDIRSKVREKRIGNFLLFVVDASGS 507 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQTFPLAWGVQH 239 M ++ + ++ +L + + ++TF P V Sbjct: 508 MG-----ARGRMAASKGAVMSLL-----LDAYQKRDKVSMITFRRDAAFINLPPTTSVDM 557 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + G T + GL ++ ++ + + + ++F+TDG+ + Sbjct: 558 AGRLLAEMPVGGRTPLSAGLAKSFEQVRN-------YLIKNPTAQPIVLFITDGKCN 607 >gi|52549969|gb|AAU83818.1| conserved hypothetical protein [uncultured archaeon GZfos34A6] Length = 569 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 82/207 (39%), Gaps = 20/207 (9%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + + ++D+S S + + + L +I + R +V Sbjct: 379 TENRTRDIAVAFLVDMSGSTVGSTIRCEKEALILMSEALKELGDAFAIYGFSGYGRDNVV 438 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F +++ F ++ ++ +Q KI+ + +T+ P + + K+ +E+ Sbjct: 439 FF---LIKDFEDSYDLR-VQCKISTMTNKQSTRIAPAIRHTTTKLRRREERT-------- 486 Query: 282 DYKKYIIFLTDGE----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + +I L+DG+ + N +++ EA+R G + I V EAA ++L Sbjct: 487 ---RMLILLSDGKPLDRDYYGNYAIEDTRMALKEAQRYGVKSFCITVDREAA-EYLPRMY 542 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + R+ + + KL RI K Sbjct: 543 ADSRWVVIDDVLKLPAKITRIYKRFTT 569 >gi|332360731|gb|EGJ38540.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK49] Length = 478 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 58/194 (29%), Gaps = 34/194 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+D S SM + + +++I R GL TFS ++ Sbjct: 206 DIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFSDEVYG 255 Query: 230 TFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +++ + + S + A N+I Sbjct: 256 RNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS----- 310 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 + KK+++ LTD E + K G V+A Sbjct: 311 TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIEG--IY 367 Query: 334 KNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 368 KNFATEGRVLDIEN 381 >gi|262402641|ref|ZP_06079202.1| TPR domain protein in aerotolerance operon [Vibrio sp. RC586] gi|262351423|gb|EEZ00556.1| TPR domain protein in aerotolerance operon [Vibrio sp. RC586] Length = 562 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 47/139 (33%), Gaps = 17/139 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S + +M++LD S SM D+L + + I ++ + ++G Sbjct: 91 EASPFGEDSASLMVLLDSSESMQQKDIAP-DRLTRSKQKILDLTEA-------RKGGKTG 142 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L+ F+ PL + +Q + + + A N + Sbjct: 143 LMVFAGSAHVAMPLTSDNRVLQPYLAAINPNVMPVEGKAAQSALNLLHKQLPPYVGNT-- 200 Query: 280 HDDYKKYIIFLTDGENSSP 298 ++ +TDG ++ Sbjct: 201 -------LLLVTDGVGNAT 212 >gi|237721534|ref|ZP_04552015.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|293369166|ref|ZP_06615760.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|229449330|gb|EEO55121.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292635749|gb|EFF54247.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] Length = 616 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSV-KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N+ H + I +I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGACPW--NAVHRLVRIGLKAREIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G S+ + L Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----SAGVKLEATLGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGAGILLAYKIAKKNFISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|220941747|emb|CAX15448.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 429 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 68/218 (31%), Gaps = 26/218 (11%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 P + + S + D+ ++D S + F D + Sbjct: 218 MTPLITVVGETDTIEGAPTPGPSHGE--RDVAFLIDGSDDVRGDFPYIRDFISRV----- 270 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTT--KST 256 I+ + N VR +V S + F L + +N L + Sbjct: 271 -----IEPLDIGINKVRVSVVQHSDRPSPNFFLDTYQTKDEVLRAVNGLTLAGGRGLNTG 325 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + N + ++ + +++I LT G + + +L K G Sbjct: 326 AALTFMKNTVLS----TARGSRAAQNVPQFLIVLTAGRSRDSVREPAVAL------KTEG 375 Query: 317 AIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 + + +GV+ + +P ++V+ +L+ Sbjct: 376 VVPFGVGVKNADPKEIEAISHNPSFAFNVKEFSQLNTI 413 >gi|330503959|ref|YP_004380828.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] gi|328918245|gb|AEB59076.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] Length = 3977 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 61/169 (36%), Gaps = 23/169 (13%) Query: 147 NSSHAPLLITSSVKISSKSDIG-------LDMMMVLDVSLSMNDH---FGPGMDKLGVAT 196 + H + S + + ++M+VLD+S SM+D KL +A Sbjct: 3126 GALHISVDDDSPAQPKDIAKSASEPQGIHTNLMVVLDLSGSMDDAPSGVSGFSTKLALAK 3185 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTT 253 +++ ++D ++ DV +VR VTF++ + + + L +T Sbjct: 3186 DAVQRLIDSYDNLGDV--MVRI--VTFANTASAVGNVWMTASDAKAWLTALANNAGNGST 3241 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 L A N + + FL+DG+ + N + Sbjct: 3242 NYDDALIKAMNAYDSTGKLT------GTGVQSVSYFLSDGQPTLSNANP 3284 >gi|218778169|ref|YP_002429487.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] gi|218759553|gb|ACL02019.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] Length = 381 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 74/229 (32%), Gaps = 37/229 (16%) Query: 142 FPWCANSSH---APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 F CA S P + S++ +++LD S SM + G K +A + Sbjct: 18 FAGCAGKSAEVFVPSDLNGSIRTGHLVQKADHFLVILDCSGSMAE-MVDGEKKFVLAREA 76 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTF---------SSKIVQTFPLAWGVQHIQEKINRLIF 249 +R+M I IP+ +GL F S+ + + + Sbjct: 77 VRKM---IAGIPEAEMNFEAGLRVFGLTMNPFVESTTALVDMEPLDKAAYDKALDKVTFA 133 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 + + + + + ++ +I +TDG+ + +L Sbjct: 134 TGKSNLALAIAQSSDDLDKTTGEIS------------LIIVTDGKETDGEA--ARALEVV 179 Query: 310 NEAKRRGAIVYAIGVQAEAADQ-FLKN------CASPDRFYSVQNSRKL 351 R VY I V + A + L+ C + +V + + + Sbjct: 180 KNTYRDRLCVYTIQVGHDPAGEKLLERLSRKGQCGYSENLDNVNSPKAM 228 >gi|308501124|ref|XP_003112747.1| CRE-ROP-1 protein [Caenorhabditis remanei] gi|308267315|gb|EFP11268.1| CRE-ROP-1 protein [Caenorhabditis remanei] Length = 644 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 58/169 (34%), Gaps = 35/169 (20%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQT 230 + LDVS SM P + M S+ +++N V F K Sbjct: 477 CLALDVSGSM---CSPVSSSPLSCREAATGM-----SLINLHNEAEVKCVAFCDKLTELP 528 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 F W + + + I++L F T + +A + + +II+ Sbjct: 529 FTKDWKIGQVNDYIDKLSF-GNTDCGLPMTWATE--------------NNLKFDVFIIY- 572 Query: 291 TDGENSSPNIDNKESLFYCNEAK---RRGAIV-------YAIGVQAEAA 329 TD + + N+ E++ EA IV Y+I ++A Sbjct: 573 TDNDTWAGNVHPFEAIKRYREASGIHDAKVIVMAMHAYNYSIADPSDAG 621 >gi|262196568|ref|YP_003267777.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] gi|262079915|gb|ACY15884.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] Length = 564 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 61/170 (35%), Gaps = 10/170 (5%) Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 ++ + + R +V + ++ + + V + I L + L A Sbjct: 96 QALEFLQKLPRDARVVVVGYDDEVHASRRVG-DVARARRDIEALEIN-PLSTELQLIEAV 153 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N+ D +LE + +K I ++DG ++ P+ + A R ++ IG Sbjct: 154 NRARDTLARLE-PEREGVPMRKLIAVVSDGRDADPS--PENYRRVAKRAARNDIRIHTIG 210 Query: 324 VQAEAADQFLKNCA-----SPDRFYSVQNSRKLHDAFLRIGKEMVKQRIL 368 A+ L A S F V F ++ +E+ +Q +L Sbjct: 211 FPADRNRYPLYGLAEMSKQSEGTFRLVLTESAFGSHFGQLAREINEQYVL 260 >gi|196233185|ref|ZP_03132032.1| hypothetical protein CfE428DRAFT_5199 [Chthoniobacter flavus Ellin428] gi|196222829|gb|EDY17352.1| hypothetical protein CfE428DRAFT_5199 [Chthoniobacter flavus Ellin428] Length = 252 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 71/209 (33%), Gaps = 15/209 (7%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 ID S + +P ++ P+ + +++ Sbjct: 16 TIDVVQPYNGTSVKIDFALPAPDRVSSLDVTANGTPVEKDKVKFTPADKLPNYHCAVLVL 75 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTFPLAW 235 V ++ ++ ++IR+ + + ++ + + G+ TFS I P+ Sbjct: 76 VDKTLGNNKDTNEKAREKLWKTIRDTMSKVSAVAETAPY-QVGVATFSAGNIDLMAPMGS 134 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + I ++ F + +++ ++ G +KYI+ ++DG + Sbjct: 135 KKSIVDSAIEKVTFNGVS----------PELYLGAKRAIEWFSGTPADRKYIVLISDGIS 184 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + ++ + +AK + IG Sbjct: 185 NDKVVSQQDVVQAALKAK---VHICTIGF 210 >gi|167911128|ref|ZP_02498219.1| hypothetical protein Bpse112_11583 [Burkholderia pseudomallei 112] Length = 579 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 44/127 (34%), Gaps = 8/127 (6%) Query: 14 KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGK 73 +GS +++ AI + V V+G ++ + FFV+ L + D + L A ++ + + Sbjct: 1 RGSFALVAAIWMLVAIAVLG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCAQ 55 Query: 74 KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 + N L D + + + + + + Sbjct: 56 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTATQ 112 Query: 134 EMPFIFC 140 +P+ F Sbjct: 113 SVPYFFL 119 >gi|109009556|ref|XP_001109351.1| PREDICTED: calcium-activated chloride channel regulator 2-like [Macaca mulatta] Length = 943 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 79/214 (36%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T+ + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTEVSICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I +T G++ + + C G+ +++I + + AA L Sbjct: 412 GTVMILVTSGDD--------KLIGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +QRI Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQRIQL 497 >gi|22298517|ref|NP_681764.1| hypothetical protein tlr0974 [Thermosynechococcus elongatus BP-1] gi|22294697|dbj|BAC08526.1| tlr0974 [Thermosynechococcus elongatus BP-1] Length = 241 Score = 44.0 bits (102), Expect = 0.032, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 72/232 (31%), Gaps = 28/232 (12%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 S+ P V + L + ++LD S SM G ++ L + Sbjct: 1 MTIQSNVPEWANVEVPGGERH---LPVYLLLDTSSSME---GAPIESLHQGLEQFQRE-- 52 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAY 263 + S ++V+ G++TF+S G+ I + L T+ Sbjct: 53 -VSSDQFARDIVKVGVITFASDAQLVTG---GLVPISDFQPPMLTASGVTRLDLAFTVLL 108 Query: 264 NKIFDAKEKLEHIAKGHD--DYKKYIIFLTDGENSSPNIDNKESL------FYCNEAKRR 315 I + KG D+K + LTDG + + + L N K Sbjct: 109 ESI---DRDVVRPVKGGQKGDWKPAVFVLTDGRPTDRHGIATDELWRPARDALVNRPKGE 165 Query: 316 GAI---VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + A+G D LK ++ F + F + + + Sbjct: 166 -IKPSVIVAVGCGPHVDDDTLKAISTGTAFKMGTSEAAFVALFQYLSQSLTT 216 >gi|224043354|ref|XP_002197844.1| PREDICTED: integrator complex subunit 6 [Taeniopygia guttata] Length = 887 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRTHLGTTYLDIAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITVTDG 131 >gi|118084887|ref|XP_417071.2| PREDICTED: similar to candidate tumor suppressor protein DICE1 [Gallus gallus] Length = 888 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRTHLGTTYLDIAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPY-A 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITVTDG 131 >gi|322790292|gb|EFZ15291.1| hypothetical protein SINV_15500 [Solenopsis invicta] Length = 2314 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 72/216 (33%), Gaps = 46/216 (21%) Query: 169 LDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++++ ++D S S+ D+F + + +D + + R L+TF + Sbjct: 123 VELVFLVDASGSVGADNFRSELSFVTKLLSDFT--VDTMAA--------RIALITFGGRG 172 Query: 228 VQTFPLAWGVQHIQEK---------INRLI-----FGSTTKSTPGLEYAYNKIFDAKEKL 273 V I +N+ G T + L A + ++E Sbjct: 173 NVY----RNVDQISRHGPNDHKCYLLNKQFRNITYSGGGTYTRGALLEALAILEKSREAA 228 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K + +TDG ++ + + K GAIV+A G++ + L Sbjct: 229 ----------SKVVFLITDGFSNGG-----DPRPAAHLLKNTGAIVFAFGIRT-GNVEEL 272 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 + ASP + F + + + + + Sbjct: 273 HDIASPPWYAHSYFLDS-FAEFEALARRALHRAKRF 307 >gi|300868463|ref|ZP_07113083.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300333545|emb|CBN58271.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 220 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 72/191 (37%), Gaps = 27/191 (14%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + +V+ + + +++LD S SM G +D L ++ D + Sbjct: 1 MRLDEAVEFAENPEPRCPCVLLLDTSGSMQ---GAPLDALN---EGLQTFRDDLTRDELA 54 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-------LIFGSTTKSTPGLEYAYNK 265 V +VTF ++I + +Q+ + L T G+ YA + Sbjct: 55 KKRVEVAIVTFDNQI----------KVVQDFVTADQFESPLLTAQGQTHMGAGISYALDM 104 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIG 323 I A K E+ G Y+ ++ +TDGE + + + + E + +A+G Sbjct: 105 I--AARKSEYRNNGITYYRPWVFMITDGEPQGESEQLVEQAAQRIREEEANKRVAFFAVG 162 Query: 324 VQAEAADQFLK 334 V+ + + Sbjct: 163 VEGANISRLAQ 173 >gi|300781256|ref|ZP_07091110.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] gi|300532963|gb|EFK54024.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] Length = 528 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 65/198 (32%), Gaps = 33/198 (16%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG-LVTFSSKIVQTFP 232 VLD S SM ++ + + ++D + + +R G LVTF S Sbjct: 347 VLDTSGSMEGE------RIASLQQIMTSLIDGSAATATGDVALRDGELVTFQSFSTAPHE 400 Query: 233 LAWG---------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 G Q +N L+ T Y+ +FDA + A Sbjct: 401 PLLGEFLRDDRITKAKYQGYVNDLVADGQTAI-------YDTLFDALRSSDPNAGISS-- 451 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 I+ L+DGE + Y ++R V+ I EA + N A + Sbjct: 452 ---IVLLSDGEVTHGMDYYAFEKQYQGLSPEQRSIPVFVILYG-EANASEMNNLAELTGG 507 Query: 341 RFYSVQNSRKLHDAFLRI 358 + N L AF I Sbjct: 508 AVFDALN-GDLDAAFKEI 524 >gi|255746017|ref|ZP_05419964.1| TPR domain protein in aerotolerance operon [Vibrio cholera CIRS 101] gi|262162144|ref|ZP_06031159.1| TPR domain protein in aerotolerance operon [Vibrio cholerae INDRE 91/1] gi|255735771|gb|EET91169.1| TPR domain protein in aerotolerance operon [Vibrio cholera CIRS 101] gi|262028219|gb|EEY46877.1| TPR domain protein in aerotolerance operon [Vibrio cholerae INDRE 91/1] Length = 632 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|229605141|ref|YP_002875845.1| TPR domain protein in aerotolerance operon [Vibrio cholerae MJ-1236] gi|229371627|gb|ACQ62049.1| TPR domain protein in aerotolerance operon [Vibrio cholerae MJ-1236] Length = 624 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|229517330|ref|ZP_04406775.1| TPR domain protein in aerotolerance operon [Vibrio cholerae RC9] gi|229345366|gb|EEO10339.1| TPR domain protein in aerotolerance operon [Vibrio cholerae RC9] Length = 652 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|153822653|ref|ZP_01975320.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229510538|ref|ZP_04400018.1| TPR domain protein in aerotolerance operon [Vibrio cholerae B33] gi|126519810|gb|EAZ77033.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229352983|gb|EEO17923.1| TPR domain protein in aerotolerance operon [Vibrio cholerae B33] Length = 622 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|254850437|ref|ZP_05239787.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|254846142|gb|EET24556.1| conserved hypothetical protein [Vibrio cholerae MO10] Length = 644 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|15600941|ref|NP_232571.1| hypothetical protein VCA0171 [Vibrio cholerae O1 biovar eltor str. N16961] gi|9657561|gb|AAF96084.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] Length = 646 Score = 44.0 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLALSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|262191199|ref|ZP_06049399.1| TPR domain protein in aerotolerance operon [Vibrio cholerae CT 5369-93] gi|262032939|gb|EEY51477.1| TPR domain protein in aerotolerance operon [Vibrio cholerae CT 5369-93] Length = 624 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|260820654|ref|XP_002605649.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] gi|229290984|gb|EEN61659.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] Length = 168 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 54/135 (40%), Gaps = 16/135 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D+ V+D S S+ + + + +R+M+D + R G+V F+ Sbjct: 1 NTPMDLAFVIDGSASVGPL------QFEKSKKFVRDMVDGFDI---GSAQTRVGVVQFAW 51 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L ++ I R+ + G T+ L + ++F L Sbjct: 52 MVQAEFNLGDYLDGTDLRNAIARIRYMDGPGTEIGKALVFTKRRLFSE---LYGARPETQ 108 Query: 282 DYKKYIIFLTDGENS 296 D + +I +TDG +S Sbjct: 109 DVPRIVILITDGRSS 123 >gi|262165252|ref|ZP_06032989.1| TPR domain protein in aerotolerance operon [Vibrio mimicus VM223] gi|262024968|gb|EEY43636.1| TPR domain protein in aerotolerance operon [Vibrio mimicus VM223] Length = 562 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 49/139 (35%), Gaps = 17/139 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S + +M++LD S SM TRS +++LD+ K+ ++G Sbjct: 91 EASPFGEDSASLMVLLDSSESMQQKDIAP----DRLTRSKQKILDLTKA----RKGGKTG 142 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L+ F+ PL + +Q + + + A N + + Sbjct: 143 LMVFAGSAHVAMPLTSDNRVLQPYLAAINPNVMPVEGKAAQSALNLLHEQLPPYVGNT-- 200 Query: 280 HDDYKKYIIFLTDGENSSP 298 ++ +TDG S Sbjct: 201 -------LLLVTDGVTDST 212 >gi|83720829|ref|YP_443052.1| hypothetical protein BTH_I2535 [Burkholderia thailandensis E264] gi|167582052|ref|ZP_02374926.1| hypothetical protein BthaT_28172 [Burkholderia thailandensis TXDOH] gi|167620215|ref|ZP_02388846.1| hypothetical protein BthaB_28164 [Burkholderia thailandensis Bt4] gi|83654654|gb|ABC38717.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 602 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 45/128 (35%), Gaps = 8/128 (6%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 +GS ++ AI + V +G V++ + FFV+ L + D + L A ++ + Sbjct: 23 ERGSFAVAAAIWMLVAIAALG-VVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCS 77 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + + + N L + D + + + S + + Sbjct: 78 QPNAAAAANASSNGFDRAASGNTLTVSCGRWDTQS---NAAPSYFSTTSTPLNAVQVTAT 134 Query: 133 YEMPFIFC 140 +P+ F Sbjct: 135 QNVPYFFL 142 >gi|86132310|ref|ZP_01050905.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85817229|gb|EAQ38412.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 351 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 52/142 (36%), Gaps = 3/142 (2%) Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + PL +QE I+ + + + L A K E + + D + Sbjct: 163 DVLHELQPLTTSATQLQEAIDGVTDDISNDPSTDLYGAVIKAATNAENIVETLENEDLFA 222 Query: 285 KY-IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ TDG + + +E+L + A + IG+ +E ++ L + Sbjct: 223 AASVVIFTDGTDQAARYSEQEALDAVSNAGEE-ISFFTIGLGSEIDEEVLTAIGKTGSAF 281 Query: 344 SVQNSRKLHDAFLRIGKEMVKQ 365 +NS +L F I + Q Sbjct: 282 -AENSNELEAVFNDISNGVAGQ 302 >gi|323351676|ref|ZP_08087330.1| peptidoglycan binding domain protein [Streptococcus sanguinis VMC66] gi|322122162|gb|EFX93888.1| peptidoglycan binding domain protein [Streptococcus sanguinis VMC66] Length = 450 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPTELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|317055297|ref|YP_004103764.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447566|gb|ADU21130.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 271 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 15/205 (7%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ S L + ++D S SM K+G + E++ I+ + + + V+ + Sbjct: 29 ATAVSKKSLVIFFLIDTSGSMKGK------KMGQLNTVMEELIPEIRKVGEADTDVKVAV 82 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +TF + + W + + + A+ ++ + + Sbjct: 83 LTFDTDV------KWMYSAPISIEDFEWARLGAQGVTSMGAAFTELAARMSRNSFLNSPS 136 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNCASP 339 + + +TDG S ++L N + G A+G+ EA D+ L + S Sbjct: 137 LSFAPVMFLMTDGYPSDDYKSGLKALQT-NSWYKFGLKA-ALGIGDEANDEMLAEFTGSK 194 Query: 340 DRFYSVQNSRKLHDAFLRIGKEMVK 364 + +L I + Sbjct: 195 ETVVHAYTGGQLASMIKIIAVTASQ 219 >gi|261415601|ref|YP_003249284.1| protein of unknown function DUF58 [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372057|gb|ACX74802.1| protein of unknown function DUF58 [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325950|gb|ADL25151.1| conserved hypothetical protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 301 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 49/128 (38%), Gaps = 25/128 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + M++++D S S FG G + L + + + N + GL+ ++ ++ Sbjct: 82 MTMLLMVDASSS--SEFGSG----KQMKGEVMATLTALLAFAAIKNNDKVGLLIYTDQVE 135 Query: 229 QTFPLAWGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P G +H+ I +++ T + LEYA + + K Sbjct: 136 LFIPPEKGRKHVLRLIREILYFKPQHHGTNTQVALEYAGKIL---------------NRK 180 Query: 285 KYIIFLTD 292 ++ ++D Sbjct: 181 AVVVVMSD 188 >gi|227821561|ref|YP_002825531.1| NorD protein required for nitric oxide reductase (Nor) activity [Sinorhizobium fredii NGR234] gi|227340560|gb|ACP24778.1| NorD protein required for nitric oxide reductase (Nor) activity [Sinorhizobium fredii NGR234] Length = 631 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 65/193 (33%), Gaps = 31/193 (16%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T V +S +D D + VLDV + L + + Sbjct: 431 IHMMSRPQAHDLAVTILVDVSLSTDAWFDDLRVLDVE-------KQALLVLAHGLSACGD 483 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +I+ + VR V + + I+ +I L G T+ + + Sbjct: 484 AHEILTFTSRRRDWVRIETVKAFDEAMSAT--------IEARIAALKPGYYTRIGTAIRH 535 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A + + + K +I LTDG+ + ++S EA+R G Sbjct: 536 AAAGLVERPNRR-----------KLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSG 584 Query: 317 AIVYAIGVQAEAA 329 V+ + V EA Sbjct: 585 VSVFGVTVDREAK 597 >gi|226510867|gb|ACO59960.1| matrilin-1 [Salmo salar] Length = 108 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 26/110 (23%), Positives = 45/110 (40%), Gaps = 15/110 (13%) Query: 218 SGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLE 274 GLV +SS + Q FPL + +++ + ++ T + L Y + D+ Sbjct: 11 VGLVQYSSSVKQEFPLGRYNNKKDLKDAVKKMAYMERGTMTGQALRY----LTDSSFAPA 66 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 A+ K I TDG + D +AK +G ++A+GV Sbjct: 67 GGARPGVA--KVGIVFTDGRSQDYIGD------AAKKAKEQGFKMFAVGV 108 >gi|323977766|gb|EGB72852.1| von Willebrand protein type A [Escherichia coli TW10509] Length = 219 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A N + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALNMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFMITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|289548270|ref|YP_003473258.1| von Willebrand factor type A [Thermocrinis albus DSM 14484] gi|289181887|gb|ADC89131.1| von Willebrand factor type A [Thermocrinis albus DSM 14484] Length = 621 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 75/214 (35%), Gaps = 37/214 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L +V+DVS SM +KL A +++ + + + S+ +V FS Sbjct: 437 ERKELAFKLVIDVSSSMRKE-----EKLVSAMKALLLVAETLSSMGMPLSVD-----LFS 486 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +++ ++ + K ++ G T L A++ + + Sbjct: 487 ERVMTLKDFDEDYRNFRSKFMQIPSMVGGATNIELALLKAFDHLSSYCKTTHRRG----- 541 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQFLKNCASPDR 341 +I +DGE + + E + K+ Y +GV + ++ + Sbjct: 542 ---VLILFSDGEPTRG-LKGTELKKIIDTMKKE----YPLVGVGVGQSRNYI------EE 587 Query: 342 FYS-----VQNSRKLHDAFLRIGKEMVKQRILYN 370 ++ V + +L AF + + ++ + Sbjct: 588 YFDRTAIKVSDISRLPSAFSFVLENYFRRLTSVD 621 >gi|182438656|ref|YP_001826375.1| hypothetical protein SGR_4863 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467172|dbj|BAG21692.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 453 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 51/176 (28%), Gaps = 37/176 (21%) Query: 152 PLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 PL ++ + +++++D S SM+ K+ A + +D + Sbjct: 47 PLAGSAPAPAYAPGRGPDAAVVLMVDCSGSMDY----PPTKMRNARDATAAAVDAL---- 98 Query: 211 DVNNVVRSGLVT--------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 R +V + +E + +L G T L A Sbjct: 99 --REGTRFAVVAGTHVAKDVYPGNGELAVADRRTKARAKEALRKLSAGGGTAIGTWLRLA 156 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-------PNIDNKESLFYCNE 311 + A + H I LTDG N +D+ F C+ Sbjct: 157 DRLLGAADVDIRHG-----------ILLTDGRNEHEAPEDLRAALDSCAGRFTCDA 201 >gi|47213219|emb|CAF89740.1| unnamed protein product [Tetraodon nigroviridis] Length = 1349 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 79/221 (35%), Gaps = 33/221 (14%) Query: 114 LSIIIDDQHKDYNLSAVS--RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM 171 L++ D + + ++ +E + C+ S + + + +D+ Sbjct: 88 LTLTSDPRDSSFVVACGPLWSHECGSSLYSTGICSRVSRTFRPTGTIAPALQRCETFMDI 147 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++VLD S S + + +L P G+V + S +VQ F Sbjct: 148 IIVLDGSNS--------IYPWSEVQNFLINILHKFFMGPGQTQ---VGVVQYGSSVVQEF 196 Query: 232 PLAWGVQHIQEKIN------RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L G E++ R G T++ G+E A ++ F + +K Sbjct: 197 RL--GEHRTVEEVVVAARSIRQRGGEETRTALGMEVARSQGFGRGGRPG--------ARK 246 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 +I +TDGE + D+ + + R YAI V A Sbjct: 247 VMIVITDGE----SHDSAQLPQAVARSNRDNITTYAIAVSA 283 >gi|219126324|ref|XP_002183410.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405166|gb|EEC45110.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 969 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 56/188 (29%), Gaps = 32/188 (17%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +S K +D++ ++D S SM D ++ T ++ S N R GL Sbjct: 21 VSVKGQAPVDLVFIIDESGSMGDDQAQIANRANQITAAL-------DSATAGN--FRVGL 71 Query: 221 VTFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLE----YAYNKIFDAKE 271 V + + FP L + L T+ PG A + + Sbjct: 72 VGYGASAFGGFPRKVGTLTDDASMFGAAVASLETSGGTE--PGFVATELTAEDSLLYTTT 129 Query: 272 KLEHIAKGHDDYKKY-------IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 G + + +TD N D +L A + ++ Sbjct: 130 SDSTKLDGSSAGTSFPGPAGFCAVLITD---EPSNGDGATTLADAKTALDNASN--SVFF 184 Query: 325 QAEAADQF 332 ++Q Sbjct: 185 GVVPSNQL 192 >gi|254474080|ref|ZP_05087472.1| nitric oxide reductase D protein [Pseudovibrio sp. JE062] gi|211956776|gb|EEA91984.1| nitric oxide reductase D protein [Pseudovibrio sp. JE062] Length = 650 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 47/336 (13%), Positives = 102/336 (30%), Gaps = 49/336 (14%) Query: 59 TATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII 118 AT++ + + S + W R L ++ + + E ++ Sbjct: 334 AATRLRLHLDLSPEDADHERLSDKYTYPEWDHRQRKFLEDHCRVLE-KDAEPDYESVLLT 392 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG---------- 168 D H+ ++E + + L + + V Sbjct: 393 DAYHRRRIRQVKRQFE-ALRPKRIMQMRQAEGSELDLDALVTAQVDLKASGYASDRIFQD 451 Query: 169 -------LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 L + M+LD S S G + +A ++ + I + D G+ Sbjct: 452 ARAVERDLSVAMLLDTSRSTESAVGDS-SVIEIAGAALAALSGGIDASGDH-----LGVW 505 Query: 222 TFS----SKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 FS ++ + + KI L T+ + + ++ Sbjct: 506 GFSSLKRDRVFMNKAKGFDEPMTDEVIAKIGGLKPCYYTRLGAAIRHTTAQLAL------ 559 Query: 275 HIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +K ++ LTDG+ + ++S EA+R+G V+ + V + Sbjct: 560 -----QQTQRKLLLVLTDGKPNDLDHYEGIHGIEDSHMAVREARRQGMAVHGVIVDEDGQ 614 Query: 330 DQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 D F + + N +L A I + + ++ Sbjct: 615 DWFARIFGKGG-YTLFPNPERLTRALPDIYRSLTRE 649 >gi|153824999|ref|ZP_01977666.1| transporter [Vibrio cholerae MZO-2] gi|153830336|ref|ZP_01983003.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874179|gb|EDL72314.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|149741324|gb|EDM55358.1| transporter [Vibrio cholerae MZO-2] Length = 628 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|126306639|ref|XP_001365694.1| PREDICTED: similar to Von Willebrand factor A domain containing 1 [Monodelphis domestica] Length = 593 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 68/200 (34%), Gaps = 31/200 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 SS + D++ +LD S S++ + + + +++ + P V+ Sbjct: 4 PATPSSAAHG--DLLFLLDSSASVSYY------EFSRVKEFVGQLVQPLPLGPGS---VQ 52 Query: 218 SGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + +V S+ FP G + T + L A ++F Sbjct: 53 TSMVHVGSEPTVEFPFDRHHSGAAAQEAIQAAKQLMGDTNTGLALALAKKQLFT------ 106 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-L 333 A +K ++++TDG++S + L K G V+ + + L Sbjct: 107 TEAGARSGVRKVLVWVTDGDSSDDVQAPMQVL------KDLGVTVFIVSTG--RGNFLDL 158 Query: 334 KNCAS--PDRFYSVQNSRKL 351 AS P++ + L Sbjct: 159 SAAASQPPEKHLRFVDVDDL 178 >gi|66819075|ref|XP_643197.1| hypothetical protein DDB_G0276383 [Dictyostelium discoideum AX4] gi|60471310|gb|EAL69272.1| hypothetical protein DDB_G0276383 [Dictyostelium discoideum AX4] Length = 2026 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 62/202 (30%), Gaps = 45/202 (22%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + LD++ +D + SM+ + ++ SI + ++ K V GLV + Sbjct: 1686 QVSKSLDLVFCIDSTGSMSGE----IKEVKEKITSIVDRIEKAKV------NVNVGLVFY 1735 Query: 224 SSKIVQTFP----------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + P + +++ I + + ++ Sbjct: 1736 NDHEIIYLPTDKSPTIVYEFTDDIPLVKKNIGTIKAFGGNDHPEAVADGLYEVS------ 1789 Query: 274 EHIAKGHDDYKKYIIFLTDG--------------ENSSPNIDN--KESLFYCNEAKRRGA 317 K + K I +TD E+ PN +S+ RRG Sbjct: 1790 --KLKFRSNASKICILITDAPAHGFDSKLNETYSEDYHPNGCPCGHDSIELVRSLVRRGI 1847 Query: 318 IVYAIGVQ-AEAADQFLKNCAS 338 Y + + + +L AS Sbjct: 1848 TFYTVSCRPTPNSSDYLNAIAS 1869 >gi|229522839|ref|ZP_04412253.1| TPR domain protein in aerotolerance operon [Vibrio cholerae TM 11079-80] gi|229340056|gb|EEO05064.1| TPR domain protein in aerotolerance operon [Vibrio cholerae TM 11079-80] Length = 623 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|18490111|gb|AAH22236.1| Unknown (protein for IMAGE:4178997) [Homo sapiens] Length = 439 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 54/162 (33%), Gaps = 21/162 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ H A R L ++ P + VR +V +S Q Sbjct: 240 DITILLDGSASVGSHNFDTTKHF--AKRLAERFLTAGRTDPAHD--VRVAVVQYSGTGQQ 295 Query: 230 TFP---LAW--GVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + ++ + F T L Y +A Sbjct: 296 RPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAA--------- 346 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 KK ++ +DG +S EA+R G ++ + V Sbjct: 347 KKRLLLFSDG--NSQGATPAAIEKAVQEAQRAGIEIFVVVVG 386 >gi|323968040|gb|EGB63450.1| VWA domain containing CoxE protein [Escherichia coli M863] gi|323977690|gb|EGB72776.1| VWA domain containing CoxE protein [Escherichia coli TW10509] gi|327253248|gb|EGE64902.1| von Willebrand factor type A domain protein [Escherichia coli STEC_7v] Length = 378 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTNVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|118081932|ref|XP_414993.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 360 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 53/148 (35%), Gaps = 18/148 (12%) Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTT-KSTPGLEYAYNKIFD 268 +R + + K + L ++ I I + ++ K L +A + + Sbjct: 11 GKGGIRMAVALYGEKPRMSIELTDYVTIEEILVAIQEISIKGSSLKVGSALAFAAHAMSQ 70 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++ A K ++ +T G++S D + L G V+A+G++ Sbjct: 71 PATLRDNAA-------KVVVLITSGKSSDLVEDKAQVLQ------DAGVTVFAVGIKDAD 117 Query: 329 ADQFLKNCASP--DRFYSVQNSRKLHDA 354 + K + P + V + LH A Sbjct: 118 KHELNKIASEPTAEHVIYVDDFHLLHSA 145 >gi|55670411|pdb|1V7P|C Chain C, Structure Of Ems16-Alpha2-I Domain Complex Length = 200 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 74/214 (34%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 7 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 55 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 56 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 107 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFLK-- 334 K ++ +TDGE+ ++ K + CN + + I V A +K Sbjct: 108 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 163 Query: 335 ----NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + F++V + L + +G+++ Sbjct: 164 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 197 >gi|85117086|ref|XP_965177.1| hypothetical protein NCU00984 [Neurospora crassa OR74A] gi|28926981|gb|EAA35941.1| predicted protein [Neurospora crassa OR74A] gi|38567055|emb|CAE76353.1| hypothetical protein [Neurospora crassa] Length = 1086 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 75/193 (38%), Gaps = 37/193 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ V D S SM ++ ++R L KSIP + +F S Sbjct: 296 IVFVCDRSGSMGGA------RIEGLKSALRIFL---KSIPVGAKF---NICSFGSTFEFL 343 Query: 231 FPL---AWGVQHIQ---EKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 F ++ + ++ + ++R+ T+ LE A+ K ++ + Sbjct: 344 FSDGSRSYDHESLRLAMDYVSRMDADLGGTEMYQPLEAAFEKRYNDMDLE---------- 393 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-AIVYAIGVQAEAADQFLKNCASPDRF 342 + LTDGE N ++ ++ ++ +G ++ +G+ + + ++ A Sbjct: 394 ---VFLLTDGE--IWNQEHLFTMINKKVSESQGAIRLFTLGIGNDVSHALIEGAARAGNG 448 Query: 343 YS--VQNSRKLHD 353 ++ V +S K++ Sbjct: 449 FAQSVTDSEKMNA 461 >gi|319652226|ref|ZP_08006344.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] gi|317396049|gb|EFV76769.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] Length = 463 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 62/205 (30%), Gaps = 27/205 (13%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + + +++LD S SM G M K+ +A +++ I +V+ VV Sbjct: 145 SEPGEEKKASEQAIILLDASSSMLLQAGGKM-KMDIAKSAVKSFAQTIGQSSEVSLVVYG 203 Query: 219 --GLVTFSSK------IVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 G + K + + +P+ + + E ++ T ++ A Sbjct: 204 HKGSEADADKEISCSGVEEVYPMGKYSKKEFHEAVDSFESKGWTPLAGAIQKAAEM---- 259 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 D I ++DG + S + IG + Sbjct: 260 --------SSGYDGSTTIYIVSDGAETCDGDPVSASKNLVKNNSSNSVNI--IGFGVDGK 309 Query: 330 DQ-FLKNCASPDR--FYSVQNSRKL 351 + LK A + N +L Sbjct: 310 AENQLKAVAEAGNGEYLKADNPDEL 334 >gi|291443250|ref|ZP_06582640.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291346197|gb|EFE73101.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 597 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 68/203 (33%), Gaps = 29/203 (14%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + V+D S SM G ++ V S+ + LD + GL F++ + Sbjct: 388 LTTVVDASGSMATLVPGRNQSRMDVTKESLIQALDQFTPNDE------IGLWEFATTLDG 441 Query: 230 TFPLAWGVQHIQEKINRLIFGSTT------KSTPGLEY---AYNKIFDAKEKLEHIAKGH 280 + +++ G T + GL+ ++D A+ Sbjct: 442 EK--DYRRLMETKRLGDPAAGGGTHREKLTAAFAGLQPVPGGATGLYDTTLASYKEARST 499 Query: 281 DDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAK-----RRGAIVYAIGVQAEAADQFL 333 K+ ++ LTDG N N ++ L E K R V AI V +A + Sbjct: 500 YVKGKFNALVILTDGSNQDTNGISRSGLI--TELKELVDPERPVPVIAIAVGPDADRDEV 557 Query: 334 KNCA--SPDRFYSVQNSRKLHDA 354 A + Y V + ++ Sbjct: 558 AEIARITGGDGYEVSDPAEIQAV 580 >gi|241767270|ref|ZP_04765006.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241362052|gb|EER58187.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 374 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 70/204 (34%), Gaps = 35/204 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD----------VNN 214 + L +++ D+SLS + + + V ++ + + D Sbjct: 180 GERSLATLLLADLSLSTDAYATSNARVIDVIRDALYVFGEALSGTGDAFEMLGFSSVRRQ 239 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 VR + ++ F W ++ ++ L G T+ L A ++ E+ Sbjct: 240 HVRI-------QHIKGFGERWN-DTVRSRVGALKPGFYTRMGAALRDATRRLGARPER-- 289 Query: 275 HIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 ++ ++ LTDG+ + +++ EA+ G + + + EA Sbjct: 290 ---------QRLLLVLTDGKPNDLDIYEGRYGLEDTRHAVQEARDAGLTPFCVTIDHEA- 339 Query: 330 DQFLKNCASPDRFYSVQNSRKLHD 353 ++L + V+ + L Sbjct: 340 HEYLPMLFGSQGYALVRRPQDLVQ 363 >gi|222081473|ref|YP_002540836.1| Tetratricopeptide TPR_4 [Agrobacterium radiobacter K84] gi|221726152|gb|ACM29241.1| Tetratricopeptide TPR_4 [Agrobacterium radiobacter K84] Length = 516 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 55/163 (33%), Gaps = 26/163 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++ +D+S +M D +L A IR+++ R+ ++ ++ Sbjct: 93 SLVIAVDLSQTM-DAIDVSPSRLERAKLKIRDVIAA-------RPGARTAIIAYAGSAHL 144 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 PL I+ ++ L T+ P + + K + + I+ Sbjct: 145 VLPLTEDASLIETYVDAL----ATRIMP--KPGKDTTAALKLAAALLKQDGAPGT--ILL 196 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +TDG +S K + + A+G+ A Sbjct: 197 MTDGVETSA----------IEPMKSMTSGIVALGIGTAAGGPV 229 >gi|115622698|ref|XP_001202504.1| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] gi|115631776|ref|XP_785188.2| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] Length = 338 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 38/103 (36%), Gaps = 10/103 (9%) Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + + +NR+ G T GL A + + + G+ + II LTDG Sbjct: 6 SDNIAAAKRHVNRIRAGGGTNLYDGLRNAVDLLME---------HGNGEAMPLIIMLTDG 56 Query: 294 ENSSPNI-DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + +S ++ E + ++++ FL+ Sbjct: 57 QPTSGSVKSTSEIIQRITNLIDGRLSLFSVSFGNGVDFSFLEK 99 >gi|1575515|gb|AAC47461.1| thrombospondin-related anonymous protein [Plasmodium gallinaceum] Length = 614 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 61/172 (35%), Gaps = 27/172 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S + T ++ + +I++++ + L F+ + Sbjct: 43 VDLYLLMDGSGS--------IGYYNWVTYAVPLVEEIVQNLNISKQGIHLYLSVFTHILK 94 Query: 229 QTFPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + PL + + I+ + +T T L F K E Sbjct: 95 EYIPLNSIFSTNRDFALNVIRSLRTKYSQNGSTNLTLALSRVLKNYFLTKGSRE------ 148 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 D + +I TDG + + ++ N+ K+ A IGV +F Sbjct: 149 -DAVQLVIIFTDGSPD----NKESAMKEVNKLKKMKAKFAVIGVGMGINKEF 195 >gi|163848376|ref|YP_001636420.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526300|ref|YP_002570771.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669665|gb|ABY36031.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450179|gb|ACM54445.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 446 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 55/169 (32%), Gaps = 34/169 (20%) Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 +S + +D A S+ E LD R GL+ +S + LA G Sbjct: 87 LSAEQRSRYRSPIDYTTHALHSLIERLDHND---------RLGLIACASDAIV---LASG 134 Query: 237 -----VQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + I RL G TT GL+ A + A + I+ Sbjct: 135 IPGSRRAELVAAIARLPALRLGETTNLAQGLQLALAQFVAADDATVRR----------IV 184 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG + D L EA RG + IG+ + L A Sbjct: 185 LITDGFTT----DQTLCLTLAREAAARGISLSTIGLGGSFEEHLLTQLA 229 >gi|325179989|emb|CCA14391.1| conserved hypothetical protein [Albugo laibachii Nc14] Length = 2099 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 63/196 (32%), Gaps = 25/196 (12%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRS-IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 VLD S SM+ G D L A R I L D+ ++TF+ + Sbjct: 1862 VLDSSGSMS---GTPWDHLLAAYREYISNRLTSGGLYRDI-----VTVITFACQANIIVE 1913 Query: 233 LAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 G+ E ++ G T + L A + + YK IIF + Sbjct: 1914 ---GLCISSAEHLHIPYNGGGTCYSLALREALGVLS---------RTNYSTYKPAIIFFS 1961 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSR 349 DG P + R G +A+G + LK AS + Sbjct: 1962 DGHPHDPVDGARMGEEISRSYARYGLRAFAVGFGSINLH-VLKKVASKLGGTYMHTMVGN 2020 Query: 350 KLHDAFLRIGKEMVKQ 365 +L F +I + + Sbjct: 2021 ELRTTFCKISASLSTK 2036 >gi|325279874|ref|YP_004252416.1| hypothetical protein Odosp_1180 [Odoribacter splanchnicus DSM 20712] gi|324311683|gb|ADY32236.1| protein of unknown function DUF58 [Odoribacter splanchnicus DSM 20712] Length = 289 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 46/109 (42%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S +FG + ++ + + + + N + G++ F Sbjct: 72 EEERELTVMLMIDVSAS--RNFGT----ISKLKKNQITEIAAVLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G H I+E I+ T LEY N I Sbjct: 126 SDKIEKFIPPKKGRTHILHIIRELIDFYPEDKQTDIEQALEYMTNSIKK 174 >gi|323187784|gb|EFZ73082.1| von Willebrand factor type A domain protein [Escherichia coli RN587/1] Length = 219 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 62/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDELLANSLALKR---VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|260753146|ref|YP_003226039.1| isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552509|gb|ACV75455.1| isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 941 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 6/143 (4%) Query: 174 VLDVSL-SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 LD + M+ G +D L V S ++L + + D N VR S Sbjct: 623 TLDGTGRKMSKSVGNVIDPLKVINESGADILRMWVASTDYNEDVRISKEVLSGTSDGYRK 682 Query: 233 LAWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L +++ + P LE + ++ H + + +Y+ L Sbjct: 683 LRNSFRYLLGALEGFSEEEKVDLADLPELEKYILHLLAELDQALHESVNGFAFNRYLRLL 742 Query: 291 TDGENSSPN---IDNKESLFYCN 310 +D N+ + D ++ YC+ Sbjct: 743 SDFVNNDLSAFFFDIRKDRLYCD 765 >gi|241760884|ref|ZP_04758973.1| isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374503|gb|EER63964.1| isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 941 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 6/143 (4%) Query: 174 VLDVSL-SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 LD + M+ G +D L V S ++L + + D N VR S Sbjct: 623 TLDGTGRKMSKSVGNVIDPLKVINESGADILRMWVASTDYNEDVRISKEVLSGTSDGYRK 682 Query: 233 LAWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L +++ + P LE + ++ H + + +Y+ L Sbjct: 683 LRNSFRYLLGALEGFSEEEKVDLADLPELEKYILHLLAELDQALHESVNGFAFNRYLRLL 742 Query: 291 TDGENSSPN---IDNKESLFYCN 310 +D N+ + D ++ YC+ Sbjct: 743 SDFVNNDLSAFFFDIRKDRLYCD 765 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 41/125 (32%), Gaps = 1/125 (0%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + G++ +L AILL + + L I+ H + V+ +L D L A ++ + Sbjct: 8 LKKDQNGAVIVLVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLESG 67 Query: 69 GNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLS 128 + N + D E+ N A + S + + Sbjct: 68 ASV-NPNANVIANNTAIENISEDVPVEVNYNAAANTGDVQRGHWSFAARQFTPNGSLTAI 126 Query: 129 AVSRY 133 V Y Sbjct: 127 DVGNY 131 >gi|5932379|gb|AAD56932.1|AF180145_24 isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis ZM4] Length = 941 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 6/143 (4%) Query: 174 VLDVSL-SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 LD + M+ G +D L V S ++L + + D N VR S Sbjct: 623 TLDGTGRKMSKSVGNVIDPLKVINESGADILRMWVASTDYNEDVRISKEVLSGTSDGYRK 682 Query: 233 LAWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L +++ + P LE + ++ H + + +Y+ L Sbjct: 683 LRNSFRYLLGALEGFSEEEKVDLADLPELEKYILHLLAELDQALHESVNGFAFNRYLRLL 742 Query: 291 TDGENSSPN---IDNKESLFYCN 310 +D N+ + D ++ YC+ Sbjct: 743 SDFVNNDLSAFFFDIRKDRLYCD 765 >gi|56551219|ref|YP_162058.1| isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis ZM4] gi|81820925|sp|Q5NQQ7|SYI_ZYMMO RecName: Full=Isoleucyl-tRNA synthetase; AltName: Full=Isoleucine--tRNA ligase; Short=IleRS gi|56542793|gb|AAV88947.1| isoleucyl-tRNA synthetase [Zymomonas mobilis subsp. mobilis ZM4] Length = 941 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 6/143 (4%) Query: 174 VLDVSL-SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 LD + M+ G +D L V S ++L + + D N VR S Sbjct: 623 TLDGTGRKMSKSVGNVIDPLKVINESGADILRMWVASTDYNEDVRISKEVLSGTSDGYRK 682 Query: 233 LAWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L +++ + P LE + ++ H + + +Y+ L Sbjct: 683 LRNSFRYLLGALEGFSEEEKVDLADLPELEKYILHLLAELDQALHESVNGFAFNRYLRLL 742 Query: 291 TDGENSSPN---IDNKESLFYCN 310 +D N+ + D ++ YC+ Sbjct: 743 SDFVNNDLSAFFFDIRKDRLYCD 765 >gi|222033838|emb|CAP76579.1| Uncharacterized protein yegl [Escherichia coli LF82] gi|312946695|gb|ADR27522.1| hypothetical protein NRG857_10535 [Escherichia coli O83:H1 str. NRG 857C] Length = 219 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNTGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|158261719|dbj|BAF83037.1| unnamed protein product [Homo sapiens] Length = 943 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 77/214 (35%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTDISICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I +T G++ + L C G+ +++I + + AA L Sbjct: 412 GSVMILVTSGDD--------KLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +Q I Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|126309704|ref|XP_001376377.1| PREDICTED: similar to complement C2 [Monodelphis domestica] Length = 822 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 60/171 (35%), Gaps = 23/171 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L++ ++LD S S++ + + S + M+D I S V G+V F+ Sbjct: 318 LNLYLLLDSSQSVS------TEDFAIFKESAQLMVDRIFSFDVN---VSVGIVIFAKSPK 368 Query: 229 QTFPLA----WGVQHIQEKINRLI-----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + I +K+ L G+ T + Y + + K Sbjct: 369 VILSVTHKDSRDEMEIAKKLEDLKYGDPDIGTGTNINKAMMQIYEMMNNEMAIFGGRQKD 428 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQ 325 + + II LTDG+++ + + + +YAIGV Sbjct: 429 WEKIRHVIILLTDGKSNMGGSPTEAVKKIKEVLNIRQERTDYLDIYAIGVG 479 >gi|4887601|dbj|BAA77810.1| chloride channel protein [Homo sapiens] Length = 943 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 77/214 (35%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTDISICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I +T G++ + L C G+ +++I + + AA L Sbjct: 412 GSVMILVTSGDD--------KLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +Q I Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|281207766|gb|EFA81946.1| hypothetical protein PPL_05180 [Polysphondylium pallidum PN500] Length = 1990 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 73/224 (32%), Gaps = 43/224 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 LD++ V+D + SM ++ + I+ ++D I SI + VR +V ++ Sbjct: 1662 LDLVFVVDDTGSMG-------SEIAKVKQEIQNIVDDIVSIGSIE--VRVAMVFYNDHTP 1712 Query: 227 -------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + F + ++ ++ ++ + + ++ Sbjct: 1713 NSDHSKSVCKVFKFTSDIPELRRGLDSVVVHGGADHPEAMADGFYEVT--------KLDF 1764 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYC----------NEAKRRGAIVYAIGVQAEAA 329 K I + D + C + + G Y + + ++ Sbjct: 1765 AKSSTKVCIVIGDAPPHGFSGSGDSFPQGCPCGHDLIASVRQLVQGGVTFYTVMCRGDSQ 1824 Query: 330 D-QFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + L A S RF + N+ +L + I K IL + Sbjct: 1825 TYETLNAIADLSEGRFVLLNNASELTE----IITGSAKASILLD 1864 >gi|258627122|ref|ZP_05721918.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258580640|gb|EEW05593.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 560 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 47/139 (33%), Gaps = 17/139 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S + +M++LD S SM D+L + + I ++ + ++G Sbjct: 89 EASPFGEDSASLMVLLDSSESMQQKDIAP-DRLTRSKQKILDLTEA-------RKGGKTG 140 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L+ F+ PL + +Q + + + A N + + Sbjct: 141 LMVFAGSAHVAMPLTSDNRVLQPYLAAINPNVMPVEGKAAQSALNLLHEQLPPYVGNT-- 198 Query: 280 HDDYKKYIIFLTDGENSSP 298 ++ +TDG S Sbjct: 199 -------LLLVTDGVTDST 210 >gi|237715741|ref|ZP_04546222.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408749|ref|ZP_06085295.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|229444450|gb|EEO50241.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262353614|gb|EEZ02708.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] Length = 616 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N++H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMESGACPW--NTNHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V SG S+ + Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYSG----SAGVKLEATPGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGAGILLAYKIAKKNLISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|5729769|ref|NP_006527.1| calcium-activated chloride channel regulator 2 precursor [Homo sapiens] gi|189082520|sp|Q9UQC9|CLCA2_HUMAN RecName: Full=Calcium-activated chloride channel regulator 2; AltName: Full=Calcium-activated chloride channel family member 2; Short=hCLCA2; AltName: Full=Calcium-activated chloride channel protein 3; Short=CaCC-3; Short=hCaCC-3; Contains: RecName: Full=Calcium-activated chloride channel regulator 2, 109 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 2, 35 kDa form; Flags: Precursor gi|5138900|gb|AAD40367.1|AF043977_1 calcium-activated chloride channel-2 [Homo sapiens] gi|5726287|gb|AAD48397.1|AF127980_1 calcium-activated chloride channel protein 3 [Homo sapiens] gi|26996807|gb|AAH41096.1| Chloride channel accessory 2 [Homo sapiens] gi|119593593|gb|EAW73187.1| chloride channel, calcium activated, family member 2 [Homo sapiens] Length = 943 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 77/214 (35%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTDISICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I +T G++ + L C G+ +++I + + AA L Sbjct: 412 GSVMILVTSGDD--------KLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +Q I Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|84498675|ref|ZP_00997432.1| Mg-chelatase subunit ChlI [Janibacter sp. HTCC2649] gi|84381072|gb|EAP96958.1| Mg-chelatase subunit ChlI [Janibacter sp. HTCC2649] Length = 730 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 43/140 (30%), Gaps = 18/140 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 +++ +D S SM ++ ++ +L + R GLVTF + Sbjct: 541 VLLAVDASGSMG-----ARRRMEEVKTAVLSLL-----LDAYQRRDRVGLVTFRGQGAEV 590 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 V + L G T GL A + E + ++ Sbjct: 591 ALHPTSSVDAAARSLTTLAHGGRTPLAEGLLMATKVL-----AREKLRDPKRRP--LLVV 643 Query: 290 LTDGENSSPNIDNKESLFYC 309 +TDG ++ + S Sbjct: 644 VTDGRATAGPDALERSRTIA 663 >gi|317122034|ref|YP_004102037.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] gi|315592014|gb|ADU51310.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] Length = 207 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 19/159 (11%), Positives = 49/159 (30%), Gaps = 5/159 (3%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G+++ ++LPV+ +GL ++ V+A + D + L +I Sbjct: 49 GAVAAAFVLVLPVLLAAVGLGLDAGRLVVVRAHAQAVADLAGLAAVQEIDEDAFARGEPA 108 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDIN---NIERSTSLSIIIDDQHKDYNLSAVS 131 + + + + R + A+D + ++ S Sbjct: 109 LREAAAAATARQWAEDGLRRAFGD-AVAEDATVDVVVVNASPASPRRHPWSGRRVAEPTV 167 Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLD 170 + + W + PL +T+ ++ + Sbjct: 168 GVRL-VVPVRLGWLPAVAAIPLTVTADASVAVERQTAAS 205 >gi|290957873|ref|YP_003489055.1| hypothetical protein SCAB_34071 [Streptomyces scabiei 87.22] gi|260647399|emb|CBG70504.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 253 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 48/192 (25%), Positives = 69/192 (35%), Gaps = 27/192 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKL-GVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + + +++ LD S SM GP + L RE+ D + ++ V +VT Sbjct: 20 NNAQRMPLVLCLDTSSSMA---GPPIQTLNNALAEWTRELHDDVS----LSYSVEVAVVT 72 Query: 223 FSSKIVQT---FPLAWGVQHIQEKINR-------LIFGSTTKSTPGLEYAYNKIFDAKEK 272 F + V L I L T T LE A + + A K Sbjct: 73 FGGQGVGAWRGPQLLDPRTRTSPFIPAHAFQAPQLTAAGVTLMTEALELAMHIV--AARK 130 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPN---IDNKESLFY--CNEAKRRGAIVYAIGVQ-- 325 E A G Y+ I +TDG + P D+ L E R +YAIGV Sbjct: 131 SELRASGLQYYRPQICLVTDGLPTDPTGHLTDSWHRLVPVLAEEQSARRFRLYAIGVGGI 190 Query: 326 AEAADQFLKNCA 337 + +Q LK A Sbjct: 191 TDRGEQVLKAFA 202 >gi|29830947|ref|NP_825581.1| hypothetical protein SAV_4404 [Streptomyces avermitilis MA-4680] gi|29608060|dbj|BAC72116.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 582 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 65/195 (33%), Gaps = 23/195 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +VLD S SM ++ + +++ E + ++ +V FS Sbjct: 394 TGTRAKVYLVLDRSASMRPYY------KDGSAQALGE--QTLALAAHLDPEATVHVVFFS 445 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +++ T L KI+ L A + EK Sbjct: 446 TELDGTGELTLTEHE--NKIDELHTALGRMGRTSYHAAVEAVLAHHEKSADPKAPAL--- 500 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA-IGVQAEAADQF--LKNCASPDR 341 ++F TDG +P+ + + AK + ++ + F L+ + + Sbjct: 501 --VVFQTDG---APDAKTPATQALTDAAKNHPTVFFSFVAFGEHDNKAFDYLRKLKTGNT 555 Query: 342 -FYSVQN-SRKLHDA 354 F+ R+L DA Sbjct: 556 AFFHAGPTPRELTDA 570 >gi|282877525|ref|ZP_06286343.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310] gi|281300349|gb|EFA92700.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310] Length = 296 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDF------GTKKQTKREMATEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I + T +EY + Sbjct: 126 SDRIEKYIPPKKGRKHILYIIREMLDFDPESKKTDVAMAIEYLTRMMK 173 >gi|228995959|ref|ZP_04155616.1| Von Willebrand factor type A domain protein [Bacillus mycoides Rock3-17] gi|229003576|ref|ZP_04161393.1| Von Willebrand factor type A domain protein [Bacillus mycoides Rock1-4] gi|228757703|gb|EEM06931.1| Von Willebrand factor type A domain protein [Bacillus mycoides Rock1-4] gi|228763818|gb|EEM12708.1| Von Willebrand factor type A domain protein [Bacillus mycoides Rock3-17] Length = 627 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSATPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRQEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|254225238|ref|ZP_04918851.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622337|gb|EAZ50658.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 619 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|302754784|ref|XP_002960816.1| hypothetical protein SELMODRAFT_402200 [Selaginella moellendorffii] gi|300171755|gb|EFJ38355.1| hypothetical protein SELMODRAFT_402200 [Selaginella moellendorffii] Length = 686 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 64/191 (33%), Gaps = 25/191 (13%) Query: 170 DMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVV----RSGL-VT 222 + ++LD S SM++ G + VA I ++L+ + + V + G V Sbjct: 196 SLYILLDTSTSMSNPTGVLSSQTRFNVANNIINQLLNTLTNGDQVAVSTIGGEKIGAPVK 255 Query: 223 FSSKI-----VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 FSS+ + L + I + + S T S ++ D ++ Sbjct: 256 FSSETRIISDYEETSLY--LAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSNLN 313 Query: 278 KGHDDYKKYIIFLTDGE---NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 I TDG+ + N + A+ VY IG F + Sbjct: 314 --------VITLFTDGQLVIPGNFNFTQLSPVLAQLNARNVVVFVYRIGSFTSNDATFQQ 365 Query: 335 NCASPDRFYSV 345 +S + Y V Sbjct: 366 MQSSLNMSYEV 376 >gi|262381908|ref|ZP_06075046.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|262297085|gb|EEY85015.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] Length = 289 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S + + V + + + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGSRDF------GSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G +H I+E I+ + T + L+Y N I Sbjct: 126 SDKIEKFIPPQKGKKHILYIIRELIDFHPEETRTDISQVLKYLTNAIKK 174 >gi|195398375|ref|XP_002057797.1| GJ17905 [Drosophila virilis] gi|194141451|gb|EDW57870.1| GJ17905 [Drosophila virilis] Length = 411 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 75/195 (38%), Gaps = 26/195 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRS 218 + D+M++LD S M++ +A + +LD + VN +V++ Sbjct: 150 AASSPKDIMILLDASSGMSEK------SFELAMATAFNILDTLGEDDFVNLITFSEMVKT 203 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + F ++V+ P VQ I+ + + T T GLEYA++ + + Sbjct: 204 PVPCFKDRMVRATP--DNVQEIKSAVKAVKLQDTANFTAGLEYAFSLLHKYNQSGA---- 257 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF-LK--N 335 + I+ +T+ + S K+ + ++ + +++ + L Sbjct: 258 -GSQCNQAIMLITESTSESHKEIIKQYNWP-----HMPVRIFTYLIGSDSGSRSNLHEMA 311 Query: 336 CASPDRFYSVQNSRK 350 C++ F + + + Sbjct: 312 CSNKGFFVQINDYEE 326 >gi|121602946|ref|YP_980275.1| vault protein inter-alpha-trypsin subunit [Polaromonas naphthalenivorans CJ2] gi|120591915|gb|ABM35354.1| Vault protein inter-alpha-trypsin domain protein [Polaromonas naphthalenivorans CJ2] Length = 701 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 76/232 (32%), Gaps = 29/232 (12%) Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS 131 Q + ++ I + E+ + + D+ + + N + Sbjct: 219 PAGQASATAFDIKVKLATPIGIKEVSSHSHSIDVTKDGDERAAVSLRSGDKPGNNRDFIL 278 Query: 132 RYEMP---------FIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSM 181 Y + T A+ + L + K +++++ D + V+D+S SM Sbjct: 279 DYRLAGERIESGVMLYQGTPGNGASGENFFLAMIEPPKQVAAQAISPRDYIFVVDISGSM 338 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 G +D R + L + N ++ SG F S A ++ Sbjct: 339 ---HGFPLDTAKTLMRELIGKLRPSDTF---NVLLFSGSNRFLSPASVPATQA-NIEQAV 391 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I+ + G T+ P L+ Y + A D + ++ +TDG Sbjct: 392 RTIDEMGGGGGTELIPALKRVYAEPKAA------------DVSRTVVVVTDG 431 >gi|325694418|gb|EGD36328.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK150] Length = 450 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|303272389|ref|XP_003055556.1| predicted protein [Micromonas pusilla CCMP1545] gi|226463530|gb|EEH60808.1| predicted protein [Micromonas pusilla CCMP1545] Length = 802 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 68/236 (28%), Gaps = 27/236 (11%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGV 194 P P + + ++ + K G ++ V+D S SM + + L Sbjct: 268 PTHILGQPPPRDPRSSFVVALAPPKPERCLAFGRSVVFVIDRSGSMNGEPMEAANEALTT 327 Query: 195 ATRSIREM-LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW-GVQHIQEKINRLIFGST 252 RS+ E I + D + +T ++ +AW + T Sbjct: 328 GLRSLTEHDYFNICAFDDGQEYFDANAMTQATPKNVERAMAWMNEHCVARY--------T 379 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T L A + A ++ +TDG S K + + Sbjct: 380 TDIYTPLSEALKLL-------AGCAGNGTVP--FVFLITDGAVSDEKEICKMLMAESQQK 430 Query: 313 KRRGAIVYAIGVQAEAADQFLKNCASPDR-FYSVQNSRKLHDAFLRIGKEMVKQRI 367 V G+ FLK A+ R + + +I +M K Sbjct: 431 GEALPRVCTFGIGQYCNHYFLKMLANIGRGLFDAAFTND------KIATQMSKMLT 480 >gi|226308012|ref|YP_002767972.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] gi|226187129|dbj|BAH35233.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] Length = 551 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 57/376 (15%), Positives = 111/376 (29%), Gaps = 60/376 (15%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + G ++ ++A L+PV G + ET ++ +D S T E Sbjct: 196 QSRGGDLTSISAALVPVAQAQYGNLKETGSTG----RIQT-VDASGGTAVTTEQTFEAYL 250 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + D L G + + + D + + Sbjct: 251 AAHADSRLTASVPSSGSVFMD--YPLASLGADSSAADDAGKLLAAALASDTGRS----VL 304 Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM-------MVLDVSLSMND 183 + P L+ S + +K+ +++ V+DVS SM D Sbjct: 305 AGLGFRGADQAPPVAGGVGKVGSLVPSDPSVVTKALARYEILSRPSRALAVVDVSGSM-D 363 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS---------------SKIV 228 + G+ ++ ++ I+ P GL FS +++ Sbjct: 364 YMQDGVTRMAATAQAGDI---AIRMFPANAQ---LGLWAFSVDLGDGTDYRELEPVARMD 417 Query: 229 QTFPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 T + +I+ L G T + AY + + + + Sbjct: 418 ATEGGTDHRSKLLSRIDNLSSIVGGGTGLYDSVLAAYRSMQQSYDPASINS--------- 468 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAADQFLKNCA--SP 339 +I LTDG N P+ + L + R R + IGV +A L+ + + Sbjct: 469 VILLTDGANDDPSGIALQELL--DTLTREQDPMRPVPIITIGVTDDADTDVLEQISTLTG 526 Query: 340 DRFYSVQNSRKLHDAF 355 + + F Sbjct: 527 GNSHFAPTPADIPKVF 542 >gi|222478564|ref|YP_002564801.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451466|gb|ACM55731.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 571 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 82/274 (29%), Gaps = 58/274 (21%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGM---DKLGVATRSIREML----- 203 L +D+M+ LD S SM G+ D + V +S E Sbjct: 207 KLWYDEDCDNVHDDAGPIDIMLTLDFSGSMLYDQYGGVVSTDPIQVDGQSYGETTKIDLV 266 Query: 204 -----DIIKSIPDVNNVVRSGLVTFSSKIVQTF--------PLAWGVQHIQEKINRL--- 247 I + N V+ G+V F + PL + + ++ L Sbjct: 267 ELGTRQFIDYLQAQNADVQVGVVYFDGEGSGENTPRTGILEPLTTNLSAVDTALSNLRQK 326 Query: 248 IFGSTTKSTP-----------------GLEYA-YNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + + P G+ Y + E + G +K I Sbjct: 327 LANVVSDAAPSTPFDNDGNPDPYSNADGIATGTYISEGLDDAQTELASNGRASAEKRNIV 386 Query: 290 LTDGE------NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--- 340 L+DGE N++ + + VY I V A+ L+ A P Sbjct: 387 LSDGESFNGDGNTNYAPPASAAANARAASPAPATDVYTINVNGSAST--LQAMAGPAGGS 444 Query: 341 -----RFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 F + + + F + + V ++++ Sbjct: 445 GGDPVFFNDINDPLNIPTVFGNLAAQTVAEKVIM 478 >gi|325696397|gb|EGD38287.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK160] Length = 450 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|228471021|ref|ZP_04055865.1| conserved hypothetical protein [Porphyromonas uenonis 60-3] gi|228307241|gb|EEK16264.1| conserved hypothetical protein [Porphyromonas uenonis 60-3] Length = 290 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 56/131 (42%), Gaps = 17/131 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + ++N R GL+ + Sbjct: 70 EEERELTIMLLVDVSHSLDF------GSTSETKRDLVATIAATIAFACIHNNDRVGLMLY 123 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + ++ + P G +H+ + I ++ +++ +I + E L + K Sbjct: 124 TDRVERYIPAGQGRKHVLQLIREILTYRPERNS-------TQISSSLEMLSRVVKKRCSA 176 Query: 284 KKYII--FLTD 292 +I+ F+TD Sbjct: 177 --FIVSDFITD 185 >gi|239939820|ref|ZP_04691757.1| hypothetical protein SrosN15_02385 [Streptomyces roseosporus NRRL 15998] gi|239986306|ref|ZP_04706970.1| hypothetical protein SrosN1_03262 [Streptomyces roseosporus NRRL 11379] Length = 592 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 68/203 (33%), Gaps = 29/203 (14%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + V+D S SM G ++ V S+ + LD + GL F++ + Sbjct: 383 LTTVVDASGSMATLVPGRNQSRMDVTKESLIQALDQFTPNDE------IGLWEFATTLDG 436 Query: 230 TFPLAWGVQHIQEKINRLIFGSTT------KSTPGLEY---AYNKIFDAKEKLEHIAKGH 280 + +++ G T + GL+ ++D A+ Sbjct: 437 EK--DYRRLMETKRLGDPAAGGGTHREKLTAAFAGLQPVPGGATGLYDTTLASYKEARST 494 Query: 281 DDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAK-----RRGAIVYAIGVQAEAADQFL 333 K+ ++ LTDG N N ++ L E K R V AI V +A + Sbjct: 495 YVKGKFNALVILTDGSNQDTNGISRSGLI--TELKELVDPERPVPVIAIAVGPDADRDEV 552 Query: 334 KNCA--SPDRFYSVQNSRKLHDA 354 A + Y V + ++ Sbjct: 553 AEIARITGGDGYEVSDPAEIQAV 575 >gi|226360795|ref|YP_002778573.1| hypothetical protein ROP_13810 [Rhodococcus opacus B4] gi|226239280|dbj|BAH49628.1| hypothetical protein [Rhodococcus opacus B4] Length = 891 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 73/234 (31%), Gaps = 51/234 (21%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIV 228 D+M+++D S S+ D + +L+ + + V + + F+ Sbjct: 45 DLMLMVDQSGSLQGS-----DPDAARVSAANYLLEQLNTFGGSAGVELNVAIAGFADDFT 99 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK-LEHIAKGHDDYKKYI 287 P + + T++ +N + A+ EH ++ + + + Sbjct: 100 VHAPWTRLDNGSLPALQGEVEKFRTRTDGIDTDYWNALDGARRTLAEHDSQSEANRCQAV 159 Query: 288 IFLTDGENSSPNIDNKESLF---------------------------YCNEAKRRGAIVY 320 + +DG+ D ++ ++ + G + + Sbjct: 160 AWFSDGKLDFTVRDAEKPYAQGISLRSDQGVQQVVAAARESICRPAGIADQLRSSGIVTF 219 Query: 321 AIGVQAEAAD----QFLKNCA-------------SPDRFYSVQNSRKLHDAFLR 357 A+G+ A A +++ A SP FY QN L AF Sbjct: 220 AVGLAAGTAQPSDFDLMRSIATGGDGACGKTTSPSPGDFYLAQNIDDLLFAFDA 273 >gi|327474184|gb|EGF19594.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK408] Length = 450 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|323490190|ref|ZP_08095408.1| von Willebrand factor type A [Planococcus donghaensis MPA1U2] gi|323396119|gb|EGA88947.1| von Willebrand factor type A [Planococcus donghaensis MPA1U2] Length = 617 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 61/170 (35%), Gaps = 28/170 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + ++ +++D S SM +DKL +++ D+++ + + +V Sbjct: 412 KTAPSKELDAVFCLLIDGSASM-------LDKLEETKQAVLLFHDVLRGLNVPHEIVLFY 464 Query: 220 LVTF-SSKIVQTFPLAW-----GVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + +S Q W E I L + + + N++ EK Sbjct: 465 EDAYEASDAEQPNYFEWMHKFEDRNKDHAETIASLDAHEDNRDGFAIRWMNNRLKRRPEK 524 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN---EAKRRGAIV 319 ++++ +DGE S+ N + N EAK+ G V Sbjct: 525 H-----------RFLLVFSDGEPSAYNYAENGVVDTANAVSEAKKMGIEV 563 >gi|270014558|gb|EFA11006.1| hypothetical protein TcasGA2_TC004591 [Tribolium castaneum] Length = 1235 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 75/200 (37%), Gaps = 24/200 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT------F 223 D+++++D S SM D G + L V +LD + +N + S T F Sbjct: 234 DVIILVDNSGSM-DGMGRHIASLTV-----NTILDTFSNNDYINILYYSNQTTNYTIPCF 287 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG---- 279 + +VQ P + +E I L T L+ A++ + + +E + Sbjct: 288 RNLLVQATPE--NIVLFKEAIRHLGPSGKTDFPQALQMAFDILENYREIRGCNNEEIDEE 345 Query: 280 --HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLK-- 334 + I+ +TDG + + + + + K ++ + E + ++ Sbjct: 346 GKSKACNQAIMLITDGISRNFSDIVMRNNQL-DGGKTIPVRIFTYLIGKEVTNVEEIRWM 404 Query: 335 NCASPDRFYSVQNSRKLHDA 354 CA+ + VQ ++ A Sbjct: 405 ACANRGFYTQVQTLEQVTSA 424 >gi|228989769|ref|ZP_04149749.1| Von Willebrand factor type A domain protein [Bacillus pseudomycoides DSM 12442] gi|228769916|gb|EEM18499.1| Von Willebrand factor type A domain protein [Bacillus pseudomycoides DSM 12442] Length = 627 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSATPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRQEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|189233997|ref|XP_971945.2| PREDICTED: similar to AGAP009579-PA [Tribolium castaneum] Length = 1056 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 75/200 (37%), Gaps = 24/200 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT------F 223 D+++++D S SM D G + L V +LD + +N + S T F Sbjct: 234 DVIILVDNSGSM-DGMGRHIASLTV-----NTILDTFSNNDYINILYYSNQTTNYTIPCF 287 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG---- 279 + +VQ P + +E I L T L+ A++ + + +E + Sbjct: 288 RNLLVQATPE--NIVLFKEAIRHLGPSGKTDFPQALQMAFDILENYREIRGCNNEEIDEE 345 Query: 280 --HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLK-- 334 + I+ +TDG + + + + + K ++ + E + ++ Sbjct: 346 GKSKACNQAIMLITDGISRNFSDIVMRNNQL-DGGKTIPVRIFTYLIGKEVTNVEEIRWM 404 Query: 335 NCASPDRFYSVQNSRKLHDA 354 CA+ + VQ ++ A Sbjct: 405 ACANRGFYTQVQTLEQVTSA 424 >gi|340082|gb|AAA36794.1| undulin 1 [Homo sapiens] Length = 843 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 61/162 (37%), Gaps = 15/162 (9%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPL-AWGVQ-HIQEKINRLIFGSTTKSTPGLEYAYNKI 266 + R GL +S + L A+ + + E + L + T A N I Sbjct: 1 FDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG---LALNYI 57 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 F+ K E ++ K I +TDG++ I +L + G ++AIGV+ Sbjct: 58 FENSFKPEAGSRTG--VSKIGILITDGKSQDDIIPPSRNL------RESGVELFAIGVKN 109 Query: 327 EAADQFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEMVKQR 366 ++ + + PD Y+V +H + + + + Sbjct: 110 ADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 151 >gi|4433775|dbj|BAA61542.1| CbbO [Hydrogenophilus thermoluteolus] Length = 779 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 69/202 (34%), Gaps = 37/202 (18%) Query: 166 DIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSI----------PDVNN 214 + + ++LD+S S N+ G L + + + + I+ I D Sbjct: 586 KRDIAVCLLLDLSASTNETPHGAHHSVLDASRAATLLLAEAIERIGDPLAILGFHSDGRE 645 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 VR ++ F WG ++ ++++ + +T+ L +A + +K Sbjct: 646 DVRL-------YPIKRFGERWG-ENAKKRLAGINGAFSTRMGAALRHAGTLLKQQPQKR- 696 Query: 275 HIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 K ++ +TDGE + PN +++ +E G VY + + Sbjct: 697 ----------KLLLLVTDGEPADIDERDPNHLRRDARRAVDELWGNG--VYTYCLTLDPG 744 Query: 330 DQFLKNCASPDRFYSVQNSRKL 351 + + KL Sbjct: 745 ADRYVAQIFGKHYTVIDRVDKL 766 >gi|116670917|ref|YP_831850.1| von Willebrand factor, type A [Arthrobacter sp. FB24] gi|116611026|gb|ABK03750.1| von Willebrand factor, type A [Arthrobacter sp. FB24] Length = 340 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 66/227 (29%), Gaps = 48/227 (21%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S D+++ LD S SM+ ++ R GL Sbjct: 88 SPEQRNRDIILCLDASGSMSSADAAVVEVFARLAAGFDGE--------------RLGLTV 133 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRL------------------IFGSTTKSTPGLEYAYN 264 F S VQ FPL +Q ++ ++ GL Sbjct: 134 FDSSAVQVFPLTDDYDVVQGQLEAARKAFDGAPGSAAFLDGTWNGAGSSLIGDGLASCVQ 193 Query: 265 KIFDAKEKL---EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVY 320 E G ++ + ++ TD + + + +L AK++ VY Sbjct: 194 GFPSNGGDTGTGEQAGSGREERSRSVVLATD---NFISGEPIFTLQEAAALAKKQDVRVY 250 Query: 321 AI-------GVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 A+ G + L+ + +Y + + + + R+ Sbjct: 251 ALNPGDFDYGTDPDQPGVQLRTAVEGTGGAYYPLDSPEAVGEIIRRV 297 >gi|297193194|ref|ZP_06910592.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC 25486] gi|297151687|gb|EDY64371.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC 25486] Length = 336 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 29/130 (22%), Positives = 48/130 (36%), Gaps = 18/130 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLD S SM +F G +LG ++ LD ++ +V F Sbjct: 151 TGARARVYLVLDRSGSMRPYFKDGSAQRLGEQALALSAHLDADATVD---------VVFF 201 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S++I T L +++ L G A ++ AK + D Sbjct: 202 STEIDGTGELTLDSHE--GRVDELHAGLGRMGRTNYHLAVEEVMSLH------AKKNTDA 253 Query: 284 KKYIIFLTDG 293 +IF TDG Sbjct: 254 PALVIFQTDG 263 >gi|331663568|ref|ZP_08364478.1| putative von Willebrand factor type A domain protein [Escherichia coli TA143] gi|331673599|ref|ZP_08374362.1| putative von Willebrand factor type A domain protein [Escherichia coli TA280] gi|331059367|gb|EGI31344.1| putative von Willebrand factor type A domain protein [Escherichia coli TA143] gi|331068872|gb|EGI40264.1| putative von Willebrand factor type A domain protein [Escherichia coli TA280] Length = 219 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|221113899|ref|XP_002160633.1| PREDICTED: similar to Collagen alpha-5(VI) chain [Hydra magnipapillata] Length = 9981 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 62/174 (35%), Gaps = 19/174 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D++ ++DVS+ M +D++ IR+ L + R G++TFS Sbjct: 3131 FDILYIIDVSVQML-----SVDRI---KEFIRKQLPTFNISFNE---TRIGIITFSDVAN 3179 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L+ G Q+ +N + + + P A+ K+ + K + Sbjct: 3180 TLLTLSQGTST-QQVLNAVKNIQLSLNNPQFSVAFEKLSSTLSDSFR-----SNVIKIVT 3233 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +T +SS D + + K IG+ L + A+ + + Sbjct: 3234 LIT--SSSSNGFDIERINSVTKKLKSSNLKFIVIGIGENVKKDELLSIATDESY 3285 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 52/146 (35%), Gaps = 8/146 (5%) Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 +KS + G+V + + + L V + + L T + L YA+ Sbjct: 9051 LKSYKISTDATYVGVVANGDQPIVSIKL-NAVSSYESIVGYLKNLKYTGESRKLSYAFQI 9109 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + E+ + K I+F NS +ID + K G + I + Sbjct: 9110 VRTSLFSEENGGR-ESIPKTLIVF----SNSGFSIDMNDLSDEAQALKDMGVKIVFIALG 9164 Query: 326 AEAADQFLKNCASPDRFYSVQNSRKL 351 +A + LK D F+ ++ L Sbjct: 9165 EDARSEMLKQVV--DVFFFAEDLPNL 9188 Score = 37.5 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 81/221 (36%), Gaps = 28/221 (12%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK----SIP 210 I+ + + LD++ +LD S DK+ + ++LD +K S+P Sbjct: 1612 ISVVQEAPCAFNANLDVVFLLDSS-----------DKIT--DYAWFQILDFVKNMATSLP 1658 Query: 211 DVNNVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 +R ++ S + L + +++ +N++ + + L + + Sbjct: 1659 ISKQKLRVSIINTGSHVKVEIALNESKSQEYVIHTVNKIK-----RISGQLNLMKSLLIV 1713 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 E + + K +I + +GE ++ I SL E + + + I V A Sbjct: 1714 QNELFGENQRENAG--KLVISILNGEVNNNLI--TNSLNIIQEFQNKFVEMLVITVGDNA 1769 Query: 329 ADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILY 369 + P+ F + NS L++A I K + Sbjct: 1770 NYNLKSFVSYPEYFINFNNSGNLYEATHIISKAAGNAAVTL 1810 >gi|209549949|ref|YP_002281866.1| hypothetical protein Rleg2_2365 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535705|gb|ACI55640.1| protein of unknown function DUF1194 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 258 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 67/194 (34%), Gaps = 16/194 (8%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 + L + +++ + +++ +D S SM+ + V +E Sbjct: 1 MLTTVAVLIGLSGLVPAAQAGTNEVDVSLVLAVDTSRSMDFEEIGIQREGYVEALKHKEF 60 Query: 203 LDIIKSIPDVNNVVRSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 LD +K +++ + +VQ + W Q I+ + + + F ++ P Sbjct: 61 LDAVKGGLTGRI-----AISYFEWAGYVVQDSVIDW--QVIETEEDAIAFAGKIEARPIA 113 Query: 260 EYAYNKIFDAKEKLEH--IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 I A + I ++ I DG N+S N + ++A G Sbjct: 114 TQRRTSISTAIGQGASMIITSPFRGRREVIDVSGDGPNNSGN----PVIPARDKAVASGM 169 Query: 318 IVYAIGVQAEAADQ 331 I+ + + +D Sbjct: 170 IINGLAIMLRPSDA 183 >gi|254286662|ref|ZP_04961617.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|150423246|gb|EDN15192.1| conserved hypothetical protein [Vibrio cholerae AM-19226] Length = 525 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|149636532|ref|XP_001512091.1| PREDICTED: similar to calcium-activated chloride channel-2 [Ornithorhynchus anatinus] Length = 768 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 75/212 (35%), Gaps = 39/212 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M++ D+L ++ L I ++ G+V+F+S+ Sbjct: 315 VCLVLDVSERMSEA-----DRLHRLRQAAELYLLQIAE-----SLSYVGIVSFNSEGRVR 364 Query: 231 FPLAW-GVQHIQEKINRLIFGSTTK-----STPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L+ ++ ++ + + T GL+ + I Sbjct: 365 AQLSQITHDGVRRQLASHLPTTVTADKVASVCAGLKTGFEVIKKLNGNTHGS-------- 416 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RF 342 +I +T GE+ L G+ V+ I + + + + +F Sbjct: 417 -VVILVTTGEDQGEVSCVPSLL-------DSGSTVHLITLGSSGSADLEETATLTGGLKF 468 Query: 343 YSVQ--NSRKLHDAFLRIGK---EMVKQRILY 369 ++ +S L DAF RI ++ KQ + Sbjct: 469 FASDRADSNSLMDAFCRISSGSGDVSKQSLQL 500 >gi|170742540|ref|YP_001771195.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] gi|168196814|gb|ACA18761.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] Length = 303 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 10/48 (20%), Positives = 25/48 (52%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLY 58 + +G+I I+ A ++P + + +G I+ + K+ LD +++ Sbjct: 18 SDARGTIGIMFAGMMPAVLLAIGCGIDLQRALAYRGKVQAALDGAVMA 65 >gi|282892469|ref|ZP_06300803.1| hypothetical protein pah_c260o014 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497751|gb|EFB40115.1| hypothetical protein pah_c260o014 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 364 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 73/229 (31%), Gaps = 41/229 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 M +LD S SM + + TRS +I+ + + + L F+ + + Sbjct: 101 QMQFILDTSASMQ------VKDIRNQTRSEYGK-EIVDELARQLDGKSASLWGFAGQATR 153 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P ++ I L + L A I + I++ D K + Sbjct: 154 LSPATMDALFLRLMIRDLQINEGNVTGTSLINAVKAI------QKEISELPQDRKLVAVL 207 Query: 290 LTDGENSSPNIDNKES---LFYCNEAKRRG---AIVYAIGVQAEAADQ------------ 331 L+DGE++ +++ ++ K + +Y IG+ + + Sbjct: 208 LSDGEDTENISAEEKAKNLRVLLDDLKTKFNDRLTIYTIGIGSREGGEIPDVLEQGQRIH 267 Query: 332 -------FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI--LYNK 371 + S + + A I +M K+ + +K Sbjct: 268 SKRDDTWLKQIGESSGEYIIADQESSIEIA-QDILSKMKKKNLEEWISK 315 >gi|125554127|gb|EAY99732.1| hypothetical protein OsI_21717 [Oryza sativa Indica Group] Length = 391 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 49/134 (36%), Gaps = 21/134 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +F P ++ L + VK + ++ LD++MVLD+ M + +++L Sbjct: 254 VFTEVPAISSQRREKLAVMVRVKAPAYTKQTRAPLDLVMVLDIGGRMRE-----LEQLKQ 308 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFG 250 + I I ++ R +VTF + + L + + + L Sbjct: 309 GAKFI---------IHNLTQQDRLSIVTFGPRADRLSELTPMTEQDKRSSNDAVQALEAS 359 Query: 251 STTKSTPGLEYAYN 264 K GL AY Sbjct: 360 GGVKIGAGLNVAYQ 373 >gi|77359907|ref|YP_339482.1| hypothetical protein PSHAa0961 [Pseudoalteromonas haloplanktis TAC125] gi|76874818|emb|CAI86039.1| conserved protein of unknown function; putative TPR domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 625 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 58/200 (29%), Gaps = 33/200 (16%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 A + I + I+ D K + S P V + Sbjct: 31 ASNEQLIAPHLAQFIMSDASTKTNQPLW-------LLALFCSLGIIFSAGPSFEEKQVPV 83 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + +V+D+S SM + + + LD+I + + + LV Sbjct: 84 FQSKNARV---IVMDMSYSMYSTDILPNRLMQ----ARFKALDMIDLFKEGD----TALV 132 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ PL ++ I L + + GL+ A + A Sbjct: 133 AYAGSAYTISPLTNDATTLENLIPSLSPEIMPDKGSNVLAGLDIAKELLGQAGYLDGD-- 190 Query: 278 KGHDDYKKYIIFLTDGENSS 297 II +TDG + Sbjct: 191 ---------IILITDGIDQQ 201 >gi|55296638|dbj|BAD69340.1| hypothetical protein [Oryza sativa Japonica Group] gi|55297431|dbj|BAD69282.1| hypothetical protein [Oryza sativa Japonica Group] Length = 393 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 49/134 (36%), Gaps = 21/134 (15%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISS---KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 +F P ++ L + VK + ++ LD++MVLD+ M + +++L Sbjct: 254 VFTEVPAISSQRREKLAVMVRVKAPAYTKQTRAPLDLVMVLDIGGRMRE-----LEQLKQ 308 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFG 250 + I I ++ R +VTF + + L + + + L Sbjct: 309 GAKFI---------IHNLTQQDRLSIVTFGPRADRLSELTPMTEQDKRSSNDAVQALEAS 359 Query: 251 STTKSTPGLEYAYN 264 K GL AY Sbjct: 360 GGVKIGAGLNVAYQ 373 >gi|329946213|ref|ZP_08293826.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 170 str. F0386] gi|328527811|gb|EGF54802.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 170 str. F0386] Length = 370 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 41/196 (20%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V S + D+++ LDVS SM + + + + DI++ R Sbjct: 106 VTERSDALANRDIVLCLDVSTSM----------VKIDSSVLTTFADILEDFDGE----RV 151 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKIN--------------------RLIFGSTTKSTP- 257 GLV ++S PL ++++++ L S T++T Sbjct: 152 GLVAWNSAAQTIVPLTDDYDLLRQQMDDLGDVLDIDPQNPTYKQQLRYLEAFSGTQNTSL 211 Query: 258 -GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL-FYCNEAKRR 315 G A + + + ++ + + II TD + P + +L + R Sbjct: 212 DGSSLAGDGLASCAQAFDNQ---GLERSRSIILATDNQVIDPKKEQIYTLPDAVDLLAER 268 Query: 316 GAIVYAIGVQAEAADQ 331 +++I A+ Sbjct: 269 KIRLFSI-YGADDDQS 283 >gi|326336011|ref|ZP_08202187.1| von Willebrand factor [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691808|gb|EGD33771.1| von Willebrand factor [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 288 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 41/105 (39%), Gaps = 6/105 (5%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S + FG L I + + + N ++GL+ F Sbjct: 72 EEERELTLMLLVDVSGS--EFFGS----LQQFKNEIITEIAATLAFAALQNNDKTGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I P G HI I LI + +A + Sbjct: 126 SDQIELYIPPKKGKSHILRIIRELIEFQPQSKRTNIAHALEFLNK 170 >gi|309356755|emb|CAP36239.2| hypothetical protein CBG_18899 [Caenorhabditis briggsae AF16] Length = 643 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 70/198 (35%), Gaps = 23/198 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ +D+M ++D S S G++ + I E+L + P + R +V Sbjct: 427 PARKLPPIDLMFLVDTSSS------IGINNFDIQKNFICEILKDVDVAPGRS---RISMV 477 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYA------YNKIFDAKEK 272 ++ F + + ++ + RL G T + L +A + ++ Sbjct: 478 QYAQDPSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKR 537 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + D + + ++DG + D+ N R ++A+ ++ D+ Sbjct: 538 HQFLPTPKHDRLQVLCLVSDGYS-----DDSADKESVNLHDRLHVKIFAVVTRSFNKDKL 592 Query: 333 LKNCASPDRFYSVQNSRK 350 + ++V Sbjct: 593 VPITRFDGSVFTVHQRES 610 >gi|150007593|ref|YP_001302336.1| hypothetical protein BDI_0946 [Parabacteroides distasonis ATCC 8503] gi|255013878|ref|ZP_05286004.1| hypothetical protein B2_08217 [Bacteroides sp. 2_1_7] gi|256839780|ref|ZP_05545289.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|298375539|ref|ZP_06985496.1| von Willebrand factor, type A [Bacteroides sp. 3_1_19] gi|301310441|ref|ZP_07216380.1| von Willebrand factor, type A [Bacteroides sp. 20_3] gi|149936017|gb|ABR42714.1| conserved hypothetical protein [Parabacteroides distasonis ATCC 8503] gi|256738710|gb|EEU52035.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|298268039|gb|EFI09695.1| von Willebrand factor, type A [Bacteroides sp. 3_1_19] gi|300832015|gb|EFK62646.1| von Willebrand factor, type A [Bacteroides sp. 20_3] Length = 289 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S + + V + + + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGSRDF------GSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G +H I+E I+ + T + L+Y N I Sbjct: 126 SDKIEKFIPPQKGKKHILYIIRELIDFHPEETRTDISQVLKYLTNAIKK 174 >gi|74182490|dbj|BAE42868.1| unnamed protein product [Mus musculus] Length = 219 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 41/130 (31%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L A ++ + +++ + R LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFM-KLRARDPASRGDRYMLVTFEEPPYAI 62 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L + N++ + Sbjct: 63 KG-GWKENHATFMNELKNLQAEGLTTLGQSLRTGFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PAIIITITDG 131 >gi|268558482|ref|XP_002637231.1| Hypothetical protein CBG18899 [Caenorhabditis briggsae] Length = 615 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 70/198 (35%), Gaps = 23/198 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ +D+M ++D S S G++ + I E+L + P + R +V Sbjct: 403 PARKLPPIDLMFLVDTSSS------IGINNFDIQKNFICEILKDVDVAPGRS---RISMV 453 Query: 222 TFSSKIVQTFPLA--WGVQHIQEKINRLI-FGSTTKSTPGLEYA------YNKIFDAKEK 272 ++ F + + ++ + RL G T + L +A + ++ Sbjct: 454 QYAQDPSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKR 513 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + D + + ++DG + D+ N R ++A+ ++ D+ Sbjct: 514 HQFLPTPKHDRLQVLCLVSDGYS-----DDSADKESVNLHDRLHVKIFAVVTRSFNKDKL 568 Query: 333 LKNCASPDRFYSVQNSRK 350 + ++V Sbjct: 569 VPITRFDGSVFTVHQRES 586 >gi|313221050|emb|CBY31881.1| unnamed protein product [Oikopleura dioica] Length = 1282 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 22/197 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S +DM+ +LD S S+ G V +R ++ + P + + Sbjct: 477 NSTKEVDMIFLLDSSGSV------GKPNFQVMKSWMRRLISGLNIAPGRTQ---VSVYLY 527 Query: 224 SSKIVQTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 ++ F L + IN++++ T+ L+ A +K+ E + + Sbjct: 528 NNIFRTIFNLNEHQNAYDMITAINKMVYSGKGTRIARALQSAMSKVLIP----ESGLRPN 583 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP- 339 + Y+ LTDG+ S D +A + A+G+ + L AS Sbjct: 584 SEI--YLYLLTDGKESD-VADVNNMANDIKDAFSDRITLTAVGISRSVENAELYAIASAP 640 Query: 340 --DRFYSVQNSRKLHDA 354 D + ++N R L Sbjct: 641 KKDNVFLLENYRDLDTI 657 >gi|212702323|ref|ZP_03310451.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] gi|212674201|gb|EEB34684.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] Length = 1151 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 53/150 (35%), Gaps = 10/150 (6%) Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++ + G + + +I+ + T + G E A + G + Sbjct: 643 SWEDVRTFSTESLTGYEAVLAQIDDMEAIGGTVYSDGYEAAKDWFGGKTSPDSLQNNGGE 702 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLKNCASP 339 + +IF+TDGE ++ K + A V +G+ D L + Sbjct: 703 N---IVIFVTDGEPNN-EWSAKNAYNQLVAAVDN-ITVETVGIAITDKDATDLLNGLTTN 757 Query: 340 DR--FYSVQNSRKLHDAFLRIGKEMVKQRI 367 + + ++++ KL D F I ++ + Sbjct: 758 NNGAHF-IEDASKLGDVFGEIVSDITTSTV 786 Score = 42.5 bits (98), Expect = 0.095, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 33/77 (42%), Gaps = 6/77 (7%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + + + + L++ +V+D S SM+ ++ +++++ D +K D Sbjct: 500 GGATESTQTTYTDLNVALVVDTSGSMDG------TRMSETKEALKDLCDQLKEHADEGAD 553 Query: 216 VRSGLVTFSSKIVQTFP 232 V L+ FS + P Sbjct: 554 VNLSLIGFSGALNINLP 570 >gi|148692918|gb|EDL24865.1| RIKEN cDNA 1700112N15 [Mus musculus] Length = 492 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 2/82 (2%) Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + A E++ G II LTDG +++ +A+R GAIVY +GV Sbjct: 5 LRKANEQIRKSTLGGRIVNSVIIALTDGLLLLKPY--LDTMEEAKKARRMGAIVYTVGVF 62 Query: 326 AEAADQFLKNCASPDRFYSVQN 347 + Q + PDR + V Sbjct: 63 MYSKQQLVNIAGDPDRCFGVDE 84 >gi|15678583|ref|NP_275698.1| magnesium chelatase subunit [Methanothermobacter thermautotrophicus str. Delta H] gi|2621631|gb|AAB85061.1| magnesium chelatase subunit [Methanothermobacter thermautotrophicus str. Delta H] Length = 182 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 25/191 (13%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPL 233 +D+S SM K I ++ + + R +V F + P Sbjct: 2 VDISGSMFSDR-----KAARVKGLIERFIEDAQ-----RHRDRISVVGFRGRDARVIIPS 51 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 ++ + + G TT G++ + K H +Y +++ L+DG Sbjct: 52 TAHASSFRDAVESIRVGGTTPMAQGIQRGLEIL--------REEKRHGEYVPFMVILSDG 103 Query: 294 ENSSPN--IDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLK---NCASPDRFYSVQN 347 + +E++ + + I + + L AS +Y + + Sbjct: 104 MPNVGTGRDPKREAVEAASRLREEEIPSTVINFERGSRGGRDLNMEIALASGGSYYDLHD 163 Query: 348 SRKLHDAFLRI 358 R A +I Sbjct: 164 LRDPSGAVAKI 174 >gi|58429455|gb|AAW78131.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 70/224 (31%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDVYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + K RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLKDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|5305316|gb|AAD41583.1|AF057703_1 structural toxin protein RtxA [Legionella pneumophila 130b] Length = 1208 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 95/256 (37%), Gaps = 14/256 (5%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNI---WQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 ++ + N FSY IK + L ++ + ++IE + + S+ Sbjct: 50 VSNSSLNGETFDIGLFSYNTIKTTPSEININMGLSLTDSDGDKITSSIEINLAPSVFKVG 109 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 ++ D S+ + + + + + + + I VLD S S Sbjct: 110 ENVDDTSSSNVLHRVGGDTGVVDGSGGADILVGDVGGVEIVGTTARIAF----VLDESGS 165 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M +FG G +L V +++ ++L + + P+ + V LV F+S + T + Sbjct: 166 MGQNFG-GTTRLEVLKQTMTDILTELSNTPNASITVH--LVKFASVVNGTGTFEITGGEL 222 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 Q+ ++ I G + Y + + G D ++ + F TDG + Sbjct: 223 QQALD-FISGLQIQQGLLAGTNYEAALGQTLQWYNSQSGTADVQQTL-FFTDGAPTFYMD 280 Query: 301 DNKESLFYCNEAKRRG 316 N S Y N A+ G Sbjct: 281 GN--STEYTNIARVYG 294 >gi|302801818|ref|XP_002982665.1| hypothetical protein SELMODRAFT_421953 [Selaginella moellendorffii] gi|300149764|gb|EFJ16418.1| hypothetical protein SELMODRAFT_421953 [Selaginella moellendorffii] Length = 2190 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 27/206 (13%) Query: 171 MMMVLDVSLSMND-HFGPGMDKL--GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 ++ V+D S SM P M K M+ I++ N +V F Sbjct: 1990 VIFVVDRSGSMGSPDIKPKMVKFPENRLGCVFEAMVRFIRTRIAANLQDVMSVVLFDDHG 2049 Query: 228 VQTFPLAWGVQHIQE-KINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +H+ E ++++L+ T + G+ + + K Sbjct: 2050 QIAME----REHMSEPQVDKLLTFEDAGGTVYSSGIARVEEILVRSVSDPAVAGKSPA-- 2103 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLK-----NC 336 ++FL+DG+N + + N+ K+ + I + I + + K Sbjct: 2104 ---VVFLSDGDNYGG----LDPVHCVNQLKKLEQSLIFHTIMFATDPTNSAKKLLTDMAA 2156 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEM 362 A F + +L +F + K + Sbjct: 2157 AGDGMFQVSIDEIQLSRSFEDLAKSL 2182 >gi|260425523|ref|ZP_05779503.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260423463|gb|EEX16713.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 737 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 70/178 (39%), Gaps = 17/178 (9%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + +S ++ + + ++LD S S D G G L + + + + + D Sbjct: 539 EGTSAANRSISVHLLLDSSRSTGDTTGSG-SVLELERDAAGILALAMDRLGDP-----LA 592 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK-IFDAKEKLEHIAK 278 + FSS+ + + +++ +RL + T GL A++ I A + Sbjct: 593 ISAFSSRGREDMRIT----EVKKFDDRLGMATGMGLT-GLRPAWSTRIGAAIRYGGRSLE 647 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +++ ++ LTDGE S ++ ++ ++ +G + I + A + Sbjct: 648 QMASHRRLVLLLTDGEPSDIDVPDRAYLVADARRAVQMLSAKGIDCFCIALGDTAGGR 705 >gi|229514669|ref|ZP_04404130.1| TPR domain protein in aerotolerance operon [Vibrio cholerae TMA 21] gi|229348649|gb|EEO13607.1| TPR domain protein in aerotolerance operon [Vibrio cholerae TMA 21] Length = 656 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 W + S + S + + +++D+S SM T++ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLTP----NRLTQA 110 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTK 254 + LD++K + + +GLV +++ PL + + L + + Sbjct: 111 RYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGSD 166 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 167 AAAAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|157374762|ref|YP_001473362.1| TPR repeat-containing protein [Shewanella sediminis HAW-EB3] gi|157317136|gb|ABV36234.1| tetratricopeptide TPR_2 repeat protein [Shewanella sediminis HAW-EB3] Length = 644 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 36/283 (12%), Positives = 85/283 (30%), Gaps = 51/283 (18%) Query: 111 STSLSIIIDDQHKDYNLSAVSRYEMP-FIFCTFPWCAN--SSHAPLLITSSVKISSKSDI 167 +++ + I ++ + P + W + P S+ + + Sbjct: 30 NSTWNRYISPHLAALLVTKTKNVKRPSLSYLAVSWLIAVFALSGPAFTQQSLPVFEAAQG 89 Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + +V+D+SLSM ++L A +++ + +GLV ++ Sbjct: 90 RV---IVMDMSLSMYATDQAP-NRLSQAKFKATDLIGELTEGE-------TGLVAYAGDA 138 Query: 228 VQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PL + + L + + LE + N + Sbjct: 139 YTISPLTRDRSTLLNLLPTLSPDIMPSRGSNLVAALEQSKNLLAQGGHIRGD-------- 190 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------DQFLKNC 336 I+ L+DG + K+ L K + + +E Q L++ Sbjct: 191 ---ILLLSDGIPPRQLNEAKKVL------KGTQYRLGILAFGSEQGSPIRLPDGQLLRDN 241 Query: 337 A-----SPDRFYS----VQNSRKLHDAFLRIGKEMVKQRILYN 370 A + + Q + + F G+++ + + Sbjct: 242 ANQVVVAKTNYLQLNELAQEADGILIPFRTDGQDLEQLLTWLS 284 >gi|187608071|ref|NP_001119900.1| integrator complex subunit 6 [Danio rerio] gi|169158663|emb|CAQ14208.1| novel protein similar to vertebrate integrator complex subunit 6 (INTS6) [Danio rerio] Length = 854 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 49/143 (34%), Gaps = 9/143 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L ++S + R LV+F + Sbjct: 4 LLFLIDTSASMNQRSHLGTSYLDIAKGAVETFL-KLRSRDPASRGDRYMLVSF-EEAPAG 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKDSHATFMTELRNLQAVGLTSFGQALRTAFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDGENSSPNIDNKESL 306 I+ +TDG + + ++ L Sbjct: 122 PAIIVAITDGSKLTGSSGVQDEL 144 >gi|58429517|gb|AAW78162.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + S+ + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSEEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYASVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 GSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|300902281|ref|ZP_07120278.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301304488|ref|ZP_07210599.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|307312362|ref|ZP_07591996.1| von Willebrand factor type A [Escherichia coli W] gi|331683757|ref|ZP_08384353.1| putative von Willebrand factor type A domain protein [Escherichia coli H299] gi|300405643|gb|EFJ89181.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300840214|gb|EFK67974.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|306907533|gb|EFN38036.1| von Willebrand factor type A [Escherichia coli W] gi|315061348|gb|ADT75675.1| conserved protein [Escherichia coli W] gi|315255415|gb|EFU35383.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323378072|gb|ADX50340.1| von Willebrand factor type A [Escherichia coli KO11] gi|331078709|gb|EGI49911.1| putative von Willebrand factor type A domain protein [Escherichia coli H299] Length = 219 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 64/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++I VQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIAVQGADMKTLAQ 171 >gi|260785838|ref|XP_002587967.1| hypothetical protein BRAFLDRAFT_87362 [Branchiostoma floridae] gi|229273122|gb|EEN43978.1| hypothetical protein BRAFLDRAFT_87362 [Branchiostoma floridae] Length = 192 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 75/194 (38%), Gaps = 26/194 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+++ LD+S S D+ +A + +D + + + +R ++ ++ + Sbjct: 7 AIDIVLALDLSSS------IPQDQFELARDFMVAFVD-CEVFQEKD--IRIAVLNYTCEA 57 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F LA + +I +L+ G + G + ++ H A Sbjct: 58 DTYFDLAPIAYGMSYEIGQLMRGDGGITRTGHAINHMRLTSKFGAESHHAA--------- 108 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + LTDG+ + D++++ +A+ G +YA+ L+ + V Sbjct: 109 VILTDGQ----SEDDQQTAAA--DARAAGIGLYAVEFGKYVNMYALEAMTTSGS--RVFT 160 Query: 348 SRKLHDAFLRIGKE 361 + + DA +I + Sbjct: 161 TSQACDAAQKIVDD 174 >gi|260823627|ref|XP_002606182.1| hypothetical protein BRAFLDRAFT_126499 [Branchiostoma floridae] gi|229291521|gb|EEN62192.1| hypothetical protein BRAFLDRAFT_126499 [Branchiostoma floridae] Length = 951 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 76/205 (37%), Gaps = 27/205 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ D S SM+ + + + +L +I + VN V +V ++ K Sbjct: 82 VIVAADKSGSMSGNPWRQVQ---------QALLYMIGDVASVNPSVALDVVIYNDKASL- 131 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L + + Q+ +NR+ T A++ I D + G K ++F+ Sbjct: 132 --LQYAGSY-QDAVNRVNADGMT----SFAAAFSCIKDCLKTEIQ---GTPVSKTVVVFM 181 Query: 291 TDGENSS--PNIDNKESLFYCNEAKRRG--AIVYAIGVQAEAADQF---LKNCASPDRFY 343 TDG ++ ++ + R G AIV+ +G A+ F L+N + + Sbjct: 182 TDGADTCNRGADIDRSVRSWKEALARLGHEAIVHVVGFSAQHDYNFLGRLRNTGTTAGLF 241 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRIL 368 +A +E+ L Sbjct: 242 RYTEPSDGTEALKAKLQELFDFVAL 266 >gi|189461335|ref|ZP_03010120.1| hypothetical protein BACCOP_01985 [Bacteroides coprocola DSM 17136] gi|189431864|gb|EDV00849.1| hypothetical protein BACCOP_01985 [Bacteroides coprocola DSM 17136] Length = 289 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 47/109 (43%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + +++ + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSNSLDF------GTVKQLKKNMVAEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +H I+E ++ + T +EY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESTRTNIQCAIEYLTNVLKK 174 >gi|119962625|ref|YP_948082.1| hypothetical protein AAur_2345 [Arthrobacter aurescens TC1] gi|119949484|gb|ABM08395.1| conserved hypothetical protein [Arthrobacter aurescens TC1] Length = 338 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 60/203 (29%), Gaps = 25/203 (12%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD-KLGVATRSIREMLDIIKSI 209 P + + LD+ V+D S SM G +L + + + + Sbjct: 63 RPGWAGGQAET---ATADLDVFFVVDTSTSMGAEDYNGTGPRLSGVKQDVMAIAKELAGA 119 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + L+TF SK PL +Q + L ++ + K+ Sbjct: 120 -------KFSLITFDSKASVRMPLTRDATALQTGMTTLQPQNSRYAKGSSVTGAAKLLKE 172 Query: 270 KEKLEHIAKGHDDYKKYIIFLT-DGENSSPNI-DNKESLFYCNEAKRRGAIVYAIGVQAE 327 + + ++F DGEN+S ++ A +G Sbjct: 173 RLAAANQQHPGRPA---LVFYAGDGENTSAEAPAPMDTGNVAGGA--------VLGYGTG 221 Query: 328 AADQFLKNCASPDRFYSVQNSRK 350 + K A PD Y + Sbjct: 222 EGGRM-KESADPDAGYVKDKTTD 243 >gi|332520546|ref|ZP_08397008.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332043899|gb|EGI80094.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 698 Score = 43.6 bits (101), Expect = 0.042, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 81/220 (36%), Gaps = 26/220 (11%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 PW N+ + I K ++ ++ ++DVS SM+ +L + + + Sbjct: 317 TPW--NNQTKLVKIGLQGKTYENKELPAANLTFLIDVSGSMSH-------ELKLLKSAFK 367 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 ++D ++ V+ VV +G +V + I + +N+L G +T G+ Sbjct: 368 LLVDQLRDKDKVSIVVYAGAAG----VVLEPTSGKDKKKILKALNKLQSGGSTAGGAGIN 423 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 AY E+ K ++ +I TDG+ + N+ E ++ G + Sbjct: 424 LAYKL------AEENFNKNGNNR---VILATDGDFNVGASSNQAMEDLIIEKRKSGVFLS 474 Query: 321 AIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 +G D L+ A + ++ F + Sbjct: 475 VLGFGYGNYKDDKLETLADKGNGNHAYIDTMQEAKLIFGK 514 >gi|212693199|ref|ZP_03301327.1| hypothetical protein BACDOR_02709 [Bacteroides dorei DSM 17855] gi|237709941|ref|ZP_04540422.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725392|ref|ZP_04555873.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753593|ref|ZP_06088948.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664304|gb|EEB24876.1| hypothetical protein BACDOR_02709 [Bacteroides dorei DSM 17855] gi|229436079|gb|EEO46156.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456034|gb|EEO61755.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235307|gb|EEZ20831.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 289 Score = 43.6 bits (101), Expect = 0.042, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 45/110 (40%), Gaps = 6/110 (5%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSNSLDF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 S +I + P G +HI I L+ ++ A + + +K Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTDIKTAVEYLTNVIKKR 175 >gi|260790216|ref|XP_002590139.1| hypothetical protein BRAFLDRAFT_90872 [Branchiostoma floridae] gi|229275328|gb|EEN46150.1| hypothetical protein BRAFLDRAFT_90872 [Branchiostoma floridae] Length = 1143 Score = 43.6 bits (101), Expect = 0.042, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 54/176 (30%), Gaps = 28/176 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM H +L D + + N++R + ++ Sbjct: 934 VAILVDTSGSMGPHLPELKKELASLV------WDQLCLNTEKFNLIRFSSDVDTWQVQLV 987 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + T + LE A+ + L Sbjct: 988 EPTDEFCHDAVQWQATFVAEGNTNTLGALEEAFQDSAVDG----------------VYLL 1031 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCAS--PDRF 342 TDG+ + + NE RG V I + A+ FLK AS R+ Sbjct: 1032 TDGKPDQSASMVLKEVAKMNE--GRGVHVNTISFNCDDSTANSFLKQLASETGGRY 1085 >gi|194223901|ref|XP_001494668.2| PREDICTED: similar to integrin alpha 2 subunit [Equus caballus] Length = 1194 Score = 43.6 bits (101), Expect = 0.042, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 73/212 (34%), Gaps = 37/212 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 186 IDVVVVCDESNS--------IYPWEAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 234 Query: 229 QTFPL-AWGVQH--IQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L + + ++ FG T + ++YA + + A A G Sbjct: 235 VVFNLNTFKTKAEMVEATSQTFQFGGDLTNTFKAIQYARDSAYSAA------AGGRRGAT 288 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQF---LKN 335 K ++ +TDGE + D ++ + + I V A +K Sbjct: 289 KVMVVVTDGE----SHDGSMLKAVIDQCNSDNILRFGIAVLGYLNRNALDTKNLIKEIKA 344 Query: 336 CAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 AS F++V + L + +G+++ Sbjct: 345 IASIPTERYFFNVSDEVALLEKAGTLGEQIFS 376 >gi|21673259|ref|NP_661324.1| magnesium-chelatase subunit D/I family protein [Chlorobium tepidum TLS] gi|21646346|gb|AAM71666.1| magnesium-chelatase, subunit D/I family [Chlorobium tepidum TLS] Length = 649 Score = 43.6 bits (101), Expect = 0.042, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 48/137 (35%), Gaps = 18/137 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +G ++ V+D S SM ++ + ++ +L + + +V+ Sbjct: 458 REKRLGNLLIFVVDASGSMG-----ARGRMAASKGAVMSLL-----LDAYQKRDKLAMVS 507 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F ++ P+ ++ + + G T + GL Y + G Sbjct: 508 FRKNEAFVNLPVTSSIELAARMLKEMPVGGRTPFSAGLLKGYEI----AQNYLRKEPGGR 563 Query: 282 DYKKYIIFLTDGENSSP 298 II +TDG+ + Sbjct: 564 PL---IILVTDGKANRA 577 >gi|13475442|ref|NP_107006.1| hypothetical protein mlr6511 [Mesorhizobium loti MAFF303099] gi|14026194|dbj|BAB52792.1| mlr6511 [Mesorhizobium loti MAFF303099] Length = 537 Score = 43.6 bits (101), Expect = 0.042, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 69/199 (34%), Gaps = 40/199 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---- 226 ++++LD S SM G KL +A S+R +L + + + G + + + Sbjct: 11 VIIILDASGSM-WAQIDGKPKLEIARESLRTVLQSVPADDE------IGFMAYGHREKGS 63 Query: 227 ---IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I P G I + + L F T T ++ A + + + Sbjct: 64 CDDIQLIVPPQPGSASAITDAADSLKFLGKTPLTAAVKQAAEALK------------YTE 111 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA----IVYAIGVQAEAADQFLKNCA- 337 K ++ +TDG + + E K G V G+ A+ Q C Sbjct: 112 DKATVVLITDGLETCGG----DPCALGKELKASGVDFTADVVGFGLTADEGKQI--ACLA 165 Query: 338 --SPDRFYSVQNSRKLHDA 354 + ++ + + L +A Sbjct: 166 ENTGGKYIQASDEKALQEA 184 >gi|294788348|ref|ZP_06753591.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] gi|294483779|gb|EFG31463.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] Length = 212 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 67/196 (34%), Gaps = 16/196 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S SM+ + ++ ++ ++ P ++TF S+ Sbjct: 4 RLPVYLLVDTSGSMHGE------AIEAVRNGLQVLVSALRQDPYALETAYLSVITFDSQA 57 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + +I + T L + I +E ++ A+ D+K + Sbjct: 58 KQVTPLT---ELMNFQIPNIEANGATAMGGALTLLADCI--NREVVKGSAEVKGDWKPVV 112 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 L+DG + + +A + G V A A LK + Sbjct: 113 FLLSDGSPTDSISKGIADI----KAVKTGIFV-ACAAGPHADTSTLKQITETVVSLDTAD 167 Query: 348 SRKLHDAFLRIGKEMV 363 + + F + + Sbjct: 168 ANSIKAYFKWVSASIS 183 >gi|289192722|ref|YP_003458663.1| Magnesium chelatase [Methanocaldococcus sp. FS406-22] gi|288939172|gb|ADC69927.1| Magnesium chelatase [Methanocaldococcus sp. FS406-22] Length = 317 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 57/168 (33%), Gaps = 20/168 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V+ + I ++ V+DVS SM M ++ A +I +L + + N Sbjct: 121 IVEKVRQRKISSHILFVVDVSGSMG-----AMRRMEAAKGAIISLL--LDAYQKRNK--- 170 Query: 218 SGLVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 G++ F + P V+ ++ + L G T +Y E Sbjct: 171 IGMIAFRKDRAELILPFTSSVELGEKLLKDLPTGGKTPLADAFIKSYEVF-----DRELR 225 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDN--KESLFYCNEAKRRGAIVYAI 322 + +I ++D + + + KE C + + V I Sbjct: 226 KNPNIIP--IMIVISDFKPNVAVKGDYVKEVFDACEKIAEKCINVILI 271 >gi|197322496|ref|YP_002154769.1| putative von Willebrand factor [Feldmannia species virus] gi|197130563|gb|ACH46899.1| putative von Willebrand factor [Feldmannia species virus] Length = 253 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 53/158 (33%), Gaps = 9/158 (5%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + K+ K + ++++D S SM G + A + ++ + + Sbjct: 41 SAAETKMLEKEALVRQFVLLIDRSGSMGWPDGDDKTRWERAKEVTKALVPSLFKYDVDKS 100 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 L F S++ I+ G+TT + LE A +K Sbjct: 101 ---IPLFLFDSEVSFVGECT-NASQIETVFTEYQPGTTTNLSGALEQAMEMYLGSKRVNY 156 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + G +I+ L DG +P+ + Y + A Sbjct: 157 EVVPGTT----FIVLL-DGGADNPDEVVQVLQKYADPA 189 >gi|149925200|ref|ZP_01913495.1| hypothetical protein PPSIR1_06648 [Plesiocystis pacifica SIR-1] gi|149813928|gb|EDM73579.1| hypothetical protein PPSIR1_06648 [Plesiocystis pacifica SIR-1] Length = 353 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 69/230 (30%), Gaps = 47/230 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D S SM+D+ + I E+ + ++ GL F Sbjct: 66 VVLVVDRSGSMSDNPLGDETRWEALHGVISEV------VSGQEASLQLGLTMF---PAAD 116 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY---NKIFDAKE--------KLEHIAKG 279 W Q +L + + A N I +H+ Sbjct: 117 AGTTWE-QGACLAPTQLDVAVGADTGAAILGALPGPNAITQGGTPAAAAVQLAADHLRAR 175 Query: 280 HDDYKKYIIFLTDGENSSP--NIDNKESLFY-------CNEAKRRGAIVYAIGVQAEAA- 329 K ++ +TDG + + D + SL Y A G + +G+Q + Sbjct: 176 DTQDPKLLVLVTDGAANCAADSADWQASLVYDEALQDAVANASMDGITTHVVGIQIDTEL 235 Query: 330 --------DQFLKNCASPD--------RFYSVQNSRKLHDAFLRIGKEMV 363 + L A FY V++ L A I ++ Sbjct: 236 DAQAGVVPAEQLHEVAQLGGAGLDGEYAFYQVEDQAMLSAALSSITADIS 285 >gi|15779150|gb|AAH14640.1| COL14A1 protein [Homo sapiens] Length = 759 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 60/154 (38%), Gaps = 21/154 (13%) Query: 214 NVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 + + +V F+ F L A+ + + + I + + G TK+ ++Y + +F A Sbjct: 31 DGTQVAMVQFTDDPRTEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA- 89 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 E K I+ +TDG + + E + G ++AIGV Sbjct: 90 ESGTRRGIP-----KVIVVITDGRSQD------DVNKISREMQLDGYSIFAIGVADADYS 138 Query: 331 QFLKNCASPD--RFYSVQNSRKLHDAFLRIGKEM 362 + + + P + V + DAF +I E+ Sbjct: 139 ELVSIGSKPSARHVFFVDD----FDAFKKIEDEL 168 >gi|171911941|ref|ZP_02927411.1| Vault protein inter-alpha-trypsin domain protein [Verrucomicrobium spinosum DSM 4136] Length = 679 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L + K + D + VLDVS SMN G ++ + + L+ + + Sbjct: 300 LNVQPPAKWEAGQTPPRDYLFVLDVSGSMN---GFPIETSKRLMSDLLKGLNPGDTFNIL 356 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + S ++ PLA ++I + ++R T+ P L+ A + Sbjct: 357 H-------FASDSAVLSPKPLAATPENIHLATKDLSRHRGNGGTELLPALQRA---LATP 406 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 +E + I+ LTDG + + A V+ G+ Sbjct: 407 RE---------VGVSRSIVILTDGYVTIEKEAFRLVRKELQNA-----NVFTFGIGTAVN 452 Query: 330 DQFLKNCASPDR 341 ++ A + Sbjct: 453 RWLIEGLAHAGQ 464 >gi|149177287|ref|ZP_01855892.1| hypothetical protein PM8797T_23741 [Planctomyces maris DSM 8797] gi|148843812|gb|EDL58170.1| hypothetical protein PM8797T_23741 [Planctomyces maris DSM 8797] Length = 338 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 64/227 (28%), Gaps = 28/227 (12%) Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG 186 A S + + P++ +V D G + VLD S SM G Sbjct: 125 TQAPSAPTTDLTTDQLMSPSTALAPPVMGAGNVNFFDAVDSGKRFVFVLDCSGSMAAPQG 184 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAWGVQHIQEKIN 245 + K S L+ + + + + K + ++ I Sbjct: 185 APIRKARSELISSLAGLNHHQQFQIIFYNTTTRAMQHRGKSAELLYATDINRTLARQFIQ 244 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + T P L+ A I FLTD ++ + + Sbjct: 245 SVEPDGGTDHLPALKRAL-----------------SFNPDVIFFLTDAKHPQLSSADLND 287 Query: 306 LFYCNEAKRRGAIVYAIGVQ-------AEAADQFLKNCASPDRFYSV 345 + N K A ++ I + D+ + R+Y+V Sbjct: 288 IREQNGGK---AKIHCIEFGEGFPVKEGNSLDKLARQNKGSYRYYNV 331 >gi|134288259|ref|YP_001110422.1| von Willebrand factor, type A [Burkholderia vietnamiensis G4] gi|134132909|gb|ABO59619.1| von Willebrand factor, type A [Burkholderia vietnamiensis G4] Length = 623 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 71/210 (33%), Gaps = 34/210 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSM----NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 K + ++ M +++D S SM + P + K + D+ +++ Sbjct: 423 KKARTEELDTCMYLLVDESSSMSACFDRERAPQLHKAQANPNDPHQHFDVSRAVAAGRVA 482 Query: 216 -----------VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + G+ ++++ + + + ++ +T + + +A Sbjct: 483 VAAGEVLDGAQIPFGVASYNTAVREWQDFDGNWSNTLQRYEA-AATGSTNTHLAVVWALR 541 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K D E+ K + +TDG+ D EA+R G V I + Sbjct: 542 KFVDRNEQR-----------KVLAVVTDGDPG----DATVLEAALKEAERFGVEVRFILI 586 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDA 354 A ++ A+ + N R+L A Sbjct: 587 GASEEVRYKGLSAA---YGVATNVRELAKA 613 >gi|152974363|ref|YP_001373880.1| von Willebrand factor type A [Bacillus cereus subsp. cytotoxis NVH 391-98] gi|152023115|gb|ABS20885.1| von Willebrand factor type A [Bacillus cytotoxicus NVH 391-98] Length = 627 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 22/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMQETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSATPENKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRTEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKNGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN F ++ Sbjct: 591 QLMKN-IYNHHFLIANDAED 609 >gi|117928940|ref|YP_873491.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] gi|117649403|gb|ABK53505.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] Length = 177 Score = 43.6 bits (101), Expect = 0.043, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 37/128 (28%), Gaps = 4/128 (3%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 + G IS+L VI ++++ S F + L D + L A + Sbjct: 27 RDDGGQISLLIVFFGLVILGFTTVIVDLSTVFLAQRVLQATADGAALTAAQHVSLAGAYT 86 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 + S + R + + +L D + ++S Sbjct: 87 TELAEWLPLSDAEVYAAVADYVGEPGRAPQSCRSGTLSITAATL----DATDRTVSVSLS 142 Query: 131 SRYEMPFI 138 +P + Sbjct: 143 CTVSLPIV 150 >gi|123438167|ref|XP_001309871.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] gi|121891616|gb|EAX96941.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] Length = 957 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 66/182 (36%), Gaps = 18/182 (9%) Query: 176 DVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW 235 D+S SM + +KL A + I + + GL+ FSS + PL Sbjct: 462 DLSRSMENS-----NKLKNAKKVISILAKNVFEFGIGTMW---GLINFSSTVKTVLPLTA 513 Query: 236 GVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 +N G TK ++ A I E +++ K I+ +TDG Sbjct: 514 IASEFSMAVNEDSELGDDTKLFEAIKVASETITSKSEYFDNVYKR-------IVVVTDGI 566 Query: 295 NSSPNIDNKESLFYCNE-AKRRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQNSRKLH 352 ++ + + ESL + I+ I + +++ + F+ + + L Sbjct: 567 DNDNHYKSDESLQKLTKILTDNKIILDVIFIDESDSRAAVMSQATGGLAFFFKGSEQNLM 626 Query: 353 DA 354 ++ Sbjct: 627 ES 628 >gi|111224529|ref|YP_715323.1| hypothetical protein FRAAL5146 [Frankia alni ACN14a] gi|111152061|emb|CAJ63786.1| hypothetical protein FRAAL5146 [Frankia alni ACN14a] Length = 209 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 65/187 (34%), Gaps = 22/187 (11%) Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM L S+ + ++S P V + R +VTFS PL + Sbjct: 2 SASMAG------GPLEALNDSLPALQKEMQSNPTVGEIARISIVTFSDVGRTVVPLC-DL 54 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + + L+ T + I L KG Y+ + F++DGE+ + Sbjct: 55 AEV--YLPELMVEGGTNFAAAFQETRRAIEGGLRSL---PKGTPIYRPVVFFMSDGEHQA 109 Query: 298 PNIDNKESLFYCNEAKRRGA----IVYAIGVQAEAADQFLKNCASPDRFY--SVQNSRKL 351 P D +L N+ + R V A G + ++ A+ F + ++ Sbjct: 110 PG-DWTAAL---NDLRDRSWRFAPEVVAFGFGDQVNVDSIRRIATRFSFLARDADPATQV 165 Query: 352 HDAFLRI 358 + + Sbjct: 166 REIMNAL 172 >gi|291520528|emb|CBK75749.1| Gram positive anchor./von Willebrand factor type A domain [Butyrivibrio fibrisolvens 16/4] Length = 605 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 51/133 (38%), Gaps = 15/133 (11%) Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M + G ++ + ++ M+D I DV+ + ++ F+++ ++ Sbjct: 1 MTETHGDK-TRIQLLKSAVDNMIDNIAEKEDVD--AKWEVIDFATRAAVRGGGWLNTSNV 57 Query: 241 QEKI-------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 ++ + N + G T G++ A + + + + KK ++FLTDG Sbjct: 58 KQYVTTAINEDNNVDIGRGTNYQAGMDLAQKEFEKKQPESDRPN-----AKKIVLFLTDG 112 Query: 294 ENSSPNIDNKESL 306 + + L Sbjct: 113 QPTYYGSGVGNDL 125 >gi|251795401|ref|YP_003010132.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247543027|gb|ACT00046.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 429 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 41/293 (13%), Positives = 86/293 (29%), Gaps = 43/293 (14%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTS 113 ++ +++ ++N + + N + F ++NG N S Sbjct: 137 RAVSDARVRLIGEQNQTVWEAKTNAEGNAFV-------FAGLFKDNG----QQNQRTRYS 185 Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 + ++ D Q K VS E+P + + + S++ +D+M Sbjct: 186 VEVMADQQKKK-----VSNIEVP----------GQGALKVDLDGELPASNQ----VDVMF 226 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D + SM D +L + E I N R +V+ P Sbjct: 227 VMDTTGSMQDEMDYLEAELNDVITRVGEKHANQLDIRMSTNFYRD---IHDDYVVKANPF 283 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + I +E A + E+ + + + D Sbjct: 284 TTHIDQAVKLIAMQKAQGGGDYPEAVEQAMRNAVSDHKWSENARA------RLLFLVLDA 337 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FLK--NCASPDRF 342 +E +A + G + + AD L+ A+ + Sbjct: 338 PPHHETQIIQEMHSVIADAAKAGIRIIPVASSGVDADTEYLLRFAAVATGGTY 390 >gi|119872363|ref|YP_930370.1| von Willebrand factor, type A [Pyrobaculum islandicum DSM 4184] gi|119673771|gb|ABL88027.1| von Willebrand factor, type A [Pyrobaculum islandicum DSM 4184] Length = 358 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 68/191 (35%), Gaps = 38/191 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV-- 228 + + LDVS SM ++ G + KL VA +I L + + +V LV F++ Sbjct: 201 IYIALDVSGSMKEYIG-ALTKLKVAKNAIARYLHQ---MAHLRGLV--SLVLFNTDADFM 254 Query: 229 -QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P+ ++ + E + + T+ LE + + I Sbjct: 255 WTPHPVNIYLRDMIEILKYIYAMGGTELASALELL----------------QSHEISRDI 298 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSV 345 + +TDG D + L KR + A QFLK+ A + ++ + Sbjct: 299 VIITDGR----THDPDKVLNLAKRFKRLHIV-------ATEKSQFLKSLAKTTGGKYREL 347 Query: 346 QNSRKLHDAFL 356 + + + Sbjct: 348 TPTLNILEVLS 358 >gi|262171975|ref|ZP_06039653.1| TPR domain protein in aerotolerance operon [Vibrio mimicus MB-451] gi|261893051|gb|EEY39037.1| TPR domain protein in aerotolerance operon [Vibrio mimicus MB-451] Length = 562 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 47/139 (33%), Gaps = 17/139 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + S + +M++LD S SM D+L + + I ++ + ++G Sbjct: 91 EASPFGEDSASLMVLLDSSESMQQKDIAP-DRLTRSKQKILDLTEA-------RKGGKTG 142 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L+ F+ PL + +Q + + + A N + + Sbjct: 143 LMVFAGSAHVALPLTSDNRVLQPYLAAINPNVMPIEGKAAQSALNLLHEQLPPYVGNT-- 200 Query: 280 HDDYKKYIIFLTDGENSSP 298 ++ +TDG S Sbjct: 201 -------LLLVTDGVTDST 212 >gi|47212619|emb|CAF92825.1| unnamed protein product [Tetraodon nigroviridis] Length = 533 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 52/164 (31%), Gaps = 45/164 (27%) Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRL---IFGSTTKSTPGLEYAYNKIFDAKEKL 273 R +TFSS+ L +I++ +N L I G T GL+ A Sbjct: 61 RMSFITFSSRASTIMKLTENRINIRKGLNALKREIPGGDTIMHLGLQKA----------N 110 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKE----------------------------- 304 E I + + II LTDGE + + + Sbjct: 111 EQIKRENFGPASVIIALTDGELQEDELISAQQEVAASRSMSAVAAVSALAAFSIPDVMDP 170 Query: 305 ---SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 A+ GAIVY +GV+ Q + + + V Sbjct: 171 FPVFPQQAETARSLGAIVYCVGVKDFNETQLATIADTIEHVFPV 214 >gi|301789441|ref|XP_002930137.1| PREDICTED: von Willebrand factor-like [Ailuropoda melanoleuca] Length = 2813 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 60/188 (31%), Gaps = 28/188 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + LD+ ++LD S S + M A I + + Sbjct: 1679 TLAPTPDCSQPLDVALLLDGSSSFPASYFEEMKSFAKA---------FISRANIGPQLTQ 1729 Query: 218 SGLVTFSSKIVQTFPLAWGVQ----HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + S P W V H+ ++ + G + L+YA + Sbjct: 1730 VSVLQYGSTTTAAVP--WNVAYEKAHLLSHVDLMQREGGLSHIGDALDYAVRYVTS---- 1783 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H A+ K +I +TD + + A V+ +G+ + Sbjct: 1784 EVHGARPGAS-KAVVILVTD-------VSADTVDAAADAATSNRVTVFPVGIGDRYDEAQ 1835 Query: 333 LKNCASPD 340 L+ A P+ Sbjct: 1836 LRRLAGPN 1843 >gi|222056730|ref|YP_002539092.1| Vault protein inter-alpha-trypsin domain protein [Geobacter sp. FRC-32] gi|221566019|gb|ACM21991.1| Vault protein inter-alpha-trypsin domain protein [Geobacter sp. FRC-32] Length = 665 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 64/209 (30%), Gaps = 36/209 (17%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 L++ K+ + + + + VLDVS SM G +D R + L + Sbjct: 290 LMVQPPEKVQAAEILPREYIFVLDVSGSM---HGFPLDTAKTLIRDLIGNLRPTDTFN-- 344 Query: 213 NNVVRSGLVTFSSKIVQTFP-----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 LV F+ P + + I+ G T+ L A + Sbjct: 345 -------LVLFAGGSQVMDPSSIPATSENITKAIRLIDSQQGGGGTELAAALNKA---LS 394 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +EK + + +TDG + + + V++ G+ + Sbjct: 395 LPREKGKARTA---------VIITDG---FISAERESFKLISENLDTTN--VFSFGIGSS 440 Query: 328 AADQFLKNCASPDR--FYSVQNSRKLHDA 354 + A + + V + +A Sbjct: 441 INRYLVDGIAQAGQGESFVVTKPEEAKEA 469 >gi|330500925|ref|YP_004377794.1| PpkA-like protein [Pseudomonas mendocina NK-01] gi|328915211|gb|AEB56042.1| PpkA-related protein [Pseudomonas mendocina NK-01] Length = 1008 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 77/216 (35%), Gaps = 22/216 (10%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 ++ ++ S +++V+D S+SM P +D++ ++ L Sbjct: 566 PGNAAQSQTPAGKPATATDSGFRTGIVLVVDTSVSMQ----PYIDRVRQVVSELQSQLQA 621 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR------LIFGSTTKSTPGL 259 + +V+ GLV + + +T L + + + + L ++T Sbjct: 622 RGELDNVS----FGLVGYRNSTERTPGLQYLSKTLVSLQDGGDPQRFLRAAEQVQATSVS 677 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDY-KKYIIFLTDG----ENSSPNIDNKESLFYCNEAKR 314 +++N+ A +Y + I+ ++D +N + A Sbjct: 678 SHSFNEDAFAGVMQAVEGMDWSNYGGRLILLVSDAGALRKNDPHSSTRMNEAEVRQAALS 737 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 + ++A+ ++ A N AS ++ Y V + Sbjct: 738 KQIKIFALHLRTPAGKA---NHASAEQQYRVLTADS 770 >gi|327459721|gb|EGF06061.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1057] Length = 450 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|330805799|ref|XP_003290865.1| hypothetical protein DICPUDRAFT_155398 [Dictyostelium purpureum] gi|325078990|gb|EGC32613.1| hypothetical protein DICPUDRAFT_155398 [Dictyostelium purpureum] Length = 942 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 63/181 (34%), Gaps = 21/181 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVL--DVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + + + +K +S I D+ +V D + SM K+ + Sbjct: 599 AVEDSINESELLKSFVESSISSDVEIVFCFDTTGSMASVIESVKSKVNQTVTRL------ 652 Query: 206 IKSIPDVNNVVRSGLVTFSSK--IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 +++IP++ + GL + + ++ T L V+ + I ++ S EYA Sbjct: 653 MQTIPNIKIGI-MGLGDYCDRENVITTLDLTENVEKLTTFITKIPHTSGGDVPEAYEYA- 710 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN--KESLFYCNEAKRRGAIVYA 321 ++ AKE K + + D P+ N C+ G +Y Sbjct: 711 --LYKAKELSWSKHT-----SKAFVMIGDSNPHEPSFTNLHINWFEECDNLFDMGIKIYG 763 Query: 322 I 322 I Sbjct: 764 I 764 >gi|310823569|ref|YP_003955927.1| hypothetical protein STAUR_6343 [Stigmatella aurantiaca DW4/3-1] gi|309396641|gb|ADO74100.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 293 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 10/97 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++DVS S FG + I + + N R GL+ FS ++ Sbjct: 77 LTVMLLVDVSAS--KEFGSH----ERSKSEIAAEAAAQIAFSAIANNDRVGLILFSDRVE 130 Query: 229 QTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEY 261 + P G H+ I+ ++ G T GL Y Sbjct: 131 KVVPPRKGRSHVLRLISDILTFKPQGKGTDLGAGLMY 167 >gi|281343742|gb|EFB19326.1| hypothetical protein PANDA_020489 [Ailuropoda melanoleuca] Length = 2801 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 60/188 (31%), Gaps = 28/188 (14%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++ + LD+ ++LD S S + M A I + + Sbjct: 1667 TLAPTPDCSQPLDVALLLDGSSSFPASYFEEMKSFAKA---------FISRANIGPQLTQ 1717 Query: 218 SGLVTFSSKIVQTFPLAWGVQ----HIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + S P W V H+ ++ + G + L+YA + Sbjct: 1718 VSVLQYGSTTTAAVP--WNVAYEKAHLLSHVDLMQREGGLSHIGDALDYAVRYVTS---- 1771 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 H A+ K +I +TD + + A V+ +G+ + Sbjct: 1772 EVHGARPGAS-KAVVILVTD-------VSADTVDAAADAATSNRVTVFPVGIGDRYDEAQ 1823 Query: 333 LKNCASPD 340 L+ A P+ Sbjct: 1824 LRRLAGPN 1831 >gi|39963539|ref|XP_364917.1| hypothetical protein MGG_09762 [Magnaporthe oryzae 70-15] gi|145015458|gb|EDJ99994.1| hypothetical protein MGG_09762 [Magnaporthe oryzae 70-15] Length = 777 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 68/222 (30%), Gaps = 35/222 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDII-----KSIPD 211 + D+++V+DVS SM P D+ V R +LD++ + Sbjct: 40 PRVPKTNEPTPTDLVLVIDVSPSMQTEMVVPTEDENQVRERFGFTVLDLVGHACLTILET 99 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRL---------IFGSTTKSTPG 258 + R G+V F + L + + + L G Sbjct: 100 LTERDRLGIVMFKGRATVLQGLTLQDPQAKERSAKYLGDLRRLSEKWHCNMLGERDVMDG 159 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---R 315 L+ E +H ++ +TD ++D+ E K Sbjct: 160 LQVGLQLF---NEVRDHSLPYRVPA---VMLVTD-----SHLDSTEYTKPVASLKETNPE 208 Query: 316 GAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAF 355 A ++ G + K + S R+ + +S + AF Sbjct: 209 KAQIHTFGFGYNSEAGVFKAFSEISGGRYTFIPDSSMIGTAF 250 >gi|58616384|ref|YP_195514.1| tellurium resistance protein [Azoarcus sp. EbN1] gi|56315846|emb|CAI10490.1| tellurium resistance protein [Aromatoleum aromaticum EbN1] Length = 214 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 49/134 (36%), Gaps = 9/134 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S L + +++D S SM G ++ + V R+ M ++ P V + TF Sbjct: 3 SSRRLPVYLLIDTSGSMR---GEPVESVNVGLRA---MQTSLRQNPYAIETVHLSVTTFD 56 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 S+I PL +I T LE+ ++ + KG Sbjct: 57 SQIKDVLPLTALEDATIPEI-VCPASGATLLGEALEHILDRAKKEVRQSSAEQKGDWAPL 115 Query: 285 KYIIFLTDGENSSP 298 +I +TDG+ + Sbjct: 116 LFI--MTDGKPTDT 127 >gi|86143677|ref|ZP_01062053.1| hypothetical protein MED217_00250 [Leeuwenhoekiella blandensis MED217] gi|85829720|gb|EAQ48182.1| hypothetical protein MED217_00250 [Leeuwenhoekiella blandensis MED217] Length = 288 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 63/206 (30%), Gaps = 37/206 (17%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + S V +Y+ W + + + + L MM+V DVS S Sbjct: 34 KGRGMTFSEVRQYQFGDDVRNIDWNVTARY-----NEPFVKVFEEERELTMMLVADVSGS 88 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + FG + + + + N + GL+ FS I P G H+ Sbjct: 89 --EFFGTD----QQLKSEVVTEIAATLAFSALQNNDKIGLILFSDGIELYIPPKKGKSHV 142 Query: 241 QEKINRLIF----GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF-LTDGEN 295 I L+ T L+ KK I+F L+D Sbjct: 143 LRIIRELLEFKPKSKKTDVAQALK----------------FLSGVMKKKAIVFVLSD--- 183 Query: 296 SSPNIDNKESLFYCNEAKR-RGAIVY 320 D ++++ + G +Y Sbjct: 184 -FIADDYQDTMKIAAKRHDITGIRIY 208 >gi|288932694|ref|YP_003436754.1| hypothetical protein Ferp_2362 [Ferroglobus placidus DSM 10642] gi|288894942|gb|ADC66479.1| conserved hypothetical protein [Ferroglobus placidus DSM 10642] Length = 403 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 82/258 (31%), Gaps = 46/258 (17%) Query: 42 FFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGF 101 +F + L +++ ++ TK + ++ + + +++K + + G Sbjct: 119 YFFEEALKELIEMGIIEGVTKRFFRRKVKFSRQAERIIAQKVMKEVSKEAKGYYAESEG- 177 Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 + + L + H Y++ I T A + + V Sbjct: 178 --ETLSYIPGYELVEYDEYLH---------SYDLIDIPETMIRAAKNEDFEIREKDIVSR 226 Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + K +M++DVS SM G ++ SIR+ D ++ Sbjct: 227 NPKKVGKRHFVMLIDVSDSMRGKKIVGAIEAALALKMSIRKGFDDLEVF----------- 275 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 F+ + + I + T L+ A N + Sbjct: 276 -VFNHRTEKIRE---------GDIVNVDVEGRTDIALALKTARNALR------------G 313 Query: 281 DDYKKYIIFLTDGENSSP 298 D KY+I +TDGE ++ Sbjct: 314 KDGAKYVILITDGEPTAS 331 >gi|254505611|ref|ZP_05117757.1| TPR repeat-containing protein [Vibrio parahaemolyticus 16] gi|219551264|gb|EED28243.1| TPR repeat-containing protein [Vibrio parahaemolyticus 16] Length = 603 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 44/125 (35%), Gaps = 23/125 (18%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D+S+SM ++L A + ++L + +GLV ++ Sbjct: 80 VVVMDMSMSM-YATDIKPNRLTQARYKVTDLLSHWQEGS-------TGLVAYAGDAYMVS 131 Query: 232 PLAWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P+ I + L+ + G++ A + + H I Sbjct: 132 PMTSDANTIANLVPNLSPELMPYPGANAASGIKLAIEMMQNTGLATGH-----------I 180 Query: 288 IFLTD 292 + +TD Sbjct: 181 VLVTD 185 >gi|167845909|ref|ZP_02471417.1| hypothetical protein BpseB_11520 [Burkholderia pseudomallei B7210] Length = 579 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 43/127 (33%), Gaps = 8/127 (6%) Query: 14 KGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGK 73 +GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + + Sbjct: 1 RGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCAQ 55 Query: 74 KQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRY 133 + N L D + + + + + + Sbjct: 56 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTATQ 112 Query: 134 EMPFIFC 140 +P+ F Sbjct: 113 SVPYFFL 119 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 24/53 (45%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 F + +++++ AI L + +G ++ + + K+ LD ++L Sbjct: 4 RFLRDRSANVAVIFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAGV 56 >gi|332823606|ref|XP_003311226.1| PREDICTED: complement C2 isoform 3 [Pan troglodytes] Length = 538 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 13/110 (11%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +FS + T P Q ++ N G+ T + L Y + + L Sbjct: 99 TSFSHMLGATNP----TQKTKDHEN----GTGTNTYAALNSVYLMMNNQMRLLGMETMAW 150 Query: 281 DDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + II LTDG+++ D+ + N+ + +YAIGV Sbjct: 151 QEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVG 200 >gi|313227043|emb|CBY22190.1| unnamed protein product [Oikopleura dioica] Length = 1109 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 28/200 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S +DM+ +LD S S+ G V +R ++ + P + + Sbjct: 529 NSTKEVDMIFLLDSSGSV------GKPNFQVMKSWMRRLISGLNIAPGRTQ---VSVYLY 579 Query: 224 SSKIVQTFPLA--WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 ++ F L + IN++++ T+ L+ A +K+ + + + Sbjct: 580 NNIFRTIFNLNEHQNAYDMITAINKMVYSGKGTRIARALQSAMSKVLIP----QSGLRPN 635 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG---AIVYAIGVQAEAADQFLKNCA 337 + Y+ LTDG+ S D + N+ K + A+G+ + L A Sbjct: 636 SEI--YLYLLTDGKES----DVADVNNMANDIKDAFGDRITLTAVGISRSVENAELYAIA 689 Query: 338 SP---DRFYSVQNSRKLHDA 354 S D + ++N R L Sbjct: 690 SAPKKDNVFLLENYRDLDTI 709 >gi|295849303|ref|NP_001171534.1| complement C2 isoform 3 [Homo sapiens] gi|194390502|dbj|BAG60565.1| unnamed protein product [Homo sapiens] Length = 538 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 13/110 (11%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +FS + T P Q ++ N G+ T + L Y + + L Sbjct: 99 TSFSHMLGATNP----TQKTKDHEN----GTGTNTYAALNSVYLMMNNQMRLLGMETMAW 150 Query: 281 DDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + II LTDG+++ D+ + N+ + +YAIGV Sbjct: 151 QEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVG 200 >gi|123283203|emb|CAI17449.2| complement component 2 [Homo sapiens] gi|123857991|emb|CAM25861.1| complement component 2 [Homo sapiens] gi|168983783|emb|CAQ06834.1| complement component 2 [Homo sapiens] gi|168984349|emb|CAI41857.2| complement component 2 [Homo sapiens] gi|168984417|emb|CAQ09273.1| complement component 2 [Homo sapiens] gi|168985078|emb|CAQ07482.1| complement component 2 [Homo sapiens] gi|168985956|emb|CAQ07112.1| complement component 2 [Homo sapiens] Length = 525 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 13/110 (11%) Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +FS + T P Q ++ N G+ T + L Y + + L Sbjct: 86 TSFSHMLGATNP----TQKTKDHEN----GTGTNTYAALNSVYLMMNNQMRLLGMETMAW 137 Query: 281 DDYKKYIIFLTDGENSSPNI-----DNKESLFYCNEAKRRGAIVYAIGVQ 325 + + II LTDG+++ D+ + N+ + +YAIGV Sbjct: 138 QEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVG 187 >gi|297664536|ref|XP_002810695.1| PREDICTED: LOW QUALITY PROTEIN: calcium-activated chloride channel regulator 2-like [Pongo abelii] Length = 945 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 77/214 (35%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTDVSICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I LT G++ + L C G+ +++I + + AA L Sbjct: 412 GSVMILLTSGDD--------KLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + +S + DAF RI ++ +Q I Sbjct: 464 KFFVPDISDSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|295087036|emb|CBK68559.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bacteroides xylanisolvens XB1A] Length = 614 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N++H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMESGVCPW--NTNHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V SG S+ + Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYSG----SAGVKLEATPGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGAGILLAYRIAKKNLISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|126729349|ref|ZP_01745163.1| hypothetical protein SSE37_24154 [Sagittula stellata E-37] gi|126710339|gb|EBA09391.1| hypothetical protein SSE37_24154 [Sagittula stellata E-37] Length = 248 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 64/189 (33%), Gaps = 29/189 (15%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +VLD S SM + A +++ + ++ + + R +V ++ +V F Sbjct: 67 AIVLDDSGSMG-------SDMEAAKQAV------VDALSAMQDTDRVAVVALNAGVVLPF 113 Query: 232 -PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 +A + + + + +T L A E +G ++ +I Sbjct: 114 ASVADARRTLPAALAPIRDTGST----PLTRAILDTQAMLEAEASSVRGFGTFR--MIVT 167 Query: 291 TDGENSSPNIDNKESLFYC--NEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 TDG D+ E+L + A + + IG+ + A F V N Sbjct: 168 TDG-----AADDGEALQRAIEDLAAKTPIQLTTIGIGIRGNHVLRR--ADLGSFVDVANV 220 Query: 349 RKLHDAFLR 357 L A Sbjct: 221 AALEGALQA 229 >gi|115379097|ref|ZP_01466221.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|115363880|gb|EAU62991.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 270 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 10/97 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++DVS S FG + I + + N R GL+ FS ++ Sbjct: 54 LTVMLLVDVSAS--KEFGSH----ERSKSEIAAEAAAQIAFSAIANNDRVGLILFSDRVE 107 Query: 229 QTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEY 261 + P G H+ I+ ++ G T GL Y Sbjct: 108 KVVPPRKGRSHVLRLISDILTFKPQGKGTDLGAGLMY 144 >gi|332361361|gb|EGJ39165.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1056] Length = 451 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 174 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 223 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 224 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 283 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 284 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 337 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 338 G--IYKNFATEGRVLDIEN 354 >gi|326501022|dbj|BAJ98742.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 284 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 70/212 (33%), Gaps = 25/212 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++++ + + ++D S S G A + ++ +++ Sbjct: 52 SVTTRCNKYQSISFLVDESGS------IGASAFQYAKSFLYAYVNQT-----YDDLSIMS 100 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + F S I I + +T N I ++ +++ Sbjct: 101 IHFFDSTFDPYIYYGNNRATILNMIQSKAYRGAGTATG------NAINNSVALIKNKNFP 154 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-DQFLKNCAS 338 + K ++ LTDG + D + N A++ G +++ +G+ + Q ++ S Sbjct: 155 NGVP-KILVILTDG----GSYD--SVIEAANNARKNGIMLFCVGIGSNVNTAQLIQIAGS 207 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQRILYN 370 + + L + I KQ I N Sbjct: 208 TSNIVYISSYSSLTNLVNLIENYFCKQIIDVN 239 >gi|319782123|ref|YP_004141599.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168011|gb|ADV11549.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 554 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 65/198 (32%), Gaps = 32/198 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---- 226 ++++LD S SM G KL +A S+R +L + + + G + + + Sbjct: 27 VIIILDASGSM-WAQIDGKPKLEIARESLRTVLQSVPTDDE------IGFMAYGHREKGS 79 Query: 227 ---IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I P G I + L F T T ++ A + + + Sbjct: 80 CDDIQLIVPPQAGSGSAISAAADSLKFLGKTPLTAAVKQAAEALR------------YTE 127 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SP 339 K ++ +TDG + A V G+ A+ Q C + Sbjct: 128 DKATVVLITDGLETCGGDPCALGKELEASGVDFTADVVGFGLTADEGKQI--ACLADNTG 185 Query: 340 DRFYSVQNSRKLHDAFLR 357 ++ + + L +A Sbjct: 186 GKYIQASDEKALQEALAE 203 >gi|215487295|ref|YP_002329726.1| hypothetical protein E2348C_2215 [Escherichia coli O127:H6 str. E2348/69] gi|312967306|ref|ZP_07781522.1| von Willebrand factor type A domain protein [Escherichia coli 2362-75] gi|168986456|dbj|BAG11994.1| conserved predicted protein [Escherichia coli O55:H6] gi|215265367|emb|CAS09763.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] gi|312288114|gb|EFR16018.1| von Willebrand factor type A domain protein [Escherichia coli 2362-75] Length = 219 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLITFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|168704873|ref|ZP_02737150.1| hypothetical protein GobsU_35372 [Gemmata obscuriglobus UQM 2246] Length = 748 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 56/172 (32%), Gaps = 20/172 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + SD +D +++ D S SM G +L A + +LD + V + Sbjct: 82 SGAASDGPVDAVLMFDTSYSMAARDGEK-TRLERAKDAAVAVLDALPDQSSVQ------I 134 Query: 221 VTFSSKIVQTFPLA-WGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + + P++ + ++ + + S +T PGL A A+ Sbjct: 135 YACADRAQALGPVSRYNRDQAKQLVRSIEVTSLSTDVLPGLTDALAA-----------AE 183 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K I +D + S C EA+ + A + + Sbjct: 184 TGTAPAKEIYVFSDLQKSGFERQQPGLRAKCEEAREKKAGLVFVRCGNPGRK 235 >gi|156976370|ref|YP_001447276.1| hypothetical protein VIBHAR_05143 [Vibrio harveyi ATCC BAA-1116] gi|156527964|gb|ABU73049.1| hypothetical protein VIBHAR_05143 [Vibrio harveyi ATCC BAA-1116] Length = 601 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 41/138 (29%), Gaps = 13/138 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + ++VLD+S SM ++L Sbjct: 54 FAIWGLAWAIACIALASPSWQSNTRPS-FELSQNRVLVLDMSRSM-YATDVKPNRLSQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GLV ++ PL + I L+ Sbjct: 112 YKASDLLPKWKEGS-------TGLVAYAGDAYTLSPLTTDSSTLAGIIENLSPELMPYQG 164 Query: 253 TKSTPGLEYAYNKIFDAK 270 + +E A + A Sbjct: 165 SNLPSAIETALGQFTQAG 182 >gi|148706513|gb|EDL38460.1| vitrin, isoform CRA_b [Mus musculus] Length = 643 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 45/131 (34%), Gaps = 29/131 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ V+D S S+ G + + + K + R G V ++ + Sbjct: 484 DIGFVIDGSSSV------GTSNFRTVLQFVANL---SKEFEISDTDTRVGAVQYTYEQR- 533 Query: 230 TFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 L +G K + L T + ++YA ++F K + Sbjct: 534 ---LEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPN 581 Query: 283 YKKYIIFLTDG 293 +K +I +TDG Sbjct: 582 KRKVMIIITDG 592 >gi|333003181|gb|EGK22729.1| von Willebrand factor type A domain protein [Shigella flexneri K-272] gi|333017016|gb|EGK36338.1| von Willebrand factor type A domain protein [Shigella flexneri K-227] Length = 378 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 60/191 (31%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRPNLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAMEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L +I Sbjct: 352 ALVSVGAQIAA 362 >gi|170038914|ref|XP_001847292.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] gi|167862533|gb|EDS25916.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] Length = 1109 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 68/204 (33%), Gaps = 44/204 (21%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + D++++LD S SM+ +L +AT S + + + L++F Sbjct: 153 AASSPKDVIILLDSSGSMSGKEY----QLAMATASAI--------MDTLGDDDYFNLISF 200 Query: 224 SSKIVQTFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 S + P V+ ++ I + +T + LE A+ + + Sbjct: 201 SDQAKVIVPCFQDKMVRATPDNVKEVKTAIQTVECENTANFSAALESAFELL-----RRY 255 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYA--IGVQAE 327 + + + I+ +TDG + + + K ++ IG Sbjct: 256 NQSSLGSQCNQAIMLITDGPSD----------TFADIIKHYNHPHMPVRIFTYLIGKDKS 305 Query: 328 AADQFLK-NCASPDRFYSVQNSRK 350 + + C + + + + + Sbjct: 306 SGKNLYQMACDNKGFYVQINSVEE 329 >gi|150005793|ref|YP_001300537.1| hypothetical protein BVU_3286 [Bacteroides vulgatus ATCC 8482] gi|149934217|gb|ABR40915.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482] Length = 289 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSNSLDF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +H I+E ++ T +EY N I Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTDIKIAVEYLTNVIKK 174 >gi|126458984|ref|YP_001055262.1| hypothetical protein Pcal_0361 [Pyrobaculum calidifontis JCM 11548] gi|126248705|gb|ABO07796.1| conserved hypothetical protein [Pyrobaculum calidifontis JCM 11548] Length = 429 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 56/159 (35%), Gaps = 24/159 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM G + + + + + VV + F +IV Sbjct: 269 IYLLVDKSGSMFYTLYDGF-AMDMTQKITWATALAVALMKRSRRVV----MRFFDQIV-Y 322 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P+ + I+ + L G T + L Y + DAK H K ++ + Sbjct: 323 PPITNTREVIKALLKVLPLGG-TDISAAL---YTAVRDAKAYGLHSYK--------LVLV 370 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 TDGE+ N + + AK V AI V Sbjct: 371 TDGEDDMINPEAIRA------AKAAFREVKAILVGGSNE 403 >gi|332707319|ref|ZP_08427370.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] gi|332353913|gb|EGJ33402.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] Length = 464 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 58/197 (29%), Gaps = 38/197 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM------------------NDHFGPGMDKLGVATRSI 199 K + V+D S SM + + + I Sbjct: 34 PTKEVAVELPSTSFAFVIDTSGSMYEVVEGDTKPTGRVYTQDGNDYEEVIGGKTKIDIVI 93 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKST 256 +L++++S + R ++ F + L ++ I +L + T Sbjct: 94 ESLLNLVRS-NQLGGSDRIAIIQFDDQASTIVGLTPATETSQLEAGIEKLRNYSGGTCMG 152 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 G+E + G ++ + TDG+ D +E + G Sbjct: 153 EGMEQTLTMLS-----------GQTMTSRHALIFTDGQ----AFDEEECRELAKQFSENG 197 Query: 317 AIVYAIGVQAEAADQFL 333 + A+GV + + L Sbjct: 198 IPITALGVG-DYNEDLL 213 >gi|260171427|ref|ZP_05757839.1| hypothetical protein BacD2_06135 [Bacteroides sp. D2] Length = 608 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N++H + I K I + + +++ ++DVS SM G ++ Sbjct: 206 VKITMEAGACPW--NANHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 258 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G S+ + Q I+E I+ L G Sbjct: 259 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----SAGVKLEATPGSDKQKIREAIDELTAGG 314 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 315 STAGGTGILLAYKIAKKNFISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 365 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 366 ERKSGVFLTVLGYGM 380 >gi|320011517|gb|ADW06367.1| cobaltochelatase subunit [Streptomyces flavogriseus ATCC 33331] Length = 680 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 486 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVMSLL-----LDAYQRRDKVG 535 Query: 220 LVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF P V ++ L G T GL A++ + ++E + Sbjct: 536 LVTFRGKDAEVALPPTSSVDAAAARLESLPTGGRTPLAAGLLKAHDVL-----RVERLRD 590 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 591 PSRRP--LLVVVTDGRATGG 608 >gi|160887255|ref|ZP_02068258.1| hypothetical protein BACOVA_05272 [Bacteroides ovatus ATCC 8483] gi|156107666|gb|EDO09411.1| hypothetical protein BACOVA_05272 [Bacteroides ovatus ATCC 8483] Length = 616 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N++H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGACPW--NANHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G S+ + Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----SAGVKLEATPGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGTGILLAYKIAKKNFISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|149202327|ref|ZP_01879300.1| von Willebrand factor, type A [Roseovarius sp. TM1035] gi|149144425|gb|EDM32456.1| von Willebrand factor, type A [Roseovarius sp. TM1035] Length = 747 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 64/177 (36%), Gaps = 30/177 (16%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGV-ATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 + + +VLD+S S D PG+ L + + +++ D + FSS Sbjct: 556 SIAVHLVLDISQSTADLAAPGISILDMECDAAAILAGTMLQLGDD------LAITAFSSA 609 Query: 226 --KIVQTFPL-AWGV---QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 V+ P+ +G + + L G +T+ L +A ++ D Sbjct: 610 GRHDVRVIPIKTFGSPLDEATGRALAGLRPGYSTRIGAALRFASQRMQDVSR-------- 661 Query: 280 HDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 Y+K ++ +TDG S P ++ + +G + + +A + Sbjct: 662 ---YRKLVLLVTDGAPSDIDVADPEYLVADARRAVQTMRSQGIDAVCVALGPDAGQR 715 >gi|332366663|gb|EGJ44406.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1059] Length = 450 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 47/170 (27%), Gaps = 32/170 (18%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGSTIDIVRTNIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + KK+++ LTD E + K G Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIE 326 >gi|315919023|ref|ZP_07915263.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313692898|gb|EFS29733.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 448 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 62/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 286 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 333 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 334 DLGSSFDRLVDFLSQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 382 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + +S+ AK + +YAI + A +LK C D++ YSVQN+ Sbjct: 383 FEMRPVDKLLSQSI---EHAKAKQTKMYAISLGGKSAESSYLKLC---DKYWEYSVQNAE 436 Query: 350 KLHD 353 L+ Sbjct: 437 NLNK 440 >gi|260170679|ref|ZP_05757091.1| hypothetical protein BacD2_02345 [Bacteroides sp. D2] Length = 477 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 62/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 315 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 362 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 363 DLGSSFDRLVDFLSQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 411 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + +S+ AK + +YAI + A +LK C D++ YSVQN+ Sbjct: 412 FEMRPVDKLLSQSI---EHAKAKQTKMYAISLGGKSAESSYLKLC---DKYWEYSVQNAE 465 Query: 350 KLHD 353 L+ Sbjct: 466 NLNK 469 >gi|8131972|gb|AAF73158.1|AF149771_1 ookinete protein [Plasmodium berghei] gi|5139521|emb|CAB45562.1| CTRP protein [Plasmodium berghei] gi|5420459|dbj|BAA82322.1| adhesive protein-like molecule [Plasmodium berghei] Length = 1905 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 60/174 (34%), Gaps = 20/174 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ + M+ + A I + S V G++ FS + Sbjct: 512 DVTLILDESSSIGEFRWT-MEVIPFAKDVINNLNIDYDS-------VHVGVLLFSHYALD 563 Query: 230 TFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + + + +KI+ L T G E K G + K Sbjct: 564 LVPFSDEARYNKYTLIKKIDSLK----TNYGNGHESFIVKTLKYALSNYTKGSGRTNAPK 619 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCA 337 + TDG + + + + + + + IGV + ++ L CA Sbjct: 620 ITMLFTDG--NDSSESDIDMYNIGSLYRTERVKLLVIGVSMASENKLKQLVGCA 671 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 69/209 (33%), Gaps = 34/209 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----SK 226 + ++LD S S+ + + L+ ++ + V G++ F+ Sbjct: 734 LTVILDESGSIGAYNWEKQ-----VYPFTEKFLNNLEISENK---VHVGIMLFAQFNRDF 785 Query: 227 IVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + ++ +++ ++I L G T L Y H D Sbjct: 786 VKFSDKESYDKENLMKQIKGLKESYKSGGYTYIIEALNYGLA-------NYTHHEASRSD 838 Query: 283 YKKYIIFLTDGENSSPNID--NKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNC-A 337 K + TDG N++P + SL Y K+ + +GV A A + L C Sbjct: 839 VPKVTMLFTDGNNTNPGDKLLSDVSLLY----KQENVKLLVVGVGASTMANLRLLAGCHK 894 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + D I K M + Sbjct: 895 TDGNCPLATKTE--WDNLQDISKLMADKI 921 Score = 39.8 bits (91), Expect = 0.63, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 66/205 (32%), Gaps = 25/205 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----S 225 D+ ++LD S S + K + I+K + N + G++ F+ Sbjct: 65 DLTLILDESRS--------IRKSNWVEYVVPFTEQIVKGLKIGENDIHVGILLFALRNRD 116 Query: 226 KIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 I + + + +K+N L G T L+Y+ K K D Sbjct: 117 YITFDNDIRYKKTELLKKVNDLNDDYRAGGDTYILEALKYSLKKYSMNKNAR-------D 169 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 D K I TDG + + E +E + + + +GV A + + Sbjct: 170 DAPKVTILFTDG--NDIHASKSEFHKMYSEYQEKHVKLLVLGVSAAEESKLKVIAGCENH 227 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 + + I ++ + Sbjct: 228 SSCPSAMKAEWETINNITNKLTNKI 252 >gi|302343421|ref|YP_003807950.1| TadE family protein [Desulfarculus baarsii DSM 2075] gi|301640034|gb|ADK85356.1| TadE family protein [Desulfarculus baarsii DSM 2075] Length = 138 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 48/156 (30%), Gaps = 30/156 (19%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDH-----SLLYTA 60 R + +GS+++ A+ LPV +V+ +IE ++ K L +L T+ Sbjct: 4 FRRLAADGRGSVAVEFALFLPVFLLVIFSIIELGAAWYQKQMLVNASREGARLGALFSTS 63 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + QE Q L ++GF + ++ + Sbjct: 64 GGLTAQE--------------------VQERVNQYLSDSGFPS--QAVVQAVGVD---GA 98 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 ++ + YE P + + Sbjct: 99 SGDPVTVNVSADYEFPVLSAFIGAVPGTISLSATTV 134 >gi|302551500|ref|ZP_07303842.1| von Willebrand factor type A domain-containing protein [Streptomyces viridochromogenes DSM 40736] gi|302469118|gb|EFL32211.1| von Willebrand factor type A domain-containing protein [Streptomyces viridochromogenes DSM 40736] Length = 448 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 49/153 (32%), Gaps = 22/153 (14%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + +++D S SM+ K+ A + +D ++ V+ V G Sbjct: 54 SPGQGPSAAVAIMVDCSGSMDY----PPTKMRNARDATAAAIDTLR--DGVHFAVIGGTH 107 Query: 222 ----TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + A ++ + L G T L+ A + A + H Sbjct: 108 VAKEVYPGGGSLAVADATTRAQAKQALRSLSAGGGTAIGTWLKLADRLLSSADVAIRHG- 166 Query: 278 KGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 I LTDG N + D K +L C Sbjct: 167 ----------ILLTDGRNEHESPQDLKAALDAC 189 >gi|223933146|ref|ZP_03625138.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|330833239|ref|YP_004402064.1| LPXTG-motif cell wall anchor domain-containing protein [Streptococcus suis ST3] gi|223898207|gb|EEF64576.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|329307462|gb|AEB81878.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis ST3] Length = 997 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 72/240 (30%), Gaps = 52/240 (21%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA--------TRSIREMLDIIKSIPDVNNVV 216 + D ++V+D S SM D + T + +L + N V Sbjct: 465 TKEPFDTLIVVDRSTSMTDPMNSVDTQARYLAVYKALNGTAGRQGLLSKLVGFHPENQVA 524 Query: 217 RSGLVTF----SSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKE 271 G + S T WG N + T T GL A + Sbjct: 525 IVGFQGYPGYPSGDQDSTVIANWGRSTSVALSNIQPPYNNGTNYTAGLRTAGVVL----- 579 Query: 272 KLEHIAKGHDDYKKYIIFLTDGEN-----------------SSPNIDNKESLFYCNE--A 312 + KK +IF++DG ++P +L Y N Sbjct: 580 -----DQNQSSRKKVMIFISDGVPTFAFVNGVRYGNGTISGNNPYYTRDWTLNYFNSWIG 634 Query: 313 KRRGAIVYAIGVQAE--------AADQFLKNCAS--PDRFYSVQNSRKLHDAFLRIGKEM 362 K +Y +G+ +E A L + +S + V +S+ L +I + Sbjct: 635 KYPKLPIYTLGISSEFGNSDNLSANPYVLNHMSSQTGGFYSHVADSQALERTLQKIVDDT 694 >gi|149410925|ref|XP_001511539.1| PREDICTED: similar to Leucine-rich repeat and calponin homology domain-containing protein 3 isoform 1 [Ornithorhynchus anatinus] Length = 898 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVEIFM-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|149410927|ref|XP_001511563.1| PREDICTED: similar to Leucine-rich repeat and calponin homology domain-containing protein 3 isoform 2 [Ornithorhynchus anatinus] Length = 861 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 43/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ + +++ + R LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVEIFM-KLRARDPASRGDRYMLVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTLGQALRSSFDLLNLNRLVSGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|332022412|gb|EGI62720.1| Voltage-dependent calcium channel subunit alpha-2/delta-4 [Acromyrmex echinatior] Length = 2087 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 72/192 (37%), Gaps = 18/192 (9%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML--DIIKSIPDVNNVVRSGLVTFSSKI 227 DM++++DVS SM G G SI + L + ++ NN + F + Sbjct: 1164 DMVILMDVSGSMK---GFGKTIAKTTVNSILDTLSNNDFVTLLKYNNETTDFVPCFKDML 1220 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 +Q P + ++ +N++ + T A++ + ++K A + I Sbjct: 1221 IQATPE--NLDTFKKSMNKIDTDNVANLTEAFTKAFSLLKTYRQKRGCNA--DSPCNQLI 1276 Query: 288 IFLTDGENSSPNIDNKESLF----YCNEAKRRGAIVYAIGVQAEAADQ-----FLKNCAS 338 + +TD +N + +F + + V+ + EA +++C + Sbjct: 1277 MLVTDDVPGGTLGNNLKKVFKKWNWNENSTHIPVRVFTYLIGKEATKMNELQWMVRSCLN 1336 Query: 339 PDRFYSVQNSRK 350 + VQ + Sbjct: 1337 LGDYKQVQTQEE 1348 >gi|330820715|ref|YP_004349577.1| von Willebrand factor type A [Burkholderia gladioli BSR3] gi|327372710|gb|AEA64065.1| von Willebrand factor type A [Burkholderia gladioli BSR3] Length = 225 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 55/167 (32%), Gaps = 14/167 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++VLD S SM D + + D + + V + +++F V Sbjct: 28 CVLVLDRSGSMAG------DAIAQLNEGLVTFKDELAADSLAMKRVDTAIISF-GPAVLE 80 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P L T + A + + K E+ A G Y+ +I + Sbjct: 81 MPFHTAPNFFPPT---LTAQGDTPMGSAINLALDTL--EARKSEYKANGISYYRPWIFLI 135 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 TDG + + +K+ +A+GVQ D + + Sbjct: 136 TDGGPTDTWQSAAARVREGEASKK--FAFFAVGVQGANMDILAQISS 180 >gi|301614659|ref|XP_002936803.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 720 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 60/167 (35%), Gaps = 24/167 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV---RSGL 220 D L+M +LD S S D+ + + +M+D +K + + R L Sbjct: 51 DEDCILEMAFLLDSSESAKDYNHNRE------KKFVLQMVDKLKEVKPNSGRSFSWRMAL 104 Query: 221 VTFSSKIVQTFPL-AW-GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIA 277 + +SS +V W G ++ + +I + + T ++ + + Sbjct: 105 LQYSSTVVIEQTFRDWKGPENFKSRIAPIAYIGHGTYTSYAITNLTQIYMNEGTHKS--- 161 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 K I +TDG + N D + AK ++ +G+ Sbjct: 162 ------VKVAILITDGVDHPRNPDIFAATS---NAKHHDIKLFTVGM 199 >gi|297696959|ref|XP_002825643.1| PREDICTED: integrin alpha-11-like [Pongo abelii] Length = 1145 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 59/161 (36%), Gaps = 25/161 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +D+++VLD S S+ P ++ + +L P ++ G+V + Sbjct: 160 QTYMDIVIVLDGSNSI----YPWVE----VQHFLINILKKFYIGPGQ---IQVGVVQYGE 208 Query: 226 KIVQTFPLAWGVQHIQEKINR---LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +V F L + +++ + + T++ + G Sbjct: 209 DVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRTAFGIEF------ARSEAFQKGGRKG 261 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 KK +I +TDGE + D+ + +++R YA+ Sbjct: 262 AKKVMIVITDGE----SHDSPDLEKVIQQSERDNVTRYAVA 298 >gi|85705522|ref|ZP_01036620.1| Nitric oxide reductase activation protein [Roseovarius sp. 217] gi|85669947|gb|EAQ24810.1| Nitric oxide reductase activation protein [Roseovarius sp. 217] Length = 747 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 60/179 (33%), Gaps = 34/179 (18%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRS----------IREMLDIIKSIPDVNNVVR 217 + + +VLD+S S D PG+ L + + + + L I + VR Sbjct: 556 SIAVHLVLDISQSTADLAAPGISILDMERDAAAILAGTMLQLGDDLAITAFSSAGRHDVR 615 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + PL + + L G +T+ L +A ++ D Sbjct: 616 VIPIK-----TFGSPLD---EATGRALAGLRPGFSTRIGAALRFASQRMQDVSR------ 661 Query: 278 KGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 Y+K ++ +TDG S P ++ + +G + + +A + Sbjct: 662 -----YRKLVLLVTDGAPSDIDVADPEYLVADARRAVQSMRSQGIDAVCVALGPDAGQR 715 >gi|320172679|gb|EFW47914.1| hypothetical protein SDB_04813 [Shigella dysenteriae CDC 74-1112] Length = 219 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|315919742|ref|ZP_07915982.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313693617|gb|EFS30452.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 616 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N++H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGACPW--NANHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G S+ + Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----SAGVKLEATPGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGTGILLAYKIAKKNFISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|307609442|emb|CBW98936.1| structural toxin protein RtxA [Legionella pneumophila 130b] Length = 1557 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 95/256 (37%), Gaps = 14/256 (5%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNI---WQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 ++ + N FSY IK + L ++ + ++IE + + S+ Sbjct: 399 VSNSSLNGETFDIGLFSYNTIKTTPSEININMGLSLTDSDGDKITSSIEINLAPSVFKVG 458 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 ++ D S+ + + + + + + + I VLD S S Sbjct: 459 ENVDDTSSSNVLHRVGGDTGVVDGSGGADILVGDVGGVEIVGTTARIAF----VLDESGS 514 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M +FG G +L V +++ ++L + + P+ + V LV F+S + T + Sbjct: 515 MGQNFG-GTTRLEVLKQTMTDILTELSNTPNASITVH--LVKFASVVNGTGTFEITGGEL 571 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 Q+ ++ I G + Y + + G D ++ + F TDG + Sbjct: 572 QQALD-FISGLQIQQGLLAGTNYEAALGQTLQWYNSQSGTADVQQTL-FFTDGAPTFYMD 629 Query: 301 DNKESLFYCNEAKRRG 316 N S Y N A+ G Sbjct: 630 GN--STEYTNIARVYG 643 >gi|145503451|ref|XP_001437701.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404855|emb|CAK70304.1| unnamed protein product [Paramecium tetraurelia] Length = 588 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 64/197 (32%), Gaps = 35/197 (17%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRSGLVTFSSKIVQTF 231 ++D S SM+ ++ A ++ L KS+P + + G +S Sbjct: 345 IIDRSGSMSGS------RISKAKEALILFL---KSLPQDSEFNIISFGSNFYSLWNESKM 395 Query: 232 PLAWGVQHIQEKINRLIFG-STTKSTPGLEY-AYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 ++ + + T+ L+ YNK + A K + Sbjct: 396 YSQNSLEQAINHVQSMDANLGGTRIIVPLKEMVYNKYYGASNKTTLN----------VFL 445 Query: 290 LTDGEN-SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 LTDGE+ + P ID + +Y +G+ + ++ A Sbjct: 446 LTDGEDFADPIIDLVQKNNRAQT------RIYTLGIGEGCSQYLIRRVAEVGN-----GK 494 Query: 349 RKLHDAFLRIGKEMVKQ 365 ++ I ++++ Sbjct: 495 HQIVSDKEDISEKIIDL 511 >gi|191168866|ref|ZP_03030639.1| von Willebrand factor type A domain protein [Escherichia coli B7A] gi|193062939|ref|ZP_03044032.1| von Willebrand factor type A domain protein [Escherichia coli E22] gi|194428768|ref|ZP_03061304.1| von Willebrand factor type A domain protein [Escherichia coli B171] gi|260844683|ref|YP_003222461.1| hypothetical protein ECO103_2551 [Escherichia coli O103:H2 str. 12009] gi|260856053|ref|YP_003229944.1| hypothetical protein ECO26_2985 [Escherichia coli O26:H11 str. 11368] gi|293446428|ref|ZP_06662850.1| yegL protein [Escherichia coli B088] gi|300818929|ref|ZP_07099134.1| von Willebrand factor type A domain protein [Escherichia coli MS 107-1] gi|300821822|ref|ZP_07101967.1| von Willebrand factor type A domain protein [Escherichia coli MS 119-7] gi|309792952|ref|ZP_07687380.1| von Willebrand factor type A domain protein [Escherichia coli MS 145-7] gi|331668769|ref|ZP_08369617.1| putative von Willebrand factor type A domain protein [Escherichia coli TA271] gi|331677984|ref|ZP_08378659.1| putative von Willebrand factor type A domain protein [Escherichia coli H591] gi|168986397|dbj|BAG11936.1| conserved predicted protein [Escherichia coli O55:H7] gi|190901105|gb|EDV60881.1| von Willebrand factor type A domain protein [Escherichia coli B7A] gi|192931582|gb|EDV84183.1| von Willebrand factor type A domain protein [Escherichia coli E22] gi|194413215|gb|EDX29501.1| von Willebrand factor type A domain protein [Escherichia coli B171] gi|257754702|dbj|BAI26204.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257759830|dbj|BAI31327.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009] gi|291323258|gb|EFE62686.1| yegL protein [Escherichia coli B088] gi|300525664|gb|EFK46733.1| von Willebrand factor type A domain protein [Escherichia coli MS 119-7] gi|300528548|gb|EFK49610.1| von Willebrand factor type A domain protein [Escherichia coli MS 107-1] gi|308123238|gb|EFO60500.1| von Willebrand factor type A domain protein [Escherichia coli MS 145-7] gi|320198764|gb|EFW73364.1| hypothetical protein ECoL_04189 [Escherichia coli EC4100B] gi|323152293|gb|EFZ38582.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323948445|gb|EGB44427.1| von Willebrand type A protein [Escherichia coli H120] gi|324119102|gb|EGC12991.1| von Willebrand protein type A [Escherichia coli E1167] gi|331063963|gb|EGI35874.1| putative von Willebrand factor type A domain protein [Escherichia coli TA271] gi|331074444|gb|EGI45764.1| putative von Willebrand factor type A domain protein [Escherichia coli H591] Length = 219 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|118578654|ref|YP_899904.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118501364|gb|ABK97846.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 572 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 53/141 (37%), Gaps = 22/141 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K+++ +M+++D S SM K+ VAT++ + + + SIP V Sbjct: 401 KTEVNTAIMILVDRSGSMQHQ------KIEVATKTAFVVAEALDSIPGCFAAV---AAFP 451 Query: 224 SSKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL +G + + + G T L ++ ++ +E Sbjct: 452 VGNSEGVAPLVRFGERPCSSRFG-MTAGGGTPLAQALYWSGVELLKREE----------- 499 Query: 283 YKKYIIFLTDGENSSPNIDNK 303 +K ++ +TDGE + Sbjct: 500 PRKILLTVTDGEPDDRRTSKR 520 >gi|115653686|ref|XP_001201102.1| PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4, partial [Strongylocentrotus purpuratus] gi|115678385|ref|XP_795437.2| PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4, partial [Strongylocentrotus purpuratus] Length = 310 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 69/185 (37%), Gaps = 26/185 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I S+ +++++LD S SM P D + I++S+P+ + R Sbjct: 74 TIQSELVADPEVVLLLDCSTSMKGE--PKQDAKKICKM-------ILQSLPEKS---RFN 121 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++TF + + FP ++ + L F +S G A+ + Sbjct: 122 VITFGTDFTELFP-TVEPVGQRQLLEALEFIEGARSVGGSSEAWRPL------RSLSLLP 174 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + + ++ ++DG + + K +L ++ K ++ V + L+ A Sbjct: 175 MMNSARNVLLVSDGHLT----NEKLTLEIASKYKHVN-RIFTCAVSSAGNRHILRALADV 229 Query: 338 SPDRF 342 S F Sbjct: 230 SGGAF 234 >gi|332221821|ref|XP_003260063.1| PREDICTED: LOW QUALITY PROTEIN: calcium-activated chloride channel regulator 2-like [Nomascus leucogenys] Length = 943 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 78/214 (36%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQAAEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T+ + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTEVSICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQ--AEAADQFLKNCASPD 340 +I +T G++ + L C G+ +++I + A + L Sbjct: 412 GSVMILVTSGDD--------KLLGNCLPTVFSSGSTIHSIALGSTAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +QRI Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQRIQL 497 >gi|260769475|ref|ZP_05878408.1| TPR domain protein in aerotolerance operon [Vibrio furnissii CIP 102972] gi|260614813|gb|EEX39999.1| TPR domain protein in aerotolerance operon [Vibrio furnissii CIP 102972] Length = 647 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 32/193 (16%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + P T S ++ + +V+D+SLS+ + K ++ ++LD Sbjct: 65 AVTALAGPSFETQPRPSYSANNARV---VVMDMSLSL---HATDI-KPNRLAQARYKVLD 117 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLE 260 ++K P+ + +GLV ++ PL I + ++ + G++ Sbjct: 118 LLKGWPEGS----TGLVAYAGDAYTVSPLTSDSATIANLVPNLSPEIMPFPGANAATGVQ 173 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY 320 A + +A II L D ++D ES + + Sbjct: 174 RAIEMLKNAGLNRGD-----------IILLAD------DLDASESKAIRALLEGTQWKLM 216 Query: 321 AIGVQAEAADQFL 333 +G+ +A L Sbjct: 217 IVGIGTQAGAPIL 229 >gi|290243156|ref|YP_003494826.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] gi|288945661|gb|ADC73359.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] Length = 615 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 79/211 (37%), Gaps = 45/211 (21%) Query: 147 NSSHAPLLITSSVKISSKSDIG----LDMMMVLDVSLSM-NDHFGPGMD---KLGVATRS 198 +S H ++T ++ K + +++D S SM D M + +AT S Sbjct: 417 SSRHLSRVVTGDHRVFGKRQESGTPNTAVQILVDRSGSMAGDPIETAMTAALAIQLATDS 476 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 +R + + + P ++ GLV + ++G ++ FG + Sbjct: 477 LRGINTQVSAFPASSSG---GLVPIT---------SFGENG---RMKADNFGVGSTGATP 521 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 + A + + + +K ++ +TDG + D++ ++ A+ Sbjct: 522 MSNAILGVLPS-------MFARSESRKVMLVITDGAPN----DSESAMEAIRMARDVNVE 570 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 +YAIG++ + P + V+N+ Sbjct: 571 MYAIGIETD-----------PSHLFGVENTT 590 >gi|6006035|gb|AAD42203.3|AF139749_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 862 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 90/256 (35%), Gaps = 38/256 (14%) Query: 54 HSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL--RENGFAQDINNIERS 111 S +T + +GN + + ++ + EL ++ ++ + ++++ Sbjct: 98 TSSTSPSTPAVASSDGNQATGTEVETPMMEVEQYTVDNKATELNIKDGKNLKNGSRVDKN 157 Query: 112 TSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDM 171 T L DD+ +D ++ + + + D G D+ Sbjct: 158 TKLIRNRDDEQRD--------------IVDIKREVKTNADGTIDVTVTVTPKEIDEGADV 203 Query: 172 MMVLDVSLSM-NDHFGPGMDKLGVATRSIREM----LDIIKSIPDVNNVVRSGLVTFSSK 226 M +LDVS M D F DK+ ++ D + + N VR L+TF K Sbjct: 204 MALLDVSKKMTEDDFKNAKDKIKKLVTTLTSRSASNSDNDEHKHNSRNSVR--LMTFYRK 261 Query: 227 IVQTFPLAWGVQHIQEKINRL------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + L +++ + ++ + + + A KEK Sbjct: 262 VNEPIQLT--AENVDKTLDEVWKKAKEDWNWGVDLQGAIHKAREIFNKEKEKKSGK---- 315 Query: 281 DDYKKYIIFLTDGENS 296 +++I+ + GE++ Sbjct: 316 ---RQHIVLFSQGEST 328 >gi|73960097|ref|XP_855330.1| PREDICTED: similar to calcium activated chloride channel 2 [Canis familiaris] Length = 943 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 68/204 (33%), Gaps = 36/204 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + + +L A + I + G+V+F+SK Sbjct: 312 VCLVLDVSSKMAE--ANRLLRLQQAVEFYLMQIVEIHTF--------VGIVSFNSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L + + + + T GL+ + + Y Sbjct: 362 AQLHQINSDDDRKLLVSHLPMTVSAEAETSVCSGLKKGFEVV---------EKLNGKAYG 412 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRF 342 +I +T G++ IDN A G+ +++I + + + L +F Sbjct: 413 SVMILVTSGDDEH--IDNCLL-----TALSSGSTIHSIAMGSSVVENLEELSRRTGGLKF 465 Query: 343 YSVQ--NSRKLHDAFLRIGKEMVK 364 + NS + DAF RI Sbjct: 466 FVPDESNSNSMIDAFSRISSGTGD 489 >gi|119963136|ref|YP_948083.1| von Willebrand factor type A domain-containing protein [Arthrobacter aurescens TC1] gi|119949995|gb|ABM08906.1| putative von Willebrand factor type A domain protein [Arthrobacter aurescens TC1] Length = 467 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 64/200 (32%), Gaps = 36/200 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++ LDVS SM + R G+V F S VQ Sbjct: 234 DIVLCLDVSGSMTSTDAALASVFQGLAKEFDGE--------------RIGMVIFDSSSVQ 279 Query: 230 TFPLAWGVQHIQEKINRL---------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 FPL Q+ E++ R F T + G + + + G Sbjct: 280 LFPLTDDYQYAAEQLTRAKEALDSGAGSFFDGTWNGEGSSLIGDGLASCIQSFPDTDTGD 339 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNE-AKRRGAIVYAI-------GVQAEAADQF 332 + ++ TD + + + +L A + VYA+ G Q + Sbjct: 340 AKRSRSVVLATD---NFLSGEPIFTLDEATALATGKSVKVYALNPGDMDYGEQPDQPGVQ 396 Query: 333 LKNCA--SPDRFYSVQNSRK 350 L+ A + ++++ + Sbjct: 397 LRAAAERTGGSYFTLDSPDA 416 >gi|229526204|ref|ZP_04415608.1| TPR domain protein in aerotolerance operon [Vibrio cholerae bv. albensis VL426] gi|229336362|gb|EEO01380.1| TPR domain protein in aerotolerance operon [Vibrio cholerae bv. albensis VL426] Length = 632 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + + +++D+S SM P T+ Sbjct: 56 VLVLSWIVATLAMAGPSWQSAERPSVQNSAARV-LIMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTT 253 + + LD++K + + +GLV +++ PL + + L + + Sbjct: 110 ARYKALDLLKGWQEGS----TGLVAYAADAYVVSPLTSDSATLANLLPNLSPDIMPYQGS 165 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + + A + + + +I +TD Sbjct: 166 DAATAVSLAITMLQQSGHQQGD-----------LILITD 193 >gi|168701952|ref|ZP_02734229.1| hypothetical protein GobsU_20668 [Gemmata obscuriglobus UQM 2246] Length = 864 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 64/180 (35%), Gaps = 33/180 (18%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV--- 228 ++V+D S SM L ++ ++L + N +V L+++SS+ Sbjct: 83 VIVVDRSGSM-------YSALPETKETLLKIL-TLDEYAQYNLLVT--LISYSSQGDVIC 132 Query: 229 --QTFPLAWGVQHIQEK---INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P+ ++ I + T + GL+ A K+ A E Sbjct: 133 HFERVPIREIMKKDSRYQKDIKSIQTSCATCISQGLKLASEKVM-AGELTA--------- 182 Query: 284 KKYIIFLTDGENSSPNIDNKES--LFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I TDG + P+ ++ + L C++ K + V + + L A+ Sbjct: 183 ---ITIHTDGYANDPSSTSEAATLLKLCDDMKGKDVFVNTLAYGDYTDFRLLSRIANAGS 239 >gi|148234334|ref|NP_001089951.1| integrator complex subunit 6-A [Xenopus laevis] gi|126258172|sp|Q2TAF4|INT6A_XENLA RecName: Full=Integrator complex subunit 6-A; Short=Int6-A gi|83405213|gb|AAI10953.1| MGC132177 protein [Xenopus laevis] Length = 883 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S SMN G L +A ++ + ++S + R LVT Sbjct: 4 LLFLLDTSASMNQRSHLGTTYLDIAKGAVETFM-KLRSRDPASRGDRYMLVTVEEPPYG- 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAVGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PSIIIVITDG 131 >gi|109897979|ref|YP_661234.1| TPR repeat-containing protein [Pseudoalteromonas atlantica T6c] gi|109700260|gb|ABG40180.1| TPR repeat [Pseudoalteromonas atlantica T6c] Length = 647 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 58/140 (41%), Gaps = 23/140 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++VLD+SLSM TR+ + +D++K+I + +GLV ++ Sbjct: 99 VVVLDMSLSMRATDVTP----NRLTRAKYKAIDLVKAIAEGE----TGLVAYAGDAFTIS 150 Query: 232 PLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL+ Q++ I L + ++ GLE A + +A + I Sbjct: 151 PLSSDGQNLTALIPSLSPEIMPVAGSEPFLGLESAIALLRNAGYQQGE-----------I 199 Query: 288 IFLTDGENSSPNIDNKESLF 307 ++TDG ++ + + L Sbjct: 200 FWITDGIENTQVAEVSKLLE 219 >gi|308473978|ref|XP_003099212.1| CRE-CLEC-66 protein [Caenorhabditis remanei] gi|308267685|gb|EFP11638.1| CRE-CLEC-66 protein [Caenorhabditis remanei] Length = 375 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 65/207 (31%), Gaps = 33/207 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S + LD++ V+D S +M + D + + I D + R G +T++ Sbjct: 31 SRLWLDIVFVVDNSKNM--NMYKVFDTISSL---FSPYVQIGTGYDDPRST-RVGFITYN 84 Query: 225 SKI---------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 L+ +Q + L + GL A + + Sbjct: 85 WNATDVADFYKLQSYSDLSNQIQQLSTT--PLSRRDESYIDTGLAAAIRMVNATQGLR-- 140 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ--FL 333 D+YKK ++F T N ++ K GA + + + Q Sbjct: 141 -----DNYKKVVVFFTSQYNYYNTYPEDQAKLL----KSWGATLITVNTGGDDNTQENLH 191 Query: 334 KNCASPDRFY---SVQNSRKLHDAFLR 357 A+ + +++L A L Sbjct: 192 DKIANKGMAFLMSDGNTTQELQRALLA 218 >gi|218705608|ref|YP_002413127.1| hypothetical protein ECUMN_2411 [Escherichia coli UMN026] gi|256022235|ref|ZP_05436100.1| hypothetical protein E4_02577 [Escherichia sp. 4_1_40B] gi|293405548|ref|ZP_06649540.1| yegL protein [Escherichia coli FVEC1412] gi|298381232|ref|ZP_06990831.1| yegL protein [Escherichia coli FVEC1302] gi|218432705|emb|CAR13599.1| conserved hypothetical protein [Escherichia coli UMN026] gi|291427756|gb|EFF00783.1| yegL protein [Escherichia coli FVEC1412] gi|298278674|gb|EFI20188.1| yegL protein [Escherichia coli FVEC1302] Length = 219 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|218690131|ref|YP_002398343.1| hypothetical protein ECED1_2419 [Escherichia coli ED1a] gi|306814804|ref|ZP_07448966.1| von Willebrand factor type A [Escherichia coli NC101] gi|218427695|emb|CAR08602.2| conserved hypothetical protein [Escherichia coli ED1a] gi|305852198|gb|EFM52650.1| von Willebrand factor type A [Escherichia coli NC101] Length = 219 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|156546336|ref|XP_001606541.1| PREDICTED: similar to ENSANGP00000020925 [Nasonia vitripennis] Length = 2053 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 48/124 (38%), Gaps = 5/124 (4%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 DM++++D S SM + K V+T + ++ + ++ + + F K+VQ Sbjct: 1109 DMVILVDNSGSMTG-MSNAIAKTTVSTIMSTLSNNDFVAVFNFSDSTQQVVSCFQDKLVQ 1167 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P ++ I + I + T A+ + + E + I+ Sbjct: 1168 ATPE--NIRRINDDILTMKPEGVANITEAFLAAFTIL--ENYRNESRCGSDLSCNQMIML 1223 Query: 290 LTDG 293 +TDG Sbjct: 1224 VTDG 1227 >gi|187730403|ref|YP_001879807.1| von Willebrand factor type A domain-containing protein [Shigella boydii CDC 3083-94] gi|301021255|ref|ZP_07185286.1| von Willebrand factor type A domain protein [Escherichia coli MS 69-1] gi|187427395|gb|ACD06669.1| von Willebrand factor type A domain protein [Shigella boydii CDC 3083-94] gi|300398181|gb|EFJ81719.1| von Willebrand factor type A domain protein [Escherichia coli MS 69-1] Length = 219 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|307943680|ref|ZP_07659024.1| von Willebrand factor type A domain-containing protein [Roseibium sp. TrichSKD4] gi|307773310|gb|EFO32527.1| von Willebrand factor type A domain-containing protein [Roseibium sp. TrichSKD4] Length = 746 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 66/226 (29%), Gaps = 26/226 (11%) Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 ID++ S + M T L I +++ VLD Sbjct: 290 EIDNRDFVLRYSMAAEKRMATGALTHFDETRGGFLSLHIEPPKLAPEDLVTPRELVFVLD 349 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW- 235 S SM + + + LD ++ ++R T S+ P Sbjct: 350 TSGSMGGQ------PMDASKSFMHAALDGLRENDQFR-ILRFANNT-SAFAKSAMPATRA 401 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 ++ + + L T+ + A+ + ++FLTDG Sbjct: 402 NIKAGKNFVTGLSARGGTEMNNAINAAF------------DLPPVPGTMRIVVFLTDGYI 449 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++E + + A ++A G+ L+ A R Sbjct: 450 GG----DREVIQTVYDRIGN-ARIHAFGIGKAINRYLLEGLAREGR 490 >gi|260793652|ref|XP_002591825.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] gi|229277036|gb|EEN47836.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] Length = 2660 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 39/91 (42%), Gaps = 6/91 (6%) Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 T + L++ + E + + + + KK + +TDG++++ ++E+ Sbjct: 2069 AKGGTATRLALKFLRESVIP--EAVAELDRPDES-KKALFLITDGKSNTGGDPSEEARKL 2125 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 E G +Y IG+ + + L + AS Sbjct: 2126 REEL---GLEIYTIGISNDVSKTELASVASS 2153 >gi|116249090|ref|YP_764931.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253740|emb|CAK12133.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 176 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 54/159 (33%), Gaps = 16/159 (10%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F ++R + KG +I AIL +FI++ +IE S FFV +S L + Sbjct: 7 FASLRRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFV---------NSALDASVH 57 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 +++ N I + +G + N+ S + D Sbjct: 58 KISRMIRTGEVASSNITLADFKAGICNDMLLSFSCSSGLLVKV-NVLSDLSSAASADPID 116 Query: 123 KDYNLSAVSRYEMP------FIFCTFPWCANSSHAPLLI 155 NL+ Y++ + PW A + L Sbjct: 117 DSGNLTVTETYDIGKGSDYILVQTFLPWTAVVNFFSLSS 155 >gi|118388811|ref|XP_001027501.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89309271|gb|EAS07259.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1543 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 61/168 (36%), Gaps = 23/168 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----S 224 LD+M ++D + SM+ + I ++ IK I N +R + + S Sbjct: 922 LDLMFIMDCTGSMSGW-------IQAVKDEILSIIAAIKDINKGNTSIRISFIGYRDYGS 974 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKS----TPGLEYA-YNKIFDAKEKLEHIAKG 279 + F + + Q +N++ S G ++A + + IA Sbjct: 975 IQRFSIFDFSSEIDSFQNFLNQIQAEGGNDSEEDVAGGFKHANLQQWKSQAKYAVFIADC 1034 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK---RRGAIVYAIGV 324 ++Y +DG + + + + E K ++G +Y I + Sbjct: 1035 PAHGREY----SDGHDDRYPDGDPDGVDLKQEFKNLIKKGVQLYTIQI 1078 >gi|88800837|ref|ZP_01116392.1| hypothetical protein MED297_17512 [Reinekea sp. MED297] gi|88776410|gb|EAR07630.1| hypothetical protein MED297_17512 [Reinekea sp. MED297] Length = 716 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 73/218 (33%), Gaps = 42/218 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++ DVS SM + + ++ ML+ I R+ + TF + Sbjct: 13 DVRVLWDVSQSMAQNDPENYRE-----DALLLMLEAIPQGE------RAAVWTFGQYVNL 61 Query: 230 TFP-----LAWGVQHIQEKINRLIFGST-TKSTPGL-EYAYNKIFDAKEKLEHIAKGHDD 282 P W + +I + +T T L E AY+ + H Sbjct: 62 LVPHDTIDAEW-RALARTRIQQQGAPATRTNLGRALDEAAYDFAYSTYTGPTH------- 113 Query: 283 YKKYIIFLTDGE-NSSPNIDNKES------LFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 ++ +TDG+ + +PN D + A ++ I + +A L+ Sbjct: 114 ----VVLITDGQVDIAPNADVNQVERGRILSQLVPRYNSANARIHTIALSDDADHALLRQ 169 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKE---MVKQRIL 368 + + ++ L +G E + + R+ Sbjct: 170 LSEQTGGQYLRANQGADLLPLLTSLGNEVAPVSQLRVR 207 >gi|325284294|ref|YP_004256834.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324316358|gb|ADY27471.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 594 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 40/123 (32%), Gaps = 23/123 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + + +D+S SM +L A ++ ++ + R+ L+ F+ ++ Sbjct: 426 LKVAIDISSSMQGP------RLKAARQAALMVIRAAELSQG-----RTDLMAFNDQVYSI 474 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + RL T ++ E ++ I+ + Sbjct: 475 GDYGTSTKEQYRAVARLRAEGGTNLAQAMDMLLLNTAQPGE------------EEVIVVI 522 Query: 291 TDG 293 +DG Sbjct: 523 SDG 525 >gi|294140885|ref|YP_003556863.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327354|dbj|BAJ02085.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 765 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 68/172 (39%), Gaps = 33/172 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL---VTFSSKI 227 + MV+D S SM D + A+ +++ +++ + D N++ G + F+ + Sbjct: 264 IKMVVDCSGSMTG------DSINQASIALQAIVEQLAD-DDWFNIILFGSHHKLIFNKSV 316 Query: 228 VQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 T +Q +++ + L T+ L+ AY +K + Sbjct: 317 QATP---NNLQRVEKTLQNLRADFGGTEMDSALQAAY------------SSKTPKNIPTD 361 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 I+ +TDG+ D L +K R + +GV + ++ FL AS Sbjct: 362 ILLITDGQ----IWDQDYLLTNAQASKHRH---FVVGVGSAVSEAFLSKLAS 406 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 25/47 (53%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYI 51 +R G+ +I+ A++LP++ ++G I+ S + K +L + Sbjct: 15 RLRPGRRAESGAFAIMAALVLPIMIAMLGFAIDLSRVYNRKVELQSV 61 >gi|14248689|gb|AAK57630.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTIDLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|332197210|gb|AEE35331.1| uncharacterized protein [Arabidopsis thaliana] Length = 756 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 66/229 (28%), Gaps = 45/229 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + PW ++ L ++ V+D+S SM L Sbjct: 293 LVKSPSPWDSDDRGIFCLYLFPGTTKHTKLFKRRVVFVIDISASMKWK------PLEDVK 346 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINR-----LIFG 250 +++ E L +++ N ++ F+ +I + + + + LI Sbjct: 347 KALLECLAKLQAEDVFN------IIAFNDEILEFSTSMEFATDETISAVTEWLDSNLIAN 400 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + +G + + +TDG + C+ Sbjct: 401 GGTNMLLPLKQAMKLL-----------EGSNIGVPLVYLVTDG-------SVENEREICH 442 Query: 311 EAKRR--------GAIVYAIGVQAEAADQFLKNCASPDR-FYSVQNSRK 350 K + G+ + FL+ A +Y N+ Sbjct: 443 AMKESCSRNGKSISPRISTFGIGSFCNHYFLQMLARIGNGYYDGTNNTD 491 >gi|33239771|ref|NP_874713.1| protoporphyrin IX magnesium chelatase subunit ChlD [Prochlorococcus marinus subsp. marinus str. CCMP1375] gi|33237297|gb|AAP99365.1| Protoporphyrin IX Mg-chelatase subunit ChlD [Prochlorococcus marinus subsp. marinus str. CCMP1375] Length = 707 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 69/221 (31%), Gaps = 41/221 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++D S SM ++++ A ++ +L D L+ F Sbjct: 505 QKKAGSLVIFLVDASGSMA------LNRMQSAKGALIRLLTEAYESRDE-----VSLIPF 553 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + + ++ + G L + + A + D Sbjct: 554 RGEQAEVLLPPTRSITAAKRRLETMPCGGG----SPLAHG---LTQAARVGANALSTGDL 606 Query: 283 YKKYIIFLTDGENS--------SPNIDNKES-------LFYCNEAKRRGAIVYAI----G 323 + ++ +TDG + P +D +S L + G + I Sbjct: 607 GQVVVVAITDGRGNIPLGKSLGQPELDGDDSVDLKQEVLDIATRYRTLGIKLLVIDTERK 666 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A + L + A+ ++ + + A I + + Sbjct: 667 FIASGIGKDLAD-AAGGKYVQLPKATD--QALASIAMDAIN 704 >gi|15218501|ref|NP_177394.1| inter-alpha-trypsin inhibitor heavy chain-related [Arabidopsis thaliana] gi|12325279|gb|AAG52586.1|AC016529_17 hypothetical protein; 14673-17893 [Arabidopsis thaliana] Length = 758 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 66/229 (28%), Gaps = 45/229 (19%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + PW ++ L ++ V+D+S SM L Sbjct: 295 LVKSPSPWDSDDRGIFCLYLFPGTTKHTKLFKRRVVFVIDISASMKWK------PLEDVK 348 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINR-----LIFG 250 +++ E L +++ N ++ F+ +I + + + + LI Sbjct: 349 KALLECLAKLQAEDVFN------IIAFNDEILEFSTSMEFATDETISAVTEWLDSNLIAN 402 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T L+ A + +G + + +TDG + C+ Sbjct: 403 GGTNMLLPLKQAMKLL-----------EGSNIGVPLVYLVTDG-------SVENEREICH 444 Query: 311 EAKRR--------GAIVYAIGVQAEAADQFLKNCASPDR-FYSVQNSRK 350 K + G+ + FL+ A +Y N+ Sbjct: 445 AMKESCSRNGKSISPRISTFGIGSFCNHYFLQMLARIGNGYYDGTNNTD 493 >gi|296506525|ref|YP_003667759.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] gi|296327112|gb|ADH10039.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] Length = 452 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 70/204 (34%), Gaps = 22/204 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L++ ++LD S SM + K+ A ++I LD I +V V + + Sbjct: 148 KAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 206 Query: 225 SKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ G + +E+ N + K L A I D ++ Sbjct: 207 EN---DKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESINDDFKEYTGE 263 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKN 335 + YI+ +DGE + + + + IG + Q LKN Sbjct: 264 ENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAVVNI--IGFDVKNSEQQQLKN 316 Query: 336 CASP--DRFYSVQNSRKLHDAFLR 357 A + +V ++ +LH + Sbjct: 317 TAEAGKGNYATVSSADELHQTLNK 340 >gi|71904378|ref|YP_281181.1| serum opacity factor [Streptococcus pyogenes MGAS6180] gi|71803473|gb|AAX72826.1| serum opacity factor [Streptococcus pyogenes MGAS6180] Length = 1026 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 63/145 (43%), Gaps = 7/145 (4%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V + K D G D+M +LDVS M ++F +++ ++ K + Sbjct: 213 TIDVTVTVKPKQIDEGADVMALLDVSQKMTKENFDKAKEQIKKMVTTLTGEPTDGKENHN 272 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 N VR L+TF K+ + L +++ + ++ + + G++ I A+E Sbjct: 273 RRNSVR--LMTFYRKVNEPIELT--AENVDKTLDEVWKKAKEDWDWGVDLQ-GAIHKARE 327 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 + +++I+ + GE++ Sbjct: 328 IFNKEKEKKSGKRQHIVLFSQGEST 352 >gi|297261544|ref|XP_001117901.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Macaca mulatta] Length = 1119 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 50/133 (37%), Gaps = 26/133 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM ++ +A +I +LD + VN ++ ++ + Sbjct: 230 DIVILVDVSGSMKGL------RMTIAKHTITTILDTLGENDFVN------IIAYNDYVHY 277 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L+ L A+ + +E AK Sbjct: 278 IEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQILKQFQE-----AKQG 332 Query: 281 DDYKKYIIFLTDG 293 + ++ ++DG Sbjct: 333 SLCNQAVMLISDG 345 >gi|212635917|ref|YP_002312442.1| TPR domain-containing protein [Shewanella piezotolerans WP3] gi|212557401|gb|ACJ29855.1| TPR repeat protein [Shewanella piezotolerans WP3] Length = 639 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 60/209 (28%), Gaps = 37/209 (17%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + WC + + ++V+D+SLSM + ++L + Sbjct: 56 IAYLAMAWCIAVVALSGPAIEKTALPV-YESAQGRVIVMDMSLSMYANDLSP-NRLTKSK 113 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG----VQHIQEKINRLIFGST 252 ++L I +GLV ++ PL + + ++ Sbjct: 114 YRATDLLGSIAEGE-------TGLVAYAGDAFTISPLTRDNATLLNLLPTLTPSIMPTKG 166 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 + LE+A + + + II TDG I + + Sbjct: 167 SNIEAALEHAKSLLSQSGHISGD-----------IILFTDG------ISPSQLSAAQSVL 209 Query: 313 KRRGAIVYAIGVQAEAA-------DQFLK 334 K + + +E Q L+ Sbjct: 210 KGSAYRLGVLAFGSEQGAPIKLPDGQLLR 238 >gi|241113142|ref|YP_002972977.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861350|gb|ACS59016.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 176 Score = 43.3 bits (100), Expect = 0.055, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 57/159 (35%), Gaps = 16/159 (10%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F ++R F + KG +I AIL +FI++ +IE S FFV LD S+ + Sbjct: 7 FASLRRLFGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVN----SALDASVHKISRM 62 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I E ++ + I + +G + + +S + D Sbjct: 63 IRTGEVASSKITLAD-----FKARICNDMLLSFSCSSGLLVKVIVLSDLSSAA-STDPID 116 Query: 123 KDYNLSAVSRYEMP------FIFCTFPWCANSSHAPLLI 155 NL+ Y++ + PW A + L Sbjct: 117 DSGNLTVTETYDIGKGSDYILVQTFLPWTAVVNFFSLSS 155 >gi|225010239|ref|ZP_03700711.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C] gi|225005718|gb|EEG43668.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C] Length = 289 Score = 43.3 bits (100), Expect = 0.055, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 42/111 (37%), Gaps = 10/111 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +MMV+DVS S FG + + + + + N + G++ F Sbjct: 72 EEERELTLMMVVDVSGS--GLFGTT----TAFKKDLLIEITATLAFSALQNNDKVGVLLF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAK 270 + +I P G HI I L+ T + L+Y N + Sbjct: 126 TDQIELFIPPKKGKSHILRIIRELLEFKPKSQKTDLSFALKYLGNVLKKKA 176 >gi|307109997|gb|EFN58234.1| hypothetical protein CHLNCDRAFT_50640 [Chlorella variabilis] Length = 1183 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 29/209 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 + + +VLD S SM L A R++ + V LV F++ Sbjct: 134 AVRLYLVLDNSGSMAGS------PLDTAKRAVIRFAQAAPAAAGVMEA--CWLVEFNTST 185 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE----KLEHIAKGHDDY 283 L G I+E + + G +T L + + A Sbjct: 186 RAHSLLGLGGAQIEEAVGTIPAGGSTCFHRALSTLADLAAPSAPLGATAAAAAAAAVAAA 245 Query: 284 KKYIIFLTDGENSSPN-----IDNKESLFYCNEAKRRGAI--------VYAIGVQAEAAD 330 + +++F +DG +S +++ ++ G V+ +G + Sbjct: 246 RHFVVFFSDGCDSGYGGVEAALESLQARLGGGSGGGSGGGGGTGAEWCVHTLGFGRDHDA 305 Query: 331 QFLKNC----ASPDRFYSVQNSRKLHDAF 355 FL + ++P F +Q+ + F Sbjct: 306 AFLSSLTLAGSAPGSFQYIQSMAEQMGPF 334 >gi|198284579|ref|YP_002220900.1| von Willebrand factor type A [Acidithiobacillus ferrooxidans ATCC 53993] gi|218665977|ref|YP_002427250.1| von Willebrand factor type A domain protein [Acidithiobacillus ferrooxidans ATCC 23270] gi|198249100|gb|ACH84693.1| von Willebrand factor type A [Acidithiobacillus ferrooxidans ATCC 53993] gi|218518190|gb|ACK78776.1| von Willebrand factor type A domain protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 225 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 58/172 (33%), Gaps = 16/172 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++LDVS SM G +D+L R ++E + +VTF Sbjct: 17 NPQARCPCILLLDVSASM---LGAPIDQLNEGLRHLQEEIQSDSLAAKRVEF---AIVTF 70 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + ++ T +E A + +E+ + + Sbjct: 71 -GPVNTEMVFTSATNFFPPHLSTQ---GNTPMGEAIETALRML---RERKDRYRANGVSF 123 Query: 284 KKYIIFL-TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + +FL TDG + + + + +K + YAIGV+ + + Sbjct: 124 YRPWVFLITDGAPTDDWRNAAQQIREGETSKA--FMFYAIGVEGANLETLAQ 173 >gi|115928324|ref|XP_781157.2| PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4 [Strongylocentrotus purpuratus] gi|115975608|ref|XP_001193639.1| PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4 [Strongylocentrotus purpuratus] Length = 1119 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 69/185 (37%), Gaps = 26/185 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I S+ +++++LD S SM P D + I++S+P+ + R Sbjct: 740 TIQSELIADPEVVLLLDCSTSMKGE--PKQDAKKICKM-------ILQSLPEKS---RFN 787 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++TF + + FP ++ + L F +S G A+ + Sbjct: 788 VITFGTDFTELFP-TVEPVGQRQLLEALEFIEGARSVGGSSEAWRPL------RSLSLLP 840 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 + + ++ ++DG + + K +L ++ K ++ V + L+ A Sbjct: 841 MMNSARNVLLVSDGHLT----NEKLTLEIASKYKHVN-RIFTCAVSSAGNRHILRALADV 895 Query: 338 SPDRF 342 S F Sbjct: 896 SGGAF 900 >gi|158314291|ref|YP_001506799.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158109696|gb|ABW11893.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 432 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 66/203 (32%), Gaps = 28/203 (13%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK--SIPDVNNVVRSGLVTFSSKIVQ 229 ++V+D S SM+ K+ A R+ +D ++ ++ V + V + Sbjct: 45 VIVIDCSGSMS----TPATKIYAARRAACAAVDGLRPGTLFAVIRGTGTAQVVYPPGGGL 100 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + INR+ T L A + GH D ++ I Sbjct: 101 ARAAVDTRDQARRTINRMSAAGGTAIGSWLLAARDLFA-----------GHPDAVRHAIL 149 Query: 290 LTDGENSSPN-IDNKESLFYCN---EAKRRGA-IVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDG N + D ++L C + RG + +D L Sbjct: 150 LTDGRNEHESAADFTDALAACRGHFQCDSRGVGRGWVAAELTAVSDTLL------GTARD 203 Query: 345 VQNSRKLHDAFLRIGKEMVKQRI 367 + + L F + K + + + Sbjct: 204 IADPADLVADFQAMTKAAMARAL 226 >gi|291296803|ref|YP_003508201.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471762|gb|ADD29181.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 406 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 81/261 (31%), Gaps = 49/261 (18%) Query: 90 TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM--PFIFCTFPWCAN 147 R L G ++ R + + + K Y + N Sbjct: 140 KSLRTLLGSLGKNAPGAHLTRHFAPGVESSGETKPYEFGDQPNINIGETLKQVVMKGLEN 199 Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 L I S+ ++ +++LD S SM G D+ A R + +I+ Sbjct: 200 IEERDLTIE-----LSEYTAAMNTVVLLDCSHSM---ILYGEDRFTPAKRVALGLAHLIR 251 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + + VR G+ F + PL G + + T + GL+ A + Sbjct: 252 TQYPGDQ-VRFGV--FHDSAEE-VPL--GRLPTVQ-VGPYH----TNTAEGLKLARKMLR 300 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGEN------------SSPNIDN---KESLFYCNEA 312 ++ K II +TDG+ ++ +D E+L A Sbjct: 301 KMSGEM-----------KQIIMITDGKPSALTLPSGQIYKNAWGLDPVILAETLKEATLA 349 Query: 313 KRRGAIVYAIGVQAEAADQFL 333 ++ G ++ + L Sbjct: 350 RKEGIPIHT--FMLAREPELL 368 >gi|260463531|ref|ZP_05811730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259030622|gb|EEW31899.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 552 Score = 43.3 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 32/195 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK---- 226 ++++LD S SM G KL +A S+R +L + + + G + + + Sbjct: 27 VLIILDASGSM-WAQIDGKPKLEIARESLRTVLQSVPADDE------IGFMAYGHREKGS 79 Query: 227 ---IVQTFPLAWGV-QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I P G I + + L F T T ++ A + + + Sbjct: 80 CDDIQLIVPPQAGSASAITDAADSLKFLGKTPLTAAVKQAAEALK------------YTE 127 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA---SP 339 K ++ +TDG + A V G+ A+ Q C + Sbjct: 128 DKATVVLITDGLETCGGDPCALGKELEASGVDFTADVVGFGLTADEGKQI--ACLADNTG 185 Query: 340 DRFYSVQNSRKLHDA 354 ++ + + L +A Sbjct: 186 GKYIQASDEKALQEA 200 >gi|3236372|gb|AAC23668.1| type VI collagen alpha 3 subunit N7 domain [Mus musculus] Length = 200 Score = 43.3 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 63/184 (34%), Gaps = 19/184 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S ++ + P + +++ S+ ++ +R GLV FS V Sbjct: 7 DIIFLLDGSDNVGKNNFPYVRDFVT---------NLVNSLDVGSDNIRVGLVQFSDTPVT 57 Query: 230 TFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L + + RL + G +Y E + H + + Sbjct: 58 EFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSYIHANHFTEAGGSRTRAH-VPQLLL 116 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 + + + P+ D L + R G + +G + +P Y + + Sbjct: 117 LLM-----AGPSEDAY--LQAADAMVRSGVLTLCVGTTRADKAESEHIAFNPSLVYVMDD 169 Query: 348 SRKL 351 R L Sbjct: 170 FRSL 173 >gi|312073180|ref|XP_003139404.1| hypothetical protein LOAG_03819 [Loa loa] gi|307765434|gb|EFO24668.1| hypothetical protein LOAG_03819 [Loa loa] Length = 444 Score = 43.3 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 68/201 (33%), Gaps = 32/201 (15%) Query: 129 AVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG 188 + + F T P + + + +K K LD++ ++D S ++D Sbjct: 227 VDTSSKRTVNFSTLPTVSTITTRNI----PMKFDIKPGCLLDVVFLMDFSGGVSDKRDVY 282 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPL---AWGVQHIQEK 243 +D + + RS+ + +S + F L + + I++ Sbjct: 283 IDFVSILIRSL----------DLNRTSAHVAAIYYSGPKRARTLFHLRKHSRTEEAIKDL 332 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 G TT++ + YA N+ + + KK II TDG + + Sbjct: 333 HQAPSNGGTTRTGEAIYYAINEFSEKFGARKGA-------KKMIIIFTDGYSQDNPAEAS 385 Query: 304 ESLFYCNEAKRRGAIVYAIGV 324 + +G + A+ V Sbjct: 386 RAAHI------KGIELKAVSV 400 >gi|251779520|ref|ZP_04822440.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083835|gb|EES49725.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 815 Score = 43.3 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 40/140 (28%), Gaps = 35/140 (25%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVN----NVVRSGLV---TFSSKIVQTFP------ 232 K+ ++ +D +K +PD+ N + ++ Sbjct: 158 YSKSTKMEELKKAANNFIDKMKDVPDLKICIVNYSSEATINPCGYNGDKNSASVEEDRHH 217 Query: 233 ------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 L + IN L T + GL A + + Sbjct: 218 TIPNYKSLGTKFLNSNDNTLHSMINGLKALGGTNTGEGLRKAEYMLEQGDKDA------- 270 Query: 281 DDYKKYIIFLTDGENSSPNI 300 KK I+F++DG + ++ Sbjct: 271 ---KKTIVFMSDGLPTYYSV 287 >gi|319794497|ref|YP_004156137.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315596960|gb|ADU38026.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 177 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 57/166 (34%), Gaps = 22/166 (13%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 AN L + ++ G +LD S SM + R+ +L Sbjct: 1 MANRGADTLRAEHLRRRPLQARSGALHCFLLDCSASMRNDGN--------LARAKGLLLS 52 Query: 205 IIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 +++ + V L+ F+ ++ P + I + G T G++ A Sbjct: 53 LMEEAYQRRDHV--ALLCFAGEVVELRLPPRRASAWNDDWIAPIAAGGGTPLALGVQRAD 110 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 + + + ++++ LTDG ++ + + + F C Sbjct: 111 QLLAHSAAR-----------QRWLWLLTDGRSNESPMRPEAADFAC 145 >gi|323345323|ref|ZP_08085546.1| von Willebrand factor [Prevotella oralis ATCC 33269] gi|323093437|gb|EFZ36015.1| von Willebrand factor [Prevotella oralis ATCC 33269] Length = 290 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDF------GTRRQFKRDMATEIAATIAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIF 267 S +I + P G +HI I + T +EY + Sbjct: 126 SDRIEKYIPPKKGRKHILYIIREMLDFHPQSKRTDIGAAIEYLTRVMK 173 >gi|288940759|ref|YP_003442999.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896131|gb|ADC61967.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 213 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 48/130 (36%), Gaps = 9/130 (6%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM G + + V ++ ML ++ P V ++TF + Sbjct: 5 RLPVYILLDTSGSMR---GEPIHSVNV---GLQAMLSALRQDPYALESVHLSIITFDVEA 58 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL Q +I + T LE + K +KG ++ Sbjct: 59 REYLPLTPLDQVQLGEIQ-VPSSGATFLGAALELLIRHVDRDVRKSTGESKGDWRPLLFV 117 Query: 288 IFLTDGENSS 297 +TDG S Sbjct: 118 --MTDGSPSD 125 >gi|332027661|gb|EGI67729.1| Voltage-dependent calcium channel subunit alpha-2/delta-3 [Acromyrmex echinatior] Length = 1252 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 51/131 (38%), Gaps = 11/131 (8%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML---DIIKSIPDVNNVVRSGLVTFSSK 226 DM++++DVS SM G G SI + L D + + N + F Sbjct: 286 DMVILMDVSGSMT---GFGKTIAKTTVNSILDTLSNNDFVTLLKYSNETTEL-VPCFKDM 341 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 ++Q P + ++ ++++ + T A++ + +E+ A + Sbjct: 342 LIQATPE--NLDTFKKSMDKIDTDNVANLTEAFTKAFSLLKTYREERGCDA--DSPCNQL 397 Query: 287 IIFLTDGENSS 297 I+ +TDG Sbjct: 398 IMLVTDGVPGG 408 >gi|324503178|gb|ADY41386.1| C-type lectin domain-containing protein 160 [Ascaris suum] Length = 358 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 57/177 (32%), Gaps = 26/177 (14%) Query: 191 KLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 L VA L + + S +R LV F+ V L + + L Sbjct: 25 ALKVAADLATLFLPMNVSSESAQGQFIRVALVAFADNAVIVGDL-NKYHNYASLVEGLFT 83 Query: 250 ----GSTT-KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 G T GL+ A + E H AK I+ + ++ D Sbjct: 84 IDYHGGKTLNIEAGLKAASTVL----ESSRHYAKT------VILLYSSAYSAGGFADPN- 132 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKN---CASPDRFYSVQNSR---KLHDAF 355 N+ K G + I + + +K +SP+ + ++ K+ +AF Sbjct: 133 --AIANQIKESGTKIITIAFRQQPEGTLVKKLGHLSSPNFAFGSMDTSIIAKITNAF 187 >gi|313835899|gb|EFS73613.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927103|gb|EFS90934.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314970746|gb|EFT14844.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] gi|328906009|gb|EGG25784.1| von Willebrand factor type A domain-containing protein [Propionibacterium sp. P08] Length = 323 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+M+ LDVS SM G+D+ + I L + I D R G V F S V Sbjct: 95 DVMLCLDVSGSME-----GVDR-----QVINTYLQLADHISD----DRIGFVMFDSSAVT 140 Query: 230 TFPLAWGVQHIQEKINRLIFG---STTKSTPGLEY---AYNKIFDAKEKLEHIAKGHDDY 283 FPL ++ + G PG+ Y L + Sbjct: 141 VFPLTHDRDSVKAGLK--QAGERLGRAGLDPGVRYGPGGSLVGDGLASCLSRFDQLDQPR 198 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 + ++ TD N + A +R +V+ I +D A + Sbjct: 199 SRSVVLATD--NMVAGPSVYTVPQAVDLAVKRHIMVFGI---VPDSDDPDYRAARDELHQ 253 Query: 344 SVQNSR 349 V + Sbjct: 254 QVVRTH 259 >gi|153822291|ref|ZP_01974958.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229509084|ref|ZP_04398571.1| hypothetical protein VCE_000486 [Vibrio cholerae B33] gi|229608756|ref|YP_002879404.1| hypothetical protein VCD_003678 [Vibrio cholerae MJ-1236] gi|255743768|ref|ZP_05417726.1| hypothetical protein VCH_000062 [Vibrio cholera CIRS 101] gi|126520187|gb|EAZ77410.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229353841|gb|EEO18776.1| hypothetical protein VCE_000486 [Vibrio cholerae B33] gi|229371411|gb|ACQ61834.1| hypothetical protein VCD_003678 [Vibrio cholerae MJ-1236] gi|255738629|gb|EET94016.1| hypothetical protein VCH_000062 [Vibrio cholera CIRS 101] gi|259156250|gb|ACV96197.1| von Willebrand factor, type A [Vibrio cholerae Ban5] gi|259156425|gb|ACV96370.1| von Willebrand factor, type A [Vibrio cholerae Ind5] Length = 551 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 65/168 (38%), Gaps = 26/168 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDK-LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + +++D+S SM G G K VA + ++ ++ IP V V++ I Sbjct: 378 AVHLLVDISGSMGKPIGEGNRKYFHVANEAALALVMALEGIPGV-----VPAVSYFPGIH 432 Query: 229 QTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q +A V+H + T + +A N + K+K Sbjct: 433 QEVSVALLPKQSVRHRAAYFD-QKPRGCTPMAQAMWFAANSLLAQKQKR----------- 480 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I LTDG+ D + + +R G + IG+Q + ++F Sbjct: 481 KLMIVLTDGDPD----DWAATHDIVDRCRRSGFELLGIGIQTRSVERF 524 >gi|114557511|ref|XP_001142936.1| PREDICTED: calcium-activated chloride channel regulator 2 isoform 1 [Pan troglodytes] Length = 943 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 77/214 (35%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQATEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTDISICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I +T G++ + L C G+ +++I + + AA L Sbjct: 412 GSVMILVTSGDD--------KLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +Q I Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|194335383|ref|YP_002017177.1| protein of unknown function DUF58 [Pelodictyon phaeoclathratiforme BU-1] gi|194307860|gb|ACF42560.1| protein of unknown function DUF58 [Pelodictyon phaeoclathratiforme BU-1] Length = 302 Score = 43.3 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 46/123 (37%), Gaps = 17/123 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++V+D S SM + + + + + + + N + GL+ F+ ++ Sbjct: 89 SLLLVVDASASMLFGSREH------SKKEVALEVSAVLAFSALQNNDKVGLLVFTDRVET 142 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P G H+ + LI S + A + + +++ ++ I Sbjct: 143 YIPPRKGRHHVLVILEELIRMKPGNSATNINAALSFVRYTRQR-----------QEIIFL 191 Query: 290 LTD 292 LTD Sbjct: 192 LTD 194 >gi|332877591|ref|ZP_08445335.1| hypothetical protein HMPREF9074_01069 [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684440|gb|EGJ57293.1| hypothetical protein HMPREF9074_01069 [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 287 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 58/178 (32%), Gaps = 34/178 (19%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+++DVS S + FG I + + Sbjct: 58 NVTARYNEPFVKVFEEERELTLMLMVDVSGS--ELFGTQ----QQFKSEIITEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNK 265 + N ++GL+ FS +I P G H+ I LI T ++ Sbjct: 112 SALQNNDKTGLILFSDQIELFIPPKKGKSHVLRIIRELIEFEPKSFKTNIGEAFQFLSRV 171 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGAIVY 320 + KK I+F+ + + ++ + AK+ G +Y Sbjct: 172 MK----------------KKAIVFM---LSDFIDKGYEKPIQIA--AKKHDITGIRIY 208 >gi|300727147|ref|ZP_07060566.1| von Willebrand factor type A domain protein [Prevotella bryantii B14] gi|299775691|gb|EFI72282.1| von Willebrand factor type A domain protein [Prevotella bryantii B14] Length = 289 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 40/107 (37%), Gaps = 10/107 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+ + + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLEF------GTSRQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKI 266 S +I + P G +HI I + T +EY + Sbjct: 126 SDRIEKYIPPQKGKKHILYLIREMLDFHPQSLRTDIGAAMEYLTRVM 172 >gi|297158891|gb|ADI08603.1| toxic cation resistance protein [Streptomyces bingchenggensis BCW-1] Length = 743 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 52/150 (34%), Gaps = 21/150 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +VLD S SM ++ G + L ++ LD ++P +V F Sbjct: 537 AGRRAAVYLVLDRSGSMRSYYKDGTVQHLAEQALALSANLDDDGTVP---------VVFF 587 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S+ I T + + + +I L +A + D H Sbjct: 588 STDIDGTADV--DLANYSGRIEELHSSLGHMGRTNYHHAIKAVID------HYQASGATD 639 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++IF TDG +S K C A+ Sbjct: 640 PAFVIFQTDGAPTSKAAAEK---ALCEAAR 666 >gi|111223304|ref|YP_714098.1| hypothetical protein FRAAL3897 [Frankia alni ACN14a] gi|111150836|emb|CAJ62540.1| conserved hypothetical protein [Frankia alni ACN14a] Length = 442 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 69/233 (29%), Gaps = 31/233 (13%) Query: 151 APLLITSSVKISSKSDI-GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-S 208 P+ + D G +++LD S SM ++ A +I + D + + Sbjct: 43 RPVPGPGETHGETHGDAPGAAEVIILDCSGSMEYPQSKIIEARRAAQAAIDALPDGVAFA 102 Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + + R V + + A + RL T L ++ Sbjct: 103 VVEGTEQAR---VVYPERRELVTASAETRAAAGRAVARLRPHGGTAMGRWLRL-TAQLMA 158 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-----KRRGA-----I 318 ++ H A I LTDG+N + +L C A + GA Sbjct: 159 SRPDAIHHA----------ILLTDGQNGESGRALEAALAACQGAFQCDCRGVGADWRVDE 208 Query: 319 VYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 + I + L+ A + ++ L + R+ K Sbjct: 209 LRQISTRLLGTVSLLREPAEMADDFRALVAKALARGVADVA-----LRVWTPK 256 >gi|326925142|ref|XP_003208779.1| PREDICTED: epithelial chloride channel protein-like, partial [Meleagris gallopavo] Length = 395 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 49/149 (32%), Gaps = 16/149 (10%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++ + S +E+ F + + ++ ++ D + +VLDVS Sbjct: 259 PNMQNKMCNYRSTWEIIMESDDFRNSSVVNGLAPPFETTFQLMQTQDRAVS--LVLDVSG 316 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGV 237 SM + L A + + S R G+VTF S + PL V Sbjct: 317 SM--LSYNRITNLRTAAEVFLIQIIEVGS--------RVGIVTFESSAYERSPLVQITSV 366 Query: 238 QHIQEKINRLI--FGSTTKSTPGLEYAYN 264 Q + L G T G+E Sbjct: 367 ATRQRLVQNLPTTAGGGTNICSGIEKGLQ 395 >gi|317486519|ref|ZP_07945343.1| von Willebrand factor type A domain-containing protein [Bilophila wadsworthia 3_1_6] gi|316922248|gb|EFV43510.1| von Willebrand factor type A domain-containing protein [Bilophila wadsworthia 3_1_6] Length = 523 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 60/168 (35%), Gaps = 30/168 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--- 226 + ++LD S SM ++ +AT S I S + + F+ Sbjct: 349 SLHILLDASASMYGK------RMELATAS----CHAIASACSGIRGLNITITAFNGNHRG 398 Query: 227 -IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +PL Q + +IN L+ T P L + ++ +E+ K Sbjct: 399 DACSVYPLLKSGQPVHARIN-LMPSGGTPLAPALWWVMQQLLFTREQR-----------K 446 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++ LTDG+ N + A + G VY +G+ + FL Sbjct: 447 MLLVLTDGQPHDMNA----TQKAIETASKIGLEVYGLGMLDRSIGDFL 490 >gi|255316498|ref|ZP_05358081.1| hypothetical protein CdifQCD-7_19217 [Clostridium difficile QCD-76w55] Length = 256 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 39/166 (23%), Positives = 67/166 (40%), Gaps = 28/166 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ D+++VLD SLS +D F G + I E L I D + ++ L F+ Sbjct: 56 KNMKFDVVLVLDCSLSTSDLFENG------TIQDIFERLLPISLSFDNDGMLDVWL--FN 107 Query: 225 SKIVQTFPLAWGVQHIQEKIN-----RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + Q + + ++ + + TK P ++ + + E E + Sbjct: 108 EEAYQLTSI--DMNNLFNYVKNEKLFKKYVRGGTKYAPVIKEIVKEKTELNEAKEPV--- 162 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 YIIF+TDG+NS D KE+ EA + IG+ Sbjct: 163 ------YIIFITDGDNS----DKKEAELVIREASNKPIFFQFIGIG 198 >gi|298480894|ref|ZP_06999089.1| conserved hypothetical protein [Bacteroides sp. D22] gi|298272917|gb|EFI14483.1| conserved hypothetical protein [Bacteroides sp. D22] Length = 475 Score = 43.3 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 68/199 (34%), Gaps = 36/199 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 ++S +++ + LD S SM +S + + + V Sbjct: 300 EVSEEAESPFIVC--LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV--- 347 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + FS I + ++ L F T P + +A KI + Sbjct: 348 -ILFSDDIE-CIEITDLGSSFDRLVDFLCQSFHGGTDMEPVITHALRKISEEGYMEAD-- 403 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNC 336 II ++D E + ++ AK + +YAI + A +LK C Sbjct: 404 ---------IITVSDFEMRPVDQLLSRTI---EHAKAKQTKMYAISLGGKSAETSYLKLC 451 Query: 337 ASPDRF--YSVQNSRKLHD 353 D++ YS+QN+ L+ Sbjct: 452 ---DKYWEYSIQNAESLNK 467 >gi|146186309|ref|XP_001033348.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146143017|gb|EAR85685.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1149 Score = 43.3 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 45/140 (32%), Gaps = 21/140 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS---- 225 D++ ++D + SM+ + + + +++ +K + ++ G V F Sbjct: 148 DLLFIMDCTGSMS-------SYINMCRTQLFLIVEQVKK-QFQKSTLKIGFVGFRDFGDN 199 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + + ++ IN + + + A+ K + E K Sbjct: 200 NQFEIYQFTTEYDKLKSFINSVSATGGNDTAEDVAGAFEKALEMNWTCE---------AK 250 Query: 286 YIIFLTDGENSSPNIDNKES 305 Y I +TD S Sbjct: 251 YAILITDAYPHGNKYCKSNS 270 >gi|47218486|emb|CAF97220.1| unnamed protein product [Tetraodon nigroviridis] Length = 900 Score = 43.3 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 29/133 (21%), Positives = 53/133 (39%), Gaps = 18/133 (13%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN-----NVVRSGLVTFS 224 D+++V+DVS SM +L +A +I+ +LD + VN + VR F Sbjct: 266 DLVIVVDVSGSMKGL------RLTIAKHTIKTILDTLGENDFVNIIAYSDYVRYVEPCFK 319 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +VQ L +H + I L K ++ ++ + +A + Sbjct: 320 GTLVQA-DLD-NREHFKLLIEELHVKGEGKVKKAMKESFKILNEAAALGQ-----GSLCN 372 Query: 285 KYIIFLTDGENSS 297 + I+ +TDG Sbjct: 373 QAIMLITDGAMED 385 >gi|54302734|ref|YP_132727.1| dinitrification protein NorD [Photobacterium profundum SS9] gi|46916158|emb|CAG22927.1| hypothetical dinitrification protein NorD [Photobacterium profundum SS9] Length = 613 Score = 43.3 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 77/208 (37%), Gaps = 45/208 (21%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD----------VN 213 + +++ D+S+S + + + V S+ + + ++ D Sbjct: 415 NCQRDISTLLLSDLSMSTDAYINNEYRVIDVIKDSMLLFSEALAAVGDPFAVYGFSSVKR 474 Query: 214 NVVRSGLV-----TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + VR L+ +++ HI+ +I L G T+ + A N + Sbjct: 475 HHVRFTLLKNFAESYND-------------HIRGRILSLRPGFYTRMGAAIRQASNIL-- 519 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDG-----ENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 E+ +H K ++ LTDG +N +++ A+R G + + I Sbjct: 520 -AEQNQHR--------KLLLILTDGKPNDIDNYDGRHGIEDTRQAIIAARRLGLVPFCIT 570 Query: 324 VQAEAADQFLKNCASPDRFYSVQNSRKL 351 + + ADQ+L + F + + +L Sbjct: 571 ID-QKADQYLPYIFGSNGFTVIFDPSQL 597 >gi|295084552|emb|CBK66075.1| hypothetical protein [Bacteroides xylanisolvens XB1A] Length = 475 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 61/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 313 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 360 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 361 DLGSSFDRLVDFLCQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 409 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + ++ AK + +YAI + A +LK C D++ YS+QN+ Sbjct: 410 FEMRPVDQLLSRTI---EHAKAKQTKMYAISLGGKSAETSYLKLC---DKYWEYSIQNAE 463 Query: 350 KLHD 353 L+ Sbjct: 464 SLNK 467 >gi|295840162|ref|ZP_06827095.1| von Willebrand factor, type A [Streptomyces sp. SPB74] gi|197698094|gb|EDY45027.1| von Willebrand factor, type A [Streptomyces sp. SPB74] Length = 588 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 73/201 (36%), Gaps = 25/201 (12%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++V+D S SM + G G ++ V S+ + L D GL F++++ Sbjct: 382 LLVVVDASPSMAEPVPGRGQSRMDVTKASLLQALAQFTPADD------IGLWEFATRLDG 435 Query: 230 TFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEY---AYNKIFDAKEKLEHIAKGHDD 282 + ++ + + T + L+ ++D A Sbjct: 436 DRDYRELVPTARLGARKGSSGTQREALTAAFGALQPQPGGATGLYDTTLAAYREASEDYA 495 Query: 283 YKKY--IIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAADQFLKN 335 K+ ++ LTDG N P + L + A + AI V EAA+ ++ Sbjct: 496 ADKFNAVVLLTDGTNEEPGSMTRGELLTRLRDLADPAHP--LPLVAIAVGPEAAEDDMEA 553 Query: 336 C--ASPDRFYSVQNSRKLHDA 354 A+ + V + ++H+ Sbjct: 554 IGGATGGSGFKVDDPAQIHEV 574 >gi|126306131|ref|XP_001365438.1| PREDICTED: similar to calcium-activated chloride channel-2 [Monodelphis domestica] Length = 1061 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 64/201 (31%), Gaps = 29/201 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L + V G+V+FSS + Sbjct: 429 VCLVLDVSAKMAEA-----DRLHRLRQAAEFYL-----LQVVETHTYVGIVSFSSTAMVK 478 Query: 231 FPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + I+ +R L T A + ++ + Y II Sbjct: 479 AQP----REIKNHQDRRQLSSALPTSVMAKNAGASVCLGLQMGLQVIESQHGNAYGSVII 534 Query: 289 FLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYSV 345 T G + C + G+ ++ I + A+ L +F+ Sbjct: 535 LATSG--------GIGDISSCLSTMVNSGSTIHTIALGPSVAENLEELSTLTGGLKFFVS 586 Query: 346 Q--NSRKLHDAFLRIGKEMVK 364 NS L DAF I Sbjct: 587 DRANSNGLMDAFSGISSGTGD 607 >gi|114557509|ref|XP_001143172.1| PREDICTED: calcium-activated chloride channel regulator 2 isoform 2 [Pan troglodytes] Length = 943 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 77/214 (35%), Gaps = 43/214 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS M + D+L ++ L I +++ V G+ +F SK Sbjct: 312 VCLVLDVSSKMAEA-----DRLLQLQQATEFYLMQI---VEIHTFV--GIASFDSKGEIR 361 Query: 231 FPL-----AWGVQHIQEKINRLIFGSTTKST--PGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + + + + T + GL+ + + Y Sbjct: 362 AQLHQINSNDDRKLLVSYL-PTTVSAKTDISICSGLKKGFEVV---------EKLNGKAY 411 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPD 340 +I +T G++ + L C G+ +++I + + AA L Sbjct: 412 GSVMILVTSGDD--------KLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGGL 463 Query: 341 RFY--SVQNSRKLHDAFLRIGK---EMVKQRILY 369 +F+ + NS + DAF RI ++ +Q I Sbjct: 464 KFFVPDISNSNSMIDAFSRISSGTGDIFQQHIQL 497 >gi|320104266|ref|YP_004179857.1| hypothetical protein Isop_2740 [Isosphaera pallida ATCC 43644] gi|319751548|gb|ADV63308.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 1239 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 57/172 (33%), Gaps = 30/172 (17%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++ + L + V+D S S+ ++ + L V RS SG++ Sbjct: 70 TTRVNDALTTLFVIDDSESIPSNYRSAI--LDVVNRS---------QERQRRPEDLSGVI 118 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F P + + +Q N T GL+ A + Sbjct: 119 VFGKNARVETPPSLNPRRLQGLENP-PDPQYTDIAAGLKLALAVFPKDTARR-------- 169 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 I+ ++DG + ++ L AKR+G + + V + D L Sbjct: 170 -----IVLISDGNANRGSV-----LEQAAAAKRQGIPIDTLVVDYKYDDDVL 211 >gi|319640967|ref|ZP_07995675.1| hypothetical protein HMPREF9011_01272 [Bacteroides sp. 3_1_40A] gi|317387412|gb|EFV68283.1| hypothetical protein HMPREF9011_01272 [Bacteroides sp. 3_1_40A] Length = 289 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSNSLDF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +H I+E ++ T +EY N I Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTDIKMAVEYLTNVIKK 174 >gi|229188238|ref|ZP_04315314.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] gi|228595258|gb|EEK53002.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] Length = 425 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 78/209 (37%), Gaps = 23/209 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 I SK+ L++ ++LD S SM + K+ A ++I LD I +V V Sbjct: 117 SIKSKAK-SLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYLDKIPDNANVMLRVYGH 174 Query: 220 LVTFSSKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + + L+ G + +E+ N + K L A + D + Sbjct: 175 KGSNNEN---DKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFK 231 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAAD 330 + + YI+ +DGE + D + N++ A+V IG + Sbjct: 232 EYTGEENLNVV---YIV--SDGEETCGG-DPVNAAKNLNQS-NTHAVVNIIGFDVKNSEQ 284 Query: 331 QFLKNCASP--DRFYSVQNSRKLHDAFLR 357 Q LK A + +V ++ +LH + Sbjct: 285 QQLKYTAEAGKGNYATVSSADELHQTLNK 313 >gi|57958|emb|CAA79153.1| collagen alpha 2 chain type VI [Mus musculus] Length = 371 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 44/122 (36%), Gaps = 19/122 (15%) Query: 210 PDVNNVVRSGLVTFSSK-----IVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAY 263 P R G+V +S + I + +E + L T + L++AY Sbjct: 4 PKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAY 63 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 N++ + + + + +TDG + P D+ C+ R V AIG Sbjct: 64 NQLIKESRRQKTRV--------FAVVITDGRH-DPRDDDLNLRALCD----RDVTVTAIG 110 Query: 324 VQ 325 + Sbjct: 111 IG 112 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 65/180 (36%), Gaps = 17/180 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + + L + + D N R L+ + Sbjct: 180 TQRPVDIVFLLDGSERLGEQNFHKVRRF---VEDVSRRLTLARRDDDPLNA-RMALLQYG 235 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + V I E + R S + G+ +A N + Sbjct: 236 SQNQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELS 295 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +FLTDG + +++ + +++ + + V + L + DR Sbjct: 296 -----FVFLTDGVTGNDSLEES-----VHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDR 345 >gi|311251228|ref|XP_003124501.1| PREDICTED: integrin alpha-D-like [Sus scrofa] Length = 230 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 80/206 (38%), Gaps = 30/206 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ ++D S S++ + +R ++D ++ + L+ +S+ + Sbjct: 14 IDIAFLIDGSGSID------QNDFKQMKNFVRAVMDQF-----MDTSILFSLMQYSNLLK 62 Query: 229 QTFPLAW--GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F + + Q ++ ++ T + G++ ++F +K AK K Sbjct: 63 THFTFSQFQTHRSPQSLVDPIVQLKGLTYTATGIQTVVKELFHSKNGARSSAK------K 116 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE----AADQFLKNCA---S 338 ++ +TDG+ D E +A++ I YAIGV +A Q L S Sbjct: 117 ILLVITDGQ---KYKDPLEYEDVIPQAEKANVIRYAIGVGDAFQEHSAKQELSIIGSLPS 173 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVK 364 D + V N L ++ +++ Sbjct: 174 KDHVFKVDNFAALSSIQKKLQEKIFA 199 >gi|282866763|ref|ZP_06275802.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282558391|gb|EFB63954.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 451 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 56/163 (34%), Gaps = 23/163 (14%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMND---HFGPGMDKLGVATRSIRE-MLDIIK 207 PL S +++ ++ G +++++D S SM+ D A ++R+ L + Sbjct: 47 PLTDASPARVAGQA-PGAAVVLMVDCSGSMDYPPTKMRNARDATAAAIDTLRDGTLFAVV 105 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + V V + ++ + RL G T L A + Sbjct: 106 AGTHVAKDV------YPGNGRLATADVHTKAQAKDALRRLNAGGGTAIGTWLRLADRLLG 159 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 A + H I LTDG N + D + +L C Sbjct: 160 AADTGIRHG-----------ILLTDGRNEHESPEDLRAALDAC 191 >gi|259156141|gb|ACV96089.1| von Willebrand factor, type A [Providencia alcalifaciens Ban1] Length = 551 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 65/168 (38%), Gaps = 26/168 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDK-LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + +++D+S SM G G K VA + ++ ++ IP V V++ I Sbjct: 378 AVHLLVDISGSMGKPIGEGNRKYFHVANEAALALVMALEGIPGV-----VPAVSYFPGIH 432 Query: 229 QTFPLAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 Q +A V+H + T + +A N + K+K Sbjct: 433 QEVSVALLPKQSVRHRAAYFD-QKPRGCTPMAQAMWFAANSLLAQKQKR----------- 480 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 K +I LTDG+ D + + +R G + IG+Q + ++F Sbjct: 481 KLMIVLTDGDPD----DWAATHDIIDRCRRSGFELLGIGIQTRSVERF 524 >gi|251795506|ref|YP_003010237.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247543132|gb|ACT00151.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 276 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 53/165 (32%), Gaps = 44/165 (26%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR----SGLVTFSS- 225 ++ ++D S SM+D FG K + +I +L + + +R G++ + + Sbjct: 14 ILFIIDQSGSMSDPFGAS-TKSESVSDAINRLLQNLVIKCAKSEGIRDYYHVGVIGYGAG 72 Query: 226 ------------KIVQTFPLAWGVQHIQEKINRLI--------------------FGSTT 253 ++V +A + +++ ++ T Sbjct: 73 VGPAFGGVLAGKEVVPISEIADNPSKLDKRLKKVPDGAGGLVDQLVKFPVWFEPVANGGT 132 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + A + + H +I +TDGE++ Sbjct: 133 PMCEAMRTAQTVLTQWISEYTHCFPP------VVIHITDGESTDG 171 >gi|150378222|ref|YP_001314817.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150032769|gb|ABR64884.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 631 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 65/196 (33%), Gaps = 31/196 (15%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + L +T + +S +D D + VLDV + L + + Sbjct: 431 IHMMSRPQAHDLAVTILMDVSLSTDAWFDDLRVLDVE-------KQALQVLAHGLSACGD 483 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 +I+ + VR V + + I+ +I L G T+ + + Sbjct: 484 AHEILTFTSRRRDWVRIETVKAFDEAMSAT--------IEARIAALKPGYYTRMGAAIRH 535 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRG 316 A ++ + + K +I LTDG+ + ++S E +R G Sbjct: 536 AAARLVERPNRR-----------KLLIVLTDGKPNDVDHYEGRFALEDSRRAVGETRRSG 584 Query: 317 AIVYAIGVQAEAADQF 332 V+ + V EA Sbjct: 585 ISVFGVTVDREAKSYV 600 >gi|312883317|ref|ZP_07743043.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309368933|gb|EFP96459.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 432 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 83/261 (31%), Gaps = 41/261 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 N N G+ + + ++LPV+ VM L ++ S ++K+ + + L Sbjct: 7 NRLKNSCGAAAFIFILILPVLICVMALSLQASQILLAQSKITEASEVTSLA--------- 57 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + + + + ++ L + +S++ D + + Sbjct: 58 ----LSALSEERAQQKLSSYATRVLKHYLVGTDDVK-GQATMQSSTFQFQTDLVGEATHE 112 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 F + H P +D+ + D+S SMN Sbjct: 113 FWFKHKPQADTFKVSGASTSRKHKPQP--------------MDVYFITDLSESMNRSEPS 158 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + + A R + + +P + R+ + ++++ V+ G + + Sbjct: 159 RLTIVKDAIRQV------VSKLPKGS---RAAFIGYNTENVKLT----GRYFDKRTGREI 205 Query: 248 IFGSTTKSTPGLEYAYNKIFD 268 T+ +A I+D Sbjct: 206 TSKKPTELQGPNIWAEKSIYD 226 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 52/165 (31%), Gaps = 31/165 (18%) Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIF 267 PD++ +R + L + +E I N + T S +N I Sbjct: 263 PDLDQKIRDFETKY---PFYDISLTTDLGSFKETIKSNAINANGNTHS-------WNGII 312 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGE------------------NSSPNIDNKESLFYC 309 A + + ++ + LTDGE N + N+ + Sbjct: 313 AAAREAHRQPSSVFNPQQVFVLLTDGEDSKKFPKGYYAPLCEKIRNDISDKQNRSQIQNA 372 Query: 310 NEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRFYSVQNSRKLHD 353 + ++ + IGV+ ++ + C + + + +L Sbjct: 373 SVEEKTKVTMSVIGVEFNPYKNEGVTECFGRENIFEAKREDELVK 417 >gi|260559816|ref|ZP_05831995.1| von Willebrand factor [Enterococcus faecium C68] gi|260074040|gb|EEW62363.1| von Willebrand factor [Enterococcus faecium C68] Length = 857 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 50/132 (37%), Gaps = 23/132 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 17 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 70 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A + + Sbjct: 71 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGH-------- 122 Query: 281 DDYKKYIIFLTD 292 KK I+ LTD Sbjct: 123 ---KKVIVLLTD 131 >gi|237721165|ref|ZP_04551646.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|229450000|gb|EEO55791.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] Length = 448 Score = 43.3 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 61/184 (33%), Gaps = 34/184 (18%) Query: 175 LDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA 234 LD S SM +S + + + V + FS I + Sbjct: 286 LDTSGSMAGER-------ERIAKSTLLAIAELTEVQHRKCYV----ILFSDDIE-CIEIT 333 Query: 235 WGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ L F T P + +A KI + II ++D Sbjct: 334 DLGSSFDRLVDFLCQSFHGGTDMEPVITHALRKISEEGYMEAD-----------IITVSD 382 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASPDRF--YSVQNSR 349 E + ++ AK + +YAI + A +LK C D++ Y++QN+ Sbjct: 383 FEMRPVDQLLSRTI---EHAKAKQTKMYAISLGGKSAGTSYLKLC---DKYWEYTIQNTE 436 Query: 350 KLHD 353 L+ Sbjct: 437 SLNK 440 >gi|307256565|ref|ZP_07538346.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864975|gb|EFM96877.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 531 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 77/238 (32%), Gaps = 20/238 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 67 ENGNN-----GKKQKNDF-----SYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 N+ K+ + F + + + L + + T + Sbjct: 70 RKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTNDDKMNLIPICKTVNNT 129 Query: 117 IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS----DIGLDMM 172 ++ + P + + + + +I +D+M Sbjct: 130 SGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTLEVIPQQVNVASQSRAFKKNTFNIPIDLM 189 Query: 173 MVLDVSLSMNDHF------GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +V D+S SM KLG+ + E+ + D N R + F+ Sbjct: 190 VVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTPFA 247 >gi|88801579|ref|ZP_01117107.1| hypothetical protein PI23P_02932 [Polaribacter irgensii 23-P] gi|88782237|gb|EAR13414.1| hypothetical protein PI23P_02932 [Polaribacter irgensii 23-P] Length = 289 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 61/188 (32%), Gaps = 21/188 (11%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 K I + + E++ + DI E +S + S V +Y+ W Sbjct: 4 KEILKKVRKIEIKTKRLSNDIFGGEYHSSFK------GRGMTFSEVRQYQFGDDVRAIDW 57 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + + L MM+++DVS S + FG + + Sbjct: 58 NVTARY-----NEPYIKVFEEERELTMMLLVDVSGS--ELFGTA----TQFKKDTVTEIA 106 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLE 260 + N + GL+ FS I P G H+ I LI T L+ Sbjct: 107 ATLAFSATQNNDKVGLILFSDDIELFIPPKKGKSHVLRIIRELIEFQPKSKKTDIAAALK 166 Query: 261 YAYNKIFD 268 + + + Sbjct: 167 FLSSVLKK 174 >gi|299136257|ref|ZP_07029441.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298602381|gb|EFI58535.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 806 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 71/208 (34%), Gaps = 39/208 (18%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 V + D++++LD SLSM D L + +++ +L +K + Sbjct: 302 PVAAAPTQTAPRDVVLLLDTSLSMQW------DNLERSFAALQAVLLSLKPADHFS---- 351 Query: 218 SGLVTFSSKIV--QTFPLAWGVQHIQEKINRLIFG---STTKSTPGLEYAYNKIFDAKEK 272 L+ F+ + + PLA + +Q+ ++ + T L + Sbjct: 352 --LLLFNQDVTLFKPEPLAATPEAVQQALDFVRASKLRGGTDLGKALTAGLAQAKLPNSS 409 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRR----GAIVYAIGVQA 326 + LTDG + I K + Y + K+ ++A+G Sbjct: 410 --------------LFLLTDGNSDRGTTVITGKIATAYAQQWKQSPTHPRTNIFAVG--D 453 Query: 327 EAADQFLKNCASPDRFYSVQNSRKLHDA 354 +A LK A D + + +A Sbjct: 454 DANLPLLKLLAQNDGVLENVLATEPVEA 481 >gi|148975971|ref|ZP_01812760.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] gi|145964716|gb|EDK29969.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] Length = 520 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 64/194 (32%), Gaps = 20/194 (10%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 +G ++ +LL I + +E + ++L + + + + ++ Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIA----VTIEDQP 71 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 + + + +++I T+ + R + + I ++ K + S Sbjct: 72 DLARGLATQYVENYVRDIKSTNLSAQ-RFHQTEDEGAGILEYIQYTV----NAKTTHDSW 126 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG- 188 + +P S + D +D++ V D S SM+D +G Sbjct: 127 FASSFIPSFDEQQDLAGRSLARKYPVYL-------GDNNIDIVFVSDFSGSMDDRWGSSR 179 Query: 189 ---MDKLGVATRSI 199 +D L A I Sbjct: 180 HKKIDDLKTAIDQI 193 Score = 37.5 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 55/150 (36%), Gaps = 10/150 (6%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 SS + L+ + + I+ + T + G+ + D + Sbjct: 364 SSTQFKNIRLSNKLSDL-NPISSMWADGGTAAFQGILRGSQILKDGDPNSSDDEEQQAYN 422 Query: 284 KKY--IIFLTDGENSSPNIDNKE--SLFYCNEAKRR--GAIVYAIGVQAEAADQF-LKNC 336 KK ++ L+DG+ S N K C++A+ G + IG+ A+ Q ++C Sbjct: 423 KKIKMLLILSDGQESPNNGILKGLVDRGMCDKAREEIPGLYIGVIGIDFRASQQSGFQDC 482 Query: 337 ASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 + V N +L + + ++ K Sbjct: 483 VIDPNEDIIDVSNLDELIEKIEELIRKGSK 512 >gi|119478002|ref|ZP_01618102.1| transporter [marine gamma proteobacterium HTCC2143] gi|119448915|gb|EAW30157.1| transporter [marine gamma proteobacterium HTCC2143] Length = 330 Score = 43.3 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 19/116 (16%), Positives = 41/116 (35%), Gaps = 14/116 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M + P ++ + ++ +++ LD+S SM D ++ Sbjct: 73 MTLLSILLPIALAGPSWEKRLSPFTEDNAA------LVIALDLSESM-DQRDIQPSRIQR 125 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 A + I ++LD +GL+ F+ PL+ Q I ++ + Sbjct: 126 AKQKIMDLLD-------KRGDSYTGLIAFAGTSHTVIPLSNDRQVISHFLDAISTN 174 >gi|254414399|ref|ZP_05028165.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196178629|gb|EDX73627.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 801 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 54/175 (30%), Gaps = 28/175 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ ++D S S G K R L+ + ++ ++ + Sbjct: 326 DVVFLMDTSGS---QRGEPFLKSQELMRRFINGLNPDDTFTIID-------FANTTTQLS 375 Query: 230 TFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 PLA IN L T+ G+ N A +L + Sbjct: 376 PQPLANTAPNRTQALNYINGLQANGGTELMNGIRAVLNFPPSAPNRL-----------RS 424 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 ++ LTDG + + + + G +Y+ GV + L A R Sbjct: 425 VVLLTDGYIG----NESQVIAEVQRQLKPGNRLYSFGVGSSVNRFLLNRLAEVGR 475 >gi|163786713|ref|ZP_02181161.1| hypothetical protein FBALC1_16047 [Flavobacteriales bacterium ALC-1] gi|159878573|gb|EDP72629.1| hypothetical protein FBALC1_16047 [Flavobacteriales bacterium ALC-1] Length = 288 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 37/125 (29%), Gaps = 10/125 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L MM++ DVS S I + + Sbjct: 58 NVTARYNEPYIKVFEEERELTMMLMADVSGS------KLFGTKNQFKDEIVTEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNK 265 N + GL+ FS +I P G H+ I L+ T L++ N Sbjct: 112 SATQNNDKIGLILFSDEIELYIPPKKGRSHVLRIIRELLEFKPKSKNTNIAEALKFLSNV 171 Query: 266 IFDAK 270 + Sbjct: 172 MKKKA 176 >gi|332360778|gb|EGJ38585.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK355] Length = 452 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 58/201 (28%), Gaps = 36/201 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP------------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQI 282 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 + KK+++ LTD E + K G V+A Sbjct: 283 IS-----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKA 336 Query: 327 EAADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 IEG--IYKNFATEGRVLDIEN 355 >gi|99033852|gb|ABF61893.1| magnesium chelatase D subunit [Arthrospira platensis] Length = 627 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 50/129 (38%), Gaps = 19/129 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 ++ V+D S SM ++++ A ++ ++L + + L+ F + Sbjct: 468 VVFVVDASGSMA------LNRMQSAKGAVMQLLTEA-----YQSRDQVSLIPFRGEQAEV 516 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + ++ R+ G + GL A +AK+ D + I+ Sbjct: 517 LLPPTRSIAAAKRRLERMPCGGGSPLAHGLTQAVRVGMNAKQ-------SGDIGQVVIVA 569 Query: 290 LTDGENSSP 298 +TDG + P Sbjct: 570 ITDGRGNIP 578 >gi|291612496|ref|YP_003522653.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291582608|gb|ADE10266.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 754 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 64/177 (36%), Gaps = 37/177 (20%) Query: 163 SKSDIGLDMMMVLDVSLSMNDH--------FGPGMDKLGVATRSIREMLDIIKSI---PD 211 + +M++LD+S S+N+ + + + ++ ++ D + Sbjct: 558 RNDGRNIAVMLLLDLSESLNEKAAGSDQTILELSQEAVSLLGWAVEKLGDPFAIAGFHSN 617 Query: 212 VNNVVR-SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + VR + +S K W Q ++ ++ + +T+ + +A Sbjct: 618 TRHDVRFLHIKGYSEK--------WDDQ-VKGRLAAMEASYSTRMGAAMRHA-------- 660 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN-----KESLFYCNEAKRRGAIVYAI 322 H + KK ++ LTDG+ S + + ++ E ++G Y I Sbjct: 661 ---AHYLEKQQADKKLMLILTDGQPSDVDSKDGELLIADTRQAVKELDQQGIYSYCI 714 >gi|187918197|ref|YP_001883760.1| hypothetical membrane associated protein [Borrelia hermsii DAH] gi|119861045|gb|AAX16840.1| hypothetical membrane associated protein [Borrelia hermsii DAH] Length = 373 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 62/173 (35%), Gaps = 31/173 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV-- 215 +K S + LD++ V+DV+ SM +H + + + +M++ P +N Sbjct: 221 ILKKSEEPLADLDLVFVIDVTDSMKNH-------IEILREHLLDMIE-----PQLNQFRS 268 Query: 216 VRSGLVTFSSKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 R G V + + ++ +++ + G + N + Sbjct: 269 YRVGFVFYKDYLEDFLTRSFDFNSREYLSNVFEGINVGGGGDYPEAVFEGINSAVTQFDW 328 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN---EAKRRGAIVYAI 322 ++II L N+ P+ + + Y + AK + I+Y I Sbjct: 329 RAD--------SRFIIVL---GNAPPHEYPRGPIVYEDVIRAAKEKDIIIYGI 370 >gi|302804190|ref|XP_002983847.1| hypothetical protein SELMODRAFT_423092 [Selaginella moellendorffii] gi|300148199|gb|EFJ14859.1| hypothetical protein SELMODRAFT_423092 [Selaginella moellendorffii] Length = 557 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 65/188 (34%), Gaps = 15/188 (7%) Query: 170 DMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVTFSS 225 + ++L+ S SM++ G + VA I+++L+ + + V G + S Sbjct: 195 SLYILLETSTSMSNPTGVLSSQTRFNVANNIIKKLLNTLTNGDQVAVSTIGGEKIGAPVS 254 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + I + + S T S ++ D ++ Sbjct: 255 VVLGVQETSLDLAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSNLN-------- 306 Query: 286 YIIFLTDGE---NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II TDG+ + N + A+ VY IG F + +S + Sbjct: 307 VIILFTDGQFVTPGNFNFTQLSPVLAQLNARGVVVFVYRIGSFTTNDATFQQMQSSLNMS 366 Query: 343 YSVQNSRK 350 Y V N K Sbjct: 367 YEVINDDK 374 >gi|257069049|ref|YP_003155304.1| von Willebrand factor type A-like protein [Brachybacterium faecium DSM 4810] gi|256559867|gb|ACU85714.1| von Willebrand factor type A-like protein [Brachybacterium faecium DSM 4810] Length = 331 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 56/165 (33%), Gaps = 14/165 (8%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREM 202 A L + ++ + ++ V+D + SM + + +L + + Sbjct: 43 MVALLLAVALRPVTPIESEQTERMNANVFFVVDRTGSMNAEDYAGDRPRLEGVRADMTRV 102 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEY 261 +++ + R ++ F S + PL I+ L T ST ++ Sbjct: 103 MEMTEGA-------RYSILAFDSTATRQLPLTTDAGAAAAWIDTLTTEPTAYSTGSNVDR 155 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIF-LTDGENSSPNIDNKES 305 A N + AK D +++ L DGEN+ + Sbjct: 156 ALNPLL----VEISEAKREDPDSSVLVYVLADGENTDGQDAESFT 196 >gi|254882025|ref|ZP_05254735.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|294776172|ref|ZP_06741661.1| conserved hypothetical protein [Bacteroides vulgatus PC510] gi|254834818|gb|EET15127.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|294449995|gb|EFG18506.1| conserved hypothetical protein [Bacteroides vulgatus PC510] Length = 301 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 84 EEERELTVMLLIDVSNSLDF------GTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 137 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +H I+E ++ T +EY N I Sbjct: 138 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTDIKMAVEYLTNVIKK 186 >gi|146302762|ref|YP_001197353.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146157180|gb|ABQ08034.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 709 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 76/175 (43%), Gaps = 19/175 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + + + ++ +++ ++DVS SM D M+KL + +S++ + Sbjct: 321 PWNSQNKILKIGLQGKNIATNDLPSS-NLVFLIDVSGSMED-----MNKLPLLKQSMKIL 374 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ ++ V+ VV +G +V + I + +++L G +T G+E A Sbjct: 375 VNELRPTDKVSIVVYAGAAG----MVLPPTSGNEKKTIIKALDQLEAGGSTAGGAGIELA 430 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 Y E+ KG ++ +I TDG+ + + N + E ++ G Sbjct: 431 YKIAT------ENFIKGGNNR---VILATDGDFNVGSSSNSDMEKLIEEKRKTGV 476 >gi|58429525|gb|AAW78166.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ + Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRN--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|217970531|ref|YP_002355765.1| von Willebrand factor type A [Thauera sp. MZ1T] gi|217507858|gb|ACK54869.1| von Willebrand factor type A [Thauera sp. MZ1T] Length = 833 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 60/181 (33%), Gaps = 28/181 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + L + +++D S SM D + A R+++ + + R L Sbjct: 262 PAAAHPLAVKILVDCSGSMQG------DSIAAARRALQ------AIVAGLREGERFSLSR 309 Query: 223 FSSKIVQTFPLAWGVQHI-----QEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHI 276 F S + W Q +L T+ L+ + Sbjct: 310 FGSTVEHRSRALWRTSPATRLAGQRWAAQLQADLGGTEMEKALDS-TLALAGDASVSPGA 368 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +G ++ +TDG+ ID + +A+ G V+ +G+ + A+ L+ Sbjct: 369 GEGAAPVD--LLLITDGQ--IHAIDRTVA-----KARALGHRVFVVGIGSAPAEGVLRRL 419 Query: 337 A 337 A Sbjct: 420 A 420 >gi|119481411|ref|XP_001260734.1| von Willebrand domain protein [Neosartorya fischeri NRRL 181] gi|119408888|gb|EAW18837.1| von Willebrand domain protein [Neosartorya fischeri NRRL 181] Length = 941 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 69/188 (36%), Gaps = 33/188 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSI-REMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D S SM D L V +S+ + I S ++ + + ++++ Sbjct: 290 IIFVIDRSGSMMDKIDTLKSALRVFLKSLPVGVCFNICSFGSAHSFLWIQSLFYTAES-- 347 Query: 230 TFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 +Q ++ + T+ +E + KE ++ Sbjct: 348 -------LQEALSFVDGVRADMGGTEMQEAVEATVHSRMKDKELE-------------VL 387 Query: 289 FLTDGENSSPNIDNKESL--FYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS-- 344 LTDG+ I N+++L F A A +++G+ A+ ++ A +S Sbjct: 388 ILTDGQ-----IWNQQTLFKFIRETAADNSARFFSLGIGNGASHSLVEGIARAGNGFSQL 442 Query: 345 VQNSRKLH 352 V N +L Sbjct: 443 VVNYEELD 450 >gi|58429477|gb|AAW78142.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429505|gb|AAW78156.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 581 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429527|gb|AAW78167.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 575 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|16130013|ref|NP_416577.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89108893|ref|AP_002673.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161564|ref|YP_001458882.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|170019600|ref|YP_001724554.1| von Willebrand factor type A [Escherichia coli ATCC 8739] gi|170081703|ref|YP_001731023.1| hypothetical protein ECDH10B_2223 [Escherichia coli str. K-12 substr. DH10B] gi|188494220|ref|ZP_03001490.1| von Willebrand factor type A domain protein [Escherichia coli 53638] gi|209919537|ref|YP_002293621.1| hypothetical protein ECSE_2346 [Escherichia coli SE11] gi|238901263|ref|YP_002927059.1| hypothetical protein BWG_1863 [Escherichia coli BW2952] gi|300947800|ref|ZP_07161956.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300955096|ref|ZP_07167500.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|312973679|ref|ZP_07787851.1| von Willebrand factor type A domain protein [Escherichia coli 1827-70] gi|13878870|sp|P76396|YEGL_ECOLI RecName: Full=Uncharacterized protein yegL gi|1788388|gb|AAC75134.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675210|dbj|BAE76580.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067244|gb|ABV06499.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|169754528|gb|ACA77227.1| von Willebrand factor type A [Escherichia coli ATCC 8739] gi|169889538|gb|ACB03245.1| conserved protein [Escherichia coli str. K-12 substr. DH10B] gi|188489419|gb|EDU64522.1| von Willebrand factor type A domain protein [Escherichia coli 53638] gi|209912796|dbj|BAG77870.1| conserved hypothetical protein [Escherichia coli SE11] gi|238860337|gb|ACR62335.1| conserved protein [Escherichia coli BW2952] gi|260448827|gb|ACX39249.1| von Willebrand factor type A [Escherichia coli DH1] gi|300317939|gb|EFJ67723.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300452620|gb|EFK16240.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|309702397|emb|CBJ01719.1| conserved hypothetical protein [Escherichia coli ETEC H10407] gi|310332274|gb|EFP99509.1| von Willebrand factor type A domain protein [Escherichia coli 1827-70] gi|315136707|dbj|BAJ43866.1| hypothetical protein ECDH1ME8569_2010 [Escherichia coli DH1] gi|323161851|gb|EFZ47728.1| von Willebrand factor type A domain protein [Escherichia coli E128010] gi|323936878|gb|EGB33162.1| von Willebrand type A protein [Escherichia coli E1520] gi|323940856|gb|EGB37044.1| von Willebrand protein type A [Escherichia coli E482] gi|332343854|gb|AEE57188.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 219 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + D + + P V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNA---GLVTFRDELLADPLALKRVELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|327274818|ref|XP_003222173.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1097 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 55/170 (32%), Gaps = 19/170 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S S D + ++ + + ++ ++ FSS + Sbjct: 48 IDIVYVLDSSESAKDVLFDKQKEFVTLLSDKLFLMKPTRVL---RYDIKLAIMQFSSSVR 104 Query: 229 QTFPLA-W-GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +P W + + ++ + + T S + A K K Sbjct: 105 IDYPFDEWRSLPDFKLRVKEMTYIGHGTYSYYAISNATQLFKTEGRKSS---------VK 155 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLK 334 ++ + DG + + D + A+ G IG+ + L Sbjct: 156 VVVLMADGIDHPKSPDVQ---AISEAARTFGISFITIGLSNVADKVKLLS 202 >gi|320105086|ref|YP_004180677.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752368|gb|ADV64128.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 356 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 45/125 (36%), Gaps = 8/125 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LDVS SM R+ M + +P +V V F + Sbjct: 109 ILLLLDVSPSMRLKDAGPEGNQTRRRRAAELMESFFERVPISQALVSI--VAFYNGAKPV 166 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++ I+ +N L +F A E+ IAK + ++ + Sbjct: 167 VVDSRDLEVIRNILNDLPLEQ------AFPTGRTTLFTALEEAAAIAKPWNPKSALVVIV 220 Query: 291 TDGEN 295 TDG+ Sbjct: 221 TDGDT 225 >gi|226310168|ref|YP_002770062.1| hypothetical protein BBR47_05810 [Brevibacillus brevis NBRC 100599] gi|226093116|dbj|BAH41558.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 437 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 68/216 (31%), Gaps = 43/216 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 ++ ++LD S SM G K+ +A +I+E + + R L + K Sbjct: 136 FNVEIILDASGSMAGKIGDK-TKMQLAKEAIQEFAEAL------PEDARISLRVYGHKGS 188 Query: 227 ------------IVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +PL A+ + +++ ++ T L A + + Sbjct: 189 NADEHKQLSCGSSEMVYPLQAYDAKRLEQALDMFEPTGWTSIAHSLRLAQEDLAGFEADK 248 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGV--QAEAA 329 I ++DG + + + E + + + IG AE Sbjct: 249 NTN---------VIYLVSDGIETC----DGNPVAVAKELSQSKIMPLLNVIGFDVNAEGQ 295 Query: 330 DQFLKNC-ASPDRFYSVQNSRKLHDAF---LRIGKE 361 Q + AS + +V N + I ++ Sbjct: 296 KQLKEIAHASEGLYANVTNREQFKQELERAKEIAQK 331 >gi|134102411|ref|YP_001108072.1| putative magnesium-chelatase subunit [Saccharopolyspora erythraea NRRL 2338] gi|291004204|ref|ZP_06562177.1| putative magnesium-chelatase subunit [Saccharopolyspora erythraea NRRL 2338] gi|133915034|emb|CAM05147.1| putative magnesium-chelatase subunit [Saccharopolyspora erythraea NRRL 2338] Length = 716 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 52/150 (34%), Gaps = 21/150 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQ 229 ++ V+D S SM +++ + ++ +L + G+VTF Sbjct: 523 VLFVVDASGSMA-----ARERMSAVSGAVLSLLR-----DAYQRRDKVGVVTFRGDSAEV 572 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V ++ RL G T GL A + + + ++ Sbjct: 573 ALPPTSSVDTAAVRMRRLRTGGRTPLADGLLKANKVVGTERTRDPRRRP-------LLVL 625 Query: 290 LTDGENSSP---NIDNKESLFYCNEAKRRG 316 LTDG+ + P ++D + + + G Sbjct: 626 LTDGKATVPLKSDVDGRARRAVDDALRAAG 655 >gi|145480079|ref|XP_001426062.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124393134|emb|CAK58664.1| unnamed protein product [Paramecium tetraurelia] Length = 2123 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 32/161 (19%), Positives = 59/161 (36%), Gaps = 23/161 (14%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +LD S SM G D++ + R + L N + +++F+ K Sbjct: 1945 ILDDSFSMEGKKG---DEMMESLRQQLKFLKS-------NKYAKVSVISFNYKASLQIEF 1994 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + ++I + G T P L+ ++I ++K++ YI+ +DG Sbjct: 1995 KKPKAKLIKQIT--LVGGITNFDPPLKLCLDQILKYEKKIDQA---------YILLYSDG 2043 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 E S P E + E + + I + I LK Sbjct: 2044 EGSYPQQSLAEYITLSQELRNK--ISFLICTAGSKPQTLLK 2082 >gi|324499488|gb|ADY39781.1| Mesocentin [Ascaris suum] Length = 3894 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 71/187 (37%), Gaps = 19/187 (10%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 W + + T+ +I ++ D D+M VLD S D+F P ++ G + ++ Sbjct: 3119 WPSRKTTLESQRTTPARICTRIDYQADVMFVLDSS----DNFSP--EQYGHLKEGLSTLI 3172 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D ++ +VV+ G V +S K + P+A G + ++ S A Sbjct: 3173 D--ETFDLSPDVVQVGFVEYSDKA--SVPVALGHYEDKVQL----LTDIANSEQLFGEAI 3224 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + G + + ++ +T+G N + E L R ++ + Sbjct: 3225 VLKGLNAARQQFQLHGRKNVPRVLLLITNGVNRGNAANAAEDLRE-----RYNVELFILA 3279 Query: 324 VQAEAAD 330 V A A Sbjct: 3280 VNASADA 3286 >gi|254514589|ref|ZP_05126650.1| TPR repeat-containing protein [gamma proteobacterium NOR5-3] gi|219676832|gb|EED33197.1| TPR repeat-containing protein [gamma proteobacterium NOR5-3] Length = 608 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 57/169 (33%), Gaps = 30/169 (17%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGPG 188 R +P + + ++ P + K D +++VLD+S SM D Sbjct: 60 GRSGLPALLLAWMIGVLAAAGPSWQQLPQPVLQKQDA---LVLVLDLSYSMLATDLQPSR 116 Query: 189 MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI 248 D++ R++LD+++ + + L+ ++ PL I + L Sbjct: 117 QDRVR------RKLLDLLRERREGL----TALIAYAGDAHIVAPLTDDNPTIANLLPALT 166 Query: 249 FGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + LE A + A + I+ +TDG Sbjct: 167 PEMMPLPGSNPVDALERAVALLDSAGVRRGR-----------ILLVTDG 204 >gi|167590268|ref|ZP_02382656.1| putative transmembrane protein [Burkholderia ubonensis Bu] Length = 377 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 MS ++ +G+++I A+ + ++ L I+ + + +L D + + A Sbjct: 1 MSIMHTGRSGKKQEGAVAITVALCMVILLGFAALAIDIGNLLIARNELQNSADAAAMAGA 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 ++ + +N + D+ + N + TS + Sbjct: 61 GCLIRRTACSNTSASQPDWPT-----------ADATASAFSTSATTNQVQGTS--VQTST 107 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 Y + + Y + + P+ ++ P + + K S ++ + + + Sbjct: 108 VATGYWNTTGTPYGLE----SLPFTPGANDLPAVQVTIRKDGSNANGAVPIFL 156 >gi|94995219|ref|YP_603317.1| Fibronectin-binding protein [Streptococcus pyogenes MGAS10750] gi|94548727|gb|ABF38773.1| Fibronectin-binding protein [Streptococcus pyogenes MGAS10750] Length = 990 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 55/144 (38%), Gaps = 18/144 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + D G D+M +LDVS M ++F +++ ++ K + N VR L Sbjct: 218 PKQIDEGADVMALLDVSQKMTQENFNKAKEQIKRLVTTLTGKSSDGKENHNRRNSVR--L 275 Query: 221 VTFSSKIVQTFPLAWGV--------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +TF KI + L+ I +K+ + + + A + EK Sbjct: 276 MTFYRKISEPIDLSGKTSDEVEKELNKIWDKVKKEDWDWGVDLQGAIHKAREIFRSSYEK 335 Query: 273 LEHIAKGHDDYKKYIIFLTDGENS 296 +++I+ + GE++ Sbjct: 336 KSGK-------RQHIVLFSQGEST 352 >gi|74318639|ref|YP_316379.1| rubisco activation protein CbbO [Thiobacillus denitrificans ATCC 25259] gi|74058134|gb|AAZ98574.1| rubisco activation protein CbbO [Thiobacillus denitrificans ATCC 25259] Length = 785 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 76/223 (34%), Gaps = 26/223 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS----MNDHFGPGMDKLG 193 I + P ++ SV+ + ++++LD+S S + D +D Sbjct: 567 IQSFTDIRLGNQPDPRIMMRSVRKTRD----FSILVLLDLSESTNETVQDQEYSVLDLTR 622 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 A + + ++ + ++ G + F W + + K+ + +T Sbjct: 623 QACVLLADAINKVGDPFAIHGFCSDGRHDVEYYRFKDFDQHWN-EVPKAKLAGMTGQLST 681 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFY 308 + + +A H K KK +I +TDGE + P ++ Sbjct: 682 RMGAAIRHA-----------GHHLKLQRSAKKLLIVITDGEPADIDVRDPQYLRYDTKKA 730 Query: 309 CNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKL 351 E R G Y + + AD ++ + V + ++L Sbjct: 731 VEEVARNGVTTYCMSLD-PRADNYVSRIFGQKNYMVVDHVQRL 772 >gi|296270453|ref|YP_003653085.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296093240|gb|ADG89192.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 690 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 48/143 (33%), Gaps = 18/143 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 ++ V+D S SM ++ ++ +L + + GLVTF Sbjct: 515 VLFVVDASGSMG-----ARRRMTAVKTAVLSLL-----LDAYQRRDKVGLVTFRGTAAEV 564 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V+ ++ L G + GL A + + E + ++ Sbjct: 565 PLPPTSSVEAGAARLRALATGGRSPLGAGLAKAAEVL-----RAERLRDPARRP--LLVL 617 Query: 290 LTDGENSSPNIDNKESLFYCNEA 312 +TDG + + ++ + A Sbjct: 618 VTDGRATDGDAADRAARLLTGTA 640 >gi|268317869|ref|YP_003291588.1| hypothetical protein Rmar_2321 [Rhodothermus marinus DSM 4252] gi|262335403|gb|ACY49200.1| protein of unknown function DUF58 [Rhodothermus marinus DSM 4252] Length = 292 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 8/95 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+DVS S G G R + + + + N R GL+ FS +I Sbjct: 79 LLLVVDVSGS------QGFGVRGRTKRELAAEICAVLGFSALRNHDRVGLLLFSDRIEAF 132 Query: 231 FPLAWGVQHIQEKINRLIF--GSTTKSTPGLEYAY 263 P G +H+ + L +T++ + Y Sbjct: 133 VPPRKGRRHVLRLVRDLYACRPGSTRTDLRVALDY 167 >gi|224051386|ref|XP_002199700.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 565 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 70/212 (33%), Gaps = 29/212 (13%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y+MP F T L + S +++ ++D S S+ + Sbjct: 345 SYQMPTWFGTTK-YVKPLVQKLCSHEQMLCSKTCYNSVNIGFLIDGSSSIGE-------- 395 Query: 192 LGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRL 247 + R + E + + +++++ + V F+ + F + + I N Sbjct: 396 --INFRLMLEFVSNVAKAFEISDIGSKVAAVQFTYNQRKEFGFTDHVTKEKVLSAIHNIQ 453 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T + + + +F + + K ++I LTDG++ + Sbjct: 454 YMSGGTATGDAISFTTRTVFGPVKDGPN--------KNFLIVLTDGQSYDDVTGPAAAAK 505 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 G V+++GV D + + P Sbjct: 506 K------AGITVFSVGVAWAPLDDLKEMASEP 531 >gi|54296668|ref|YP_123037.1| structural toxin protein RtxA [Legionella pneumophila str. Paris] gi|53750453|emb|CAH11847.1| structural toxin protein RtxA [Legionella pneumophila str. Paris] Length = 7679 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 49/253 (19%), Positives = 92/253 (36%), Gaps = 8/253 (3%) Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 ++ + NN FSY IK ++ + + D + I S +++ Sbjct: 6505 VSNSSLNNETFDIGLFSYNTIKTT-PSEININMGLSLTDSDGDKINSSIEINLAPSVFKV 6563 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 N+ S +P + S ++ V + +LD S SM+ Sbjct: 6564 GENVDDTSSSNVPHRVGGDTGVIDGSGGADILVGDVGGVEVVGTTARLAFILDESGSMSQ 6623 Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEK 243 +FG G +L V +++ ++L + + P+ + V LV F+S + T +Q+ Sbjct: 6624 NFG-GTTRLEVLKQAMTDILTELSNTPNASITVH--LVKFASVVNGTGTFEITGGGLQQA 6680 Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 ++ I G + Y + G D ++ + F TDG + N Sbjct: 6681 LD-FISGLQIQQGLLAGTNYEAALGQTVQWFSSQSGTVDVQQTL-FFTDGVPTFYMDGN- 6737 Query: 304 ESLFYCNEAKRRG 316 S Y N A+ G Sbjct: 6738 -STEYTNLARVYG 6749 >gi|326382924|ref|ZP_08204614.1| ATPase [Gordonia neofelifaecis NRRL B-59395] gi|326198514|gb|EGD55698.1| ATPase [Gordonia neofelifaecis NRRL B-59395] Length = 624 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 18/153 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V+D+S SM +L + ++L D V VV G Sbjct: 447 VVFVVDLSGSM-----TARSRLAAVRNACVDLLRDSYTRRDRVAVVVACG-----KDAYV 496 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V+ ++ ++ G T GL I E+ + ++ Sbjct: 497 AVPPTRSVEIAVARLAQVRTGGRTPLAEGLHRGLEVI----ERAARVDPTRRPL---LVV 549 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 +TDG ++ ++ + +E RRG + Sbjct: 550 MTDGRATAGPDASRRADAAADEIARRGISAVVV 582 >gi|86134837|ref|ZP_01053419.1| conserved hypothetical protein [Polaribacter sp. MED152] gi|85821700|gb|EAQ42847.1| conserved hypothetical protein [Polaribacter sp. MED152] Length = 289 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 81/243 (33%), Gaps = 45/243 (18%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 K I + + E++ + DI E +S + S V +Y+ T W Sbjct: 4 KEILKKVRKIEIKTKRLSNDIFGGEYHSSFK------GRGMTFSEVRQYQFGDDVRTIDW 57 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + + L M++++DVS S + FG + + Sbjct: 58 NVTARY-----NEPYVKVFEEERELTMLLMVDVSGS--EFFGTT----NQFKKDTITEIA 106 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLE 260 + N + GLV FS I P G H+ I LI T + L+ Sbjct: 107 ATLAFSATQNNDKVGLVLFSDDIELYIPPKKGKSHVLRIIRELIEFKPKSKKTNISVALK 166 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR---RGA 317 + + + K+ I+F+ + + D +++ AK+ G Sbjct: 167 FLSSVLK----------------KRAIVFM---LSDFMDDDYEKTAKIA--AKKHDLTGI 205 Query: 318 IVY 320 VY Sbjct: 206 RVY 208 >gi|332519336|ref|ZP_08395803.1| protein of unknown function DUF58 [Lacinutrix algicola 5H-3-7-4] gi|332045184|gb|EGI81377.1| protein of unknown function DUF58 [Lacinutrix algicola 5H-3-7-4] Length = 287 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 40/111 (36%), Gaps = 10/111 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L M++++D+S S + FG I + + N + GL+ F Sbjct: 72 EEERELTMLLMVDISGS--ELFGTD----QQFKNEIITEISATLAFSATQNNDKIGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNKIFDAK 270 S +I P G H+ I LI T L++ N + Sbjct: 126 SDEIELYIPPKKGRSHVLRIIRELIEFEPKSKGTNVAEALKFMRNVMKKKA 176 >gi|302336991|ref|YP_003802197.1| protein of unknown function DUF58 [Spirochaeta smaragdinae DSM 11293] gi|301634176|gb|ADK79603.1| protein of unknown function DUF58 [Spirochaeta smaragdinae DSM 11293] Length = 290 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 45/125 (36%), Gaps = 6/125 (4%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + + + + + +V+DVS S+ FG G ++ +L + Sbjct: 58 NVSSRMGQPYVKTFREEREMALFLVIDVSASLG--FGNGKLSKQETVATLAALL----AF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 V+N R G FS +I + P G I ++ L A + ++ Sbjct: 112 SAVHNNDRVGAAFFSDRIEKWVPPRKGRNQIFRLAGDIMEVEPKGKGSDLALALRGVHES 171 Query: 270 KEKLE 274 ++ Sbjct: 172 VKRRG 176 >gi|58429511|gb|AAW78159.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|327313517|ref|YP_004328954.1| hypothetical protein HMPREF9137_1259 [Prevotella denticola F0289] gi|326944811|gb|AEA20696.1| conserved hypothetical protein [Prevotella denticola F0289] Length = 290 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 10/102 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ +G R + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTVGQLKRECATEIAATLAFSAIQNNDKIGIIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEY 261 S + + G +HI I + T GLEY Sbjct: 126 SDHVEKYIAPKKGRKHILYLIREMLTFTPESRKTDVGAGLEY 167 >gi|325860284|ref|ZP_08173406.1| hypothetical protein HMPREF9303_2130 [Prevotella denticola CRIS 18C-A] gi|325482163|gb|EGC85174.1| hypothetical protein HMPREF9303_2130 [Prevotella denticola CRIS 18C-A] Length = 290 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 10/102 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ +G R + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTVGQLKRECATEIAATLAFSAIQNNDKIGIIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEY 261 S + + G +HI I + T GLEY Sbjct: 126 SDHVEKYIAPKKGRKHILYLIREMLTFTPESRKTDVGAGLEY 167 >gi|291239167|ref|XP_002739504.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 603 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 72/195 (36%), Gaps = 31/195 (15%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +++VLDVS SM+D+ ++L + ++ + V N G+V FS+ Sbjct: 318 LRIVLVLDVSGSMDDN-----NRLDLLLQASTRYIGYT-----VPNATWIGIVEFSNDAT 367 Query: 229 QTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 L GV+ +E I L T GL + + E G Sbjct: 368 ILSELVQIVGVETRKELIEELPDDAKGATSIGSGLLAGLSVL-------ERGPGGAAGGI 420 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC-ASPDR-- 341 ++I +DGE ++P E +V + + EA + K A+ Sbjct: 421 IFLI--SDGEENTPPYMKDVVDLLVQEE----VVVDTLALSDEADEGLAKLSDATGGTAY 474 Query: 342 -FYSVQNSRKLHDAF 355 + S LHDAF Sbjct: 475 WYSESDESTALHDAF 489 >gi|198424351|ref|XP_002122740.1| PREDICTED: similar to collagen, type XXVIII [Ciona intestinalis] Length = 870 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 58/175 (33%), Gaps = 19/175 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + D++M +D S DH M L R I ++I R G++ F Sbjct: 67 QPKCANDIIMAVDSSACFRDHHAKMMRFLRKLVRRIGR----TENIQYGGTETRLGIMQF 122 Query: 224 SSKIVQTFPLA-----WGVQHIQEKINRLIFGSTTKSTPG----LEYAYNKIFD-----A 269 SS I+ L + ++ + +T G L A N D Sbjct: 123 SSDILFPLHLNSFEDYTNPSTRRGLMSGIEQALSTLGFLGEGSFLNKALNATVDHFQSEP 182 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 + + K +I +T+G+ S P++ + K G V + V Sbjct: 183 RPEELLTQISPQTPKPVVILMTNGK-SHPSVTMDDIELSIAGLKAAGVTVIPVSV 236 >gi|196011371|ref|XP_002115549.1| hypothetical protein TRIADDRAFT_59612 [Trichoplax adhaerens] gi|190581837|gb|EDV21912.1| hypothetical protein TRIADDRAFT_59612 [Trichoplax adhaerens] Length = 795 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 44/130 (33%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ VLD S SMN G L + ++ L ++S + L+T + Sbjct: 3 LIFVLDTSASMNQQSYLGTSYLDITKGAVEYFL-KLRSRDPACRTDKCMLIT-TDDPPAM 60 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF-----DAKEKLEHIAKGHDDY 283 + W ++ L ++ L A+N + +K H Sbjct: 61 IKVGWRESHYSFLNELKNLQATGYSQMDLALREAFNLLNVNRMVSGIDKYGQGRDPHSIE 120 Query: 284 KKYIIFLTDG 293 II LTDG Sbjct: 121 PAVIITLTDG 130 >gi|319945321|ref|ZP_08019583.1| group 2 glycosyl transferase [Lautropia mirabilis ATCC 51599] gi|319741891|gb|EFV94316.1| group 2 glycosyl transferase [Lautropia mirabilis ATCC 51599] Length = 224 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 58/165 (35%), Gaps = 16/165 (9%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++LDVS SM K+ ++ + +KS V +VTF +V Sbjct: 23 CLLLLDVSGSMAG------GKIEELNAGLQAFEEELKSDSLSAKRVEVAIVTFGPVLVAQ 76 Query: 231 FPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + L T +E + K + G Y+ +I Sbjct: 77 -----DFTSASQFQAPQLQAQGLTPMGQAIEEGIELLRQ--RKALYRQSGITYYRPWIFL 129 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 +TDG + +++ EA++ A YA+GV DQ + Sbjct: 130 ITDGAPTDSWRQAAQAIAS-GEARKEFA-FYAVGVDDADMDQLRQ 172 >gi|229113208|ref|ZP_04242703.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] gi|228670234|gb|EEL25582.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] Length = 425 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 70/204 (34%), Gaps = 22/204 (10%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L++ ++LD S SM + K+ A ++I LD I +V V + + Sbjct: 121 KAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 179 Query: 225 SKIVQTFPLAWGVQHI--------QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 L+ G + +E+ N + K L A + D ++ Sbjct: 180 EN---DKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKEYTGE 236 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKN 335 + YI+ +DGE + + + + IG + Q LKN Sbjct: 237 ENLNVV---YIV--SDGEETCGGDPVNAAKNLNQSSTHAVVNI--IGFDVKNSEQQQLKN 289 Query: 336 CASP--DRFYSVQNSRKLHDAFLR 357 A + +V ++ +LH + Sbjct: 290 TAEAGKGNYATVSSADELHQTLNK 313 >gi|218517234|ref|ZP_03514074.1| hypothetical protein Retl8_28685 [Rhizobium etli 8C-3] Length = 176 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 62/154 (40%), Gaps = 30/154 (19%) Query: 28 IFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNI 87 + + +G + + V+ K+ LD +L+ +I N + + K + D+ + ++N Sbjct: 3 MLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTGDTDALKLKVTDWFHAQVENS 62 Query: 88 WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCAN 147 + ID ++N++A + +P F AN Sbjct: 63 YTLG-------------------------EIDIDTTNHNITATASGTVPTTFMKI---AN 94 Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 P+ + S+VK + S L++ +V+D S SM Sbjct: 95 IDTVPVSVASAVKGPATS--YLNVYIVIDTSPSM 126 >gi|6225079|sp|Q9ZGE6|BCHD_HELMO RecName: Full=Magnesium-chelatase 67 kDa subunit; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase gi|3820559|gb|AAC84032.1| Mg chelatase subunit D BchD [Heliobacillus mobilis] Length = 666 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 48/129 (37%), Gaps = 20/129 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQ 229 ++ V+D S SM + ++ A ++ +L+ N + L+ F + Sbjct: 476 IIFVVDASGSMAFN------RMSSAKGAVSVLLNEA-----YVNRDKVALIIFRGQQAET 524 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V+ +++ +++ G + + A ++ G D + I Sbjct: 525 LVPPTRSVELAKKRFDQVPVGGGSPLAGAIAQAIEVGVNS--------IGSDVGQVIITL 576 Query: 290 LTDGENSSP 298 +TDG + P Sbjct: 577 ITDGRGNVP 585 >gi|328885837|emb|CCA59076.1| hypothetical protein SVEN_5790 [Streptomyces venezuelae ATCC 10712] Length = 865 Score = 42.9 bits (99), Expect = 0.072, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 63/211 (29%), Gaps = 44/211 (20%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + +D +V+D S S++D + +++ R+ ++ Sbjct: 57 PGEGPDPVDFAVVVDQSASLSDKDLARETEAAGLLSQG-----------EISERSRAAVI 105 Query: 222 TF-SSKIVQTFPL----------AWGVQHIQEKINRL------IFGSTTKSTPGLEYAYN 264 F SS+ P+ A G + + + + L G T + A + Sbjct: 106 GFGSSEKPGQSPVREVCPLTVADAAGRERLSDCVQELGRRDAARMGPGTDFPAAIRQAVS 165 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDG--------------ENSSPNIDNKESLFYCN 310 ++ K + LTDG E+ N + + + Sbjct: 166 RLTADGTAGG-TGAAGKPAPKVVFLLTDGKLDVKDSPEYGTDPESRQSNGEKRLTEELA- 223 Query: 311 EAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 A+ G ++ +G +E L A Sbjct: 224 RARAAGVQIWPLGFGSEIDRAALTAMAEGGY 254 >gi|331647768|ref|ZP_08348860.1| conserved hypothetical protein [Escherichia coli M605] gi|330911953|gb|EGH40463.1| Mg-chelatase subunit ChlD [Escherichia coli AA86] gi|331043492|gb|EGI15630.1| conserved hypothetical protein [Escherichia coli M605] Length = 378 Score = 42.9 bits (99), Expect = 0.072, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSSATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|212633291|ref|YP_002309816.1| VCBS [Shewanella piezotolerans WP3] gi|212554775|gb|ACJ27229.1| VCBS [Shewanella piezotolerans WP3] Length = 1477 Score = 42.9 bits (99), Expect = 0.072, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 66/200 (33%), Gaps = 27/200 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDK--LGVATRSIREMLDIIKSIPDV--NNVVRSGL 220 + ++ ++D S SM D L V I +V + ++ Sbjct: 1022 AGADYNLAFLIDSSGSMGDSAVATAKAQILSVLATLITNANQPSAGTVNVLLVDFDQTAK 1081 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + + + PLA I + + G TT + AYN D + + Sbjct: 1082 ILIAIDLSSNDPLA----SITTALEAMSSGGTTNYSAAFTAAYNWFNDNYPQGNNRT--- 1134 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI------VYAIGVQAEAADQFLK 334 F+TDGE PN DN + Y A+ A+ V AIG+ L+ Sbjct: 1135 -------FFITDGE---PNTDNGQPGDYFENAQNAFALLNALSYVEAIGLGGNVNSSTLQ 1184 Query: 335 NCASPDRFYSVQNSRKLHDA 354 + + + L DA Sbjct: 1185 QFDTEAPIINNVDVDDLADA 1204 >gi|29831916|ref|NP_826550.1| hypothetical protein SAV_5373 [Streptomyces avermitilis MA-4680] gi|29609033|dbj|BAC73085.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 458 Score = 42.9 bits (99), Expect = 0.072, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 52/161 (32%), Gaps = 32/161 (19%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 + D + +M +D S SM+ K+ A + +D + + V Sbjct: 62 PYASGAAPDAAVAVM--VDCSGSMDY----PPTKMRNARDATAAAIDTL------RDGVH 109 Query: 218 SGLVTFSSKIVQTFP--------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 ++ + + +P A ++ + +L G T L A + A Sbjct: 110 FAVIGGTHVAKEVYPGGGRLAVADARTRDQAKQALRKLSAGGGTAIGTWLRLADRLLSSA 169 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 + H I LTDG N + D + +L C Sbjct: 170 DVSIRHG-----------ILLTDGRNEHESPQDLRAALDAC 199 >gi|71028594|ref|XP_763940.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350894|gb|EAN31657.1| thrombospondin-related protein [Theileria parva] Length = 931 Score = 42.9 bits (99), Expect = 0.072, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 63/176 (35%), Gaps = 9/176 (5%) Query: 170 DMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 D ++VLD S SM++ ++ + ++ + S L+ ++ V+ + + + + Sbjct: 271 DFVLVLDESESMSNYNWKKYVKEVTLLLASSISHLNKDNTLSIVH-YSNVPTLRLNFQKI 329 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH-DDYKKYI 287 + I E + + LEY ++ E E + K + Sbjct: 330 DPEAFQNTLDKINEMFQMRRSYGKSYTGKALEYVRQQLLHLPEIPEGSSSDSPKASNKVV 389 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQFLKNCASPDR 341 I +TDG ESL + G V+A+G E + L C + Sbjct: 390 ILMTDGAAKDIEKAYNESL----ALRYNGVELFVFAVGFVNEENCRKLVGCPNEGH 441 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 50/132 (37%), Gaps = 23/132 (17%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 LD+++V+D S SMND+ +++G + +D + + + + G V +SS+ Sbjct: 267 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTLAD-SGITDKINMGYVGYSSE 320 Query: 227 IVQTFPLAW---GVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 A ++ ++ + T + L A + + Sbjct: 321 GYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGH-------- 372 Query: 281 DDYKKYIIFLTD 292 KK I+ LTD Sbjct: 373 ---KKVIVLLTD 381 >gi|224370039|ref|YP_002604203.1| hypothetical protein HRM2_29520 [Desulfobacterium autotrophicum HRM2] gi|223692756|gb|ACN16039.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 292 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 11/105 (10%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 +K+ + + +M+V+D+S S G+ A + IK N + Sbjct: 68 IKLYREERESV-VMLVIDMSASQGFGTFSGLKLEKAAEVASVLAFSAIK------NNDKV 120 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGL 259 G++ F+ ++ + P G H+ I + G T + L Sbjct: 121 GVIFFTDQVEKYIPPKKGTGHVWRLIKEIFTFVPQGRGTDISAAL 165 >gi|182439279|ref|YP_001826998.1| hypothetical protein SGR_5486 [Streptomyces griseus subsp. griseus NBRC 13350] gi|326779931|ref|ZP_08239196.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|178467795|dbj|BAG22315.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] gi|326660264|gb|EGE45110.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 249 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 50/145 (34%), Gaps = 21/145 (14%) Query: 170 DMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 + +V+D S SM ++ G + L + LD ++P +V FS+ + Sbjct: 48 AVYLVVDHSGSMRPYYNDGSVQALADRVLGLSSHLDDDGTVP---------VVFFSTDVD 98 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 +A G H +I+ + G A + + D H ++ Sbjct: 99 AVTDIALGNHH--GRIDEIRAGLGHMGKTSYHLAMDAVID------HYLDSGSTAPALVV 150 Query: 289 FLTDGENSSPNIDNKESLFYCNEAK 313 F TDG P + C AK Sbjct: 151 FQTDG---GPINKLAAERYLCKAAK 172 >gi|313885994|ref|ZP_07819732.1| conserved hypothetical protein [Porphyromonas asaccharolytica PR426713P-I] gi|332299335|ref|YP_004441256.1| protein of unknown function DUF58 [Porphyromonas asaccharolytica DSM 20707] gi|312924524|gb|EFR35295.1| conserved hypothetical protein [Porphyromonas asaccharolytica PR426713P-I] gi|332176398|gb|AEE12088.1| protein of unknown function DUF58 [Porphyromonas asaccharolytica DSM 20707] Length = 290 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 55/131 (41%), Gaps = 17/131 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + ++N R GL+ + Sbjct: 70 EEERELTIMLLVDVSHSLDF------GSTSETKRDLVATIAATIAFACIHNNDRVGLMLY 123 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + ++ + P G +H+ + I ++ + + +I + E L + K Sbjct: 124 TDRVERYIPAGQGRKHVLQLIREILTYRPERHS-------TQISSSLEMLSRVVKKRCSA 176 Query: 284 KKYII--FLTD 292 +I+ F+TD Sbjct: 177 --FIVSDFITD 185 >gi|167738778|ref|ZP_02411552.1| hypothetical protein Bpse14_11973 [Burkholderia pseudomallei 14] Length = 578 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 42/126 (33%), Gaps = 8/126 (6%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 GS +++ AI + V +G ++ + FFV+ L + D + L A ++ + + Sbjct: 1 GSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCAQP 55 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 + N L D + + + + + + Sbjct: 56 NAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTATQS 112 Query: 135 MPFIFC 140 +P+ F Sbjct: 113 VPYFFL 118 >gi|167535479|ref|XP_001749413.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772041|gb|EDQ85698.1| predicted protein [Monosiga brevicollis MX1] Length = 2014 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 52/131 (39%), Gaps = 10/131 (7%) Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 S + Q + + ++ + F T + GL++ ++F + ++ Sbjct: 392 SWTLYQDLIETTDQYALNDLLDNVEFPAGATHLSWGLDFIDREMF---RLAAGMRSSNNS 448 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-- 340 + +I LTDG ++ P + E Y K +G +YAIGV + + + + P Sbjct: 449 IPRVLIVLTDGRSN-PGFEPDE---YSTALKDKGIEIYAIGVGDYYSIEVQEMASEPKDR 504 Query: 341 RFYSVQNSRKL 351 + + N L Sbjct: 505 HAFELSNQDDL 515 >gi|118595076|ref|ZP_01552423.1| hypothetical protein MB2181_05370 [Methylophilales bacterium HTCC2181] gi|118440854|gb|EAV47481.1| hypothetical protein MB2181_05370 [Methylophilales bacterium HTCC2181] Length = 700 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 33/106 (31%), Gaps = 15/106 (14%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + L T +++A+ + IIFLTDG+ Sbjct: 398 NKKSGLQFTKSLEADGGTDPLEAIKFAFTS----------KKIPSQPLLRQIIFLTDGQV 447 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 S+ + Y ++ K + IG+ + + A R Sbjct: 448 SNEHEIIDTVRQYIDQDK-----FFTIGIGSAPNSYLMTKLADYGR 488 >gi|226137|prf||1411304A thrombospondin related protein TRAP Length = 559 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + S+ + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSEEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|136153|sp|P16893|TRAP_PLAFA RecName: Full=Thrombospondin-related anonymous protein; Flags: Precursor gi|9978|emb|CAA31440.1| unnamed protein product [Plasmodium falciparum] Length = 559 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + S+ + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSEEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|227661|prf||1708291A thrombospondin related protein Length = 559 Score = 42.9 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + S+ + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSEEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|327401667|ref|YP_004342506.1| Magnesium chelatase [Archaeoglobus veneficus SNP6] gi|327317175|gb|AEA47791.1| Magnesium chelatase [Archaeoglobus veneficus SNP6] Length = 628 Score = 42.9 bits (99), Expect = 0.074, Method: Composition-based stats. Identities = 38/263 (14%), Positives = 84/263 (31%), Gaps = 31/263 (11%) Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + NN N K +L + G + S + Y Sbjct: 357 EHKQNNSNADGNAKGNATSK-YDAASVDVDLSKGGVSTAKRASRGSRDERATVIGHPHGY 415 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKIS-SKSDIGLDMMMVLDVSLSMNDH 184 +S+V A + + +++ K + +++LD S SM Sbjct: 416 PISSVPGVASDVDIVATIRTAAMNGRRQICDEDIRVRVRKVKLPRLTVIMLDASGSMA-- 473 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR---SGLVTFSSK-IVQTFPLAWGVQHI 240 M ++ +A ++++ N+ V+ L+TF P + Sbjct: 474 ---AMRRIRIAKGVAAKLIE--------NSYVKRDSLALITFRGHAAEVLVPPTRRYSGV 522 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIAKGHDDYKKYIIFLTDGENSSP- 298 + ++ ++ G T + L+ + + + I KG + +TDG+ ++P Sbjct: 523 MDALHNVVIGGRTPLSSALQTLLVLARSFRLKNRDSIVKG--------VLITDGKANTPL 574 Query: 299 --NIDNKESLFYCNEAKRRGAIV 319 +E + K+ G + Sbjct: 575 YGKSIKEELQMLASAIKKSGIKL 597 >gi|302894803|ref|XP_003046282.1| hypothetical protein NECHADRAFT_33272 [Nectria haematococca mpVI 77-13-4] gi|256727209|gb|EEU40569.1| hypothetical protein NECHADRAFT_33272 [Nectria haematococca mpVI 77-13-4] Length = 728 Score = 42.9 bits (99), Expect = 0.074, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 36/110 (32%), Gaps = 15/110 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K D++ ++D + SM + A ++ ++D I VR +V + Sbjct: 57 KKAYATDLLFLIDTTGSM-------YSYIKAARDQVKRIMDDITKAFFNEAEVRIAVVGY 109 Query: 224 SSKIV----QTFPLAWGVQHIQEKINRLIFGSTTKST----PGLEYAYNK 265 Q + ++ I++ + G++ A N Sbjct: 110 KDHADKPNIQFLDFTTSIDDVRSFIDKFKASGGGDAPEDVLGGIDQAINA 159 >gi|227887129|ref|ZP_04004934.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|227835479|gb|EEJ45945.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|307554146|gb|ADN46921.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] Length = 219 Score = 42.9 bits (99), Expect = 0.074, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 66/172 (38%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SMN G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMN---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVRVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG ++ +F E K+ ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTNEWQAAANKVFQGEEDKK--FAFFSIGVQGADMKTLAQ 171 >gi|89096888|ref|ZP_01169779.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89088268|gb|EAR67378.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 459 Score = 42.9 bits (99), Expect = 0.074, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 80/216 (37%), Gaps = 28/216 (12%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 P L +I + ++++++D S SM G +K+ +A +I+E ++ Sbjct: 136 PELPDGEDEIQQAKNQKSNIVILMDASGSMKADVSGG-NKMMLAKETIKEFTSSLEDDAS 194 Query: 212 VNNVVRSGLVTFSS--------KIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 V+ + + T + +I + FPL A+ + ++ T ++ A Sbjct: 195 VSLMAYGHVGTGNDEDKAESCSRIDEVFPLGAYEKTAFNKSMDSFEASGWTPLAGAIDKA 254 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVY 320 + A DYK + ++DG + + + + + + V Sbjct: 255 RELLS---------AYNSTDYKNTLYIVSDGVETC----DGDPVEAAQQLQGSNIEAKVN 301 Query: 321 AIGVQAEAADQ-FLKNC--ASPDRFYSVQNSRKLHD 353 IG + Q LK A + +V++ +L D Sbjct: 302 IIGFDVDDEGQKQLKEVAEAGGGTYATVRDKDELED 337 >gi|166367043|ref|YP_001659316.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089416|dbj|BAG04124.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 218 Score = 42.9 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 71/216 (32%), Gaps = 22/216 (10%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + ++++LD S SM + I+ + + V Sbjct: 4 GVAEFVENQETRCPVVLLLDTSGSMQGE------PIKALNDGIKTFQEDVMRDIQATLSV 57 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP---GLEYAYNKIFDAKEKL 273 + +VTF GV+ +Q+ + F T + + A D E Sbjct: 58 ETAIVTFGRG---------GVKTVQDFVGIDQFTPPTLTAGDLTPMGKAIELALDLIEDR 108 Query: 274 EHIAKGH--DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + I + H Y+ +I +TDG + + + + +++GVQ ++ Sbjct: 109 KAIYRNHGIQYYRPWIFLITDGAPTDQWNLAAQRVKQAEA--ENRVLFFSVGVQGADMEK 166 Query: 332 FLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 + +P + + R L K + +I Sbjct: 167 LKQISNNPPVLLNGLDFRDLFQWLSNSMKRVSGGKI 202 >gi|284050388|ref|ZP_06380598.1| hypothetical protein AplaP_02827 [Arthrospira platensis str. Paraca] gi|291570280|dbj|BAI92552.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 220 Score = 42.9 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 65/171 (38%), Gaps = 31/171 (18%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LD S SM+ G +D+L + + + V+ +VTF S + Sbjct: 17 VVLLLDTSGSMS---GQPIDQLN---QGLVTFQQELSQDSLAARRVQVAIVTFDSYV--- 67 Query: 231 FPLAWGVQHIQEKIN-------RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 +Q+ ++ L TT G+ A N I K+ L + Sbjct: 68 -------NVVQDFVDFDQFNPPHLSTTGTTAMGEGIGTALNLIESHKQLLRANGIDYYRP 120 Query: 284 KKYIIFLTDGENSSPNIDNKESL--FYCNEAKRRGAIVYAIGVQAEAADQF 332 +++ +TDG + + + L FY N+ + +AI VQ + Sbjct: 121 --WLLMITDGAPTDSWQNAAQMLHQFYANK----KVVFFAIAVQGADMNTL 165 >gi|269962785|ref|ZP_06177126.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832475|gb|EEZ86593.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 582 Score = 42.9 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 41/138 (29%), Gaps = 13/138 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S ++ + ++VLD+S SM ++L Sbjct: 54 FAIWGLAWAIACIALAGPSWQSNTRPS-FELSQNRVLVLDMSRSM-YATDVKPNRLSQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GLV ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGS-------TGLVAYAGDAYTLSPLTTDSSTLAGIIENLSPELMPYQG 164 Query: 253 TKSTPGLEYAYNKIFDAK 270 + +E A + A Sbjct: 165 SNLPSAIETALGQFTQAG 182 >gi|261855659|ref|YP_003262942.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836128|gb|ACX95895.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 625 Score = 42.9 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 74/209 (35%), Gaps = 32/209 (15%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + +++ ++ +++DVS SM + + A R + ++L P G+ Sbjct: 31 VPAQAATPPELHVLIDVSGSMKQTDPNNLRR--PALRLLGDLL------PPSAQ---IGI 79 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG- 279 F K+ A ++E++ + + N+ F A Sbjct: 80 WFFGDKVSLMLKTAGADPKVKERVRQT----------AKKIRSNEPFTDIPAALAAAAAT 129 Query: 280 -HDDYKKYIIFLTDG--ENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAADQ 331 +D + I+ L+DG + S N + + + V+ I + +A + Sbjct: 130 WNDGTDRNILLLSDGMVDISPEKAINVRAQEELLQKLVPQLRAEHIRVHTIALSKDADSK 189 Query: 332 FLKNCA--SPDRFYSVQNSRKLHDAFLRI 358 L A + F ++ L AFL+I Sbjct: 190 LLSQIAADTGGIFVEADSADALQRAFLKI 218 >gi|156977400|ref|YP_001448306.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156528994|gb|ABU74079.1| hypothetical protein VIBHAR_06187 [Vibrio harveyi ATCC BAA-1116] Length = 515 Score = 42.9 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 62/203 (30%), Gaps = 17/203 (8%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G +I +LL I +E + ++L + + + + Sbjct: 16 GVAAIWMGLLLVPIMGFTFWAVEGTRYVQESSRLRDSAEAAAMAV--------TIEDQPG 67 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 + + ++N + L F Q + + Y ++A + ++ Sbjct: 68 AARALATKYVENYVRDIKSTNLSAQRFYQAEDKGTGALEYI--------QYTVNARTTHD 119 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLG 193 F P + + D +D++ V D S SMND +G K+ Sbjct: 120 SWFASSFIPSFDKQQELAGRSLARKYPAYLGDNNIDIVFVSDFSRSMNDKWGSSWNKKID 179 Query: 194 VATRSIREMLDIIKSIPDVNNVV 216 +I ++ + I V Sbjct: 180 DLKTAIDQISNNILCKSTRQEYV 202 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 59/152 (38%), Gaps = 11/152 (7%) Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 ++SS+ L+ + + I + T + G+ + + + Sbjct: 358 SYSSQ-FHNIQLSNKLSDLDS-IKSMWADGGTAAFQGILRGSQVLHEGDPNSSDQEEQQA 415 Query: 282 DYKKY--IIFLTDGENSSPNIDNKESLFY--CNEAKRR--GAIVYAIGVQAEAADQF-LK 334 KK ++ L+DG+ S N K + + C++A++ G + IG+ A+ Q + Sbjct: 416 YNKKIKMLLILSDGQESPDNGILKGLVDWGMCDKARQEIPGLYIGVIGIDFRASQQSGFQ 475 Query: 335 NCASP--DRFYSVQNSRKLHDAFLRIGKEMVK 364 +C + V N +L + + ++ K Sbjct: 476 DCVVDPREDIIDVSNLDELIEKIEELIRKGSK 507 >gi|324499478|gb|ADY39777.1| Mesocentin [Ascaris suum] Length = 4390 Score = 42.9 bits (99), Expect = 0.076, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 71/187 (37%), Gaps = 19/187 (10%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 W + + T+ +I ++ D D+M VLD S D+F P ++ G + ++ Sbjct: 3615 WPSRKTTLESERTTPARICTRIDYQADVMFVLDSS----DNFSP--EQYGHLKEGLSTLI 3668 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 D ++ +VV+ G V +S K + P+A G + ++ S A Sbjct: 3669 D--ETFDLSPDVVQVGFVEYSDKA--SVPVALGHYEDKVQL----LTDIANSEQLFGEAI 3720 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + G + + ++ +T+G N + E L R ++ + Sbjct: 3721 VLKGLNAARQQFQLHGRKNVPRVLLLITNGVNRGNAANAAEDLRE-----RYNVELFILA 3775 Query: 324 VQAEAAD 330 V A A Sbjct: 3776 VNASADA 3782 >gi|31873561|emb|CAD97767.1| hypothetical protein [Homo sapiens] Length = 363 Score = 42.9 bits (99), Expect = 0.076, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 32/81 (39%), Gaps = 5/81 (6%) Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI-----DNKE 304 G+ T + L Y + + L + + II LTDG+++ D+ Sbjct: 5 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 64 Query: 305 SLFYCNEAKRRGAIVYAIGVQ 325 + N+ + +YAIGV Sbjct: 65 EILNINQKRNDYLDIYAIGVG 85 >gi|332828720|gb|EGK01412.1| hypothetical protein HMPREF9455_02245 [Dysgonomonas gadei ATCC BAA-286] Length = 289 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 44/109 (40%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S + R + + + + N + G+V F Sbjct: 72 EEERELTVMLLVDVSASQDF------GTRNSLKRDMVTEIAATLAFSAIQNNDKIGVVFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G +H I+E IN + T L+Y N I Sbjct: 126 SDKIEKFIPPKKGKKHILYIIRELINFQADSAKTDVGMALKYLTNVIKK 174 >gi|330508119|ref|YP_004384547.1| magnesium-chelatase subunit ChlD [Methanosaeta concilii GP-6] gi|328928927|gb|AEB68729.1| magnesium-chelatase subunit ChlD (Mg-protoporphyrin IXchelatase) (Mg-chelatase subunit D) [Methanosaeta concilii GP-6] Length = 661 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 56/160 (35%), Gaps = 22/160 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQ 229 ++ V+D S SM + ++ A ++ +L + + G++ F + Sbjct: 462 VLFVVDASGSMGAN-----QRMESAKGAVLSLL-----MDSYQKRDKIGMIAFKGKEAEI 511 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V ++ L G T + GL + + + ++ K ++ Sbjct: 512 ILPPCTSVDLALGRLRELPTGGKTPLSAGLSRGLQLL-------QGELRKDEESKLMMVL 564 Query: 290 LTDGENSS--PNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++DG + E + K G ++ I + +E Sbjct: 565 ISDGRANEGMGGKIKDELMAISERIKHLG--IHTIVIDSE 602 >gi|326674112|ref|XP_002664610.2| PREDICTED: calcium-activated chloride channel regulator 4 [Danio rerio] Length = 828 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 50/161 (31%), Gaps = 29/161 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LDVS SM + AT +R ++ S G+V FS+ Sbjct: 299 AVCLILDVSGSMATE-SRILRMRQAATHLLRNYVEEQAS---------VGIVKFSTAASI 348 Query: 230 TFPLAWGVQHIQEK--INRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L IN L G +T GL + + Sbjct: 349 VSSLTIIESDATRDHLINLLPETPGGSTNMCNGLRLGLQVLSEDDMDAIGDE-------- 400 Query: 286 YIIFLTDGENSSP---NIDNKESLFYCNEA---KRRGAIVY 320 IIFLTDG+ S D ++ K I+Y Sbjct: 401 -IIFLTDGQLESVGTRTSDWFNGTVSVDQTVGNKTSFVIIY 440 >gi|315639746|ref|ZP_07894885.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] gi|315484523|gb|EFU74980.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] Length = 1228 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 29/150 (19%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 +D+++V+D+S SM P + + + D V+ V G V +SS Sbjct: 356 KPIDIVLVIDMSGSMEPSRAPAVR--QGIKDFFKYIQDA-----GVSQYVNVGFVGYSSI 408 Query: 226 ----------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 + +Q +I K+NR F T + G++ + + Sbjct: 409 NQSNYVGLIQEGIQPISNTTHTNNINTKLNR-TFIGGTFTQLGIKTGADMLTS------- 460 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + +K +I LTDG + N+ + Sbjct: 461 ---YNSTNEKMMILLTDGVPTYSYKVNEST 487 >gi|209808861|ref|YP_002264399.1| hemolysin-type calcium-binding protein [Aliivibrio salmonicida LFI1238] gi|208010423|emb|CAQ80774.1| hemolysin-type calcium-binding protein [Aliivibrio salmonicida LFI1238] Length = 2890 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 57/155 (36%), Gaps = 33/155 (21%) Query: 171 MMMVLDVSLSMN------DHFGPGMDK--LGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + +VLD S SM+ D+ P DK + + + MLD +K + + + V+ LV Sbjct: 2404 ISLVLDSSGSMDHKPFKSDNSNPDQDKTRMELVLEASIAMLDNVK-VQEGSEEVKVQLVD 2462 Query: 223 FSSKIVQTFPL---AWGVQHIQEKINRLIFG----------------STTKSTPGLEYAY 263 F + + + G +Q I+ L T G+ Y Sbjct: 2463 FDDQKHSSQDKDVESLGWFTVQSAIDALNAALVDIAEKDKDEHFYPKGGTDYEEGI---Y 2519 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + ++ G + + FL+DG+N+ Sbjct: 2520 AVMSGYQDTQITNITG--ETNDVVYFLSDGDNNGG 2552 >gi|160725|gb|AAA29777.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYANIFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|3925399|gb|AAC79447.1| TrkA-like protein [Pseudomonas sp. G-179] Length = 632 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 66/180 (36%), Gaps = 35/180 (19%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD----------VNNVVR 217 L + +++DVSLS + F L V ++ + + + + D + VR Sbjct: 442 DLAVTLLVDVSLSTDSWFNDR-RVLDVEKEALMVLAEGLSACGDNHSILTFTSRRRDWVR 500 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + + + ++ +I L G T+ + +A K+ + Sbjct: 501 VETIKAFDEPMS--------HAVRRRIASLKPGYYTRIGAAIRHAAAKLSEQP------- 545 Query: 278 KGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K ++ LTDG+ + +++ EA+R+G V+ + V +A Sbjct: 546 ----NRKHLMLVLTDGKPNDVDHYEGRFALEDTRRSVIEARRKGVQVFGVTVDQDAKSYV 601 >gi|294791467|ref|ZP_06756624.1| putative von Willebrand factor type A domain protein [Scardovia inopinata F0304] gi|294457938|gb|EFG26292.1| putative von Willebrand factor type A domain protein [Scardovia inopinata F0304] Length = 358 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 51/141 (36%), Gaps = 10/141 (7%) Query: 159 VKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + ++K+ D+ + +D + SM +D L + + + DI D + Sbjct: 71 TENTTKAVKATDVFIAVDTTGSMAVSDAHYKSEKTLTRLAAARQAITDIASMYSDAS--- 127 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEH 275 ++F + P+ + + N L+ + T L+ N + A EK Sbjct: 128 -FSAISFGASTTIDLPMTPDSNAVTQWANTLVTEATATSRGSSLDAPINTLITAMEKTRQ 186 Query: 276 IAKGHDDYKKYIIFLTDGENS 296 YI +DGE++ Sbjct: 187 AHPNDSIVLYYI---SDGEST 204 >gi|91974615|ref|YP_567274.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB5] gi|91681071|gb|ABE37373.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB5] Length = 372 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 72/204 (35%), Gaps = 27/204 (13%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M I + APL I S ++++ +++ VLD + SM+ K+ Sbjct: 1 MKRITMLRALAFAALIAPLAIPS----AARARPAVEVAFVLDTTGSMSGLIEGAKRKIWS 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTF----SSKIVQTFPLAWGVQHIQEKINRLIFG 250 +I + + +R GLV + + + L +Q + ++ L Sbjct: 57 IATAIVD--------SNPGADIRMGLVAYRDIGDDYVTRNVELTPDIQDLYARLLELQAR 108 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 + A + + + + I+FL G+ + P++D + Y Sbjct: 109 GGGDWPESVNEALDVAVNKLRWSKDGDT------RRIVFLV-GD-APPHMDYAQDTKYPT 160 Query: 311 E---AKRRGAIVYAIGVQAEAADQ 331 A+++ IV A+ A + Sbjct: 161 TLSVARQKDIIVNAVQAGAARDTE 184 >gi|284799403|ref|ZP_06390123.1| PilC protein [Neisseria subflava NJ9703] gi|284797751|gb|EFC53098.1| PilC protein [Neisseria subflava NJ9703] Length = 1126 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 53/151 (35%), Gaps = 13/151 (8%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR--EMLDIIKSI 209 P + + +S + + ++M ++D S SM + + R E L + Sbjct: 44 PFYLQNKTDVSGQPKVKHNIMFLIDDSGSMQWNVQGKETSVRADKRITITKEALKSVLKE 103 Query: 210 PDVNNVVRSGLVTFSSKIVQTFP----LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + GL T + P + +Q +++ + G T T + Sbjct: 104 YGEKQRFQWGLQTLHNNGRTDTPDEGGFTDDWKDVQRRVDGIDPGHATPIT-------RR 156 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++ + + K Y+I ++DG+ + Sbjct: 157 YYEVVKNFVMPNIKYRCQKSYVIVMSDGDAN 187 Score = 36.7 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 28/77 (36%), Gaps = 13/77 (16%) Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKNCAS-PDRFYSVQN 347 DG+ S P + + +G +E ++L+ AS PD +++ Sbjct: 351 DGDPSDPKGIDYSKQL---------VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAK 401 Query: 348 SRKLHDAFLRIGKEMVK 364 L +AF I + Sbjct: 402 KEDLLEAFKTIVDNIEN 418 >gi|297194950|ref|ZP_06912348.1| toxic cation resistance protein [Streptomyces pristinaespiralis ATCC 25486] gi|197723405|gb|EDY67313.1| toxic cation resistance protein [Streptomyces pristinaespiralis ATCC 25486] Length = 238 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 51/163 (31%), Gaps = 25/163 (15%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDV 212 S++ + +V+D S SM D++ G + L + LD ++P Sbjct: 21 TAGVSLQKHGLHGQRAAVYLVVDYSGSMKDYYKDGSVQALADRVLGLSANLDDDGTVP-- 78 Query: 213 NNVVRSGLVTFSSKIV--QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 +V FS+ + L I E ++ L T ++ + Sbjct: 79 -------VVFFSTDVDAVTDIALENHHGRIDEIVSGLGHMGKTSYHLAMDAVID------ 125 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 H ++F TDG P + C AK Sbjct: 126 ----HYLDSGSTDPALVVFQTDG---GPINKPAAERYLCKAAK 161 >gi|110636656|ref|YP_676863.1| hypothetical protein CHU_0231 [Cytophaga hutchinsonii ATCC 33406] gi|110279337|gb|ABG57523.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406] Length = 251 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 60/154 (38%), Gaps = 24/154 (15%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + + I + + D++ ++D + SM D + +++ +++D Sbjct: 75 YFSQTKDSVYSVITNVIKNHATQDADIVFLIDNTGSM-------TDDIENVKKNLNKLID 127 Query: 205 IIKSIPDVNNVVRSGLVTFSSKI------VQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 ++K++ VR + + K + L ++ ++ I+ + + Sbjct: 128 LLKTLQH----VRVAVALYGDKNSDGSAWYKRTELTPDLETTRKFIHSIYVNGGGDTP-- 181 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 E AY+ ++ K + K K+ I+ + D Sbjct: 182 -ESAYDALY----KTTNELKWKSSSKRMILLIGD 210 >gi|160712|gb|AAA29771.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYASVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L K + ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDAL--------LEVRKHLNDRINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 12/79 (15%), Positives = 30/79 (37%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQEN 68 + KG++ ++ A+++ V + LVI+ + + +L LD + L + + Sbjct: 3 LINSQKGTVIVVVALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDDSQ 62 Query: 69 GNNGKKQKNDFSYRIIKNI 87 + N+ Sbjct: 63 QAETVALDYASRNGLDNNV 81 >gi|221309837|ref|ZP_03591684.1| hypothetical protein Bsubs1_10701 [Bacillus subtilis subsp. subtilis str. 168] gi|221314159|ref|ZP_03595964.1| hypothetical protein BsubsN3_10622 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221319082|ref|ZP_03600376.1| hypothetical protein BsubsJ_10553 [Bacillus subtilis subsp. subtilis str. JH642] gi|221323355|ref|ZP_03604649.1| hypothetical protein BsubsS_10667 [Bacillus subtilis subsp. subtilis str. SMY] gi|255767454|ref|NP_389820.2| activator of nitric oxide reductase [Bacillus subtilis subsp. subtilis str. 168] gi|264675898|sp|O31849|YOJO_BACSU RecName: Full=Uncharacterized protein yojO; Flags: Precursor gi|225185081|emb|CAB13830.2| putative activator of nitric oxide reductase [Bacillus subtilis subsp. subtilis str. 168] Length = 638 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 62/174 (35%), Gaps = 29/174 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V Sbjct: 434 KQEPSSEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVAVPHQIV--- 483 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY----NKIFDAKE 271 F + ++ P + + S P + N+ A Sbjct: 484 --GFWEDTNDATEKSQPNYFNT------VIPFQSSLRQDSGPAIMQLEPEEDNRDGYAIR 535 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 536 QMTKKMLHRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|212702918|ref|ZP_03311046.1| hypothetical protein DESPIG_00954 [Desulfovibrio piger ATCC 29098] gi|212673780|gb|EEB34263.1| hypothetical protein DESPIG_00954 [Desulfovibrio piger ATCC 29098] Length = 643 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 85/269 (31%), Gaps = 35/269 (13%) Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG---LDMM 172 + + + ++ Y MP + + + + S + G + Sbjct: 157 LASEPAEQKGAVAQKRFYLMPILGMNRAFE-GVNLLQVASIDPGSSQSGAVAGPPKTGIA 215 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +V+D S+SM P +D+ R + + L+ K +V V V F S +T Sbjct: 216 LVMDTSISMK----PYIDQSRDIIRQLYDRLEKDKMTDNVGFAV----VAFRSSTEKTPK 267 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD-------YKK 285 L + Q I + + +E A D E D + Sbjct: 268 LGYTSQVISDFATAKDRKALESRLAKVEQATVSSHDFNEDSLAGIYTAIDSLNWGPYSTR 327 Query: 286 YIIFLTD-----GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------- 330 I+ +TD G++ + + + A+++G + + V+ Sbjct: 328 LILLVTDAGPLRGDDPYAS-QRLGAAEMNDLARQKGIWITTMHVKTPGGSKNHAYAEQAY 386 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLRIG 359 + L + Y N+ +A + G Sbjct: 387 RALSRLSGDQANYQAVNASSHKEAARQFG 415 >gi|170096674|ref|XP_001879557.1| predicted protein [Laccaria bicolor S238N-H82] gi|164645925|gb|EDR10172.1| predicted protein [Laccaria bicolor S238N-H82] Length = 360 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 62/178 (34%), Gaps = 32/178 (17%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN-VVRSGLV 221 S + +D++ + D + S + + A ++I+++ + + +++ +R L+ Sbjct: 9 STAGKSIDIVFLQDATGSQGPY-------IQAARQAIQQICLKVSASAELSKGAIRFALI 61 Query: 222 TFSSK-------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 F + + F +++ ++ LI A + Sbjct: 62 AFRDHPPQDMSFVTKNFGFTAEQSVMKKNLDGLIASGGGDGPEAQTAALADALN------ 115 Query: 275 HIAKGHDDYKKYIIFLTD------GENSSP---NIDNKESLFYCNEAKRRGAIVYAIG 323 + + K ++ +TD GE+ + D + L + RG ++ I Sbjct: 116 --LEWAEGAAKMVVLITDAPPHGIGEDGDGFTESPDQNDPLQLARQMAERGITLFVIA 171 >gi|126343558|ref|XP_001372639.1| PREDICTED: similar to ITI-like protein, partial [Monodelphis domestica] Length = 1002 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 40/260 (15%), Positives = 84/260 (32%), Gaps = 43/260 (16%) Query: 97 RENGFAQDINNIERSTSLSIIIDDQHK-----------DYNLSAVSR--YEMPFIFCTFP 143 ++ + + + + + I ++ + L+ + R ++P Sbjct: 224 YQHAVSVRPGQVVPNLRVEVTISERTGIDYIHVLPLQSSHLLTNIVRGEADLPTSTTIEK 283 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 + + S S I D ++ DV +M D G K +++ +L Sbjct: 284 EKTCARVIFMPSPSEQAAYSSQGITGDFVIQYDV--TMKDVIGDVQTK-----KAMHVIL 336 Query: 204 DIIKSIPDVNNVVRSGLVTFSSKI--------VQTFPLAWGVQHIQEKINRLIFGSTTKS 255 + N +VTFS + +Q P +Q + ++R+ T Sbjct: 337 GDLCPKDHFN------IVTFSDTVHIWKAAGSIQAIPP--NIQRAKAYVSRMKAARWTDM 388 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR 315 L A + + + G + IIFLTDGE ++ L A Sbjct: 389 NAALLAAASILNQS-------IAGPLGEARLIIFLTDGEPTAGVTSPARILANAQRALAG 441 Query: 316 GAIVYAIGVQAEAADQFLKN 335 ++ + + +A L+ Sbjct: 442 QVALFGLALGDDADLPLLRR 461 >gi|309358923|emb|CAP33622.2| CBR-DPY-1 protein [Caenorhabditis briggsae AF16] Length = 1470 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 61/165 (36%), Gaps = 19/165 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S D+F + A + I E + + ++ V+ LV ++ + Sbjct: 857 DILFLLDSS----DNFNEQ--RFHRAIKLIGETVSKFNNF--GSDGVQVSLVQYNDEPYL 908 Query: 230 TFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 F L +H+ + I F + L A K+ + + D + + Sbjct: 909 EFSLRKHNCKKHLLDDIADTEFMTVGCGGSQLTKALEKVSQFAFTKKRGDRP--DAENVL 966 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 I +TDG+++ + AK V I EA QF Sbjct: 967 IIVTDGQSNGRIQEPTRL------AKENNVTVLVI-TTIEADKQF 1004 >gi|305663839|ref|YP_003860127.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304378408|gb|ADM28247.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 443 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 60/173 (34%), Gaps = 33/173 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM+ + K+ A + + + I +R F S+ Sbjct: 276 IYVLLDKSGSMDGN------KILWAKATALALF-MKSRIERRPYYIR----FFDSEPYDL 324 Query: 231 FPLAWGVQ-----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + G + + E I + G T + + A N I + + Sbjct: 325 IKVKSGAKPSEVMKLIEYIAMVRNGGGTDISKAIITACNDILRNEVVRDVSD-------- 376 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 II +TDGE+ ++SL AK + I V + LK +S Sbjct: 377 -IIIITDGEDRIAKSLVRKSL---QHAKAK-----LISVMVMGENDDLKQISS 420 >gi|220922039|ref|YP_002497340.1| hypothetical protein Mnod_2052 [Methylobacterium nodulans ORS 2060] gi|219946645|gb|ACL57037.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 418 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 54/175 (30%), Gaps = 12/175 (6%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +R + G I++L ++LLP+ + L I+ S VK +L D + L + Sbjct: 1 MRRLLRDRSGQITVLASLLLPIGLGIAALAIDLSTLQLVKHRLKVAADAASLAAVAVL-- 58 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + + +++R + + + Sbjct: 59 PDTTTALDRALSIAADNAGTGAGTVTAASDVRFGSYNSAAKSFTPGATP-----ANAVQV 113 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 S + P + A + S ++ + +VLD S S Sbjct: 114 TASRNQAHGNPVVLAF----AKALGWSTPDISVSAVAVRFSPAY-CFLVLDPSAS 163 >gi|254296485|ref|ZP_04963941.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157806387|gb|EDO83557.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 418 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|268530522|ref|XP_002630387.1| Hypothetical protein CBG04326 [Caenorhabditis briggsae] Length = 1016 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 62/180 (34%), Gaps = 24/180 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++ D+S S++ P + + N+ R G++TF+ ++ Sbjct: 375 LDIIIAFDISESLSRIILPKYVAFAERLVAQYKY--------KANDFTRVGVLTFNDQVT 426 Query: 229 QTFPLAWG--VQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L G + + I+ + G T T L+ A K Sbjct: 427 EKLTLVNGNDLAAVNAAIDSVQYVGGLTDVTKALKTAQQLFTTES---------DASRSK 477 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRFY 343 +I L+D + ++ + + G + IG ++ + L +P+ + Sbjct: 478 VLIVLSDAVPTVDTYTDE--IQAGKQLSAMGVATFFIGYNHYSDDVKKELGQVTNPNYVF 535 >gi|3273249|dbj|BAA31168.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273271|dbj|BAA31179.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 42.9 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + L FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|255531383|ref|YP_003091755.1| hypothetical protein Phep_1479 [Pedobacter heparinus DSM 2366] gi|255344367|gb|ACU03693.1| conserved hypothetical protein [Pedobacter heparinus DSM 2366] Length = 291 Score = 42.9 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + +M+++DVS S +FG + + +L + + N + G++ F Sbjct: 72 DEEREMTVMLLVDVSGS--KNFGTQLQLKQELATEVCAVL----AFSAIQNNDKVGVLFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S K+ + P G H I+E I+ T GL Y + I Sbjct: 126 SDKVEKFIPPKKGRSHILMIIRELIDFKPENKGTHVAEGLRYFTSAIKK 174 >gi|282900192|ref|ZP_06308148.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194941|gb|EFA69882.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 232 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 59/185 (31%), Gaps = 17/185 (9%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + V+ + + +++LD S SM D++ + + D + Sbjct: 12 TFTLDEVVEFAENPEPRCPCVLLLDTSGSMQG------DRIEALNQGLLSFKDELVKNTL 65 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAK 270 V +VTF S + + + L T G+ + I + Sbjct: 66 AARRVEVAIVTFDSHVNVVQ----DFVTVDQFTPPILTAQGLTTMGAGINKSLEII---Q 118 Query: 271 EKLEHIAKGHDDYKKYIIFL-TDGENSSPNIDNKE--SLFYCNEAKRRGAIVYAIGVQAE 327 E+ Y + +F+ TDGE + E + + + + +GV+ Sbjct: 119 ERKSQYRANGIAYYRPWVFMITDGEPQGEIDEVIEQATQRLRGDESNKKVAFFTVGVENA 178 Query: 328 AADQF 332 D+ Sbjct: 179 NMDRL 183 >gi|163816348|ref|ZP_02207714.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] gi|158448345|gb|EDP25340.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] Length = 612 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 23/164 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--SKIV 228 ++ ++D S SM D +KL + +S + +++ R +VT++ +V Sbjct: 230 LVFLIDTSGSMYDD-----NKLPLVQQSFAML------AENLDENDRVSIVTYAGEDTVV 278 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + I E ++ + T + AY E E + + +I Sbjct: 279 LSGTPGSEQYTISEALSNMTAEGCTNGGDAIITAY-------ELAEKNFINGGNNR--VI 329 Query: 289 FLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 TDG+ N ++ E K + +G + Sbjct: 330 LATDGDLNVGLTSESDLVDLITEEKKENNIFLSVLGFGTDNLKD 373 >gi|254187133|ref|ZP_04893648.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254263081|ref|ZP_04953946.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|157934816|gb|EDO90486.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254214083|gb|EET03468.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 418 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|300929295|ref|ZP_07144770.1| von Willebrand factor type A domain protein [Escherichia coli MS 187-1] gi|300462753|gb|EFK26246.1| von Willebrand factor type A domain protein [Escherichia coli MS 187-1] Length = 378 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVIAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|301026447|ref|ZP_07189884.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879670|gb|EFI87881.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 378 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVIAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|254182581|ref|ZP_04889175.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184213116|gb|EDU10159.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 418 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|192289227|ref|YP_001989832.1| hypothetical protein Rpal_0799 [Rhodopseudomonas palustris TIE-1] gi|192282976|gb|ACE99356.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 468 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 65/182 (35%), Gaps = 21/182 (11%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 MS + F + K +I+++TA+++ I ++G+ ++ + K +L D + + Sbjct: 1 MSDALLSRFVRDRKANIAVITALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAV 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + + + Q ++ N L + + +I I D Sbjct: 61 RPAMLMQ--TDAVAQNTAYA-------IFMSTANRLASGLTSVP--------TPTITITD 103 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + Y + N+ + S+ + SS + ++ +++D S S Sbjct: 104 V--GLQRTVKVSYNAASLNNFPQLLMNNVSWAISGASTAQASSAPN--MNFYLLMDDSPS 159 Query: 181 MN 182 M Sbjct: 160 MG 161 >gi|126457247|ref|YP_001076991.1| hypothetical protein BURPS1106A_A2962 [Burkholderia pseudomallei 1106a] gi|242311800|ref|ZP_04810817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254192526|ref|ZP_04898965.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|126231015|gb|ABN94428.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|169649284|gb|EDS81977.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|242135039|gb|EES21442.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 418 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|194436110|ref|ZP_03068212.1| von Willebrand factor type A domain protein [Escherichia coli 101-1] gi|253772952|ref|YP_003035783.1| VWA containing CoxE family protein [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|254162135|ref|YP_003045243.1| hypothetical protein ECB_02050 [Escherichia coli B str. REL606] gi|297516876|ref|ZP_06935262.1| hypothetical protein EcolOP_04496 [Escherichia coli OP50] gi|194424838|gb|EDX40823.1| von Willebrand factor type A domain protein [Escherichia coli 101-1] gi|242377762|emb|CAQ32525.1| conserved protein [Escherichia coli BL21(DE3)] gi|253323996|gb|ACT28598.1| VWA containing CoxE family protein [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|253974036|gb|ACT39707.1| hypothetical protein ECB_02050 [Escherichia coli B str. REL606] gi|253978204|gb|ACT43874.1| hypothetical protein ECD_02050 [Escherichia coli BL21(DE3)] gi|323972895|gb|EGB68093.1| VWA domain containing CoxE protein [Escherichia coli TA007] Length = 378 Score = 42.9 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVIAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|325497936|gb|EGC95795.1| hypothetical protein ECD227_2033 [Escherichia fergusonii ECD227] Length = 378 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|324114461|gb|EGC08430.1| VWA domain containing CoxE protein [Escherichia fergusonii B253] Length = 378 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|323172713|gb|EFZ58347.1| VWA domain containing CoxE-like family protein [Escherichia coli LT-68] Length = 376 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 61/191 (31%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHHVKKCVQ---SGIKV--------PGLAALDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|218549535|ref|YP_002383326.1| hypothetical protein EFER_2207 [Escherichia fergusonii ATCC 35469] gi|218357076|emb|CAQ89709.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469] Length = 378 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|170743327|ref|YP_001771982.1| hypothetical protein M446_5224 [Methylobacterium sp. 4-46] gi|168197601|gb|ACA19548.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 478 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 63/468 (13%), Positives = 110/468 (23%), Gaps = 140/468 (29%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 GS++IL A+ L + ++GL ++ K L DH+ A + Sbjct: 23 GSVNILFALSLIPVLGLVGLAVDYGLAAADKTTL----DHAADTAALAAVVTAKSYIAAN 78 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 Q N N + + + + + Sbjct: 79 QGQANLTANAIAAGLAQAANVFAVNAGSVPFAQVTLQPP---QLVRSGQTLTATVSYGAT 135 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN------------ 182 + F L+ +SV S+ LD +++DVS SM Sbjct: 136 IQNSFGKL-----LGTPTTLLGNSVTASADLPSYLDFYLLVDVSGSMGLPATPGGMTQLA 190 Query: 183 -------DHFGPGM----------------DKLGVATRSIREMLDII---KSIPDVNNVV 216 + G K+ + + ++ + + S P V N Sbjct: 191 SVNKDMWSDYQQGCQFACHFPGFTGWGLAAGKIQLRSDAVNAAVCSLIQRASTPAVPNQY 250 Query: 217 RSGLVTF---------------SSKIVQTFPLAW----------GVQHIQEKINRL--IF 249 R G+ F S L+W G + + Sbjct: 251 RVGIYPFINQMATLVGITGSVASLNAAAQCALSWPLAFTNLLDTGTTQLFAYGDPTTGTA 310 Query: 250 GSTTK---STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS---------- 296 T P L+ A D K ++ +TDG + Sbjct: 311 SGGTHFEVVMPQLQAAIKAFGDGS--------SSTSPKPFVFLITDGMQNGQHYGAPANG 362 Query: 297 ---------------SPNIDNKESLFY----CNEAKRRGAIV------YAIGVQAEAADQ 331 D + C K GA + Y + Sbjct: 363 TYAYPGNPSSFWGYADAWWDGSQPSQIDPTVCAGLKSAGATISILSIPYNLITFVNNGGG 422 Query: 332 F-----------------LKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 LK CASP F + + + + + Sbjct: 423 VAWENNRVSGFSPTLATPLKACASPGFFATANTPADITASLNAMFDQA 470 >gi|309364385|emb|CAP25058.2| hypothetical protein CBG_04326 [Caenorhabditis briggsae AF16] Length = 862 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 62/180 (34%), Gaps = 24/180 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LD+++ D+S S++ P + + N+ R G++TF+ ++ Sbjct: 390 LDIIIAFDISESLSRIILPKYVAFAERLVAQYKY--------KANDFTRVGVLTFNDQVT 441 Query: 229 QTFPLAWG--VQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L G + + I+ + G T T L+ A K Sbjct: 442 EKLTLVNGNDLAAVNAAIDSVQYVGGLTDVTKALKTAQQLFTTES---------DASRSK 492 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRFY 343 +I L+D + ++ + + G + IG ++ + L +P+ + Sbjct: 493 VLIVLSDAVPTVDTYTDE--IQAGKQLSAMGVATFFIGYNHYSDDVKKELGQVTNPNYVF 550 >gi|253996475|ref|YP_003048539.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253983154|gb|ACT48012.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 321 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 67/227 (29%), Gaps = 35/227 (15%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K K G ++V+D S+SM+ F A +I D Sbjct: 68 GESKQEQKIGKGAQTVLVIDRSVSMDHPFAGQATGGRAAEIKSMAARRLITDFIDSRPDD 127 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 G+V F++ + + I I + T G+ +FD + Sbjct: 128 MMGVVGFTNSALYGMKITANRDAIHAAIKAATGPALNQTNIGAGITQGV-TLFDNIQSSG 186 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY----------AIGV 324 A +I L+DG + + + + + +Y +I Sbjct: 187 SRA---------VILLSDG---AGKLSPRVKYKISQQLTGKKINLYWIVLREPDDISIFN 234 Query: 325 QAEAADQFLKNCASPDRF----------YSVQNSRKLHDAFLRIGKE 361 A +D+ + + DRF + N L A I + Sbjct: 235 GANYSDEQAPDAINLDRFFKSLNIKYKAFEADNPTTLESALREINAK 281 >gi|213963762|ref|ZP_03392012.1| conserved hypothetical protein [Capnocytophaga sputigena Capno] gi|213953642|gb|EEB64974.1| conserved hypothetical protein [Capnocytophaga sputigena Capno] Length = 288 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 47/145 (32%), Gaps = 26/145 (17%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L +M+++DVS S + FG + + + Sbjct: 58 NVTARYNEPFVKVFEEERELTLMLMIDVSGS--ELFGTE----QQFKSELITEIAATLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNK 265 + N ++GL+ FS +I P G H+ I LI T + L+ Sbjct: 112 SALQNNDKTGLILFSDQIELYIPPKKGKSHVLRIIRELIEFQPKSLKTNISEALQ----- 166 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFL 290 KK I+F+ Sbjct: 167 -----------FLSRVSKKKAIVFM 180 >gi|212639262|ref|YP_002315782.1| nitric oxide reductase activation protein [Anoxybacillus flavithermus WK1] gi|212560742|gb|ACJ33797.1| Nitric oxide reductase activation protein [Anoxybacillus flavithermus WK1] Length = 641 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 62/155 (40%), Gaps = 13/155 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++D S SM DK+ + I + +K++ + +V G ++ + Sbjct: 449 FVLLVDCSASM-------YDKMEETKKGIVLFHESLKALRVPHKIV--GFWEDTNDATSS 499 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + Q + + + L G + N+ A + + ++ +K+++ Sbjct: 500 YQPNY-FQTVIDFGSSLKKGGGAEIMQLEPQEDNRDGLAIRLMTEELQKRNEKQKFLLVF 558 Query: 291 TDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 +DGE ++ D ++ EA++ G V + Sbjct: 559 SDGEPAAFGYDQNGIVDTHEAVTEARKLGIEVVNV 593 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 55/141 (39%), Gaps = 11/141 (7%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE-- 67 ++ +G ++IL A++L V+ ++ L ++ + VKA+L D + L A +++ Sbjct: 10 LHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQNAADAAALAGAGSLIDTSLQ 69 Query: 68 --NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS-IIIDDQHKD 124 N + + +F+ + + ++ QD+N + + I Sbjct: 70 TFNWSAATAKAQEFADVNSAD------GKTIGQHRQEQDVNVAIQPGYWNLITPSFTSNT 123 Query: 125 YNLSAVSRYEMPFIFCTFPWC 145 ++ +P + T Sbjct: 124 GLVTHTGDGNIPAVQVTITLS 144 >gi|34099642|gb|AAO23011.1| serum opacity factor [Streptococcus pyogenes] Length = 954 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 69/197 (35%), Gaps = 21/197 (10%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + T L++ D Q + + PF + + +K + D G Sbjct: 131 KEKTELTVKDDKQQLKIRKDVELKNKDPFDVKREVKDNGDGTLDVTLKVMLK---QIDEG 187 Query: 169 LDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 D+M +LDVS M ++F +++ ++ K + N VR L+TF KI Sbjct: 188 ADVMALLDVSQKMTQENFNKAKEQIKRLVTTLTGKSSDGKENHNRRNSVR--LMTFYRKI 245 Query: 228 VQTFPLAWGV--------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + L+ I +K+ + + + A + EK Sbjct: 246 SEPIDLSGKTSDEVEKELNKIWDKVKKEDWDWGVDLQGAIHKAREIFRSSYEKKSGK--- 302 Query: 280 HDDYKKYIIFLTDGENS 296 +++I+ + GE + Sbjct: 303 ----RQHIVLFSQGEPT 315 >gi|332140757|ref|YP_004426495.1| TPR domain protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327550779|gb|AEA97497.1| TPR domain protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 667 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 51/162 (31%), Gaps = 24/162 (14%) Query: 136 PFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVA 195 P + W + + + + ++V+D+SLSM D+L A Sbjct: 64 PIMLLAIAWFVSVIALAGPTWERL-PQPVYQLKMGHVIVIDMSLSMRATDMTP-DRLTRA 121 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----S 251 ++++ I GLV ++ PL +I I L Sbjct: 122 KYKAIDLVNAIGEGE-------MGLVAYAGDAFVISPLTEDAANITTLIPSLSPEIMPVP 174 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + G+E A + +A I ++TDG Sbjct: 175 GSDPLLGIESAAALLTNAGYNSG-----------MIYWITDG 205 >gi|302546412|ref|ZP_07298754.1| subunit D/I family magnesium chelatase [Streptomyces hygroscopicus ATCC 53653] gi|302464030|gb|EFL27123.1| subunit D/I family magnesium chelatase [Streptomyces himastatinicus ATCC 53653] Length = 195 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 46/130 (35%), Gaps = 18/130 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQ 229 M+ V+D S SM ++G ++ +L + + GLVTF S Sbjct: 1 MLFVVDASGSMA-----ARKRMGAIKGAVLSLL-----LDAYQRRDKVGLVTFRGSDAAL 50 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V ++ +L G T GL A + ++E + ++ Sbjct: 51 ALPPTSSVDAAAARLEQLPTGGRTPLAAGLLKAREVL-----RVERLRDPSRRP--LLVV 103 Query: 290 LTDGENSSPN 299 +TDG + Sbjct: 104 VTDGRATESK 113 >gi|260824347|ref|XP_002607129.1| hypothetical protein BRAFLDRAFT_118666 [Branchiostoma floridae] gi|229292475|gb|EEN63139.1| hypothetical protein BRAFLDRAFT_118666 [Branchiostoma floridae] Length = 642 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 44/131 (33%), Gaps = 12/131 (9%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 ++ + + T + A + + D + + H + I+ LTDG+ Sbjct: 351 NIRAADTYVTSMAAHGGTNINDAILEA-SVLLDPELRSRHDSHASM-----IVLLTDGQP 404 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-----SPDRFYSVQNSR- 349 + + + ++ ++ +G + + +FL+ A R Y + Sbjct: 405 TGGVTNTNHIIANARDSLAGNHALFCLGFGYDVSFEFLERLALQNGGFARRIYPDDDGEL 464 Query: 350 KLHDAFLRIGK 360 +L F + Sbjct: 465 QLTSFFDEVAD 475 >gi|3169331|gb|AAC17863.1| YojO [Bacillus subtilis subsp. subtilis str. 168] Length = 661 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 62/174 (35%), Gaps = 29/174 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V Sbjct: 457 KQEPSSEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVAVPHQIV--- 506 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY----NKIFDAKE 271 F + ++ P + + S P + N+ A Sbjct: 507 --GFWEDTNDATEKSQPNYFNT------VIPFQSSLRQDSGPAIMQLEPEEDNRDGYAIR 558 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 559 QMTKKMLHRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 612 >gi|297162409|gb|ADI12121.1| Na-Ca exchanger/integrin-beta4 [Streptomyces bingchenggensis BCW-1] Length = 879 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 55/164 (33%), Gaps = 17/164 (10%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++D + SM P +D + + I + + R + TF K+ Sbjct: 94 DVVLLVDGTRSMQ----PTIDNVQ------GNLDQITDRVREEQPDSRFAVATFGDKVDG 143 Query: 230 T------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 L + ++ ++ L S E N ++ + Sbjct: 144 DRVFTVLQGLTDNLDEVRRGVDGLASDRGLGSPGPAEDWINALWQIGNGAGGKTVFREGA 203 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ + D + P++D+ S N K G V + V E Sbjct: 204 SPIVVLVGDASSHDPSMDHSLSDGI-NALKSAGVRVLGVDVATE 246 >gi|91977980|ref|YP_570639.1| TadE-like [Rhodopseudomonas palustris BisB5] gi|91684436|gb|ABE40738.1| TadE-like [Rhodopseudomonas palustris BisB5] Length = 181 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 51/153 (33%), Gaps = 13/153 (8%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYIL-DHSLLYTATKIL 64 +R F N +GS +I A++ P+ F ++ +IE + FF L + D + L Sbjct: 13 MRRFGRNRRGSAAIEFALIAPIFFGLLFAIIEVALMFFAGQVLETAVQDSARL------- 65 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 G+ Q FS ++ +G D+ + S I + Sbjct: 66 ----ILTGQAQGGSFSQSKFRDDVCGRLGGLFTCSGVFVDVQSYGTDFSKVDISPPIDAN 121 Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITS 157 N +Y P PL +T Sbjct: 122 KNFVDNMKYA-PGQAGDVVVVRAFYQWPLFVTG 153 >gi|198420538|ref|XP_002128804.1| PREDICTED: similar to hedgling [Ciona intestinalis] Length = 420 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 71/216 (32%), Gaps = 28/216 (12%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIRE 201 P A + + D++M+LD S S+ F + ++ A + Sbjct: 174 PVGAGRKRRQVSSGFPIV-----QTKRDVLMLLDESGSVGYRRFHSKVKRIAAAI--VEV 226 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAW---GVQHIQEKIN-------RLIFGS 251 + + I P R + +F L + I Sbjct: 227 LCNDIAVAPHK---TRVAVTSFDQITHDHIRLKDFYPSPAALAGYIRSPYNIRYSYYTSR 283 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS--SPNIDNKESLF-Y 308 T L +AY ++ A ++ + + II +TDG + + ++ L Sbjct: 284 RTCLVDALIHAYVEM-SATINGGRQSQNNVEQD--IILITDGCANCHHYGVSTEQVLQNL 340 Query: 309 CNEAKRRGAIVYAIGVQAE-AADQFLKNCASPDRFY 343 + ++G +Y IGV + A Q L+ A R Y Sbjct: 341 ADFFVQQGMHIYVIGVGLQHACRQKLRILAQGGRCY 376 >gi|321311581|ref|YP_004203868.1| von Willebrand factor type A [Bacillus subtilis BSn5] gi|320017855|gb|ADV92841.1| von Willebrand factor type A [Bacillus subtilis BSn5] Length = 638 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 62/174 (35%), Gaps = 29/174 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V Sbjct: 434 KQEPSSEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVAVPHQIV--- 483 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY----NKIFDAKE 271 F + ++ P + + S P + N+ A Sbjct: 484 --GFWEDTNDATEKSQPNYFNT------VIPFQSSLRQDSGPAIMQLEPEEDNRDGYAIR 535 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 536 QMTKKMLHRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|237507530|ref|ZP_04520245.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] gi|234999735|gb|EEP49159.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] Length = 418 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|260820612|ref|XP_002605628.1| hypothetical protein BRAFLDRAFT_150512 [Branchiostoma floridae] gi|229290963|gb|EEN61638.1| hypothetical protein BRAFLDRAFT_150512 [Branchiostoma floridae] Length = 168 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 53/135 (39%), Gaps = 16/135 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + +D+ V+D S S+ + + + +R+M+D R G+V F+ Sbjct: 1 NTPVDLAFVIDGSASVGPL------QFEKSKKFVRDMVDGFNIGAAQ---TRVGVVQFAW 51 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIF--GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + F L ++ I R+ + G T+ L + ++F L Sbjct: 52 MVQAEFNLGDYLDGTDLRNAIARIRYMDGPGTEIGKALVFTKRRLFSE---LYGARPETQ 108 Query: 282 DYKKYIIFLTDGENS 296 D + +I +TDG +S Sbjct: 109 DVPRIVILITDGRSS 123 >gi|126442905|ref|YP_001064078.1| hypothetical protein BURPS668_A3087 [Burkholderia pseudomallei 668] gi|126222396|gb|ABN85901.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 418 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|225174961|ref|ZP_03728958.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] gi|225169601|gb|EEG78398.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] Length = 357 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 10/64 (15%), Positives = 26/64 (40%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 R F + +G++ ++ A+ L + +VI+ + + + D L A ++ Sbjct: 9 RFRYLFKDERGNVLVIFAVALIALLGFAAIVIDVGGMYVERRSMVTAADAGALAGARELA 68 Query: 65 NQEN 68 + Sbjct: 69 ESGD 72 >gi|14248665|gb|AAK57618.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYKNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|226194158|ref|ZP_03789758.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|225933851|gb|EEH29838.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 418 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|14248701|gb|AAK57636.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNLTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|330816722|ref|YP_004360427.1| Membrane protein [Burkholderia gladioli BSR3] gi|327369115|gb|AEA60471.1| Membrane protein [Burkholderia gladioli BSR3] Length = 622 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 41/127 (32%), Gaps = 16/127 (12%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSL-----------LYT 59 +GS +++ A+ L VI + G V++ + + + L I D + + Sbjct: 22 RRERGSFAVMAAVFLVVIAAIFG-VLDVGNTYLQRRDLQQIADMAAAAGVQRVDNLCVQA 80 Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKN---IWQTDFRNELRENGFAQDINNIE-RSTSLS 115 T N N + D L + D N ++ + ++ Sbjct: 81 PTSATNSATVNGLNTSQGDTIAVTCGRWDPTVNPAPSYYLANTNTSGDPNRLQLNAVQVN 140 Query: 116 IIIDDQH 122 + +H Sbjct: 141 VTRQVRH 147 >gi|134281810|ref|ZP_01768517.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246872|gb|EBA46959.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 418 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 41/126 (32%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSA 129 + + + F +++ N +++ + I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|21228942|ref|NP_634864.1| hypothetical protein MM_2840 [Methanosarcina mazei Go1] gi|20907478|gb|AAM32536.1| hypothetical protein MM_2840 [Methanosarcina mazei Go1] Length = 548 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 67/184 (36%), Gaps = 32/184 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQ 229 M+ ++D S SM G A ++ +++ + + ++ F S+ Sbjct: 369 MIALVDTSGSM---HGTPQTLAKSAVLAMAKLM-----LSQQRD---MKVILFASTSQHL 417 Query: 230 TFPLAWGVQHIQEKINRL--IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 L+ + + +N L FG T L + + + + Sbjct: 418 EIELSNRKKMSERFLNFLLYTFGGGTDFNTALASGLKSLKEKDFQGAD-----------L 466 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAADQFLKNCASPDRFYSVQ 346 +F+TDG++ I ++ L EAK++ A VY++ V A + D Y V+ Sbjct: 467 LFITDGKS---EISDELVLARWEEAKKKYNAKVYSLIVGGSGAGGLSQI---SDYTYIVE 520 Query: 347 NSRK 350 + Sbjct: 521 MEQD 524 >gi|85706962|ref|ZP_01038052.1| NorD Nitric oxide reductase activation protein [Roseovarius sp. 217] gi|85668573|gb|EAQ23444.1| NorD Nitric oxide reductase activation protein [Roseovarius sp. 217] Length = 627 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 74/218 (33%), Gaps = 30/218 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + ++D L + ++D S S G + + + R G Sbjct: 427 QSARQTDRDLSVAFLIDTSRSTEAAIG------DTSVIEVAREAMAALAAGINAAGDRLG 480 Query: 220 LVTFS----SKIVQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + FS ++ + +G Q I I L G T+ + + ++ Sbjct: 481 IWGFSSLRRDRVFLSRCKRFGDQMSPEIIANIGALKPGHYTRLGAAIRHVSTQLAAEP-- 538 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +K +I LTDG+ + ++S EA++ G ++ I + + Sbjct: 539 ---------SARKLLIVLTDGKPNDLDHYEGRYGIEDSHMAVREARKAGHSLHGIIIDED 589 Query: 328 AADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 D F + F + N +L A I + + ++ Sbjct: 590 GQDWFARIFGRGG-FSLLPNPERLTRALPDIYRTLTQE 626 >gi|67468139|ref|XP_650129.1| elongation factor-2 kinase [Entamoeba histolytica HM-1:IMSS] gi|56466695|gb|EAL44742.1| elongation factor-2 kinase, putative [Entamoeba histolytica HM-1:IMSS] Length = 402 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 67/207 (32%), Gaps = 34/207 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI- 227 LD+++++D + SM + + +I +R +V + Sbjct: 5 LDLVLLVDTTGSMGQYLHSAQSNINKIVNTITN---------SEKIDLRFAIVEYKDHQP 55 Query: 228 -VQTFPLA-----WGVQHIQEKINRLIFGSTTKSTP-----GLEYAYNKIFDAKEKLEHI 276 Q F L ++ IQ IN+L P + A N + I Sbjct: 56 NQQQFALKKYDWMNDIKDIQNAINQLSAYGGGMDGPESVTCAFDCAVNLGYRGYAAKVII 115 Query: 277 AKGHDDYKKYIIFLTDGENSS--PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA----- 329 + I DG + ID +E + +A + +Y++ + Sbjct: 116 WIADAPPHGFNIQY-DGYPNGCPCGIDFQEVVL---KAIKNDIQIYSVACEPIRPIYRHF 171 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDA 354 ++ A + +F ++ ++ L D Sbjct: 172 RDLMRAVAMMTGGQFIALNSADCLADV 198 >gi|308501637|ref|XP_003113003.1| hypothetical protein CRE_25217 [Caenorhabditis remanei] gi|308265304|gb|EFP09257.1| hypothetical protein CRE_25217 [Caenorhabditis remanei] Length = 376 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 25/193 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 +M+V+D S M P + V ++++ L+ + N + + GL+T + + Sbjct: 65 VMIVIDCSRFMTSKAMPP-SRFVVVMKALQSFLE---RFFEQNPIAQIGLITCKDRKAER 120 Query: 230 TFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + ++ ++E +N L G L+ A + ++ Sbjct: 121 MTMMTGNIRVLKESLNSLTEAFCGGDFSLQNALQLACANLKGM---------PGHVSREV 171 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCASPDRFYS 344 +I + + +ID KR AIG+ AE + K A+ + Sbjct: 172 VIVMA----ALSSIDPGNIFSTIESMKRMNIRCSAIGLSAEMFICKEMAK--ATKGEYSV 225 Query: 345 VQNSRKLHDAFLR 357 + L F + Sbjct: 226 ALDPDHLQLLFSK 238 >gi|262203982|ref|YP_003275190.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262087329|gb|ACY23297.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 461 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 76/239 (31%), Gaps = 52/239 (21%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + A ++ + ++ ++++LD S SM PG ++ A Sbjct: 11 LSAVSMMLAACTTSDGTPTGAGGLSTAADKPTTPVVVILDGSESMQIADAPG-PRIDAAR 69 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTF-SSKIVQTFPLAWGVQHIQ-------------- 241 ++ + + S GLV + +++ +T P A G + + Sbjct: 70 NAVSTFISDLTSGTPF------GLVAYGNTESAKTTPQAVGCEDVSTLARLGPIDKEAAR 123 Query: 242 EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 I+ + T + L A + ++ ++DGE + Sbjct: 124 SAIDGVRAQGWTPLSAALTRAAEMLGTEAGS--------------VVLVSDGEANCLPD- 168 Query: 302 NKESLFYCNEAKRR-----GAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHD 353 C A+ + +G +++AA L+ A F + N+ +L Sbjct: 169 ------PCATARSLREQNPNLTISTVGFKSDAAQ--LQCVAREGGGVFVTADNTAQLSA 219 >gi|6202031|gb|AAD42207.2|AF139753_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 452 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 56/136 (41%), Gaps = 6/136 (4%) Query: 162 SSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + D G D+M +LDVS M D F DK+ ++ + + N VR L Sbjct: 193 PKEIDEGADVMALLDVSKKMTEDDFNNAKDKIKKLVTTLTSKSPDSQPNHNARNSVR--L 250 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 +TF K+ L ++ K++ + + G++ I A+E + Sbjct: 251 MTFYRKVNDPIEL--NADNVDAKLDEVWKKAKEDWDWGVDL-QGAIHKAREIFNKGKRKK 307 Query: 281 DDYKKYIIFLTDGENS 296 +++I+ + GE++ Sbjct: 308 SGKRQHIVLFSQGEST 323 >gi|74136087|ref|NP_001027973.1| complement factor B-1 [Ciona intestinalis] gi|58531104|dbj|BAD89299.1| complement factor B-1 [Ciona intestinalis] Length = 999 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 63/192 (32%), Gaps = 24/192 (12%) Query: 146 ANSSHAPLLITSSV--KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 S H + I++ + G ++ ++D S S+ D L + + ++++ Sbjct: 405 RISVHMSTPSSGVGGRSITADHEGGNEIYFLIDFSRSVTDE------ALDHSLKFAQKLV 458 Query: 204 DIIKSIPDVNNVVRSGLVTFSSK----IVQTFPLAWGVQHIQEKINRL---------IFG 250 N G++ F+S + I + G Sbjct: 459 TRFSG--GTNKTAHYGVIIFASHSKVVLDSRKNQKLNSTKIIAYFKGIFKDKKAMRDTVG 516 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T + L+ + + I G ++ +++ LTDG+++ K N Sbjct: 517 GGTNTGAALQDLRKMLGISYRAD-RINNGGNNRQRHCFILTDGKSNEGENPVKMVKRMEN 575 Query: 311 EAKRRGAIVYAI 322 + K+ Y+I Sbjct: 576 DFKQNPPQFYSI 587 >gi|39936736|ref|NP_949012.1| hypothetical protein RPA3674 [Rhodopseudomonas palustris CGA009] gi|192292562|ref|YP_001993167.1| TadE family protein [Rhodopseudomonas palustris TIE-1] gi|39650592|emb|CAE29115.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286311|gb|ACF02692.1| TadE family protein [Rhodopseudomonas palustris TIE-1] Length = 177 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 45/158 (28%), Gaps = 11/158 (6%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F N KGS ++ AI+ P+ F ++ +IE + FF L + + Sbjct: 13 RKFRRNRKGSAAVEFAIVAPIFFALLFAIIEVAMIFFASQVLETAVQD----------SS 62 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + Q + K + + D+ N I + Sbjct: 63 RLIFTRQAQDASMTQDQFKTEVCKRLISLFDCSIVRVDVQNYGSDFGTVSITTPIDSNKK 122 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 +Y + PL +T +S Sbjct: 123 FVDNMQYNI-GKAGDIIVVRAFYQWPLFVTGLGFDTSN 159 >gi|290961436|ref|YP_003492618.1| chelatase [Streptomyces scabiei 87.22] gi|260650962|emb|CBG74080.1| putative chelatase [Streptomyces scabiei 87.22] Length = 684 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 486 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 535 Query: 220 LVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF P V ++ L G T GL A++ + ++E + Sbjct: 536 LVTFRGASAEVALPPTSSVDAAAARLESLPTGGRTPLAAGLLKAHDVL-----RVERLRD 590 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 591 PARRP--LVVVVTDGRATGG 608 >gi|218130608|ref|ZP_03459412.1| hypothetical protein BACEGG_02197 [Bacteroides eggerthii DSM 20697] gi|217986952|gb|EEC53283.1| hypothetical protein BACEGG_02197 [Bacteroides eggerthii DSM 20697] Length = 247 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 64/204 (31%), Gaps = 27/204 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS---- 224 + + V+D S SM G + + A ++ MLD I S + + ++ + FS Sbjct: 15 MTLFFVIDTSGSMA---GNKIGAVNDAVENVLPMLDEI-SASNPDAEIKVAALEFSSGCN 70 Query: 225 ---SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + W + T L A ++ + + Sbjct: 71 WLYDEPKLASEFVW---------QDVTASGLT----SLGAACQELNTKLSRNGFMQTPSG 117 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNCASPD 340 + II L+DG + L N K + AI + +A L + + + Sbjct: 118 SFAPAIILLSDGGPTDDFYGGLSKLKANNWFKNA-IKI-AIAIGDDADKDVLTQFTGTNE 175 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 ++V N L + + Sbjct: 176 AVFTVHNIDALKQIIRVVAVTSSQ 199 >gi|58429519|gb|AAW78163.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 565 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429497|gb|AAW78152.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429469|gb|AAW78138.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|296123244|ref|YP_003631022.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296015584|gb|ADG68823.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 365 Score = 42.5 bits (98), Expect = 0.094, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 46/188 (24%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D G ++ V+D S SM + + VA ++ L + + ++ ++ Sbjct: 191 KDQGSRVVFVIDCSGSM-----TNYNAMRVAKTALVSSLQALDTGQQFQ------IIFYN 239 Query: 225 SKIVQTFPLA-------W-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + W ++I+ + T+ P L+ A Sbjct: 240 DSPTFLKGTSRDGKASLWFATEINKTLATQQISAVQPDRGTQHLPALKLALKF------- 292 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--- 329 + I FLTD + ++ L N+ + R ++ I Sbjct: 293 ----------SPEVIYFLTDADEPELTSIERKELIRLNQGRSR---IHTIEFGQGPELKT 339 Query: 330 DQFLKNCA 337 + FLK A Sbjct: 340 ENFLKKVA 347 >gi|218779582|ref|YP_002430900.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218760966|gb|ACL03432.1| Putative uncharacterized protein (AssF1) [Desulfatibacillum alkenivorans AK-01] Length = 570 Score = 42.5 bits (98), Expect = 0.094, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 53/164 (32%), Gaps = 34/164 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSKIV 228 + ++LD + SM K + ++ M + + R G + I Sbjct: 389 VTLLLDATGSMRG------GKWRMVENTVGNMHKALSGSQN-----RLGAWAYFEMDGIC 437 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 L G + + T S + A + K + K +I Sbjct: 438 MMSRLISGRNLLS-----VPPSGQTASGQAIIAAAYFMPKDKRR------------KLLI 480 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +TDGE++ +D + YC +++ + IG + + Sbjct: 481 HVTDGESNFG-VDASCGIDYC---RQQNINLVTIGCGVKDRSRM 520 >gi|58429465|gb|AAW78136.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.5 bits (98), Expect = 0.094, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + S+ + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSEEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 GSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|115522260|ref|YP_779171.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] gi|115516207|gb|ABJ04191.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] Length = 372 Score = 42.5 bits (98), Expect = 0.094, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 47/148 (31%), Gaps = 20/148 (13%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M + PL S +++ VLD + SM+ K+ Sbjct: 1 MRSFVTLRGLALATLAVPLAFASPALAKP----VVEVAFVLDTTGSMSGLIEGAKRKIWS 56 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTF----SSKIVQTFPLAWGVQHIQEKINRLIFG 250 SI + + + V+ GLV + + +TF L +Q + + + Sbjct: 57 IATSIVD--------ENPDAEVKLGLVAYRDIGDDYVTRTFDLTTDIQDLYANLLEMKAR 108 Query: 251 STTK----STPGLEYAYNKIFDAKEKLE 274 L+ A NK+ ++ K Sbjct: 109 GGGDWPESVNEALDIAVNKLHWSQGKES 136 >gi|322435085|ref|YP_004217297.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] gi|321162812|gb|ADW68517.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] Length = 794 Score = 42.5 bits (98), Expect = 0.095, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 61/196 (31%), Gaps = 42/196 (21%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++ + +++++LD SLSM DKL + ++ +L ++ R L Sbjct: 294 ATTHASEPRNVILLLDTSLSMQW------DKLERSYAALETVLRSLQPTD------RFSL 341 Query: 221 VTFSS-------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + F+ Q P + VQ + I T L A + Sbjct: 342 MLFNQDLSWFRPDPTQATPES--VQEALQFIRASRLRGGTDLGKALAAALTQAKS----- 394 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC-----NEAKRRGAIVYAIGVQAEA 328 + + TDG + + N + V+AIG +A Sbjct: 395 ---------PNQSLYLFTDGNSDRGVTILDNKIAAAYTQQWNHSTHPRTNVFAIG--DDA 443 Query: 329 ADQFLKNCASPDRFYS 344 L+ A D + Sbjct: 444 NLPLLRLLARNDGLFE 459 >gi|153006973|ref|YP_001381298.1| vault protein inter-alpha-trypsin subunit [Anaeromyxobacter sp. Fw109-5] gi|152030546|gb|ABS28314.1| Vault protein inter-alpha-trypsin domain protein [Anaeromyxobacter sp. Fw109-5] Length = 1362 Score = 42.5 bits (98), Expect = 0.095, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 74/225 (32%), Gaps = 34/225 (15%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 +KD+ L+ +P L + ++ +++ ++D S SM Sbjct: 332 NKDFVLTWRPAGVVPGAHALVQREKGEDFLMLFVQPPAGVAPALVRPKELVFLVDKSGSM 391 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF--SSKIVQTFPLAWGVQH 239 G D++ + + + +V F S++ + PL Sbjct: 392 ---MGAPFDRVRALVARALDAMGPDDTFQ---------VVAFDGSAQAMSEAPLPATPSA 439 Query: 240 I---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 I +E + L G T+ G+ A + D + + ++F TDG Sbjct: 440 IARAKEWLASLEGGGGTEMLEGVRAALSPPEDPRRL------------RMVVFCTDGF-- 485 Query: 297 SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 I N+ + EA R A V+ G+ + ++ R Sbjct: 486 ---IGNEPEIIEAVEALRGRARVFGFGIGSSVNRYLVEGVGRAGR 527 >gi|60219503|emb|CAI56763.1| hypothetical protein [Homo sapiens] Length = 860 Score = 42.5 bits (98), Expect = 0.095, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN G L +A ++ L +++ + R VT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFL-KLRARDPASRGDRYMPVTYDEPPY-C 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L ++ N++ + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 +I +TDG Sbjct: 122 PSILITITDG 131 >gi|74316573|ref|YP_314313.1| putative nitric oxide reductase activation protein [Thiobacillus denitrificans ATCC 25259] gi|74056068|gb|AAZ96508.1| putative nitric oxide reductase activation protein [Thiobacillus denitrificans ATCC 25259] Length = 614 Score = 42.5 bits (98), Expect = 0.095, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S K L +++ D+SLS + + V S+ + + + D R L Sbjct: 415 SRKQQRDLSCLLLADLSLSTDAWVNNSARVIEVIRDSLLLFGEALAATGD-----RFALY 469 Query: 222 TFSS--KIVQTFPLAWGVQ-----HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 FSS + F L G I+ ++ L G T+ + +A + K Sbjct: 470 GFSSVRRDNVRFHLLKGFDERYDDGIRGRLATLKPGYYTRMGAAIRHAAAILATRKSSR- 528 Query: 275 HIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + ++ LTDG+ + +++ EA+R+G + + + A Sbjct: 529 ----------RLLLILTDGKPNDLDKYEGRYGIEDTRMAILEARRQGLTPFCVTIDEAAG 578 Query: 330 DQF 332 D Sbjct: 579 DYL 581 >gi|315186710|gb|EFU20468.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 289 Score = 42.5 bits (98), Expect = 0.096, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 39/117 (33%), Gaps = 10/117 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + + L + +V+DVS SM + E+L I S+ Sbjct: 58 NVTSRFGEPHTKVFREERELVLFLVVDVSRSMRSGSNQY------SKFDCLEILFSIFSL 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYA 262 + N R G + F+ ++ P G H+ + R G + L A Sbjct: 112 VTLENNDRVGALFFTDEVEDVIPPRKGKTHVLALLRRFRDMRPRGRGSDLALALRTA 168 >gi|218672134|ref|ZP_03521803.1| hypothetical protein RetlG_11055 [Rhizobium etli GR56] Length = 125 Score = 42.5 bits (98), Expect = 0.096, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 35/110 (31%), Gaps = 29/110 (26%) Query: 278 KGHDDYKKYIIFLTDG------------ENSSPNIDNKESLFYCNEAKRRGAIV---YAI 322 + +K + F++DG + + +C K RG + Y Sbjct: 4 TSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTT 63 Query: 323 GVQAEAADQF--------------LKNCASPDRFYSVQNSRKLHDAFLRI 358 + + + ++ CASP ++ V + + DA + Sbjct: 64 YLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTEGITDAMKAL 113 >gi|217958211|ref|YP_002336755.1| hypothetical protein BCAH187_A0752 [Bacillus cereus AH187] gi|229137422|ref|ZP_04266033.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST26] gi|217065519|gb|ACJ79769.1| conserved hypothetical protein [Bacillus cereus AH187] gi|228645980|gb|EEL02203.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST26] Length = 627 Score = 42.5 bits (98), Expect = 0.096, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKGPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|169627635|ref|YP_001701284.1| hypothetical protein MAB_0531c [Mycobacterium abscessus ATCC 19977] gi|169239602|emb|CAM60630.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 522 Score = 42.5 bits (98), Expect = 0.097, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 67/186 (36%), Gaps = 25/186 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M+ VLDVS SM GP ++G+ +++I +P + GL FS Sbjct: 319 MLAVLDVSGSMKFSAGP-TTRVGLLSQAIDN------GLPLFPENAQIGLWAFSIDKGGP 371 Query: 231 -------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P+ + + +K R + L ++D + D Sbjct: 372 GQDWKELLPIRTLGEKVGDKTQRQLLADEGHGLDALVGGGTGLYDTARAAFRKVQSTYDP 431 Query: 284 KKY--IIFLTDGENSSPNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAADQFLKNC 336 ++ +TDG N PN + E L K+ R ++ +G+ +A LK Sbjct: 432 HYINSVVIITDGSNEDPNGISLEQLLA--TLKKEQDPARPVVLITLGITEDADATVLKQI 489 Query: 337 --ASPD 340 A+P Sbjct: 490 SDATPG 495 >gi|168216759|ref|ZP_02642384.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381207|gb|EDT78686.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 1341 Score = 42.5 bits (98), Expect = 0.097, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 64/183 (34%), Gaps = 26/183 (14%) Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDH 184 Y + ++ + + + + I K ++ D+++VLD S ++N+ Sbjct: 29 YGNTYKEQFTINANYNLSELKHGETEDIKYTITPKPIKFKDNVEKDVVLVLDTSQTVNES 88 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 +G D A + +L K ++ G+V ++ K I I Sbjct: 89 YG---DISTAAQTFVTNILG--KQYKNLK----VGVVCYNEKAEILHGFDNSTYSINNDI 139 Query: 245 NR------LIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + G+ T A + + + + +K +IF+++G+ + Sbjct: 140 KKCYQNYNIKSGNHLGTNVGDAFRLAISMLGKDS---------NPNKEKIVIFMSNGKPN 190 Query: 297 SPN 299 + Sbjct: 191 AYT 193 >gi|148725344|emb|CAI12059.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta 3 subunit (CACNA2D3) [Danio rerio] Length = 742 Score = 42.5 bits (98), Expect = 0.097, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 48/137 (35%), Gaps = 26/137 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++ +D+S SM +L +A +I +LD + VN ++ +S + Sbjct: 215 DIIIAVDISGSMKGL------RLTIAKHTINTILDTLGENDFVN------VIAYSDYVQY 262 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L K ++ ++ + +A Sbjct: 263 VEPCFKGTLVQADLDNREHFKLLVQELQVKGEGKVKKAMKESFKILNEAA-----AEGRG 317 Query: 281 DDYKKYIIFLTDGENSS 297 + I+ +TDG Sbjct: 318 SLCNQAIMLITDGAMED 334 >gi|47229095|emb|CAG03847.1| unnamed protein product [Tetraodon nigroviridis] Length = 608 Score = 42.5 bits (98), Expect = 0.097, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 74/236 (31%), Gaps = 68/236 (28%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ V+DVS SM K+ +++ +L ++ N ++TFS ++ Sbjct: 249 DVIFVIDVSGSMIG------TKIQQTKQAMSTILADLREGDHFN------IITFSDQVR- 295 Query: 230 TFPLAW-----------GVQHIQEKINRLIFGS--------------------------- 251 W V+ +E + R+I Sbjct: 296 ----TWKRGRTVRATRQNVRDAKEFVRRIIAEGCESEATEHHLTASLCLFLLLYEFSFSF 351 Query: 252 --TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T L A ++ + H++ +IFLTDGE + L Sbjct: 352 PSGTNINAALLSA-AQLINPPSSSRHLSSHRVPL---VIFLTDGEATIGVTAGDTILTNA 407 Query: 310 NEAKRRGAIVYAIGVQAEAADQFLKNCASPDR------FYSVQNSRKLHDAFLRIG 359 +A A ++ + +A LK A +R + + +L + + Sbjct: 408 KKALGS-ASLFGLAFGDDADFLLLKRLALDNRGVARMVYEDADAALQLKGFYDEVA 462 >gi|319782171|ref|YP_004141647.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168059|gb|ADV11597.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 509 Score = 42.5 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 25/58 (43%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 IR F+ + +G IL +I LP I L I+ S + L D L A ++ Sbjct: 5 IRAFWNDQRGIALILVSITLPAIIGFSLLAIDMSRINNLHNDLQKGADSFALAAAAEL 62 >gi|58429471|gb|AAW78139.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 42.5 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 63/181 (34%), Gaps = 27/181 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S+ H ++ + +I+ + +N + + FS+ Sbjct: 47 VDLYLLMDCSGSIRRH--------NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAR 98 Query: 229 QTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L I+ ++ + T T L + D Sbjct: 99 EIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINR 150 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASP 339 ++ + ++ LTDG S KES + + V+ IG A ++FL C Sbjct: 151 ENANQLVVILTDGIPDSIQDSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPS 208 Query: 340 D 340 D Sbjct: 209 D 209 >gi|307595146|ref|YP_003901463.1| hypothetical protein Vdis_1023 [Vulcanisaeta distributa DSM 14429] gi|307550347|gb|ADN50412.1| conserved hypothetical protein [Vulcanisaeta distributa DSM 14429] Length = 441 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 78/227 (34%), Gaps = 26/227 (11%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV 159 ++ +E + + K ++ R +P + L + + + Sbjct: 208 DTRYTVDVVEDVDAERVGTVGGVKRMTRASELRDMIPSERMLMKFAKPVFAYKLAMGNVL 267 Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR-SIREMLDIIKSIPDVNNVVRS 218 ++ + M++D S SM + ++G ++ + L +I ++ N VR Sbjct: 268 VRERRAIKKPKIYMLIDKSGSMFYTVNINIFEVGAISKITWATALAVILAMKGGNLAVR- 326 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 F ++ P+ I + + LI T T + A D ++ Sbjct: 327 ---FFDQQVY---PMLTNKNDIIKMLLSLIPLGGTDITNAVRAAVQDAVDKPSLRDYK-- 378 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR--RGAIVYAIG 323 ++ +TDGE+ N+D N+AK R V IG Sbjct: 379 --------LVVITDGEDD--NVDPTVF----NKAKSIFRSVKVLLIG 411 >gi|304393682|ref|ZP_07375610.1| magnesium-chelatase 60 kDa subunit [Ahrensia sp. R2A130] gi|303294689|gb|EFL89061.1| magnesium-chelatase 60 kDa subunit [Ahrensia sp. R2A130] Length = 595 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 42/134 (31%), Gaps = 21/134 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K ++ +D S S +++L A ++ +L GL++F Sbjct: 409 KEKSASSVIFAVDASGS------TALNRLAEAKGAVELLL-----GESYARRDHVGLISF 457 Query: 224 SSKIVQTF-PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + + + + L G T L A +A + + Sbjct: 458 RGQTAEMLLAPTRSLLRAKRALAALPGGGGTPLADALRLA----AEASRDEQDKGRTPT- 512 Query: 283 YKKYIIFLTDGENS 296 I+ +TDG + Sbjct: 513 ----IVIMTDGSAN 522 >gi|282897191|ref|ZP_06305193.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197843|gb|EFA72737.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 232 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 58/185 (31%), Gaps = 17/185 (9%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 + V+ + + +++LD S SM D++ + + D + Sbjct: 12 TFTLDEVVEFAENPEPRCPCVLLLDTSGSMQG------DRIEALNQGLLSFKDELVKNTL 65 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAK 270 V +VTF S + + + L T G+ A I + Sbjct: 66 AARRVEVAIVTFDSYVSVVQ----DFVTVDQFTPPILTAQGLTTMGAGINKALEII---Q 118 Query: 271 EKLEHIAKGHDDYKKYIIFL-TDGENSSP--NIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 E+ Y + +F+ TDGE + + + + + + +GV+ Sbjct: 119 ERKSQYRANGIAYYRPWVFMITDGEPQGEIDEVIEEATQRLRGDEFNKKVAFFTVGVENA 178 Query: 328 AADQF 332 + Sbjct: 179 NMHRL 183 >gi|238922693|ref|YP_002936206.1| Carbohydrate-Binding Module Family 2 candidate secreted modular protein with two N-terminal CBM2 domains [Eubacterium rectale ATCC 33656] gi|238874365|gb|ACR74072.1| Carbohydrate-Binding Module Family 2 candidate secreted modular protein with two N-terminal CBM2 domains [Eubacterium rectale ATCC 33656] Length = 1186 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 49/144 (34%), Gaps = 25/144 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT----FSS 225 D++ V+D + SM + + + S+ + +R GLV ++ Sbjct: 572 DVVFVIDTTGSMGNEIQNVKNNIETVVSSL----------EENKVDIRLGLVEYRDIYAD 621 Query: 226 KIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 I T W V + ++ L + + A + Sbjct: 622 GIGSTKSYDWYTSVSSFKSELATLGVSGGGDTPESVVDAL--------YCARNMEYRTGV 673 Query: 284 KKYIIFLTDG-ENSSPNIDNKESL 306 KKY+I LTD + ++D+ +L Sbjct: 674 KKYVILLTDANYKNGTSVDSGATL 697 >gi|229816859|ref|ZP_04447141.1| hypothetical protein BIFANG_02107 [Bifidobacterium angulatum DSM 20098] gi|229785875|gb|EEP21989.1| hypothetical protein BIFANG_02107 [Bifidobacterium angulatum DSM 20098] Length = 373 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 57/158 (36%), Gaps = 12/158 (7%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREML 203 C L + +V +S++ D++M +DV+ SM G D+L + ++ + Sbjct: 63 CLTVGAMALGPSMTVSTTSRAVNNTDVVMAVDVTGSMAVKDATYGSDELTTRLNAAKQAV 122 Query: 204 DIIKSIPDVNNV--VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF-GSTTKSTPGLE 260 D + ++ VR G PL I+ L + S L+ Sbjct: 123 DDVTKAYADSSFAAVRFGASG-----TLDVPLTPDAAAIRNWATTLAPEATGVSSGSSLD 177 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 +++ I H D + +TDGE +S Sbjct: 178 APLDQLITTLND---IRTTHPDDAIVLYLITDGEQTSA 212 >gi|124006282|ref|ZP_01691117.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123988206|gb|EAY27864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 224 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 64/166 (38%), Gaps = 14/166 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++VLDVS SM + + ++ + + + + ++TFS ++ Sbjct: 20 CILVLDVSGSMIGN------PIQQLNEGLQRFKQQVMNDEIASQRLEICIITFSGRVACI 73 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++ L G +T GL A K+ K + G Y+ ++I + Sbjct: 74 QEPSLIHNF---EMPTLKAGGSTALVDGLRRAILKVTT--RKNWYKQTGQPYYRPFVIMI 128 Query: 291 TDGENSSPNIDNKESLFYCN-EAKRRGAIVYAIGVQAEAADQFLKN 335 TDGE + D K + + + + IGVQ +A L+ Sbjct: 129 TDGEP-DADQDVKGVSQDIDIRVDNKEFLFFPIGVQ-DANMDVLRQ 172 >gi|119510748|ref|ZP_01629875.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119464612|gb|EAW45522.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 217 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 37/166 (22%), Positives = 66/166 (39%), Gaps = 18/166 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++LDVS SM+ G + +L + ++ D++K V +V+F Sbjct: 18 VILLLDVSGSMS---GQPIQELNRGLAAFKK--DVVKD-SQAALSVEVAIVSF-----GP 66 Query: 231 FPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L I+ L T +EYA + + K + G Y+ ++ Sbjct: 67 VRLTQDFVTIENFTPPELKSDGLTPMGEAIEYALDLL--ESRKTAYKDNGILYYRPWVFL 124 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAEAADQFLK 334 +TDG +P D K + EA+ R +A+GVQ D + Sbjct: 125 ITDG---APTDDWKYAAQRVKEAEASRRLCFFAVGVQGADFDTLKQ 167 >gi|91842247|gb|ABE66386.1| truncated integrin beta1 subunit-like protein 3 [Danio rerio] Length = 619 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 53/153 (34%), Gaps = 33/153 (21%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++K D +D+ ++D+S SM + L E+ + +K D+ +R Sbjct: 126 TLKFKRAEDYPIDLYFLMDLSHSM-------LSNLENLKNLGTELANEMK---DITKDLR 175 Query: 218 SGLVTF----SSKIVQTFP--------------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 G +F S + FP L ++I++L S+ Sbjct: 176 IGFGSFFRKPSIQTNPCFPDNCIAPFSYFNVLSLTDDHALFTQEISKLKTSGNLDSSEA- 234 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + A + G + + ++F+TD Sbjct: 235 --GLEALMQAA--VCTDVIGWRNVTRVLVFITD 263 >gi|46143335|ref|ZP_00135441.2| COG4961: Flp pilus assembly protein TadG [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 520 Score = 42.5 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 83/250 (33%), Gaps = 30/250 (12%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTAT----- 61 R F + G +++ +L I ++ + +E++ +A+L L+ ++L Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 62 ------KILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 K+ N N + + + T + L + + + + + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDNQMVTTFVQAFLPQT----NEKAMRLTPTCK 125 Query: 116 IIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI-------- 167 + D K + S+ + W +I V ++SKS Sbjct: 126 TVTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNI 185 Query: 168 GLDMMMVLDVSLSMNDHFGPGM-------DKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 +D+M+V D+S SMN K+ + + E+ D NN R + Sbjct: 186 PIDLMVVADLSGSMNFDLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRIAV 245 Query: 221 VTFSSKIVQT 230 F+ + Sbjct: 246 APFALGAQHS 255 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 37/303 (12%), Positives = 86/303 (28%), Gaps = 62/303 (20%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 + + + + ++ ++ + C P+ + ++K Sbjct: 224 LEELAAKSLFNQDSNNNNRIAVAPFALGAQHSNNQCIIPFILKKYSKDRISEKNIKSYLS 283 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ + D +LS+ + + SI I + N G + Sbjct: 284 ANNNIS---AKDFALSL-----SYLVDIDKTINSIGGTFSSNSIIFNKNKFC-LGRSNKN 334 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + I RL +T ++ GL A N + ++ + K Sbjct: 335 THHWYNRDES---SNFFSFIKRLHAEGSTLASSGLITASNIML---KEESRSKSLGEQTK 388 Query: 285 KYIIFLTDGE-------------------------------------NSSPNIDNKESLF 307 + I+ L+DG S P ++ Sbjct: 389 RVILVLSDGNDELRLNDEGTPFTQYSRITENLLLGQEEQTTDTYPYFMSKPPKKLTSNIN 448 Query: 308 YCNEAK--------RRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQNSRKLHDAFL-R 357 C+ + + + + A A Q ++C +YS + L ++F Sbjct: 449 VCDRIRNKLDEHNEDKNTKIVFVEFGYASKAKQAWQHCVGNGNYYSANDKASLLNSFKQA 508 Query: 358 IGK 360 IG+ Sbjct: 509 IGE 511 >gi|332884777|gb|EGK05033.1| hypothetical protein HMPREF9456_03186 [Dysgonomonas mossii DSM 22836] Length = 289 Score = 42.5 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S FG + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSAS--QDFGTHLS----VKRDVVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S KI + P G +H I+E IN T L++ N I Sbjct: 126 SDKIEKFIPPKKGKKHILYIIRELINFQAESPKTDMGMALKFLTNAIKK 174 >gi|328883597|emb|CCA56836.1| hypothetical protein SVEN_3550 [Streptomyces venezuelae ATCC 10712] Length = 642 Score = 42.5 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 49/147 (33%), Gaps = 29/147 (19%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDV 212 + +K + + +V+D S SM +F G + +L ++ LD ++ Sbjct: 444 AAGAQLKKQGLAGARAAVYLVVDRSGSMRGYFKDGSVQRLAEQVTALAAHLDEDATVT-- 501 Query: 213 NNVVRSGLVTFSSKIVQTFPLA------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 V FS+ I T L V + + RL T +E Sbjct: 502 -------TVFFSTDIDGTVDLTPADLTPTRVDEVNATLGRL---GRTNYHRAVE------ 545 Query: 267 FDAKEKLEHIAKGHDDYKKYIIFLTDG 293 E L H K ++F TDG Sbjct: 546 ----EVLAHHEKADASRPALVVFQTDG 568 >gi|323137991|ref|ZP_08073065.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] gi|322396710|gb|EFX99237.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] Length = 296 Score = 42.5 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 52/144 (36%), Gaps = 2/144 (1%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 +R + G+++++ A + +++G ++ S ++ L LD +L A K Sbjct: 15 RLRTLLRDRNGAVAVIFAFATLPMVLLIGGAVDYSRAIGARSNLQQALDAGVLSAAVKGG 74 Query: 65 NQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKD 124 N ++G + ++ S + G + + + S + + + Sbjct: 75 NPDSGQLARYLNSNMSPGGAATNVT--LTRSVATGGAVTFVGDADFSVATNFLKMAGLGA 132 Query: 125 YNLSAVSRYEMPFIFCTFPWCANS 148 L + S +P T + S Sbjct: 133 IKLHSHSEATLPAQIVTATFKPTS 156 >gi|261883870|ref|ZP_06007909.1| hypothetical protein CfetvA_01086 [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 216 Score = 42.5 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 63/202 (31%), Gaps = 28/202 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV--NNVVRSGLVTF 223 L++M++LD S SM + ++ A +I ++ + +V G + Sbjct: 28 KESLNIMVILDASGSMA-NIEGDQTRMEAAKSAINSFMESLPQEANVGLRIYGHEGSGSK 86 Query: 224 SSKIVQTFP-------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + K + + + + T L+ A + + Sbjct: 87 TDKARSCSSSELVYSIDKYDPAKFNQALVKAKPAGWTPIGFALKEAQKDLAAFQGDANTN 146 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAEAADQ-FL 333 + ++DG ++ + + + + IG + Q L Sbjct: 147 ---------IVYLVSDGISTC----DDDPVGSAKALYDSDITPIINVIGFNVDQEGQKQL 193 Query: 334 KNCA--SPDRFYSVQNSRKLHD 353 + A + + +VQ + L+D Sbjct: 194 QEVAKVTEGTYQNVQMAEGLYD 215 >gi|298481574|ref|ZP_06999766.1| von Willebrand factor type A domain protein [Bacteroides sp. D22] gi|298272438|gb|EFI14007.1| von Willebrand factor type A domain protein [Bacteroides sp. D22] Length = 616 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 73/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N++H + I K I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMESGACPW--NTNHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ + + ++ V V SG S+ + Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLENNLRDKDKVAIVTYSG----SAGVKLEATPGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGAGIMLAYKIAKKNLISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|196041820|ref|ZP_03109109.1| conserved hypothetical protein [Bacillus cereus NVH0597-99] gi|196027314|gb|EDX65932.1| conserved hypothetical protein [Bacillus cereus NVH0597-99] Length = 626 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGIEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|14248693|gb|AAK57632.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248695|gb|AAK57633.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|14248707|gb|AAK57639.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|14248697|gb|AAK57634.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248699|gb|AAK57635.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|27378609|ref|NP_770138.1| hypothetical protein blr3498 [Bradyrhizobium japonicum USDA 110] gi|27351757|dbj|BAC48763.1| blr3498 [Bradyrhizobium japonicum USDA 110] Length = 445 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 36/119 (30%), Gaps = 6/119 (5%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +RN + +GS + T I L + + L E + D + A ++ Sbjct: 1 MRNLLRSRQGSAAFATVIALVPLIGAVALGAEAGSWYVTHQHAQNAADSAAYSGALRLSC 60 Query: 66 QENGNNGKKQKNDFSYRII---KNIWQTDFRNELRENGFA---QDINNIERSTSLSIII 118 G Q D+ + + ++ G N I R+ + I Sbjct: 61 TMAGAACGTQSVDYLAKEFAAQNGFCNSSPQDSTPYPGTQCAPSLPNRISRAVQIDIGT 119 >gi|85715076|ref|ZP_01046060.1| Nitric oxide reductase activation protein [Nitrobacter sp. Nb-311A] gi|85697991|gb|EAQ35864.1| Nitric oxide reductase activation protein [Nitrobacter sp. Nb-311A] Length = 786 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 69/195 (35%), Gaps = 26/195 (13%) Query: 168 GLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL--V 221 L ++++LD+S S ND MD AT + LD I ++ G V Sbjct: 594 DLAVLVLLDMSESSNDKVRGHDYTVMDLTRAATVLLAGALDRIGDPFAIHGFCSDGRHDV 653 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + P ++ ++ + +T+ L +A H Sbjct: 654 HYHRFKDFDQPYD---DVVKARLAGMKGRLSTRMGAALRHA-----------AHYLALQP 699 Query: 282 DYKKYIIFLTDGEN-----SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 K+ + +TDGE + P ++ E R+G VYA+ + ADQ++ Sbjct: 700 RSKRVVFVVTDGEPADNDVTDPKYLRHDTKAAVEELSRQGITVYALSLD-PHADQYVSRI 758 Query: 337 ASPDRFYSVQNSRKL 351 F + + +L Sbjct: 759 FGIKNFTVIDQAERL 773 >gi|326779305|ref|ZP_08238570.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326659638|gb|EGE44484.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 453 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 51/176 (28%), Gaps = 37/176 (21%) Query: 152 PLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 PL ++ + +++++D S SM+ K+ A + +D + Sbjct: 47 PLAGSAPAPAYAPGRGPDAAVVLMVDCSGSMDY----PPTKMRNARDATAAAVDAL---- 98 Query: 211 DVNNVVRSGLVT--------FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 R +V + +E + +L G T L A Sbjct: 99 --REGTRFAVVAGTHVAKDVYPGNGELAVADRPTKARAKEALRKLSAGGGTAIGTWLRLA 156 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS-------PNIDNKESLFYCNE 311 + A + H I LTDG N +D+ F C+ Sbjct: 157 DRLLGAADVDIRHG-----------ILLTDGRNEHEAPEDLRAALDSCAGRFTCDA 201 >gi|223996681|ref|XP_002288014.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220977130|gb|EED95457.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 790 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 49/140 (35%), Gaps = 19/140 (13%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 G ++ V+D S SM ++++ A + +L + + Sbjct: 572 TKKMARKAGSLIIFVVDASGSMA------LNRMNAAKGAAMSLLTEA-----YQSRDQIC 620 Query: 220 LVTFS-SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 L+ F + P + ++++ + G + L+ A +A+ K Sbjct: 621 LIPFQGDRADVLLPPTRSIAMAKKRLEAMPCGGGSPLADALQTATLTGLNAQ-------K 673 Query: 279 GHDDYKKYIIFLTDGENSSP 298 D K ++ ++DG + P Sbjct: 674 TGDVGKVVVVCISDGRANVP 693 >gi|90412304|ref|ZP_01220309.1| hypothetical dinitrification protein NorD [Photobacterium profundum 3TCK] gi|90326795|gb|EAS43188.1| hypothetical dinitrification protein NorD [Photobacterium profundum 3TCK] Length = 613 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 79/201 (39%), Gaps = 31/201 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + +++ D+S+S + + + V S+ + + ++ D + F Sbjct: 415 NCQRDISTLLLSDLSMSTDAYINNEYRVIDVIKDSMLLFSEALAAVEDA-----FAIYGF 469 Query: 224 SS--------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 SS +++ F ++ HI+ +I L G T+ + A N + E+ +H Sbjct: 470 SSVKRHHVRFTLLKNFAESYD-DHIRGRILSLRPGFYTRMGAAIRQASNILV---EQTQH 525 Query: 276 IAKGHDDYKKYIIFLTDG-----ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 K ++ LTDG +N +++ A+R G I + I + + AD Sbjct: 526 R--------KLLLILTDGKPNDIDNYDGRHGVEDTRQAIIAARRLGLIPFCITID-QKAD 576 Query: 331 QFLKNCASPDRFYSVQNSRKL 351 Q+L + F + + +L Sbjct: 577 QYLPYIFGNNGFTVIFDPSQL 597 >gi|145551564|ref|XP_001461459.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429293|emb|CAK94086.1| unnamed protein product [Paramecium tetraurelia] Length = 610 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 66/200 (33%), Gaps = 31/200 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K +D+M V+DVS SMN K+ + S+R + I+K R LVTF Sbjct: 123 KQRANIDLMCVVDVSGSMNGE------KIKLVQNSLRYIQKILKPTD------RLALVTF 170 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGS------TTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ L W + K +T G+ I D Sbjct: 171 GTQA--GINLQWTRNIAENKKKIKKAIKDIKIRDSTNIASGVALGLRMIRD------RKF 222 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K + L+DG + D + + + G ++ + + N A Sbjct: 223 KNPVTS---MFVLSDGVDDDRGADLRCQQALHQYNIQDTLTINTFGYGSDHDAKVMNNIA 279 Query: 338 --SPDRFYSVQNSRKLHDAF 355 +F + +++ + F Sbjct: 280 NLKGGQFVYIDQIQRVSEHF 299 >gi|320450072|ref|YP_004202168.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150241|gb|ADW21619.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 407 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 64/186 (34%), Gaps = 42/186 (22%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S+ + +++LD S SM G D+ A + + +I++ + R V Sbjct: 211 SEYTASMSTVVLLDCSHSM---ILYGEDRFTPAKKVALALAHLIRTQYPGD---RVRFVL 264 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + PL+ + ++ T + GLE A + ++ Sbjct: 265 FHDTAEE-IPLS---RLPLAQVGPYH----TNTKAGLELARTLLKKMGGEM--------- 307 Query: 283 YKKYIIFLTDGENSSPNIDN---------------KESLFYCNEAKRRGAIVYAIGVQAE 327 + II +TDG+ S+ + + E+L A++ G ++ Sbjct: 308 --RQIILITDGKPSAITLPSGEIYKNAWGLDPLILAETLKEATLARKEGIPIHT--FMLA 363 Query: 328 AADQFL 333 + L Sbjct: 364 REPELL 369 >gi|126208028|ref|YP_001053253.1| tight adherence protein G [Actinobacillus pleuropneumoniae L20] gi|126096820|gb|ABN73648.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 520 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 88/246 (35%), Gaps = 22/246 (8%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F + G +++ +L I ++ + +E++ +A+L L+ ++L + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 67 ENGNN----GKKQKNDFSYRIIKNIWQTDFR---NELRENGFAQDINNIERSTSLSIIID 119 N+ G K + S+ I + + D + ++ + + + + + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDTQMVTTFVQAFLPQTNEKAMRLTPTCKTVTT 129 Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDI--------GLDM 171 D K + S+ + W +I V ++SKS +D+ Sbjct: 130 DNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIPIDL 189 Query: 172 MMVLDVSLSMNDHFGPGM-------DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 M+V D+S SMN K+ + + E+ D NN R + F+ Sbjct: 190 MVVADLSGSMNFDLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRIAVAPFA 249 Query: 225 SKIVQT 230 + Sbjct: 250 LGAQHS 255 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 37/303 (12%), Positives = 86/303 (28%), Gaps = 62/303 (20%) Query: 105 INNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 + + + + ++ ++ + C P+ + ++K Sbjct: 224 LEELAAKSLFNQDSNNNNRIAVAPFALGAQHSNNQCIIPFILKKYSKDRISEKNIKSYLS 283 Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ + D +LS+ + + SI I + N G + Sbjct: 284 ANNNIS---AKDFALSL-----SYLVDIDKTINSIGGTFSSNSIIFNKNKFC-LGRSNKN 334 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + + I RL +T ++ GL A N + ++ + K Sbjct: 335 THHWYNRDES---SNFFSFIKRLHAEGSTLASSGLITASNIML---KEESRSKSLGEQTK 388 Query: 285 KYIIFLTDGE-------------------------------------NSSPNIDNKESLF 307 + I+ L+DG S P ++ Sbjct: 389 RVILVLSDGNDELRLNDEGTPFTQYSRITENLLLGQEEQTTDTYPYFMSKPPKKLTSNIN 448 Query: 308 YCNEAK--------RRGAIVYAIGVQ-AEAADQFLKNCASPDRFYSVQNSRKLHDAFL-R 357 C+ + + + + A A Q ++C +YS + L ++F Sbjct: 449 VCDRIRNKLDEHNEDKNTKIVFVEFGYASKAKQAWQHCVGNGNYYSANDKASLLNSFKQA 508 Query: 358 IGK 360 IG+ Sbjct: 509 IGE 511 >gi|330829743|ref|YP_004392695.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804879|gb|AEB50078.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 483 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 64/191 (33%), Gaps = 31/191 (16%) Query: 129 AVSRYEMPFIFCTFPWCANSSH-APLLITSSV---KISSKSDIGLDMMMVLDVSLSMNDH 184 A+ Y +P PW ++ L ++ ++ + LD+ +LD+S SM Sbjct: 34 AMQAYLLPASRPQRPWLWLATLPVILALSGPALRGELQQQPAAPLDI-WLLDLSRSMTAT 92 Query: 185 FGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI 244 D+ ++++L K R L+ ++ P Q + + Sbjct: 93 -DLKPDRATRVRWQLQQLLSRAKGE-------RIALILYAGDAYLAMPPTRDHQALSLLL 144 Query: 245 NRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 L + +E A ++ ++ ++ +TD + + + Sbjct: 145 PDLRPDIMPLQGSNPARAVELAMTQLAPGEQAR-------------LLLITD-DLTHNQM 190 Query: 301 DNKESLFYCNE 311 +L+ C + Sbjct: 191 TQIAALWPCQQ 201 >gi|326428371|gb|EGD73941.1| hypothetical protein PTSG_05636 [Salpingoeca sp. ATCC 50818] Length = 1446 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 54/152 (35%), Gaps = 14/152 (9%) Query: 170 DMMMVLDVSLSM---NDHFGPGMDKLGVATRSIREMLDII------KSIPDVNNVVRSGL 220 D+++VLD S SM + M K V + S + +++ K D R+ + Sbjct: 487 DIVIVLDCSRSMRGDKWNDAVAMTKFLVNSLSRDDRYNVVCFSSSHKDYNDNFVYRRTEV 546 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA----KEKLEHI 276 ++ K + + +++ T G++ + + Sbjct: 547 LS-CRKHELLRGTSSNKEDTFTRLDGYTPAGGTDPLTGIQVGFRLLRGECNGLDTDCPMR 605 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 D ++ ++FL+DG++ + +Y Sbjct: 606 DPPRTDCQRLMVFLSDGKDRDNEVRCGRGRYY 637 >gi|313159753|gb|EFR59110.1| conserved hypothetical protein [Alistipes sp. HGB5] Length = 294 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 65/189 (34%), Gaps = 17/189 (8%) Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 +I + + E++ G + +I + T+ + + + S V Y W Sbjct: 6 NDILKRVRKIEIKTRGLSNEIFAGKYHTAF------RGRGMSFSEVREYRAGDDVRDIDW 59 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + + L MM+++DVS S D+L ++I + Sbjct: 60 NVTARSRK-----PHIKIYEEERELTMMLLVDVSAS---RMFGSTDRL---KKNIITEIA 108 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 + + N + G + FS K+ + P G HI I L+ + L Sbjct: 109 AVLAFSAAQNNDKVGCIFFSDKVEKFIPPKKGRSHILMIIRELVGFRPESTGTKLSEPVR 168 Query: 265 KIFDAKEKL 273 + + +K Sbjct: 169 FLTNVNKKR 177 >gi|260642112|ref|ZP_05414581.2| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] gi|260623548|gb|EEX46419.1| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] Length = 615 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 71/183 (38%), Gaps = 23/183 (12%) Query: 147 NSSHAPLLITSSVK-ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 N++H + I K I ++ +++ ++D+S SM G ++L + S++ +++ Sbjct: 225 NTAHRLVRIGLKAKEIPTEQLPASNLVFLIDISGSM-----WGANRLDLVKSSLKLLVN- 278 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAY 263 ++ N + +VT++ G Q I+E I+ L G +T G+ AY Sbjct: 279 -----NLRNKDKVAIVTYAGSAGVKLEATSGGDKQKIREAIDELTAGGSTAGGAGIHLAY 333 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + II +DG+ + + + ++ G + +G Sbjct: 334 QIAKKNFISDGNNR---------IILCSDGDFNVGVSSAEGLEQLIEKERKSGVHLTVLG 384 Query: 324 VQA 326 Sbjct: 385 YGM 387 >gi|310772247|ref|NP_001185582.1| von Willebrand factor A domain-containing protein 3A isoform 1 [Rattus norvegicus] gi|149068054|gb|EDM17606.1| rCG39867, isoform CRA_b [Rattus norvegicus] Length = 1148 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 70/203 (34%), Gaps = 34/203 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM H +L + +R+ D R L++F+ Sbjct: 924 VCILLDTSGSMGPHLQWMKTELVLLIWEQLRKHCD------------RFNLLSFARDPQ- 970 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--I 287 N L+ + +++ + + ++ + Sbjct: 971 ------------PWQNTLVESTDAACHEAMQWVAHLQAQGSTSVLTALMKAFSFQDVQGL 1018 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA--SPDRFY 343 LTDG+ + +++ K RG V+ I + + AA +FL+ A S R++ Sbjct: 1019 YLLTDGKPDTSCSLILDTVQRFQ--KERGVKVHTISLTSTDRAAIEFLRKLASLSGGRYH 1076 Query: 344 SVQNSRKLHDAFLRIGKEMVKQR 366 + L + + +++R Sbjct: 1077 CPVSDEALSRIQGLLARGFIEER 1099 >gi|325963533|ref|YP_004241439.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469620|gb|ADX73305.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 326 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 54/133 (40%), Gaps = 10/133 (7%) Query: 167 IGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L++ +V+D + SM + +G G+ ++ R I +I R ++TF + Sbjct: 37 TDLNVFLVVDTTTSMVAEDYGDGLPRMEGVRR-------DIAAIAGELPGARFSVITFDT 89 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 K PL + + L T + A ++ E+LE + H + + Sbjct: 90 KAHVRMPLTTDTLALDTITSVLEPQVTAYAKGSSITAARQVLS--ERLEAARESHPERPR 147 Query: 286 YIIFLTDGENSSP 298 + +L DGE ++ Sbjct: 148 LVFYLGDGEQTTG 160 >gi|313904292|ref|ZP_07837670.1| von Willebrand factor type A [Eubacterium cellulosolvens 6] gi|313470842|gb|EFR66166.1| von Willebrand factor type A [Eubacterium cellulosolvens 6] Length = 317 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 75/181 (41%), Gaps = 27/181 (14%) Query: 123 KDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM- 181 K Y + + + + + P+ ++ + + +D++ V+D ++SM Sbjct: 30 KKYKI-WIKVAAIARVSLILLLVFVINLRPVKKVYDMETTLNN---VDVLFVVDSTISMW 85 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 + + ++ +R++++ + +N GL+ F ++ P +++ Sbjct: 86 AEDYQGDQPRMNG----VRDLIEHVMFSMQGSNF---GLIRFDNQAQILAPFTGDAENVL 138 Query: 242 EKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + +RL +T L+ AY + E+L + ++ + + F++DGEN Sbjct: 139 DAVEMIEIPSRLYAKGST-----LDTAYEAM----EELLQSSSKKENRQTVVFFISDGEN 189 Query: 296 S 296 + Sbjct: 190 T 190 >gi|198274640|ref|ZP_03207172.1| hypothetical protein BACPLE_00792 [Bacteroides plebeius DSM 17135] gi|198272087|gb|EDY96356.1| hypothetical protein BACPLE_00792 [Bacteroides plebeius DSM 17135] Length = 289 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ + + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSNSLDF------GTIQQLKKEMVTEIAATIAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 S +I + P G +H I+E ++ T +EY N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESQRTNLQCVIEYLTNVLKK 174 >gi|58429485|gb|AAW78146.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429473|gb|AAW78140.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429483|gb|AAW78145.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429501|gb|AAW78154.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429487|gb|AAW78147.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429453|gb|AAW78130.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|149911640|ref|ZP_01900251.1| hypothetical protein PE36_02984 [Moritella sp. PE36] gi|149805310|gb|EDM65324.1| hypothetical protein PE36_02984 [Moritella sp. PE36] Length = 2370 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 60/168 (35%), Gaps = 20/168 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHF--GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++++VLD S SM+ G G ++ +A ++ + + ++ +VN Sbjct: 1790 AAGGYTTNLIIVLDKSGSMDYSANDGTGRSRMDIAKEALSSLFNEYDNVGNVN----IQF 1845 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 V F ++ + Q+ +N + T L + Sbjct: 1846 VAFDYYATKSSWYEDDINAAQDYLNDVDADGGTSYDVALNKVMDDYQAPVAD-------- 1897 Query: 281 DDYKKYIIFLTDGENSS--PNIDNKESLFYCNEAKRRGA-IVYAIGVQ 325 K + F++DGE S I++ E + N +AIG+ Sbjct: 1898 ---KSVVYFISDGEPSYQEAGINSNEQATWENFLVDNNIDKSFAIGIG 1942 >gi|94312593|ref|YP_585802.1| hypothetical protein Rmet_3661 [Cupriavidus metallidurans CH34] gi|93356445|gb|ABF10533.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] gi|222832771|gb|EEE71248.1| predicted protein [Populus trichocarpa] Length = 575 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 12/80 (15%), Positives = 35/80 (43%), Gaps = 2/80 (2%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN--QENGN 70 +G++S++ A+L+ I + + ++ F+ + +L ++D + L A ++ + + Sbjct: 13 QRGTVSLMAALLIAAIGVAALVSLDVGFVFYTQRQLQKLVDVAALSGAQQLKSADDQATT 72 Query: 71 NGKKQKNDFSYRIIKNIWQT 90 N + S + Sbjct: 73 NANVLSSVTSAAAQNGYTKA 92 >gi|320166340|gb|EFW43239.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864] Length = 1035 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 17/115 (14%), Positives = 36/115 (31%), Gaps = 7/115 (6%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIP 210 + +V ++ + +D+ +++D S SM+D + ++ D Sbjct: 50 GQPLLVTVSVTPQPCKPVDLYLLMDFSGSMDDDLTKVRSLAQPLATKVTQLCTDTSNPTC 109 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 N R G +F K G I + T + +A+ Sbjct: 110 SNTNCARLGFGSFLEKPAYPM----GRWTTSGWIKDSTYTGGT--SLPTNHAFRA 158 >gi|297572374|gb|ADI46542.1| integrin beta 1 [Capsaspora owczarzaki] Length = 1064 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 17/115 (14%), Positives = 36/115 (31%), Gaps = 7/115 (6%) Query: 152 PLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIP 210 + +V ++ + +D+ +++D S SM+D + ++ D Sbjct: 95 GQPLLVTVSVTPQPCKPVDLYLLMDFSGSMDDDLTKVRSLAQPLATKVTQLCTDTSNPTC 154 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 N R G +F K G I + T + +A+ Sbjct: 155 SNTNCARLGFGSFLEKPAYPM----GRWTTSGWIKDSTYTGGT--SLPTNHAFRA 203 >gi|218515243|ref|ZP_03512083.1| hypothetical protein Retl8_16920 [Rhizobium etli 8C-3] Length = 94 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 31/86 (36%), Gaps = 10/86 (11%) Query: 9 FFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHY----------ILDHSLLY 58 F + G+ I+TA+L+ + G+ ++ +H ++ +L+ + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 59 TATKILNQENGNNGKKQKNDFSYRII 84 A + + GK D + Sbjct: 68 AAMAMNGNGTISLGKTDARDIFMSQV 93 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 49/114 (42%), Gaps = 7/114 (6%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + KG + + AILL ++F +GL ++ H F V+ +L D + L A + + Sbjct: 6 KLIRSEKGMVIVYVAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTRP 65 Query: 68 -NGNNGKKQKNDFSYRIIKNIWQTD------FRNELRENGFAQDINNIERSTSL 114 + + + + + + + ++ + + G+ +NI + T+L Sbjct: 66 TDPTQLPTLQWEVARFQAQQMITENSSDNTALKDAMVDVGYWNTNSNILQPTTL 119 >gi|294632153|ref|ZP_06710713.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] gi|292835486|gb|EFF93835.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] Length = 403 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 43/121 (35%), Gaps = 16/121 (13%) Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + L T P L A + D + I+ ++DGE+ Sbjct: 95 NRTEAKTAVATLSPTGWTPIGPALLKAATDL------------EGGDGTRRIVLISDGED 142 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC---ASPDRFYSVQNSRKLH 352 + +D E AK G + +G+ +A + +C A+ + V++ +L Sbjct: 143 TCQPLDPCEVAREI-AAKGIGLTIDTLGLVPDAKTRDQLSCIADATGGTYTDVRHKDELS 201 Query: 353 D 353 D Sbjct: 202 D 202 >gi|323168757|gb|EFZ54437.1| von Willebrand factor type A domain protein [Shigella sonnei 53G] Length = 378 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|307138777|ref|ZP_07498133.1| hypothetical protein EcolH7_11726 [Escherichia coli H736] gi|331642742|ref|ZP_08343877.1| conserved hypothetical protein [Escherichia coli H736] gi|331039540|gb|EGI11760.1| conserved hypothetical protein [Escherichia coli H736] Length = 378 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|301644136|ref|ZP_07244145.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|301077521|gb|EFK92327.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|315618099|gb|EFU98690.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 378 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|256022195|ref|ZP_05436060.1| hypothetical protein E4_02377 [Escherichia sp. 4_1_40B] gi|312973630|ref|ZP_07787802.1| von Willebrand factor type A domain protein [Escherichia coli 1827-70] gi|310332225|gb|EFP99460.1| von Willebrand factor type A domain protein [Escherichia coli 1827-70] Length = 378 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|16130059|ref|NP_416625.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89108937|ref|AP_002717.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|170081743|ref|YP_001731063.1| hypothetical protein ECDH10B_2276 [Escherichia coli str. K-12 substr. DH10B] gi|238901305|ref|YP_002927101.1| hypothetical protein BWG_1905 [Escherichia coli BW2952] gi|300948218|ref|ZP_07162340.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|465583|sp|P33352|YEHP_ECOLI RecName: Full=Uncharacterized protein yehP gi|405852|gb|AAA60484.1| yehP [Escherichia coli] gi|1788440|gb|AAC75182.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675234|dbj|BAE76597.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|169889578|gb|ACB03285.1| conserved protein [Escherichia coli str. K-12 substr. DH10B] gi|238860503|gb|ACR62501.1| conserved protein [Escherichia coli BW2952] gi|260448784|gb|ACX39206.1| VWA containing CoxE family protein [Escherichia coli DH1] gi|300452252|gb|EFK15872.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|315136753|dbj|BAJ43912.1| hypothetical protein ECDH1ME8569_2056 [Escherichia coli DH1] gi|323940811|gb|EGB36999.1| VWA domain containing CoxE protein [Escherichia coli E482] gi|332343898|gb|AEE57232.1| conserved hypothetical protein [Escherichia coli UMNK88] gi|744221|prf||2014253Q yehP gene Length = 378 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|193062933|ref|ZP_03044026.1| von Willebrand factor type A domain protein [Escherichia coli E22] gi|194428909|ref|ZP_03061443.1| von Willebrand factor type A domain protein [Escherichia coli B171] gi|209919580|ref|YP_002293664.1| hypothetical protein ECSE_2389 [Escherichia coli SE11] gi|260844728|ref|YP_003222506.1| hypothetical protein ECO103_2597 [Escherichia coli O103:H2 str. 12009] gi|260856095|ref|YP_003229986.1| hypothetical protein ECO26_3030 [Escherichia coli O26:H11 str. 11368] gi|260868825|ref|YP_003235227.1| hypothetical protein ECO111_2838 [Escherichia coli O111:H- str. 11128] gi|293446472|ref|ZP_06662894.1| yehP protein [Escherichia coli B088] gi|300818880|ref|ZP_07099085.1| von Willebrand factor type A domain protein [Escherichia coli MS 107-1] gi|300821775|ref|ZP_07101920.1| von Willebrand factor type A domain protein [Escherichia coli MS 119-7] gi|300924507|ref|ZP_07140475.1| von Willebrand factor type A domain protein [Escherichia coli MS 182-1] gi|301325228|ref|ZP_07218747.1| von Willebrand factor type A domain protein [Escherichia coli MS 78-1] gi|309792999|ref|ZP_07687427.1| von Willebrand factor type A domain protein [Escherichia coli MS 145-7] gi|331653547|ref|ZP_08354548.1| conserved hypothetical protein [Escherichia coli M718] gi|331668817|ref|ZP_08369665.1| conserved hypothetical protein [Escherichia coli TA271] gi|331678069|ref|ZP_08378744.1| conserved hypothetical protein [Escherichia coli H591] gi|331683796|ref|ZP_08384392.1| conserved hypothetical protein [Escherichia coli H299] gi|192931576|gb|EDV84177.1| von Willebrand factor type A domain protein [Escherichia coli E22] gi|194413077|gb|EDX29365.1| von Willebrand factor type A domain protein [Escherichia coli B171] gi|195183024|dbj|BAG66586.1| predicted protein [Escherichia coli O111:H-] gi|209912839|dbj|BAG77913.1| conserved hypothetical protein [Escherichia coli SE11] gi|257754744|dbj|BAI26246.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257759875|dbj|BAI31372.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009] gi|257765181|dbj|BAI36676.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|291323302|gb|EFE62730.1| yehP protein [Escherichia coli B088] gi|300419287|gb|EFK02598.1| von Willebrand factor type A domain protein [Escherichia coli MS 182-1] gi|300525617|gb|EFK46686.1| von Willebrand factor type A domain protein [Escherichia coli MS 119-7] gi|300528499|gb|EFK49561.1| von Willebrand factor type A domain protein [Escherichia coli MS 107-1] gi|300847913|gb|EFK75673.1| von Willebrand factor type A domain protein [Escherichia coli MS 78-1] gi|308123285|gb|EFO60547.1| von Willebrand factor type A domain protein [Escherichia coli MS 145-7] gi|320200872|gb|EFW75457.1| Mg-chelatase subunit ChlD [Escherichia coli EC4100B] gi|323152342|gb|EFZ38631.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323161787|gb|EFZ47667.1| von Willebrand factor type A domain protein [Escherichia coli E128010] gi|323176985|gb|EFZ62575.1| von Willebrand factor type A domain protein [Escherichia coli 1180] gi|323944937|gb|EGB41002.1| VWA domain containing CoxE protein [Escherichia coli H120] gi|331048396|gb|EGI20472.1| conserved hypothetical protein [Escherichia coli M718] gi|331064011|gb|EGI35922.1| conserved hypothetical protein [Escherichia coli TA271] gi|331074529|gb|EGI45849.1| conserved hypothetical protein [Escherichia coli H591] gi|331078748|gb|EGI49950.1| conserved hypothetical protein [Escherichia coli H299] Length = 378 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|254229333|ref|ZP_04922750.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262393419|ref|YP_003285273.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] gi|151938141|gb|EDN56982.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262337013|gb|ACY50808.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] Length = 422 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 31/92 (33%), Gaps = 10/92 (10%) Query: 30 IVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQ 89 V I+ +H+ K +L +D + L A E+ N Q + I Sbjct: 30 GVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVN----QAETAVRTTLGGIST 85 Query: 90 TDFRNELRENG------FAQDINNIERSTSLS 115 +EL + F+ D+ + S + Sbjct: 86 EPGNSELTFSDSNTAVTFSHDMQTFVSAASFT 117 >gi|160710|gb|AAA29770.1| thrombospondin related anonymous protein [Plasmodium falciparum] gi|33333554|gb|AAQ11893.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYANIFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|307107471|gb|EFN55714.1| hypothetical protein CHLNCDRAFT_134017 [Chlorella variabilis] Length = 611 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 67/179 (37%), Gaps = 26/179 (14%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-V 228 ++ ++D S S+ + F + A R + ++P + + V ++ FS+ I V Sbjct: 431 VVFLVDGSGSVTEEDFRVMTGFMLAAVRGLAGGAAGQGAVPPLRSKV--AVIQFSNDIRV 488 Query: 229 QTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + P+ V + + + T ++ A + + + Sbjct: 489 EQGPVDVDVGAFEALMAGMARMNGGTNIALAVQKAGQLLKPL----------SATAHRVL 538 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAE-AADQFLKNCASPD 340 + LTDG ID+ +S + A R G V+A GV + L+ CA+ D Sbjct: 539 VLLTDG-----RIDSHQSREARDMAARLGDEQANVRVHAYGVGRGVDKQELLRICAARD 592 >gi|161524898|ref|YP_001579910.1| hypothetical protein Bmul_1725 [Burkholderia multivorans ATCC 17616] gi|160342327|gb|ABX15413.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 626 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 34/80 (42%), Gaps = 1/80 (1%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 +GS++++ AI + + +V+G I+ + +F + L I D + + + + +Q Sbjct: 10 RGKSRQRGSVAVMAAIWVMIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIASVESMTDQC 68 Query: 68 NGNNGKKQKNDFSYRIIKNI 87 + N S + Sbjct: 69 SQQNSPAMMAAQSNALANGF 88 >gi|163800204|ref|ZP_02194105.1| hypothetical protein 1103602000595_AND4_05974 [Vibrio sp. AND4] gi|159175647|gb|EDP60441.1| hypothetical protein AND4_05974 [Vibrio sp. AND4] Length = 638 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 40/138 (28%), Gaps = 13/138 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S S + + ++VLD+S SM ++L Sbjct: 54 FTLWGLAWAIACVALAGPSWQSNTRPS-FQLSQNRVLVLDMSRSM-YATDVKPNRLSQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GL+ ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGS-------TGLIAYAGDAYTLSPLTTDSNTLAGIIENLSPELMPYQG 164 Query: 253 TKSTPGLEYAYNKIFDAK 270 + +E A + A Sbjct: 165 SNLPSAIETALGQFNQAG 182 >gi|120554866|ref|YP_959217.1| TPR repeat-containing protein [Marinobacter aquaeolei VT8] gi|120324715|gb|ABM19030.1| TPR repeat-containing protein [Marinobacter aquaeolei VT8] Length = 612 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 44/120 (36%), Gaps = 12/120 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++++LD+SLSM D+L +A R +R++L + L Sbjct: 86 PTPLKQTQDSLVILLDLSLSMLATDVEP-DRLTLAKRKVRDIL-------AQRQGGLTAL 137 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V +S PL + I+ ++ L + ++ + A + + I Sbjct: 138 VVYSGDAHTVTPLTGDTRTIEAMLSVLDPTIMPAQGNRADLAVRQAVDLLDQGAPGQGRI 197 >gi|261855517|ref|YP_003262800.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261835986|gb|ACX95753.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 788 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 24/186 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRS------IREMLDIIKSIPDVNNVVRSGLV 221 L +++++D+S S ND G ++ V + + + L+ I ++ Sbjct: 594 DLSVILLIDLSESTNDLVQTGEEEKTVLDMAREAAALLGDALNRIGDPYAIHGFDS---- 649 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + + + + + T ++ A H+ Sbjct: 650 --NGRSDVEYFRFKDFDDAFDDLAKARLAGMT------GQLSTRMGTALRHAGHLLDSRP 701 Query: 282 DYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 KK I+ LTDGE S P +++ E +RG + + + AD +++ Sbjct: 702 SQKKLILLLTDGEPSDNDVRDPQYLRQDTKRVVEELGKRGINTFCVTLD-PRADDYVERI 760 Query: 337 ASPDRF 342 P + Sbjct: 761 FGPKHY 766 >gi|153831726|ref|ZP_01984393.1| TPR repeat containing protein [Vibrio harveyi HY01] gi|148872236|gb|EDL71053.1| TPR repeat containing protein [Vibrio harveyi HY01] Length = 650 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 42/138 (30%), Gaps = 13/138 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F W S + S ++ + ++VLD+S SM ++L Sbjct: 54 FAIWGITWAIACIALAGPSWQSNTLPS-FELSQNRVLVLDMSRSM-YATDVKPNRLSQTR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN----RLIFGST 252 ++L K +GLV ++ PL + I L+ Sbjct: 112 YKALDLLPKWKEGS-------TGLVAYAGDAYTLSPLTTDSSTLAGIIENLSPELMPYQG 164 Query: 253 TKSTPGLEYAYNKIFDAK 270 + +E A + A Sbjct: 165 SNLPSAIETALGQFTQAG 182 >gi|153006798|ref|YP_001381123.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] gi|152030371|gb|ABS28139.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] Length = 381 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 13/53 (24%), Positives = 24/53 (45%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 +G+++++ AI+L V+ M L + H V+ +L D L A Sbjct: 4 RGKRRERGAVAVIVAIVLLVLGGFMALSLNVGHLLSVRGELQNASDAGALGGA 56 >gi|87307101|ref|ZP_01089247.1| hypothetical protein DSM3645_01570 [Blastopirellula marina DSM 3645] gi|87290474|gb|EAQ82362.1| hypothetical protein DSM3645_01570 [Blastopirellula marina DSM 3645] Length = 1670 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 67/220 (30%), Gaps = 40/220 (18%) Query: 170 DMMMVLDVSLSMNDHFGP---GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + +VLD S SM G K A R++ +L+ I V+ + Sbjct: 1116 AISIVLDASGSMGAAAGQPFGPQTKYAEAVRALDRLLETIPDGTQVSVWTFGQAMGSQKT 1175 Query: 227 IVQT--------FPLAWGVQHIQEK---INRLIFGSTT--KSTPGLEYAYNKIFDAKEKL 273 +V+ P+ W + + + + + +P L + AK L Sbjct: 1176 VVEAERTIQRLVAPIIWNSKDKSQYQQLVKAITYPQVEPWNESPILR----AMISAKGDL 1231 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKR----RGAIVYAIGVQ- 325 + K ++ +TDG ++ D+ G + +G + Sbjct: 1232 -----TGVNGPKTMLVITDGADNRFVNDSTVNPRGRSVAEALFDIFDGSGISIQVVGFKV 1286 Query: 326 -------AEAADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 A+ + ++ P FY + L + Sbjct: 1287 VSAEAALAQKQFELVEQLYPPGGFYLIDRVEALEAHLANV 1326 >gi|297617300|ref|YP_003702459.1| von Willebrand factor A [Syntrophothermus lipocalidus DSM 12680] gi|297145137|gb|ADI01894.1| von Willebrand factor type A [Syntrophothermus lipocalidus DSM 12680] Length = 588 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 52/175 (29%), Gaps = 25/175 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+ +++D S SM DK A +L V ++ +K Sbjct: 407 PIDVCLLIDASASMAG------DKRQAACYLAEHLLLT------GREKVAV-VIFQQTKA 453 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 P + + + + + T G+ + I K + Sbjct: 454 KVVVPFTRNHKLLTKGLASIRPEGLTPLADGIATSLKLI-----------KSSRVSNPLL 502 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFLKNCASPDR 341 + +TDG + P AK+ + + + E+ FL A+ Sbjct: 503 VLITDGMPNFPLWTVDAREDALTAAKKIANTKIKFVCIGLESNKVFLNELAARAH 557 >gi|288941616|ref|YP_003443856.1| TPR repeat-containing protein [Allochromatium vinosum DSM 180] gi|288896988|gb|ADC62824.1| TPR repeat-containing protein [Allochromatium vinosum DSM 180] Length = 630 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 64/204 (31%), Gaps = 34/204 (16%) Query: 136 PFIFCTFPWCA--NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 P + W + P+ + S + ++LD+S S+N Sbjct: 58 PLVLLGVGWLILVVALAGPVWRQLPQPVFSLDARSV---ILLDLSPSLNAADVSP----S 110 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-----NRLI 248 TR+ E LD++K++ + + GL+ F PL+ I ++ + + Sbjct: 111 RLTRARFETLDLLKAMREG----QVGLIAFGPDPFLVSPLSADANTIAAQVPMLTTDLIP 166 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 ++ LE A + A H +I +TDG + SL Sbjct: 167 VPGARRTERALEMAAGMLERAGGGAGH-----------VILITDGVGN-----LAGSLES 210 Query: 309 CNEAKRRGAIVYAIGVQAEAADQF 332 G + + V Sbjct: 211 ARRLADAGHRLSVLAVGTLDGAPV 234 >gi|193208855|ref|NP_505147.3| C-type LECtin family member (clec-218) [Caenorhabditis elegans] gi|163644494|gb|AAY43994.2| C-type lectin protein 218 [Caenorhabditis elegans] Length = 389 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 65/188 (34%), Gaps = 18/188 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI-IKSIPDVNNVVRSGLVTFSSKI 227 LD+ +++D S M G+ ++ S+ I + S R ++T+S Sbjct: 37 LDVYLLIDNSAKMG---SVGLLEVASNVNSVFGFTQIRVGSNYPDKRGARVSVLTYSDSP 93 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L+ + I L ++ S L+ + + + + A ++ Sbjct: 94 TVHANLSDFKSTDELTSMIYALKPSTSYDSN--LQSSLKLVKNMMNYKDINAPRNNTQTV 151 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ------FLKNCASP 339 II+ D + D + ++ K G + + + Q +LK AS Sbjct: 152 IIIYAGD----YVDYDEPTIAQFGDQLKADGVKIITVADISNTDHQHVSKLKWLKELASE 207 Query: 340 DRFYSVQN 347 +++ + Sbjct: 208 GNGFNIND 215 >gi|306814764|ref|ZP_07448926.1| hypothetical protein ECNC101_22022 [Escherichia coli NC101] gi|222033883|emb|CAP76624.1| Uncharacterized protein yehP [Escherichia coli LF82] gi|305852158|gb|EFM52610.1| hypothetical protein ECNC101_22022 [Escherichia coli NC101] gi|312946740|gb|ADR27567.1| hypothetical protein NRG857_10760 [Escherichia coli O83:H1 str. NRG 857C] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGQQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|145591210|ref|YP_001153212.1| hypothetical protein Pars_0980 [Pyrobaculum arsenaticum DSM 13514] gi|145282978|gb|ABP50560.1| conserved hypothetical protein [Pyrobaculum arsenaticum DSM 13514] Length = 431 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 31/142 (21%), Positives = 54/142 (38%), Gaps = 22/142 (15%) Query: 159 VKISSKSDIGLD----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 K S D+ LD + +++D S SM G+ + L +I + Sbjct: 255 TKSLSIYDLALDTRDRVYLLVDKSGSMFYSLYDGVAMDMTQKITWATAL----AIAVMKK 310 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R+ L F + P V+ I + R++ T T + A + DAK++ Sbjct: 311 SKRTVLRFFDQMVY---PPITNVKDIIRSLLRVLPLGGTDITAAVHTA---VRDAKQQSL 364 Query: 275 HIAKGHDDYKKYIIFLTDGENS 296 H K ++ +TDGE+ Sbjct: 365 HNYK--------LVIITDGEDD 378 >gi|126437258|ref|YP_001072949.1| von Willebrand factor, type A [Mycobacterium sp. JLS] gi|126237058|gb|ABO00459.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 665 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 41/279 (14%), Positives = 87/279 (31%), Gaps = 35/279 (12%) Query: 31 VMGLVIETSHKFFVKAKL--HYILDHSLLYTA-TKILNQE---NGNNGKKQKNDFSYRII 84 G ++ + +L +D L ++NQ G++GK + + + R + Sbjct: 328 YAGATMDDVDLDALARQLGDQAAVDARTLAELERALMNQGFLDRGSDGKWRLSPKAMRQL 387 Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 D +L +D + L+ + Sbjct: 388 GQAALRDVAQQLSGRHGERDTRRAGAAGELTGATRPWQFGDTEPWNVTRTLTNAVLRQAG 447 Query: 145 CANSSHAPLLITSSVKIS-SKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREM 202 + + V+IS +++ + +++D S SM ++ M + +A Sbjct: 448 SSVREIPVSITVDDVEISETETRTQAAVALLVDTSFSMVMENRWLPMKRTALA----LNH 503 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 L + D +V G + V L G++ + E T L A Sbjct: 504 LVSTRFRSDALQIVAFGRYA---RTVTAAELT-GLEGVYE--------QGTNLHHALALA 551 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + H + + I+ +TDGE ++ D Sbjct: 552 TRHL-----------RRHPNAQPVILVVTDGEPTAHLED 579 >gi|108801277|ref|YP_641474.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119870428|ref|YP_940380.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|108771696|gb|ABG10418.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119696517|gb|ABL93590.1| von Willebrand factor, type A [Mycobacterium sp. KMS] Length = 665 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 41/279 (14%), Positives = 87/279 (31%), Gaps = 35/279 (12%) Query: 31 VMGLVIETSHKFFVKAKL--HYILDHSLLYTA-TKILNQE---NGNNGKKQKNDFSYRII 84 G ++ + +L +D L ++NQ G++GK + + + R + Sbjct: 328 YAGATMDDVDLDALARQLGDQAAVDARTLAELERALMNQGFLDRGSDGKWRLSPKAMRQL 387 Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 D +L +D + L+ + Sbjct: 388 GQAALRDVAQQLSGRHGERDTRRAGAAGELTGATRPWQFGDTEPWNVTRTLTNAVLRQAG 447 Query: 145 CANSSHAPLLITSSVKIS-SKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREM 202 + + V+IS +++ + +++D S SM ++ M + +A Sbjct: 448 SSVREIPVSITVDDVEISETETRTQAAVALLVDTSFSMVMENRWLPMKRTALA----LNH 503 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 L + D +V G + V L G++ + E T L A Sbjct: 504 LVSTRFRSDALQIVAFGRYA---RTVTAAELT-GLEGVYE--------QGTNLHHALALA 551 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 + + H + + I+ +TDGE ++ D Sbjct: 552 TRHL-----------RRHPNAQPVILVVTDGEPTAHLED 579 >gi|14248671|gb|AAK57621.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248673|gb|AAK57622.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|268575744|ref|XP_002642852.1| Hypothetical protein CBG15118 [Caenorhabditis briggsae] Length = 1268 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 61/166 (36%), Gaps = 24/166 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +LD S D+F + A + I E + + ++ V+ LV ++ + Sbjct: 671 DILFLLDSS----DNFNEQ--RFHRAIKLIGETVSKFNNF--GSDGVQVSLVQYNDEPYL 722 Query: 230 TFPLA--WGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 F L +H+ + I F ++ T LE F K A+ Sbjct: 723 EFSLRKHNCKKHLLDDIADTEFMTGGSQLTKALEKVSQFAFTKKRGDRPDAEN------V 776 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +I +TDG+++ + AK V I EA QF Sbjct: 777 LIIVTDGQSNGRIQEPTRL------AKENNVTVLVI-TTIEADKQF 815 >gi|328865688|gb|EGG14074.1| type A von Willebrand factor domain-containing protein [Dictyostelium fasciculatum] Length = 908 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 56/369 (15%), Positives = 111/369 (30%), Gaps = 51/369 (13%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLL---YTATKILNQE 67 N KG ++ L P + + + I S LHY L + + K+ + Sbjct: 155 KNDKGYFNLSIGNLPPGKEVTIAITI-VSEVGLHLQDLHYCLHRYMFPKNSLSLKLDMEV 213 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + Y+ K I Q L NG N+ Sbjct: 214 DLSTDITDIQIDHYQFKKEINQNRATISLEHNGVVN--KNLIVVVKPK-----------T 260 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGP 187 Y + + + P S ++ KS+ + +LD S SM+ Sbjct: 261 EEKPGYFLEYNKEDKTAALALNFYPRFQISPDEVDQKSE----FVFLLDCSGSMSG---- 312 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 + A R++ ++ + N+V G S ++ I ++ Sbjct: 313 --GAITKAKRALEILMRSLTE-NSKFNIVLFGSNFKSLFPESMPYDDANLEIASTYIQKI 369 Query: 248 IFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T+ P ++ +K +D + Y + + LTDGE S + Sbjct: 370 QADLGGTELLPPIKSILSKPYDPQ------------YPRQVFILTDGEVSERDQLIDFVG 417 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFL----KNCASPDRFYSVQNSRKLHDA-FLRIGKE 361 N + ++ +G+ + + + K+C F + + + I E Sbjct: 418 KEANTTR-----IFTLGIGSGVDRELVIGLSKSCKGYYEFIEENSMMETQVVKLMSIAME 472 Query: 362 MVKQRILYN 370 I + Sbjct: 473 PTISNIRVD 481 >gi|313896128|ref|ZP_07829682.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312975553|gb|EFR41014.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 243 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 66/185 (35%), Gaps = 22/185 (11%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 P + ++ +++++V+D S SM + +++ Sbjct: 21 VPRAYAADQTTGAEKPKRIELVLVIDKSGSMGGLESDTIGGFNS----------MVEKQR 70 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ VR V F+ ++ + W V + + + G TT + ++ Sbjct: 71 ALHIPVRVTAVLFNDEVQTLYDRKAIWRVPPLTDH--DYVTGGTTALLDAVGTTIRRM-- 126 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES--LFYCNEAKRRGAIVYAIGVQA 326 E+ + I K + +TDG ++ + + ++ ++ G +G Sbjct: 127 --ERADGITAPGT--KVVFVIITDGLENASTEFTRAAVKRMISDKQEKAGWSFIYLGANI 182 Query: 327 EAADQ 331 +AA++ Sbjct: 183 DAAEE 187 >gi|313884738|ref|ZP_07818493.1| von Willebrand factor type A domain protein [Eremococcus coleocola ACS-139-V-Col8] gi|312619999|gb|EFR31433.1| von Willebrand factor type A domain protein [Eremococcus coleocola ACS-139-V-Col8] Length = 549 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 59/188 (31%), Gaps = 15/188 (7%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 VLD S SM G G ++ A + + ++ S ++ FS + Sbjct: 370 VLDYSGSME---GEGNKQMTEAMAQVLLPENAKANLLQGTQQDISLVIPFSDMVDAATVA 426 Query: 234 AWGVQHI---QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 Q + + + + G T G+ A + + + I+ L Sbjct: 427 KGNGQELVDLNQYVQDYVVGGDTAMYEGIIAALDTMVSDYGQDLEDYSPA------IVIL 480 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRK 350 TDG+ + S Y +++I + K A + + Sbjct: 481 TDGQPNGAKTFKDLSQRYQQAQVD--IPMFSILFGEAEEGKM-KEIADLTKARVFDGRKD 537 Query: 351 LHDAFLRI 358 L +AF ++ Sbjct: 538 LINAFKQV 545 >gi|288921206|ref|ZP_06415492.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347413|gb|EFC81704.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 625 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 67/222 (30%), Gaps = 50/222 (22%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 +MVLD S SM + D+ +R + ++P + GL FSS++ Sbjct: 417 LMVLDSSGSMAEPVPASGDR----SRLAVALDAAKAALPMFADGSNLGLWRFSSRLHGA- 471 Query: 232 PLAWG--------------------------VQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 W Q + ++++R+ T A+ Sbjct: 472 -DDWDELVGLGPVEEQVGGADGADEAGGVPRRQAVIDEMSRIEPRGDTGLYETTLAAFRH 530 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP---NIDNKESLFYCNEAKRRGAIVYAI 322 + E ++ LTDG NS P ++D + + I Sbjct: 531 LNQHYEDGWPNQ---------VVLLTDGRNSDPGSMSLDELVRTLRREYSALHPVRIITI 581 Query: 323 GVQAEAADQFLKNC--ASPDRFYSVQNSRK----LHDAFLRI 358 G +A L A+ + Y + L AF I Sbjct: 582 GYGEDADLGALARISDATGAQSYPALDPNSIFVVLVGAFTEI 623 >gi|268558158|ref|XP_002637069.1| C. briggsae CBR-ROP-1 protein [Caenorhabditis briggsae] gi|187031741|emb|CAP29040.1| CBR-ROP-1 protein [Caenorhabditis briggsae AF16] Length = 644 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 57/169 (33%), Gaps = 35/169 (20%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQT 230 + LDVS SM P + M S+ +++N V F K Sbjct: 477 CLALDVSGSM---CSPVSSSPLSCREAATGM-----SLINLHNEAEVKCVAFCDKLTELP 528 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 F W + + + +N L F +T + +A + + +II+ Sbjct: 529 FTKDWKIGQVNDYVNNLSF-GSTDCGLPMTWATE--------------NNLKFDVFIIY- 572 Query: 291 TDGENSSPNIDNKESLFYCNEAK---RRGAIV-------YAIGVQAEAA 329 TD + + I E++ EA IV Y+I ++A Sbjct: 573 TDNDTWAGEIHPFEAIKKYREASGIHDAKVIVMAMQAYNYSIADPSDAG 621 >gi|310119165|ref|XP_003118918.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] gi|310126588|ref|XP_003120448.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 535 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 12/89 (13%) Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 G + +Y+I +TDG++S + E L + G +YAIG++ EA LK Sbjct: 6 ADTGRINVARYVIVITDGKSSDSVAEAAEGL------RANGVNIYAIGIR-EANIDELKE 58 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A F+ + D I KE+V+ Sbjct: 59 IAKDKIFFVYE-----FDLLKDIQKEVVQ 82 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 42/111 (37%), Gaps = 13/111 (11%) Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T++ L + +K G ++Y+I +TDG I ++L Sbjct: 138 GTRTGKALNFTLPFFDSSKG-------GRPSVQQYLIVITDGVAQDNVIIPAKAL----- 185 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + + I++AIGV Q L+ D+ Y N L + I ++ Sbjct: 186 -RDKNTIIFAIGVGEAKKSQLLEITNDEDKVYHDVNFEALQNLEKEILSKV 235 >gi|218512349|ref|ZP_03509189.1| hypothetical protein Retl8_00989 [Rhizobium etli 8C-3] Length = 222 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 50/142 (35%), Gaps = 16/142 (11%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + VA ++RE+LD+I + ++ GL + + + +++++ Sbjct: 75 RADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARKRLSD-DSY 133 Query: 251 STTKSTPGLEYAYNKIFDAKEKL----EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 T +T + Y Y + A + + K ++ LTDG S K S Sbjct: 134 GLTSATS-MNYTYFDVALAALQKIVGTGGDGTSSANPLKLVLLLTDGVQSQRGWVVKNSS 192 Query: 307 F----------YCNEAKRRGAI 318 +C K + A Sbjct: 193 NLKKVAPLNPDWCGYVKNKSAT 214 >gi|333002369|gb|EGK21932.1| von Willebrand factor type A domain protein [Shigella flexneri VA-6] gi|333003420|gb|EGK22964.1| von Willebrand factor type A domain protein [Shigella flexneri K-272] gi|333017250|gb|EGK36570.1| von Willebrand factor type A domain protein [Shigella flexneri K-227] Length = 219 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNTGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|307109844|gb|EFN58081.1| hypothetical protein CHLNCDRAFT_142385 [Chlorella variabilis] Length = 654 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 83/256 (32%), Gaps = 42/256 (16%) Query: 110 RSTSLSIIIDDQHKDYNLSAVSRYEMP------FIFCTFPWCANSSHAPLLITSSVKISS 163 + ++ + Y V + +P C + IT V I Sbjct: 410 PEVQVEVLAKAEASVYLFIRVQQPAIPWWSGPTATICISDSEGACNVNTPPITPPVVIQ- 468 Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 D ++ +LD S S+ + G + + + RSI L+ ++ DV F Sbjct: 469 --DCSANVCFLLDGSKSLTN--VDGWNDVVASARSIMYSLNDPAAVFDV--------FWF 516 Query: 224 SSKIVQT-----FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 S+ + + +A + +N T + + + Sbjct: 517 SNDVEKIGHATGAEVAANNSFVSMVVNTTPDAHGTWMAQAITTCQGVLLEE--------- 567 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + I+ +TDG+ + D + + + AK RG + +G E LK AS Sbjct: 568 -DTQASRTIVLITDGKPT----DPQPTFEAADAAKARGIKMVVVG-AGEIDYATLKALAS 621 Query: 339 PDRFYSVQ---NSRKL 351 +F +S +L Sbjct: 622 GPQFVFANTNLDSSQL 637 >gi|218200417|gb|EEC82844.1| hypothetical protein OsI_27660 [Oryza sativa Indica Group] Length = 423 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 26/165 (15%) Query: 216 VRSGLVTFSSKIVQTFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R +V+FS + L G + + L T GL A K+FD + Sbjct: 20 DRLAVVSFSYNARRVIRLTRMSEDGKASAKSAVESLHADGCTNILKGLVEA-AKVFDGRR 78 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-------KRRG---AIVYA 321 +A +I L+DG+++ + N + KR G V+ Sbjct: 79 YRNAVAS--------VILLSDGQDNYNVNGGWGASNSKNYSVLVPPSFKRSGDRRLPVHT 130 Query: 322 IGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 G + + A + F ++N + DAF + IG + Sbjct: 131 FGFGTDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQCIGGLLS 175 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 2/77 (2%) Query: 4 LNIRNFFYNCKGSISILTA-ILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 ++ F+ + +G+ +I TA +LP++ + G ++ ++L LD + L TK Sbjct: 2 FALKAFWSSERGNFAITTAFAMLPIMIGLAG-AVDLIGTSHDASQLQNSLDAAGLAIGTK 60 Query: 63 ILNQENGNNGKKQKNDF 79 + ++ F Sbjct: 61 FSPDMAAGDVQQLGLQF 77 >gi|291220856|ref|XP_002730441.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 925 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 70/200 (35%), Gaps = 33/200 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + L ++++LDVS SM ++ + ++ + + V N G+V FS Sbjct: 294 KEGDLRIVLILDVSGSM-----STQNRFMLMIQASTKYIGYT-----VPNGTWIGIVEFS 343 Query: 225 SKIVQTFPLAW--GVQHIQEKINRLIFGSTTKS--TPGLEYAYNKIFDAKEKLEHIAKGH 280 + L ++ QE I+ L + GLE + + Sbjct: 344 TNATILSYLEQVGDLESRQEIISDLPTTVVEDTCIGCGLEAGIEVLEYGGKDPAGG---- 399 Query: 281 DDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + LTDG EN +P I + +E + +V I ++A + A Sbjct: 400 -----IFLLLTDGRENVAPYISD-----VIDELVEKEIVVDTIAFSSQADPGLAELSAET 449 Query: 340 D----RFYSVQNSRKLHDAF 355 + S LHDAF Sbjct: 450 GGTAYWYSESDESTALHDAF 469 >gi|220897449|emb|CAX15333.1| complement component 2 (within H-2S) [Mus musculus] Length = 623 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 69/180 (38%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + + +S M++ I S V Sbjct: 114 KIIIQRSGHLNLYLLLDASQSVTEK------DFDIFKKSAELMVERIFSFEVN---VSVA 164 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS--------TTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L+ Q + E I L S T + L Y+ + Sbjct: 165 IITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQM 224 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ + K L + + +YAIGV Sbjct: 225 DRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVG 284 >gi|156097055|ref|XP_001614561.1| CTRP adhesive protein (invasive stage) [Plasmodium vivax SaI-1] gi|148803435|gb|EDL44834.1| CTRP adhesive protein (invasive stage), putative [Plasmodium vivax] Length = 2061 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 76/212 (35%), Gaps = 37/212 (17%) Query: 170 DMMMVLDVSLSMN-----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ +++D S S+ H P DK+ IK + N V +G++ FS Sbjct: 108 DLTLIIDESGSIGIKNWEKHVIPFTDKI-------------IKDLHIGENEVHAGILLFS 154 Query: 225 SKIVQTFPLAWGVQHIQ--------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + I + + + + G+ TK L+YA EK H Sbjct: 155 NFIRDYVTFDEDESYKKDKLLKKVDQLKKKYAAGAGTKIVSALDYAL-------EKYTHH 207 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 KG + K I TDG N + + + + L +++ + +GV A A D L+ Sbjct: 208 KKGRPNAPKVTILFTDG-NDTSSSSSTKLLDMGLTYRKKNVKLLVLGV-AAAKDVNLRAI 265 Query: 337 A--SPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 A + D I K++ + Sbjct: 266 AGCGDKNVPCPYAMKAEWDTINDITKKLTNKI 297 >gi|148694788|gb|EDL26735.1| complement component 2 (within H-2S), isoform CRA_b [Mus musculus] Length = 753 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 69/180 (38%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + + +S M++ I S V Sbjct: 251 KIIIQRSGHLNLYLLLDASQSVTEK------DFDIFKKSAELMVERIFSFEVN---VSVA 301 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS--------TTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L+ Q + E I L S T + L Y+ + Sbjct: 302 IITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQM 361 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ + K L + + +YAIGV Sbjct: 362 DRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVG 421 >gi|148694789|gb|EDL26736.1| complement component 2 (within H-2S), isoform CRA_c [Mus musculus] Length = 809 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 69/180 (38%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + + +S M++ I S V Sbjct: 251 KIIIQRSGHLNLYLLLDASQSVTEK------DFDIFKKSAELMVERIFSFEVN---VSVA 301 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS--------TTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L+ Q + E I L S T + L Y+ + Sbjct: 302 IITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQM 361 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ + K L + + +YAIGV Sbjct: 362 DRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVG 421 >gi|148694790|gb|EDL26737.1| complement component 2 (within H-2S), isoform CRA_d [Mus musculus] Length = 755 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 69/180 (38%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + + +S M++ I S V Sbjct: 251 KIIIQRSGHLNLYLLLDASQSVTEK------DFDIFKKSAELMVERIFSFEVN---VSVA 301 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS--------TTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L+ Q + E I L S T + L Y+ + Sbjct: 302 IITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQM 361 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ + K L + + +YAIGV Sbjct: 362 DRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVG 421 >gi|157951694|ref|NP_038512.2| complement C2 precursor [Mus musculus] gi|3986766|gb|AAC84162.1| C2 [Mus musculus] gi|15029737|gb|AAH11086.1| Complement component 2 (within H-2S) [Mus musculus] gi|74147034|dbj|BAE27452.1| unnamed protein product [Mus musculus] gi|148694787|gb|EDL26734.1| complement component 2 (within H-2S), isoform CRA_a [Mus musculus] gi|220897448|emb|CAX15332.1| complement component 2 (within H-2S) [Mus musculus] Length = 760 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 69/180 (38%), Gaps = 23/180 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI + L++ ++LD S S+ + + +S M++ I S V Sbjct: 251 KIIIQRSGHLNLYLLLDASQSVTEK------DFDIFKKSAELMVERIFSFEVN---VSVA 301 Query: 220 LVTFSSKIVQTFP-LAWGVQHIQEKINRLIFGS--------TTKSTPGLEYAYNKIFDAK 270 ++TF+S+ L+ Q + E I L S T + L Y+ + Sbjct: 302 IITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMMQSQM 361 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGVQ 325 ++L + + II LTDG+++ + K L + + +YAIGV Sbjct: 362 DRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYAIGVG 421 >gi|188494036|ref|ZP_03001306.1| von Willebrand factor type A domain protein [Escherichia coli 53638] gi|188489235|gb|EDU64338.1| von Willebrand factor type A domain protein [Escherichia coli 53638] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y +R Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAR 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|320195990|gb|EFW70614.1| Mg-chelatase subunit ChlD [Escherichia coli WV_060327] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|331658198|ref|ZP_08359160.1| conserved hypothetical protein [Escherichia coli TA206] gi|315299430|gb|EFU58681.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] gi|331056446|gb|EGI28455.1| conserved hypothetical protein [Escherichia coli TA206] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|307311300|ref|ZP_07590944.1| VWA containing CoxE family protein [Escherichia coli W] gi|306908806|gb|EFN39303.1| VWA containing CoxE family protein [Escherichia coli W] gi|315061432|gb|ADT75759.1| conserved hypothetical protein [Escherichia coli W] gi|323377988|gb|ADX50256.1| VWA containing CoxE family protein [Escherichia coli KO11] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|300902231|ref|ZP_07120230.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301305164|ref|ZP_07211263.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|300405673|gb|EFJ89211.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300839568|gb|EFK67328.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255366|gb|EFU35334.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|256017712|ref|ZP_05431577.1| hypothetical protein ShiD9_02267 [Shigella sp. D9] gi|332278727|ref|ZP_08391140.1| conserved hypothetical protein [Shigella sp. D9] gi|323183736|gb|EFZ69128.1| von Willebrand factor type A domain protein [Escherichia coli 1357] gi|332101079|gb|EGJ04425.1| conserved hypothetical protein [Shigella sp. D9] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|326442267|ref|ZP_08217001.1| hypothetical protein SclaA2_14434 [Streptomyces clavuligerus ATCC 27064] Length = 748 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 61/196 (31%), Gaps = 24/196 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVT 222 + + +VLD S SM ++ G S + + + + ++ R +V Sbjct: 560 AAKGDARVYLVLDRSGSMRPYYKDG---------SAQNLGERVLALAAHLDEAARVEVVF 610 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 FS++I T L + +I+ L A E L AK Sbjct: 611 FSTEIDGTGELTATAH--RGRIDELHGSLGRMGRTSYHLAVE------EVLARHAKAAPG 662 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASP- 339 ++F TDG S G + A F L+ A+P Sbjct: 663 APALVVFQTDGAPESKTAATAALASA--ATAEPGMFWQFVAFGERDAKAFDYLRRLAAPN 720 Query: 340 -DRFYSVQNSRKLHDA 354 F++ L DA Sbjct: 721 AGFFHAGPEPAALTDA 736 >gi|254458950|ref|ZP_05072373.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084221|gb|EDZ61510.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 481 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 67/199 (33%), Gaps = 35/199 (17%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 TF + + P++ +K + D+++ +D+S SM +L A Sbjct: 35 IVTFVFIVLALTRPVVEQEPIKS---EQVLSDVVIAVDLSYSMQ-AMDVPPSRLKKAK-- 88 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI----QEKINRLIFGSTTK 254 +I+KS+ R G++ F++ + P+ + + ++LI + Sbjct: 89 -----EILKSLIKSEQKSRFGILGFTTNAIILSPMTEDSELLEHLFSSLDDKLIITKGSS 143 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 P LE A K ++ L+DG D AK+ Sbjct: 144 IMPALELA--------------RKMSQSKNLSMVILSDG------ADEISYEDEAKFAKK 183 Query: 315 RGAIVYAIGVQAEAADQFL 333 +V + + + + Sbjct: 184 NYLVVNVLMLATKTGGTLM 202 >gi|193069304|ref|ZP_03050260.1| von Willebrand factor type A domain protein [Escherichia coli E110019] gi|192957437|gb|EDV87884.1| von Willebrand factor type A domain protein [Escherichia coli E110019] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|110642328|ref|YP_670058.1| hypothetical protein ECP_2158 [Escherichia coli 536] gi|191169979|ref|ZP_03031533.1| von Willebrand factor type A domain protein [Escherichia coli F11] gi|300981544|ref|ZP_07175591.1| von Willebrand factor type A domain protein [Escherichia coli MS 200-1] gi|110343920|gb|ABG70157.1| hypothetical protein YehP [Escherichia coli 536] gi|190909495|gb|EDV69080.1| von Willebrand factor type A domain protein [Escherichia coli F11] gi|300307501|gb|EFJ62021.1| von Willebrand factor type A domain protein [Escherichia coli MS 200-1] gi|324013910|gb|EGB83129.1| von Willebrand factor type A domain protein [Escherichia coli MS 60-1] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|191165430|ref|ZP_03027272.1| von Willebrand factor type A domain protein [Escherichia coli B7A] gi|190904593|gb|EDV64300.1| von Willebrand factor type A domain protein [Escherichia coli B7A] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|157161603|ref|YP_001458921.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|157067283|gb|ABV06538.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|323936734|gb|EGB33019.1| VWA domain containing CoxE protein [Escherichia coli E1520] Length = 378 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|89100228|ref|ZP_01173095.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085078|gb|EAR64212.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 469 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 69/214 (32%), Gaps = 39/214 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-- 226 L++ ++LD S SM + G + +A SI E D+ L + K Sbjct: 164 LNVEILLDASGSMANQID-GKTMMDIAKESIGEF------AEDLPEGANVALRVYGHKGT 216 Query: 227 -IVQTFPLA------------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 Q L+ + + + + + T +E A N + K Sbjct: 217 GSDQDKELSCKSNELIYEMGDYNGEKLNSTLQTVKPAGWTPLAAAIEEAKNDLLSYKGDT 276 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ-F 332 I ++DG + +++ + IV IG +A Q Sbjct: 277 NTN---------IIYLVSDGVETCDGNPVEQAKSLGDS--DIQPIVNVIGFDLDAEGQNQ 325 Query: 333 LKNCASP--DRFYSVQNSRKL---HDAFLRIGKE 361 LK A F + +N +L D +I K+ Sbjct: 326 LKEVAKSAKGLFSNARNQSELKEELDKAKQIAKK 359 >gi|291236633|ref|XP_002738245.1| PREDICTED: calcium channel-like protein [Saccoglossus kowalevskii] Length = 1069 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 52/126 (41%), Gaps = 8/126 (6%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FSSKIV 228 DMM+++DVS S+ G ++ + + S+ + L + + R+ V+ F + + Sbjct: 234 DMMILIDVSGSV---HGLVLELIKASAVSLIDTLGENDFVNIASFNERARFVSCFETFVQ 290 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG-HDDYKKYI 287 +++KI L+ G +A+ + KE + + + I Sbjct: 291 AN---ERNKNVMKDKIRLLVDNGIASFDIGFTFAFEQFKKFKETSAFQEENQGANCSQVI 347 Query: 288 IFLTDG 293 + LTDG Sbjct: 348 MLLTDG 353 >gi|281208974|gb|EFA83149.1| hypothetical protein PPL_03939 [Polysphondylium pallidum PN500] Length = 1103 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 53/161 (32%), Gaps = 19/161 (11%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 ++++ V D + SM+ ++ + K IP++ + GL F Sbjct: 171 SDVEIVFVFDTTGSMSSIIANVKSQVEATITRLT------KDIPNIKIGI-MGLGDFCDG 223 Query: 226 -KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 +++T L + I ++ + E+A + Sbjct: 224 QNVLKTLDLTDDKTKLINFIKKVPMTGGGDAPEAYEFALYM--------ANSLSWSAHTS 275 Query: 285 KYIIFLTD-GENSSPNIDNK-ESLFYCNEAKRRGAIVYAIG 323 K ++ + D G + D K + ++ +G +Y I Sbjct: 276 KALVMIGDEGPHPPSTTDLKINWIQQADDLAAKGVKIYGIR 316 >gi|284033848|ref|YP_003383779.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283813141|gb|ADB34980.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 425 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 47/137 (34%), Gaps = 32/137 (23%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +++D S SM G D + A + + LD I + V +++ + + FP Sbjct: 47 IIVDTSGSM------GADGVRAAAYAAQTALDQI------LDGVWFAVISGNDRAELAFP 94 Query: 233 LA----------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + Q ++ + R T L A A Sbjct: 95 PSAEPVMVRMDPYTRQAAKDAVARFYADGGTAMGTWLRLASRVF----------ATVPTL 144 Query: 283 YKKYIIFLTDGENSSPN 299 +K+ I LTDGEN + Sbjct: 145 TQKHAILLTDGENQHES 161 >gi|323493531|ref|ZP_08098653.1| hypothetical protein VIBR0546_14460 [Vibrio brasiliensis LMG 20546] gi|323312354|gb|EGA65496.1| hypothetical protein VIBR0546_14460 [Vibrio brasiliensis LMG 20546] Length = 624 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 53/146 (36%), Gaps = 25/146 (17%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NR 246 K T++ + LD++K+ + + +GLV ++ Q P+ Q I + Sbjct: 104 KPNRLTQARFKALDLLKNWQEGS----TGLVAYAGDAYQVSPMTNDSQTIANLLPNLSPE 159 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 L+ + G++ A + +A I+ ++D +IDN E Sbjct: 160 LMPYPGADAGAGVKLAIEMMTNAGLASGD-----------IVLISD------DIDNAEQQ 202 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 N+ K + +GV + Sbjct: 203 AIENQLKGTSWRLVILGVGTRSGAPI 228 >gi|14248651|gb|AAK57611.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|14248653|gb|AAK57612.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248655|gb|AAK57613.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248657|gb|AAK57614.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248659|gb|AAK57615.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248661|gb|AAK57616.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|14248663|gb|AAK57617.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|14248649|gb|AAK57610.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|323464488|gb|ADX76641.1| conserved hypothetical protein [Staphylococcus pseudintermedius ED99] Length = 629 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 56/164 (34%), Gaps = 19/164 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++D S SM+ DK+ + + + +K++ + + + F+ Sbjct: 437 FTLLIDASASMH-------DKMDETIKGVVLFHETLKALNVKHEI-----LAFNEDA-FD 483 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---EKLEHIAKGHDDYKKYI 287 + I E IN T++ + + ++++ Sbjct: 484 ADAQYQPNIIDEIINYHQSTFNTEAPRIMSLTPQDDNRDGVAIRVASDRLLSRSEQQRFL 543 Query: 288 IFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGVQAEA 328 I +DGE S+ N ++ +++ G V+ + + E Sbjct: 544 IVFSDGEPSAFNYSQDGILDTYEAVENSRKLGIEVFNVFLSQEP 587 >gi|313676404|ref|YP_004054400.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312943102|gb|ADR22292.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 618 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 65/184 (35%), Gaps = 19/184 (10%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 PW + + V + +++ +LDVS SM +KL + ++ + Sbjct: 228 PWNKKHQLVHIGLQGKVIPTENLPAS-NLVFLLDVSGSM-----FAQNKLPLLKSGLKML 281 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 +D ++ V+ VV +G V I E + L G +T G+E A Sbjct: 282 VDQLREEDKVSIVVYAGAAG----CVLPPTSGNEKDKIIEALQNLQAGGSTAGGAGIELA 337 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y + K + II TDG+ + N+ + ++ G + + Sbjct: 338 YKIAKENFIKEGNNR---------IILATDGDFNVGASSNEAMEDLIEKKRKEGVFLTVL 388 Query: 323 GVQA 326 G Sbjct: 389 GFGM 392 >gi|291386245|ref|XP_002710074.1| PREDICTED: von Willebrand factor A domain containing 3B [Oryctolagus cuniculus] Length = 1381 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYVLIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWQEKLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GSTT + L+ A+ KE I L Sbjct: 562 EINEENLEQAQSWIRDIKVGSTTNTLDALQIAFT----DKETQA------------IYLL 605 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCAS 338 TDG P + + + +YAI E A+ FLK AS Sbjct: 606 TDGRPDQPPEMVMDQVRVFQK-----IPIYAISFNYHDEIANTFLKELAS 650 >gi|256394484|ref|YP_003116048.1| von Willebrand factor type A [Catenulispora acidiphila DSM 44928] gi|256360710|gb|ACU74207.1| von Willebrand factor type A [Catenulispora acidiphila DSM 44928] Length = 583 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 65/213 (30%), Gaps = 27/213 (12%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF-GPGMDKLGVATRSIREM 202 A ++ + + L M+ V+DVS SM G G +L + + + Sbjct: 354 LAAAKRADYGAAGQALTLWASLTKQLRMLAVVDVSGSMAQAVPGTGQTRLQLTAAASEKA 413 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + + S GL TF++ ++ G T + A Sbjct: 414 MALFGSHAA------MGLWTFTTTHDAAGSTVIDQVLPIAELGAAEPGGGTH-GQRMIAA 466 Query: 263 YNKIFDAKEKL-----------EHIAKGHDDY-KKYIIFLTDGENSSPNIDNKESL---- 306 Y + D +++ KG D + TDG++ N + L Sbjct: 467 YGALADKAGSRNGLYDVLLAAYQNVQKGWDPTRTNTVAVFTDGKDDDLNSMTSDQLIAKL 526 Query: 307 -FYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 + A+ V+ + + + L + Sbjct: 527 QAAVDPARP--IRVFVVALGTDVDLTLLNKITA 557 >gi|254390370|ref|ZP_05005587.1| hypothetical protein SSCG_02914 [Streptomyces clavuligerus ATCC 27064] gi|294813849|ref|ZP_06772492.1| toxic cation resistance protein [Streptomyces clavuligerus ATCC 27064] gi|197704074|gb|EDY49886.1| hypothetical protein SSCG_02914 [Streptomyces clavuligerus ATCC 27064] gi|294326448|gb|EFG08091.1| toxic cation resistance protein [Streptomyces clavuligerus ATCC 27064] Length = 791 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 61/196 (31%), Gaps = 24/196 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-KSIPDVNNVVRSGLVT 222 + + +VLD S SM ++ G S + + + + ++ R +V Sbjct: 603 AAKGDARVYLVLDRSGSMRPYYKDG---------SAQNLGERVLALAAHLDEAARVEVVF 653 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 FS++I T L + +I+ L A E L AK Sbjct: 654 FSTEIDGTGELTATAH--RGRIDELHGSLGRMGRTSYHLAVE------EVLARHAKAAPG 705 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASP- 339 ++F TDG S G + A F L+ A+P Sbjct: 706 APALVVFQTDGAPESKTAATAALASA--ATAEPGMFWQFVAFGERDAKAFDYLRRLAAPN 763 Query: 340 -DRFYSVQNSRKLHDA 354 F++ L DA Sbjct: 764 AGFFHAGPEPAALTDA 779 >gi|167519555|ref|XP_001744117.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777203|gb|EDQ90820.1| predicted protein [Monosiga brevicollis MX1] Length = 3700 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 68/220 (30%), Gaps = 36/220 (16%) Query: 118 IDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDV 177 +D KDY + + F FC F C VK ++ +G + LD Sbjct: 3353 VDKLTKDYEAGKIRTPGL-FHFCKFQACTR-----------VKSAADVAVGQGSLFGLDE 3400 Query: 178 SLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 S SM D S + + ++N R +V +S + LA Sbjct: 3401 SGSMGG------DNWDALLNSYSSFMQS-RVHDELNLADRVTVVQYSDRA--RTTLAKAS 3451 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 + G T A ++ + + +I + +IF++DG+ Sbjct: 3452 MREAAAFVPQMNGGGTD----FNVAIQELRGQGKTMSNIFRP------VLIFMSDGQAYD 3501 Query: 298 PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 P + + + + + A L+ A Sbjct: 3502 PRTELERMKAELPHMSS-----FMVALGPNAQVAVLQGMA 3536 >gi|241113475|ref|YP_002973310.1| Tetratricopeptide TPR_2 repeat protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861683|gb|ACS59349.1| Tetratricopeptide TPR_2 repeat protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 517 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 77/252 (30%), Gaps = 38/252 (15%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + R M + P A T + + +++ +D+S +M D Sbjct: 49 VVDTSGRSRMRPSWLLAPLLATGIAGAAGPTWQREPPPFVEDTAPLIIAVDLSQTM-DAI 107 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 +L A I+++++ V R+ ++ ++ P ++ Sbjct: 108 DVTPSRLERAKLKIKDVIE-------VRQGARTAIIAYAGTAHLVLPPTEDAALLESYSE 160 Query: 246 RLIFGSTTKS--TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK 303 L T+ TPG + A + + G I+ LTDG ++ K Sbjct: 161 AL----ATRIMPTPGKDTAAALLLGQSLLDKEGVAGT------ILLLTDGVEAAAVQTLK 210 Query: 304 ESLFYCNEAKRRGAIVYAIGVQAEA-----ADQFLKNCASPDRFYSVQNSRKLHDAFLRI 358 S G ++ IG A FL + AS R + + + Sbjct: 211 SS--------GDGVVILGIGTSAGGPVKTPDGGFLSD-ASGARLFPKLD----VAGLQAV 257 Query: 359 GKEMVKQRILYN 370 G+E Sbjct: 258 GREAGADVATIT 269 >gi|332797831|ref|YP_004459331.1| von Willebrand factor type A domain-containing protein [Acidianus hospitalis W1] gi|332695566|gb|AEE95033.1| von Willebrand factor type A domain protein [Acidianus hospitalis W1] Length = 452 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 62/188 (32%), Gaps = 42/188 (22%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++LD S SM+ + +A S + + +R F I Sbjct: 290 IYLLLDKSGSMDGEKILWAKAVALALYSRARR-------ENRDFYLR-----FFDNI--P 335 Query: 231 FPL--------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 +PL + V + E I ++ G T + + A I + KG + Sbjct: 336 YPLIKVIKNAKSKDVIKMIEYIGKIRGGGGTDISRSVISACEDIKEGH------VKGVSE 389 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 +I LTDGE+ + SL K A + I V + L+ D + Sbjct: 390 ----VIILTDGEDKIAETTVRRSL------KDANATL--ISVMIRGDNADLRRI--SDTY 435 Query: 343 YSVQNSRK 350 V + Sbjct: 436 LVVYKLDQ 443 >gi|299138185|ref|ZP_07031365.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600115|gb|EFI56273.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 382 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 60/170 (35%), Gaps = 16/170 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNVVRSGLVT 222 +D+ L + +++D S S + + LD ++ ++ +V Sbjct: 79 DTDVPLTLGLLVDTSQSQR-------TVIDSERSASGTFLDKMLAPASANRESDKAFVVQ 131 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL-----EYAYNKIFDAKEKLEHIA 277 F+ ++ +++ + L + + S+ + ++DA Sbjct: 132 FAREVELLQDPTDSRSKLKQALKELDTTAPSTSSSTGDDSGHAHGGTTLYDAVFLSADEV 191 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 ++ +I LTDG + + E++ A I+YAI + E Sbjct: 192 TSKQKGRRALILLTDGVDRNSKESIAEAIEAAQRADT---IIYAIYFKGE 238 >gi|224051388|ref|XP_002199708.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 417 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 70/212 (33%), Gaps = 29/212 (13%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y+MP F T L + S +++ ++D S S+ + Sbjct: 197 SYQMPTWFGTTK-YVKPLVQKLCSHEQMLCSKTCYNSVNIGFLIDGSSSIGE-------- 247 Query: 192 LGVATRSIREMLDIIKSIPDVNNV-VRSGLVTFSSKIVQTFPLAW--GVQHIQEKI-NRL 247 + R + E + + +++++ + V F+ + F + + I N Sbjct: 248 --INFRLMLEFVSNVAKAFEISDIGSKVAAVQFTYNQRKEFGFTDHVTKEKVLSAIHNIQ 305 Query: 248 IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 T + + + +F + + K ++I LTDG++ + Sbjct: 306 YMSGGTATGDAISFTTRTVFGPVKDGPN--------KNFLIVLTDGQSYDDVTGPAAAAK 357 Query: 308 YCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 G V+++GV D + + P Sbjct: 358 K------AGITVFSVGVAWAPLDDLKEMASEP 383 >gi|332092340|gb|EGI97414.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 232 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|328946175|gb|EGG40320.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1087] Length = 450 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 61/199 (30%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM G +D + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSM----GATIDIVRANIN------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVFGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|320177885|gb|EFW52869.1| hypothetical protein SGB_04927 [Shigella boydii ATCC 9905] Length = 219 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|291484531|dbj|BAI85606.1| hypothetical protein BSNT_03154 [Bacillus subtilis subsp. natto BEST195] Length = 638 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 61/174 (35%), Gaps = 29/174 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++I +++D S SM DK+ R I + +KS+ + +V Sbjct: 434 KQEPSTEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVAVPHQIV--- 483 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY----NKIFDAKE 271 F + + P + + S P + N+ A Sbjct: 484 --GFWEDTNDATETSQPNYFNT------VIPFQSSLRQDSGPAIMQLEPEEDNRDGYAIR 535 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 536 QMTKKMLHRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|296124235|ref|YP_003632013.1| hypothetical protein Plim_4003 [Planctomyces limnophilus DSM 3776] gi|296016575|gb|ADG69814.1| protein of unknown function DUF1355 [Planctomyces limnophilus DSM 3776] Length = 1023 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 51/163 (31%), Gaps = 32/163 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +M VLD S S+ + +R + + + P + GL F Sbjct: 112 SLMYVLDTSDSIGRS---------AKDQVLRYIAETVTKKPARDEA---GLSVFGRNAAV 159 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + E +N I G T L + + D + I+ Sbjct: 160 ELPPR--TTFLAEALNTDIRGDATNIEQALSLSSAMLPDDQAGK-------------IVL 204 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 +DG + ++D +E K R V + ++ + + Sbjct: 205 FSDGSQTEGSLD-----RILDELKSRKISVDVVPIEYDYEHEV 242 >gi|194431507|ref|ZP_03063799.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194420332|gb|EDX36409.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] Length = 238 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 65/172 (37%), Gaps = 14/172 (8%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNAGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P L T + A + + + K E+ A G Sbjct: 68 F-GPVHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISY 121 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|291232483|ref|XP_002736186.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 1797 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 58/187 (31%), Gaps = 34/187 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + +++DVS SM H +KL R + E L + R +T+ ++V Sbjct: 523 QVYILIDVSNSMEPHLDLVKEKL---IRLMEEQLRHKMKFNLIKFGTR--AMTWRDRMVD 577 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + L +T + L+ A D + + Sbjct: 578 VNEA--NLHSAWSWVRGLTVTGSTNTLSALKLAL----------------SDPNTQAVYL 619 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ---AEAADQFLKNCASPD---RFY 343 LTDG P + ++ ++ I EA + A ++ Sbjct: 620 LTDGRPDMPQKTVLAQVQL-----QQKVPIHTISFNCADTEANQFLAQLAADTGGRYHYF 674 Query: 344 SVQNSRK 350 S Q +R Sbjct: 675 SEQGTRD 681 >gi|171186268|ref|YP_001795187.1| hypothetical protein Tneu_1824 [Thermoproteus neutrophilus V24Sta] gi|170935480|gb|ACB40741.1| conserved hypothetical protein [Thermoproteus neutrophilus V24Sta] Length = 426 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 54/142 (38%), Gaps = 22/142 (15%) Query: 159 VKISSKSDIGLD----MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 K S D LD + +++D S SM G+ + + + + + Sbjct: 250 TKSLSIYDTSLDTREKIYLLIDKSGSMFYTLYDGV-AMDMTQKITWATALAVALMKRSR- 307 Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 R+ L F + P + + I + + R++ T T + A I DAK++ Sbjct: 308 --RTVLRFFDQMVY---PPIYSTKEIIKSLLRVLPLGGTDITAAVHTA---IRDAKQQGL 359 Query: 275 HIAKGHDDYKKYIIFLTDGENS 296 H K ++ +TDGE+ Sbjct: 360 HSYK--------LVIVTDGEDD 373 >gi|302804200|ref|XP_002983852.1| hypothetical protein SELMODRAFT_423100 [Selaginella moellendorffii] gi|300148204|gb|EFJ14864.1| hypothetical protein SELMODRAFT_423100 [Selaginella moellendorffii] Length = 532 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 63/183 (34%), Gaps = 15/183 (8%) Query: 170 DMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVTFSS 225 + ++LD S SM++ G + VA+ I ++L+ + + V G + S Sbjct: 196 SLYILLDTSTSMSNPTGVLSSQTRFNVASNIITQLLNTLTNGDQVAVSTIGGEKIGAPVS 255 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + I + + S T S ++ D ++ Sbjct: 256 VVLDVQETSLYLAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSNLN-------- 307 Query: 286 YIIFLTDGE---NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRF 342 II TDG+ + N + A+ VY IG F + +S + Sbjct: 308 VIILFTDGQFVTPGNFNFTQLSPVLAQLNARNVVVFVYRIGSFTSNDATFQQMQSSLNMS 367 Query: 343 YSV 345 Y V Sbjct: 368 YEV 370 >gi|298251755|ref|ZP_06975558.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297546347|gb|EFH80215.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 224 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 68/201 (33%), Gaps = 39/201 (19%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI----PDVNNVVRSGL 220 S L + + D S SM DK+ +IRE + ++ + P +VR Sbjct: 2 SGRPLHFIWIADCSGSMAG------DKIQSLNTAIREAIPEMRGVAHSNPHARVLVR--A 53 Query: 221 VTFS----SKIVQTFPLA---WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + FS + Q P+ W L G T + A + D + Sbjct: 54 LKFSSGATWHVTQPVPIEQFQW---------QDLQAGGVTD----MGKALLMVADELKMP 100 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +G ++ ++DG + ++ K+ + AIG+ + + L Sbjct: 101 PMDPRGLPP---VLVLISDGYPTDDVNKGISAILDQPWGKKA-VRI-AIGIGHDVDNNVL 155 Query: 334 KN-CASPD-RFYSVQNSRKLH 352 + P+ + N+ +L Sbjct: 156 QRFINHPEIQPLQAHNAEQLV 176 >gi|153809330|ref|ZP_01961998.1| hypothetical protein BACCAC_03643 [Bacteroides caccae ATCC 43185] gi|149128100|gb|EDM19321.1| hypothetical protein BACCAC_03643 [Bacteroides caccae ATCC 43185] Length = 611 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 72/196 (36%), Gaps = 23/196 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 ++ PW + + + + +I + + +++ ++DVS SM G ++L Sbjct: 211 VKITMEAGACPWNPANRLVRIGLKAK-EIPTDNLPASNLVFLIDVSGSM-----WGANRL 264 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFG 250 + S++ +++ ++ + + +VT+S G Q I+E I+ L G Sbjct: 265 DLVKSSLKLLVN------NLRDKDKVAIVTYSGSAGVKLESTSGSDKQKIREAIDELTAG 318 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+ AY + II +DG+ + + Sbjct: 319 GSTAGGAGIMLAYKIAKKNFISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIE 369 Query: 311 EAKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 370 RERKSGVFLTVLGYGM 385 >gi|296121941|ref|YP_003629719.1| hypothetical protein Plim_1690 [Planctomyces limnophilus DSM 3776] gi|296014281|gb|ADG67520.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776] Length = 299 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 22/104 (21%), Positives = 43/104 (41%), Gaps = 12/104 (11%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLG--VATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++ + M+++D+S SM F ++K + + L I + P GL+ Sbjct: 79 EAETNISGMLLMDLSASMGYTFRQQLNKFEYSICLAAALAYLMIHQQDP-------VGLM 131 Query: 222 TFSSKIVQTFPLAWGVQHIQE---KINRLIFGSTTKSTPGLEYA 262 TF + I + P + E ++RL TT ++ A Sbjct: 132 TFDTNINASLPPRSRRSQLAELLGLLSRLSPTGTTDFGTSMKRA 175 >gi|227819050|ref|YP_002823021.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] gi|227338049|gb|ACP22268.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] Length = 155 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 32/74 (43%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R + + G+++++ A+ LPV+ MGL ET + + KL + D S A + Sbjct: 18 RGWLKDESGTVAVIAAVTLPVLVGAMGLGAETGYWYLKDRKLQHAADVSAHAAAVRYRAG 77 Query: 67 ENGNNGKKQKNDFS 80 + + + Sbjct: 78 DQKPALETTAKRIA 91 >gi|260824043|ref|XP_002606977.1| hypothetical protein BRAFLDRAFT_64962 [Branchiostoma floridae] gi|229292323|gb|EEN62987.1| hypothetical protein BRAFLDRAFT_64962 [Branchiostoma floridae] Length = 219 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 42/112 (37%), Gaps = 4/112 (3%) Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 G GM +L A R ++ + + V +V F + L + +Q ++ Sbjct: 3 GTGMLELRKAVRQFLAGVEQTATQTGLKENV--AVVEFGGGVRIVQHLTNDYRCVQRAVD 60 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 L G TT GL A ++ L + G +I +TDG+ + Sbjct: 61 NLRPGGTTPMFEGLMEALKELCQNGGVL--VLPGGIRMTPRVILMTDGKPDN 110 >gi|294628791|ref|ZP_06707351.1| toxic cation resistance protein [Streptomyces sp. e14] gi|292832124|gb|EFF90473.1| toxic cation resistance protein [Streptomyces sp. e14] Length = 241 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 61/170 (35%), Gaps = 24/170 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +V+D S SM +F G + +++ + + + +++ + +V FS Sbjct: 33 AGQRAAVYLVVDHSGSMRPYFRNG------SVQALADRV--LGLAAHLDDDGQVPVVFFS 84 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + + +A Q ++ R+ G A + + D H Sbjct: 85 TDVDAVTEIALDDH--QGRVERIAAGLGHMGRTNYHLAMDAVID------HYLDSGATDP 136 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--AIGVQAEAADQF 332 ++F TDG +P + C AK ++ IG + QF Sbjct: 137 ALVVFQTDG---APTSRPAAERYLCKAAK---LPLFWQFIGFGNPRSRQF 180 >gi|91842225|gb|ABE66384.1| truncated integrin beta1 subunit-like protein 1 [Danio rerio] Length = 613 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 51/153 (33%), Gaps = 33/153 (21%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++K D +D+ ++D+S SM + L ++ K + D+ +R Sbjct: 126 TLKFKRAEDYPIDLYFLMDLSHSMLSNLEN----LKNLG------FELAKEMKDITKDLR 175 Query: 218 SGLVTF----SSKIVQTFP--------------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 G +F S + FP L ++I++L S+ Sbjct: 176 IGFGSFFRKPSIQTNPCFPDNCIAPFSYFNVLSLTDDHALFTQEISKLKTSGNLDSSEA- 234 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + A + G + + ++F TD Sbjct: 235 --GLEALMQAA--VCTDVIGWRNATRVLVFFTD 263 >gi|307596010|ref|YP_003902327.1| magnesium chelatase ChlI subunit [Vulcanisaeta distributa DSM 14429] gi|307551211|gb|ADN51276.1| magnesium chelatase ChlI subunit [Vulcanisaeta distributa DSM 14429] Length = 647 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/128 (22%), Positives = 47/128 (36%), Gaps = 19/128 (14%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQT 230 ++VLD S SMN M ++ VA +R + + GL++F + + Sbjct: 483 LIVLDASGSMNF-----MRRIEVAKGLVRRIAE-----ESYVKRSYVGLISFRGRGVDVI 532 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + + L G T + L A + I K DY Y+I Sbjct: 533 IEPTRNYWQVLSTLEGLPSGGATPLSAALACAVDLIKRLG------LKLRGDYWVYVI-- 584 Query: 291 TDGENSSP 298 TDG+ + P Sbjct: 585 TDGKANVP 592 >gi|301772358|ref|XP_002921601.1| PREDICTED: integrin alpha-2-like [Ailuropoda melanoleuca] Length = 1369 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 85/240 (35%), Gaps = 33/240 (13%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + T C++ S L+TS + +D+++V D S S + Sbjct: 331 YYTTGVCSDVSADFQLLTSFAPAAQACPSLIDVVVVCDESNS--------IYPWEAVKNF 382 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + + + + P GL+ +++K F + + K + S T G Sbjct: 383 LEKFVQSLDIGPKKTQ---VGLIQYANKPRVIF----NLNTFKTKAEMIEATSQTYQYGG 435 Query: 259 -LEYAYNKIFDAKE-KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + I AK+ A G K ++ +TDGE+ ++ K + C+ Sbjct: 436 DLTNTFKAIQYAKDFAYAAGAGGRLGAAKVMVVVTDGESHDGSM-LKAVIDQCD---NDN 491 Query: 317 AIVYAIGV------QAEAADQF---LKNCAS---PDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + I V A +K AS F++V + L + +G+++ Sbjct: 492 ILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTETFFFNVSDEAALLEKAGTLGEQIFS 551 >gi|270008952|gb|EFA05400.1| hypothetical protein TcasGA2_TC015572 [Tribolium castaneum] Length = 767 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 53/367 (14%), Positives = 101/367 (27%), Gaps = 54/367 (14%) Query: 38 TSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK-QKNDFSYRIIKNIWQTDFRNEL 96 TS K++ +L I E +GK + K I+ + L Sbjct: 29 TSKVKNTKSQAEEATFSVILPENAFISEFEMEIDGKVYKAYVKEKDEAKTIYTRAVSSGL 88 Query: 97 RENGFAQDINNIER-STSLSIIIDD--------------QHKDYNLSAVSRYEMPFIFCT 141 + + ++ + S++I Q+ Y L Sbjct: 89 SAGHVELNARDSKKFTVSVNIEPSSETIFRLTYEELLQRQNGQYELIINVHPG----QIV 144 Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIR 200 C S + ++ VLD S SM + M + + Sbjct: 145 DDLCVEVLLRDGYFVHFFAPSGLQTLPKHVVFVLDHSGSMRGRKYEQLMQAMDKILSDLN 204 Query: 201 --EMLDIIKSIPDV-------NNVVRSGLVTFSSKIVQTFPLA------------WGVQH 239 ++ I++ DV N + LA ++ Sbjct: 205 PDDLFHIVRFSDDVSVWNLEKNKFDQIRFEQMPDYENLDTSLAELNLGEAIQVTEDNIKK 264 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKYIIFLTDGENSS 297 + + + T GL + +K +++A H IIFLTDG + Sbjct: 265 AKRIKDDDVNMGCTNIIGGLAVGLYLVRRTLQKFYEKNVATKHQPM---IIFLTDGLPNV 321 Query: 298 PNIDNKESLFYCNEAK--RRGAIVYAIGVQAEAADQFLKNCAS-----PDRFYSVQNSRK 350 + E + A ++++ +A FLK ++ Y ++ Sbjct: 322 GISNPDEITKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSRHIYEAADAAL 381 Query: 351 LHDAFLR 357 F R Sbjct: 382 QLQNFYR 388 >gi|226228509|ref|YP_002762615.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226091700|dbj|BAH40145.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 699 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 20/102 (19%), Positives = 36/102 (35%), Gaps = 12/102 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++++LD S+SM G L A R++ + L+ R LV + Sbjct: 94 AVVILLDRSMSMGYT-GVWPRALDSA-RAVIDRLEGKD---------RVALVAYDDAAEV 142 Query: 230 TFPLAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAK 270 L ++ + + T+ P L A + DA Sbjct: 143 MQRLTDDRAAVRGSLGAVQPMRRGTRLAPALRTARQLLLDAP 184 >gi|218458154|ref|ZP_03498245.1| hypothetical protein RetlK5_01327 [Rhizobium etli Kim 5] Length = 156 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 44/143 (30%), Gaps = 33/143 (23%) Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG---------------ENSSP 298 + A I + D +K + F+ DG N Sbjct: 11 DQQTSFDEALKGIEGEITGNIGKGTSNADRQKIVFFVADGVADSYKPSGCTSPKGANGGR 70 Query: 299 NIDNKESLFYCNEAKRRGAIV---YAIGVQAEAADQF--------------LKNCASPDR 341 I+ ++ YC + K RG V Y + + ++ CA+P Sbjct: 71 CIEPIDT-TYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETKIAAKMEECATPGF 129 Query: 342 FYSVQNSRKLHDAFLRIGKEMVK 364 +++V + + A + ++V Sbjct: 130 YFAVSPTEGIEAAMKALFLKIVS 152 >gi|198419856|ref|XP_002122374.1| PREDICTED: similar to predicted protein, partial [Ciona intestinalis] Length = 960 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSM--NDHFGPGMDKLGVATRSI--REMLDIIKSIPDVN 213 + + + D+++ +DVS SM N+ G +D ++ + + +I D Sbjct: 204 PCNVQASTPKPKDVIIAIDVSGSMIINNRIGAAIDAATTVLNTLSPNDRVTVITFSDDAK 263 Query: 214 ------------NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 RS L F++ + P + H+ + ++ T L+ Sbjct: 264 SLGAVHCMKASAQPTRSSLC-FNNMMASATP--HNIHHLVGLVKQIKPHGDTYYVTALKL 320 Query: 262 AYNKIFDA-------KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKR 314 +++ + A K + I+FL+DG S + + N A + Sbjct: 321 SFDFLESAYKWDLINSSNSFTKDKVAKSRDRVILFLSDGVPSDSPFRIFKLIKLRNLAMQ 380 Query: 315 RGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ + LK AS + + Sbjct: 381 NSVVLLCYELGKGTFGPALKLMASQNFTF 409 >gi|119358222|ref|YP_912866.1| hypothetical protein Cpha266_2454 [Chlorobium phaeobacteroides DSM 266] gi|119355571|gb|ABL66442.1| protein of unknown function DUF58 [Chlorobium phaeobacteroides DSM 266] Length = 296 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 43/126 (34%), Gaps = 25/126 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M++++D S SM + R + + + + + N + GL+ F+ +I Sbjct: 84 MLLLVDGSASMLF------GSRKRSKRDLALEVSAVLAYSAIQNNDKVGLLVFTDRIETF 137 Query: 231 FPLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P A G + + ++ L T T L + + K Sbjct: 138 IPPAKGRRQVLVILDALFNLQPENRNTDITAALSF---------------VRFTQKRKAI 182 Query: 287 IIFLTD 292 I LTD Sbjct: 183 IFLLTD 188 >gi|14248675|gb|AAK57623.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|213966336|ref|ZP_03394518.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] gi|213951042|gb|EEB62442.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] Length = 543 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 68/195 (34%), Gaps = 29/195 (14%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT----FSSKIVQ 229 VLD S SM FG + L R+I +D + +R+ + FS+ + + Sbjct: 364 VLDTSGSM---FGDRIADLQATMRAI---VDGSARTETGSVGLRNREIATILPFSTSVGE 417 Query: 230 TFPLAWG----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + ++ L T L A++ + + + Sbjct: 418 PTTTTIDGPESRAQLTAAVDGLYAEGETALYDALIQAFDLLGSSDKNSIPS--------- 468 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFLKNC-ASPDRFY 343 I+ LTDG+ +S ++ FY ++K V+ I + + + + + Sbjct: 469 -IVVLTDGQVTSGKTFSEFRDFY--QSKGGNLPPVFVIRYGEADPGEMQELANLTGGKVF 525 Query: 344 SVQNSRKLHDAFLRI 358 + + +L D F I Sbjct: 526 ESRET-ELADVFKEI 539 >gi|209560161|ref|YP_002286633.1| Serum opacity factor [Streptococcus pyogenes NZ131] gi|209541362|gb|ACI61938.1| Serum opacity factor [Streptococcus pyogenes NZ131] Length = 1026 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 58/162 (35%), Gaps = 18/162 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + D G D+M +LDVS M ++F +++ ++ K + N VR L Sbjct: 221 PKQIDEGADVMALLDVSKKMTQENFNKAKEQIKKMVTTLTGESTDGKENHNRRNSVR--L 278 Query: 221 VTFSSKIVQTFPLAW-GVQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 +TF K+ L V E++ + + + A + KE + Sbjct: 279 MTFYRKVSDPIELTTKNVDAKLEEVWEQAKKDWDWGVDLQGAIHRARDIFKKEKESKKR- 337 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 ++I+ + GE++ N + N ++ Sbjct: 338 --------QHIVLFSQGESTFSYDINDKDKN--NTVRKNRIT 369 >gi|162448738|ref|YP_001611105.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] gi|161159320|emb|CAN90625.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] Length = 1041 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 67/198 (33%), Gaps = 32/198 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 L ++ +D+S SM G KL +A + +S + R G+ + Sbjct: 439 RASLAEVIGIDISGSMAATAGAH-TKLELANEA------AARSASLLGAGDRLGVAHVDT 491 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGSTTK-STPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + + PL I++ I + G LE AY + L+H Sbjct: 492 AVRWSVPLGPVADGAAIEKAIRAVGPGGGGIYVDITLEAAYQALDRDTSSLKH------- 544 Query: 283 YKKYIIFLTDG---ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-- 337 ++ DG EN P A RRG + + + L+ + Sbjct: 545 ----VLLFADGSDAENMGP------CRAMVEAAMRRGTTTSVVALGQGSDVPELEALSRL 594 Query: 338 SPDRFYSVQNSRKLHDAF 355 RFY V+++ +L F Sbjct: 595 GGGRFYLVEDATRLPAVF 612 >gi|153833207|ref|ZP_01985874.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148870478|gb|EDL69393.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 515 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 37/277 (13%), Positives = 85/277 (30%), Gaps = 45/277 (16%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNG 72 KG ++ +LL I +E + ++L + + + + ++ + Sbjct: 14 QKGVAAVWMGLLLVPIMGFTFWAVEGTRYVQETSRLRDSAEAAAMA----VTIEDQAGSA 69 Query: 73 KKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSR 132 + +++I + + +E + Y ++A + Sbjct: 70 STLAAKYVESYVRDIKSMNVSAQRYYRAADDRAGVLE------------YIQYTVNAKTT 117 Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFG----PG 188 ++ F P + + +D +D++ V D S SM + +G Sbjct: 118 HDSWFASSFIPSFDEQQDLAGRSLARKYPAYLADNNIDIVFVSDFSGSMREQWGFNRHIK 177 Query: 189 MDKLGVATRSIREML-------------------DIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +D L A I + + + + R G V ++ + + Sbjct: 178 IDDLKTAISQISNNILCTSTRQEYVDGEWKDVCDEPGEDTTSDKLLNRVGFVPYNVRTRE 237 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI 266 W + ++N G T +P Y YN I Sbjct: 238 IIG--WNQANTTSQLNYTN-GYNTHLSP---YTYNDI 268 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 51/130 (39%), Gaps = 9/130 (6%) Query: 244 INRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSSPNID 301 IN + T + G+ + + + KK ++ L+DG+ S N Sbjct: 378 INAMWADGGTAAFQGILRGAQILHEGDPNSSDQEEQQAYNKKIKMLLILSDGQESPNNGI 437 Query: 302 NKESLFY--CNEAKRR--GAIVYAIGVQAEAADQF-LKNCASP--DRFYSVQNSRKLHDA 354 K + + C++A++ G + IG+ A+ Q ++C + V N +L + Sbjct: 438 LKGLVDWGMCDKARQEIPGLYIGVIGIDFRASQQSGFQDCVVDPREDIIDVSNLDELIEK 497 Query: 355 FLRIGKEMVK 364 + ++ K Sbjct: 498 IEELIRKGSK 507 >gi|14248629|gb|AAK57600.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248679|gb|AAK57625.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248681|gb|AAK57626.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248683|gb|AAK57627.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248685|gb|AAK57628.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248687|gb|AAK57629.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248691|gb|AAK57631.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|304441881|gb|ADM34176.1| martilin [Aplysia californica] Length = 348 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 68/207 (32%), Gaps = 42/207 (20%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 +++ +VLD S+S++ +++ L + + ++ Sbjct: 3 EQEPIELGIVLDSSVSIDRK------DFKKGKEFLQDFLQQFEIGGGKDELI-------- 48 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY-AYNKIFDAKEKLEHIAKGHDDY 283 + P G T + G+EY A ++ Sbjct: 49 -AAIGNIPHRLGRY--------------TSTGEGIEYMATAQLAS--------QFTRSWA 85 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFY 343 ++ + +TDG + ++ ++ +A+ G ++AIGV + + R + Sbjct: 86 ERVGLVITDG----NSQESAKTKEAARQARESGITMFAIGVGNVKDQELVNIAGDASRVF 141 Query: 344 SVQNSRKLHDAFLRIGKEMVKQRILYN 370 V + +L + + + +++ Sbjct: 142 KVDSYDELENIKQTLAHQTCIRQLKTT 168 >gi|299148843|ref|ZP_07041905.1| von Willebrand factor type A domain protein [Bacteroides sp. 3_1_23] gi|298513604|gb|EFI37491.1| von Willebrand factor type A domain protein [Bacteroides sp. 3_1_23] Length = 616 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 73/195 (37%), Gaps = 21/195 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSV-KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++ PW N+ H + I +I + + +++ ++DVS SM G ++ Sbjct: 214 VKITMEAGACPW--NADHRLVRIGLKAREIPTDNLPASNLVFLIDVSGSM-----WGANR 266 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + S++ +++ ++ V V +G S+ + Q I+E I+ L G Sbjct: 267 LDLVKSSLKLLVNNLRDKDKVAIVTYAG----SAGVKLEATPGSDKQKIREAIDELTAGG 322 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 +T G+ AY + II +DG+ + + + Sbjct: 323 STAGGAGILLAYKIAKKNFISNGNNR---------IILCSDGDFNVGVSSAEGLEQLIEK 373 Query: 312 AKRRGAIVYAIGVQA 326 ++ G + +G Sbjct: 374 ERKSGVFLTVLGYGM 388 >gi|303278902|ref|XP_003058744.1| predicted protein [Micromonas pusilla CCMP1545] gi|226459904|gb|EEH57199.1| predicted protein [Micromonas pusilla CCMP1545] Length = 654 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 48/135 (35%), Gaps = 14/135 (10%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML-DIIKSIPDVNNVVRS 218 K S+ L + +V D S SM M+++ A + ++ + V VV Sbjct: 418 KRMSRRAGSLTIFLV-DASGSMA------MNRMAAAKGAALRLISESYTKRDSVALVVAR 470 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G P + V + ++ L G T GL A +A++ Sbjct: 471 G-----DAAAVALPPSRSVVLARRRLAELPCGGGTPLAHGLVTAARVAINAEKTGRSGGG 525 Query: 279 GHDDYKKYIIFLTDG 293 G + ++ LTDG Sbjct: 526 GGASRVR-VVCLTDG 539 >gi|123446482|ref|XP_001311991.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121893822|gb|EAX99061.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 722 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 69/208 (33%), Gaps = 46/208 (22%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 + +S+ ++D S SM+ ++ A + +R + + Sbjct: 236 FEGPIDSNSE------FYFIVDCSGSMS------CSRINNAIKCMRLFIQSLPV------ 277 Query: 215 VVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDA 269 R ++ F S P V + ++ + T L++ Sbjct: 278 GCRFSILRFGSHFETVLPPCDYTDENVANAMNLLDNISANMGGTNILAPLQH-------- 329 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIGVQAE 327 + + + K I FLTDGE S C A + + +++IG+ + Sbjct: 330 ----VSDLQASEGFVKQIFFLTDGE-------VDNSDIICATALKNRSTNRIFSIGLGSG 378 Query: 328 AADQFLKNCA--SPDRFYSVQNSRKLHD 353 A +K A S + + ++ +++ Sbjct: 379 ADPGLIKGMARKSGGNYAIIGDNDNMNE 406 >gi|170742065|ref|YP_001770720.1| hypothetical protein M446_3920 [Methylobacterium sp. 4-46] gi|168196339|gb|ACA18286.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 418 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 16/44 (36%), Positives = 25/44 (56%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLH 49 +R+FF + G I++L ++L PV V L I+ S VK +L Sbjct: 1 MRHFFRDRAGQITVLASLLSPVGLGVAALAIDLSTLQMVKQRLK 44 >gi|17533681|ref|NP_496741.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876681|emb|CAB03055.1| C. elegans protein F35C5.5a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 389 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 47/147 (31%), Gaps = 18/147 (12%) Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKS-----TPGLEYAYNK 265 R GLVT++++ L + + +++ + S G+ A Sbjct: 75 KRTTRVGLVTYNTEATIQADLNRFQSPDDLFSTVFQILPNDLSTSEDVFLAKGIGAAEQL 134 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + ++ +YK+ +I N D + K G + + Sbjct: 135 LAAGRKNNTRK-----NYKQMVIVYASAYNDEGEEDPRP---IAERLKASGVSIATVAFD 186 Query: 326 AEAADQFLK---NCASPDRFYSVQNSR 349 ++ +K A+P ++ ++ Sbjct: 187 QTGDEEMIKLIGEIATPGFNFTNEDEN 213 >gi|330688339|gb|AEC32940.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 483 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 49/151 (32%), Gaps = 24/151 (15%) Query: 192 LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 L S+ D I ++ ++R G I + L+ + E Sbjct: 2 LNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSIDKRQALS----KVTELRKS 54 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 TT T L D +K + + + +I +TDG +S +L Sbjct: 55 YSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVILMTDGVPNS----KYRAL 102 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + K+R + IG+ QF + A Sbjct: 103 EVAKKLKQRNVTLAVIGIGQGINHQFNRLIA 133 >gi|324995677|gb|EGC27589.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK678] Length = 450 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGTIDIVRANIN----------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVFGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLA-ALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|218660616|ref|ZP_03516546.1| hypothetical protein RetlI_13974 [Rhizobium etli IE4771] Length = 251 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 43/112 (38%), Gaps = 2/112 (1%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG 250 + VA ++RE+LD+I + ++ GL + + + + + ++ +G Sbjct: 39 RADVAGDAVREVLDMIDEADSNHERIKVGLYSLGDTLTEVLTPTLNTETARNRLADASYG 98 Query: 251 STTKSTPGLEYAYNKIFDAKEK--LEHIAKGHDDYKKYIIFLTDGENSSPNI 300 T+ ++ Y + K+K K ++ LTDG S Sbjct: 99 LTSATSKAATYFDVSLATLKQKVGAGGDGTSSGSPLKLVLLLTDGVQSQREW 150 >gi|17533679|ref|NP_496742.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876680|emb|CAB03054.1| C. elegans protein F35C5.5b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 387 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 47/147 (31%), Gaps = 18/147 (12%) Query: 213 NNVVRSGLVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKS-----TPGLEYAYNK 265 R GLVT++++ L + + +++ + S G+ A Sbjct: 75 KRTTRVGLVTYNTEATIQADLNRFQSPDDLFSTVFQILPNDLSTSEDVFLAKGIGAAEQL 134 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + ++ +YK+ +I N D + K G + + Sbjct: 135 LAAGRKNNTRK-----NYKQMVIVYASAYNDEGEEDPRP---IAERLKASGVSIATVAFD 186 Query: 326 AEAADQFLK---NCASPDRFYSVQNSR 349 ++ +K A+P ++ ++ Sbjct: 187 QTGDEEMIKLIGEIATPGFNFTNEDEN 213 >gi|300917042|ref|ZP_07133736.1| von Willebrand factor type A domain protein [Escherichia coli MS 115-1] gi|300415690|gb|EFJ99000.1| von Willebrand factor type A domain protein [Escherichia coli MS 115-1] Length = 378 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPSKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|218442319|ref|YP_002380645.1| hypothetical protein PCC7424_5589 [Cyanothece sp. PCC 7424] gi|218175423|gb|ACK74152.1| hypothetical protein PCC7424_5589 [Cyanothece sp. PCC 7424] Length = 143 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 12/95 (12%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + L++ +V+D S SM+ KL A +S I + +V F Sbjct: 41 DTKKALNLCLVIDRSGSMSGE------KLETAKKSC------IDIFKQLGEKDLLTVVVF 88 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + + +KIN++ +T + G Sbjct: 89 DDEAEVIVNPQVPKAEVIKKINQINDRGSTNLSLG 123 >gi|118790455|ref|XP_318596.3| AGAP009579-PA [Anopheles gambiae str. PEST] gi|116117953|gb|EAA14572.4| AGAP009579-PA [Anopheles gambiae str. PEST] Length = 1164 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 59/156 (37%), Gaps = 13/156 (8%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVA----TRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++++LD S SM + + +L V T S + ++I K DV +V F Sbjct: 224 DIVILLDNSGSMTG-YRNYIAQLTVKSILDTFSNNDFINIYKYSNDVEPLV----PCFKD 278 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +VQ P ++ E + L+ A+ + +E + + Sbjct: 279 MLVQATPE--NMRFFNEYVRELLPEGYANVKKAFVAAFELLQKYREIR-RCNESVSGCNQ 335 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 I+ +TDG S+ + E+ + + V+ Sbjct: 336 AIMLITDGVPSNI-TEVFEAYNWFENGTKIPVRVFT 370 >gi|40062742|gb|AAR37636.1| CbbO protein/von Willebrand factor type A domain protein [uncultured marine bacterium 439] Length = 787 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 71/193 (36%), Gaps = 14/193 (7%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + + +++D+S S ND + + +L +N+ SG + Sbjct: 591 QHTRDVAVNLLMDLSESTNDMVVGSEHSILELMQEATSLL-SWAINKIGDNLTISGFASD 649 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S VQ + +++ + G + + A Sbjct: 650 SRHDVQYYRFKPFHHSFNDEVKARLAGIKGGLSTRMGTAIRHAGVD-------LLTQSSA 702 Query: 284 KKYIIFLTDGENSSPNIDNKE-----SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 KK ++ LTDGE + ++D+ + + E + RG + + I + AD+++ Sbjct: 703 KKILLVLTDGEPADIDVDDPQHLRMDAKKAVEELRSRGIVTFCISLD-PYADEYVARIFG 761 Query: 339 PDRFYSVQNSRKL 351 +RF + + +KL Sbjct: 762 KNRFMVIDDIKKL 774 >gi|147901239|ref|NP_001088543.1| integrator complex subunit 6-B [Xenopus laevis] gi|82180065|sp|Q5U4W6|INT6B_XENLA RecName: Full=Integrator complex subunit 6-B; Short=Int6-B gi|54647561|gb|AAH84927.1| LOC495417 protein [Xenopus laevis] Length = 883 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S SMN G L +A ++ + ++S + R LVT Sbjct: 4 LLFLLDTSASMNQRSHLGTTYLDIAKGAVETFM-KLRSRDPASRGDRYMLVTLEELPYG- 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAVGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 I+ +TDG Sbjct: 122 PSIIVAITDG 131 >gi|326432703|gb|EGD78273.1| hypothetical protein PTSG_09337 [Salpingoeca sp. ATCC 50818] Length = 218 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 64/162 (39%), Gaps = 22/162 (13%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 +L S K +K+ ++++ D S SM+D + DI ++ + Sbjct: 42 MLSGSKAKRHAKTQANTRVLVIADNSYSMSDSMNALNSGI----------CDIYRTCCEA 91 Query: 213 NNVVRSG---LVTFSSKIVQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 G LV F+ K+ + L G+ + +KI T ++ +I Sbjct: 92 K----LGAFHLVYFNDKLEE-MDLTEAEGIDAVCDKIMHAGPNGCTDFDIVVDRLAREID 146 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 A E AK + ++ Y++ +TDG S P+ D ++L Sbjct: 147 AAME--VSTAKPDEQHQLYLVVMTDGVASMPSDDRFQALSSA 186 >gi|318059204|ref|ZP_07977927.1| hypothetical protein SSA3_14746 [Streptomyces sp. SA3_actG] gi|318081092|ref|ZP_07988424.1| hypothetical protein SSA3_31430 [Streptomyces sp. SA3_actF] Length = 184 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 44/123 (35%), Gaps = 16/123 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M +VLD S SM ++ + +++ E + + + D VVR LV FS+ I Sbjct: 1 MYLVLDRSGSMRPYY------KDGSAQALGEQVLALAAHLDTEAVVR--LVFFSTAIDAM 52 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L ++ L G+ A E AK ++F Sbjct: 53 GTLTLDAYE--GVVDGLHEGAGRMGRTNYALAIE------EVRALHAKEAAGEPGLVVFQ 104 Query: 291 TDG 293 TDG Sbjct: 105 TDG 107 >gi|313219556|emb|CBY30479.1| unnamed protein product [Oikopleura dioica] Length = 359 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 82/223 (36%), Gaps = 39/223 (17%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 M + F A +TS D +D++ ++D S S+ G + Sbjct: 1 MQLLRSFFLLAA-------ALTSPATADCPPDAKMDLVFLVDTSSSIR---KAGHKAIES 50 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIF--G 250 I +++D PD +S V F L + + ++ ++ + F G Sbjct: 51 IRSFIYKVVDGFTMGPD-----------HTSDPVINFVLNEHYNQEGVKMAVDTIDFESG 99 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 T++ + + I + +D K I +TDG +S + E+ Sbjct: 100 KGTETGKAMNFMAQMIDMGFGQR-------NDSKVVAIVITDGRSSEKHDFVAEASK--- 149 Query: 311 EAKRRGAIVYAIGVQAEAADQF---LKNCAS-PDRFYSVQNSR 349 K+ IV A+GV + ++ +K AS PD Y+++ Sbjct: 150 NLKKVVDIVIAVGVNIKKENELSREIKTIASEPDEHYAIEAES 192 >gi|119952970|ref|YP_945179.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861741|gb|AAX17509.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 329 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 64/210 (30%), Gaps = 23/210 (10%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSV---KISSKSDIGLDMMMVLDVSLSM-NDHF 185 + Y + I F + + + I K + L + V+D+S SM Sbjct: 46 IKNYYIKKILMMFFFVLSLGSLIVSILDISWGQKATEDKRSNLRISFVVDISRSMLTFDE 105 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN 245 +++L A I +L+ ++ V L F K + P + + + +N Sbjct: 106 EKSINRLESAKNFISLILNNFEN-------VEYSLTIFKGKSLLVLPFSKDKASLHKILN 158 Query: 246 RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + + L + + ++I LTDG+ N Sbjct: 159 YIEPNLISSPGSFLGEGVFSAIQGIKDDSYYN--------FLIILTDGDEWGENNYYTFP 210 Query: 306 LFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + +G+ ++ + N Sbjct: 211 KLI----DALNVTSFVVGIGSDKPSPLIDN 236 >gi|260427430|ref|ZP_05781409.1| tetratricopeptide TPR_2 repeat protein [Citreicella sp. SE45] gi|260421922|gb|EEX15173.1| tetratricopeptide TPR_2 repeat protein [Citreicella sp. SE45] Length = 510 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 67/207 (32%), Gaps = 37/207 (17%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + + P +++S + +VL+V+ SM +L + Sbjct: 70 VALILVLASIGAAGPTWSRQPDPFAAQSAPAV---VVLEVTDSMEASDVAP-TRLERGKQ 125 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFG----STT 253 IR+ LD + R+ LV ++ P+ Q + + L T Sbjct: 126 KIRDFLD-------LRAGARTALVAYAGTAHAVLPMTEDPQVMVPYLEGLSPQVMPREGT 178 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC---- 309 ++ L+ A + + I+F++DG +++ S Sbjct: 179 RAAEALQLAQSLLASEASTGG------------ILFVSDGIDAADVAALNASPQPVAVLG 226 Query: 310 ---NEAKRRGA---IVYAIGVQAEAAD 330 + RG V + V A+A D Sbjct: 227 MLPEGTRDRGIDALSVPVVTVTADAGD 253 >gi|229095272|ref|ZP_04226264.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-29] gi|228688131|gb|EEL42017.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-29] Length = 610 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|119512000|ref|ZP_01631096.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463351|gb|EAW44292.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 202 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 48/134 (35%), Gaps = 19/134 (14%) Query: 168 GLDMMMVLDVSLSMNDHFG-PGMDK---LGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G +++D S SM PG + + T ++ D + PD NV F Sbjct: 6 GRQYTLIIDKSGSMESPGDLPGKTRWETMQENTLALARKCDQLD--PDGINV-----YLF 58 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S + + +Q+ +T L+ A++ F +E+ G Sbjct: 59 SGRHKFYSEVT--ASKVQQIFQENHPAGSTNLGGVLQAAFDDFFKRREQ------GLTPN 110 Query: 284 KKYIIFLTDGENSS 297 + II +TDGE Sbjct: 111 GETIIVVTDGEPDD 124 >gi|186470651|ref|YP_001861969.1| putative transmembrane protein [Burkholderia phymatum STM815] gi|184196960|gb|ACC74923.1| putative transmembrane protein [Burkholderia phymatum STM815] Length = 372 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 59/165 (35%), Gaps = 17/165 (10%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 +GS++++TA+ + + + L I+ + + +L D + L A + + Sbjct: 6 KQKQRGSVAVITAVSMVSLLGLAALAIDIGNLLVSRNELQNAADAAALAGAPCLYQRAQC 65 Query: 70 NNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA 129 N + D++ K + FA + + S+I Q + Sbjct: 66 GNTTATEPDWTTATQK------------ASSFATASTS--NTVQGSVIKVAQTGSGYWNI 111 Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMV 174 P T P+ ++ P + + K ++ ++ + + + Sbjct: 112 TGS---PGTLETVPFTPGTNDLPAIQVTITKSAANANGSVPVYLA 153 >gi|2159|emb|CAA78125.1| VLA-2 [Sus scrofa] Length = 191 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 60/186 (32%), Gaps = 25/186 (13%) Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + + + + P GL+ +++ + + + K + S Sbjct: 2 DAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPRVV----FNLNTFKTKAEMVEATSH 54 Query: 253 TKSTPG-LEYAYNKIFDAKEKLEHIAKGHDD-YKKYIIFLTDGENSSPNIDNKESLFYCN 310 T G L + I A++ A G K ++ +TDGE + D + Sbjct: 55 TTQYGGDLTNTFKAIQYARDSAYSAAAGGRPGATKVMVVVTDGE----SHDGSMLKAVID 110 Query: 311 EAKRRGAIVYAIGV------QAEAADQFLK------NCASPDRFYSVQNSRKLHDAFLRI 358 + + + I V A +K + + F++V + L + + Sbjct: 111 QCNNDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEADLLEKAGTL 170 Query: 359 GKEMVK 364 G+++ Sbjct: 171 GEQIFS 176 >gi|17548846|ref|NP_522186.1| chelatase protein [Ralstonia solanacearum GMI1000] gi|17431095|emb|CAD17776.1| probable magnesium chelatase (subunits dI protein [Ralstonia solanacearum GMI1000] Length = 637 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 48/151 (31%), Gaps = 26/151 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 +++V D S SM ++ + S+ +L + L+ F + Sbjct: 458 ILLVADASGSMA-----ARRRMEMVKASVLGLLQ-----DAYQRRDQVALICFRGEQAEL 507 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P V+ + + L G T L+ A + A D ++ Sbjct: 508 VLPPTRQVELAERALAALPTGGRTPLAHALQLAAQTLAQA-----------CDLTPLLVV 556 Query: 290 LTDGENS----SPNIDNKESLFYCNEAKRRG 316 ++DG + + +E+L RG Sbjct: 557 ISDGRANIALDAGQDPWRETLALAEHLAARG 587 >gi|229825155|ref|ZP_04451224.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] gi|229790527|gb|EEP26641.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] Length = 1109 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 64/168 (38%), Gaps = 21/168 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 LD+++V+D S D+ ++ A +++D + NN +R GLV+F+ Sbjct: 83 EQKNLDVVLVVDRSY---DNMR-KNGRMAAAKAEAAKLVDYF--LQSGNNKIRVGLVSFA 136 Query: 225 SKIVQTFPLAWGVQHIQEKINRLI------FGSTTKSTPGLEYAYNK--IFDAKEKLEHI 276 + GV + + + L + ++P L A+ + + A E Sbjct: 137 GNNGGSPSPVLGVTQLTQDADELKNAIRGYNTAGWNNSPVLREAFTQAGLIKANEMFGAS 196 Query: 277 AKGHDDYKKYIIFLTDGENS---SPNIDNKESLFYCNEAKRRGAIVYA 321 KK I+ ++ G + +D + + + G +++ Sbjct: 197 NTN----KKIIVLISGGAPTISYGLTLDFNQREEALSNTPKEGYEIWS 240 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 12/44 (27%), Positives = 25/44 (56%), Gaps = 2/44 (4%) Query: 302 NKESLFYCNEAK-RRGAIVYAIGVQAEA-ADQFLKNCASPDRFY 343 ++ N+ K G ++++G++A+ A + LK AS +R+Y Sbjct: 286 KTNTIAEANKIKTDSGVEIFSVGIKADGDAAEVLKKIASDNRYY 329 >gi|156397430|ref|XP_001637894.1| predicted protein [Nematostella vectensis] gi|156225010|gb|EDO45831.1| predicted protein [Nematostella vectensis] Length = 757 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 63/215 (29%), Gaps = 32/215 (14%) Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAP-L 153 LR G+ NI S+ + + + P + Sbjct: 31 YLRLKGYGCREENISNPQSIFSETKNDPVGATIQVTPQ------------RVTLKLRPGI 78 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI---- 209 V I + +D+ ++D+S SM + G S++ + K Sbjct: 79 PSILPVTIRPAENYPVDLYYLMDMSWSMENDLDNLKKLAGKIAESMKNITKNFKLGFGSF 138 Query: 210 --PDVNNVVRSGLVTFSSKIVQTF---------PLAWGVQHIQEKINRLIFGSTTKSTPG 258 V+ VR+ L S + PL +EK+N G Sbjct: 139 VDKTVSPYVRTELTKPCSDTSECVASYGFKHILPLVRNDTMFKEKVNEQKISGNLDEPEG 198 Query: 259 LEYAYNKIFD-AKEKLEHIAKGHDDYKKYIIFLTD 292 ++ + A + + ++ ++F+TD Sbjct: 199 ---GFDALMQVAVCDQQIGWSVNGTSRRLVVFVTD 230 >gi|1813523|gb|AAB63302.1| PbTRAP [Plasmodium berghei] Length = 606 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 74/213 (34%), Gaps = 34/213 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDH---------FG 186 + F C + I +K S + + +D+ ++LD S S+ Sbjct: 9 YFFVVLLLCISVFLNGQEILDEIKYSEEVCNEQIDLHILLDGSGSIGHSNWISHVIPMLT 68 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +D L ++ I + + + +VR +SK F I + N Sbjct: 69 TLVDNLNISRDEINISMTLFSTYA--RELVRLKRYGSTSKASLRF-------IIAQLQNN 119 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 TT T L + + + + + + +I LTDG + + K+S Sbjct: 120 YSPHGTTNLTSAL------LNVDNLIQKKMNRPNAI--QLVIILTDGIPN----NLKKST 167 Query: 307 FYCNEAKRRGAIVYAIGVQAEAA---DQFLKNC 336 N+ K++ V IGV A ++ L C Sbjct: 168 TVVNQLKKKDVNVAIIGVGAGVNNMFNRILVGC 200 >gi|68070931|ref|XP_677379.1| sporozoite surface protein 2 [Plasmodium berghei strain ANKA] gi|56497475|emb|CAH99602.1| sporozoite surface protein 2, putative [Plasmodium berghei] Length = 421 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 74/213 (34%), Gaps = 34/213 (15%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDH---------FG 186 + F C + I +K S + + +D+ ++LD S S+ Sbjct: 9 YFFVVLLLCISVFLNGQEILDEIKYSEEVCNEQIDLHILLDGSGSIGHSNWISHVIPMLT 68 Query: 187 PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 +D L ++ I + + + +VR +SK F I + N Sbjct: 69 TLVDNLNISRDEINISMTLFSTYA--RELVRLKRYGSTSKASLRF-------IIAQLQNN 119 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 TT T L + + + + + + +I LTDG + + K+S Sbjct: 120 YSPHGTTNLTSAL------LNVDNLIQKKMNRPNAI--QLVIILTDGIPN----NLKKST 167 Query: 307 FYCNEAKRRGAIVYAIGVQAEAA---DQFLKNC 336 N+ K++ V IGV A ++ L C Sbjct: 168 TVVNQLKKKDVNVAIIGVGAGVNNMFNRILVGC 200 >gi|257415126|ref|ZP_05592120.1| predicted protein [Enterococcus faecalis AR01/DG] gi|257156954|gb|EEU86914.1| predicted protein [Enterococcus faecalis ARO1/DG] Length = 711 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 139 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 191 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 192 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 243 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 244 -KKMLVHIGDGSANASFLPRENAQSYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 298 Query: 334 KNCASPD 340 N AS D Sbjct: 299 SNSASTD 305 >gi|241205402|ref|YP_002976498.1| hypothetical protein Rleg_2696 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859292|gb|ACS56959.1| protein of unknown function DUF1194 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 258 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 69/194 (35%), Gaps = 18/194 (9%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVL--DVSLSMNDHFGPGMDKLGVATRSIREM 202 + + ++ V I+ +D+ +VL D S SM+ + V +E Sbjct: 1 MLTTLAVLMGLSGLVPIAQAGGSEVDVTLVLAVDTSRSMDFEEIGIQREGYVEALKHKEF 60 Query: 203 LDIIKSIPDVNNVVRSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +D +K +++ + +VQ + W Q I+ + + + F ++ P Sbjct: 61 IDAVKDGLTGRI-----AISYFEWAGYVVQDSVIDW--QVIETEEDAIAFADKLEARPIA 113 Query: 260 EYAYNKIFDAKEKLEH--IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 I A + ++ ++ I DG N+S N ++A G Sbjct: 114 TQRRTSISTAIAQGASMIVSSPFQSRRQVIDVSGDGPNNSGNPVTPAR----DKAVEAGM 169 Query: 318 IVYAIGVQAEAADQ 331 I+ + + +D Sbjct: 170 IINGLAIMLRPSDA 183 >gi|219849983|ref|YP_002464416.1| magnesium chelatase ATPase subunit D [Chloroflexus aggregans DSM 9485] gi|219544242|gb|ACL25980.1| magnesium chelatase ATPase subunit D [Chloroflexus aggregans DSM 9485] Length = 636 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 46/135 (34%), Gaps = 20/135 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++D S SM H ++ A ++ +L D + L+ F Sbjct: 439 SKAGTLFCFLVDASGSMALH------RMRQAKGAVNSLLQQAYVHRD-----QVALLAFR 487 Query: 225 S-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + V+ + ++ L G T L AY A+ + Sbjct: 488 GERADLLLPPSQSVELAKRALDVLPTGGGTPLAAALLAAYQISEQARARGIFRTT----- 542 Query: 284 KKYIIFLTDGENSSP 298 I+ +TDG + P Sbjct: 543 ---IVLITDGRPNVP 554 >gi|301021143|ref|ZP_07185182.1| von Willebrand factor type A domain protein [Escherichia coli MS 69-1] gi|300398262|gb|EFJ81800.1| von Willebrand factor type A domain protein [Escherichia coli MS 69-1] Length = 378 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQTAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|126282074|ref|XP_001368568.1| PREDICTED: similar to Coch-5B2 gene product [Monodelphis domestica] Length = 549 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 61/173 (35%), Gaps = 22/173 (12%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +++ ++D S S+ + M + I E+ DI I V T+ + Sbjct: 364 SVNIAFLIDGSSSVGESNFRLMLEFVSNIAKIFEISDIGAKIAAVQ-------FTYDQRP 416 Query: 228 VQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 +F +++ I + T + + + +F + K + Sbjct: 417 EFSFTDYTTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRDGPN--------KNF 468 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 ++ +TDG+ + D+ + A + G VY+IGV D + P Sbjct: 469 LVIVTDGQ----SYDDVRAPAA--AAHKAGITVYSIGVAWAPLDDLKDMASEP 515 >gi|282858822|ref|ZP_06267967.1| conserved hypothetical protein [Prevotella bivia JCVIHMP010] gi|282588391|gb|EFB93551.1| conserved hypothetical protein [Prevotella bivia JCVIHMP010] Length = 290 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 45/123 (36%), Gaps = 6/123 (4%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTREQMKRDRVTEIAATLAFSAIQNNDKVGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S +I + + G +HI I ++ ++ A + ++ D Y Sbjct: 126 SDQIEKYIAPSKGRKHILYCIREMLTFKPESKRTDIKIAIEYLTKVLKRRCTAFLISDFY 185 Query: 284 KKY 286 K Sbjct: 186 TKI 188 >gi|149067641|gb|EDM17193.1| rCG39970 [Rattus norvegicus] Length = 1163 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 78/217 (35%), Gaps = 28/217 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + +D+ ++D S S+N ++ ++ S + +++ Sbjct: 139 VPASMPECPRQEMDIAFLIDGSGSIN------QRDFAQMKDFVKALMGEFASTSTLFSLM 192 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + F Q + + I +L T + G+ ++F +K Sbjct: 193 QYSNILKTHFTFTEFKNILDPQSLVDPIVQLQ--GLTYTATGIRTVVEELFHSKNGSRKS 250 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQF 332 AK K ++ +TDG+ ++ + + ++A G I YAIG Q A + Sbjct: 251 AK------KILLVITDGQKYRDPLEYSDVIPAADKA---GIIRYAIGVGDAFQEPTALKE 301 Query: 333 LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 L S D + V N A I +++ ++ Sbjct: 302 LNTIGSAPPQDHVFKVGN----FAALRSIQRQLQEKI 334 >gi|88603963|ref|YP_504141.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88189425|gb|ABD42422.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 233 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 74/197 (37%), Gaps = 16/197 (8%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++VLD S SM+ + K+ +R + D +K + ++TF + Sbjct: 23 VLVLDTSASMSGN------KIAELNEGLRILTDELKEDDLAVKRIDLAVITFGKGVELVR 76 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 P G+ L G T + A + + K E+ G D Y+ +I +T Sbjct: 77 PFT-GISAFDP--PELSAGGYTPMGQAILEAVRLV--EERKAEYRTIGTDYYRPWIFLIT 131 Query: 292 DGENSSPNIDNKESLFYCNEA----KRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 DG+ + ++ + + +A+GV +A L+ + P R + Sbjct: 132 DGQPTDMRKGDEIWEKVIEAVHGGERDHKFLFWALGVD-QANMTVLREISPPGRTPLMLK 190 Query: 348 SRKLHDAFLRIGKEMVK 364 K + FL + K + + Sbjct: 191 EAKWAEMFLWLSKSLSQ 207 >gi|320667910|gb|EFX34813.1| hypothetical protein ECOSU61_19691 [Escherichia coli O157:H7 str. LSU-61] Length = 378 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D+S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDLSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|320663159|gb|EFX30468.1| hypothetical protein ECO5905_22618 [Escherichia coli O55:H7 str. USDA 5905] Length = 378 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D+S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDLSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|317056550|ref|YP_004105017.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315448819|gb|ADU22383.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 279 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 51/135 (37%), Gaps = 21/135 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLDVS SM F GM ++ +L + + D + + TF + + Sbjct: 38 VAVVLDVSGSMTKAFQSGM-----VQATLERLLPLAMAFDDDGS---MEVWTFDHEFKRY 89 Query: 231 FPLAWGVQHIQEKI--NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P+ + + I N++ T Y + + Y+I Sbjct: 90 PPIT--RTNFYDYIKDNKISARGGT--------MYAPVLRDVGSYFVQEEPAKIPT-YVI 138 Query: 289 FLTDGENSSPNIDNK 303 F+TDG+N+ + +K Sbjct: 139 FITDGDNADESDTDK 153 >gi|302754782|ref|XP_002960815.1| hypothetical protein SELMODRAFT_402199 [Selaginella moellendorffii] gi|300171754|gb|EFJ38354.1| hypothetical protein SELMODRAFT_402199 [Selaginella moellendorffii] Length = 878 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 64/203 (31%), Gaps = 21/203 (10%) Query: 170 DMMMVLDVSLSMNDHFG--PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG--LVTFSS 225 + ++LD S SM++ G + VA+ I+++L+ + + V G + S Sbjct: 591 SLYILLDTSTSMSNPTGVLSSQSRFNVASNIIKQLLNTLTNGDQVAVSTIGGEKIGAPVS 650 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ + + I + + S T S I + + H + Sbjct: 651 VVLDVQETSLNLTGISSLKDSISNTSVTNSAS-------NIKNGLQGALHFFNTSSNLN- 702 Query: 286 YIIFLTDGE---NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--- 339 II TDG+ + N + A+ VY IG + S Sbjct: 703 VIILFTDGQFVTPGNFNFTQLSPVLAQLNARNVVVFVYRIGSFTSNDSAAFQQMQSSLNM 762 Query: 340 DRFYSVQNSRKLHDA---FLRIG 359 + + L F I Sbjct: 763 SYEFISDDKNPLLKIHSYFDYIA 785 >gi|295675528|ref|YP_003604052.1| von Willebrand factor type A [Burkholderia sp. CCGE1002] gi|295435371|gb|ADG14541.1| von Willebrand factor type A [Burkholderia sp. CCGE1002] Length = 225 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 56/164 (34%), Gaps = 14/164 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++VLD S SM G +++L + D + + V + +V+F Sbjct: 28 CILVLDRSGSMG---GQPIEQLNA---GLVTFKDELSADSLAMKRVDTAIVSF-GPATLE 80 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P L T + A + + K E+ A G Y+ ++ + Sbjct: 81 MPFHTAPNFFPPT---LTAQGDTPMGSAINLALDTL--EARKAEYKANGISYYRPWVFLI 135 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 TDG + + +K+ +A+GVQ D + Sbjct: 136 TDGGPTDAWQQAAARVREGEASKK--FAFFAVGVQGANMDTLAQ 177 >gi|291283369|ref|YP_003500187.1| hypothetical protein G2583_2658 [Escherichia coli O55:H7 str. CB9615] gi|209766534|gb|ACI81579.1| hypothetical protein ECs2930 [Escherichia coli] gi|226237475|dbj|BAH46997.1| conserved protein [Escherichia coli O55:H7] gi|290763242|gb|ADD57203.1| hypothetical protein G2583_2658 [Escherichia coli O55:H7 str. CB9615] gi|320657553|gb|EFX25351.1| hypothetical protein ECO7815_05277 [Escherichia coli O55:H7 str. 3256-97 TW 07815] Length = 378 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D+S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDLSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|168750096|ref|ZP_02775118.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4113] gi|168756390|ref|ZP_02781397.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4401] gi|168770105|ref|ZP_02795112.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4486] gi|168775913|ref|ZP_02800920.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4196] gi|168783905|ref|ZP_02808912.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4076] gi|195937890|ref|ZP_03083272.1| hypothetical protein EscherichcoliO157_15866 [Escherichia coli O157:H7 str. EC4024] gi|208807119|ref|ZP_03249456.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4206] gi|208812301|ref|ZP_03253630.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4045] gi|208819003|ref|ZP_03259323.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4042] gi|209396016|ref|YP_002271402.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4115] gi|254793951|ref|YP_003078788.1| hypothetical protein ECSP_2926 [Escherichia coli O157:H7 str. TW14359] gi|187768638|gb|EDU32482.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4196] gi|188015615|gb|EDU53737.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4113] gi|188998833|gb|EDU67819.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4076] gi|189356440|gb|EDU74859.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4401] gi|189360921|gb|EDU79340.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4486] gi|208726920|gb|EDZ76521.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4206] gi|208733578|gb|EDZ82265.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4045] gi|208739126|gb|EDZ86808.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4042] gi|209157416|gb|ACI34849.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4115] gi|209766536|gb|ACI81580.1| hypothetical protein ECs2930 [Escherichia coli] gi|254593351|gb|ACT72712.1| conserved protein [Escherichia coli O157:H7 str. TW14359] Length = 378 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D+S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDLSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|163732407|ref|ZP_02139853.1| von Willebrand factor type A domain [Roseobacter litoralis Och 149] gi|161394705|gb|EDQ19028.1| von Willebrand factor type A domain [Roseobacter litoralis Och 149] Length = 748 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 63/180 (35%), Gaps = 36/180 (20%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGV----------ATRSIREMLDIIKSIPDVNNVVR 217 + + ++LD+S S D GPG L + + + L I + +R Sbjct: 557 SIAVHLLLDISRSTADPAGPGFTILDMERDAAAILACTMSQLGDDLAITAFSSSGRHDMR 616 Query: 218 SGLV-TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V TF + + + +A ++ L +T+ L YA Sbjct: 617 VVPVKTFDTALDEVTGMA---------LSGLRPEYSTRIGAALRYA-----------AGC 656 Query: 277 AKGHDDYKKYIIFLTDGE-----NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 +G+ ++K ++ +TDGE + P +++ + G + + E + Sbjct: 657 MEGYSRHRKIVLLVTDGEPSDIDMTDPEYLVEDARRAVQRIRSEGIDAICVALGNETGQR 716 >gi|154507989|ref|ZP_02043631.1| hypothetical protein ACTODO_00475 [Actinomyces odontolyticus ATCC 17982] gi|153797623|gb|EDN80043.1| hypothetical protein ACTODO_00475 [Actinomyces odontolyticus ATCC 17982] Length = 338 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 49/273 (17%), Positives = 80/273 (29%), Gaps = 53/273 (19%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++K + M F+ A S A + VK D+++ LD S Sbjct: 48 PKYKALVRRTRASLAMAFVCFLIAVIATSVSAGAPVDRYVK--HDKSASRDIVLCLDASG 105 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--- 236 SM P K+G A R I + R L + + + FPL Sbjct: 106 SML----PYDSKIGAAFREIISHFE----------GERISLQLWDAYSMTMFPLTDDYDM 151 Query: 237 VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD-----AKEKLEHIAK------------- 278 + + ++ I T+ L A ++FD E E + Sbjct: 152 ATDVLQDMSDTIDTGLTRIGGRLS-ATQELFDYLAPVMDENQEVSSIVGDGLASCVMGFD 210 Query: 279 -GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + I+ TD N AK +G V A+ +D L + A Sbjct: 211 HNDKQRSRTILLATD--NEVYGDGVYNLSEAIQFAKSQGVTVTAL---YPGSDITLSSEA 265 Query: 338 ---------SPDRFYSVQNSRKLHDAFLRIGKE 361 + FY + + +I E Sbjct: 266 LQLRDEVRKTGGDFYDASSPSSVDRVVKQIEAE 298 >gi|11875069|dbj|BAB19548.1| hypothetical protein [Escherichia coli O157:H7] Length = 306 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D+S SM D V ++ + +P +R+ LV F + +V Sbjct: 144 QLVLLVDLSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 187 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 188 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 232 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 233 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 279 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 280 ALVNVGAQIAAMTPGELATWL 300 >gi|15802605|ref|NP_288632.1| hypothetical protein Z3294 [Escherichia coli O157:H7 EDL933] gi|15832184|ref|NP_310957.1| hypothetical protein ECs2930 [Escherichia coli O157:H7 str. Sakai] gi|168761872|ref|ZP_02786879.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4501] gi|168789093|ref|ZP_02814100.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC869] gi|168801647|ref|ZP_02826654.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC508] gi|217329612|ref|ZP_03445691.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. TW14588] gi|261224987|ref|ZP_05939268.1| hypothetical protein EscherichiacoliO157_10378 [Escherichia coli O157:H7 str. FRIK2000] gi|261257665|ref|ZP_05950198.1| hypothetical protein EscherichiacoliO157EcO_17847 [Escherichia coli O157:H7 str. FRIK966] gi|12516345|gb|AAG57187.1|AE005439_6 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933] gi|13362399|dbj|BAB36353.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai] gi|189367798|gb|EDU86214.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4501] gi|189371315|gb|EDU89731.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC869] gi|189376233|gb|EDU94649.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC508] gi|209766528|gb|ACI81576.1| hypothetical protein ECs2930 [Escherichia coli] gi|209766530|gb|ACI81577.1| hypothetical protein ECs2930 [Escherichia coli] gi|209766532|gb|ACI81578.1| hypothetical protein ECs2930 [Escherichia coli] gi|217317380|gb|EEC25809.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. TW14588] gi|320191714|gb|EFW66362.1| Mg-chelatase subunit ChlD [Escherichia coli O157:H7 str. EC1212] gi|320641186|gb|EFX10664.1| hypothetical protein ECO5101_07587 [Escherichia coli O157:H7 str. G5101] gi|320646574|gb|EFX15485.1| hypothetical protein ECO9389_10747 [Escherichia coli O157:H- str. 493-89] gi|320651829|gb|EFX20204.1| hypothetical protein ECO2687_07504 [Escherichia coli O157:H- str. H 2687] gi|326338476|gb|EGD62304.1| Mg-chelatase subunit ChlD [Escherichia coli O157:H7 str. 1044] gi|326347000|gb|EGD70733.1| Mg-chelatase subunit ChlD [Escherichia coli O157:H7 str. 1125] Length = 378 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D+S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDLSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|14248609|gb|AAK57590.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248611|gb|AAK57591.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|58429509|gb|AAW78158.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 61/183 (33%), Gaps = 31/183 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S+ H ++ + +I+ + +N + + FS+ Sbjct: 47 VDLYLLMDCSGSIRRH--------NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAK 98 Query: 229 QTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L I+ ++ + T T L + D Sbjct: 99 EIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINR 150 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCA 337 ++ + ++ LTDG S KES + G + G+ ++FL C Sbjct: 151 ENANQLVVILTDGIPDSIQDSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCH 206 Query: 338 SPD 340 D Sbjct: 207 PSD 209 >gi|58429493|gb|AAW78150.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 569 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 61/183 (33%), Gaps = 31/183 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S+ H ++ + +I+ + +N + + FS+ Sbjct: 47 VDLYLLMDCSGSIRRH--------NWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAK 98 Query: 229 QTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L I+ ++ + T T L + D Sbjct: 99 EIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLND--------RINR 150 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCA 337 ++ + ++ LTDG S KES + G + G+ ++FL C Sbjct: 151 ENANQLVVILTDGIPDSIQDSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCH 206 Query: 338 SPD 340 D Sbjct: 207 PSD 209 >gi|297565533|ref|YP_003684505.1| putative cytoplasmic protein [Meiothermus silvanus DSM 9946] gi|296849982|gb|ADH62997.1| putative cytoplasmic protein [Meiothermus silvanus DSM 9946] Length = 407 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 41/258 (15%), Positives = 83/258 (32%), Gaps = 45/258 (17%) Query: 91 DFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSH 150 R+ + G + R + + + K Y + + T A Sbjct: 142 SLRSLMGALGKNNPGAHATRHYASGVESSGETKPYEFGDQPNFN---VGETLKRVAVKGV 198 Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 L V ++ ++ +++LD S SM G D+ A R + +I++ Sbjct: 199 ENLEEGDLVVELAEYTAAMNTVVLLDCSHSM---ILYGEDRFTPAKRVALGLSHLIRTQY 255 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK 270 + VR G+ F + PL G + + T + G++ A + Sbjct: 256 PGDQ-VRFGV--FHDSAEE-VPL--GRLPTVQ-VGPYH----TNTAEGIKLARKMLKKMG 304 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGEN------------SSPNIDN---KESLFYCNEAKRR 315 ++ K II +TDG+ ++ +D E+L A++ Sbjct: 305 GEM-----------KQIIMITDGKPSALTLPSGQIYKNAWGLDPVILAETLKEATLARKE 353 Query: 316 GAIVYAIGVQAEAADQFL 333 G ++ + L Sbjct: 354 GIPIHT--FMLAREPELL 369 >gi|221193525|gb|ACM07795.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 68/211 (32%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + EK A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGEKFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|121608786|ref|YP_996593.1| von Willebrand factor, type A [Verminephrobacter eiseniae EF01-2] gi|121553426|gb|ABM57575.1| von Willebrand factor, type A [Verminephrobacter eiseniae EF01-2] Length = 326 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 69/217 (31%), Gaps = 34/217 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + G + +VLD S SM D F + L + + GL FS Sbjct: 77 TGTGAHLAIVLDRSASMADGFAAARQAGEESKSEAAARL--LDGFVKERPLDLFGLTLFS 134 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + +Q L+ + + +TPG+ N L + Sbjct: 135 TAPMQVLGLSSDHE-------AVRAALRAAATPGIGL-TNVAAGLALALAQFGEQPHTGS 186 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA--------------- 329 + I+F++DG + I+ + + + +Y + +++ + Sbjct: 187 RVILFVSDG---AAQIEPRAQIKLRRLFEEYRVQLYWVYLRSPGSNSPTRPPDPQAGADV 243 Query: 330 ------DQFLKNCASPDRFYSVQNSRKLHDAFLRIGK 360 F ++ +P R Y N + L A + + Sbjct: 244 APEYHLHTFFQDLGTPYRLYEADNPQALAAAISDVSR 280 >gi|296446920|ref|ZP_06888856.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296255595|gb|EFH02686.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 486 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 17/168 (10%) Query: 15 GSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKK 74 G+++I+ A+ + I G ++ + V+ +L+ I D + L T + Q+ K Sbjct: 19 GNVAIIFALAAIPLLIAAGGAVDFAIASRVQTQLYAICDSATLAATTPAMMQQTTATAKT 78 Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 + I N L N + + +++ + Y + + Sbjct: 79 VATSMFAAQVAQI------NRLTYNSANLTVTVNDDTSASPVKTRTVTVSYLAQVGNAFG 132 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 F P + T ++ + +D +VLD S SM Sbjct: 133 ---SFYHVP--------TSIFTVKASSTASTARNIDFYLVLDNSPSME 169 >gi|226327208|ref|ZP_03802726.1| hypothetical protein PROPEN_01074 [Proteus penneri ATCC 35198] gi|225204426|gb|EEG86780.1| hypothetical protein PROPEN_01074 [Proteus penneri ATCC 35198] Length = 109 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 42/111 (37%), Gaps = 9/111 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM +G + + +L ++ P ++TF+S Sbjct: 3 RLPVYLLLDTSGSM---YGEPI---AAVKNGVEMLLSTLRQDPYALETAYISIITFNSTA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 Q PL I K+ L+ TT L N+I +KL K Sbjct: 57 QQIVPLT---DLINFKVPDLVASGTTALGSALTLVSNRIEKEVQKLRLRQK 104 >gi|167520728|ref|XP_001744703.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777034|gb|EDQ90652.1| predicted protein [Monosiga brevicollis MX1] Length = 785 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 46/143 (32%), Gaps = 25/143 (17%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTF 223 + + +VLDVS SM D+ ++D++K V +VTF Sbjct: 136 IHLNLVLDVSGSMGAAVTAR-DESNTLIEYNLCVMDLVKFASQVAVKCLAPGDVISIVTF 194 Query: 224 SSKIVQTFPL----------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 S V + KI+ + G +T G+E + + Sbjct: 195 SDAAKIIVEPISVPDPKMGADTTVADVLGKIDAIYHGGSTNLWAGIETGLQLLASCAQPH 254 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 H + LTDGE + Sbjct: 255 LHN---------VCVALTDGEPN 268 >gi|114665186|ref|XP_001171182.1| PREDICTED: similar to leukointegrin alpha d chain, partial [Pan troglodytes] Length = 129 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 27/100 (27%), Positives = 39/100 (39%), Gaps = 14/100 (14%) Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAA 329 H KK +I +TDG+ D E +A++ G I YAIG Q A Sbjct: 5 HHKNGARKSAKKILIVITDGQ---KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTA 61 Query: 330 DQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 Q L +S D + V N A I K++ ++ Sbjct: 62 RQELNTISSAPPQDHVFKVDN----FAALGSIQKQLQEKI 97 >gi|3273261|dbj|BAA31174.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + L FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYLNVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|258624109|ref|ZP_05719059.1| von Willebrand factor type A domain protein [Vibrio mimicus VM603] gi|258583540|gb|EEW08339.1| von Willebrand factor type A domain protein [Vibrio mimicus VM603] Length = 598 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 64/179 (35%), Gaps = 22/179 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K +K + +++D+S SM G ++ +A + + +++IP VN V Sbjct: 412 KPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIPGVNPAV--- 468 Query: 220 LVTFSSKIVQT-FPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F Q F + +Q + R T + YA ++ +E+ Sbjct: 469 -TFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFELTKTREER--- 524 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K +I +TDG+ S + + C R V IGV+ A + Sbjct: 525 --------KMLIVVTDGQPQSAPA-CRSVIDLCE---RSDVEVIGIGVETAAVSGLFQK 571 >gi|288800167|ref|ZP_06405626.1| von Willebrand factor, type A [Prevotella sp. oral taxon 299 str. F0039] gi|288333415|gb|EFC71894.1| von Willebrand factor, type A [Prevotella sp. oral taxon 299 str. F0039] Length = 289 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 46/108 (42%), Gaps = 10/108 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ G+ R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDF------GTTGMFKRDMATEIAATIAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIF 267 S +I + P G +H I+E +N T + LEY + Sbjct: 126 SDRIEKYIPPQKGRKHILYIIREMLNFESKSKKTNLSVPLEYLTKIVK 173 >gi|297198202|ref|ZP_06915599.1| von Willebrand factor [Streptomyces sviceus ATCC 29083] gi|197714651|gb|EDY58685.1| von Willebrand factor [Streptomyces sviceus ATCC 29083] Length = 592 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 70/202 (34%), Gaps = 27/202 (13%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + V+D S SM++ G G ++ V S+ + L + GL FS+++ Sbjct: 384 ITTVVDASASMSEAVPGTGRSRMDVTRASLLQALATFTQEDE------IGLWEFSTELDG 437 Query: 230 TFP---------LAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 L + RL + PG ++D A Sbjct: 438 DKDYKILVPTDRLGDSTAAGTTQRERLSAAFGGLEPVPGGATG---LYDTTLAAYKAATS 494 Query: 280 HDDYKKY--IIFLTDGENSSPNIDNKESL-----FYCNEAKRRGAIVYAIGVQAEAADQF 332 K+ ++ LTDG N P ++ +L + A+ IV A+G A+ A+ Sbjct: 495 SYAKGKFNALVVLTDGVNQDPGSISRGALISELEKLSSPARPVPLIVIAVGPDADRAEAE 554 Query: 333 LKNCASPDRFYSVQNSRKLHDA 354 A+ V + ++H Sbjct: 555 QLAEATGGSGQQVNDPAQIHTV 576 >gi|154503613|ref|ZP_02040673.1| hypothetical protein RUMGNA_01437 [Ruminococcus gnavus ATCC 29149] gi|260589891|ref|ZP_05855804.1| putative von Willebrand factor type A domain protein [Blautia hansenii DSM 20583] gi|153795713|gb|EDN78133.1| hypothetical protein RUMGNA_01437 [Ruminococcus gnavus ATCC 29149] gi|260539698|gb|EEX20267.1| putative von Willebrand factor type A domain protein [Blautia hansenii DSM 20583] Length = 709 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 70/208 (33%), Gaps = 33/208 (15%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP-----DVNNVVR 217 + L + +++D S SM D++ A ++ + D S+ ++ Sbjct: 523 PDEEQKLSVGLLVDESGSMGWG-----DRITHARKTAIVLYDFCTSLGIPITIYGHSTDS 577 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 G+ +S A + + + + + L Y + E Sbjct: 578 KGVALYS-YAEFDSLDASDRYRLMD----MSARNGNRDGAALRYVAEHLAKRPES----- 627 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKES----LFYCNEAKRRGAIVYAIGVQAEAADQFL 333 +K +I ++DG+ + E+ NE ++RG +++A + + + + Sbjct: 628 ------QKLLIIISDGQPADCGYSGTEAEADLRGIKNEYRKRGIVIFAAAIGDDKEN--I 679 Query: 334 KNCASPDRFYSVQNSRKLHDAFLRIGKE 361 + F + L ++ K+ Sbjct: 680 RRIYQDG-FLDITKLEDLPKNMTQLVKQ 706 >gi|149916833|ref|ZP_01905335.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] gi|149822550|gb|EDM81939.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] Length = 447 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 77/251 (30%), Gaps = 62/251 (24%) Query: 171 MMMVLDVSLSM---NDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 +MMVLD S SM + S+ +++D I + D + + G F S Sbjct: 146 VMMVLDKSGSMFTNTWDHDNNGGTPQITRWNSLYDVVDNITTTFD--DSINFGANLFPST 203 Query: 227 IVQTF------------PLAWGVQHIQEKINRLI------FGSTTKSTPGLEYAYNKIFD 268 + Q + G + + + + T +T G+ AYN + Sbjct: 204 LAQNIYGPQACTTSNFPEVTVGENNSAQILATIPGPGVTASYGGTPATLGVTTAYNHLTS 263 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESLF---------YCNEAKRRGAI 318 + + +I +TDG N N N LF A G Sbjct: 264 LDP----------ELPRAMILVTDGAANCDQNAANNFQLFDVYDDGLPVIVGTAAANGVP 313 Query: 319 VYAI-----------GVQAEAAD----QFLKNCASP---DRFYSVQNSRKLHDAFLRIGK 360 Y + G+ + L A+ F++ ++ +L A + Sbjct: 314 TYVVGIDIINQTINDGIGGDPNGINPTVVLNEVAAAGGTGSFFNTEDQAELEAALTDVVA 373 Query: 361 EMVKQRILYNK 371 + I ++ Sbjct: 374 SVQTCTIPLSE 384 >gi|14248669|gb|AAK57620.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|301614604|ref|XP_002936771.1| PREDICTED: integrator complex subunit 6-A-like [Xenopus (Silurana) tropicalis] Length = 877 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 42/130 (32%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ +LD S SMN G L +A ++ + ++S + R LVT Sbjct: 4 LLFLLDTSASMNQRTHLGTTYLDIAKGAVETFM-KLRSRDPASRGDRYMLVTLEEPPYG- 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W ++ L T L A+ N++ + Sbjct: 62 IKAGWKENHATFMNELKNLQAVGLTTLGQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 284 KKYIIFLTDG 293 II +TDG Sbjct: 122 PSIIIAITDG 131 >gi|330448513|ref|ZP_08312161.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492704|dbj|GAA06658.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 899 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 50/274 (18%), Positives = 93/274 (33%), Gaps = 51/274 (18%) Query: 113 SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV-KISSKSDIGLDM 171 +L + D K Y + S + T S P +T S ++ + Sbjct: 393 TLDAVKDHVAKSYEVYVDSNSTLTASDGT-----KLSDRPGWVTVSYDQLKAGLQYDAGS 447 Query: 172 MMVLDVSLSMNDHFGPGMDKL----GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D+ S ++F D L + +S + + I D + + +VTFS+ + Sbjct: 448 RAGIDIKASDGEYFHTKFDALPYLMDIVKQSYQTLTQEILHNIDDKSKLEFNIVTFSNAV 507 Query: 228 VQTFPLAWGVQH----------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + I+ I+ L G T+ L+ A I D+ ++ Sbjct: 508 RGNTTFHYDDSTHQFVNKQNVTIENYIHDLTAGGGTQFEWPLKDASAHITDSSKRN---- 563 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + FL+DG++ +D F +G + +IGV A + + A Sbjct: 564 --------VVYFLSDGKDED-KLDTTGIHFL------KGTEIVSIGVGPSADAKQMGEIA 608 Query: 338 ------------SPDRFYSVQNSRKLHDAFLRIG 359 +P + N +L+D F IG Sbjct: 609 QMGTGYDKDNPNAPSYSKVITNGNELNDIFHNIG 642 >gi|307288765|ref|ZP_07568743.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0109] gi|306500234|gb|EFM69573.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0109] gi|315166099|gb|EFU10116.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1302] Length = 711 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 139 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 191 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 192 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 243 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 244 -KKMLVHIGDGSANASFLPRENAQIYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 298 Query: 334 KNCASPD 340 N AS D Sbjct: 299 SNSASTD 305 >gi|257083477|ref|ZP_05577838.1| predicted protein [Enterococcus faecalis Fly1] gi|256991507|gb|EEU78809.1| predicted protein [Enterococcus faecalis Fly1] Length = 711 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 139 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 191 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 192 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 243 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 244 -KKMLVHIGDGSANASFLPRENAQIYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 298 Query: 334 KNCASPD 340 N AS D Sbjct: 299 SNSASTD 305 >gi|229101379|ref|ZP_04232123.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-28] gi|228682084|gb|EEL36217.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-28] Length = 627 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|221369934|ref|YP_002521030.1| hypothetical protein RSKD131_4097 [Rhodobacter sphaeroides KD131] gi|221162986|gb|ACM03957.1| Hypothetical Protein RSKD131_4097 [Rhodobacter sphaeroides KD131] Length = 1043 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 52/170 (30%), Gaps = 22/170 (12%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG-LDMMMVLDVSLSMNDHFGPGMDK 191 Y P+ + A I +++I + + LDVS SM+ Sbjct: 221 YLKPWAVRVTRVLTGEAGAAQWYPEKAPIVPEANISDAAIYITLDVSGSMSGTRMAAQKA 280 Query: 192 -LGVATRSIREMLDIIKSIPDVNNVVRSGL--VTFSSKIVQTFPLAWGVQHIQEKINRL- 247 + R I +D PD N +R L + I + +++ + L Sbjct: 281 GVAALIREIGASVD-----PDRPNDIRIVLWNAGLAGSIERRNMEPDDYTALEDWMLALS 335 Query: 248 -IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 T A G ++ +IF+TDGE S Sbjct: 336 NSTSGGTNFNAAFAEA-----------STFFAGGGSKRRIVIFVTDGEPS 374 >gi|56697084|ref|YP_167447.1| von Willebrand factor type A domain-containing protein [Ruegeria pomeroyi DSS-3] gi|56678821|gb|AAV95487.1| von Willebrand factor type A domain protein [Ruegeria pomeroyi DSS-3] Length = 565 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 56/196 (28%), Gaps = 37/196 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++VLD S SM G K+ +A + +L + GL + + Sbjct: 24 SAILVLDGSGSM-WGQIEGKAKIEIAREVVTGLLADLPQDQP------LGLTVYGHRRKG 76 Query: 230 T--------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 P I + L T + A + +EK Sbjct: 77 DCTDIETLVAPAPGNRAAIASAVAGLKPKGKTPMLEAVRQAAEALRYTEEKAT------- 129 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA--IVYAIGVQAEAADQFLK-NC-- 336 +I ++DG + + + + G + +G + + C Sbjct: 130 -----VILVSDGVETC----DADPCATAAALEAAGVDFTAHVVGFDIDDPQALAQMQCLA 180 Query: 337 -ASPDRFYSVQNSRKL 351 + F S N+ +L Sbjct: 181 DQTGGTFRSAANADEL 196 >gi|17229534|ref|NP_486082.1| hypothetical protein alr2042 [Nostoc sp. PCC 7120] gi|17131133|dbj|BAB73741.1| alr2042 [Nostoc sp. PCC 7120] Length = 608 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 50/136 (36%), Gaps = 18/136 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D ++V+D S SM G + SI E ++ + V + F Sbjct: 36 NPKGGIDWIVVVDTSASMRGVGGTR-NIFTQVKNSINEFVNTARVGDTVT------IYNF 88 Query: 224 SSKIV---QTFPLAWG--VQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIA 277 S + Q P+A +++ IN L T + ++ A + + + + Sbjct: 89 DSDVTLQAQEIPIASNPDRGKLKQIINNLKADGVRTHTGKAVQQA---LSTSAKLNQRPN 145 Query: 278 KGHDDYKKYIIFLTDG 293 I+FLTDG Sbjct: 146 TADRTVS--IVFLTDG 159 >gi|317502944|ref|ZP_07961036.1| von Willebrand factor [Prevotella salivae DSM 15606] gi|315665943|gb|EFV05518.1| von Willebrand factor [Prevotella salivae DSM 15606] Length = 289 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 47/122 (38%), Gaps = 6/122 (4%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S+N + R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLNF------GTVHQTKREMVTEIAATMAFSAIQNNDKIGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S I + P G HI I L+ + + A + + ++ + D Y Sbjct: 126 SDHIEKYIPPKKGRHHILYIIRELLDFKPKSARTDIGMAVSYLTRMMKRRATVFLLSDFY 185 Query: 284 KK 285 + Sbjct: 186 TQ 187 >gi|326772465|ref|ZP_08231749.1| conserved hypothetical protein [Actinomyces viscosus C505] gi|326637097|gb|EGE37999.1| conserved hypothetical protein [Actinomyces viscosus C505] Length = 367 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 59/189 (31%), Gaps = 18/189 (9%) Query: 151 APLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD---- 204 L + S+ S +++ MV+D + SM G +S LD Sbjct: 62 IVLALAGPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGKGP-DGVDQSASTRLDGVRA 120 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAY 263 +++I + R ++ + + PL + I S+ + LE A Sbjct: 121 DMRAIREAFPDSRFSIIALDNTAARELPLTHDTNAVDAWIGSFKQEVSSHATGSSLEVAL 180 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + + L + + + +DGE + DN + A G ++ Sbjct: 181 PVLG---QTLAQARQSDPKDIRLVYIFSDGEAT----DNGRGAQTADNA---GISWQSLA 230 Query: 324 VQAEAADQF 332 + Sbjct: 231 GLVDGGAVL 239 >gi|296206489|ref|XP_002750241.1| PREDICTED: von Willebrand factor A domain-containing protein 1 [Callithrix jacchus] Length = 445 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 66/200 (33%), Gaps = 28/200 (14%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + + S G D+M +LD S S++ + + + ++ +P + Sbjct: 21 GAARGPPASAPGGDLMFLLDSSASVSHYEFSRVREFVG---------QLVAPLPLDTGAL 71 Query: 217 RSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R+ LV S+ F + G + T + L Y ++F Sbjct: 72 RASLVHVGSRPYTEFSFGQHSSGKAAQDAVLASAQRMGDTHTGLALAYVKEQLF----AE 127 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 A+ K ++++TDG +S P + E K G V+ + + Sbjct: 128 AAGARPGVP--KVLVWVTDGGSSDPVGPPMQ------ELKDLGVTVFIVSTGRGNFLELS 179 Query: 334 KNCASPD----RFYSVQNSR 349 ++P F V + Sbjct: 180 AVASAPAEKHLHFVDVDDLH 199 >gi|296122919|ref|YP_003630697.1| hypothetical protein Plim_2675 [Planctomyces limnophilus DSM 3776] gi|296015259|gb|ADG68498.1| hypothetical protein Plim_2675 [Planctomyces limnophilus DSM 3776] Length = 645 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 75/207 (36%), Gaps = 19/207 (9%) Query: 120 DQHKDYNLSA-VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 Q +S S+ P + AN + I + S + ++ +LDVS Sbjct: 147 SQTPGEVISFETSQLSQPVLRAGKMTRANGVSGAVDILTEEIAQSLQEGPTRVVWILDVS 206 Query: 179 LSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI-VQTFPLAWGV 237 SM ++ D++ ++ M I++ + +GL+ F K + Sbjct: 207 PSMTEYRRVIADRMEKIYAQLKSMKLPIETSLE------TGLMAFDDKPHALSTKTVKDP 260 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSS 297 I++ INRL G+E Y + +A AK ++ I +TD E Sbjct: 261 LKIKDLINRLPSTER-----GIENTYLAVMNA-RNRYGPAKERVPFRVMFIVVTDEE--- 311 Query: 298 PNIDNKESLFYC-NEAKRRGAIVYAIG 323 D +L +C ++ + IG Sbjct: 312 -GSDVSRNLDHCIELMQKDLIRCFVIG 337 >gi|163889361|gb|ABY48131.1| zinc finger protein [Medicago truncatula] Length = 691 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 67/202 (33%), Gaps = 30/202 (14%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 L + + +D+++VLDV +M+ KL + ++R + I Sbjct: 275 LKLKTPAPAPVKVLRRAPVDVVIVLDVGGAMSGQ------KLRLMKNTMRLV------IS 322 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIF-GSTTKSTPGLEYAYNK 265 +N R +V FS + PL G + + + L + P A K Sbjct: 323 SLNATDRLSIVAFSGGSKRLLPLKRMTGGGQRSARRIVEALAAIDQIRDAVPAKNDALKK 382 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI--VYAIG 323 E + I+ L+D +S + + + + + ++A+ Sbjct: 383 AAKVLEDRREK-----NPVACIVVLSDIVDSRAVNASFQKISLVSSTRLSNMEVPIHAVR 437 Query: 324 VQAEAA------DQFLKNCASP 339 E+ D+ L C + Sbjct: 438 FPKESECTHALPDEILSKCINS 459 >gi|163846015|ref|YP_001634059.1| magnesium chelatase ATPase subunit D [Chloroflexus aurantiacus J-10-fl] gi|222523743|ref|YP_002568213.1| magnesium chelatase ATPase subunit D [Chloroflexus sp. Y-400-fl] gi|163667304|gb|ABY33670.1| magnesium chelatase ATPase subunit D [Chloroflexus aurantiacus J-10-fl] gi|222447622|gb|ACM51888.1| magnesium chelatase ATPase subunit D [Chloroflexus sp. Y-400-fl] Length = 634 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 46/135 (34%), Gaps = 20/135 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++D S SM H ++ A ++ +L D + L+ F Sbjct: 437 SKAGTLFCFLVDASGSMALH------RMRQAKGAVNALLQQAYVHRD-----QVALLAFR 485 Query: 225 S-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + V+ + ++ L G T L AY A+ + Sbjct: 486 GERADLLLPPSQSVELAKRALDVLPTGGGTPLAAALLAAYQISEQARSRGIFRTT----- 540 Query: 284 KKYIIFLTDGENSSP 298 I+ +TDG + P Sbjct: 541 ---IVLITDGRPNVP 552 >gi|297159760|gb|ADI09472.1| hypothetical protein SBI_06352 [Streptomyces bingchenggensis BCW-1] Length = 545 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 HA + + + + + ++V+D S SM K+ A ++ + ++ Sbjct: 61 HAIVRVEARGLGPAAARAAASEVIVIDCSGSMTW----PPTKIMAARKA------TVAAV 110 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWG-------VQHIQEKIN-RLIFGSTTKSTPGLEY 261 + R +V + + +P G + +L T L+ Sbjct: 111 SALRKGTRFAVVQGTERAEVVYPPTGGMAVAGPDTKAAASHAAMKLAALGGTAIGTWLDL 170 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN-IDNKESLFYC 309 A + H + ++ + LTDG+N + D L C Sbjct: 171 A---------RRLHAEQP--TALRHTLLLTDGKNEHEDPGDLARVLDAC 208 >gi|242780420|ref|XP_002479591.1| von Willebrand domain protein [Talaromyces stipitatus ATCC 10500] gi|218719738|gb|EED19157.1| von Willebrand domain protein [Talaromyces stipitatus ATCC 10500] Length = 952 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 64/197 (32%), Gaps = 27/197 (13%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D +++ + D S SM ++ + + RS+ E S + R + S Sbjct: 339 GDFAGEIIFMADRSGSMMSKIPSLINVMNIFLRSLPEKC----SFNISSFGSRPTWLWPS 394 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 SK + + +N+ T+ LE + H K +D Sbjct: 395 SKRYSQEDMD----IASQHVNKFQANYGGTEIYGALESVLD----------HYNKQNDVP 440 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEAADQFLKNCASPD 340 I LTDGE D + +A +++G+ + + + ++ Sbjct: 441 TSVI-LLTDGE----VWDVDNVIKLVRKAVSESDTNIRFFSLGIGGQVSHRLVEGIGEQG 495 Query: 341 RFYSVQNSRKLHDAFLR 357 Y+ L +++ Sbjct: 496 GGYAEIVPESLMNSWQE 512 >gi|281357360|ref|ZP_06243849.1| protein of unknown function DUF58 [Victivallis vadensis ATCC BAA-548] gi|281316391|gb|EFB00416.1| protein of unknown function DUF58 [Victivallis vadensis ATCC BAA-548] Length = 293 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 38/125 (30%), Gaps = 10/125 (8%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + + S + L +++++DVS S R L + + Sbjct: 58 NVSARMGSPYVKKYMEERELTVLLLVDVSAS------GAFGSAEKTKRRTAAELAALLAF 111 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----GSTTKSTPGLEYAYNK 265 +N + GL+ FS +I P G +H I ++ T L Sbjct: 112 SAGHNGDKVGLLMFSDRIELFVPPRSGRRHTLRLIREMLAFEPQSKGTDIDLALRETLQV 171 Query: 266 IFDAK 270 + Sbjct: 172 LKKKS 176 >gi|51893578|ref|YP_076269.1| magnesium chelatase [Symbiobacterium thermophilum IAM 14863] gi|51857267|dbj|BAD41425.1| magnesium chelatase [Symbiobacterium thermophilum IAM 14863] Length = 741 Score = 41.7 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 65/196 (33%), Gaps = 31/196 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 ++ LD+ +++D S SM ++ A R +L V+ R ++ F Sbjct: 558 AEQSLDICLLIDASASMAG------RRILAAKHLARHLL--------VSTRDRIAVIAFQ 603 Query: 225 SK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + + P +++ + R+ T GL + I A+ + Sbjct: 604 ERDVRVYVPFTRSYSAVEDGLARIQPMGLTPLAHGLIRSMELIHSARVRRP--------- 654 Query: 284 KKYIIFLTDGENSSPNIDN---KESLFYCNEAKRRGAIVYAIGVQAEAA--DQFLKNCAS 338 ++ +TDG + P +++ + + + IG+Q +Q ++ Sbjct: 655 --LLLLITDGIPTVPKWSVDPLADAVEAARQLRAQRIPFTCIGLQPSRRYLEQLVRQAGG 712 Query: 339 PDRFYSVQNSRKLHDA 354 + L Sbjct: 713 TLHVVDELSEESLIRI 728 >gi|228919479|ref|ZP_04082843.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|229068309|ref|ZP_04201612.1| Von Willebrand factor type A domain protein [Bacillus cereus F65185] gi|228714770|gb|EEL66642.1| Von Willebrand factor type A domain protein [Bacillus cereus F65185] gi|228840122|gb|EEM85399.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 610 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|169604987|ref|XP_001795914.1| hypothetical protein SNOG_05509 [Phaeosphaeria nodorum SN15] gi|160706682|gb|EAT86573.2| hypothetical protein SNOG_05509 [Phaeosphaeria nodorum SN15] Length = 1180 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 45/250 (18%), Positives = 90/250 (36%), Gaps = 28/250 (11%) Query: 109 ERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIG 168 + S SL++ + KD+ + +++ ++H L+ T K S ++ Sbjct: 238 KASASLTLGTAELDKDFVMQVIAKNTGVPKAVLENHPTIANHRALMTTLVPKFSLPAEKP 297 Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +++ V D S SMN + +A ++++ L KS+P V F+ Sbjct: 298 -ELVFVCDRSGSMNG------TSMELAKQALKVFL---KSLPVG--------VKFNICSF 339 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + ++ + N+ + YN I + + D II Sbjct: 340 GSS-YSFLWKKSASY-NQENLDEAVRHAEQFSANYNGTEMLAPLKATIDQRYKDMPLDII 397 Query: 289 FLTDGENSSPNIDNKESLFYCNEAK---RRGAIVYAIGVQAEAADQFLKNCASPDR-FYS 344 LTDG+ D + Y NEA ++ V+ +G+ + ++ A F Sbjct: 398 LLTDGQ----IWDQERLFSYLNEAITGSKQPVRVFTLGIGNGVSHALIEGVAKAGNGFSQ 453 Query: 345 VQNSRKLHDA 354 + + DA Sbjct: 454 AVGAGEKMDA 463 >gi|293357059|ref|XP_345157.4| PREDICTED: integrin, alpha 2 [Rattus norvegicus] Length = 1269 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 39/289 (13%), Positives = 92/289 (31%), Gaps = 36/289 (12%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIFCTFPWCANSS 149 + L+ + ++ I+ + SL + + ++ + C++ S Sbjct: 182 KLNLQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVS 241 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +TS +D+++V D S S + + + + + Sbjct: 242 PDFQSLTSFSPAVQACPSLVDVVVVCDESNS--------IYPWEAVKNFLEKFVQGLDIG 293 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 P L+ +++ F L + K + + S T+ G K Sbjct: 294 PKKTQ---VALIQYANDPRVVFNLT----TYKNKEDMVQATSETRQYGGDLTNTFKAIQF 346 Query: 270 KEKLEHIAKGHDDY--KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--- 324 + ++ + K ++ +TDGE+ + + + CN + + I V Sbjct: 347 ARDIAYLPESGGRPGATKVMVVVTDGESHDGSK-LQTVIQQCN---DDEILRFGIAVLGY 402 Query: 325 ---QAEAADQF---LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 A +K AS F++V + L + +G+ + Sbjct: 403 LNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEHIFS 451 >gi|229114220|ref|ZP_04243641.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-3] gi|228669240|gb|EEL24661.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-3] Length = 627 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|10198152|gb|AAG15217.1|AF288461_3 BchD [Chloroflexus aurantiacus] Length = 634 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 46/135 (34%), Gaps = 20/135 (14%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S G ++D S SM H ++ A ++ +L D + L+ F Sbjct: 437 SKAGTLFCFLVDASGSMALH------RMRQAKGAVNALLQQAYVHRD-----QVALLAFR 485 Query: 225 S-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + P + V+ + ++ L G T L AY A+ + Sbjct: 486 GERADLLLPPSQSVELAKRALDVLPTGGGTPLAAALLAAYQISEQARSRGIFRTT----- 540 Query: 284 KKYIIFLTDGENSSP 298 I+ +TDG + P Sbjct: 541 ---IVLITDGRPNVP 552 >gi|83647103|ref|YP_435538.1| nitric oxide reductase activation protein [Hahella chejuensis KCTC 2396] gi|83635146|gb|ABC31113.1| Nitric oxide reductase activation protein [Hahella chejuensis KCTC 2396] Length = 618 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 62/165 (37%), Gaps = 26/165 (15%) Query: 180 SMNDHFGPGMDK--LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGV 237 SM+ D+ + V + ++ + + ++ D R L FSS Sbjct: 435 SMSTDAYVDNDRRVIDVISDALLLFSEALDAVGD-----RFALYGFSSCK---------R 480 Query: 238 QHIQEKINRLIFGSTTKSTPGLEYA-----YNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 QH++ + + T + G A Y +I A + I + ++ ++ LTD Sbjct: 481 QHVRFHVIKNFAEPYTNAVRGRIVALRPGFYTRIGAAIRQSTTILRDQKASRRLLLILTD 540 Query: 293 GENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 G+ + +++ EA+R G I + I V EA D Sbjct: 541 GKPNDLDLYEGRYGLEDTRQAVLEARRAGLIPFCITVDKEAGDYL 585 >gi|322436386|ref|YP_004218598.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164113|gb|ADW69818.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 384 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 52/170 (30%), Gaps = 19/170 (11%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 +D+ L + +++D S S L + LD + V + Sbjct: 90 RDADLPLTLGLLVDTSQSQR-------AALDDERAASTTFLDQMLKGKQDKAFV----IQ 138 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIA 277 F+ + L +Q + L S + ++DA Sbjct: 139 FAHETELLQDLTDSRPKLQAALKDLDSSPQRSSDSSDSDGHRDRGGTVLYDAAFLAADEI 198 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 +K II LTDG++ +++ A VYAI + E Sbjct: 199 LSKPKGRKAIILLTDGDDRGSKETITKAIEAAQRADT---TVYAIYFKGE 245 >gi|260769129|ref|ZP_05878062.1| putative hemolysin [Vibrio furnissii CIP 102972] gi|260614467|gb|EEX39653.1| putative hemolysin [Vibrio furnissii CIP 102972] Length = 1476 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 55/160 (34%), Gaps = 24/160 (15%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHF--------GPGMDKLGVATRSIREMLDIIKSIP-- 210 +S + + ++ +++D S SM + ++ ++D + ++ Sbjct: 857 VSVQPGVNYNIALIIDTSGSMKFDLAGNDNGFSNSYQSQSQYNASRMKLVIDALTNLATD 916 Query: 211 --DVNNVVRSGLVTFSSKIVQTFPL---AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 + + V+ L+ F S L A +Q + +I + T + A N Sbjct: 917 LVNHDGVININLIGFESSAHSALTLQLTADNLQQLLTEIQDMDAEGGTNYEAAFDLASNW 976 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 H + Y FLTDG+ + N + + Sbjct: 977 F-------SHQPTEGYENLTY--FLTDGDPTFSNSGDNGA 1007 >gi|254503616|ref|ZP_05115767.1| von Willebrand factor type A domain protein [Labrenzia alexandrii DFL-11] gi|222439687|gb|EEE46366.1| von Willebrand factor type A domain protein [Labrenzia alexandrii DFL-11] Length = 609 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 46/155 (29%), Gaps = 17/155 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--S 225 G + +V+D+S SM + L S E+++ I R G+VTF+ Sbjct: 146 GQSVALVIDLSGSMG-------NDLASVKSSALELINAIFGTDAAPVASRLGIVTFNNTD 198 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN-KIFDAKEKLEHIAKGHDDYK 284 I P K + + G N + A + + Sbjct: 199 SIQIVLPFTEQANISDRKSAAIDAINGLAILGGGAEPLNGALLTALRGDIGSWRTGTNK- 257 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 II TD P D + A G + Sbjct: 258 --IIVFTD----EPAADPELRDDVITLANDLGVQI 286 >gi|195978918|ref|YP_002124162.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975623|gb|ACG63149.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 967 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 35/256 (13%), Positives = 81/256 (31%), Gaps = 69/256 (26%) Query: 165 SDIGLDMMMVLDVSLSMNDHFG----PGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-- 218 LD+++V+D S SM + P + A +L +I N + Sbjct: 411 KKQPLDVLVVVDRSASMKEGISQNDIPRDQAVKNALTGAGGLLQKFININAENKLSVIGF 470 Query: 219 -GLVTFSSKIVQTFPLAW-------------GVQHIQEKINRLIFG---------STTKS 255 G + ++S+ + ++W ++ + + T Sbjct: 471 QGSLNYNSREGKPERISWRSIIYQPSINNNKDADVLKNWESSSALNRDDLSYKDKNGTNY 530 Query: 256 TPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG------------------ENSS 297 L A + + K ++F++DG + S+ Sbjct: 531 HAALVKADEMLNKVADDGHR---------KIMVFVSDGVPTFYFGSDHYRAGNGTSDASN 581 Query: 298 PNIDNKESLFYCNEAKRR--GAIVYAIGVQAEAADQ-------FLKNCASPDRFYSVQNS 348 + ++ K++ +Y++GV + LK + D +Y + N+ Sbjct: 582 IKSSQDGTRAAIDDFKKKHPNLSIYSLGVSKDINSDTASSSPVVLKYLSGEDHYYGITNT 641 Query: 349 RKLHDAFLRIGKEMVK 364 +L +I ++V+ Sbjct: 642 VEL----EKIANKIVE 653 >gi|170750593|ref|YP_001756853.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657115|gb|ACB26170.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 359 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 51/160 (31%), Gaps = 23/160 (14%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVA 195 + P + ++ IG + +++D S SMN+ F P + A Sbjct: 60 ILAVAGLIAALAGPYRTGARIER---LGIGAQVSLLIDRSGSMNETFAGRQPSGAEESKA 116 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--T 253 S R + D + + V FS+ + P+ ++ I + T Sbjct: 117 AASRRLLADFVGARAHDLFAV----TAFSTAPMLVMPMTDRHDAVRAAIAAIDRPGLDYT 172 Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 GL A ++ + ++ ++DG Sbjct: 173 NIGRGLGMALSQFGIDVG-----------ASRVLLLVSDG 201 >gi|148656939|ref|YP_001277144.1| magnesium chelatase ATPase subunit D [Roseiflexus sp. RS-1] gi|148569049|gb|ABQ91194.1| protoporphyrin IX magnesium-chelatase [Roseiflexus sp. RS-1] Length = 610 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 74/216 (34%), Gaps = 25/216 (11%) Query: 115 SIIIDDQHKDYNLSAVSRYEMPFIFC--TFPWCANSSHAPLLITSSVKISS-KSDIGLDM 171 SI D + ++SA R PF + S LL ++I +S G Sbjct: 367 SIPGDPRRGRIDVSATLRVAAPFQPLRRSQAGADAQSSRVLLRADDLRIKQYRSKAGALF 426 Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQT 230 + +D S SM H ++ A ++ +L D R L+ F + Sbjct: 427 LFAVDASGSMALH------RMRQAKGAVHALLQKAYVHRD-----RVALLAFRGQSAELL 475 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + V+ + ++ L G T L A + A+ + ++ L Sbjct: 476 LPPSQSVELARRALDLLPTGGGTPLAAALLAAI--------DVAQQARARGIMQTVLVLL 527 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 TDG + ++ E + G V A G+Q Sbjct: 528 TDGRANVGLRAGRD--QVAEELQTLGRAVVASGIQT 561 >gi|297663822|ref|XP_002810364.1| PREDICTED: integrin alpha-10-like [Pongo abelii] Length = 1177 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 166 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAILVACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|159465497|ref|XP_001690959.1| hypothetical protein CHLREDRAFT_188557 [Chlamydomonas reinhardtii] gi|158279645|gb|EDP05405.1| predicted protein [Chlamydomonas reinhardtii] Length = 434 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 53/200 (26%), Gaps = 58/200 (29%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K + + VLD S SM+ ++ + + ++D + PD G+V++ Sbjct: 165 KQRAHVALTCVLDRSGSMSGE------RIALVRETCHFLIDQLT--PD----DYLGIVSY 212 Query: 224 SSKIVQTFPLAWGVQHIQ----EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 S + PL + ++ L +T GL + Sbjct: 213 SGGVRADVPLLRMTPAARGLAHAMVDALEADGSTALYDGLVAGVRQ-------------- 258 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC--A 337 + V+ G A + + L+ A Sbjct: 259 --------------------------QMEAEAPTDQHVTVHTFGFGAGHSVELLQAVADA 292 Query: 338 SPDRFYSVQNSRKLHDAFLR 357 +Y + + F Sbjct: 293 QSGVYYYISCVDDIPSGFGD 312 >gi|14248631|gb|AAK57601.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|310114429|ref|XP_003119946.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 481 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 41/111 (36%), Gaps = 13/111 (11%) Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T++ L + +K G ++Y+I +TDG I ++L Sbjct: 138 GTRTGKALNFTLPFFDSSKG-------GRPSVQQYLIVITDGVAQDNVIIPAKAL----- 185 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEM 362 + + I++AIGV Q L+ D Y N L + I ++ Sbjct: 186 -RDKNTIIFAIGVGEAKKSQLLEITNDEDNVYHDVNFEALQNLEKEILSKV 235 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 12/89 (13%) Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 G + +Y+I +TDG++S + E L + G +YAIG++ EA LK Sbjct: 6 ADTGRINVARYLIVITDGKSSDSVAEAAEGL------RANGVNIYAIGIR-EANIDELKE 58 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 A F+ + D I KE+V+ Sbjct: 59 IAKDKIFFVYE-----FDLLKDIQKEVVQ 82 >gi|304373664|ref|YP_003858409.1| hypothetical protein RB16p109 [Enterobacteria phage RB16] gi|299829620|gb|ADJ55413.1| conserved hypothetical phage protein [Enterobacteria phage RB16] Length = 739 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 40/129 (31%), Gaps = 12/129 (9%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DVS SM L S+ + D + + + G V K+ L Sbjct: 23 VCDVSGSMYSELPKIRKHLKANLASLVKQDDTVSILYFSSKGDY-GTVFRGEKVSNVSDL 81 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I L T L A D + + ++ +IFLTDG Sbjct: 82 TNICTAIDRY---LKPTGCTGFVEPLNLAAEIATDLQSENGNLNS--------LIFLTDG 130 Query: 294 ENSSPNIDN 302 ++ D+ Sbjct: 131 YDNCWRTDD 139 >gi|301109958|ref|XP_002904059.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262096185|gb|EEY54237.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 1481 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 55/191 (28%), Gaps = 26/191 (13%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 VLD S SM+ P D L L LV++ + Sbjct: 1294 VLDNSGSMSG--QPWKDLLCACDEFGISRLKDGGEKD---------LVSY---------V 1333 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH--DDYKKYIIFLT 291 + + + + + + + + +++K +IF + Sbjct: 1334 TFDHEG-RIFCEGVPLPEALEMSVPFGGGGTSYGKGLRAANEVLSRNDFEEFKVVLIFFS 1392 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS--PDRFYSVQNSR 349 DG+ + + + +A+G L+ AS R+ V ++ Sbjct: 1393 DGQPCDIEMGVALARHIRLSYAKYDLKAFAVGFGCI-NLPVLQRVASEMGGRYRQVLDAN 1451 Query: 350 KLHDAFLRIGK 360 L F RI Sbjct: 1452 ALRTEFQRIAA 1462 >gi|147677783|ref|YP_001211998.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] gi|146273880|dbj|BAF59629.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] Length = 312 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 30/82 (36%), Gaps = 4/82 (4%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 NF N G +++ + +F LV + + +L +D + L A ++ + Sbjct: 13 NFLNNQNGLAAVMLCAGMAALFGFAALVTDIGLLAAKRQQLINTMDAAALAGAQELPDNP 72 Query: 68 NGNNGKKQ----KNDFSYRIIK 85 + KN F+ + Sbjct: 73 AQAVQVARDYAGKNGFAPDSLN 94 >gi|14248677|gb|AAK57624.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|75812416|ref|YP_320035.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705172|gb|ABA24846.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 218 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 60/179 (33%), Gaps = 31/179 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS------ 224 ++++LD S SM+ + R + + + + V ++TF Sbjct: 18 VILLLDTSGSMSGQ------PIQELNRGLATFKEDVIKDSQASLSVEVAIITFGPVRLVQ 71 Query: 225 --SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 I Q P +L T +EYA + + K + G Sbjct: 72 DFVNIDQFTPP------------QLEAEGVTPMGEAIEYALDLL--ETRKSAYKENGILY 117 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 Y+ +I +TDG + + + R + +GVQ ++ L+ A +R Sbjct: 118 YRPWIFLITDGAPTDYYHLAAQRVKEAEA--NRRLCFFTVGVQGADFNK-LRQIAPAER 173 >gi|309787993|ref|ZP_07682602.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1617] gi|308924127|gb|EFP69625.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1617] Length = 378 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFGTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|300787704|ref|YP_003767995.1| hypothetical protein AMED_5848 [Amycolatopsis mediterranei U32] gi|299797218|gb|ADJ47593.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 476 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 29/134 (21%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ-- 229 ++++D S SM+ K+ A + R +I + + V G+V +S+ Sbjct: 50 LLLVDCSSSMDW----PPTKIEAARHACR------AAIGSLRDGVLFGVVECTSRARLVY 99 Query: 230 --TFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 PLA + + + LI G T + L+ A + + H Sbjct: 100 PAEPPLAVAGERTRREAKAAASGLIAGGGTAMSTWLDLARDLFGKTPAVIRHA------- 152 Query: 284 KKYIIFLTDGENSS 297 + LTDG+N S Sbjct: 153 ----VLLTDGKNES 162 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 65/178 (36%), Gaps = 32/178 (17%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL--DMMMVLDVSLSMND-------- 183 + F A++ A + SS +G+ ++M+ +D S SM Sbjct: 20 SIASALSLFSLEAHAQFASTPLYLQNDSSSAGQLGVKHNIMLFIDDSGSMKWIPGEKRPP 79 Query: 184 HFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + +L + + ++LD + S+ ++N R+ P + Sbjct: 80 RYYGEKSRLAITKSVLNKVLDKYQDQFNWSLQTLHNNGRT------DTADFKTP----WK 129 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 ++++++ + + T +T Y +I + K +++ ++DG+ + Sbjct: 130 SVKDRVDGIEAANGTPTT----RRYYEIVSGIVMPS---VEYRCQKAFVVLMSDGDAN 180 >gi|119871810|ref|YP_929817.1| hypothetical protein Pisl_0296 [Pyrobaculum islandicum DSM 4184] gi|119673218|gb|ABL87474.1| conserved hypothetical protein [Pyrobaculum islandicum DSM 4184] Length = 429 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 47/131 (35%), Gaps = 18/131 (13%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D + +++D S SM F D + + + + R L F Sbjct: 264 DAREKIYLLVDKSGSM---FYTLYDGIAMDMTQKITWATALAIALMKRSK-RVILRFFDQ 319 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + P ++ + + R++ T T + Y + DAK++ H K Sbjct: 320 MVY---PPITNIKEVIRSLLRVLPLGGTDITAAV---YTAVRDAKQQGLHSYK------- 366 Query: 286 YIIFLTDGENS 296 +I +TDGE+ Sbjct: 367 -LIVITDGEDD 376 >gi|319892409|ref|YP_004149284.1| Nitric oxide reductase activation protein NorD [Staphylococcus pseudintermedius HKU10-03] gi|317162105|gb|ADV05648.1| Nitric oxide reductase activation protein NorD [Staphylococcus pseudintermedius HKU10-03] Length = 629 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 57/164 (34%), Gaps = 19/164 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++D S SM+ DK+ + + + +K++ + + + F+ Sbjct: 437 FTLLIDASASMH-------DKMDETIKGVVLFHETLKALNVKHEI-----LAFNEDA-FD 483 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---EKLEHIAKGHDDYKKYI 287 + I E IN T++ + + ++++ Sbjct: 484 ADAQYQPNIIDEIINYHQSTFNTEAPRIMSLTPQDDNRDGVAIRVASDRLLSRSEQQRFL 543 Query: 288 IFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGVQAEA 328 I +DGE S+ N ++ +++ G V+ + + E+ Sbjct: 544 IVFSDGEPSAFNYSQDGILDTYEAVENSRKLGIEVFNVFLSQES 587 >gi|308462088|ref|XP_003093330.1| hypothetical protein CRE_03438 [Caenorhabditis remanei] gi|308250341|gb|EFO94293.1| hypothetical protein CRE_03438 [Caenorhabditis remanei] Length = 382 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 45/147 (30%), Gaps = 19/147 (12%) Query: 215 VVRSGLVTFSSKIVQTFPL----AWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 R LVT++S L + G ++ + L + GL A ++ Sbjct: 72 TTRVALVTYNSDSYTNADLNQFQSTGDLFNNVFSALATLSSTDQSYLETGLSTA-EQLLK 130 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 A + + A +K+ +I + + + K G + + Sbjct: 131 AGKNQFNRA----HFKRVVIVYASAYEGEG---ERAPMPVADRLKGDGVKIITVAYDQRG 183 Query: 329 ADQFLK---NCASPDRFYSVQNSRKLH 352 L ASP + N+R L Sbjct: 184 DGALLDQLAKIASPRMNF--TNNRDLV 208 >gi|6198477|gb|AAD31490.2|AF138797_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 434 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 63/145 (43%), Gaps = 7/145 (4%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V + K D G D+M +LDVS M ++F +++ ++ K + Sbjct: 191 TIDVTVTVKPKQIDEGADVMALLDVSQKMTKENFDKAKEQIKKMVTTLTGEPTDGKENHN 250 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 N VR L+TF K+ + L +++ + ++ + + G++ I A+E Sbjct: 251 RRNSVR--LMTFYRKVNEPIELT--AENVDKTLDEVWKKAKEDWDWGVDLQ-GAIHKARE 305 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS 296 + +++I+ + GE++ Sbjct: 306 IFNKEKEKKSGKRQHIVLFSQGEST 330 >gi|221193615|gb|ACM07840.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|189518251|ref|XP_696635.3| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4 [Danio rerio] Length = 1094 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 46/137 (33%), Gaps = 26/137 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++ +D+S SM +L +A +I +LD + VN ++ +S + Sbjct: 243 DIIIAVDISGSMKGL------RLTIAKHTINTILDTLGENDFVN------VIAYSDYVQY 290 Query: 230 TFP---------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 P +H + + L K ++ ++ + Sbjct: 291 VEPCFKGTLVQADLDNREHFKLLVQELQVKGEGKVKKAMKESFKIL-----NEVTAEGRG 345 Query: 281 DDYKKYIIFLTDGENSS 297 + I+ +TDG Sbjct: 346 SLCNQAIMLITDGAMED 362 >gi|77552602|gb|ABA95399.1| hypothetical protein LOC_Os11g45380 [Oryza sativa Japonica Group] gi|125578051|gb|EAZ19273.1| hypothetical protein OsJ_34814 [Oryza sativa Japonica Group] Length = 553 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 22/99 (22%), Positives = 42/99 (42%), Gaps = 12/99 (12%) Query: 216 VRSGLVTFSSKIVQT-----FPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R +V F+ K+V + G K+N+L G TK P L++A + + D+ Sbjct: 31 DRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKVNQLKAGGDTKFLPALKHA-SGLLDS 89 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 + + + +I L+DG+++ D + Y Sbjct: 90 RPAGDKQYRPG-----FIFLLSDGQDNGVLDDKLGGVRY 123 >gi|291451535|ref|ZP_06590925.1| conserved hypothetical protein [Streptomyces albus J1074] gi|291354484|gb|EFE81386.1| conserved hypothetical protein [Streptomyces albus J1074] Length = 444 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 49/184 (26%), Gaps = 23/184 (12%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 MP P ++ + + + + +VLD S SM ++ G Sbjct: 193 GEPRMPISLDKIPPGLVDLTKTAAVSLRKRGLADRGLRAAVYLVLDRSGSMRPYYRDGTV 252 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLI 248 + V V F + ++ +QE R Sbjct: 253 QHLAEQALALAAHLDDDGTVPV--------VFFDTDAHPATEVSLDAYEGRVQELHERYG 304 Query: 249 FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFY 308 TT + E +EH ++IF TDG P+ + Sbjct: 305 HMGTTNYAAAM----------LEVIEHYTATGATAPAFVIFQTDG---GPDAKREAERVL 351 Query: 309 CNEA 312 C A Sbjct: 352 CRAA 355 >gi|306823288|ref|ZP_07456664.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|304553920|gb|EFM41831.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] Length = 344 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 68/195 (34%), Gaps = 32/195 (16%) Query: 151 APLLITSSVKISSKSDIGL---DMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREM 202 ++I + +SS + + D+++ +DV+ SM +L VA ++++ Sbjct: 62 VAVMILTPSIVSSTHNRAINATDVVIAVDVTGSMAVKDAQYGSDELQTRLDVAKQAVK-- 119 Query: 203 LDIIKSIPDVN-NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 DI P+ + VR G PL + I + L T + G Sbjct: 120 -DITGLYPNSSFAAVRFGASG-----TLDVPLTPDSKAIDNWADTL-APEATSVSSG--- 169 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYII--FLTDGENSSPNI--DNKESLFYCNEAKRRGA 317 + + + L + I+ ++DGE +S Y N+A Sbjct: 170 STLDVPIDQLLLTCKSIHEQHPDDAIVMYLISDGEQTSSKTRRTFSSLRRYLNDA----- 224 Query: 318 IVYAIGVQAEAADQF 332 + + V +E Q Sbjct: 225 --FTVAVGSEQGGQI 237 >gi|323512335|gb|ADX87791.1| putative cobalamin biosynthesis protein CobT [Vibrio phage ICP2_2006_A] Length = 580 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 76/221 (34%), Gaps = 27/221 (12%) Query: 149 SHAPLLITSSVKISSKSDIGL----DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + T S I + + + + ++LD S SM+ L + + L+ Sbjct: 375 KAVAKVTTGSDVIFRQKEQKVVLDTAVTVLLDSSGSMSGRSKYLHGMLACCM--LNDALN 432 Query: 205 IIKSIPDVNNVVRS--GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + +V ++ G +++ ++ T +G + + +S G++ A Sbjct: 433 KVGIPIEVLGFTQTYEGSNSYNQHLIHTP---FGRRDTARDL--------VESMDGVDLA 481 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN-KESLFYCNEA-KRRGAIVY 320 N A H +K +I L+DG + ++ + E K+ +Y Sbjct: 482 NNDDGAAIMWAHSRLIRHKAKRKILIVLSDGSPACLQANSYAFTKQVVEEIEKKSPVEIY 541 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 IG+ + + ++ +L A L + K Sbjct: 542 GIGIMDDNVKRIYSQSE------VIRTPEQLESALLNVVKS 576 >gi|229089685|ref|ZP_04220947.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-42] gi|228693715|gb|EEL47416.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-42] Length = 609 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANQAE 590 >gi|229188830|ref|ZP_04315864.1| Von Willebrand factor type A domain protein [Bacillus cereus ATCC 10876] gi|228594643|gb|EEK52428.1| Von Willebrand factor type A domain protein [Bacillus cereus ATCC 10876] Length = 610 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|227519070|ref|ZP_03949119.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|227073490|gb|EEI11453.1| conserved hypothetical protein [Enterococcus faecalis TX0104] Length = 711 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 139 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 191 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 192 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 243 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 244 -KKMLVHIGDGSANASFLPRENAQIYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 298 Query: 334 KNCASPD 340 N AS D Sbjct: 299 SNSASTD 305 >gi|111225854|ref|YP_716648.1| hypothetical protein FRAAL6519 [Frankia alni ACN14a] gi|111153386|emb|CAJ65142.1| conserved hypothetical protein [Frankia alni ACN14a] Length = 450 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 44/150 (29%), Gaps = 29/150 (19%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VR-SGLVT--FSSK 226 +++LD S SM P ++ I +PD VR +G + + Sbjct: 73 VLLLDCSGSMA---NPP----AKIAQARHAASAAIDCLPDGVWFAIVRGTGAASMIYPDA 125 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 + + ++++ T L+ A + H Sbjct: 126 PYLVAASTVTRERARLAVSQMEPHGGTAIGRWLDLARALTATRPRAIAHA---------- 175 Query: 287 IIFLTDGENSSPNID------NKESLFYCN 310 + LTDG+N D F C+ Sbjct: 176 -LLLTDGQNGELATDLAAAVAAARGRFQCD 204 >gi|309801885|ref|ZP_07696000.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|308221441|gb|EFO77738.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 339 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 68/195 (34%), Gaps = 32/195 (16%) Query: 151 APLLITSSVKISSKSDIGL---DMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREM 202 ++I + +SS + + D+++ +DV+ SM +L VA ++++ Sbjct: 57 VAVMILTPSIVSSTHNRAINATDVVIAVDVTGSMAVKDAQYGSDELQTRLDVAKQAVK-- 114 Query: 203 LDIIKSIPDVN-NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 DI P+ + VR G PL + I + L T + G Sbjct: 115 -DITGLYPNSSFAAVRFGASG-----TLDVPLTPDSKAIDNWADTL-APEATSVSSG--- 164 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYII--FLTDGENSSPNI--DNKESLFYCNEAKRRGA 317 + + + L + I+ ++DGE +S Y N+A Sbjct: 165 STLDVPIDQLLLTCKSIHEQHPDDAIVMYLISDGEQTSSKTRRTFSSLRRYLNDA----- 219 Query: 318 IVYAIGVQAEAADQF 332 + + V +E Q Sbjct: 220 --FTVAVGSEQGGQI 232 >gi|289643958|ref|ZP_06476060.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289506234|gb|EFD27231.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 434 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 68/226 (30%), Gaps = 22/226 (9%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 ++ D +Y + A + P T + ++ + G ++ Sbjct: 2 VTFSSDVSQNEYLSEGATDVHAVVTVNAVADEAGAGAVPGGPTGAGQVGA--GPGTAEVI 59 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK-SIPDVNNVVRSGLVTFSSKIVQTFP 232 ++D S SM ++ A +I + D + ++ ++ R V +S Sbjct: 60 IIDSSGSMGGK--KIIEARQAAAVAIDTLPDGVAFAVVSGDSSARV--VYPTSGTGLAEA 115 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 A ++ + + T L + + H I LTD Sbjct: 116 SAATRSAAKDAVRGVQADGGTAIGQWLTLTRSLMTTRPGAARHA-----------ILLTD 164 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS 338 G+N +L C + GV A+ L+ AS Sbjct: 165 GQNGESEQVFAAALAQCEGVFQCDCR----GVGADWRVSELRRIAS 206 >gi|228963719|ref|ZP_04124861.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar sotto str. T04001] gi|228795955|gb|EEM43421.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar sotto str. T04001] Length = 610 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|146170189|ref|XP_001017437.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146145085|gb|EAR97192.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 865 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 74/248 (29%), Gaps = 21/248 (8%) Query: 102 AQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 Q + + S + I D ++ + + + + I S K Sbjct: 85 EQIVQDKASSDEIQIQQDSENTANKKQIIKGMTLEDFKSSLVELNQDLKIIMPIIQSEKD 144 Query: 162 SS---KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 S S LD+ ++D++ SM+ V + I + P ++ +R Sbjct: 145 QSKKWNSSCSLDLCFIVDITGSMSPWMN------QVVQYVGNTLKTIYEDFPAID--LRV 196 Query: 219 GLVTFSSKIVQ----TFPLAWGVQHIQEKINRLIFGSTT----KSTPGLEYAYN-KIFDA 269 + Q T+ V + ++ ++ T GL A N + Sbjct: 197 AFSGYRDHCDQHKYVTYDFTRNVDSFRSQLAQVTCLGGGDCPEDVTGGLHNALNFDWGSS 256 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGE-NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++ IA KY + D N PN E L K+ V I + Sbjct: 257 AQQAVFIADSPAHGVKYHENIADDYPNGCPNGRKLEDLMIQFSQKKIDFTVVEISGMTKK 316 Query: 329 ADQFLKNC 336 + C Sbjct: 317 MFSIMTQC 324 >gi|330914182|ref|XP_003296529.1| hypothetical protein PTT_06655 [Pyrenophora teres f. teres 0-1] gi|311331290|gb|EFQ95393.1| hypothetical protein PTT_06655 [Pyrenophora teres f. teres 0-1] Length = 1084 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 63/179 (35%), Gaps = 31/179 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSI-REMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ V D S SM G L V +S+ + I S + ++ + VT+S + Sbjct: 295 IVFVCDRSGSMQTSIGLARQALQVFLKSLPIGVKFNICSFGNTHSFLWPKSVTYS---QE 351 Query: 230 TFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 T LA +N + T+ L+ + + I+ Sbjct: 352 TLDLAMN------HVNSMTANYGGTEMLQPLQATIEERYKDMALD-------------IM 392 Query: 289 FLTDGENSSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEAADQFLKNCASPDRFYS 344 LTDGE + + Y N++ + V+ +GV + + ++ A +S Sbjct: 393 LLTDGE----IWNQHQLFSYLNQSVLESKDPIRVFTLGVGSSVSHALIEGIAKAGNGFS 447 >gi|242373708|ref|ZP_04819282.1| regulator of nitric oxide reductase [Staphylococcus epidermidis M23864:W1] gi|242348676|gb|EES40278.1| regulator of nitric oxide reductase [Staphylococcus epidermidis M23864:W1] Length = 629 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 38/300 (12%), Positives = 94/300 (31%), Gaps = 33/300 (11%) Query: 37 ETSHKFFVKAK-LHYILDH--SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR 93 + + K K LDH L + E N K + ++I Sbjct: 313 DMTDMMTKKGKGSQSTLDHDEGGLIGQNQAFALEGINKNVKIEWKVPDIQPQDIIN---- 368 Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 + +N +I ++ + +I + + + NL+ R + I + Sbjct: 369 YQSSKNDVQYEIKDLIQIIKKTIDREHEDERRNLT-KGRLQKDLINWFIDDQFKLFYKKQ 427 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 ++ + + +++D S SM+ DK+ + + + +K++ + Sbjct: 428 DLSKTFDAT--------FTLLVDASASMH-------DKMDETIKGVVLFHETLKTLNIKH 472 Query: 214 NVVRSGLVTFSSKIVQT--FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + + F+ + + I + + N+ A Sbjct: 473 EI-----LAFNEDAFEADDREQPNIIDEIINYNYSIFEKEGPRIMSLEPQDDNRDGVAIR 527 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGVQAEA 328 ++++I +DGE S+ N ++ A++ G V+ + + E Sbjct: 528 VASERLLQRSHQQRFLIVFSDGEPSAFNYSQDGIIDTYEAVETARKFGIEVFNVFLSQEP 587 >gi|221193645|gb|ACM07855.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|254513910|ref|ZP_05125971.1| transporter [gamma proteobacterium NOR5-3] gi|219676153|gb|EED32518.1| transporter [gamma proteobacterium NOR5-3] Length = 326 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 56/169 (33%), Gaps = 19/169 (11%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 P A + P S + +++ +D+S SM++ +L A I ++ Sbjct: 74 PLLAVAVAGPSWERGD---SPFAQDSAALIIAVDLSASMDESDLQP-SRLQRAREKILKL 129 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + + L+ +S PL+ + + ++ L G + + Sbjct: 130 AEA-------RGDAYTALIAYSGTAHTVLPLSNDSKLLLHYLDALSVGMLPRRGKAPQ-- 180 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 N + A+ L G ++ ++DG + ++ + Sbjct: 181 -NVLPIAQSLLATRGTGGS-----LLIVSDGASDQSAAAFRDWASQSDT 223 >gi|90408686|ref|ZP_01216836.1| hypothetical protein PCNPT3_08480 [Psychromonas sp. CNPT3] gi|90310200|gb|EAS38335.1| hypothetical protein PCNPT3_08480 [Psychromonas sp. CNPT3] Length = 592 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 65/183 (35%), Gaps = 17/183 (9%) Query: 134 EMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLG 193 ++P + + D ++++LD SLSM + ++L Sbjct: 71 QLPLKLLILVFSLGILVCAGPSWQKQVSPFEEDKA-PLLIILDSSLSMLEKDVMP-NRLS 128 Query: 194 VATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTT 253 A + I ++L + ++GL+ +S PL + + + + Sbjct: 129 RAKQKITDLL-------MLRGGGKTGLIVYSGSAHLVMPLTQDLAVLGPYLMAI----GP 177 Query: 254 KSTPGL-EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 P + AY I K++L ++K ++ +TD + +D + Y + Sbjct: 178 DIMPIEGKSAYKTIPLIKQQLLTLSKSKLQISATVLLITDAITT---LDKQMFTEYFKHS 234 Query: 313 KRR 315 K + Sbjct: 235 KNQ 237 >gi|327289127|ref|XP_003229276.1| PREDICTED: von Willebrand factor A domain-containing protein 3A-like [Anolis carolinensis] Length = 1095 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 56/181 (30%), Gaps = 38/181 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM +L N ++ L++F+ + Sbjct: 886 VCLLIDASGSMESSLEEVTKELTSLIWE-----------QLRKNNMKFNLISFAEDVEVW 934 Query: 231 FP-LAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + ++ L T L A+ + E L Sbjct: 935 QECLEEATDEACHDAVQWVSMLHAHGNTSVLKALRRAF--LLQDVEAL------------ 980 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA--EAADQFLKNCA--SPDR 341 YI LTDG+ + + + K++ ++ I A+ FLK A + R Sbjct: 981 YI--LTDGKPDTSCNLVLKEIEMLR--KKQAITIHTISFNCTDRGANDFLKKLAFQTGGR 1036 Query: 342 F 342 + Sbjct: 1037 Y 1037 >gi|167623905|ref|YP_001674199.1| vault protein inter-alpha-trypsin subunit [Shewanella halifaxensis HAW-EB4] gi|167353927|gb|ABZ76540.1| Vault protein inter-alpha-trypsin domain protein [Shewanella halifaxensis HAW-EB4] Length = 761 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 50/290 (17%), Positives = 101/290 (34%), Gaps = 44/290 (15%) Query: 92 FRNELRENGFAQDINNIERST---SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS 148 + +L + I + SL + +++Q + + + P + + Sbjct: 180 IQGQLASAALNSPTHAITKHYDNESLKVSLNNQTAAMDRDLIVEFTQPKYYAGEGLWSKD 239 Query: 149 SHAPLLITS--SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 + +TS ++ + MV+D S SM G + + G+A + I ++L+ Sbjct: 240 DDKVVALTSFYPQIDLPQTTSSRCIKMVVDCSGSM---LGDSITQAGIALKQILKLLNE- 295 Query: 207 KSIPDVNNVVRSGL---VTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYA 262 D N++ G FS + + +++ L T+ L A Sbjct: 296 ---DDWFNIILFGSHHKSLFSESVKANRA---NLDIAAKELANLNADLGGTEMLSALNAA 349 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 Y+ A E + I+ +TDGE E C +A+ + + Sbjct: 350 YDSAAPA-ELASN-----------ILLITDGE------IWGEEQLIC-KAQESNHRHFVV 390 Query: 323 GVQAEAADQFLKNCA-----SPDRFYSVQNSRK-LHDAFLRIGKEMVKQR 366 GV + ++ FLK A + + +N + F RI + + Q Sbjct: 391 GVGSAVSEAFLKQLADKTGGASEFVTPNENMSSRIVQHFCRIKQSKLTQS 440 >gi|125717894|ref|YP_001035027.1| peptidoglycan binding domain-containing protein [Streptococcus sanguinis SK36] gi|125497811|gb|ABN44477.1| Peptidoglycan-binding domain-containing protein, putative [Streptococcus sanguinis SK36] Length = 450 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 63/199 (31%), Gaps = 34/199 (17%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D++ V+D S SM G +D + + +++I R GL TFS Sbjct: 173 KAGSADIVFVVDRSGSM----GSTIDIVRANIN------EFVRNITKEGITARFGLATFS 222 Query: 225 SKIVQTFP----------------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ +++ + + S + A N+I Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPTPALNQIIS 282 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA 328 + KK+++ LTD E P + + K G V+A Sbjct: 283 -----TYDWSKSSKNKKFVVLLTDAEIKKPLQFPRVADTL-RALKAAGIERTVATVKAIE 336 Query: 329 ADQFLKNCASPDRFYSVQN 347 KN A+ R ++N Sbjct: 337 G--IYKNFATEGRVLDIEN 353 >gi|58429515|gb|AAW78161.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 554 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429457|gb|AAW78132.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429523|gb|AAW78165.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429475|gb|AAW78141.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429513|gb|AAW78160.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|13928960|ref|NP_113879.1| integrin alpha-D precursor [Rattus norvegicus] gi|48428189|sp|Q9QYE7|ITAD_RAT RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|6648592|gb|AAF21241.1|AF021334_1 alpha D integrin [Rattus norvegicus] Length = 1161 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 78/217 (35%), Gaps = 28/217 (12%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + +D+ ++D S S+N ++ ++ S + +++ Sbjct: 139 VPASMPECPRQEMDIAFLIDGSGSIN------QRDFAQMKDFVKALMGEFASTSTLFSLM 192 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + F Q + + I +L T + G+ ++F +K Sbjct: 193 QYSNILKTHFTFTEFKNILDPQSLVDPIVQLQ--GLTYTATGIRTVMEELFHSKNGSRKS 250 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAADQF 332 AK K ++ +TDG+ ++ + + ++A G I YAIG Q A + Sbjct: 251 AK------KILLVITDGQKYRDPLEYSDVIPAADKA---GIIRYAIGVGDAFQEPTALKE 301 Query: 333 LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 L S D + V N A I +++ ++ Sbjct: 302 LNTIGSAPPQDHVFKVGN----FAALRSIQRQLQEKI 334 >gi|39933805|ref|NP_946081.1| hypothetical protein RPA0728 [Rhodopseudomonas palustris CGA009] gi|39647652|emb|CAE26172.1| conserved unknown protein [Rhodopseudomonas palustris CGA009] Length = 468 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 64/182 (35%), Gaps = 21/182 (11%) Query: 1 MSFLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTA 60 MS + F + K +I+++ A+++ I ++G+ ++ + K +L D + + Sbjct: 1 MSDALLSRFVRDRKANIAVIAALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAV 60 Query: 61 TKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDD 120 + + + Q ++ N L + + +I I D Sbjct: 61 RPAMLMQ--TDAVAQNTAYA-------IFMSTANRLASGLTSVP--------TPTITITD 103 Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + Y + N+ + S+ + SS + ++ +++D S S Sbjct: 104 V--GLQRTVKVSYNAASLNNFPQLLMNNVSWAISGASTAQASSAPN--MNFYLLMDDSPS 159 Query: 181 MN 182 M Sbjct: 160 MG 161 >gi|156602970|ref|XP_001618750.1| hypothetical protein NEMVEDRAFT_v1g153509 [Nematostella vectensis] gi|156200182|gb|EDO26650.1| predicted protein [Nematostella vectensis] Length = 133 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 46/129 (35%), Gaps = 20/129 (15%) Query: 235 WGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + + E ++++ +T + L+Y ++I+ ++ +I LTDG Sbjct: 11 NNKKDVLEAVDKMPYPKGSTYTGRALQYMNDEIYRKATRVG--------VPNILIVLTDG 62 Query: 294 ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP---DRFYSVQ--NS 348 + + ++L + G +Y+IGV L A+ +SV N Sbjct: 63 KAHDSVAEPAKAL------RDIGIEIYSIGVGESYDKAELDAIATDPDASHVFSVDFKNM 116 Query: 349 RKLHDAFLR 357 + Sbjct: 117 NSIVSTLDA 125 >gi|148980400|ref|ZP_01816043.1| Large exoprotein [Vibrionales bacterium SWAT-3] gi|145961272|gb|EDK26584.1| Large exoprotein [Vibrionales bacterium SWAT-3] Length = 1361 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 102/318 (32%), Gaps = 38/318 (11%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL--RENGFAQDIN 106 + +L E+ + ++ S ++ L NG Sbjct: 592 EDATTTQTVSGNVDVLYAESVTLQEPSESYTSGGQSITWVSSNGGQSLVGSANGVEIVTA 651 Query: 107 NIERSTSLSIIIDD--QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSK 164 I+ S + S+ + H + + + + T S L I V ++ Sbjct: 652 TIDDSGNYSVSLSGPIDHLNGSTPVNNMSIDIGVVATNDTGTQSGTISLTIDDDVPVAQA 711 Query: 165 S--------DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 G ++ ++LDVS SM+ + + V +S +L + + + V Sbjct: 712 KVHDLAPVVKDGANVQLILDVSGSMSGN------AMTVMKQSAIALLQGYQLLGETKVQV 765 Query: 217 RSGLV---------TFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYN 264 ++SS L+ V IN L G T + + Sbjct: 766 TVFESDANVVNENGSWSSDNSLPSNLSTIWMDVDTAIGVINSLTAGGGTDYDDAV---WL 822 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAI 322 D+ + G + FL+DG+ SS N I+ E + ++ + G A Sbjct: 823 AGSDSIWGNSDMVSGGSNIS---YFLSDGDPSSANHRINASEQAAWESQLTKYGVTSLAY 879 Query: 323 GVQAEAADQFLKNCASPD 340 G+ A +++ ++ A Sbjct: 880 GMGANISNEHMEPVAFDG 897 >gi|118594950|ref|ZP_01552297.1| von Willebrand factor, type A [Methylophilales bacterium HTCC2181] gi|118440728|gb|EAV47355.1| von Willebrand factor, type A [Methylophilales bacterium HTCC2181] Length = 318 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 61/227 (26%), Gaps = 37/227 (16%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 S ++ G ++ ++D S+SM F D + +K D Sbjct: 68 SKSTSITEIGKGAQLVFLIDRSVSMAKPFIGDDDNKSEIKSLAARRI--LKDFIDQRPSD 125 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLE 274 G+V FS+ + + + I+ S T G+ + Sbjct: 126 MIGIVGFSNSALYASKITKNRSYTYAAIDAATGSSINQTNIGSGITSGLFMFSE------ 179 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 + ++ L+DG + I + + +Y I ++ Sbjct: 180 ----IETTGSQALVLLSDG---AGKISKRVKERIAEMLNEKKINLYWIIIKEPNDVSLFS 232 Query: 335 ----------NCASPDRF----------YSVQNSRKLHDAFLRIGKE 361 D F Y +N L A I ++ Sbjct: 233 GNTYLEGREPTVIKLDNFFKSLKTEYKAYEAENPDALSSAIADIDQK 279 >gi|315173854|gb|EFU17871.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1346] Length = 683 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 111 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 163 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 164 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 215 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 216 -KKMLVHIGDGSANASFLPRENAQIYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 270 Query: 334 KNCASPD 340 N AS D Sbjct: 271 SNSASTD 277 >gi|254562692|ref|YP_003069787.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] gi|254269970|emb|CAX25948.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] Length = 473 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 22/59 (37%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 GS++++ A+ + ++G I+ + + L +D +L + Sbjct: 20 FARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALK 78 >gi|229171406|ref|ZP_04298991.1| Von Willebrand factor type A domain protein [Bacillus cereus MM3] gi|228612110|gb|EEK69347.1| Von Willebrand factor type A domain protein [Bacillus cereus MM3] Length = 610 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|221193678|gb|ACM07871.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193701|gb|ACM07882.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESL-----FYCNEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGGSYESQFEQMKKNG 424 >gi|218531748|ref|YP_002422564.1| hypothetical protein Mchl_3818 [Methylobacterium chloromethanicum CM4] gi|218524051|gb|ACK84636.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 473 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 22/59 (37%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 GS++++ A+ + ++G I+ + + L +D +L + Sbjct: 20 FARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALK 78 >gi|66391570|ref|YP_239095.1| hypothetical protein RB43ORF119c [Enterobacteria phage RB43] gi|62288658|gb|AAX78641.1| hypothetical protein RB43ORF119c [Enterobacteria phage RB43] Length = 739 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 41/129 (31%), Gaps = 12/129 (9%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DVS SM + L S+ + D + + + G V K+ L Sbjct: 23 VCDVSGSMYNELPKIRKHLKANLASLVKQDDTVSILYFSSKGDY-GTVFRGEKVSNVSDL 81 Query: 234 AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I L T L A D + + ++ +IFLTDG Sbjct: 82 TNICTAIDRY---LKPTGCTGFVEPLNLAAEIATDLQSENGNLNS--------LIFLTDG 130 Query: 294 ENSSPNIDN 302 ++ D+ Sbjct: 131 YDNCWRTDD 139 >gi|84386030|ref|ZP_00989060.1| hypothetical protein V12B01_13425 [Vibrio splendidus 12B01] gi|84379346|gb|EAP96199.1| hypothetical protein V12B01_13425 [Vibrio splendidus 12B01] Length = 310 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 51/153 (33%), Gaps = 11/153 (7%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 VKI S+ ++ +++D SM F ++ + L K I + R Sbjct: 83 VKIFSEERER-NVYVMVDQRTSM---FFGSTGRMKSVVAAEVAALIAWKVID---STDRV 135 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G + ++ H+ + +N + A + ++ KL H A+ Sbjct: 136 GAIIYNDSTALPISPQRSANHVLKILNEIASK---NQQLKAGKAQDTQSNSFAKLFHQAQ 192 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 + +I +TDG ++ C Sbjct: 193 RLVKHDGLVILITDGY-GYNERSEEQIKALCQH 224 >gi|163852924|ref|YP_001640967.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] gi|163664529|gb|ABY31896.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] Length = 473 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 22/59 (37%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 GS++++ A+ + ++G I+ + + L +D +L + Sbjct: 20 FARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALK 78 >gi|110806037|ref|YP_689557.1| hypothetical protein SFV_2127 [Shigella flexneri 5 str. 8401] gi|110615585|gb|ABF04252.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401] Length = 219 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 66/174 (37%), Gaps = 18/174 (10%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 S + +++LDVS SM+ G +++L + R+ L + S+ V G+VT Sbjct: 14 SNPEPRCPCILLLDVSGSMS---GRPINELNTGLVTFRDEL-LADSLALKR--VELGIVT 67 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL--EHIAKGH 280 F + P ++F + A K D E+ E+ A G Sbjct: 68 F-GPVHVEQPFT----SAANFFPPILFAQG---DTPMGAAITKALDMVEERKGEYRANGI 119 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 Y+ +I +TDG + +F E KR ++IGVQ + Sbjct: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQGADMKTLAQ 171 >gi|15605859|ref|NP_213236.1| hypothetical protein aq_345 [Aquifex aeolicus VF5] gi|2983035|gb|AAC06645.1| putative protein [Aquifex aeolicus VF5] Length = 625 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 75/199 (37%), Gaps = 25/199 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++D+S SM ++ L + E+LD +K + F+ + Sbjct: 447 FELLMDISSSMKKE-EKILNALKSLI-LVSEVLDKLKMEFSIK--------VFNENVYTL 496 Query: 231 FPLAWGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + + +I L+ G +T + + + + + K +I Sbjct: 497 KDFSEDYKVAKARIMDLLNDLGGSTDLSKAITVGVESL--------EVVMKKEHKKGVLI 548 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNS 348 TDG+ + + +E ++ ++ K + + AIGV EA + SV++ Sbjct: 549 LFTDGQPTKG-LRGEELKYFISQMKMK-LPIVAIGVG-EATHMVKEYFDKTG--LSVEDI 603 Query: 349 RKLHDAFLRIGKEMVKQRI 367 KL AF + + K+ + Sbjct: 604 SKLPSAFSFVMENQFKRLL 622 >gi|77413359|ref|ZP_00789553.1| cell wall surface anchor family protein [Streptococcus agalactiae 515] gi|77160601|gb|EAO71718.1| cell wall surface anchor family protein [Streptococcus agalactiae 515] gi|221193497|gb|ACM07781.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193501|gb|ACM07783.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193513|gb|ACM07789.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193519|gb|ACM07792.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193529|gb|ACM07797.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193531|gb|ACM07798.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193561|gb|ACM07813.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193581|gb|ACM07823.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193589|gb|ACM07827.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193597|gb|ACM07831.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193603|gb|ACM07834.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193607|gb|ACM07836.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193609|gb|ACM07837.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193611|gb|ACM07838.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193613|gb|ACM07839.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193617|gb|ACM07841.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193621|gb|ACM07843.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193639|gb|ACM07852.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193643|gb|ACM07854.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193663|gb|ACM07864.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193665|gb|ACM07865.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193667|gb|ACM07866.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193680|gb|ACM07872.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193699|gb|ACM07881.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193711|gb|ACM07887.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193719|gb|ACM07891.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193725|gb|ACM07894.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193743|gb|ACM07903.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193745|gb|ACM07904.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193747|gb|ACM07905.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193751|gb|ACM07907.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193753|gb|ACM07908.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|77411517|ref|ZP_00787861.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|77162443|gb|EAO73410.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|221193483|gb|ACM07774.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193485|gb|ACM07775.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193487|gb|ACM07776.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193491|gb|ACM07778.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193503|gb|ACM07784.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193509|gb|ACM07787.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193511|gb|ACM07788.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193515|gb|ACM07790.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193527|gb|ACM07796.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193535|gb|ACM07800.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193541|gb|ACM07803.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193545|gb|ACM07805.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193551|gb|ACM07808.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193565|gb|ACM07815.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193569|gb|ACM07817.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193573|gb|ACM07819.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193579|gb|ACM07822.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193585|gb|ACM07825.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193601|gb|ACM07833.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193619|gb|ACM07842.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193629|gb|ACM07847.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193631|gb|ACM07848.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193635|gb|ACM07850.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193637|gb|ACM07851.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193655|gb|ACM07860.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193669|gb|ACM07867.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193676|gb|ACM07870.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193683|gb|ACM07873.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193685|gb|ACM07874.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193693|gb|ACM07878.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193695|gb|ACM07879.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193703|gb|ACM07883.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193705|gb|ACM07884.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193709|gb|ACM07886.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193721|gb|ACM07892.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193727|gb|ACM07895.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193749|gb|ACM07906.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|116670916|ref|YP_831849.1| von Willebrand factor, type A [Arthrobacter sp. FB24] gi|116611025|gb|ABK03749.1| von Willebrand factor, type A [Arthrobacter sp. FB24] Length = 336 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 53/137 (38%), Gaps = 10/137 (7%) Query: 163 SKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + L++ V+D + S++ + +G G +L + I + + R ++ Sbjct: 65 RAAASDLNVFFVVDTTSSISAEDYGTGSPRLDGVRKDIMAIAGELAGA-------RFSMI 117 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 TF S+ V PL + L T ST A + + E+L + H Sbjct: 118 TFDSQTVVRMPLTTDTSALDTLTGVLEPQITLYSTGSSVTAARTVLN--ERLGAARESHP 175 Query: 282 DYKKYIIFLTDGENSSP 298 + + + + DGE +S Sbjct: 176 ERARIVFYFGDGEQTSG 192 >gi|22537555|ref|NP_688406.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|76797665|ref|ZP_00779934.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|22534437|gb|AAN00279.1|AE014256_11 cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|76586959|gb|EAO63448.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|221193505|gb|ACM07785.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193507|gb|ACM07786.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193517|gb|ACM07791.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193523|gb|ACM07794.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193537|gb|ACM07801.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193549|gb|ACM07807.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193553|gb|ACM07809.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193557|gb|ACM07811.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193559|gb|ACM07812.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193563|gb|ACM07814.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193571|gb|ACM07818.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193583|gb|ACM07824.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193587|gb|ACM07826.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193591|gb|ACM07828.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193593|gb|ACM07829.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193599|gb|ACM07832.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193605|gb|ACM07835.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193627|gb|ACM07846.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193651|gb|ACM07858.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193653|gb|ACM07859.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193674|gb|ACM07869.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193691|gb|ACM07877.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193707|gb|ACM07885.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193713|gb|ACM07888.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193715|gb|ACM07889.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193717|gb|ACM07890.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193723|gb|ACM07893.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193729|gb|ACM07896.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193731|gb|ACM07897.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193735|gb|ACM07899.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193739|gb|ACM07901.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|229083862|ref|ZP_04216171.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-44] gi|228699440|gb|EEL52116.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-44] Length = 609 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANHAE 590 >gi|224119512|ref|XP_002331179.1| predicted protein [Populus trichocarpa] gi|222873300|gb|EEF10431.1| predicted protein [Populus trichocarpa] Length = 742 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 67/220 (30%), Gaps = 35/220 (15%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 S +++ ++D+S SM + P S+++ L+ S + V Sbjct: 320 PGDNQSMKAFRKEVIFLIDISGSMKGN--PFESAKNGLLSSLQK-LNPEDSFNIIAFNVE 376 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKIN-RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + FSS + Q A + + +N L T LE A + + + + Sbjct: 377 T--YLFSSLMEQATKEA--ILKATQWLNDNLTADGGTNILAPLEQALKLLAETTDSIP-- 430 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-------GAIVYAIGVQAEAA 329 I +TDG ++ CN K + G+ Sbjct: 431 ---------LIFLITDG-------AVEDERDICNFVKGSLTSGGSISLRICTFGIGTYCN 474 Query: 330 DQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 FL+ A F + ++ + R+ + Sbjct: 475 HYFLRMLAQIGRGHFDTAYDADSVDFRMQRLFATASSIIL 514 >gi|171743344|ref|ZP_02919151.1| hypothetical protein BIFDEN_02475 [Bifidobacterium dentium ATCC 27678] gi|171278958|gb|EDT46619.1| hypothetical protein BIFDEN_02475 [Bifidobacterium dentium ATCC 27678] Length = 344 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 68/195 (34%), Gaps = 32/195 (16%) Query: 151 APLLITSSVKISSKSDIGL---DMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREM 202 ++I + +SS + + D+++ +DV+ SM +L VA ++++ Sbjct: 62 VAVMILTPSIVSSTHNRAINATDVVIAVDVTGSMAVKDAQYGSDELQTRLDVAKQAVK-- 119 Query: 203 LDIIKSIPDVN-NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 DI P+ + VR G PL + I + L T + G Sbjct: 120 -DITGLYPNSSFAAVRFGASG-----TLDVPLTPDSKAIDNWADTL-APEATSVSSG--- 169 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYII--FLTDGENSSPNI--DNKESLFYCNEAKRRGA 317 + + + L + I+ ++DGE +S Y N+A Sbjct: 170 STLDVPIDQLLLTCKSIHEQHPDDAIVMYLISDGEQTSSKTRRTFSSLRRYLNDA----- 224 Query: 318 IVYAIGVQAEAADQF 332 + + V +E Q Sbjct: 225 --FTVAVGSEQGGQI 237 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 N N KG ++ + + + L I+ SH K +L +D L AT Sbjct: 17 NRLRNQKGLTLVVMTMSMVAFITIAALSIDVSHFVVNKTRLQNAVDTIALAGATVANRTN 76 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + ++I++ + ++G ++ +IE S S + Sbjct: 77 EKGDTDTAIIESYKKVIESPGNDEIELTATDDGNGLNLLSIEYSDSPNSGFSTTFPSSPD 136 Query: 128 SAVSRYEM 135 R E+ Sbjct: 137 MVYVRVEV 144 >gi|239995769|ref|ZP_04716293.1| TPR domain protein [Alteromonas macleodii ATCC 27126] Length = 692 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 55/176 (31%), Gaps = 26/176 (14%) Query: 125 YNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSV---KISSKSDIGLDMMMVLDVSLSM 181 Y + + EM + + + + + + ++V+D+SLSM Sbjct: 49 YQYMVIGKNEMGAKPPMWMLAFVWIISVIALAGPTWERLPQPVYQLKMGHVIVIDMSLSM 108 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 D+L A ++++ I GLV ++ PL +I Sbjct: 109 RATDMTP-DRLTRAKYKAIDLVNAIGEGE-------MGLVAYAGDAFVISPLTEDAGNIT 160 Query: 242 EKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 I L + G+E A + +A I ++TDG Sbjct: 161 TLIPSLSPEIMPVPGSDPLLGIESASELLTNAGYNSG-----------MIYWITDG 205 >gi|206967848|ref|ZP_03228804.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|218231623|ref|YP_002365420.1| hypothetical protein BCB4264_A0660 [Bacillus cereus B4264] gi|228951116|ref|ZP_04113232.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar kurstaki str. T03a001] gi|229077936|ref|ZP_04210546.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock4-2] gi|229177150|ref|ZP_04304539.1| Von Willebrand factor type A domain protein [Bacillus cereus 172560W] gi|206736768|gb|EDZ53915.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|218159580|gb|ACK59572.1| conserved hypothetical protein [Bacillus cereus B4264] gi|228606331|gb|EEK63763.1| Von Willebrand factor type A domain protein [Bacillus cereus 172560W] gi|228705394|gb|EEL57770.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock4-2] gi|228808526|gb|EEM55029.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar kurstaki str. T03a001] Length = 627 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|325171240|ref|YP_004251212.1| putative cobalamin biosynthesis protein CobT [Vibrio phage ICP2] gi|323512266|gb|ADX87723.1| putative cobalamin biosynthesis protein CobT [Vibrio phage ICP2] Length = 580 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 76/221 (34%), Gaps = 27/221 (12%) Query: 149 SHAPLLITSSVKISSKSDIGL----DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + T S I + + + + ++LD S SM+ L + + L+ Sbjct: 375 KAVAKVTTGSDVIFRQKEQKVVLDTAVTVLLDSSGSMSGRSKYLHGMLACCM--LNDALN 432 Query: 205 IIKSIPDVNNVVRS--GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 + +V ++ G +++ ++ T +G + + +S G++ A Sbjct: 433 KVGIPIEVLGFTQTYEGSNSYNQHLIHTP---FGRRDTARDL--------VESMDGVDLA 481 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN-KESLFYCNEA-KRRGAIVY 320 N A H +K +I L+DG + ++ + E K+ +Y Sbjct: 482 NNDDGAAIMWAHSRLIRHKAKRKILIVLSDGSPACLQANSYAFTKQVVEEIEKKSPVEIY 541 Query: 321 AIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 IG+ + + ++ +L A L + K Sbjct: 542 GIGIMDDNVKRIYSQSE------VIRTPEQLESALLNVVKS 576 >gi|283455686|ref|YP_003360250.1| von Willebrand factor A [Bifidobacterium dentium Bd1] gi|283102320|gb|ADB09426.1| von Willebrand factor, type A [Bifidobacterium dentium Bd1] Length = 339 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 68/195 (34%), Gaps = 32/195 (16%) Query: 151 APLLITSSVKISSKSDIGL---DMMMVLDVSLSMNDH-----FGPGMDKLGVATRSIREM 202 ++I + +SS + + D+++ +DV+ SM +L VA ++++ Sbjct: 57 VAVMILTPSIVSSTHNRAINATDVVIAVDVTGSMAVKDAQYGSDELQTRLDVAKQAVK-- 114 Query: 203 LDIIKSIPDVN-NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY 261 DI P+ + VR G PL + I + L T + G Sbjct: 115 -DITGLYPNSSFAAVRFGASG-----TLDVPLTPDSKAIDNWADTL-APEATSVSSG--- 164 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYII--FLTDGENSSPNI--DNKESLFYCNEAKRRGA 317 + + + L + I+ ++DGE +S Y N+A Sbjct: 165 STLDVPIDQLLLTCKSIHEQHPDDAIVMYLISDGEQTSSKTRRTFSSLRRYLNDA----- 219 Query: 318 IVYAIGVQAEAADQF 332 + + V +E Q Sbjct: 220 --FTVAVGSEQGGQI 232 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 54/428 (12%), Positives = 118/428 (27%), Gaps = 104/428 (24%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 + +++L +D + L + ++ + + + G + D L Sbjct: 17 DLGRVSVEQSRLQNAVDSAALAGSLQLPDDPDVSTGAVTAAATQNLLAN---DADATGIL 73 Query: 97 RENGFAQDINNIERSTSLSIIIDDQ----HKDYNLSAVSRYE-MPFIFCT--------FP 143 E+G A + + + + + A + Y + + P Sbjct: 74 VESGGATRSVCVSAEAKVEMTLSQVIGIGDQTVTAEACAGYNDIELVMVLDATGSMKGTP 133 Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMV-------LD-----VSLSMNDHFGPGMDK 191 A L+ + SS + + +V +D + + D GPG Sbjct: 134 IANVKEAATNLVNLIMPSSSSTSTRSKIGLVPFQGKVRIDGNDPVTAEANPDGVGPGCRN 193 Query: 192 LGVATRSIREMLDIIKSIPDVN---NVVRSGLVTFSSKIVQTFP----LAWGVQHIQEKI 244 + + + K+ N SG+ T S K L+ I I Sbjct: 194 ADGTLNNGKLRTEYSKTTTKTNIFYGYTLSGVSTTSDKTCSGMSPIRALSSDKSAILSNI 253 Query: 245 NRLIFGS---TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD--------- 292 L G T + G+++ + + +E +K +I LTD Sbjct: 254 TALNAGQVTSGTIISEGIKWGRHVLTPTAPYVEGS--TDTKVRKIMIVLTDGDTEDGRCG 311 Query: 293 ---------------------GENSSPNIDNKESLFYCN-------EAKRRG-------- 316 G + +L + K G Sbjct: 312 GSYASASKTINTYWTNAYFGQGLKPDSSASPYSTLSTAALTLAQIPDCKDGGLLNTYVVN 371 Query: 317 ------------AIVYAIGVQAEAA--DQFLKNCASP-----DRFYSVQNSRKLHDAFLR 357 +++I + +K AS D ++ + + + F + Sbjct: 372 EATLAKTDANYPIEIFSIRFGDSDSTDKNLMKQIASSKPGTEDHYFDAPDEAGIKEMFKK 431 Query: 358 IGKEMVKQ 365 IG+++ ++ Sbjct: 432 IGQQLGQR 439 >gi|221193661|gb|ACM07863.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|160897741|ref|YP_001563323.1| hypothetical protein Daci_2300 [Delftia acidovorans SPH-1] gi|160363325|gb|ABX34938.1| conserved hypothetical protein [Delftia acidovorans SPH-1] Length = 277 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 51/166 (30%), Gaps = 36/166 (21%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLG----VATRSIREMLDIIKSIPDVNNVVRSGLVT 222 + V+D S SM D G K R++ ++ V + G++ Sbjct: 11 SPTAFLFVVDQSGSMADKMSSGRSKAEFVADALNRTLVNLVTRCSKSEGVRDYFDVGVIG 70 Query: 223 FSSKIVQT---FPLAWGVQHIQEKINRLI-------------FGS--------------T 252 +S V PL+ + + +I R G + Sbjct: 71 YSGTTVGNGFTGPLSGKILNAISEIERSPLRVEDRKRKMDDGAGGIIETSIKFPVWFEPS 130 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 + A K A+E + + Y ++ +TDGE+S Sbjct: 131 ANGGTPMHAALTKA--AEELVAWCDAHPESYPPTVLHVTDGESSDG 174 >gi|58429545|gb|AAW78176.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLSTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|293190491|ref|ZP_06608878.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] gi|292820902|gb|EFF79858.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] Length = 338 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 81/273 (29%), Gaps = 53/273 (19%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 +++ + M F+ A S A + VK D+++ LD S Sbjct: 48 PKYQALVRRTRASLAMAFVCFLIAVIATSVSAGAPVDRYVK--HDKSASRDIVLCLDASG 105 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ- 238 SM P K+G A R I + R L + + + FPL + Sbjct: 106 SML----PYDSKIGGAFREIISHFE----------GERISLQLWDAYSMTMFPLTDDYEM 151 Query: 239 --HIQEKINRLIFGSTTKSTPGLEYAYNKIFD-----AKEKLEHIAK------------- 278 + + ++ I T+ L A ++FD E E + Sbjct: 152 ATDVLQDMSDTIDTGLTRIGGRLS-ATQELFDYLAPVMDENQEVSSIVGDGLASCVMGFD 210 Query: 279 -GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + I+ TD N AK +G V A+ +D L + A Sbjct: 211 HNDKQRSRTILLATD--NEVYGDGVYNLSEAIEFAKSQGVTVTAL---YPGSDITLSSEA 265 Query: 338 ---------SPDRFYSVQNSRKLHDAFLRIGKE 361 + FY + + +I E Sbjct: 266 LQLRDEVRKTGGDFYDASSPSSVDRVVKQIEAE 298 >gi|296121072|ref|YP_003628850.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296013412|gb|ADG66651.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 330 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 64/185 (34%), Gaps = 31/185 (16%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV-ATRSIREMLDIIKSIPDVNNVVRSG 219 I ++S ++ V+D SLSMN P + GV ATR R L++++++ + Sbjct: 158 IFTRSSSARKVVYVVDCSLSMNAPHPPTYYRTGVPATRFQRVQLELVQAVEALPEETEFS 217 Query: 220 LVTFSSKIVQTFPLAW------GVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +V FS + A + + + + + T L A Sbjct: 218 IVFFSDRAFAMPGEAMVVASQENKVKVLKWVASSMTMSGGTDPREALGVALKM------- 270 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + I L+DG + P I + ++ I + AA+ Sbjct: 271 ----------QPEMIYLLSDG-SFHPVIQQDLFKLQQDR-----IQIHTIALGEPAAEAV 314 Query: 333 LKNCA 337 LK + Sbjct: 315 LKQIS 319 >gi|167518794|ref|XP_001743737.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777699|gb|EDQ91315.1| predicted protein [Monosiga brevicollis MX1] Length = 874 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 67/190 (35%), Gaps = 14/190 (7%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ VLD S S+ G +A + + + + + +R + F ++ + Sbjct: 270 VDVLFVLDASGSV------GQANFNLAQQFV---ISAVSQLDVGLAAIRVAGMMFHAEAL 320 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 F Q + F G N + + L + G+ + + Sbjct: 321 PQFDFDDYTSAAQVQNAVANFNYPVNENWGTATG-NALDSIRTNLLQASAGYRGGEVVVY 379 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQN 347 F+TDG + + GA V AIG+ + + L+ A S D +V + Sbjct: 380 FITDGVSQE---SPSVVESAAQALRATGAQVMAIGITDQIDETQLEVIAGSADNVITVAD 436 Query: 348 SRKLHDAFLR 357 L++A Sbjct: 437 FANLNEAVRD 446 >gi|104779436|ref|YP_605934.1| surface adhesion protein [Pseudomonas entomophila L48] gi|95108423|emb|CAK13117.1| Surface adhesion protein [Pseudomonas entomophila L48] Length = 5862 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 50/137 (36%), Gaps = 14/137 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + ++D S SM G+D + S+ + L D + V LV F++++ + Sbjct: 5338 LAFIVDTSGSMGSS---GVDAAKKSLESVFKTL-AASVKGDQSGTVNILLVDFATQVKSS 5393 Query: 231 FPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + G+Q + +N L T + N +++ G + Sbjct: 5394 VAVTLNDAGLQTLLNALNNLRADGGTNYEDAFKTTANWF-------QNLKDGGNTGSNQT 5446 Query: 288 IFLTDGENSSPNIDNKE 304 F+TDG+ + + Sbjct: 5447 FFITDGKPTYYQANENS 5463 >gi|30018809|ref|NP_830440.1| von Willebrand factor type A domain-containing protein [Bacillus cereus ATCC 14579] gi|229126055|ref|ZP_04255077.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-Cer4] gi|229143348|ref|ZP_04271779.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST24] gi|296501384|ref|YP_003663084.1| von Willebrand factor type A domain-containing protein [Bacillus thuringiensis BMB171] gi|29894351|gb|AAP07641.1| von Willebrand factor type A domain protein [Bacillus cereus ATCC 14579] gi|228640155|gb|EEK96554.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST24] gi|228657377|gb|EEL13193.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-Cer4] gi|296322436|gb|ADH05364.1| von Willebrand factor type A domain-containing protein [Bacillus thuringiensis BMB171] Length = 627 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|317483712|ref|ZP_07942659.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] gi|316925028|gb|EFV46167.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] Length = 1111 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 79/233 (33%), Gaps = 27/233 (11%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDI 205 + + + + ++ VLD S SM + + + + + + Sbjct: 527 TGAGSIGDDLLQGATTTENVAMSYNISFVLDKSGSMGSSYSTAKEAVANYIEKLWDDIQN 586 Query: 206 IKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLI-----FGSTTKSTPGLE 260 +I ++ V S V + T + + +Q ++ + T L Sbjct: 587 TDAIINIQVVKFSSSVGWGDNNTFTLDKSTTYKELQAFLSAHVTNNDKASGNTNYEDALL 646 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE------SLFYCNEAKR 314 A F+++E+ + + F++DGE + P E + A Sbjct: 647 KA-ESWFNSQEENGFANR--------LYFISDGEPNRPYGKPVERAEAVYDRIVGDSAHP 697 Query: 315 RGAIVYAIGVQAEAADQF--LKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 V+AIG+ A+ L + D ++N+ +L+DA I V + Sbjct: 698 --VDVHAIGILGNGANDLDVLNKFDNTDGADQIRNAGELYDA---IASSTVTK 745 >gi|297153945|gb|ADI03657.1| von Willebrand factor type A [Streptomyces bingchenggensis BCW-1] Length = 256 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 53/157 (33%), Gaps = 21/157 (13%) Query: 171 MMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++++LD S SM P +D+L ++ D ++ + N V L+TF S + Sbjct: 18 VVLLLDTSASMGRPEEHPRIDELN---GALTRWFDGVRGQERLRNRVEVCLITFDSAVRV 74 Query: 230 TFPLAWGVQHIQEKINR--------------LIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 P G E+ + L T+ T +E A + L+ Sbjct: 75 HDP-GPGRLVPVEEADADRVFVPVDSMRPPTLRAEGLTRLTEAVEAALELVRTRYRTLQR 133 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 ++ LTDG S ++ A Sbjct: 134 QRVPVRRP--FLWVLTDGAPSDAQGRPLDATALAGTA 168 >gi|294787481|ref|ZP_06752734.1| putative von Willebrand factor type A domain protein [Parascardovia denticolens F0305] gi|315226945|ref|ZP_07868733.1| conserved hypothetical protein [Parascardovia denticolens DSM 10105] gi|294484837|gb|EFG32472.1| putative von Willebrand factor type A domain protein [Parascardovia denticolens F0305] gi|315121077|gb|EFT84209.1| conserved hypothetical protein [Parascardovia denticolens DSM 10105] Length = 369 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 60/158 (37%), Gaps = 10/158 (6%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN--DHFGPGMDKLGVATRSIRE 201 C + + + ++++++ D+ + +D + SM D D + + + Sbjct: 76 LCLLLAATVMTPSLRSQVTTRAVNATDVFIAVDTTGSMAVRDANYASPDTISRLEAASKA 135 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLE 260 + DI++ PD + + F S PL I + + L + GL+ Sbjct: 136 VKDIVRLYPDAS----FSAIHFDSTSNADLPLTPDSHAITQWADTLRTEPTAISQGSGLD 191 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 N + + + A HD Y +++DGE ++ Sbjct: 192 TPLNTLITSMKSTL-AAHPHDTIILY--YISDGETTNG 226 >gi|221193567|gb|ACM07816.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 67/211 (31%), Gaps = 50/211 (23%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI-------------------------------VQTFPLAWGVQHIQEK 243 R LVT+ S I L + I ++ Sbjct: 274 DNRVALVTYGSDIFDGRSVDVVKGFKEDDKYYGLQTKFTIQTENYSHKQLTNNAEEIIKR 333 Query: 244 I----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA---------KGHDDYKKYIIFL 290 I + +GSTT + + E A + + + +K I+ + Sbjct: 334 IPTEAPKAKWGSTTNGLTPEQQKEYYLSKVGETFTMKAFMEADDILSQVNRNSQKIIVHV 393 Query: 291 TDGENSSPNIDNKESLFYC-----NEAKRRG 316 TDG + N L + K+ G Sbjct: 394 TDGVPTRSYAINNFKLGASYESQFEQMKKNG 424 >gi|328947244|ref|YP_004364581.1| von Willebrand factor A [Treponema succinifaciens DSM 2489] gi|328447568|gb|AEB13284.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 241 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 55/165 (33%), Gaps = 18/165 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIV 228 + V+D S SM K+G +IRE+L +K + ++ +TFSS K + Sbjct: 17 LFFVVDTSGSMQG------TKIGAVNTAIREVLPELKDAGGSDVDLKVACLTFSSGCKWM 70 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + P+A + N + T K+ I Sbjct: 71 YSSPIASD----SFQWNNVDADGVTDLGSACRELSEKLSKNGFLKA----PSGSVAPAIF 122 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 ++DGE + L N K V A+ + +A L Sbjct: 123 LMSDGEPTDDFESGLNLLQQNNWFKHA-IKV-AVAIGDDANKDVL 165 >gi|228957042|ref|ZP_04118817.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pakistani str. T13001] gi|228802673|gb|EEM49515.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pakistani str. T13001] Length = 627 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|229010050|ref|ZP_04167264.1| Von Willebrand factor type A domain protein [Bacillus mycoides DSM 2048] gi|228751183|gb|EEM00995.1| Von Willebrand factor type A domain protein [Bacillus mycoides DSM 2048] Length = 610 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|51244488|ref|YP_064372.1| hypothetical protein DP0636 [Desulfotalea psychrophila LSv54] gi|50875525|emb|CAG35365.1| conserved hypothetical protein [Desulfotalea psychrophila LSv54] Length = 334 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 61/164 (37%), Gaps = 19/164 (11%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L +M+++D+S S FG G + R + L + + + N + GL+ F+ +I Sbjct: 112 LTIMLMIDISKS--GDFGSG----ESSKREMIAELASVLAFSAIKNNDKVGLILFTDEIE 165 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 P G HI I ++F L+ N I KK ++ Sbjct: 166 LYIPAKKGRSHILRVIQEILFFKAKGVRTNLQTPLNFI------------NSVCKKKCVV 213 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 FL + S + D+ +L + + + AI + Q Sbjct: 214 FLL-SDFSLTSTDDISTLQPSLKIANKYHDLIAILISDPNERQL 256 >gi|218559039|ref|YP_002391952.1| hypothetical protein ECS88_2264 [Escherichia coli S88] gi|237704580|ref|ZP_04535061.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|218365808|emb|CAR03548.1| conserved hypothetical protein [Escherichia coli S88] gi|226900946|gb|EEH87205.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|294491981|gb|ADE90737.1| conserved hypothetical protein [Escherichia coli IHE3034] gi|307626337|gb|ADN70641.1| hypothetical protein UM146_06190 [Escherichia coli UM146] gi|315285732|gb|EFU45172.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|323951916|gb|EGB47790.1| VWA domain containing CoxE protein [Escherichia coli H252] gi|323956154|gb|EGB51906.1| VWA domain containing CoxE protein [Escherichia coli H263] gi|324006479|gb|EGB75698.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 378 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSNATP--CYDHDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|209809313|ref|YP_002264851.1| hypothetical protein VSAL_II0523 [Aliivibrio salmonicida LFI1238] gi|208010875|emb|CAQ81277.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 619 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 47/131 (35%), Gaps = 25/131 (19%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 +V+D+S S+ ++L A ++L K +GLV +++ P Sbjct: 90 LVMDMSRSL-YATDVKPNRLTQAKYKANDLLPYWKEGM-------TGLVAYANNSYLISP 141 Query: 233 LAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + ++ I L G + + + + + + + Sbjct: 142 LTEDSKTLENLIQNLSPEIMPYKGKGSNLSSAISQSIEMMKKSGHQQGD----------- 190 Query: 287 IIFLTDGENSS 297 II +TDG +++ Sbjct: 191 IIVITDGISNA 201 >gi|117624324|ref|YP_853237.1| hypothetical protein APECO1_4428 [Escherichia coli APEC O1] gi|115513448|gb|ABJ01523.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 448 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 286 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 329 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 330 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 374 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 375 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSNATP--CYDHDTAQ 421 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 422 ALVNVGAQIAA 432 >gi|91211406|ref|YP_541392.1| hypothetical protein UTI89_C2393 [Escherichia coli UTI89] gi|91072980|gb|ABE07861.1| conserved hypothetical protein [Escherichia coli UTI89] Length = 399 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 237 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 280 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 281 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 325 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 326 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSNATP--CYDHDTAQ 372 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 373 ALVNVGAQIAA 383 >gi|34481898|emb|CAE46497.1| trap [Plasmodium falciparum] Length = 331 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 63/183 (34%), Gaps = 31/183 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S+ H ++ + +I+ + +N + + FS+ Sbjct: 47 VDLYLLMDCSGSIRRH--------NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAK 98 Query: 229 QTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L I+ ++ + T T L + D Sbjct: 99 EIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINR 150 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCA 337 ++ K+ ++ LTDG S KES + RG + G+ ++FL C Sbjct: 151 ENAKQLVVILTDGIPDSIQDSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCH 206 Query: 338 SPD 340 D Sbjct: 207 PSD 209 >gi|300727566|ref|ZP_07060956.1| von Willebrand factor, type A [Prevotella bryantii B14] gi|299775155|gb|EFI71757.1| von Willebrand factor, type A [Prevotella bryantii B14] Length = 245 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 64/204 (31%), Gaps = 27/204 (13%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS---- 224 + + V+D S SM G + + A ++ MLD I S + + ++ + FS Sbjct: 9 MTLFFVIDTSGSM---IGNKIGAVNDAVENVLPMLDEI-SASNPDAEIKVAALEFSSGCN 64 Query: 225 ---SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + W + T L A ++ + + Sbjct: 65 WLYDEPKLASEFVW---------QDVTASGLT----SLGAACLELNSKLSRSGFMQTPSG 111 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL-KNCASPD 340 + II L+DG + L N K + AI + +A L + + + Sbjct: 112 SFAPAIILLSDGGPTDDFHGGLSKLKANNWFKNA-IKI-AIAIGDDADKDVLTQFTGTNE 169 Query: 341 RFYSVQNSRKLHDAFLRIGKEMVK 364 ++V N L + + Sbjct: 170 AVFTVHNIDALKQIIRVVAVTSSQ 193 >gi|258647265|ref|ZP_05734734.1| von Willebrand factor, type A [Prevotella tannerae ATCC 51259] gi|260852914|gb|EEX72783.1| von Willebrand factor, type A [Prevotella tannerae ATCC 51259] Length = 289 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 10/109 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ R + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLSF------GSQRCTQRDLIAEIAATLAFSAIQNNDKIGVLFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNKIFD 268 S ++ + P G +HI I L+ + T GLEY I Sbjct: 126 SDRVEKFIPPQKGRRHILYIIRELLTFTPQSRRTDLGVGLEYLVRAISK 174 >gi|50085105|ref|YP_046615.1| putative tellurium resistance protein (TerY-like) [Acinetobacter sp. ADP1] gi|49531081|emb|CAG68793.1| putative tellurium resistance protein (TerY-like) [Acinetobacter sp. ADP1] Length = 211 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 48/130 (36%), Gaps = 9/130 (6%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM G + + V ++ ML ++ P V ++TF + Sbjct: 3 RLPVYILLDTSGSMR---GEPIHSVNV---GLQSMLSALRQDPYALESVHLSIITFDLEA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL Q I+ + T LE ++ +K KG ++ Sbjct: 57 KVYLPLTPLDQVQLANID-VPSAGATFMGAALELLAEQVAQHLQKSTDEVKGDWRPLLFV 115 Query: 288 IFLTDGENSS 297 +TDG S Sbjct: 116 --MTDGSPSD 123 >gi|25011520|ref|NP_735915.1| hypothetical protein gbs1478 [Streptococcus agalactiae NEM316] gi|24413059|emb|CAD47137.1| Unknown [Streptococcus agalactiae NEM316] gi|221193659|gb|ACM07862.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193672|gb|ACM07868.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193737|gb|ACM07900.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193741|gb|ACM07902.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI 227 R LVT+ S I Sbjct: 274 DNRVALVTYGSDI 286 >gi|326925129|ref|XP_003208773.1| PREDICTED: hypothetical protein LOC100547685, partial [Meleagris gallopavo] Length = 721 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 16/89 (17%) Query: 287 IIFLTDGENSSPNIDNKESLFYC-NEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFY 343 I+ LTDGE+S S+ C + GAI++ I + AA + N + Y Sbjct: 14 IVLLTDGEDS--------SMSVCRERVRESGAIIHTIALGPAAAKELEEFSNITGGLQLY 65 Query: 344 --SVQNSRKLHDAFLRIGK---EMVKQRI 367 V KL +AF I ++ +Q I Sbjct: 66 AVDVDVPSKLVEAFSEITTGSGDISEQSI 94 >gi|149919655|ref|ZP_01908134.1| hypothetical protein PPSIR1_07278 [Plesiocystis pacifica SIR-1] gi|149819598|gb|EDM79026.1| hypothetical protein PPSIR1_07278 [Plesiocystis pacifica SIR-1] Length = 435 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 61/213 (28%), Gaps = 46/213 (21%) Query: 170 DMMMVLDVSLSMN----DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D++ +LD S SM D P + ++ ++D D + + G TFS+ Sbjct: 161 DVLFLLDRSGSMLEVGFDVQDPDKTRWQALYEAVEGVVD-----EDADQQIAFGAKTFST 215 Query: 226 K----------IVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + L+ Q + L T + L N + + Sbjct: 216 QGFGACGVSPTPDVPIALSNAAQLLSTIPGPLAQVNGGTPTNLALTNTMNYMESYQADG- 274 Query: 275 HIAKGHDDYKKYIIFLTDG----ENSSPNIDNKESLFYCNEAKRRGAIVYAI----GVQA 326 K++I +TDG EN + G Y + + Sbjct: 275 ---------DKFVILITDGRIGCENDDDAATAAAVSTITAAREDHGITTYVVCIAPSIFG 325 Query: 327 EAADQFLKNCASPD--------RFYSVQNSRKL 351 DQ + + +Y ++ L Sbjct: 326 PIIDQLNQMAVAGGAPSGIFGQEYYLADDADML 358 >gi|114563847|ref|YP_751361.1| TPR repeat-containing protein [Shewanella frigidimarina NCIMB 400] gi|114335140|gb|ABI72522.1| TPR repeat-containing protein [Shewanella frigidimarina NCIMB 400] Length = 701 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 41/126 (32%), Gaps = 23/126 (18%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++++D+S SM ++L A ++L+ +K +GL+ ++ Sbjct: 91 VLIMDMSQSMYATDLSP-NRLSYAKFRATDLLNELKEGE-------TGLIAYAGDAYTIS 142 Query: 232 PLAWGVQHIQEKINRL----IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL I + L + + L A + I Sbjct: 143 PLTRDSDTILNLLPTLSPDIMPTKGSNLAAALSLAEKLLAQGGHVSGD-----------I 191 Query: 288 IFLTDG 293 I +TDG Sbjct: 192 IVMTDG 197 >gi|330688331|gb|AEC32936.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 482 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 50/151 (33%), Gaps = 24/151 (15%) Query: 192 LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 L S+ D I ++ ++R G I + L+ + E Sbjct: 2 LNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSIDKRQALS----KVTELRKS 54 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 TT T L+ + + D + + +I +TDG +S +L Sbjct: 55 YSPYGTTNMTAALDEVHKHLND--------RVNREKAIQLVILMTDGIPNSKYT----AL 102 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 N+ K+R + IG+ QF + A Sbjct: 103 EVANKLKQRNVSLAVIGIGQGINHQFNRLIA 133 >gi|308472951|ref|XP_003098702.1| hypothetical protein CRE_04179 [Caenorhabditis remanei] gi|308268302|gb|EFP12255.1| hypothetical protein CRE_04179 [Caenorhabditis remanei] Length = 363 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 47/146 (32%), Gaps = 21/146 (14%) Query: 215 VVRSGLVTFS---------SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 R G VT++ +K L+ GV + ++ + T+ GL A + Sbjct: 44 YTRVGFVTYNYVATVNADLNKFKSPSALSQGVYNSYN-LDNISPEKTSFLGTGLTTAGDI 102 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + + K II N + +D L N K G + I + Sbjct: 103 LTVQGSADGRVNNP-----KVIIVYASVLNGTGFVDP---LLVANTLKSAGITIITIPLD 154 Query: 326 AEAAD---QFLKNCASPDRFYSVQNS 348 + + L + ASP + N Sbjct: 155 TDHNGVIQKQLVSIASPGFAFDYLNP 180 >gi|294628673|ref|ZP_06707233.1| cobaltochelatase subunit [Streptomyces sp. e14] gi|292832006|gb|EFF90355.1| cobaltochelatase subunit [Streptomyces sp. e14] Length = 675 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 47/141 (33%), Gaps = 20/141 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 480 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 529 Query: 220 LVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIA 277 LVTF + P V ++ L G T GL A+ + + A Sbjct: 530 LVTFRGRDAQVALPPTSSVDAAAVRLESLPTGGRTPLAAGLLRAHEVLRVERLRDPARRA 589 Query: 278 KGHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 590 --------LVVVVTDGRATGG 602 >gi|228937862|ref|ZP_04100490.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970743|ref|ZP_04131383.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228977318|ref|ZP_04137713.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis Bt407] gi|229148962|ref|ZP_04277207.1| Von Willebrand factor type A domain protein [Bacillus cereus m1550] gi|228634502|gb|EEK91086.1| Von Willebrand factor type A domain protein [Bacillus cereus m1550] gi|228782295|gb|EEM30478.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis Bt407] gi|228788868|gb|EEM36807.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228821768|gb|EEM67768.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326938340|gb|AEA14236.1| von Willebrand factor type A [Bacillus thuringiensis serovar chinensis CT-43] Length = 627 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|228983824|ref|ZP_04144020.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar tochigiensis BGSC 4Y1] gi|229154325|ref|ZP_04282445.1| Von Willebrand factor type A domain protein [Bacillus cereus ATCC 4342] gi|228629149|gb|EEK85856.1| Von Willebrand factor type A domain protein [Bacillus cereus ATCC 4342] gi|228775931|gb|EEM24301.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar tochigiensis BGSC 4Y1] Length = 610 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 406 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 453 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 513 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 514 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 573 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 574 QLMKNIY--NHHFLVANHAE 591 >gi|221193499|gb|ACM07782.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193641|gb|ACM07853.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193647|gb|ACM07856.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193649|gb|ACM07857.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 ++ LD++ VLD S SMN+ GP + A ++ + +K I N+ Sbjct: 215 SGKTIVKPVDKQKPLDVVFVLDNSNSMNND-GPNFQRHNKAKKAAEALGTAVKDILGANS 273 Query: 215 VVRSGLVTFSSKI 227 R LVT+ S I Sbjct: 274 DNRVALVTYGSDI 286 >gi|58429463|gb|AAW78135.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDVYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|218673728|ref|ZP_03523397.1| hypothetical protein RetlG_20318 [Rhizobium etli GR56] Length = 176 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 52/159 (32%), Gaps = 16/159 (10%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F R + KG +I AIL +FI++ +IE S FFV LD S+ + Sbjct: 7 FAPFRRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVN----SALDASVHKISRM 62 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I E ++ I +G ++ + +S + D Sbjct: 63 IRTGEVASSKITLAG-----FKSKICDDMLLAFSCSSGLVVKVSVLSDLSSAT-SADPID 116 Query: 123 KDYNLSAVSRYEMP------FIFCTFPWCANSSHAPLLI 155 L+ Y++ + PW A + L Sbjct: 117 NSGKLTVTETYDIGKGSDYILVQAFLPWGATVNFFSLSS 155 >gi|218895679|ref|YP_002444090.1| hypothetical protein BCG9842_B4677 [Bacillus cereus G9842] gi|218545918|gb|ACK98312.1| conserved hypothetical protein [Bacillus cereus G9842] Length = 627 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|156404065|ref|XP_001640228.1| predicted protein [Nematostella vectensis] gi|156227361|gb|EDO48165.1| predicted protein [Nematostella vectensis] Length = 230 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 71/189 (37%), Gaps = 18/189 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + + +F +S + ++ K D +D+ +VLD S SM + D +A Sbjct: 4 YKYSSFVLLVAASVIYAQVKAAKTEHVKCDKKVDLAIVLDASASMGE------DSYKLAK 57 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTK 254 +E++ PD R L FS+ V L+ + + +R+++ K Sbjct: 58 TLTKEIISRFTISPDK---TRVSLNFFSANHVIMSKLSDNFSISKLFSLTDRMMY---EK 111 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY-IIFLTDGENSSPNIDNKESLFYCNEAK 313 S L + + + A+ K ++ +TDG +S +ES+ + Sbjct: 112 SFSILSTSLETVHFEVLAKKGGARPKQKGVKMAVVLVTDGFGTSG---YEESIAQAKSLQ 168 Query: 314 RRGAIVYAI 322 ++ + Sbjct: 169 NYHVEMFTV 177 >gi|92109483|ref|YP_571771.1| von Willebrand factor, type A [Nitrobacter hamburgensis X14] gi|91802565|gb|ABE64939.1| von Willebrand factor, type A [Nitrobacter hamburgensis X14] Length = 802 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 68/195 (34%), Gaps = 26/195 (13%) Query: 168 GLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL--V 221 L ++++LD S S ND MD AT + LD I ++ G V Sbjct: 610 DLAVLVLLDTSESSNDKVRGHDYTVMDLTRAATVLLAGALDRIGDPFAIHGFCSDGRHDV 669 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + P ++ ++ + +T+ L +A H Sbjct: 670 HYQRFKDFDQPYD---DSVKTRLAGMKGRLSTRMGAALRHA-----------AHYLAEQP 715 Query: 282 DYKKYIIFLTDGEN-----SSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 K+ + +TDGE + P ++ E R+G VYA+ + ADQ++ Sbjct: 716 KSKRVVFVVTDGEPADNDVTDPQYLRHDAKAAVEELGRQGITVYALSLD-PHADQYVSRI 774 Query: 337 ASPDRFYSVQNSRKL 351 F + + +L Sbjct: 775 FGMKNFTVIDQAERL 789 >gi|75759395|ref|ZP_00739490.1| von Willebrand factor type A domain protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228899309|ref|ZP_04063572.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 4222] gi|74493107|gb|EAO56228.1| von Willebrand factor type A domain protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228860340|gb|EEN04737.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 4222] Length = 627 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|291452386|ref|ZP_06591776.1| predicted protein [Streptomyces albus J1074] gi|291355335|gb|EFE82237.1| predicted protein [Streptomyces albus J1074] Length = 789 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 60/211 (28%), Gaps = 42/211 (19%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + Y+ ++ Y + F + S G+D+ +VLD+S S+ Sbjct: 164 GQTYSSTSDFMYSTEYRESPFTASTG-----IWQQSRNNPPLNGACGIDVALVLDLSASV 218 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 L + + D + P R L +F + G ++ Sbjct: 219 G-------SALPQLKDAADQFTDALAGTPS-----RLSLFSFDRN-----SPSTGTENHP 261 Query: 242 EKINRLIFGSTTKSTPGLEYAYN--KIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSS 297 E + + Y + + + +Y + LTDG + Sbjct: 262 EPTSVSTQAG----ADAFKSLYADWDLGSGTNWDQGLYAVAKAPTRYDLTVVLTDGNPTR 317 Query: 298 PN------------IDNKESLFYCNEAKRRG 316 + D + +F N K +G Sbjct: 318 FSKPIEGDGSRTHFADTEGGIFAANAVKAQG 348 >gi|239980526|ref|ZP_04703050.1| putative surface-anchored fimbrial subunit [Streptomyces albus J1074] Length = 779 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 60/211 (28%), Gaps = 42/211 (19%) Query: 122 HKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM 181 + Y+ ++ Y + F + S G+D+ +VLD+S S+ Sbjct: 154 GQTYSSTSDFMYSTEYRESPFTASTG-----IWQQSRNNPPLNGACGIDVALVLDLSASV 208 Query: 182 NDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQ 241 L + + D + P R L +F + G ++ Sbjct: 209 G-------SALPQLKDAADQFTDALAGTPS-----RLSLFSFDRN-----SPSTGTENHP 251 Query: 242 EKINRLIFGSTTKSTPGLEYAYN--KIFDAKEKLEHIAKGHDDYKKY--IIFLTDGENSS 297 E + + Y + + + +Y + LTDG + Sbjct: 252 EPTSVSTQAG----ADAFKSLYADWDLGSGTNWDQGLYAVAKAPTRYDLTVVLTDGNPTR 307 Query: 298 PN------------IDNKESLFYCNEAKRRG 316 + D + +F N K +G Sbjct: 308 FSKPIEGDGSRTHFADTEGGIFAANAVKAQG 338 >gi|21220339|ref|NP_626118.1| chelatase [Streptomyces coelicolor A3(2)] gi|256788538|ref|ZP_05526969.1| chelatase [Streptomyces lividans TK24] gi|289772430|ref|ZP_06531808.1| cobaltochelatase subunit [Streptomyces lividans TK24] gi|6117893|emb|CAB59466.1| putative chelatase [Streptomyces coelicolor A3(2)] gi|289702629|gb|EFD70058.1| cobaltochelatase subunit [Streptomyces lividans TK24] Length = 672 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 477 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 526 Query: 220 LVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 LVTF S P V ++ L G T GL A+ + ++E + Sbjct: 527 LVTFRGSAADVALPPTSSVDAAAVRLESLPTGGRTPLAAGLLRAHEVL-----RVERLRD 581 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 582 PARRP--LMVVVTDGRATGG 599 >gi|323187867|gb|EFZ73163.1| von Willebrand factor type A domain protein [Escherichia coli RN587/1] Length = 378 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|331673646|ref|ZP_08374409.1| conserved hypothetical protein [Escherichia coli TA280] gi|331068919|gb|EGI40311.1| conserved hypothetical protein [Escherichia coli TA280] Length = 378 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|290961218|ref|YP_003492400.1| hypothetical protein SCAB_68641 [Streptomyces scabiei 87.22] gi|260650744|emb|CBG73860.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 239 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 63/205 (30%), Gaps = 32/205 (15%) Query: 135 MPFIFCTFPWCANSSHAPL--LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDK 191 MP I T + L S+ + +V+D S SM ++ G + Sbjct: 1 MPAISLTKMMETAPALVDLYKAAGHSLDRHGLRGQRAAVYLVIDHSGSMRPYYKDGSVQA 60 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS--KIVQTFPLAWGVQHIQEKINRLIF 249 L + LD ++P +V FS+ V LA I + L Sbjct: 61 LADRVLGLSAQLDDDGTVP---------VVFFSTGVDAVTEIGLADHHGRIDRIVAGLGH 111 Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 T ++ + H + +++F TDG P + C Sbjct: 112 MGRTDYHLAMDAVID----------HYLDCGAEDPAFVVFQTDG---GPTSRLAAERYMC 158 Query: 310 NEAKRRGAIVY--AIGVQAEAADQF 332 A+ ++ IG + QF Sbjct: 159 KAAR---LPLFWQFIGFGDPRSKQF 180 >gi|238064373|ref|ZP_04609082.1| conserved hypothetical protein [Micromonospora sp. ATCC 39149] gi|237886184|gb|EEP75012.1| conserved hypothetical protein [Micromonospora sp. ATCC 39149] Length = 483 Score = 41.3 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 44/134 (32%), Gaps = 29/134 (21%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D S SM P + R + +PD VR +V + + + Sbjct: 57 VLVVDCSGSMA---QPP----TKLGAARRATAAAVGMLPDG---VRFAVVEGTHEARMVY 106 Query: 232 PLAWG--------VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P G + + ++ RL T L A + + E + H Sbjct: 107 PRHRGLATASPATREEARRELGRLAAAGGTAIGTWLALARELLAEHPEAIRHT------- 159 Query: 284 KKYIIFLTDGENSS 297 + LTDG N Sbjct: 160 ----LLLTDGRNEH 169 >gi|329946212|ref|ZP_08293825.1| hypothetical protein HMPREF9056_01718 [Actinomyces sp. oral taxon 170 str. F0386] gi|328527810|gb|EGF54801.1| hypothetical protein HMPREF9056_01718 [Actinomyces sp. oral taxon 170 str. F0386] Length = 367 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 56/193 (29%), Gaps = 16/193 (8%) Query: 146 ANSSHAPLLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREML 203 A L + S+ S +++ MV+D + SM G GV + + Sbjct: 57 AIVVVVVLAMAGPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGQGPEGVDQPASTRLD 116 Query: 204 DIIKSIPDVNNVV---RSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGL 259 + + V R ++ S PL + I L S + L Sbjct: 117 GVRADMRAVREAFPDSRFSIIALDSAAATELPLTHDTNAVDSWIGSLKQEVSAHATGSSL 176 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 E A + + L + + + +DGE + D + A G Sbjct: 177 EVALPLLG---QTLAQSRQSEPKDIRLVYIFSDGEATD---DGRG----AQAADNAGISW 226 Query: 320 YAIGVQAEAADQF 332 ++ + Sbjct: 227 QSLAGLVDGGAVL 239 >gi|298674367|ref|YP_003726117.1| von Willebrand factor type A [Methanohalobium evestigatum Z-7303] gi|298287355|gb|ADI73321.1| von Willebrand factor type A [Methanohalobium evestigatum Z-7303] Length = 1141 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 66/183 (36%), Gaps = 23/183 (12%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNN-VVRSG 219 + K + + + ++DVS S G G L + ++ M +++I D SG Sbjct: 939 NVKKERDVATLFLVDVSASTQKKLGMGGKTILDIEKEALIVMSHALENIGDKYAIYAFSG 998 Query: 220 LVTFSSKIVQTFPLAWGVQH-IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + + G+ ++ +I+ L S T+ P + ++ K K Sbjct: 999 TTHRNVEYYVVKKFDEGLSTDMKSRISALEPISNTRLGPAIRHSIKK-----------HK 1047 Query: 279 GHDDYKKYIIFLTDGENSSPNIDN---------KESLFYCNEAKRRGAIVYAIGVQAEAA 329 + K II ++DGE + +++ E +G + + V EA Sbjct: 1048 DIEAKTKLIILISDGEPYDFGTNGPPYQGDFAERDTKMAIREGYEKGIHFFCVTVDKEAK 1107 Query: 330 DQF 332 D Sbjct: 1108 DYM 1110 >gi|229022165|ref|ZP_04178716.1| Von Willebrand factor type A domain protein [Bacillus cereus AH1272] gi|228739166|gb|EEL89611.1| Von Willebrand factor type A domain protein [Bacillus cereus AH1272] Length = 627 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|159478148|ref|XP_001697166.1| hypothetical protein CHLREDRAFT_39139 [Chlamydomonas reinhardtii] gi|158274640|gb|EDP00421.1| predicted protein [Chlamydomonas reinhardtii] Length = 562 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 52/159 (32%), Gaps = 24/159 (15%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQT 230 ++ LDVS SM GM L + ++ ++++ P V + FS + + Sbjct: 393 LLGLDVSGSMGCANCSGMTSLTARQAAAAVVMTLVRTEPWVKT------MAFSHQLVEFD 446 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + ++ + + R+ G T + YA K + L Sbjct: 447 VRESDRLEEVVRRAERIPMGG-TDCALPMIYATEK---------------QLPVDVFVVL 490 Query: 291 TDGENSSPNIDNKESLF-YCNEAKRRGAIVYAIGVQAEA 328 TD E + E+L Y K A + + Sbjct: 491 TDNETWFGGVHPTEALKRYRTAMKMPDAKLVVLAFSVND 529 >gi|332845505|ref|XP_003315058.1| PREDICTED: von Willebrand factor A domain-containing protein 3A [Pan troglodytes] Length = 1184 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 960 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 1014 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 1015 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 1056 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 1057 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 1111 >gi|326912615|ref|XP_003202644.1| PREDICTED: von Willebrand factor-like [Meleagris gallopavo] Length = 2810 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 47/343 (13%), Positives = 102/343 (29%), Gaps = 46/343 (13%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRN-- 94 +T + ++ S++ I Q D+ + +N Sbjct: 1527 DTITLEYSFREIQSK--ESIIEKVRSIPYQGGKATNTGHALDYISKHTFTSANGGRQNVP 1584 Query: 95 ELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSA-----VSRYEMPFIFCTFPWCANSS 149 L + ++ +SI + + + +S+ P I + Sbjct: 1585 HLVYMVSSNPSTDVITRHPMSINVIPIGISPSANVQELRMISQPNRPII--LQSYSTLIE 1642 Query: 150 HAPLLITSS-------VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREM 202 AP L+ S +I + +D+M +LD S S G+ + ++ Sbjct: 1643 EAPELVLQSCCSHKAWTEIPELCNKPMDVMFLLDGSPS------IGVSEFEEMKNFVQAF 1696 Query: 203 LDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEY 261 ++ N V ++ ++ + L Q + I + + P L Sbjct: 1697 IESADI---SNTSVHVSVLQYARENNLEISL-NKPQETENLIKMVHSIKQREQGPTRLGK 1752 Query: 262 AYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 A + + + + K II + G+ + D E+ A+ ++ Sbjct: 1753 AIDFVVQNALSESYGGRPGA--SKVIIVIVSGK----SEDTMETAALI--ARMNKVSLFP 1804 Query: 322 IGVQAEAADQFLKNCASPD---------RFYSVQNSRKLHDAF 355 IG+ +Q L+ P F + L+ F Sbjct: 1805 IGIGNGYDEQQLRTLTGPSAVNRITKLQNFEDLSTMVTLNSEF 1847 >gi|144853481|gb|AAI09296.1| VWA3A protein [Homo sapiens] Length = 793 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 569 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 623 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 624 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 665 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 666 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 720 >gi|119570984|gb|EAW50599.1| hCG2039627, isoform CRA_c [Homo sapiens] Length = 730 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 506 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 560 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 561 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 602 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 603 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 657 >gi|119570985|gb|EAW50600.1| hCG2039627, isoform CRA_d [Homo sapiens] Length = 1208 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 984 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 1038 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 1039 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 1080 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 1081 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 1135 >gi|119570982|gb|EAW50597.1| hCG2039627, isoform CRA_a [Homo sapiens] Length = 1184 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 960 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 1014 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 1015 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 1056 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 1057 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 1111 >gi|34536064|dbj|BAC87526.1| unnamed protein product [Homo sapiens] Length = 808 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 584 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 638 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 639 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 680 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 681 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 735 >gi|34223450|gb|AAQ62968.1| hypothetical protein LOC146177-like protein [Homo sapiens] Length = 286 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 62 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 116 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 117 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 158 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 159 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 213 >gi|21752350|dbj|BAC04176.1| unnamed protein product [Homo sapiens] gi|119570983|gb|EAW50598.1| hCG2039627, isoform CRA_b [Homo sapiens] Length = 262 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 38 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 92 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 93 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 134 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 135 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 189 >gi|23468244|gb|AAH38400.1| VWA3A protein [Homo sapiens] Length = 365 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + L ++ +V+ Sbjct: 141 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAES-----LQSWQDTLVE 195 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 196 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 237 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 238 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 292 >gi|314964167|gb|EFT08267.1| cobaltochelatase subunit [Propionibacterium acnes HL082PA1] gi|315078927|gb|EFT50945.1| cobaltochelatase subunit [Propionibacterium acnes HL053PA2] gi|327457301|gb|EGF03956.1| cobaltochelatase subunit [Propionibacterium acnes HL092PA1] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|313814112|gb|EFS51826.1| cobaltochelatase subunit [Propionibacterium acnes HL025PA1] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|313792794|gb|EFS40875.1| cobaltochelatase subunit [Propionibacterium acnes HL110PA1] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|313771227|gb|EFS37193.1| cobaltochelatase subunit [Propionibacterium acnes HL074PA1] gi|313811880|gb|EFS49594.1| cobaltochelatase subunit [Propionibacterium acnes HL083PA1] gi|313832069|gb|EFS69783.1| cobaltochelatase subunit [Propionibacterium acnes HL007PA1] gi|313832876|gb|EFS70590.1| cobaltochelatase subunit [Propionibacterium acnes HL056PA1] gi|314975309|gb|EFT19404.1| cobaltochelatase subunit [Propionibacterium acnes HL053PA1] gi|314977724|gb|EFT21819.1| cobaltochelatase subunit [Propionibacterium acnes HL045PA1] gi|314985301|gb|EFT29393.1| cobaltochelatase subunit [Propionibacterium acnes HL005PA1] gi|315097050|gb|EFT69026.1| cobaltochelatase subunit [Propionibacterium acnes HL038PA1] gi|327332611|gb|EGE74346.1| cobaltochelatase subunit [Propionibacterium acnes HL096PA2] gi|327446609|gb|EGE93263.1| cobaltochelatase subunit [Propionibacterium acnes HL043PA2] gi|327448948|gb|EGE95602.1| cobaltochelatase subunit [Propionibacterium acnes HL043PA1] gi|328759695|gb|EGF73292.1| cobaltochelatase subunit [Propionibacterium acnes HL099PA1] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|313764908|gb|EFS36272.1| cobaltochelatase subunit [Propionibacterium acnes HL013PA1] gi|313815530|gb|EFS53244.1| cobaltochelatase subunit [Propionibacterium acnes HL059PA1] gi|313828869|gb|EFS66583.1| cobaltochelatase subunit [Propionibacterium acnes HL063PA2] gi|314916327|gb|EFS80158.1| cobaltochelatase subunit [Propionibacterium acnes HL005PA4] gi|314917324|gb|EFS81155.1| cobaltochelatase subunit [Propionibacterium acnes HL050PA1] gi|314921929|gb|EFS85760.1| cobaltochelatase subunit [Propionibacterium acnes HL050PA3] gi|314930806|gb|EFS94637.1| cobaltochelatase subunit [Propionibacterium acnes HL067PA1] gi|314955268|gb|EFS99673.1| cobaltochelatase subunit [Propionibacterium acnes HL027PA1] gi|314959270|gb|EFT03372.1| cobaltochelatase subunit [Propionibacterium acnes HL002PA1] gi|314969278|gb|EFT13376.1| cobaltochelatase subunit [Propionibacterium acnes HL037PA1] gi|315099657|gb|EFT71633.1| cobaltochelatase subunit [Propionibacterium acnes HL059PA2] gi|315102111|gb|EFT74087.1| cobaltochelatase subunit [Propionibacterium acnes HL046PA1] gi|315110018|gb|EFT81994.1| cobaltochelatase subunit [Propionibacterium acnes HL030PA2] gi|327334665|gb|EGE76376.1| cobaltochelatase subunit [Propionibacterium acnes HL097PA1] gi|327454362|gb|EGF01017.1| cobaltochelatase subunit [Propionibacterium acnes HL087PA3] gi|327456428|gb|EGF03083.1| cobaltochelatase subunit [Propionibacterium acnes HL083PA2] gi|328756121|gb|EGF69737.1| cobaltochelatase subunit [Propionibacterium acnes HL087PA1] gi|328758500|gb|EGF72116.1| cobaltochelatase subunit [Propionibacterium acnes HL025PA2] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|301060207|ref|ZP_07201074.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300445719|gb|EFK09617.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 768 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 65/184 (35%), Gaps = 35/184 (19%) Query: 164 KSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSI-----REMLDIIKSIPDVNNVVR 217 D++ V+D S SM D L V +++ ++ + + R Sbjct: 261 NKSGNEDLLFVVDCSGSMQGDSIYEARQALDVCLKALEEGRRFNIIRFGSRFESLFSEPR 320 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 +S K ++ L+W + T+ L++ Y D+ + Sbjct: 321 ----AYSEKTLERA-LSWSRNMQADL-------GGTEILQPLQHIYKVQGDSDSRYGS-- 366 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 I+ LTDG + N++ +FY + G V+ +G+ A + F+K A Sbjct: 367 ---------ILLLTDG-----AVGNEDDIFYLVRNRS-GPRVFPVGIGAGCNEAFIKGLA 411 Query: 338 SPDR 341 + Sbjct: 412 RAGK 415 >gi|291398101|ref|XP_002715428.1| PREDICTED: integrin, alpha 10 [Oryctolagus cuniculus] Length = 1169 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGESAV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETKTAQAIMVACTEGFSQSRGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFLK---NCAS- 338 +TDGE+ +L C + Y I V + FLK AS Sbjct: 273 VVTDGESHDGEELPT-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLKEIRAIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|228944375|ref|ZP_04106748.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|228815277|gb|EEM61525.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 609 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANHAE 590 >gi|225874357|ref|YP_002755816.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] gi|225791485|gb|ACO31575.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] Length = 333 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 49/148 (33%), Gaps = 12/148 (8%) Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-- 247 L + ++ L+ + S + R +V F+ + + Q+ ++++ Sbjct: 57 SALHEKVHASKQFLNSLLSAHASGSAPRVFVVQFNRDVDLLEDPSASASKAQQALSQVGV 116 Query: 248 -------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 ST + + ++DA H +K I+ L+DG + Sbjct: 117 AQFHGDSNADSTANKGRHAKASGAALYDAIYLATHNVLNTTAGRKVIVVLSDGIDQGSKT 176 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEA 328 ++ A G VYAI + Sbjct: 177 TLNGAVEAAQRA---GVAVYAIYFKGGR 201 >gi|295129632|ref|YP_003580295.1| cobaltochelatase subunit [Propionibacterium acnes SK137] gi|291376336|gb|ADE00191.1| cobaltochelatase subunit [Propionibacterium acnes SK137] gi|313839735|gb|EFS77449.1| cobaltochelatase subunit [Propionibacterium acnes HL086PA1] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|156400040|ref|XP_001638808.1| predicted protein [Nematostella vectensis] gi|156225932|gb|EDO46745.1| predicted protein [Nematostella vectensis] Length = 798 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 59/172 (34%), Gaps = 31/172 (18%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V+D S SM+ ++ A R+++ L KS+PD G + S K + + Sbjct: 292 VVDRSGSMSGS------RIKDAARTLQLFL---KSLPDGCYFNIVGFGS-SYKTLFSKSK 341 Query: 234 AWGVQHIQEKIN---RLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + ++ N L T+ L + Y + + Sbjct: 342 TYNDETLKTATNHAAHLAADLGGTEILEPLRWVY----------SQSLIEGAPRQLF--L 389 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 LTDGE + + A+ V++ G+ A+ + +K A Sbjct: 390 LTDGEVGNTAQVISLVAENASTAR-----VFSFGIGDGASTELIKGVARAGH 436 >gi|34481888|emb|CAE46492.1| trap [Plasmodium falciparum] Length = 331 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + T + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNTVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + + + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNESAIHLYVNIFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENASQLVVMLTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLAGCHPSD 209 >gi|14248667|gb|AAK57619.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 56/169 (33%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D + ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTVNLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E +T T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGSTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + +GV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVVGVGQGINHQFNRLIA 148 >gi|50841592|ref|YP_054819.1| magnesium-chelatase 67 kDa subunit [Propionibacterium acnes KPA171202] gi|289424508|ref|ZP_06426291.1| cobaltochelatase subunit [Propionibacterium acnes SK187] gi|289427519|ref|ZP_06429232.1| cobaltochelatase subunit [Propionibacterium acnes J165] gi|50839194|gb|AAT81861.1| magnesium-chelatase 67 kDa subunit [Propionibacterium acnes KPA171202] gi|289155205|gb|EFD03887.1| cobaltochelatase subunit [Propionibacterium acnes SK187] gi|289159449|gb|EFD07640.1| cobaltochelatase subunit [Propionibacterium acnes J165] gi|313806971|gb|EFS45469.1| cobaltochelatase subunit [Propionibacterium acnes HL087PA2] gi|313817755|gb|EFS55469.1| cobaltochelatase subunit [Propionibacterium acnes HL046PA2] gi|313821418|gb|EFS59132.1| cobaltochelatase subunit [Propionibacterium acnes HL036PA1] gi|313824636|gb|EFS62350.1| cobaltochelatase subunit [Propionibacterium acnes HL036PA2] gi|313826305|gb|EFS64019.1| cobaltochelatase subunit [Propionibacterium acnes HL063PA1] gi|314926461|gb|EFS90292.1| cobaltochelatase subunit [Propionibacterium acnes HL036PA3] gi|314961443|gb|EFT05544.1| cobaltochelatase subunit [Propionibacterium acnes HL002PA2] gi|314980141|gb|EFT24235.1| cobaltochelatase subunit [Propionibacterium acnes HL072PA2] gi|314986996|gb|EFT31088.1| cobaltochelatase subunit [Propionibacterium acnes HL005PA2] gi|314990509|gb|EFT34600.1| cobaltochelatase subunit [Propionibacterium acnes HL005PA3] gi|315082979|gb|EFT54955.1| cobaltochelatase subunit [Propionibacterium acnes HL027PA2] gi|315086723|gb|EFT58699.1| cobaltochelatase subunit [Propionibacterium acnes HL002PA3] gi|315088127|gb|EFT60103.1| cobaltochelatase subunit [Propionibacterium acnes HL072PA1] gi|315107610|gb|EFT79586.1| cobaltochelatase subunit [Propionibacterium acnes HL030PA1] gi|327333787|gb|EGE75504.1| cobaltochelatase subunit [Propionibacterium acnes HL096PA3] gi|327444750|gb|EGE91404.1| cobaltochelatase subunit [Propionibacterium acnes HL013PA2] gi|328757863|gb|EGF71479.1| cobaltochelatase subunit [Propionibacterium acnes HL020PA1] gi|332674502|gb|AEE71318.1| magnesium-chelatase 67 kDa subunit [Propionibacterium acnes 266] Length = 654 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAVLSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|327538080|gb|EGF24770.1| conserved hypothetical protein, secreted [Rhodopirellula baltica WH47] Length = 359 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 36/97 (37%), Gaps = 2/97 (2%) Query: 19 ILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKND 78 IL ILL +F + GL+I+ +A + + D + L ++ + + Sbjct: 2 ILVVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGADQTT-TRIAVV 60 Query: 79 FSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLS 115 + W R EL + D E S +++ Sbjct: 61 DRAAEMSESW-GPHRIELEDGYDLNDDGKPESSQTIN 96 >gi|198419820|ref|XP_002120348.1| PREDICTED: similar to type A von Willebrand factor (VWFA) domain-containing protein [Ciona intestinalis] Length = 863 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 67/227 (29%), Gaps = 66/227 (29%) Query: 143 PWCANSSHAPLLITSSVKISSKSDIGLDMMM-VLDVSLSM-------NDHFGPGMDKLGV 194 P + SH + ++ D M++ +DVS SM + + + +L Sbjct: 326 PLIPSVSHGDVTYLVQPSVTRNQDNNDPMLIFCIDVSGSMCVSSQVKSGNGNVFVTRLQG 385 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ------------------------- 229 ++I + ++ I+ R GLVTFSS + Sbjct: 386 VQQAIVDQINKIEV---TRPNTRIGLVTFSSLVNGIGDGSPEQEILLSDFQLYDFDFIFS 442 Query: 230 -------TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 ++ + +N L T P L Y +A Sbjct: 443 TGKTAPIPGTISNTRVQLSRALNSLNTSGGTALGPALLY-------------SVALASQR 489 Query: 283 YKKYIIFLTDG--ENSSPNIDNKESLFYCNE--------AKRRGAIV 319 +I TDG +++ ++ C+ A G IV Sbjct: 490 PGSQVIVCTDGRANMGIGSLETEDDYQICHHFYDSTTEMALSNGVIV 536 >gi|156408866|ref|XP_001642077.1| predicted protein [Nematostella vectensis] gi|156229218|gb|EDO50014.1| predicted protein [Nematostella vectensis] Length = 251 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 51/134 (38%), Gaps = 12/134 (8%) Query: 237 VQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + +++++ F TT + L A ++F K ++ +TDG + Sbjct: 120 RKEAYRQLSKVPFIAGTTNTQEALNLAQRELFGKKNSGATPGAIGR-----VLIITDGLS 174 Query: 296 SSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPDRF-YSVQNSRKLH 352 N+ +LF + K G +Y + V + + +S + + Q+ R L Sbjct: 175 ---NVQRNLTLFNAYKLKMAGPEIYVVAVGQYLYGLHELVGLASSTENHLFRAQSMRGLE 231 Query: 353 DAFLRIGKEMVKQR 366 A I K + ++ Sbjct: 232 GAVRLIPKPSMYRK 245 >gi|160714|gb|AAA29772.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 565 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429459|gb|AAW78133.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429495|gb|AAW78151.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429503|gb|AAW78155.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429541|gb|AAW78174.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|324500920|gb|ADY40417.1| Integrator complex subunit 6 [Ascaris suum] Length = 892 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 46/130 (35%), Gaps = 9/130 (6%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SM G L A RSI E L + R L++F V Sbjct: 4 LLFLVDTSASMAQKTYQGTTILDTA-RSIVEQLLKQRGRDAGARYDRYMLMSFEEFPV-N 61 Query: 231 FPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAY-----NKIFDAKEKLEHIAKGHDDY 283 W G +++ L T L A+ N++ + + + Sbjct: 62 VKAGWREGQAIFHQQLKALRPKGATTFGAALGSAFRFVNANRLNSCIDNYGYGRYPYCLE 121 Query: 284 KKYIIFLTDG 293 +II +TDG Sbjct: 122 PVFIISITDG 131 >gi|302384159|ref|YP_003819982.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302194787|gb|ADL02359.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 591 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 55/172 (31%), Gaps = 34/172 (19%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR---SGLVTFSSK- 226 ++ V+D S S + +L ++ +L VR L+ F + Sbjct: 411 VIFVVDASGS------AALQRLAETKGAVELLL--------AEAYVRRTQVALIAFRGEG 456 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 P + ++++ L G T GLE A A + AKG Sbjct: 457 ADLLLPPTRSLARARKQLAELAGGGATPLAAGLEMA------AILAVAERAKGRTPL--- 507 Query: 287 IIFLTDGENSSP-------NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 ++F+TDG + +++L C + G I + Sbjct: 508 LVFMTDGRGNIALDGGAFRTRAEQDALNACRRIRAAGLRAALIDISPRPRGD 559 >gi|291613456|ref|YP_003523613.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583568|gb|ADE11226.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 796 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 74/205 (36%), Gaps = 26/205 (12%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDH-FGPGMDKLGVATRSIREMLDIIKSIPD---VN 213 + S + + +M++LD+S S ND G L + ++ + D I I D ++ Sbjct: 594 IMMRSVRKVRDISVMVLLDLSESTNDKVSGQDYSVLDLQRQAAVLLADAIHKIGDPFAIH 653 Query: 214 NVVRSGL--VTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 G V +S P + + ++ + +T+ + +A Sbjct: 654 GFCSDGRHNVEYSRFKDFDQPYN---EVPKSRLAGMRGQLSTRMGAAIRHA--------- 701 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 H K KK ++ +TDGE + P ++ E R G + + + Sbjct: 702 --GHYLKLQKSAKKLLLVITDGEPADVDVRDPQYLRYDTKKAVEEMARSGVTTFCMSLD- 758 Query: 327 EAADQFLKNCASPDRFYSVQNSRKL 351 ADQ++ + V + +L Sbjct: 759 PRADQYVSRIFGARNYMVVDHVERL 783 >gi|294661523|ref|YP_003579976.1| hypothetical protein KP-KP15_gp109 [Klebsiella phage KP15] gi|292660684|gb|ADE34932.1| hypothetical protein [Klebsiella phage KP15] Length = 738 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 62/197 (31%), Gaps = 18/197 (9%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 V DVS SM + L ++ + D + + + + G V K+ L Sbjct: 22 VCDVSGSMYNELPRIRQHLKNNLATLVKQDDTVSILYFSSRG-QFGSVFVGEKVSSVSDL 80 Query: 234 AWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + I + I+R L T L+ A D + ++ +IFLTD Sbjct: 81 S----KINDAIDRYLRPTGCTGFVEPLQLAVEVATDLQADNGNLNS--------LIFLTD 128 Query: 293 GENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--FYSVQNSRK 350 G ++ D + L C+ + + L+ A V+ Sbjct: 129 GYDNCWRTD--DILKACSTLPLTFNNIAFLEYGYYVNRPLLEKMAEATNALHKFVEGFDA 186 Query: 351 LHDAFLRIGKEMVKQRI 367 AF I R+ Sbjct: 187 YVPAFDEIITAQTSARV 203 >gi|258624849|ref|ZP_05719777.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582847|gb|EEW07668.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 610 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 42/121 (34%), Gaps = 13/121 (10%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 S + S + + +++D+S SM T++ + LD++K + Sbjct: 71 PSWQSAERPSVQNSAARV-LLMDMSRSMYATDLTP----NRLTQARYKALDLLKGWQEGT 125 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFDA 269 +GLV +++ PL + I ++ + + A + + A Sbjct: 126 ----TGLVAYAADAYVVSPLTSDTATLANLIPNLSPEIMPYQGANAANAVSLAISMLQQA 181 Query: 270 K 270 Sbjct: 182 G 182 >gi|254563469|ref|YP_003070564.1| MxaC protein [Methylobacterium extorquens DM4] gi|254270747|emb|CAX26752.1| MxaC protein [Methylobacterium extorquens DM4] Length = 355 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 50/159 (31%), Gaps = 22/159 (13%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVAT 196 + P V + IG + M++D S SMN+ F P + A Sbjct: 61 LAIGGLVLALAGPYRAGERVTRT---GIGAQISMLIDRSGSMNETFAGRQPSGAEESKAA 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TK 254 S R + D + + + FS+ + P+ ++ I + T Sbjct: 118 ASRRILRDFVGE----RAHDQFAVTAFSTAPMLVVPMTDRHDAVRAAIAAIDRPGLDYTN 173 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 GL A ++ + ++ ++DG Sbjct: 174 VARGLGMALSQFGAGAPG----------VSRALLLVSDG 202 >gi|240140959|ref|YP_002965439.1| MxaC protein [Methylobacterium extorquens AM1] gi|240010936|gb|ACS42162.1| MxaC protein [Methylobacterium extorquens AM1] Length = 355 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 50/159 (31%), Gaps = 22/159 (13%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVAT 196 + P V + IG + M++D S SMN+ F P + A Sbjct: 61 LAIGGLVLALAGPYRAGERVTRT---GIGAQISMLIDRSGSMNETFAGRQPSGAEESKAA 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TK 254 S R + D + + + FS+ + P+ ++ I + T Sbjct: 118 ASRRILRDFVGE----RAHDQFAVTAFSTAPMLVVPMTDRHDAVRAAIAAIDRPGLDYTN 173 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 GL A ++ + ++ ++DG Sbjct: 174 VARGLGMALSQFGAGAPG----------VSRALLLVSDG 202 >gi|228906371|ref|ZP_04070255.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 200] gi|228853283|gb|EEM98056.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 200] Length = 627 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|221058683|ref|XP_002259987.1| sporozoite surface protein 2 [Plasmodium knowlesi strain H] gi|193810060|emb|CAQ41254.1| sporozoite surface protein 2, putative [Plasmodium knowlesi strain H] Length = 579 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 67/221 (30%), Gaps = 36/221 (16%) Query: 166 DIGLDMMMVLDVSLSMNDH---------FGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + +D+ +++D S S+ ++ L ++ SI + + S ++ Sbjct: 40 NEKVDLYLLVDGSGSIGYANWITRVIPMLTGLIENLNLSKDSINLYMSLFAS--HTTELI 97 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G + + L +++ T + L E Sbjct: 98 RLGS---GPSMDKKQAL----NVVRDLRKGYEPYGNTSMSSAL--------SEVEMHLKD 142 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---FL 333 + + +I +TDG + + +L K R + IG+ Q + Sbjct: 143 RVNRPNAIQLVILMTDGIPN----NKYRALELSRALKERNVKLAVIGIGQGINHQYNKLM 198 Query: 334 KNCASPDR---FYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 C +R FYS + + ++ + K Sbjct: 199 AGCRPRERSCKFYSSADWSEAISLIKPFIAKVCTEVERIAK 239 >gi|10945609|gb|AAG24613.1|AF298217_1 sporozoite surface protein 2 [Plasmodium knowlesi] Length = 572 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 67/221 (30%), Gaps = 36/221 (16%) Query: 166 DIGLDMMMVLDVSLSMNDH---------FGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + +D+ +++D S S+ ++ L ++ SI + + S ++ Sbjct: 40 NEKVDLYLLVDGSGSIGYANWITRVIPMLTGLIENLNLSKDSINLYMSLFAS--HTTELI 97 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G + + L +++ T + L E Sbjct: 98 RLGS---GPSMDKKQAL----NVVRDLRKGYEPYGNTSMSSAL--------SEVEMHLKD 142 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---FL 333 + + +I +TDG + + +L K R + IG+ Q + Sbjct: 143 RVNRPNAIQLVILMTDGIPN----NKYRALELSRALKERNVKLAVIGIGQGINHQYNKLM 198 Query: 334 KNCASPDR---FYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 C +R FYS + + ++ + K Sbjct: 199 AGCRPRERSCKFYSSADWSEAISLIKPFIAKVCTEVERIAK 239 >gi|1164996|gb|AAA85569.1| mxaC [Methylobacterium extorquens AM1] Length = 355 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 50/159 (31%), Gaps = 22/159 (13%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVAT 196 + P V + IG + M++D S SMN+ F P + A Sbjct: 61 LAIGGLVLALAGPYRAGERVTRT---GIGAQISMLIDRSGSMNETFAGRQPSGAEESKAA 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TK 254 S R + D + + + FS+ + P+ ++ I + T Sbjct: 118 ASRRILRDFVGE----RAHDQFAVTAFSTAPMLVVPMTDRHDAVRAAIAAIDRPGLDYTN 173 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 GL A ++ + ++ ++DG Sbjct: 174 VARGLGMALSQFGAGAPG----------VSRALLLVSDG 202 >gi|1589785|gb|AAC47462.1| thrombospondin-related anonymous protein [Plasmodium knowlesi] Length = 533 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 67/221 (30%), Gaps = 36/221 (16%) Query: 166 DIGLDMMMVLDVSLSMNDH---------FGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + +D+ +++D S S+ ++ L ++ SI + + S ++ Sbjct: 1 NEKVDLYLLVDGSGSIGYANWITRVIPMLTGLIENLNLSKDSINLYMSLFAS--HTTELI 58 Query: 217 RSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 R G + + L +++ T + L E Sbjct: 59 RLGS---GPSMDKKQAL----NVVRDLRKGYEPYGNTSMSSAL--------SEVEMHLKD 103 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ---FL 333 + + +I +TDG + + +L K R + IG+ Q + Sbjct: 104 RVNRPNAIQLVILMTDGIPN----NKYRALELSRALKERNVKLAVIGIGQGINHQYNKLM 159 Query: 334 KNCASPDR---FYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 C +R FYS + + ++ + K Sbjct: 160 AGCRPRERSCKFYSSADWSEAISLIKPFIAKVCTEVERIAK 200 >gi|326922791|ref|XP_003207628.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1224 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 65/167 (38%), Gaps = 24/167 (14%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV---RSGL 220 D L++ +LD S S H+ K + + E++D +K + + R L Sbjct: 18 DEDCILEIAFLLDSSES-AKHYNHEQQK-----KFVLEVVDQMKGLQLSSGRTLSWRMAL 71 Query: 221 VTFSSKIVQTFPL-AW-GVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHIA 277 + +SS + W G + + I + + T +T + ++ Sbjct: 72 LQYSSTVSTEQTFHDWKGPEAFKSHIAPITYIGHGTYTTYAI---------TNLTQLYMT 122 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 +G D K + TDG + N D + +AK +G +++ +G+ Sbjct: 123 EGTPDSLKLAVLFTDGVDHPRNPDIFAATA---DAKNQGIVLFTMGM 166 >gi|307565295|ref|ZP_07627788.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] gi|307345964|gb|EFN91308.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] Length = 290 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 47/123 (38%), Gaps = 6/123 (4%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ M + V + I+ N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLDFGTQLQMKRDRVTEVAATLAFSAIE------NNDKVGVIFF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S I + + G +HI I ++ L+ A + ++ D Y Sbjct: 126 SDTIEKYIVPSKGRKHILYCIREMLTFKPKSKHTDLQIAIEYLTKVLKRKCTAFIISDFY 185 Query: 284 KKY 286 K+ Sbjct: 186 TKF 188 >gi|115537688|ref|NP_872008.2| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|82654523|gb|ABB88212.1| Hypothetical protein T19D12.4b [Caenorhabditis elegans] Length = 1015 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 58/160 (36%), Gaps = 23/160 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LDM++ D S S++ P ++ N+ R G++TFSS +V Sbjct: 375 LDMIIAFDTSESLSSLIVPQYVDFAKK---------LVAQYKYGNDNTRVGIITFSSDVV 425 Query: 229 QTFPLAWG--VQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L G + + I+ + + T T A N + + + K Sbjct: 426 EVRKLTDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKNLF---------DTESNANRNK 476 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + LTDG + ++ + ++ K I + +G Sbjct: 477 VLFILTDGVPTVDTYTDE--VAAGDKLKSISVISFFVGYS 514 >gi|4996347|dbj|BAA78416.1| complement B/C2-A2 [Cyprinus carpio] Length = 750 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 84/225 (37%), Gaps = 29/225 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 KI+ LD+ + +D S S++ A +I+ ++D I P N Sbjct: 250 KITMNQGGKLDIYIAVDASESIDKK------DFENAKTTIKMLIDKISYYPVSPNYE--- 300 Query: 220 LVTFSSKIVQTFPL--------AWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAK 270 ++ F++ + + + A + +I + ++ K+ + AY I ++ Sbjct: 301 ILMFATDVTRITSMRDFKNNEDARNLMNIFKDLDDFNYERVGVKTGTNIAKAYTAILESI 360 Query: 271 EKLE-HIAKGHDDYKKYIIFLTDGE-NSSPNIDNK-------ESLFYCNEAKRRGAIVYA 321 + E + A ++ + +I TDG+ N N K + + N K V+ Sbjct: 361 KLEELNNAAIFNETQHIVILFTDGQANMGGNPRPKVEQIKHLVTKNHPNREKNLDLYVFG 420 Query: 322 IG--VQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 +G V E + + F+ +++ ++ F + E Sbjct: 421 VGDDVNQEDINGLVSQRDQEKYFFKLKDLTEVQKMFDDMIDESTS 465 >gi|302561365|ref|ZP_07313707.1| magnesium chelatase [Streptomyces griseoflavus Tu4000] gi|302478983|gb|EFL42076.1| magnesium chelatase [Streptomyces griseoflavus Tu4000] Length = 201 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 47/141 (33%), Gaps = 20/141 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++G ++ +L + + G Sbjct: 9 QATREGREGNLVLFVVDASGSMA-----ARQRMGAVKGAVLSLL-----LDAYQRRDKVG 58 Query: 220 LVTFSSKI-VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIA 277 LVTF P V ++ L G T GL A+ + + A Sbjct: 59 LVTFRGSATEVALPPTSSVDAAAARLESLPTGGRTPLAAGLLRAHEVLRVERLRDPARRA 118 Query: 278 KGHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 119 --------LVVVVTDGRATGG 131 >gi|254383386|ref|ZP_04998738.1| hypothetical protein SSAG_03040 [Streptomyces sp. Mg1] gi|194342283|gb|EDX23249.1| hypothetical protein SSAG_03040 [Streptomyces sp. Mg1] Length = 686 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 58/202 (28%), Gaps = 16/202 (7%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 ++ K+ + +VLD S SM + + + + + + + D Sbjct: 488 RAAGQVLRAKGKTGARAKVYLVLDRSGSMRPFY------KDGSAQFLADHTLALAAHLDA 541 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + V + V FS+ + T L G H A + Sbjct: 542 DATVHT--VFFSTDVDGTAELTLG-SHDAAWAEARHAELGRMGRTSYHVAVEAV------ 592 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 LEH K ++F TDG + + + A A G A F Sbjct: 593 LEHYRKDGGTGPALVVFQTDGAPDNRQPARQAIIDAAATAPDVHWQFVAFGDHENKAFDF 652 Query: 333 LKNC-ASPDRFYSVQNSRKLHD 353 L+ A F+ + Sbjct: 653 LRKLDAGNAGFFHAGPAPATLT 674 >gi|58429481|gb|AAW78144.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|17536325|ref|NP_495348.1| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|3258584|gb|AAC24429.1| Hypothetical protein T19D12.4a [Caenorhabditis elegans] Length = 1028 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 58/160 (36%), Gaps = 23/160 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 LDM++ D S S++ P ++ N+ R G++TFSS +V Sbjct: 388 LDMIIAFDTSESLSSLIVPQYVDFAKK---------LVAQYKYGNDNTRVGIITFSSDVV 438 Query: 229 QTFPLAWG--VQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 + L G + + I+ + + T T A N + + + K Sbjct: 439 EVRKLTDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKNLF---------DTESNANRNK 489 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + LTDG + ++ + ++ K I + +G Sbjct: 490 VLFILTDGVPTVDTYTDE--VAAGDKLKSISVISFFVGYS 527 >gi|228913311|ref|ZP_04076945.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pulsiensis BGSC 4CC1] gi|228846362|gb|EEM91380.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pulsiensis BGSC 4CC1] Length = 609 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANHAE 590 >gi|228925811|ref|ZP_04088895.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|229120272|ref|ZP_04249523.1| Von Willebrand factor type A domain protein [Bacillus cereus 95/8201] gi|228663313|gb|EEL18902.1| Von Willebrand factor type A domain protein [Bacillus cereus 95/8201] gi|228833826|gb|EEM79379.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 609 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANHAE 590 >gi|213626825|gb|AAI70192.1| Complement factor B [Xenopus laevis] Length = 745 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 76/222 (34%), Gaps = 35/222 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D +++ +VLD S S+ + + I+ + D + R +++++ Sbjct: 236 KDGLMNIFIVLDTSKSVGEEKFEEAKEASKL---------FIEKMADYDIKPRYCIISYA 286 Query: 225 SKIVQTFPL----AWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEH 275 S + L + + + + + + T + L Y + + + E Sbjct: 287 SVAIAVVSLRDPDSNDAEAVTKHLEDFQYNNHADKQGTNTRAALHSIYEHLIEQELAYEK 346 Query: 276 IAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----------VYAIG 323 K D K I+ +TDG+ + D +E + G VY G Sbjct: 347 EGKKADFMKIHNVILLMTDGKFNMGG-DPREEMKLIIRFLDIGIRTENPRLEYLDVYVFG 405 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKE 361 + ++ + AS + ++N K+ + F + E Sbjct: 406 LGSDIDQPEINELASKKDKEVHTFHLENVNKMKEFFELMLDE 447 >gi|167640939|ref|ZP_02399197.1| conserved hypothetical protein [Bacillus anthracis str. A0193] gi|167511159|gb|EDR86547.1| conserved hypothetical protein [Bacillus anthracis str. A0193] Length = 609 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANHAE 590 >gi|52080631|ref|YP_079422.1| von Willebrand factor type A domain-containing protein [Bacillus licheniformis ATCC 14580] gi|52786005|ref|YP_091834.1| YojO [Bacillus licheniformis ATCC 14580] gi|319645408|ref|ZP_07999640.1| YojO protein [Bacillus sp. BT1B_CT2] gi|52003842|gb|AAU23784.1| von Willebrand factor, type A domain containing protein [Bacillus licheniformis ATCC 14580] gi|52348507|gb|AAU41141.1| YojO [Bacillus licheniformis ATCC 14580] gi|317392294|gb|EFV73089.1| YojO protein [Bacillus sp. BT1B_CT2] Length = 637 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 65/166 (39%), Gaps = 13/166 (7%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++I +++D S SM+ DK+ R I + +KS+ + +V G Sbjct: 433 KQEPSTEIDAVFTLLVDCSASMH-------DKMAETKRGIVLFHEALKSVAVPHQIV--G 483 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ +T + + + L G+ N+ A ++ + Sbjct: 484 FWEDTNDATETSQPNY-FHTVVSFADSLKAGAGPHIMQLEPEEDNRDGYAIRQMTKMLVQ 542 Query: 280 HDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 + +K++I +DGE ++ + + ++ EA++R V + Sbjct: 543 RSEAQKFLIVFSDGEPAAFDYEQNGIVDTHEAVMEARKRNIEVINV 588 >gi|148237705|ref|NP_001079047.1| complement factor B [Xenopus laevis] gi|1030718|dbj|BAA08371.1| Xenopus Bf B [Xenopus laevis] Length = 745 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 76/222 (34%), Gaps = 35/222 (15%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D +++ +VLD S S+ + + I+ + D + R +++++ Sbjct: 236 KDGLMNIFIVLDTSKSVGEEKFEEAKEASKL---------FIEKMADYDIKPRYCIISYA 286 Query: 225 SKIVQTFPL----AWGVQHIQEKINRLIFG-----STTKSTPGLEYAYNKIFDAKEKLEH 275 S + L + + + + + + T + L Y + + + E Sbjct: 287 SVAIAVVSLRDPDSNDAEAVTKHLEDFQYNNHADKQGTNTRAALHSIYEHLIEQELAYEK 346 Query: 276 IAKGHDDYK--KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI----------VYAIG 323 K D K I+ +TDG+ + D +E + G VY G Sbjct: 347 EGKKADFMKIHNVILLMTDGKFNMGG-DPREEMKLIIRFLDIGIRTENPRLEYLDVYVFG 405 Query: 324 VQAEAADQFLKNCASPD----RFYSVQNSRKLHDAFLRIGKE 361 + ++ + AS + ++N K+ + F + E Sbjct: 406 LGSDIDQPEINELASKKDKEVHTFHLENVNKMKEFFELMLDE 447 >gi|297685993|ref|XP_002820555.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Pongo abelii] Length = 150 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 28/79 (35%), Gaps = 4/79 (5%) Query: 243 KINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDN 302 I+ + T L+ A + + + H G I+FLTDG+ + Sbjct: 70 YIHHMSPTGGTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPTVGETHT 125 Query: 303 KESLFYCNEAKRRGAIVYA 321 + L EA R ++ Sbjct: 126 LKILNNTREAARGQVCIFT 144 >gi|229165565|ref|ZP_04293342.1| Von Willebrand factor type A domain protein [Bacillus cereus AH621] gi|228617918|gb|EEK74966.1| Von Willebrand factor type A domain protein [Bacillus cereus AH621] Length = 627 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|254464781|ref|ZP_05078192.1| nitric oxide reductase D protein [Rhodobacterales bacterium Y4I] gi|206685689|gb|EDZ46171.1| nitric oxide reductase D protein [Rhodobacterales bacterium Y4I] Length = 625 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 71/208 (34%), Gaps = 30/208 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +++D S S G + A S+ + I + D R G+ FSS Sbjct: 433 DLSVAVLMDCSRSTEATVGDR-PVIETARESLSALAGGIATAGD-----RLGIWGFSSLR 486 Query: 228 VQTFPLAWGV-------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 L + + +I G T+ + +A + + + Sbjct: 487 RDRVFLTRAKGFEDPMSEAVTARIGGFKPGHYTRLGAAIRHASALLAEEGSER------- 539 Query: 281 DDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + ++ LTDG+ + ++S EA+ G V+ + + A+ D F + Sbjct: 540 ----RLLLVLTDGKPNDLDHYEGVHGIEDSRMAVREARALGQSVHGVVIDADGQDWFARI 595 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMV 363 F + + +L A I + + Sbjct: 596 FGRAG-FTLLPDPDRLPRALPEIYQSLT 622 >gi|47086179|ref|NP_998093.1| integrin, beta 1b.2 [Danio rerio] gi|45709487|gb|AAH67552.1| Integrin, beta 1b.2 [Danio rerio] Length = 619 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 54/174 (31%), Gaps = 33/174 (18%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++K D +D+ ++D+S SM + + + D+ +R Sbjct: 126 TLKFKRAEDYPIDLYFLMDLSHSMLSNLENFKNLGTELAN----------EMKDITKDLR 175 Query: 218 SGLVTF----SSKIVQTFP--------------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 G +F S + FP L ++I++L S+ Sbjct: 176 IGFGSFFRKPSIQTNPCFPDNCIAPFSYFNVLSLTDDHALFTQEISKLKTSGNLDSSEA- 234 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 + A + G + + ++F TD + + + N+ K Sbjct: 235 --GLEALMQAA--VCTDVIGWRNATRVLVFFTDAGLRFSGDGKRGGIVHLNDGK 284 >gi|291446035|ref|ZP_06585425.1| predicted protein [Streptomyces roseosporus NRRL 15998] gi|291348982|gb|EFE75886.1| predicted protein [Streptomyces roseosporus NRRL 15998] Length = 537 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 47/129 (36%), Gaps = 16/129 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + + +VLD S SM + + + + + + + D + VR V FS Sbjct: 341 TGLRARVYLVLDRSGSMRPFY------KDGSAQHLGDRTLALAAHLDEDATVRV--VFFS 392 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + I T L + + +++ L G E A ++ E K Sbjct: 393 TDIDGTGSLE--LSGHEGRVDELHAGLGRLGRTHYERAVEEVVADYE------KAEATGP 444 Query: 285 KYIIFLTDG 293 +IF TDG Sbjct: 445 ALVIFQTDG 453 >gi|239942573|ref|ZP_04694510.1| hypothetical protein SrosN15_16393 [Streptomyces roseosporus NRRL 15998] gi|239989034|ref|ZP_04709698.1| hypothetical protein SrosN1_17140 [Streptomyces roseosporus NRRL 11379] Length = 531 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 47/129 (36%), Gaps = 16/129 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + + +VLD S SM + + + + + + + D + VR V FS Sbjct: 335 TGLRARVYLVLDRSGSMRPFY------KDGSAQHLGDRTLALAAHLDEDATVRV--VFFS 386 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + I T L + + +++ L G E A ++ E K Sbjct: 387 TDIDGTGSLE--LSGHEGRVDELHAGLGRLGRTHYERAVEEVVADYE------KAEATGP 438 Query: 285 KYIIFLTDG 293 +IF TDG Sbjct: 439 ALVIFQTDG 447 >gi|229131564|ref|ZP_04260450.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST196] gi|228651908|gb|EEL07859.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST196] Length = 627 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|167624594|ref|YP_001674888.1| TPR repeat-containing protein [Shewanella halifaxensis HAW-EB4] gi|167354616|gb|ABZ77229.1| Tetratricopeptide TPR_2 repeat protein [Shewanella halifaxensis HAW-EB4] Length = 672 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 57/173 (32%), Gaps = 28/173 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D+S SM + T++ + D+I +PD +GL+ ++ Sbjct: 90 VIVMDMSQSMYANDLAP----NRLTQAKFKATDLIDELPDGE----TGLIAYAGDAFTIS 141 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 PL + + L LE A + + HI II T Sbjct: 142 PLTRDKATLLNLLPTLTPSIMPIKGSNLEAAITQAKSLLSQGGHIRGD-------IILFT 194 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA-------DQFLKNCA 337 DG I + + + K + I ++ Q L++ A Sbjct: 195 DG------ISSTQFKRAESVLKDSQYRLAIIAFGSQQGSPIKLPDGQLLRDSA 241 >gi|333028008|ref|ZP_08456072.1| putative magnesium chelatase [Streptomyces sp. Tu6071] gi|332747860|gb|EGJ78301.1| putative magnesium chelatase [Streptomyces sp. Tu6071] Length = 718 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 48/140 (34%), Gaps = 20/140 (14%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 S + G ++ V+D S SM ++GV ++ +L + + GL Sbjct: 520 ASREGREGNLVLFVVDASGSMA-----ARQRMGVVKGAVLSLL-----LDAYQRRDKVGL 569 Query: 221 VTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIAK 278 VTF + P V ++ L G T GL A + + + A Sbjct: 570 VTFRGREAGLALPPTSSVDTAAARLESLPTGGRTPLAAGLLKARDVLRIERLRDPARRA- 628 Query: 279 GHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 629 -------LLVVVTDGRATGG 641 >gi|307294185|ref|ZP_07574029.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] gi|306880336|gb|EFN11553.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] Length = 157 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 50/156 (32%), Gaps = 5/156 (3%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSL-LYTATKILN 65 R FF + G+ ++ A+++P++ ++ + + + F K L + + L TA N Sbjct: 3 RRFFADPMGTSTVELALIMPILVLLACMAGDVAMAFKAKIALQRAAERTGQLATAGGYTN 62 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + + + D S + Q K Y Sbjct: 63 DTSKTQAAYNNLAADAAAAAGVSTNNVTVTPT---LLCDATVQTASPEVPCADGQQTKRY 119 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 +S P P +N S + IT S + Sbjct: 120 VAITISGSYTPMFAKLMP-GSNWSTQGIPITGSASV 154 >gi|229056399|ref|ZP_04195812.1| Von Willebrand factor type A domain protein [Bacillus cereus AH603] gi|228720873|gb|EEL72422.1| Von Willebrand factor type A domain protein [Bacillus cereus AH603] Length = 627 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|222094375|ref|YP_002528434.1| von willebrand factor type a domain protein [Bacillus cereus Q1] gi|229194935|ref|ZP_04321717.1| Von Willebrand factor type A domain protein [Bacillus cereus m1293] gi|221238432|gb|ACM11142.1| von Willebrand factor type A domain protein [Bacillus cereus Q1] gi|228588566|gb|EEK46602.1| Von Willebrand factor type A domain protein [Bacillus cereus m1293] gi|324324664|gb|ADY19924.1| von Willebrand factor type A [Bacillus thuringiensis serovar finitimus YBT-020] Length = 627 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|109466064|ref|XP_001075558.1| PREDICTED: integrin, alpha 2 [Rattus norvegicus] Length = 1178 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 39/289 (13%), Positives = 92/289 (31%), Gaps = 36/289 (12%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIFCTFPWCANSS 149 + L+ + ++ I+ + SL + + ++ + C++ S Sbjct: 91 KLNLQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVS 150 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +TS +D+++V D S S + + + + + Sbjct: 151 PDFQSLTSFSPAVQACPSLVDVVVVCDESNS--------IYPWEAVKNFLEKFVQGLDIG 202 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 P L+ +++ F L + K + + S T+ G K Sbjct: 203 PKKTQ---VALIQYANDPRVVFNLT----TYKNKEDMVQATSETRQYGGDLTNTFKAIQF 255 Query: 270 KEKLEHIAKGHDDY--KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--- 324 + ++ + K ++ +TDGE+ + + + CN + + I V Sbjct: 256 ARDIAYLPESGGRPGATKVMVVVTDGESHDGSK-LQTVIQQCN---DDEILRFGIAVLGY 311 Query: 325 ---QAEAADQF---LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 A +K AS F++V + L + +G+ + Sbjct: 312 LNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEHIFS 360 >gi|70943972|ref|XP_741969.1| hypothetical protein [Plasmodium chabaudi chabaudi] gi|56520682|emb|CAH74986.1| hypothetical protein PC000452.00.0 [Plasmodium chabaudi chabaudi] Length = 831 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 60/183 (32%), Gaps = 31/183 (16%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS----S 225 D+ ++LD S S+ + + L+ ++ V G++ F+ Sbjct: 647 DLTVILDESGSIGANNWERQ-----VYPFTEKFLNNLEISE---KNVHVGIMLFAQYNRD 698 Query: 226 KIVQTFPLAWGVQHIQEKINRLI----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + + ++ +H+ + L G T L Y A Sbjct: 699 FVKFSDKESYDKEHLMTHVKTLKKSYKSGGYTYIISALNYGLV-------NYTRHADSRS 751 Query: 282 DYKKYIIFLTDGENSSPNID--NKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNCA 337 D K + TDG N+ P + SL Y K+ + +GV A A + L C Sbjct: 752 DVPKVTMLFTDGNNTDPGDKLLSDASLLY----KQENVKLLVVGVGASTMANLRLLAGCH 807 Query: 338 SPD 340 D Sbjct: 808 KTD 810 Score = 39.8 bits (91), Expect = 0.63, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 61/174 (35%), Gaps = 20/174 (11%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ ++LD S S+ + M+ + A D+I ++ N V +G++ FS + Sbjct: 426 DVTLILDESASIGESRWT-MEVIPFAK-------DVINNLNIGYNSVHAGILLFSHYALD 477 Query: 230 TFPLA----WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P + + + +I+ L T G E K + K Sbjct: 478 LVPFSDAARYNKDSLINRIDSLK----TNYGNGHESFIVKTLKYALYNYTKGSDRTNAPK 533 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCA 337 + TDG + + + E + + + IGV + + L CA Sbjct: 534 ITMLFTDG--NDSSESDIEMYNIGSLYRTERVKLLVIGVSMASESKLKQLVGCA 585 >gi|3273247|dbj|BAA31167.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 66/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYANDFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|327403930|ref|YP_004344768.1| hypothetical protein Fluta_1943 [Fluviicola taffensis DSM 16823] gi|327319438|gb|AEA43930.1| protein of unknown function DUF58 [Fluviicola taffensis DSM 16823] Length = 289 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 54/152 (35%), Gaps = 15/152 (9%) Query: 121 QHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLS 180 + + + S V Y+ T W + + + L +++++DVS S Sbjct: 34 KGRGMSFSEVRSYQFGDDVRTIDWNVTARFR-----EPYIKIFEEERELTVLLIIDVSGS 88 Query: 181 MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 M +FG G D + S+ L + + G + S ++ P G H+ Sbjct: 89 M--YFGQGKD----SKISLAVELAATLAFSAAKKNDKVGAILISDEVEYYVPPKKGFGHV 142 Query: 241 ----QEKINRLIFGSTTKSTPGLEYAYNKIFD 268 ++ IN + T GL YA N Sbjct: 143 HFLLRKLINLNPKSTGTSLDMGLRYARNIFKQ 174 >gi|229028418|ref|ZP_04184540.1| Von Willebrand factor type A domain protein [Bacillus cereus AH1271] gi|228732867|gb|EEL83727.1| Von Willebrand factor type A domain protein [Bacillus cereus AH1271] Length = 627 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|221213132|ref|ZP_03586108.1| membrane protein [Burkholderia multivorans CGD1] gi|221167345|gb|EED99815.1| membrane protein [Burkholderia multivorans CGD1] Length = 609 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 76/259 (29%), Gaps = 30/259 (11%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 ++++ AI + V IV+G I+ + +F + L + D + L + N+ Q Sbjct: 1 MAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQSV------NDLCPQT 53 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMP 136 + N T NGF D S S++ D DY A + Sbjct: 54 DTTVTASGSNAVVTAAYRGAALNGF--DAQASGNSMSIACGRWDV-SDYGA-AAGYFGTA 109 Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIG-------LDMMMVLDVSLSMNDHFGPGM 189 + PL + S + + L + S D G + Sbjct: 110 TNQLNAVRVVAAKTVPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSV 169 Query: 190 DKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 T + +L + + + S I LA + + + + Sbjct: 170 SADQRNTGLVNALLGALLN------------TSLSLNIGSYQALACTRVKVGDLVKAQVG 217 Query: 250 GSTTKSTPGLEYAYNKIFD 268 T + N++ Sbjct: 218 AGTVDQLLATKLTLNQLVS 236 >gi|189238321|ref|XP_972429.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 653 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 67/218 (30%), Gaps = 34/218 (15%) Query: 171 MMMVLDVSLSM-NDHFGPGMDKLGVATRSIR--EMLDIIKSIPDV-------NNVVRSGL 220 ++ VLD S SM + M + + ++ I++ DV N + Sbjct: 248 VVFVLDHSGSMRGRKYEQLMQAMDKILSDLNPDDLFHIVRFSDDVSVWNLEKNKFDQIRF 307 Query: 221 VTFSSKIVQTFPLA------------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 LA ++ + + + T GL + Sbjct: 308 EQMPDYENLDTSLAELNLGEAIQVTEDNIKKAKRIKDDDVNMGCTNIIGGLAVGLYLVRR 367 Query: 269 AKEK--LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK--RRGAIVYAIGV 324 +K +++A H IIFLTDG + + E + A ++++ Sbjct: 368 TLQKFYEKNVATKHQPM---IIFLTDGLPNVGISNPDEITKIVTKINQGTNRAAIFSMSF 424 Query: 325 QAEAADQFLKNCAS-----PDRFYSVQNSRKLHDAFLR 357 +A FLK ++ Y ++ F R Sbjct: 425 GEDADKNFLKKLSAQNLGFSRHIYEAADAALQLQNFYR 462 >gi|218532399|ref|YP_002423215.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|218524702|gb|ACK85287.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] Length = 355 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 50/159 (31%), Gaps = 22/159 (13%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF---GPGMDKLGVAT 196 + P V + IG + M++D S SMN+ F P + A Sbjct: 61 LAIGGLVLALAGPYRAGERVTRT---GIGAQISMLIDRSGSMNETFAGRQPSGAEESKAA 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TK 254 S R + D + + + FS+ + P+ ++ I + T Sbjct: 118 ASRRILRDFVGE----RAHDQFAVTAFSTAPMLVVPMTDRHDAVRAAIAAIDRPGLDYTN 173 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 GL A ++ + ++ ++DG Sbjct: 174 VARGLGMALSQFGAGAPG----------VSRALLLVSDG 202 >gi|163853540|ref|YP_001641583.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163665145|gb|ABY32512.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 355 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 55/164 (33%), Gaps = 22/164 (13%) Query: 138 IFCTFPWCANSSHAPLLITSSVKIS---SKSDIGLDMMMVLDVSLSMNDHF---GPGMDK 191 I T L + + S +++ IG + M++D S SMN+ F P + Sbjct: 53 IVLTAAGLLAIGGLVLALAGPYRASERVTRTGIGAQISMLIDRSGSMNETFAGRQPSGAE 112 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 A S R + D + + + FS+ + P+ ++ I + Sbjct: 113 ESKAAASRRILRDFVGE----RAHDQFAVTAFSTAPMLVVPMTDRHDAVRAAIAAIDRPG 168 Query: 252 T--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 T GL A ++ + ++ ++DG Sbjct: 169 LDYTNVARGLGMALSQFGAGAPG----------VSRALLLVSDG 202 >gi|21219969|ref|NP_625748.1| hypothetical protein SCO1467 [Streptomyces coelicolor A3(2)] gi|7209227|emb|CAB76889.1| conserved hypothetical protein SCL6.24c [Streptomyces coelicolor A3(2)] Length = 491 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 48/147 (32%), Gaps = 24/147 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRS---GLVTF 223 L ++V+D S SM ++KL A R+ I+ +PD VR V + Sbjct: 58 LAQVLVMDCSSSMTW----PVEKLHAAQRAAVA---AIRKLPDGTPFAVVRGNEQAAVVY 110 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 A + + + G T L+ + + + + H Sbjct: 111 PDTPRMARASARTRSRAERAVRETVAGGGTCIGAWLDLSRRLLTEQDAPIGH-------- 162 Query: 284 KKYIIFLTDGENSSPNIDN-KESLFYC 309 ++ LTDG+N L C Sbjct: 163 ---VLLLTDGKNQHDEQMPLARVLEEC 186 >gi|84503110|ref|ZP_01001206.1| von Willebrand factor type A domain [Oceanicola batsensis HTCC2597] gi|84388654|gb|EAQ01526.1| von Willebrand factor type A domain [Oceanicola batsensis HTCC2597] Length = 750 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 70/190 (36%), Gaps = 35/190 (18%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGM------DKLGVATRSIREMLDIIKS---IP 210 + +S + + + ++LD+S S D G D + R++ + D + Sbjct: 552 EGTSPPERSIAVHLLLDMSRSTADRVGSQTVLSLERDAAAILARAMDRLGDPLAITAFAS 611 Query: 211 DVNNVVRSGLVT-FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 + + +R+ V F ++ +A ++ L G +T+ L A + Sbjct: 612 NGRDDLRTVPVKRFPDELGLLSGMA---------LSGLTPGYSTRIGAALRLAGRSV--- 659 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK-----ESLFYCNEAKRRGAIVYAIGV 324 E H + ++ LTDGE S ++ ++ ++ + + + I + Sbjct: 660 SEVPCHR--------RLVLLLTDGEPSDVDVPDRDYLVADARRAVHGLSAKNIDTFCIAL 711 Query: 325 QAEAADQFLK 334 ++ D + Sbjct: 712 GSDVGDSVAR 721 >gi|291223813|ref|XP_002731902.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 1001 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 58/171 (33%), Gaps = 29/171 (16%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L ++ VLD+S SM F + L I+ + + V G+V F Sbjct: 334 KTSPLRIVFVLDISGSM--SFNNRIGILESLATKF------IRYVVPGGHFV--GIVEFD 383 Query: 225 SKIVQTFPLAWGV-QHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 S L +E + LI T GL + E A+G Sbjct: 384 SHATVKSNLTEITSSTTREYLVSLIPSYTSGATCIGCGLLSGIQVL----ESNGQSARGG 439 Query: 281 DDYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD 330 ++ ++DG EN SP I+ + +G +V + + A D Sbjct: 440 -----ILLLISDGQENVSPYINTMKPDLI-----SKGVVVDTVALSGNAED 480 >gi|206978157|ref|ZP_03239039.1| conserved hypothetical protein [Bacillus cereus H3081.97] gi|206743629|gb|EDZ55054.1| conserved hypothetical protein [Bacillus cereus H3081.97] Length = 627 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|289772819|ref|ZP_06532197.1| conserved hypothetical protein [Streptomyces lividans TK24] gi|289703018|gb|EFD70447.1| conserved hypothetical protein [Streptomyces lividans TK24] Length = 450 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 48/147 (32%), Gaps = 24/147 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRS---GLVTF 223 L ++V+D S SM ++KL A R+ I+ +PD VR V + Sbjct: 17 LAQVLVMDCSSSMTW----PVEKLHAAQRAAVA---AIRKLPDGTPFAVVRGNEQAAVVY 69 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 A + + + G T L+ + + + + H Sbjct: 70 PDTPRMARASARTRSRAERAVRETVAGGGTCIGAWLDLSRRLLTEQDAPIGH-------- 121 Query: 284 KKYIIFLTDGENSSPNIDN-KESLFYC 309 ++ LTDG+N L C Sbjct: 122 ---VLLLTDGKNQHDEQMPLARVLEEC 145 >gi|171693029|ref|XP_001911439.1| hypothetical protein [Podospora anserina S mat+] gi|170946463|emb|CAP73264.1| unnamed protein product [Podospora anserina S mat+] Length = 1378 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 47/352 (13%), Positives = 102/352 (28%), Gaps = 50/352 (14%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 +++ F + KG +T + +P S + S+ T + Sbjct: 200 LKHHFADRKG----ITTLTIPTCIA--------SRYGDKPQDYNNAASTSIPEGLTIQIE 247 Query: 66 QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 + ++N T + + +D + + + + D Sbjct: 248 VVEAGKIASIVSPTHKVTVENWLGTRAASSFAD-LVGEDTRSSVETALVKLETGSAFLDR 306 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS---------VKISSKSDIGLDMMMVLD 176 + P P H L + +S +++++ D Sbjct: 307 DFVLDIATGGPNDEAESPQAWIEKHPTLPNQQALMVTIPPGFTTRTSNPTDQTEILLLAD 366 Query: 177 VSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG 236 +S SM+ DKL +++ L K IP+ F+ + +W Sbjct: 367 LSGSMD-------DKLTSLRAAMQFFL---KGIPNGRK--------FNVWCFGSSYKSWQ 408 Query: 237 VQHI-QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + + S + ++ A + + II LTDGE Sbjct: 409 PHSVDYGEASYQSASSWVDTNFHANMGGTELLPAVQAIVTARDKRLPTD--IIILTDGE- 465 Query: 296 SSPNIDNKESLFYCNEAKRR---GAIVYAIGVQAEAADQFLKNCASPDRFYS 344 E+L Y + + G +A+G+ + ++ A Y+ Sbjct: 466 ---TWRLDETLEYIRKQRDLTEGGIRFFALGIGPAVSHALVEGIAKVGGGYA 514 >gi|314982666|gb|EFT26758.1| cobaltochelatase subunit [Propionibacterium acnes HL110PA3] gi|315091325|gb|EFT63301.1| cobaltochelatase subunit [Propionibacterium acnes HL110PA4] Length = 635 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 54/158 (34%), Gaps = 19/158 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 432 RAGRAASCVIFVVDASGSMG-----SRGRMVASKGAVLSLL-----LDAYIKRDRVCLIG 481 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 482 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLVKACEVV-----RPLLLKDPGL 536 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ +TDG ++ +D + + +EA R + Sbjct: 537 RP--LLVLVTDGRSNVG-LDGRPNSRATDEAIRVATKI 571 >gi|314965508|gb|EFT09607.1| cobaltochelatase subunit [Propionibacterium acnes HL082PA2] gi|315094559|gb|EFT66535.1| cobaltochelatase subunit [Propionibacterium acnes HL060PA1] gi|315104978|gb|EFT76954.1| cobaltochelatase subunit [Propionibacterium acnes HL050PA2] gi|327329053|gb|EGE70813.1| cobaltochelatase subunit [Propionibacterium acnes HL103PA1] Length = 635 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 54/158 (34%), Gaps = 19/158 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 432 RAGRAASCVIFVVDASGSMG-----SRGRMVASKGAVLSLL-----LDAYIKRDRVCLIG 481 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 482 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLVKACEVV-----RPLLLKDPGL 536 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ +TDG ++ +D + + +EA R + Sbjct: 537 RP--LLVLVTDGRSNVG-LDGRPNSRATDEAIRVATKI 571 >gi|282853148|ref|ZP_06262485.1| cobaltochelatase subunit [Propionibacterium acnes J139] gi|282582601|gb|EFB87981.1| cobaltochelatase subunit [Propionibacterium acnes J139] Length = 635 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 54/158 (34%), Gaps = 19/158 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 432 RAGRAASCVIFVVDASGSMG-----SRGRMVASKGAVLSLL-----LDAYIKRDRVCLIG 481 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 482 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLVKACEVV-----RPLLLKDPGL 536 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ +TDG ++ +D + + +EA R + Sbjct: 537 RP--LLVLVTDGRSNVG-LDGRPNSRATDEAIRVATKI 571 >gi|256788931|ref|ZP_05527362.1| hypothetical protein SlivT_30978 [Streptomyces lividans TK24] Length = 477 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 48/147 (32%), Gaps = 24/147 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV--VRS---GLVTF 223 L ++V+D S SM ++KL A R+ I+ +PD VR V + Sbjct: 44 LAQVLVMDCSSSMTW----PVEKLHAAQRAAVA---AIRKLPDGTPFAVVRGNEQAAVVY 96 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 A + + + G T L+ + + + + H Sbjct: 97 PDTPRMARASARTRSRAERAVRETVAGGGTCIGAWLDLSRRLLTEQDAPIGH-------- 148 Query: 284 KKYIIFLTDGENSSPNIDN-KESLFYC 309 ++ LTDG+N L C Sbjct: 149 ---VLLLTDGKNQHDEQMPLARVLEEC 172 >gi|124028121|ref|YP_001013441.1| hypothetical protein Hbut_1264 [Hyperthermus butylicus DSM 5456] gi|123978815|gb|ABM81096.1| conserved archaeal protein [Hyperthermus butylicus DSM 5456] Length = 439 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 91/318 (28%), Gaps = 54/318 (16%) Query: 44 VKAKLHYILDHSLLYTATKILNQE----NGNNGKKQKNDFSYRIIKNIWQTDFRNELREN 99 V+ L D + A ++ N GN +D +I TD + L Sbjct: 148 VRKALETARDVA--QQAKELTNLAMRFTAGNASMLSLDDVIQDVINLARNTDVKVLLEAL 205 Query: 100 GFAQDINNIERSTSLSI---IIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 + R+ + +D ++ V E+ F + L Sbjct: 206 KTIESTEAYIRTRKIRSPRGELDGYELGSDIERVVASELALPTDLFLLKFAERNLLLYKK 265 Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 + K ++LD S SM M + +++ L + Sbjct: 266 VVSEEYGK------FYVLLDKSGSM-------MGMKIIWAKAVALALAQRAIREKREFYI 312 Query: 217 RSGLVTFSSKIVQTFPL-------AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 R F S PL V + E + R+ T T + A + I Sbjct: 313 R----FFDSIPY--PPLYIPKRVHGRDVVKLLEYVARIRANGGTDITRAILTAVDDIATK 366 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 ++ + II +TDGE+ + SL N A ++ + Sbjct: 367 LQRSKVSD---------IILITDGEDKIAIDTIRRSLNKVN------ARLHTV--MISGN 409 Query: 330 DQFLKNCASPDRFYSVQN 347 + L+ A D + Sbjct: 410 NPDLR--AISDSYMVATK 425 >gi|42779770|ref|NP_977017.1| hypothetical protein BCE_0692 [Bacillus cereus ATCC 10987] gi|42735687|gb|AAS39625.1| conserved hypothetical protein [Bacillus cereus ATCC 10987] Length = 627 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|163938547|ref|YP_001643431.1| von Willebrand factor type A [Bacillus weihenstephanensis KBAB4] gi|163860744|gb|ABY41803.1| von Willebrand factor type A [Bacillus weihenstephanensis KBAB4] Length = 627 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|313238221|emb|CBY13313.1| unnamed protein product [Oikopleura dioica] Length = 759 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 63/191 (32%), Gaps = 22/191 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 + LD++ V+D S SM + I + ++V+ G++T+S+ Sbjct: 520 EKPLDILFVVDRSTSMRGENQF----FDASKAWIENFIGEFDL-----SLVKIGVITYSN 570 Query: 226 KIVQTFPLAWG-VQHIQEKINRLIFGS--TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 PL ++ I ++++ + + + + E D Sbjct: 571 NATLDIPLKNHLLEKITKRMDGIELDEKVGSDLSDAANLVRKVVSRRDED--------SD 622 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPD 340 + +I L+D +S + S + V ++G+ + + A D Sbjct: 623 AESIVIILSDFWDSGFEVPEFLSDTPFSALATLRTSVISVGLGEAPNEIIGQITALNQAD 682 Query: 341 RFYSVQNSRKL 351 F Q L Sbjct: 683 NFMKAQTPDDL 693 >gi|256425856|ref|YP_003126509.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256040764|gb|ACU64308.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 588 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 78/223 (34%), Gaps = 22/223 (9%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + T PW + + + ++ S +++ ++DVS SM +KL + Sbjct: 212 NYATCPWAEDHRLLQIAVRGK-SVNLDSLPPSNLVFLIDVSGSMAMP-----NKLPLLQA 265 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 + R +++ ++S V V +G+ ++ I I+ L G T Sbjct: 266 AFRILVNNLRSNDHVAIVAYAGVPG----VILPSTPGSAKSKILNAIDYLSAGGATAGEA 321 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 ++ AY + K + +I TDG+ + + + K G Sbjct: 322 AIKLAYQIAEENFIKEGNNR---------VILATDGDFNVGQTSDHDMEQLILGKKETGV 372 Query: 318 IVYAIGV-QAEAADQFLKNCAS--PDRFYSVQNSRKLHDAFLR 357 ++ +G D L+ +S F + N + F R Sbjct: 373 LLTCLGFGMKNYKDSKLETLSSKGNGNFAYIDNLEEASKIFAR 415 >gi|229182952|ref|ZP_04310185.1| Von Willebrand factor type A domain protein [Bacillus cereus BGSC 6E1] gi|228600576|gb|EEK58163.1| Von Willebrand factor type A domain protein [Bacillus cereus BGSC 6E1] Length = 609 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NHHFLVANHAE 590 >gi|167041143|gb|ABZ05903.1| putative protein of unknown function DUF58 [uncultured marine microorganism HF4000_001A02] Length = 291 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 58/156 (37%), Gaps = 17/156 (10%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L + +++DVS S HFG + + + + + N + GL+ F Sbjct: 72 EEERELTVYLMVDVSRS--GHFGT----IDQFKSELAAEIAAVLGFSAIKNKDKVGLILF 125 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 S + + HI + ++F + L+ + + + ++ + D Sbjct: 126 SDHVEKFIAPKKDRSHILRVVREVLFHEPEGTGTSLQSGLDFLMNVAKRKSVVFLISD-- 183 Query: 284 KKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRR 315 FL DG S + NK+ + + A+ + Sbjct: 184 -----FLDDGYWKSLKLANKKHDMIGIRIADPAESK 214 >gi|182625113|ref|ZP_02952890.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] gi|177909733|gb|EDT72159.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] Length = 620 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 60/152 (39%), Gaps = 27/152 (17%) Query: 155 ITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 IT +VK K +D+++++D S SM + D+L + S+ + + I +IP+ Sbjct: 72 ITLTVKGKPKKVTKPVDILLIMDASNSMYYNM----DELKASMNSLVDKV--IDNIPNS- 124 Query: 214 NVVRSGLVTFSSKIVQTFPL--AWGVQHIQEKINRLIFG-------STTKSTPGLEYAYN 264 R +V F +++ + F +E N + T A Sbjct: 125 ---RIAVVAFGTEVEEVFSFNDKNNFTSKEEYKNAIKDSYYYITGRGNTNIEGTWRRA-- 179 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 E ++ + + KK +IF +DG + Sbjct: 180 -----NEIFKNELNNNSNSKKDVIFFSDGYPN 206 >gi|154250522|ref|YP_001411346.1| hypothetical protein Plav_0066 [Parvibaculum lavamentivorans DS-1] gi|154154472|gb|ABS61689.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] Length = 566 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 46/123 (37%), Gaps = 5/123 (4%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + + +G IS++ L ++ V +VI+T + + L D + L +I N E Sbjct: 16 RYRRDERGVISVMAVGALFLVLAVAMVVIDTGSMLYARRDLQAATDAAALGAVRQIGNAE 75 Query: 68 NGN----NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI-NNIERSTSLSIIIDDQH 122 N + + +++ I+ D R+ D + ++ +I + Sbjct: 76 NAARSILDLNGYSPGDAPQVVTGIYSADPSLAPRDRFVEADGATEASQINAVRVIKYAEA 135 Query: 123 KDY 125 Y Sbjct: 136 PTY 138 >gi|293342302|ref|XP_001059971.2| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|293354106|ref|XP_344486.4| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|149034121|gb|EDL88891.1| similar to hypothetical protein 4933430J11 (predicted) [Rattus norvegicus] Length = 535 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 49/118 (41%), Gaps = 2/118 (1%) Query: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 + +SG V + + +F + + +I++ + GL + + A E+++ Sbjct: 71 LDKSGSVAKNWIYIYSFAEGLVKKFTKNEIHKSLLLLKNIEPQGLTHMQKGLIKANEQIQ 130 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 A I+ LTDG +++ +A++ GAI+Y +GV + Q Sbjct: 131 KSAARGHRAVSVIVALTDGLLLLKPY--LDTMEEAKKARKLGAIIYTVGVFMYSKQQL 186 >gi|58429479|gb|AAW78143.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429539|gb|AAW78173.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 29/222 (13%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASPD 340 KES + + V+ IG A ++FL C D Sbjct: 170 DSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|47568068|ref|ZP_00238773.1| von Willebrand factor type A domain protein [Bacillus cereus G9241] gi|47555222|gb|EAL13568.1| von Willebrand factor type A domain protein [Bacillus cereus G9241] Length = 627 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 591 QLMKNIY--NHHFLVANHAE 608 >gi|149059383|gb|EDM10390.1| integrin, alpha 2 [Rattus norvegicus] Length = 1140 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 39/289 (13%), Positives = 92/289 (31%), Gaps = 36/289 (12%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIFCTFPWCANSS 149 + L+ + ++ I+ + SL + + ++ + C++ S Sbjct: 91 KLNLQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVS 150 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +TS +D+++V D S S + + + + + Sbjct: 151 PDFQSLTSFSPAVQACPSLVDVVVVCDESNS--------IYPWEAVKNFLEKFVQGLDIG 202 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDA 269 P L+ +++ F L + K + + S T+ G K Sbjct: 203 PKKTQ---VALIQYANDPRVVFNLT----TYKNKEDMVQATSETRQYGGDLTNTFKAIQF 255 Query: 270 KEKLEHIAKGHDDY--KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--- 324 + ++ + K ++ +TDGE+ + + + CN + + I V Sbjct: 256 ARDIAYLPESGGRPGATKVMVVVTDGESHDGSK-LQTVIQQCN---DDEILRFGIAVLGY 311 Query: 325 ---QAEAADQF---LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 A +K AS F++V + L + +G+ + Sbjct: 312 LNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEHIFS 360 >gi|197118194|ref|YP_002138621.1| hypothetical protein Gbem_1809 [Geobacter bemidjiensis Bem] gi|197087554|gb|ACH38825.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 294 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 10/97 (10%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L++M+++D S S+ A + + ++ + R GL+T S ++ Sbjct: 78 LNLMLLVDRSASLA------CPTPRGAKSRLAAEAAALLALAAAKSNDRVGLITCSDRVE 131 Query: 229 QTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLEY 261 P A G + Q + L G T L+Y Sbjct: 132 SFIPPAKGARQAQRIVASLSCNASSGGGTDLAAALDY 168 >gi|91842240|gb|ABE66385.1| truncated integrin beta1 subunit-like protein 2 [Danio rerio] Length = 613 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 48/153 (31%), Gaps = 33/153 (21%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++K D +D+ ++D+S SM + + + D+ +R Sbjct: 126 TLKFKRAEDYPIDLYFLMDLSHSMLSNLENFKNLGTELAN----------EMKDITKDLR 175 Query: 218 SGLVTF----SSKIVQTFP--------------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 G +F S + FP L ++I++L S+ Sbjct: 176 IGFGSFFRKPSIQTNPCFPDNCIAPFSYFNVLSLTDDHALFTQEISKLKTSGNLDSSEA- 234 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + A + G + + ++F TD Sbjct: 235 --GLEALMHAA--VCTDVIGWRNVTRVLVFFTD 263 >gi|268573216|ref|XP_002641585.1| Hypothetical protein CBG09887 [Caenorhabditis briggsae] gi|187031370|emb|CAP29431.1| hypothetical protein CBG_09887 [Caenorhabditis briggsae AF16] Length = 376 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 74/207 (35%), Gaps = 27/207 (13%) Query: 159 VKISSKSDIGL--DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV 216 K K +G+ +M+V+D S M P + V +++++ L+ + N + Sbjct: 51 TKRPEKIRLGIMRHVMIVIDCSRFMTSKAMPP-SRFVVVMKALQQFLE---RFFEQNPIA 106 Query: 217 RSGLVTFSSK-IVQTFPLAWGVQHIQEKINRLIF---GSTTKSTPGLEYAYNKIFDAKEK 272 + GL+T + + + ++ ++E ++ L G L+ A + Sbjct: 107 QIGLITCKDRKADRLTMMTGNIRILKESLSSLTEAFCGGDFSLQNALQLACVNLKGM--- 163 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--AD 330 ++ +I + + +ID KR AIG+ AE Sbjct: 164 ------PGHVSREVVIVMA----ALSSIDPGNIFSTIETMKRMNIRCSAIGLSAEMFICK 213 Query: 331 QFLKNCASPDRFYSVQNSRKLHDAFLR 357 + K A+ + + LH F + Sbjct: 214 EMAK--ATKGEYSVALDPDHLHLLFSK 238 >gi|84684502|ref|ZP_01012403.1| NorD Nitric oxide reductase activation protein [Maritimibacter alkaliphilus HTCC2654] gi|84667481|gb|EAQ13950.1| NorD Nitric oxide reductase activation protein [Rhodobacterales bacterium HTCC2654] Length = 624 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 30/208 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS--- 224 L + +++D S S H G + V ++ + I + D R + FS Sbjct: 432 DLSVAILMDCSRSTEAHIGDR-SVIDVGREALAALAAGIDTAGD-----RLAIWGFSSLR 485 Query: 225 -SKIVQTFPLAWG---VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 ++ + + +I G T+ + +A ++ D Sbjct: 486 RDRVFLHRAKDFDEDMSGEVTARIGGFRPGHYTRLGAAIRHASAQLADEGAAR------- 538 Query: 281 DDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 + ++ LTDG+ + ++S EA G V+ + + A+ D F + Sbjct: 539 ----RLLLVLTDGKPNDLDHYEGQHGIEDSRMAVREASASGQAVHGVIIDADGQDWFARI 594 Query: 336 CASPDRFYSVQNSRKLHDAFLRIGKEMV 363 F + ++ +L A I + + Sbjct: 595 FGRSG-FTLLPHADRLPRALPDIYQTLT 621 >gi|165928916|gb|ABY74499.1| integrin beta 2 [Acropora millepora] Length = 771 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 64/215 (29%), Gaps = 44/215 (20%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 N I +S +I ++ D N+ + S T Sbjct: 63 NHTRNGCQKIANPSSEAITHQNKPLDTNVKVKPQ-----------NITLSLRPGQPATVK 111 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V + + +D+ ++D+S SM + + + SI + + I N R Sbjct: 112 VDVKMPGNYPVDLYYLMDLSGSMVED----LKRFPTLGDSIAKEITNIT-----RNF-RL 161 Query: 219 GLVTFSSKIVQTF---------------------PLAWGVQHIQEKINRLIFGSTTKSTP 257 G F K + + L + + INR+ + Sbjct: 162 GFGAFVDKPLAPYIETQPGSNPNMVPAFGFVNQLSLTENISEFESVINRVNISGNIDNPE 221 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 G A +I A E + ++ +I +TD Sbjct: 222 GTLDALMQI--AVCDKEIGWRDKQSARRIVIVITD 254 >gi|118601973|ref|YP_908673.1| hypothetical protein P91278ORF_075 [Photobacterium damselae subsp. piscicida] gi|118614711|ref|YP_908494.1| hypothetical protein P99018ORF_083 [Photobacterium damselae subsp. piscicida] gi|134044562|ref|YP_001101800.1| von Willebrand factor type A domain-containing protein [Yersinia ruckeri] gi|134044805|ref|YP_001102190.1| von Willebrand factor type A domain-containing protein [Yersinia pestis biovar Orientalis str. IP275] gi|165938076|ref|ZP_02226636.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|229516181|ref|ZP_04405630.1| hypothetical protein VCC_000196 [Vibrio cholerae RC9] gi|118596802|dbj|BAF38106.1| hypothetical protein P99018ORF_083 [Photobacterium damselae subsp. piscicida] gi|118596982|dbj|BAF38285.1| hypothetical protein P91278ORF_075 [Photobacterium damselae subsp. piscicida] gi|133904925|gb|ABO40942.1| von Willebrand factor type A domain protein [Yersinia ruckeri] gi|133905339|gb|ABO42101.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165914099|gb|EDR32716.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|229346831|gb|EEO11800.1| hypothetical protein VCC_000196 [Vibrio cholerae RC9] gi|324008185|gb|EGB77404.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 598 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 64/179 (35%), Gaps = 22/179 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K +K + +++D+S SM G ++ +A + + +++IP VN V Sbjct: 412 KPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIPGVNPAV--- 468 Query: 220 LVTFSSKIVQT-FPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F Q F + +Q + R T + YA ++ +E+ Sbjct: 469 -TFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFELTKTREER--- 524 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K +I +TDG+ S + + C R V IGV+ A + Sbjct: 525 --------KMLIVVTDGQPQSAPA-CRSVIDLCE---RSDVEVIGIGVETTAVSGLFQK 571 >gi|330688333|gb|AEC32937.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 482 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 50/151 (33%), Gaps = 24/151 (15%) Query: 192 LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR 246 L S+ D I ++ ++R G I + L+ + E Sbjct: 2 LNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSIDKRQALS----KVTELRKS 54 Query: 247 LIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESL 306 TT T L D +K + + + +I +TDG +S +L Sbjct: 55 YSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVILMTDGIPNSKYT----AL 102 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 N+ K+R + IG+ QF + A Sbjct: 103 EVANKLKQRNVNLAVIGIGQGINHQFNRLIA 133 >gi|314922607|gb|EFS86438.1| cobaltochelatase subunit [Propionibacterium acnes HL001PA1] Length = 635 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 54/158 (34%), Gaps = 19/158 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + ++ +L + R L+ Sbjct: 432 RAGRAASCVIFVVDASGSMG-----SRGRMVASKGAVLSLL-----LDAYIKRDRVCLIG 481 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 482 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLVKACEVV-----RPLLLKDPGL 536 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 ++ +TDG ++ +D + + NEA R + Sbjct: 537 RP--LLVLVTDGRSNVG-LDGRPNSRATNEAIRVATKI 571 >gi|218505703|ref|NP_775886.3| von Willebrand factor A domain-containing protein 3A precursor [Homo sapiens] gi|172049040|sp|A6NCI4|VWA3A_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 3A; Flags: Precursor Length = 1184 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 63/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + +L + +R+ D + + ++ +V+ Sbjct: 960 VCILLDTSGSMGPYLQQVKTELVLLIWEQLRKCCDSFNLLSFAESF-----QSWQDTLVE 1014 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 1015 TTDAA--CHEAMQWVTHLQAQGSTSILQALLKAF---------SFHDLEG-------LYL 1056 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + + +R V+ I + AA +FL+ AS R+ Sbjct: 1057 LTDGK---PDTSCSLVLNEVQKLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 1111 >gi|134047208|ref|YP_001101986.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|237810014|ref|YP_002894453.1| hypothetical protein pAR060302_0107 [Escherichia coli] gi|237810210|ref|YP_002894649.1| hypothetical protein pAM04528_0113 [Salmonella enterica] gi|133905127|gb|ABO41142.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|229561694|gb|ACQ77896.1| conserved hypothetical protein [Salmonella enterica] gi|229561869|gb|ACQ78070.1| conserved hypothetical protein [Escherichia coli] gi|327536555|gb|AEA95388.1| hypothetical protein pSD853_174_122 [Salmonella enterica subsp. enterica serovar Dublin] gi|332144451|dbj|BAK19671.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 598 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 64/179 (35%), Gaps = 22/179 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K +K + +++D+S SM G ++ +A + + +++IP VN V Sbjct: 412 KPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIPGVNPAV--- 468 Query: 220 LVTFSSKIVQT-FPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F Q F + +Q + R T + YA ++ +E+ Sbjct: 469 -TFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFELTKTREER--- 524 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K +I +TDG+ S + + C R V IGV+ A + Sbjct: 525 --------KMLIVVTDGQPQSAPA-CRSVIDLCE---RSDVEVIGIGVETTAVSGLFQK 571 >gi|11641062|gb|AAG39436.1|AF296184_1 Sof precursor [Streptococcus pyogenes] Length = 471 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 60/152 (39%), Gaps = 14/152 (9%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M+ D F K+ +++ + + Sbjct: 206 TIDVTVTVTPKEIDEGADVMALLDVSKKMSEDDFNNAKTKIKQLVKTLT---EKNGENHN 262 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGV-QHIQEKINRLIFGS------TTKSTPGLEYAYN 264 N VR L+TF +I ++ +++ +N L + + A Sbjct: 263 SRNSVR--LMTFYREISDPIDISGKTDAELEKLLNELREKAKANYDWGVDLQGAIHKART 320 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 + + K +++I+ + GE++ Sbjct: 321 VFKNENNDENNDNKKKSGKRQHIVLFSQGEST 352 >gi|109732398|gb|AAI15869.1| Vwa2 protein [Mus musculus] Length = 573 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 54/161 (33%), Gaps = 22/161 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 LD++ +LD S S+ IR+ PDV GLV + S++ Sbjct: 310 SLDLVFLLDASASVGRE------NFAQMQSFIRKCTLRFDVNPDVTQ---VGLVVYGSRV 360 Query: 228 VQTFPLAW--GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 F L + +++ + S A I D ++ A+ K Sbjct: 361 QTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAG---TALLHIEDKVMTVQRGARPGVP--K 415 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 ++ LT G +++ + + G V + V A Sbjct: 416 AVVMLTGG------SGAEDAAVPAQKLRGNGISVLVMSVGA 450 >gi|320101673|ref|YP_004177264.1| hypothetical protein Isop_0118 [Isosphaera pallida ATCC 43644] gi|319748955|gb|ADV60715.1| hypothetical protein Isop_0118 [Isosphaera pallida ATCC 43644] Length = 980 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 37/123 (30%), Gaps = 12/123 (9%) Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAK---------GHDDYKKYIIFLTDGENSSPNID 301 T T L YN+ + G +K ++F+TDG+ +S + Sbjct: 764 GGTNYTHPLMLVYNQFSENPGLRTFNPNRPLGDAGGNGRRGAQKLVVFVTDGQPNSGSSA 823 Query: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 E+ N A Y + + A F + +++D I + Sbjct: 824 PFEASLQSNGA---NYSYYRVRFRESGGSSDNVTSAGGGAFNNPTLRAQIYDVTQTICND 880 Query: 362 MVK 364 Sbjct: 881 SEH 883 >gi|302389924|ref|YP_003825745.1| von Willebrand factor type A [Thermosediminibacter oceani DSM 16646] gi|302200552|gb|ADL08122.1| von Willebrand factor type A [Thermosediminibacter oceani DSM 16646] Length = 548 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 49/159 (30%), Gaps = 26/159 (16%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V + G D+ +++D S SM +L A I+ + + R Sbjct: 364 VTVKRYPGRGQDICLIIDASASMAGE------RLRSAK--------ILAKHIVLKSNRRV 409 Query: 219 GLVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 ++ F + + P I ++ + T L+ + + H+ Sbjct: 410 SVLAFKERNVSLHVPFTKNFSTIDAGVSSITSSGLTPLALALDQGLSYMCS-----RHMK 464 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 I+ +TDG + P A++ G Sbjct: 465 NP------LIMLITDGIPTVPMWSTDPVKDAITAAEKIG 497 >gi|119470035|ref|ZP_01612840.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] gi|119446745|gb|EAW28018.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] Length = 1090 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 66/238 (27%), Gaps = 79/238 (33%) Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLI 248 ++ VA ++ ++++ I GL+ F S + L + IN L Sbjct: 134 RIDVAKNAMTQLVNDNSDID-------FGLMRFRSNVGGYVVAKLGTDKTSLISDINSLP 186 Query: 249 FGSTTKSTPGLEYAYNKIF----------DAKEKLEHIAKGHDDYKK------------- 285 +T T L AY I ++K + + K Sbjct: 187 ASGSTPMTETLWEAYRYITGQSLDYAFNVSDRDKSADNSVVYTSPFKPNNGDPLRCDNSI 246 Query: 286 YIIFLTDGE---------------NSSPNID------NKESLFYCNEAK----------- 313 +I +TDG+ NS N D + + AK Sbjct: 247 NVILMTDGDPTNDDGRDTSIAQTHNSYFNDDIPFDDGTYDDSYLVAMAKILHGTSDTKVD 306 Query: 314 ---------RRGAIVYAIGV---QAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 G +Y IG ++ L A ++ + + L +A Sbjct: 307 LYTPSTDVLDTG-RLYTIGFGTGMSQGGIDLLDKAARVGGGQYIEARTAEALSEALKN 363 >gi|7453543|gb|AAF62872.1|AF181974_1 serum opacity factor [Streptococcus pyogenes] Length = 1013 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 32/146 (21%), Positives = 64/146 (43%), Gaps = 9/146 (6%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M+ + F K+ +++ + + Sbjct: 210 TIDVTVTVTPKEIDKGADVMALLDVSKKMSKEDFNNAKTKIKQLVKTLT---EKNGENHN 266 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAK 270 N VR L+TF +I ++ + +KI N L + G++ I A+ Sbjct: 267 SRNSVR--LMTFYREISDPIDISGKTEEQLDKILNDLRKKAKANYDWGVDL-QGAIHKAR 323 Query: 271 EKLEHIAKGHDDYKKYIIFLTDGENS 296 E + + +++I+ + GE++ Sbjct: 324 EIFKRDQEKKSGKRQHIVLFSQGEST 349 >gi|313902415|ref|ZP_07835818.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] gi|313467346|gb|EFR62857.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] Length = 170 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 25/55 (45%) Query: 13 CKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 G++ +LLPVI +GLV++ S +A + D + L ++ + Sbjct: 17 QGGAVGAAFLLLLPVILAALGLVLDGSRLVLTRAHAQAVADFASLAGVQEVDEEA 71 >gi|198429151|ref|XP_002122318.1| PREDICTED: similar to novel EGF domain containing protein [Ciona intestinalis] Length = 7911 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 77/214 (35%), Gaps = 25/214 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + K + +D++++LD S S+ + I +++ K + + G Sbjct: 6756 TVECKGNNKIDLVVLLDTSSSIKSK------NFELIREFIANLINQFKI---GKDGLLVG 6806 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + T+S + + + + + + + ST A I D K Sbjct: 6807 VATYSRSVQNLWEMNKYSDKDSLLRAVRGIPYNGGGTSTGA---AITNITDIKYTELAGR 6863 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + + + LTDG +S D+ Y + K +V A+GV+ Q + Sbjct: 6864 RKSA--QAVTLVLTDGVSS----DDVSGPAYILQQKS---VVIALGVKGANLKQLNEIAT 6914 Query: 338 SPDRFYSVQNSRKLHDAFLRIGKEMVKQRILYNK 371 PD +++ + D I + +V+ ++ Sbjct: 6915 EPDSIFAI--MLQNFDELKGITRTIVEAMCYVDE 6946 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 57/163 (34%), Gaps = 28/163 (17%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 K++ LD++ VLD S S+ + ++ ++ K PD V+ GL + Sbjct: 6992 KANAKLDLVFVLDSSGSVQRV------NFRLVLDFVKSVVSGFKIGPDG---VQVGLARY 7042 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLE-YAYNKIFDAKEKLEH 275 +S+ + W + K L +T++ L A ++ + + Sbjct: 7043 NSRWKKL----WDLNEYSTKDRLLRAISGVDYISGSTRTGVALRSTALTMFGSSQGRRK- 7097 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 D K + +TDG++ + L + G Sbjct: 7098 ------DVKAVTVVVTDGKSWDKVEEPALLLRNKSSVIALGIK 7134 >gi|167034051|ref|YP_001669282.1| TPR repeat-containing protein [Pseudomonas putida GB-1] gi|166860539|gb|ABY98946.1| TPR repeat-containing protein [Pseudomonas putida GB-1] Length = 533 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 22/162 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D+S SM+ P +L A ++ +++ R+GL+ ++ Sbjct: 97 LIVAVDLSPSMDADDVPP-SRLAAARHTLHDLV-------MRRGGARTGLIAYAGSAHLV 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + I L +T + D KL K ++ L Sbjct: 149 LPCTEDPGLLDTFIQAL----STDLMSAPGKNVLGVIDEALKLLGAEKSPGT----LLLL 200 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 TDG D + V + V +++ Sbjct: 201 TDG------ADASQFDAIAKRLAGTQLQVLVLAVGSQSTGPL 236 >gi|332225082|ref|XP_003261706.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A domain-containing protein 3A-like [Nomascus leucogenys] Length = 1277 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 64/178 (35%), Gaps = 32/178 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM+ + +L + +R+ D + + L ++ +V+ Sbjct: 1053 VCILLDTSGSMDPYLQQVKTELVLLIWEQLRKRCDSFNLLSFAES-----LQSWQDTLVE 1107 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 T A + + L +T L A+ H +G + Sbjct: 1108 TTDEA--CHEAMQWVTHLQAQGSTSILQALLKAF---------GFHDLEG-------LYL 1149 Query: 290 LTDGENSSPNIDNKESLFYCNEAK-RRGAIVYAIGVQAE--AADQFLKNCAS--PDRF 342 LTDG+ P+ L + +R V+ I + AA +FL+ AS R+ Sbjct: 1150 LTDGK---PDTSCSLVLNEVQRLREKRDVKVHTISLNCSDRAAVEFLRKLASFTGGRY 1204 >gi|14248581|gb|AAK57576.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 493 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|149921342|ref|ZP_01909796.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] gi|149817775|gb|EDM77239.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] Length = 739 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 46/122 (37%), Gaps = 10/122 (8%) Query: 250 GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 GS T + GLE ++ + D + + +D + I +TDG+ S+ + D + Sbjct: 438 GSGTFTHRGLELVFDNLLDYQANPPALYPADEDTQYVNILITDGQYSTYSTD-AQVQNAL 496 Query: 310 NEAKRRGAIVYAIGVQ-----AEAADQFLKNCA----SPDRFYSVQNSRKLHDAFLRIGK 360 G+ Y IG EA Q + + + + +L A I + Sbjct: 497 EALLDAGSKTYVIGFGDGLNTTEAQLQLMNMATWGSGGTEMPFDADSQAELELALGAIIE 556 Query: 361 EM 362 ++ Sbjct: 557 DI 558 >gi|33333556|gb|AAQ11894.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYVNIFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|291223805|ref|XP_002731898.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 885 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 65/192 (33%), Gaps = 31/192 (16%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+DVS SM D++ + ++ + + G+V FS+ T Sbjct: 306 VLVMDVSGSMFTD-----DRIHLLVQAATRYIRYTLPLGSW-----LGMVEFSTSSRITL 355 Query: 232 PLA--WGVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL Q ++ I+ L T L + + E E + Sbjct: 356 PLTQITDDQVREDLIDMLPKIVNGHTCIGCALLDGIEVLDRSTEGTEGGR---------L 406 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNCASPDRFYSV 345 +TDG + N + G +V + + EA Q L +Y Sbjct: 407 FLITDGLENYEPFIND----VIGGVIKSGVVVDTLALSDEADPQLTMLSQVTGGRSYYYS 462 Query: 346 QN--SRKLHDAF 355 ++ S LHDAF Sbjct: 463 ESEISTALHDAF 474 >gi|284053419|ref|ZP_06383629.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 250 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 72/200 (36%), Gaps = 27/200 (13%) Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 W + + + +V+ + + +++LD S SM G +D L V + Sbjct: 20 VMWMVRVKNQNMRLEEAVEFAENPEPRCPCVLLLDTSASMQ---GEPLDGLNV---GLMT 73 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-------LIFGSTTK 254 + + V ++TF +++ + IQ+ + L T Sbjct: 74 FRENLIKDELAKKRVEIAVITFDNQV----------KIIQDFVTADRFEPPMLTAQGQTY 123 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN--IDNKESLFYCNEA 312 + A + I A K E+ G Y+ ++ +TDGE + I + +E Sbjct: 124 MGTAIGEALDMI--ASRKAEYRNNGITYYRPWVFMITDGEPQGESDRITEQAIKRIRDEE 181 Query: 313 KRRGAIVYAIGVQAEAADQF 332 + +A+GV+ ++ Sbjct: 182 ANKQVAFFAVGVEGANMERL 201 >gi|225862598|ref|YP_002747976.1| hypothetical protein BCA_0658 [Bacillus cereus 03BB102] gi|225789196|gb|ACO29413.1| conserved hypothetical protein [Bacillus cereus 03BB102] Length = 626 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|260785923|ref|XP_002588009.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] gi|229273165|gb|EEN44020.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] Length = 948 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 48/136 (35%), Gaps = 19/136 (13%) Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P+ + ++EK+ L G TT P L + D II Sbjct: 529 QPIKDSLATLKEKVCDLSAGGTTALGPALAVCAGVVADKPRSE-------------IILC 575 Query: 291 TDGENSSPNIDNKES----LFYCNEAKRRGAIVYAIGVQAE--AADQFLKNCASPDRFYS 344 TDG + D K+ + A+ ++ IG++ E A +Q A+ + Sbjct: 576 TDGAANVGVGDVKKDPGFYKKIGDFARSHKIVISIIGIEGENVALEQVSAAAATSGGTVN 635 Query: 345 VQNSRKLHDAFLRIGK 360 + + +L +I + Sbjct: 636 ILHPLELVRQIRQIAQ 651 >gi|188583113|ref|YP_001926558.1| hypothetical protein Mpop_3896 [Methylobacterium populi BJ001] gi|179346611|gb|ACB82023.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 473 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 19/169 (11%), Positives = 54/169 (31%), Gaps = 27/169 (15%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQE 67 + G+++++ + + ++G IE + + +L +D ++ + Sbjct: 22 RLRHESDGAVAVIFGLAASTLIGLVGGGIEYARVLSARTQLQSAVDAGVMAGGNALKLVV 81 Query: 68 NGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 + + I + + E + +++ + ++ Sbjct: 82 SSTDS-----------IVGLTTQTIQTEAKAPA----------DVPVTVQVTVAPDKTSV 120 Query: 128 SAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLD 176 A + + F F A + I++ K S + L M+ LD Sbjct: 121 EARAEQVIKLTFGAFVGMA-----SMPISARAKASVVGRMRL-CMLALD 163 >gi|49476847|ref|YP_034881.1| von Willebrand factor type A domain-containing protein [Bacillus thuringiensis serovar konkukian str. 97-27] gi|196035654|ref|ZP_03103057.1| conserved hypothetical protein [Bacillus cereus W] gi|218901817|ref|YP_002449651.1| hypothetical protein BCAH820_0678 [Bacillus cereus AH820] gi|301052272|ref|YP_003790483.1| von Willebrand factor (vWF) type A domain-containing protein [Bacillus anthracis CI] gi|49328403|gb|AAT59049.1| von Willebrand factor (vWF) type A domain protein [Bacillus thuringiensis serovar konkukian str. 97-27] gi|195991621|gb|EDX55586.1| conserved hypothetical protein [Bacillus cereus W] gi|218540167|gb|ACK92565.1| conserved hypothetical protein [Bacillus cereus AH820] gi|300374441|gb|ADK03345.1| von Willebrand factor (vWF) type A domain protein [Bacillus cereus biovar anthracis str. CI] Length = 626 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|52144692|ref|YP_082140.1| hypothetical protein BCZK0534 [Bacillus cereus E33L] gi|51978161|gb|AAU19711.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 626 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|30260777|ref|NP_843154.1| hypothetical protein BA_0623 [Bacillus anthracis str. Ames] gi|47525901|ref|YP_017250.1| hypothetical protein GBAA_0623 [Bacillus anthracis str. 'Ames Ancestor'] gi|49183615|ref|YP_026867.1| hypothetical protein BAS0590 [Bacillus anthracis str. Sterne] gi|65318048|ref|ZP_00391007.1| COG4548: Nitric oxide reductase activation protein [Bacillus anthracis str. A2012] gi|165872961|ref|ZP_02217584.1| conserved hypothetical protein [Bacillus anthracis str. A0488] gi|170689000|ref|ZP_02880200.1| conserved hypothetical protein [Bacillus anthracis str. A0465] gi|170708459|ref|ZP_02898901.1| conserved hypothetical protein [Bacillus anthracis str. A0389] gi|177652877|ref|ZP_02935250.1| conserved hypothetical protein [Bacillus anthracis str. A0174] gi|227816507|ref|YP_002816516.1| hypothetical protein BAMEG_3964 [Bacillus anthracis str. CDC 684] gi|229600632|ref|YP_002865221.1| hypothetical protein BAA_0705 [Bacillus anthracis str. A0248] gi|254684299|ref|ZP_05148159.1| hypothetical protein BantC_10622 [Bacillus anthracis str. CNEVA-9066] gi|254722102|ref|ZP_05183891.1| hypothetical protein BantA1_06497 [Bacillus anthracis str. A1055] gi|254738763|ref|ZP_05196466.1| hypothetical protein BantWNA_26679 [Bacillus anthracis str. Western North America USA6153] gi|254754988|ref|ZP_05207022.1| hypothetical protein BantV_21132 [Bacillus anthracis str. Vollum] gi|254762279|ref|ZP_05214123.1| hypothetical protein BantA9_27682 [Bacillus anthracis str. Australia 94] gi|30254226|gb|AAP24640.1| conserved hypothetical protein [Bacillus anthracis str. Ames] gi|47501049|gb|AAT29725.1| conserved hypothetical protein [Bacillus anthracis str. 'Ames Ancestor'] gi|49177542|gb|AAT52918.1| conserved hypothetical protein [Bacillus anthracis str. Sterne] gi|164711315|gb|EDR16869.1| conserved hypothetical protein [Bacillus anthracis str. A0488] gi|170126580|gb|EDS95465.1| conserved hypothetical protein [Bacillus anthracis str. A0389] gi|170666985|gb|EDT17748.1| conserved hypothetical protein [Bacillus anthracis str. A0465] gi|172081911|gb|EDT66980.1| conserved hypothetical protein [Bacillus anthracis str. A0174] gi|227006795|gb|ACP16538.1| conserved hypothetical protein [Bacillus anthracis str. CDC 684] gi|229265040|gb|ACQ46677.1| conserved hypothetical protein [Bacillus anthracis str. A0248] Length = 626 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|323491534|ref|ZP_08096713.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] gi|323314110|gb|EGA67195.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] Length = 411 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 46/135 (34%), Gaps = 6/135 (4%) Query: 10 FYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG 69 + KG + ++ + + + V L I+ SH KA+L +D + L A + ++ Sbjct: 11 YRRQKGLVLVMVTVAMLALVGVAALAIDVSHAVLNKARLQNSVDAAALAAAIVMDSEGTN 70 Query: 70 NNGKKQKNDFSYRIIKNIWQ--TDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 N + DF + ++ D +++ +D Y Sbjct: 71 AEATTAANTTLTNLASATGNSEMDFSSSTVTVQYSNDPTIFPQTSGYDADLD----TYVR 126 Query: 128 SAVSRYEMPFIFCTF 142 VS Y + F Sbjct: 127 VTVSAYSLDNFFAYL 141 >gi|313233582|emb|CBY09754.1| unnamed protein product [Oikopleura dioica] Length = 620 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 68/204 (33%), Gaps = 29/204 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L+++ +D+S SM + R D K +N TF+ Sbjct: 368 LNIVFSVDISASME-----SVRDFWGWFRGFVRFFDFSKQKISIN--------TFADDAN 414 Query: 229 QTFPLAW-GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 LA QHI + I L + S + + KG + + Sbjct: 415 VILDLAHHDSQHIMDTIFNLQQQGASSSQDS------VLLKGMKTSRVSLKGLEGENSVV 468 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD--RFYSV 345 I TDG NS P +F+ EA + G + ++G+ + ++L A+ + Y V Sbjct: 469 IVFTDGWNSEPKT----PVFHGEEAYKDGINMISVGIGEKLMKEYLYTIATGNQRNVYEV 524 Query: 346 --QNSRKLHDAFLR-IGKEMVKQR 366 + + I KQ Sbjct: 525 KTDELNSITASLQDQICSSTSKQV 548 >gi|221107176|ref|XP_002169824.1| PREDICTED: similar to predicted protein, partial [Hydra magnipapillata] Length = 228 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 68/187 (36%), Gaps = 24/187 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +DM+ +D+S S + L ++IR ++D +P +N GL+T+S Sbjct: 7 VDMVFAMDLSSSSEEI-------LQKQKKAIRSLIDY--HLPSKSNE--LGLITYSDIAN 55 Query: 229 QTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + + + + +N G + A IF + E + K ++ Sbjct: 56 VNSELTSKFNSEILDKIVNN---GRRQNVASAITVASENIFKIRNNDEKLKKK----QRV 108 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP--DRFYS 344 ++ G+ SS N Y + IG+ + D F KN ++ Sbjct: 109 LVLFVVGKPSS-NYPPVVPERYGKLLEENNVKTIIIGLD-DVGDDFNKNIPGSKLSKYSE 166 Query: 345 VQNSRKL 351 + +L Sbjct: 167 NDTANEL 173 >gi|13399532|pdb|1DZI|A Chain A, Integrin Alpha2 I Domain COLLAGEN COMPLEX Length = 185 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 69/203 (33%), Gaps = 41/203 (20%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++ Sbjct: 3 IDVVVVCDESNS--------IYPWDAVKNFLEKFVQGLDIGPTKTQ---VGLIQYANNPR 51 Query: 229 QTFPLAWGVQHIQEKI------NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L +E++ G T + ++YA + A + G Sbjct: 52 VVFNL--NTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAA------SGGRRS 103 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFLK-- 334 K ++ +TDGE+ ++ K + CN + + I V A +K Sbjct: 104 ATKVMVVVTDGESHDGSM-LKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEI 159 Query: 335 ----NCASPDRFYSVQNSRKLHD 353 + + F++V + L + Sbjct: 160 KAIASIPTERYFFNVSDEAALLE 182 >gi|332817222|ref|XP_516539.3| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 [Pan troglodytes] Length = 1053 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 45/108 (41%), Gaps = 21/108 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+++++DVS SM +L +A +++ +LD + N ++ ++ ++ Sbjct: 256 DVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYNEELHY 303 Query: 230 TFPLAWGV---------QHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 P G +H +E +++L L A+N + D Sbjct: 304 VEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSD 351 >gi|322435057|ref|YP_004217269.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162784|gb|ADW68489.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 393 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 51/182 (28%), Gaps = 34/182 (18%) Query: 209 IPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + ++T+ + + + +N L + T + Y + Sbjct: 169 YHSLRPDDYIAVITYDLRTHILTDFTNNKDTVAQSLNSLQIPGFSD-TNMFDALYETLDR 227 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-- 326 KYII + G ++ + + L +++IG A Sbjct: 228 TSRIEGR---------KYIILIGSGRDTFSKLTLDKMLAKIKATPN--VTIFSIGTGALA 276 Query: 327 ----EAADQF--------------LKNCAS--PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 +A Q LK A+ ++ L D F +I + Q Sbjct: 277 QELGDARGQIGGIGRMNILQAQNQLKTFATMTGGLYFDPMFQGALPDIFSQINDSIRNQY 336 Query: 367 IL 368 IL Sbjct: 337 IL 338 >gi|256086242|ref|XP_002579311.1| hypothetical protein [Schistosoma mansoni] gi|238664738|emb|CAZ35550.1| expressed protein [Schistosoma mansoni] Length = 1678 Score = 41.0 bits (94), Expect = 0.27, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 45/139 (32%), Gaps = 29/139 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++++D S SM+ H L + I E L K V F+ + Sbjct: 1387 IVILIDTSGSMDAHLNEIKTYLKLL---IWEQLHKNK-------------VFFNMVSFSS 1430 Query: 231 FPLAWGVQHI-----------QEKINRLIFGSTTKSTPGLEYAYNKI--FDAKEKLEHIA 277 L W + I++L T + + N + ++ E Sbjct: 1431 TLLEWQASGVVMADEVYCHDAIAWIDQLKATGGTCTGEAILCGLNHLQIMNSSNIKEMYD 1490 Query: 278 KGHDDYKKYIIFLTDGENS 296 ++ K I +TDG+ Sbjct: 1491 VNFNEMSKGIYLITDGKPD 1509 >gi|14248643|gb|AAK57607.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.27, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|88704963|ref|ZP_01102675.1| TPR (repeat) domain protein [Congregibacter litoralis KT71] gi|88700658|gb|EAQ97765.1| TPR (repeat) domain protein [Congregibacter litoralis KT71] Length = 594 Score = 41.0 bits (94), Expect = 0.27, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 57/170 (33%), Gaps = 30/170 (17%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM--NDHFGP 187 R +P + + ++ P + K D +++VLD+S SM D Sbjct: 44 KGRSGLPALLAAWIIAVLAAAGPSWQQLPQPVLQKQDA---LVLVLDLSYSMLATDLQPS 100 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL 247 D++ R++LD+++ + + L+ ++ PL I + L Sbjct: 101 RQDRVR------RKLLDLLRERREGL----TALIAYAGDAHIVAPLTDDNPTIANLLPAL 150 Query: 248 IFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 + +E A + A + I+ +TDG Sbjct: 151 TPEMMPLPGSNPVEAVERALALLDSAGVRRGR-----------ILLVTDG 189 >gi|328555379|gb|AEB25871.1| hypothetical protein BAMTA208_18605 [Bacillus amyloliquefaciens TA208] Length = 226 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 75/231 (32%), Gaps = 29/231 (12%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD---- 204 L + S V + K ++ ++ D S SM G G K+ VA S+ Sbjct: 11 GLLTLSLGSPVFAAEKKHEETNVAVLFDASGSMIQKTG-GERKIDVAKDSVTSFAKVLPE 69 Query: 205 ----IIKSIPDVNNVVRSG-LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +++ N SG V+ ++ + V ++ ++++ T L Sbjct: 70 DTNLMLRVFGHKGNNKNSGKAVSCNATETLYGLQPYAVTPFEQSLSQIKPTGWTPIAKAL 129 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-I 318 + E + K + +TDGE + + ++ Sbjct: 130 ----------SDTREEFERAGAKGKNVVYLITDGEETCGG----NPQAEIQKLRKANVNT 175 Query: 319 VYAI---GVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + I + ++ K A + S ++ + A+ K++ K+ Sbjct: 176 IVNIIGFNFGMKGSESLEKAAEAGGGTYVSADSAAEFKQAWEDAAKDLAKE 226 >gi|308175401|ref|YP_003922106.1| hypothetical protein BAMF_3510 [Bacillus amyloliquefaciens DSM 7] gi|307608265|emb|CBI44636.1| putative exported protein [Bacillus amyloliquefaciens DSM 7] gi|328913750|gb|AEB65346.1| hypothetical protein LL3_03820 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 75/231 (32%), Gaps = 29/231 (12%) Query: 149 SHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD---- 204 L + S V + K ++ ++ D S SM G G K+ VA S+ Sbjct: 14 GLLTLSLGSPVFAAEKKHEETNVAVLFDASGSMIQKTG-GERKIDVAKDSVTSFAKVLPE 72 Query: 205 ----IIKSIPDVNNVVRSG-LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +++ N SG V+ ++ + V ++ ++++ T L Sbjct: 73 DTNLMLRVFGHKGNNKNSGKAVSCNATETLYGLQPYAVTPFEQSLSQIKPTGWTPIAKAL 132 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA-I 318 + E + K + +TDGE + + ++ Sbjct: 133 ----------SDTREEFERAGAKGKNVVYLITDGEETCGG----NPQAEIQKLRKANVNT 178 Query: 319 VYAI---GVQAEAADQFLKNC-ASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + I + ++ K A + S ++ + A+ K++ K+ Sbjct: 179 IVNIIGFNFGMKGSESLEKAAEAGGGTYVSADSAAEFKQAWEDAAKDLAKE 229 >gi|317056370|ref|YP_004104837.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315448639|gb|ADU22203.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 554 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 100/303 (33%), Gaps = 37/303 (12%) Query: 65 NQENGNNGKKQKNDFSYRIIKNI--WQTDFRNELRENGFAQDINNIERSTSLSIIIDDQ- 121 ++E ++ +++Y + + + NI R + ID Sbjct: 67 SEEAISDMDINTEEYNYYAENSYLSVAEHPLSTFSTDVDTASFTNIRRMIENNQNIDPNA 126 Query: 122 --HKDYNLSAVSRYEMP-------FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMM 172 +++ YE P N+ + I K DI +++ Sbjct: 127 VRTEEFINYFKYNYEYPDGDDKIAITTELSDCPWNADAKLMQIGLQAKDIDVQDIDSNIV 186 Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK--IVQT 230 ++DVS SM D +KL + ++ + ++ R +VT++ + I Sbjct: 187 FLIDVSGSMADE-----NKLPLVVQAFAML------AENLGENDRISIVTYAGRDTIELE 235 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I + L G +T G+ AY ++ KG ++ +I Sbjct: 236 GESGANYETIASTLAGLTAGGSTAGAAGINTAYEL------AEKYFIKGGNNR---VILA 286 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQN 347 TDG+ + +E E + +G + +G D L+ A + + + Sbjct: 287 TDGDLNVGLSSEEELKALIEEKRDKGVFLSVLGFGMGNYKDNRLEALADNGNGNYAYIDS 346 Query: 348 SRK 350 + Sbjct: 347 VDE 349 >gi|260881326|ref|ZP_05404133.2| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] gi|260849116|gb|EEX69123.1| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] Length = 428 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 47/141 (33%), Gaps = 8/141 (5%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G+I +LTA LLP I G+ ++ + +++L D + L A + Sbjct: 6 RGQQGAILVLTAFLLPFIIAFTGMAVDFGSAYVRRSQLQNAADAAALAGAYHL------- 58 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAV 130 Q +D + +K F + + G + D+ S Sbjct: 59 -DDNQADDVVLKYLKTNLDPHFTSYSYQTGDDFPDKFETLNYHTDKQKDELDVTLRSSVE 117 Query: 131 SRYEMPFIFCTFPWCANSSHA 151 + + F T P A + Sbjct: 118 ASFLKLFDIDTIPVYATAKAK 138 >gi|270208337|ref|YP_003329114.1| Bee1 [Enterococcus faecalis] gi|72388797|gb|AAZ68037.1| Bee1 [Enterococcus faecalis] Length = 1083 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 72/233 (30%), Gaps = 43/233 (18%) Query: 91 DFRNELRENGFAQDINNIERST-SLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSS 149 +N + D N ++ ++ D+ + Y++ + I Sbjct: 268 MIKNYNYGSAGDSDSNVEIQNILEGNLNFDNGYHAYDVGNNQNINLKKIVIPTDNPTQFQ 327 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +I S+K + +D+ V+D S SM ++ D+ ++ + Sbjct: 328 IQLDIIGGSLK----TRKNVDVAFVVDKSGSMANN-----DRWKNLKSALNTFAHGL-LD 377 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAW------------------GVQHIQEKINRLI--- 248 + V+ G+ FSS T + ++ + Sbjct: 378 DNPGGSVQLGIAGFSSVQETTTYFPYYKINVPYGKVGNFGGTQGNYTGFTTSVDAFLNHA 437 Query: 249 -----FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 T + GL+ + +++ G D +K +I LTDG + Sbjct: 438 LLNESPSGGTPTFLGLDAGLELLTNSQYN------GRKDAQKVLIILTDGLPT 484 >gi|14248625|gb|AAK57598.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248627|gb|AAK57599.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248619|gb|AAK57595.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248621|gb|AAK57596.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248623|gb|AAK57597.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248705|gb|AAK57638.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|315174683|gb|EFU18700.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1346] Length = 657 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVHVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|315150240|gb|EFT94256.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0012] Length = 711 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 73/186 (39%), Gaps = 28/186 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 139 QTESPIDLVLVIDYSSSMKGE--KLNNALNGLQQFGEELSDSLT-----DGHVRIGIVAY 191 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 + T + + +++ L + S ++ + + + L ++ + Sbjct: 192 NRLTYSTADFSTDMNDLEDF---LRTTAEPHSGTFMQKG---LLEGQRLLAEKSRPN--A 243 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFLK 334 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 244 KKMLVHIGDGSANASFLPRENAQSYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYTS 299 Query: 335 NCASPD 340 N AS D Sbjct: 300 NSASTD 305 >gi|229051629|ref|ZP_04195099.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] gi|228721740|gb|EEL73214.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] Length = 452 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 64/195 (32%), Gaps = 12/195 (6%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L++ ++LD S SM + K+ A ++I LD I +V V + + Sbjct: 148 KAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 206 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE-YAYNKIFDAKEKLEHIAK--GHD 281 L+ G + + + + + A E + K + Sbjct: 207 EN---DKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNADFKEYTGE 263 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA-EAADQFLKNCASP- 339 + + ++DGE + + + + IG + Q LKN A Sbjct: 264 ENLNVVYIVSDGEETCGGDPVNAAKNLNQSSTHAVVNI--IGFDVKNSEQQQLKNTAEAG 321 Query: 340 -DRFYSVQNSRKLHD 353 + +V + +L+ Sbjct: 322 KGNYATVSTADELYQ 336 >gi|227112734|ref|ZP_03826390.1| putative hemagglutinin/hemolysin-related protein [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 1159 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 26/255 (10%) Query: 69 GNNGKKQKNDFSYRIIKNIWQTDF---RNELRENGFAQDINNIERSTSLSIIIDDQHKDY 125 NN K D IIKN + L A N + ++TS I D Y Sbjct: 532 INNVKYTVGDDGSYIIKNANNAQTLAGKITLDVPVDAGKFNVVAQATSTEIANHDTATGY 591 Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 + V +Y M + + ++I + ++ ++D S SM+ Sbjct: 592 SAGGVEQYGMSIGTIGDDTMSGTHSHDVMIADVSGLQIIEGQNYNIAFMVDSSGSMSS-- 649 Query: 186 GPGMDKLGVATRSIREMLDIIKSIPDVNN--VVRSGLVTFSSKIVQTFPLAWG------- 236 + A S+ + + + N V L F +++ +T + Sbjct: 650 ----TDIDNARTSLSNVFKSLINSAGGANSGTVNVFLADFDTQVGKTVSVNLNDTNALNK 705 Query: 237 -VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGEN 295 + + G T + N ++ + Y F+TDGE Sbjct: 706 LTAVLNSMVGGSSAGGGTNYEDVFKTTANWFQSDV--VKKNVGNNLTY-----FITDGEP 758 Query: 296 SSPNIDNKESLFYCN 310 + + + + Sbjct: 759 TYYQTNEANEVRVSD 773 >gi|218690185|ref|YP_002398397.1| hypothetical protein ECED1_2477 [Escherichia coli ED1a] gi|218427749|emb|CAR08536.1| conserved hypothetical protein [Escherichia coli ED1a] Length = 378 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|90410253|ref|ZP_01218270.1| transporter [Photobacterium profundum 3TCK] gi|90329606|gb|EAS45863.1| transporter [Photobacterium profundum 3TCK] Length = 564 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 67/206 (32%), Gaps = 25/206 (12%) Query: 134 EMPFIFCTF--PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 ++P T P S +M+V+DVS SM + Sbjct: 63 QLPLKILTIVIAVSIIICAGPTWQRQSSPFGEDKSP---VMIVMDVSQSMLQT-DTAPSR 118 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 L + + ++L + + R+GL+ ++ PL + + +N + Sbjct: 119 LSRTKQKVEDLLTM-------RDGGRTGLIVYAGSAHLALPLTQDIDIYKPLLNAVTPKI 171 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN- 310 + ++A + +++ + + TD N+ + N + Sbjct: 172 MPRDG---KFAEYALPLVEQQFASNQRDNAQP-------TDASNTDTSKTNTSEQVTQDP 221 Query: 311 EAKRRG-AIVYAIGVQAEAADQFLKN 335 A G +++ G+ A + F + Sbjct: 222 PASIAGTVVLFTDGMGANTVEAFQQY 247 >gi|189346545|ref|YP_001943074.1| cobaltochelatase subunit [Chlorobium limicola DSM 245] gi|189340692|gb|ACD90095.1| cobaltochelatase subunit [Chlorobium limicola DSM 245] Length = 653 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 70/218 (32%), Gaps = 21/218 (9%) Query: 83 IIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN-LSAVSRYEMPFIFCT 141 +++I R R +G ++ + + D L A R PF Sbjct: 383 RVRSIVTPKDRKLRRGSGKRSRSRVSQKQGRYTRSTMPRGTDDIALDATLRAAAPFQR-- 440 Query: 142 FPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE 201 + N L + + +G ++ V+D S SM ++ + +I Sbjct: 441 YRLNPNGMAVVLQNEDIREKIREKRLGNLLIFVVDASGSMG-----ARGRMAASKGAIMS 495 Query: 202 MLDIIKSIPDVNNVVRSGLVTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLE 260 +L + + +V+F + V P+ ++ + + G T + GL Sbjct: 496 LL-----LDAYQKRDKLAMVSFRKEGAVVNLPVTSSIELAARLLRDMPVGGRTPFSAGLV 550 Query: 261 YAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 Y + K +I +TDG+ + Sbjct: 551 KGYEIAMNYLRKEPQGRP-------LVILVTDGKANRS 581 >gi|229159703|ref|ZP_04287713.1| Von Willebrand factor type A domain protein [Bacillus cereus R309803] gi|228623782|gb|EEK80598.1| Von Willebrand factor type A domain protein [Bacillus cereus R309803] Length = 627 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 22/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 423 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 470 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 530 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 531 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 590 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN F ++ Sbjct: 591 QLMKN-IYNHHFLIANHAED 609 >gi|332237901|ref|XP_003268145.1| PREDICTED: integrin alpha-10 isoform 3 [Nomascus leucogenys] Length = 1032 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 31 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 79 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 80 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 137 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 138 VVTDGESHDGEQLPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 193 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 194 PDERFFFNVTDEAALTDIVDALGDRIF 220 >gi|332237899|ref|XP_003268144.1| PREDICTED: integrin alpha-10 isoform 2 [Nomascus leucogenys] Length = 1036 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 35 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 83 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 84 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 141 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 142 VVTDGESHDGEQLPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 197 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 198 PDERFFFNVTDEAALTDIVDALGDRIF 224 >gi|332237897|ref|XP_003268143.1| PREDICTED: integrin alpha-10 isoform 1 [Nomascus leucogenys] Length = 1175 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 174 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 222 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 223 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 280 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 281 VVTDGESHDGEQLPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 336 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 337 PDERFFFNVTDEAALTDIVDALGDRIF 363 >gi|226329024|ref|ZP_03804542.1| hypothetical protein PROPEN_02926 [Proteus penneri ATCC 35198] gi|225202210|gb|EEG84564.1| hypothetical protein PROPEN_02926 [Proteus penneri ATCC 35198] Length = 227 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 62/167 (37%), Gaps = 13/167 (7%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 S+ +++VLD S SM +G + +L ++ + +K+ VR ++ + Sbjct: 15 NSEQRTPLILVLDSSGSM---YGQPIQQLN---EGLKLLEQELKNDVIAAKRVRILVIEY 68 Query: 224 SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 T W + L TT + A +I K++ + + Sbjct: 69 GGYDQCTIHGDW-KDAMDFTAPVLEANGTTPMGQAITLALEEIEAEKQRFKQAGVAYTRP 127 Query: 284 KKYIIFLTDGENSSPNIDNKESLFYCNEAKRR-GAIVYAIGVQAEAA 329 ++ ++DG P +++ C +A+ V+ I V +A Sbjct: 128 --WLFLMSDGV---PTDQWEQAAQLCRQAEESQKTAVFPIMVDGASA 169 >gi|281358203|ref|ZP_06244686.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281315293|gb|EFA99323.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 232 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 64/187 (34%), Gaps = 39/187 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL---------V 221 ++ ++DVS SM G +L V R +R + + + R G Sbjct: 62 VVFLVDVSGSMGAVTPEGGSRLDVMKRELRRAVGSAVASAN-----RIGAPKEAGNFRVW 116 Query: 222 TFSSKIVQTFPLA-------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 FSS + L V+ + + L G +T + A+ KI E + Sbjct: 117 AFSSGLQLFPDLEPCGFRDRSAVERLNRFVGALGAGGSTN----MLMAWRKIL---ELTK 169 Query: 275 HIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK 334 H + FL+DG+ S + + + + K +AIG+ L+ Sbjct: 170 HGQLDT------VYFLSDGDPSDCSAE-ELTRLLTRLPKDVTVHCFAIGL----DSSLLR 218 Query: 335 NCASPDR 341 A+ Sbjct: 219 EIAAAHN 225 >gi|41054731|ref|NP_032422.2| integrin alpha-2 precursor [Mus musculus] gi|40781666|gb|AAH65139.1| Integrin alpha 2 [Mus musculus] gi|148686423|gb|EDL18370.1| integrin alpha 2 [Mus musculus] Length = 1178 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 41/289 (14%), Positives = 97/289 (33%), Gaps = 36/289 (12%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIFCTFPWCANSS 149 + L+ + ++ I+ + SL + + ++ + C++ S Sbjct: 91 KLNLQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVS 150 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +TS +D+++V D S S + + + + + Sbjct: 151 PDFQFLTSFSPAVQACPSLVDVVVVCDESNS--------IYPWEAVKNFLVKFVTGLDIG 202 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFD 268 P L+ ++++ + + + + K + + S T+ G L + I Sbjct: 203 PKKTQ---VALIQYANEPR----IIFNLNDFETKEDMVQATSETRQHGGDLTNTFRAIEF 255 Query: 269 AKEKLEHIAKGHDD-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--- 324 A++ G K ++ +TDGE+ + K + CN + + I V Sbjct: 256 ARDYAYSQTSGGRPGATKVMVVVTDGESHDGSK-LKTVIQQCN---DDEILRFGIAVLGY 311 Query: 325 ---QAEAADQF---LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 A +K AS F++V + L + +G+++ Sbjct: 312 LNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEQIFS 360 >gi|13242646|ref|NP_077661.1| EsV-1-176 [Ectocarpus siliculosus virus 1] gi|13177446|gb|AAK14590.1|AF204951_175 EsV-1-176 [Ectocarpus siliculosus virus 1] Length = 254 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 71/224 (31%), Gaps = 45/224 (20%) Query: 144 WCANSSHAPLLITSSVKISSKSDI------GLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 + + P +I+ + S+ ++++D S SM G + A + Sbjct: 25 FALEARKNPEVISQMAETPSEKSALEKEALARQYVLLIDRSGSMGWADGS-TTRWESARK 83 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 ++ ++++ + R + F K+ + + TT Sbjct: 84 AVEKLVEAAFIYDTDH---RVPVYLFDDKVEFVGECT-SSSQVVDVFKNYQPRGTTDLAQ 139 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 LE A + K + G +I+ L DG C + K Sbjct: 140 CLEVAMEEYAGRKRPNYEVCPGTT----FIVVL-DG---------------CADDK---- 175 Query: 318 IVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 +A + L+ + P Y V N +L +FL+I + Sbjct: 176 ---------DAVKRVLRKFSDPVSGY-VANHTQLAISFLQIADD 209 >gi|119775308|ref|YP_928048.1| TPR domain-containing protein [Shewanella amazonensis SB2B] gi|119767808|gb|ABM00379.1| TPR domain protein [Shewanella amazonensis SB2B] Length = 701 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 56/160 (35%), Gaps = 18/160 (11%) Query: 148 SSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIK 207 + P + + + S + +V+D+S+SM P T++ +D+I Sbjct: 71 ALSGPAITKEPMPVFSAGAARV---LVMDMSISMYATDLPP----NRLTQARFRAIDLIN 123 Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 + D + + LV ++ + PL + + L+ + P L + Sbjct: 124 QLSDGD----TALVAYAGEAFVISPLTRD----KATLLNLLPSLSPDIMPLLGSSPAAGI 175 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLF 307 L I+ +TDG ++ D K SL Sbjct: 176 SLARDL---LTQGGHPDGDIVLMTDGLDAVDVADVKRSLK 212 >gi|332092683|gb|EGI97753.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 378 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G + + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKLLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|320177873|gb|EFW52858.1| Mg-chelatase subunit ChlD [Shigella boydii ATCC 9905] Length = 378 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 62/191 (32%), Gaps = 44/191 (23%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G + + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKLLGLAA--------LDSTATP--CYDRDTAQ 351 Query: 350 KLHDAFLRIGK 360 L + +I Sbjct: 352 ALVNVGAQIAA 362 >gi|297279798|ref|XP_002801794.1| PREDICTED: integrin alpha-10-like isoform 4 [Macaca mulatta] gi|297279800|ref|XP_002801795.1| PREDICTED: integrin alpha-10-like isoform 5 [Macaca mulatta] Length = 1024 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 23 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 71 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 72 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 129 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 130 VVTDGESHDGEELPA-ALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 185 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 186 PDERFFFNVTDEAALTDIVDALGDRIF 212 >gi|297279796|ref|XP_002801793.1| PREDICTED: integrin alpha-10-like isoform 3 [Macaca mulatta] Length = 1036 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 35 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 83 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 84 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 141 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 142 VVTDGESHDGEELPA-ALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 197 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 198 PDERFFFNVTDEAALTDIVDALGDRIF 224 >gi|297279792|ref|XP_002801791.1| PREDICTED: integrin alpha-10-like isoform 1 [Macaca mulatta] gi|297279794|ref|XP_002801792.1| PREDICTED: integrin alpha-10-like isoform 2 [Macaca mulatta] Length = 1167 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 166 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|78355641|ref|YP_387090.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78218046|gb|ABB37395.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 686 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 49/141 (34%), Gaps = 18/141 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++V+D+S SM++ G + + L S ++ + + TF T Sbjct: 224 VLVVIDMSYSMSESISDGNGGTQARYEAAAKALSDFISKYAGHDNINLAVTTF-GYTTLT 282 Query: 231 FPLAWGVQH------IQEKINRL------IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 P+ + + I ++ L + T +E A + H Sbjct: 283 GPVTIDLTNPDMATVISNALHALGIDQPPVNPQGTNFDAAMETGREWFDAANQDPGHAG- 341 Query: 279 GHDDYKKYIIFLTDGENSSPN 299 YK IF+TDG + N Sbjct: 342 ----YKNVAIFVTDGAPTVHN 358 >gi|332809817|ref|XP_003308326.1| PREDICTED: integrin alpha-10 [Pan troglodytes] Length = 1024 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 23 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 71 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 72 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 129 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 130 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 185 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 186 PDERFFFNVTDEAALTDIVDALGDRIF 212 >gi|332809815|ref|XP_001161858.2| PREDICTED: integrin alpha-10 isoform 1 [Pan troglodytes] Length = 1036 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 35 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 83 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 84 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 141 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 142 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 197 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 198 PDERFFFNVTDEAALTDIVDALGDRIF 224 >gi|223043129|ref|ZP_03613176.1| conserved hypothetical protein [Staphylococcus capitis SK14] gi|222443340|gb|EEE49438.1| conserved hypothetical protein [Staphylococcus capitis SK14] Length = 629 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 95/300 (31%), Gaps = 33/300 (11%) Query: 37 ETSHKFFVKAK-LHYILDH--SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR 93 + + K K LDH + E N K + ++I Sbjct: 313 DMTDMMTKKGKGSQNTLDHDEGGFIGQNQAFALEGINKNVKVEWKVPNIQPQHILD---- 368 Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 + +N +I ++ + +I + Q + +NL+ R + I + Sbjct: 369 YQHSKNDVQFEIKDLIQIIKKTIDREHQDERHNLT-KGRLQKDLINWFIDDQYKLFYKKQ 427 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 ++ + + +++D S SM+ DK+ + + + +KS+ + Sbjct: 428 DLSKTFDAT--------FTLLVDASASMH-------DKMDETIKGVVLFHETLKSLNIKH 472 Query: 214 NVVRSGLVTFSSKIVQT--FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + + F+ + + I + + N+ A Sbjct: 473 EI-----LAFNEDAFEADDREQPNIIDEIINYNYSIFEKEGPRIMSLEPQDDNRDGVAIR 527 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGVQAEA 328 ++++I +DGE S+ N ++ A++ G V+ + + EA Sbjct: 528 IASERLLQRIHQQRFLIVFSDGEPSAFNYSQDGIIDTYEAVETARKFGIEVFNVFLSQEA 587 >gi|194385372|dbj|BAG65063.1| unnamed protein product [Homo sapiens] Length = 1095 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 132 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 180 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 181 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 238 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 239 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 294 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 295 PDERFFFNVTDEAALTDIVDALGDRIF 321 >gi|194385334|dbj|BAG65044.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 35 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 83 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 84 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 141 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 142 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 197 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 198 PDERFFFNVTDEAALTDIVDALGDRIF 224 >gi|187954851|gb|AAI40832.1| ITGA10 protein [Homo sapiens] gi|219519694|gb|AAI44638.1| ITGA10 protein [Homo sapiens] Length = 1024 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 23 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 71 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 72 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 129 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 130 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 185 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 186 PDERFFFNVTDEAALTDIVDALGDRIF 212 >gi|150170950|emb|CAO03559.1| collagen type XXI alpha 1 [Homo sapiens] Length = 136 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 55/139 (39%), Gaps = 13/139 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 F ++ L V+ S ++ D++ +LD S S+ + K Sbjct: 5 ITFLCMVLVLLLQNSVLAEDGEVRSSCRT-APTDLVFILDGSYSVGPENFEIVKKW---- 59 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL-AWGV-QHIQEKINR-LIFGSTT 253 +++I K+ ++ G+V +S V PL ++ +H+ + L G T Sbjct: 60 -----LVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNT 114 Query: 254 KSTPGLEYAYNKIFDAKEK 272 K+ +++A + +F + Sbjct: 115 KTGKAIQFALDYLFAKSSR 133 >gi|119591830|gb|EAW71424.1| integrin, alpha 10, isoform CRA_a [Homo sapiens] Length = 1177 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 166 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|114558438|ref|XP_514418.2| PREDICTED: integrin alpha-10 isoform 2 [Pan troglodytes] Length = 1167 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 166 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|7385003|gb|AAF61638.1|AF172723_1 integrin alpha 10 subunit [Homo sapiens] Length = 517 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 5 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 53 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 54 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 111 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 112 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 167 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 168 PDERFFFNVTDEAALTDIVDALGDRIF 194 >gi|3420888|gb|AAC31952.1| integrin subunit alpha 10 precursor [Homo sapiens] Length = 1167 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 166 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|38569398|ref|NP_003628.2| integrin alpha-10 precursor [Homo sapiens] gi|115502407|sp|O75578|ITA10_HUMAN RecName: Full=Integrin alpha-10; Flags: Precursor gi|6650628|gb|AAF21944.1|AF112345_1 integrin alpha 10 subunit [Homo sapiens] gi|119591831|gb|EAW71425.1| integrin, alpha 10, isoform CRA_b [Homo sapiens] gi|182887771|gb|AAI60008.1| Integrin, alpha 10 [synthetic construct] gi|189055316|dbj|BAG36921.1| unnamed protein product [Homo sapiens] Length = 1167 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ P + R + ++ + I GLV + V Sbjct: 166 MDVVIVLDGSNSI----YPWSEVQTFLRRLVGKLFIDPEQIQ-------VGLVQYGESPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|17230314|ref|NP_486862.1| hypothetical protein alr2822 [Nostoc sp. PCC 7120] gi|17131916|dbj|BAB74521.1| alr2822 [Nostoc sp. PCC 7120] Length = 224 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 53/151 (35%), Gaps = 17/151 (11%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + L + V+ + + +++LD S SM G ++ L S+++ L Sbjct: 2 NDTLTLDEVVEFAENPEPRCPCVLLLDTSGSMQ---GAAIEALNQGLLSLKDELMKNSIA 58 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIF 267 V +VTF S I ++ N L T G+ A + + Sbjct: 59 ARR---VEIAIVTFDSHINVIQDF-----VTADQFNPPILTAQGLTSMGAGIHKALDMV- 109 Query: 268 DAKEKLEHIAKGHDDYKKYIIFL-TDGENSS 297 +E+ Y + +F+ TDGE Sbjct: 110 --QERKSLYRANGVAYYRPWVFMITDGEPQG 138 >gi|302545372|ref|ZP_07297714.1| putative von Willebrand factor type A domain protein [Streptomyces hygroscopicus ATCC 53653] gi|302462990|gb|EFL26083.1| putative von Willebrand factor type A domain protein [Streptomyces himastatinicus ATCC 53653] Length = 455 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 52/148 (35%), Gaps = 30/148 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++++D S SM+ K+ A + +D + + V ++ + K V+ Sbjct: 69 VVIMVDCSGSMDY----PPTKMRNARDATAAAIDTV------RDGVAFAVIAGTHKAVEV 118 Query: 231 FP----LAW----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 FP LA ++ + RL T L A + A + H Sbjct: 119 FPGNGRLATADPVTRAEAKDALRRLGAAGGTAIGTWLRLADRLLGTADVSIRHG------ 172 Query: 283 YKKYIIFLTDGENSSPN-IDNKESLFYC 309 I LTDG N D + +L C Sbjct: 173 -----ILLTDGRNEHETPEDLRAALDAC 195 >gi|162450490|ref|YP_001612857.1| hypothetical protein sce2218 [Sorangium cellulosum 'So ce 56'] gi|161161072|emb|CAN92377.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 482 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 50/167 (29%), Gaps = 20/167 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 LD+ +V+D + SM+D + + SI P+ R L Sbjct: 236 APPGKKETLDVSLVIDTTGSMSDEITYLQTEFIALSDSI------FARYPNAQQ--RWSL 287 Query: 221 VTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 V + IV+ F QEK+ G + A +K D Sbjct: 288 VLYKDTEDDYIVRWFDFRSDPDEFQEKLAEQAAGGGGDFPEAPDMALSKAADLS------ 341 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 + D + ++ D + N S K G VY + Sbjct: 342 WRTGGDTARLAFWVADAPHHDSNAAAMASGIRA--LKDLGVAVYPVA 386 >gi|296330466|ref|ZP_06872945.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305674748|ref|YP_003866420.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii str. W23] gi|296152363|gb|EFG93233.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305412992|gb|ADM38111.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii str. W23] Length = 638 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 62/174 (35%), Gaps = 29/174 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V Sbjct: 434 KQEPSSEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVAVPHQIV--- 483 Query: 220 LVTF----SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY----NKIFDAKE 271 F + + P + + ++ P + N+ A Sbjct: 484 --GFWEDTNDATETSQPNYFNT------VIPFQSSLRQEAGPAIMQLEPEEDNRDGYAIR 535 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 536 QMTKKMLQRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|154489100|ref|ZP_02029949.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] gi|154083237|gb|EDN82282.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] Length = 560 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 49/161 (30%), Gaps = 15/161 (9%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 + V+D S SM+ G + ++ I ++ V + SS V Sbjct: 385 LWVVDYSGSMSGKGKSG--AVAGLQAALDTDQARASHIEPGDDDVNVFIPFNSSAKVAQV 442 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + + LE A + ++ I LT Sbjct: 443 AQGKQTATLLAASENQVANGNADIYNALEVALKNLPSDRDDYTVA----------IALLT 492 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 DG++ + +D + + + +G +++I + Q Sbjct: 493 DGQSDTAKLDEFKQQYASD---GKGVPIFSIMFGDADSQQL 530 >gi|118476306|ref|YP_893457.1| hypothetical protein BALH_0561 [Bacillus thuringiensis str. Al Hakam] gi|118415531|gb|ABK83950.1| conserved hypothetical protein [Bacillus thuringiensis str. Al Hakam] Length = 626 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|62078605|ref|NP_001013960.1| von Willebrand factor A domain-containing protein 1 precursor [Rattus norvegicus] gi|81884089|sp|Q642A6|VWA1_RAT RecName: Full=von Willebrand factor A domain-containing protein 1; Flags: Precursor gi|51980318|gb|AAH81983.1| Von Willebrand factor A domain containing 1 [Rattus norvegicus] gi|149024826|gb|EDL81323.1| von Willebrand factor A domain containing 1 [Rattus norvegicus] Length = 415 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 67/194 (34%), Gaps = 28/194 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F T A S L + + + S D++ +LD S S++ + + + Sbjct: 1 MLFWTVLSMALSLRLALAQSGIERGPTASAPQGDLLFLLDSSASVSHYEFSRVREFVG-- 58 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKIN-----RLIFGS 251 ++ ++P +R+ LV S+ + + I Sbjct: 59 -------QLVATMPFGPGALRASLVHVGSRPH--TEFTFDQYSSGQAIQDAVRVAPQRMG 109 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + L YA ++F A+E + K ++++TDG +S + E Sbjct: 110 DTNTGLALAYAKEQLF-AEEAGARLGVP-----KVLVWVTDGASSDSVGPPMQ------E 157 Query: 312 AKRRGAIVYAIGVQ 325 K G ++ + Sbjct: 158 LKDLGVTIFIVSTG 171 >gi|25028446|ref|NP_738500.1| putative protoporphyrin IX magnesium chelatase [Corynebacterium efficiens YS-314] gi|259507505|ref|ZP_05750405.1| magnesium chelatase [Corynebacterium efficiens YS-314] gi|23493731|dbj|BAC18700.1| putative protoporphyrin IX magnesium chelatase [Corynebacterium efficiens YS-314] gi|259164890|gb|EEW49444.1| magnesium chelatase [Corynebacterium efficiens YS-314] Length = 248 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 64/198 (32%), Gaps = 21/198 (10%) Query: 101 FAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVK 160 + + T + + + ++ V P + S++ Sbjct: 5 DSTPGRRSKSYTRQGADVRPKKDGHGINLVGTLMAAADRGAAVVDGMVDFRPEDLRGSLR 64 Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 ++++ ++ V+D S SM ++ T +I ML + + Sbjct: 65 RGREANL---IVFVVDTSGSMA-----ARSRVRAVTGAIMSMLT-----DAYQRRDKVAV 111 Query: 221 VTFSSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + + V Q+ ++ + G T GL A + + + EH + Sbjct: 112 IAVNGNKPTLVLAPTSSVDMAQKSLDAMPMGGRTPLAEGLIMARDLM-----EREHRKEP 166 Query: 280 HDDYKKYIIFLTDGENSS 297 ++ +TDGE++S Sbjct: 167 SRRP--LLVVMTDGEDTS 182 >gi|3183039|sp|Q62469|ITA2_MOUSE RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|473099|emb|CAA82877.1| VLA-2 homologue [Mus musculus] Length = 1178 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 41/289 (14%), Positives = 97/289 (33%), Gaps = 36/289 (12%) Query: 93 RNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS---RYEMPFIFCTFPWCANSS 149 + L+ + ++ I+ + SL + + ++ + C++ S Sbjct: 91 KLNLQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVS 150 Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 +TS +D+++V D S S + + + + + Sbjct: 151 PDFQFLTSFSPAVQACPSLVDVVVVCDESNS--------IYPWEAVKNFLVKFVTGLDIG 202 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG-LEYAYNKIFD 268 P L+ ++++ + + + + K + + S T+ G L + I Sbjct: 203 PKKTQ---VALIQYANEPR----IIFNLNDFETKEDMVQATSETRQHGGDLTNTFRAIEF 255 Query: 269 AKEKLEHIAKGHDD-YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV--- 324 A++ G K ++ +TDGE+ + K + CN + + I V Sbjct: 256 ARDYAYSQTSGGRPGATKVMVVVTDGESHDGSK-LKTVIQQCN---DDEILRFGIAVLGY 311 Query: 325 ---QAEAADQF---LKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVK 364 A +K AS F++V + L + +G+++ Sbjct: 312 LNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEQIFS 360 >gi|325066441|ref|ZP_08125114.1| von Willebrand factor type A [Actinomyces oris K20] Length = 367 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 57/187 (30%), Gaps = 18/187 (9%) Query: 153 LLITSSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD----II 206 L + S+ S +++ MV+D + SM G +S LD + Sbjct: 64 LALAGPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGKGP-DGVDQSASTRLDGVRADM 122 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNK 265 ++I + R ++ + + PL + I S + LE A Sbjct: 123 RAIREAFPDSRFSIIALDNTAARELPLTHDTNAVDAWIGSFKQEVSGHATGSSLEVALPM 182 Query: 266 IFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ 325 + L + + + +DGE + DN + A G ++ Sbjct: 183 LGL---TLAQARQSDPKDIRLVYIFSDGEAT----DNGRGAQAADNA---GISWQSLAGL 232 Query: 326 AEAADQF 332 + Sbjct: 233 VDGGAVL 239 >gi|229042482|ref|ZP_04190227.1| Von Willebrand factor type A domain protein [Bacillus cereus AH676] gi|228726835|gb|EEL78047.1| Von Willebrand factor type A domain protein [Bacillus cereus AH676] Length = 610 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 66/194 (34%), Gaps = 23/194 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++ + +++D S SM +K+ +S+ + +KS+ + + F Sbjct: 412 ELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----AISGFWE 459 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P + + N + + E N+ + + Sbjct: 460 DASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKLAKRPEK 519 Query: 284 KKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAADQFLKNC 336 K+++ TDGE S+ + ++ A++ G V I + EA Q +KN Sbjct: 520 HKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATYQLMKNI 579 Query: 337 ASPDRFYSVQNSRK 350 + + V N + Sbjct: 580 Y--NHHFLVANHAE 591 >gi|168705806|ref|ZP_02738083.1| hypothetical protein GobsU_40112 [Gemmata obscuriglobus UQM 2246] Length = 188 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 55/172 (31%), Gaps = 35/172 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ D S SM G + L S ++ + + FSS+ Sbjct: 32 FAILADNSGSMQ---GTPLTVLKAEITS---------TLARSRGSAQFYVTFFSSEADPQ 79 Query: 231 FPLAWG-----VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 W V + + + + + T G + H+ K Sbjct: 80 PLKRWTADRNEVAAVSKWVQGIQTANGTSPVVGFQ--------------HVLKLK-PPPD 124 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 + +TDGE ++ +L N+A R A+++ + + + LK A Sbjct: 125 VVYLMTDGEFDPREVEQIRTL---NQALRPPAVIHTVAFGGKQGEAELKMIA 173 >gi|118346597|ref|XP_976944.1| TPR Domain containing protein [Tetrahymena thermophila] gi|89288544|gb|EAR86532.1| TPR Domain containing protein [Tetrahymena thermophila SB210] Length = 1163 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 65/201 (32%), Gaps = 26/201 (12%) Query: 176 DVSLSMNDHFGPGMDKLGVATRS-----IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 D S + G +K RS +++ L I + V N R + F+ + Sbjct: 958 DESGKTENKEKIGSNKNQRQARSQKYIAVKKFLQIFDNY--VQNDDRVAFIKFNENVDVI 1015 Query: 231 FPL---AWGVQHIQEKIN---RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 F L ++ I ++ + + A A K Sbjct: 1016 FELNEKENNTIFLRNSIKSFLKICPEGESAVRQAIYTAIKLFQKAVPKDH---------S 1066 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR--F 342 K+I+ TDG +S I E + +++ ++ G+ A F + C F Sbjct: 1067 KWIVIFTDGGDSCSKISEPELMQMLSQS-DVNIMIIGCGLDKYATKIFRQYCEKTKNGIF 1125 Query: 343 YSVQN-SRKLHDAFLRIGKEM 362 N ++ AF I ++ Sbjct: 1126 IKTNNKKEEIDIAFQAITNKI 1146 >gi|218680613|ref|ZP_03528510.1| TadE family protein [Rhizobium etli CIAT 894] Length = 176 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 54/157 (34%), Gaps = 16/157 (10%) Query: 3 FLNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK 62 F +R + KG+ +I AIL +FI++ +IE S FFV LD S+ + Sbjct: 7 FAPLRRLVGDHKGAAAIEFAILALPLFIILFGIIEVSLMFFVN----SALDASVHKISRM 62 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQH 122 I E ++ I +G + N+ S + D Sbjct: 63 IRTGEVASSKITLAG-----FKAKICDDMLLTFDCSSGLVVKV-NVLSDMSSAASTDPID 116 Query: 123 KDYNLSAVSRYEMP------FIFCTFPWCANSSHAPL 153 L+ Y++ + PW A ++ L Sbjct: 117 NSGKLTVTETYDIGKGSDYILVQAFLPWTAVANFLSL 153 >gi|260805092|ref|XP_002597421.1| hypothetical protein BRAFLDRAFT_122634 [Branchiostoma floridae] gi|229282686|gb|EEN53433.1| hypothetical protein BRAFLDRAFT_122634 [Branchiostoma floridae] Length = 705 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 57/186 (30%), Gaps = 29/186 (15%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ VLD S S+ + A + I +D ++ I D + G + + Sbjct: 506 DVVFVLDRSSSIE------LSIFNQAKQFI---VDTLQCIADRGVQIGVGYIVYDCVPKT 556 Query: 230 TFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 L + I+ + T TP L Y ++ A Sbjct: 557 IITLGTYTSDDPAVSGIIHYEMTEGGTTRTP-LAIRYMRLTSKFRDGAARAA-------- 607 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD-RFYSV 345 + LTDG+ D+ G +YA+ + + L+ A + Sbjct: 608 -VILTDGQTEGDAADDASDARD------AGIEMYAVAIGSFVDGSALQAIAGSGANVFDS 660 Query: 346 QNSRKL 351 + L Sbjct: 661 SDPCAL 666 >gi|91775988|ref|YP_545744.1| membrane protein-like protein [Methylobacillus flagellatus KT] gi|91709975|gb|ABE49903.1| membrane protein-like protein [Methylobacillus flagellatus KT] Length = 542 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 50/133 (37%), Gaps = 10/133 (7%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 +N +G+I + + L + + + + +++ + K KL I D + + ++ Sbjct: 3 KNTKKKQQGAIGLFGVLTLLMAVLFVAVAVDSGRLWMEKRKLQNIADMAAIAAGGQVGGC 62 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDF---RNELRENGFAQDINNIERSTSLSIIIDDQHK 123 N+ + K N +Q + N ++ G+ D + I ++ Sbjct: 63 AQNNSSEAYKAAAQAAAAANGYQGNLLAAPNAVQLGGYHTDSDGIR-------TFAANNE 115 Query: 124 DYNLSAVSRYEMP 136 + ++ E+P Sbjct: 116 RSAVRVLATQEVP 128 >gi|332799397|ref|YP_004460896.1| von Willebrand factor type A [Tepidanaerobacter sp. Re1] gi|332697132|gb|AEE91589.1| von Willebrand factor type A [Tepidanaerobacter sp. Re1] Length = 547 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 62/194 (31%), Gaps = 30/194 (15%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + + V I + D+ +++D S SM +L A R ++ Sbjct: 350 YIENKKFSIEQEDIVAIKRTPKLQQDICLIIDASASMAGF------RLRNAKYLARYLV- 402 Query: 205 IIKSIPDVNNVVRSGLVTFSS-KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 + + ++ F ++ P ++E IN+++ T L+ Sbjct: 403 -------LKPNTQVSIMAFQEKEVNVCVPFTRNYDIMEEGINKIVATGLTPLALALDKGI 455 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVY 320 I K I+ +TDG + + +++ ++ + Sbjct: 456 CHINKKNLKNP-----------LIMLITDGIPTVSLWTSDPINDAVSAADKIAKNKINFC 504 Query: 321 AIGVQAEAADQFLK 334 IG+Q D +K Sbjct: 505 CIGLQ-PNKDCLIK 517 >gi|294102191|ref|YP_003554049.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] gi|293617171|gb|ADE57325.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] Length = 329 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 13/89 (14%), Positives = 33/89 (37%), Gaps = 4/89 (4%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R +G++ + A + V+ L ++ + +L +D L A ++ Sbjct: 6 RKELKRSRGAVLVWVAASMVVLLGAGALSLDYGRLVVARWRLQTAVDAGSLAGAWEL--- 62 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 N + + + S + +D ++E Sbjct: 63 GNKSASQALREA-SAAQVAGSVASDNKSE 90 >gi|226228456|ref|YP_002762562.1| hypothetical protein GAU_3050 [Gemmatimonas aurantiaca T-27] gi|226091647|dbj|BAH40092.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 642 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 69/188 (36%), Gaps = 23/188 (12%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIR 200 T PW + + S +I + S +++ ++DVS SM DKL + +S+R Sbjct: 247 TAPWQPRHQLVRIAL-QSRRIETASLPPNNLVFLIDVSGSMQ-----SPDKLPLVKQSLR 300 Query: 201 EMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPG 258 ++D + R +V ++ P G + I + I RL G +T G Sbjct: 301 LLVD------QMRPQDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERLEAGGSTAGGAG 354 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 +E AY + + +I +DG+ + + E + G Sbjct: 355 IELAYRTAREHFMDHGNNR---------VILASDGDFNVGVSSDGELERLIERKRTEGTY 405 Query: 319 VYAIGVQA 326 + +G Sbjct: 406 LTILGFGT 413 >gi|167635158|ref|ZP_02393474.1| conserved hypothetical protein [Bacillus anthracis str. A0442] gi|254742025|ref|ZP_05199712.1| hypothetical protein BantKB_13606 [Bacillus anthracis str. Kruger B] gi|167529417|gb|EDR92168.1| conserved hypothetical protein [Bacillus anthracis str. A0442] Length = 626 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|119493558|ref|ZP_01624223.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106] gi|119452612|gb|EAW33794.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106] Length = 757 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 81/249 (32%), Gaps = 33/249 (13%) Query: 100 GFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANS----SHAPLLI 155 + IN E +++ +D++ N + RY++ S H L Sbjct: 211 STSHQINITENGEIVTVKLDNEDTIPNKDLILRYQVSGDNTQTTVLTQSDERGGHFALYF 270 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 +++ + + D++ ++D S S + K R L+ + ++ Sbjct: 271 IPAIEYKTDEIVAKDVLFLMDTSGSQQGD---PLFKCQELMRRFINGLNPNDTFNIMD-- 325 Query: 216 VRSGLVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 ++ + PLA IN+L T+ G+ ++ E Sbjct: 326 -----FAHTTCTLSETPLANSPENRSLAIHYINQLRANGGTELLNGIR----EVLKFPEL 376 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + + I+ LTDG + N L + + G +++ GV + Sbjct: 377 TGRL--------RSIVLLTDGYIGNENAI----LSEVQDNLKPGNRLHSFGVGSSVNRFL 424 Query: 333 LKNCASPDR 341 + A R Sbjct: 425 INRIAEIGR 433 >gi|300776965|ref|ZP_07086823.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502475|gb|EFK33615.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 634 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 76/226 (33%), Gaps = 26/226 (11%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + + PW + + I +++ ++DVS SM++ Sbjct: 240 FSINTEYGNSPWNPKHKLLKIGLQGK-NIPMDKLPASNIVFLIDVSGSMSEENKLP---- 294 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLIFG 250 + + + + G+V ++ V A + I + ++ L G Sbjct: 295 -------LLKSSLKVLLKQLRPKDKVGIVVYAGNAGMVLPSTSAGEKEKIIKALDNLQAG 347 Query: 251 STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCN 310 +T G+E AY E+ KG ++ ++ TDG+ + + Sbjct: 348 GSTAGGAGIELAYKL------AKENFIKGGNNR---VVLATDGDFNVGASSTSDIETLIT 398 Query: 311 EAKRRGAIVYAIGV-QAEAADQFLKNCA--SPDRFYSVQNSRKLHD 353 E ++ G + +G D ++ A + + N ++ + Sbjct: 399 EKRKTGIFLTCLGYGMGNYKDNTMEVLADKGNGNYAYIDNMQEANK 444 >gi|167842450|ref|ZP_02469134.1| hypothetical protein Bpse38_37650 [Burkholderia thailandensis MSMB43] Length = 418 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 39/126 (30%), Gaps = 3/126 (2%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 +G +SIL A++L V+ +GL ++ + +++L A N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAIN 74 Query: 71 NGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIII-DDQHKDYNLSA 129 + + + F ++ N N++ I Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMLTNSNVTFSNSLSNPFQPKSSIASPSSIKYVKCT 134 Query: 130 VSRYEM 135 SR + Sbjct: 135 TSRTGI 140 >gi|156404067|ref|XP_001640229.1| predicted protein [Nematostella vectensis] gi|156227362|gb|EDO48166.1| predicted protein [Nematostella vectensis] Length = 218 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 62/170 (36%), Gaps = 18/170 (10%) Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 T++ K D +D+ +VLD S SM + D +A +E++ PD Sbjct: 11 TAAKTEQVKCDKKVDLAIVLDASASMGE------DNYKLAKTLTKEIISRFTISPDK--- 61 Query: 216 VRSGLVTFSSKIVQTFPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R L FS+ V L+ + +++++ KS L + Sbjct: 62 TRVSLNFFSAHHVIVSKLSDNFSKSKLMSLTDQMMY---EKSFSKLATTLEAVHYELLVK 118 Query: 274 EHIAKGHDDYKKY-IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + A+ K + +TDG ++ +E+ K+ ++ + Sbjct: 119 KGGARPKQKGVKMATVLVTDGYGTAG---FEETSDEAKSMKKYNVEMFTV 165 >gi|323705583|ref|ZP_08117157.1| von Willebrand factor type A [Thermoanaerobacterium xylanolyticum LX-11] gi|323535060|gb|EGB24837.1| von Willebrand factor type A [Thermoanaerobacterium xylanolyticum LX-11] Length = 229 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 37/108 (34%), Gaps = 10/108 (9%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 + ++LD+S SM KL A + ++L +K + G + + Sbjct: 125 SDVRCAILLDLSGSMAR-------KLKKAVNGVMDLLSTMKVRKGKSQFCLIG-FPYGND 176 Query: 227 IVQTF--PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + P + I+ + L G T + ++ A + E Sbjct: 177 VYAKVICPFTSKINDIEAHLKNLKAGGNTPTYHAIKLATSLFDTEVES 224 >gi|58429521|gb|AAW78164.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG +S Sbjct: 118 IKSLLSTNLPFGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPNSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|77552209|gb|ABA95006.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 605 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 66/202 (32%), Gaps = 24/202 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 +S +D++ VLDVS MD L A + + L + V Sbjct: 42 PPAAASSERAPIDLVAVLDVSCCGGLGPVNRMDLLKKAMGFVIDKLGEHDRLAVVPVQAS 101 Query: 218 SGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + + G + + + L K + L+ A + K Sbjct: 102 AAIAEKHDLVEMNAE---GRKEATRMVQSSLTVTGENKLSTALKKAATIL--EGRKDHDK 156 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 + +I+ ++DG+++S N+A V+A G + + + Sbjct: 157 KRPG-----FIVLISDGDDAS----------VLNDAMNLNCSVHAFGFRDAHNARAMHRI 201 Query: 337 A--SPDRFYSVQNSRK-LHDAF 355 A S + + + L DAF Sbjct: 202 ANTSAGTYGILNDGHDGLADAF 223 >gi|325268971|ref|ZP_08135592.1| von Willebrand factor [Prevotella multiformis DSM 16608] gi|324988592|gb|EGC20554.1| von Willebrand factor [Prevotella multiformis DSM 16608] Length = 292 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 38/101 (37%), Gaps = 10/101 (9%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L +M+++DVS S++ G R + + + N + G++ F Sbjct: 74 EEERELTVMLLIDVSGSLDF------GTTGQLKRECATEIAATLAFSAIQNNDKIGIIFF 127 Query: 224 SSKIVQTFPLAWGVQHIQEKINRL----IFGSTTKSTPGLE 260 S + + G +HI I + T GLE Sbjct: 128 SDHVEKYIAPKKGRKHILYLIREMLTFTPESRKTDVGTGLE 168 >gi|300779171|ref|ZP_07089029.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300504681|gb|EFK35821.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 396 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 74/218 (33%), Gaps = 32/218 (14%) Query: 146 ANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK-------LGVATRS 198 + +TS+VK D + + ++LD S SM+ + L + Sbjct: 36 TVQKTTLIPVTSTVK-----DNKIQVALLLDTSNSMDGLIDQAKSRLWNIVNTLTTLKYN 90 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + I N+ +R + I Q PL + + EK+ L ++ Sbjct: 91 GKAPEIEIALYEYGNDGIRD-----ENYIRQVTPLTQDLDLVSEKLFALRTNGGSEYCGA 145 Query: 259 LEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 I DA L + YI N + + ++AK + Sbjct: 146 ------VIRDAAANLNWDSNDKSMKLIYI-----AGNEAFDQGKINYREVVSKAKNKNIY 194 Query: 319 VYAI--GVQAEAADQFLKNCAS--PDRFYSVQNSRKLH 352 + I G + E F +N AS +++++ + RK+ Sbjct: 195 INTIFCGSREEGIQTFWQNGASLGGGKYFNIDSDRKVL 232 >gi|322388234|ref|ZP_08061838.1| peptidoglycan binding domain protein [Streptococcus infantis ATCC 700779] gi|321140906|gb|EFX36407.1| peptidoglycan binding domain protein [Streptococcus infantis ATCC 700779] Length = 459 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 56/179 (31%), Gaps = 40/179 (22%) Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 + K D++ V+D S SM + R++ + D + + R Sbjct: 174 TPPTVKKAGAADIVFVVDRSGSMGGTINTVRKNVNEFARNLAK--DGVAA--------RF 223 Query: 219 GLVTFSSKIV----------------QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 GL T+S ++ +++ + ++ S A Sbjct: 224 GLATYSDEVYGRRLGKTDEDTILTKFGETYFTTDPVELEKALEKIKIAHGGDSPETATPA 283 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG----ENSSPNIDNKESLFYCNEAKRRGA 317 KI K KK+++ LTD + S P+ID + K+ G Sbjct: 284 LTKIVSTY-DWSKSPKN----KKFVVLLTDARMKEDPSIPSIDETLT-----TLKKAGI 332 >gi|315581432|gb|EFU93623.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] Length = 609 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|75907326|ref|YP_321622.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701051|gb|ABA20727.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 224 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 53/151 (35%), Gaps = 17/151 (11%) Query: 150 HAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 + L + V+ + + +++LD S SM G ++ L S+++ L Sbjct: 2 NDTLTLDEVVEFAENPEPRCPCVLLLDTSGSMQ---GAAIEALNQGLLSLKDELMKNSIA 58 Query: 210 PDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIF 267 V ++TF S I ++ N L T G+ A + + Sbjct: 59 ARR---VEIAIITFDSHINVIQDF-----VTADQFNPPILTAQGLTSMGAGIHKALDMV- 109 Query: 268 DAKEKLEHIAKGHDDYKKYIIFL-TDGENSS 297 +E+ Y + +F+ TDGE Sbjct: 110 --QERKSLYRANGVAYYRPWVFMITDGEPQG 138 >gi|330834711|ref|YP_004409439.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566850|gb|AEB94955.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 379 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 74/205 (36%), Gaps = 38/205 (18%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP 232 ++LD SLSM KL +A E+ + S+P + L+ F K+ Sbjct: 42 ILLDKSLSMKGE------KLQMAKEGANEL---VSSLPQESYF---SLLAFDEKV---SI 86 Query: 233 LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 L Q I+ + GS T Y + +A + E + Y I LTD Sbjct: 87 LKEHSQSHLIDIDEIKAGSGT-------SLYKALEEASKLAERYRQPS-----YFILLTD 134 Query: 293 GENSS----PNIDNKESLFYCNEAKR-----RGAIVYAIGVQAEAADQFLKNCASPDR-- 341 G + + K L C + V + G+ + ++ L + Sbjct: 135 GVPTDRGCTHGLSRKFDLERCLPVYQGLSLPHNVQVISFGIGQDYNEKILSLISEKGNGF 194 Query: 342 FYSVQNSRKLHDAFLRIGKEMVKQR 366 FY +++ +++ + ++ K V + Sbjct: 195 FYHIKDPKEIVEKMPKLAKSSVAAK 219 >gi|58429499|gb|AAW78153.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ + Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRN--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|242348028|ref|YP_002995589.1| von Willebrand factor type A domain protein [Aeromonas hydrophila] gi|224831847|gb|ACN66978.1| von Willebrand factor type A domain protein [Aeromonas hydrophila] Length = 593 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 64/179 (35%), Gaps = 22/179 (12%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K +K + +++D+S SM G ++ +A + + +++IP VN V Sbjct: 407 KPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIPGVNPAV--- 463 Query: 220 LVTFSSKIVQT-FPLAWGVQHIQEKINR--LIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 F Q F + +Q + R T + YA ++ +E+ Sbjct: 464 -TFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFELTKTREER--- 519 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 K +I +TDG+ S + + C R V IGV+ A + Sbjct: 520 --------KMLIVVTDGQPQSAPA-CRSVIDLCE---RSDVEVIGIGVETTAVSGLFQK 566 >gi|162454181|ref|YP_001616548.1| hypothetical protein sce5904 [Sorangium cellulosum 'So ce 56'] gi|161164763|emb|CAN96068.1| hypothetical protein sce5904 [Sorangium cellulosum 'So ce 56'] Length = 317 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 62/207 (29%), Gaps = 33/207 (15%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + + L + +LD S SM G + ++ + + N+ R +VT Sbjct: 100 EEEEDLSIYFILDTSASMAFGDGEKLRHAKRLAAAL--------AYVGLANLDRIAIVTA 151 Query: 224 SSKIVQTFPLAWGVQ---HIQEKINRLIFGSTTKSTPGLEY--------AYNKIFDAKEK 272 + +I G I + + T ++ I Sbjct: 152 TDEISGRMQSTRGKARIFRIFRFLGQARAEGPTDLGEAMKTFVAQHKRRGLAVILSDLYD 211 Query: 273 LEHIAKG-----HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV-------Y 320 +G ++ ++ +++ L D + P + ++ C R V Y Sbjct: 212 PAGFERGINVLRYNRFEPFVLHLVDAREARPALRGDVRVYDCETGDEREVTVTPKVLERY 271 Query: 321 AIGVQAEAADQFLKNCASPD-RFYSVQ 346 A + D+ + C S ++ Sbjct: 272 AEAYG-QYLDEIRRFCTSRQVSYFRAD 297 >gi|63102459|gb|AAH95480.1| VWA3B protein [Homo sapiens] gi|158258226|dbj|BAF85086.1| unnamed protein product [Homo sapiens] Length = 624 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 40/131 (30%), Gaps = 23/131 (17%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D S SM KL + I + + N V+ + + Sbjct: 509 IYILIDTSHSMK-------SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 ++ Q I + GS+T + L+ A+ KE I L Sbjct: 562 EVNEDNLEQAQSWIRDIKIGSSTNTLSALKTAFA----DKETQA------------IYLL 605 Query: 291 TDGENSSPNID 301 TDG Sbjct: 606 TDGRPDQGTSS 616 >gi|288919402|ref|ZP_06413735.1| hypothetical protein FrEUN1fDRAFT_3432 [Frankia sp. EUN1f] gi|288349190|gb|EFC83434.1| hypothetical protein FrEUN1fDRAFT_3432 [Frankia sp. EUN1f] Length = 587 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 47/120 (39%), Gaps = 6/120 (5%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS----- 225 ++++LDVS SMN+ DK AT+ + +++ +++ GL TFSS Sbjct: 425 VLLLLDVSGSMNEPVDDP-DKAVDATKLQLMIPAAERALTLLDDDDEVGLWTFSSDPAYT 483 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++V P+ +QE++ L T A K+ + + Sbjct: 484 EVVPVSPVGEVRAALQERLRGLQARGDTALFEATRRASEKMTRNASTPSCCSATGRTPSR 543 >gi|14248703|gb|AAK57637.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYTPYGTTNMTAA--------QDEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L N+ K+R + IGV QF + A Sbjct: 104 LMTDGVPNS----KYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 148 >gi|260221155|emb|CBA29437.1| hypothetical protein Csp_A12140 [Curvibacter putative symbiont of Hydra magnipapillata] Length = 325 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 46/135 (34%), Gaps = 19/135 (14%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI--- 227 +M+ +D S SM D G G+ K+ A ++ K++ N +R G+ F Sbjct: 9 VMLHIDNSGSMADSVG-GVPKIQTAR-------NVGKALIAANPDLRWGVFAFDRNAGAR 60 Query: 228 --VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 P+ ++ IN L + T + I + + Sbjct: 61 GGRLQAPVGSSQAVLEASINALGANTNTPLAEAMLELTRYFAGEPSYYGKIIGNYTSPIQ 120 Query: 286 Y------IIFLTDGE 294 Y I +TDGE Sbjct: 121 YRCQKNFAIVITDGE 135 >gi|319428191|gb|ADV56265.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 599 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 66/197 (33%), Gaps = 29/197 (14%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM + S +++ + I N+V++ ++ F + Sbjct: 425 AVTLLLDQSNSMCGKAY------QTSVESTYALVEALSKI----NLVKTSVLGFGNSTES 474 Query: 230 TFPLAWGVQHIQEKINRLIFGST----TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 L + + + +L S T GL A N+++ E K Sbjct: 475 VIALKGFEETPAKCLTKLASSSADGYCTPLATGLWAALNQLYTRTEDR-----------K 523 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 ++ +TDG+ E + VY IG+ + L++ V Sbjct: 524 VVLVVTDGQPHGFQY----CKNLIAEMQASNVEVYGIGIGNDLNLPTLQSLFGKQFAIKV 579 Query: 346 QNSRKLHDAFLRIGKEM 362 L + +I + + Sbjct: 580 DQLSDLGNEVFKIAEGI 596 >gi|58429543|gb|AAW78175.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 63/181 (34%), Gaps = 27/181 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S+ H ++ + +I+ + +N + + FS+ Sbjct: 47 VDLYLLMDCSGSIRRH--------NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAR 98 Query: 229 QTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L I+ ++ + T T L + D Sbjct: 99 EIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINR 150 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE-AADQFLKNCASP 339 ++ + ++ LTDG S KES + + V+ IG A ++FL C Sbjct: 151 ENANQLVVILTDGIPDSIQDSLKESRKLSD--RGVKIAVFGIGQGINVAFNRFLVGCHPS 208 Query: 340 D 340 D Sbjct: 209 D 209 >gi|307717905|ref|YP_003873437.1| hypothetical protein STHERM_c01900 [Spirochaeta thermophila DSM 6192] gi|306531630|gb|ADN01164.1| hypothetical protein STHERM_c01900 [Spirochaeta thermophila DSM 6192] Length = 367 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 63/208 (30%), Gaps = 24/208 (11%) Query: 120 DQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSL 179 ++Y +AV + P ++ +L +S + LD+++VLD + Sbjct: 175 PPEENYMPAAVESFRKLAEESRTPLLYSTGQEDVLKRIEEVLSEREGDTLDLVVVLDTTQ 234 Query: 180 SMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT----FPLAW 235 SM D +LG R + VR G+V + + + P Sbjct: 235 SMEDDIPYVKRELGPLVRRVCAPYRS----------VRIGVVLYKDYMEEYLNKVIPFER 284 Query: 236 GVQHIQEKINRLIFGSTTKSTPGLEYA-YNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + Q + + + A Y + + + II + D Sbjct: 285 DMGEFQRLVQGIRVSGGRDLPEAVHEALYAAL---------VRFDWQAEDRKIILVGDAP 335 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAI 322 EA+R+ +Y + Sbjct: 336 PHPRPRGAVTEEMVRREAERKKVRIYPV 363 >gi|154686346|ref|YP_001421507.1| YojO [Bacillus amyloliquefaciens FZB42] gi|154352197|gb|ABS74276.1| YojO [Bacillus amyloliquefaciens FZB42] Length = 638 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 64/169 (37%), Gaps = 19/169 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K SDI +++D S SM DK+ R I + +KS+ + +V G Sbjct: 434 KQDPSSDIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVSVPHQIV--G 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---EKLEHI 276 +++ +T + I + + L ++ A + ++ Sbjct: 485 FWEDTNEATETSQPNFFNTVIPFEESLLQAAGP----LIMQLAPEEDNRDGYAIRQMTKQ 540 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 + +K++I +DGE ++ + ++ EA+++G V + Sbjct: 541 LMRRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKKGIEVINV 589 >gi|109732283|gb|AAI15771.1| Integrin, alpha 10 [Mus musculus] Length = 1166 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 73/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGENPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSQSRGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|124486692|ref|NP_001074522.1| integrin alpha-10 [Mus musculus] Length = 1167 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 73/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGENPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSQSRGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|42524019|ref|NP_969399.1| norD protein [Bdellovibrio bacteriovorus HD100] gi|39576227|emb|CAE80392.1| norD protein [Bdellovibrio bacteriovorus HD100] Length = 609 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 68/193 (35%), Gaps = 28/193 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + K + L +++VLD+SLS D + L ++ + +++ N +V +G Sbjct: 414 QTQVKRERDLQIVIVLDLSLS-TDSYVSDRRVLDTELEAVG-LWGLLQPSGPDNTLV-AG 470 Query: 220 LVTFSSKIVQTFPLAW---GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI 276 + + L + +++ T+ P L +A + Sbjct: 471 AFSETRHKCAFEILKDQGEDWSAYFSRAQQIVPRGYTRLGPALRHATRIL---------- 520 Query: 277 AKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 + +K +I LTDG+ + ++ C EA+ A ++ A Sbjct: 521 -RECSARQKVLIILTDGKPTDYDGYEGRYGIEDMRKACMEAESAQISTRAFAIEKAAKHY 579 Query: 332 FLKNCASPDRFYS 344 F P FYS Sbjct: 580 F------PQMFYS 586 >gi|134298678|ref|YP_001112174.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] gi|134051378|gb|ABO49349.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] Length = 183 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 12/165 (7%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATK----- 62 KG IL + +I + M I+ + KL I D L A+ Sbjct: 4 KVLKGKKGFTFILFVPVFLIIMLFMARGIDWGMATVARGKLQTISDAGSLAGASAVEPIT 63 Query: 63 ---ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDI-NNIERSTSLSIII 118 ++N ++G+ ++K +I Q R NG QD + + Sbjct: 64 KVELVNNDDGSLELREK-VTGIKINSEEAQRRARLARELNGGTQDYWQGVGGHWEGTEER 122 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISS 163 + Y + + +R ++PF+ + S H L IT S Sbjct: 123 IEGDDIYCVKSTARVKLPFLSRIY--EKVSGHKDLTITMPGDARS 165 >gi|221204654|ref|ZP_03577671.1| membrane protein [Burkholderia multivorans CGD2] gi|221175511|gb|EEE07941.1| membrane protein [Burkholderia multivorans CGD2] Length = 609 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 38/257 (14%), Positives = 79/257 (30%), Gaps = 26/257 (10%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 ++++ AI + V IV+G I+ + +F + L + D + L + N+ Q Sbjct: 1 MAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQSV------NDLCPQT 53 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-----SIIIDDQHKDYNLSAVS 131 + N T NGF D S S+ + Y +A + Sbjct: 54 DTTVTASGSNAVVTAAYRGAALNGF--DAQASGNSMSIACGRWDVSDYGAAAGYFGTATN 111 Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 + + T P I+++ + + + L + S D G + Sbjct: 112 QLNAVRVVATKTVPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSVSA 171 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 T + +L + + + S I LA + + + + Sbjct: 172 DQRNTGLVNALLGALLN------------TSLSLNIGSYQALACTRVKVGDLVKAQVGAG 219 Query: 252 TTKSTPGLEYAYNKIFD 268 T + N++ Sbjct: 220 TVDQLLATKLTLNQLVS 236 >gi|170748502|ref|YP_001754762.1| hypothetical protein Mrad2831_2084 [Methylobacterium radiotolerans JCM 2831] gi|170655024|gb|ACB24079.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 463 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 22/57 (38%) Query: 8 NFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKIL 64 + GS+++ + V+ + G I+ + +++L D +L + Sbjct: 7 RAVTDRSGSVALTFGLSAVVLLGLTGGGIDYARLAARRSQLQNAADAGVLAAGNYLK 63 >gi|255576003|ref|XP_002528897.1| protein binding protein, putative [Ricinus communis] gi|223531651|gb|EEF33477.1| protein binding protein, putative [Ricinus communis] Length = 705 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 58/163 (35%), Gaps = 26/163 (15%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 ++ +D++MVLDVS M G + + R + L+ R +V Sbjct: 302 TAARRPPIDLVMVLDVSQRM---CGVKLQVMKRIMRVVMSSLNSND---------RLSIV 349 Query: 222 TFSSKIVQTFPL----AWGVQHIQEKINRLIFGS-TTKSTPGLEYAYNKIFDAKEKLEHI 276 FS+ + PL A G + + I+ L + L+ A I D Sbjct: 350 AFSATSKRLSPLKRMTADGRRSARRIIDALGSTGQGMSANDALKKAAKVIED------RR 403 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV 319 K II +++G++ +I++ + + + Sbjct: 404 VKNPVAS---IIIISNGQDDRSHINSVNQKMTVSSTRFSHLEI 443 >gi|33333552|gb|AAQ11892.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 60/183 (32%), Gaps = 31/183 (16%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+ +++D S S+ H ++ + +I+ + N + FS+ Sbjct: 47 VDLYLLMDCSGSIRRH--------NWVNHAVPLAMKLIQQLNLNENAIHLYANVFSNNAR 98 Query: 229 QTFPLAWGVQH--------IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGH 280 + L I+ ++ + T T L + D Sbjct: 99 EIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINR 150 Query: 281 DDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCA 337 ++ + ++ LTDG S KES + RG + G+ ++FL C Sbjct: 151 ENANQLVVILTDGIPDSIQDSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCH 206 Query: 338 SPD 340 D Sbjct: 207 PSD 209 >gi|75910929|ref|YP_325225.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75704654|gb|ABA24330.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 608 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G+D ++V+D S SM G + SI E ++ K V + F Sbjct: 36 NPKGGIDWIVVVDTSASMRGVGGTR-NIFAQVKNSINEFVNTAKLGDTVT------IYNF 88 Query: 224 SSKIV---QTFPLAWG--VQHIQEKINRLIFGST-TKSTPGLEYAYNKIFDAKEKLEHIA 277 S + Q P+ +++ IN L T + ++ A + + + + Sbjct: 89 DSDVTLQAQEIPIVSNPDRGKLKQIINNLKADGVRTHTGKAVQQA---LSTSAKLNQRPN 145 Query: 278 KGHDDYKKYIIFLTDG 293 I+FLTDG Sbjct: 146 TADRTVS--IVFLTDG 159 >gi|50085107|ref|YP_046617.1| tellurium resistance protein [Acinetobacter sp. ADP1] gi|49531083|emb|CAG68795.1| tellurium resistance protein [Acinetobacter sp. ADP1] Length = 212 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 64/197 (32%), Gaps = 16/197 (8%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM+ + ++ +L ++ P L++F S+ Sbjct: 3 RLPVYLLLDTSGSMHGE------PIEAVKNGVQILLSTLRQDPYALETAYLSLISFDSQA 56 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 Q PL + + L TT L +KI K KG I Sbjct: 57 KQLVPLT---ELAVFQAPDLQVTGTTALGGALALLADKIEQEVAKTTVDVKGDWKP--LI 111 Query: 288 IFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 +TDG P D ++ L + K G + A A LK + + Sbjct: 112 FIMTDGV---PTDDWRKGLQRLQQVKT-GV-IVACAAGHGADTSILKQITEVVVELATAD 166 Query: 348 SRKLHDAFLRIGKEMVK 364 S + F + + Sbjct: 167 SNTIKAFFKWVSASIST 183 >gi|307289533|ref|ZP_07569478.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306499494|gb|EFM68866.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] Length = 463 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|284167111|ref|YP_003405389.1| ATPase AAA [Haloterrigena turkmenica DSM 5511] gi|284016766|gb|ADB62716.1| ATPase associated with various cellular activities AAA_5 [Haloterrigena turkmenica DSM 5511] Length = 665 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 46/128 (35%), Gaps = 20/128 (15%) Query: 171 MMMVLDVSLS-MNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IV 228 ++ V+D S S M+ ++ R I +++ R +V F + Sbjct: 464 VVFVVDASGSVMSG------RQMFETKRGILSLVE-----DAYRARDRVAVVVFREEGAF 512 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + + +++L G T GL AY E +E + +D ++ Sbjct: 513 TLVEPTRNLSAARRAVSKLTVGGNTPLAHGLVEAY-------ELVERERRRDEDLYPLVV 565 Query: 289 FLTDGENS 296 +DG+ + Sbjct: 566 LFSDGQTN 573 >gi|281339018|gb|EFB14602.1| hypothetical protein PANDA_010506 [Ailuropoda melanoleuca] Length = 1096 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 85/240 (35%), Gaps = 33/240 (13%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRS 198 + T C++ S L+TS + +D+++V D S S + Sbjct: 122 YYTTGVCSDVSADFQLLTSFAPAAQACPSLIDVVVVCDESNS--------IYPWEAVKNF 173 Query: 199 IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 + + + + P GL+ +++K F + + K + S T G Sbjct: 174 LEKFVQSLDIGPKKTQ---VGLIQYANKPRVIF----NLNTFKTKAEMIEATSQTYQYGG 226 Query: 259 -LEYAYNKIFDAKE-KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRG 316 L + I AK+ A G K ++ +TDGE+ ++ K + C+ Sbjct: 227 DLTNTFKAIQYAKDFAYAAGAGGRLGAAKVMVVVTDGESHDGSM-LKAVIDQCD---NDN 282 Query: 317 AIVYAIGV------QAEAADQFLK------NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + I V A +K + + F++V + L + +G+++ Sbjct: 283 ILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTETFFFNVSDEAALLEKAGTLGEQIFS 342 >gi|293605450|ref|ZP_06687832.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816178|gb|EFF75277.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 509 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 52/153 (33%), Gaps = 19/153 (12%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ +D+S SM+ P +L A R++R++ ++GL+ ++ Sbjct: 96 LIVAVDLSPSMDGADVPP-SRLEAAKRALRDL-------TAQRAGAKTGLIAYAGSSHLV 147 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + L T ++ + I A L G ++ L Sbjct: 148 LPPTDDANLLDLFGQALSTDLTGRAGRDVA---GVIALAARVLAAERAGGT-----LLLL 199 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 TDG + A+ +V A+G Sbjct: 200 TDGADPGQ---MDRVRERAQAARDLQILVMAVG 229 >gi|218693343|gb|ACL01175.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|168701746|ref|ZP_02734023.1| von Willebrand factor, type A [Gemmata obscuriglobus UQM 2246] Length = 249 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 67/183 (36%), Gaps = 24/183 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++++D S SM + LG + + ++ ++V GL + + + Sbjct: 22 CVLLIDTSGSMAEVVSGTGRDLGRTAQVDGKTYRVVSGGTTRIDLVNEGLRVYQADVTND 81 Query: 231 FPLA------------WGVQHIQEKINR-------LIFGSTTKSTPGLEYAYNKIFDAKE 271 PLA V+ + + L T + A + + + Sbjct: 82 -PLAAQRVEVSVVTFGDTVRTVTPFVTTSQFTPPVLTANGETPMGAAILKAIDAVTE--R 138 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 K E+ G Y+ +I +TDGE + + + E K++ A +A+GV+ D+ Sbjct: 139 KREYRQNGLHFYRPWIFLITDGEPTDA-WEAAAARVREGEEKKQFA-FFAVGVEGANMDR 196 Query: 332 FLK 334 + Sbjct: 197 LKQ 199 >gi|29833533|ref|NP_828167.1| hypothetical protein SAV_6991 [Streptomyces avermitilis MA-4680] gi|29610656|dbj|BAC74702.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 623 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 74/217 (34%), Gaps = 34/217 (15%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + V+D+S SM++ G ++ V S+ + L + GL FS+K+ Sbjct: 420 ITTVVDISASMSEAVPGSSRSRMDVTKASLLQTLTTFTPDDE------IGLWNFSAKLDG 473 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKS-TPGLEYAYNKI----------FDAKEKLEHIAK 278 + RL + L A++ + +D A Sbjct: 474 DKD-----YRVLVPTGRLGDRGGRDTQRDRLSAAFSALEPVRGGATGLYDTTLAAYKAAT 528 Query: 279 GHDDYKKY--IIFLTDGENSSPNIDNKESLFYCNEAKRRG-----AIVYAIGVQAEAADQ 331 K+ ++ LTDG N P ++ +L + ++ + I V EA Q Sbjct: 529 ASYVKGKFNALVILTDGVNEDPGSISRSTLL--TQLRKLADPRHPVPLIMIAVGPEAHRQ 586 Query: 332 FLKNC--ASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + A+ + V + ++H L+ E Q Sbjct: 587 EAERIAGATGGSGHQVDSPAQIHSVILKAIMEAGNQS 623 >gi|11641066|gb|AAG39438.1|AF296186_1 Sof precursor [Streptococcus pyogenes] Length = 423 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 65/157 (41%), Gaps = 7/157 (4%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V ++ K D G D+M +LDVS M D F +K+ +++ + + Sbjct: 191 TIDVTVTVTPKEIDEGADVMALLDVSKKMTDADFNNAKEKIKKLVKTLTSKPTDNQPNHN 250 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQH-IQEKINRL--IFGSTTKSTPGLEYAYNKIFD 268 N VR L+TF +I ++ + + ++ L + L+ A +K Sbjct: 251 ARNSVR--LMTFYREISDPIDISGKTDGDLDQLLDELRKKAKANYDWGVDLQGAIHKART 308 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKES 305 + + K +++I+ + GE++ N +S Sbjct: 309 VFKNENNDNKKKSGKRQHIVLFSQGESTFSYDINDKS 345 >gi|3273265|dbj|BAA31176.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 69/225 (30%), Gaps = 35/225 (15%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWV 65 Query: 188 G---------MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + +L + +I L+ + + ++R S + A + Sbjct: 66 NHAVPLAMKLIQQLNLNESAIHLYLNDFSN--NAREIIRL-----HSDASKNKEKA--LI 116 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I+ +N + T T L + D ++ + ++ LTDG S Sbjct: 117 IIKSLLNTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSI 168 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 169 QDSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|3273301|dbj|BAA31194.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 69/225 (30%), Gaps = 35/225 (15%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWV 65 Query: 188 G---------MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + +L + +I L+ + + ++R S + A + Sbjct: 66 NHAVPLAMKLIQQLNLNESAIHLYLNDFSN--NAREIIRL-----HSDASKNKEKA--LI 116 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I+ +N + T T L + D ++ + ++ LTDG S Sbjct: 117 IIKSLLNTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSI 168 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 169 QDSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|3273303|dbj|BAA31195.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 69/225 (30%), Gaps = 35/225 (15%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRHNWV 65 Query: 188 G---------MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQ 238 + +L + +I L+ + + ++R S + A + Sbjct: 66 NHAVPLAMKLIQQLNLNESAIHLYLNDFSN--NAREIIRL-----HSDASKNKEKA--LI 116 Query: 239 HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 I+ +N + T T L + D ++ + ++ LTDG S Sbjct: 117 IIKSLLNTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSI 168 Query: 299 NIDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 169 QDSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|294054131|ref|YP_003547789.1| protein of unknown function DUF58 [Coraliomargarita akajimensis DSM 45221] gi|293613464|gb|ADE53619.1| protein of unknown function DUF58 [Coraliomargarita akajimensis DSM 45221] Length = 286 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 36/104 (34%), Gaps = 9/104 (8%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +++D+S SM + + + ++ + N R GL+ FS +I + Sbjct: 78 IYLLIDLSGSMLHDPNKRQSMVELC---------SLLAMAAIKNQDRVGLILFSDEIEEI 128 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 P G H ++ L+ +S + K Sbjct: 129 LPPGKGRSHAMRIMDLLMNYQPKRSGTDFRAMLARFGHMARKHS 172 >gi|218548279|ref|YP_002382070.1| hypothetical protein EFER_0898 [Escherichia fergusonii ATCC 35469] gi|218355820|emb|CAQ88433.1| conserved hypothetical protein; putative exported protein [Escherichia fergusonii ATCC 35469] Length = 543 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 60/174 (34%), Gaps = 24/174 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN ++L + S++ +++ ++ + R +VT++ Sbjct: 184 LVFLIDTSGSMNSD-----ERLPLIKSSLKLLVN------ELRDQDRISIVTYAGSARLL 232 Query: 231 FPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 G I I L G T G+ AY + KG + I+ Sbjct: 233 LSSTSGAEKNTILNAIANLQAGGGTNGGAGVAMAYE------QAQAGYIKGGVNR---IL 283 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 TDG+ + D + + G + +GV + + A Sbjct: 284 LATDGDFNIG-DDPSSVEDLVKKQRESGITLSTLGVGDNNYNEAMMVKIADTGN 336 >gi|154432890|gb|ABS82078.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432914|gb|ABS82099.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432922|gb|ABS82106.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|332077748|gb|EGI88209.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA41301] Length = 886 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 4/80 (5%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNN 214 +V + LD++++LD S SM++ + A + R ++D I S P+ Sbjct: 204 SGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLIDKITSDPEN-- 261 Query: 215 VVRSGLVTFSSKIVQTFPLA 234 R LVT++S I Sbjct: 262 --RVALVTYASTIFDGTEFT 279 >gi|86739043|ref|YP_479443.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86565905|gb|ABD09714.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 655 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 66/210 (31%), Gaps = 36/210 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT- 230 ++ LD S SM + ++ L ++ + GL FS + Sbjct: 451 LVALDSSGSMAEPVPGTDPSRTRLQAAVETSLAGLRLFAPDSQ---VGLWRFSGEQGAQG 507 Query: 231 -------FPLA---------WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLE 274 P+A + ++ G T A+ + + Sbjct: 508 VQGYRDLVPMAALNAPGRGGTHRDELIAAQAGIVPGGATDLYATTLAAFRFLT------Q 561 Query: 275 HIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 H G + ++ LTDG +++ P++D+ + +R + I +A Sbjct: 562 HYVPGRLNQ---VVILTDGRDTNRGAAAPSLDSLIEQLHAEYDPKRPVAIIMIAYSNDAD 618 Query: 330 DQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 L A+ R Y + ++ D +L Sbjct: 619 LTALSRIAAATAGRSYLRPDPSQVVDIYLD 648 >gi|58429531|gb|AAW78169.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429467|gb|AAW78137.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429537|gb|AAW78172.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429535|gb|AAW78171.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|58429461|gb|AAW78134.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|313803457|gb|EFS44639.1| cobaltochelatase subunit [Propionibacterium acnes HL110PA2] Length = 654 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 19/151 (12%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 ++ V+D S SM ++ + + +L + R L+ Sbjct: 451 RAGRAASCVIFVVDASGSMG-----SRGRMTASKGAALSLL-----LDAYVKRDRVCLIG 500 Query: 223 F-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 F + P+ V+ Q + L G T + GL A + + + Sbjct: 501 FRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVV-----RPLLLKDPGL 555 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 +I +TDG + ++D + + +EA Sbjct: 556 RP--LLILVTDGRGN-VSLDGRPNSQATDEA 583 >gi|58429491|gb|AAW78149.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDQVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|332993942|gb|AEF03997.1| hypothetical protein ambt_12385 [Alteromonas sp. SN2] Length = 671 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 27/127 (21%), Positives = 47/127 (37%), Gaps = 25/127 (19%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D+SLSM D+L A ++++ I GLV ++ Sbjct: 98 VIVMDMSLSMRATDITP-DRLTRAKYKAIDLVNQIGEGE-------MGLVAYAGDAFVVS 149 Query: 232 PLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL +I + L + G+E A + +A Y K + Sbjct: 150 PLTQDAANITTLLPSLSPEIMPVPGSDPLLGIETAAELLTNAG------------YSKGM 197 Query: 288 IF-LTDG 293 I+ +TDG Sbjct: 198 IYWITDG 204 >gi|320158178|ref|YP_004190556.1| TPR domain-containing protein in aerotolerance operon [Vibrio vulnificus MO6-24/O] gi|319933490|gb|ADV88353.1| TPR domain protein in aerotolerance operon [Vibrio vulnificus MO6-24/O] Length = 608 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 41/142 (28%), Gaps = 13/142 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + W S + + ++V+D+S SM +L A Sbjct: 54 ILLWLLAWVIAILALASPSWQHTTRPS-FNSHQNRVLVMDMSQSM-YAQDIAPSRLQQAR 111 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGST 252 ++L K + L+ ++ PL Q + I L + Sbjct: 112 YKALDLLPQWKEGN-------TALIAYAGDAYLLSPLTSDSQTLANLIQNLSPDIMPYQG 164 Query: 253 TKSTPGLEYAYNKIFDAKEKLE 274 ++ T L A ++ A Sbjct: 165 SRLTSALVLAKAQLEKAGAAKG 186 >gi|303241097|ref|ZP_07327606.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591357|gb|EFL61096.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 689 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 51/131 (38%), Gaps = 13/131 (9%) Query: 184 HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA-----WGVQ 238 + +L +++ +D S ++ + G+V+++ K L V Sbjct: 331 KYVEYETRLYKEKTAVKSFIDSFSS----DDRTKVGVVSYAEKAKIVSGLTSTLDTTNVS 386 Query: 239 HIQEKINRLIFGST--TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY--KKYIIFLTDGE 294 ++ I+ L + + + + I DA + ++ K D KYII LTD Sbjct: 387 IMKANIDALSPSNAIVSNVDGSYKSSRRNIGDAMRRAYYMLKNSTDAGASKYIIVLTDEI 446 Query: 295 NSSPNIDNKES 305 ++S D+ S Sbjct: 447 SNSWTWDSITS 457 >gi|229108230|ref|ZP_04237852.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-15] gi|228675246|gb|EEL30468.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-15] Length = 627 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 66/194 (34%), Gaps = 23/194 (11%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 ++ + +++D S SM +K+ +S+ + +KS+ + + F Sbjct: 429 ELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----AISGFWE 476 Query: 226 KIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 P + + N + + E N+ + + Sbjct: 477 DASSAKPEDKPNVIHEVVNYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKLAKRPEK 536 Query: 284 KKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAADQFLKNC 336 K+++ TDGE S+ + ++ A++ G V I + EA Q +KN Sbjct: 537 HKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATYQLMKNI 596 Query: 337 ASPDRFYSVQNSRK 350 + + V N + Sbjct: 597 Y--NHHFLVANHAE 608 >gi|167719780|ref|ZP_02403016.1| hypothetical protein BpseD_12240 [Burkholderia pseudomallei DM98] Length = 577 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 17/125 (13%), Positives = 41/125 (32%), Gaps = 8/125 (6%) Query: 16 SISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQ 75 S +++ AI + V +G ++ + FFV+ L + D + L A ++ + + Sbjct: 1 SFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM----DDQCAQPN 55 Query: 76 KNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM 135 + N L D + + + + + + + Sbjct: 56 AAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLN---AVQVTATQSV 112 Query: 136 PFIFC 140 P+ F Sbjct: 113 PYFFL 117 >gi|188583812|ref|YP_001927257.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179347310|gb|ACB82722.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 354 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 49/156 (31%), Gaps = 16/156 (10%) Query: 140 CTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSI 199 + P V + IG + M++D S SMN+ F ++++ Sbjct: 61 LAIGGLVLALAGPYRAGERVTRT---GIGAQVSMLIDRSGSMNETFAGRQPSGAEESKAM 117 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST--TKSTP 257 I++ + + FS+ + P+ ++ I + T Sbjct: 118 ASR-RILRDFVGERAHDQFAVTAFSTAPMLVVPMTDRHDAVRAAIAAIDRPGLDYTNVAR 176 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG 293 GL A ++ + ++ ++DG Sbjct: 177 GLGMALSQFGAGAPG----------VSRALLLVSDG 202 >gi|109129708|ref|XP_001116037.1| PREDICTED: integrin alpha-D-like, partial [Macaca mulatta] Length = 101 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 27/100 (27%), Positives = 38/100 (38%), Gaps = 14/100 (14%) Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG----VQAEAA 329 H KK +I +TDG+ D E +A++ G I YAIG Q A Sbjct: 5 HHKNGARKSAKKILIVITDGQ---KYKDPLEYRDVIPQAEKAGIIRYAIGVGRAFQEPTA 61 Query: 330 DQFLKNCASP---DRFYSVQNSRKLHDAFLRIGKEMVKQR 366 Q L S D + V N A I K++ ++ Sbjct: 62 RQELNTIGSAPPQDHVFKVDN----FAALSSIQKQLQEKI 97 >gi|121605804|ref|YP_983133.1| type 4 fimbrial biogenesis protein PilY1 [Polaromonas naphthalenivorans CJ2] gi|120594773|gb|ABM38212.1| type 4 fimbrial biogenesis protein PilY1 [Polaromonas naphthalenivorans CJ2] Length = 1148 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 56/181 (30%), Gaps = 38/181 (20%) Query: 171 MMMVLDVSLSMNDH------FGPGMDKLGVATRSIREMLDIIKSIPDVNNVV-----RSG 219 +++ +D S SMN ++ + S++ + + P N VV R Sbjct: 80 VIISVDDSGSMNWDVNGNSTNTTNNKRITLLKNSLKSVFGNPTANPPTNGVVPDNRIRLA 139 Query: 220 --LVTFSSKIVQTFPLAWG------------VQHIQEKINRLIFGSTTKSTPGLEYAYNK 265 ++ + L G + IN L T S + A + Sbjct: 140 WQVMHNNGDASGASSLTPGNTNSMKAFSGTHRTNFNTFINSLSASGDTPSHKMAKQALDY 199 Query: 266 IFDAKEKLE--HIAKGHDDYKKYI-------IFLTDGENSSPNIDNKESLFYCN-EAKRR 315 + G +Y+ IF+TDG N D +E + + A+ Sbjct: 200 MKSPAGTNSPWADMPGTAQTTQYLSCRRSYHIFMTDG---GWNSDPREFVGNADGTARTL 256 Query: 316 G 316 G Sbjct: 257 G 257 >gi|182439948|ref|YP_001827667.1| hypothetical protein SGR_6155 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178468464|dbj|BAG22984.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 596 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 64/202 (31%), Gaps = 27/202 (13%) Query: 171 MMMVLDVSLSMNDHF-GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + V+D S SM G G ++ V S+ + L+ + GL F++ + Sbjct: 387 LTTVVDASGSMATPVPGRGQSRMDVTKESLIQALEQFTPNDE------IGLWEFATTLDG 440 Query: 230 T------------FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 A G +E++ G + KE Sbjct: 441 DRDYRRLMPTKRLGDPAEGGGTHREELTAAFAGLRPVPGGATGLYDTTLASYKEARSTFV 500 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA---KRRGAIVYAIGVQAEAADQFLK 334 KG + ++ LTDG N ++ L +A R V AI V +A + Sbjct: 501 KGKFNA---LVILTDGSNQDDRSISRSGLVAELKALRDPERPVPVIAIAVGPDADRDEVA 557 Query: 335 NCA--SPDRFYSVQNSRKLHDA 354 A + Y V + ++ Sbjct: 558 EIARVTGGDGYEVSDPAEIRAV 579 >gi|94498563|ref|ZP_01305118.1| TadE-like protein [Sphingomonas sp. SKA58] gi|94422006|gb|EAT07052.1| TadE-like protein [Sphingomonas sp. SKA58] Length = 193 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 30/81 (37%), Gaps = 1/81 (1%) Query: 5 NIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLH-YILDHSLLYTATKI 63 + + G ++ AI+LP + +M IE H + L + + + + TA+ Sbjct: 3 HFAPLAKDRTGVSTVEFAIILPALLTLMCGAIELGHMLLARVVLEGAMTEAARISTASLE 62 Query: 64 LNQENGNNGKKQKNDFSYRII 84 + ++ + + Sbjct: 63 TAEAQRTTLMEESIEQAMGNF 83 >gi|223934362|ref|ZP_03626283.1| hypothetical protein Cflav_PD5900 [bacterium Ellin514] gi|223896825|gb|EEF63265.1| hypothetical protein Cflav_PD5900 [bacterium Ellin514] Length = 576 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 52/158 (32%), Gaps = 27/158 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++ LD S SM M +L A +L +P R G+V + F Sbjct: 85 LVALDTSRSMKQADEEKMTRLEAAE----NLLMDADIVP------RDGVVKDPGVRLFEF 134 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + ++ G++T+ + N + + +I T Sbjct: 135 G-EDATAVTKSMMDLAPAGNSTRFHRSINTMVNSLGADEGAKA------------LILFT 181 Query: 292 DGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 DG + +K +L A+ R +YA+ + + Sbjct: 182 DGHDFELVNPSKTALI----ARARQVPIYAVALGKQGK 215 >gi|218693363|gb|ACL01185.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|167924336|ref|ZP_02511427.1| hypothetical protein BpseBC_37618 [Burkholderia pseudomallei BCC215] Length = 396 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 40/120 (33%), Gaps = 3/120 (2%) Query: 17 ISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQK 76 +SIL A++L V+ +GL ++ + +++L A N + Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS--ADACALAAARDLTGAINLSVPEA 58 Query: 77 NDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSL-SIIIDDQHKDYNLSAVSRYEM 135 + + + F +++ N +++ + I Y SR + Sbjct: 59 AGITAGHLNHALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGI 118 >gi|150024647|ref|YP_001295473.1| outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] gi|149771188|emb|CAL42655.1| Probable outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] Length = 631 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 68/185 (36%), Gaps = 19/185 (10%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKL 192 + + + PW + + + I + +++ ++DVS SMN +KL Sbjct: 237 FSITTQYSDCPWNSKRKLVQIGLQGK-NIPTNDLPASNLVFLIDVSGSMNAD-----NKL 290 Query: 193 GVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGST 252 + S++ +++ ++ V VV +G +V Q I + +L G + Sbjct: 291 PLLIESLKILVEQLRKQDKVAIVVYAGAAG----LVLPPTAGNEKQTIINALEKLNAGGS 346 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T G+E AY + K + +I TDG+ + + + + Sbjct: 347 TAGGAGIELAYKTAQENFIKDGNNR---------VILATDGDFNVGSTSDSAMQTLIEDK 397 Query: 313 KRRGA 317 + G Sbjct: 398 RESGV 402 >gi|329934513|ref|ZP_08284554.1| hypothetical protein SGM_0266 [Streptomyces griseoaurantiacus M045] gi|329305335|gb|EGG49191.1| hypothetical protein SGM_0266 [Streptomyces griseoaurantiacus M045] Length = 449 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 50/166 (30%), Gaps = 38/166 (22%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S + +++D S SM+ K+ A + +D + + V ++ Sbjct: 54 SPGRGPAAAVALMVDCSGSMDY----PPTKMRNAREATAAAVDTL------RDGVHFAVI 103 Query: 222 TFSSKIVQTFPLAWGVQHIQEK---------INRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 + + +P G + + + RL G T L A + A Sbjct: 104 GGTHVAKEVYP-GGGRLAVASEATRAAATYALRRLSAGGGTAIGTWLRLADRLLSSADVT 162 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPN-------IDNKESLFYCNE 311 + H I LTDG N +D F C+ Sbjct: 163 IRHG-----------ILLTDGRNEHETPEALRSALDGCAGRFTCDA 197 >gi|298370192|ref|ZP_06981508.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281652|gb|EFI23141.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1092 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 70/186 (37%), Gaps = 39/186 (20%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGL--DMMMVLDVSLSMN-------- 182 Y + A++ +P + S+K + + ++M+++D S SM Sbjct: 19 YSVASALLFISLEAHAQFSPTPPYLQNESSTKGQLTVKHNIMLLIDDSGSMEYVPSSDRR 78 Query: 183 -------DHFGPGMDKLGVATRSIREMLDIIK-----SIPDVNNVVRSGLVTFSSKIVQT 230 + + +L + + ++LD + S+ ++N R+ FSS Sbjct: 79 PYSSYYYGYGAREISRLDITKNVLNKVLDKYQDRFNWSLQTLHNNGRTDTQDFSSS---- 134 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 + I+ K+N + + T +T Y + + + + K Y++ + Sbjct: 135 ------WRTIKNKVNGIRAENGTPTT---RRYYEVVSN----IVMPNIKYRCQKSYVVLM 181 Query: 291 TDGENS 296 +DG+ + Sbjct: 182 SDGDAN 187 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 24/56 (42%), Gaps = 4/56 (7%) Query: 313 KRRGAIVYAIGVQA---EAADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVK 364 K + Y +G + + +L+N AS + +++ ++ L AF I + Sbjct: 295 KNQLVQTYTVGFGSGVSATGEAYLRNGASGSNNYFNARDENGLFAAFDAITDSIAN 350 >gi|291222148|ref|XP_002731080.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 954 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 74/199 (37%), Gaps = 33/199 (16%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 +I L ++++LD+S SMN + D+ + ++ + + V + G+V FS+ Sbjct: 310 EIELRIVLILDISGSMNTN-----DRFDLMIQASTKYIGYT-----VPSDTWLGIVEFST 359 Query: 226 KIVQTFPLAW--GVQHIQEKINRLI--FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 L ++ I++L G +T GL + E Sbjct: 360 DATILSYLVQLSDTDTRKQLIDKLPTGTGGSTCIGCGLNSGIEVLEHGWESPSGG----- 414 Query: 282 DYKKYIIFLTDG-ENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCAS 338 + +TDG EN P I++ +E + +V + + A+ L Sbjct: 415 ----ILFLITDGDENVRPYIED-----VIDELVEKEIVVDTLALSDEADPGLAELSEETG 465 Query: 339 PDRFY--SVQNSRKLHDAF 355 ++ +S LHDAF Sbjct: 466 GTAYWYSESDSSTGLHDAF 484 >gi|262172999|ref|ZP_06040676.1| TPR domain protein in aerotolerance operon [Vibrio mimicus MB-451] gi|261890357|gb|EEY36344.1| TPR domain protein in aerotolerance operon [Vibrio mimicus MB-451] Length = 636 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 15/122 (12%) Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 S + S + ++M +D+S SM P T++ + LD++K + Sbjct: 71 PSWQSAERPSVQNRAARVLM-MDMSRSMYATDLAP-----NRLTQARYKALDLLKGWQEG 124 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFD 268 +GLV +++ PL + I ++ + + A + + Sbjct: 125 T----TGLVAYAADAYVVSPLTSDTATLANLIPNLSPEIMPYQGANAANAVSLAISMLQQ 180 Query: 269 AK 270 A Sbjct: 181 AG 182 >gi|300772297|ref|ZP_07082167.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] gi|300760600|gb|EFK57426.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] Length = 256 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 47/130 (36%), Gaps = 9/130 (6%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + +LD S SM+ + ++ M++ +++ + ++TF ++ Sbjct: 48 RLPVYFLLDTSGSMHGE------PIQALNNALSGMINNLRTDAQAAETLWISMITFDREV 101 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 + PL ++ Q T + LE Y+ K KG +I Sbjct: 102 KEIVPLT-ALESFQLPEISCPESGPTFTGKALEILYDTATREVIKGSPEQKGDWRPLLFI 160 Query: 288 IFLTDGENSS 297 TDG+ S Sbjct: 161 --FTDGKPSD 168 >gi|300859618|ref|ZP_07105706.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecalis TUSoD Ef11] gi|300850436|gb|EFK78185.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecalis TUSoD Ef11] gi|315145275|gb|EFT89291.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX2141] gi|315161973|gb|EFU05990.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0645] Length = 657 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|156402363|ref|XP_001639560.1| predicted protein [Nematostella vectensis] gi|156226689|gb|EDO47497.1| predicted protein [Nematostella vectensis] Length = 260 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 63/192 (32%), Gaps = 31/192 (16%) Query: 171 MMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ VLD S S+ F G+ + + T N + L+ ++++ V Sbjct: 90 VVYVLDSSTSITKVDFENGIKAIQLLTA-------------KARNDTKFALILYAAEPVM 136 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 +Q N T + LE F L ++ Sbjct: 137 VTNFTTQASMLQLLTNVQRLYGKTNTFSALELCRKLFFVKSGSLNR-----------VLL 185 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ-AEAADQFLKNCAS--PDRFYSVQ 346 +TDG + N++ + L+ + K ++ + V + + +S Y V+ Sbjct: 186 VTDGLS---NMNQDKMLYEAFQLKMSNIEIFVVAVGRYNYGADEISSLSSIPTTHIYRVE 242 Query: 347 NSRKLHDAFLRI 358 + R L + I Sbjct: 243 SMRGLVRIIMWI 254 >gi|311254427|ref|XP_003125838.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-10-like [Sus scrofa] Length = 1177 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGESSV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETKTAQAIMMACTEGFSQSHGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAAD---------QFLKNCAS- 338 +TDGE+ E+L C + Y I V Q ++ AS Sbjct: 273 VVTDGESHDGEELP-EALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLQEIRAIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|229015946|ref|ZP_04172909.1| Von Willebrand factor type A domain protein [Bacillus cereus AH1273] gi|228745362|gb|EEL95401.1| Von Willebrand factor type A domain protein [Bacillus cereus AH1273] Length = 412 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 208 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 255 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 256 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 315 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 316 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 375 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 376 QLMKNIY--NHHFLVANHAE 393 >gi|297563324|ref|YP_003682298.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296847772|gb|ADH69792.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 450 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 57/212 (26%), Gaps = 45/212 (21%) Query: 114 LSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMM 173 + + +P + + T SV + ++ Sbjct: 2 PDTTERAAEPGFRIEVDQNVLLPV-----GGREVHAIVSVTSTGSVVVGDSVRAAE--VI 54 Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFP- 232 ++D S SM G +D A R+ +L V +V S FP Sbjct: 55 IVDTSGSM---HGAKIDAAKQAARAAVGVL---------REGVHFAVVAGHSDASVLFPE 102 Query: 233 --------LAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 A E I L T+ L A + ++H Sbjct: 103 GGGRMVRADAVTRAEAAEAIGGLRADGGTRMGSWLVRAAELFATVEGGIKHA-------- 154 Query: 285 KYIIFLTDGENSSPNI------DNKESLFYCN 310 I LTDG+N+ P D F C+ Sbjct: 155 ---ILLTDGQNNEPAAVFGQALDRVAGSFVCD 183 >gi|256829450|ref|YP_003158178.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] gi|256578626|gb|ACU89762.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] Length = 702 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 56/147 (38%), Gaps = 19/147 (12%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 D+ D++ V+D S SM P +D A + + K + + R GLV F Sbjct: 287 DVKTDVVFVIDTSRSMQ----PFIDMTREAVAGMTK-----KISAETADRYRFGLVVFRD 337 Query: 226 KIVQTFPLAWGVQHIQ-------EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 + L + +++ + ++ L + ++YA A +K Sbjct: 338 SLEAAPQLEYVTRNLTPELVPGEQLVDLLEKEGGATAVGSVDYAEEAF--AGVDEALRSK 395 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKES 305 + +++IF+ D +S P + + Sbjct: 396 WREGALRFVIFIGD-ASSHPKGHPQNT 421 >gi|224282727|ref|ZP_03646049.1| von Willebrand factor, type A [Bifidobacterium bifidum NCIMB 41171] gi|310287186|ref|YP_003938444.1| Conserved hypothetical membrane spanning protein with a von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|313139885|ref|ZP_07802078.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|309251122|gb|ADO52870.1| Conserved hypothetical membrane spanning protein with a von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|313132395|gb|EFR50012.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 342 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 73/202 (36%), Gaps = 32/202 (15%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM---NDHFGPG--MDKLGVA 195 F A P ++T +S++ D+M+ +DV+ SM + +G + +L A Sbjct: 53 MFLLAAVMMLTPSIVT---TTTSRAINATDVMVAVDVTGSMAVKDAEYGSSGTISRLDAA 109 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTK 254 R ++ + +R G PL I + L + ++T Sbjct: 110 KRIVKGITSTY--ADSSFAALRFGASG-----TLDVPLTPDSIAIDGWADTLAVESTSTS 162 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF--LTDGENSSPNIDNKES--LFYCN 310 + L+ +++ L + I+ +TDGE +S S Y + Sbjct: 163 AGSSLDTPLDQL-----MLSLKSIRDQHPDDIIVLYVITDGEQTSDTARRSYSALRRYLD 217 Query: 311 EAKRRGAIVYAIGVQAEAADQF 332 ++ + IGV ++A + Sbjct: 218 DS-------FTIGVGSDAGGKI 232 >gi|254462465|ref|ZP_05075881.1| magnesium chelatase ATPase subunit D [Rhodobacterales bacterium HTCC2083] gi|206679054|gb|EDZ43541.1| magnesium chelatase ATPase subunit D [Rhodobacteraceae bacterium HTCC2083] Length = 547 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 49/157 (31%), Gaps = 21/157 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQ 229 ++ +D S S M +L A ++ +L + D L++F Sbjct: 366 LIFTVDASGS------AAMARLAEAKGAVEMLLSEAYARRDH-----VALISFRGLDAEV 414 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P + + ++ L G T GL A E H I+ Sbjct: 415 LLPPTRSLVQTKRRLAALPGGGGTPLASGLTAA----LSLAETASHKGMSAT-----IVL 465 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQA 326 LTDG + ++A+ + + GV A Sbjct: 466 LTDGRANIALDGQANRTQAGDDAQTIARNILSAGVDA 502 >gi|228932050|ref|ZP_04094942.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar andalousiensis BGSC 4AW1] gi|228827633|gb|EEM73375.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar andalousiensis BGSC 4AW1] Length = 609 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 405 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 452 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 453 ISGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 512 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 513 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 572 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 573 QLMKNIY--NYHFLVANHAE 590 >gi|218693409|gb|ACL01208.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|218693367|gb|ACL01187.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693469|gb|ACL01238.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|218693353|gb|ACL01180.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|218693287|gb|ACL01147.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|311068613|ref|YP_003973536.1| putative activator of nitric oxide reductase [Bacillus atrophaeus 1942] gi|310869130|gb|ADP32605.1| putative activator of nitric oxide reductase [Bacillus atrophaeus 1942] Length = 638 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 63/166 (37%), Gaps = 13/166 (7%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V G Sbjct: 434 KQEPSSEIDAVFTLLVDCSASM-------FDKMEETKRGIVLFHEALKSVSVPHQIV--G 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ +T + + + L G+ N+ A ++ Sbjct: 485 FWEDTNDATETSQPNY-FNTVIPFQDSLQKGAGPAIMQLEPEEDNRDGYAIRQMTKQVLQ 543 Query: 280 HDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 + +K+++ +DGE ++ + ++ EA++RG V + Sbjct: 544 RREAQKFLLVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|294674673|ref|YP_003575289.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294473007|gb|ADE82396.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 211 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 70/190 (36%), Gaps = 18/190 (9%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 L + +++D S SM + + +M+ ++ P V ++TF ++ Sbjct: 4 LPVYILIDTSGSMKGE------PIESVKVGLADMVATLRQDPYALETVCISIITFDREVS 57 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 Q PL ++++Q T L+ +KI ++ KG +I Sbjct: 58 QVLPLT-ELENLQMPDILTPDTGPTHLGKALKMLCDKIAVEVKRGTPEQKGDWRPLLFI- 115 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKR-RGAIVYAIGVQAEAADQFLKNCASPDRFYSVQN 347 LTDG+ S D ++ + K A + A +A + L+ D+ + + Sbjct: 116 -LTDGKPS----DVQDYNQIIPKVKSLNFASIVACAAGPKAKVEPLQQLT--DQVFRLDT 168 Query: 348 SRKLHDAFLR 357 +F + Sbjct: 169 MDA--TSFKK 176 >gi|221130655|ref|XP_002156983.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1a [Hydra magnipapillata] Length = 1029 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 74/230 (32%), Gaps = 49/230 (21%) Query: 151 APLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIP 210 + SV D+++V+D S SM +A + R ++D + Sbjct: 139 RRVWYQQSVSSPK------DVVIVIDRSGSMTGK------SFSIAKIAARMIIDALGE-- 184 Query: 211 DVNNVVRSGLVTFSSKI------VQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAY 263 N+ V+ +KI + + +Q IN++ + T G+ A+ Sbjct: 185 --NDYFNVIAVSNKAKIIEPCVPYLIQATKFNKEKMQIAINKIEKPNNVLNLTNGILLAF 242 Query: 264 NKIFD--AKEKLEHIAKGHDDY----------------------KKYIIFLTDGENSSPN 299 N + ++ + K K II ++DG + N Sbjct: 243 NILNSGCDEKFMISPWKTGLSPWKTGLSVKIAERGNNRTYRVGCNKLIIVISDGLEGNYN 302 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLK--NCASPDRFYSVQN 347 K N K Y +G D+ LK +C + FY ++ Sbjct: 303 NAGKVVFDKMNSEKNVRVFSYLVGRVKNPNDRALKEMSCNNRGYFYKIET 352 >gi|218693397|gb|ACL01202.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|218693385|gb|ACL01196.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|218693373|gb|ACL01190.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|218693235|gb|ACL01121.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|163738938|ref|ZP_02146351.1| von Willebrand factor type A domain protein [Phaeobacter gallaeciensis BS107] gi|161387743|gb|EDQ12099.1| von Willebrand factor type A domain protein [Phaeobacter gallaeciensis BS107] Length = 229 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 57/158 (36%), Gaps = 11/158 (6%) Query: 168 GLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK 226 L +++ +DVS S++ + + M+ L A + D + + V R L+ +S Sbjct: 23 DLALVLAVDVSGSVDVEEYRTQMEGLAAA------LQDGVVAEALVRAQARVALIQWSGS 76 Query: 227 IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 Q L+W + +++L S P YA + +E ++ Sbjct: 77 GRQELTLSWRHASDFQALDQLAADIAAASRPWRNYATGIGEALQLAIEQFQTVPGCRRRV 136 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 I DG P+ + E G V A+ + Sbjct: 137 IDVSGDG----PSNEGVEPTAIRGALSAAGITVNALAI 170 >gi|163751138|ref|ZP_02158368.1| TPR domain protein [Shewanella benthica KT99] gi|161329094|gb|EDQ00166.1| TPR domain protein [Shewanella benthica KT99] Length = 618 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 23/130 (17%) Query: 172 MMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTF 231 ++V+D+SLSM ++L A +++ +K +GL+ ++ Sbjct: 90 VIVMDMSLSMYATDLSP-NRLSQAKFKATDLIGELKEGE-------TGLIAYAGDAFTIS 141 Query: 232 PLAWGVQHIQEKINRLIFG----STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYI 287 PL + + L + L+ N + I Sbjct: 142 PLTRDRATLLNLLPTLSPDIMPVRGSNLVAALQQGKNLLAQGGHIRGD-----------I 190 Query: 288 IFLTDGENSS 297 + LTDG ++S Sbjct: 191 LLLTDGVSTS 200 >gi|293384735|ref|ZP_06630589.1| putative gram positive anchor protein [Enterococcus faecalis R712] gi|291077985|gb|EFE15349.1| putative gram positive anchor protein [Enterococcus faecalis R712] Length = 434 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|59808775|gb|AAH89923.1| Col6a2 protein [Rattus norvegicus] Length = 225 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 64/180 (35%), Gaps = 17/180 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + + + L + + D N R L+ + Sbjct: 34 TQRPVDIVFLLDGSERLGEQNFYKARRF---VEEVSRRLTLARRDDDPLNA-RMALLQYG 89 Query: 225 SKIVQT--FPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 S+ Q FPL + V I E + R S + G+ +A N + Sbjct: 90 SQNQQQVAFPLTYNVTTIHEALERTTYLNSFSHVGTGIVHAINNVVRGARGGARRHAELS 149 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 +FLTDG + +++ + +++ + + V + L + DR Sbjct: 150 -----FVFLTDGVTGNDSLEES-----VHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDR 199 >gi|22536818|ref|NP_687669.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|25010692|ref|NP_735087.1| hypothetical protein gbs0632 [Streptococcus agalactiae NEM316] gi|76788219|ref|YP_329407.1| Cna B domain-containing protein [Streptococcus agalactiae A909] gi|77411235|ref|ZP_00787585.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|22533664|gb|AAM99541.1|AE014220_13 cell wall surface anchor family protein, putative [Streptococcus agalactiae 2603V/R] gi|23095046|emb|CAD46276.1| Unknown [Streptococcus agalactiae NEM316] gi|76563276|gb|ABA45860.1| cna B-type domain protein [Streptococcus agalactiae A909] gi|77162661|gb|EAO73622.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|218693223|gb|ACL01115.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693225|gb|ACL01116.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693227|gb|ACL01117.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693229|gb|ACL01118.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693231|gb|ACL01119.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693233|gb|ACL01120.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693237|gb|ACL01122.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693239|gb|ACL01123.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693243|gb|ACL01125.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693245|gb|ACL01126.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693249|gb|ACL01128.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693251|gb|ACL01129.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693253|gb|ACL01130.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693255|gb|ACL01131.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693257|gb|ACL01132.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693259|gb|ACL01133.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693261|gb|ACL01134.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693263|gb|ACL01135.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693265|gb|ACL01136.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693267|gb|ACL01137.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693269|gb|ACL01138.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693271|gb|ACL01139.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693273|gb|ACL01140.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693275|gb|ACL01141.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693277|gb|ACL01142.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693279|gb|ACL01143.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693281|gb|ACL01144.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693283|gb|ACL01145.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693285|gb|ACL01146.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693289|gb|ACL01148.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693291|gb|ACL01149.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693293|gb|ACL01150.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693295|gb|ACL01151.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693297|gb|ACL01152.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693299|gb|ACL01153.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693301|gb|ACL01154.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693303|gb|ACL01155.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693305|gb|ACL01156.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693307|gb|ACL01157.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693309|gb|ACL01158.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693311|gb|ACL01159.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693313|gb|ACL01160.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693315|gb|ACL01161.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693317|gb|ACL01162.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693319|gb|ACL01163.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693321|gb|ACL01164.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693323|gb|ACL01165.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693325|gb|ACL01166.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693327|gb|ACL01167.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693331|gb|ACL01169.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693333|gb|ACL01170.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693337|gb|ACL01172.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693339|gb|ACL01173.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693341|gb|ACL01174.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693345|gb|ACL01176.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693347|gb|ACL01177.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693349|gb|ACL01178.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693355|gb|ACL01181.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693359|gb|ACL01183.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693369|gb|ACL01188.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693371|gb|ACL01189.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693377|gb|ACL01192.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693379|gb|ACL01193.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693381|gb|ACL01194.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693383|gb|ACL01195.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693387|gb|ACL01197.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693389|gb|ACL01198.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693391|gb|ACL01199.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693393|gb|ACL01200.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693395|gb|ACL01201.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693399|gb|ACL01203.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693401|gb|ACL01204.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693403|gb|ACL01205.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693405|gb|ACL01206.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693407|gb|ACL01207.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693411|gb|ACL01209.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693415|gb|ACL01211.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693419|gb|ACL01213.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693421|gb|ACL01214.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693433|gb|ACL01220.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693435|gb|ACL01221.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693439|gb|ACL01223.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693447|gb|ACL01227.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693453|gb|ACL01230.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693455|gb|ACL01231.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693457|gb|ACL01232.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693459|gb|ACL01233.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693461|gb|ACL01234.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693463|gb|ACL01235.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693465|gb|ACL01236.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693467|gb|ACL01237.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693471|gb|ACL01239.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693473|gb|ACL01240.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693475|gb|ACL01241.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.6 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + I N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|330684715|gb|EGG96413.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis VCU121] Length = 629 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 40/304 (13%), Positives = 103/304 (33%), Gaps = 41/304 (13%) Query: 37 ETSHKFFVKAK-LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNE 95 + + K K LD N +G Q D +++ + + E Sbjct: 313 DMTDMMTKKGKGSQNTLDRE--EGGFIGQNSAFALDGINQNVDIKWKVPDILPEYIQAYE 370 Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLI 155 +N +I ++ + +I + Q + +NL+ R + I + + Sbjct: 371 DVKNDVQFEIKDLIQIIKKTIEREHQDERHNLT-KGRLQKNLINWFIDDQYKLFYKKQDL 429 Query: 156 TSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + S + +++D S SM+ DK+ + + + +K + + + Sbjct: 430 SQSFDAT--------FTLLIDASASMH-------DKMDETIKGVVLFHETLKELNVKHEI 474 Query: 216 VRSGLVTFSSKIVQTFPL-----AWGVQHIQE---KINRLIFGSTTKSTPGLEYAYNKIF 267 + F S + + + +++ +I L + + A +++ Sbjct: 475 LAFNEDAFDSDDTKQPNIIDEIIHYDYSTLKKDGPRIMALEPQDDNRDGVAIRIASDRLI 534 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV 324 + H ++++I +DGE S+ N ++ A++ G V+ + + Sbjct: 535 ---RRSHH--------QRFLIVFSDGEPSAYNYSQDGIIDTYEAVENARKFGIEVFNVFL 583 Query: 325 QAEA 328 + Sbjct: 584 SQDP 587 >gi|330794740|ref|XP_003285435.1| hypothetical protein DICPUDRAFT_76377 [Dictyostelium purpureum] gi|325084610|gb|EGC38034.1| hypothetical protein DICPUDRAFT_76377 [Dictyostelium purpureum] Length = 1962 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 54/182 (29%), Gaps = 38/182 (20%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 S ++ LD++++ DV+ SM KLG I+ I + VR V Sbjct: 1629 SRHNNQYLDLVILCDVTGSMGSELTQVKTKLG----------QIVDRIKESKVNVRVANV 1678 Query: 222 TFSSK----------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 ++ + I+ KI L + + ++ Sbjct: 1679 FYNDHHQVYLQTDESPTVQVDFTSDLDEIKSKIESLNVDGGDDNAEAVADGLYQVSKLNF 1738 Query: 272 KLEHIAKGHDDYKKYIIFLTD--------GENSSPNIDN--KESLFYCNEAKRRGAIVYA 321 + + K IF+ D ++ PN + + + + G Y Sbjct: 1739 RR--------NSTKVCIFIADAPAHGFDEAIDNFPNGCPCGHDCIELVRKIVKMGVTFYT 1790 Query: 322 IG 323 + Sbjct: 1791 VS 1792 >gi|262403350|ref|ZP_06079910.1| TPR domain protein in aerotolerance operon [Vibrio sp. RC586] gi|262350849|gb|EEY99982.1| TPR domain protein in aerotolerance operon [Vibrio sp. RC586] Length = 624 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 39/112 (34%), Gaps = 11/112 (9%) Query: 139 FCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM-NDHFGPGMDKLGVATR 197 W + S + S + ++V+D+S SM P T+ Sbjct: 56 LLAIVWLFATLALAGPSWQSAERPSVQNSA-SRVLVMDMSRSMYATDLAP-----NRLTQ 109 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF 249 + + LD++K + + +GLV +++ PL + I L Sbjct: 110 ARYKALDLLKGWHEGS----TGLVAYAADAYVVSPLTSDSATLANLIPNLSP 157 >gi|296125272|ref|YP_003632524.1| hypothetical protein Bmur_0218 [Brachyspira murdochii DSM 12563] gi|296017088|gb|ADG70325.1| protein of unknown function DUF58 [Brachyspira murdochii DSM 12563] Length = 295 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 66/191 (34%), Gaps = 40/191 (20%) Query: 75 QKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYE 134 Q + +I+ + + + + + +G D ++ D + D+ +SA RY Sbjct: 18 QIEIKTSKIVNSYFAGQYHSAFKGHGIEFDEVR-----KYTVGDDVRTMDWKVSA--RYN 70 Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 PFI + + L ++++ D S S + F Sbjct: 71 EPFIKRF----------------------REERELSVIILADFSASTDFGF-------TK 101 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIF----G 250 ++ L + + N + GL+ F+ + + PL G H+ I LI Sbjct: 102 TKHNLIVELSALLGFSALKNNDKVGLLIFTDTVEKFIPLNKGRNHVLRIIRELIEFEPKS 161 Query: 251 STTKSTPGLEY 261 + T LEY Sbjct: 162 ANTNIASALEY 172 >gi|123498822|ref|XP_001327483.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121910413|gb|EAY15260.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 729 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 69/200 (34%), Gaps = 40/200 (20%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 KI + S+ ++D S SM + ++ A + ++ + R Sbjct: 226 EGKIYANSE----FYFIIDCSGSMEES------RIKNAKFCLNLLIHSLPV------GCR 269 Query: 218 SGLVTFSSKIVQTFPLAW----GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEK 272 ++ F S P V E+IN++ T L++ ++ Sbjct: 270 FSIIKFGSMYEVVLPTCDYTDENVAKAMEQINQMDANMEGTDILSPLKFVSDQSTKEG-- 327 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF 332 + K + LTDGE+ D +L N R ++ IG+ + A Sbjct: 328 ----------FIKQVFLLTDGEDIH--TDQIYALVQANRTNNR---IFTIGIGSGADRNL 372 Query: 333 LKNCA--SPDRFYSVQNSRK 350 +KN A S ++++ + Sbjct: 373 IKNIARISGGNNALIEDNDE 392 >gi|26352267|dbj|BAC39770.1| unnamed protein product [Mus musculus] Length = 1121 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 63/186 (33%), Gaps = 30/186 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM H +L + +R+ + + L + +V+ Sbjct: 924 VCILLDTSGSMGPHLQWIKTELVLLIWEQLRKHCARFNLLSFAED-----LQLWQDTLVE 978 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + A + + L +T L A+ + + Sbjct: 979 STEAA--CHKAMQWVAHLQAQGSTSVLAALTKAF----------------SFQDVQGLYL 1020 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSV 345 LTDG+ + ++ K RG V+ I + + A +FL+ A S R++ Sbjct: 1021 LTDGKPDTSCSLILNTVQSFQ--KERGVKVHTISLTSADRTATEFLRELASLSGGRYHCP 1078 Query: 346 QNSRKL 351 + + L Sbjct: 1079 VSDKAL 1084 >gi|117320533|ref|NP_808365.2| von Willebrand factor A domain-containing protein 3A [Mus musculus] gi|123785278|sp|Q3UVV9|VWA3A_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 3A gi|74205319|dbj|BAE23160.1| unnamed protein product [Mus musculus] gi|148685277|gb|EDL17224.1| RIKEN cDNA E030013G06, isoform CRA_b [Mus musculus] Length = 1148 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 63/186 (33%), Gaps = 30/186 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVAT-RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 + ++LD S SM H +L + +R+ + + L + +V+ Sbjct: 924 VCILLDTSGSMGPHLQWIKTELVLLIWEQLRKHCARFNLLSFAED-----LQLWQDTLVE 978 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + A + + L +T L A+ + + Sbjct: 979 STEAA--CHKAMQWVAHLQAQGSTSVLAALTKAF----------------SFQDVQGLYL 1020 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA--ADQFLKNCA--SPDRFYSV 345 LTDG+ + ++ K RG V+ I + + A +FL+ A S R++ Sbjct: 1021 LTDGKPDTSCSLILNTVQSFQ--KERGVKVHTISLTSADRTATEFLRELASLSGGRYHCP 1078 Query: 346 QNSRKL 351 + + L Sbjct: 1079 VSDKAL 1084 >gi|58616210|ref|YP_195339.1| hypothetical protein p1B95 [Azoarcus sp. EbN1] gi|56315671|emb|CAI10315.1| hypothetical protein p1B95 [Aromatoleum aromaticum EbN1] Length = 637 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 76/214 (35%), Gaps = 32/214 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + +++D S SM P D ++R+ ++ + + + S Sbjct: 446 KKEETEGTNTAVALLVDYSGSM-WTTMPLTD---GSSRTTAGQTTAQEAAYETHFALASV 501 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIF-------GSTTKSTPGLEYAYNKIFDAKEK 272 L T+ + A V ++E L T + +E A +I +E+ Sbjct: 502 LSTY-EVPFLSIAFANHVCVLKEFDEPLSAHRWDDGSSGDTATGDAVEEALIRITAREEE 560 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYA--IGVQAEAAD 330 K+ ++ +TDGE + D+ + C A++ G + IG A + Sbjct: 561 -----------KRILVVVTDGEPN----DHDSLMAGCRFARQNGIQIAVVFIGSMGHALE 605 Query: 331 QFLKNCASPDRFYSVQNSRKLHDA-FLRIGKEMV 363 LK F ++ +L A F + K + Sbjct: 606 AMLKE--EEVAFDRAMSADRLSQAVFGAVAKAVG 637 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 26/65 (40%) Query: 11 YNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGN 70 G + I I L V V L ++ +H K +L +D + L AT + N ++ Sbjct: 10 KKQGGLVVIFVTIALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQA 69 Query: 71 NGKKQ 75 + Sbjct: 70 AVSAE 74 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 62/397 (15%), Positives = 139/397 (35%), Gaps = 50/397 (12%) Query: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 L + +G ++TAILL ++ +V++ A L +LDH + T+ K Sbjct: 9 LGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDHVIYRTSPKN 68 Query: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 L + +++F I+ T +L G + I ++ + L+ ++ Sbjct: 69 LYD----LREAGRDNFIRHQIEKALNTYNSRDLSNTGSIESI--VKDAVILTKNVNSLPL 122 Query: 124 DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMND 183 + + + + S + I+ K+ K +IGL +M + + Sbjct: 123 QFTVDIALSTTVQLRGSLLQMFSQSKGK-VDISRRKKVMYKQNIGLMIM-----PFAWDG 176 Query: 184 HFGPGMDKLGVATRSIREMLD---IIKSIPDVNNVVRSGLVT-----FSSKIVQTFP-LA 234 ++ K+ + + L+ + + N +V++ L F L Sbjct: 177 YWLASRGKVADSKVHPPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFCMAPYHYSSILY 236 Query: 235 WGVQHIQEKIN---------RLIFGSTTKSTP------GLEYAYNKIFDAK--EKLEHIA 277 W V + ++ + + +T P + N+ D + H Sbjct: 237 WAVGTLTYSVDNKTTTREYYKDPYYATWDHFPYSFIKNVFDMTSNQFGDGQVLTNTNHCF 296 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNKE----SLFYCNEAKRR------GAIVYAIGVQAE 327 KY++ L G S + KE L C+ +R ++++G + Sbjct: 297 PHGASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGRDAITIFSVGFSPD 356 Query: 328 AADQF-LKNCAS-PDRFYSVQNSRKLHDAFLRIGKEM 362 ++ L+ CAS P ++Y + + + + + + Sbjct: 357 QDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNV 393 >gi|227554889|ref|ZP_03984936.1| von Willebrand factor domain LPTXG domain protein [Enterococcus faecalis HH22] gi|227175982|gb|EEI56954.1| von Willebrand factor domain LPTXG domain protein [Enterococcus faecalis HH22] Length = 498 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|194016713|ref|ZP_03055326.1| von Willebrand factor, type A domain containing protein [Bacillus pumilus ATCC 7061] gi|194011319|gb|EDW20888.1| von Willebrand factor, type A domain containing protein [Bacillus pumilus ATCC 7061] Length = 638 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 68/166 (40%), Gaps = 14/166 (8%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + S+I +++D S SM DK+ + I + +KS+ + +V G Sbjct: 434 KQAPSSEIDAVFTLLVDCSASM-------FDKMDETKKGIVLFHEALKSVQVPHQIV--G 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ +T + + + L + + E + ++ + I K Sbjct: 485 FWEDTNDASETSQPNY-FNTVVSFKDSLFDAGPSIMSLEPEEDNRDGYAIRQMTKMILKR 543 Query: 280 HDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ +K++I +DGE ++ + + ++ EA+++G V + Sbjct: 544 REE-QKFLIVFSDGEPAAFSYEQNGIVDTHEAVLEARKKGIEVINV 588 >gi|196047158|ref|ZP_03114375.1| conserved hypothetical protein [Bacillus cereus 03BB108] gi|196022028|gb|EDX60718.1| conserved hypothetical protein [Bacillus cereus 03BB108] Length = 626 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K ++ + +++D S SM +K+ +S+ + +KS+ + Sbjct: 422 KGQESQELDVAFQLLVDCSGSM-------YNKMEETKKSVVLFHEALKSLKIPH-----A 469 Query: 220 LVTFSSKIVQTFPLA--WGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIA 277 + F P + + N + + E N+ + Sbjct: 470 ICGFWEDASSAKPEDKPNVIHEVVTYKNSTLPNVGPEIMQLREEEDNRDGYIIRIVSEKL 529 Query: 278 KGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGV----QAEAAD 330 + K+++ TDGE S+ + ++ A++ G V I + EA Sbjct: 530 AKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKEATY 589 Query: 331 QFLKNCASPDRFYSVQNSRK 350 Q +KN + + V N + Sbjct: 590 QLMKNIY--NHHFLVANHAE 607 >gi|156408870|ref|XP_001642079.1| predicted protein [Nematostella vectensis] gi|156229220|gb|EDO50016.1| predicted protein [Nematostella vectensis] Length = 274 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 69/199 (34%), Gaps = 34/199 (17%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D++ +D S S+ + R++R ++ K + V++S Sbjct: 94 DVIFAVDSSGSIKE------TGFRAGIRALRILISRAKPTTIYS------AVSYSEHAHI 141 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---EKLEHIAKGHDDYKKY 286 F + + + G T + L N + H K+ Sbjct: 142 EFSFKSPKEAVSLLHSARFLGFRTNTQDALRECRNLFTNKTSGVRMRSH--------KRV 193 Query: 287 IIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQ--AEAADQFLKNCASPD-RFY 343 +I TDG N+D + +L+ + K G V+ + V + ++ L +S + + Sbjct: 194 LIV-TDG---RSNVDKELTLYQAFQLKLMGVEVFVVAVGKYIKGIEEILGMASSFEKHLF 249 Query: 344 SVQNSRKLHDAFLRIGKEM 362 V++ F R+ + + Sbjct: 250 RVES----LAGFARVVRMI 264 >gi|333025630|ref|ZP_08453694.1| hypothetical protein STTU_3134 [Streptomyces sp. Tu6071] gi|332745482|gb|EGJ75923.1| hypothetical protein STTU_3134 [Streptomyces sp. Tu6071] Length = 518 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 48/147 (32%), Gaps = 17/147 (11%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +VLD S SM ++ + +++ E + + + D VVR LV FS Sbjct: 329 TGTRARVYLVLDRSGSMRPYY------KDGSAQALGEQVLALAAHLDTEAVVR--LVFFS 380 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + I L ++ L G+ A E AK Sbjct: 381 TAIDAMGTLTLDAYE--GVVDGLHEGAGRMGRTNYALAIE------EVRALHAKEAAGEP 432 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNE 311 ++F TDG ++L Sbjct: 433 GLVVFQTDG-PPDARTAATQALKAAET 458 >gi|320533437|ref|ZP_08034120.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] gi|320134345|gb|EFW26610.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] Length = 367 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 58/183 (31%), Gaps = 18/183 (9%) Query: 157 SSVKISSK--SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKS----IP 210 S+ S +++ MV+D + SM G +S LD ++S I Sbjct: 68 GPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGKGP-EGVDQSASTRLDGVRSDMRAIR 126 Query: 211 DVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTKSTPGLEYAYNKIFDA 269 D R ++ + + PL + I + + LE A + + Sbjct: 127 DAFPDSRFSIIALDNTAARELPLTHDTNAVDAWIGSFKQEVTGHATGSSLEVALPLLGQS 186 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAA 329 + + + + +DGE + + + +L A+ G ++ + Sbjct: 187 LAQSRQSDPNNI---RLVYIFSDGEATD---EGRGAL----AAESAGVSWQSLADIVDGG 236 Query: 330 DQF 332 Sbjct: 237 AVL 239 >gi|168208905|ref|ZP_02634530.1| von Willebrand factor type A domain protein [Clostridium perfringens B str. ATCC 3626] gi|170712845|gb|EDT25027.1| von Willebrand factor type A domain protein [Clostridium perfringens B str. ATCC 3626] Length = 620 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 61/152 (40%), Gaps = 27/152 (17%) Query: 155 ITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 IT +VK K +D+++++D S SM + D+L + S+ + + I +IP+ Sbjct: 72 ITLTVKGKPKKVTKPVDILLIMDASNSMYYNM----DELKASMNSLVDKV--IDNIPNS- 124 Query: 214 NVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLIFG-------STTKSTPGLEYAYN 264 R +V F +++ + F + +E N + T A Sbjct: 125 ---RIAVVAFGTEVEEVFSFSNKNNFTSKEEYKNAIKDSYYYITGRGNTNIEGTWRVA-- 179 Query: 265 KIFDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 E ++ + + KK +IF +DG + Sbjct: 180 -----DEIFKNELNNNSNSKKDVIFFSDGYPN 206 >gi|156349150|ref|XP_001621939.1| predicted protein [Nematostella vectensis] gi|156208297|gb|EDO29839.1| predicted protein [Nematostella vectensis] Length = 147 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 14/105 (13%), Positives = 37/105 (35%), Gaps = 16/105 (15%) Query: 236 GVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGE 294 + + ++ + T+ L+ +F+ + + ++ +TDG Sbjct: 34 DLNSAKFAVDAIQISNGGTRIGEALKLVKTDMFNTAR---------SNVPRMLLVMTDGR 84 Query: 295 NSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 +S + +L + G + +G+ ++ LK AS Sbjct: 85 SSDDVVAPSRAL------RDIGVTILTLGLGSDYDLDQLKMIASG 123 >gi|116252902|ref|YP_768740.1| hypothetical protein RL3160 [Rhizobium leguminosarum bv. viciae 3841] gi|115257550|emb|CAK08647.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 258 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 69/194 (35%), Gaps = 18/194 (9%) Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVL--DVSLSMNDHFGPGMDKLGVATRSIREM 202 + + ++ V I+ +D+ +VL D S SM+ + V +E Sbjct: 1 MLTTLAVLMGLSGLVPIAQAGGSEVDVTLVLAVDTSRSMDFEEIGIQREGYVEALKHKEF 60 Query: 203 LDIIKSIPDVNNVVRSGLVTF---SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGL 259 +D +K +++ + +VQ + W Q I+ + + + F ++ P Sbjct: 61 IDAVKGGLTGRI-----AISYFEWAGYVVQDSVIDW--QVIETEEDAIAFADKLEARPIA 113 Query: 260 EYAYNKIFDAKEKLEH--IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 I A + ++ ++ I DG N+S D ++A G Sbjct: 114 TQRRTSISTAIAQGASMIVSSPFQSRRQVIDVSGDGPNNSG--DPVTPAR--DKAVEAGM 169 Query: 318 IVYAIGVQAEAADQ 331 I+ + + +D Sbjct: 170 IINGLAIMLRPSDA 183 >gi|92113590|ref|YP_573518.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] gi|91796680|gb|ABE58819.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] Length = 596 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 63/210 (30%), Gaps = 40/210 (19%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ M+ DVS SM + + + + S+ VR + Sbjct: 29 DVRMIFDVSGSMKANDPANLRASALQLAAALLPSQARGSVWTFGTQVR------NPLPDG 82 Query: 230 TFPLAWGVQHIQ---EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKY 286 W + + ++ F T L A K++ Sbjct: 83 KVDAEWRRRARSLSPQLVDYQQF---TDIEQALREASQAAGG---------------KRH 124 Query: 287 IIFLTDGENSSPN---------IDNKESL--FYCNEAKRRGAIVYAIGVQAEAADQFLKN 335 +I LTDG P ++E+L E + +V+ I + L+ Sbjct: 125 VILLTDGMVDLPGSGEVKRKRDAASRETLIASLAPELATQDVVVHTIALSRNVDRDLLER 184 Query: 336 C--ASPDRFYSVQNSRKLHDAFLRIGKEMV 363 ++ + +L AFL + + +V Sbjct: 185 VSQSTDGLAAVAETPEELLRAFLDVLERIV 214 >gi|5731352|gb|AAD31497.2|AF138804_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 865 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 66/148 (44%), Gaps = 12/148 (8%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 + ++K++ K D G D+M +LDVS M D F DK+ ++ + + + Sbjct: 191 TLDVTLKVTPKEIDEGADVMALLDVSQKMTDADFKNAKDKIKKLVTTLTSKSNSDEHKHN 250 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQ-HIQEKINRLIFGSTTKSTPG--LEYAYNKIFD 268 N VR L+TF +I ++ + + + +N L + G L+ A +K + Sbjct: 251 SRNSVR--LMTFYREISDPIDISGKTEAELDQLLNELREKAKANYDWGVDLQGAIHKTRE 308 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENS 296 K + K +++I+ + GE++ Sbjct: 309 IFNKEQKSKK-----RQHIVLFSQGEST 331 >gi|296273948|ref|YP_003656579.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296098122|gb|ADG94072.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 592 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 68/190 (35%), Gaps = 22/190 (11%) Query: 138 IFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATR 197 I F + +T+ K S + + +++ +D S SM +L +A + Sbjct: 56 ILLFFALISMVIALARPVTNE-KEQSINQEVIPVVVAIDASKSMLAQDVFP-SRLKMAKK 113 Query: 198 SIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTP 257 ++LD+IK+ ++ G++ F PL + + + + Sbjct: 114 ---KVLDLIKAPNQLS----VGVIIFGQSSFILSPLTNDFTSLDFLLQNFDYNLNINNGS 166 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 IF A E + K + K IIFLTDG N D K+ + Y N+ Sbjct: 167 -------NIFSALEASNKLLKNYK--SKNIIFLTDGGNEG---DYKKEIEYANK-NNLNI 213 Query: 318 IVYAIGVQAE 327 + I Sbjct: 214 YIITIATNKP 223 >gi|291452540|ref|ZP_06591930.1| predicted protein [Streptomyces albus J1074] gi|291355489|gb|EFE82391.1| predicted protein [Streptomyces albus J1074] Length = 591 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 66/192 (34%), Gaps = 30/192 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM ++ G + + + P V+ V FS+ I Sbjct: 404 VYLVLDRSGSMRGYYRDG--SAQCLGEQVLALATHLD--PAGAPAVQV--VFFSTDIDGK 457 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L Q + +I+ L A ++ H A G + +IF Sbjct: 458 GELT--PQAYEGRIDALHAECGRMGRTSYHRAIEEVT-----ALHEASGATEPA-LVIFQ 509 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR--GAIVY--AIGVQAE--AADQFLKNCASP--DRF 342 TDG ++ +A + GA ++ +G ++L+ +P F Sbjct: 510 TDG--------PPDAKTAATQALKAVSGAPLFWQFVGFGESEHKNFEYLRKLNTPHTGYF 561 Query: 343 YSVQNSRKLHDA 354 + R+L DA Sbjct: 562 PAGPAPRELTDA 573 >gi|239980680|ref|ZP_04703204.1| hypothetical protein SalbJ_14645 [Streptomyces albus J1074] Length = 530 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 66/192 (34%), Gaps = 30/192 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 + +VLD S SM ++ G + + + P V+ V FS+ I Sbjct: 343 VYLVLDRSGSMRGYYRDG--SAQCLGEQVLALATHLD--PAGAPAVQV--VFFSTDIDGK 396 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 L Q + +I+ L A ++ H A G + +IF Sbjct: 397 GELT--PQAYEGRIDALHAECGRMGRTSYHRAIEEVT-----ALHEASGATEPA-LVIFQ 448 Query: 291 TDGENSSPNIDNKESLFYCNEAKRR--GAIVY--AIGVQAE--AADQFLKNCASP--DRF 342 TDG ++ +A + GA ++ +G ++L+ +P F Sbjct: 449 TDG--------PPDAKTAATQALKAVSGAPLFWQFVGFGESEHKNFEYLRKLNTPHTGYF 500 Query: 343 YSVQNSRKLHDA 354 + R+L DA Sbjct: 501 PAGPAPRELTDA 512 >gi|260826343|ref|XP_002608125.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] gi|229293475|gb|EEN64135.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] Length = 1803 Score = 40.6 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 63/188 (33%), Gaps = 32/188 (17%) Query: 147 NSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII 206 S T + + ++S +D++ LD S S+ + ++ +L+ Sbjct: 24 VMSAWEAASTLNACLPTRSG-AVDIIFALDRSGSVGRSNYDKI------IDFVKAVLNHF 76 Query: 207 KSIPDVNNVVRSGLVTFSSKIVQTFPL----AWGVQH---IQEKINRLIFGSTTKSTPGL 259 P R +V+F + F L + ++ + +L + Sbjct: 77 SVSP---TTTRVAVVSFGTSARVEFDLLRSSSNDNNKCELLRTHLPKLSYTGG------- 126 Query: 260 EYAYNKIFDAKEKLEHIAKGHDD--YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 A N + + L + K + +TDG + + + E + RG Sbjct: 127 --ATNTVGALRLALALLKNPGVRSYSTKVVFTITDGYWNRGG----DPAYVVRELQSRGV 180 Query: 318 IVYAIGVQ 325 I++A G+ Sbjct: 181 IMFAFGIG 188 >gi|322788194|gb|EFZ13976.1| hypothetical protein SINV_08974 [Solenopsis invicta] Length = 1955 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 61/189 (32%), Gaps = 14/189 (7%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 M++++D S SM A D + + + + F + ++Q Sbjct: 1 MVILMDASGSMKGMENTIAKTTVSAILDTLSNNDFVAFLNFSKEATET-VPCFKNMLIQA 59 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + +++ +++ T+ A++ + +E + I+ + Sbjct: 60 TPE--NLDTLKKSMDKFQINGTSDLPAAFTKAFSLLETYRETRGCDV--DLPCNQLIMLI 115 Query: 291 TDGENSSPNIDNKESLFYCNEAKRRG----AIVYAI-----GVQAEAADQFLKNCASPDR 341 TD +N + +F K V+ + +++C + Sbjct: 116 TDNVPGGTLGNNLKEVFKKWNWKENSTHVPVRVFTYLIGKEATMTDDVQWMVRSCLNRGD 175 Query: 342 FYSVQNSRK 350 Y+V+ + Sbjct: 176 CYNVRTLEE 184 >gi|302520599|ref|ZP_07272941.1| toxic cation resistance protein [Streptomyces sp. SPB78] gi|302429494|gb|EFL01310.1| toxic cation resistance protein [Streptomyces sp. SPB78] Length = 345 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 45/129 (34%), Gaps = 16/129 (12%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + + +VLD S SM ++ + +++ E + + + D VVR LV FS Sbjct: 156 TGTRARVYLVLDRSGSMRPYY------KDGSAQALGEQVLALAAHLDTEAVVR--LVFFS 207 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + I L I+ L G+ A E AK Sbjct: 208 TAIDAMGTLTLDAYE--GVIDGLHEGAGRMGRTNYALAIE------EVRALHAKEAAGEP 259 Query: 285 KYIIFLTDG 293 ++F TDG Sbjct: 260 GLVVFQTDG 268 >gi|94991307|ref|YP_599407.1| serum opacity factor [Streptococcus pyogenes MGAS10270] gi|94544815|gb|ABF34863.1| Fibronectin-binding protein [Streptococcus pyogenes MGAS10270] Length = 1007 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 95/258 (36%), Gaps = 28/258 (10%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIW-QTDFRNELRENGFAQDINN 107 + + T + + + + + K + + + +++ + ++ Sbjct: 117 QAVTSSTSPSTPAAASSNGSNQEASAETEPQTMEVEKYTVDKENSKLNIKDGKTPKTGSS 176 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-D 166 + +I + K ++ V+R ++ I +V + K D Sbjct: 177 VNNEKDTKLIRNRDGKQRDIVDVTR-------------TVKTNEDGTIDVTVTVKPKQID 223 Query: 167 IGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIP----DVNNVVRSGLV 221 G D+M +LDVS M+ D F +K+ +++ + N VR L+ Sbjct: 224 EGADVMALLDVSKKMSEDDFNNAKNKIKKLVKTLTSKSASNSDNDEHKYNSRNSVR--LM 281 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN---KIFDAKEKLEHIAK 278 TF +I ++ +E++++L+ K+ ++ + I A+E + Sbjct: 282 TFYREISNPIDIS---GKTEEQLDKLLDDLRKKAKANYDWGVDLQGAIHKAREIFNKEKE 338 Query: 279 GHDDYKKYIIFLTDGENS 296 +++I+ + GE++ Sbjct: 339 KKFGKRQHIVLFSQGEST 356 >gi|118388807|ref|XP_001027499.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89309269|gb|EAS07257.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1029 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 94/304 (30%), Gaps = 44/304 (14%) Query: 36 IETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIK--NIWQTDFR 93 I+ H + K L T K L+ N ++ + + + Sbjct: 47 IQILHISLEENKFITPNKQQSLQTDKKSLDNSVTKNQTPEQLQSDKKQNNLITVPKELED 106 Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 E+++N Q ++ ++ L + I D K + F + L Sbjct: 107 EEVKQNLQVQPPKDVVINSQLPVKIQDLIKSHYNQCQESAN---FISNFYQLSEDMKQFL 163 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 I ++ + LD+M ++D + SM+ D++ +I+ D+ K D Sbjct: 164 PIYQNILP----RMELDLMFIMDCTGSMSSWIQAVKDEILCIIATIK---DVNKGYRDYG 216 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 R + FS + + V I + G ++A Sbjct: 217 VSQRYSIFNFSDDMDEFEKFLINVNAIAN------SDTPEDVAGGFKHA----------- 259 Query: 274 EHIAKGHDDYKKYIIFLTD----G--ENSSPNI-------DNKESLFYCNEAKRRGAIVY 320 + KY +F+ D G N+ + D + ++G +Y Sbjct: 260 --NLQQWKSQAKYAVFIADCPAHGKEYNNDYSDRYPDGDPDGIDLKQEFKNLIKKGVKLY 317 Query: 321 AIGV 324 AI + Sbjct: 318 AIQI 321 >gi|325981556|ref|YP_004293958.1| von Willebrand factor type A [Nitrosomonas sp. AL212] gi|325531075|gb|ADZ25796.1| von Willebrand factor type A [Nitrosomonas sp. AL212] Length = 775 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 72/204 (35%), Gaps = 24/204 (11%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVN 213 + S + + ++++LD+S S N+ +D AT + ++ I ++ Sbjct: 573 IMMRSVRKVRDISVLVLLDLSESTNEKVAGQDYSVLDLTRQATVLLANAINKIGDPFAIH 632 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHI-QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 G + + I + ++ + +T+ + +A Sbjct: 633 GFCSDG--RHDVEYYRFKDFDQPYNEIPKARLAGMTGQLSTRMGAAMRHAT--------- 681 Query: 273 LEHIAKGHDDYKKYIIFLTDGENS-----SPNIDNKESLFYCNEAKRRGAIVYAIGVQAE 327 H K KK ++ +TDGE + P ++ EA R G + Y + + Sbjct: 682 --HYLKLQKSAKKLLLVITDGEPADVDVRDPQYLRHDTKKAVEEAGRSGILTYCMSLD-P 738 Query: 328 AADQFLKNCASPDRFYSVQNSRKL 351 ADQ++ + V + +L Sbjct: 739 RADQYVSRIFGERNYLVVDHVERL 762 >gi|226358202|ref|YP_002787941.1| magnesium chelatase, chlD subunit [Deinococcus deserti VCD115] gi|226319845|gb|ACO47839.1| putative magnesium chelatase, chlD subunit [Deinococcus deserti VCD115] Length = 589 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 44/153 (28%), Gaps = 28/153 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ V D S SM ++G ++ +L+ R L+TF Sbjct: 418 EKRGGRRVLFVADTSGSMG-----AQGRMGAVKGAMLAVLE------QQARRDRVALITF 466 Query: 224 -SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 ++ V+ + I G T L A + Sbjct: 467 RATGAVRALEWTADATLAEAAITAAPTGGRTPLAHALVLAREVLATEPGAE--------- 517 Query: 283 YKKYIIFLTDGENS---SPNIDNKESLFYCNEA 312 ++ TDG + SP+ D A Sbjct: 518 ----LVLFTDGRANVALSPSGDAWADALEAARA 546 >gi|160716|gb|AAA29773.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 568 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDVYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES RG + G+ ++FL C D Sbjct: 170 DSLKESR----NLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|5748802|gb|AAD42206.2|AF139752_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 872 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 66/148 (44%), Gaps = 12/148 (8%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 + ++K++ K D G D+M +LDVS M D F DK+ ++ + + + Sbjct: 198 TLDVTLKVTPKEIDEGADVMALLDVSQKMTDADFKNAKDKIKKLVTTLTSKSNSDEHKHN 257 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQ-HIQEKINRLIFGSTTKSTPG--LEYAYNKIFD 268 N VR L+TF +I ++ + + + +N L + G L+ A +K + Sbjct: 258 SRNSVR--LMTFYREISDPIDISGKTEAELDQLLNELREKAKANYDWGVDLQGAIHKTRE 315 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENS 296 K + K +++I+ + GE++ Sbjct: 316 IFNKEQKSKK-----RQHIVLFSQGEST 338 >gi|315573470|gb|EFU85661.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] Length = 659 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|296816633|ref|XP_002848653.1| von Willebrand factor type A domain-containing protein [Arthroderma otae CBS 113480] gi|238839106|gb|EEQ28768.1| von Willebrand factor type A domain-containing protein [Arthroderma otae CBS 113480] Length = 1002 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 65/231 (28%), Gaps = 36/231 (15%) Query: 126 NLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHF 185 SA R + A P + +++ ++D S SM Sbjct: 283 IASAKPRASLETHPSIEGHSAIMIEIPPDFMLESQEPVDDK---EIIFLVDRSGSMAGKI 339 Query: 186 GPGMDKLGVATRS-----IREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHI 240 + + RS + + S + R L + ++ Sbjct: 340 HGLISSMQFYLRSLPMSTLFNICSFGSSYQLLWEQSR---------AYSEITLNEALYYV 390 Query: 241 QEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNI 300 + L T P L EH+ + K II LTDGE Sbjct: 391 SSFSSNL---GGTDLLPAL--------------EHVVLQQNHSSKDIIVLTDGEVWRLE- 432 Query: 301 DNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA-SPDRFYSVQNSRK 350 + + + ++ +A+G+ + + ++ A S + + + Sbjct: 433 ETIRFVRLTHIVSKKAIRFFALGIGNAVSHELVEGIANSGGGYAEIIPATS 483 >gi|189201073|ref|XP_001936873.1| von Willebrand domain containing protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187983972|gb|EDU49460.1| von Willebrand domain containing protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 933 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 74/196 (37%), Gaps = 33/196 (16%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSI-REMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 ++ ++D S SM+ + L V +SI + I S ++ + S ++ + Sbjct: 294 IIFIVDRSGSMSHQIPTLVSALKVFLKSIPVGCMFNICSFGSSHSCLWSESKGYNQET-- 351 Query: 230 TFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 ++ ++ T++ ++ + + +H + ++ Sbjct: 352 -------LEEAINCVDAFQADMGGTETLAAVQSCFKM------RNKHCSTE-------MV 391 Query: 289 FLTDGENSSPNIDNKESLF--YCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYS-- 344 LTDG NI ++ LF E K R V+ IG+ + + ++ A ++ Sbjct: 392 LLTDG-----NIWAQQQLFNYIIEETKSRDVRVFPIGIGGQVSSALIEGVARAGGGFAEM 446 Query: 345 VQNSRKLHDAFLRIGK 360 V + KL +RI K Sbjct: 447 VAENEKLDRKIIRILK 462 >gi|315166935|gb|EFU10952.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 718 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 44/140 (31%), Gaps = 14/140 (10%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDG 293 + Y + +TDG Sbjct: 188 TNRKT-------YFLLVTDG 200 >gi|302550736|ref|ZP_07303078.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] gi|302468354|gb|EFL31447.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] Length = 248 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 62/181 (34%), Gaps = 26/181 (14%) Query: 155 ITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVN 213 S+ + + +V+D S SM ++ G + L + LD ++P Sbjct: 29 AGVSLTKHGLDGLRAAVYLVIDHSGSMRPYYKDGSVQALADRVLGLSAHLDDDGTVP--- 85 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 +V FS+ + +A + Q +I+R++ G A + + D Sbjct: 86 ------VVFFSTDVDAETEIA--LADHQGRIDRIVAGLGHMGRTSYHLAMDAVID----- 132 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVY--AIGVQAEAADQ 331 H ++F TDG P + C A+ ++ IG + Q Sbjct: 133 -HYLDSGSKDPALVVFQTDG---GPVSKLAAERYLCKAAR---LPLFWQFIGFGDPGSKQ 185 Query: 332 F 332 F Sbjct: 186 F 186 >gi|149005573|ref|ZP_01829312.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|168482703|ref|ZP_02707655.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|307126680|ref|YP_003878711.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|147762513|gb|EDK69473.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|154432930|gb|ABS82113.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432938|gb|ABS82120.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432946|gb|ABS82127.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432954|gb|ABS82134.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432970|gb|ABS82148.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|172043667|gb|EDT51713.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|306483742|gb|ADM90611.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|332076908|gb|EGI87370.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17545] gi|332203636|gb|EGJ17703.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA47368] Length = 883 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 5/78 (6%) Query: 151 APLLITSSVKI-SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 L ++ + + ++ LD++++LD S SM++ + A + R ++D I S Sbjct: 199 IELTVSGKTTVETKEASTPLDVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVDKITSN 258 Query: 210 PDVNNVVRSGLVTFSSKI 227 PD R LVT+ S I Sbjct: 259 PDN----RVALVTYGSTI 272 >gi|29832956|ref|NP_827590.1| magnesium-chelatase subunit [Streptomyces avermitilis MA-4680] gi|29610077|dbj|BAC74125.1| putative magnesium-chelatase subunit [Streptomyces avermitilis MA-4680] Length = 689 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 47/141 (33%), Gaps = 20/141 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 494 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 543 Query: 220 LVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIA 277 LVTF S P V ++ L G T GL A++ + + Sbjct: 544 LVTFRGSAAEVALPPTSSVDAAAVRLESLPTGGRTPLATGLLKAHDVLRVERLRDAARRP 603 Query: 278 KGHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 604 --------LVVVVTDGRATGG 616 >gi|78778675|ref|YP_396787.1| protoporphyrin IX magnesium chelatase subunit ChlD [Prochlorococcus marinus str. MIT 9312] gi|78712174|gb|ABB49351.1| protoporphyrin IX magnesium-chelatase [Prochlorococcus marinus str. MIT 9312] Length = 725 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 50/136 (36%), Gaps = 19/136 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++D S SM ++++ A ++ +L ++ + + L+ F Sbjct: 522 QKKAGALVIFLVDASGSMA------LNRMQSAKGAVIRLLT--EAYENRDE---VALIPF 570 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P + + ++ + G L + + + + ++ D Sbjct: 571 RGNQAEVLLPPTRSITAAKRRLETMPCGGG----SPLAHG---LTQSAKVAKNALSTGDI 623 Query: 283 YKKYIIFLTDGENSSP 298 + ++ +TDG + P Sbjct: 624 GQVIVVAITDGRGNVP 639 >gi|217976668|ref|YP_002360815.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217502044|gb|ACK49453.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 342 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 69/199 (34%), Gaps = 36/199 (18%) Query: 144 WCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN----DHFGPGMDKLGVATRSI 199 W ++ L ++ ++ G+D+++V+D++ SMN G + +L S+ Sbjct: 19 WLLCATAFLLAVSFALPSFRSKANGVDLLVVVDITGSMNTRDYRENGHPVSRLEHVKASL 78 Query: 200 REMLDIIKSIPDVNNVVRSGLVTFSSKI--VQTFPLAWGVQHIQEKINRLIFG------- 250 R +L P++ R GL F+ + + P+ V + I G Sbjct: 79 RSLL------PELPCPARVGLAIFTERQPFLLFEPI--DVCGAFAPVEGAIAGLDWRMAW 130 Query: 251 -STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 + GL A + D ++F TDG+ + P + F Sbjct: 131 EGDSHIAAGLYRAIDIAHDVNAD--------------LVFFTDGQEAPPLPASGGPEFTG 176 Query: 310 NEAKRRGAIVYAIGVQAEA 328 +G IV G Sbjct: 177 ERGAVKGLIVGTGGFALSP 195 >gi|3142286|gb|AAC18657.1| thrombospondin related adhesive protein [Plasmodium falciparum] Length = 562 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYANIFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|307304370|ref|ZP_07584121.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|306902572|gb|EFN33166.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 431 Score = 40.6 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 57/172 (33%), Gaps = 4/172 (2%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R F G+++++ A+ PV+ MGL +ET + + K KL + D S A + Sbjct: 22 RRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAAVRHRAG 81 Query: 67 ENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYN 126 + + + + + G + + T + Sbjct: 82 DQQSALEAAARRVAGGSGFSPGGLTVSTAPGSAGGSNKVTVELTETHPRMF-SSVFGTGT 140 Query: 127 LSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVS 178 ++ +R T A + +V ++ +++ L V+ S Sbjct: 141 ITMKAR---AVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCSVVSNS 189 >gi|313216074|emb|CBY37452.1| unnamed protein product [Oikopleura dioica] Length = 392 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 74/199 (37%), Gaps = 33/199 (16%) Query: 162 SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + D D++ V+D S S + + L +++ I+ P +N VR G+V Sbjct: 41 NPQLDANYDLVFVVDRSDSTPEEY------LDAYKSFMKK---IVIERPVSDNNVRVGIV 91 Query: 222 TFSSKIVQTFPL--AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 T++S + L + I I+ L + + T A + + D ++ Sbjct: 92 TYASDVEFEVNLEDSNSQADILNIIDNLRIYGSGRKT---ANAIDFLLDQVDRKGREHIP 148 Query: 280 HDDYKKYIIFLTDGENSSPNIDN------KESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 I LT G + + ++ +CN Y+IGV +++ + Sbjct: 149 MVS-----ILLTSGISDEDRLKVILAGWRVQAATWCNA--------YSIGVGPFISEEEM 195 Query: 334 KNCASPDRFYSVQNSRKLH 352 + + Y + +S L Sbjct: 196 FLLSGDLQHYQLISSPDLL 214 >gi|311064048|ref|YP_003970773.1| hypothetical protein BBPR_0642 [Bifidobacterium bifidum PRL2010] gi|310866367|gb|ADP35736.1| Conserved hypothetical membrane spanning protein with a von Willebrand factor type A domain [Bifidobacterium bifidum PRL2010] Length = 342 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 73/202 (36%), Gaps = 32/202 (15%) Query: 141 TFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSM---NDHFGPG--MDKLGVA 195 F A P ++T +S++ D+M+ +DV+ SM + +G + +L A Sbjct: 53 MFLLAAVMMLTPSIVT---TTTSRAINATDVMVAVDVTGSMAVKDAEYGSSGTISRLDAA 109 Query: 196 TRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL-IFGSTTK 254 R + + +R G + PL I + L + ++T Sbjct: 110 KRIAKGITSTY--ADSSFAALRFGASS-----TLDVPLTPDSIAIDGWADTLAVESTSTS 162 Query: 255 STPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF--LTDGENSSPNIDNKES--LFYCN 310 + L+ +++ L + I+ +TDGE +S S Y + Sbjct: 163 AGSSLDTPLDQL-----MLSLKSIRDQHPDDIIVLYVITDGEQTSDTARRSYSALRRYLD 217 Query: 311 EAKRRGAIVYAIGVQAEAADQF 332 ++ + IGV ++A + Sbjct: 218 DS-------FTIGVGSDAGGKI 232 >gi|257083695|ref|ZP_05578056.1| conserved hypothetical protein [Enterococcus faecalis Fly1] gi|256991725|gb|EEU79027.1| conserved hypothetical protein [Enterococcus faecalis Fly1] Length = 809 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 48/138 (34%), Gaps = 18/138 (13%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIRE----MLDIIKS-----IPDVNNVVRS 218 +D+++V D S S +D+F + + + + ML + PD + Sbjct: 75 PVDLVVVEDASGSFSDNFPMVRSAIDEVVQGLTDKDRVMLTSYRGGRQFMFPDGTTKIND 134 Query: 219 GLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 G + ++ PL + + T + G++ A ++ L + Sbjct: 135 GDKELNVRVN--TPLTFDKSQFVSNFGDVRTYGGTPTAQGMKLALDEYNATHGDLTNRKT 192 Query: 279 GHDDYKKYIIFLTDGENS 296 Y + +TDG + Sbjct: 193 -------YFLLVTDGVAN 203 >gi|239979671|ref|ZP_04702195.1| hypothetical protein SalbJ_09552 [Streptomyces albus J1074] Length = 248 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 49/180 (27%), Gaps = 23/180 (12%) Query: 135 MPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGV 194 MP P ++ + + + + +VLD S SM ++ G + Sbjct: 1 MPISLDKIPPGLVDLTKTAAVSLRKRGLADRGLRAAVYLVLDRSGSMRPYYRDGTVQHLA 60 Query: 195 ATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH--IQEKINRLIFGST 252 V V F + ++ +QE R T Sbjct: 61 EQALALAAHLDDDGTVPV--------VFFDTDAHPATEVSLDAYEGRVQELHERYGHMGT 112 Query: 253 TKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA 312 T + E +EH ++IF TDG P+ + C A Sbjct: 113 TNYAAAM----------LEVIEHYTATGATAPAFVIFQTDG---GPDAKREAERVLCRAA 159 >gi|262201266|ref|YP_003272474.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262084613|gb|ACY20581.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 594 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 64/213 (30%), Gaps = 32/213 (15%) Query: 166 DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT-FS 224 + + ++V D S SM + A I + ++ P R+ +V ++ Sbjct: 382 GVPIRTLVVEDTSGSME----TPVGNTTRAGLLIDASMTGLEMFP------RNAMVGGWA 431 Query: 225 SKIVQTFPL-AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHI---AKGH 280 I + P W +++ T L A + Sbjct: 432 FGIDKGGPDQDWTEMAPIRRLDAPSGSGGTH-REALARAVREGLAPARLGGGTGLYDTTL 490 Query: 281 DDYKKY-----------IIFLTDGENSSPNIDNKESLFYCNEAK---RRGAIVYAIGVQA 326 +KK +I +TDG+N P L + R ++ IG+ Sbjct: 491 AAFKKVQSTYDPNYSNSVIIMTDGQNEDPGSITLTELLAELKELEDPARPVLILTIGISE 550 Query: 327 EAADQFLKNC--ASPDRFYSVQNSRKLHDAFLR 357 +A L+ A+ Y + + + F Sbjct: 551 DADTNALRQIAQATGGTTYVAKTAADIKQVFTN 583 >gi|29375367|ref|NP_814521.1| anchor protein, putative [Enterococcus faecalis V583] gi|257418380|ref|ZP_05595374.1| predicted protein [Enterococcus faecalis T11] gi|29342827|gb|AAO80591.1| gram positive anchor protein, putative [Enterococcus faecalis V583] gi|257160208|gb|EEU90168.1| predicted protein [Enterococcus faecalis T11] Length = 962 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|268561242|ref|XP_002646398.1| Hypothetical protein CBG15368 [Caenorhabditis briggsae] Length = 377 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 62/169 (36%), Gaps = 13/169 (7%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 S++ LD++ V+D S M + + + I + + R GLVT+S Sbjct: 30 SNLWLDVVAVVDNSQGMTNDGLTAV--------FVFSEGTKIGTNSNDPRTTRLGLVTYS 81 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 SK + L Q I + + + T S + A+E LE + Sbjct: 82 SKATKNAYLD-KFQSIDDLYDNIFTDLATVSQTDDSNLETGLEAAEEILEAGKNEKRKFY 140 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL 333 K +I + S+ N + E++ N K G + + D+ L Sbjct: 141 KKLILI---YASTFNRNG-EAISIANRLKSAGTKLVTVAYDQGGGDEQL 185 >gi|255950230|ref|XP_002565882.1| Pc22g19800 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592899|emb|CAP99268.1| Pc22g19800 [Penicillium chrysogenum Wisconsin 54-1255] Length = 896 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 94/273 (34%), Gaps = 57/273 (20%) Query: 96 LRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS-RYEMPFIFC-TFPWCANSSHAPL 153 + E+ F + +I+ +++I D + ++ + + ++PF F T P N + Sbjct: 210 MPESTFESNYASIKLRENVTIDED-----FVITVNADKQDLPFAFLETHPTLPNQKALMV 264 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 + + +++ V+D S SM D+ ++ L + N Sbjct: 265 SLVPKFSLPPDLS---EIVFVVDRSGSMTDNMH-------TLRSALGLFLKSLPLGVPFN 314 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQ---------EKINRLIFG-STTKSTPGLEYAY 263 L++F S +F W + + + T+ GLE A Sbjct: 315 ------LISFGS----SFEAIWARSKVSTRESLEEALQHTKNIQADLGGTEILSGLEAAV 364 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-KRRGAIVYAI 322 K + K ++ LTDGE + E N+A ++ + + Sbjct: 365 EKRYQDKVLE-------------VLVLTDGE----VWNQSEVFDLVNQANQQHSTRFFTL 407 Query: 323 GVQAEAADQFLKNCASPDRFY--SVQNSRKLHD 353 G+ + + + + + +V N+ L+ Sbjct: 408 GLGDSVSHSLINGISRAGKGFTQTVLNNEDLNK 440 >gi|111022920|ref|YP_705892.1| hypothetical protein RHA1_ro05957 [Rhodococcus jostii RHA1] gi|110822450|gb|ABG97734.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 548 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 67/203 (33%), Gaps = 21/203 (10%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS- 225 + + ++ +DVS SM G ++ + + L PD V +GL FS Sbjct: 343 LPIRTLVAIDVSGSMETPAGNR-SRMDLTVDAA---LAGNAMFPDS---VSAGLWAFSQG 395 Query: 226 ------KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + P+ + R + GL ++D + Sbjct: 396 LGGGPQDYKEMVPIRRYDTVVDGLTQRQLMAQQAGKVDGLIGGGTGLYDTTLAAFRTVQE 455 Query: 280 HDDYKKY--IIFLTDGENSSPNIDNKESLFYC---NEAKRRGAIVYAIGVQAEAADQFLK 334 D + +I LTDG N P+ KE L R I+ IG+ +A L Sbjct: 456 TYDPRAVNSVIILTDGANEDPDSITKEQLLSILQRETDPARPVIIVTIGITGDADAATLA 515 Query: 335 NCA--SPDRFYSVQNSRKLHDAF 355 + + Y ++ + + F Sbjct: 516 EISRVTGGSSYVAKDPADIANVF 538 >gi|328906008|gb|EGG25783.1| von Willebrand factor type A [Propionibacterium sp. P08] Length = 316 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 56/173 (32%), Gaps = 21/173 (12%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S M + ++ S + SS D D+++++D + SM G Sbjct: 32 SGVAMTILDILLAVLVGLIGLHPVVGESGRQSSSVDA--DIVIMIDTTTSMAARDNGGKT 89 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--- 247 +L A + ++ V R LVTF + + I N L Sbjct: 90 RLSAAVH-------DVAALAAVFGGARFTLVTFDNDARVSVASTTDTGTIVSAANALTWR 142 Query: 248 --IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 G+ T + + + A++ + + +L DGE ++ Sbjct: 143 EDTKGTGTDVSVAVPTVVRVLQKARQASPQA-------TRMVFYLGDGEQTAT 188 >gi|297198775|ref|ZP_06916172.1| magnesium chelatase ATPase subunit D [Streptomyces sviceus ATCC 29083] gi|297147205|gb|EDY61425.2| magnesium chelatase ATPase subunit D [Streptomyces sviceus ATCC 29083] Length = 670 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 47/141 (33%), Gaps = 20/141 (14%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + G ++ V+D S SM ++ ++ +L + + G Sbjct: 478 QATREGREGNLVLFVVDASGSMA-----ARQRMSAVKGAVLSLL-----LDAYQRRDKVG 527 Query: 220 LVTF-SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK-EKLEHIA 277 LVTF S P V ++ L G T GL A+ + + A Sbjct: 528 LVTFRGSAAEVALPPTSSVDAAAARLETLPTGGRTPLAAGLLRAHEVLRVERLRDPARRA 587 Query: 278 KGHDDYKKYIIFLTDGENSSP 298 ++ +TDG + Sbjct: 588 --------LVVVVTDGRATGG 600 >gi|188026532|ref|ZP_02962486.2| hypothetical protein PROSTU_04608 [Providencia stuartii ATCC 25827] gi|188019325|gb|EDU57365.1| hypothetical protein PROSTU_04608 [Providencia stuartii ATCC 25827] Length = 197 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 63/182 (34%), Gaps = 12/182 (6%) Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGS 251 + ++ +L ++ P ++TF+S Q PL I + L Sbjct: 6 IEAVKNGVQTLLSTLRQDPYALETAHVSIITFNSTAQQIVPLT---DLINFSLPDLQASG 62 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT L + I + ++ +KG +I +TDG +P D K L Sbjct: 63 TTALGDALSVVAHCIENEVQRTTVESKGDWRPLVFI--MTDG---APTDDWKAGLNKFKA 117 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMV--KQRILY 369 A+ G ++ A A + LK +S + F + + Q++ Sbjct: 118 ART-GLVI-ACAAGQSAQTKVLKEITEVVLQLDTADSTTIKSFFKWVSASISVGSQKVDL 175 Query: 370 NK 371 +K Sbjct: 176 SK 177 >gi|163744007|ref|ZP_02151375.1| NorD Nitric oxide reductase activation protein [Phaeobacter gallaeciensis 2.10] gi|161382695|gb|EDQ07096.1| NorD Nitric oxide reductase activation protein [Phaeobacter gallaeciensis 2.10] Length = 645 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 71/221 (32%), Gaps = 36/221 (16%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + + L + ++D S S + +A S+ E+ Sbjct: 445 QSARQQQRDLSVSFLIDTSRS---------TEAAIADSSVIEIARNA-----------MA 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEK---------INRLIFGSTTKSTPGLEYA-YNKIFDA 269 + F + WG ++ + + + T + L+ Y ++ A Sbjct: 485 ALAFGIDVAGDRLAIWGFSSLRRDRVFLTRCKGFDMPMSDAVTANIGALQPGHYTRLGAA 544 Query: 270 KEKLEHIAKGHDDYKKYIIFLTDGENSS-----PNIDNKESLFYCNEAKRRGAIVYAIGV 324 + +K +I LTDG+ + ++S A+ G ++ I + Sbjct: 545 IRHVSTQLAAEPSSRKLLIVLTDGKPNDLDHYEGQHGIEDSRMAVRSARSAGQSLHGIII 604 Query: 325 QAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKEMVKQ 365 + D F + F + N +L A I + + ++ Sbjct: 605 DEDGQDWFARIFGRGG-FSLLPNPARLSRALPDIYRSLTQE 644 >gi|119510959|ref|ZP_01630081.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119464398|gb|EAW45313.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 464 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 55/196 (28%), Gaps = 35/196 (17%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVV- 216 K + V+D S SM + G + + ++V Sbjct: 33 PTKDVAAIPPSTTFTFVIDTSGSMYEIVAGDTTPTGKTYTVDGKEYTQVTGGKSKIDIVS 92 Query: 217 ----------------RSGLVTFSSKIVQTFPLA--WGVQHIQEKINRL-IFGSTTKSTP 257 R +V F Q L + +++ I +L F T+ Sbjct: 93 ESLLALIRSGRLGASDRIAIVQFDDTASQIIGLTSATEINKLEDAIAQLRTFSGGTRMGL 152 Query: 258 GLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGA 317 GL A + + + + TDG+ D+ + ++ Sbjct: 153 GLRRALEMLNN-----------QQMTVRRTLLFTDGQ----TFDDDQCRAIASDFATNNI 197 Query: 318 IVYAIGVQAEAADQFL 333 + A+GV + + L Sbjct: 198 PITALGVGEDFNEDLL 213 >gi|313835900|gb|EFS73614.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927102|gb|EFS90933.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314970745|gb|EFT14843.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 316 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 56/173 (32%), Gaps = 21/173 (12%) Query: 131 SRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMD 190 S M + ++ S + SS D D+++++D + SM G Sbjct: 32 SGVAMTILDILLAVLVGLIGLHPVVGESGRQSSSVDA--DIVIMIDTTTSMAARDNGGKT 89 Query: 191 KLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL--- 247 +L A + ++ V R LVTF + + I N L Sbjct: 90 RLSAAVH-------DVAALAAVFGGARFTLVTFDNDARVSVASTTDTGTIVSAANALTWR 142 Query: 248 --IFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 G+ T + + + A++ + + +L DGE ++ Sbjct: 143 EDTKGTGTDVSVAVPTVVRVLQKARQASPQA-------TRMVFYLGDGEQTAT 188 >gi|293387453|ref|ZP_06632005.1| putative gram positive anchor protein [Enterococcus faecalis S613] gi|291083104|gb|EFE20067.1| putative gram positive anchor protein [Enterococcus faecalis S613] Length = 506 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|281208683|gb|EFA82859.1| type A von Willebrand factor domain-containing protein [Polysphondylium pallidum PN500] Length = 2327 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 46/259 (17%), Positives = 92/259 (35%), Gaps = 28/259 (10%) Query: 52 LDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR---------NELRENGFA 102 LD S + ++ Q+ + + ++ S +K I +++F N ++ N Sbjct: 806 LDISFILPSSISPAQQAVSGAQVTQSSTSTVQVKEIGKSNFTVQIGIEMPYNIVKLNSPT 865 Query: 103 QDINNIERSTSLSIIIDDQHK-DYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKI 161 I + + T ++ +D N + E P+ + NS H ++ K+ Sbjct: 866 HQIRSKKTHTKATVELDRVESLGTNFQLLIGLEDPYSPRMWVEVDNSGHHASMLAFYPKL 925 Query: 162 SSKSDIGLDMM-MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + ++ +VLD+S SM+ D R++R + ++ + N+V+ G Sbjct: 926 DIEHGDQPSIVTIVLDLSASMHG------DPFEDMMRAVRLTITNLRGMNIKFNIVQFGD 979 Query: 221 VTFSSKIVQTFPLAWGVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 + I P +Q IN L T L+ H Sbjct: 980 IFDWLFIEHVPPTEANLQLAWSHINSLRPSYGGTALHLPLQSLILMSDQTPSNRPHN--- 1036 Query: 280 HDDYKKYIIFLTDGENSSP 298 I+ TDG+ ++P Sbjct: 1037 -------IVLFTDGQIANP 1048 >gi|118398621|ref|XP_001031638.1| hypothetical protein TTHERM_00760220 [Tetrahymena thermophila] gi|89285970|gb|EAR83975.1| hypothetical protein TTHERM_00760220 [Tetrahymena thermophila SB210] Length = 326 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 36/97 (37%), Gaps = 18/97 (18%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF----- 223 +D++ VLD + SM +F P +D + +M IK GL + Sbjct: 43 VDILFVLDTTGSMGSYFQPAIDTIKKIVEKFNKMEFNIK----------FGLCAYRDHPP 92 Query: 224 --SSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPG 258 SS + + L +++ + +++L Sbjct: 93 QESSYVTEFTDLTT-SKNLVQVLSKLSAQGGGDGPEA 128 >gi|3273269|dbj|BAA31178.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + L FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|3273273|dbj|BAA31180.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + L FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|3273295|dbj|BAA31191.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + L FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ +N + T + L + D ++ + ++ LTDG S Sbjct: 118 IKSLLNTNLPYGRTNLSDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|297171348|gb|ADI22352.1| hypothetical protein [uncultured nuHF2 cluster bacterium HF0500_02A10] Length = 266 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 53/126 (42%), Gaps = 18/126 (14%) Query: 245 NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKE 304 +R+I + T P LE A+ ++ + ++ H +I L+DG+ ++ Sbjct: 1 SRIIASAQTNIYPALEMAFEELSEIDAEVRH-----------VILLSDGQTYPDKYESLV 49 Query: 305 SLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 + AK V ++ V E+ L + A R Y + ++ ++ F++ + Sbjct: 50 TRM----AKDD-ISVSSVAVGQESDRALLADIAEWGNGRSYFILDAARVPQVFIQETQIA 104 Query: 363 VKQRIL 368 +Q ++ Sbjct: 105 SQQTLI 110 >gi|315164636|gb|EFU08653.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1302] Length = 811 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|293410481|ref|ZP_06654057.1| yehP protein [Escherichia coli B354] gi|291470949|gb|EFF13433.1| yehP protein [Escherichia coli B354] Length = 378 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGVKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|284922107|emb|CBG35188.1| conserved hypothetical protein [Escherichia coli 042] Length = 378 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 63/201 (31%), Gaps = 45/201 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSR 349 ++D + + C + G V + L + A+P Y ++ Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGVKVLGLAA--------LDSTATP--CYDRDMAQ 351 Query: 350 KLHDAFLRIGKEM-VKQRILY 369 L + +I + Sbjct: 352 ALVNVGAQIAAMTPGELATWL 372 >gi|257457979|ref|ZP_05623138.1| conserved hypothetical protein [Treponema vincentii ATCC 35580] gi|257444692|gb|EEV19776.1| conserved hypothetical protein [Treponema vincentii ATCC 35580] Length = 379 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 28/229 (12%), Positives = 63/229 (27%), Gaps = 22/229 (9%) Query: 99 NGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSS 158 G D + R T +I + ++ A + + + ++ Sbjct: 160 TGSFSDNPFVLRVTQKTI-KEKNEPAFSTPATDAFSELARLSGNKLIYSDTVDGIMPAIE 218 Query: 159 VKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS 218 V ++ K LD + V+D + SM D + L D + + + R Sbjct: 219 VILTPKKHKKLDAVFVIDATESMKDDIQKIRELLA----------DSLNKVLPKYDSYRI 268 Query: 219 GLVTFSSKIV-----QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 GLV + + + + + D + Sbjct: 269 GLVLYKDYYDDFLTKTATDFTSDINRFSAALRGFSVFGGRDIPEAVYEGIDAGLDLE--- 325 Query: 274 EHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + D + +I + D + + + AK + ++ I Sbjct: 326 ---WRTDKDTDRKLILIGDAPPHARPQGSITKEAVMDAAKSKDVKLFPI 371 >gi|260794975|ref|XP_002592482.1| hypothetical protein BRAFLDRAFT_68970 [Branchiostoma floridae] gi|229277702|gb|EEN48493.1| hypothetical protein BRAFLDRAFT_68970 [Branchiostoma floridae] Length = 806 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 60/172 (34%), Gaps = 31/172 (18%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +LD S SM+ + + +L +KS+P G + + + + Sbjct: 279 ILDRSGSMSGN---------KIKNARETLLLFLKSLPIGCYFNIVGFGS-THESLFKGSE 328 Query: 234 AWGVQHIQE---KINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + + ++ + ++ T+ L+Y Y + H + + Sbjct: 329 KYDNKSLKTACKALGKMEADLGGTEILQPLQYVYKQ----PPIAGH--------PRQLFL 376 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDR 341 LTDGE +E + + A+ +++G+ A+ +K A R Sbjct: 377 LTDGEVWDTQACVREVAKHADSAR-----CFSVGIGEGASTALVKGVARAGR 423 >gi|3450826|gb|AAC32596.1| serum opacity factor [Streptococcus pyogenes] Length = 1046 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 95/258 (36%), Gaps = 28/258 (10%) Query: 49 HYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIW-QTDFRNELRENGFAQDINN 107 + + T + + + + + K + + + +++ + ++ Sbjct: 117 QAVTSSTSPSTPAAASSNGSNQEASAETEPQTMEVEKYTVDKENSKLNIKDGKTPKTGSS 176 Query: 108 IERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-D 166 + +I + K ++ V+R ++ I +V + K D Sbjct: 177 VNNEKDTKLIRNRDGKLRDIVDVTR-------------TVKTNEDGTIDVTVTVKPKQID 223 Query: 167 IGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIP----DVNNVVRSGLV 221 G D+M +LDVS M+ D F +K+ +++ + N VR L+ Sbjct: 224 EGADVMALLDVSKKMSEDDFNNAKNKIKKLVKTLTSKSASNSDNDEHKYNSRNSVR--LM 281 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN---KIFDAKEKLEHIAK 278 TF +I ++ +E++++L+ K+ ++ + I A+E + Sbjct: 282 TFYREISNPIDIS---GKTEEQLDKLLDDLRKKAKANYDWGVDLQGAIHKAREIFNKEKE 338 Query: 279 GHDDYKKYIIFLTDGENS 296 +++I+ + GE++ Sbjct: 339 KKFGKRRHIVLFSQGEST 356 >gi|66802592|ref|XP_635168.1| hypothetical protein DDB_G0291658 [Dictyostelium discoideum AX4] gi|60463628|gb|EAL61813.1| hypothetical protein DDB_G0291658 [Dictyostelium discoideum AX4] Length = 409 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 25/115 (21%), Positives = 43/115 (37%), Gaps = 23/115 (20%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 I SKS+ + +++DVS SM +L +A + ++ + G Sbjct: 256 IKSKSEKSV---LLIDVSRSMTGA------QLDIAKNNSKKFIAESDKFA-------IGA 299 Query: 221 VTFSSKIVQTFPLAWGVQH----IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 ++S + F +WG I+ L T+ +E A K DA E Sbjct: 300 WSYSI---KFFSDSWGTSSKISAANSWIDGLCSDGHTEIKQAIEEAITKFKDADE 351 >gi|330904068|gb|EGH34640.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 84 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 3/72 (4%) Query: 254 KSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK 313 ++ G+ + DA + + ++ +TDG N++ ID + A Sbjct: 15 EARIGIAGKNTALGDAIGLALKRLRMRPATSRALVLVTDGANNAGQIDP---ITAARLAA 71 Query: 314 RRGAIVYAIGVQ 325 G +Y IG+ Sbjct: 72 EEGVKIYPIGIG 83 >gi|325496693|gb|EGC94552.1| hypothetical protein ECD227_0790 [Escherichia fergusonii ECD227] Length = 496 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 61/174 (35%), Gaps = 24/174 (13%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ ++D S SMN ++L + S++ +++ ++ + R +VT++ Sbjct: 137 LVFLIDTSGSMNSD-----ERLPLIKSSLKLLVN------ELRDQDRISIVTYAGSARLL 185 Query: 231 FPLAWG--VQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 G I I L G T G+ AY + KG + I+ Sbjct: 186 LSSTSGSEKNTILNAIANLQAGGGTNGGAGVAMAYE------QAQAGYIKGGVNR---IL 236 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEA-ADQFLKNCASPDR 341 TDG+ + D + + G + +GV ++ + A Sbjct: 237 LATDGDFNIG-DDPASVEDLVKKQRESGITLSTLGVGDNNYNEEMMVKIADTGN 289 >gi|111026154|ref|YP_708437.1| hypothetical protein RHA1_ro10086 [Rhodococcus jostii RHA1] gi|110824997|gb|ABH00279.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 419 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 34/154 (22%), Positives = 53/154 (34%), Gaps = 35/154 (22%) Query: 157 SSVKISSKSDIGLDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 S + + + + +V+D S SM+ D M ++ R+ V + Sbjct: 271 SGIVAPAMRGPSITVSIVVDTSGSMSADDLDAAMSEVAGVLRAGGV----------VRDR 320 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 VR SS Q V+ I + +LI G T G+E A +A H Sbjct: 321 VRILACDASSTTAQP------VRSIAD--VKLIGGGGTDMRVGIEAA-----NAARPQPH 367 Query: 276 IAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYC 309 +I LTDG+ P+ S C Sbjct: 368 ----------VVIVLTDGDTPWPDR-PSRSHLVC 390 >gi|307284464|ref|ZP_07564626.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306503141|gb|EFM72395.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 748 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 46/143 (32%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDII-------KSIPDVN 213 + + +D+++V D S S +D+F + + + + ++ K + Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKKFMFPD 127 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|256959601|ref|ZP_05563772.1| predicted protein [Enterococcus faecalis Merz96] gi|256950097|gb|EEU66729.1| predicted protein [Enterococcus faecalis Merz96] Length = 370 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 100 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 152 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 153 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 204 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 205 -KKMLVHIGDGSANASFLPRENAQIYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 259 Query: 334 KNCASPD 340 N AS D Sbjct: 260 SNSASTD 266 >gi|169832403|ref|YP_001693966.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] gi|154432882|gb|ABS82071.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432906|gb|ABS82092.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|168994905|gb|ACA35517.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] Length = 883 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 5/78 (6%) Query: 151 APLLITSSVKI-SSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI 209 L ++ + + ++ LD++++LD S SM++ + A + R ++D I S Sbjct: 199 IELTVSGKTTVETKEASTPLDVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVDKITSN 258 Query: 210 PDVNNVVRSGLVTFSSKI 227 PD R LVT+ S I Sbjct: 259 PDN----RVALVTYGSTI 272 >gi|120603022|ref|YP_967422.1| outer membrane adhesin like proteiin [Desulfovibrio vulgaris DP4] gi|120563251|gb|ABM28995.1| putative outer membrane adhesin like proteiin [Desulfovibrio vulgaris DP4] Length = 3038 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 65/174 (37%), Gaps = 24/174 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + ++++VLD S SM G G ++ +A +I +++ + VN +V Sbjct: 2236 PQVNTTTNLVIVLDTSGSMAWDSGVDGKSRMELAQEAIAKLMHAYDDMGHVN----IKIV 2291 Query: 222 TFSSKIVQTFPLAWG---VQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 F S + P G V + + N+ + G T Y+ A Sbjct: 2292 DFYSDADASKPWFEGNDAVAKATQYLTTDNKFVPGGGTD--------YDDATAATASALQ 2343 Query: 276 IAKGHDDYKKYIIFLTDG--ENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQ 325 + D + F +DG + S ++ E + + K I YAIG+ Sbjct: 2344 TGMPNADRT-VLYFFSDGAPDPISEALNATEEKAWVDALKGVDNLDIAYAIGIG 2396 >gi|3273251|dbj|BAA31169.1| thrombospondin-related protein [Plasmodium falciparum] Length = 574 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 66/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSYRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNENAIHLYANVFSNNAREIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|114330826|ref|YP_747048.1| von Willebrand factor, type A [Nitrosomonas eutropha C91] gi|114307840|gb|ABI59083.1| von Willebrand factor, type A [Nitrosomonas eutropha C91] Length = 786 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 72/192 (37%), Gaps = 22/192 (11%) Query: 169 LDMMMVLDVSLSMNDHF----GPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 L +M++LD+S S ND MD AT + LD I ++ G Sbjct: 595 LAVMVLLDMSESSNDKVRGHDYTIMDLTRAATVLFADALDRIGDPFAIHGFCSDGRHDVH 654 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 + F +G + ++ + +T+ L +A + + K Sbjct: 655 YHRFKDFDQPYG-DIAKARLAGMKGHFSTRMGAALRHAGYYLAQQPKS-----------K 702 Query: 285 KYIIFLTDGENSSPNIDNKESLFY-----CNEAKRRGAIVYAIGVQAEAADQFLKNCASP 339 + I +TDGE + ++ + + L Y E R+G VYA+ + ADQ++ Sbjct: 703 RVIFVITDGEPADNDVHDPQYLRYDTKHAVEELARKGITVYALSLD-PHADQYVSRIFGA 761 Query: 340 DRFYSVQNSRKL 351 F + +L Sbjct: 762 KNFTVIDEVERL 773 >gi|312910491|ref|ZP_07769336.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|311289187|gb|EFQ67743.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 507 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|149030577|gb|EDL85614.1| integrin, alpha 10 (predicted) [Rattus norvegicus] Length = 746 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 73/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGENPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSESRGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|164565428|ref|NP_001101169.2| integrin, alpha 10 [Rattus norvegicus] Length = 1167 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 73/207 (35%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGENPV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSESRGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCAS- 338 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPA-ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASD 328 Query: 339 PDR--FYSVQNSRKLHDAFLRIGKEMV 363 PD F++V + L D +G + Sbjct: 329 PDERFFFNVTDEAALTDIVDALGDRIF 355 >gi|314933586|ref|ZP_07840951.1| von Willebrand factor, type A domain containing protein [Staphylococcus caprae C87] gi|313653736|gb|EFS17493.1| von Willebrand factor, type A domain containing protein [Staphylococcus caprae C87] Length = 629 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 95/300 (31%), Gaps = 33/300 (11%) Query: 37 ETSHKFFVKAK-LHYILDH--SLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR 93 + + K K LDH + E N K + ++I Sbjct: 313 DMTDMMTKKGKGSQNTLDHDEGGFIGQNQAFALEGINKNVKVEWKVPNIQPQHILD---- 368 Query: 94 NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPL 153 + +N +I ++ + +I + Q + +NL+ R + I + Sbjct: 369 YQHSKNDVQFEIKDLIQIIKKTINREHQDERHNLT-KGRLQKDLINWFIDDQYKLFYKKQ 427 Query: 154 LITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 ++ + + +++D S SM+ DK+ + + + +KS+ + Sbjct: 428 DLSKTFDAT--------FTLLVDASASMH-------DKMDETIKGVVLFHETLKSLNIKH 472 Query: 214 NVVRSGLVTFSSKIVQT--FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKE 271 + + F+ + + I + ++ N+ A Sbjct: 473 EI-----LAFNEDAFEADDREQPNIIDEIINYNYSIFEKEGSRIMSLEPQDDNRDGVAIR 527 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAIGVQAEA 328 + ++I +DGE S+ N ++ A++ G V+ + + EA Sbjct: 528 IASERLLQRSHQQCFLIVFSDGEPSAFNYSQDGIIDTYEAVETARKFGIEVFNVFLSQEA 587 >gi|227517736|ref|ZP_03947785.1| von Willebrand factor domain LPTXG domain protein [Enterococcus faecalis TX0104] gi|227074817|gb|EEI12780.1| von Willebrand factor domain LPTXG domain protein [Enterococcus faecalis TX0104] Length = 619 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 44/143 (30%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVKGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVDTQLTHDKSKFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|218705650|ref|YP_002413169.1| hypothetical protein ECUMN_2455 [Escherichia coli UMN026] gi|293405589|ref|ZP_06649581.1| conserved hypothetical protein [Escherichia coli FVEC1412] gi|298381271|ref|ZP_06990870.1| yehP protein [Escherichia coli FVEC1302] gi|300901263|ref|ZP_07119363.1| von Willebrand factor type A domain protein [Escherichia coli MS 198-1] gi|218432747|emb|CAR13641.1| conserved hypothetical protein [Escherichia coli UMN026] gi|291427797|gb|EFF00824.1| conserved hypothetical protein [Escherichia coli FVEC1412] gi|298278713|gb|EFI20227.1| yehP protein [Escherichia coli FVEC1302] gi|300355295|gb|EFJ71165.1| von Willebrand factor type A domain protein [Escherichia coli MS 198-1] Length = 378 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 50/154 (32%), Gaps = 34/154 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++D + + C + G V + Sbjct: 305 VSDFYEGGSSSMLTHQVKKCVQ---SGIKVLGLA 335 >gi|16080726|ref|NP_391554.1| hypothetical protein BSU36730 [Bacillus subtilis subsp. subtilis str. 168] gi|221311632|ref|ZP_03593479.1| hypothetical protein Bsubs1_19856 [Bacillus subtilis subsp. subtilis str. 168] gi|221320871|ref|ZP_03602165.1| hypothetical protein BsubsJ_19720 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325156|ref|ZP_03606450.1| hypothetical protein BsubsS_19886 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313222|ref|YP_004205509.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] gi|8928526|sp|P70961|YWMD_BACSU RecName: Full=Uncharacterized protein ywmD; Flags: Precursor gi|1648854|emb|CAB03681.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636198|emb|CAB15690.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019496|gb|ADV94482.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] Length = 224 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 79/226 (34%), Gaps = 15/226 (6%) Query: 147 NSSHAPLLITSSVKIS--SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + LL S S ++ ++ ++ D S SM G G K+ +A +S++ + Sbjct: 6 AAGIIGLLTVSIASPSFAAEKQADTNVAVLFDGSGSMVQKTG-GERKIDIAKKSVKSFAE 64 Query: 205 IIKSIPD--VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYA 262 ++ + + +G S K + + + L S ++ P Sbjct: 65 LLPKDTNLMLRVFGHAGNNKLSGKALSCST-TETIYGLHPYEGSLFDNSLSELKP---TG 120 Query: 263 YNKIFDAKEKLEHIAKGHDDYKKYIIFL-TDGENSSPNIDNKESLFYCNEAKRRGAIVYA 321 + I A + D K +++L TDGE + E A IV Sbjct: 121 WTPIAKALADTRKEFEAFDADGKNVVYLITDGEETCGGDPAAEIEKL--RASNVDTIVNI 178 Query: 322 IGVQAE--AADQFLK-NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 IG + ++ + A + S ++ + A+ + ++ + Sbjct: 179 IGFNFDVKGNEEMKQAAVAGGGEYISANSADEFEQAWEKEAQKFTE 224 >gi|326333218|ref|ZP_08199465.1| PE-PGRS family protein [Nocardioidaceae bacterium Broad-1] gi|325948862|gb|EGD40955.1| PE-PGRS family protein [Nocardioidaceae bacterium Broad-1] Length = 539 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 72/208 (34%), Gaps = 28/208 (13%) Query: 174 VLDVSLSMNDHFGPG-MDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-------- 224 V+DVS SM G G +L + + + + +P GL FS Sbjct: 345 VVDVSGSMAWPAGSGATTRLQLTQAAAS---EAVDLLPGGAA---VGLWAFSEKSAGMLD 398 Query: 225 -SKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDY 283 V + + ++ I G T ++ G A + A + A Sbjct: 399 EDHTVLVPTRQLATKDQRAQLTEAIAGLTARTGGG--TALHDTALAAYQAAVAAYDPLAS 456 Query: 284 KKYIIFLTDGENSSP-NIDNKESLFYCNEAKR--RGAIVYAIGVQAEAADQFLKNC--AS 338 ++F TDG N P ++D E + A R V IG+ A+A L+ A+ Sbjct: 457 NTVLLF-TDGTNDDPDSMDLDELVRQLEAASDPRRPVRVLGIGITADADLGALQAIADAT 515 Query: 339 PDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 Y + + + +E + QR Sbjct: 516 GGAAYVAERPEDV----GTVLREALAQR 539 >gi|293384268|ref|ZP_06630156.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387001|ref|ZP_06631569.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312906299|ref|ZP_07765310.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312909645|ref|ZP_07768499.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078416|gb|EFE15780.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083543|gb|EFE20506.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310627674|gb|EFQ10957.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311290047|gb|EFQ68603.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 383 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 30/187 (16%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 +++ +D+++V+D S SM + L + E+ D + + VR G+V + Sbjct: 113 QTESPIDLVLVIDYSSSMKGE--KLNNALKGLQQFGEELSDSLT-----DGHVRIGIVAY 165 Query: 224 SSKIVQTFPLAWGVQHIQEKI-NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + T + + +++ + N S T GL + + Sbjct: 166 NRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQKGLLEGQRLLAEKSRPNA-------- 217 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIV---------YAIGVQAEAADQFL 333 KK ++ + DG ++ + + + Y N G I+ Y Q E+ + Sbjct: 218 -KKMLVHIGDGSANASFLPRENAQIYPN----NGEIIDYNGYHTSSYMEEFQTESNQYYT 272 Query: 334 KNCASPD 340 N AS D Sbjct: 273 SNSASTD 279 >gi|118357564|ref|XP_001012031.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293798|gb|EAR91786.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 896 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 56/173 (32%), Gaps = 33/173 (19%) Query: 174 VLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPL 233 +LD S SM+ + A ++ L + I NVV G S + + + Sbjct: 317 LLDRSGSMSGQ------PIQKACEALILFLKSL-PIDSYFNVVSFGS---SHEKLFQTSI 366 Query: 234 AWGVQHIQEKI----NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + +++ I N T+ L+ + + Y K I Sbjct: 367 KYDTNSLEKAIKIIKNYTADLGGTEIYKPLQSVFKETKIDG------------YNKQIFL 414 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGA-IVYAIGVQAEAADQFLKNCASPDR 341 LTDGE SP K K + +IG + A ++ A + Sbjct: 415 LTDGEVESPKEVVKLIK------KNNKYNRINSIGFGSGADKYLIEESAIAGK 461 >gi|15678479|ref|NP_275594.1| magnesium chelatase subunit ChlI [Methanothermobacter thermautotrophicus str. Delta H] gi|2621518|gb|AAB84957.1| magnesium chelatase subunit ChlI [Methanothermobacter thermautotrophicus str. Delta H] Length = 591 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 38/122 (31%), Gaps = 22/122 (18%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-KIVQTF 231 +VLD S SM K+ A +L R L+ F + Sbjct: 431 IVLDTSSSMRLER-----KIKFAKTVSWLLLR-----DSYEKRNRIALIAFRGYEANLVV 480 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 ++ ++E + L G T TP L A + ++ + ++ Sbjct: 481 EPTSNLETVEEALEGLRSGGRTPLTPALRLAAEVASSSSDEACTA-----------VVIS 529 Query: 292 DG 293 DG Sbjct: 530 DG 531 >gi|115757185|ref|XP_784676.2| PREDICTED: similar to parturition-related protein PRP3 [Strongylocentrotus purpuratus] Length = 768 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 68/206 (33%), Gaps = 29/206 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y + + P+ T+ V + +++VLD+S SM + + Sbjct: 219 NYRSAWSVMRNHTDFQAVVPPVSNTTPVFEVLQLSSVRSVVLVLDISNSMLGNR-----R 273 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI- 248 +S + + G+V F S L Q + L Sbjct: 274 FDRMIQSSTVYIMNVIPADSK-----LGIVVFDSTSQIRANLTDITNTASRQRLVKALPK 328 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESL 306 TT G+ Y +E L A+G YI+ L+DG EN++P I + Sbjct: 329 SPKGTTCIGCGILYWVRI----REVLGSYAQGG-----YILLLSDGVENNAPYI-----I 374 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + G I I + A Q Sbjct: 375 DIYDDINKSGVIFDTITISNAADQQM 400 >gi|46579425|ref|YP_010233.1| hemolysin-type calcium-binding repeat-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46448839|gb|AAS95492.1| hemolysin-type calcium-binding repeat protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233245|gb|ADP86099.1| outer membrane adhesin like proteiin [Desulfovibrio vulgaris RCH1] Length = 3038 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 65/174 (37%), Gaps = 24/174 (13%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFG-PGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 + + ++++VLD S SM G G ++ +A +I +++ + VN +V Sbjct: 2236 PQVNTTTNLVIVLDTSGSMAWDSGVDGKSRMELAQEAIAKLMHAYDDMGHVN----IKIV 2291 Query: 222 TFSSKIVQTFPLAWG---VQHIQEKI---NRLIFGSTTKSTPGLEYAYNKIFDAKEKLEH 275 F S + P G V + + N+ + G T Y+ A Sbjct: 2292 DFYSDADASKPWFEGNDAVAKATQYLTTDNKFVPGGGTD--------YDDATAATASALQ 2343 Query: 276 IAKGHDDYKKYIIFLTDG--ENSSPNIDNKESLFYCNEAK--RRGAIVYAIGVQ 325 + D + F +DG + S ++ E + + K I YAIG+ Sbjct: 2344 TGMPNADRT-VLYFFSDGAPDPISEALNATEEKAWVDALKGVDNLDIAYAIGIG 2396 >gi|219112735|ref|XP_002178119.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217411004|gb|EEC50933.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 800 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 50/136 (36%), Gaps = 19/136 (13%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 G ++ V+D S SM ++++ A + +L + + L+ F Sbjct: 586 ARKAGSLIIFVVDASGSMA------LNRMNAAKGAAVSLLTEA-----YQSRDKISLIPF 634 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 + P + ++++ ++ G + L+ A +A+ K D Sbjct: 635 QGEMADVLLPPTKSITMARQRLEQMPCGGGSPLAHALQLATLTGINAQ-------KSGDV 687 Query: 283 YKKYIIFLTDGENSSP 298 K ++ ++DG + P Sbjct: 688 GKVVVVLISDGRANVP 703 >gi|329663456|ref|NP_001192519.1| integrin alpha-10 [Bos taurus] gi|297472754|ref|XP_002686129.1| PREDICTED: integrin, alpha 10-like [Bos taurus] gi|296489504|gb|DAA31617.1| integrin, alpha 10-like [Bos taurus] Length = 1167 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 72/207 (34%), Gaps = 29/207 (14%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++VLD S S+ ++ R + L I ++ GLV + V Sbjct: 166 MDVVIVLDGSNSI-----YPWSEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGESSV 214 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L G +E++ R + + + A + E G + + ++ Sbjct: 215 HEWSL--GDFRTKEEVVRAARNLSRREGRETKTAQAIMMACTEGFSQSRGGRPEAARLLV 272 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFL---KNCASP 339 +TDGE+ +L C + Y I V + FL + AS Sbjct: 273 VVTDGESHDGEELPT-ALQACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASD 328 Query: 340 ---DRFYSVQNSRKLHDAFLRIGKEMV 363 F++V + L D +G + Sbjct: 329 PDEKFFFNVTDEAALTDIVDALGDRIF 355 >gi|7800480|gb|AAF70056.1|AF251439_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 383 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 56/166 (33%), Gaps = 19/166 (11%) Query: 162 SSKSDIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGL 220 + D G D+M +LDVS M D F +K+ ++ + + N VR L Sbjct: 196 PKQIDEGADVMALLDVSKKMTDADFNNAKEKIKKLVTTLTSKSPDGQQNLNNRNTVR--L 253 Query: 221 VTFSSKIVQTFPLAWGVQ--------HIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEK 272 +TF KI + L+ I K+ + + + A + EK Sbjct: 254 MTFYRKISEPIDLSGKTSDEVEKELDDIWNKVKKEDWDWGVDLQGAIHKAREIFRSSYEK 313 Query: 273 LEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAI 318 +++I+ + GE++ + D K Sbjct: 314 KSGK-------RQHIVLFSQGEST-FSYDIKNKSNIAKTRITEKVT 351 >gi|157412650|ref|YP_001483516.1| protoporphyrin IX magnesium chelatase subunit ChlD [Prochlorococcus marinus str. MIT 9215] gi|157387225|gb|ABV49930.1| Protoporphyrin IX Magnesium chelatase, ChlD subunit [Prochlorococcus marinus str. MIT 9215] Length = 724 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 69/215 (32%), Gaps = 40/215 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++D S SM ++++ A ++ +L ++ + + L+ F Sbjct: 521 QKKAGALVIFLVDASGSMA------LNRMQSAKGAVIRLLT--EAYENRDE---VALIPF 569 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P + + ++ + G L + + + + ++ D Sbjct: 570 RGNQAEVLLPPTRSITAAKRRLETMPCGGG----SPLAHG---LTQSAKVAKNALSTGDI 622 Query: 283 YKKYIIFLTDG----------------ENSSPNID-NKESLFYCNEAKRRGAIVYAI--- 322 + ++ +TDG EN + N++ +E L + G + I Sbjct: 623 GQVIVVGITDGRGNVPLGLSLGQNEVEENDNSNLNLKQEVLDIAAKYPMLGIKLLIIDTE 682 Query: 323 -GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 A + L A + + K A Sbjct: 683 RKFIASGFGKELAEAAQGKYVQLPKATDKAIAAMA 717 >gi|7266944|gb|AAD42198.2|AF139744_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 456 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 64/148 (43%), Gaps = 10/148 (6%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 I +V + K D G D+M +LDVS M D F DK+ ++ + Sbjct: 199 TIDVTVTVKPKQIDEGADVMALLDVSKKMTDADFNNAKDKIKKLVTTLTSK--SPDGQQN 256 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYN---KIFD 268 +NN R L+TF +I + ++ ++++ L+ ++ ++ + I Sbjct: 257 LNNRNRVRLMTFYREISDSIDIS---GKTDDELDGLLNKLRQEAKDEYDWGVDLQGAIHK 313 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENS 296 A+E + + +++I+ + GE++ Sbjct: 314 AREIFNKEKEKNSGKRQHIVLFSQGEST 341 >gi|5802676|gb|AAD51756.1|AF177977_1 serum opacity factor precursor [Streptococcus pyogenes] Length = 854 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 66/148 (44%), Gaps = 12/148 (8%) Query: 154 LITSSVKISSKS-DIGLDMMMVLDVSLSMND-HFGPGMDKLGVATRSIREMLDIIKSIPD 211 + ++K++ K D G D+M +LDVS M D F DK+ ++ + + + Sbjct: 180 TLDVTLKVTPKEIDEGADVMALLDVSQKMTDADFKNAKDKIKKLVTTLTSKSNSDEHKHN 239 Query: 212 VNNVVRSGLVTFSSKIVQTFPLAWGVQ-HIQEKINRLIFGSTTKSTPG--LEYAYNKIFD 268 N VR L+TF +I ++ + + + +N L + G L+ A +K + Sbjct: 240 SRNSVR--LMTFYREISDPIDISGKTEAELDQLLNELREKAKANYDWGVDLQGAIHKTRE 297 Query: 269 AKEKLEHIAKGHDDYKKYIIFLTDGENS 296 K + K +++I+ + GE++ Sbjct: 298 IFNKEQKSKK-----RQHIVLFSQGEST 320 >gi|34481892|emb|CAE46494.1| trap [Plasmodium falciparum] Length = 331 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + N + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNENAIHLYVNIFSNNAREIIRLHSDASKNKEKALFI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENASQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + G + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|124504959|ref|XP_001351221.1| CSP and TRAP-related protein (CTRP) [Plasmodium falciparum 3D7] gi|4493942|emb|CAB38978.1| CSP and TRAP-related protein (CTRP) [Plasmodium falciparum 3D7] Length = 2114 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 42/181 (23%), Positives = 67/181 (37%), Gaps = 34/181 (18%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +VLD S S++D L I L+IIK I V G++ FS Sbjct: 569 DITLVLDESASISD--------LIWRNEVIPFSLEIIKRINISYKNVHMGVLLFSEYTRD 620 Query: 230 TFPLAWGVQ----HIQEKINRLI----FGSTTKSTPGLEYA---YNKIFDAKEKLEHIAK 278 + +Q KIN L G T L+YA Y+K+ + KE Sbjct: 621 IVRFYDNARYEKGTLQTKINDLKRDYRSGKKTYIIQALQYALTYYSKLSNRKE------- 673 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQF--LKNC 336 K + TDG +S + + + ++ + +GV + ++ L C Sbjct: 674 ----APKVTMLFTDGNDSYESEKGLQDIAL--LYRKENVKLLVVGVSTASENKLKMLVGC 727 Query: 337 A 337 A Sbjct: 728 A 728 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 59/179 (32%), Gaps = 23/179 (12%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D+ +++D S S+ + + + ++ V G++ FS KI + Sbjct: 849 DLTLIIDESASIGYSNWE--------KEVVPFTIGLASNLEISEKKVNMGILLFSDKIRE 900 Query: 230 TFPL----AWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 ++ ++ +I+ L + G Y + + K Sbjct: 901 FIKYGQKESYDKNNLVRRIHDLKKYYK---SGGFSYIVEALKYGLYSYAKSTSSRLNVPK 957 Query: 286 YIIFLTDGENSSPN--IDNKESLFYCNEAKRRGAIVYAIGVQAEAAD--QFLKNCASPD 340 I LTDG N+ + I + S Y K+ + IG+ + L C D Sbjct: 958 VNILLTDGNNTDTSDFILTEVSSLY----KKENVKLLLIGIGGPTIHKLRLLGGCDKSD 1012 Score = 35.9 bits (81), Expect = 9.8, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 73/210 (34%), Gaps = 35/210 (16%) Query: 170 DMMMVLDVSLSMND-----HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 D+ ++LD S S+ H P DK+ IK + N V G++ FS Sbjct: 85 DLTLILDESASIGSKNWKNHVIPFTDKI-------------IKDLTISKNEVHVGILLFS 131 Query: 225 SKIVQTFPLAWGVQHIQEKINRLIF--------GSTTKSTPGLEYAYNKIFDAKEKLEHI 276 SK +++ ++++ + + G TK L+Y+ H Sbjct: 132 SKNRDYVTYGDELRYQKDELLKKVEKLKKDYYCGGGTKILGALKYSLENYTK------HK 185 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNC 336 +D K I TDG + + NK+ L +R + +GV A ++ Sbjct: 186 NIRYDAP-KVTILFTDG--NENSASNKQLLEMGLTYRRERVKLLVLGVAAAEDNKLKLIA 242 Query: 337 ASPDRFYSVQNSRKLHDAFLRIGKEMVKQR 366 + + + + I K + + Sbjct: 243 GCEENTNCPYSMKAEWETINDITKRLTNKI 272 >gi|87311516|ref|ZP_01093635.1| hypothetical protein DSM3645_01931 [Blastopirellula marina DSM 3645] gi|87285772|gb|EAQ77687.1| hypothetical protein DSM3645_01931 [Blastopirellula marina DSM 3645] Length = 640 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 58/155 (37%), Gaps = 16/155 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++ + D S SM D D++G + + L I+ D + +G+ +F S Sbjct: 182 VVWLFDQSGSMKDDQAKIRDRIGR----VYDELGIVGVSQD--EALTTGVASFGSNFQLH 235 Query: 231 FP-LAWGVQHIQEKINRLIF-GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 I E I + S + T A ++ ++ I+K II Sbjct: 236 TKTPTSNRDEIAEAIAEVPIDPSGEELT---TQAVMEVITRHKRYAQISKRQMA----II 288 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++D E+ +P+ + + A + VY +G Sbjct: 289 LVSD-ESGNPDSNQRYLEETIQAANKANCRVYVMG 322 >gi|239636912|ref|ZP_04677911.1| von Willebrand factor type A domain protein [Staphylococcus warneri L37603] gi|239597586|gb|EEQ80084.1| von Willebrand factor type A domain protein [Staphylococcus warneri L37603] Length = 629 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 105/315 (33%), Gaps = 63/315 (20%) Query: 37 ETSHKFFVKAK-LHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFR-- 93 + + K K LD +E G G Q + F+ I ++ Sbjct: 313 DMTDMMTKKGKGSQNTLDR-----------EEGGFIG--QNSAFALEGINQNVDIKWKVP 359 Query: 94 ---------NELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPW 144 E +N +I ++ + +I + Q + +NL+ R + I Sbjct: 360 DILPEYIQAYEDVKNDVQFEIKDLIQIIKKTIEREHQDERHNLT-KGRLQKNLINWFIDD 418 Query: 145 CANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD 204 + ++ S + +++D S SM+ DK+ + + + Sbjct: 419 QYKLFYKKQDLSQSFDAT--------FTLLIDASASMH-------DKMDETIKGVVLFHE 463 Query: 205 IIKSIPDVNNVVRSGLVTFSSKIVQTFPL-----AWGVQHIQE---KINRLIFGSTTKST 256 +K + + ++ F S + + + +++ +I L + Sbjct: 464 TLKELNVKHEILAFNEDAFDSDDTKQPNIIDEIIHYDYSTLKKDGPRIMALEPQDDNRDG 523 Query: 257 PGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAK 313 + A +++ + H ++++I +DGE S+ N ++ +A+ Sbjct: 524 VAIRIASDRLI---RRSHH--------QRFLIVFSDGEPSAYNYSQDGIIDTYEAVEDAR 572 Query: 314 RRGAIVYAIGVQAEA 328 + G V+ + + + Sbjct: 573 KFGIEVFNVFLSQDP 587 >gi|154411703|ref|XP_001578886.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121913087|gb|EAY17900.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 710 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 71/192 (36%), Gaps = 40/192 (20%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 ++D S SM +G ++ + +L I S+P + R ++ F + + Sbjct: 243 FYFLIDCSGSM---YGSRIE----NAKFCLNLL--IHSLPIDS---RFSIIKFGTSYEEI 290 Query: 231 FPLAW----GVQHIQEKINRLIFG-STTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 FP+ V+ +I L T LEY Y + Y + Sbjct: 291 FPICDYTNKNVKIAMRQIKDLDADMDGTDILSPLEYVYTQTTKNG------------YHR 338 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEA--KRRGAIVYAIGVQAEAADQFLKNCA--SPDR 341 I LTDG+ S C+ A KR +YAIG+ + A +KN + S Sbjct: 339 KIFLLTDGQ-------VHNSDVICSLAQEKRDNNRIYAIGLGSGADPGLIKNVSLKSWGN 391 Query: 342 FYSVQNSRKLHD 353 + + + +++ Sbjct: 392 YVLIADKDNMNE 403 >gi|14248575|gb|AAK57573.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248577|gb|AAK57574.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248589|gb|AAK57580.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248593|gb|AAK57582.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248597|gb|AAK57584.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248583|gb|AAK57577.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248579|gb|AAK57575.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 493 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248585|gb|AAK57578.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248595|gb|AAK57583.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248569|gb|AAK57570.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248571|gb|AAK57571.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248573|gb|AAK57572.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248591|gb|AAK57581.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T LE + D + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAALEEVQKHLND--------RVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|18309499|ref|NP_561433.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens str. 13] gi|18144176|dbj|BAB80223.1| hypothetical protein [Clostridium perfringens str. 13] Length = 614 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 64/157 (40%), Gaps = 33/157 (21%) Query: 155 ITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVN 213 IT +VK K +D+++++D S SM + D+L + S+ + + I +IPD Sbjct: 72 ITLTVKGKPKKVTKPVDILLIIDASNSMYYNM----DELKASMNSLVDKV--IDNIPDS- 124 Query: 214 NVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD----- 268 R +V F + + + N+ F S + ++ +YN I Sbjct: 125 ---RIAVVAFGTY----------SEEVFSFNNKNNFTSKEEYKKAIKKSYNNIEGRGNTN 171 Query: 269 -------AKEKLEHIAKGHDDYKKYIIFLTDGENSSP 298 A E ++ + + KK +IF +DG + Sbjct: 172 IESSWRLADEIFKNELNNNFNSKKDVIFFSDGYPNES 208 >gi|297527013|ref|YP_003669037.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255929|gb|ADI32138.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 443 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 60/176 (34%), Gaps = 28/176 (15%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR-SGLVTFSSKIVQ 229 + ++LD S SM+ K+ A ++ L + R + + + Sbjct: 281 LYVLLDKSGSMDG------TKMTWAK-AVALSLYMRAVREHREFYFRFFDSIPYPLAKIS 333 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 P A V + + I R+ T + + A N I + + II Sbjct: 334 RRPRASNVLKLIDYIARVRGSGGTDISKAIITACNDIRTSSVRNTSD----------IIL 383 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSV 345 +TDG + +L N + V ++ LKN ++ ++++V Sbjct: 384 ITDGVDRIAEQLVTYNLRKANTR--------LVAVMIMGDNRSLKNIST--KYFTV 429 >gi|157692632|ref|YP_001487094.1| regulator of nitric oxide reductase [Bacillus pumilus SAFR-032] gi|157681390|gb|ABV62534.1| possible regulator of nitric oxide reductase [Bacillus pumilus SAFR-032] Length = 638 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 68/166 (40%), Gaps = 14/166 (8%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K + ++I +++D S SM DK+ + I + +KS+ + +V G Sbjct: 434 KQAPSTEIDAVFTLLVDCSASM-------FDKMDETKKGIVLFHEALKSVQVPHQIV--G 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 ++ +T + + + L + + E + ++ + I K Sbjct: 485 FWEDTNDASETSQPNY-FNTVVSFKDSLFDAGPSIMSLEPEEDNRDGYAIRQMTKMILKR 543 Query: 280 HDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 ++ +K++I +DGE ++ + + ++ EA+++G V + Sbjct: 544 REE-QKFLIVFSDGEPAAFSYEQNGIVDTHEAVLEARKKGIEVINV 588 >gi|145526919|ref|XP_001449265.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124416842|emb|CAK81868.1| unnamed protein product [Paramecium tetraurelia] Length = 470 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 85/231 (36%), Gaps = 34/231 (14%) Query: 98 ENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEM-PFIFCTFPWCANSSHAPLLIT 156 +N ++ + + T SI L +++M + P+++ Sbjct: 68 KNSQSESKHRFSQRTQQSIPSRSSINKQPLDDDIQFDMFSVNPGSNILNLGQQTIPIVLQ 127 Query: 157 SSVKISSK-SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 K + IG+D+ ++D+S SM + ++ + + ++ Sbjct: 128 LRTKTIEELDQIGVDLFCLIDISNSMQGQKIEYVKQI------------LHSILSNLREQ 175 Query: 216 VRSGLVTFSSKIVQTFPLAWGVQHIQEK----INRLIFGSTTKSTPGLEYAYNKIFDAKE 271 R L++F+++ L Q QE I+ L TT+ G E A++ I Sbjct: 176 DRLCLISFNNEGKLLTGLQKVTQETQEYFAFVIDDLQCIGTTQLWKGTEVAFDVI----- 230 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + K +++ + +IF +DG++ ++ L Y ++ I Sbjct: 231 ---NQRKNKNNWARILIF-SDGQDEIALTKIRKQLEY-------NYDIFTI 270 >gi|14248613|gb|AAK57592.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248635|gb|AAK57603.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248599|gb|AAK57585.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248601|gb|AAK57586.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248603|gb|AAK57587.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248563|gb|AAK57567.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248565|gb|AAK57568.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248567|gb|AAK57569.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248607|gb|AAK57589.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248633|gb|AAK57602.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248615|gb|AAK57593.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248617|gb|AAK57594.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248639|gb|AAK57605.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248641|gb|AAK57606.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248605|gb|AAK57588.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|14248637|gb|AAK57604.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKSYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGIPNSKYT----ALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|320162872|gb|EFW39771.1| ubiquitin [Capsaspora owczarzaki ATCC 30864] Length = 563 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 61/195 (31%), Gaps = 28/195 (14%) Query: 169 LDMMMVLDVSLSMND-------HFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLV 221 +++ D + SM+ KL S+R + D Sbjct: 190 VEICFAFDTTGSMSSYIAAVRAELKSISTKLMSNISSLRISICAFGDYCDQ--------- 240 Query: 222 TFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 FSS +++ L ++ + + ++ + EY +I D + Sbjct: 241 -FSSYVLKQEDLTSDIKKLCSFVEKVGNTGGGDAPECYEYVLRRIQD--------LSWSE 291 Query: 282 DYKKYIIFLTDGENSSPNIDN--KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCAS- 338 D K ++ + D + + + + + G VY++ + F ++ A+ Sbjct: 292 DASKALVLIGDADPHPADYTSLHIDWRLEAEQLAHMGIKVYSVQAGGSSGTHFYRDVAAM 351 Query: 339 PDRFYSVQNSRKLHD 353 ++ +L Sbjct: 352 TGGYFVPFTQFQLIT 366 >gi|113476000|ref|YP_722061.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167048|gb|ABG51588.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 220 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 62/185 (33%), Gaps = 19/185 (10%) Query: 153 LLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDV 212 + + V+ + + +++LD S SM + + D + + Sbjct: 1 MRLDELVEFAENPEPRCPCVLLLDTSASMTG------TPIKALNDGLETFRDNLITDDLA 54 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINR-LIFGSTTKSTPGLEYAYNKIFDAKE 271 V +++F +++ + N L T + A + I A Sbjct: 55 KKRVEVAIISFDNQVKIVQ----DFITADQFENPILTAQGQTYMGTAIHKALDMI--AAR 108 Query: 272 KLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAK----RRGAIVYAIGVQAE 327 K E+ G Y+ ++ +TDGE + K + K + +A+GV+ Sbjct: 109 KSEYRNNGITYYRPWVFLITDGEPQGES--EKVFKEAGDRIKQEEDNKHVAFFAVGVEGA 166 Query: 328 AADQF 332 D+ Sbjct: 167 NMDKL 171 >gi|239906055|ref|YP_002952794.1| hypothetical protein DMR_14170 [Desulfovibrio magneticus RS-1] gi|239795919|dbj|BAH74908.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 137 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 50/149 (33%), Gaps = 25/149 (16%) Query: 12 NCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNN 71 + +G+ ++ A++LP++ V+ +I+ S FF+++ + + + AT++ + Sbjct: 9 DQRGASAVEMALILPLLLTVVFAIIDYSRFFFLRSTV-----TAAVADATRLAVLPGTTD 63 Query: 72 GKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVS 131 + + D N + ++ + + Sbjct: 64 A------------------MIAAAISQALL--DPINQADGQTPNVSVTPSQRSAGQPVTV 103 Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVK 160 +PF P S P I ++ Sbjct: 104 TASLPFSPLILPQFLGMSLFPQNINAAAT 132 >gi|254525434|ref|ZP_05137486.1| magnesium chelatase ATPase subunit D [Prochlorococcus marinus str. MIT 9202] gi|221536858|gb|EEE39311.1| magnesium chelatase ATPase subunit D [Prochlorococcus marinus str. MIT 9202] Length = 725 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 69/215 (32%), Gaps = 40/215 (18%) Query: 164 KSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTF 223 + G ++ ++D S SM ++++ A ++ +L ++ + + L+ F Sbjct: 522 QKKAGALVIFLVDASGSMA------LNRMQSAKGAVIRLLT--EAYENRDE---VALIPF 570 Query: 224 SSK-IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 P + + ++ + G L + + + + ++ D Sbjct: 571 RGNQAEVLLPPTRSITAAKRRLETMPCGGG----SPLAHG---LTQSAKVAKNALSTGDI 623 Query: 283 YKKYIIFLTDG----------------ENSSPNID-NKESLFYCNEAKRRGAIVYAI--- 322 + ++ +TDG EN + N++ +E L + G + I Sbjct: 624 GQVIVVGITDGRGNVPLGLSLGQNEVEENDNSNVNLKQEVLDIAAKYPMLGIKLLIIDTE 683 Query: 323 -GVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFL 356 A + L A + + K A Sbjct: 684 RKFIASGFGKELAEAAQGKYVQLPKATDKAIAAMA 718 >gi|125540180|gb|EAY86575.1| hypothetical protein OsI_07955 [Oryza sativa Indica Group] Length = 477 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 59/169 (34%), Gaps = 22/169 (13%) Query: 208 SIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIF 267 D ++++ + + I T G + + L G T GL A K+F Sbjct: 144 IYKDEDDIIDTAKEQYRRVIRLTRMSDDGKASAKSAVESLAAGGGTNILKGLVEA-AKVF 202 Query: 268 DAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEA-------KRRG---A 317 D + +A +I L+DG+++ + N + KR G Sbjct: 203 DGRRYRNAVAS--------VILLSDGQDTYNVNSGWGASNSKNYSVLVPPSFKRSGDRRL 254 Query: 318 IVYAIGVQAEAADQFLKNCA--SPDRFYSVQNSRKLHDAFLR-IGKEMV 363 V+ G + + A + F ++N + DAF + IG + Sbjct: 255 PVHTFGFGTDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQCIGGLLS 303 >gi|14248645|gb|AAK57608.1| thrombospondin-related adhesive protein [Plasmodium vivax] gi|14248647|gb|AAK57609.1| thrombospondin-related adhesive protein [Plasmodium vivax] Length = 490 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 30/169 (17%) Query: 178 SLSMNDHFGPGMDK----LGVATRSIREMLDIIKSIPDV-----NNVVRSGLVTFSSKIV 228 S S+ + + K L S+ D I ++ ++R G I Sbjct: 1 SGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRLGS---GQSID 55 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 + L+ + E TT T L D +K + + + +I Sbjct: 56 KRQALS----KVTELRKTYSPYGTTNMTAAL--------DEVQKHLNDRVNREKAIQLVI 103 Query: 289 FLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 +TDG +S +L + K+R + IG+ QF + A Sbjct: 104 LMTDGVPNS----KYRALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 148 >gi|58429529|gb|AAW78168.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 40.2 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + +N + + FS+ + L Sbjct: 63 -----NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALSI 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IKSLLSTNLPFGRTNLTDALLQVRKHLND--------RINRENANQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES RG + G+ ++FL C D Sbjct: 170 DSLKESR----NLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|328876382|gb|EGG24745.1| hypothetical protein DFA_02989 [Dictyostelium fasciculatum] Length = 684 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 58/177 (32%), Gaps = 25/177 (14%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSS-- 225 ++++ V D + SM+ V ++++ I + +R G++ Sbjct: 192 DVEIVFVFDTTGSMSS----------VLKEVRTKVVETITRLMKEIPSIRIGIMGLGDYC 241 Query: 226 ---KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 ++ T L + + +N+ EYA + AKE Sbjct: 242 DGLNVLNTCDLTSDISKLVNFVNKTPSTGGGDEPEAYEYA---LLKAKELSWSKHT---- 294 Query: 283 YKKYIIFLTDGENSSPNIDN--KESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K ++ + D SP + + + + + G +Y + E F + + Sbjct: 295 -SKALVMIGDSPPHSPQFTSLKIDWVAETDNLEAMGVKIYGVVCMHERHRFFYQTIS 350 >gi|221135718|ref|XP_002171014.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 132 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 38/103 (36%), Gaps = 8/103 (7%) Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 TT+ L A +F + G K II LTDG + P D ++ +E Sbjct: 4 TTRIDRALRLAQKDMFTSA------NGGRVGVSKLIILLTDGSQT-PGGDAEDPERIADE 56 Query: 312 AKRRGAIVYAIGVQAEAADQFLKNCASP-DRFYSVQNSRKLHD 353 + G ++ +G+ + + L + Y+ L D Sbjct: 57 LRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTD 99 >gi|220925466|ref|YP_002500768.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] gi|219950073|gb|ACL60465.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] Length = 359 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 11/78 (14%), Positives = 34/78 (43%), Gaps = 5/78 (6%) Query: 7 RNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQ 66 R+F +G+++++ A + +F + G ++ ++ V+ L I D + L ++ Sbjct: 6 RHFHSCDRGAVALIFAAAMIPLFGLAGAALDYANARRVRDVLQSISDATAL-----LVAD 60 Query: 67 ENGNNGKKQKNDFSYRII 84 + ++ + + Sbjct: 61 ADTPTVAERSFKLAENQL 78 >gi|2240037|gb|AAB66910.1| integrin subunit betaCn1 [Acropora millepora] Length = 792 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 54/149 (36%), Gaps = 16/149 (10%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSI------PD 211 ++ + + +D+ ++D+S SM D G G +++E+ K Sbjct: 107 TLTVRPAENYPVDLYYLMDMSSSMKDDLGNLRSLAGQMATTMKEITSNFKLGFGSFVDKA 166 Query: 212 VNNVVRSGLVTFSSK--------IVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAY 263 V+ VR+ L PL Q KIN+ I + G A Sbjct: 167 VSPFVRTELKLPCDNNSCVATYGFKNVLPLVNDTVEFQTKINQQIISGNLDAPEGGFDAL 226 Query: 264 NKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 ++ A EK ++ ++ ++F TD Sbjct: 227 MQVA-ACEKEIGWSRNGTS-RRLVVFATD 253 >gi|227887173|ref|ZP_04004978.1| VWA containing CoxE family protein [Escherichia coli 83972] gi|300978749|ref|ZP_07174379.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301048851|ref|ZP_07195846.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|227835523|gb|EEJ45989.1| VWA containing CoxE family protein [Escherichia coli 83972] gi|300299336|gb|EFJ55721.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300409587|gb|EFJ93125.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554189|gb|ADN46964.1| von Willebrand factor type A domain protein YehP [Escherichia coli ABU 83972] gi|315294438|gb|EFU53786.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] Length = 378 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 50/154 (32%), Gaps = 34/154 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 216 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 259 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 260 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 304 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++D + + C + G V + Sbjct: 305 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLA 335 >gi|261289465|ref|XP_002604708.1| hypothetical protein BRAFLDRAFT_80321 [Branchiostoma floridae] gi|229290037|gb|EEN60719.1| hypothetical protein BRAFLDRAFT_80321 [Branchiostoma floridae] Length = 395 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 45/136 (33%), Gaps = 18/136 (13%) Query: 133 YEMPFIFCTFPWCANSSHAPLLITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y F+ S + L+ +K S+S DM VLD S S Sbjct: 5 YFKTFVQFGILLLVGSDTSTLVGALDIKPKSQSCRGAFDMYFVLDRSAS----------- 53 Query: 192 LGVATRSIREMLDIIKSIPDV--NNVVRSGLVTFSSKIVQTFPLAWGVQHIQ---EKINR 246 E +D ++ I + + VR +TFS L+ I+ + + Sbjct: 54 -VTPKNFREETVDFVEKIVNSFHSPKVRFSFITFSDDASVVMELSNDPDVIKHGLSHLRQ 112 Query: 247 LIFGSTTKSTPGLEYA 262 + T + GL A Sbjct: 113 IDTYGGTFMSHGLRKA 128 >gi|168216419|ref|ZP_02642044.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381279|gb|EDT78758.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 620 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 65/156 (41%), Gaps = 35/156 (22%) Query: 155 ITSSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDV 212 IT +VK K +D+++++D S SM + + S+ ++D +I +IP+ Sbjct: 72 ITLTVKGKPKKVTKPVDILLIMDASNSM-------WNDMDALKESMNSLVDKVIDNIPNS 124 Query: 213 NNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKI------ 266 R ++ F +++ + + N+ F S + ++ +YN I Sbjct: 125 ----RIAVIAFGTEV----------EEVFSFNNKNKFTSKEEYKKAIKKSYNNIERRGNT 170 Query: 267 ------FDAKEKLEHIAKGHDDYKKYIIFLTDGENS 296 A E ++ + + KK +IF +DG + Sbjct: 171 NIEGTWRRADEIFKNELNNNSNSKKDVIFFSDGYPN 206 >gi|123475372|ref|XP_001320864.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121903678|gb|EAY08641.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 665 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 62/173 (35%), Gaps = 37/173 (21%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 D V+D S SM ++ A + + ML + R + F ++ Q Sbjct: 240 DYFFVIDCSGSMEGK------RIQKAVKCLHLMLQSLPMK------CRFSIFCFGTEFKQ 287 Query: 230 TFPLA-WGVQHIQEKINRLIFGS----TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYK 284 P+A + +++ +N + T L+ + Sbjct: 288 IMPIAEYNNENVLLAMNMIKNIQADMYGTNIYDPLK---------------CIFSLEGMT 332 Query: 285 KYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCA 337 K I LTDGE ++ +E L E K+ G +Y +G+ + A +++ A Sbjct: 333 KKIFLLTDGEVNNA----EEILNLAEENKKFG-NIYTVGIGSGADQCLIRDLA 380 >gi|91842253|gb|ABE66387.1| beta1 subunit-like protein [Danio rerio] Length = 613 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 48/153 (31%), Gaps = 33/153 (21%) Query: 158 SVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVR 217 ++K D +D+ ++D+S SM + + + D+ +R Sbjct: 126 TLKFKRAEDYPIDLYFLMDLSHSMLSNLENFKNLGTELAN----------EMKDITKDLR 175 Query: 218 SGLVTF----SSKIVQTFP--------------LAWGVQHIQEKINRLIFGSTTKSTPGL 259 G +F S + FP L ++I++L S+ Sbjct: 176 IGFGSFFRKPSIQTNPCFPDNCIAPFSYFNVLSLTDDHALFTQEISKLKTSGNLDSSEA- 234 Query: 260 EYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTD 292 + A + G + + ++F TD Sbjct: 235 --GLEALMQAA--VCTDVIGWRNVTRLLVFFTD 263 >gi|26248499|ref|NP_754539.1| hypothetical protein c2651 [Escherichia coli CFT073] gi|26108904|gb|AAN81107.1|AE016763_66 Hypothetical protein yehP [Escherichia coli CFT073] Length = 399 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 50/154 (32%), Gaps = 34/154 (22%) Query: 170 DMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQ 229 +++++D S SM D V ++ + +P +R+ LV F + +V Sbjct: 237 QLVLLVDQSGSMVDS---------VIHSAVMAAC--LWQLP----GIRTHLVAFDTSVV- 280 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 L V E + ++ G T +EY I K II Sbjct: 281 --DLTADVADPVELLMKVQLGGGTNIASAVEYGRQLI-------------EQPAKSVIIL 325 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 ++D + + C + G V + Sbjct: 326 VSDFYEGGSSSLLTHQVKKCVQ---SGIKVLGLA 356 >gi|328875636|gb|EGG24000.1| hypothetical protein DFA_06138 [Dictyostelium fasciculatum] Length = 832 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 72/196 (36%), Gaps = 31/196 (15%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 + + ++ ++++D S SM+ D++ A R++ M+ + N Sbjct: 332 TPEDELNQKMECIIIVDRSGSMSG------DRIESAKRALEIMMRSLNENTMFN------ 379 Query: 220 LVTFSSKIVQTFPLAWGVQHIQ-EKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAK 278 +V+F S + FP + EK + LI ++F+ + + + Sbjct: 380 IVSFGSGFKKLFPSSTRYNDASLEKASSLIASMDADMGG------TELFEPIKDVLAMPA 433 Query: 279 GHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFL----K 334 + ++ I LTDG S + K CN + ++ +G+ + + K Sbjct: 434 LPEYPRQVFI-LTDGAVSQREVLIKYVASECNTTR-----IFTLGIGNGVDKELVIGLSK 487 Query: 335 NCASPDRFYSVQNSRK 350 C + V ++ Sbjct: 488 AC--KGHYEMVSDTSS 501 >gi|34481890|emb|CAE46493.1| trap [Plasmodium falciparum] Length = 331 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 33/224 (14%) Query: 130 VSRYEMPFIFCTFPWCANSSHAPLLITSSVKISSKS--DIGLDMMMVLDVSLSMNDHFGP 187 +Y + F + + + +D+ +++D S S+ H Sbjct: 6 NVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYREEVCNDEVDLYLLMDCSGSIRRH--- 62 Query: 188 GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQH-------- 239 ++ + +I+ + + + + FS+ + L Sbjct: 63 -----NWVKHAVPLAMKLIQQLNLNESAIHLYVNIFSNNAKEIIRLHSDASKNKEKALII 117 Query: 240 IQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPN 299 I+ ++ + T T L + D ++ + ++ LTDG S Sbjct: 118 IRSLLSTNLPYGRTNLTDALLQVRKHLND--------RINRENASQLVVILTDGIPDSIQ 169 Query: 300 IDNKESLFYCNEAKRRGAIVYAIGVQAEAA---DQFLKNCASPD 340 KES + RG + G+ ++FL C D Sbjct: 170 DSLKESR----KLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSD 209 >gi|73954244|ref|XP_546326.2| PREDICTED: similar to Integrin alpha-2 precursor (Platelet membrane glycoprotein Ia) (GPIa) (Collagen receptor) (VLA-2 alpha chain) (CD49b) [Canis familiaris] Length = 1171 Score = 40.2 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 70/214 (32%), Gaps = 41/214 (19%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D+++V D S S + + + + + P GL+ +++K Sbjct: 163 IDVVVVCDESNS--------IYPWEAVKNFLEKFVQGLDIGPKKTQ---VGLIQYANKPR 211 Query: 229 QTFPLAWGVQHIQEKINRLIF------GSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F L + ++ G T + ++YA + A G Sbjct: 212 VVFNL--NTFKTKAEMTEATSQTYQYGGDLTNTFKAIQYAKDF------AYAAAAGGRPG 263 Query: 283 YKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV------QAEAADQFLK-- 334 K ++ +TDGE + D ++ + + I V A +K Sbjct: 264 ATKVMVVVTDGE----SHDGSMLKAVIDQCNNDNILRFGIAVLGYLNRNALDTKNLIKEI 319 Query: 335 ----NCASPDRFYSVQNSRKLHDAFLRIGKEMVK 364 + + F++V + L + +G+++ Sbjct: 320 KAIASIPTERYFFNVSDEAALLEKAGTLGEQIFS 353 >gi|308173901|ref|YP_003920606.1| activator of nitric oxide reductase [Bacillus amyloliquefaciens DSM 7] gi|307606765|emb|CBI43136.1| putative activator of nitric oxide reductase [Bacillus amyloliquefaciens DSM 7] Length = 638 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 64/169 (37%), Gaps = 19/169 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V G Sbjct: 434 KQDPSSEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVSVPHQIV--G 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---EKLEHI 276 +++ +T + I + + L ++ A + ++ Sbjct: 485 FWEDTNEATETSQPNFFNTVIPFEESLLQAAGP----LIMQLAPEEDNRDGYAIRQMTKQ 540 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 541 LMRRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|171914256|ref|ZP_02929726.1| TerY3 [Verrucomicrobium spinosum DSM 4136] Length = 345 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 62/158 (39%), Gaps = 17/158 (10%) Query: 168 GLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 L + ++LD S SM G G+ + S ML ++ P +TF S+ Sbjct: 3 RLPVYVLLDCSESM---IGNGLRGMRTGISS---MLKALRQNPHALETAWISFITFDSRA 56 Query: 228 VQTFPLAWGVQHIQEKIN--RLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 PL +++ RL+ T L +I ++ + D++ Sbjct: 57 ELKSPL-----QSLDEVQPPRLLVRPGTSLGSALLLLSERILQEVKRTQPGTLTKGDFRP 111 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIG 323 +I +TDG+ P D + +L N + A +YA+G Sbjct: 112 IVILITDGQ---PTDDWRSALREMNSTVKI-ANLYAVG 145 >gi|149918235|ref|ZP_01906727.1| hypothetical protein PPSIR1_11520 [Plesiocystis pacifica SIR-1] gi|149820995|gb|EDM80402.1| hypothetical protein PPSIR1_11520 [Plesiocystis pacifica SIR-1] Length = 429 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 52/181 (28%), Gaps = 15/181 (8%) Query: 167 IGLDMMMVLDVSLSMNDHFGPGMDKLG-VATRSIREMLDIIKSIPDVNNVVRSGLVTFSS 225 GLD+M V+D + SM D +L V R ++ + + VN G Sbjct: 220 TGLDLMFVVDTTGSMGDELSYLQVELEDVINRVSSDLANDVDLRLSVNFYRDEG----DD 275 Query: 226 KIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKK 285 +V+ FP ++ ++ A EH + + Sbjct: 276 YVVEPFPFTDDPSVALTQLAAQSTDGGGDWPEAVDAALV-----NAVYEHQWRDSARA-R 329 Query: 286 YIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV-QAEAADQFLKN---CASPDR 341 + + D +A G V + + +FL A+ Sbjct: 330 LMFLILDAPPHDTPQIRDVLTSATEDAAALGVRVIPVAASGIDKPTEFLLRSLDIATGGT 389 Query: 342 F 342 + Sbjct: 390 Y 390 >gi|326795600|ref|YP_004313420.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546364|gb|ADZ91584.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 635 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 72/209 (34%), Gaps = 22/209 (10%) Query: 163 SKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVT 222 K+ +++D S SM + T +++ + P+ + G+ Sbjct: 22 PKAFADTQFRIIVDASGSM------VISDPDKLTSEALKLISNL--APEEHAT--LGVWL 71 Query: 223 FSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDD 282 F + P A + + K+ + T+ L+ I + K D Sbjct: 72 FGEEPRVLLPEAPINKETKAKLASYVDNYVTQD---LKTDLESIIELLLKTPDTPSLSDG 128 Query: 283 YKKYIIFLTDG------ENSSPNIDNKESLFYCNEA-KRRGAIVYAIGVQAEAADQFLKN 335 + ++ I +TDG + + + EA ++RG ++ + + + LK Sbjct: 129 FDRHWILVTDGMVDVSLDKDVNEASRRRIMTTLTEALEKRGIHLHTVSMTGYTDEDMLKQ 188 Query: 336 CA--SPDRFYSVQNSRKLHDAFLRIGKEM 362 A + V + +L D F RI + Sbjct: 189 LAVKTNASHTEVASPEELLDTFDRIFAQA 217 >gi|326424412|ref|NP_763435.2| TPR domain-containing protein in aerotolerance operon [Vibrio vulnificus CMCP6] gi|319999802|gb|AAO08425.2| TPR domain protein in aerotolerance operon [Vibrio vulnificus CMCP6] Length = 656 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 41/142 (28%), Gaps = 13/142 (9%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 + W S + + ++V+D+S SM +L A Sbjct: 60 ILLWLLAWVIAILALASPSWQHTTRPS-FNSHQNRVLVMDMSQSM-YAQDIAPSRLQQAR 117 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRL----IFGST 252 ++L K + L+ ++ PL Q + I L + Sbjct: 118 YKALDLLPQWKEGN-------TALIAYAGDAYLLSPLTSDSQTLANLIQNLSPDIMPYQG 170 Query: 253 TKSTPGLEYAYNKIFDAKEKLE 274 ++ T L A ++ A Sbjct: 171 SRLTSALVLAKAQLEKAGAAKG 192 >gi|115964593|ref|XP_001196489.1| PREDICTED: similar to calcium-activated chloride channel family member 1 [Strongylocentrotus purpuratus] Length = 843 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 68/206 (33%), Gaps = 29/206 (14%) Query: 132 RYEMPFIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDK 191 Y + + P+ T+ V + +++VLD+S SM + + Sbjct: 294 NYRSAWSVMRNHTDFQAVVPPVSNTTPVFEVLQLSSVRSVVLVLDISNSMLGNR-----R 348 Query: 192 LGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLA--WGVQHIQEKINRLI- 248 +S + + G+V F S L Q + L Sbjct: 349 FDRMIQSSTVYIMNVIPADSK-----LGIVVFDSTSQIRANLTDITNTASRQRLVKALPK 403 Query: 249 -FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDG-ENSSPNIDNKESL 306 TT G+ Y +E L A+G YI+ L+DG EN++P I + Sbjct: 404 SPKGTTCIGCGILYWVRI----REVLGSYAQGG-----YILLLSDGVENNAPYI-----I 449 Query: 307 FYCNEAKRRGAIVYAIGVQAEAADQF 332 ++ + G I I + A Q Sbjct: 450 DIYDDINKSGVIFDTITISNAADQQM 475 >gi|92020110|dbj|BAE93251.1| type VI collagen alpha-2 chain [Sus scrofa] Length = 246 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 63/179 (35%), Gaps = 17/179 (9%) Query: 165 SDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS 224 + +D++ +LD S + + K + L + + D N R L+ F Sbjct: 70 TQRPVDIVFLLDGSERLGEQ---NFHKAQRFVEDVSRRLTLARRDDDPLNA-RVALLQFG 125 Query: 225 S--KIVQTFPLAWGVQHIQEKINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHD 281 + FPL + + IQE + S + G+ +A N++ Sbjct: 126 GPREQQVAFPLTFNLTVIQEALASSRYLNSFSHVGTGIVHAINQVVLGAPAGARRHAELA 185 Query: 282 DYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPD 340 +FLTDG + +++ + +++ + + V ++ L + D Sbjct: 186 -----FVFLTDGITGNESLE-----EAVHSMRKQNVVPTVVAVGSDVDADVLSKISLGD 234 >gi|54026054|ref|YP_120296.1| putative magnesium chelatase subunit [Nocardia farcinica IFM 10152] gi|54017562|dbj|BAD58932.1| putative magnesium chelatase subunit [Nocardia farcinica IFM 10152] Length = 637 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 47/163 (28%), Gaps = 18/163 (11%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRS-G 219 + G ++ V+D S SM D+L T +I +L D VV G Sbjct: 442 AYREGREGNLVVFVVDASGSMA-----ARDRLSAVTGAIVTLLRDAYQRRDKVAVVSVRG 496 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKG 279 P V ++ L G T G + A+E + Sbjct: 497 AT-----ADLVLPPTSSVDIAVRRLAGLRTGGKTPLAAG-------LLRAREIVARERVR 544 Query: 280 HDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAI 322 + ++ LTDG + + R G + Sbjct: 545 DPRRRPMLVLLTDGRATGGADPVGRAKTAAGLLARDGVTAMVV 587 >gi|300864750|ref|ZP_07109602.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] gi|300337260|emb|CBN54750.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] Length = 845 Score = 40.2 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 70/194 (36%), Gaps = 24/194 (12%) Query: 173 MVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFS-SKIVQTF 231 +++D S SM P D L + + + + + + G VT +I Sbjct: 65 IIIDRSGSMYYDLEPLKDTLIKLLT--LDEYNNYQLLISLISYASHGDVTCHFQRIPIQE 122 Query: 232 PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLT 291 + +I ++I ++ GS T + ++ A + E I T Sbjct: 123 VMQLNSSYI-QEIKKIKTGSFTGMSQSMKLA-KSLIKEGELTA------------INLHT 168 Query: 292 DGENSSPNIDNKESLF--YCNEAKRRGAIVYAIGVQAEAAD-QFLKNCAS--PDRFYSVQ 346 DG + P+ +++ + C E K G V+A + +AD + L A+ Sbjct: 169 DGYANDPSANSEAAALERICEELK--GMDVFANTISYSSADFKLLSKIANTLSGNCIRAG 226 Query: 347 NSRKLHDAFLRIGK 360 + ++DA K Sbjct: 227 EVKMVYDALYDTAK 240 >gi|300022836|ref|YP_003755447.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524657|gb|ADJ23126.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 334 Score = 40.2 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 54/133 (40%), Gaps = 14/133 (10%) Query: 169 LDMMMVLDVSLSMN-DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKI 227 +D + ++D++ SMN G + ++D+++S+P + R GL F+ +I Sbjct: 38 VDAVAIVDITGSMNTRDMGSPPGSASRLQAARNALVDLVQSVPCQS---RLGLGVFTERI 94 Query: 228 VQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEY--AYNKIFDAKEKLEHIAKGHDDYKK 285 L + I + L + L++ A+ + L K Sbjct: 95 SF---LLFNPVEICNNYDGLE-----DALSELDWRMAWQGDSYIAKGLYSAIDIAASLKA 146 Query: 286 YIIFLTDGENSSP 298 +IFLTDG+ + P Sbjct: 147 DLIFLTDGQEAPP 159 >gi|77407764|ref|ZP_00784518.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|77173630|gb|EAO76745.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|218693241|gb|ACL01124.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693247|gb|ACL01127.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693329|gb|ACL01168.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693351|gb|ACL01179.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693361|gb|ACL01184.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693365|gb|ACL01186.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693375|gb|ACL01191.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693413|gb|ACL01210.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693417|gb|ACL01212.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693437|gb|ACL01222.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693441|gb|ACL01224.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693443|gb|ACL01225.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693445|gb|ACL01226.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693449|gb|ACL01228.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693451|gb|ACL01229.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.2 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 70/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKNTGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|332992330|gb|AEF02385.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas sp. SN2] Length = 1360 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 60/187 (32%), Gaps = 42/187 (22%) Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLI------FGSTTKSTPGLEYAYNKIFDAKEKL 273 V + + L G + + + + + + + D+ Sbjct: 501 AVNYYGGRDVDYGLTRGNYKVSNTVRKNTRVSHRSAYLGGDAVQPNGCSDDNLSDSDCIN 560 Query: 274 EHIAKGHD----------DYKKYIIFLTDGE-NSSPNIDNKESLF--YCN---------- 310 E+I G +I+ L+DGE N++ ++ ESL C Sbjct: 561 EYIPSGATYISPVNDLQCQTNNHIVLLSDGEANNNHSVSLIESLLGDSCTGSGGEKCGLN 620 Query: 311 ----------EAKRRGAIVYAIGVQAEA-ADQFLKNCA--SPDRFYSVQNSRKLHDAFLR 357 R I + IG A + A+ FL A S FY +S +L +AF Sbjct: 621 LVKNVSDSDTSVIDRRVITHTIGFAANSTANNFLNQLAVQSGGGFYQADDSTELLEAFNT 680 Query: 358 IGKEMVK 364 I + + Sbjct: 681 ILRSVKD 687 >gi|209886526|ref|YP_002290383.1| TadE family protein [Oligotropha carboxidovorans OM5] gi|209874722|gb|ACI94518.1| TadE family protein [Oligotropha carboxidovorans OM5] Length = 187 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 33/162 (20%), Positives = 58/162 (35%), Gaps = 8/162 (4%) Query: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYIL-DHSLL--YTATK 62 +R F N KGS ++ A++ P+ F ++ ++E + FF L D + L + Sbjct: 15 LRRFGANRKGSAAVQFALVAPLFFALIFAIVEVALVFFANQILETGTQDTARLLFTHQAQ 74 Query: 63 ILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQ----DINNIERSTSLSIII 118 N N S + + D R+ F D S++ Sbjct: 75 DQNLTGEQVRTNLCNRVSVLLACDGVILDVRSYPAGQPFTVPPFFDGGGNPISSNFLYQP 134 Query: 119 DDQHKDYNLSAVSRYEMPFIFCTFPWC-ANSSHAPLLITSSV 159 D + + + Y+ P IF + N L+TS+V Sbjct: 135 PDPNSSNIVVVRAFYKWPLIFTNLGFSLINIGTDKRLLTSTV 176 >gi|70730212|ref|YP_259951.1| TPR domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344511|gb|AAY92117.1| TPR domain protein, putative [Pseudomonas fluorescens Pf-5] Length = 530 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 53/136 (38%), Gaps = 16/136 (11%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQT 230 +++ LD+S SMN P +L A +++++ R+GL+ ++ Sbjct: 97 LIIALDLSPSMNASDLPP-SRLEAAKHKLQQLVQQ-------RRGARTGLIAYAGSAHLV 148 Query: 231 FPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFL 290 P + + I L GS PG + I A+ L+ ++ + Sbjct: 149 LPASDDPALLDSFIQAL--GSDLIQRPG-KDVLGVIRQAQRLLQAEHSPGS-----LLLI 200 Query: 291 TDGENSSPNIDNKESL 306 +DG ++S K+ L Sbjct: 201 SDGADTSQLAAIKQQL 216 >gi|328553175|gb|AEB23667.1| von Willebrand factor type A [Bacillus amyloliquefaciens TA208] gi|328912051|gb|AEB63647.1| putative activator of nitric oxide reductase [Bacillus amyloliquefaciens LL3] Length = 638 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 64/169 (37%), Gaps = 19/169 (11%) Query: 160 KISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSG 219 K S+I +++D S SM DK+ R I + +KS+ + +V G Sbjct: 434 KQDPSSEIDAVFTLLVDCSASM-------FDKMDETKRGIVLFHEALKSVSVPHQIV--G 484 Query: 220 LVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAK---EKLEHI 276 +++ +T + I + + L ++ A + ++ Sbjct: 485 FWEDTNEATETSQPNFFNTVIPFEESLLQAAGP----LIMQLAPEEDNRDGYAIRQMTKQ 540 Query: 277 AKGHDDYKKYIIFLTDGENSSPNIDNK---ESLFYCNEAKRRGAIVYAI 322 + +K++I +DGE ++ + ++ EA++RG V + Sbjct: 541 LMRRSEAQKFLIVFSDGEPAAFGYEQNGIVDTSEAVIEARKRGIEVINV 589 >gi|256959904|ref|ZP_05564075.1| anchor protein [Enterococcus faecalis Merz96] gi|256950400|gb|EEU67032.1| anchor protein [Enterococcus faecalis Merz96] Length = 729 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 45/143 (31%), Gaps = 14/143 (9%) Query: 161 ISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLD-IIKSIPDVNNV---- 215 + + +D+++V D S S +D+F + + + + ++ S Sbjct: 68 VQAGETEPVDLVVVEDASGSFSDNFPHVRQAIDEVVQGLSDQDRVMLASYRGGKQFMFPD 127 Query: 216 --VRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 + + + L + + T + PGL+ A + L Sbjct: 128 GKTKINSADYDMNVRVNTQLTYDKSQFVSGFGDVRTYGGTPTAPGLKLALDTYNQTHGDL 187 Query: 274 EHIAKGHDDYKKYIIFLTDGENS 296 + Y + +TDG + Sbjct: 188 TNRKT-------YFLLVTDGVAN 203 >gi|240255600|ref|NP_680085.3| von Willebrand factor A domain-containing protein 1 precursor [Mus musculus] gi|81901574|sp|Q8R2Z5|VWA1_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 1; AltName: Full=von Willebrand factor A domain-related protein; Flags: Precursor gi|20071274|gb|AAH26919.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|26346104|dbj|BAC36703.1| unnamed protein product [Mus musculus] gi|54611403|gb|AAH36166.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|123295149|emb|CAM18384.1| von Willebrand factor A domain containing 1 [Mus musculus] gi|148683083|gb|EDL15030.1| von Willebrand factor A domain containing 1 [Mus musculus] Length = 415 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 69/194 (35%), Gaps = 28/194 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F T A S L +S + S+ SD D++ +LD S S++ + + + Sbjct: 1 MLFWTAFSMALSLRLALARSSIERGSTASDPQGDLLFLLDSSASVSHYEFSRVREFVG-- 58 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFG----S 251 ++ ++ +R+ LV + + I + + Sbjct: 59 -------QLVATMSFGPGALRASLV--HVGSQPHTEFTFDQYSSGQAIQDAIRVAPQRMG 109 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + L YA ++F E A+ K ++++TDG +S P + E Sbjct: 110 DTNTGLALAYAKEQLF----AEEAGARPGVP--KVLVWVTDGGSSDPVGPPMQ------E 157 Query: 312 AKRRGAIVYAIGVQ 325 K G ++ + Sbjct: 158 LKDLGVTIFIVSTG 171 >gi|15072454|gb|AAK38350.1| von Willebrand factor A-related protein [Mus musculus] Length = 415 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 69/194 (35%), Gaps = 28/194 (14%) Query: 137 FIFCTFPWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMNDHFGPGMDKLGVAT 196 +F T A S L +S + S+ SD D++ +LD S S++ + + + Sbjct: 1 MLFWTAFSMALSLRLALARSSIERGSTASDPQGDLLFLLDSSASVSHYEFSRVREFVG-- 58 Query: 197 RSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQEKI-NRLIFG----S 251 ++ ++ +R+ LV + + I + + Sbjct: 59 -------QLVATMSFGPGALRASLV--HVGSQPHTEFTFDQYSSGQAIRDAIRVAPQRMG 109 Query: 252 TTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNE 311 T + L YA ++F E A+ K ++++TDG +S P + E Sbjct: 110 DTNTGLALAYAKEQLF----AEEAGARPGVP--KVLVWVTDGGSSDPVGPPMQ------E 157 Query: 312 AKRRGAIVYAIGVQ 325 K G ++ + Sbjct: 158 LKDLGVTIFIVSTG 171 >gi|221067364|ref|ZP_03543469.1| conserved hypothetical protein [Comamonas testosteroni KF-1] gi|220712387|gb|EED67755.1| conserved hypothetical protein [Comamonas testosteroni KF-1] Length = 419 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 33/103 (32%), Gaps = 5/103 (4%) Query: 30 IVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENG-----NNGKKQKNDFSYRII 84 MG+ ++ F V+ +L LD L A ++ Q + + G N + Sbjct: 43 GFMGIALDFGRLFIVRTELQTALDSCALAAARELNKQPDAISRAVSAGAAAGNLNGVNLQ 102 Query: 85 KNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHKDYNL 127 W + + F S + ++ H N+ Sbjct: 103 SANWSGQGQITAADISFRDASYLSTTSATAAVYSQCTHTQSNI 145 >gi|17555086|ref|NP_499239.1| hypothetical protein T16H12.4 [Caenorhabditis elegans] gi|27923998|sp|P34567|TF2H2_CAEEL RecName: Full=General transcription factor IIH subunit 2; AltName: Full=TFIIH basal transcription factor complex subunit 2 gi|15718246|emb|CAA83139.3| C. elegans protein T16H12.4, confirmed by transcript evidence [Caenorhabditis elegans] Length = 376 Score = 40.2 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 69/190 (36%), Gaps = 19/190 (10%) Query: 171 MMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSK-IVQ 229 +M+V+D S M P + V ++++ LD + N + + GL+T + + Sbjct: 65 VMIVIDCSRFMTSKAMPP-SRFVVVMKALQTFLD---RFFEQNPIAQIGLITCKDRKADR 120 Query: 230 TFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIF 289 + ++ ++E +N L T++ G +++ GH + ++ Sbjct: 121 LTMMTGNIRVLKESLNTL-----TEAFCGGDFSLQNALQLACANLKGMPGHVSREVVLVI 175 Query: 290 LTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGVQAE--AADQFLKNCASPDRFYSVQN 347 ++ ID KR AIG+ AE + K A+ + + Sbjct: 176 -----SALSTIDPGNIYSTIETMKRMNIRCSAIGLSAEMFVCKEMAK--ATKGEYSVALD 228 Query: 348 SRKLHDAFLR 357 L F + Sbjct: 229 PDHLQLLFSK 238 >gi|218693423|gb|ACL01215.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693429|gb|ACL01218.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693431|gb|ACL01219.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 70/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKNTGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|145494260|ref|XP_001433124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400241|emb|CAK65727.1| unnamed protein product [Paramecium tetraurelia] Length = 903 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 37/105 (35%), Gaps = 13/105 (12%) Query: 169 LDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 +D++ +LD++ SMN ++ +E ++ VR G V + Sbjct: 247 IDVIFILDLTKSMNRWLEAIKKQIAKLIYQFKEKINGYG--------VRIGFVGYRDTCD 298 Query: 229 QTFPLAW-----GVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFD 268 + L + + I + I++ + + A + + Sbjct: 299 EEEQLVYRCLTENFEEIIKVISKQVAQGGGDQAEDIVTALEQGLN 343 >gi|317486290|ref|ZP_07945122.1| OmpA family protein [Bilophila wadsworthia 3_1_6] gi|316922460|gb|EFV43714.1| OmpA family protein [Bilophila wadsworthia 3_1_6] Length = 340 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 58/165 (35%), Gaps = 19/165 (11%) Query: 170 DMMMVLDVSLSMNDHFGP-GMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIV 228 ++D S SM +F G K+ VA I+ + D++ + + GL TF+ Sbjct: 36 SFNYLVDYSGSMMMNFPSVGKTKMAVAKEVIKRVNDMVPEL-----GYQGGLYTFAPYGA 90 Query: 229 QTFPLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYII 288 W + ++ L T +A + + + +I Sbjct: 91 VVNQGPWVRSTLAAGVDSLKDNLET-------FARFTPMGDGIQAHNGIISQMTPRAAVI 143 Query: 289 FLTDGENSSP--NIDNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 ++DGE++ ID ++++ N + I V + Q Sbjct: 144 LVSDGESNRGISPIDEVKAIYAANP----NVCFHVISVASSPEGQ 184 >gi|218693357|gb|ACL01182.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693425|gb|ACL01216.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693427|gb|ACL01217.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 70/192 (36%), Gaps = 11/192 (5%) Query: 37 ETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNEL 96 ET+ + K + + + I+ + + +K+K + + K+ D + Sbjct: 95 ETAPIGYKKTDKTWKVKVA--DNGATIIEGMDADKAEKRKEVLNAQYPKSAIYEDTK--- 149 Query: 97 RENGFAQDINNIERSTSLSIIIDDQHKDYNLSAVSRYEMPFIFCTFPWCANSSHAPLLIT 156 EN ++ + + KD + N L + Sbjct: 150 -ENYPLVNVEGSKVGEQYKALNPINGKDGRREIAEGWLSKKNTGVNDLDKNKYKIELTVE 208 Query: 157 SSVKISSKS-DIGLDMMMVLDVSLSMNDHFGPGMDKLGVATRSIREMLDIIKSIPDVNNV 215 + +K + LD++++LD S SMN+ + A ++ +++D I S N Sbjct: 209 GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITS----NKD 264 Query: 216 VRSGLVTFSSKI 227 R LVT++S I Sbjct: 265 NRVALVTYASTI 276 >gi|118363890|ref|XP_001015168.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|89296935|gb|EAR94923.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 714 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 65/198 (32%), Gaps = 10/198 (5%) Query: 60 ATKILNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIID 119 E+G + D +KN T N+ + D N ++ D Sbjct: 378 GRTARQGEDGTYSIVLQKDTLQAFLKNDKDTMQINKR